Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Overview Tools Metrics About the catalogue

Phoebe

Github

Idea

Phoebe wants to add basic artificial intelligence capabilities to the Linux OS.

What problem Phoebe wants to solve

System-level tuning is a very complex activity, requiring the knowledge and expertise of several (all?) layers which compose the system itself, how they interact with each other, and (quite often) it is required to also have an intimate knowledge of the implementation of the various layers.

Another big aspect of running systems is dealing with failure. Do not think of failure as a machine turning on fire but rather as an overloaded system, caused by misconfiguration, which could lead to starvation of the available resources.

In many circumstances, operators are used to dealing with telemetry, live charts, alerts, etc. which could help them identify the offending machine(s) and (re)act to fix any potential issues.

However, one question comes to mind: wouldn’t it be awesome if the machine could auto-tune itself and provide a self-healing capability to the user? Well, if that is enough to trigger your interest then this is what PhoeÎ²e aims to provide.

Phoebe uses system telemetry as the input to its brain and produces a big set of settings that get applied to the running system. The decision made by the brain is continuously reevaluated (considering the grace_period setting) to offer eventually the best possible setup.

Architecture

Phoebe is designed with a plugin architecture in mind, providing an interface for new functionality to be added with ease.

Plugins are loaded at runtime and registered with the main body of execution. The only requirement is to implement the interface dictated by the structure plugin_t. The network_plugin.c represents a very good example of how to implement a new plugin for PhoeÎ²e.

Disclaimer

The mathematical model implemented is a super-basic one, which implements a machine-learning 101 approach:
input * weight + bias. It does not use any fancy techniques and the complexity is close to zero.

The plan is to eventually migrate towards a model created in Tensorflow and exported so to be used by Phoebe, but
we are not there yet.

10,000 feet view

The code allows for both training and inference: all the knobs which can modify the run-time behavior of the implementation are configurable via the settings.json file, where each parameter is explained in detail.

For the inference case, when a match is found, then the identified kernel parameters are configured accordingly.

The inference loop runs every N seconds and the value is configurable via the inference_loop_period.
Depending on how quickly we want the system to react to a situation change, then the value given to the inference_loop_period will be bigger or smaller.

The code has a dedicated stats collection thread that periodically collects system statistics and populates structures used by the inference loop. The statistics are collected every N seconds, and this value is configurable via the stats_collection_period. Depending on the overall network demands, the value of quickstats_collection_period will be bigger or smaller to react slower or quicker to network events.

In case a high traffic rate is seen on the network and a matching entry is found, then the code will not consider any lower values for a certain period of time: the value is configurable via the grace_period in
the settings.json file.

That behavior has been implemented to avoid causing too much reconfiguration on the system and to prevent sudden system reconfiguration due to network spikes.

The code also supports a few approximation functions, also available via the settings.json file.

The approximation functions can tune the tolerance value – runtime calculated – to further allow the user to fine-tune the matching criteria. Depending on the approximation function, obviously, the matching criteria could be narrower or broader.

Settings

Below is a detailed explanation of what configurations are available in settings.json, the possible values, and what effect they have. (Note that this is not really valid JSON; please remove the lines with double-forward slashes if you use it.)

{
    'app_settings': {

        // path where application is expecting to find plugins to load
        'plugins_path': '/home/mvarlese/REPOS/phoebe/bin',

        // max_learning_values: number of values learnt per iteration
        'max_learning_values': 1000,

        // save trained data to file every saving_loop value
        'saving_loop': 10,

        // accuracy: the level of accuracy to find a potential entry
        // given the transfer rate considered.
        //
        // MaxValue: Undefined, MinValue: 0.00..1
        // Probably not very intuitive: a higher number correspondes to
        // a higher accuracy level.
        'accuracy': 0.5,

        // approx_function: the approximation function applied
        // to the calculated tolerance value used to find a
        // matching entry in values.
        //
        // Possible values:
        // 0 = no approximation function
        // 1 = square-root
        // 2 = power-of-two
        // 3 = log10
        // 4 = log
        'approx_function': 0,

        // grace_period: the time which must be elapsed
        // before applying new settings for a lower
        // transfer rate than the one previously measured.
        'grace_period': 10,

        // stats_loop_period: the cadence of time which
        // has to be elapsed between stats collection.
        // It is expressed in seconds but it accepts non-integer
        // values; ie. 0.5 represents half-second
        'stats_collection_period': 0.5,

        // inferece_loop_period: the time which must be
        // elapsed before running a new inference evaluation
        'inference_loop_period': 1

    },

    'labels': {
        // geography: valid options are EMEA, NA, LAT, APAC, NOT_SET
        'geography': 'NOT_SET',
        // business: valid options are RETAIL, AUTOMOTIVE, SERVICE, NOT_SET
        'business': 'NOT_SET',
        // behavior: valid options are THROUGHPUT, LATENCY, POWER
        'behavior': 'THROUGHPUT'
    },

    'weights':{
        'transfer_rate_weight': 0.8,
        'drop_rate_weight' : 0.1,
        'errors_rate_weight' : 0.05,
        'fifo_errors_rate_weight' : 0.05
    },

    'bias': 10
}

Building and installation.

See BUILDING.md for build instructions. Packages for various distributions can be found in the OpenBuild service.

Running

The code supports multiple modes of operation:

Training mode:

./build/src/phoebe -f ./csv_files/rates.csv -m training -s settings.json

Inference

./build/src/phoebe -f ./csv_files/rates_trained_data.csv -i wlan0 -m inference -s settings.json

Feedback / Input / Collaboration

If you are curious about the project and want more information, please, do reach out to marco.varlese@suse.com.
I will be more than happy to talk to you more about this project and what other initiatives are in this area.

About the tool

You can click on the links to see the associated tools

Developing organisation(s):

SUSE

Tool type(s):

Toolkit/software

Objective(s):

Robustness
Safety

Purpose(s):

Goal-driven optimisation

Type of approach:

Technical

Maturity:

Implemented in multiple projects
In development

Usage rights:

Open source/Permissive

License:

BSD 3-Clause 'New' or 'Revised' License

Target users:

Data scientist
Developer
Researcher

Required skills:

Data
Programming skills

Programming languages:

Meson
Python
Shell
C
C++

Github stars:

Github forks:

Modify this tool

Use Cases

There is no use cases for this tool yet.

Would you like to submit a use case for this tool?

If you have used this tool, we would love to know more about your experience.

Add use case

Partnership on AI

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.