Giskard CI/CD runner (WIP)

Overview

The idea is to have a common CI/CD core that can interface with different input sources (loaders) and output destinations (reporters).

The core is responsible for running the tests and generating a report.

The loaders are responsible for loading the model and dataset, wrapped as Giskard objects, from a given source (for example the HuggingFace hub, a Github repository, etc.).

The reporters are responsible for sending the report to the appropriate destination (e.g. a comment to a Github PR, a HuggingFace discussion, etc.).

Tasks

Task could be data objects containing all the information needed to run a CI/CD pipeline. For example:

{
    "loader_id": "huggingface",
    "model": "distilbert-base-uncased",
    "dataset": "sst2",
    "loader_args": {
        "dataset_split": "validation",
    },
    "reporter_id": "huggingface_discussion",
    "reporter_args": {
        "discussion_id": 1234,
    }
}

or

{
    "loader_id": "github",
    "model": "my.package::load_model",
    "dataset": "my.package::load_test_dataset",
    "loader_args": {
        "repository": "My-Organization/my_project",
        "branch": "dev-test2",
    },
    "reporter_id": "github_pr",
    "reported_args": {
        "repository": "My-Organization/my_project",
        "pr_id": 1234,
    }
}

These tasks may be generated by a watcher (e.g. a Github action, a HuggingFace webhook, etc.) and put in a queue. The CI/CD runner will then pick them up and run the pipeline.

Otherwise, a single task can be created to run a single-shot Github action, without queueing.

CI/CD Core

In pseudocode, the CI/CD core could look like this:

task = get_task_from_queue_or_envirnoment()

loader = get_loader(task.loader_id)
gsk_model, gsk_dataset = loader.load_model_dataset(
    task.model,
    task.dataset,
    **task.loader_args,
)

runner = PipelineRunner()
report = runner.run(gsk_model, gsk_dataset)

reporter = get_reporter(task.reporter_id)
reporter.push_report(report, **task.reporter_args)

Prototype

Current implementation has two loaders:

The github loader which can be run from the command line (after running python train.py in examples/github):

$ python cli.py --loader github --model examples/github/artifacts/model --dataset examples/github/artifacts/dataset

The huggingface loader which can be run from the command line:

$ python cli.py --loader huggingface --model distilbert-base-uncased-finetuned-sst-2-english --dataset_split validation --output demo_report.html

Automatically post to discussion area for a given repo

$ python cli.py --loader huggingface --model distilbert-base-uncased-finetuned-sst-2-english --dataset_split validation --output_format markdown --output_portal huggingface --discussion_repo [REPO_ID] --hf_token [HF_TOKEN]

$   python cli.py --loader huggingface --model distilbert-base-uncased-finetuned-sst-2-english --dataset_split validation --scan_config [Path to scan_config.yaml] --hf_token [Huggingface Token]

Manually input label and feature mapping

Label Mapping: map the dataset labels to model label ids. Use the labels2id or id2labels in model card to help you if needed. This should be idx to key. Example:
```
--label_mapping '{"0":"negative","1":"positive"}'
```
Feature Mapping: map the feature labels directly from key to key.
```
--feature-mapping '{"text": "sentence"}'
```

This will launch a pipeline that will load the model and dataset from the HuggingFace hub, run the scan and generate a report in HTML format (for now).

Name		Name	Last commit message	Last commit date
Latest commit History 196 Commits
.github/workflows		.github/workflows
examples/github		examples/github
giskard_cicd		giskard_cicd
.gitignore		.gitignore
.models_and_datasets_to_be_skipped.csv		.models_and_datasets_to_be_skipped.csv
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
cli.py		cli.py
pyproject.toml		pyproject.toml
readme.md		readme.md
retriever.py		retriever.py
scan_config_template.yaml		scan_config_template.yaml
scan_retrieved.py		scan_retrieved.py
setup.cfg		setup.cfg

License

Giskard-AI/cicd

Folders and files

Latest commit

History

Repository files navigation

Giskard CI/CD runner (WIP)

Overview

Tasks

CI/CD Core

Prototype

About

Resources

License

Stars

Watchers

Forks

Languages