Evaluator API Schema

Evaluator API Schema#

Evaluator request#

Key	Value type - Required/Optional	Description: Value options	Example
`readout`	`string` - Required	Type of readout that is requested from the Predictor: [“point”,”track”, “interaction_matrix”].	“readout”: “track”
`prediction_tasks`	`array of objects` - Required	Each object must contain the following keys: `name`, `type`, `cell_type`, `species`, `scale`(optional).	“prediction_tasks”: [ { “name”: “task1”, “type”: “expression”, “cell_type”: “iPSC”, “scale”: “linear”, “species”: “homo_sapiens” } ]
`name`	`string` - Required	Unique identifier for each prediction task object.	“name”: “model_prediction”
`type`	`string` - Required	Prediction type you want predicted: [”`accessibility`”, “`binding_{molecule}`” , “`expression`”, “`conformation_{isoform}`”]. “`binding_{molecule}`” can be for any type of binding assay (e.g. CHIP-Seq, H3k27ac) and the text trailing the `_` should be all lower case and without any special characters. `expression` by default refers to mRNA production [`expression_mRNA`] e.g. RNA-seq. The other valid types are [”`expression_pol1`”, “`expression_pol2`”, “`expression_pol3`”, “`expression_splicing_acceptor`”, “`expression_splicing_donor`”]. “`conformation_{isoform}`” can represent the conformation of any isoform e.g. “`conformation_chromatin`”.	“type”: “expression”
`cell_type`	`string` - Required	What cell type you want predicted for `type`.	“cell_type”: “K562”
`species`	`string` - Required	What species you want predicted for `type`. Species names should be all lower case with words separated by a “_”.	“species”: “homo_sapiens”
`scale`	`string` - Optional	How would you like the predictions scaled upon return (if at all): [“linear”, “log”].	“scale” : “linear”
`upstream_seq`	`string`- Optional	Upstream flanking sequences to add to each sequence in `sequences`.	“upstream_seq”: “AATTA”
`downstream_seq`	`string`- Optional	Downstream flanking sequences to add to each sequence in `sequences`.	“downstream_seq”: “CCCAAAA”
`sequences`	`object` - Required	A collection of key-value pairs (strings). Keys are unique sequence ID keys – any characters `[A-Z][a-z][0-9][-.\_\~#\@%^&\*()]`.	“sequences”: { “seq1”: “ATGC…”, “seq2”: “ATGC…”, “random_seq”: “ATGC…”, “enhancer”: “ATGC…”, “control”: “ATGC…” }
`prediction_ranges`	`object` - Optional	A collection of key-value pairs, where the keys should be identical to sequence ID keys and values are arrays of integers with the start and end region you want predictions for, within the provided sequence context. Start and end are 0 indexed and inclusive, respectively (e.g. [0,1] is the first two bases).	“prediction_ranges”: { “seq1”: [0,1000], “seq2”: [100,110], “random_seq”: [], “enhancer”: [210,500], “control”: [] }

Notes#

keys in sequences must be unique or will be overwritten during the reading in
all indexing is 0 based
to minimize any bias from the predictors we suggested randomizing your sequences so that there is no dependency on the order

Evaluator Output File Specifications#

All evaluation metrics are saved into a single tab-separated file: evaluation_summary_[evaluator_name].csv

The evaluator_name is constructed automatically in config.py by appending the container’s build timestamp (from Apptainer’s /.singularity.d/labels.json) to the evaluator’s base name. The format is {EvaluatorName}_{YYYYMMDD-HHMMSS}_{TZ}. In development mode (outside a container), _dev is appended instead. This follows the same convention as predictor_name (see Predictor API Schema).

The file uses append mode, so multiple evaluation runs against different Predictors accumulate in the same file. The description column distinguishes between different metric types (e.g. per-task correlation vs. cell-type specificity).

Column	Data Type	Description	Example
`evaluator_name`	`string`	The unique, automatically versioned identifier for the evaluator module.	`agarwal_2025_joint_lib_20260407-134539_PDT`
`description`	`string`	Human-readable description of what is being measured. For per-task correlations, this identifies the task (e.g. `Agarwal Joint MPRA (K562)`). For cell-type specificity, this identifies the pair being compared (e.g. `Cell type specific expression (HepG2 - K562)`).	`Agarwal Joint MPRA (K562)`
`predictor_name`	`string`	The `predictor_name` returned by the Predictor in its response payload.	`DREAM-RNN_Human_K562_20260407-140628_PDT`
`time_stamp`	`string`	UTC timestamp in `YYYYMMDD-HHMMSS.f` format, generated at the time of metric calculation. Ensures unique entries when the same Evaluator-Predictor pair is run multiple times.	`20260407-212607.595087`
`metric`	`string`	The evaluation metric used.	`pearson_r`
`value`	`float` or `string`	The computed metric value. Set to “NaN” (as a string) only if the metric could not be calculated due to a protocol failure (e.g. missing predictions, HTTP errors, missing sequences, scale mismatch). A value of 0.0 is recorded if the Predictor successfully returned valid predictions, but the mathematical result is zero or undefined due to data invariance (e.g. the model predicted a constant flat-line value resulting in zero variance, or identical predictions across cell types).	`-0.0486` or `"NaN"`
`prediction_task(s)_data`	`string`	Serialized prediction task metadata (excluding the `predictions` values) as a list of dictionaries. For metrics involving two tasks (e.g. cell-type specificity), the metadata for both tasks is included, separated by `-`.	`[{'name': 'task_k562', 'type_requested': 'expression', ...}]`

The codebase for the Agarwal MPRA Joint Library Evaluator can be found on GitHub and serves as a reference for implementing custom metric calculations.

Evaluator API Schema

Contents

Evaluator API Schema#

Evaluator request#

Notes#

Evaluator Output File Specifications#