Diagnose / Remove / Prove
Rosa diagnoses bias, removes it, and hands you the evidence.
Bias hides in data, directly and through proxies. Rosa strips it out without changing your data's statistics, and every run leaves an immutable, verifiable record. Everyone else measures and manages bias. Rosa removes it.
Free to try · Run it live in the browser
model race-disparity 0.54 column distribution
with Rosa 0.09 same distribution, preserved
See the methodology
See the bias in the data you supervise.
Free, independent, reproducible. Diagnose is free for supervisory authorities on the shared trial, with no end date.
Your path AuditorsArticle 10 bias-audit evidence in one run.
A standard, hash-verifiable evidence artefact you can reuse across engagements.
Your path EnterprisesComply before the deadline. Keep your model's performance.
Remove dataset bias and produce the evidence in one motion. UK processing, or a dedicated instance in your region.
Your pathMeasured, sourced, reproducible
The numbers do the persuading.
R² (R-squared) measures how well a model's predictions fit reality; higher is better. Reproducible. See the methodology.
Why Rosa is different
The only tool in the conversation that changes the dataset and proves the change.
Open-source fairness toolkits
Assess and mitigate at the data scientist's desk. A toolbox of metrics and algorithms you assemble and operate yourself. No evidence artefact, no service.
Governance and GRC platforms
Measure, monitor, and document bias across the enterprise. Useful paperwork, but the dataset leaves the platform exactly as biased as it arrived.
Rosa
Removes the bias at source, preserves the data's statistics, and proves the change with an immutable, hash-verifiable record of every run.
The product in three beats
Diagnose. Remove. Prove.
Find the bias
Point Rosa at a dataset and one protected attribute. It finds where bias is encoded, directly or by proxy (a feature that stands in for the protected attribute, like a postcode for race), and scores it.
Remove it at source
Rosa transforms the data so downstream models cannot recover the protected attribute, while preserving every column's statistics.
Keep the evidence
Every run emits an immutable Run Manifest and a PDF report. The audit evidence is generated by doing the work, not written up afterwards.
EU AI Act · Article 10
High-risk AI must be examined for dataset bias, with mitigation in place. The deadline is 2 August 2026.
Article 10 of the European Union Artificial Intelligence Act requires the training, validation, and testing datasets behind high-risk AI systems to be examined for possible biases, with appropriate measures to detect, prevent, and mitigate them. Rosa produces exactly that examination, the mitigation, and the evidence, in one run.
Product as proof
Do not take our word for it. Run it.
The Customer Portal includes Test 1: the Apple Card gender-bias scenario, debiased in your browser in one click. Submit the dataset, watch the job run, download the debiased CSV, the PDF report, and the Run Manifest (an immutable, hash-verifiable record of exactly what Rosa did to the data).
- job_id
- 550e8400-e29b-41d4-a716-446655440000
- mode
- remove_bias_training
- job_status
- complete
- timestamp_submitted
- 2026-06-10T09:14:02Z
- timestamp_completed
- 2026-06-10T09:31:47Z
- row_count
- 2,000
- bias_columns
- ["race"]
- input_hash
- sha256:9f1c…e7a2
- schema_hash
- sha256:4b08…21cd
- config_hash
- sha256:d3aa…90f4
- container_digest
- sha256:71be…0c55
- bias (pre)
- 0.21
- residual_bias
- 0.001
- artifacts
- remove_bias_report.pdf, compas_preconditioned_fair.csv
Where bias hides
Use cases with numbers, not adjectives.
Credit and lending
The Apple Card scenario: proxy-encoded gender bias removed with no fairness/accuracy trade-off. Run it live.
Read the case Criminal justiceRecidivism scoring
COMPAS: the canonical biased dataset. Rosa cuts a recidivism model's measured racial disparity by ~75-90% while preserving the data.
Read the case Hiring and HRShortlisting and scoring
Career gaps, postcodes, schools, names: proxies Rosa finds and removes without you labelling them.
Read the case HealthcareClinical risk models
The published heart-disease result: equal risk estimates for men and women after Rosa, gender gap effectively eliminated.
Read the caseWhat sets Rosa apart
- No proxy labelling required. Name one protected attribute; Rosa finds the proxies itself.
- Any model or stack. Rosa outputs a clean CSV. No rewrite of your training code.
- Data residency you choose. UK region for the trial; a dedicated instance in your own region for a PoC.
- Works for human decisions too. Cleans the data analysts use directly, not just model training data.
- No fairness/accuracy trade-off. Fairness as a transformation of the data, not a constraint fighting your model.
- Honest scope. Univariate: one protected attribute per run. Rosa says what it can and cannot detect.
Designed to support organisations implementing these frameworks at the data layer
Rosa is an evidence-producing instrument, not a certification. Standards and compliance
Remove the bias from your data. Keep the evidence.
Free to try. Free for regulators, with no end date. Processed in the UK.