Accelerate your drug discovery programs and power the next generation of AI models with access to some of the largest and most comprehensive single cell datasets available for commercial use. All datasets are generated using Parse’s Evercode combinatorial barcoding technology, enabling unmatched scale and quality. The following datasets are available for commercial use following completion of a license agreement.
Access raw and processed data, complete metadata, and experimental details to fast-track your internal research programs. This supplements your existing internal pipelines and shortens discovery cycles.
What’s Included with a Licensed Dataset
Each dataset license is fully configurable to meet the requirements of your organization. Parse provides a range of deliverables, including raw FASTQ files, processed gene count matrices, metadata, and detailed experimental documentation, and will tailor access to match your analytical pipelines and research objectives. Our team collaborates with you to determine the scope, format, and level of support that advance your research objectives.
Overview of Available Datasets for Commercial Licensing
Below is the current catalog of datasets available for licensing. New large-scale drug perturbation datasets are in progress and will be released soon.


Figure 2: Experimental design for the high-throughput cancer cell line drug screen.
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A | GW9508 | Zebularine | Acarbose | CHIR-99021 (CT99021) | Alogliptin Benzoate | Bardoxolone methyl | Bilobalide | Curcumin | SRT2104 (GSK2245840) | Teneligliptin hydrobromide | Trelagliptin | DMSO |
| B | Troglitazone | Glimepiride | Linagliptin (BI-1356) | Sitagliptin phosphate monohydrate | Vildagliptin (LAF-237) | Alogliptin (SYR-322) | Rucaparib (AG-014699; PF-01367338) | SRT1720 HCl | Resveratrol | Rosiglitazone maleate | GSK3787 | DMSO |
| C | Rosiglitazone | GW0742 | GW501516 | Rosiglitazone HCl | Roflumilast | Pioglitazone HCl | MK-8245 | PF-04620110 | Piperlongumine | Streptozocin | Empagliflozin (BI 10773) | DMSO |
| D | Repaglinide | Dapagliflozin | R406 (free base) | R788 disodium | Canagliflozin | TAK-875 | CHIR-99021 (CT99021) HCl | Glipizide | GSK1292263 | R406 | Fingolimod (FTY720) | DMSO |
| E | Saxagliptin | Alvelestat | AZD7545 | Glibenclamide | Phenformin HCl | RVX-208 | Bethanechol chloride | Bosentan | Chlorpropamide | Methyldopa | Voglibose | DMSO |
| F | AdipoRon | Fenofibrate | Gemfibrozil | Inulin | Metformin HCl | Sodium Salicylate | Miglitol | Pioglitazone | LY2608204 | Tolbutamide | Gliclazide | DMSO |
| G | Gliquidone | Nateglinide | Ramipril | Valsartan | WAY-100635 maleate salt | Fostamatinib (R788) | Lisinopril dihydrate | Hydroxychloroquine sulfate | WS6 | Obeticholic acid | MK3102 | DMSO |
| H | Triacetyl Resveratrol | Diflunisal | Bromocriptine mesylate | (R)-(+)-Etomoxir sodium salt | Astaxanthin | Tauroursodeoxycholic acid | L-(-)-Fucose | Maslinic acid | Morin | L-Leucine | (2S,3S)-2-Amino-3-methylpentanoic acid | DMSO |
Figure 3: Plate layout of the 88-compound anti-diabetic drug screen.

Figure 4: UMAP of the 5 million mouse single nuclei atlas.
Dataset Comparison
| Dataset | Number of Cells / Nuclei | Number of Samples | Number of Reads per Cell | Treatments / Condition | Timepoints | Species | Ideal Applications |
|---|---|---|---|---|---|---|---|
| 10M PBMC Cytokine Screen | ~9.7M | 1092 | ~31,000 | 90 cytokines | 24h | Human | Immunomodulatory drug discovery Donor variability and stratification studies Training ML models for immune response prediction |
| Cancer Drug Screen | 800,114 | 384 | ~10,000 | 88 chromatin remodeling drugs & control | 6–24 hrs | Human | Mechanism-of-action (MoA) profiling Synergy prediction and combination hypothesis generation Benchmarking computational phenotyping |
| Anti-Diabetic Screen | 622,042 | 356 | ~2,000–3,000 | 88 anti-diabetic drugs & control | 1–24 hrs | Human & Mouse | Comparative MoA analysis across compound classes Identifying off-target or compensatory transcriptional effects Training predictive drug response models |
| Mouse 5M Nuclei Atlas | ~5M | 14 | ~10,000 | 7 tissues from male and female mice | N/A | Mouse | Atlasing Cross-tissue comparative analyses Preclinical species-to-human alignment studies |