Paired VH/VL datasets


#1

Hi everyone,

I was wondering if anybody has, or is aware of, VH/VL datasets, other than the ones from DeKosky papers?

Cheers,
Pej.


#2

Yep:
Tumor-infiltrating immune repertoires captured by single-cell barcoding in emulsion

And 10x might have some smaller datasets too.


#3

Hi,
We at 10x have released some datasets for public use here:

https://support.10xgenomics.com/single-cell-vdj/datasets

In particular,
Pan T Cells isolated from mononuclear cells of a healthy donor
CD8+ Cytotoxic T cells from mononuclear cells of a healthy donor
CD4+ helper T cells from mononuclear cells of a healthy donor
Peripheral blood mononuclear cells (PBMCs) from a healthy donor
Jurkat, a lymphoblast cell line cultured in suspension
Anti EBV specific T cells of a healthy donor
Dissociated primary tumor cells from a clear cell renal carcinoma (CCRC)
Enriched CD3+ T cells of dissociated primary cells from a clear cell renal carcinoma (CCRC)
CD19+ B cells isolated from PBMCs of a healthy donor - Direct Ig enrichment
GM12878 cell line - Direct Ig enrichment
NSCLC tumor - Ig enrichment from amplified cDNA
NSCLC tumor - TCR enrichment from amplified cDNA
PBMCs of a healthy donor - Ig enrichment from amplified cDNA
PBMCs of a healthy donor - TCR enrichment from amplified cDNA


#4

Thank you, that was a fantastic paper. Though, I could not find a link to where the data is deposited.

Cheers


#5

Hi,

I have seen some of those datasets, but unfortunately I need a lot of sequences, since, it is for a machine learning application.

I have two questions, however:

  1. Are there, in total, over 100000 sequences in these datasets?
  2. Is any of the available sequence data from these projects processed?

Cheers,