Hi all,
I’m new to BCR/Antibody repertoire area. I realized the following data will help me a lot to do a presentation to show the reason behind primer design for scFv library construction from B cells. But I cannot find one whole set of them.
One human BCR(or antibody, can be IgG, IgM or IgE)'s:
- (Optional) full gDNA sequence (after VDJ recombination, with annotation: Heavy chain [ VDJ and C ] and Light chain [ VJ and C ])
- Full mRNA sequence (with annotation)
- Amino acid (with annotation, or it could be deduced)
- Structure (Like those in PDB, which I can view in software like PyMOL)
I spent a whole day to look for one set but failed. What I have got are:
- Two IgG precursor sequences with one puzzling link to light chain’s structure [One set]
anti-RhD monoclonal T125 precursor
Heavy chain
Light chain
Notes: With one structure link in the light chain. But I do not understand the connections between “anti-RhD monoclonal T125” in the sequence title and “Kir3dl1*015” in the sturcture title.
Also they do not have detailed V(D)J_C annotations for sequence.
__ - IgM full mRNA seq with no structure info [Two sets]
a. Homo sapiens 2G9 monoclonal IgM
Heavy chain
Light chain
b. Homo sapiens 9F11 monoclonal IgM
Heavy chain
Light chain
Notes: The sequences look good. But I do not know how to search the structure data in database like PDB. Specifically, I do not know which keywords I should use and where to search. When I put “2G9” in the PDB search box, the results are “4MZN”, “4DUC”, and “4DUF”. Again, I lost. What’s the relationship between them?
__ - IgG sturcture data with no sequence data [One set]:
1HZH on PDB
Notes: This is the only one, intact human antibody structure, mentioned in PDB Education Portal for Antibody under section “Exploring the Structure”.
(Two small question here: in the .pdb 3D file, 233rd and 234th amino acids are linked together in the sequence of Chain “/K/1”, but are not linked together in the cartoon structure. Is this normal? The sequence lengths are not the same in fasta and the model in the .pdb file. E.g., 457aa for Heavy Chain in fasta, but 487aa for it in structure model. Why is that?)
Note2:
When I searched the keyword “1HZH” in NCBI Nucleotide, The results are also confusing. I wanted to find the annotated sequence for 1HZH (I guess it should be somewhere online).
Also, I noticed that it is easier to find Fab’s structure than the intact antibody. One whole set of a Fab would also be of great help.
Thanks in advance!
Roden