The VBASE2 entries can be found on the VBASE2 web page. My favourite one is the entry for the V186.2 gene.
You can look at it here http://www.vbase2.org/vgene.php?id=musIGHV057&ref=all
I just go through all the entries of this sample and it would be worth to discuss what to add in the new gremline database.
1) VBASE2 ID (unique number) Nothing really associated to this number. It is just unique in the database. (this is not quite true, I added MUS for mouse and added IgH for the type of the Vgene. But you could treat the letters like numbers). If I would do it again, I would use just a number and add the species information elsewhere and the type of V gene elsewhere.
2) Class (as discussed before) The example above is a class I gene (the best).
3) functionality (of course functional as it is a class I gene), functionality is tested by looking at the sequence at the time of the database construction and anything unusual could make the gene potentially non-functional, for example if conserved positions are impaired.
4) V gene names (a list of all names known, like in this case V186.2)
5) V gene family (here I display the trivial name but should display the number, the scientists working with mouse sequences know the VHJ558 Family so well that they would not remember the number (VH1).
6) Last update (as you can see, I did the latest update in 2007 (about 9 years ago).
7) The nucleotide sequence (in fasta format with name and sequence length)
8) The protein sequence (in fasta format with name, I would not use bp now but number of aa).
9) Nucleotide sequence structure Listing key positions within the sequence like
FR1, FR2, FR3, CDR1, CDR2, CDR3 (more important for the light chains...)
Conceived positions 1st and 2nd Cysteine and conserved Tryptophan in FR2.
10) genomic sequence(s) all entries of the DNA databases where the genomic sequence can be found (Evidence for a class I sequence)
11) All rearranged sequences in the EMBL database
and cross references
12) All rearranged sequences in the IMGT database
13) All rearranged sequences in the KABAT database