Germline Set Overview v3.pdf (715.2 KB)
Attached is a revised overview of the Germline Set proposal, reflecting comments on the previous version and including an entity-relationship diagram for the database. I have also produced a new Germline Scheme and File Formats document. This includes a description of each field in the database, and a separate illustrative file format, showing the format in which a germline set could be downloaded for use by a parser. This is illustrative only at this stage, because there are details mentioned in previous threads which haven't as yet been resolved, such as whether all chains should be in the same file or in separate files. I think we could do with some discussion on a call when time allows. But for the time being I did think it would be valuable to show some details of a possible format, in order to draw out some of the features it offers, which are highlighted in the last slide of the overview.
Terminology continues for the time being to be aligned with IMGT standards and I have revised one or two field names with this in mind.
Please let me have any comments - either here or in the Google sheet.
Thanks, and Happy New Year to you all