Hi AIRR Community,
Our BCR/TCR clustering software, InterClone, is in the final stages of development. InterClone accepts AIRR-compliant tsv files containing BCR or TCR AA sequences and returns clusters under various similarity thresholds. Our thinking was, it would be easiest for users if the output consisted of the original AIRR files with cluster IDs added as additional columns. Any thoughts on this? There does not seem to be a concept of “cluster” in the current schema, but at the rate of data growth, I think there already is a need. Any thoughts how to handle this within the current standards?
Thanks in advance for your thoughts