We want to find if one CDR3 is mutated or derived from another CDR3 in our own data pool. Is it possible to do that?
Or can I search for one CDR3 sequence in our data pool with the Levenshtein Distance (the edit distance between two sequences, i.e. sum of insertions, deletions or substitutions)
in tcR, vis.gene.usage(twb, HUMAN_TRBJ, .main = âtwb J-usage dodgeâ, .dodge = T) is based on âRead.countâ, can I use âRead.proportionâ to plot this figure?
Try .quant option. Like this: vis.gene.usage(twb, HUMAN_TRBJ, .main = âtwb J-usage dodgeâ, .dodge = T, .quant=âread.propâ)
The full list of parameters for vis.gen.usage is described in the main function called geneUsage.
Actually, for this particular task (searching for clonotypes with hamming/levensthein distance) Iâm using tcR.
As for VDJTools, it seems I was wrong. Just briefly went through its parameters - looks like the tool canât search with not exact matching. May be, it would be better to ask Michael (mikhail.shugay) directly..
Itâs also worth remembering that just because a CDR3 is very similar to another one it doesnât mean that one was derived from the other. Even TCR repertoires (which will not be undergoing SHM) will display vast networks of CDR3s which differ from each other by a Levenshtein distance of 1, presumably reflecting the biases inherent in their generation and selection.