As some of you already know, there is an emerging community, led by @Felix_Breden, Jamie Scott, and @tbkepler, which is working hard to coordinate our emerging field of repertoire analysis. There is an upcoming meeting, which unfortunately is already full, but there will be opportunities for remote participation and I’m dedicated to making sure that we get contributions from the broader community.
As part of this effort, there are several working groups that are putting together white papers on standards and recommendations. I’m part of the tools and resources working group along with @caschramm @sdwfrost @davide.bagnara @kiplingd @bussec @dooley, and I’m posting here to ask for your suggestions to direct that work. Other working groups have started a nice pattern in which we first decide on a set of philosophical principles, then talk more about specifics.
Below is a quick sketch of some principles that I came up with @caschramm, and I hope you’ll tear them apart and make better ones. [I know that some of the following will read like pie-in-the-sky, but that’s the point here.] Thanks!
Software tools and resources
Community goals
- Promote transparent sharing of tools, methods and information, in order to enable review, contribution and continued development
- Provide a platform for understanding current experience, methods and their limitations
- Discuss and communicate current challenges, providing a channel between tool/resource developers and users
- Find ways of describing, sharing and versioning alternative germline gene sets
- Develop standardized benchmark data sets and testing frameworks
- Strive for modularity and interoperability in independent software components
Software recommendations
Software should
- Use OSI-approved open-source licenses and hosted on publicly available repositories
- Be versioned
- Include metadata about run parameters should be part of the output
- Use community-curated standard file formats
- Include example data and checks for expected output
- Clearly list dependencies and/or provide scripts to build a virtual machine
- Have a public support forum or channel, describing the level of support users can expect
- Make it clear whether the software is still under active development
Biological tools and resources
Community goals
- Choose a central, searchable, location to list available protocols and reagents
- Develop a framework for a standardized MTA to facilitate sharing of reagents
- Develop standardized benchmark biological resources to test library preparation methods
Protocol recommendations
Protocols should
- Be made publicly available in full detail
- Be kept in a versioned repository and updated as necessary
- Novel reagents (plasmids, cell lines, antibodies, etc.) should be made available to all qualified researchers, preferably through an established repository