A computational biologist's personal views on new technologies & publications on genomics & proteomics and their impact on drug discovery
Friday, February 27, 2026
Element Previews High-Throughput VITARI
On the Thursday before AGBT, Element Biosciences unveiled their VITARI high throughput sequencer, expected to ship in the second half of this year. VITARI surprised much of the sequencing community, including this pundit, as it seemed as though Element was devoting their development efforts to their AVITI24 multi-omic spatial system - which they continue to evolve.
Wednesday, February 25, 2026
Illumina Constellation Becomes TruPath Genome
Illumina is rebranding their Constellation mapped read technology to TruPath Genome and launching it into the marketplace. Priced at $395 per genome, TruPath Genome was presented as being superior to true long read approaches for generating haplotype blocks and for resolving difficult genetic disease loci. TruPath Genome relies on on-flowcell tagmentation of long input molecules, meaning that library prep moves entirely to the flowcell. TruPath genome runs on new NovaSeq X C8 flowcells (8 lanes) or the new C2 flowcell (2 lanes), with one sample per lane.
Tuesday, February 24, 2026
Canal Biosciences: Ligation Libraries without Distortions from End Repair
Canal Biosciences is a small startup in the library prep space that is making a debut at AGBT this year as a contributing sponsor. Based in the historic mill town of Lowell Massachusetts with its many canals, the company is bringing forward a ligation chemistry which is free from end repair - and that offers many interesting advantages. Three of Canal's scientists (half the team!) chatted with me on the eve of AGBT about their TrueLink chemistry.
Enzymatic ligation lies at the heart of a large number of NGS library preparation protocols. Tagmentation is probably the next most common, and then there is a long tail of niche methods that introduce sequencing adaptors by random priming, click chemistry or other chemistry. Ligation traces back to the earliest days of molecular cloning. At first such techniques relied on the sticky overhangs generated by Type II restriction enzymes, but later methods were developed for blunt ends. DNA sheared by mechanical or non-specific nucleases can have a variety of ends, so "end polishing" or "end repair" protocols were developed, taking advantage of the properties of T4 DNA polymerase. Under the right conditions, this enzyme will extend a recessed 3' hydroxyl and trim back a 3' overhang. Later came A-tailing and Y-adaptors to ensure directional generation of sequencing library fragments - no i5-i5 or i3-i3 fragments (to use Illumina nomenclature) which will soon drop out during PCR or just waste flowcell space in a PCR-free protocol.
End repair into ligation (with possibly a stop at A-tailing) has served the sequencing community well for decades, so don't fix what isn't broken, right? Except there are issues becoming increasingly apparent with this approach, especially as DNA sequencing moves beyond genome assembly, transcriptome discovery and variant calling into epigenomic markings and naturally fragmented DNA. Plus the sequencing instruments deliver increasingly accurate reading of the library fragments they are given, so it is important to not alter the sequence from its native state.
Those cracks have been showing. First, there are fields using DNA that is extracted as fragments - ancient DNA, forensic DNA and cell-free DNA. Many of these are short to start with, so discarding any of it via trimming 3' overhangs is painful. Worse, there is an emerging field of fragmentomics which has shown that the lengths and termini of cell-free DNA carry critical information about the processes that generated the cell-free DNA.
T4 DNA polymerase also lacks the fidelity of many polymerases, and so potentially generates base substitutions. Element Biosciences dealt with this in their Q50 protocol by running dark cycles across the bases most often filled in by T4, which they showed had lower accuracy. But the Broad has published data showing that end repair often rewrites DNA far from the ends of the molecule, with all the problems of rewriting the native DNA. And if you are interested in methylation marks, rewriting will erase them and the bases filled-in won't have the marks of the opposite strand copied over.
One solution to this is to denature the sample and use a single stranded DNA prep, such as marketed by Claret Biosciences. Canal is avoiding that step by creating a double stranded ligation chemistry which can still ligate to recessed 3' ends, recessed 5' ends or blunt ends. It also functions on single stranded DNA. The company is a cagey about the exact chemistry - tiny startups must be careful not to give fast follower competitors more information than strictly necessary - it uses to accomplish this, but their issued patents discuss using mutant ligases and modified oligonucleotides. One gets the impression that co-founder Yu Zheng, a veteran protein engineer with stops at IDT and NEB and a previous founder of a protein engineering startup - probably regularly dreams of atoms dancing the complex molecular choreography of a ligation reaction.
Canal's first TrueLink product is a PCR-indexed DNA kit, with 24 reactions for $1.2K and 96 for $4.35K. This is a bit of a premium to existing ligation kits, but you are getting a superior chemistry which won't trim ends or erase methylation. Soon to follow - making and validating all the indexing oligos isn't a small task - will be TruLink Absolute PCR-Free. These are intended to take in DNA either from cell-free inputs or downstream of mechanical shearing; commercial nuclease fragmentation kits typically contain end repair enzymes. Canal plans to offer their own nuclease mix in the future to enable enzymatic shearing
Canal's libraries nearly eliminate the higher error rates seen at the ends of reads. Interestingly, on Illumina platforms the first few bases still show a highly elevated rate. Canal hasn't tested this on another platform - I'd suggest Element or PacBio HiFi would be very interesting. Is it just a basecalling algorithm weakness?
An interesting additional benefit of TrueLink is reducing the number of chimaeras - despite all the Y-adapter trickery ligation libraries contain as much as 1% chimaeric fragments. Those are certainly unwelcome when trying to detect structural variants, particularly in low coverage data.
Canal has plenty more in development - kits tuned for RNA-Seq (targeted for next year) and for methylation sequencing. The latter would support both cytosine-conversion chemistries and 5mC conversion chemistries.
So expect even more from this small company that wants to steer sequencing library preparation away from the time tested, but problematic, paradigm of end repair followed by A-tailing.
Monday, February 23, 2026
Wish: Scientific Poster Snap Organizer App
AGBT is another conference with a wonderful wealth of scientific posters and never enough time to view them all properly. I definitely enjoy engaging scientists - particularly early stage scientists such as graduate students - at their posters, but poster session times are traps for other conversations and all too often are booked-over with meetings. And I'm not adept at time management. Just a bit over a week ago, I found myself furiously running through the posters at SLAS on the last day, trying to capture them on my phone while lamenting the ones I already saw being carted away. So now I've converted the problem of spending time on each poster to a future time - but can I have more than just a collection of photos on my devices?
Sunday, February 22, 2026
AI Agent -Driven Biochemical Optimization
When I started this space, I vowed not to become just a mouthpiece of my employer - but when my employer does something interesting, of course I'm not going to reflexively avoid writing about it. On the autonomous labs front, there's an interesting preprint showing how an intelligent agent interacting with an autonomous laboratory system can generate new knowledge. I'd like to think I would have covered it anyways, but since it is the autonomous laboratory at my workplace, I'm especially eager to do so. In a collaboration between Ginkgo and OpenAI, an agent was used to optimize a cell-free protein synthesis reaction using our autonomous laboratory based on Reconfigurable Automation Cart (RAC) technology. This isn't the first time this has been done, but I think it has a number of interesting elements. Plus, optimizing complex biochemical reactions is a broadly interesting area to explore. And, thinking up new services running on our autonomous laboratory is exactly my bailiwick, and I certainly reserve the right to use this space to be a mouthpiece of myself!
Wednesday, February 18, 2026
AGBT 2026 Preview
AGBT starts on Monday at the Signia by Hilton Bonnet Creek in Orlando on Monday, but the pre-conference flurry of announcements is already rolling out. Here is a quick scan of what's already been telegraphed to be in store next week - and expect more announcements over the next few days. I'm getting to Orlando on Friday - far before most people so perhaps I can actually focus on writing (though I also have a backlog of follow-ups from last week's SLAS lab automation conference). I've supplied links for many of these items, though in some cases it isn't easy to find a single link to include
Sunday, February 08, 2026
Pancreatic Adenocarcinoma Mutation Trajectories at Single Cell Resolution
Single cell RNA sequencing is nearly ubiquitous; single cell DNA sequencing has been much rarer. I'm going to dive in a bit - but not a full review - of a recent paper which described single cell DNA sequencing of pancreatic ductal adenocarcinomas. The paper applied the Mission Bio droplet-based PCR approach, which hasn't had quite the buzz of its close technological cousin 10X Genomics. And there are some interesting implications of the findings in this paper to impending developments in oncology therapeutics - including one in which I have more than academic interest.
Monday, February 02, 2026
Non-coding DNA's Alpha Moment
I had been meaning to read the AlphaGenome paper on non-coding variant effect prediction from Google DeepMind which recently showed up in Nature, but had found excuses not to dive in. Then DeciBio’s Stephane Budel posted on LinkedIn with some incisive comments and few things spur me to action better than my sense of competition! So here's a quick, incomplete & imperfect take on this giant paper.
Thursday, January 29, 2026
Sequencer vs. Library Prep Market Structures
Something I started pondering as I was prepping my JP Morgan items: the structure of the market for sequencing library prep is very different from that for sequencers. Perhaps this is painfully obvious to many, but I hadn't turned it over before in my mind. It also follows that the sequencer market was overweighted in who was invited to give talks at the banking conference - nearly every sequencer maker was there - Complete Genomics and Singular Genomics were the missing parties (and Singular is a bit of a quasi-sequencer company now). Note that in this piece I will not claim a complete census of companies in this space - though please feel free to note exceptions in the comments
Wednesday, January 28, 2026
P2 Solo: ONT Extends Support But Not Sales
A brief update to the earlier P2 Solo piece - Oxford Nanopore has now announced they will extend their support for the device until 2030, but they are still ending sales this summer. So a cosmetic victory for Solo fans, but not really much gain - and certainly disappointing for anyone saving their quid for a future purchase. Solos may become very valuable, though it's hard to see ONT ever countenancing a healthy resale market.
Sunday, January 25, 2026
ONT Axes P2 Solo, Roiling Community
Oxford Nanopore has a serious case of user community unrest, triggered by the announcement that the P2 Solo (aka P2s) will be phased out starting this summer. P2 Solo is the two flowcell version of PromethION which required external GPU for basecalling; P2i is the sibling with onboard GPU. P2 Solo clearly has a very passionate constituency, one which feels betrayed by its discontinuation. This ongoing episode - ONT could always change their mind and restore the P2 Solo - is a cautionary tale of the challenges of changing corporate strategy in the context of a brand which has built a near cult-like loyalty.
Thursday, January 22, 2026
JP Morgan 2026 Roundup
The J.P. Morgan conference last week was mostly small tidbits of info on the sequencing front, with Element delivering the one big teaser announcement.
Sunday, January 11, 2026
JPM 2026 Schedule Through A Molecular Tools & Testing Lens
The J.P. Morgan Healthcare Conference is this week in San Francisco. As with every other one, I'm tracking any news remotely from Boston. Some year I really should go to observe the sheer spectacle of it, but this isn't the year - and if I were in California now it probably would be more valuable to attend Plant & Animal Genomes down in San Diego. If you do go, be sure to check out the 4 instrument autonomous laboratory Ginkgo will have running in the Marriott Marquis lobby. But here are some notes to keep track of when different companies are presenting. Public companies will make their presentations available shortly thereafter to comply with insider trading regulations. Unfortunately, several years ago J.P. Morgan ended posting presentations from private companies - these companies could choose to post their decks but I'm not optimistic to see many.
Friday, January 09, 2026
Stalking the Elusive Product Market Fit
I had a wonderful breakfast conversation back in November with Arima Genomics Founder (and now President and Chief Operating Officer) Sid Selvaraj where he brought up the topic of product market fit. I've often kibitzed on 'omics companies trials and tribulations trying to achieve product market fit, but now that I'm in an outward-facing product development role the concept is central to my daily work. What is it and how does one achieve it - or fail to?
Wednesday, January 07, 2026
Scales of Instrumentation and Automation
Back in September I moved into a product role within the automation unit at Ginkgo, where we sell autonomous laboratories. I'm going to end up posting a bunch on the topic, as I spend a lot of time thinking about it and this is an interesting place to expose my thinking to critique. It's also a way to solicit feedback on semi-formed ideas. I referenced autonomous laboratories in my previous post, which might have some wondering "what exactly is an autonomous laboratory?" So this post will lay out a description of different levels of automation (or instrumentation) so I can refer back to it in future posts - and there will certainly be further posts focused on these different scales of automation. So here I will lay out four levels of laboratory instrumentation: Bench, Walkup, Workcell and Autonomous. BTW, if you are in San Francisco next week during J.P. Morgan, there will be a small autonomous lab running in the Marriott Marquis lobby - and it can be viewed regardless of whether you are registered for the conference
