Tuesday, February 24, 2026

Canal Biosciences: Ligation Libraries without Distortions from End Repair

Canal Biosciences is a small startup in the library prep space that is making a debut at AGBT this year as a contributing sponsor.  Based in the historic mill town of Lowell Massachusetts with its many canals, the company is bringing forward a ligation chemistry which is free from end repair - and that offers many interesting advantages.  Three of Canal's scientists (half the team!) chatted with me on the eve of AGBT about their TrueLink chemistry.

Enzymatic ligation lies at the heart of a large number of NGS library preparation protocols.  Tagmentation is probably the next most common, and then there is a long tail of niche methods that introduce sequencing adaptors by random priming, click chemistry or other chemistry.  Ligation traces back to the earliest days of molecular cloning.  At first such techniques relied on the sticky overhangs generated by Type II restriction enzymes, but later methods were developed for blunt ends. DNA sheared by mechanical or non-specific nucleases can have a variety of ends, so "end polishing" or "end repair" protocols were developed, taking advantage of the properties of T4 DNA polymerase.  Under the right conditions, this enzyme will extend a recessed 3' hydroxyl and trim back a 3' overhang.  Later came A-tailing and Y-adaptors to ensure directional generation of sequencing library fragments - no i5-i5 or i3-i3 fragments (to use Illumina nomenclature) which will soon drop out during PCR or just waste flowcell space in a PCR-free protocol.

End repair into ligation (with possibly a stop at A-tailing) has served the sequencing community well for decades, so don't fix what isn't broken, right?  Except there are issues becoming increasingly apparent with this approach, especially as DNA sequencing moves beyond genome assembly, transcriptome discovery and variant calling into epigenomic markings and naturally fragmented DNA.  Plus the sequencing instruments deliver increasingly accurate reading of the library fragments they are given, so it is important to not alter the sequence from its native state.

Those cracks have been showing.  First, there are fields using DNA that is extracted as fragments - ancient DNA, forensic DNA and cell-free DNA.  Many of these are short to start with, so discarding any of it via trimming 3' overhangs is painful.  Worse, there is an emerging field of fragmentomics which has shown that the lengths and termini of cell-free DNA carry critical information about the processes that generated the cell-free DNA.  

T4 DNA polymerase also lacks the fidelity of many polymerases, and so potentially generates base substitutions.  Element Biosciences dealt with this in their Q50 protocol by running dark cycles across the bases most often filled in by T4, which they showed had lower accuracy.  But the Broad has published data showing that end repair often rewrites DNA far from the ends of the molecule, with all the problems of rewriting the native DNA.  And if you are interested in methylation marks, rewriting will erase them and the bases filled-in won't have the marks of the opposite strand copied over.

One solution to this is to denature the sample and use a single stranded DNA prep, such as marketed by Claret Biosciences.  Canal is avoiding that step by creating a double stranded ligation chemistry which can still ligate to recessed 3' ends, recessed 5' ends or blunt ends. It also functions on single stranded DNA.   The company is a cagey about the exact chemistry - tiny startups must be careful not to give fast follower competitors more information than strictly necessary - it uses to accomplish this, but their issued patents discuss using mutant ligases and modified oligonucleotides.  One gets the impression that co-founder Yu Zheng, a veteran protein engineer with stops at IDT and NEB and a previous founder of a protein engineering startup - probably regularly dreams of atoms dancing the complex molecular choreography of a ligation reaction.  




Canal's first TrueLink product is a PCR-indexed DNA kit, with 24 reactions for $1.2K and 96 for $4.35K.  This is a bit of a premium to existing ligation kits, but you are getting a superior chemistry which won't trim ends or erase methylation.  Soon to follow - making and validating all the indexing oligos isn't a small task - will be TruLink Absolute PCR-Free. These are intended to take in DNA either from cell-free inputs or downstream of mechanical shearing; commercial nuclease fragmentation kits typically contain end repair enzymes.  Canal plans to offer their own nuclease mix in the future to enable enzymatic shearing

Canal's libraries nearly eliminate the higher error rates seen at the ends of reads.  Interestingly, on Illumina platforms the first few bases still show a highly elevated rate.  Canal hasn't tested this on another platform - I'd suggest Element or PacBio HiFi would be very interesting.  Is it just a basecalling algorithm weakness?

An interesting additional benefit of TrueLink is reducing the number of chimaeras - despite all the Y-adapter trickery ligation libraries contain as much as 1% chimaeric fragments.  Those are certainly unwelcome when trying to detect structural variants, particularly in low coverage data.  

Canal has plenty more in development - kits tuned for RNA-Seq (targeted for next year) and for methylation sequencing.  The latter would support both cytosine-conversion chemistries and 5mC conversion chemistries.  

So expect even more from this small company that wants to steer sequencing library preparation away from the time tested, but problematic, paradigm of end repair followed by A-tailing.

Monday, February 23, 2026

Wish: Scientific Poster Snap Organizer App

AGBT is another conference with a wonderful wealth of scientific posters and never enough time to view them all properly. I definitely enjoy engaging scientists - particularly early stage scientists such as graduate students - at their posters, but poster session times are traps for other conversations and all too often are booked-over with meetings.  And I'm not adept at time management.  Just a bit over a week ago, I found myself furiously running through the posters at SLAS on the last day, trying to capture them on my phone while lamenting the ones I already saw being carted away.  So now I've converted the problem of spending time on each poster to a future time - but can I have more than just a collection of photos on my devices?

Sunday, February 22, 2026

AI Agent -Driven Biochemical Optimization

When I started this space, I vowed not to become just a mouthpiece of my employer - but when my employer does something interesting, of course I'm not going to reflexively avoid writing about it.  On the autonomous labs front, there's an interesting preprint showing how an intelligent agent interacting with an autonomous laboratory system can generate new knowledge. I'd like to think I would have covered it anyways, but since it is the autonomous laboratory at my workplace, I'm especially eager to do so.  In a collaboration between Ginkgo and OpenAI, an agent was used to optimize a cell-free protein synthesis reaction using our autonomous laboratory based on Reconfigurable Automation Cart (RAC) technology.  This isn't the first time this has been done, but I think it has a number of interesting elements. Plus, optimizing complex biochemical reactions is a broadly interesting area to explore.  And, thinking up new services running on our autonomous laboratory is exactly my bailiwick, and I certainly reserve the right to use this space to be a mouthpiece of myself!

Wednesday, February 18, 2026

AGBT 2026 Preview

AGBT starts on Monday at the Signia by Hilton Bonnet Creek in Orlando on Monday, but the pre-conference flurry of announcements is already rolling out.  Here is a quick scan of what's already been telegraphed to be in store next week - and expect more announcements over the next few days.  I'm getting to Orlando on Friday - far before most people so perhaps I can actually focus on writing (though I also have a backlog of follow-ups from last week's SLAS lab automation conference).   I've supplied links for many of these items, though in some cases it isn't easy to find a single link to include

Sunday, February 08, 2026

Pancreatic Adenocarcinoma Mutation Trajectories at Single Cell Resolution

Single cell RNA sequencing is nearly ubiquitous; single cell DNA sequencing has been much rarer.  I'm going to dive in a bit - but not a full review - of a recent paper which described single cell DNA sequencing of pancreatic ductal adenocarcinomas.  The paper applied the Mission Bio droplet-based PCR approach, which hasn't had quite the buzz of its close technological cousin 10X Genomics.  And there are some interesting implications of the findings in this paper to impending developments in oncology therapeutics - including one in which I have more than academic interest.

Monday, February 02, 2026

Non-coding DNA's Alpha Moment

I had been meaning to read the AlphaGenome paper on non-coding variant effect prediction from Google DeepMind which recently showed up in Nature, but had found excuses not to dive in.  Then DeciBio’s Stephane Budel posted on LinkedIn with some incisive comments and few things spur me to action better than my sense of competition! So here's a quick, incomplete & imperfect take on this giant paper.


Thursday, January 29, 2026

Sequencer vs. Library Prep Market Structures

Something I started pondering as I was prepping my JP Morgan items: the structure of the market for sequencing library prep is very different from that for sequencers.  Perhaps this is painfully obvious to many, but I hadn't turned it over before in my mind.   It also follows that the sequencer market was overweighted in who was invited to give talks at the banking conference - nearly every sequencer maker was there - Complete Genomics and Singular Genomics were the missing parties (and Singular is a bit of a quasi-sequencer company now).  Note that in this piece I will not claim a complete census of companies in this space - though please feel free to note exceptions in the comments