A computational biologist's personal views on new technologies & publications on genomics & proteomics and their impact on drug discovery
Wednesday, November 06, 2024
Benchtop HiFi: PacBio Unveils Vega
Revio Refresh
Wednesday, October 09, 2024
MiSeq Makeover
Friday, September 27, 2024
QuantumScale: Two Million Cells is the Opening Offer
I'm always excited by sequencing technology going bigger. Every time the technology can generate significantly more data, experiments that previously could only be run as proof-of-concept can move to routine, and what was previously completely impractical enters the realm of proof-of-concept. These shifts have steadily enabled scientists to look farther and broader into biology - though the complexity of the living world always dwarves our approaches. So it was easy to say yes several weeks ago to an overture from Scale Bio to again chat with CEO Giovanna Prout about their newest leap forward: QuantumScale, which will start out enabling single cell 3' RNA sequencing experiments with two million cells of output- but that's just the beginning. And to help with it, they're collaborating with three other organizations sharing the vision of sequencing at unprecedented scale: Ultima Genomics on the data generation side, NVIDIA for data analysis, and Chan Zuckerberg Initiative (CZI) which will subsidize the program and make the research publicly available on Chan Zuckerberg Cell by Gene Discover.
Scale Bio is launching QuantumScale as an Early Access offering, originally aiming for 100 million cells across all participants - though since I spoke with Prout they've received over 140 million cells in submitted proposals. First 50 million cells would be converted to libraries at Scale Bio and sequenced by Ultima (with CZI covering the cost), with the second 50 million cells prepped in the participants labs with Scale Bio covering the library costs (and CZI subsidizing sequencing cost). Data return would include CRAMs and gene count matrices. Labs running their own sequencing have a choice of Ultima or NovaSeq X - the libraries are agnostic, but it isn't practical to run these libraries on anything smaller. Prout mentioned that a typical target is 20K reads per cell, though Scale Bio and NVIDIA are exploring ways to reduce this, so with 2M cells that's 40B reads required - or about two 25B flowcells on NovaSeq X.
How do they do it? The typical Scale Bio workflow has gotten a new last step, for which two million cells is expected to be only the beginning. The ScalePlex reagent can be first used to tag samples prior to the initial fixation, with up to 1000 samples per pool (as I covered in June). Samples are fixed and then distributed to a 96-well plate in which reverse transcription and a round of barcoding take place. Then pool those and split into a new 96-well plate which performs the "Quantum Barcoding", with around 800K barcodes within each well. Prout says full technical details of that process aren't being released now but will be soon, but hinted that it might involve microwells within each well. Indexing primers during the PCR add another level of coding, generating over 600 million possible barcode combinations. This gives Scale Bio, according to Prout, a roadmap to experiments with 10 million, 30 million or perhaps even more cells per experiment - and multiplet rates "like nothing".
As noted above, the scale of data generation is enormous, and that might stress or break some existing pipelines. Prout suggested that Seurat probably won't work, but scanpy "might". So having NVIDIA on board makes great sense - they're already on the Ultima UG100 performing alignment, but part of the program will be NVIDIA working with participants to build out secondary and tertiary analyses using the Parabricks framework.
What might someone do with all that? I don't run single cell 3' RNA experiments myself, but reaching back to my pharma days I can start imagining. In particular, there are a set of experiment schemes known as Perturb-Seq or CROP-Seq which use single cell RNA readouts from pools of CRISPR constructs - the single cell data both provides a fingerprint of cellular state and reveals which guide RNA (or guide RNAs; some of these have multiple per construct) are present.
Suppose there is a Perturb-Seq experiment and the statisticians say we require 10K cells per sample to properly sample the complexity of the CRISPR pool we are using. Two million cells just became 200 samples. Two hundred seems like a big number, but suppose we want to run each perturbation in quadruplicate to deal with noise. For example, I'd like to spread those four cells around the geometry of a plate, knowing that there are often corner and edge effects and even more complex location effects from where the plate is in the incubator. So now only 50 perturbations - perhaps my 49 favorite drugs plus a vehicle control. Suddenly 2M cells isn't so enormous any more - I didn't even get into timepoints or using different cell lines or different compound concentrations or any of numerous other experimental variables I might wish to explore. But Perturb-Seq on 49 drugs in quadruplicate at a single concentration in a single cell line is still many orders of magnitude more perturbation data than we could dream about two decades ago at Millennium to pack into three 96-well plates.
And that, as I started with, is the continuing story: 'omics gets bigger and our dreams of what we might explore just ratchet up to the new level of just in reach.
The announcement of QuantumScale also has interesting timing in the industry, arriving a bit over a month after Illumina announced it was entering the single cell RNA-Seq library prep market with the purchase of Fluent Biosciences. While nobody (except perhaps BGI/MGI/Complete Genomics) makes their single cell solution tied exclusively to one sequencing platform, the connection of Scale Bio and Ultima makes clear business sense - Illumina is now a frenemy to be treated more cautiously and boosting an alternative is good business. Ultima would of course love if QuantumScale nudges more labs into their orbit, and these 3' counting assays perform very well on Ultima with few concerns about homopolymers confusing the results (and Prout assures me that all the Scale Bio multiplex tags are read very effectively) . And as is so often the case, NVIDIA finds itself in the center of a new data hungry computing trend.
Will many labs jump into QuantumScale? Greater reach is wonderful, but one must have the budgets to run the experiments and grind the data. PacBio in particular and to a degree Illumina have seen their big new machines face limited demand - or in the case of Revio the real possibility that everyone is spending the same money to get more data (great for science, not great for PacBio's bottom line). But perhaps academic labs won't be the main drivers here, but instead pharma and perhaps even more so the emerging space of tech companies hungry for biological data to train foundation models - sometimes not even having their own labs but instead relying on companies such as my employer to run the experiments.
A favorite quote of mine is from late 1800s architect Daniel Burnham; among his masterpieces is Washington DC's Union Station. "Make no little plans. They have no magic to stir men's blood and probably will not themselves be realized." I can't wait to see what magic is stirred in women's and men's blood by QuantumScale, which is certainly not the stuff of little plans.
[2024-10-02 tweaked working around how program is funded]
Thursday, August 29, 2024
Illumina Would Like to Change the Conversation
Two New Library Preps -- in the Future
Monday, July 29, 2024
Musings on Possible Fixes To PacBio & ONT's Achilles Heels
Tuesday, July 09, 2024
Tagify: seqWell's Line of Tagmentation Reagents Awaits Your Creative Thoughts!
One of the most important enzymes in the sequencing world, one which enables spectacular creativity on the part of novel assay designers, is Tn5 transposase. Personally, I spend many times each month thinking about how to use Tn5 and its ability to tagment - both tag and fragment - input DNA. There’s even reports that Tn5 can tagment RNA-DNA hybrids such as from reverse transcription or even long single-stranded DNA. I’ve covered seqWell in the past,with their fully kitted reagents; now the company (which just turned ten) is launching a Tagify product line that is focused on enabling NGS dreamers to easily explore new Tn5-based library preparation methods.
Friday, June 28, 2024
mRNA Therapeutic / Vaccine Quality Control: A Major ONT Opportunity?
Thursday, June 27, 2024
ONT T2T Genome Bundle: Hot New Thing or Flash in Pan?
Wednesday, June 26, 2024
ScalePlex: Easing High Sample Count 3’ scRNA Sequencing
Monday, June 24, 2024
Aftermath
Friday, June 07, 2024
CariGenetics: Breakthrough Breast Cancer Genetics in the Caribbean - but Also a Template for ONT Clinical Push?
Tuesday, June 04, 2024
ElysION vs. TraxION: Divergent Shots at Applied Market End-To-End Automation
Tuesday, May 21, 2024
Thoughts on A Decade of Oxford Nanopore Sequencing
Monday, May 20, 2024
FOMO Index at All Time High This Week
Friday, May 17, 2024
HiFi WGS As A (Nearly) Unified Tool For Rare Genetic Disease Diagnosis
Sunday, May 12, 2024
DoveTail Transposes Their Hi-C Methodology
Friday, May 10, 2024
AQTUAL: Arthritis Drug Selection Via Assaying Cell Free Chromatin
[Note: after I initially released this, Dr. Abdueva spotted some glitches; I pulled it back for editing & then got swept into London Calling; this revised version is finally emerging]
Liquid biopsies - the idea of peering into the disease state somewhere in the body by looking at “cell-free DNA” in the blood - is quite the rage these days. There are a host of companies and approaches, and I haven’t quite found the discipline to start trying to build a census of all of them. The field started with Non-Invasive Prenatal Testing (NIPT), and then some early NIPT cases had odd DNA that looked like oncogenic in healthy mothers - who turned out to actually have the cancer. Oncology has been the primary focus, but there’s been many hints that liquid biopsies may be valuable in a wide range of diseases. A bit over a week ago, Dr. Diana Abdueva founder and CEO of AQTUAL, walked me through (over Zoom) that company’s liquid biopsy approach to inflammatory disease management.
Tuesday, May 07, 2024
On The Expanding Versatility of Single Molecule Sequencing for Detecting Anomalous DNA
Wednesday, May 01, 2024
First Illumina Complete Long Reads Preprint
Tuesday, April 30, 2024
A Peek At QuantumSI's Protein Sequencer
If QuantumSI is the Answer, What is the Question?
Tuesday, April 23, 2024
Bruker Wins NanoString Auction
Wednesday, April 17, 2024
PacBio Plummets
Wednesday, April 10, 2024
Thoughts on RNU4-2 Mutation Paper
Thursday, March 28, 2024
Post-AGBT: VizGen & Scale Biosciences Partner
Monday, March 11, 2024
BioNano In Peril Again
Thursday, February 29, 2024
Post-AGBT: Sequencing Hardware Roundup
Post-AGBT: Element AVITI Sequencing Updates
Tuesday, February 27, 2024
Post-AGBT: Both Element & Singular Want Spatial to Go With The Flow(cells)
Tuesday, February 20, 2024
AGBT Follow-up: Ultima Genomics UG100, Volta Labs Callisto, N6Tec iconPCR
Wednesday, February 07, 2024
VoltaLabs Launches Callisto for DNA Extraction & Library Prep
Here at AGBT, VoltaLabs has unveiled their 24-sample DNA extraction and NGS library prep Callisto instrument, which is particularly suited for long read applications but is also suited for short read work. Volta has matured liquid handling automation to a novel open top electrowetting technology. Priced at $125K and planning to ship in the second quarter, Callisto is designed as a walk-away solution requiring no human interaction during a run. Personally, not only do I love the a new medium-throughput instrument for HMW DNA extraction and manipulation, but I also can at least pretend I helped steer the company In that directions
Tuesday, February 06, 2024
iconPCR: Super-Flexible qPCR Thermocycler Oft Dreamed, Now Delivered
Has there ever been a product you’ve just wanted to have, but it doesn’t exist? That keeps popping up in discussions - “if only we had X this project would go so much faster!”. Well, N6 Tec’s automation-friendly $99K i96 well iconPCR thermocycler is that to me. Launching at AGBT, it’s the gadget I’ve wanted repeatedly at Codon Devices, Warp Drive Bio and now Ginkgo Bioworks. It won’t solve all your PCR challenges, but it certainly gives new options to customize PCR like never before. And for many NGS labs, it offers major streamlining of PCR-based library construction protocols while also delivering superior data. How? By being a thermocycler where every well can run its own thermal profile and each well can go dormant once a desired level of amplification is achieved
Monday, February 05, 2024
Want to Build A Sequencer? 454.bio Opens Up Their Plans
Thursday, February 01, 2024
Ultima Launches
As part of the run-up to Gold sponsorship at AGBT, Ultima Genomics held a multi-day event in early December, with tours of the headquarters facility and factory floor in the Bay Area and a day at a beautiful Wine Country resort. The resort session included talks from the company, early access collaborators and a pair of big name early backers, with a few hundred current customers and many contemplating the leap. So confident was the company in their product, they even invited a blogger to moderate one of the panel discussions! The UG100 is now officially launched as a fully commercial product, with ambitions to replace panels, exomes and microarrays with whole genome sequences at $100 apiece. All in an instrument package designed for continuous industrial-scale operation. Please note that Ultima did review this piece to ensure I didn’t disclose information they did not wish public, but for the most part just gave me some very good proofreading support. Photos are my own, except as noted.