I've been intrigued for a few months now since hearing about a St. Louis company called Cofactor Genomics. Right on their front webpage they advertise they will generate & assemble 680Mb of sequence (from an Illumina machine) for the paltry sum of $4.7K.
Wow! That would fit on my credit card when I was a graduate student (though it would have been a few months stipend). 680Mb is 100+X coverage of an E.coli-class genome, or about 50X coverage of Saccharomyces. It's even well over 0.5X coverage of an awful lot of interesting eukaryotes.
As an aside, I feel obligated to stress that I don't have any personal stake in, or direct relationship with, Cofactor Genomics. I also have no experience with them or any of their competitors. It's just the ease of accessing their pricing matrix makes them easy to talk about.
At those prices, the idea of doing my own personal genome project can't be easily shooed away. Not a Personal Genome Project -- I worry I'd develop genomania -- but some small genome sequenced on my whim. There's probably still not a shortage of interesting genomes in species I could easily & safely grow up with some forbearance of my shop's management or at a friendly academic. There must be some left; there are even some industrially-interesting E.coli strains that seem to lack public sequences. However, even if it wouldn't violate my town's zoning laws to do it in my basement, neither growing biological samples nor the $5K budget would fly with my spouse.
So I'll float a different idea. My only wish is that anyone who tries it post back here, and if you're already doing the same thing I invite your response as well. If I can't do it, why not some class?
Now $5K isn't chicken feed. I'm sure that is far beyond the typical budget for lab experiments in a college class, let alone a high school. Maybe a donor could step in, but these days that's a particularly tough challenge to find. But suppose the cost were spread over a lot of students?
One scenario would be for a very large university to make this the project for an entire class. A really huge state school I would guess could have 500+ students a year taking first-year biology. Now we're talking less than $10/student -- perhaps still a significant hit (what is a typical per student budget for such a course?). Each student would get about 1/500th of the genome as their very own research project.
At a smaller school, could a genome project become a departmental initiative? A bioinformatics class could set up the analysis pipeline & develop reporting tools. Biochemistry class could map the ORFs to the known biochemical pathways and identify both missing pathways and predicted novel (to the species) enzyme activities. Genetics classes could focus on operon structure or identifying possible regions recently transferred horizontally from another species. Evolution classes could tackle that, or building a bazillion gene trees. A bit of a stretch to work this into a human physiology curriculum, though a comparative look at how another biological system manages homeostasis isn't completely absurd.
Of course, when it comes time to publish it will be a very long author list!
I think I've heard of a genome project being run as an undergraduate effort, but I'm guessing a lot of that involved doing the actual sequencing. While there's merit to that, these days even with free labor, large-scale Sanger sequencing isn't cost competitive. Perhaps some departments have one of the next-gen machines & are willing to let some undergraduates play with them -- but I'm guessing that's pretty rare (like a NotI site in an AT-rich genome).
Will sequencing costs ever crash low enough that someone will sequence a genome for an grade school science fair project? I'm not holding my breath, but I certainly wouldn't rule it out.