To the ancient Greeks, a chimera was a kind of monster, with the body of a goat, the tail of a dragon, and a lion's head. To geneticists, a chimera can be an animal that's derived from two embryos, such as a transgenic mouse. Or if the organism is a plant, it can be a plant with a graft. We have a chimeric cherry tree in our back yard with branches from Rainier cherries, Bing cherries, and Van cherries. And you should see the chimeras that hang out at evolgen
Naturally, the DNA cloning and sequencing world has it's chimeras, too. There are two main kinds that I know. Sometimes chimeras are created during a cloning step when two fragments of DNA that are normally separated, become joined by accident in the cloning step.
Can you spot the chimera?
Sometimes chimeras are created during electrophoresis. In the early days of DNA sequencing instruments, when fluorescently-labeled DNA fragments were separated by size in polyacrylamide gels, chimeras could appear during the gel runs.
This image shows what the DNA in a gel would look like. Labeled DNA fragments from 8 different samples, are lined up in 8 lanes, straight up and down. Occasionally, while the gel was running, the tracking software would shift lanes, and start reading information from the next lane. These events produced chimeric reads, that is, reads with DNA sequences originating from two different samples. (What is a read?
A chimeric read.
Believe it or not, one of my former students had a night job at a genome center that involved staring at computer screens and making corrections if the gel tracking software started to misbehave.
I'm sure you're wondering, about now, how this all relates to genome sequencing.
Yesterday, I wrote about some of our experiences with analyzing some genomic libraries from phage
(1). Some of the libraries had been created from DNA that was broken into random pieces by sonication. Other libraries were created from genomic DNA that was digested with restriction enzymes.
Remember, there are 3 main steps in sequencing genomes (and I suspect if you read a few of these posts you will never forget this
- Break the genome into lots of small pieces at random positions.
- Determine the sequence of each small piece of DNA.
- Use an assembly program to figure out which pieces fit together.
We got such strange results from trying to reconstruct genome sequences from the RE (restriction enzyme) libraries that we knew there must be something strange going on. Yesterday, we established that the libraries weren't random, but we thought there must be other problems, too.
Hunting for chimeras.
This is what happens when you assemble DNA that's been broken at random positions.
You can imagine that if the DNA sequences in our reads are from different locations, themselves, things can get pretty confusing, pretty fast. We thought that if some of the reads were chimeras, that might explain the bizarre results from our assemblies.
Fortunately, Phrap (2), the assembly program that we use in the Finch Suite
, has an option to detect chimeras and problem reads. We put it to the test. We assembled different sets of sequences that had been sampled over time, and to each assembly, we added different numbers of new reads.
What happens when we add RE reads to our assemblies?
The first half of the graph shows the results from assembling 3148 reads obtained from the sonicated DNA library. We started reads from the RE libraries at the position of the vertical bar. In total, we added 1187 reads from the AseI and DraI libraries. Phrap detected 67 chimeras in the last assembly (4.7% of the added reads) and comparatively few in the reads from the sonicated DNA library.
I'm sure glad we weren't sequencing manatees.
1. E. Green. 2001. Strategies for the systematic sequencing of complex genomes. Nature Reviews Genetics 2:573-583.
2. Porter, S., Slagel, J., and T. Smith. 2004. Analysis of Genomic DNA Library Quality with the Finch®-Server. Geospiza, Inc. You can download the paper as a pdf document from here: http://www.geospiza.com/research/white-papers.htm
Look in the middle of the page.
The earlier installments are here:
Part I: Introduction
Part II: Sequencing strategies
Part III: Reads and chromats
Part IV: How many reads does it take?
Part V: Checking out the library