Presentation
Accelerating DNA Long Read Mapping with Emerging Technologies
Author
Event Type
ACM Student Research Competition
Poster
TP
EX
TimeTuesday, November 13th8:30am - 5pm
LocationC2/3/4 Ballroom
DescriptionDNA sequencing technologies output only short fragments of a genome, called reads. New single-molecule real-time sequencing technologies can produce long reads, up to tens of thousands base pairs, within minutes. However, these long reads may contain up to 15% errors.
To construct a genome from DNA reads, a computationally expensive bioinformatics task, read mapping, is required. Read mapping finds the best-fitting location for each DNA read on a long reference sequence. The length and error rate of long reads poses a challenge for existing read mapping hardware solutions, designed for short reads with low error rates. This work presents a novel DNA read mapping hardware architecture, RASSA. RASSA is a Resistive Approximate Similarity Search Accelerator that exploits charge distribution and parallel in-memory processing to reflect a mismatch count between DNA sequences. RASSA implementation of long read DNA mapping outperforms state-of-the-art long read mapping solution by 16-77x with comparable accuracy.
To construct a genome from DNA reads, a computationally expensive bioinformatics task, read mapping, is required. Read mapping finds the best-fitting location for each DNA read on a long reference sequence. The length and error rate of long reads poses a challenge for existing read mapping hardware solutions, designed for short reads with low error rates. This work presents a novel DNA read mapping hardware architecture, RASSA. RASSA is a Resistive Approximate Similarity Search Accelerator that exploits charge distribution and parallel in-memory processing to reflect a mismatch count between DNA sequences. RASSA implementation of long read DNA mapping outperforms state-of-the-art long read mapping solution by 16-77x with comparable accuracy.
Archive