An Encoder-Decoder Based Basecaller for Nanopore DNA Sequencing
dc.contributor.advisor | Magierowski, Sebastian | |
dc.creator | Abbaszadegan, Mahdieh | |
dc.date.accessioned | 2019-07-02T16:13:29Z | |
dc.date.available | 2019-07-02T16:13:29Z | |
dc.date.copyright | 2019-02-12 | |
dc.date.issued | 2019-07-02 | |
dc.date.updated | 2019-07-02T16:13:29Z | |
dc.degree.discipline | Electrical and Computer Engineering | |
dc.degree.level | Master's | |
dc.degree.name | MASc - Master of Applied Science | |
dc.description.abstract | Nanopore DNA sequencing is a method in which DNA bases are determined (basecalled) using electric current signals generated by passing DNA through nanopore sensors. The raw measured signals can be aggregated into event data presenting new bases entering the nanopore. This thesis has two contributions. First, we implemented RNN-based single- and double-strand basecallers for simulated event data to analyze the effect of signal noise. As the SNR decreased from 20 dB to 5 dB, the accuracy of the single-strand basecaller dropped 9% while the accuracy of double-strand basecaller only dropped 0.5%. Second, we implemented an end-to-end single-strand basecaller, directly processing the raw signal using an encoder-decoder model with attention instead of the CTC-style approach used in available basecallers. We achieved an accuracy of 81.9% for a viral sample and an accuracy of 90.9% for a bacterial sample. Our accuracy is comparable to state-of-the-art basecallers with a considerably smaller model. | |
dc.identifier.uri | http://hdl.handle.net/10315/36268 | |
dc.language.iso | en | |
dc.rights | Author owns copyright, except where explicitly noted. Please contact the author directly with licensing requests. | |
dc.subject | Computer science | |
dc.subject.keywords | DNA Sequencing | |
dc.subject.keywords | Nanopore Sequencing | |
dc.subject.keywords | Deep Learning | |
dc.subject.keywords | Recurrent Neural Networks | |
dc.subject.keywords | Seq2seq | |
dc.subject.keywords | Attention Mechanism | |
dc.title | An Encoder-Decoder Based Basecaller for Nanopore DNA Sequencing | |
dc.type | Electronic Thesis or Dissertation |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Abbaszadegan_Mahdieh_2019_Masters.pdf
- Size:
- 1.78 MB
- Format:
- Adobe Portable Document Format
- Description: