A Non-Traditional Approach to Whole Genome Ultra- Fast, Inexpensive Nanopore-Based Nucleic Acid Sequencing

Special Issue - DNA Sequencing Technology

Austin J Proteomics Bioinform & Genomics. 2015; 2(1): 1012.

A Non-Traditional Approach to Whole Genome Ultra- Fast, Inexpensive Nanopore-Based Nucleic Acid Sequencing

Kanavarioti A*

Yenos Analytical LLC, USA

*Corresponding author: Anastassia Kanavarioti, Yenos Analytical LLC, El Dorado Hills, CA, USA

Received: November 02, 2015; Accepted: November 19, 2015; Published: November 24, 2015

Abstract

DNA sequencing claims responsibility for breakthroughs in understanding the molecular basis of life, and improving quality of life through advances in prognosis, diagnosis, treatment, and cure of disease. The last 30 years have seen an exponential improvement onto the original Sanger sequencing by synthesis, as well as the emergence of new technologies. Still the mandate for cheaper, faster, longer, and more accurate reads hasn’t been satisfied. We are proposing a single molecule approach by combining unassisted nanoporebased sequencing with labeled DNA, where the ion-channel readout of current vs. time (i-t) may represent base sequence. Pyrimidines on DNA are labeled selectively with Osmium tetroxide 2,2’-bipyridine (OsBp) ahead of sequencing. The OsBp label slows down the translocation to detectable levels, and provides base discrimination between labeled deoxythymidine, labeled deoxycytidine, and an intact base. This technology promises to sequence DNA with no limit in strand length, without amplification, and without the use of a processing enzyme; it requires consensus building, but no assembly and no scaffolding. To facilitate consensus building for a human chromosome long DNA, highly repetitive DNA sequences, such as the Alu repeats, may serve as markers. Observed translocation times of a series of osmylated oligos via the wt a-Hemolysin nanopore are exploited to estimate the time it takes (1 hour) to sequence a 100,000,000 bp genome at 128x coverage using one MinION™ device from Oxford Nanopore Technologies. This technology has the potential for mapping protein bound regions in dsDNA, sequencing RNA, as well as identifying methylated and other rare bases.

Keywords: a-Hemolysin Nanopore; DNA sequencing technology; Ionchannel measurements; Osmium tetroxide bipyridine; Whole genome

Abbreviations

C: Cytidine; T: Thymidine; ds: Double Stranded; ss: Single Stranded; i-t: Current vs. Time; OsBp: Osmium Tetroxide 2,2’-Bipyridine; bp: Base-Pairs; nt: Nucleotides; a-HL: a-Hemolysin; SBS: Sequencing By Synthesis

Introduction

DNA sequencing was enabled by the Sanger approach exploiting the enzymatic synthesis of the complementary of a target DNA strand using deoxynucleotide triphosphates and a small amount of the dideoxynucleotides that serve as chain terminators [1]. Each dideoxynucleotide carried a different fluorescent label, so that it was optically identifiable and distinguishable from the others. Since the seminal paper of Sanger in 1977 [1] remarkable technical progress resulted in the 2001 sequencing of the human genome, valued at 2.7 billion in FY 1991 dollars, that took over a decade to complete [2,3]. Cost and analysis time have dramatically decreased since then, but expensive instrumentation, consumables, as well as analysis time are still prohibiting the effort from being routinely implemented [4-6]. The progress is primarily due to engineering advances in miniaturization, parallelization, and computing speed. Most commercial DNA sequencers use the Sequencing by Synthesis (SBS) approach [7-9]. Besides much leverage gained by bioinformatics, the issues still are: (i) the four chemistries behind labeling nucleotides are less than 100% efficient, introducing insertion and deletion errors [9]; (ii) the amplification process of the target DNA, if required ahead of sequencing, has its own limits in amplifying sequence repeats [7]; (iii) library construction, primer incorporation, and amplification add complexity and expense to sample preparation, (iv) the polymerase enzymes typically synthesize up to a few thousand bases of the complementary strand, and then dissociate [8], limiting the maximum output length, and (v) the enzymatic synthesis is a relatively slow process [10] that is further slowed down by the change of reactants between mononucleotide additions. These issues are overshadowed by the fact that the SBS approach yields lengths that are a miniscule fraction of a whole genome, and presents challenges in the analysis of the data by requiring error correction, assembly, and scaffolding bioinformatics [11]. Read length in sequencing directly impacts sequencing accuracy. Specifically a read should be long enough to span a repetitive region in the genome. Practically speaking it takes a village, i.e., a number of scientists with completely different background and skills, to sequence a small genome and, despite all of the de novo assembly efforts, de novo sequencing is still a challenge [12,13].

To move forward a non-traditional approach may be more suitable. One such strategy is using nanopores, with sub 2nm diameter, located within an isolated membrane that separates two compartments filled with electrolyte (Figure 1A). Applying a voltage across the two compartments leads to a constant flow of ions via the nanopore; this flow is partially blocked by the occasional passage of a single molecule through the pore [14]. Numerous studies have explored translocation of single stranded (ss) nucleic acids via the a-Hemolysin pore (a-HL) and show that conductance measurement (i-t) yield current modulation and translocation time (Figure 1B) with sequencing information attributed to nanopore/nucleobase interactions [15-20]. Even though this strategy avoids synthesis of the complementary strand, it is still being explored assisted by an enzyme to slow down the otherwise too fast translocation, and act as a motor to move the DNA strand one base at a time [21]. In this report we review DNA translocation via a-HL, in the absence of an enzyme, and the use of labeled DNA to differentiate the bases, and simultaneously slow down translocation [22,23]. We will discuss what are the potential gains from this “nanopore/labeled DNA” approach, and outline the requirements to make it a preferable alternative to current technologies.