Welcome To Website IAS

Hot news
Achievement

Independence Award

- First Rank - Second Rank - Third Rank

Labour Award

- First Rank - Second Rank -Third Rank

National Award

 - Study on food stuff for animal(2005)

 - Study on rice breeding for export and domestic consumption(2005)

VIFOTEC Award

- Hybrid Maize by Single Cross V2002 (2003)

- Tomato Grafting to Manage Ralstonia Disease(2005)

- Cassava variety KM140(2010)

Centres
Website links
Vietnamese calendar
Library
Visitors summary
 Curently online :  51
 Total visitors :  7667943

SequencErr: measuring and suppressing sequencer errors in next-generation sequencing data
Tuesday, 2021/07/06 | 06:21:58

Eric M DavisYu SunYanling LiuPandurang KolekarYing ShaoKarol SzlachtaHeather L MulderDongren RenStephen V RiceZhaoming WangJoy NakitandweAlexander M GoutBridget Shaner,  Salina HallLeslie L RobisonStanley PoundsJeffery M KlcoJohn EastonXiaotu Ma.

 

Genome Biol.; 2021 Jan 25; 22(1):37.  doi: 10.1186/s13059-020-02254-2.

 

Background: There is currently no method to precisely measure the errors that occur in the sequencing instrument/sequencer, which is critical for next-generation sequencing applications aimed at discovering the genetic makeup of heterogeneous cellular populations.

 

Results: We propose a novel computational method, SequencErr, to address this challenge by measuring the base correspondence between overlapping regions in forward and reverse reads. An analysis of 3777 public datasets from 75 research institutions in 18 countries revealed the sequencer error rate to be ~ 10 per million (pm) and 1.4% of sequencers and 2.7% of flow cells have error rates > 100 pm. At the flow cell level, error rates are elevated in the bottom surfaces and > 90% of HiSeq and NovaSeq flow cells have at least one outlier error-prone tile. By sequencing a common DNA library on different sequencers, we demonstrate that sequencers with high error rates have reduced overall sequencing accuracy, and removal of outlier error-prone tiles improves sequencing accuracy. We demonstrate that SequencErr can reveal novel insights relative to the popular quality control method FastQC and achieve a 10-fold lower error rate than popular error correction methods including Lighter and Musket.

 

Conclusions: Our study reveals novel insights into the nature of DNA sequencing errors incurred on DNA sequencers. Our method can be used to assess, calibrate, and monitor sequencer accuracy, and to computationally suppress sequencer errors in existing datasets.

 

See: https://pubmed.ncbi.nlm.nih.gov/33487172/

Figure 1: Measuring sequencer error rates. ab Reference DNA method, where large amounts of reference DNA are needed. This can be achieved by starting from a small amounts of DNA/cells (to minimize inter-molecule/cell genetic heterogeneity) followed a by a large number of PCR cycles and sequencing. Alternatively, we can start from b large amounts of starting DNA/cells followed by a small number of PCR cycles (to minimize PCR errors) and sequencing. In both approaches, mutations/PCR errors (red dots) before sequencing can confound the sequencer error rate estimate (red triangles). c We interrogate the sequencer errors by focusing on discordant bases between forward and reverse reads of the same DNA segment within the overlapping regions. Such mismatches must have happened in the sequencer. d Public datasets produced by HiSeq, NextSeq, and NovaSeq as of December 2019. Datasets without proper read names, with very small sizes, or with very short reads (so that overlap is minimal) are not suitable for our analysis (see the “Methods” section). HiSeq has the most suitable datasets and we downloaded and analyzed ~ 50% of these. eg Tile-level error rate across representative sequencers for e HiSeq, f NextSeq, and g NovaSeq. In each panel, a “good” sequencer (top) is illustrated with a “problematic” sequencer (bottom), where sequencer identifiers are indicated on the right. h Comparison of overall error rate (oER) and sequencer error rate (with or without computational error suppression) measurements on a common DNA library (generated by PCR enzymes Kapa and Q5) sequenced by two sequencing providers (St. Jude Children’s Research Hospital Computational Biology Genomics Laboratory (SJ) and HudsonAlpha Institute of Biotechnology (HAIB)), with two different NovaSeq sequencers. Tile arrangements are determined according to vendor documentation (see the “Methods” section). Tile-level error rates are capped at 200 per million for visualization purposes. ***Significant Wilcoxon rank-sum test (two-sided) P value (< 0.01). n.s, not significant (P > 0.01)

Back      Print      View: 216

[ Other News ]___________________________________________________
  • Genome-wide analysis of autophagy-associated genes in foxtail millet (Setaria italica L.) and characterization of the function of SiATG8a in conferring tolerance to nitrogen starvation in rice.
  • Arabidopsis small nucleolar RNA monitors the efficient pre-rRNA processing during ribosome biogenesis
  • XA21-specific induction of stress-related genes following Xanthomonas infection of detached rice leaves.
  • Reducing the Use of Pesticides with Site-Specific Application: The Chemical Control of Rhizoctonia solani as a Case of Study for the Management of Soil-Borne Diseases
  • OsJRL, a rice jacalin-related mannose-binding lectin gene, enhances Escherichia coli viability under high-salinity stress and improves salinity tolerance of rice.
  • Production of lipopeptide biosurfactants by Bacillus atrophaeus 5-2a and their potential use in microbial enhanced oil recovery.
  • GhABF2, a bZIP transcription factor, confers drought and salinity tolerance in cotton (Gossypium hirsutum L.).
  • Resilience of cassava (Manihot esculenta Crantz) to salinity: implications for food security in low-lying regions.
  • Cellulose synthase complexes act in a concerted fashion to synthesize highly aggregated cellulose in secondary cell walls of plants
  • No adverse effects of transgenic maize on population dynamics of endophytic Bacillus subtilis strain B916-gfp
  • Identification and expression analysis of OsLPR family revealed the potential roles of OsLPR3 and 5 in maintaining phosphate homeostasis in rice
  • Functional analysis of molecular interactions in synthetic auxin response circuits
  • Titanium dioxide nanoparticles strongly impact soil microbial function by affecting archaeal nitrifiers.
  • Inducible Expression of the De-Novo Designed Antimicrobial Peptide SP1-1 in Tomato Confers Resistance to Xanthomonas campestris pv. vesicatoria.
  • Toward combined delignification and saccharification of wheat straw by a laccase-containing designer cellulosome
  • SNP-based discovery of salinity-tolerant QTLs in a bi-parental population of rice (Oryza sativa)
  • Pinpointing genes underlying the quantitative trait loci for root-knot nematode resistance in palaeopolyploid soybean by whole genome resequencing.
  • Transcriptome- Assisted Label-Free Quantitative Proteomics Analysis Reveals Novel Insights into Piper nigrum -Phytophthora capsici Phytopathosystem.
  • Brassinosteroids participate in the control of basal and acquired freezing tolerance of plants
  • Rapid hyperosmotic-induced Ca2+ responses in Arabidopsis thaliana exhibit sensory potentiation and involvement of plastidial KEA transporters

 

Designed & Powered by WEBSO CO.,LTD