The high throughput of modern NGS sequencers coupled with the huge sizes of genomes currently analysed, poses always higher algorithmic challenges to align short reads quickly and accurately against a reference sequence. A crucial, additional, requirement is that the data structures used should be light. The available modern solutions usually are a compromise between the mentioned constraints: in particular, indexes based on the Burrows-Wheeler transform offer reduced memory requirements at the price of lower sensitivity, while hash-based text indexes guarantee high sensitivity at the price of significant memory consumption.
Fast randomized approximate string matching with succinct hash data structures / Policriti, Alberto; Prezza, Nicola. - In: BMC BIOINFORMATICS. - ISSN 1471-2105. - 16 (Suppl 9):S4(2015), pp. 1-8. [10.1186/1471-2105-16-S9-S4]
Fast randomized approximate string matching with succinct hash data structures
PREZZA, Nicola
2015
Abstract
The high throughput of modern NGS sequencers coupled with the huge sizes of genomes currently analysed, poses always higher algorithmic challenges to align short reads quickly and accurately against a reference sequence. A crucial, additional, requirement is that the data structures used should be light. The available modern solutions usually are a compromise between the mentioned constraints: in particular, indexes based on the Burrows-Wheeler transform offer reduced memory requirements at the price of lower sensitivity, while hash-based text indexes guarantee high sensitivity at the price of significant memory consumption.File | Dimensione | Formato | |
---|---|---|---|
BMC fast randomized.pdf
Open Access
Tipologia:
Versione dell'editore
Licenza:
DRM (Digital rights management) non definiti
Dimensione
572.34 kB
Formato
Adobe PDF
|
572.34 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.