The Musa acuminata genome sequence, a new template for banana genetics

We produced a reference genome sequence of banana. The Musa accession selected for sequencing is a doubled haploid of the accession 'Pahang' (DH-Pahang). This accession belongs to the Musa acuminata species (AA genome) malaccensis subspecies. We generated 20.5x coverage of the 523 Mb genome of DH-Pahang (estimated by flow cytometry) using paired and single 454 reads, complemented by Sanger plasmid- and BAC-End Sequences. An additional ~50x coverage of Illumina shotgun data was produced in order to ensure a high-quality sequence draft. The assembly, performed with Newbler software, covers 472.2 Mb, representing 90% of the genome of DH-Pahang. A high-density genetic map was produced using SSR and DArT markers that enabled anchoring 70% of the assembly in 11 pseudo-chromosomes. We identified 36,542 protein-coding gene models using a reconciliation approach, taking ESTs, protein sequences and ab initio data as input. Progress on the characterization of the sequence will be presented, considering its potential to support Musa genetics and breeding. This project is carried out in the framework and with the participation of partners of the Global Musa Genomics Consortium.

Saved in:
Bibliographic Details
Main Authors: D'Hont, Angélique, Wincker, Patrick
Format: conference_item biblioteca
Language:eng
Published: s.n.
Subjects:F30 - Génétique et amélioration des plantes,
Online Access:http://agritrop.cirad.fr/561780/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We produced a reference genome sequence of banana. The Musa accession selected for sequencing is a doubled haploid of the accession 'Pahang' (DH-Pahang). This accession belongs to the Musa acuminata species (AA genome) malaccensis subspecies. We generated 20.5x coverage of the 523 Mb genome of DH-Pahang (estimated by flow cytometry) using paired and single 454 reads, complemented by Sanger plasmid- and BAC-End Sequences. An additional ~50x coverage of Illumina shotgun data was produced in order to ensure a high-quality sequence draft. The assembly, performed with Newbler software, covers 472.2 Mb, representing 90% of the genome of DH-Pahang. A high-density genetic map was produced using SSR and DArT markers that enabled anchoring 70% of the assembly in 11 pseudo-chromosomes. We identified 36,542 protein-coding gene models using a reconciliation approach, taking ESTs, protein sequences and ab initio data as input. Progress on the characterization of the sequence will be presented, considering its potential to support Musa genetics and breeding. This project is carried out in the framework and with the participation of partners of the Global Musa Genomics Consortium.