Data from: Shoot transcriptome of the giant reed, Arundo donax

<p>The giant reed, <em>Arundo donax</em>, is a perennial grass species that has become an invasive plant in many countries. Expansive stands of <em>A. donax</em> have significant negative impacts on available water resources and efforts are underway to identify biological control agents against this species. The giant reed grows under adverse environmental conditions, displaying insensitivity to drought stress, flooding, heavy metals, salinity and herbaceous competition, thus hampering control programs. To establish a foundational molecular dataset, we used an llumina Hi-Seq protocol to sequence the transcriptome of actively growing shoots from an invasive genotype collected along the Rio Grande River, bordering Texas and Mexico. We report the assembly of 27,491 high confidence transcripts (≥200 bp) with at least 70% coverage of known genes in other Poaceae species. Of these 13,080 (47.58%), 6165 (22.43%) and 8246 (30.0%) transcripts have sequence similarity to known, domain-containing and conserved hypothetical proteins, respectively. We also report 75,590 low confidence transcripts supported by both trans-ABBySS and Velvet-Oases <em>de novo</em> assembly pipelines. Within the low confidence subset of transcripts we identified partial hits to known (19,021; 25.16%), domain-containing (7093; 9.38%) and conserved hypothetical (16,647; 22.02%) proteins. Additionally 32,829 (43.43%) transcripts encode putative hypothetical proteins unique to A. donax. Functional annotation resulted in 5,550 and 6,070 transcripts with assigned Gene Ontology and KEGG pathway information, respectively. The most abundant KEGG pathways are spliceosome, ribosome, ubiquitin mediated proteolysis, plant–pathogen interaction, RNA degradation and oxidative phosphorylation metabolic pathway. Furthermore, we also found 12, 9, and 4 transcripts annotated as stress-related, heat stress, and water stress proteins, respectively. It is envisaged that these resources will promote and facilitate studies of the abiotic stress capabilities of this exotic plant species, which facilitates its invasive capacity. Supplemental Excel data files with the article detail functional annotation of Arundo donax high confidence and low confidence genes. Data are also available at <a href="https://www.ncbi.nlm.nih.gov/nuccore/GBRH01000000">https://www.ncbi.nlm.nih.gov/nuccore/GBRH01000000</a> . The assembled and annotated A. donax USA genotype Rio Grande RNA transcriptome has been deposited at DDBJ/EMBL/GenBank under the project accession PRJNA256910.</p> <div><br>Resources in this dataset:</div><br><ul><li><p>Resource Title: Shoot transcriptome of the giant reed, Arundo donax.</p> <p>File Name: Web Page, url: <a href="https://www.sciencedirect.com/science/article/pii/S2352340914000377" target="_blank">https://www.sciencedirect.com/science/article/pii/S2352340914000377</a> </p><p>*Data in Brief* article reporting the assembly of 27,491 high confidence transcripts (≥200 bp) for the giant reed, *Arundo donax* with at least 70% coverage of known genes in other Poaceae species. </p></li></ul>

Saved in:
Bibliographic Details
Main Authors: Roberto Barrero (246219), Felix D. Guerrero (17477454), Paula M. Moolhuijzen (11519701), John A. Goolsby (3273768), Jason Tidwell (17478660), Stanley E. Bellgard (8357334), Matthew I. Bellgard (11519719)
Format: Dataset biblioteca
Published: 2018
Subjects:Crop and pasture production, Genomics and transcriptomics, Genetics, giant reed, llumina Hi-Seq protocol, RNA de novo assembly, data.gov, ARS,
Online Access:https://figshare.com/articles/dataset/Data_from_Shoot_transcriptome_of_the_giant_reed_Arundo_donax/24852591
Tags: Add Tag
No Tags, Be the first to tag this record!
id dat-usda-us-article24852591
record_format figshare
spelling dat-usda-us-article248525912018-01-19T00:00:00Z Data from: Shoot transcriptome of the giant reed, Arundo donax Roberto Barrero (246219) Felix D. Guerrero (17477454) Paula M. Moolhuijzen (11519701) John A. Goolsby (3273768) Jason Tidwell (17478660) Stanley E. Bellgard (8357334) Matthew I. Bellgard (11519719) Crop and pasture production Genomics and transcriptomics Genetics giant reed llumina Hi-Seq protocol RNA de novo assembly data.gov ARS <p>The giant reed, <em>Arundo donax</em>, is a perennial grass species that has become an invasive plant in many countries. Expansive stands of <em>A. donax</em> have significant negative impacts on available water resources and efforts are underway to identify biological control agents against this species. The giant reed grows under adverse environmental conditions, displaying insensitivity to drought stress, flooding, heavy metals, salinity and herbaceous competition, thus hampering control programs. To establish a foundational molecular dataset, we used an llumina Hi-Seq protocol to sequence the transcriptome of actively growing shoots from an invasive genotype collected along the Rio Grande River, bordering Texas and Mexico. We report the assembly of 27,491 high confidence transcripts (≥200 bp) with at least 70% coverage of known genes in other Poaceae species. Of these 13,080 (47.58%), 6165 (22.43%) and 8246 (30.0%) transcripts have sequence similarity to known, domain-containing and conserved hypothetical proteins, respectively. We also report 75,590 low confidence transcripts supported by both trans-ABBySS and Velvet-Oases <em>de novo</em> assembly pipelines. Within the low confidence subset of transcripts we identified partial hits to known (19,021; 25.16%), domain-containing (7093; 9.38%) and conserved hypothetical (16,647; 22.02%) proteins. Additionally 32,829 (43.43%) transcripts encode putative hypothetical proteins unique to A. donax. Functional annotation resulted in 5,550 and 6,070 transcripts with assigned Gene Ontology and KEGG pathway information, respectively. The most abundant KEGG pathways are spliceosome, ribosome, ubiquitin mediated proteolysis, plant–pathogen interaction, RNA degradation and oxidative phosphorylation metabolic pathway. Furthermore, we also found 12, 9, and 4 transcripts annotated as stress-related, heat stress, and water stress proteins, respectively. It is envisaged that these resources will promote and facilitate studies of the abiotic stress capabilities of this exotic plant species, which facilitates its invasive capacity. Supplemental Excel data files with the article detail functional annotation of Arundo donax high confidence and low confidence genes. Data are also available at <a href="https://www.ncbi.nlm.nih.gov/nuccore/GBRH01000000">https://www.ncbi.nlm.nih.gov/nuccore/GBRH01000000</a> . The assembled and annotated A. donax USA genotype Rio Grande RNA transcriptome has been deposited at DDBJ/EMBL/GenBank under the project accession PRJNA256910.</p> <div><br>Resources in this dataset:</div><br><ul><li><p>Resource Title: Shoot transcriptome of the giant reed, Arundo donax.</p> <p>File Name: Web Page, url: <a href="https://www.sciencedirect.com/science/article/pii/S2352340914000377" target="_blank">https://www.sciencedirect.com/science/article/pii/S2352340914000377</a> </p><p>*Data in Brief* article reporting the assembly of 27,491 high confidence transcripts (≥200 bp) for the giant reed, *Arundo donax* with at least 70% coverage of known genes in other Poaceae species. </p></li></ul> 2018-01-19T00:00:00Z Dataset Dataset 10.1016/j.dib.2014.12.007 https://figshare.com/articles/dataset/Data_from_Shoot_transcriptome_of_the_giant_reed_Arundo_donax/24852591 U.S. Public Domain
institution USDA US
collection Figshare
country Estados Unidos
countrycode US
component Datos de investigación
access En linea
databasecode dat-usda-us
tag biblioteca
region America del Norte
libraryname National Agricultural Library of USDA
topic Crop and pasture production
Genomics and transcriptomics
Genetics
giant reed
llumina Hi-Seq protocol
RNA de novo assembly
data.gov
ARS
spellingShingle Crop and pasture production
Genomics and transcriptomics
Genetics
giant reed
llumina Hi-Seq protocol
RNA de novo assembly
data.gov
ARS
Roberto Barrero (246219)
Felix D. Guerrero (17477454)
Paula M. Moolhuijzen (11519701)
John A. Goolsby (3273768)
Jason Tidwell (17478660)
Stanley E. Bellgard (8357334)
Matthew I. Bellgard (11519719)
Data from: Shoot transcriptome of the giant reed, Arundo donax
description <p>The giant reed, <em>Arundo donax</em>, is a perennial grass species that has become an invasive plant in many countries. Expansive stands of <em>A. donax</em> have significant negative impacts on available water resources and efforts are underway to identify biological control agents against this species. The giant reed grows under adverse environmental conditions, displaying insensitivity to drought stress, flooding, heavy metals, salinity and herbaceous competition, thus hampering control programs. To establish a foundational molecular dataset, we used an llumina Hi-Seq protocol to sequence the transcriptome of actively growing shoots from an invasive genotype collected along the Rio Grande River, bordering Texas and Mexico. We report the assembly of 27,491 high confidence transcripts (≥200 bp) with at least 70% coverage of known genes in other Poaceae species. Of these 13,080 (47.58%), 6165 (22.43%) and 8246 (30.0%) transcripts have sequence similarity to known, domain-containing and conserved hypothetical proteins, respectively. We also report 75,590 low confidence transcripts supported by both trans-ABBySS and Velvet-Oases <em>de novo</em> assembly pipelines. Within the low confidence subset of transcripts we identified partial hits to known (19,021; 25.16%), domain-containing (7093; 9.38%) and conserved hypothetical (16,647; 22.02%) proteins. Additionally 32,829 (43.43%) transcripts encode putative hypothetical proteins unique to A. donax. Functional annotation resulted in 5,550 and 6,070 transcripts with assigned Gene Ontology and KEGG pathway information, respectively. The most abundant KEGG pathways are spliceosome, ribosome, ubiquitin mediated proteolysis, plant–pathogen interaction, RNA degradation and oxidative phosphorylation metabolic pathway. Furthermore, we also found 12, 9, and 4 transcripts annotated as stress-related, heat stress, and water stress proteins, respectively. It is envisaged that these resources will promote and facilitate studies of the abiotic stress capabilities of this exotic plant species, which facilitates its invasive capacity. Supplemental Excel data files with the article detail functional annotation of Arundo donax high confidence and low confidence genes. Data are also available at <a href="https://www.ncbi.nlm.nih.gov/nuccore/GBRH01000000">https://www.ncbi.nlm.nih.gov/nuccore/GBRH01000000</a> . The assembled and annotated A. donax USA genotype Rio Grande RNA transcriptome has been deposited at DDBJ/EMBL/GenBank under the project accession PRJNA256910.</p> <div><br>Resources in this dataset:</div><br><ul><li><p>Resource Title: Shoot transcriptome of the giant reed, Arundo donax.</p> <p>File Name: Web Page, url: <a href="https://www.sciencedirect.com/science/article/pii/S2352340914000377" target="_blank">https://www.sciencedirect.com/science/article/pii/S2352340914000377</a> </p><p>*Data in Brief* article reporting the assembly of 27,491 high confidence transcripts (≥200 bp) for the giant reed, *Arundo donax* with at least 70% coverage of known genes in other Poaceae species. </p></li></ul>
format Dataset
author Roberto Barrero (246219)
Felix D. Guerrero (17477454)
Paula M. Moolhuijzen (11519701)
John A. Goolsby (3273768)
Jason Tidwell (17478660)
Stanley E. Bellgard (8357334)
Matthew I. Bellgard (11519719)
author_facet Roberto Barrero (246219)
Felix D. Guerrero (17477454)
Paula M. Moolhuijzen (11519701)
John A. Goolsby (3273768)
Jason Tidwell (17478660)
Stanley E. Bellgard (8357334)
Matthew I. Bellgard (11519719)
author_sort Roberto Barrero (246219)
title Data from: Shoot transcriptome of the giant reed, Arundo donax
title_short Data from: Shoot transcriptome of the giant reed, Arundo donax
title_full Data from: Shoot transcriptome of the giant reed, Arundo donax
title_fullStr Data from: Shoot transcriptome of the giant reed, Arundo donax
title_full_unstemmed Data from: Shoot transcriptome of the giant reed, Arundo donax
title_sort data from: shoot transcriptome of the giant reed, arundo donax
publishDate 2018
url https://figshare.com/articles/dataset/Data_from_Shoot_transcriptome_of_the_giant_reed_Arundo_donax/24852591
work_keys_str_mv AT robertobarrero246219 datafromshoottranscriptomeofthegiantreedarundodonax
AT felixdguerrero17477454 datafromshoottranscriptomeofthegiantreedarundodonax
AT paulammoolhuijzen11519701 datafromshoottranscriptomeofthegiantreedarundodonax
AT johnagoolsby3273768 datafromshoottranscriptomeofthegiantreedarundodonax
AT jasontidwell17478660 datafromshoottranscriptomeofthegiantreedarundodonax
AT stanleyebellgard8357334 datafromshoottranscriptomeofthegiantreedarundodonax
AT matthewibellgard11519719 datafromshoottranscriptomeofthegiantreedarundodonax
_version_ 1802722099303809024