Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing

Publication Overview


Title	Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing
Authors	Franssen SU, Shrestha RP, Bräutigam A, Bornberg-Bauer E, Weber AP
Type	Journal Article
Journal Name	BMC genomics
Volume	12
Year	2011
Page(s)	227
Citation	Franssen SU, Shrestha RP, Bräutigam A, Bornberg-Bauer E, Weber AP. Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing. BMC genomics. 2011; 12:227.

Abstract

BACKGROUND
The garden pea, Pisum sativum, is among the best-investigated legume plants and of significant agro-commercial relevance. Pisum sativum has a large and complex genome and accordingly few comprehensive genomic resources exist.

RESULTS
We analyzed the pea transcriptome at the highest possible amount of accuracy by current technology. We used next generation sequencing with the Roche/454 platform and evaluated and compared a variety of approaches, including diverse tissue libraries, normalization, alternative sequencing technologies, saturation estimation and diverse assembly strategies. We generated libraries from flowers, leaves, cotyledons, epi- and hypocotyl, and etiolated and light treated etiolated seedlings, comprising a total of 450 megabases. Libraries were assembled into 324,428 unigenes in a first pass assembly.A second pass assembly reduced the amount to 81,449 unigenes but caused a significant number of chimeras. Analyses of the assemblies identified the assembly step as a major possibility for improvement. By recording frequencies of Arabidopsis orthologs hit by randomly drawn reads and fitting parameters of the saturation curve we concluded that sequencing was exhaustive. For leaf libraries we found normalization allows partial recovery of expression strength aside the desired effect of increased coverage. Based on theoretical and biological considerations we concluded that the sequence reads in the database tagged the vast majority of transcripts in the aerial tissues. A pathway representation analysis showed the merits of sampling multiple aerial tissues to increase the number of tagged genes. All results have been made available as a fully annotated database in fasta format.

CONCLUSIONS
We conclude that the approach taken resulted in a high quality - dataset which serves well as a first comprehensive reference set for the model legume pea. We suggest future deep sequencing transcriptome projects of species lacking a genomics backbone will need to concentrate mainly on resolving the issues of redundancy and paralogy during transcriptome assembly.

Features

This publication contains information about 84,267 features:


Feature Name	Uniquename	Type
JI905697	JI905697.1	region
JI905696	JI905696.1	region
JI905695	JI905695.1	region
JI905694	JI905694.1	region
JI905693	JI905693.1	region
JI905692	JI905692.1	region
JI905691	JI905691.1	region
JI905690	JI905690.1	region
JI905689	JI905689.1	region
JI905688	JI905688.1	region
JI905687	JI905687.1	region
JI905686	JI905686.1	region
JI905685	JI905685.1	region
JI905684	JI905684.1	region
JI905683	JI905683.1	region
JI905682	JI905682.1	region
JI905681	JI905681.1	region
JI905680	JI905680.1	region
JI905679	JI905679.1	region
JI905678	JI905678.1	region
JI905677	JI905677.1	region
JI905676	JI905676.1	region
JI905675	JI905675.1	region
JI905674	JI905674.1	region
JI905673	JI905673.1	region

Pages

Properties

Additional details for this publication include:


Property Name	Value
Journal Country	England
Language	English
Language Abbr	eng
Publication Type	Journal Article
Elocation	10.1186/1471-2164-12-227
Publication Model	Electronic
ISSN	1471-2164
eISSN	1471-2164
Publication Date	2011
Journal Abbreviation	BMC Genomics
DOI	10.1186/1471-2164-12-227
Publication Type	Research Support, Non-U.S. Gov't

Search form

Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing

Pages