Stone pine (Pinus Pinea L.) is an emblematic tree currently distributed around the whole Mediterranean basin. The species is well known for the economics of its timber and its seeds, the pine nuts, which are commercialized in food industry. The Iberian Peninsula accounts for the 75% of the world stone pine distribution, being Portugal the second country with the largest area of stone pine, after Spain.
Despite its relevance, the genomic information available for the species is scarce due to the limited studies and data available. Until now no reference genome available. However, advances in sequencing technologies, such as RNAseq, have allowed the characterization of the transcriptome even for species with no reference genome available. The main purpose of this study was to characterize the stone pine transcriptome of seven tissues (needles, xylem, stem bark, terminal bud, first and second year pine cone and pine nuts) by performing a de novo transcriptome assembly. A transcriptome comprising 165,179 contigs with an accumulative size of 81,31 Mbps was generated. A total of 55,328 genes were predicted, which from 50,161 (90.66%) genes had at least one homologous hit against the SWISS-PROT and/or non-redundant NCBI databases. Additionally, InterPro signatures were assigned identifying protein domains and associating GO terms and KEGG pathways. Together, these results represent the first transcriptome characterization of multiple tissues of stone pine. The assembled and annotated sequences will be deposited in a public repository.
Further analysis are ongoing to identify differences in gene expression between tissues.
Acknowledgements: This work was co-financed by Program Alentejo 2020, through the European Fund for Regional Development under the scope SelectPinea- Development of genetic markers for relevant traits in stone pine (ALT20-03-0145-FEDER-000041). Authors also acknowledge FCT for Contrato – Programa to L. Marum (CEECINST/00131/2018) and FCT for UIDB/05183/2020