BBEST 2011
Resume:365-1


Poster (Painel)
365-1BAC combined with overgos gene-rich strategies for sugarcane genome sequencing
Authors:NUNES, S.L (IQ/USP - Instituto de Bioquimica, Universidade Sao Paulo) ; CAMPOS RA (IQ/USP - Instituto de Bioquimica, Universidade Sao Paulo) ; ALMEIDA, A.S.S.A. (IQ/USP - Instituto de Bioquimica, Universidade Sao Paulo) ; NISHIYAMA, M.Y.J (IQ/USP - Instituto de Bioquimica, Universidade Sao Paulo) ; KIM, C (GRA - University of Georgia) ; SETTA, N (ICB - Instituto de Biociencias, Universidade de Sao Paulo, Brazil) ; LEMBKE, C.G (IQ/USP - Instituto de Bioquimica, Universidade Sao Paulo) ; SOUZA, GM (IQ/USP - Instituto de Bioquimica, Universidade Sao Paulo)

Resume

Sugarcane is a major feedstock used for biofuel production worldwide. Sequencing of sugarcane genome is a key step for the development of the of molecular biology tools that will provide genes and promoters to the use in the generation of transgenic varieties and molecular markers for breeding. Hybridization using overgo probes is an established approach for screening arrayed bacterial artificial chromosome (BAC) libraries. We have improved the use of overgos by increasing the yield of positive clones without use radioisotopes, instead overgo gene-rich regions for construct BAC sugarcane libraries. Also, a total of 6,021 overgos hybridized to sugarcane BACs were extracted from the BACMan database with corresponding nucleotide sequences. The overgos were blasted against sorghum, rice, and selected sugarcane EST clusters. Since an overgo is a short sequence (40 nucleotides). Aiming to increase accuracy and generate quality data for assembly in one-contig sugarcane BAC, with overgo approach we combine a technique called pooling BAC, for the use of the same space physical gasket, with specially designed adapters for sequence connection and identification called Mid-tag (Multiplex Identifiers). The Mid-tag had unique sequence for identification and fishing each BAC sequence on gasket. We obtained a first race in 454 Genome Sequencer FLX System (Roche), with 18 sugarcane BACs sequenced. Resulting in 1,169,584 sequences considered good quality (Genome Analyzer 454, Roche), 495 bp average size, resulting 377 Mbp of high quality. All generated sequences were assembled with PHRAP Program. Assemblies generated one-contig size of average 110 kb. The BACs had 150 kb size, but contains vector-related sequences (E. coli ≤ 1% and vector pBeloBAC11 ≤ 10%), trimmed. Genes analysis in AUGUSTUS Predictor Pipeline conclude that approximately 30% of BACs selected using overgos contained target genes sequences, others 20% of overgos BACs had genes from the same structural family or orthologous genes.


Keyword:  BAC, gene-rich, Mid-tag, overgos, pBeloBAC11