• Title/Summary/Keyword: unigene

Search Result 33, Processing Time 0.032 seconds

Application of Pac-Bio Sequencing, Trinity, and rnaSPAdes Assembly for Transcriptome Analysis in Medicinal Crop Astragalus membranaceus

  • Ji-Nam Kang;Si Myung Lee
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2022.10a
    • /
    • pp.254-254
    • /
    • 2022
  • Astragalus membranaceus (A. membranaceus) has traditionally been used as a medicinal plant in East Asia for the treatment ofvarious diseases. A. membranaceus belongs to the legume family and is known to be rich in substances such as flavonoids and saponins. Recent pharmacological studies of A. membranaceus have shown that the plant has immunomodulatory, anti-oxidant, anti-cancer, and anti-inflammatory effects. However, knowledge of major biosynthetic pathways in A. membranaceu is still lacking. Recently developed sequencing techniques enable high-quality transcriptome analysis in plants, which is recognized as an important part in elucidating the regulatory mechanisms of many plant secondary metabolic pathways. However, it is difficult to predict the number of transcripts because plant transcripts contain a large number of isoforms due to alternative splicing events, which can vary depending on the assembly platform used. In this study, we constructed three unigene sets using Pac-Bio isoform sequencing, Trinity and rnaSPAdes assembly for detailed transcriptome analysis mA. membranaceus. Furthermore, all genes involved in the flavonoid biosynthetic pathway were searched from three unigene sets, and structural comparisons and expression profiles between these genes were analyzed. The isoflavone synthesis was active in most tissues. Flavonol synthesis was mainly active in leaves and flowers, and anthocyanin synthesis was specific in flowers. Gene structural analysis revealed structural differences in the flavonoid-related genes derived from the three unigene sets. This study suggests the need for the application of multiple unigene sets for the analysis of key biosynthetic pathways in plants.

  • PDF

Comparative Analysis of Expressed Sequence Tags from Flammulina velutipes at Different Developmental Stages

  • Joh, Joong-Ho;Kim, Kyung-Yun;Lim, Jong-Hyun;Son, Eun-Suk;Park, Hye-Ran;Park, Young-Jin;Kong, Won-Sik;Yoo, Young-Bok;Lee, Chang-Soo
    • Journal of Microbiology and Biotechnology
    • /
    • v.19 no.8
    • /
    • pp.774-780
    • /
    • 2009
  • Flammulina velutipes is a popular edible basidiomycete mushroom found in East Asia and is commonly known as winter mushroom. Mushroom development showing dramatic morphological changes by different environmental factors is scientifically and commercially interesting. To create a genetic database and isolate genes regulated during mushroom development, cDNA libraries were constructed from three developmental stages of mycelium, primordium, and fruit body in F. velutipes. We generated a total of 5,431 expressed sequence tags (ESTs) from randomly selected clones from the three cDNA libraries. Of these, 3,332 different unique genes (unigenes) were consistent with 2,442 (73%) singlets and 890 (27%) contigs. This corresponds to a redundancy of 39%. Using a homology search in the gene ontology database, the EST unigenes were classified into the three categories of molecular function (28%), biological process (29%), and cellular component (6%). Comparative analysis found great variations in the unigene expression pattern among the three different unigene sets generated from the cDNA libraries of mycelium, primordium, and fruit body. The 19-34% of total unigenes were unique to each unigene set and only 3% were shared among all three unigene sets. The unique and common representation in F. velutipes unigenes from the three different cDNA libraries suggests great differential gene expression profiles during the different developmental stages of F. velutipes mushroom.

Construction of a Full-length cDNA Library from Korean Stewartia (Stewartia koreana Nakai) and Characterization of EST Dataset (노각나무(Stewartia koreana Nakai)의 cDNA library 제작 및 EST 분석)

  • Im, Su-Bin;Kim, Joon-Ki;Choi, Young-In;Choi, Sun-Hee;Kwon, Hye-Jin;Song, Ho-Kyung;Lim, Yong-Pyo
    • Horticultural Science & Technology
    • /
    • v.29 no.2
    • /
    • pp.116-122
    • /
    • 2011
  • In this study, we report the generation and analysis of 1,392 expressed sequence tags (ESTs) from Korean Stewartia (Stewartia koreana Nakai). A cDNA library was generated from the young leaf tissue and a total of 1,392 cDNA were partially sequenced. EST and unigene sequence quality were determined by computational filtering, manual review, and BLAST analyses. Finally, 1,301 ESTs were acquired after the removal of the vector sequence and filtering over a minimum length 100 nucleotides. A total of 893 unigene, consisting of 150 contigs and 743 singletons, was identified after assembling. Also, we identified 95 new microsatellite-containing sequences from the unigenes and classified the structure according to their repeat unit. According to homology search with BLASTX against the NCBI database, 65% of ESTs were homologous with known function and 11.6% of ESTs were matched with putative or unknown function. The remaining 23.2% of ESTs showed no significant similarity to any protein sequences found in the public database. Annotation based searches against multiple databases including wine grape and populus sequences helped to identify putative functions of ESTs and unigenes. Gene ontology (GO) classification showed that the most abundant GO terms were transport, nucleotide binding, plastid, in terms biological process, molecular function and cellular component, respectively. The sequence data will be used to characterize potential roles of new genes in Stewartia and provided for the useful tools as a genetic resource.

Characterization of tissue-specific mbu-3 gene expression in the mouse central nervous system

  • Lee, Chae-Jin;Cho, Eun-Young;Kim, Sun-Jung
    • BMB Reports
    • /
    • v.41 no.12
    • /
    • pp.875-880
    • /
    • 2008
  • Mbu-3 is a novel mouse brain unigene that was identified by digital differential display. In this study, expression of the gene was chased through developmental stages and the protein product was identified in the brain. The cDNA sequence was 3,995-bp long and contained an ORF of 745 AA. Database searches revealed that the chicken SST273 gene containing LRR- and Ig-domain was an mbu-3 orthologue. Tissue specificity for the gene was examined in embryos and in brains at post-natal and adult stages. During the embryonic stages, mbu-3 was localized to the central nervous system in the brain and spinal cord. In the early post-natal stages, the gene was evenly expressed in the brain. However, with aging, expression was confined to specific regions, particularly the hippocampus. The protein was approximately 95 kDa as determined by Western blot analysis of brain extracts.

Construction of a full-length cDNA library from Pinus koraiensis and analysis of EST dataset (잣나무(Pinus koraiensis)의 cDNA library 제작 및 EST 분석)

  • Kim, Joon-Ki;Im, Su-Bin;Choi, Sun-Hee;Lee, Jong-Suk;Roh, Mark S.;Lim, Yong-Pyo
    • Korean Journal of Agricultural Science
    • /
    • v.38 no.1
    • /
    • pp.11-16
    • /
    • 2011
  • In this study, we report the generation and analysis of a total of 1,211 expressed sequence tags (ESTs) from Pinus koraiensis. A cDNA library was generated from the young leaf tissue and a total of 1,211 cDNA were partially sequenced. EST and unigene sequence quality were determined by computational filtering, manual review, and BLAST analyses. In all, 857 ESTs were acquired after the removal of the vector sequence and filtering over a minimum length 50 nucleotides. A total of 411 unigene, consisting of 89 contigs and 322 singletons, was identified after assembling. Also, we identified 77 new microsatellite-containing sequences from the unigenes and classified the structure according to their repeat unit. According to homology search with BLASTX against the NCBI database, 63.1% of ESTs were homologous with known function and 22.2% of ESTs were matched with putative or unknown function. The remaining 14.6% of ESTs showed no significant similarity to any protein sequences found in the public database. Gene ontology (GO) classification showed that the most abundant GO terms were transport, nucleotide binding, plastid, in terms biological process, molecular function and cellular component, respectively. The sequence data will be used to characterize potential roles of new genes in Pinus and provided for the useful tools as a genetic resource.

Analysis of Seed Hair Formation Related Genes by EST Profiling in Carrot (Daucus carota var. sativa) (EST profiling을 통한 당근(Daucus carota var. sativa)의 종모 형성에 관련된 유전자 분석)

  • Hwang, Eun-Mi;Oh, Gyu-Dong;Shim, Eun-Jo;Jeon, Sang-Jin;Park, Young-Doo
    • Horticultural Science & Technology
    • /
    • v.28 no.6
    • /
    • pp.1039-1050
    • /
    • 2010
  • Carrot is one of the useful crops used abundantly in cooking in Western as well as Asia regions such as China and Korea. However, seed coats have hairs which should be removed to increase germination rate. Furthermore, because of seed hairs, farmers face several additional losses, such as time consumption, manpower, capital and so on, for seed handling. To prevent these problems, study of gene related hair formation using short-hair seed lines is required. We analyzed genes related to hair formation from seed through expressed sequenced tag (EST) profiling, based on the fact that the development of carrot seed hair is related to cellulose synthesis pathway in secondary cell wall synthesis stage. To study the gene expression related to hair formation of the carrot seed, a cDNA library was constructed by using the early maturation stage of the short-hair line (659-1) and hairy seed line (677-14). In short-hair (659-1) and hairy seed (677-14) lines, results from of EST profiling through BLASTX search analysis using the NCBI database showed that 172 and 224 unigenes had significant homology with known protein sequences, whereas 233 and 192 unigenes were not, respectively. All ESTs were grouped into 16 categories according to their putative functions. Twenty nine unigenes among all ESTs were considered to be genes regulating seed hair development from cellulose synthesis pathway during secondary cell wall synthesis stage; in results, 14 unigenes related to seed hair development were found only in hairy seed line.

Construction and Characterization of a cDNA Library from the Camelina sativa L. as an Alternative Oil-Seed Crop (신 바이오디젤 원료 작물인 Camelina의 cDNA library 제작 및 유전자 특성)

  • Park, Won;Jang, Young-Seok;Ahn, Sung-Ju
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.55 no.2
    • /
    • pp.151-158
    • /
    • 2010
  • Camelina sativa L., known as popular names "gold-of-pleasure" or "false flax" is an alternative oilseed crop that can be grown under different climatic and soil conditions. Up to date, however, the genomic information of Camelina has not been studied in detail. Therefore, a cDNA library was constructed and characterized from young leaves. The constructed cDNA library incorporated of 1334 cDNA clones and the size of the insertion fragments average was 736 base pair. We generated a total of 1269 high-quality expressed sequence tags (ESTs) sequences. The result of cluster analysis of EST sequences showed that the number of unigene was 851. According to subsequent analysis, the 476 (55.9%) unigenes were highly homologous to known function genes and the other 375 (44.1%) unigenes were unknown. Remaining 63 (7.4%) unigenes had no homology with any other peptide in NCBI database, indicating that these seemed to be novel genes expressed in leaves of Camelina. The database-matched ESTs were further classified into 17 categories according to their functional annotation. The most abundant of categories were "protein with binding function or cofactor requirement (27%)", "metabolism (11%)", "subcellular localization (11%)", "cellular transport, transport facilities and transport routes (7%)", "energy (6%)", "regulation of metabolism and protein function (6%)". Our result in this study provides an overview of mRNA expression profile and a basal genetic information of Camelina as an oilseed crop.

Comprehensive Expression Analysis of Triterpenoid Biosynthesis Genes Using Pac-Bio Sequencing and rnaSPAdes assembly in Codonopsis lanceolata

  • Ji-Nam Kang;Si Myung Lee;Mi-Hwa Choi;Chang-Kug Kim
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2022.10a
    • /
    • pp.253-253
    • /
    • 2022
  • Codonopsis lanceolata (C. lanceolata) has been widely used in East Asia as a traditional medicine to treat various diseases such as bronchitis, convulsions, cough, obesity, and hepatitis. C. lanceolata belonging to Campanulaceae contains bioactive compounds such as polyphenols, saponins, and steroids. However, despite the pharmacological significance of C. lanceolata, the genetic information of this plant is limited and there are few studies of its transcriptome. In this study, we constructed a unigene set of C. lanceolata using Pac-Bio sequencing. Furthermore, the reads generated from Pac-bio and Illumina sequencing were mixed and assembled using rnaSPAdes. All genes involved in the triterpenoid pathway, a major bioactive compounds of C. lanceolata, were searched from the two unigene sets and the expression profiles of these genes were analyzed. The results showed that lupeol, beta-amyrin, and dammarenediol synthesis genes were activated in the leaves and roots of C. lanceolata. In particular, the expression of genes related to lupeol synthesis was relatively high, suggesting that the main triterpenoid of C. lanceolata is lupeol. Transcriptome studies related to lupeol synthesis in C. lanceolata have been rarely reported. Lupeol has been reported to have pharmacological effects such as anti-inflammatory, anti-cancer, and anti-bacterial. This study suggests the importance of C. lanceolata as a lupeol producing plant.

  • PDF

Analysis of Expressed Sequence Tags from the Red Alga Griffithsia okiensis

  • Lee, Hyoung-Seok;Lee, Hong-Kum;An, Gyn-Heung;Lee, Yoo-Kyung
    • Journal of Microbiology
    • /
    • v.45 no.6
    • /
    • pp.541-546
    • /
    • 2007
  • Red algae are distributed globally, and the group contains several commercially important species. Griffithsia okiensis is one of the most extensively studied red algal species. In this study, we conducted expressed sequence tag (ESTs) analysis and synonymous codon usage analysis using cultured G. okiensis samples. A total of 1,104 cDNA clones were sequenced using a cDNA library made from samples collected from Dolsan Island, on the southern coast of Korea. The clustering analysis of these sequences allowed for the identification of 1,048 unigene clusters consisting of 36 consensus and 1,012 singleton sequences. BLASTX searches generated 532 significant hits (E-value <$10^{-4}$) and via further Gene Ontology analysis, we constructed a functional classification of 434 unigenes. Our codon usage analysis showed that unigene clusters with more than three ESTs had higher GC contents (76.5%) at the third position of the codons than the singletons. Also, the majority of the optimal codons of G. okiensis and Chondrus crispus belonging to Bangiophycidae were G-ending, whereas those of Porphyra yezoensis belonging to Florideophycidae were G-ending. An orthologous gene search for the P. yezoensis EST database resulted in the identification of 39 unigenes commonly expressed in two rhodophytes, which have putative functions for structural proteins, protein degradation, signal transduction, stress response, and physiological processes. Although experiments have been conducted on a limited scale, this study provides a material basis for the development of microarrays useful for gene expression studies, as well as useful information for the comparative genomic analysis of red algae.

Construction of a Full-length cDNA Library from Cardamine manshurica Nakai and Characterization of EST Dataset

  • Im, Subin;Lee, Sung-Ho;Kim, Yoon-Young;Kim, Ju-Sang;Kim, Dasom;Lim, Yong Pyo
    • Korean Journal of Agricultural Science
    • /
    • v.43 no.1
    • /
    • pp.33-39
    • /
    • 2016
  • Brassicaceae consists of important species that have significant amounts of metabolites, and many studies have been carried out in order to understand the mechanism that improves the content of these metabolites. In Brassicacea, Cardamine manshurica Nakai is one of the important edible plants and is rich in oil, fiber, and various nutrients. In this study, we constructed cDNA library using leaves from 4 week-old plants and analyzed the ESTs of C. manshurica Nakai. One thousand thirty-nine ESTs were discovered which assembled to form 468 unigenes. The latter contained 116 contigs and 352 singletons. Similarity search of these ESTs with BLASTX revealed similarities with Arabidopsis thaliana 285 (31.9%), Arabidopsis lyrata 172 (19.3%), Capsella rubella 162 (18.1%), and Eutrema salsugineum 137 (15.3%). ESTs were functionally categorized into molecular function, biological process, and cellular component, and each category took 10.6%, 58.5%, and 30.9%, respectively. The functional analysis also found that 94.9% of ESTs showed at least one GO ID. Microsatellite analysis of 468 unigene sequences revealed 225 structures of which Di-, Tri-, Tetra-, Penta-repeats were 35.6% (80/225), 63.1% (142/225), 0.9% (2/225), and 0.4% (1/225), respectively. The results from our study can be a valuable resource for Cardamine research.