Complete Chloroplast Genome assembly and Annotation of Milk Thistle (Silybum marianum) and Phylogenetic Analysis

  • Hwajin Jung (Genomics Division, National Institute of Agricultural Sciences, RDA) ;
  • Yedomon Ange Bovys Zoclanclounon (Genomics Division, National Institute of Agricultural Sciences, RDA) ;
  • Jeongwoo Lee (Genomics Division, National Institute of Agricultural Sciences, RDA) ;
  • Taeho Lee (Genomics Division, National Institute of Agricultural Sciences, RDA) ;
  • Jeonggu Kim (Genomics Division, National Institute of Agricultural Sciences, RDA) ;
  • Guhwang Park (Genomics Division, National Institute of Agricultural Sciences, RDA) ;
  • Keunpyo Lee (Genomics Division, National Institute of Agricultural Sciences, RDA) ;
  • Kwanghoon An (EL&I, Co, Ltd.) ;
  • Jeehyoung Shim (EL&I, Co, Ltd.) ;
  • Joonghyoun Chin (Department of Integrative Biological Sciences and Industry, Sejong University) ;
  • Suyoung Hong (Genomics Division, National Institute of Agricultural Sciences, RDA)
  • Published : 2022.10.13

Abstract

Silybum marianum is an annual or biennial plant from the Asteraceae family. It can grow in low-nutrient soil and drought conditions, making it easy to cultivate. From the seed, a specialized plant metabolite called silymarin (flavonolignan complex) is produced and is known to alleviate the liver from hepatitis and toxins damages. To infer the phylogenetic placement of a Korean milk thistle, we conducted a chloroplast assembly and annotation following by a comparison with existing Chinese reference genome (NC_028027). The chloroplast genome structure was highly similar with an assembly size of 152,642 bp, an 153,202 bp for Korean and Chinese milk thistle respectively. Moreover, there were similarities at the gene level, coding sequence (n = 82), transfer RNA (n = 31) and ribosomal RNA (n = 4). From all coding sequences gene set, the phylogenetic tree inference placed the Korean cultivar into the milk thistle clade; corroborating the expected tree. Moreover, an investigation the tree based only on the ycf1 gene confirmed the same tree; suggesting that ycf1 gene is a potential marker for DNA barcoding and population diversity study in milk thistle genus. Overall, the provided data represents a valuable resource for population genomics and species-centered determination since several species have been reported in the Silybum genus.

Keywords

Acknowledgement

This research was supported by Rural Development Administration of South Korea, under the project No. PJ015988.