شناسایی مناطق ژنومی کاندیدای مرتبط با وزن تولد در گوسفندان لری-بختیاری با استفاده از الگوریتم گرادیان بوستینگ

نوع مقاله : مقاله پژوهشی

نویسندگان

1 استاد، دانشکده کشاورزی و منابع طبیعی دانشگاه تهران

2 گروه علوم دامی ، دانشگاه تهران

3 گروه علوم دامی، دانشگاه تهران

4 گروه علوم دامی، دانشگاه بوعلی همدان

5 گروه علوم دامی، دانشگاه تربیت مدرس

چکیده

تا به امروز بیشتر پیشرفتهای ژنتیکی مربوط به صفات کمی، با استفاده از ارزش اصلاحی برآورد شده از
اطلاعات فنوتیپی صفات به دست آمده است. وزن تولد به عنوان یک صفت کمی و اقتصادی در نژادهای گوسفند گوشتی ایرانی در نظر گرفته می‌شود. این مطالعه با هدف شناسایی جایگاه‌های ژنومیکی مرتبط با وزن تولد در گوسفند نژاد لری-بختیاری با استفاده از الگوریتم گرادیان بوستینگ انجام شد. در مجموع ۱۳۲ رأس گوسفند لری-بختیاری بر اساس ارزش‌های اصلاحی برای وزن بدن انتخاب شدند. حیوانات مورد مطالعه با استفاده از تراشه های 50K گوسفندی شرکت ایلومینا، تعیین ژنوتیپ شدند. برنامه R برای تجزیه و تحلیل گرادیان بوستینگ استفاده شد. نتایج نشان داد که گرادیان بوستینگ روشی کارآمد برای شناسایی SNPهای مؤثر با نقش‌های عملکردی در وزن تولد است. از بین 32 ژن کاندید شناسایی شده، ژن های KAKN1، ZNF277 و DOCK4 با SNPهای دارای بالاترین رتبه مرتبط بودند، که میتوان آنها را به عنوان ژن های مؤثر برای صفات وزن معرفی کرد. نتایج هستی شناسی ژن، منجر به شناسایی ژن های کاندید بیشتری از جمله ARHGAP10، GDNF، AGO2، LZTFL1 و APOD شد. در تحلیل شبکه ژنی، ارتباط عملکردی بین این ژن‌ها و نقش بالقوه آنها به عنوان ژن‌های کاندید مؤثر بر وزن تولد، مورد تأیید قرار گرفت. به طور کلی، یافته‌های ما بینش‌های جدیدی در مورد معماری ژنتیکی وزن تولد در گوسفندان ارائه می‌دهد و اهداف مهمی را برای برنامه‌های بهبود ژنتیکی در گوسفندان گوشتی با استفاده از الگوریتم گرادیان بوستینگ پیشنهاد می‌کند.

کلیدواژه‌ها

موضوعات


عنوان مقاله [English]

Identification of candidate genomic regions associated with birth weight in Lori-Bakhtiari sheep using gradient boosting algorithm

نویسندگان [English]

  • mohammad moradi shahrbabak 1
  • Maryam Ghasemi 2
  • Hossain Moradi Shahrbabak 3
  • Pouya Zamani 4
  • Mohammad Hossain Moradi 2
  • Hossain Banisaadat 5
1 Professor, Department of Animal Science, College of Agriculture and Natural Resources, University of Tehran
2 Department of Animal Science< University of Tehran
3 Department of Animal Science, University of Tehran
4 Department of Animal Science, , University of Bu Ali Sina
5 Department of Animal Science, University of Tarbiatmodares
چکیده [English]

Introduction
Growth traits are quantitative traits controlled by a large number of genes with small effects, each of which has an additive effect on the phenotype. Today, machine learning methods have been used in animal science not only in GWAS but also in other important topics such as image processing, genomic evaluation, and prediction of important traits. Machine learning methods and their algorithms such as deep learning, random forest, support vector machine, and boosting have been introduced into GWAS topics. The advantages of machine learning methods are their high potential and efficiency, especially for large-volume data and estimation of non-additive effects such as dominance and epistasis, as well as the investigation of complex relationships between variables (such as interactions between markers). Therefore, the aim of this study is to use new technologies, such as genome-wide association studies (GWAS) along with artificial intelligence-based methods such as gradient boosting, to identify mutations in loci associated with growth traits in Lori-Bakhtiari sheep.
Materials and Methods
For the present study, blood samples were collected from 132 Lori-Bakhtiari sheep. Genomic DNA was extracted from whole blood using the Cinagene kit (Cinagene Company, Iran). The samples were genotyped using Illumina 50K Ovine SNPChip, which included 51,135 SNP markers. The gbm package in R software was used to implement the GB method. All markers were ranked based on their importance after applying the gradient boosting method based on the best combination of parameters. After identifying these markers, the genes reported within ± 500 kb of the top 50 SNPs were identified using the NCBI and Ensembl databases from the sheep reference genome Oar_v4.0. To explore the biological and functional processes of the identified genes, ontology studies were performed using the DAVID online database. Finally, gene network analyses were performed using Cytoscape software.
Results and Discussion
Genetic parameters and birth weight breeding values were estimated in this study. After quality control, a total of 44,796 SNP markers from 122 sheep remained for final analysis. Manhattan plots were drawn for birth weight and genome-wide SNP values using relative influence. The distribution of ranked SNPs (from most important to least important) was shown based on GB analysis. Our results showed that 2,509 SNPs (5/6%) had positive effects and 42,287 SNPs (94/4%) had neutral effects. The results of the investigation of genes reported within or adjacent to the 50 significant SNP markers identified in the present study led to the identification of 32 candidate genes. Gene ontology analysis of the 50 significant SNP markers and 32 candidate genes identified showed significant associations between the identified ontology pathways and biological processes related to weight gain. Among the top 10 SNPs and the genes identified around them using gradient boosting, there are 4 important genes including KANK1, ZNF277, DOCK4, and APOD, which are involved in the biological processes of body weight traits.
Conclusion
In this study, The top 10 SNPs were identified and introduced as potential essential SNPs for analysis. 32 candidate genes for birth weight, some of which are introduced for the first time. The results of the present study can provide valuable information for future research to identify genes, precisely determine candidate genomic regions for important economic traits such as birth weight, and better understand biological mechanisms using gradient boosting algorithm in Iranian sheep breeds.

کلیدواژه‌ها [English]

  • Body weight
  • single nucleotide polymorphism
  • candidate gene
  • Gradient bossing
  • genome-wide association studies