DOI QR코드

DOI QR Code

EfficientNet-B0 outperforms other CNNs in image-based five-class embryo grading: a comparative analysis

  • Vincent Jaehyun Shim (Cellular Reprogramming and Embryo Biotechnology Laboratory, Dental Research Institute, Seoul National University School of Dentistry) ;
  • Hosup Shim (Department of Nanobiomedical Science, Dankook University) ;
  • Sangho Roh (Cellular Reprogramming and Embryo Biotechnology Laboratory, Dental Research Institute, Seoul National University School of Dentistry)
  • Received : 2024.11.28
  • Accepted : 2024.12.16
  • Published : 2024.12.31

Abstract

Background: Evaluating embryo quality is crucial for the success of in vitro fertilization procedures. Traditional methods, such as the Gardner grading system, rely on subjective human assessment of morphological features, leading to potential inconsistencies and errors. Artificial intelligence-powered grading systems offer a more objective and consistent approach by reducing human biases and enhancing accuracy and reliability. Methods: We evaluated the performance of five convolutional neural network architectures-EfficientNet-B0, InceptionV3, ResNet18, ResNet50, and VGG16- in grading blastocysts into five quality classes using only embryo images, without incorporating clinical or patient data. Transfer learning was applied to adapt pretrained models to our dataset, and data augmentation techniques were employed to improve model generalizability and address class imbalance. Results: EfficientNet-B0 outperformed the other architectures, achieving the highest accuracy, area under the receiver operating characteristic curve, and F1-score across all evaluation metrics. Gradient-weighted Class Activation Mapping was used to interpret the models' decision-making processes, revealing that the most successful models predominantly focused on the inner cell mass, a critical determinant of embryo quality. Conclusions: Convolutional neural networks, particularly EfficientNet-B0, can significantly enhance the reliability and consistency of embryo grading in in vitro fertilization procedures by providing objective assessments based solely on embryo images. This approach offers a promising alternative to traditional subjective morphological evaluations.

Keywords

Acknowledgement

Thank to Jihye Park for her valuable assistance in creating and refining the figures presented in this study.

References

  1. Alpha Scientists in Reproductive Medicine and ESHRE Special Interest Group of Embryology. 2011. The Istanbul consensus workshop on embryo assessment: proceedings of an expert meeting. Hum. Reprod. 26:1270-1283. https://doi.org/10.1093/humrep/der037
  2. Baxter Bendus AE, Mayer JF, Shipley SK, Catherino WH. 2006. Interobserver and intraobserver variation in day 3 embryo grading. Fertil. Steril. 86:1608-1615. https://doi.org/10.1016/j.fertnstert.2006.05.037
  3. Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, Thrun S. 2017. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542:115-118. (Erratum published 2017, Nature 28: 686) https://doi.org/10.1038/nature21056
  4. Gardner DK Lane M, Stevens J, Schlenker T, Schoolcraft WB. 2000. Blastocyst score affects implantation and pregnancy outcome: towards a single blastocyst transfer. Fertil. Steril. 73:1155-1158. https://doi.org/10.1016/S0015-0282(00)00518-5
  5. Gardner DK and Schoolcraft WB. 1999. Culture and transfer of human blastocysts. Curr. Opin. Obstet. Gynecol. 11:307-311. https://doi.org/10.1097/00001703-199906000-00013
  6. Guerif F, Lemseffer M, Bidault R, Gasnier O, Saussereau MH, Cadoret V, Jamet C, Royere D. 2009. Single day 2 embryo versus blastocyst-stage transfer: a prospective study integrating fresh and frozen embryo transfers. Hum. Reprod. 24:1051-1058. https://doi.org/10.1093/humrep/dep018
  7. Gulshan V, Peng L, Coram M, Stumpe MC, Wu D, Narayanaswamy A, Venugopalan S, Widner K, Madams T, Cuadros J, Kim R, Raman R, Nelson PC, Mega JL, Webster DR. 2016. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316:2402-2410. https://doi.org/10.1001/jama.2016.17216
  8. He K, Zhang X, Ren S, Sun J. 2015. Deep residual learning for image recognition. arXiv. https://doi.org/10.48550/arXiv.1512.03385
  9. Khosravi P, Kazemi E, Zhan Q, Malmsten JE, Toschi M, Zisimopoulos P, Sigaras A, Lavery S, Cooper LAD, Hickman C, Meseguer M, Rosenwaks Z, Elemento O, Zaninovic N, Hajirasouliha I. 2019. Deep learning enables robust assessment and selection of human blastocysts after in vitro fertilization. NPJ Digit. Med. 2:21. https://doi.org/10.1038/s41746-019-0096-y
  10. LeCun Y, Bengio Y, Hinton G. 2015. Deep learning. Nature 521: 436-444. https://doi.org/10.1038/nature14539
  11. Manna C, Nanni L, Lumini A, Pappalardo S. 2013. Artificial intelligence techniques for embryo and oocyte classification. Reprod. Biomed. Online 26:42-49. https://doi.org/10.1016/j.rbmo.2012.09.015
  12. Paternot G, Wetzels AM, Thonon F, Vansteenbrugge A, Willemen D, Devroe J, Debrock S, D'Hooghe TM, Spiessens C. 2011. Intra- and interobserver analysis in the morphological assessment of early stage embryos during an IVF procedure: a multicentre study. Reprod. Biol. Endocrinol. 9:127. https://doi.org/10.1186/1477-7827-9-127
  13. Rawat W and Wang Z. 2017. Deep convolutional neural networks for image classification: a comprehensive review. Neural Comput. 29:2352-2449. https://doi.org/10.1162/neco_a_00990
  14. Salih M, Austin C, Warty RR, Tiktin C, Rolnik DL, Momeni M, Rezatofighi H, Reddy S, Smith V, Vollenhoven B, Horta F. 2023. Embryo selection through artificial intelligence versus embryologists: a systematic review. Hum. Reprod. Open 2023:hoad031. https://doi.org/10.1093/hropen/hoad031
  15. Selvaraju PR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. 2017. Grad-CAM: visual explanations from deep networks via gradient-based localization: Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV). Venice: IEEE.
  16. Simonyan K and Zisserman A. 2014. Very deep convolutional networks for large-scale image recognition. arXiv. https://doi.org/10.48550/arXiv.1409.1556
  17. Steptoe PC and Edwards RG. 1978. Birth after the reimplantation of a human embryo. Lancet 2:366. https://doi.org/10.1016/S0140-6736(78)92957-4
  18. Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z. 2015. Rethinking the inception architecture for computer vision. arXiv. https://doi.org/10.48550/arXiv.1512.00567
  19. Tan M and Le QV. 2019. EfficientNet: rethinking model scaling for convolutional neural networks. arXiv. https://doi.org/10.48550/arXiv.1905.11946
  20. Thirumalaraju P, Kanakasabapathy MK, Bormann CL, Gupta R, Pooniwala R, Kandula H, Souter I, Dimitriadis I, Shafiee H. 2021. Evaluation of deep convolutional neural networks in classifying human embryo images based on their morphological quality. Heliyon 7:e06298. https://doi.org/10.1016/j.heliyon.2021.e06298
  21. Tran D, Cooke S, Illingworth PJ, Gardner DK. 2019. Deep learning as a predictive tool for fetal heart pregnancy following time-lapse incubation and blastocyst transfer. Hum. Reprod. 34:1011-1018. https://doi.org/10.1093/humrep/dez064
  22. VerMilyea M, Hall JMM, Diakiw SM, Johnston A, Nguyen T, Perugini D, Miller A, Picou A, Murphy AP, Perugini M. 2020. Development of an artificial intelligence-based assessment model for prediction of embryo viability using static images captured by optical light microscopy during IVF. Hum. Reprod. 35:770-784. https://doi.org/10.1093/humrep/deaa013
  23. Zaninovic N and Rosenwaks Z. 2020. Artificial intelligence in human in vitro fertilization and embryology. Fertil. Steril. 114:914-920. https://doi.org/10.1016/j.fertnstert.2020.09.157