Comparison of Galaxy and Unix tools for analyzing the exome sequencing data from syndactyly abnormalities

Nguyen Thy Ngoc , Huynh Minh Huong
Author affiliations

Authors

  • Nguyen Thy Ngoc University of Science and Technology of Hanoi, Vietnam Academy of Science and Technology, 18 Hoang Quoc Viet, Càu Giay, Ha Noi, Viet Nam https://orcid.org/0000-0002-3181-9209
  • Huynh Minh Huong University of Science and Technology of Hanoi, Vietnam Academy of Science and Technology. 18 Hoang Quoc Viet, Cua Giay, Hà Noi, Viet Nam

DOI:

https://doi.org/10.15625/2525-2518/20054

Keywords:

Galaxy, Exome sequencing, High performance computing, syndactyly, pipeline

Abstract

Syndactyly is a congenital limb abnormality, which manifests as the fusion of digits due to incomplete separation during embryonic development, and its pathogenesis involves intricate genetic and molecular processes. Since Exome sequencing has gained widespread utilization as an invaluable tool for exploring genetic disorders during prenatal development, the Bioinformatic platforms, such as GALAXY and UNIX, play a central role in the analysis process of exome sequencing data, facilitating precise identification and interpretation of genetic variations linked to congenital abnormalities. In this study, we conducted a comparative analysis of exome sequencing data from a 1.5-year-old syndactyly patient using two platforms: GALAXY and UNIX. The UNIX platform identified a total of 275,572 variants, and the GALAXY platform identified 140,291 variants when compared with the Grch38/hg38 reference genome. A comparative analysis identified 126,848 common variants between the platforms. After filtration with the 200 syndactyly-related genes, 1,345 variants were remained. The distribution of these 1,345 variants spans the entirety of the patient's genome, with focal concentrations observed on specific chromosomes including chromosomes 2, 4, and 11. Concurrently, within the top 200 genes implicated in syndactyly, the genes FRAS1, CACNA1C, GLI2, and NOTCH1 exhibit the highest frequency of variants. These data emphasized the impact of the chosen analytical platform on genetic variation detection in congenital limb abnormalities, provided critical insights into the selection of bioinformatic tools for optimizing exome sequencing workflows in the context of limb malformations, contributed to advancements in genetic research and diagnostic methodologies.

Downloads

Download data is not yet available.

References

Ahmed H., Akbari H., Emami A., Akbari M. R. - Genetic Overview of Syndactyly and Polydactyly. Plastic and reconstructive surgery. Global open, 5 (2017) e1549.

Mandal K., Phadke S. R., Kalita J. - Congenital swan neck deformity of fingers with syndactyly. Clinical dysmorphology, 17 (2008) 109-111.

Malik S., Afzal M., Gul S., Wahab A., Ahmad M. - Autosomal dominant syndrome of camptodactyly, clinodactyly, syndactyly, and bifid toes. American journal of medical genetics. Part A, 152A (2010) 2313-2317.

Vieira C., Teixeira N., Cadilhe A., Reis I. - Apert syndrome: prenatal diagnosis challenge. BMJ case reports, 12 (2019)

Turnpenny P. D., Dean J. C., Duffty P., Reid J. A., Carter P. - Weyers' ulnar ray/oligodactyly syndrome and the association of midline malformations with ulnar ray defects. Journal of medical genetics, 29 (1992) 659-662.

Patel R., Singh S. K., Bhattacharya V., Ali A. - Novel HOXD13 variants in syndactyly type 1b and type 1c, and a new spectrum of TP63-related disorders. Journal of human genetics, 67 (2022) 43-49.

Ngoc N. T., Duong N. T., Quynh D. H., Ton N. D., Duc H. H., Huong L. T. M., Anh L. T. L., Hai N. V. - Identification of novel missense mutations associated with non-syndromic syndactyly in two vietnamese trios by whole exome sequencing. Clinica chimica acta; international journal of clinical chemistry, 506 (2020) 16-21.

Deng H., Tan T. - Advances in the Molecular Genetics of Non-syndromic Syndactyly. Current genomics, 16 (2015) 183-193.

Jelin A. C., Vora N. - Whole Exome Sequencing: Applications in Prenatal Genetics. Obstetrics and gynecology clinics of North America, 45 (2018) 69-81.

Blankenberg D., Gordon A., Von Kuster G., Coraor N., Taylor J., Nekrutenko A., Galaxy T. - Manipulation of FASTQ data with Galaxy. Bioinformatics, 26 (2010) 1783-1785.

Van der Auwera G. A., Carneiro M. O., Hartl C., Poplin R., Del Angel G., Levy-Moonshine A., Jordan T., Shakir K., Roazen D., Thibault J., Banks E., Garimella K. V., Altshuler D., Gabriel S., DePristo M. A. - From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Current protocols in bioinformatics, 43 (2013) 11 10 11-11 10 33.

Al-Qattan M. M. - A Review of the Genetics and Pathogenesis of Syndactyly in Humans and Experimental Animals: A 3-Step Pathway of Pathogenesis. BioMed research international, 2019 (2019) 9652649.

Cassim A., Hettiarachchi D., Dissanayake V. H. W. - Genetic determinants of syndactyly: perspectives on pathogenesis and diagnosis. Orphanet journal of rare diseases, 17 (2022) 198.

Afgan E., Baker D., Batut B., van den Beek M., Bouvier D., Cech M., Chilton J., Clements D., Coraor N., Gruning B. A., Guerler A., Hillman-Jackson J., Hiltemann S., Jalili V., Rasche H., Soranzo N., Goecks J., Taylor J., Nekrutenko A., Blankenberg D. - The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic acids research, 46 (2018) W537-W544.

Sims D., Ilott N. E., Sansom S. N., Sudbery I. M., Johnson J. S., Fawcett K. A., Berlanga-Taylor A. J., Luna-Valero S., Ponting C. P., Heger A. - CGAT: computational genomics analysis toolkit. Bioinformatics, 30 (2014) 1290-1291.

Batut B., van den Beek M., Doyle M. A., Soranzo N. - RNA-Seq Data Analysis in Galaxy. Methods in molecular biology, 2284 (2021) 367-392.

Wee S. K., Yap E. P. H. - GALAXY Workflow for Bacterial Next-Generation Sequencing De Novo Assembly and Annotation. Current protocols, 1 (2021) e242.

Thang M. W. C., Chua X. Y., Price G., Gorse D., Field M. A. - MetaDEGalaxy: Galaxy workflow for differential abundance analysis of 16s metagenomic data. F1000Research, 8 (2019) 726.

Chappell K., Francou B., Habib C., Huby T., Leoni M., Cottin A., Nadal F., Adnet E., Paoli E., Oliveira C., Verstuyft C., Davit-Spraul A., Gaignard P., Lebigot E., Duclos-Vallee J. C., Young J., Kamenicky P., Adams D., Echaniz-Laguna A., Gonzales E., Bouvattier C., Linglart A., Picard V., Bergoin E., Jacquemin E., Guiochon-Mantel A., Proust A., Bouligand J. - Galaxy Is a Suitable Bioinformatics Platform for the Molecular Diagnosis of Human Genetic Disorders Using High-Throughput Sequencing Data Analysis: Five Years of Experience in a Clinical Laboratory. Clinical chemistry, 68 (2022) 313-321.

Downloads

Published

08-10-2024

How to Cite

[1]
T. N. Nguyen and M. H. Huynh, “Comparison of Galaxy and Unix tools for analyzing the exome sequencing data from syndactyly abnormalities”, Vietnam J. Sci. Technol., vol. 61, no. 4, Oct. 2024.

Issue

Section

Natural Products

Funding data