Comparative Analysis of Romanian and Swiss Bovine Populations Using Whole Genome Sequencing and SNP Microarrays

Authors

  • Alexandru Eugeniu Mizeranschi Research and Development Station for Bovine - Arad, 310059, Arad, Bodrogului 32, Romania
  • Radu Ionel Neamț Research and Development Station for Bovine - Arad, 310059, Arad, Bodrogului 32, Romania
  • Ciprian Valentin Mihali Research and Development Station for Bovine - Arad, 310059, Arad, Bodrogului 32, Romania
  • Vlad Goilean Research and Development Station for Bovine - Arad, 310059, Arad, Bodrogului 32, Romania
  • Mihai Carabaș Politehnica University of Bucharest, Splaiul Independenţei nr. 313, Bucharest, Romania
  • Daniela Elena Ilie Research and Development Station for Bovine - Arad, 310059, Arad, Bodrogului 32, Romania

Keywords:

whole genome sequencing, variant calling, SNP microarray, population structure

Abstract

The use of SNP microarrays has gained distinguished attention in recent years and the identification of numerous SNPs has proven valuable for genetic evaluation and selection in farm animals. In the current study we compared several bovine populations from Romania and Switzerland at the level of SNPs. Romanian Brown (N=39) and Romanian Spotted (N=245) cattle were genotyped using the Axiom Bovine BovMDv3 SNP microarray. For the Swiss population, we acquired sequencing data from the NCBI SRA database for 80 individuals from three breeds: Brown Swiss (N=20), Original Braunvieh (N=20) and Simmental (N=40). Sequencing data were processed using the Bcbio-nextgen data analysis pipeline and variants were called based on the UMD3.1 reference genome. Common SNPs found from both microarray and sequencing data were retained and genotypes from all the Romanian and Swiss animals were pooled together, resulting in a combined dataset of 48,291 SNPs for 364 individuals. Pairwise comparisons were assessed on the five subpopulations according to Weir and Cockerham’s FST index. Small genetic differences (FST < 0.05) were found between the Romanian Brown and Swiss Brown Swiss subpopulations and between Romanian Spotted and Swiss Simmental subpopulations. For all the other pairwise comparisons, FST values were between 0.05 and 0.1, indicating a moderate level of genetic difference among the corresponding subpopulations. The results of fastSTRUCTURE indicated that the most likely number (K) of subpopulations from the pooled dataset was between 8 and 12. Bar plots for K = 5, 8 and 12 confirmed that Romanian Brown and Swiss Brown Swiss subpopulations were genetically similar. However, they also revealed a surprisingly high level of heterogeneity among the Romanian Spotted individuals. As such, future research is required to zoom in on the genetic make-up and explain the most likely sources of heterogeneity for the Romanian Spotted breed. Current results will facilitate a better understanding of genomic selection and its application for improved breeding programs in Romanian cattle breeds.

References

Zimin A.V., Delcher A.L., Florea L., Kelley D.R., Schatz M.C., Puiu D. A whole genome assembly of the domestic cow, Bos taurus, Genome Biology, 2009, 10, 4: R42, doi: 10.1186/gb-2009-10-4-r42: 10 p.

Dekkers J.C.M., Commercial application of marker- and gene-assisted selection in livestock: Strategies and lessons, Journal of Animal Science, 2004, 82, 313–328

https://bovinegenome.elsiklab.missouri.edu

Rosen, B.D., Bickhart, D.M., Schnabel, R.D., Koren, S., Elsik, C.G., Zimin, A., Dreischer, C., Schultheiss, S., Hall, R., Schroeder, S.G., Van Tassell, C.P., Smith, T.P.L., Medrano, J.F., Modernizing the bovine reference genome assembly. Proc. World Congr. Genet. Appl. Livest. Prod., Vol. Mol. Genet. 2018, 3, 802.

Null, D.J., Van Raden, P.M., Rosen, B.D., O'Connell, J.R., Bickhart, D.M., Using the ARS-UCD1.2 reference genome in U.S. evaluations. Interbull Bulletin., 2019, 55, 30-34.

Boison, S.A., Utsunomiya, A.T.H., Santos, D.J.A., Neves, H.H.R., Carvalheiro, R., Mészáros, G., Utsunomiya, Y.T., do Carmo, A.S., Verneque, R.S., Machado, M.A. and Panetto, J.C.C., Accuracy of genomic predictions in Gyr (Bos indicus) dairy cattle. Journal of Dairy Science, 2017, 100(7), pp.5479-5490.

Brown, A., Ojango, J., Gibson, J., Coffey, M., Okeyo, M., Mrode, R., Genomic selection in a crossbred cattle population using data from the dairy genetics East Africa project. J Dairy Sci, 2016, 99, 7308–7312. doi: 10.3168/jds.2016-11083

Murgiano, L., Jagannathan, V., Benazzi, C., Bolcato, M., Brunetti, B., Muscatello, L.V., Dittmer, K., Piffer, C., Gentile, A., Drögemüller, C., Deletion in the EVC2 gene causes chondrodysplastic dwarfism in Tyrolean Grey cattle. PloS One, 2014, 9, e94861. doi: 10.1371/journal.pone.0094861

Tiezzi F, Parker-Gaddis KL, Cole JB, Clay JS, Maltecca, C. A Genome-Wide Association Study for Clinical Mastitis in First Parity US Holstein Cows Using Single-Step Approach and Genomic Matrix Re-Weighting Procedure, PLoS ONE, 2015, 10(2), e0114919. doi:10.1371/journal.pone.0114919

Häfliger, I.M., Hirter, N., Paris, J.M., Wolf Hofstetter, S., Seefried, F.R. and Drögemüller, C., A de novo germline mutation of KIT in a white‐spotted Brown Swiss cow. Animal genetics, 2020, 51(3), pp.449-452.

Häfliger, I.M., Sickinger, M., Holsteg, M., Raeder, L.M., Henrich, M., Marquardt, S., Drögemüller, C. and Lühken, G., An IL17RA frameshift variant in a Holstein cattle family with psoriasis-like skin alterations and immunodeficiency. BMC genetics, 2020, 21, pp.1-10.

https://github.com/bcbio/bcbio-nextgen

Li, H. and Durbin, R., Fast and accurate short read alignment with Burrows–Wheeler transform. bioinformatics, 2009, 25(14), pp.1754-1760.

Tischler, G. and Leonard, S., Biobambam: tools for read pair collation based algorithms on BAM files. Source Code for Biology and Medicine, 2014, 9(1), p.13.

Poplin, R., Ruano-Rubio, V., DePristo, M.A., Fennell, T.J., Carneiro, M.O., Van der Auwera, G.A., Kling, D.E., Gauthier, L.D., Levy-Moonshine, A., Roazen, D. and Shakir, K., Scaling accurate genetic variant discovery to tens of thousands of samples. BioRxiv, 2017, p.201178.

FastQC: A Quality Control Tool for High Throughput Sequence Data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/

Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G., Abecasis, G. and Durbin, R., The sequence alignment/map format and SAMtools. Bioinformatics, 2009, 25(16), pp.2078-2079.

García-Alcalde, F., Okonechnikov, K., Carbonell, J., Cruz, L.M., Götz, S., Tarazona, S., Dopazo, J., Meyer, T.F. and Conesa, A., Qualimap: evaluating next-generation sequencing alignment data. Bioinformatics, 2012, 28(20), pp.2678-2679.

Mizeranschi, A.E., Mihali, C.V., Neamţ, R.I., Carabaş, M. and Ilie, D.E., September. Distributed bioinformatics analyses on an SGE cluster, for variant calling on bovine whole-genome sequencing samples. In 2019 IEEE International Conference on Advanced Scientific Computing (ICASC) (pp. 1-6). IEEE, 2019

Bos Taurus reference genome UMD3.1.1: https://www.ncbi.nlm.nih.gov/assembly/GCF_000003055.6/

Merchant, S., Wood, D.E. and Salzberg, S.L., Unexpected cross-species contamination in genome sequencing projects. PeerJ, 2014, 2, p.e675.

Danecek, P., Auton, A., Abecasis, G., Albers, C.A., Banks, E., DePristo, M.A., Handsaker, R.E., Lunter, G., Marth, G.T., Sherry, S.T. and McVean, G., The variant call format and VCFtools. Bioinformatics, 2011, 27(15), pp.2156-2158.

McLaren, W., Gil, L., Hunt, S.E., Riat, H.S., Ritchie, G.R., Thormann, A., Flicek, P. and Cunningham, F., The Ensembl Variant Effect Predictor. Genome biology, 2016, 17(1), p.122.

Weir, B.S. and Cockerham, C.C., Estimating F-statistics for the analysis of population structure. Evolution, 1984, pp.1358-1370.

Raj, A., Stephens, M. and Pritchard, J.K., fastSTRUCTURE: variational inference of population structure in large SNP data sets. Genetics, 2014, 197(2), pp.573-589.

Grüning, B., Dale, R., Sjödin, A., Chapman, B.A., Rowe, J., Tomkins-Tinch, C.H., Valieris, R. and Köster, J., Bioconda: sustainable and comprehensive software distribution for the life sciences. Nature methods, 2018, 15(7), pp.475-476.

Hartl, D.L., Clark, A.G. and Clark, A.G., Principles of population genetics, 1997, (Vol. 116). Sunderland, MA: Sinauer associates.

Downloads

Published

2023-09-05