Identification of patterns related to linkage groups or disequilibrium by factor analysis

ABSTRACT: Empirical patterns of linkage disequilibrium (LD) can be used to increase the statistical power of genetic mapping. This study was carried out with the objective of verifying the efficacy of factor analysis (AF) applied to data sets of molecular markers of the SNP type, in order to identify linkage groups and haplotypes blocks. The SNPs data set used was derived from a simulation process of an F2 population, containing 2000 marks with information of 500 individuals. The estimation of the factorial loadings of FA was made in two ways, considering the matrix of distances between the markers (A) and considering the correlation matrix (R). The number of factors (k) to be used was established based on the graph scree-plot and based on the proportion of the total variance explained. Results indicated that matrices A and R lead to similar results. Based on the scree-plot we considered k equal to 10 and the factors interpreted as being representative of the bonding groups. The second criterion led to a number of factors equal to 50, and the factors interpreted as being representative of the haplotypes blocks. This showed the potential of the technique, making it possible to obtain results applicable to any type of population, helping or corroborating the interpretation of genomic studies. The study demonstrated that AF was able to identify patterns of association between markers, identifying subgroups of markers that reflect factor binding groups and also linkage disequilibrium groups.

Saved in:
Bibliographic Details
Main Authors: Oliveira,Cristiano Ferreira de, Teixeira,Gabriely, Temoteo,Alex da Silva, Nascimento,Moysés, Cruz,Cosme Damião
Format: Digital revista
Language:English
Published: Universidade Federal de Santa Maria 2021
Online Access:http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-84782021000500401
Tags: Add Tag
No Tags, Be the first to tag this record!
id oai:scielo:S0103-84782021000500401
record_format ojs
spelling oai:scielo:S0103-847820210005004012021-03-03Identification of patterns related to linkage groups or disequilibrium by factor analysisOliveira,Cristiano Ferreira deTeixeira,GabrielyTemoteo,Alex da SilvaNascimento,MoysésCruz,Cosme Damião linkage disequilibrium factor analysis SNP haplotype blocks linkage groups QTL ABSTRACT: Empirical patterns of linkage disequilibrium (LD) can be used to increase the statistical power of genetic mapping. This study was carried out with the objective of verifying the efficacy of factor analysis (AF) applied to data sets of molecular markers of the SNP type, in order to identify linkage groups and haplotypes blocks. The SNPs data set used was derived from a simulation process of an F2 population, containing 2000 marks with information of 500 individuals. The estimation of the factorial loadings of FA was made in two ways, considering the matrix of distances between the markers (A) and considering the correlation matrix (R). The number of factors (k) to be used was established based on the graph scree-plot and based on the proportion of the total variance explained. Results indicated that matrices A and R lead to similar results. Based on the scree-plot we considered k equal to 10 and the factors interpreted as being representative of the bonding groups. The second criterion led to a number of factors equal to 50, and the factors interpreted as being representative of the haplotypes blocks. This showed the potential of the technique, making it possible to obtain results applicable to any type of population, helping or corroborating the interpretation of genomic studies. The study demonstrated that AF was able to identify patterns of association between markers, identifying subgroups of markers that reflect factor binding groups and also linkage disequilibrium groups.info:eu-repo/semantics/openAccessUniversidade Federal de Santa MariaCiência Rural v.51 n.5 20212021-01-01info:eu-repo/semantics/articletext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-84782021000500401en10.1590/0103-8478cr20190984
institution SCIELO
collection OJS
country Brasil
countrycode BR
component Revista
access En linea
databasecode rev-scielo-br
tag revista
region America del Sur
libraryname SciELO
language English
format Digital
author Oliveira,Cristiano Ferreira de
Teixeira,Gabriely
Temoteo,Alex da Silva
Nascimento,Moysés
Cruz,Cosme Damião
spellingShingle Oliveira,Cristiano Ferreira de
Teixeira,Gabriely
Temoteo,Alex da Silva
Nascimento,Moysés
Cruz,Cosme Damião
Identification of patterns related to linkage groups or disequilibrium by factor analysis
author_facet Oliveira,Cristiano Ferreira de
Teixeira,Gabriely
Temoteo,Alex da Silva
Nascimento,Moysés
Cruz,Cosme Damião
author_sort Oliveira,Cristiano Ferreira de
title Identification of patterns related to linkage groups or disequilibrium by factor analysis
title_short Identification of patterns related to linkage groups or disequilibrium by factor analysis
title_full Identification of patterns related to linkage groups or disequilibrium by factor analysis
title_fullStr Identification of patterns related to linkage groups or disequilibrium by factor analysis
title_full_unstemmed Identification of patterns related to linkage groups or disequilibrium by factor analysis
title_sort identification of patterns related to linkage groups or disequilibrium by factor analysis
description ABSTRACT: Empirical patterns of linkage disequilibrium (LD) can be used to increase the statistical power of genetic mapping. This study was carried out with the objective of verifying the efficacy of factor analysis (AF) applied to data sets of molecular markers of the SNP type, in order to identify linkage groups and haplotypes blocks. The SNPs data set used was derived from a simulation process of an F2 population, containing 2000 marks with information of 500 individuals. The estimation of the factorial loadings of FA was made in two ways, considering the matrix of distances between the markers (A) and considering the correlation matrix (R). The number of factors (k) to be used was established based on the graph scree-plot and based on the proportion of the total variance explained. Results indicated that matrices A and R lead to similar results. Based on the scree-plot we considered k equal to 10 and the factors interpreted as being representative of the bonding groups. The second criterion led to a number of factors equal to 50, and the factors interpreted as being representative of the haplotypes blocks. This showed the potential of the technique, making it possible to obtain results applicable to any type of population, helping or corroborating the interpretation of genomic studies. The study demonstrated that AF was able to identify patterns of association between markers, identifying subgroups of markers that reflect factor binding groups and also linkage disequilibrium groups.
publisher Universidade Federal de Santa Maria
publishDate 2021
url http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-84782021000500401
work_keys_str_mv AT oliveiracristianoferreirade identificationofpatternsrelatedtolinkagegroupsordisequilibriumbyfactoranalysis
AT teixeiragabriely identificationofpatternsrelatedtolinkagegroupsordisequilibriumbyfactoranalysis
AT temoteoalexdasilva identificationofpatternsrelatedtolinkagegroupsordisequilibriumbyfactoranalysis
AT nascimentomoyses identificationofpatternsrelatedtolinkagegroupsordisequilibriumbyfactoranalysis
AT cruzcosmedamiao identificationofpatternsrelatedtolinkagegroupsordisequilibriumbyfactoranalysis
_version_ 1756406586224410624