genio - Genetics Input/Output Functions
Implements readers and writers for file formats associated with genetics data. Reading and writing Plink BED/BIM/FAM and GCTA binary GRM formats is fully supported, including a lightning-fast BED reader and writer implementations. Other functions are 'readr' wrappers that are more constrained, user-friendly, and efficient for these particular applications; handles Plink and Eigenstrat tables (FAM, BIM, IND, and SNP files). There are also make functions for FAM and BIM tables with default values to go with simulated genotype data.
Last updated 3 months ago
7.42 score 17 stars 3 packages 173 scripts 736 downloadslfa - Logistic Factor Analysis for Categorical Data
Logistic Factor Analysis is a method for a PCA analogue on Binomial data via estimation of latent structure in the natural parameter. The main method estimates genetic population structure from genotype data. There are also methods for estimating individual-specific allele frequencies using the population structure. Lastly, a structured Hardy-Weinberg equilibrium (HWE) test is developed, which quantifies the goodness of fit of the genotype data to the estimated population structure, via the estimated individual-specific allele frequencies (all of which generalizes traditional HWE tests).
Last updated 24 days ago
snpdimensionreductionprincipalcomponentregression
7.02 score 16 stars 1 packages 54 scripts 605 downloadspopkin - Estimate Kinship and FST under Arbitrary Population Structure
Provides functions to estimate the kinship matrix of individuals from a large set of biallelic SNPs, and extract inbreeding coefficients and the generalized FST (Wright's fixation index). Method described in Ochoa and Storey (2021) <doi:10.1371/journal.pgen.1009241>.
Last updated 2 months ago
6.08 score 19 stars 64 scripts 312 downloadsgcatest - Genotype Conditional Association TEST
GCAT is an association test for genome wide association studies that controls for population structure under a general class of trait models. This test conditions on the trait, which makes it immune to confounding by unmodeled environmental factors. Population structure is modeled via logistic factors, which are estimated using the `lfa` package.
Last updated 24 days ago
snpdimensionreductionprincipalcomponentgenomewideassociation
5.18 score 5 stars 4 scripts 240 downloads