
genio - Genetics Input/Output Functions
Implements readers and writers for file formats associated with genetics data. Reading and writing Plink BED/BIM/FAM and GCTA binary GRM formats is fully supported, including a lightning-fast BED reader and writer implementations. Other functions are 'readr' wrappers that are more constrained, user-friendly, and efficient for these particular applications; handles Plink and Eigenstrat tables (FAM, BIM, IND, and SNP files). There are also make functions for FAM and BIM tables with default values to go with simulated genotype data.
Last updated 14 days ago
cpp
7.97 score 17 stars 4 dependents 213 scripts 1.1k downloadslfa - Logistic Factor Analysis for Categorical Data
Logistic Factor Analysis is a method for a PCA analogue on Binomial data via estimation of latent structure in the natural parameter. The main method estimates genetic population structure from genotype data. There are also methods for estimating individual-specific allele frequencies using the population structure. Lastly, a structured Hardy-Weinberg equilibrium (HWE) test is developed, which quantifies the goodness of fit of the genotype data to the estimated population structure, via the estimated individual-specific allele frequencies (all of which generalizes traditional HWE tests).
Last updated 4 months ago
snpdimensionreductionprincipalcomponentregressionopenblas
7.04 score 16 stars 1 dependents 57 scripts 671 downloads
popkin - Estimate Kinship and FST under Arbitrary Population Structure
Provides functions to estimate the kinship matrix of individuals from a large set of biallelic SNPs, and extract inbreeding coefficients and the generalized FST (Wright's fixation index). Method described in Ochoa and Storey (2021) <doi:10.1371/journal.pgen.1009241>.
Last updated 5 months ago
6.11 score 20 stars 65 scripts 456 downloadsgcatest - Genotype Conditional Association TEST
GCAT is an association test for genome wide association studies that controls for population structure under a general class of trait models. This test conditions on the trait, which makes it immune to confounding by unmodeled environmental factors. Population structure is modeled via logistic factors, which are estimated using the `lfa` package.
Last updated 4 months ago
snpdimensionreductionprincipalcomponentgenomewideassociation
5.18 score 5 stars 4 scripts 267 downloads