CandidatePrograms

Revision as of 18:20, 17 August 2015 by Beerli (talk | contribs)

Candidate Programs for our Class

Keep in mind that we will definitely not be able to talk about all of them.

  • adegenet is an R-package dedicated to the exploratory analysis of genetic data. It implements a set of tools ranging from multivariate methods to spatial genetics and genome-wise SNP data analysis.
  • BEDASSLE This is a method I worked on in collaboration with Peter Ralph and Graham Coop. It allows users to quantify the relative contributions of geographic and environmental distance to patterns of genetic differentiation.
  • mtml-msBayes msBayes uses hierarchical approximate Bayesian computation (HABC) for comparative (multi-taxa) phylogeographic inference. HABC allows testing hypotheses and estimating hyper-parameters that quantify levels and patterns of congruence across co-distributed taxon-pairs in divergence/colonization times and/or categorical biogeographic histories.
  • GPHoCS G-PhoCS is a software package for inferring ancestral population sizes, population divergence times, and migration rates from individual genome sequences. G-PhoCS accepts as input a set of multiple sequence alignments from separate neutrally evolving loci along the genome. Parameter inference is done in a Bayesian manner, using a Markov Chain Monte Carlo (MCMC) to jointly sample model parameters and genealogies at the input loci.
  • TESS implements ancestry estimation algorithms for spatial population genetic analyses. The program performs individual geographical assignment, admixture analysis, and can be used to run genome scans for selection. TESS is particularly suited to seeking genetic discontinuities in continuous populations and estimating spatially varying individual admixture proportions.
  • STRUCTURE is a free software package for using multi-locus genotype data to investigate population structure. Its uses include inferring the presence of distinct populations, assigning individuals to populations, studying hybrid zones, identifying migrants and admixed individuals, and estimating population allele frequencies in situations where many individuals are migrants or admixed.
  • BAPS introduces the spatial clustering model for DNA sequence data such that the output can be displayed on Google Maps.
  • Geneland is a computer program for statistical analysis of population genetics data. Its main goal is to detect population structure in form of systematic variation of allele frequency that can be detected from departure from Hardy-Weinberg and linkage equilibrium.
  • Structurama is a program for inferring population structure from genetic data. The program assumes that the sampled loci are in linkage equilibrium and that the allele frequencies for each population are drawn from a Dirichlet probability distribution.
  • Partition finder is free open source software to select best-fit partitioning schemes and models of molecular evolution for phylogenetic analyses.(this seems to have a recent spinoff by Frandsen, but his github site pbfrandsen's github needs better organization before diving into this.)
  • RaXML is a fast maximum likelihood phylogenetic inference program
  • SVDquartets is a program to compute a score based on singular value decomposition of a matrix of site pattern frequencies corresponding to a split on a phylogenetic tree. These quartet scores can be used to select the best-supported topology for quartets of taxa, which in turn can be used to infer the species phylogeny using quartet methods. It is implemented in PAUP*
  • PAUP* is the reference phylogeny inference program including a kitchen sink of methods.
  • ASTRAL is a Java program for estimating a species tree given a set of unrooted gene trees. ASTRAL is statistically consistent under multi-species coalescent model (and thus is useful for handling ILS). It finds the tree that maximizes the number of induced quartet trees in the set of gene trees that are shared by the species tree.
  • STRAW:STAR,MP-EST, NJst The species tree reconstruction webserver consists of a series of phylogenetic tools for estimating species trees from phylogenomic data: 1) rooting gene trees with outgroup species, 2) estimating species trees from the rooted gene trees using STAR , MP-EST , NJst methods, 3) bootstrap analyses for the species tree reconstruction methods (STAR, MP-EST, and NJst).
  • SATE,PASTA,Superfine SATE:Simultaneous Alignment and Tree Estimation (SATé) is a software package for constructing phylogenies and alignments from sequence data; PASTA (Practical Alignment using SATé and TrAnsitivity) is an improvement to SATé: it uses some of the algorithmic design of SATé but is faster, produces more accurate alignments and trees, and can scale to much larger datasets; SuperFine is software for constructing supertrees from source phylogenies.
  • GARLI Genetic Algorithm for Rapid Likelihood Inference is a program for inferring phylogenetic trees. Using an approach similar to a classical genetic algorithm, it rapidly searches the space of evolutionary trees and model parameters to find the solution maximizing the likelihood score. It implements nucleotide, amino acid and codon-based models of sequence evolution, and runs on all platforms. The latest version adds support for partitioned models and morphology-like datatypes.
  • ExaBayes ExaBayes is a software package for Bayesian tree inference. It is particularly suitable for large-scale analyses on computer clusters. {see MrBayes
  • MrBayes is a program for Bayesian inference and model choice across a wide range of phylogenetic and evolutionary models. MrBayes uses Markov chain Monte Carlo (MCMC) methods to estimate the posterior distribution of model parameters.
  • RevBayes Bayesian phylogenetic inference using probabilistic graphical models and an interactive language. Successor of MrBayes.
  • MIGRATE Migrate estimates effective population sizes, past migration rates, and divergence times between n population.
  • IMa was developed for the fitting of an isolation model with migration to haplotype data drawn from two closely related species or populations.
  • angsd: Analyses of nextgen sequencing data.

NOT FINISHED