====== GENEPOP ====== **[[http://genepop.curtin.edu.au/genepop_op1.html|GENEPOP]]**\\ [[http://kimura.univ-montp2.fr/~rousset/Genepop.pdf|documentation]]\\ [[http://genepop.curtin.edu.au/help_input.html|GENEPOP input file]] \\ Genepop 4.1 (24.03.2011)\\ It computes exact tests for Hardy-Weinberg equilibrium, for population differentiation and for genotypic disequilibrium among pairs of loci;\\ It computes estimates of F-statistics, null allele frequencies, allele sizebased statistics for microsatellites, etc., and of number of immigrants by Barton & Slatkin's 1986 private allele method;\\ It performs analyses of isolation by distance from pairwise comparisons of individuals or population samples, including confidence intervals for "neighborhood size";\\ Mantel test ===== Program information ===== * Windows * Linux * Mac OSX ===== Data type handled ===== * haploid/diploid * Microsattelite * Standard (multi-allelic markers) ===== Input Files ===== * **First line:** anything Use this line to store information about your data * **Locus names:** They may be given one per line, or on the same line but separated by commas. This could be useful to clearly label each column Title line: "Grape populations in southern France" Loc1,Loc2, ADH3,ADH4,ADH5,mtDNA Pop Grange des Peres , 0201 003003 0102 0302 1011 01 ... * **Pop sample indicator** (Capitalization does not matter): Each sample from a different geographical original is declared by a line with a pop statement * **Information for first individual:** ind#001 fem ,0101 0202 0000 0410 * Here ind#001 fem is an identifier for your personal use. You can use any character (except a comma!).The last identifier of every sub-population is used by Genepop as the sample name in output files. The comma between the identifier and the list of genotypes is required. * 0101 indicates that this individual is homozygous for the 01 allele at the first locus. * The same individual is homozygous for the 02 allele at the second locus (0202). * Unordered List Item Data are missing at the third locus (0000). * At the fourth locus, the genotype is 0410, which indicates the presence of alleles 04 and 10. * **More individuals**: Each individual information starts on a new line, but may extend over several lines (do not start a new line in the middle of a onelocus genotype!) * **More samples:** each declared by a pop statement on a new line * Do not leave blank lines at the end of the file \\ ==== example: ==== Title line: "Grape populations in southern France" ADH Locus 1 ADH #2 ADH three ADH-4 ADH-5 mtDNA Pop Grange des Peres , 0201 003003 0102 0302 1011 01 Grange des Peres , 0202 003001 0102 0303 1111 01 Grange des Peres , 0102 004001 0202 0102 1010 01 Grange des Peres , 0103 002002 0101 0202 1011 01 Grange des Peres , 0203 002004 0101 0102 1010 01 POP Tertre Roteboeuf , 0102 002002 0201 0405 0807 01 Tertre Roteboeuf , 0102 002001 0201 0405 0307 01 Tertre Roteboeuf , 0201 002003 0101 0505 0402 01 Tertre Roteboeuf , 0201 003003 0301 0303 0603 01 Tertre Roteboeuf , 0101 002001 0301 0505 0807 01 pop Bonneau 01 , 0101 002002 0304 0805 0304 01 Bonneau 02 , 0201 002002 0404 0505 0304 01 Bonneau 03 , 0101 002100 0304 0505 0101 01 Bonneau 04 , 0101 100100 0204 0805 0304 01 Bonneau 05 , 0101 100002 0104 0808 0304 01 Pop , 0000 002001 0202 0402 0007 01 , 0200 002001 0202 0205 0707 01 , 0010 002001 0101 0105 0807 01 last pop, 0101 002001 0101 0401 0807 02 This example shows some useful features: * There is no constraint on the number of blanks separating the various fields * The individual identifer has a free format * Alleles are numbered from 01 to 99 or 001 to 999 if needed. 2-digits and 3-digits coding of alleles can be intermixed (among loci, not within loci!) * To designate alleles, consecutive numbers are not required * haploid and diploid data can be intermixed. (6-digits genotypes are recognized as 3-digits diploid genotypes; 4-digits genotypes are recognized as 2-digits diploid genotypes; 2- and 3-digits genotypes are recognized as haploid genotypes. The same coding should be used consistently within each locus (for haplo-diploid data haploid data should be coded as diploid data with one unknown allele).) * Genotypes can extend on more than one line * To group various samples, just remove each relevant Pop separator ==== constraints: ==== * Missing data should be indicated with 00 (or 000 for 3-digits coding) and not with blanks * The number of locus names should correspond to the number of genotypes in each individual * No empty line should be present in the data file * accepts input file names either with the extension .txt or without any extension * input files are ASCII text files ===== How to cite ===== Rousset F, 2008. GENEPOP'007: a complete re-implementation of the genepop software for Windows and Linux. Molecular Ecology Resources, 1:103-106