User Tools

Site Tools


mega

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
mega [2008/05/20 11:50] heidimega [2011/07/07 11:50] (current) heidi
Line 8: Line 8:
  
 \\ \\
-Version (May 12008)\\+Version (Aril 242011)\\
 MEGA is an integrated tool for conducting automatic and manual sequence alignment, inferring phylogenetic trees, mining web-based databases, estimating rates of molecular evolution, and testing evolutionary hypotheses. MEGA is an integrated tool for conducting automatic and manual sequence alignment, inferring phylogenetic trees, mining web-based databases, estimating rates of molecular evolution, and testing evolutionary hypotheses.
 +
  
  
Line 15: Line 16:
  
 ===== Program information ===== ===== Program information =====
-  * Windows 95/98, NT, 2000, XP, and Vista (with at least 64 MB of RAM, 20 MB of available hard disk space)+  * Windows XP, Vista, 7 (with at least 64 MB of RAM, 20 MB of available hard disk space)
   * MEGA also can be run on other operating systems for which Windows emulators are available:   * MEGA also can be run on other operating systems for which Windows emulators are available:
     * Macintosh: Windows using VirtualPC     * Macintosh: Windows using VirtualPC
     * Sun Workstation: SoftWindows95     * Sun Workstation: SoftWindows95
     * Linux: Windows using VMWare     * Linux: Windows using VMWare
 +
  
  
Line 27: Line 29:
   * RNA   * RNA
   * nucleotide   * nucleotide
 +  * distance
   * (protein sequences)   * (protein sequences)
 +
  
 ===== Input Files ===== ===== Input Files =====
   * ASCII-text files   * ASCII-text files
   * extension: *.MEG   * extension: *.MEG
 +  * Importing Data from Other Formats:
 +    * CLUSTAL
 +    * [[NEXUS]]
 +    * [[PHYLIP]] (Interleaved/Noninterleaved)
 +    * GCG
 +    * [[FASTA]]
 +    * PIR
 +    * NBRF
 +    * MSF
 +    * IG
 +    * Internet (NCBI) XML format
 +
  
  
Line 60: Line 76:
  
 \\ \\
-  * Comments: can be written anywhere in the data file and can span multiple lines. They must always be enclosed in square brackets ([and]) brackets and can be nested. +  * Comments:  
-  * keywords: can be written in any combination of lower- and upper-case letters +    * anywhere in the data file 
-  * Rules for Taxa Names: Distance matrices as well as sequence data may come from species, populations, or individuals. These evolutionary entities are designated as OTUs (Operational Taxonomic Units) or taxa. Each taxon must have an identification tag, i.e., a taxon Iabel according to the following conventions+    * can span multiple lines 
-    * ‘#’ Sign: Every Iabel must be written on a new line, and a '#' sign must precede the label. There are no restrictions on the length of the Iabels in the data file. The labels are not required to be uniquealthough identical labels may result in ambiguities and should be avoided. +    * enclosed in square brackets ([and]) 
-    * Characters: Taxa labels must start with alphanumeric characters (0-9, a-z, and A-Z) or a special character: ''-, + or .''After the first character, taxa labels may contain the following additional special characters:''_, *, :, ( ), |, \, /''For multiple word labels, an underscore can be used to represent a blank space+    * can be nested 
 +  * keywords:  
 +    * written in any combination of lower- and upper-case letters 
 +  * Taxa Names:  
 +    * ‘#’ Sign: Every Iabel must be written on a new line, and a '#' sign must precede the label 
 +    * no restrictions on the length of the Iabels 
 +    * not required to be unique (although identical labels may result in ambiguities and should be avoided) 
 +    * must start with alphanumeric characters (0-9, a-z, and A-Z) or a special character: ''-, + or .'' 
 +    * After the first character, taxa labels may contain the following additional special characters:''_, *, :, ( ), |, \, /'' 
 +    * For multiple word labels, an underscore can be used to represent a blank space  
 + 
 +\\ 
  
  
Line 89: Line 117:
 | MatchChar | single character | Synonymous with the identical keyword | MatchChar = . | | MatchChar | single character | Synonymous with the identical keyword | MatchChar = . |
 | Missing | single character | use question mark (?) to indicate missing data | Missing = ? | | Missing | single character | use question mark (?) to indicate missing data | Missing = ? |
-| CodeTable | A name This instruction gives the name of the code table for the protein coding domains of the data | CodeTable = Standard |+| CodeTable | A name This instruction gives the name of the code table for the protein coding domains of the data | CodeTable = Standard |
  
  
Line 106: Line 134:
     * there should be no spaces between the taxa name and group name      * there should be no spaces between the taxa name and group name 
  
-  *  **Labelling Individual Sites:**¨+  *  **Labelling Individual Sites:**
     * The individual sites in nucleotide or amino acid data can be labeled to construct non-contiguous sets of sites.      * The individual sites in nucleotide or amino acid data can be labeled to construct non-contiguous sets of sites. 
     * Each site can be associated with only one label      * Each site can be associated with only one label 
-    * A label can be a letter or a +    * A label can be a letter or a number.
-number.+
  
 === example === === example ===
Line 130: Line 157:
 !Label +++__-+++-a-+++-L-+++-k-+++123+++-_-+++---+++; !Label +++__-+++-a-+++-L-+++-k-+++123+++-_-+++---+++;
 </code> </code>
 +
 +
  
  
Line 140: Line 169:
   * **Keywords for Format Statement:**   * **Keywords for Format Statement:**
  
-^ Command ^ Setting ^ Remark ^ Example ^ +^ Command ^ Setting ^ Remark ^ Example ^  
-| DataType | DNA, RNA, nucleotide, protein | | DataType=DNA +| DataType | Distance Specifies that the distance data is in the file | DataType=distance |  
-| NSeqs | integer | Number of sequences | NSeqs=85 | +| NSeqs | integer | Number of sequences | NSeqs=85 |  
-| NTaxa | integer | Synonymous with NSeqs | NTaxa=85 | +| NTaxa | integer | Same as NSeqs | NTaxa=85 |  
-NSites integer | Number of nucleotides | Nsites=4592 | +DataFormat Lowerleftupperright | Specifies whether the data is in lower left triangular matrix or the upper right triangular matrix DataFormat=lowerleft 
-| Property | ExonIntron, Coding, Noncoding, and End | Specifies whether a domain is protein coding Property=cyt_b + 
-| Indel | single character | Use dash (-) | Indel = - | +
-| Identical | single character | Use period (.) | Identical = . | +
-| MatchChar | single character | Synonymous with identical | MatchChar = . | +
-| Missing | single character | Use a question mark (?) | Missing = ? | +
-| CodeTable | A name | This instruction gives the name of the code table for the protein coding domains of the data | CodeTable = Standard |+
  
   * **Defining Groups:**   * **Defining Groups:**
Line 157: Line 181:
 === example === === example ===
 <code> <code>
-#one +#mega 
-#two +!Title: Concatenated Files; 
-#three +!Format DataType=Distance DataFormat=LowerLeft NTaxa=6;
-#four +
-#five+
  
-1.0 2.0 3.0 4.0 +#Rodent 
-    3.0 2.5 4.6 +#Primate 
-        1.3 3.+#Lagomorpha 
-            4.2+#Artiodactyla 
 +#Carnivora 
 +#Perissodactyla 
 +       
 +0.514        
 +0.535 0.436        
 +0.530 0.388 0.418        
 +0.521 0.353 0.417 0.345        
 +0.500 0.331 0.402 0.327 0.349
 </code> </code>
 +
  
  
Line 173: Line 204:
  
 ===== How to cite ===== ===== How to cite =====
-  * When referring to MEGA in the main text of your publication, you may choose a format such as:\\ +Citation for MEGA 5
-    Phylogenetic and molecular evolutionary analyses were conducted using MEGA version 4 (Tamura, Dudley, Nei, and Kumar 2007). +  * Tamura K, Peterson D, Peterson N, Stecher G, Nei M, and Kumar S (2011MEGA5: Molecular Evolutionary Genetics Analysis using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods. Molecular Biology and Evolution doi10.1093/molbev/msr121
-  * When including a MEGA citation in the Literature Cited/Bibliography section, you may use the following:\\ +
-Tamura K, Dudley J, Nei M Kumar S (2007MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Molecular Biology and Evolution 24:1596-1599(Publication PDF at http://www.kumarlab.net/publications)+
  
 +\\
 +Citation for MEGA 4:
 +  * Tamura K, Dudley J, Nei M & Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Molecular Biology and Evolution 24: 1596-1599. 
mega.1211277039.txt.gz · Last modified: 2008/07/22 13:30 (external edit)