Pan-genome

From CSBLwiki

(Difference between revisions)
Jump to: navigation, search
(MDS)
(Data)
Line 2: Line 2:
==Data==
==Data==
*<b>Pan-Genome data version 0.1 Feb 2010</b>
*<b>Pan-Genome data version 0.1 Feb 2010</b>
-
*Download the frequency profile (1,137 genomes x 11,912 [http://pfam.sanger.ac.uk Pfam] domains)
+
**The frequency profile (1,137 genomes x 11,912 [http://pfam.sanger.ac.uk Pfam] domains)
-
**download: [[media:profile.csv|The FP(Frequency Profile) matrix]]
+
***<b>download: [[media:profile.csv|The FP(Frequency Profile) matrix]]</b>
==Results==
==Results==

Revision as of 07:53, 12 March 2010

Contents

Pan-Genomic Universe

Data

Results

MDS

  1. build a pairwise distance matrix (11,912 x 11,912) from the FP matrix (1,137 x 11,912) - euclidean distance
  2. do MDS (multidimensional scaling) of the distance matrix
  3. get coordinates from the first three dimension and plot them in a 3D-space
  1. from Exact distance matrix:
    Mds1.png
  2. from distance matrix cutting off at 99% of the maximum distance:
    Mds99.png
  3. from distance matrix cutting off at 98% of the maximum distance:
    Mds98.png
  4. from distance matrix cutting off at 97% of the maximum distance:
    Mds97.png

R script sources

Personal tools
Namespaces
Variants
Actions
Site
Choi lab
Resources
Toolbox