Pan-genome
From CSBLwiki
(Difference between revisions)
(→Data) |
(→Results) |
||
Line 6: | Line 6: | ||
==Results== | ==Results== | ||
+ | ===MDS=== | ||
+ | *Procedures | ||
+ | #build a pairwise distance matrix (11,912 x 11,912) from the FP matrix (1,137 x 11,912) - [ref:euclidean distance] | ||
+ | #do MDS (multidimensional scaling) of the distance matrix | ||
+ | #get coordinates from the first three dimension and plot them in a 3D-space | ||
+ | *Exact distance matrix | ||
+ | |||
==R script sources== | ==R script sources== |
Revision as of 07:40, 12 March 2010
Contents |
Pan-Genomic Universe
Data
- Pan-Genome data version 0.1 Feb 2010
- Download the frequency profile (1,137 genomes x 11,912 pfam domains)
- download: The FP(Frequency Profile) matrix
Results
MDS
- Procedures
- build a pairwise distance matrix (11,912 x 11,912) from the FP matrix (1,137 x 11,912) - [ref:euclidean distance]
- do MDS (multidimensional scaling) of the distance matrix
- get coordinates from the first three dimension and plot them in a 3D-space
- Exact distance matrix