Download Pathways

From WikiPathways

(Difference between revisions)
Jump to: navigation, search
Current revision (14:50, 11 April 2024) (view source)
(April data release)
 
(135 intermediate revisions not shown.)
Line 1: Line 1:
__NOEDITSECTION__ <!-- Turn off section editing -->
__NOEDITSECTION__ <!-- Turn off section editing -->
  {| align="right"
  {| align="right"
-
   | __TOC__
+
   | __NOTOC__
   |}
   |}
-
Choose one of the file types below to download a set of pathways in that format. Choose between the following sets:
+
=== Versioned Releases ===
-
;Analysis collection pathways: Only pathways that have been explicitly marked with the 'Analysis collection' curation tag. This includes pathways that have been carefully curated and typically excludes draft or test pathways not intended for distribution. ''(recommended)''
+
Each month we release an updated set of pathways in various data and image formats. These pathways have been reviewed and tagged as approved, and are considered ready for analysis and data overlays.  
-
;All pathways: Includes non-featured pathways, but still excludes test or tutorial pathways (such as the [[Pathway:Homo sapiens:Sandbox|Sandbox pathway]]).
+
-
== GPML ==
 
-
Click on one of the links below to download all pathways in GPML format. You can view and edit GPML files in [http://pathvisio.org PathVisio]. From [http://pathvisio.org PathVisio] you can also export these files to the [www.genmapp.org GenMAPP] mapp format and [http://en.wikipedia.org/wiki/Portable_Document_Format PDF].
 
-
{|class="prettytable"
+
<font size=4>'''Current version: [http://data.wikipathways.org/20240410/ 20240410 (10 April 2024)]'''</font>
-
!All pathways
+
-
!Analysis collection pathways
+
-
|- valign="top"
+
-
|<batchDownload filetype="gpml" excludetags="Curation:Tutorial"></batchDownload>
+
-
|<batchDownload filetype="gpml" tag="Curation:AnalysisCollection"></batchDownload>
+
-
|}
+
-
'''KEGG pathways''' for select species are available in GPML format at [http://www.pathvisio.org/Download#Step_3 PathVisio.org]. ''(KEGG content current as of September 14th, 2010)''
 
-
== BioPAX ==
+
=== Vertebrates ===
-
Click on one of the links below to download all pathways in [http://www.biopax.org BioPAX] level 3 format.
+
-
{|class="prettytable"
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Bos_taurus.zip Bos taurus]'''
-
!All pathways
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Canis_familiaris.zip Canis familiaris]'''
-
!Analysis collection pathways
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Danio_rerio.zip Danio rerio]'''
-
|- valign="top"
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Equus_caballus.zip Equus caballus]'''
-
|<batchDownload filetype="owl" excludetags="Curation:Tutorial"></batchDownload>
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Gallus_gallus.zip Gallus gallus]'''
-
|<batchDownload filetype="owl" tag="Curation:AnalysisCollection"></batchDownload>
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Homo_sapiens.zip Homo sapiens]'''
-
|}
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Mus_musculus.zip Mus musculus]'''
 +
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Pan_troglodytes.zip Pan troglodytes]'''
 +
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Rattus_norvegicus.zip Rattus norvegicus]'''
 +
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Sus_scrofa.zip Sus scrofa]'''
-
== Eu.Gene ==
+
=== Invertebrates ===
-
Click on one of the links below to download all pathways in the [http://www.ducciocavalieri.org/bio.htm Eu.Gene] format (pwf). Eu.Gene is a tool for microarray analysis in context of biological pathways ([http://www.ncbi.nlm.nih.gov/sites/entrez?cmd=retrieve&db=pubmed&list_uids=17599938&dopt=AbstractPlus read more]).
+
-
{|class="prettytable"
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Anopheles_gambiae.zip Anopheles gambiae]'''
-
!All pathways
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Caenorhabditis_elegans.zip Caenorhabditis elegans]'''
-
!Analysis collection pathways
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Drosophila_melanogaster.zip Drosophila melanogaster]'''
-
|- valign="top"
+
-
|<batchDownload filetype="pwf" excludetags="Curation:Tutorial"></batchDownload>
+
-
|<batchDownload filetype="pwf" tag="Curation:AnalysisCollection"></batchDownload>
+
-
|}
+
-
== Plain text ==
+
=== Plants ===
-
Click on one of the links below to download all pathways in plain text format. This format contains a list of all datanodes, with the identifier in the first
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Arabidopsis_thaliana.zip Arabidopsis thaliana]'''
-
column, and the database system in the second column.
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Hordeum_vulgare.zip Hordeum vulgare]'''
 +
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Oryza_sativa.zip Oryza sativa]'''
 +
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Populus_trichocarpa.zip Populus trichocarpa]'''
 +
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Solanum_lycopersicum.zip Solanum lycopersicum]'''
 +
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Zea_mays.zip Zea mays]'''
 +
<!-- * '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Beta_vulgaris.zip Beta vulgaris]''' -->
-
{|class="prettytable"
+
=== Eukaryotic microorganisms ===
-
!All pathways
+
-
!Analysis collection pathways
+
-
|- valign="top"
+
-
|<batchDownload filetype="txt" excludetags="Curation:Tutorial"></batchDownload>
+
-
|<batchDownload filetype="txt" tag="Curation:AnalysisCollection"></batchDownload>
+
-
|}
+
-
== PDF ==
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Gibberella_zeae.zip Gibberella zeae]'''
-
Click on one of the links below to download all pathways in Portable Document Format (PDF).
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Saccharomyces_cerevisiae.zip Saccharomyces cerevisiae]'''
 +
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Plasmodium_falciparum.zip Plasmodium falciparum]'''
-
{|class="prettytable"
+
=== Bacteria ===
-
!All pathways
+
-
!Analysis collection pathways
+
-
|- valign="top"
+
-
|<batchDownload filetype="pdf" excludetags="Curation:Tutorial"></batchDownload>
+
-
|<batchDownload filetype="pdf" tag="Curation:AnalysisCollection"></batchDownload>
+
-
|}
+
-
== SVG ==
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Bacillus_subtilis.zip Bacillus subtilis]'''
-
Click on one of the links below to download all pathways in [http://www.w3.org/Graphics/SVG/ SVG] format. The [http://www.w3.org/Graphics/SVG/ SVG] files can be used to create high quality images for publication purposes. You can edit and convert the SVG files using [http://www.inkscape.org Inkscape].
+
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Escherichia_coli.zip E.coli]'''
 +
* '''[http://data.wikipathways.org/20240410/gpml/wikipathways-20240410-gpml-Mycobacterium_tuberculosis.zip Mycobacterium tuberculosis]'''
-
{|class="prettytable"
 
-
!All pathways
 
-
!Analysis collection pathways
 
-
|- valign="top"
 
-
|<batchDownload filetype="svg" excludetags="Curation:Tutorial"></batchDownload>
 
-
|<batchDownload filetype="svg" tag="Curation:AnalysisCollection"></batchDownload>
 
-
|}
 
-
== PNG ==
+
== Programmatic Access ==
-
Click on one of the links below to download all pathways in the Portable Network Graphics (png) format.
+
The archive of current and past collections of pathways in various formats at data.wikipathways.org is accessible programmatically as well. Depending on your preferences, there are many ways to identify and download the collection you need.
-
{|class="prettytable"
+
''Note: Our files contain the date of creation in their names so that you can be sure which collection your are using and to avoid overwriting local copies of these files.''
-
!All pathways
+
-
!Analysis collection pathways
+
-
|- valign="top"
+
-
|<batchDownload filetype="png" excludetags="Curation:Tutorial"></batchDownload>
+
-
|<batchDownload filetype="png" tag="Curation:AnalysisCollection"></batchDownload>
+
-
|}
+
-
== Flatfile ==
+
# '''[https://github.com/wikipathways/rwikipathways rWikiPathways]''' is an R package that provides an helper function called ''downloadPathwayArchive'' that will retrieve the latest file for you per species and format, e.g.,  <pre>downloadPathwayArchive(organism="Mus musculus”, format=‘gmt’)</pre>
-
Click on one of the links below to download all pathways in either a tab-delimited or HTML table format. The output includes gene, protein and small molecule contents. You can choose between "''original''" and "''mapped''" versions, which refer to whether only the original identifiers are provided or if you want them mapped to all available identifier systems.
+
# '''Filename pattern''' allows you to infer the filename of the latest collection given the current date. For example, since we always release our archive collections on the 10th of each month, you know that the latest filename is the nearest prior date matching that pattern, e.g., 20180910 would be the current file from Sep 10 to Oct 10, 2018. ''Caution: this might break if for some unforeseen reason we are unable to produce the archive on schedule.''
 +
# '''Bash scripting''' allows you to scrape the currently available filenames and guarantee that you are getting the latest file no matter what the name might be.  Here is an example of a one-liner to get a list of all the current GMT files: <pre>echo "cat //html/body/div/table/tbody/tr/td/a" |  xmllint --html --shell http://data.wikipathways.org/current/gmt/ | grep -o -E ">(.*gmt)<" | sed -E 's/(<|>)//g'</pre> And here is a version that would return the latest GMT for mouse: <pre>echo "cat //html/body/div/table/tbody/tr/td/a" |  xmllint --html --shell http://data.wikipathways.org/current/gmt/ | grep -o -E ">.*Mus_musculus.gmt<" | sed -E 's/(<|>)//g'</pre>
 +
-
* Anopheles gambiae  (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Anopheles%20gambiae&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Anopheles%20gambiae&mapping=off&output=html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Anopheles%20gambiae&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Anopheles%20gambiae&output=html html])
+
== Other Collections ==
-
* Arabidopsis thaliana  (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Arabidopsis%20thaliana&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Arabidopsis%20thaliana&mapping=off&output=html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Arabidopsis%20thaliana&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Arabidopsis%20thaliana&output=html html])
+
 
-
* Bos taurus (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Bos%20taurus&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Bos%20taurus&mapping=off&output=html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Bos%20taurus&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Bos%20taurus&output=html html])
+
<font size=3>
-
* Caenorhabditis elegans (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Caenorhabditis%20elegans&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Caenorhabditis%20elegans&mapping=off&output=html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Caenorhabditis%20elegans&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Caenorhabditis%20elegans&output=html html])
+
* [http://data.wikipathways.org Prior monthly releases]
-
* Danio rerio (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Danio%20rerio&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Danio%20rerio&mapping=off&output=html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Danio%20rerio&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Danio%20rerio&output=html html])
+
* [[Daily_Download|Daily curated releases]]
-
* Drosophila melanogaster (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Drosophila%20melanogaster&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Drosophila%20melanogaster&mapping=off&output=html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Drosophila%20melanogaster&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Drosophila%20melanogaster&output=html html])
+
* [http://www.wikipathways.org//wpi/batchDownload.php?species=Homo%20sapiens&fileType=gpml&tag=Curation:Reactome_Approved  Reactome Human Collection]
-
* Equus caballus  (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Equus%20caballus&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Equus%20caballus&mapping=off&output=html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Equus%20caballus&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Equus%20caballus&output=html html])
+
* [http://data.wikipathways.org/current/gmt Gene lists per pathway (GMT)]
-
* Gallus gallus  (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Gallus%20gallus&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Gallus%20gallus&mapping=off&output=html html])  (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Gallus%20gallus&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Gallus%20gallus&output=html html])
+
* [http://data.wikipathways.org/current/svg Pathway image files (SVG)]
-
* Homo sapiens (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/cache/wikipathways_native_data_Homo%20sapiens.tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/cache/wikipathways_native_data_Homo%20sapiens.html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/cache/wikipathways_data_Homo%20sapiens.tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/cache/wikipathways_data_Homo%20sapiens.html html])
+
* [http://data.wikipathways.org/current/rdf Linked data files (RDF)]
-
* Mus Musculus (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Mus%20musculus&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Mus%20musculus&mapping=off&output=html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Mus%20musculus&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Mus%20musculus&output=html html])
+
* [http://data.wikipathways.org/current/index Database index files (index)]
-
* Oryza sativa (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Oryza%20sativa&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Oryza%20sativa&mapping=off&output=html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Oryza%20sativa&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Oryza%20sativa&output=html html])
+
* [[Help:FileFormats|Other file formats]]
-
* Pan troglodytes (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Pan%20troglodytes&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Pan%20troglodytes&mapping=off&output=html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Pan%20troglodytes&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Pan%20troglodytes&output=html html])
+
</font>
-
* Rattus norvegicus (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Rattus%20norvegicus&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Rattus%20norvegicus&mapping=off&output=html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Rattus%20norvegicus&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Rattus%20norvegicus&output=html html])
+
-
* Saccharomyces cerevisiae (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Saccharomyces%20cerevisiae&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Saccharomyces%20cerevisiae&mapping=off&output=html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Saccharomyces%20cerevisiae&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Saccharomyces%20cerevisiae&output=html html])
+
-
* Zea mays (''original'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Zea%20mays&mapping=off&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Zea%20mays&mapping=off&output=html html]) (''mapped'': [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Zea%20mays&output=tab tab], [{{SERVER}}{{SCRIPTPATH}}/wpi/pathway_content_flatfile.php?species=Zea%20mays&output=html html])
+

Current revision

Versioned Releases

Each month we release an updated set of pathways in various data and image formats. These pathways have been reviewed and tagged as approved, and are considered ready for analysis and data overlays.


Current version: 20240410 (10 April 2024)


Vertebrates

Invertebrates

Plants

Eukaryotic microorganisms

Bacteria


Programmatic Access

The archive of current and past collections of pathways in various formats at data.wikipathways.org is accessible programmatically as well. Depending on your preferences, there are many ways to identify and download the collection you need.

Note: Our files contain the date of creation in their names so that you can be sure which collection your are using and to avoid overwriting local copies of these files.

  1. rWikiPathways is an R package that provides an helper function called downloadPathwayArchive that will retrieve the latest file for you per species and format, e.g.,
    downloadPathwayArchive(organism="Mus musculus”, format=‘gmt’)
  2. Filename pattern allows you to infer the filename of the latest collection given the current date. For example, since we always release our archive collections on the 10th of each month, you know that the latest filename is the nearest prior date matching that pattern, e.g., 20180910 would be the current file from Sep 10 to Oct 10, 2018. Caution: this might break if for some unforeseen reason we are unable to produce the archive on schedule.
  3. Bash scripting allows you to scrape the currently available filenames and guarantee that you are getting the latest file no matter what the name might be. Here is an example of a one-liner to get a list of all the current GMT files:
    echo "cat //html/body/div/table/tbody/tr/td/a" |  xmllint --html --shell http://data.wikipathways.org/current/gmt/ | grep -o -E ">(.*gmt)<" | sed -E 's/(<|>)//g'
    And here is a version that would return the latest GMT for mouse:
    echo "cat //html/body/div/table/tbody/tr/td/a" |  xmllint --html --shell http://data.wikipathways.org/current/gmt/ | grep -o -E ">.*Mus_musculus.gmt<" | sed -E 's/(<|>)//g'


Other Collections

Personal tools