Help:DataVisualizationInCytoscape

From WikiPathways

Revision as of 22:03, 12 January 2016 by Khanspers (Talk | contribs)
Jump to: navigation, search

UNDER CONSTRUCTION

This tutorial describes how to visualize experimental data on WikiPathways in Cytoscape. After importing pathways from WikiPathways, they are mapped to the same identifier system as used by the data, using BridgeDb. Data is imported, and a visual style is created to visualize the data on pathways.

Contents

Installation

Cytoscape

Cytoscape Apps

  • In Cytoscape, open the App Manager under Apps > App Manager.

  • In the search field, type WikiPathways to search for the WikiPathways app. Select the WikiPathways app from the results and click Install.
  • Next, find and install the BridgeDb app in the same way. When the installation is complete, click Close to exit the App Manager.

Alternatively, Cytoscape apps can be installed directly from the Cytoscape App Store. If Cytoscape is open on your computer, you can click Install on the relevant app page, and the app will install.

WikiPathways app

BridgeDb app

Pathway import and mapping

Import pathways from WikiPathways

Based on over-representation analysis, two pathways have been identified as significant for this dataset. To start, we will open these pathways.

  • Go to File > Import > Network > Public Databases.... In the Data Source drop-down, select WikiPathways.

  • Check the box for Only and select Mus musculus in the species drop-down.
  • In the search field, type in Electron Transport Chain. Click Enter to search.
  • In the pathway results, select the Electron Transport Chain and either double-click on the entry to open, or click ont he Import as Pathway button at the bottom right. Once opened, the pathway should look like this:

Once the pathway opens in Cytoscape, it should look like this:


  • Repeat the steps above to import the Proteasome degradation pathway.
  • You should now have both pathways open in Cytoscape:

Map pathways with BridgeDb

Since the experimental data is annotated with Ensembl identifiers and the pathways have a mix of identifiers, the pathways need to be mapped to Ensembl before the data can be visualized. The BridgeDb app does exactly this, given a mapping file. Before performing the mapping, we need to import the appropriate mapping file to BridegDb.

  • Open the BridgeDb app under Apps > BridegDb > Manage ID Mapping Resources.

BridgeDb works with mapping information from various sources. For example, you can either download a database here, and after extracting it select it under Databases in the ID Mapping Resources Configuration interface. You can also supply a local or remote mapping file. For this tutorial, we will use Web Services for the mappings.

  • Click on the Web Services entry in the list of ID mapping sources.
  • In the Webservice-based ID Mapping Resources Configuration interface, select BridegDb web service in the drop-down.

  • In the Base URL of BridgeDb web service drop-down, select the entry for mouse. Click OK to continue.

You will see a listing of supported ID systems. We are now ready to perform the mapping.


  • Close the ID Mapping Source Configuration interface.

We are now ready to map the identifiers from the pathways to Ensembl. BridgeDb works best when you know the type of identifier to map from (Source).

  • To find out which identifier type(s) is used in your pathway, first select all nodes on the pathway by clicking and dragging to select the entire pathway. Selected nodes will turn yellow.
  • Immediately below the network view in Cytoscape is the Table Panel. Expand the table either by clicking dragging on the table border, or by click on the Float Window icon on the right end of the Table Panel border.
  • The identifier types for data nodes are listed in the XrefDatasource column. The pathways in this tutorial are annotated with a mix of Entrez Gene, MGI and Ensembl identifiers.

  • With one of the networks selected, go to Apps > BridegDb > Map Identifiers.
  • In the BridgeDb interface, open the drop-down under Source ID Type(s), and select Ensembl, Entrez Gene and MGI in the list of identifier types.
  • In the Source Column in Node Table, select XrefId, which is the column that contains the identifier for WikiPathways data nodes.
  • In the Target ID Type, select Ensembl. The Target Column in Node Table will default to Ensembl, which will be a new column in the Table Panel for the pathway.
  • Finally, in the All Target ID(s) or First Only? drop-down, select Keep the first target ID only.

To summarize, we are mapping Entrez Gene, MGI and Ensembl identifiers to Ensembl identifiers. It may seem redundant to map Ensembl - > Ensembl, but BridgeDb works by placing the new identifiers (targets) in a new column, which we will use later to link to our data. We can not designate more than one column to link to, so existing Ensembl identifiers in the pathway will not be linked unless we effectively transfer them to this new column by performing the same-to-same mapping. The interface should now look like this:


  • Click OK to run the mapping. When the mapping is complete, you will get a message stating how many identifiers were converted, and a question about running additional mapping. Click No to close BridgeDb.
  • Repeat the mapping process with the Urea cycle pathway.

Data import

Download data

For this tutorial, we will use an example dataset describing intestinal changes in response to fasting in mice. The data gene expression array data, and is in the form of a text file.

The experimental data for this tutorial can be found here: Starvation_dataset_trans.txt.

Import data to Cytoscape

  • To import the experimental data, select File > Import > Table > File.... Select the starvation_dataset_trans.txt file and click Open.
  • Under Where to Import Table Data, choose To selected networks only.
  • To select both pathways, click the Select All button.
  • Under Key Column for Networks, select Ensembl. This is the column created during BridgeDb mapping.
  • In the Preview section, click the expand symbol at the top of the ensid column. This will open an interface where you can define the details of the column, such as data type etc.
  • For the purposes of this tutorial, click the key symbol. This defines the ensid column as being the key column, which will be used to link the data to the pathways.

  • Click OK to import the data.

Data visualization on pathways

The data and pathways are now both imported and linked and we can visualize the data on the pathways. For this tutorial, we will create a color scheme for all significant nodes, based on ANOVA values for a comparison across three experimental groups.

  • In the Control Panel on the left side of Cytoscape, select the Style tab.
  • In the list of Properties click on the triangle next to Fill Color to expand.
  • Click on Fill Color next to Column. In the drop-down, select ANOVA to select the ANOVA column from the data.
  • Under MappingType, select Continous Mapping.

  • Double-click on the default black and white continuum to customize it.
  • First, set the min/max values by clicking the Set Min and Max... button. Set Min to 0 and Max to 1.
  • Next, double-click the large triangle at the left and select a green color. Repeat with the smaller triangle handle right next to it.
  • Move the smaller left-most handle to the right, to the 0.05 position. Once the handle is selected, its position can be set in the Handle Position field.
  • For the large right-most triangle, set the color to a pale gray color.
  • Next, select the small right-most triangle and click Delete.

You now have a color scheme where nodes with an ANOVA value of 0.05 or below is colored green, and all other measured nodes are colored gray. Nodes that are not found will simply be white.


Export visualizations

Any pathway or network visualization in Cytoscape can be exported to graphical formats, which can easily be included in publications and websites.

  • To export the pathway visualization, select File > Export > Network View as Graphics...
  • Select the export file format, and designate a location name filename for the export.

Personal tools