WoPPER - Webserver for Position Related data analysis of gene Expression in Prokaryotes -is a web-tool for the analysis of bacteria transcriptomic data. Gene expression is integrated with genomic annotations to reveal connections between transcription regulation and structural organization of bacterial genomes.
Both Microarray and RNA-Seq data can be used as inputs for WoPPER. The analysis produces, as outputs, clusters of positionally related genes having the same trend of differential modulation of expression and graphical outputs of the modulated clusters.
WoPPER is implemented in AngularJS, a structural framework for dynamic web applications, on the basis of the R-package PREDA, developed by Ferrari F. and co-workers in 2011 1 to identify regional variations in genomic data.
WoPPER is available without registration at http://WoPPER.ba.itb.cnr.it/ or https://WoPPER.ba.itb.cnr.it/ - SSL secure connection.
WoPPER is based on the release 3.4 of PREDA package (Bioconductor Preda , last update: 17/10/2016). Since PREDA is currently under constant improvement and development, we are planning to keep WoPPER updated to the future releases of PREDA package.
WoPPER, based on AngularJS framework, has been tested with major modern browsers such as Google Chrome (5+), Mozilla Firefox (3+), Safari, and Safari for iOs (10+), as well as Microsoft Internet Explorer (11+) and Microsoft Edge (AngularJS FAQ ). Known Issues: in Internet Explorer PNG/SVG downloads don't work. Thus, WoPPER should work on any recently updated browser. For a better usage, we recommend users to access WoPPER using Chrome or Firefox from a computer with at least 4G RAM and 1200 x 800 screen resolution.
If you encounter an issue with your browser, try updating it to the latest version (Browse Happy ). If you still have problems, contact us: bigstaff@ba.itb.cnr.it
At the top of the WoPPER page, the first two drop-down buttons allow users to have a look at some datasets that can be used as examples of WoPPER experiment and functionalities and to show the executed experiments whereas the last button allows to set up form input maintenance.
The complete step-by-step analysis of the same datasets starting from GED files and (in Helicobacter G27 example) a custom annotation can be found in the Help/Tutorial ⟹ Tutorials page.
For each experiment, WoPPER allows the user to:
Mandatory inputs for WoPPER execution are:
WoPPER requires the user to upload at least one file containing a list of gene expression fold-changes (in log2 format). Custom genome annotation can be provided in different file formats, as long as they are built according to some specifics (see below, in the “Genome Annotation” section).
A file containing gene expression data (GED) for either RNA-Seq or microarray experiment is the only input that must be provided to WoPPER in all settings.
Clicking on the “Preview” button () allows quick check of the provided GED file, in order to correctly indicate column numbers for Gene Name and Log2 fold change and eventual header lines.
Note: GED file must contain the unfiltered gene expression values, that is the Log2 fold change of all the genes, without any previous selection. Loading such kind of data (e.g.: only differentially expressed genes) will cause WoPPER to give meaningless results.
Note: after loading the GED file, users must Validate it clicking on the corresponding radio-button. When the GED file is valid, the radio-button turns green, otherwise, it turns red.
GED file requires checking | Valid GED file | Non-valid GED file |
---|---|---|
Genome annotation can be provided either as a custom file (Custom Genome Annotation tab) or by selecting one of the annotations pre-loaded in the internal WoPPER database (derived from bacterial genome annotations by the National Center for Biotechnology Information of the National Institute of Health) (NCBI Genome Annotation).
Users can select one of the pre-loaded NCBI bacteria genome annotations (NCBI Ftp /genomes/archive/old_refseq/Bacteria/ , last updated on 13/06/2013), containing complete information about 2780 bacterial strains (including 2970 chromosomes and 2188 plasmids) 2. Organisms with more than one chromosome are listed with a separated record for each chromosome (named as “chr1”, “chr2”, etc.). Similarly, plasmids for each reference (where present) have been listed separately and can be used in case users have Gene Expression Data also from plasmid genes.
In order to select the proper strain from the drop-down menu, the user has to type just 3 or more characters of the strain name to activate the automatic search in the database.
Clicking on the “Preview” button () allows quick check of the Genome Annotation selected.
Note: check that the “Gene Name” field contains the same values as the GED file, otherwise WoPPER will not process the data.
Note: organisms with more than one chromosome or containing plasmids must be analyzed one chromosome-at-a-time, selecting the correct genome annotation from the database.
Alternatively, users can provide their own preferred genome annotation, provided the custom annotation file fulfills the specific requirements listed below.
Note: organisms with more than one chromosome or containing plasmids must be analyzed one chromosome-at-a-time. Thus, when building custom genome annotations, users have to provide separate files for each chromosome and plasmid.
Gene annotation in Browser Extensible Data format (BED) must be provided according to the UCSC standard, with at least 6 columns (also called “BED6” format) (https://genome.ucsc.edu/FAQ/FAQformat#format1).
BED file fields must contain the following information:
An example of a valid Genome annotation BED6 file for use in WoPPER is reported below.
Alternatively, annotations can be provided also in the General Feature Format (GFF) format, which has nine required fields that must be tab-separated. Please, refer to https://genome.ucsc.edu/FAQ/FAQformat#format3 and http://gmod.org/wiki/GFF3 for the complete description of this format.
GFF file fields must contain the following information:
Note: Please check that records defining gene features have a pair: “locus_tag = LOCUS_NAME” in the “attributes” section (column 9), since WoPPER infers the gene name from this tag-value pair.
Note: Valid GFF files are those downloaded from NCBI (https://www.ncbi.nlm.nih.gov/) and Patric (https://www.patricbrc.org/) and custom genome files formatted accordingly. Please, be sure to upload only plain text files to WoPPER since it does not accept compressed formats (e.g.: zip, gzip or bzip archives).
Example of a GFF downloaded from NCBI
Example of a GFF downloaded from Patric
Finally, annotations can be provided also as a separated-columns text. Column separator can be chosen among TAB, SPACE, comma (“,”) or semicolon (“;”). This file must have 5 columns.
CSV file fields must contain the following information:
An example of a valid Genome annotation CSV file for use in WoPPER is reported below.
Note: The Separated Strands Analysis could not be reliable for small bacterial genomes (indicatively, having less than 2000 genes), since separating the analysis would result in WoPPER building its statistical model on too few points and originating inaccurate clusters.
Q-value input mask | Separated Strands Analysis button |
---|---|
Once all the mandatory inputs have been provided, WoPPER can proceed to execution step.
All inputs correctly provided | |
---|---|
Some input missing or incorrect | |
A new page will show information about:
WoPPER provides the user three different types of outputs that summarize all the results obtained from the application of the PREDA analysis on the input dataset. Each output type is described briefly below.
Tabular output represents a text version of WoPPER results. A list of the identified clusters is available, as well as the possibility to sort, filter and download the cluster information. An example of tabular output is reported in figure 1. Numbers in the figure correspond to the related numbered list entry below, describing each field.
Figure 1: example of WoPPER Tabular Output.
The circular plot represents the bacterial chromosome as a circle in which genomic features and gene cluster determined by WoPPER are visualized. This is particularly useful to highlight cluster densities, location and strandness all in one plot. The image can be easily downloaded for use in publications or presentations.
An example of circular plot is reported in Figure 2. Numbers in the figure correspond to the related numbered list entry below, describing each field.
Figure 2: example of WoPPER Circular Plot Output.
The linear graph represents the bacterial chromosome in linear coordinates and reports the average Log2 fold change for gene clusters. An example of tabular output is reported in Figure 3. Numbers in the figure correspond to the related numbered list entry below, describing each field.
Figure 3: example of WoPPER Linear Graph Output.