The START app allows users to visualize RNA-seq data starting with count data.
Features
Visualize your data:
Data Format
Count or Expression Data
Analyzed Data
TIP: Save Data for Future UploadAfter uploading your data to START, click red button to download an .RData file to upload your data to START with one click. Next time use the “Input Data” tab –> “START RData file” option. More Help and InfoAdditional help information and more detailed instructions are provided under the “Instructions” tab. App InfoThe START app has been developed by Jessica Minnier, Jiri Sklenar, Anthony Paul Barnes, and Jonathan Nelson of Oregon Health & Science University, Knight Cardiovascular Institute and School of Public Health. Please cite our app: The source code of START is available on Github. We would appreciate reports of any issues with the app via the issues option of Github or by emailing start.app.help-at-gmail.com. group1 group2 group3 group2/group1 This panel constructs box and whisker plots of log2(CPM) or CPM values with dot plots superimposed to show the raw data. When there are three data points the median and interquartile ranges are precisely the data values. Medians are denoted by horizontal lines and averages are denoted by open diamonds.FiltersVisualization SettingsThe START app allows users to visualize RNA-seq data starting with count data.
InstructionsThe app is hosted on the website: https://kcvi./START/ Code can be found on github: https://github.com/jminnier/STARTapp To run this app locally on your machine, download R or RStudio and run the following commands once to set up the environment:
You may now run the shiny app with just one command in R:
Input DataYou may use this app by
Data Format
Count or Expression Data
Count data contains read counts for each gene for each sample, along with gene identifiers. Analysis: When raw counts are uploaded, the data is then analyzed by the app. The app uses the voom method from the ‘limma’ Bioconductor package to transform the raw counts into logged and normalized intensity values. These values are then analyzed via linear regression where gene intensity is regressed on the group factor. P-values from all pairwise regression tests for group effect are computed and Benjamini-Hochberg false discovery rate adjusted p-values are computed for each pairwise comparison. The “log2cpm” values are the log2-counts-per-million values. The “log2cpm_voom” values are the normalized logcpm values from the voom method. Both methods use an offset of 0.5, which means 0.5 is added to all count values before normalizing (in the case of voom) and log transforming so that 0 counts have non infinite values. Example file: https://github.com/jminnier/STARTapp/blob/master/data/examplecounts_short.csv Analyzed Data
Analyzed data must contain some kind of expression measure for each sample (i.e. counts, normalized intensities, CPMs), and a set of p-values with corresponding fold changes for those p-values. For instance, if you have a p-value for the comparison of group1 vs group2, you can upload the observed fold change or log2(fold change) between group1 vs group2. If you have a more complex design and do not have fold changes readily available, you may upload the test statistics or other similar measures of effect size as placeholders. The fold changes are mainly used in the volcano plots. We recommend uploading p-values that are adjusted for multiple comparisons (such as q-values from the qvalue package, or adjusted p-values from p.adjust() function in R). Example file: https://github.com/jminnier/STARTapp/blob/master/data/exampleanalysisres_short.csv TIP: Save Data for Future UploadAfter submitting a raw data or analyzed file, you may download the .csv file with the analysis results for your own use (or to upload as an “analyzed data”) or more conveniently click the button “Save Results as RData File for Future Upload” so that you may easily and quickly upload your data to the START app in the future under the “RData from previous START upload” option with one click. After uploading your data to START, click red button to download an .RData file to upload your data to START with one click. Next time use the “Input Data” tab –> “START RData file” option. VisualizationsGroup PlotsPCA PlotThis plot uses Principal Component Analysis (PCA) to calculate the principal components of the expression data using data from all genes. Euclidean distances between expression values are used. Samples are projected on the first two principal components (PCs) and the percent variance explained by those PCs are displayed along the x and y axes. Ideally your samples will cluster by group identifier. Sample Distance HeatmapThis plot displays unsupervised clustering of the Euclidean distances between samples using data from all genes. Again your data should cluster by group. Analysis PlotsThese plots use the p-values and fold changes to visualize your data. Volcano PlotThis is a scatter plot log fold changes vs –log10(p-values) so that genes with the largest fold changes and smallest p-values are shown on the extreme top left and top right of the plot. Hover over points to see which gene is represented by each point. (https://en./wiki/Volcano_plot_(statistics)) Scatter PlotThis is a scatter plot of average gene expression in one group against another group. This allows the viewer to observe which genes have the largest differences between two groups. The smallest distances will be along the diagonal line, and points far away from the diagonal show the most differences. Hover over points to see which gene is represented by each point. Gene Expression BoxplotUse the search bar to look up genes in your data set. For selected gene(s) the stripchart (dotplot) and boxplots of the expression values are presented for each group. You may plot one or multiple genes along side each other. Hover over points for more information about the data. HeatmapA heatmap of expression values are shown, with genes and samples arranged by unsupervised clustering. You may filter on test results as well as P-value cutoffs. By default the top 100 genes (with lowest P-values) are shown. News & ReleasesVersion HistoryVersion 1.0.0. September 23, 2016. START is published and the first publically released version is online. Version 0.9.0001 May 26, 2014. First development version (previously KCVI Heart Atlas) is online via alpha! NewsSeptember 23, 2016. START App manuscript is accepted for publication in Bioinformatics! Please read our manuscript and cite our app when you use it for your work:
Shinyapps.io Terms & ConditionsTerms of UseIf you have concerns about the terms of use for this web hosted application, please run the app locally on your computer. See the “Instructions” tab for more information on this. Shinyapps.io Terms of UseThis application is hosted on a Shinyapps.io server https://www./. By using this app you are agreeing to the terms of use as described by Shinnyaps.io: https://www./about/shinyapps-terms-use/ We (the authors and maintainers of this app) will not save your data on our servers. However, as the Shinyapps server is not HIPAA compliant, you must refrain from uploading protected health information or confidential data with this app. You may instead download the code and run the app locally on your private computer and network (see above). We are not responsible for the confidentiality, availability, security, loss, misuse or misappropriation of any data you submit to this application. From the Shinyapps terms of use (https://www./about/shinyapps-terms-use/): “If you choose to upload data to an application you are using via the RStudio Service, you acknowledge and agree you are giving certain legal rights to the licensor of the application to process and otherwise use your data. Please carefully review any license terms accompanying any application to which you submit your data for the legal rights which you are giving the application licensor. Further, RStudio does not claim ownership of your data; however, you hereby grant RStudio a worldwide, perpetual, irrevocable, royalty-free, fully paid up, transferable and non-exclusive license, as applicable, to use and copy your data in connection with making your data available to the application (and the licensor of such application) to which you have submitted your data. You acknowledge and agree that even if you remove your data from the RStudio Service, your data may have been downloaded by, and remain accessible to, the licensors of those applications to which you submitted your data. Accordingly, do not submit data which you desire to remain confidential or which you wish to limit the right to access or use. You should never submit to the RStudio Service any data which consists of personally identifiable information, credit card information, or protected health information, as such terms are defined by relevant laws, rules, and regulations. RSTUDIO IS NOT RESPONSIBLE FOR THE CONFIDENTIALITY, AVAILABILITY, SECURITY, LOSS, MISUSE OR MISAPPROPRIATION OF ANY DATA YOU SUBMIT TO THE RSTUDIO SERVICE OR ANY APPLICATION MADE AVAILABLE VIA THE RSTUDIO SERVICE.” |
|