-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathREADME.rmd
executable file
·83 lines (56 loc) · 2.2 KB
/
README.rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
---
output: github_document
---
```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE)
```
# PhantasusLite
PhantasusLite -- a tool designed to integrate the work with RNA-Seq count matrices into a single and fast R-pipeline. This R-package supports loading the data from GEO by GSE. It provides url access to the remote repository with the archs4, archs4_zoo and dee2 HDF5-files for getting the count matrix. Finally phantasusLite allows to get an ExpressionSet with the expression matrix for future differential expression analysis.
## Installation
It is recommended to install the release version of the package from Bioconductor using the following commands:
```{r eval=FALSE}
if (!require("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("phantasusLite")
```
Alternatively, the most recent version of the package can be installed from the GitHub repository:
```{r message=FALSE, eval=FALSE}
library(devtools)
install_github("ctlab/phantasusLite")
```
Note that the latest version depends on `rhdfclient5 >= 1.25.1` from Bioconductor 3.19, which on older systems can be more convenient to install from GitHub:
```{r message=FALSE, eval=FALSE}
library(devtools)
install_github("vjcitn/rhdf5client")
```
## Dependencies
To run the code you need:
- `GEOquery`
- `rhdf5client`
- `phantasuslite`
```{r message=FALSE, warning=FALSE}
library(GEOquery)
library(rhdf5client)
library(phantasusLite)
```
## Quick start
To run the package enter the code sample below.
Let's load the ExpressionSet from GEO
```{r message=FALSE}
ess <- getGEO("GSE53053")
es <- ess[[1]]
```
ExpressionSet from the GEO doesn't contain the expression matrix -- `exprs(es)` is empty.
```{r}
head(exprs(es))
```
Function loadCountsFromHSDS returns an ExpressionSet with the expression matrix -- now `exprs(es)` contains an expression matrix. The default remote repository URL is '<https://alserglab.wustl.edu/hsds/?domain=/counts>'.
```{r}
# `url` is explicitly specified for illustration purposes and can be omitted
es <- loadCountsFromHSDS(es, url = 'https://alserglab.wustl.edu/hsds/?domain=/counts')
head(exprs(es))
```
The available gene annotations are also filled in:
```{r}
head(fData(es))
```