Cytotidyr

Cytotidyr is a wrapper for the CytobankAPI package which allows for all steps of a cytobank preprocessing workflow to be reproduced in R, such that you can go from an experiment on cytobank to a flowFrame or data.frame with all your preprocessed events as quickly as possible. It also includes a simple shiny application for converting data on cytobank into csv files, which can easily be adapted to other applications. Cytotidyr provides a set of function for applying the following preprocessing steps from cytobank:

Data scaling
Panel assignment
Compensation
Gating on populations
Sample tagging

All of these steps are designed to be applied to flowFrame or flowSet objects as defined in the flowCore package allowing it to integrate with existing analysis pipelines. New methods for the as.data.frame() function in base R also allow these flowCore objects to be converted to data.frames or tibbles consisting of tidy data for integration with tidyverse based workflows.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.

Prerequisites

Cytotidyr was developed for R 3.6+. In order to fully utilize the features of cytotidyr, you'll need to install and load the flowCore and CytobankAPI pacakges. These are also dependencies, so if you install cytotidyr flowCore and CytobankAPI should be automatically installed.

if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("flowCore")

install("CytobankAPI")

Installation

A step by step series of examples that tell you how to get a development env running

Say what the step will be

#install.packages("devtools")
devtools::install_github("bjreisman/cytotidyr")

Getting Started

First we'll load the dependencies neccessary to run this example:

library(cytotidyr)
library(flowCore)
library(CytobankAPI)
library(scico)
library(tidyverse)
library(flowWorkspace)
library(CytoML)

Next I'll import a cytobank experiment from cytobank using an API token (under "account settings") and the experiment ID ("from the experiment URL").

The token below is no longer valid, but I've saved the output and we can load it two chunks below:

token <- "eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJqdGkiOiI4ZjA2OTA5Y2FkZTUyZDM0OTIxNDE4ZjMxZTljMjc0MiIsImV4cCI6MTU4MDQ0MjYzNSwidXNlcl9pZCI6MTQ3LCJhdWQiOiJjeXRvYmFua19hcGlfdjFfdXNlcnMiLCJpYXQiOjE1ODA0MTM4MzUsImlzcyI6Imh0dHBzOi8vdmFuZGVyYmlsdC5jeXRvYmFuay5vcmcvIiwibmJmIjoxNTgwNDEzODM1LCJzdWIiOiJjeXRvYmFua19hcGlfdjEifQ.QIsddGuaaE6PUef1DveyyKChwQ0lAuQCt_xUTxVPR-E"

cyto_session <- CytobankAPI::authenticate("vanderbilt", auth_token = token)
experiment.id <- 29564

exp_info <- fetchCytobankExperiment(cyto_session = cyto_session, experiment.id)
saveRDS(exp_info, "exp_info_sample.rds")

In order to minimize the size of the API calls, fetchCytobankExperimet doesn't import the actual FCS files. In order to load those, we'll need to manually download them using CytobankAPI and read them in as FCS files. Again this requires access to a specific experiment, so we'll load a previously loaded version in the next chunk.

fcspath <- fcs_files.download_zip(cyto_session, experiment.id, exp_info$fcs_files$originalId)
fcspath_unzipped <- unzip(fcspath)
myflowset <- read.flowSet(fcspath_unzipped)

This chunk will load the same data as above, but will actually execute.

exp_info <- readRDS(system.file("extdata", "exp_info_sample.rds", package = "cytotidyr"))
exp_info$gates.path <- system.file("extdata", basename(exp_info$gates.path), package = "cytotidyr")

mygatingset <- CytoML::cytobank_to_gatingset(exp_info$gates.path,
  system.file(
  "extdata",
  c("Donor 2 mem post-sort.fcs", "Donor 2 pre-sort.fcs"),
  package = "cytotidyr"
))

#> intact
#> singles
#> purity UR
#> purity UL
#>  LL
#> purity LR
#>  FCS files have the same following channels:
#> FSC-A
#> FSC-W
#> FSC-H
#> SSC-A
#> SSC-W
#> SSC-H
#> FL-A
#> PE-A
#> APC-A
#> Alexa Fluor 700-A
#> Time
#>  Donor 2 mem post-sort.fcs to empty cdf slot...
#> write Donor 2 pre-sort.fcs to empty cdf slot...
#> done!
#> ..done!
#> intact
#> purity LR
#> purity LL
#> purity UL
#> purity UR
#> singles
#> ..done!

Here's an example of a typical workflow for going from an experiment + FCS files to a data.frame

mygatingset_compesated <- compensate(mygatingset, exp_info$compensations$`Identity Comp`)
mygatingset_scaled <- transform(mygatingset_compesated, exp_info$transforms)
mygatingset_tagged <- apply_sampletags(mygatingset_scaled, exp_info)
myflowset_singles <- gs_pop_get_data(mygatingset_tagged, "singles")

mytidydata <- as.data.frame(myflowset_singles, use_longnames = F)
str(mytidydata)
#> 'data.frame':	28735 obs. of  16 variables:
#> $ FSC-A            : num  156339 134925 105803 148191 151899 ...
#> $ FSC-W            : num  84410 80593 76462 83050 82579 ...
#> $ FSC-H            : num  121381 109717 90684 116939 120550 ...
#> $ SSC-A            : num  0.0254 0.024 0.0228 0.0228 0.023 ...
#> $ SSC-W            : num  88618 89930 81403 86246 83903 ...
#> $ SSC-H            : num  3.5 3.28 3.2 3.15 3.21 ...
#> $ FL-A             : num  0.007266 0.015847 0.012436 0.000837 0.020483 ...
#> $ PE-A             : num  5.246 1.537 4.78 5.274 -0.397 ...
#> $ APC-A            : num  1.125 3.114 -0.121 -0.116 3.957 ...
#> $ Alexa Fluor 700-A: num  0.000178 -0.000228 0.003316 0.003869 -0.00046 ...
#> $ Time             : num  0 0 0 0 0.1 ...
#> $ FCS Filename     : chr  "Donor 2 pre-sort.fcs" "Donor 2 pre-sort.fcs" "Donor 2 pre-sort.fcs" "Donor 2 pre-sort.fcs" ...
#> $ Sample.Type      : Factor w/ 2 levels "Post-Sort","Pre-Sort": 2 2 2 2 2 2 2 2 2 2 ...
#> $ Plate            : Factor w/ 1 level "Samples": 1 1 1 1 1 1 1 1 1 1 ...
#> $ FCS.File.Category: Factor w/ 1 level "Experiment Files": 1 1 1 1 1 1 1 1 1 1 ...
#> $ name             : chr  "Donor 2 pre-sort.fcs" "Donor 2 pre-sort.fcs" "Donor 2 pre-sort.fcs" "Donor 2 pre-sort.fcs" ...

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Acknowledgments

Brian Bachmann
Jonathan Irish
P. Brent Ferrell
Sierra Barone
RGLab's set of packages including flowCore, flowWorkspace, and CytoML, on which Cytotidyr is built.

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
R		R
inst		inst
man		man
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
CytExp_33997_Gates_v3.xml		CytExp_33997_Gates_v3.xml
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
LICENSE.txt		LICENSE.txt
NAMESPACE		NAMESPACE
README.md		README.md
README.rmd		README.rmd
cytotidyr.Rproj		cytotidyr.Rproj
debugging_gatingfxn.R		debugging_gatingfxn.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

Cytotidyr

Getting Started

Prerequisites

Installation

Getting Started

License

Acknowledgments

About

Licenses found

Releases

Packages

Languages

License

Licenses found

bjreisman/cytotidyr

Folders and files

Latest commit

History

Repository files navigation

Cytotidyr

Getting Started

Prerequisites

Installation

Getting Started

License

Acknowledgments

About

Resources

License

Licenses found

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages