Volume 7, Issue 2 (Suppl)
J Ecosyst Ecography, an open access journal
ISSN:2157-7625
September 18-20, 2017
Page 30
Notes:
conference
series
.com
September 18-20, 2017 Toronto, Canada
Joint Conference
International Conference on
International Conference on
Environmental Microbiology and Microbial Ecology
&
Ecology and Ecosystems
Analyzing 'omics data using compositional data analysis
W
e will demonstrate that the microbiome and transcriptome datasets should be analyzed by a combination of Bayesian
estimation and compositional data approaches to examine the ratios between features giving robust insights into the
structure of high throughput sequencing datasets. Traditional methods of analyzing microbiome or RNA-seq datasets can
be misleading, and not use all the available information. This results in many analyses being dominated by either the most
abundant, or the rarest features. Data collected using high throughput sequencing (HTS) methods are sequence reads mapped
to genomic intervals, and are commonly analyzed as either 'normalized count data’ or 'relative abundance data’. One reason for
these normalizations is to attempt to compensate for the problem that the sequencing instrument imposes an upper bound on
the number of sequence reads. Positive data with an arbitrary bound are 'compositional data' and are subject to the problem of
spurious correlation. Thus ordination, clustering and network analysis become unreliable. A second problem is that the data
are sparse: i.e., contain many 0 values. A third problem is that the largest measurement error is at the low count margins in
these datasets. These issues are all addressed using our approach.
Biography
Greg B Gloor is a professor of biochemistry with broad experience in molecular biology, genetics and genomics. Most recently, he has developed tools to investigate
fundamentals of molecular evolution, microbial ecology and meta-transcriptomics. He is currently working on developing and adapting principled methods to characterize
correlation and differential abundance in sparse, high throughput sequencing data as generated in 16S rRNA gene sequencing surveys, meta-genomics and meta-
transcriptomics. He is the developer and maintainer of the ALDEx2 R package on Bioconductor.
ggloor@uwo.caGregory B Gloor
University of Western Ontario, Canada
Gregory B Gloor, J Ecosyst Ecography 2017, 7:2 (Suppl)
DOI: 10.4172/2157-7625-C1-029