Clustering Metagenomes using Expectation Maximization Algorithm

Show simple item record

dc.contributor.author Rifat, Jubair Ibn Malik
dc.contributor.author Ahmed, Istiaque
dc.date.accessioned 2021-01-22T09:36:55Z
dc.date.available 2021-01-22T09:36:55Z
dc.date.issued 2015-11-15
dc.identifier.citation [1] David Koslicki1,*, Simon Foucart2 and Gail Rosen3 1Mathematical Biosciences Institute, The Ohio State University, Columbus, OH 43201, USA and 2Department of Mathematics and 3Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA 19104, USA, Advance Access publication June 20, 2013. [2] Genivaldo Gueiros Z. Silva, Daniel A. Cuevas, Bas E. Dutilh and Robert A. Edwards,Computational Science Research Center, San Diego State University, San Diego, CA, USA;Department of Computer Science, San Diego State University, San Diego, CA, USA, Accepted 21 May 2014 ,Published 5 June 2014 [3] Turnbaugh,P.J., Hamady,M., Yatsunenko,T., Cantarel,B.L.,Duncan,A., Ley,R.E., Sogin,M.L., Jones,W.J., Roe,B.A., Affourtit,J.P. et al. (2009) A core gut microbiome in obese and lean twins. Nature, 457, 480–484. [4] N. Diaz, L. Krause, A. Goesmann, and et al., \TACOA - Taxonomic classification of environmental genomic fragments using a kernelized nearest neighbor approach," BMC Bioinformatics, vol. 10, no. 1, pp. 56+, 2009. [5] S. D. Bentley and J. Parkhill, \Comparative genomic structure of prokaryotes," Annual Review of Genetics,vol. 38, pp. 771791, December 2004. [6] Y.-W. Wu and Y. Ye, \A novel abundance-based algorithm for binning metagenomic sequences using l-tuples,"in Proceedings of the 14th annual international conference RECOMB'10, pp. 535{549, Springer, 2010. [7] D. L. Wheeler, T. Barrett, D. A. Benson, and et al., \Database resources of the National Center for Biotechnology Information.," Nucleic Acids Research, vol. 35, January 2007. [8] D. A. Benson, I. Karsch-Mizrachi, D. J. Lipman, and et al., \GenBank.," Nucleic acids research, vol. 37, pp. D26 31, January 2009. 35 | P a g e [9] Qichao Tu1, Zhili He1,* and Jizhong Zhou1,2,3,* 1Department of Microbiology and Plant Biology, Institute for Environmental Genomics, University of Oklahoma, Norman, OK 73072, USA, 2Earth Science Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA and 3State Key Joint Laboratory of Environmental Simulation and Pollution Control, School of Environment, Tsinghua University, Beijing 100084, China. Published online 12 February 2014 [10] Ley,R.E. (2010) Obesity and the human microbiome. Curr. Opin.Gastroenterol., 26, 5–11 . [11] Larsen,N., Vogensen,F.K., van den Berg,F.W.J., Nielsen,D.S.,Andreasen,A.S., Pedersen,B.K., Al-Soud,W.A., Sørensen,S.J.,Hansen,L.H. and Jakobsen,M. (2010) Gut microbiota in human adults with type 2 diabetes differs from non-diabetic adults. PLoS One, 5, e9085. [12] Qin,J., Li,Y., Cai,Z., Li,S., Zhu,J., Zhang,F., Liang,S., Zhang,W.,Guan,Y., Shen,D. et al. (2012) A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature,490, 55–60. [13] M. Wendl and R. Waterston, \Generalized gap model for bacterial articial chromosome clone ngerprint mapping and shotgun sequencing," Genome Res, vol. 12, no. 1, p. 19431949, 2002. [14] X. Li and M. S. Waterman, \Estimating the Repeat Structure and Length of DNA Sequences Using l-Tuples,"Genome Research, vol. 13, pp. 1916{1922, August 2003. [15] Karlsson,F.H., Tremaroli,V., Nookaew,I., Bergstrom,G.,Behre,C.J., Fagerberg,B., Nielsen,J. and Backhed,F. (2013) Gut metagenome in European women with normal, impaired and diabetic glucose control. Nature, 498, 99–103. [16] Kau,A.L., Ahern,P.P., Griffin,N.W., Goodman,A.L. and Gordon,J.I. (2011) Human nutrition, the gut microbiome and the immune system. Nature, 474, 327–336. [17] Schwabe,R.F. and Jobin,C. (2013) The microbiome and cancer. Nat. Rev. Cancer, 13, 800–812. en_US
dc.identifier.uri http://hdl.handle.net/123456789/792
dc.description Supervised by Prof. Dr. M.A Mottalib, Head, Department of Computer Science and Engineering, Islamic University of Technology (IUT), Co-Supervisor: M. Arifur Rahman, Lecturer, Department of Computer Science and Engineering en_US
dc.description.abstract Clustering metagenome refers to group genes with similar expression patterns of a metagenomic data set into clusters with the hope that these clusters correspond to groups of functionally related genes. It allows access to uncultivated microbial populations that may have important roles in natural and engineered ecosystems. Proper clustering of Metgenome sequence is a very essential step in recovering genomes and understanding microbial functions. We took the distance matrix from the expression matrix of a metagenomic sequence and used Expectation Maximization (EM) algorithm for clustering the metagenome. After clustering we label the clusters with proper name, we match the cluster nucleotides with reference genome of bacteria in HMPDAC and name the clusters with the bacteria title given in database. Finally for healthy/ patient sample we will show the percentage of bacteria and infer that since this bacteria is higher it might be causing the problem. en_US
dc.language.iso en en_US
dc.title Clustering Metagenomes using Expectation Maximization Algorithm en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search IUT Repository


Advanced Search

Browse

My Account

Statistics