You are here: University of Vienna PHAIDRA Detail o:1248465
Title (eng)
Maximum likelihood estimation of Gene family specific duplication and deletion rates
Parallel title (deu)
Maximum Likelihood Schätzung von Genduplikations- und Gendeletionsraten für einzelne Genfamilien
Author
Andrea Führer
Adviser
Arndt von Haeseler
Assessor
Reinhard Bürger
Assessor
Volkmar Liebscher
Abstract (deu)
Genes can be divided into gene families, which are defined by homology, and let presume that the genes evolved from a common ancestral gene. In this content the size of a gene family is subject to evolutionary change. We model gene duplications and deletions using a birth-and-death process. Based on the number of gene copies in a set of extant species, we apply a maximum likelihood approach to infer the birth-rate and the death-rate due to duplications and deletions respectively. Furthermore we determine the number of gene copies of the most recent common ancestor. To validate this strategy, we performed simulation studies. Assuming a fixed number of gene copies for the ancestor and specific rates for duplication and deletion of genes, we simulated the evolution of the number of gene copies along a phylogenetic tree. Using our maximum likelihood framework, we subsequently estimated the rates and the ancestral number used for the simulation. A collection of different simulation studies showed that the maximum likelihood approach infers the given parameter quite good. We further applied our method to biological gene family data from vertebrates of the Inparanoid and the Ensembl databases. Compared to previous reported rates our estimates are about one magnitude lower. The data was also considered with regards to model violations, since it is assumed that e.g. large-scale duplication, like whole genome duplications, have occurred during evolution. Hence, extensions of our method and future work are discussed.
Abstract (eng)
Genes can be divided into gene families, which are defined by homology, and let presume that the genes evolved from a common ancestral gene. In this content the size of a gene family is subject to evolutionary change. We model gene duplications and deletions using a birth-and-death process. Based on the number of gene copies in a set of extant species, we apply a maximum likelihood approach to infer the birth-rate and the death-rate due to duplications and deletions respectively. Furthermore we determine the number of gene copies of the most recent common ancestor. To validate this strategy, we performed simulation studies. Assuming a fixed number of gene copies for the ancestor and specific rates for duplication and deletion of genes, we simulated the evolution of the number of gene copies along a phylogenetic tree. Using our maximum likelihood framework, we subsequently estimated the rates and the ancestral number used for the simulation. A collection of different simulation studies showed that the maximum likelihood approach infers the given parameter quite good. We further applied our method to biological gene family data from vertebrates of the Inparanoid and the Ensembl databases. Compared to previous reported rates our estimates are about one magnitude lower. The data was also considered with regards to model violations, since it is assumed that e.g. large-scale duplication, like whole genome duplications, have occurred during evolution. Hence, extensions of our method and future work are discussed.
Keywords (eng)
genome evolutiongene duplicationgene deletionmaximum likelihoodbirth and death processhigh accuracy computation
Keywords (deu)
GenomevolutionGenduplikationGendeletionMaximum LikelihoodGeburts- und Todes-Prozesse
Subject (deu)
Subject (deu)
Type (deu)
Persistent identifier
https://phaidra.univie.ac.at/o:1248465
rdau:P60550 (deu)
160 S.
Number of pages
160
Members (1)
Title (eng)
Maximum likelihood estimation of Gene family specific duplication and deletion rates
Parallel title (deu)
Maximum Likelihood Schätzung von Genduplikations- und Gendeletionsraten für einzelne Genfamilien
Author
Andrea Führer
Abstract (deu)
Genes can be divided into gene families, which are defined by homology, and let presume that the genes evolved from a common ancestral gene. In this content the size of a gene family is subject to evolutionary change. We model gene duplications and deletions using a birth-and-death process. Based on the number of gene copies in a set of extant species, we apply a maximum likelihood approach to infer the birth-rate and the death-rate due to duplications and deletions respectively. Furthermore we determine the number of gene copies of the most recent common ancestor. To validate this strategy, we performed simulation studies. Assuming a fixed number of gene copies for the ancestor and specific rates for duplication and deletion of genes, we simulated the evolution of the number of gene copies along a phylogenetic tree. Using our maximum likelihood framework, we subsequently estimated the rates and the ancestral number used for the simulation. A collection of different simulation studies showed that the maximum likelihood approach infers the given parameter quite good. We further applied our method to biological gene family data from vertebrates of the Inparanoid and the Ensembl databases. Compared to previous reported rates our estimates are about one magnitude lower. The data was also considered with regards to model violations, since it is assumed that e.g. large-scale duplication, like whole genome duplications, have occurred during evolution. Hence, extensions of our method and future work are discussed.
Abstract (eng)
Genes can be divided into gene families, which are defined by homology, and let presume that the genes evolved from a common ancestral gene. In this content the size of a gene family is subject to evolutionary change. We model gene duplications and deletions using a birth-and-death process. Based on the number of gene copies in a set of extant species, we apply a maximum likelihood approach to infer the birth-rate and the death-rate due to duplications and deletions respectively. Furthermore we determine the number of gene copies of the most recent common ancestor. To validate this strategy, we performed simulation studies. Assuming a fixed number of gene copies for the ancestor and specific rates for duplication and deletion of genes, we simulated the evolution of the number of gene copies along a phylogenetic tree. Using our maximum likelihood framework, we subsequently estimated the rates and the ancestral number used for the simulation. A collection of different simulation studies showed that the maximum likelihood approach infers the given parameter quite good. We further applied our method to biological gene family data from vertebrates of the Inparanoid and the Ensembl databases. Compared to previous reported rates our estimates are about one magnitude lower. The data was also considered with regards to model violations, since it is assumed that e.g. large-scale duplication, like whole genome duplications, have occurred during evolution. Hence, extensions of our method and future work are discussed.
Keywords (eng)
genome evolutiongene duplicationgene deletionmaximum likelihoodbirth and death processhigh accuracy computation
Keywords (deu)
GenomevolutionGenduplikationGendeletionMaximum LikelihoodGeburts- und Todes-Prozesse
Subject (deu)
Subject (deu)
Type (deu)
Persistent identifier
https://phaidra.univie.ac.at/o:1248466
Number of pages
160