Abstract
In this study, I take a new approach to modeling the evolutionary constraint of protein sequence, introducing the stabilizing selection of protein function into the nearly-neutral theory. In other words, protein function under stabilizing selection generates the evolutionary conservation at the sequence level. With the help of random mutational effects of nucleotides on protein function, I have derived the distribution of selection coefficient among sites, called the S-distribution whose parameters have clear biological interpretations. Moreover, I have studied the inverse relationship between the evolutionary rate and the effective population size, showing that the number of molecular phenotypes of protein function, i.e., independent components in the fitness of the organism, may play a key role for the molecular clock under the nearly-neutral theory. These results are helpful for having a better understanding of the underlying evolutionary mechanism of protein sequences, as well as human disease-related mutations.
References
Chao L, Carr DE (1993) The molecular clock and relationship between population size and generation time. Evolution 47:688–690
Gu X, Fu YX, Li WH (1995) Maximum likelihood estimation of the heterogeneity of substitution rate among nucleotide sites. Mol Biol Evol 12:546–557
Gu X, Zhang J (1997) A simple method for estimating the parameter of substitution rate variation among sites. Mol Biol Evol 14:1106–1113
Gu X (1999) Statistical methods for testing functional divergence after gene duplication. Mol Biol Evol 16:1664-1674
Hartl DL, Taubes CH (1996) Compensatory nearly neutral mutations: selection without adaptation. J Theor Biol 182:303–309
Imhof M, Schlotterer C (2001) Fitness effects of advantageous mutations in evolving Escherichia coli populations. PNAS 98: 1113–1117
Keightley PD (1994) The distribution of mutation effects of viability in Drosophila melanogaster. Genetics 138:1315–1322
Keightley PD, Caballero A (1997)Genomic mutation rates for lifetime reproductive output and lifespan in Caenorhabditis elegans. PNAS 94:3823–3827
Kimura M (1979) Model of effectively neutral mutations in which selective constraint is incorporated. PNAS 75:1934–1937
Kimura M (1983) The neutral theory of molecular evolution. Cambridge University Press, Cambridge
Kosakovsky Pond SL, Frost DW (2005) A simple hierarchical approach to modeling distributions of substitution rates. Mol Biol Evol 22:223–234
Nielsen R, Yang Z (1998) Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics 148:929–936
Nielsen R, Yang Z (2003) Estimating the distribution of selection coefficients from phylogenetic data with applications to mitochondrial and viral DNA. Mol Biol Evol 20:1231–1239
Ohta T (1973) Slightly deleterious mutant substitutions in evolution. Nature 246:96–98
Ohta T (1977) Extension to the neutral mutation random drift hypothesis. In: Kimura M (ed), Molecular evolution and polymorphysim. National Institute of Genetics, Mishima Japan, pp 148–176
Piganeau G, Eyre-Walker A (2003) Estimating the distribution of fitness effects from DNA sequence data: implications for the molecular clock. PNAS 100:10335–10340
Poon A, Otto SP (2000) Compensating for our load of mutants: freezing the meltdown of small populations. Evolution 54: 1467–1479
Sella G, Hirsh AE (2005) The application of statistical physics to evolutionary biology. PNAS 102: 9541–9546
Uzzel T, Corbin KW (1971) Fitting discrete probability distribution to evolutionary events. Science 172:1089–1096
Wagner GP (1989) Multivariate mutation-selection balance with constrained pleiotropic effects. Genetics 122:223-234
Waxman D, Peck JR (1998) Pleiotropy and the preservation of perfection. Science 279:1210–1213
Yang Z (1993) Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites. Mol Biol Evol 10:1396–1401
Yang Z, Nielsen R, Goldman N, Pedersen AMK (2000) Codon-substitution models for variable selection pressure at amino acid sites. Genetics 155:431–449
Yampolsky LY, Kondrashov FA, Kondrashov AS (2005) Distribution of the strength of selection against amino acid replacements in human proteins. Hum Mol Genet 14:3191–3201
Acknowledgements
This study was supported by the NIH grants. The authors is grateful to Adam Eyre-Walker for constructive comments.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Gu, X. Stabilizing selection of protein function and distribution of selection coefficient among sites. Genetica 130, 93–97 (2007). https://doi.org/10.1007/s10709-006-0022-5
Received:
Accepted:
Published:
Issue date:
DOI: https://doi.org/10.1007/s10709-006-0022-5