TY - JOUR
T1 - Profiles of low complexity regions in Apicomplexa Genome evolution and evolutionary systems biology
AU - Battistuzzi, Fabia U.
AU - Schneider, Kristan A.
AU - Spencer, Matthew K.
AU - Fisher, David
AU - Chaudhry, Sophia
AU - Escalante, Ananias A.
N1 - Funding Information:
We thank two anonymous reviewers for their comments and suggestions and Luke Odisho for technical support. The work was supported by funds from Oakland University (FUB) and the grant R01GM080586 from NIH (AAE).
Publisher Copyright:
© 2016 Battistuzzi et al.
PY - 2016/2/29
Y1 - 2016/2/29
N2 - Background: Low complexity regions (LCRs) are a ubiquitous feature in genomes and yet their evolutionary history and functional roles are unclear. Previous studies have shown contrasting evidence in favor of both neutral and selective mechanisms of evolution for different sets of LCRs suggesting that modes of identification of these regions may play a role in our ability to discern their evolutionary history. To further investigate this issue, we used a multiple threshold approach to identify species-specific profiles of proteome complexity and, by comparing properties of these sets, determine the influence that starting parameters have on evolutionary inferences. Results: We find that, although qualitatively similar, quantitatively each species has a unique LCR profile which represents the frequency of these regions within each genome. Inferences based on these profiles are more accurate in comparative analyses of genome complexity as they allow to determine the relative complexity of multiple genomes as well as the type of repetitiveness that is most common in each. Based on the multiple threshold LCR sets obtained, we identified predominant evolutionary mechanisms at different complexity levels, which show neutral mechanisms acting on highly repetitive LCRs (e.g., homopolymers) and selective forces becoming more important as heterogeneity of the LCRs increases. Conclusions: Our results show how inferences based on LCRs are influenced by the parameters used to identify these regions. Sets of LCRs are heterogeneous aggregates of regions that include homo-and heteropolymers and, as such, evolve according to different mechanisms. LCR profiles provide a new way to investigate genome complexity across species and to determine the driving mechanism of their evolution.
AB - Background: Low complexity regions (LCRs) are a ubiquitous feature in genomes and yet their evolutionary history and functional roles are unclear. Previous studies have shown contrasting evidence in favor of both neutral and selective mechanisms of evolution for different sets of LCRs suggesting that modes of identification of these regions may play a role in our ability to discern their evolutionary history. To further investigate this issue, we used a multiple threshold approach to identify species-specific profiles of proteome complexity and, by comparing properties of these sets, determine the influence that starting parameters have on evolutionary inferences. Results: We find that, although qualitatively similar, quantitatively each species has a unique LCR profile which represents the frequency of these regions within each genome. Inferences based on these profiles are more accurate in comparative analyses of genome complexity as they allow to determine the relative complexity of multiple genomes as well as the type of repetitiveness that is most common in each. Based on the multiple threshold LCR sets obtained, we identified predominant evolutionary mechanisms at different complexity levels, which show neutral mechanisms acting on highly repetitive LCRs (e.g., homopolymers) and selective forces becoming more important as heterogeneity of the LCRs increases. Conclusions: Our results show how inferences based on LCRs are influenced by the parameters used to identify these regions. Sets of LCRs are heterogeneous aggregates of regions that include homo-and heteropolymers and, as such, evolve according to different mechanisms. LCR profiles provide a new way to investigate genome complexity across species and to determine the driving mechanism of their evolution.
KW - Apicomplexa
KW - Complexity threshold
KW - Composition bias
KW - Homopolymers
KW - Low complexity regions
KW - Plasmodium falciparum
KW - Repetitive regions
UR - http://www.scopus.com/inward/record.url?scp=84959328004&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84959328004&partnerID=8YFLogxK
U2 - 10.1186/s12862-016-0625-0
DO - 10.1186/s12862-016-0625-0
M3 - Article
AN - SCOPUS:84959328004
SN - 1472-6785
VL - 16
JO - BMC Evolutionary Biology
JF - BMC Evolutionary Biology
IS - 1
M1 - 47
ER -