Logo image
Serine substitutions are linked to codon usage and differ for variable and conserved protein regions
Journal article   Open access   Peer reviewed

Serine substitutions are linked to codon usage and differ for variable and conserved protein regions

Gregory W Schwartz, Tair Shauli, Michal Linial and Uri Hershberg
Scientific reports, v 9(1), pp 17238-9
21 Nov 2019
PMID: 31754132
url
https://doi.org/10.1038/s41598-019-53452-3View
Published, Version of Record (VoR)CC BY V4.0 Open

Abstract

Amino Acid Substitution - genetics Amino Acids - genetics Codon - genetics Codon Usage - genetics Evolution, Molecular Exome - genetics Humans Phosphorylation - genetics Protein Domains - genetics Serine - genetics
Serine is the only amino acid that is encoded by two disjoint codon sets (TCN & AGY) so that a tandem substitution of two nucleotides is required to switch between the two sets. We show that these codon sets underlie distinct substitution patterns at positions subject to purifying and diversifying selections. We found that in humans, positions that are conserved among ~100 vertebrates, and thus subjected to purifying selection, are enriched for substitutions involving serine (TCN, denoted S'), proline, and alanine, (S'PA). In contrast, the less conserved positions are enriched for serine encoded with AGY codons (denoted S″), glycine and asparagine, (GS″N). We tested this phenomenon in the HIV envelope glycoprotein (gp120), and the V-gene that encodes B-cell receptors/antibodies. These fast evolving proteins both have hypervariable positions, which are under diversifying selection, closely adjacent to highly conserved structural regions. In both instances, we identified an opposite abundance of two groups of serine substitutions, with enrichment of S'PA in the conserved positions, and GS″N in the hypervariable regions. Finally, we analyzed the substitutions across 60,000 individual human exomes to show that, when serine has a specific functional constraint of phosphorylation capability, S' codons are 32-folds less prone than S″ to substitutions to Threonine or Tyrosine that could potentially retain the phosphorylation site capacity. Combined, our results, that cover evolutionary signals at different temporal scales, demonstrate that through its encoding by two codon sets, serine allows for the existence of alternating substitution patterns within positions of functional maintenance versus sites of rapid diversification.

Metrics

4 Record Views
8 citations in Scopus

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#3 Good Health and Well-Being

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Domestic collaboration
International collaboration
Web of Science research areas
Biochemistry & Molecular Biology
Logo image