Professor Laurence D Hurst explains why understanding the nucleotide mutations in viruses, including SARS-CoV-2, can have significant implications for vaccine design.
With 61 codons specifying 20 amino acids, some can be encoded by more than one codon and it is often presumed that it does not matter which one a gene uses. When I first studied genetics, some books I read taught that mutations between such alternative codons (eg, GGA->GGC, both giving glycine) were called synonymous mutations, while others referred to them as silent mutations. However, are synonymous mutations really silent meaning they are identical in terms of fitness and function? Although they may specify the same amino acid, does that mean they are all the same?
Figure 1: Intronless GFP transgene expression is higher for variants of GFP with higher GC content at synonymous sites5
Perhaps one of the biggest surprises over recent years has been the discovery that versions of the same gene, differing only at synonymous sites, can not only have different properties, but effects that are not modest.1-5 For example, two versions of green fluorescent protein (GFP) differing only at synonymous sites can have orders of magnitude differences in their expression level.4 We similarly recently discovered that for an intronless transgene to express in human cell lines it needs to be GC rich, which can be achieved by altering the synonymous sites,5 as seen in Figure 1. It is no accident, we suggest, that the well-expressed endogenous intronless genes in humans (such as histones) are all GC rich and that our functional retrogenes tend to be richer in GC content than their parental genes.
The realisation that synonymous sites matter has clear relevance to the design of transgenes or other artificial genes, be these for experiments, gene therapy, protein production (eg, in bacteria) or for vaccine design. In the case of vaccines, we might wish to modulate a viral protein to be effectively expressed in human cells to illicit a strong and robust immune response.6 Conversely to the design of attenuated vaccines, we seek to produce a tuned down version of the virus that can function but is weak.7
The challenge is knowing not just which synonymous sites can be altered but knowing how they should be altered. One approach is mass randomisation try many alternatives and see what works.4,8,9 In principle this is fine, but this approach requires many randomisations, which is still technically difficult for long attenuated viruses. An alternative strategy that we have been exploring is to let nature tell us; we can apply tools and ideas from population genetics to better understand what natural selection favours and disfavours and in turn to estimate the strength of selection.
it will be interesting to see if we can learn a lesson from nature as to how to weaken a virus
Estimation of the strength of selection is possible from knowledge of the site frequency spectrum, (ie, how common variants are) from which we can infer the distribution of fitness effects (DFE). If a site is under strong purifying selection, then mutations may occur in the population but these are rapidly eliminated, so variants are always rare. By contrast, if they are selectively neutral, we expect some variants to be quite common. We recently applied this methodology to show that synonymous mutations in human genes that disrupt exonic splice enhancer motifs are often under strong selection and affect many synonymous sites in our genes.10 This has implications for both diagnostics and for transgene design for gene therapy, as we often remove introns in heterologous genes, so freeing up these residues from their role in specifying exons ceases.11
The same DFE methodology cannot easily be applied to viruses, as the methods assume free recombination (ie, we assume one mutation does not impact the fate of others in the same genome). However, other population genetical tools can still be applied. Recently, we examined SARS-CoV-2 and identified the profile of mutations that we see at four-fold degenerate sites.12 From this profile we could estimate what the synonymous site composition would be, assuming that the only forces are mutational biases and neutral evolution (ie, no selection). We observed that in this genome there is a strikingly strong C->U mutation bias and a G->U one. In the raw data this is not so obvious as G and C are quite rare. However, the mutability of the sites per occurrence of the site reveals the underlying patterns.
Figure 2: The rate of mutational flux from one dinucleotide to another in the coding sequence of SARS-CoV-2. The direction of flux is indicated by the indentation of the connecting links: the inner layer represents flux out while the outermost layer represents flux into the node. The frequency of the flux exchange is represented by the width of any given link where it meets the outer axis. Dinucleotide nodes are coloured according to their GC-content. Hence, it is evident that there is high flux away from GC-rich dinucleotides whereas AU-rich dinucleotides are largely conserved.12
With knowledge of the mutational bias we then asked what the equilibrium frequency of the four nucleotides would be using four simultaneous equations. This is the nucleotide content at which for every mutation changing a particular base there is an equal and opposite one creating the same base somewhere else in the genome, ensuring overall unchanged nucleotide content. Given the strong C->U and G->U mutational biases, it is no surprise that the equilibrium content is very U rich (we estimate equilibrium U content should be about 65 percent). However, while the four-fold sites are indeed U rich, they are not that U rich, being closer to 50 percent. A clue as to why the mutation bias is so skewed to generating U comes from analysis of equilibrium UU content: UU residues are predicted to be very common, with CU residues being particularly mutable generating UU (Figure 2) this is expected due to human APOBEC proteins attacking and mutating/editing the virus.13
One probable explanation for this difference between predicted and observed nucleotide content is selection against U content. There may be many U residues appearing in the population, but many are pushed out of the population owing to purification selection, ie, because of the deleterious effects of the mutations. That such selection is happening in the SARS-CoV-2 genome is also clear from the sequence data. We estimate that for every 10 mutations that appear in the sequence databases, another six are lost because of selection prior to genome sequencing. Indeed, UU content is about a quarter of that predicted (Figure 3).
Figure 3: The predicted (under neutral mutational equilibrium) and observed dinucleotide content of SARS-CoV-2. Note the very high predicted levels of UU given the strong mutational flux to UU residues (see Figure 2) and the net underrepresentation in actual sequence.9
This leaves two problems: why is selection operating on SARS-CoV-2 and what can we do with this information? In some cases, we have a good idea as to why: many mutations to U at codon sites generate stop codons. However, we have observed that U destabilises the transcripts and is associated with lower-reported transcript levels;12 a full explanation of the causes of selection on nucleotide content therefore requires manipulation of the sequences.
The second question, what to do with this information, is perhaps more urgent. It has previously been noted that nucleotide content manipulation is a viable means to attenuate viruses.7 Currently there are three groups investigating this route to make a vaccine for SARS-CoV-2: Indian Immunologicals Ltd/Griffith University, Codagenix/Serum Institute of India and Acbadem Labmed Health Services/Mehmet Ali Aydinlar University. In prior attempts, attention has been paid to CpG levels and UpA levels (which we find to be correlated between SARS genes and between different viruses).12 CpGs attract the attention of zinc antiviral protein (ZAP) and UpA attracts an RNAase L. Not surprisingly, some viruses, including SARS-CoV-2, therefore have low levels of both dinucleotide pairs given the levels of the underlying nucleotides.
The challenge is knowing not just which synonymous sites can be altered but knowing how they should be altered
In the past, attenuation strategies have focused on modulating synonymous sites to increase CpG and UpA, making the virus more visible to antiviral proteins.14 We in turn suggest a general strategy to utilise this method and to increase U content as well.12 Given the evidence that selection on the virus is to reduce U content, while our antiviral proteins are mutating it to increase U content, it will be interesting to see if we can learn a lesson from nature as to how to weaken a virus. This is an unusual circumstance in which we predict that we should build in more of the already most common synonymous site nucleotides (U in this case) to degrade the virus. More generally, it is assumed that the most used codons are those that tend to increase the fitness of the organism. In the face of such a severe mutation bias, however, this simpler logic no longer holds.
Laurence D Hurst is Professor of Evolutionary Genetics and Director of the Milner Centre for Evolution at the University of Bath. He is currently also the President of the Genetics Society. He completed his D.Phil in Oxford, after which he won a research fellowship and then moved to Cambridge University as a Royal Society Research Fellow. While on the fellowship he assumed his current Chair at Bath University. In 2015 he was elected a Fellow of the Academy of Medical Sciences and a Fellow of the Royal Society. He is a recipient of the Genetics Society Medal and the Scientific Medal of the Zoological Society of London.
Related topicsDisease research, DNA, Gene Therapy, Genetic analysis, Genomics, Protein, Proteogenomics, Proteomics, Research & Development, RNAs, Vaccine
More here:
Tweaking synonymous sites for gene therapy and vaccines - Drug Target Review
- Faulty Circuits (preview) [Last Updated On: April 7th, 2010] [Originally Added On: April 7th, 2010]
- Faulty Circuits (preview) [Last Updated On: April 7th, 2010] [Originally Added On: April 7th, 2010]
- Rare flowers and common herbal supplements get unmasked with plant DNA barcoding [Last Updated On: April 20th, 2010] [Originally Added On: April 20th, 2010]
- Rare flowers and common herbal supplements get unmasked with plant DNA barcoding [Last Updated On: April 20th, 2010] [Originally Added On: April 20th, 2010]
- Biomarker Studies Could Realize Goal of More Effective and Personalized Cancer Medicine [Last Updated On: April 27th, 2010] [Originally Added On: April 27th, 2010]
- Biomarker Studies Could Realize Goal of More Effective and Personalized Cancer Medicine [Last Updated On: April 27th, 2010] [Originally Added On: April 27th, 2010]
- Schizophrenia shares genetic links with autism, genome study shows [Last Updated On: May 12th, 2010] [Originally Added On: May 12th, 2010]
- Schizophrenia shares genetic links with autism, genome study shows [Last Updated On: May 12th, 2010] [Originally Added On: May 12th, 2010]
- Alzheimer's: Forestalling the Darkness with New Approaches (preview) [Last Updated On: May 28th, 2010] [Originally Added On: May 28th, 2010]
- Alzheimer's: Forestalling the Darkness with New Approaches (preview) [Last Updated On: May 28th, 2010] [Originally Added On: May 28th, 2010]
- Large-Scale Autism Study Reveals Disorder's Genetic Complexity [Last Updated On: June 12th, 2010] [Originally Added On: June 12th, 2010]
- Large-Scale Autism Study Reveals Disorder's Genetic Complexity [Last Updated On: June 12th, 2010] [Originally Added On: June 12th, 2010]
- Cancer Therapy Goes Viral: Progress Is Made Tackling Tumors with Viruses [Last Updated On: June 24th, 2010] [Originally Added On: June 24th, 2010]
- Cancer Therapy Goes Viral: Progress Is Made Tackling Tumors with Viruses [Last Updated On: June 24th, 2010] [Originally Added On: June 24th, 2010]
- Vaccines Derived from Patients' Tumor Cells Are Individualizing Cancer Treatment [Last Updated On: June 26th, 2010] [Originally Added On: June 26th, 2010]
- Vaccines Derived from Patients' Tumor Cells Are Individualizing Cancer Treatment [Last Updated On: June 26th, 2010] [Originally Added On: June 26th, 2010]
- A genome story: 10th anniversary commentary by Francis Collins [Last Updated On: June 29th, 2010] [Originally Added On: June 29th, 2010]
- A genome story: 10th anniversary commentary by Francis Collins [Last Updated On: June 29th, 2010] [Originally Added On: June 29th, 2010]
- Hair Trigger: How a Cell's Primary Cilium Functions as a Molecular Antenna [Last Updated On: June 30th, 2010] [Originally Added On: June 30th, 2010]
- Hair Trigger: How a Cell's Primary Cilium Functions as a Molecular Antenna [Last Updated On: June 30th, 2010] [Originally Added On: June 30th, 2010]
- DNA Drugs Come of Age (preview) [Last Updated On: July 16th, 2010] [Originally Added On: July 16th, 2010]
- DNA Drugs Come of Age (preview) [Last Updated On: July 16th, 2010] [Originally Added On: July 16th, 2010]
- 2 Genes Linked to Embryonic Brain Impairment in Down's Syndrome [Last Updated On: July 22nd, 2010] [Originally Added On: July 22nd, 2010]
- 2 Genes Linked to Embryonic Brain Impairment in Down's Syndrome [Last Updated On: July 22nd, 2010] [Originally Added On: July 22nd, 2010]
- Stem Cells from Reprogrammed Adult Cells Found to Bring Along Genetic Defects of Their Donors [Last Updated On: October 11th, 2010] [Originally Added On: October 11th, 2010]
- Was Darwin a Punk? A Q&A with Punker-Paleontologist Greg Graffin [Last Updated On: October 11th, 2010] [Originally Added On: October 11th, 2010]
- Parkinsonian Power Failure: Neuron Degeneration May Be Caused by a Cellular Energy System Breakdown [Last Updated On: October 11th, 2010] [Originally Added On: October 11th, 2010]
- Was Darwin a Punk? A Q&A with Punker-Paleontologist Greg Graffin [Last Updated On: October 11th, 2010] [Originally Added On: October 11th, 2010]
- Desperation Drives Parents to Dubious Autism Treatments (preview) [Last Updated On: October 13th, 2010] [Originally Added On: October 13th, 2010]
- Revolution Postponed: Why the Human Genome Project Has Been Disappointing (preview) [Last Updated On: October 26th, 2010] [Originally Added On: October 26th, 2010]
- Controlling the Brain with Light (preview) [Last Updated On: October 26th, 2010] [Originally Added On: October 26th, 2010]
- Optogenetics: Controlling the Brain with Light [Extended Version] [Last Updated On: October 26th, 2010] [Originally Added On: October 26th, 2010]
- Clear New Insights into the Genetics of Depression [Last Updated On: November 7th, 2010] [Originally Added On: November 7th, 2010]
- TEDMED 2010: Technology and the people [Last Updated On: November 7th, 2010] [Originally Added On: November 7th, 2010]
- Bacteria, the anti-cancer soldier [Last Updated On: November 7th, 2010] [Originally Added On: November 7th, 2010]
- Clear New Insights into the Genetics of Depression [Last Updated On: November 7th, 2010] [Originally Added On: November 7th, 2010]
- TEDMED 2010: Technology and the people [Last Updated On: November 7th, 2010] [Originally Added On: November 7th, 2010]
- Bacteria, the anti-cancer soldier [Last Updated On: November 7th, 2010] [Originally Added On: November 7th, 2010]
- Scientific regress: When science goes backward [Last Updated On: November 29th, 2010] [Originally Added On: November 29th, 2010]
- Can You Live Forever? Maybe Not--But You Can Have Fun Trying [Last Updated On: December 26th, 2010] [Originally Added On: December 26th, 2010]
- How to Fix the Obesity Crisis (preview) [Last Updated On: February 14th, 2011] [Originally Added On: February 14th, 2011]
- Personalizing cancer medicine [Last Updated On: February 14th, 2011] [Originally Added On: February 14th, 2011]
- New Salmonella strain delivers gene-based therapy to fight virus in mice [Last Updated On: February 14th, 2011] [Originally Added On: February 14th, 2011]
- How to Fix the Obesity Crisis (preview) [Last Updated On: February 14th, 2011] [Originally Added On: February 14th, 2011]
- Personalizing cancer medicine [Last Updated On: February 14th, 2011] [Originally Added On: February 14th, 2011]
- New Salmonella strain delivers gene-based therapy to fight virus in mice [Last Updated On: February 14th, 2011] [Originally Added On: February 14th, 2011]
- Steps toward a Bionic Eye [Last Updated On: February 20th, 2011] [Originally Added On: February 20th, 2011]
- Steps toward a Bionic Eye [Last Updated On: February 20th, 2011] [Originally Added On: February 20th, 2011]
- Giving HIV a Poor Reception: New AIDS Treatment Tinkers with Immune Cell Genes [Last Updated On: March 6th, 2011] [Originally Added On: March 6th, 2011]
- Giving HIV a Poor Reception: New AIDS Treatment Tinkers with Immune Cell Genes [Last Updated On: March 6th, 2011] [Originally Added On: March 6th, 2011]
- Smaller, cheaper, faster: Does Moore's law apply to solar cells? [Last Updated On: March 27th, 2011] [Originally Added On: March 27th, 2011]
- Smaller, cheaper, faster: Does Moore's law apply to solar cells? [Last Updated On: March 27th, 2011] [Originally Added On: March 27th, 2011]
- New Drugs for Hepatitis C on the Horizon [Last Updated On: April 10th, 2011] [Originally Added On: April 10th, 2011]
- Can we capture all of the world's carbon emissions? [Last Updated On: April 10th, 2011] [Originally Added On: April 10th, 2011]
- New Drugs for Hepatitis C on the Horizon [Last Updated On: April 10th, 2011] [Originally Added On: April 10th, 2011]
- Can we capture all of the world's carbon emissions? [Last Updated On: April 10th, 2011] [Originally Added On: April 10th, 2011]
- Drug-resistant genes found in cholera and dysentery strains in New Delhi water supply [Last Updated On: May 1st, 2011] [Originally Added On: May 1st, 2011]
- Fast Track to Vaccines: How Systems Biology Speeds Drug Development (preview) [Last Updated On: May 1st, 2011] [Originally Added On: May 1st, 2011]
- Drug-resistant genes found in cholera and dysentery strains in New Delhi water supply [Last Updated On: May 1st, 2011] [Originally Added On: May 1st, 2011]
- Fast Track to Vaccines: How Systems Biology Speeds Drug Development (preview) [Last Updated On: May 1st, 2011] [Originally Added On: May 1st, 2011]
- Autism's Tangled Genetics Full of Rare and Varied Mutations [Last Updated On: June 19th, 2011] [Originally Added On: June 19th, 2011]
- A New Look at Obsessive-Compulsive Disorder (preview) [Last Updated On: June 19th, 2011] [Originally Added On: June 19th, 2011]
- Autism's Tangled Genetics Full of Rare and Varied Mutations [Last Updated On: June 19th, 2011] [Originally Added On: June 19th, 2011]
- A New Look at Obsessive-Compulsive Disorder (preview) [Last Updated On: June 19th, 2011] [Originally Added On: June 19th, 2011]
- Close Encounters of Science and Medicine [Last Updated On: July 3rd, 2011] [Originally Added On: July 3rd, 2011]
- Close Encounters of Science and Medicine [Last Updated On: July 3rd, 2011] [Originally Added On: July 3rd, 2011]
- New Report Details Uphill Battle to Solve the U.S.'s Pain Problem [Last Updated On: July 24th, 2011] [Originally Added On: July 24th, 2011]
- New Report Details Uphill Battle to Solve the U.S.'s Pain Problem [Last Updated On: July 24th, 2011] [Originally Added On: July 24th, 2011]
- A Breath of Fresh Air: New Hope for Cystic Fibrosis Treatment (preview) [Last Updated On: August 7th, 2011] [Originally Added On: August 7th, 2011]
- A Breath of Fresh Air: New Hope for Cystic Fibrosis Treatment (preview) [Last Updated On: August 7th, 2011] [Originally Added On: August 7th, 2011]
- Sickle Cell Anemia: Stem Cell Gene Therapy - Donald Kohn [Last Updated On: August 18th, 2011] [Originally Added On: August 18th, 2011]
- Sickle Cell Anemia: Stem Cell Gene Therapy - A Patient's Perspective [Last Updated On: October 8th, 2011] [Originally Added On: October 8th, 2011]
- Gene therapy improves stem cell transplantation - Video [Last Updated On: October 14th, 2011] [Originally Added On: October 14th, 2011]
- THE NEW MORGELLONS HAIR - Video [Last Updated On: October 14th, 2011] [Originally Added On: October 14th, 2011]
- Studying Mental Illness in a Dish [Last Updated On: November 13th, 2011] [Originally Added On: November 13th, 2011]
- The Puzzle of Pancreatic Cancer: How Steve Jobs Did Not Beat the Oddsbut Nobel Winner Ralph Steinman Did [Last Updated On: November 13th, 2011] [Originally Added On: November 13th, 2011]
- Did Alternative Medicine Extend or Abbreviate Steve Jobs's Life? [Last Updated On: November 13th, 2011] [Originally Added On: November 13th, 2011]
- Calendar: MIND Events in November and December [Last Updated On: November 13th, 2011] [Originally Added On: November 13th, 2011]
- Studying Mental Illness in a Dish [Last Updated On: November 13th, 2011] [Originally Added On: November 13th, 2011]
- The Puzzle of Pancreatic Cancer: How Steve Jobs Did Not Beat the Odds?but Nobel Winner Ralph Steinman Did [Last Updated On: November 13th, 2011] [Originally Added On: November 13th, 2011]