TR3.5

SNP Final Project

Huxley 2021-09-27 Mon 12:00

#ret #ref


1 SNP Project Write-up

%% Resources: KBxSNPPCR Instructions

1.0.1 Init Planning

  1. Outline
    • basics
    • function and regulation
    • SNP effect
  2. Writing!

1.0.2 Part One! org

The COMT gene, or catechol-O-methyltransferase, encodes the COMT enzyme which is responsible for breaking down neurotransmitters the brain's prefrontal cortex. More specifically, it acts as a catalyst for the transfer of a methyl group from S-adenosylmethionine to dopamine, epinephrine, and norepinephrine. This process, called O-methylation, leads to the degradation of the aforementioned neurotransmitters. The COMT enzyme also effects the metabolism of exogenous substances, but that is irrelevant for the mutation at hand citation. The COMT gene itself is 27.22kb long and located on chromosome 22q11.2 citation. It has ubiquitous expression in 27 tissues, including the placenta, the adrenal, and the lung citation. Val158Met, also known as rs4680, is a common missense mutation swapping a guanine for an adenine. It has the frequency G=0.510915, and thus, A=0.489085. Val158Met causes the COMT enzyme to be roughly 25% as effective compared to the wild type. Expression levels in mRNA, despite its reduced protein abundance, are not effected by Val158Met citation & citation. Thus, Val158Met must be located in a protein-coding region causing the COMT enzyme to have lower protein integrity, explaining the discrepancy between mRNA expression and protein expression. This lower protein integrity is most likely manifested as diminished thermostability of the enzyme citation. The higher level effect of this reduced enzyme efficacy is greatly debated, and linked to many different phenotypes. At a broad level, lower COMT activity leads to higher levels of catecholamines in the prefrontal cortex. The actual effect of these increased levels are not well understood. One proposed theory is the Warrior versus Worrier hypothesis, which outlines two groups of personality traits based upon the Val158Met mutation citation. The "Warrior" group, defined as the wild type group with lower levels of catecholamines like dopamine, are said to have an advantage in processing aversive stimuli. They are also said to have higher pain tolerance, be less prone to stress, less exploratory, and ect. However, many of these claims are not well defined and bordering on pseudoscience citation. The "Worrier" group, those with the mutation, are said to have an advantage in memory and attention tasks citation. The Val158Met mutation has also been linked to schizophrenia, but this claim is debated citation.


  1. Citations:

1.0.3 Part One!

%% ### Feedback and revisions

This is looking good so far. It is based in solid research and you're clear about what's still unknown or uncertain. I'm noticing that you don't discuss any known regulators of COMT expression (transcription factors, environmental signals, etc). Try to work that in. If there are none reported, you can mention that in the write-up. I also think you can add a bit more clarity in the portion where you're discussing the integrity/stability of the mutant protein. You've said it leads to lower abundance. Is the decreased enzyme activity related direction to enzyme function or based on lower total abundance?

%% The COMT gene, or catechol-O-methyltransferase, encodes the COMT enzyme which is responsible for breaking down neurotransmitters the brain's prefrontal cortex. More specifically, it acts as a catalyst for the transfer of a methyl group from S-adenosylmethionine to dopamine, epinephrine, and norepinephrine. This process, called O-methylation, leads to the degradation of the aforementioned neurotransmitters. The COMT enzyme also effects the metabolism of exogenous substances, but that is irrelevant for the mutation at hand citation. The COMT gene itself is 27.22kb long and located on chromosome 22q11.2 citation. It has ubiquitous expression in 27 tissues, including the placenta, the adrenal, and the lung citation. This expression is dynamically regulated during brain development and due to environmental stimuli. Despite much research into COMT regulation, the actual processes are still mostly unknown citation. COMT has two promoters — P2 functions constitutively, whereas P1 has tissue dependent regulation. This tissue specific regulation is most likely done by C/EBPalpha. citation. Val158Met, also known as rs4680, is a common missense mutation swapping a guanine for an adenine. It has the frequency G=0.510915, and thus, A=0.489085. Val158Met causes the COMT enzyme to be roughly 25% as effective compared to the wild type. Expression levels in mRNA, despite its reduced protein abundance, are not effected by Val158Met citation & citation. Thus, Val158Met must be located in a protein-coding region causing the COMT enzyme to have lower protein integrity, explaining the discrepancy between mRNA expression and protein expression. This lower protein integrity is most likely manifested as diminished thermostability of the enzyme, in turn leading to its reduced effectiveness citation. The higher level effect of this reduced enzyme efficacy is greatly debated, and linked to many different phenotypes. At a broad level, lower COMT activity leads to higher levels of catecholamines in the prefrontal cortex. The actual effect of these increased levels are not well understood. One proposed theory is the Warrior versus Worrier hypothesis, which outlines two groups of personality traits based upon the Val158Met mutation citation. The "Warrior" group, defined as the wild type group with lower levels of catecholamines like dopamine, are said to have an advantage in processing aversive stimuli. They are also said to have higher pain tolerance, be less prone to stress, less exploratory, and ect. However, many of these claims are not well defined and bordering on pseudoscience citation. The "Worrier" group, those with the mutation, are said to have an advantage in memory and attention tasks citation. The Val158Met mutation has also been linked to schizophrenia, but this claim is debated citation.

  1. Citations:

1.0.4 Part Two

%% infographic time.

  • *Create an infographic that diagrams the various connections between the gene/SNP genotypic variants and the known phenotypic associations. This graphic should visually show the biological effects of the gene/protein and studied genotypes on human traits/phenotypes. It is also important to highlight the ways in which the environment affects gene expression, protein function and/or phenotype (see the example infographic for APOE below, which makes connections to diet and traumatic brain injury). You can choose to visually organize and/or lay out your graphic in a variety of formats, but make sure that the following information is included:*
  • info to include
    • gene basics
      • Gene info (gene size, location of SNP, SNP variants, SNP frequency)
      • Protein info (protein size, protein variants if known)
      • reg
        • estrogen?
        • brain develpoment
        • eviromental
        • P2:
          • constitutively
        • P1:
          • tissue dependent, C/EBPalpha, ect.
    • SNP variants to human phenotype relationships
      • reduced COMT enzyme activity
      • warrior worrier stuff
    • Gene-environment Interactions

%% Submitted in a separate file. You can also check it out linked here. %%Letter - 1 (1).jpg%% %%COMTVal158Metpt2it1.pdf%%


1.0.5 Part Three

%% #### outlinin: prompt: In this section, you should try to provide some evolutionary insight on the SNP alleles that we see for your gene in the human population. You may focus on one particular allele if you see that it is better-researched. Using actual research into the evolution of the allele(s) and/or research about gene and gene variant functions, explain why/how the SNP allele(s) you studied were maintained in the human population (in over 1% of people studied). At some point the allele first appeared as a mutation but it subsequently spread into a relatively large proportion of people; why might that have been? - Although this section should be based on gene/SNP research and your understanding of evolution, your explanations may be somewhat speculative due to the difficulty in obtaining evidence for certain evolutionary predictions/hypotheses (because environmental pressures, migrations, and random events that influenced evolution likely happened long ago). That is okay; just show us that you're using research-backed reasoning about your allele(s) and that you have an understanding of evolutionary mechanisms. - In terms of evolutionary mechanisms, you should be thinking about possible selective pressures that may have maintained certain alleles in the human population (and disfavored others). Note that evolution typically operates over long timescales, so the selective pressures that are most likely to have played a role were operating before civilizations, agriculture, etc (with some exceptions that may have evolved more "recently", like lactase persistence, high-altitude adaptation, and disease resistances). - It's also important to remember the possible contributions of mechanisms like gene flow and genetic drift in getting to the allele frequencies that we see today. It would be harder to detect whether/when these happened without complex analyses of sequences, but you can still acknowledge their possible impact and explain how these mechanisms operate.

allele: g -> a

selection factors:

population makes it's own selection factors need for warrior / worrier

organism, gene, population

greedy epsilon, worrier as epsilon

evolution on multiple levels like genetic, epigenetic, symbolic, cultural, ect

fitness landscapes of organisms but also of tech

worriers are better for jumping out of local minima warriors are better for carrying out the best strategy

perhaps a collection of personality-ish traits that are all being balanced

research:

A comparison of human and mouse COMT confirms that the amino acid at the Val/Met locus is important for COMT activity and suggests that COMT activity has decreased during the course of evolution. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1182110/

derived allele uniqe to humans

https://www.jstage.jst.go.jp/article/ase/121/3/121_130731/_html/-char/en

%%

Not much is known about the evolution of the COMT*L allele. In the sea of speculation, only two facts emerge: COMT activity has decreased during the course of recent evolution, and COMT*L is a derived allele unique to humans citation & citation. For the following speculation, the phenotypes said to arise from different variations of the COMT gene will be assumed true. Speculating about evolution on the organism level is relatively straightforward: this organism evolved a patch of photosensitive skin so it could tell which way was up. However, this level of analysis breaks down when trying to explain altruism, and thus comes speculation on the gene level: these organisms evolved so they would "lay down [their] life for two brothers or eight cousins" - J.B.S. Haldane. The evolution of COMT*L is not explained solely by either of these levels — instead, its evolution operates on the population level.

It can be assumed that given a constant environment without changing selection factors, there is an optimal mix of so called Warriors and Worriers. On top of the hypothetically constant selection pressures from the environment, the population itself creates its own selection pressures based upon the ratio of Warriors to Worriers. If there are too many Warriors, then a selection pressure favoring Worriers would arise, and vice versa, leading the population towards the optimal ratio of Warriors and Worriers. With heavy speculation, one could suggest that there exists a whole set of personality traits that are constantly being balanced across the population. Of course, these selection pressures would on top of the environmental selection pressures; a Worrier is more likely to do better in a famine where new ways to get resources need to created. A Worrier would be more likely to advocate for saving some grain for the future despite the fact that people are hungry now. A Warrior would do better in situations like getting chased by a predator, where they need to act well under stress and have high resilience.

Of course, environmental selection pressures are not constant. Hence, the question becomes, what type of environment shifts the optimal ratio of Warriors to Worriers? To answer this question, one must think of evolution in terms of a fitness landscape. An organism can be stuck in a local minima and require some large mutation to jump out of it. Disregarding genetic drift, the only other mutations that would persist would be ones leading the species to the bottom of the local minima. In recent years, the theory of evolution has been extended into axes beyond just the organism itself. A good book on the topic is Evolution in four dimensions: Genetic, epigenetic, behavioral, and symbolic variation in the history of life. Each of these axes have their own fitness landscape associated with them. We can imagine an axis called, for lack of a better term, technological. This axis could include, of course, technology, but also human action and innovation. For example, traveling to this location through this new path, or hunting in this different patch of land instead of the old patch would be grouped into this axis.

In the fitness landscape associated with this new technological axis, which co-evolves with the biological evolution of humans, Warriors would be better at carrying out the current best strategy, whereas the Worriers would be best at jumping out of the local minima and finding new strategies. This setup of Warriors and Worriers is much like Epsilon-Greedy algorithms in the world of computer science. A Greedy algorithm is an algorithm which simply does the best action for itself in the current situation, disregarding the future. These algorithms sometimes work for very complex problems. An Epsilon-Greedy algorithm is often used when dealing with unknown probability distributions, like trying to navigate an unknown fitness landscape. It acts just like a normal Greedy algorithm, except for some epsilon amount of time where it explores instead of carrying out the current best action. Figuring out the optimal value of this epsilon is a massive problem in computer science. In this case, Warriors would be the ones best suited for carrying out the normal Greedy function, and the Worriers would be the epsilon. Thus, when the population is still catching up to the limits of the current strategy — when times are good — Warriors will do better. When a new strategy is needed, Worriers will be needed.

However, the fact that COMT activity has been decreasing over time is still not explained. I would propose that this decrease is due to the expansion of the adjacent possible. If one imagines the realm of all that is currently possible as a circle, a ring outside of that circle is the adjacent possible — that which is almost achievable. As the possible grows, the adjacent possible grows faster. As the possible expands, the adjacent possible becomes much larger. As the adjacent possible becomes larger, there becomes more local minima to jump out of, and thus, the optimal ratio of Warriors and Worriers shifts to more Worriers. Of course, this is all speculation, but until the circle of possible grows larger, that is all we can do.

  1. Citations

1.0.6 Part Four: Combined Citations

(order of appearance)

%% ### More Feedback

Huxley, Sorry, I should have been a bit more clear. I was asking you to revise the second paragraph of Part 3 to fully explain your reasoning there. If that's what you were trying to do in this comment, then yes I do think you can clarify further before incorporating it into the write-up: 1) Are you still using epsilon in the sense of the epsilon-greedy algorithm that you described, essentially "exploratory" behavior (in contrast to "exploitative" behavior)? If you're going to introduce this epsilon concept earlier in this Part (i.e. the second paragraph), you'd obviously need to define your terms. Or you could probably get your point across here without the analogy to the algorithm. 2) It sounds like you might be proposing that there is group selection at work in maintaining both alleles in the population. It also sounds a bit like you're invoking the idea of balancing selection. If you are indeed thinking along these lines, can you ground your explanations in relation to those concepts? If you're not familiar with those and this is a separate line of speculation, then the areas in your reasoning that could use more explanation are: -why it can be assumed that there is some optimal mix of "warriors" and "worriers." How does that relate to both individual and group fitness and how do you expect those to interact? -how the balancing mechanism that you're proposing at the population level would actually work, i.e. what is the selection pressure that would tip the scales toward one phenotype and then back to the other based on changing ratios.

%%