What is a gene, and what is it for?

ChromosomesThe gene is a fundamental concept in biology, and it has been since Mendel introduced it in the late 19th century, unbeknownst to Darwin, who was just at the same time looking for a theory of heredity, flirted with Lamarckism, and tiene came up with his own, incorrect, notion of blended inheritance.

Mendel’s work was rediscovered in 1900 (it’s bad for one’s academic career when one publishes in obscure journals and agrees to become an administrator, as the Augustinian friar did), and ever since it has been a crucial component of our understanding of biology. But scientists have developed a number of different concepts of gene, concepts that don’t always sit quite nicely and coherently with each other. Sounds like a job for philosophers of science…

And sure enough, a number of philosophers have written about genes and what they are “for” (including yours truly). By far the most active pair, however, is made of Paul Griffith and Karola Stotz, who have published a number of influential papers on gene concepts, and have now produced a new book on the topic, Genetics and Philosophy: An Introduction. Below I will comment on an in-depth essay about the book, published by Lindell Bromham (a biologist) in Biology and Philosophy. (Often book reviews in philosophy journals are a lot more than just book reviews, and provide the author a way to articulate his own thoughts on whatever subject matter is covered by the book.)

Before proceeding, one might well wonder what is the point of this sort of philosophical exercise. As Bromham puts it: “the word ‘gene’ belongs to all of us, and understanding its meaning is critically important for informed discussion of many important ideas in medicine, biology and in wider society. None of us is unconnected to debates about what is a ‘gene for’.”

One of the first things Bromham clarifies in his commentary is that common locutions such as “gene for bowel cancer” are nonsense. This is because of course genes don’t evolve (usually) to harm their carriers. What typically happens, though, is that we identify a gene when it misfire (say, contributing to causing bowel cancer), even though its normal function is something entirely different. It would be like trying to understand how cars work by studying when they break down and concluding that the brakes are “for” smashing into other cars when they unexpectedly cross your path.

Bromham correctly argues that one problem is that there is no common, i.e., non-technical, term to identify gene variants (the technical word is “allele”), so that people can make more sense of statements from doctors and other health care professionals: “So in most people the Huntington Disease gene does not give them Huntington’s Disease, only those people unlucky enough to inherit an allele of the gene with more than 40 repeats will get the disease.” Which means that people shouldn’t worry about whether they carry the gene “for” Huntington’s, but rather whether they have inherited a particular, lethal allele of a gene that does whatever it does when it works properly and causes Huntington’s when it doesn’t.

The bulk of the paper is then devoted to a section-by-section discussion of Griffith and Stotz’s classification of different concepts of genes, beginning, of course with the Mendelian one: “A mendelian ‘gene’ is a pattern of inheritance: it does not need to be a ‘gene’ in the sense of being a cistron [a DNA sequence that functions as an unbroken hereditary unit] that makes a product or influences phenotype.”

One of the complications is that sometimes blocks of cistrons (known as “haplotypes”) are inherited at once, perhaps because the individual genes making up the haplotype contribute to a particular biological function and natural selection therefore favored their inheritance as a larger, coherent unit (a “supergene”).

Then again, sometimes natural selection surprises biologists, as in the case of the mimicry in some species of butterflies, such as Papilio polytes. Their mimetic phenotype is the result of a coordinated action among a number of genes, so for a long time scientists thought these genes must be part of a single haplotype block and be inherited together, to maintain coordination. But no, it turns out that the whole shebang is orchestrated instead by a regulatory gene, known as doublesex, which sends signals to all the other genes involved in the production of the mimetic phenotype. No need for a supergene, if the work can be done by a single regulatory element.

As Bromham summarizes it: “Uncovering the rich complexity of gene action has not dissolved the mendelian gene, but it has disassociated the mendelian gene from any particular form of DNA sequence.” Moreover, there are genes whose influence is actually dependent on epigenetic factors, as in the case of the GNAS complex, defects in which are associated with a number of diseases in humans, including certain kinds of tumors. Some of these deleterious effects, however, are manifested only if the GNAS allele is inherited from the mother, while others take place only if the variant is inherited from the father. Not exactly a standard Mendelian factor.

There are also so-called “postgenomic genes,” which are defined as “the DNA sequences that have a linear correspondence to the gene product of interest, wherever these occur in the genome. This way you start with the product and work backwards to the DNA sequences that were needed for its construction. The ‘product’ is somewhat vague, as it will look different if we focus on the RNA transcript, a processed RNA molecule, or a translated protein.”

For instance, the above mentioned GNAS is actually a complex made of a whopping 17 different exons (transcribed units). The complex is controlled by a number of regulatory sequences, which cause differential “splicing” (i.e., different cuts off the larger molecule, resulting in different proteins after translation from the intermediate RNA). Some of these spliced versions are inherited maternally, other paternally, and yet others biparentally. Moreover, one of the exons can be “read” by the transcription enzymes in two different ways (two “reading frames,” in molecular biological jargon), resulting in two distinct proteins. What, exactly, constitutes a gene in the case of the GNAS complex?

Then we have the “reactive genome,” that is the ensemble of the gene that codes for a given protein plus the regulatory sequences that control when and where during the development or life of an organism the gene gets transcribed and then translated. This is affected by the environment, a term that includes not just the external conditions in which the organism happens to live, but also the intra-organismal one, such as which cells surround which others, their communication channels, and so forth.

Griffith and Stotz, and therefore also Bromham, also comment on the relationship between genes and the ever treacherous concept of information. The authors of the book — and Bromham seems to agree — correctly state that genes cannot really be said to be “coding” for anything other than sequences of amino acids (and not even that, in the case of many regulatory sequences). They certainly do not code for phenotypes, so that the phrase “gene for X” where X is the color of the eyes, a particular behavior, or whatever, is nonsensical from a strictly biological perspective: “in most usages, the use of the word ‘information’ to describe the functioning of genetic systems is at best a sloppy metaphor.”

Indeed, even the famous distinction between genotypes and phenotypes, which is at the center of simplistic metaphors such as Richard Dawkins’ “selfish gene” is undermined by advances in understanding within the field of molecular genomics. As Bromham puts it: “consider DNA sequences that do not act solely as a template but also act directly as a binding site for other molecules which recognize the particular sequence of nucleotides as a kind of ‘docking station’ … This means that DNA control elements that serve as recognition sequences are both genotype (a message that can be copied from one form to another) and phenotype (a structure that contributes to form and function).”

The last two sections of the paper address interesting questions concerning which I think Bromham comes across as reasonable but perhaps a bit too conservative.

The first one is dedicated to the issue of epigenetic inheritance. This is by now definitely a well established phenomenon in a number of organisms, but Bromham is right that the jury is still out there in terms of its long-term evolutionary significance. Most epigenetic variants are relatively short-lived, influencing phenotypes over the span of a few generations, very brief when compared to the stability of genetic sequences, whose shared ancestry is recognizable in organisms that have diverged for millions or even tens of millions of years. My take on this is that we have evidence at the moment that epigenetic inheritance can influence the short-term response of natural populations to environmental changes, but that long-term stability of phenotypes is encoded only in the DNA. This, however, doesn’t make the import of epigenetic factors negligible, since after all evolution happens one generation at a time (“evolutionary time” is simply the cumulation of a number of short moments), and because the epigenotype constantly interacts with the genetic system of inheritance, in highly non-linear and complex ways.

The last section of Bromham’s commentary concerns the so-called Extended Synthesis, the new version of evolutionary theory that some people (including yours truly, duly cited by Bromham) have been put forth in recent years, contrasting it with the standard Modern Synthesis.

Here is Bromham’s summary of the situation: “If ‘extended’ means a richer appreciation of the tangled web of interactions between genes, development and environment, then there are very few fields of biology that do not contribute to such an extension: the more we find out, the more we know, and the more fiendishly complicated everything looks … If ‘extended’ means that knowledge of the complex interactions between genotype and phenotype cannot be accommodated in current evolutionary theory, then there are no concrete examples here that illustrate how this new knowledge does not fit into the established framework.”

In my mind, “Extended” actually means something in between those extremes: neither trivial (like the first one), nor revolutionary (like the second one). But I’ve already explained my take on the ES vs MS debate elsewhere, so I will refer readers to that commentary.

151 thoughts on “What is a gene, and what is it for?

  1. synred

    ‘Blinded’ inheritance does kind of work for characteristics the depended on the additive effect of a bunch of gene’s. One might regard it as a ‘model’ that worked well on pigeons and such.

    Like

  2. garthdaisy

    I don’t think I inherited the gene for understanding genes. 😉 I applaud the great and complicated work of dedicated biologists. All hail the nerd gene.

    Also, any time you can squeeze the words “simplistic” and “Richard Dawkins into one sentence is a good day for any anti-new-atheist. 😉

    Like

  3. SocraticGadfly

    Good stuff in general.

    And:

    Indeed, even the famous distinction between genotypes and phenotypes, which is at the center of simplistic metaphors such as Richard Dawkins’ “selfish gene” is undermined by advances in understanding within the field of molecular genomics.

    Setting alarm for Coel’s first ‘response’

    Liked by 1 person

  4. synred

    I really hate the ‘Self Gene’ title. It’s not just simplistic, it’s wrong and misleading. Dawkins says as much in the book, but there it is blazoned on the cover.

    I am, however, thinking using ‘extended phenotype’ as basis for my story based on Well’s ;’Star Begotten’ in which the Martians invade by other means.

    First, I have to finish proofing ‘The Story of Eddy Puss’.

    Like

  5. Coel

    Three out of four comments about Dawkins (four out of five if you count this one)? Funny how he can dominate conversation. Methinks the man amounts to more than his critics. 🙂

    Socratic,

    Setting alarm for Coel’s first ‘response’

    It was excusing and overlooking it, since, by the standards of Massimo’s near-obligatory put downs of Dawkins accompanying any mention of him, this one was pretty mild.

    Synred,

    I really hate the ‘Self Gene’ title. It’s not just simplistic, it’s wrong and misleading.

    It’s only wrong and misleading if you try hard to misunderstand it. And, really, you do have to be trying hard to misunderstand it, given how clearly and straightforwardly he has stated what it’s intended to mean.

    Excellent, now we’re going to have yet another thread dominated by Dawkins! Well, it wasn’t me who started it. 🙂

    Liked by 2 people

  6. nannus

    Being especially interested in the history of ideas and the history of concepts, it is indeed interesting to learn what such a concept like “gene” meant in Mendel’s time or later, around 1900 or in the 1920s, before DNA was known or understood. It is also interesting to follow the development of such ideas, including those ideas that where later ruled out (and those of competing theories, like the vitalism of the 1920s) and the political connections such theories developed. If one really starts to look into the history of philosophy and science of that time, it turns out to be much more complicated than what you see comming out at the end, and quite interesting in its connections to general history. If you study biology today, you probably learn a neat theory of evolution which is what came out of all of these developments. But the actual history of this theory was more complicated (and more interesting).
    I am currently starting to delve into the history of some currents of German philosophy of the 1920s that include the Darwinians on one side (Haeckel and his followers etc.) but on the other side some neo-vitalists like Hans Driesch and some others (I am currently investigating a forgotten but rather interesting right-wing philosopher called Karl Faigl). The theories of some of these people helped pave the way for the Nazis and it is this ideological connection I am especially interested in, but they are also interesting in themselves as part of the histories of science and philosophy.
    Question: as a biologist turned philosopher, are you aware of any literature on the materialis-vitalism-debate that existed in those days. I am not sure these debates ever made it into the English-speaking world. I am currently investigating original German texts (some of then obviously totally neglected by historians or philosophers so far, including, for example, a text published by Faigl in the same obscure journal in which Mendel published decades earlier). In case you know of any publications on these topics, I would aprecciate any hint. I have started blogging about these topics on https://philosophicalexcavations.wordpress.com/category/field-season-1/ (some articles on topics related to biology are going follow there, among other things.). The history of the concepts and theories of genetics, evolution etc. is quite relevant in this respect, so I would be greatful for any hints on relevant literature.

    Like

  7. Daniel Kaufman

    Massimo: I know we discussed it in one of our dialogues not too long ago, but could you speak a little about the once common view that the genetic code can be understand as a program that runs in the human body and what’s wrong with that view?

    Like

  8. brodix

    Maybe biology will lead the way from the current quantified information centric view, toward one that balances it with more fluid processes.

    Like

  9. synred

    It’s not me who miss understands what Dawkins means by ‘selfish gene’, but it likely to miss lead others. I object to ‘intentional stance’ where ever it pops up whether its genes or ‘corporations are people, my friend.’ It is intellectually lazy and talks down to the reader.

    Like

  10. Robin Herbert

    Hi Massimo,

    Good article. I probably understood as little of the technical detail as garth says he did, but the points made about the concept of a gene in general are very interesting to me, something I have puzzled about.

    Liked by 1 person

  11. brodix

    Continuing the thought, it would seem one of the main differences between “mechanical” and “organic” is that mechanics involves quantifiable parts working exactly as intended(which does require intentionality, even if the effects are unpredictable), while organics are much more fluid and adaptive.

    Like

  12. Robin Herbert

    Speaking of Richard Dawkins…

    I have wanted to mention for some time that the chapter in “The Greatest Show on Earth” on embryonic development is one damn fine piece of science writing.

    That is not to diminish the rest of the book, or his other books, but it just stuck in my mind as an exemplar of the field.

    Or maybe it is just because it helped illuminate something which had puzzled me for some time.

    I am, in general a curmodgeonly consumer of science writing, but finishing that chapter was one of the few occasions that gave me the feeling “that is how it is done!”

    Like

  13. Robin Herbert

    Hi Coel,

    Methinks the man amounts to more than his critics.

    Possibly, he certainly amounts to more than his hagiologists.

    The point is that he amounts to a good deal less than the hagiology.

    Liked by 2 people

  14. Robin Herbert

    I am interested to read Massimo’s thoughts, but don’t think the programming metaphor is too bad unless it is taken too far. I think that the cell can be regarded as ‘programmable’, for example a virus invades the nucleus, inserts some ‘code’ which makes more viruses.

    Like

  15. synred

    So the supreme court is ‘deriving’ ought from ‘is’. In the Bob McDonnel case. They seem to have bought into ‘bub ma, everybody does it!’

    What hire authority could you want?

    <\;_(

    Like

  16. brodix

    Another difference between mechanical and organic systems is the first is entropic, while the primal function of the second is to convert energy to biology.
    That one is quantified and thus closed, while the other is fluid and adaptable would be an essential aspect of this.

    Like

  17. davidlduffy

    “genes as a program”: Down in the RNA world, we already have “genes” that can replicate themselves, or more accurately cooperate by cross-replicating each other:

    http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2652413/

    http://www.nature.com/nature/journal/v491/n7422/full/nature11549.html

    “When such cooperative networks are competed directly against selfish autocatalytic cycles, the former grow faster, indicating an intrinsic ability of RNA populations to evolve greater complexity through cooperation. We can observe the evolvability of networks through in vitro selection. Our experiments highlight the advantages of cooperative behaviour even at the molecular stages of nascent life.”

    The basic physics of maintaining a stable dynamic structure (retaining identity over time either by self-regulation or copying) is the thermodynamics of information – folks will have seen papers by Jeremy England and others discussing this point eg
    http://scitation.aip.org/content/aip/journal/jcp/139/12/10.1063/1.4818538#c9
    http://rsta.royalsocietypublishing.org/content/374/2063

    That is to say, even though the same molecule can be both gene and “phene”, at the basic level these are two different processes thermodynamically

    “[there is] no prior requirement for the system to be partitioned into separate plant and controller components.” The latter is referring to regulation and adaptation, but it is equally true of replication.

    Liked by 1 person

  18. Alan White

    Massimo–

    Do you then believe that evolution has an ineliminable chaotic factor that might explain large leaps between otherwise similar kinds, such as mammals and marsupials? Sorry if I seem thick on all this, but I just want the biggest accurate picture on your overall thesis, maybe by example. This was a great post.

    Liked by 1 person

  19. Coel

    Hi Robin,

    … but don’t think the programming metaphor is too bad unless it is taken too far.

    As with all of these analogies — “genes are a language”, “genes are information”, “genes are a program”, etc — they are right in some ways and wrong in others. So one can adopt them as useful and informative, or one can write articles saying they are wrong and misleading. When I read articles of the latter types I usually find myself thinking “ok but I never connoted it that way anyhow”.

    The important thing is to keep in mind the actual mechanical details of the matter, and refer to that if one has any issues.

    Like

  20. Massimo Post author

    Nannus,

    I know there is a fairly sizeable literature on the historical debate between materialism and vitalism, but frankly I have never been particularly interested in it myself, so I can’t suggest anything off the top of my head, except for a good search on Google Scholar.

    Dan,

    Here is a summary of what I think about the genes-as-coded information metaphor:

    https://www.dropbox.com/s/u1npnq4falcvxup/2011-Pigliucci-Why%20machine%20information%20metaphors%20are%20bad%20for%20science%20and%20science%20education-Science%20%26%20Education.pdf?dl=0

    From the abstract:

    Genes are often described by biologists using metaphors derived from computational science: they are thought of as carriers of information, as being the equivalent of ‘‘blueprints’’ for the construction of organisms. Likewise, cells are often characterized as ‘‘factories’’ and organisms themselves become analogous to machines. Accordingly, when the human genome project was initially announced, the promise was that we would soon know how a human being is made, just as we know how to make airplanes and buildings. Importantly, modern proponents of Intelligent Design, the latest version of creationism, have exploited biologists’ use of the language of information and blueprints to make their spurious case, based on pseudoscientific concepts such as ‘‘irreducible complexity’’ and on flawed analogies between living cells and mechanical factories. However, the living organism = machine analogy was criticized already by David Hume in his Dialogues Concerning Natural Religion. In line with Hume’s criticism, over the past several years a more nuanced and accurate understanding of what genes are and how they operate has emerged, ironically in part from the work of computational scientists who take biology, and in particular developmental biology, more seriously than some biologists seem to do. In this article we connect Hume’s original criticism of the living organism = machine analogy with the modern ID movement, and illustrate how the use of misleading and outdated metaphors in science can play into the hands of pseudoscientists. Thus, we argue that dropping the blueprint and similar metaphors will improve both the science of biology and its understanding by the general public.

    Alan,

    I don’t think there are huge evolutionary leaps among species, but I do think that the role of stasis over evolutionary time, as well as of developmental constraints over shorter periods, has been woefully underestimated by the architects of the Modern Synthesis (the standard model in evolutionary biology) and its current defenders. I also think — with Gould — that biologists need to take more seriously the fact that their discipline is fundamentally historical, and that the role played by contingency is very large, even in comparison with that play by deterministic forces like natural selection.

    Coel,

    The reason Dawkins’ metaphor is a bad one is that he should have known how people were going to misinterpret it and resisted the temptation for cuteness / sellability. More substantively, it is a metaphor deeply steeped in a reductionist / information theoretic view of genes that is simply not tenable any more, if it ever was.

    Liked by 3 people

Comments are closed.