A supergene is a chromosomal region encompassing multiple neighboring genes that are inherited together because of close genetic linkage, i.e. much less recombination than would normally be expected.[1] This mode of inheritance can be due to genomic rearrangements between supergene variants.
A supergene region can contain few, functionally related genes that clearly contribute to a shared phenotype.[2]
Phenotypes encoded by supergenes
Supergenes have cis-effects due to multiple loci (which may be within a gene, or within a single gene's regulatory region), and tight linkage. They are classically polymorphic, whereby different supergene variants code for different phenotypes.
Primula supergene. Pin and thrum morphs of Primula have effects on genetic compatibility (pin style x thrum pollen, or thrum style x pin pollen matings are successful, while pin x pin, and thrum x thrum matings are rarely successful due to pollen-style incompatibility), and have different style length, anther height in the corolla tube, pollen size, and papilla size on the stigma. Each of these effects is controlled by a different locus in the same supergene, but recombinants are occasionally found with traits combining those of "pin" and "thrum" morphs.
Origin
The earliest use of the term "supergene" may be in an article by A. Ernst (1936) in the journal Archiv der Julius Klaus-Stiftung für Vererbungsforschung, Sozialanthropologie und Rassenhygiene.[6]
Classically, supergenes were hypothesized to have evolved from less tightly-linked genes coming together via chromosomal rearrangement or reduced crossing over, due to selection for particular multilocus phenotypes. For instance, in Batesian mimicry supergenes in species such as Papilio memnon, genes are required to affect hind-wing, fore-wing, and body colour, and also the presence or absence of long projections (the "tails" of swallowtail butterflies).
The case for the accumulative origin for supergenes was originally based on the work of Nabours on polymorphism for colour and pattern in grouse locusts (Tetrigidae). In Acridium arenosum the colour-patterns are controlled by thirteen genes on the same chromosome, which reassort (recombine) fairly easily. They also occur in Apotettix eurycephalus where they form two tightly linked groups, between which there is 7% crossing-over. Furthermore, in Paratettix texanus there appears to be complete suppression of crossing-over among 24 out of 25 of the colour-pattern genes, which can be distinguished by comparing their effects with those found in other species. Analysis of Nabour's data by Darlington & Mather concluded that the genes responsible for the morphs of Paratettix texanus have been gradually aggregated into a group which acts as a single switch-mechanism.[7][8][9] This explanation was accepted by E.B. Ford and incorporated into his accounts of ecological genetics.[10][11]
This process might involve suppression of crossing-over, translocation of chromosome fragments and possibly occasional cistron duplication. That crossing-over can be suppressed by selection has been known for many years; Detlefsen and Roberts were able to reduce recombination between the loci for white eyes (w) and miniature wings (m) in Drosophila melanogaster from the normal 36% to 6% in one line and 0.6% in another.[12][13]
Debate has tended to centre round the question, could the component genes in a super-gene have started off on separate chromosomes, with subsequent reorganization, or is it necessary for them to start on the same chromosome? Many scientists today believe the latter, because some linkage disequilibrium is initially needed to select for tighter linkage, and linkage disequilibrium requires both the previous existence of polymorphisms via some other process, like natural selection, favouring gene combinations.[14] If genes are weakly linked, it is probable that the rarer advantageous haplotype dies out, leading to the loss of polymorphism at the other locus.
Most people, following J.R.G. Turner, therefore argue that supergenes arose in situ due to selection for correlated and epistatic traits, which just happened to have been possible to select via the existence of suitable loci closely linked to the original variant.[15] Turner calls this a "sieve" explanation, and the Turner explanation might be called the "Turner sieve" hypothesis.[16]Maynard Smith agreed with this view in his authoritative textbook.[17] Nevertheless, the question is not definitively settled. The problem is connected to an even larger question, the evolution of evolvability.
Genomic structure
Genomic rearrangements such as inversions can suppress recombination.
Suppressed recombination leads to accumulation of repetitive elements (including to degenerative expansion) in early supergene evolution [Ref Papaya, Fire ant], and to changes in gene expression [ref Fire ant, anther smut].
Gene complexes are not supergenes
Gene complexes, in contrast, are simply tightly linked groups of genes, often created via gene duplication (sometimes called tandem duplication if the duplicates remain side-by-side). Here, each gene has similar though slightly diverged function. For example, the human major histocompatibility complex (MHC) region is a complex of tightly linked genes all acting in the immune system, but has no claim to be a supergene, even though the component genes very likely have epistatic effects and are in strong disequilibrium due in part to selection.
^Ernst A. 1936. Weitere Untersuchungen zur Phänanalyse, zum Fertilitätsproblem und zur Genetik heterostyler Primeln. II. Primula hortensis. Wettst. Arch. J. K.-Stift. Ver. Sozialanthropol. Ras. 11, 1-280.
^Nabours R.K. 1929. The genetics of the Tettigidae [Tetrigidae] (Grouse Locusts). Bibliogr. genet. 5, 27-104.
^Nabours R.K., Larson L. & Hartwig N. 1933. Inheritance of colour-patterns in the grouse-locust Acridium arenosum Burmeister (Tettigidae [Tetrigidae]. Genetics 18, 159-71.
^Darlington C.D. & Mather K. 1949. The elements of genetics, p335-6. Allen & Unwin, London.
^Ford E.B. 1965. Genetic polymorphism, p17-25. MIT Press 1965.
^Detlefsen J.A. and Roberts E. 1921. Studies on crossing-over I. The effects of selection on crossover values. J Exp Zoology. 32, 333-54.
^Darlington C.D. 1956. Chromosome botany, p36. Allen & Unwin, London.
^Charlesworth D. and Charlesworth B. 1975. Theoretical genetics of Batesian mimicry I. Single-locus models. J. Theoret. Biol. 55, 283-303; II. Evolution of supergenes. J. Theoret. Biol. 55, 305-324; III. Evolution of dominance. J. Theoret. Biol. 55, 325-337.
^Thompson M.J. & Jiggins C.D. 2014 Supergenes and their role in evolution. Heredity 113, 1-8.
^Turner J.R.G. 1984. Mimicry: the palatability spectrum and its consequences. Chap. 14. In Vane-Wright R.I. & Ackery P.R. (eds) The Biology of Butterflies. Symposia of the Royal Entomological Society of London #11. Academic Press, London.
Berdan EL, Flatt T, Kozak
GM, Lotterhos KE, Wielstra B. 2022 Genomic
architecture of supergenes: connecting form
and function. Phil. Trans. R. Soc. B 377:
20210192