|
|
||||||||
APPETITE, OBESITY, DIGESTION, AND METABOLISM
Departments of Laboratory Medicine and Pathobiology and the Banting and Best Diabetes Centre, University of Toronto, Toronto, Ontario, Canada
Submitted 16 August 2005 ; accepted in final form 11 October 2005
| ABSTRACT |
|---|
|
|
|---|
glucagon; gene expression; transgenic mice; diabetes
Extensive knowledge has been gained on the sequences and DNA-binding proteins that are required for expression of the rat proglucagon gene (4, 15, 16). Approximately 300 bases of rat proglucagon 5' flanking sequence have been found to be sufficient to support expression of reporter genes in glucagon gene-expressing rodent islet cell lines (5). Within the immediate promoter of the rat proglucagon gene, four enhancer elements (designated G2 through G5) and an A-cell-specific promoter element (G1), as well as a large number of DNA-binding proteins have been identified and characterized (4, 15, 16). In contrast, much less is known about the regulation of the human proglucagon gene. Despite extensive sequence similarity of the immediate promoter region, short human proglucagon promoter-reporter gene constructs fail to generate significant reporter gene activity in rodent islet cell lines (21). It was concluded that divergent mechanisms are used for tissue-specific regulation of the human and rat proglucagon genes, as more than 3,300 bases of human proglucagon 5'-flanking sequences were required for expression in rodent islet cell lines (21). Similar conclusions were also drawn from observations with transgenic mice. Although 1,300 bases of rat proglucagon gene 5' flanking sequences allowed expression of a transgene in islets of mice (6), 1,600 bases of human proglucagon 5' flanking sequence did not generate expression in islets of mice, yet this human proglucagon promoter construct did support expression in two other proglucagon gene expressing tissues, the intestine, and selected neurons of the brain (21).
To help identify which lineage, either the primate lineage leading to human, or rodent lineage leading to rat, produces the pancreatic islet cell transcriptional activity change, proglucagon gene promoters from the chicken and diverse mammals have been characterized (Tsai B, Yue S, and Irwin DM, unpublished observations, Ref. 30). The chicken proglucagon gene promoter was found to have extremely limited sequence similarity to mammalian promoter, yet it could support reporter gene expression in rodent islet cell lines (30). This work suggested that significant changes in the sequence of the proglucagon promoter could occur yet retain promoter activity. A survey of proglucagon promoters from diverse mammals that included two primates (human and rhesus monkey), three rodents (rat, mouse, and hamster), a carnivore (dog), and an artiodactyl (cow) found that only the primate proglucagon promoters failed to support reporter gene expression in transient-transfection studies with rodent islet cell lines (Tsai B, Yue S, and Irwin DM, unpublished observations). These two studies strongly suggest that promoter sequence changes have occurred on the lineage leading to primates that have yielded an immediate promoter that fails to support expression in rodent islet cell lines. It was concluded that additional sequences, either further 5' or 3' to the mRNA start site, are necessary for the regulation of expression of the human proglucagon gene (Tsai B, Yue S, and Irwin DM, unpublished observations) (21, 30).
In this study, we have compared proglucagon gene sequences from diverse mammals and the chicken to identify evolutionarily conserved noncoding regions (ECRs) that potentially regulate expression of the human proglucagon gene. ECRs are sequences that do not code for protein yet have changed little through evolution, presumably because they have been maintained because they have a function (8, 23), potentially a regulatory function. Our genomic comparisons have identified three ECRs: the immediate promoter region, a second
5-kb upstream of the proglucagon promoter, and a third within intron 1. We have tested the transcription activity of these ECR sequences to determine whether they support gene expression in islet cells. We found that the ECR sequence present in intron 1 supports expression of the proglucagon gene in rodent islet cells, both in vitro and in vivo.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Bioinformatic analysis. Genomic sequences that encoded proglucagon genes were downloaded from the ensemble Web site (http://www.ensemble.org). Genome databases for human (NCBI35), mouse (NCBIm33), rat (RGSC3.4), dog (BROADD1), cow (Btau1.0), opossum (BROAD0.5), and chicken (WASUC1), were searched for proglucagon genes. The cow and opossum gene databases had not been annotated; thus proglucagon gene sequences were identified through BLAST searches. The cow proglucagon genomic sequence was identified using the BLASTN algorithm to find genomic sequences identical to the cow proglucagon cDNA sequence. The opossum proglucagon gene sequence was identified using the TBLASTN algorithm to find genomic sequences that had similarity to mammalian proglucagon amino acid sequences. Proglucagon genomic sequences were downloaded and further characterized using MacDNASIS version 3.7 DNA and protein sequence analysis software (Hitachi, San Bruno, CA) using methods we have previously described (12, 13, 32). Multiple sequence alignments of the proglucagon genomic sequences were generated using the program MultiPipMaker (25, 26).
Reporter gene plasmids. Genomic fragments containing putative promoter and intron 1 regions of the human proglucagon gene were amplified by PCR using primers 602f and 3071r (see Table 1) from a P1 genomic clone that has previous been characterized (21) and cloned into pCR2.1 TA cloning vector (Invitrogen). Smaller promoter and intron fragments were amplified from this promoter-intron clone using the primers listed in Table 1, were released from the TA vector by digestion with appropriate restriction endonucleases (see Table 1), and cloned into pGL3 basic, pGL3 promoter, or modified pGL3 luciferase reporter gene vectors (Promega). Large genomic fragments (>1 kb) were amplified using Turbo Pfu (Stratagene), while smaller fragments were amplified with Taq (Roche, Laval PQ). The 602-base and minimal 82-base human promoters were amplified using the primers 602f and 58r and 82f and 58r, respectively. Intron 1 sequences were amplified using the primers 98f and 3063r. A series of 500-base intron fragments were amplified using the following pairs of primers: 98f and 552r, 553f and 1018r, 1018f and 1454r, 1455f and 1920r, 1921f and 2378r, 2379f and 2862r, and 2862f and 3063r. The DNA sequence of all promoter-reporter constructs was verified.
|
TC-1, and baby hamster fibroblasts (BHK cells) were grown in DMEM (4.5 g glucose/l) supplemented with 5% fetal calf serum and 1% penicillin and streptomycin. Luciferase reporter gene constructs were transfected into the cell lines using lipofectamine (Invitrogen). Controls for transfections included the promoterless pGL3 basic, the positive transfection pGL3 control vector (containing the SV40 promoter and enhancer; Promega), and a construct containing 312 bases of promoter and 58 bases of exon 1 of the rat proglucagon gene cloned into pGL3 Basic. Reporter gene construct activity was measured by detecting luciferase activity using a Lumat LB 9501 luminometer (EG&G Berthold, Wellesley, MA). Data analysis. Results are presented as means ± SE from at least three independent experiments. All luciferase assays were performed in triplicate. Data were analyzed by ANOVA followed by Tukey's test or paired Student's t-tests using the computer software InStat 2.0 (GraphPad Software, San Diego, CA). Differences were considered statistically significant at P < 0.05.
Generation of transgenic mice.
The 5.7 glucagon-growth hormone (GLU-GH) transgene contained a 5.8-kb genomic fragment that included 5775 bp of 5' flanking sequence and 96 bp of exon 1 and was ligated upstream of the human growth hormone (hGH) reporter gene. The 3.7 GLUin1-
Gal transgene was a 3.7-kb genomic fragment containing 602 bases of 5' flanking sequence, all of exon 1 and intron 1, and 8 bases of untranslated region of exon 2 of the human proglucagon gene were ligated upstream of the
-galactosidase (
-gal) reporter gene. The University of Toronto Faculty of Medicine Transgene and Knockout Facility used the transgene constructs to generate transgenic founders in the FVB strain. Transgenic mice were characterized by Southern blot analysis or PCR analysis of tail DNA, as previously described (21). Primers for detection of transgene are listed in Table 1. Germ line transmission was identified from two founders for each transgene, with transgene expression characterized in mice from the second generation of each line. The University Health Network Animal Care Committee approved the experimental protocol for the generation and characterization of transgenic mice.
RNA isolation and analysis.
Total cellular RNA was isolated from various tissues using Trizol reagent (Invitrogen). cDNA was synthesized by reverse transcription using a cDNA synthesis kit (AP Biotech, Baie d'Urfe, PQ). Primers for detection of transgene expression are listed in Table 1. For
-gal, the primers were
-galf and
-galr. Human proglucagon intron 1 primers were 2379f and 2862r. Mouse proglucagon primers were mouse PG5' and mouse PG3'. GAPDH primers were GAPDH5' and GAPDH3'.
| RESULTS |
|---|
|
|
|---|
Conserved noncoding regions potentially have a role in the regulation of nearby genes (8, 23). To identify ECRs near the human proglucagon gene, we used the program MultiPipMaker (25, 26) to align genomic sequences that encode the human, mouse, rat, dog, cow, opossum, and chicken proglucagon genes. An output of the MultiPipMaker program is a PIP plot where for each species, the genomic sequence was compared with the human sequence, and portions that show gap-free alignments with more than 50% identity to the human sequence are indicated by dots, with the height indicating the level of identity. If longer sequences have similar identity to the human sequence, then they are shown as lines. Figure 1 shows the PIP plot of the mammalian and chicken proglucagon genes; similar PIP plots were generated using other mammalian sequences as the reference sequence (results not shown).
|
Comparisons within placental mammals (human, mouse, rat, dog, and cow) showed that most of the genomic sequences near and within the proglucagon gene possessed greater than 50% sequence identity, with a few areas showing higher levels of sequence identity (Fig. 1). The four regions that showed the highest identities are (1) about 5 kb upstream of the mRNA start site, (2) the immediate promoter region, (3) near the 3' end of intron 1, and (4) the middle of intron 3. The highly conserved sequences about 16 kb upstream of the proglucagon mRNA start site in Fig. 1 are the coding and 3' untranslated sequences encoded by the upstream FAP gene.
Comparison of the proglucagon genomic sequence from the more distantly related opossum (a marsupial) to placental mammalian proglucagon genomic sequences identified fewer portions of the sequence that show greater than 50% sequence identity (Fig. 1). All six exons of the proglucagon gene are well conserved, although the untranslated sequences within exons 1 and 6 show less conservation than the protein coding exons (Fig. 1). In addition to the exon sequences, three noncoding genomic areas, identified as ECR1, ECR2, and ECR3 in Fig. 1, show very high sequence conservation. ECR1 is the 300-base immediate promoter region and is known to be important for expression of the rat proglucagon gene (5). ECR2 is
5 kb upstream from the mRNA start site and contains a region and of more than 200 bases with greater than 90% identity between human and the opossum (and more than 300 bases showing greater than 90% identity within placental mammals). ECR3, which is found near the 3' end of intron 1 is even better conserved than the other two ECRs, composed of
300 bases, showing greater than 92% identity between opossum and human (and
300 bases with greater than 97% identity within placental mammals).
Expression of human proglucagon transgene with 5.7 kb of 5' flanking sequence. We have previously reported that 5.7 kb of human proglucagon gene 5' flanking sequence could drive expression of luciferase reporter gene in rodent islet cell lines (21). ECR2 is included within the 5.7 kb of 5' flanking sequence, thus potentially contributing to islet-specific expression of the proglucagon gene. To determine whether the 5.7 kb of 5' flanking sequence was sufficient to drive expression in glucagon-gene expressing cells in vivo, we generated the 5.7 GLU-GH transgene. Tissue-specific expression of the reporter gene was characterized in two founder lines of transgenic mice by RT-PCR (Fig. 2). hGH reporter transgene mRNA was detected in the stomach, duodenum, jejunum, ileum, colon, and brain, but not pancreas or liver of mice descending from founder no. 10 (Fig. 2A). In contrast, low levels of growth hormone transgene mRNA were detected by RT-PCR in all tested tissues of mice descendent from founder no. 6 (Fig. 2B). To confirm expression of the transgene product, immunocytochemistry was used to assay for expression of the transgene protein product hGH. hGH-immunoreactive cells were detected in the stomach, duodenum, jejunum, ileum, colon, and brain, but not in the pancreas or liver of mice from both line no. 10 and 6 (data not shown). Despite widespread expression of the transgene as detected by RT-PCR (Fig. 2B), immunocytochemistry could only detect the hGH reporter in the stomach, duodenum, jejunum, ileum, colon, and brain in mice from line no. 6, suggesting that only some tissues properly express the transgene to generate mRNA that can be translated to yield hGH.
|
|
Enhancer activity of the evolutionarily conserved region in intron 1.
To determine which portions of intron 1 (bases 97 to 3,063 of the human proglucagon gene) contribute to the enhancer activity, a series of smaller fragments of intron 1 were tested for enhancer activity (Fig. 4A). A total of seven fragments, each of
500 base pairs, spanning the intron were cloned upstream of the SV40 promoter in pGL3P. Only one fragment, spanning bases 2,379 to 2,862, significantly enhanced luciferase activity in rodent islet cell lines (Fig. 4A). This fragment from bases 2,379 to 2,862, which enhances reporter gene activity, corresponds to the ECR3 that was found within intron 1 of the proglucagon gene (Fig. 1). The short ECR3 portion of intron 1 was also found to enhance reporter gene activity when placed upstream of 332-base or 82-base human proglucagon gene promoters or a minimal 82-base rat proglucagon promoter, although to a lower extent than that generated from the 312-base rat promoter (Fig. 4B). No other fragment of intron 1 significantly enhanced reporter gene activity, but surprisingly, one portion of the intron, from bases 1,018 to 1,454, essentially silenced reporter activity (Fig. 4A). The sequence between bases 1,018 and 1,454 of the proglucagon gene does not appear to correspond to any highly conserved portion of intron 1, although most of intron 1 is better conserved than other noncoding sequences near the proglucagon gene (Fig. 1).
|
-gal reporter gene. This construct was identical to the construct in Fig. 3A, except for the reporter gene. A total of three founders were generated, two of which produced germ line transmission. To assess reporter gene expression, RT-PCR was conducted using primers for the
-gal sequence. As shown in Fig. 5A, reporter gene expression was detected in both lines showing germ line transmission in brain and intestinal tissues, as previously detected in 1.6 kb (21) and 5.7 kb (Fig. 2), 5' flanking transgenic mice. In addition, both of our intron 1-containing transgenic mice also expressed the transgene in the pancreas and did not express the transgene in tissues, such as liver and muscle (Fig. 5A), suggesting that this intron 1-containing transgene construct expresses the reporter gene in all major sites of proglucagon expression. An attempt to amplify a portion of human proglucagon intron 1 (Fig. 5B) in these same cDNA samples failed to generate the expected band, indicating that the mRNA was not contaminated with genomic DNA or partially processed mRNA, yet GAPDH could be amplified from all samples (Fig. 5D), suggesting that the mRNA was intact and converted to cDNA. The pattern of expression of the transgene (Fig. 5A) was identical to that of the endogenous mouse proglucagon gene (Fig. 5C).
|
| DISCUSSION |
|---|
|
|
|---|
We first compared the genomic sequences encoding the proglucagon gene of the chicken to those sequences encoding proglucagon genes from mammals, thus representing sequence divergence of
310 million years (9, 17), and found that the only conserved sequences are the exons of the gene (Fig. 1). The protein coding exons 2 through 5 that encode most of the proglucagon precursor show about 75% sequence conservation, while exon 6, which encodes the C-terminus of proglucagon and the 3' untranslated region, had more varied conservation (Fig. 1). Although our chicken proglucagon genomic sequence did contain the first exon of the gene (30), this exon was not conserved with the mammalian sequences. The mammalian proglucagon gene is flanked by the FAP gene, about 16 kb upstream, and the DPIV gene, about 65 kb downstream (12), with a similar structure observed in the chicken genome (30). The last exon of the upstream FAP gene is conserved between chicken and mammalian proglucagon genomic sequences (Fig. 1), whereas downstream, nothing similar was identified between the proglucagon and DPIV genes (data not shown). Similar results were observed with comparisons of proglucagon genomic sequences from fish or frogs to mammals (data not shown). Therefore, comparisons of chicken (or fish or frogs) and mammals fail to detect any ECRs, which potentially could be regulatory sequences.
Our next comparisons involved genomic sequences from the placental mammals (human, mouse, rat, dog, and cow), which diverged from each other over the past
100 million years (9, 17). In comparisons within placental mammals, most of the nonrepetitive sequence was found to align between species (Fig. 1), with most of the "gaps" in the alignment due to deletion or insertion of sequence rather than low conservation. A large number of noncoding sequences are highly conserved between proglucagon genes of placental mammals (see Fig. 1). Comparisons that included the proglucagon gene sequence from the marsupial opossum, which diverged from placental mammals about 170 million years ago (9, 17), result in the identification of fewer ECRs (Fig. 2). In comparisons between the opossum and placental mammals, the noncoding sequences immediately upstream of exon 1 (ECR1), a sequence about 5 kb 5' to exon 1 (ECR2), and intron 1, especially near the 3' end of intron 1 (ECR3) (Fig. 2). The sequences of the three ECRs are better conserved than other flanking or intronic sequences, suggesting that they are being maintained by selection because they have a function, possibly as regulatory elements. In the opossum-placental mammal comparisons, the best conserved genomic sequence is ECR3 located in intron 1, which shows higher sequence identity than any of the protein coding exons, implying that it is undergoing a very slow rate of sequence evolution. Yet despite a slow rate of sequence evolution in mammals for ECR3, no sequence similar to ECR3 was found within the chicken proglucagon gene (see Fig. 1). This suggests that the selective constraints, and hence function of the sequence, have changed between chicken and mammals. A similar change appears to have also occurred for part of ECR1 in the immediate promoter region between these species (Tsai B, Yue S, and Irwin DM, unpublished observations).
ECR1 has a clear role in the regulation of rat proglucagon gene expression. ECR1 corresponds to the well-characterized immediate promoter region of the rat proglucagon gene (4, 15, 16), which in many mammals is sufficient to support expression in rodent islet cell lines (Tsai B, Yue S, and Irwin DM, unpublished observations). Although this sequence is well conserved in the human proglucagon gene, it is not sufficient to support reporter gene expression in vitro or in vivo, suggesting that a few changes may prevent transcriptional activity (21). Additional sequences have been hypothesized to be necessary for proper expression of the human proglucagon gene (21). In vitro transfection assays suggested that human proglucagon gene sequences 3.3 to 5.7 kb upstream of the mRNA start site could support expression of a reporter gene in rodent islet cell lines (21). ECR2 at about 5 kb upstream of the human proglucagon mRNA start site falls within the sequence defined by transient transfection assays as having enhancer activity. Transgenic mice, which include the 5.7 kb of 5' flanking sequence, including ECR2, failed to express the reporter gene in the pancreas, yet did express the reporter in the other major sites of endogenous proglucagon gene expression, the intestine and brain (Fig. 3). Thus the presence of both ECR1 and ECR2 was not sufficient to drive expression of a reporter gene in all sites of expression of the endogenous proglucagon gene.
Genomic comparisons indicated that the most conserved portion of mammalian proglucagon genes is not a protein coding exon but rather is a 450-base segment (ECR3) near the 3' end of intron 1 (Fig. 1) and potentially confers a regulatory function. Testing the enhancer activity of the proglucagon gene intron 1 is greatly facilitated by the fact that the initiator methionine for human (and other mammalian) proglucagon gene is encoded by exon 2 (31). Thus all of the exon 1 and intron 1, together with the 5' end of exon 2, could be included in our constructs, as they would only contribute 5' untranslated sequence and would not interfere with the expression, or function, of downstream reporter genes. Both in vitro and in vivo, reporter gene constructs that contained 602 bases of 5' flanking sequence (containing ECR1), all of exon 1 and intron 1 (containing ECR3), and the first 12 bases of exon 2, allowed expression of reporter genes in rodent islet cells (Figs. 3 and 5). Because the 602 bases of 5' flanking sequence, as well as exon 1 sequences, did not support reporter gene expression (Fig. 3), these results suggest that sequences within intron 1 were acting as an enhancer. Analysis of fragments of intron 1 suggests that most of the enhancer activity is located in a fragment between bases 2,379 and 2,860 of intron 1 (Fig. 5), the sequence that corresponds to ECR3 (Fig. 2). Searches of the ECR3 sequence identified potential binding sites for transcription factors such as HNF3
, Cdx-2/3, GATA factors, homeobox-containing factors (e.g., Isl-1, Brn4), and Pax factors. The potential transcription factor binding sites included many factors that previously had been identified as having a role in the expression of the rat proglucagon gene (4, 15, 16).
Most of intron 1 is better conserved than most of the noncoding sequences flanking or within mammalian proglucagon genes (Fig. 1), suggesting that selective pressure is conserving the sequence, possibly for additional functions. The observation that a sequence between bases 1,018 and 1,454 may repress expression supports this suggestion. Sequences distributed through intron 1 may act to modulate expression of proglucagon rather than directly act as an enhancer. The disassociation of sequences necessary for expression and those necessary for regulation has previously been demonstrated for intestinal expression of the human proglucagon gene. While 1.6 kb of 5' flanking sequence is sufficient for expression in intestinal cell (21), these sequences were not sufficient for regulated expression via nutrient sensing (22).
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
| REFERENCES |
|---|
|
|
|---|
-cell and regulation of glucagon gene transcriptions. In: Molecular Basis of Pancreas Development and Function, edited by Habener JF and Hussain M. Norwell, MA: Kluwer Academic, 2001, p. 6789.This article has been cited by other articles:
![]() |
D. Chen, G. Jin, Y. Wang, H. Wang, H. Liu, Y. Liu, W. Fan, H. Ma, R. Miao, Z. Hu, et al. Genetic variants in peroxisome proliferator-activated receptor-{gamma} gene are associated with risk of lung cancer in a Chinese population Carcinogenesis, February 1, 2008; 29(2): 342 - 350. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Wang, B. P. Flemming, C. C. Martin, S. R. Allen, J. Walters, J. K. Oeser, J. C. Hutton, and R. M. O'Brien Long-Range Enhancers Are Required to Maintain Expression of the Autoantigen Islet-Specific Glucose-6-Phosphatase Catalytic Subunit Related Protein in Adult Mouse Islets In Vivo Diabetes, January 1, 2008; 57(1): 133 - 141. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |