Bioinformatic Analysis of Retroelement-Associated Sequences in Human and Mouse Promoters
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 33122
Bioinformatic Analysis of Retroelement-Associated Sequences in Human and Mouse Promoters

Authors: Nadezhda M. Usmanova, Nikolai V. Tomilin

Abstract:

Mammalian genomes contain large number of retroelements (SINEs, LINEs and LTRs) which could affect expression of protein coding genes through associated transcription factor binding sites (TFBS). Activity of the retroelement-associated TFBS in many genes is confirmed experimentally but their global functional impact remains unclear. Human SINEs (Alu repeats) and mouse SINEs (B1 and B2 repeats) are known to be clustered in GCrich gene rich genome segments consistent with the view that they can contribute to regulation of gene expression. We have shown earlier that Alu are involved in formation of cis-regulatory modules (clusters of TFBS) in human promoters, and other authors reported that Alu located near promoter CpG islands have an increased frequency of CpG dinucleotides suggesting that these Alu are undermethylated. Human Alu and mouse B1/B2 elements have an internal bipartite promoter for RNA polymerase III containing conserved sequence motif called B-box which can bind basal transcription complex TFIIIC. It has been recently shown that TFIIIC binding to B-box leads to formation of a boundary which limits spread of repressive chromatin modifications in S. pombe. SINEassociated B-boxes may have similar function but conservation of TFIIIC binding sites in SINEs located near mammalian promoters has not been studied earlier. Here we analysed abundance and distribution of retroelements (SINEs, LINEs and LTRs) in annotated sequences of the Database of mammalian transcription start sites (DBTSS). Fractions of SINEs in human and mouse promoters are slightly lower than in all genome but >40% of human and mouse promoters contain Alu or B1/B2 elements within -1000 to +200 bp interval relative to transcription start site (TSS). Most of these SINEs is associated with distal segments of promoters (-1000 to -200 bp relative to TSS) indicating that their insertion at distances >200 bp upstream of TSS is tolerated during evolution. Distribution of SINEs in promoters correlates negatively with the distribution of CpG sequences. Using analysis of abundance of 12-mer motifs from the B1 and Alu consensus sequences in genome and DBTSS it has been confirmed that some subsegments of Alu and B1 elements are poorly conserved which depends in part on the presence of CpG dinucleotides. One of these CpG-containing subsegments in B1 elements overlaps with SINE-associated B-box and it shows better conservation in DBTSS compared to genomic sequences. It has been also studied conservation in DBTSS and genome of the B-box containing segments of old (AluJ, AluS) and young (AluY) Alu repeats and found that CpG sequence of the B-box of old Alu is better conserved in DBTSS than in genome. This indicates that Bbox- associated CpGs in promoters are better protected from methylation and mutation than B-box-associated CpGs in genomic SINEs. These results are consistent with the view that potential TFIIIC binding motifs in SINEs associated with human and mouse promoters may be functionally important. These motifs may protect promoters from repressive histone modifications which spread from adjacent sequences. This can potentially explain well known clustering of SINEs in GC-rich gene rich genome compartments and existence of unmethylated CpG islands.

Keywords: Retroelement, promoter, CpG island, DNAmethylation.

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1085559

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1578

References:


[1] Soriano P, Meunier-Rotival M, Bernardi G. The distribution of interspersed repeats is nonuniform and conserved in the mouse and human genomes. Proc Natl Acad Sci U S A 1983, 80:1816-1820.
[2] Korenberg JR, Rykowski MC: Human genome organization: Alu, lines, and the molecular structure of metaphase chromosome bands. Cell 1988, 53: 391-400.
[3] Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, et al.: Initial sequencing and analysis of the human genome. Nature 2001, 409: 860-921.
[4] Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, Agarwala R, Ainscough R, Alexandersson M, An P, et al.: Initial sequencing and comparative analysis of the mouse genome. Nature 2002, 420: 520-562.
[5] Chu WM, Ballard R, Carpick BW, Williams BR, Schmid CW: Potential Alu function: regulation of the activity of double-stranded RNAactivated kinase PKR. Mol Cell Biol 1998, 18:58-68.
[6] Kang MI, Rhyu MG, Kim YH, Jung YC, Hong SJ, Cho CS, Kim HS: The length of CpG islands is associated with the distribution of Alu and L1 retroelements. Genomics 2006, 87: 580-590.
[7] Gardiner-Garden M, Frommer M: CpG islands in vertebrate genomes. J Mol Biol 1987, 196: 261-282.
[8] Ioshikhes IP, Zhang MQ: Large-scale human promoter mapping using CpG islands. Nat Genet 2000, 26: 61-63.
[9] Yoder JA, Walsch CP, Bestor TH: Cytosine methylation and the ecology of intragenomic parasites. Trends Genet 1997, 13: 335-340.
[10] Rollins RA, Haghighi F, Edwards JR, Das R, Zhang MQ, Ju J, Bestor TH: Large-scale structure of genomic methylation patterns. Genome Res 2006, 16: 157-163.
[11] Kondrashov AS: Direct estimates of human per nucleotide mutation rates at 20 loci causing Mendelian diseases. Hum Mutat 2003, 21: 12- 27.
[12] Rubin CM, VandeVoort CA, Teplitz RL, Schmid CW: Alu repeated DNAs are differentially methylated in primate germ cells. Nucl Acids Res 1994, 22: 5121-5127.
[13] Oei SL, Babich VS, Kazakov VI, Usmanova NM, Kropotov AV, Tomilin NV: Clusters of regulatory signals for RNA polymerase II transcription associated with Alu family repeats and CpG islands in human promoters. Genomics 2004, 83: 873-882.
[14] Brohede J, Rand KN: Evolutionary evidence suggests that CpG islandassociated Alus are frequently unmethylated in human germline. Hum Genet 2006, 119: 457-458.
[15] Borchert GM, Lanier W, Davidson BL: RNA polymerase III transcribes human microRNAs. Nat Struct Mol Biol 2006: 13: 1097-1101.
[16] Noma K, Cam HP, Maraia RJ, Grewal SI: A role for TFIIIC transcription factor complex in genome organization. Cell 2006, 125: 859-872.
[17] Medstrand P, van de Lagemaat LN, Mager DL: Retroelement distributions in the human genome: variations associated with age and proximity to genes. Genome Res 2002, 12: 1483-1495.
[18] Babich V, Aksenov N, Alexeenko V, Oei SL, Buchlow G, Tomilin N: Association of some potential hormone response elements in human genes with the Alu family repeats. Gene 1999, 239: 341-349.
[19] Britten RJ. Evolutionary selection against change in many Alu repeat sequences interspersed through primate genomes. Proc Natl Acad Sci U S A 1994, 91: 5992-5996.
[20] Simons C, Pheasant M, Makunin IV, Mattick JS: Transposon-free regions in mammalian genomes.Genome Res 2006, 16: 164-172.
[21] Tomilin NV: Control of genes by mammalian retroposons. Int Rev Cytol 1999, 186: 1-48.
[22] Kochanek S, Renz D, Doerfler W: Probing DNA-protein interactions in vitro with the CpG DNA methyltransferase. Nucl Acids Res 1995, 21: 2339-2342.
[23] Chu WM, Wang Z, Roeder RG, Schmid CW: RNA polymerase III transcription repressed by Rb through its interactions with TFIIIB and TFIIIC2. J Biol Chem 1997, 272: 14755-14761.
[24] Kropotov AV, Tomilin NV: A human B-box-binding protein downregulated in adenovirus 5-transformed human cells. FEBS Lett 1996, 386: 43-46.
[25] Van Dyke MW, Roeder RG: Multiple proteins bind to VA RNA genes of adenovirus type 2. Mol Cell Biol 1987, 7: 1021-1031.
[26] Geiduschek EP, Tocchini-Valentini GP: Transcription by RNA polymerase III. Annu. Rev. Biochem. 1988, 57: 873-914.
[27] Besser D, Gotz F, Schulze-Forster K, Wagner H, Kroger H, Simon D: DNA methylation inhibits transcription by RNA polymerase III of a tRNA gene, but not of a 5S rRNA gene. FEBS Lett 1990, 269: 358-362.
[28] Pagano A, Castelnuovo M, Tortelli F, Ferrari R, Dieci G, Cancedda R: New Small Nuclear RNA Gene-Like Transcriptional Units as Sources of Regulatory Transcripts. PLoS Genet. 2007, 3: e1.
[29] Cook PR: Nongenic transcription, gene regulation and action at a distance. J. Cell Sci. 2003, 116: 4483-4491.
[30] Mattick JS: Challenging the dogma: the hidden layer of non-proteincoding RNAs in complex organisms. BioEssays 2003, 25: 930-939.
[31] Carninci P, Hayashizaki Y: Noncoding RNA transcription beyond annotated genes. Curr Opin Genet Dev 2007, 17: 139-144.
[32] Pavlicek A, Jabbari K, Paces J, Paces V, Hejnar JV, Bernardi G: Similar integration but different stability of Alus and LINEs in the human genome. Gene 2001, 276: 39-45.
[33] Jordan IK, Rogozin IB, Glazko GV, Koonin EV: Origin of a substantial fraction of human regulatory sequences from transposable elements. Trends Genet 2003, 19: 68-72.
[34] Hasler J, Strub K: Alu elements as regulators of gene expression. Nucleic Acids Res 2006, 34: 5491-5497.
[35] Laperriere D, Wang TT, White JH, Mader S: Widespread Alu repeatdriven expansion of consensus DR2 retinoic acid response elements during primate evolution. BMC Genomics 2007, 8: 23.
[36] Norris J, Fan D, Aleman C, Marks JR, Futreal PA, Wiseman RW, Iglehart JD, Deininger PL, McDonnell DP: Identification of a new subclass of Alu DNA repeats which can function as estrogen receptordependent transcriptional enhancers. J Biol Chem 1995, 270: 22777- 22782.
[37] Caretti G, Di Padova M, Micales B, Lyons GE, Sartorelli V: The Polycomb Ezh2 methyltransferase regulates muscle gene expression and skeletal muscle differentiation. Genes Dev 2004, 18: 262-272.
[38] Lunyak VV, Prefontaine GG, Núñez E, Cramer T, Ju BG, Ohgi KA, Hutt K, Roy R, García-Díaz A, Zhu X, Yung Y, Montoliu L, Glass CK, Rosenfeld MG. Developmentally regulated activation of a SINE B2 repeat as a domain boundary in organogenesis. Science 2007, 317: 248- 251.
[39] Tomilin NV. Regulation of mammalian gene expression by retroelements and non-coding tandem repeats. Bioessays 2008, 30: 338- 348.