The complexity of antisense transcription revealed by the ... · The complexity of antisense...

12
The complexity of antisense transcription revealed by the study of developing male germ cells Wai-Yee Chan a,b, , Shao-Ming Wu a , Lisa Ruszczyk a , Evelyn Law a , Tin-Lap Lee a , Vanessa Baxendale a , Alan Lap-Yin Pang a , Owen M. Rennert a a Laboratory of Clinical Genomics, National Institute of Child Health and Human Development, National Institutes of Health, Building 49, Room 2A08, 49 Convent Drive, MSC 4429, Bethesda, MD 20892-4429, USA b Department of Pediatrics, Georgetown University, Washington, DC 20007, USA Received 22 October 2005; accepted 13 December 2005 Available online 3 February 2006 Abstract Computational analyses have identified the widespread occurrence of antisense transcripts in the human and the mouse genome. However, the structure and the origin of the majority of the antisense transcripts are unknown. The presence of antisense transcripts for 19 of 64 differentially expressed genes during mouse spermatogenesis was demonstrated with orientation-specific RT-PCR. These antisense transcripts were derived from a wide variety of origins, including processed sense transcripts, intronic and exonic sequences of a single gene or multiple genes, intergenic sequences, and pseudogenes. They underwent normal and alternative splicing, 5capping, and 3polyadenylation, similar to the sense transcripts. There were also antisense transcripts that were not capped and/or polyadenylated. The testicular levels of the sense transcripts were higher than those of the antisense transcripts in all cases, while the relative expression in nontesticular tissues was variable. Thus antisense transcripts have complex origins and structures and the sense and antisense transcripts can be regulated independently. Published by Elsevier Inc. Keywords: Antisense transcription; Intron; Exon; Intergenic; Pseudogene; Mouse; Spermatogonia; Spermatocytes; Spermatids Although antisense transcription has been known to occur in prokaryotes for many years, the widespread occurrence of antisense transcripts in humans and mice has only recently been documented. Computational analyses estimated 8 to 20% of human and mouse genes form senseantisense transcript pairs [14]. A more recent study of 10 human chromosomes indicated about 61% of surveyed loci have antisense transcripts [5]. The significance of the majority of antisense transcripts is currently unknown, though a number of biological functions have been proposed [6]. A role for antisense transcription in disease processes has also been suggested [710]. Despite this apparent biological impor- tance of the antisense transcripts, very little is known of the mechanisms by which they are generated. Structural information of antisense transcripts is also very limited. Furthermore, the occurrence of antisense transcription in specific biological processes has rarely been studied. A number of biological events in spermatogenesis such as genomic imprinting, translation repression, and stage-specific alternative splicing [11] are frequently associated with antisense transcripts. A systematic search for antisense transcripts in spermatogenic cells has not been reported. Recent profiling of expressed genes in mouse germ cells at different stages of development, including type A spermato- gonia, pachytene spermatocytes, and round spermatids, by serial analysis of gene expression (SAGE) [12] offered a unique opportunity to examine the occurrence of antisense transcription during this critical process in development. This study documents the occurrence of antisense transcripts in these three types of germ cells. Characterization of antisense transcripts cloned revealed the complex origins and structural features of these RNA molecules. Expression studies showed that the sense and the antisense transcripts can be regulated independently. Genomics 87 (2006) 681 692 www.elsevier.com/locate/ygeno Corresponding author. Fax: +1 301 480 4700. E-mail address: [email protected] (W.-Y. Chan). 0888-7543/$ - see front matter. Published by Elsevier Inc. doi:10.1016/j.ygeno.2005.12.006

Transcript of The complexity of antisense transcription revealed by the ... · The complexity of antisense...

Page 1: The complexity of antisense transcription revealed by the ... · The complexity of antisense transcription revealed by the study of developing male germ cells Wai-Yee Chana,b,⁎,

6) 681–692www.elsevier.com/locate/ygeno

Genomics 87 (200

The complexity of antisense transcription revealed by the study ofdeveloping male germ cells

Wai-Yee Chan a,b,⁎, Shao-Ming Wu a, Lisa Ruszczyk a, Evelyn Law a, Tin-Lap Lee a,Vanessa Baxendale a, Alan Lap-Yin Pang a, Owen M. Rennert a

a Laboratory of Clinical Genomics, National Institute of Child Health and Human Development, National Institutes of Health, Building 49,Room 2A08, 49 Convent Drive, MSC 4429, Bethesda, MD 20892-4429, USA

b Department of Pediatrics, Georgetown University, Washington, DC 20007, USA

Received 22 October 2005; accepted 13 December 2005Available online 3 February 2006

Abstract

Computational analyses have identified the widespread occurrence of antisense transcripts in the human and the mouse genome. However, thestructure and the origin of the majority of the antisense transcripts are unknown. The presence of antisense transcripts for 19 of 64 differentiallyexpressed genes during mouse spermatogenesis was demonstrated with orientation-specific RT-PCR. These antisense transcripts were derivedfrom a wide variety of origins, including processed sense transcripts, intronic and exonic sequences of a single gene or multiple genes, intergenicsequences, and pseudogenes. They underwent normal and alternative splicing, 5′ capping, and 3′ polyadenylation, similar to the sense transcripts.There were also antisense transcripts that were not capped and/or polyadenylated. The testicular levels of the sense transcripts were higher thanthose of the antisense transcripts in all cases, while the relative expression in nontesticular tissues was variable. Thus antisense transcripts havecomplex origins and structures and the sense and antisense transcripts can be regulated independently.Published by Elsevier Inc.

Keywords: Antisense transcription; Intron; Exon; Intergenic; Pseudogene; Mouse; Spermatogonia; Spermatocytes; Spermatids

Although antisense transcription has been known to occurin prokaryotes for many years, the widespread occurrence ofantisense transcripts in humans and mice has only recentlybeen documented. Computational analyses estimated 8 to20% of human and mouse genes form sense–antisensetranscript pairs [1–4]. A more recent study of 10 humanchromosomes indicated about 61% of surveyed loci haveantisense transcripts [5]. The significance of the majority ofantisense transcripts is currently unknown, though a numberof biological functions have been proposed [6]. A role forantisense transcription in disease processes has also beensuggested [7–10]. Despite this apparent biological impor-tance of the antisense transcripts, very little is known of themechanisms by which they are generated. Structuralinformation of antisense transcripts is also very limited.

⁎ Corresponding author. Fax: +1 301 480 4700.E-mail address: [email protected] (W.-Y. Chan).

0888-7543/$ - see front matter. Published by Elsevier Inc.doi:10.1016/j.ygeno.2005.12.006

Furthermore, the occurrence of antisense transcription inspecific biological processes has rarely been studied. Anumber of biological events in spermatogenesis such asgenomic imprinting, translation repression, and stage-specificalternative splicing [11] are frequently associated withantisense transcripts. A systematic search for antisensetranscripts in spermatogenic cells has not been reported.Recent profiling of expressed genes in mouse germ cells atdifferent stages of development, including type A spermato-gonia, pachytene spermatocytes, and round spermatids, byserial analysis of gene expression (SAGE) [12] offered aunique opportunity to examine the occurrence of antisensetranscription during this critical process in development. Thisstudy documents the occurrence of antisense transcripts inthese three types of germ cells. Characterization of antisensetranscripts cloned revealed the complex origins and structuralfeatures of these RNA molecules. Expression studies showedthat the sense and the antisense transcripts can be regulatedindependently.

Page 2: The complexity of antisense transcription revealed by the ... · The complexity of antisense transcription revealed by the study of developing male germ cells Wai-Yee Chana,b,⁎,

Table 1Genes with presence of antisense transcripts in testis confirmed by orientation-specific RT-PCR using total RNA as template

Symbol Gene name SAGE tag No. inSpga–Spcy–Sptd(S/AS)

Mean ΔcycleNo. ± SDtesticulartotal RNA

ApproximateS/AS

Ch10 Heat shock 10-kDaprotein 1(chaperone 10)

106–2–2/6–0–0

3.85 ± 0.35 14

Calm2 Calmodulin 2 110–304–40/6–20–0

4.07 ± 0.44 17

Ppic Peptidylprolylisomerase C

14–1–0/7– 20–11 5.02 ± 0.75 32

Pdcl2 Phosducin-like 2 0–82–91/0–5–7 6.52 ± 0.22 92Ubb Ubiquitin B 198–769–510/

7–59–217.35 ± 0.16 163

Sh3-Stam

Associatedmolecule with theSH3 domain ofSTAM

1–7–0/0–7–0 7.72 ± 1.00 211

Tsg1 Testis-specificgene 1

0–310–419/0–31–51

8.70 ± 0.15 416

Gk-rs2 Glucokinaseactivity, relatedsequence 2

0–54–41/0–6–5 9.00 ± 0.32 512

Uba52 Ubiquitin A-52residue ribosomalprotein fusionproduct 1

531–113–203/20–4–13

9.20 ± 0.15 588

Tetc3 T-complex-associated testisexpressed 3

4–471–360/6–20–0

9.88 ± 1.09 942

Prm1 Proteamine 1 0–41–177/0–2–26 9.88 ± 0.20 942Dnajb3 DnaJ (Hsp40)

homolog,subfamily B,member 3

0–63–84/0–4–12 10.50 ± 0.21 1448

Ppp1cc Proteinphosphatase 1,catalytic subunit,γ isoform

82–363–382/5–18–6

10.70 ± 0.34 1663

Prm2 Protamine 2 0–103–147/0–1–9 12.50 ± 0.12 5792Dbil5 Diazeparm

binding inhibitor-like 5

2–64–119/0–1–4 14.50 ± 0.13 23,170

Ldh3C Lactatedehydrogenase 3,C chain, spermspecific

0–282–243/0–4–2 16.10 ± 0.89 70,239

Adam5 A disintegrin andmetalloproteasedomain 5

3–54–92/0–12–4 16.81 ± 1.54 114,898

Sap17 Spermautoantigenicprotein 17

2–116–377/1–21–30

17.47 ± 0.45 181,549

Fhl4 Four and a halfLIM domain 4

0–164–272/2–19–27

19.54 ± 1.72 762,300

The germ cell SAGE tag numbers are from Ref. [12]. SAGE tag No. in Spga–Spcy–Sptd, number of SAGE tags in type A spermatogonia, pachytenespermatocytes, and round spermatids; S, sense; AS, antisense; Mean Δ cycleNo. ± SD, means of difference in QPCR cycle number between sense andantisense transcripts and standard deviation of the means with testicular totalRNA as template; Approximate S/AS, approximate ratio of expression levels ofsense/antisense transcripts calculated from QPCR cycle numbers.

682 W.-Y. Chan et al. / Genomics 87 (2006) 681–692

Results and discussion

Orientation-specific reverse transcription-polymerase chainreaction (RT-PCR)

Sixty-four genes that had been confirmed to be differentiallyexpressed in mouse germ cells at different stages of spermato-genesis by microarray [13] and/or quantitative real-time RT-PCR (QPCR) and corroborated by SAGE [12] were selected forthis investigation. The list of the genes is shown inSupplementary Table S1. Thirteen differential expressionpatterns could be distinguished (results not shown). The mostcommon pattern of expression was 0 or very low inspermatogonia and equally high in spermatocytes and sperma-tids (18 genes). The second most common pattern of expressionwas 0 or very low in spermatogonia, increased in spermato-cytes, and peaked in spermatids (17 genes). The copy number ofthe representative SAGE tag of the majority of the genes (52genes) was either 0 (33 genes) or very low in spermatogonia. Toidentify the antisense transcripts, SAGE tags matching theUniGene cluster of the genes were identified and multipleSAGE tags matching the same UniGene cluster were alignedwith all mRNAs deposited in that UniGene cluster. Orientationof the transcript represented by the SAGE tag was determined inreference to the known mRNA and confirmed by orientation-specific RT-PCR [14].

Following alignment of the matching SAGE tags withsequences in their respective UniGene clusters, 41 genes (64%)were shown to have SAGE tags matching the antisense strand(Supplementary Table S1). The presence of sense–antisenseoverlapping transcript was confirmed for 19 genes withorientation-specific RT-PCR using testicular total RNA as thetemplate (Table 1). The results of analysis of the products oforientation-specific RT-PCR using polyacrylamide gel electro-phoresis (PAGE) are shown in Supplementary Fig. S1. Amongthe rest of the 41 genes, 10 genes were considered to be notconfirmed of having antisense transcripts because a band waspresent in the no-primer control of the RT-PCR. The remaining12 genes were not analyzed by orientation-specific RT-PCRbecause the copy number of the antisense SAGE tags wasextremely low. Expression of the antisense transcript wasconfirmed by cloning and sequencing of the amplicons of theorientation-specific RT-PCR. Thus, 19 of 64 (∼30%) differen-tially expressed genes had antisense transcripts.

Examination of the nucleotide sequence of the antisenseamplicons showed that they could be divided into three maingroups based on the comparison with their sense genes. Fig. 1shows the three groups of antisense amplicons. In group 1 theantisense amplicons were 100% complementary to the sensetranscripts. This group could be divided into two subgroups. Insubgroup 1A, the amplicon was contained in a single exon. Thissubgroup included the antisense transcript of a disintegrin andmetalloprotease domain 5 (Adam5); diazepam-binding inhibi-tor-like 5 (DbiL5); DnaJ (Hsp40) homolog, subfamily B,member 3 (DnajB3); four and a half LIM domains 4 (Fhl4);glucokinase activity-related sequence 2 (Gk-rs2); phosducin-like 2 (Pdcl2); peptidylprolyl isomerase C (Ppic); γ isoform of

Page 3: The complexity of antisense transcription revealed by the ... · The complexity of antisense transcription revealed by the study of developing male germ cells Wai-Yee Chana,b,⁎,

Fig. 1. Grouping of antisense amplicons based on their structure. For groups 1A, 1B, and 2, antisense amplicons are represented by yellow and brown bars and areshown on the top. Sense gene sequences are represented by red and pink bars and are shown on the bottom. For group 3, pseudogene is represented by a striped bar. Thenoncoding strand of the sense gene is represented by a white bar. Complementarity is represented by short black lines linking the antisense amplicon with the sensesequence. Thick black lines represent introns.

683W.-Y. Chan et al. / Genomics 87 (2006) 681–692

protein phosphatase 1, catalytic subunit (Ppp1cc); protamine 2(Prm2); sperm autoantigenic protein 17 (Sap17); associatedmolecule with the SH3 domain of STAM (Sh3-Stam); andtestis-specific gene 1 (Tsg1). Subgroup 1B antisense ampliconswere 100% complementary to the sense transcripts, whichcomprised two (T-complex-associated testis expressed 3(Tcte3)) or three (sperm-specific lactate dehydrogenase 3, Cchain (Ldh3c)) spliced exons. Similar to amplicons of group 1B,the group 2 antisense amplicon consisted of more than oneexon. However, different from group 1B amplicons, only part ofthe antisense amplicon was complementary to the exons in thesense transcript. Part of the antisense amplicon was comple-mentary to the intron between the two exons of the sense gene.The antisense amplicon of protamine 1 (Prm1-ASF1 (GenBankAccession No. DQ082992)) belonged to this group. Group 3antisense amplicons were not complementary to the sensetranscripts. Instead, they were complementary to the pseudo-genes. Four antisense amplicons, namely those of calmodulin 2(Calm-2), heat shock 10-kDa protein 1 (chaperonin 10) (Ch10),ubiquitin A-52 residue ribosomal protein fusion product 1(Uba52), and ubiquitin B (Ubb), belonged to this group. Theantisense amplicons of these genes, namely, Calm2-ASF1(GenBank Accession No. DQ082990), Ch10-ASF1 (GenBankAccession No. DQ082993), Uba52-ASF1 (GenBank AccessionNo. DQ082994), and Ubb-ASF1 (GenBank Accession No.DQ082991), were 88–94% identical to their sense transcripts,but were 99% identical to their pseudogenes on chromosomes12, 2, 11, and 2, respectively (Figs. 2B to 2E).

Total complementarity of the antisense amplicons of thegenes in group 1B to the spliced exons of the sense genessuggested that they were synthesized after the sense transcriptswere processed. Similar observations had been made withantisense transcripts of cardiac troponin 1 [15], hemoglobin β[16], and rat urocortin [17]. It had been postulated that theseantisense transcripts were transcribed from the sense mRNA inthe cytoplasm by RNA-dependent RNA polymerase activity[4,15], a biological activity hypothesized but not yet demon-strated in eukaryotic cells. Sequence of the antisense ampliconof the gene in group 2 suggested that it was generated by

splicing of two exons in accordance with the AG/GT(C) rule[18], implying a similar RNA processing mechanism in bothsense and antisense transcription. A number of antisensetranscripts had been reported to arise in a similar mannerpreviously [19–25]. Thus, the noncoding strand of a gene locuscould be transcribed and processed independently to produce amature transcript. Both functional genes and pseudogenes couldgive rise to antisense transcripts as shown by the group 3antisense amplicons. Pseudogene-derived antisense transcriptsare not unique to germ cells. The antisense transcript of neuralnitric oxide synthase (nNOS) was also reported to be transcribedfrom a pseudogene [26]. Judging from the proportion ofantisense transcripts derived from pseudogenes and the numberof such genes identified in the mouse and human genome [27],pseudogenes may be a rich source of antisense transcripts.

Molecular cloning and characterization of antisensetranscripts

T-complex-associated testis expressed 3To understand further the nature of the antisense transcripts,

we cloned and characterized the antisense transcripts of onemember each of group 1B, group 2, and group 3. Cloning of theantisense transcripts of Tcte3 of group 1B yielded 18 distinctclones, which could be assigned to two groups based on the sizeof the transcript. Fig. 3A shows a cartoon of the Tcte3 antisensetranscripts, A-Tcte3-a (GenBank Accession No. DQ072383)and A-Tcte3-b (GenBank Accession No. DQ072384). Thepresence of the longer antisense transcript, A-Tcte3-b, wasconfirmed by amplifying the entire transcript using the most 5′and 3′ primers in one RT-PCR. The nucleotide sequences of A-Tcte3-a and A-Tcte3-b are shown in Fig. 3B. A-Tcte3-a wasidentical to the first 532 nucleotides of A-Tcte3-b except for anadditional 18 A's. Blasting A-Tcte3-b against the mouse genomerevealed that the first 704 nucleotides of A-Tcte3-b were thereverse complement of exon 4 (164 bp) of Tcte3 (NT_039641),including 246 and 294 bp, respectively, on the 5′ and 3′ side ofthis exon (Fig. 3B). However, the 50 nucleotides upstream ofthe poly(A) tail could not be found in the Tcte3 gene. These 50

Page 4: The complexity of antisense transcription revealed by the ... · The complexity of antisense transcription revealed by the study of developing male germ cells Wai-Yee Chana,b,⁎,

Fig. 2. Nucleotide sequence of group 2 and group 3 antisense amplicons.Primers for amplification are represented by broken line with arrow. Notation ofthe primers refers to the sense transcript. (A) Prm1-ASF1. The antisenseamplicon sequence is on the top. The complementary sense sequence is on thebottom. Uppercase letters represent exon sequence and lowercase lettersrepresent intronic sequence. (B) Calm2-ASF1. The antisense amplicon ofcalmodulin 2. (C) Ch10-ASF1, antisense amplicon of chaperonin 10. (D)Uba52-ASF1, antisense amplicon of ubiquitin A-52 ribosomal protein fusionproduct 1. (E) Ubb-ASF1, antisense amplicon of ubiquitin B.

684 W.-Y. Chan et al. / Genomics 87 (2006) 681–692

nucleotides contained a polymorphic AAAAC repeat, withseven repeats in some clones and eight repeats in the othersinterrupted by an AAAC sequence. The SAGE tag sequencesidentified in the germ cell SAGE libraries were present in theantisense transcripts. Computer-assisted translation of theantisense transcripts revealed open reading frames (ORF) of117 to 144 bp. The largest ORF encoded a putativepolypeptide of 48 amino acids with no recognizable proteinmotif, indicating that the antisense transcript might serve as anoncoding regulatory RNA [28]. Unlike the antisenseamplicon identified by orientation-specific RT-PCR, neitherof the Tcte3 antisense transcripts overlapped more than oneexon. The cloning approaches selected transcripts with a poly(A) sequence at the 3′ end (3′ rapid amplification of cDNAends (RACE)) or a cap at the 5′ end (5′ RACE). Therefore theresults suggested there exist more than two species ofantisense transcripts of Tcte3, some of which were notpolyadenylated and capped, similar to that observed in anumber of noncoding regulatory RNAs [28].

Protamin 1 and 2Three full-length antisense transcripts (A-Prm[1-2]-a

(GenBank Accession No. DQ072380), A-Prm[1-2]-b (Gen-Bank Accession No. DQ072381), and A-Prm[1-2]-c (GenBankAccession No. DQ072383)) of the sole member of group 2, i.e.,Prm1, were cloned; upon analysis, these turned out to overlapPrm1 as well as the neighboring Prm2. These transcripts werealternative splice variants containing different numbers ofexons. The presence of these transcripts was confirmed bysingle RT-PCRs using the most 3′ and 5′ primers. The structureof the antisense transcripts is shown in Fig. 4A. The nucleotidesequence of A-Prm[1-2] is shown in Fig. 4B. The shortestantisense transcript was A-Prm[1-2]-a, which had 921 bp and apoly(A) tail. It had four exons, with the first exon overlappingthe most 3′ 126 bp of Prm2 (BC049612), and exons 2 and 3were the reverse complement of Prm1 (NM_013637). A-Prm[1-2]-b was identical to A-Prm[1-2]-a with an additional exon(exon 3) of 116 bp, which overlapped part of the intergenicsequence between Prm1 and Prm2. It had a poly(A) tail. Thelongest antisense transcript was A-Prm[1-2]-c, which wasidentical to A-Prm[1-2]-b with an additional exon (exon 2) of147 bp and a poly(A) tail. The 5′ end of exon 2 overlapped the5′-most 112 bp of exon 1 of Prm2. Computer-assistedtranslation of the three antisense transcripts identifieda similar ORF of 318 bp encoding a putative polypeptidecontaining four casein kinase II phosphorylation sites and fourmyristoylation sites as shown in Fig. 4B. A number of protein-encoding antisense transcripts had been reported previously[20,23,29,30]. Whether the Prm[1-2] antisense transcriptsencode any protein awaits validation.

The Prm[1-2] antisense transcripts contained multiple exonswhose splice variants followed the AG/GT(C) rule [18]. Theseantisense exons overlapped with exonic, intronic, and intergenicsequences of the sense genes. Similar to the sense transcripts,the antisense transcripts underwent alternative splicing. Thisphenomenon had also been demonstrated in other studies[21,23,31,32]. Thus, processing of some antisense transcripts

Page 5: The complexity of antisense transcription revealed by the ... · The complexity of antisense transcription revealed by the study of developing male germ cells Wai-Yee Chana,b,⁎,

Fig. 3. Antisense transcripts of T-complex testis expressed 3. (A) Cartoon of the Tcte3 antisense transcripts. The SAGE tags identifying these antisense transcripts areshown in the boxes with the number of SAGE tags in each cell type shown as spermatogonia/spermatocytes/spermatids [12]. The area of complementary to the senseexon is flanked by the dotted lines. The 3′ poly(A) tail and the 5′ cap site of the antisense transcripts are shown. For the sense gene, the boxes represent exons while thethick line represents introns and flanking sequence of the gene. Empty boxes represent untranslated region and filled boxes represents translated sequence. The sizes ofthe exons, introns, and intergenic regions are not to scale. (B) Nucleotide sequence of A-Tcte3-b.▵ indicates the end of sequence homology between A-Tcte3-a and A-Tcte3-b. For A-Tcte3-a, there are 18 A's after that point. The polymorphic AAAAC stretch is marked with asterisks. The number of AAAAC is polymorphic, with 6 insome clones and 7 in the others. The two SAGE tag sequences identified in germ cell SAGE libraries are underlined.⇒⇐ demarcate the boundaries of the sequencethat is reverse complementary to exon 4 of Tcte3.

685W.-Y. Chan et al. / Genomics 87 (2006) 681–692

follows rules similar to those of the sense transcripts. It wasinteresting to note that the Prm[1-2] antisense transcriptsspanned two neighboring genes. Thus, they might interact withboth sense transcripts. Examination of the SAGE tags in thegerm cell libraries indicated two antisense tags that overlappedwith the second exon of Prm2 [12]. Since SAGE tags are closeto the 3′ end of transcripts, this observation suggested thepresence of antisense transcripts that overlapped primarily withthe Prm2 gene.

Ubiquitin A-52 residue ribosomal protein fusion product 1The antisense transcripts of Uba52 in group 3 were cloned.

5′ RACE yielded two groups of clones, neither of which werederived from the functional Uba52 gene on chromosome8 (Fig. 5A). They were derived from two putative pseudo-genes, one on the tip of the short arm of chromosome 4(NT_039258) and the other on the long arm of chromosome 9(NT_039472). Both pseudogenes were 97–98% homologousto Uba52 mRNA (BC014772) with no introns and a putativeancestral poly(A) stretch with 24 and 17 A's, respectively. 3′RACE using primers based on the 5′ RACE productssubsequently yielded three groups of antisense transcripts

derived from the pseudogene on chromosome 4 (A-Uba52-4a(GenBank Accession No. DQ072385), A-Uba52-4b (GenBankAccession No. DQ072386), and A-Uba52-4c (GenBankAccession No. DQ072387)) and four groups of antisensetranscripts derived from the pseudogene on chromosome 9 (A-Uba52-9a (GenBank Accession No. DQ072388), A-Uba52 -9b(GenBank Accession No. DQ072389), A-Uba52-9c (GenBankAccession No. DQ072390), and A-Uba52-9d (GenBankAccession No. DQ072391)). The presence of these transcriptswas confirmed by successful RT-PCR with the most 5′ andmost 3′ primers. Fig. 5A shows the cartoon depicting theintron–exon structure of the Uba52 antisense transcripts. Thenucleotide sequences of the antisense transcripts of Uba52 areshown in Fig. 5B. It is interesting to notice that the shortamplicon generated by orientation-specific RT-PCR wasderived from another pseudogene of Uba52 on chromosome11, indicating that only a portion of the antisense transcripts ofthis gene were cloned in this exercise.

All the antisense transcripts derived from the chromosome 4pseudogene predict an extra exon downstream of the 3′ end ofthe pseudogene identified by the putative ancestral poly(A) tail(Fig. 5A, b). A-Uba52-4a comprised two exons of 170 and 415

Page 6: The complexity of antisense transcription revealed by the ... · The complexity of antisense transcription revealed by the study of developing male germ cells Wai-Yee Chana,b,⁎,

Fig. 4. Antisense transcripts of protamine 1 and protamine 2. (A) Cartoon of the Prm[1-2] antisense transcripts. The putative exons of the antisense transcripts areshown. The exons are numbered starting from the 5′ end of the antisense transcript. The rest of the notations are the same as described in the legend to Fig. 3B. Theintergenic sequence between the Prm1gene and the Prm2 gene is ∼4.2 kb. (B) Nucleotide sequence and computer-predicted amino acid sequence of polypeptideencoded by A-Prm[1-2]-c. A-Prm[1-2]-adoes not have exons 2 and 3; A-Prm[1-2]-b does not have exon 2. Exon–intron boundaries are indicated by←→. The SAGEtag sequences identified in germ cell SAGE libraries are underlined. Amino acids encoded by the ORF are shown underneath the nucleotide sequence as single-lettercode. The stop codon is represented by +++. Amino acids constituting the potential casein kinase II phosphorylation sites in the putative encoded polypeptide areunderlined. Amino acids constituting potential myristoylation site are marked by asterisks.

686 W.-Y. Chan et al. / Genomics 87 (2006) 681–692

bp, respectively. The first exon was complementary to a putativeexon 1830 bp downstream of the pseudogene. This exon wasspliced to an exon starting at 84 bp 5′ of the poly(A) stretch of

the pseudogene. Splicing of the exons of A-Uba52-4a followedthe AG/GT(C) rule [18]. A-Uba52-4b, with 1048 bp, was thelargest antisense transcript derived from this pseudogene. It was

Page 7: The complexity of antisense transcription revealed by the ... · The complexity of antisense transcription revealed by the study of developing male germ cells Wai-Yee Chana,b,⁎,

687W.-Y. Chan et al. / Genomics 87 (2006) 681–692

similar to A-Uba52-4a except the second exon was larger, with854 bp. A-Uba52-4cwas different from the other two transcriptsin that the first exon was 321 bp and was 1799 bp downstreamof the pseudogene. This exon was spliced to the second exon atthe same position as the other two antisense transcripts.Computer-assisted translation of A-Uba52-4b revealed anORF of 393 bp with no recognizable protein motif.

The longest antisense transcript derived from the pseudogeneon chromosome 9, A-Uba52-9c, had 1920 bp with 298 bp inexon 1, 161 bp in exon 2, and 1461 bp in exon 3, including apoly(A) tail of ∼21 A's. Exon 3 overlapped with thepseudogene of Uba52. Exon 1 was predicted to be 4060 bpand exon 2 to be 1868 bp, downstream of the ancestral poly(A)stretch of the pseudogene (Fig. 5A, c). A-Uba52-9b wasidentical to A-Uba52-9c except that exon 3 was shorter withonly 725 bp and a poly(A) tail of ∼14 A's. A-Uba52-9a did nothave exon 2. A-Uba52-9d was identical to A-Uba52-9a exceptwith a LINE 1 repeat of 88 bp inserted between exons 1 and 2.3′ extension of exon 3 of both A-Uba52-9a and A-Uba52-9dwas not successful and we sequenced only 170 bp from the 5′end and stopped at the poly(T) stretch. Different from A-Uba52-4b, no ORF of appreciable size could be identified in any of thechromosome 9 antisense transcripts.

The study of the antisense transcripts of Uba52 showed thatunlike the pseudogenes, the antisense transcripts derived fromthem could be composed of multiple exons and undergoposttranscription processing. It is likely that these transcriptshave biological activities, unlike pseudogenes that arepresumed to be nonfunctional. A-Uba52-4b had an appreciableORF. Even though no known motif could be identified, thepotential for it to encode a novel protein could not beprecluded. The other transcripts with no significant ORFmight have regulator activities similar to that proposed fornoncoding RNAs [28]. The transcription of an antisense RNAfrom a pseudogene was reported previously for neural nitricoxide synthase [26]. In that case, the pseudogene-derivedantisense transcript was shown to regulate the synthesis of theneural nitric oxide synthase protein. Whether the Uba52antisense transcripts serve as regulatory RNAs is currentlyunder investigation.

Expression studies of antisense transcripts

The relative expression levels in mouse testis of sense andantisense transcripts of the aforementioned 19 genes wereexamined using QPCR with primer sets described in Supple-mental Table S2. The data indicated that the expression level intestis of the sense transcript was always significantly higher(p b 0.001) than that of the antisense transcript. The results aresummarized in Table 1. This differential expression of the senseand antisense transcripts in whole testis was similar to thatobserved in the more differentiated germ cells such asspermatocytes and spermatids as shown by the SAGE analysis,with the exception of Ppic and Sh3-Stam ([12] and Table 1). Inmost cases, the difference in expression levels between thesense and the antisense transcripts as shown by QPCR wasmuch bigger than that revealed by PAGE analysis following RT-

PCR (Supplementary Fig. S1). The discrepancy between theQPCR and the PAGE results could be the consequence ofdifferences in the procedure. Since RT-PCR was done using theone-tube procedure while QPCR involved a transfer of RTproduct from one tube to another prior to performing QPCR,any loss of RT product due to adsorption onto to the wall of theRT reaction tube will greatly affect the result of the QPCR butnot that of the RT-PCR and PAGE. This effect will be especiallysignificant for low-abundance antisense transcripts. Thus theQPCR results will not accurately reflect the relative expressionlevels of the sense and antisense transcripts, particularly for thelow-abundance transcripts. Another cause of the discrepancycould be the excessive number of PCR cycles (30 cycles)usually performed to reveal the presence of the antisensetranscripts.

We also examined the expression of sense and antisensetranscripts of nine genes in nontesticular tissues, includingwhole embryo, ovary, brain, thymus, kidney, spleen, heart,lung, and liver, by semiquantitative orientation-specific RT-PCR. The primers for the RT-PCR were the same as those forstudying testicular expression of the sense and the antisensetranscripts (Supplementary Table S2). Results of electropho-resis of the RT-PCR products are shown in Fig. 6. The relativeexpression levels of the sense and the antisense transcripts aresummarized in Table 2. Expression of the sense and theantisense transcript of Pdcl2 was testis specific. Theexpression of the antisense transcript of Tsg1 and Tcte3 wasalso testis specific. Unlike the sense transcript, the antisensetranscript of Prm1 was ubiquitously expressed. Expression ofPrm1 and Prm2 in nontesticular tissues was reportedpreviously [33–35] and confirmed by nucleotide sequencingof the TA-cloned sense and antisense amplicons (data notshown). The sense and the antisense transcripts of Pdcl2 andPrm2 were coexpressed in selected tissues while those ofCalm2 and Uba52 were coexpressed in all tissues examined.On the other hand, the sense transcripts of Ppp1cc and Ppicwere ubiquitously expressed while their antisense transcriptswere found only in certain tissues.

Although the sense transcripts were often expressed at higherlevels than antisense transcripts, when the two were coex-pressed the relative expression pattern of the sense and theantisense transcripts varied for different genes. This wascontrary to that observed when mouse embryonic tail geneswere studied [36]. Coexpression of the sense and the antisensetranscripts in the same cell facilitates the formation of RNAduplexes and may result in modulation of gene expression ormRNA stability and processing [6,37,38]. Tissue-specificexpression of the antisense transcripts implies regulation ofthe appropriate tissue- and cell-type expression of the sensegene similar to that observed for Nphs1 and Dnm3 [37,39,40]. Itmay also indicate regulation or activity of the antisensetranscript independent of the sense transcript as observed inthe case of FGF-2 [41].

Differential expression of sense and antisense transcripts indifferent cells and tissues has been reported for a number ofimprinted genes [6,21,42]. Antisense transcripts have beensuggested to play important roles in the regulation of

Page 8: The complexity of antisense transcription revealed by the ... · The complexity of antisense transcription revealed by the study of developing male germ cells Wai-Yee Chana,b,⁎,

688 W.-Y. Chan et al. / Genomics 87 (2006) 681–692

monoallelic expression in X-chromosome inactivation andgenomic imprinting [43–45]. A number of the genes in thepresent study also showed differential expression of the sense–antisense transcripts. The sense transcript of these genes,

including Prm1, Prm2, Tcte3, and Tsg1, coincidentally werepreferentially expressed in meiotic and postmeiotic cells (Tables1 and 2). It is unknown whether these antisense transcripts havea particular function in transcription regulation at these stages.

Page 9: The complexity of antisense transcription revealed by the ... · The complexity of antisense transcription revealed by the study of developing male germ cells Wai-Yee Chana,b,⁎,

689W.-Y. Chan et al. / Genomics 87 (2006) 681–692

The present study demonstrates the occurrence of antisensetranscription in germ cells. It is tempting to speculate that thisvigorous control of gene expression is necessary to ensure theaccuracy or to facilitate the functioning of the biologicalprocesses occurring during spermatogenesis such as genome-wide methylation–demethylation, genomic imprinting, mono-allelic gene expression, etc. It suggests that mouse spermato-genesis may be a good model for studying the regulation andbiological activities of antisense transcripts.

It has been argued that antisense transcripts arise from leakytranscription of the noncoding strand. Even though thispossibility cannot be completely ruled out, the fact that mostof the antisense transcripts examined are polyadenylated orprocessed in a manner similar to the sense transcripts suggeststhat they are intentional transcripts. This is supported by theresults of a recent study comparing the genomic organization ofgenes with or without antisense transcripts between human,mouse, and pufferfish [46]. Previous studies on antisensetranscription either used a computational approach to look forthe global presence of antisense transcripts or focused on asingle gene [1–4,19–26,29,30,35,37]. Few reports documentthe mechanisms by which the antisense transcripts aregenerated. In the present study, similar to a recent report [47],we used a SAGE database for the identification of antisensetranscripts. We showed that a significant percentage ofdifferentially expressed genes in spermatogenic germ cells areassociated with antisense transcripts. These transcripts comefrom a wide spectrum of sources: processed sense mRNA, thesense gene locus, pseudogene, two neighboring genes, andintergenic sequence. Characterization of the cloned antisensetranscripts also showed that antisense transcripts can beprocessed, alternatively spliced, capped, and polyadenylated.There are also antisense transcripts that do not have a 5′ capand/or 3′ poly(A) tail. Thus the origin and structure of antisensetranscripts are very complex. Antisense transcripts have beenproposed to have functions in transcriptional and posttranscrip-tional regulation [6,14,28,43,44,48,49]. The complex nature ofthe antisense transcripts suggests that they are well suited toprovide an additional layer of vigorous regulation of geneexpression.

Materials and methods

Orientation-specific RT-PCR and QPCR

Presence of antisense transcript was confirmed by orientation-specific RT-PCR. Total RNA from mouse tissues was purchased from Ambion (Austin,

Fig. 5. Antisense transcripts of ubiquitin A 52-residue ribosomal protein fusion prodgenes and the pseudogenes, the empty and hatched boxes represent exons, while the tthe same as described in the legend to Fig. 3B. (a) Functional gene on chromosome 8.the pseudogene on chromosome 4. The filled box represents the putative exon preAntisense transcripts A-Uba52-9a, -9b, -9c, and -9d of the pseudogene on chromodownstream of the pseudogene are represented by the filled boxes. The hatched btranscripts of Uba52. (a) A-Uba52-4b.▵ indicates the end of sequence of A-Uba52-4⇐ indicates the 5′ end of the pseudogene. The SAGE tag sequences identified inindicates the end of A-Uba52-9a and A-Uba52-9b. There are 14 A's following▵ in Aunderlined. Exon 2 is missing in A-Uba52-9a. A-Uba52-9dis similar to A-Uba52-9aTCCATATGCC GACTATTCAAT GCTGTTTTTA CTTTGCGGTG CAGAAATCT

TX, USA). RNA integrity and concentration were checked with theBioanalyzer 2100 (Agilent Technologies, Germantown, MD, USA). Orienta-tion-specific RT-PCR was performed as described previously by Shendure andChurch [14]. The One Step RT-PCR kit (Qiagen, Valencia, CA, USA) wasused according to the manufacturer's protocol with a total volume of 25 μl.The primers were designed based on the published mRNA sequence such thatthey would prime the amplification of a 120- to 230-bp sequence covering themost 3′ sense–antisense pair. Primers were designed using Primer Expressversion 2.0 (Applied Biosystems, Branchburg, NJ, USA) and specificity wasconfirmed by a BLASTN search against the nonredundant and EST mousesets from NCBI. The sequence and location of the primers are described inSupplemental Table S2. One microgram of total RNA was used as template inall RT reactions. In general, 30 PCR cycles were performed for the orientation-specific RT-PCR with the exception of cases in which the expression level ofthe transcript was very high or very low. All orientation-specific RT-PCRexperiments were performed with controls in which no strand-specific primerwas added to the RT reaction [14]. Five microliters of the 25 μl product oforientation-specific RT-PCR was analyzed by PAGE. Quantitative real-timeRT-PCR analyses were carried out as described previously [50]. Sense-specificfirst-strand cDNA was synthesized as described earlier using the Qiagen OneStep RT-PCR kit. Products of the RT reaction were aliquoted for QPCR.Primers for QPCR were the same as those used for orientation-specific RT-PCR as described in Supplemental Table S2. QPCR was performed intriplicate in the 7900 HTS Sequence Detection System (Applied Biosystems,Foster City, CA, USA). Gene expression data were reported after normalizingto 18S rRNA content. The mean cycle number and the standard deviationswere reported. Significance of expression level difference was analyzed usingStudent's t test.

5′ and 3′ RACE and nucleotide sequencing

To clone the antisense transcripts, anchored PCR was performed using theGeneRacer Kit (Invitrogen, Gaithersburg, MD, USA) according to themanufacturer's recommendations. 5′ RACE using the GeneRacer Kitspecifically targets only 5′-capped mRNA. Reverse transcription of the ligatedmRNA to create RACE-ready first-strand cDNAwas effected using SuperscriptIII reverse transcriptase (Invitrogen) and orientation-specific primer. The RACEproduct was amplified using nested primers and gene-specific primers before TAcloned into the pCR 4-TOPO vector (Invitrogen) according to the manufac-turer's protocol. The sequences of specific primers used for 5′ and 3′ RACE areindicated in Figs. 3B, 4B, and 5B. The sequence of the cloned cDNA wasdetermined by cycle sequencing using BigDye Primer Cycle Sequencing Kits(Applied Biosystems). DNA sequences were analyzed using DNASIS softwareversion 2.5 (MiraiBio, Alameda, CA, USA). The presence of antisensetranscripts was confirmed by repeating the RT-PCR using the most 5′ and themost 3′ primers that were located in different exons in the antisense transcript. Ifboth primers were contained in the sense transcript, orientation-specific RT-PCRwas performed.

Open reading frames in the nucleotide sequence were predicted using theNCBI ORF finder program (http://www.ncbi.nlm.nih.gov/gorf/gorf.html) andORF Finder (http://www.bioinformatics.org/sms/orf_find.html). Translatedamino acid sequences in the potential ORFs in FASTA format were input intothe SMART 4.0 database [51]. The output was ranked by e values. Domainswith scores that were less significant than the required threshold or overlappedwith some other source of annotation were rejected.

uct 1. (A) Cartoon of the cloned Uba52 antisense transcripts. For the functionalhick line represents introns and flanking sequences. The rest of the notations areThe three exons are shown. (b) Antisense transcripts A-Uba52-4a, -4b,and -4c ofdicted by the antisense transcripts located downstream of the pseudogene. (c)some 9. The two putative exons predicted by the antisense transcripts locatedox represents the LINE1 repeat insert. (B) Nucleotide sequence of antisensea. There are 21 A's after this point.← → indicates the exon–intron boundaries.germ cell SAGE libraries are underlined. (b) A-Uba52-4c; (c) A-Uba52-9c. ▵-Uba52-9b. The SAGE tag sequences identified in germ cell SAGE libraries areexcept with the insertion of ATATATTGTT GGCAAAGATT TTTCTCCCAT

T TTAAATTC between exons 1 and 2.

Page 10: The complexity of antisense transcription revealed by the ... · The complexity of antisense transcription revealed by the study of developing male germ cells Wai-Yee Chana,b,⁎,

Fig. 6. Polyacrylamide gel electrophoresis of the product of orientation-specific RT-PCR of nine genes in different tissues. M, size marker; S, orientation-specific RT-PCR using primer specific for the sense transcript; As, orientation-specific RT-PCR using primer specific for the antisense strand. Control with no strand-specificprimer in the RT reaction was done in all experiments. The representative results of triplicate experiments are shown.

690 W.-Y. Chan et al. / Genomics 87 (2006) 681–692

Page 11: The complexity of antisense transcription revealed by the ... · The complexity of antisense transcription revealed by the study of developing male germ cells Wai-Yee Chana,b,⁎,

Table 2Tissue distribution of sense and antisense transcript pairs in different mouse tissues

Gene Embryo Testis Ovary Brain Thymus Kidney Spleen Heart Lung LiverS/AS S/AS S/AS S/AS S/AS S/AS S/AS S/AS S/AS S/AS

Calm2 +++/+++ +++/+ +++/+++ +++/+ ++/+ ++/+ +++/+++ +++/+ +++/+ +++/+Pdcl2 –/– ++/+ –/– –/– –/– –/– –/– –/– –/– –/–Ppic ++++/+ ++++/++ ++++/+ ++/– ++++/– ++++/+ ++++/– ++++/+ ++++/– ++++/–Ppp1cc ++++/– +++/+ ++/– ++/– ++/– ++/+ ++/+ ++/– ++/– ++/++Prm1 –/+ +++/+++ –/+ –/++ ++/+ ++/+ ++/+ ++/+ +/+ +/+Prm2 –/– ++++/+++ –/– –/– ++/++ –/– +/++ –/– +/++ –/–Tcte3 –/– ++++/++ +/– +/– ++/– +/– +++/– –/– ++/– –/–Tsg1 +/– +++/+ +/– –/– –/– +/– +/– –/– +/– +/–Uba52 ++++/+ ++++/++++ ++++/+ ++++/+ ++++/+++ ++++/+ ++++/++++ ++++/+ ++++/++ +++/+

S, sense transcript expression; AS, antisense transcript expression; + indicates the presence of a band and the number of +'s represents the visual intensity of theband; – indicates the absence of a band.

691W.-Y. Chan et al. / Genomics 87 (2006) 681–692

Acknowledgments

This research was supported by the Intramural ResearchProgram of the NIH, National Institute of Child Health andHuman Development.

Appendix A. Supplementary data

Supplementary data associated with this article can be foundin the online version at doi:10.1016/j.ygeno.2005.12.006.

References

[1] R. Yelin, et al., Widespread occurrence of antisense transcription in thehuman genome, Nat. Biotechnol. 21 (2003) 379–386.

[2] H. Kiyosawa, et al., Antisense transcripts with FANTOM2 clone set andtheir implications for gene regulation, Genome Res. 12 (2003) 1324–1334.

[3] J. Chen, et al., Over 20% of human transcripts might form sense–antisensepairs, Nucleic Acids Res. 32 (2004) 4812–4820.

[4] O. Rosok, M. Sioud, Systematic identification of sense–antisensetranscripts in mammalian cells, Nat. Biotechnol. 22 (2004) 104–108.

[5] J. Cheng, et al., Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution, Science 308 (2005) 1149–1154.

[6] G. Lavorgna, D. Dahary, B. Lehner, R. Sorek, C.M. Sanderson, G. Casari,In search of antisense, Trends Biochem. Sci. 29 (2004) 88–94.

[7] C. Tufarelli, et al., Transcription of antisense RNA leading to genesilencing and methylation as a novel cause of human genetic disease, Nat.Genet. 34 (2003) 157–165.

[8] E.M. Reis, et al., Antisense intronic non-coding RNA levels correlate tothe degree of tumor differentiation in prostate cancer, Oncogene 23 (2004)6684–6692.

[9] E.M. Reis, R. Louro, H.I. Nakaya, S. Verjovski-Almeida, As antisenseRNA gets intronic, OMICS 9 (2005) 2012.

[10] A. Seitz, et al., Sense and antisense transcripts of the apolipoprotein E genein normal and ApoE knockout mice, their expression after spinal cordinjury and corresponding human transcripts, Hum. Mol. Genet. 14 (2005)2661–2670.

[11] K.C. Kleene, Patterns, mechanisms, and functions of translation regulationin mammalian spermatogenic cells, Cytogenet. Genome Res. 103 (2003)217–224.

[12] S.M. Wu, et al., Analysis of mouse germ cell transcriptome at differentstages of spermatogenesis: biological significance, Genomics 84 (2004)971–981.

[13] N. Schultz, F.K. Hamra, D.L. Garbers, A multitude of genes expressedsolely in meiotic or postmeiotic spermatogenic cells offers a myriadof contraceptive targets, Proc. Natl. Acad. Sci. USA 100 (2003)12201–12206.

[14] J. Shendure, G.M. Church, Computational discovery of sense–antisensetranscription in the human and mouse genome, Genome Biol. 3 (2002)research0044.1-0044.14.

[15] H. Bartsch, S. Voigtsberger, G. Baumann, I. Morano, H.P. Luther,Detection of a novel sense–antisense RNA-hybrid structure by RACEexperiments on endogenous troponin 1 antisense RNA, RNA 10 (2004)1215–1224.

[16] B. Bonafoux, et al., Analysis of remnant reticulocyte mRNA reveals newgenes and antisense transcripts expressed in the human erythroid lineage,Haematologica 89 (2004) 1434–1438.

[17] M. Shi, X. Yan, D.H. Ryan, R.B.S. Harris, Identification of urocortinmRNA antisense transcripts in rat tissue, Brain Res. Bull. 53 (2000)317–324.

[18] M. Burset, I.A. Seledtsov, V.V. Solovyev, Analysis of canonical and non-canonical splice sites in mammalian genomes, Nucleic Acids Res. 28(2000) 4364–4375.

[19] J.P. Nemes, K.A. Benzow, M.D. Koob, The SCA8 transcript is an antisenseRNA to a brain-specific transcript encoding a novel actin-binding protein(KLHL1), Hum. Mol. Genet. 9 (2000) 1543–1551.

[20] M.L. Hastings, H.A. Ingle, M.A. Lazar, S.H. Munroe, Post-transcriptionalregulation of thyroid hormone receptor expression by cis-acting sequencesand a naturally occurring antisense RNA, J. Biol. Chem. 275 (2000)11507–11513.

[21] T. Li, et al., An imprinted PEG1/MEST antisense expressed predominantlyin human testis and in mature spermatozoa, J. Biol. Chem. 277 (2002)13518–13527.

[22] T.H. Vu, N.V. Chuyen, T. Li, A.R. Hoffman, Loss of imprinting of IGF2sense and antisense transcripts in Wilms tumor, Cancer Res. 63 (2003)1900–1905.

[23] R. Knee, A.W. Li, P.R. Murphy, Characterization and tissue-specificexpression of the rat basic fibroblast growth factor antisense mRNA andprotein, Proc. Natl. Acad. Sci. USA 94 (1997) 4943–4947.

[24] G.B. Robb, et al., Post-transcriptional regulation of endothelial nitric-oxidesynthase by an overlapping antisense mRNA transcript, J. Biol. Chem. 279(2004) 37982–37996.

[25] A. Hernandez, M.E. Martinez, W. Croteau, D.L. St. Germain,Complex organization and structure of sense and antisense transcriptsexpressed from the DIO3 gene imprinted locus, Genomics 83 (2004)413–424.

[26] S.A. Korneev, J.-H. Park, M. O'Shea, Neuronal expression of neuralnitric oxide synthase (nNOS) protein is suppressed by an antisense RNAtranscribed from an NOS pseudogene, J. Neurosci. 19 (1999)7711–7720.

[27] Z. Zhang, M. Gerstein, Large-scale analysis of pseudogenes in the humangenome, Curr. Opin. Genet. Dev. 14 (2004) 328–335.

[28] J.S. Mattick, RNA regulation: a new genetics? Nat. Rev. Genet. 5 (2004)316–323.

[29] F. Runkel, M. Michels, T. Franz, Fxyd2 and Lgi4 expression in the adultmouse: a case of endogenous antisense expression, Mamm. Genome 14(2003) 665–670.

Page 12: The complexity of antisense transcription revealed by the ... · The complexity of antisense transcription revealed by the study of developing male germ cells Wai-Yee Chana,b,⁎,

692 W.-Y. Chan et al. / Genomics 87 (2006) 681–692

[30] A.-N. Spiess, N. Walther, N. Muller, M. Balvers, C. Hansis, R. Ivell,SPEER—A new family of testis-specific genes from the mouse, Biol.Reprod. 68 (2003) 2044–2054.

[31] A.W. Li, P.R. Murphy, Expression of alternatively spliced FGF-2 antisenseRNA transcripts in the central nervous system: regulation of FGF-2 mRNAtranslation, Mol. Cell. Endocrinol. 162 (2000) 69–78.

[32] G. Alfano, et al., Natural antisense transcripts associated with genesinvolved in eye development, Hum. Mol. Genet. 14 (2005) 913–923.

[33] R. Slomski, M. Schloesser, H. Chlebowska, J. Reiss, W. Engel, Detectionof human spermatid-specific transcripts in peripheral blood lymphocytesof males and females, Hum. Genet. 87 (1991) 307–310.

[34] Stanford Microarray Database [http://www.genome-www5.stanford.edu/cgi-bin/source].

[35] EMBL Bioinformatic Harvester [http://www.harvester.embl.de/harvester/AAH6/AAH66338.htm].

[36] M.B. Wahl, U. Heinzmann, K. Imai, LongSAGE analysis revealed thepresence of a large number of novel antisense genes in the mouse genome,Bioinformatics 21 (2005) 1389–1392.

[37] G. Solda, et al., In vivo RNA–RNA duplexes from human alpha3 andalpha5 nicotinic receptor subunit mRNAs, Gene 345 (2005) 155–164.

[38] H.P. Luther, Role of endogenous antisense RNA in cardiac generegulation, J. Mol. Med. 83 (2005) 26–32.

[39] P. Ihalmo, et al., Molecular cloning and characterization of an endogenousantisense transcript of Nphs1, Genomics 83 (2004) 1134–1140.

[40] D.A.F. Loebel, B. Tsoi, N. Wong, P.P.L. Tam, A conserved noncodingintronic transcript at the mouse Dnm3 locus, Genomics 85 (2005)782–789.

[41] A.W. Li, C.K.L. Too, R. Knee, M. Wilkinson, P.R. Murphy, FGF-2

antisense RNA encodes a nuclear protein with MutT-like antitumoractivity, Mol. Cell. Endocrinol. 133 (1997) 177–182.

[42] K. Yamasaki, K. Joh, T. Ohta, H. Masuzaki, T. Ishimaru, Neurons but notglial cells show reciprocal imprinting of sense and antisense transcripts ofUbe3a, Hum. Mol. Genet. 12 (2003) 837–847.

[43] H. Kiyosawa, K. Abe, Speculations on the role of natural antisensetranscripts in mammalian X chromosome evolution, Cytogenet. GenomeRes. 99 (2002) 151–156.

[44] Y. Ogawa, J.T. Lee, Antisense regulation in X inactivation and autosomalimprinting, Cytogenet. Genome Res. 99 (2002) 59–65.

[45] C.J. Brown, J.C. Chow, Beyond sense: the role of antisense RNA incontrolling Xist expression, Semin. Cell Dev. Biol. 14 (2003) 341–347.

[46] D. Dahary, O. Elroy-Stein, R. Sorek, Naturally occurring antisense:transcriptional leakage or real overlap? Genome Res. 15 (2005) 364–368.

[47] J. Chen, M. Sun, L.D. Hurst, G.G. Carmichael, J.D. Rowley, Humanantisense genes have unusually short introns: evidence for selection forrapid transcription, Trends Genet. 21 (2005) 203–207.

[48] C. Blin-Wakkach, et al., Endogenous Msx1 antisense transcript: in vivoand in vitro evidences, structure, and potential involvement in skeletondevelopment in mammals, Proc. Natl. Acad. Sci. USA 98 (2001)7336–7341.

[49] T. Imamura, S. Yamamoto, J. Ohgane, N. Hattori, S. Tanaka, K. Shiota,Non-coding RNA directed DNA demethylation of Sphk1 CpG island,Biochem. Biophys. Res. Commun. 322 (2004) 593–600.

[50] A.L.Y. Pang, et al., Identification of differentially expressed genes inspermatogenesis in the mouse, J. Androl. 24 (2003) 899–911.

[51] I. Letunic, et al., SMART 4.0: towards genomic data integration, NucleicAcids Res. 32(database issue) (2004) D142–D144.