RetrogeneDB ID: | retro_mmus_238 | ||
Retrocopylocation | Organism: | Mouse (Mus musculus) | |
Coordinates: | X:94636256..94638155(+) | ||
Located in intron of: | None | ||
Retrocopyinformation | Ensembl ID: | ENSMUSG00000071723 | |
Aliases: | None | ||
Status: | KNOWN_PROTEIN_CODING | ||
Parental geneinformation | Parental gene summary: | ||
Parental gene symbol: | Gspt1 | ||
Ensembl ID: | ENSMUSG00000062203 | ||
Aliases: | None | ||
Description: | G1 to S phase transition 1 [Source:MGI Symbol;Acc:MGI:1316728] |
Percent Identity: | 87.66 % |
Parental protein coverage: | 84.12 % |
Number of stop codons detected: | 0 |
Number of frameshifts detected | 0 |
Parental | GAAGGDHGAGSGAGGPSEPVESSQDQSCEGSNSTVSMELSEPVVENGETEMSPEESWEHKEEISEAEPGG |
G.AG...G.....G.P.EP.......S.EGS.S.V.MELSEPVVENGE.EM..EESWE.KE..SEA.P.. | |
Retrocopy | GGAGEPEGKRMEWGAPVEPSKDGPLVSWEGSSSVVTMELSEPVVENGEVEMALEESWELKE-VSEAKPEA |
Parental | GSSGDGRPPEESTQEMMEEEEEIPKPKSAVAPPGAPKKEHVNVVFIGHVDAGKSTIGGQIMYLTGMVDKR |
.S.GD..PPEES..E.MEE.EE..K.KS...P.GAPKKEHVNVVFIGHVDAGKSTIGGQIM.LTGMVD.R | |
Retrocopy | -SLGDAGPPEESVKEVMEEKEEVRKSKSVSIPSGAPKKEHVNVVFIGHVDAGKSTIGGQIMFLTGMVDRR |
Parental | TLEKYEREAKEKNRETWYLSWALDTNQEERDKGKTVEVGRAYFETEKKHFTILDAPGHKSFVPNMIGGAS |
TLEKYEREAKEKNRETWYLSWALDTNQEERDKGKTVEVGRAYFETEKKHFTILDAPGHKSFVPNMIGGAS | |
Retrocopy | TLEKYEREAKEKNRETWYLSWALDTNQEERDKGKTVEVGRAYFETEKKHFTILDAPGHKSFVPNMIGGAS |
Parental | QADLAVLVISARKGEFETGFEKGGQTREHAMLAKTAGVKHLIVLINKMDDPTVNWSNERYEECKEKLVPF |
QADLAVLVISARKGEFETGFEKGGQTREHAMLAKTAGVK.LIVLINKMDDPTV.WS.ERYEECKEKLVPF | |
Retrocopy | QADLAVLVISARKGEFETGFEKGGQTREHAMLAKTAGVKYLIVLINKMDDPTVDWSSERYEECKEKLVPF |
Parental | LKKVGFNPKKDIHFMPCSGLTGANLKEQSDFCPWYIGLPFIPYLDNLPNFNRSVDGPIRLPIVDKYKDMG |
LKKVGF.PKKDIHFMPCSGLTGAN.KEQSDFCPWY.GLPFIPYLD.LPNFNRS.DGPIRLPIVDKYKDMG | |
Retrocopy | LKKVGFSPKKDIHFMPCSGLTGANIKEQSDFCPWYTGLPFIPYLDSLPNFNRSIDGPIRLPIVDKYKDMG |
Parental | TVVLGKLESGSICKGQQLVMMPNKHNVEVLGILSDDVETDSVAPGENLKIRLKGIEEEEILPGFILCDLN |
TVVLGKLESGSI.KGQQLVMMPNKH.VEVLGI.SDD.ETD.VAPGENLKIRLKGIEEEEILPGFILC... | |
Retrocopy | TVVLGKLESGSIFKGQQLVMMPNKHSVEVLGIVSDDAETDFVAPGENLKIRLKGIEEEEILPGFILCEPS |
Parental | NLCHSGRTFDAQIVIIEHKSIICPGYNAVLHIHTCIEEVEITALICLVDKKSGEKSKTRPRFVKQDQVCI |
NLCHSGRTFD.QIVIIEHKSIICPGYNAVLHIHTCIEEVEITALI.LVDKKSGEKSKTRPRFVKQDQVCI | |
Retrocopy | NLCHSGRTFDVQIVIIEHKSIICPGYNAVLHIHTCIEEVEITALISLVDKKSGEKSKTRPRFVKQDQVCI |
Parental | ARLRTAGTICLETFKDFPQMGRFTLRDEGKTIAIGKVLKLVPEKD |
ARLRTAGTICLETFKDFPQMGRFTLRDEGKTIAIGKVLKLVPEKD | |
Retrocopy | ARLRTAGTICLETFKDFPQMGRFTLRDEGKTIAIGKVLKLVPEKD |
* | Stop codon |
> | Forward frameshift by one nucleotide |
< | Reverse frameshift by one nucleotide |
Library | Retrocopy expression | Parental gene expression |
---|---|---|
SRP007412_brain | 20 .79 RPM | 63 .34 RPM |
SRP007412_cerebellum | 16 .02 RPM | 55 .41 RPM |
SRP007412_heart | 0 .62 RPM | 60 .61 RPM |
SRP007412_kidney | 1 .44 RPM | 92 .32 RPM |
SRP007412_liver | 0 .25 RPM | 82 .72 RPM |
SRP007412_testis | 4 .94 RPM | 83 .66 RPM |
ENCODE library ID | Target | ChIP-Seq Peak coordinates |
---|---|---|
ENCFF001XVA | POLR2A | X:94636002..94636310 |
EST ID | Start | End | Identity | Match | Mis-match | Score |
---|---|---|---|---|---|---|
AV512414 | 94636801 | 94637281 | 99.8 | 479 | 1 | 478 |
BB180433 | 94636568 | 94636829 | 95.8 | 253 | 8 | 244 |
CA752205 | 94636990 | 94637623 | 99.9 | 630 | 1 | 629 |
CB196124 | 94636112 | 94636945 | 98.5 | 828 | 1 | 819 |
CF536599 | 94636740 | 94637362 | 100 | 622 | 0 | 622 |
CO808645 | 94636436 | 94637117 | 100 | 670 | 0 | 661 |
CX240774 | 94636110 | 94636916 | 99.4 | 800 | 5 | 794 |
TSS No. | TSS Name | TSS expression level (Expr) in TPM range: | ||||
---|---|---|---|---|---|---|
no expression | 0 < Expr ≤ 1 | 1 < Expr ≤ 5 | 5 < Expr ≤ 10 | Expr > 10 | ||
TSS #1 | TSS_158566 | 595 libraries | 386 libraries | 89 libraries | 2 libraries | 0 libraries |
TSS #2 | TSS_158567 | 214 libraries | 210 libraries | 432 libraries | 112 libraries | 104 libraries |
TSS #3 | TSS_158568 | 686 libraries | 307 libraries | 76 libraries | 3 libraries | 0 libraries |