RetrogeneDB ID:

retro_hsap_3069

Retrocopy
location
Organism:Human (Homo sapiens)
Coordinates:4:103650004..103651376(-)
Located in intron of:ENSG00000109323
Retrocopy
information
Ensembl ID:ENSG00000248971
Aliases:None
Status:KNOWN_PSEUDOGENE
Parental gene
information
Parental gene summary:
Parental gene symbol:KRT8
Ensembl ID:ENSG00000170421
Aliases:KRT8, CARD2, CK-8, CK8, CYK8, K2C8, K8, KO
Description:keratin 8 [Source:HGNC Symbol;Acc:6446]


Retrocopy-Parental alignment summary:






>retro_hsap_3069
GGGATCTCTATCTGGTTCGGCCTTCCTGCCCCCACTCCTGCCTCCACCATGTCCATCAGGGTGACCCAGAAGTCCTACAA
GGTGTCCACCTCTGGGCCCCGGGCCTTCAGCAGCCACTCCTACACAAGTGGGCCCAGTGTCCAGATAAGCTCCTCAGGCT
TCTCCCTAGTGGGCAGCAGCAGCTTCCAAGGTTGTCTGGGCGGAGGCTATGGTGAGGGCAGCAGCGTGGGTGGCATCACC
ACCATCACAGTCAACCAGAGCCTGCTGAGCCCCCTTAACCTGGAGGTGGACCCCAACATCCAGGCTGTGCGCACCCAGGA
GAAGCAGCAGATCAAGACCCTCAACAACAAGTTTGCCTCCTTCATAGACAAGGTATGGTTCCTGGAGCAGTAGAACAAGA
TGCTGGAGACTAAGTGGAGCCTCCTGCAGCAGCAGAAGATGGCTCAGAGCAACATGGACAACATGTTCCAGAGCTACATC
AACAACCTTAGGCAGCAGCTGAAGACTCTGGGCCAGGAGAAGCTGAAGCTGGAGGCGGAGCTTGGCAACATGCCCGGGCT
GGTGGAGGACTTCAAGAACAAGTATGAGGATGAGATCAATAAGTGTACAGAGGTGGAGAATGAATTTGTCCTCATCAAGA
GGATGTGGATGAAGGTTACATGAACAAGGTAGAGCTGGAGTCTCACCTGGAAGGGCTGACTGACGAGATCAATTTCCTCA
AGCAGCTGTATGAAGAGGAGATCCAGGAGCTGCAGTCCCAGATCTCGGACACATCTGTGGTGCTGCCCATGGACAACAGC
CGCTCCCTGGGCATGAACAGCATCATCGCTGAGGTCAAGGCACCATACGAGGAGATCGCCAACTGCAGCCAGGCCAAGGC
TGAGAGCATGTACCAGATCAAGGATGAGGAGCTGCAGATGCTGGCTGGGAAGCACGAGGATGACCTGCAGTGTACAAAGA
CTGAGATTTCTGAGATGAACCGGAACATCAGCCAGCTCCAGGCTGAGACTGAGGGCCTCAAAGGCCAGAGGGCTTCCCTG
GAGGGCACCATCACAGATGTCTGCAGCACAGGGGGCTGGCTGTTAAGGATGCCAACGCCAAGCTGTCCAAGCTGGAGGCC
GCCCTGCAGCAGGCCAAGCAGGACATGGCATGGCAACTGCTTGAGTACCAGGAACTGATGAACATCAAGCTGGCCCTGGA
CATCGAGATAGCCACCTACAGGAAGCTGCTGGAGGGCGAGGAGAGCCAGCTGGACTCTGGGATGCAGAACATGAGTATCC
ATACAAAGACCACCAGTGGTTATGCAGGTGATCTGAGCTCGGCCTGTGGGGGCCTCCCAAGCCCCAGCTTTGGCTCTGGT
GTGGGCTCCAGC

ORF - retro_hsap_3069 Open Reading Frame is not conserved.
Retrocopy - Parental Gene Alignment summary:
Percent Identity: 83.26 %
Parental protein coverage: 89.63 %
Number of stop codons detected: 1
Number of frameshifts detected 2


Retrocopy - Parental Gene Alignment:

ParentalGISAWFGPPASTPASTMSIRVTQKSYKVSTSGPRAFSSRSYTSGPGSRISSSSFSRVGSSNFRGGLGGGY
GIS.WFG.PA.TPASTMSIRVTQKSYKVSTSGPRAFSS.SYTSGP...ISSS.FS.VGSS.F.G.LGGGY
RetrocopyGISIWFGLPAPTPASTMSIRVTQKSYKVSTSGPRAFSSHSYTSGPSVQISSSGFSLVGSSSFQGCLGGGY
ParentalGGASGMGGITAVTVNQSLLSPLVLEVDPNIQAVRTQEKEQIKTLNNKFASFIDKVRFLEQQNKMLETKWS
G..S..GGIT..TVNQSLLSPL.LEVDPNIQAVRTQEK.QIKTLNNKFASFIDKV.FLEQ.NKMLETKWS
RetrocopyGEGSSVGGITTITVNQSLLSPLNLEVDPNIQAVRTQEKQQIKTLNNKFASFIDKVWFLEQ*NKMLETKWS
ParentalLLQQQKTARSNMDNMFESYINNLRRQLETLGQEKLKLEAELGNMQGLVEDFKNKYEDEINKRTEMENEFV
LLQQQK.A.SNMDNMF.SYINNLR.QL.TLGQEKLKLEAELGNM.GLVEDFKNKYEDEINK.TE.ENEFV
RetrocopyLLQQQKMAQSNMDNMFQSYINNLRQQLKTLGQEKLKLEAELGNMPGLVEDFKNKYEDEINKCTEVENEFV
ParentalLIKK-DVDEAYMNKVELESRLEGLTDEINFLRQLYEEEIRELQSQISDTSVVLSMDNSRSLDMDSIIAEV
LIK..DVDE.YMNKVELES.LEGLTDEINFL.QLYEEEI.ELQSQISDTSVVL.MDNSRSL.M.SIIAEV
RetrocopyLIKR<DVDEGYMNKVELESHLEGLTDEINFLKQLYEEEIQELQSQISDTSVVLPMDNSRSLGMNSIIAEV
ParentalKAQYEDIANRSRAEAESMYQIKYEELQSLAGKHGDDLRRTKTEISEMNRNISRLQAEIEGLKGQRASLEA
KA.YE.IAN.S.A.AESMYQIK.EELQ.LAGKH.DDL..TKTEISEMNRNIS.LQAE.EGLKGQRASLE.
RetrocopyKAPYEEIANCSQAKAESMYQIKDEELQMLAGKHEDDLQCTKTEISEMNRNISQLQAETEGLKGQRASLEG
ParentalAIADAEQRG-ELAIKDANAKLSELEAALQRAKQDMARQLREYQELMNVKLALDIEIATYRKLLEGEESRL
.I.D....G..LA.KDANAKLS.LEAALQ.AKQDMA.QL.EYQELMN.KLALDIEIATYRKLLEGEES.L
RetrocopyTITDVCSTG<GLAVKDANAKLSKLEAALQQAKQDMAWQLLEYQELMNIKLALDIEIATYRKLLEGEESQL
ParentalESGMQNMSIHTKTTSGYAGGLSSAYGGLTSPGLSYSLGSS
.SGMQNMSIHTKTTSGYAG.LSSA.GGL.SP......GSS
RetrocopyDSGMQNMSIHTKTTSGYAGDLSSACGGLPSPSFGSGVGSS

Legend:
*Stop codon
>Forward frameshift by one nucleotide
<Reverse frameshift by one nucleotide






(Hint: click retrocopy or parental gene accession number on the plot's legend, to show / hide expression level values)

Expression validation based on RNA-Seq data:
Library Retrocopy expression Parental gene expression
bodymap2_adipose 0 .00 RPM 6 .86 RPM
bodymap2_adrenal 0 .33 RPM 4 .42 RPM
bodymap2_brain 0 .05 RPM 0 .56 RPM
bodymap2_breast 0 .04 RPM 56 .07 RPM
bodymap2_colon 0 .00 RPM 132 .45 RPM
bodymap2_heart 0 .07 RPM 32 .09 RPM
bodymap2_kidney 0 .08 RPM 197 .95 RPM
bodymap2_liver 0 .00 RPM 168 .59 RPM
bodymap2_lung 0 .02 RPM 176 .67 RPM
bodymap2_lymph_node 0 .11 RPM 8 .63 RPM
bodymap2_ovary 0 .21 RPM 45 .99 RPM
bodymap2_prostate 0 .17 RPM 95 .50 RPM
bodymap2_skeletal_muscle 0 .00 RPM 0 .11 RPM
bodymap2_testis 0 .00 RPM 29 .77 RPM
bodymap2_thyroid 0 .08 RPM 375 .21 RPM
bodymap2_white_blood_cells 0 .57 RPM 0 .00 RPM
RNA Polymerase II actvity near the 5' end of retro_hsap_3069 was not detected
1 EST(s) were mapped to retro_hsap_3069 retrocopy
EST ID Start End Identity Match Mis-match Score
BX476129 103650114 103650765 99.6 646 3 643
No TSS is located nearby retro_hsap_3069 retrocopy 5' end.
retro_hsap_3069 was not experimentally validated.

Retrocopy orthology:
Retrocopy retro_hsap_3069 has 1 orthologous retrocopies within eutheria group .

Species RetrogeneDB ID
Pongo abelii retro_pabe_2544

Parental genes homology:
Parental genes homology involve 16 parental genes, and 151 retrocopies.

Species Parental gene accession Retrocopies number
Canis familiaris ENSCAFG000000071992 retrocopies
Callithrix jacchus ENSCJAG000000181473 retrocopies
Cavia porcellus ENSCPOG000000026155 retrocopies
Dasypus novemcinctus ENSDNOG000000074003 retrocopies
Felis catus ENSFCAG000000002982 retrocopies
Homo sapiens ENSG00000170421 28 retrocopies
Gorilla gorilla ENSGGOG0000001467318 retrocopies
Macaca mulatta ENSMMUG0000000216322 retrocopies
Mustela putorius furoENSMPUG000000064417 retrocopies
Mus musculus ENSMUSG000000493821 retrocopy
Nomascus leucogenys ENSNLEG000000177338 retrocopies
Oryctolagus cuniculus ENSOCUG000000069451 retrocopy
Otolemur garnettii ENSOGAG0000001174016 retrocopies
Pongo abelii ENSPPYG0000000456231 retrocopies
Pteropus vampyrus ENSPVAG000000093603 retrocopies
Rattus norvegicus ENSRNOG000000097791 retrocopy

Expression level across human populations :
image/svg+xml GBR_HG00142 GBR_HG00099 GBR_HG00114 GBR_HG00143 GBR_HG00131 GBR_HG00137 GBR_HG00133 GBR_HG00119 GBR_HG00111 GBR_HG00134 FIN_HG00378 FIN_HG00338 FIN_HG00349 FIN_HG00375 FIN_HG00315 FIN_HG00277 FIN_HG00328 FIN_HG00321 FIN_HG00377 FIN_HG00183 TSI_NA20756 TSI_NA20538 TSI_NA20798 TSI_NA20532 TSI_NA20765 TSI_NA20518 TSI_NA20513 TSI_NA20512 TSI_NA20771 TSI_NA20786 YRI_NA19114 YRI_NA19099 YRI_NA18870 YRI_NA18907 YRI_NA19223 YRI_NA19214 YRI_NA18916 YRI_NA19093 YRI_NA19118 YRI_NA19213 Toscaniin Italia: Finnish inFinland: British in England and Scotland: Utah Residents (CEPH) with Northernand Western European Ancestry: Yoruba in Ibadan, Nigeria: CEU_NA12760 CEU_NA12827 CEU_NA12872 CEU_NA12751 CEU_NA12873 CEU_NA12400 CEU_NA11930 CEU_NA12004 CEU_NA11831 CEU_NA11843 No expression ( = 0 RPM ) > 0 RPM = 1.66 RPM Legend:


Library Retrogene expression
CEU_NA11831 0 .71 RPM
CEU_NA11843 0 .14 RPM
CEU_NA11930 0 .68 RPM
CEU_NA12004 0 .23 RPM
CEU_NA12400 0 .39 RPM
CEU_NA12751 0 .70 RPM
CEU_NA12760 0 .63 RPM
CEU_NA12827 0 .47 RPM
CEU_NA12872 0 .82 RPM
CEU_NA12873 0 .80 RPM
FIN_HG00183 0 .74 RPM
FIN_HG00277 0 .85 RPM
FIN_HG00315 0 .30 RPM
FIN_HG00321 1 .66 RPM
FIN_HG00328 0 .59 RPM
FIN_HG00338 0 .51 RPM
FIN_HG00349 0 .26 RPM
FIN_HG00375 0 .64 RPM
FIN_HG00377 0 .15 RPM
FIN_HG00378 0 .76 RPM
GBR_HG00099 0 .81 RPM
GBR_HG00111 0 .26 RPM
GBR_HG00114 0 .66 RPM
GBR_HG00119 0 .98 RPM
GBR_HG00131 0 .66 RPM
GBR_HG00133 0 .84 RPM
GBR_HG00134 1 .38 RPM
GBR_HG00137 0 .38 RPM
GBR_HG00142 0 .56 RPM
GBR_HG00143 0 .38 RPM
TSI_NA20512 0 .11 RPM
TSI_NA20513 1 .55 RPM
TSI_NA20518 0 .80 RPM
TSI_NA20532 0 .99 RPM
TSI_NA20538 1 .01 RPM
TSI_NA20756 0 .47 RPM
TSI_NA20765 1 .04 RPM
TSI_NA20771 0 .88 RPM
TSI_NA20786 0 .73 RPM
TSI_NA20798 0 .37 RPM
YRI_NA18870 0 .54 RPM
YRI_NA18907 0 .49 RPM
YRI_NA18916 0 .63 RPM
YRI_NA19093 0 .68 RPM
YRI_NA19099 0 .69 RPM
YRI_NA19114 1 .02 RPM
YRI_NA19118 0 .64 RPM
YRI_NA19213 0 .93 RPM
YRI_NA19214 0 .89 RPM
YRI_NA19223 0 .82 RPM


Indel association:

No indels were associated with its genomic coordinates. Based on Kabza et al. 2015 (PubMed).




Copyright © RetrogeneDB 2014-2017