RetrogeneDB ID:

retro_hsap_4038

Retrocopy
location
Organism:Human (Homo sapiens)
Coordinates:8:33579820..33580596(-)
Located in intron of:ENSG00000253642
Retrocopy
information
Ensembl ID:ENSG00000253569
Aliases:None
Status:KNOWN_PSEUDOGENE
Parental gene
information
Parental gene summary:
Parental gene symbol:VENTX
Ensembl ID:ENSG00000151650
Aliases:VENTX, HPX42B, NA88A, VENTX2
Description:VENT homeobox [Source:HGNC Symbol;Acc:13639]


Retrocopy-Parental alignment summary:






>retro_hsap_4038
ATGCGTCCGTCCTCCTCCCCACCTCGTGGCCGGCAGCAGCCCTCCAGCTTTGGCTCCGTGGACTGGCTCTCCCAGAACAG
CTGCTCAGGGCCGACCCACACCCCCAGGCCTGCCGACGTCTCCCCGGGGAGCCACCCTGGCCCGGGCCAGACATCCGGCA
CCCGGGAGCCCCCTCAGGCCGTCAGCATCAAGAAGGCTCGGTCCTCAAATCTGCCTGCGCCGGAGAGGGCCGTGGCTGGG
TTGGGTAAGAAGCCAAACCCCTGGCAGGCCCCCTGTGTAGGCACAGCCTTCACCACCGAGCAGGTCCGCGCCTTGGAGGG
CGTCTTCCAGCACCACCAGGACCTGGGCCCTCTGGAACGGAAAAGGCTGGCAGGGAGATGCAGCTCTCGGACGTCTAGAC
AAAAACCTGGTTTCAAAAACGCCAGATGAAACAGAAACGACAAACGCAGGACTCCCAGCTGAACAGCCCCTTCTCTGTGT
CTCTCCACGCGCCCCCTGCTTTCCACTCACCGTCTTCTGGCCTTGTCAATACCTGCAGCTGCCGTGCCCTTGGGCACCCC
TGCCAGGGCCCCAGGCTCTGATGCTGCCCCCTGGCTCTCTCTGGGGTCTCTGCCAAGTGGAACAAGAGGCCCTGGCCTCT
GCGGGGGTTTCCTGCTATGGGCAGCCTCTGGCGTACCACCGGCCAAGCCCAGGAAGTGGCGTGCGTACGCTGGGACCAGC
TCTGTCCACGGGGCCCTGGGGCCTGAGCGCCCTGCAGGAGACGGGGGGATGCATTT

ORF - retro_hsap_4038 Open Reading Frame is not conserved.
Retrocopy - Parental Gene Alignment summary:
Percent Identity: 77.19 %
Parental protein coverage: 100. %
Number of stop codons detected: 1
Number of frameshifts detected 3


Retrocopy - Parental Gene Alignment:

ParentalMRLSSSPPRGPQQLSSFGSVDWLSQSSCSGPTHTPRPADFSLGSLPGPGQTSGAREPPQAVSIKEAAGSS
MR.SSSPPRG.QQ.SSFGSVDWLSQ.SCSGPTHTPRPAD.S.GS.PGPGQTSG.REPPQAVSIK.A..SS
RetrocopyMRPSSSPPRGRQQPSSFGSVDWLSQNSCSGPTHTPRPADVSPGSHPGPGQTSGTREPPQAVSIKKAR-SS
ParentalNLPAPERTMAGLSKEPNTLRAPRVRTAFTMEQVRTLEGVFQHHQYLSPLERKRLA-REMQLSEVQIKTWF
NLPAPER..AGL.K.PN...AP.V.TAFT.EQVR.LEGVFQHHQ.L.PLERKRLA.REMQLS.V..KTWF
RetrocopyNLPAPERAVAGLGKKPNPWQAPCVGTAFTTEQVRALEGVFQHHQDLGPLERKRLA<REMQLSDV*TKTWF
ParentalQNRRMKHKRQMQDPQLHSPFSGSLHAPPAFYSTSSGLANG-LQLLCPWAPLSGPQALMLPPGSFWGLCQV
Q.R.MK.KRQ.QD.QL.SPFS.SLHAPPAF.S.SSGL.N..LQL.CPWAPL.GPQALMLPPGS.WGLCQV
RetrocopyQKRQMKQKRQTQDSQLNSPFSVSLHAPPAFHSPSSGLVNT<LQLPCPWAPLPGPQALMLPPGSLWGLCQV
ParentalAQEALASAGASCCGQPLASHPPTPGR--PSLGPALSTGPRGLCAMPQT-GDAF
.QEALASAG.SC.GQPLA.H.P.PG.....LGPALSTGP.GL.A...T.GDAF
RetrocopyEQEALASAGVSCYGQPLAYHRPSPGSGVRTLGPALSTGPWGLSALQET>GDAF

Legend:
*Stop codon
>Forward frameshift by one nucleotide
<Reverse frameshift by one nucleotide






(Hint: click retrocopy or parental gene accession number on the plot's legend, to show / hide expression level values)

Expression validation based on RNA-Seq data:
Library Retrocopy expression Parental gene expression
bodymap2_adipose 0 .00 RPM 0 .44 RPM
bodymap2_adrenal 0 .00 RPM 0 .53 RPM
bodymap2_brain 0 .07 RPM 0 .28 RPM
bodymap2_breast 0 .00 RPM 0 .53 RPM
bodymap2_colon 0 .00 RPM 0 .10 RPM
bodymap2_heart 0 .00 RPM 0 .59 RPM
bodymap2_kidney 0 .00 RPM 0 .45 RPM
bodymap2_liver 0 .00 RPM 0 .15 RPM
bodymap2_lung 0 .00 RPM 1 .23 RPM
bodymap2_lymph_node 0 .00 RPM 0 .93 RPM
bodymap2_ovary 0 .00 RPM 2 .83 RPM
bodymap2_prostate 0 .00 RPM 0 .40 RPM
bodymap2_skeletal_muscle 0 .00 RPM 0 .09 RPM
bodymap2_testis 0 .06 RPM 0 .60 RPM
bodymap2_thyroid 0 .00 RPM 0 .30 RPM
bodymap2_white_blood_cells 0 .00 RPM 8 .94 RPM
RNA Polymerase II actvity near the 5' end of retro_hsap_4038 was not detected
No EST(s) were mapped for retro_hsap_4038 retrocopy.
No TSS is located nearby retro_hsap_4038 retrocopy 5' end.
retro_hsap_4038 was not experimentally validated.

Retrocopy orthology:
Retrocopy retro_hsap_4038 has 2 orthologous retrocopies within eutheria group .

Species RetrogeneDB ID
Pan troglodytes retro_ptro_2736
Gorilla gorilla retro_ggor_2708

Parental genes homology:
Parental genes homology involve 5 parental genes, and 18 retrocopies.

Species Parental gene accession Retrocopies number
Homo sapiens ENSG00000151650 4 retrocopies
Gorilla gorilla ENSGGOG000000033403 retrocopies
Macaca mulatta ENSMMUG000000005432 retrocopies
Nomascus leucogenys ENSNLEG000000020475 retrocopies
Pan troglodytes ENSPTRG000000030874 retrocopies

Expression level across human populations :
image/svg+xml GBR_HG00142 GBR_HG00099 GBR_HG00114 GBR_HG00143 GBR_HG00131 GBR_HG00137 GBR_HG00133 GBR_HG00119 GBR_HG00111 GBR_HG00134 FIN_HG00378 FIN_HG00338 FIN_HG00349 FIN_HG00375 FIN_HG00315 FIN_HG00277 FIN_HG00328 FIN_HG00321 FIN_HG00377 FIN_HG00183 TSI_NA20756 TSI_NA20538 TSI_NA20798 TSI_NA20532 TSI_NA20765 TSI_NA20518 TSI_NA20513 TSI_NA20512 TSI_NA20771 TSI_NA20786 YRI_NA19114 YRI_NA19099 YRI_NA18870 YRI_NA18907 YRI_NA19223 YRI_NA19214 YRI_NA18916 YRI_NA19093 YRI_NA19118 YRI_NA19213 Toscaniin Italia: Finnish inFinland: British in England and Scotland: Utah Residents (CEPH) with Northernand Western European Ancestry: Yoruba in Ibadan, Nigeria: CEU_NA12760 CEU_NA12827 CEU_NA12872 CEU_NA12751 CEU_NA12873 CEU_NA12400 CEU_NA11930 CEU_NA12004 CEU_NA11831 CEU_NA11843 No expression ( = 0 RPM ) Legend:


Library Retrogene expression
CEU_NA11831 0 .00 RPM
CEU_NA11843 0 .00 RPM
CEU_NA11930 0 .00 RPM
CEU_NA12004 0 .00 RPM
CEU_NA12400 0 .00 RPM
CEU_NA12751 0 .00 RPM
CEU_NA12760 0 .00 RPM
CEU_NA12827 0 .00 RPM
CEU_NA12872 0 .00 RPM
CEU_NA12873 0 .00 RPM
FIN_HG00183 0 .00 RPM
FIN_HG00277 0 .00 RPM
FIN_HG00315 0 .00 RPM
FIN_HG00321 0 .00 RPM
FIN_HG00328 0 .00 RPM
FIN_HG00338 0 .00 RPM
FIN_HG00349 0 .00 RPM
FIN_HG00375 0 .00 RPM
FIN_HG00377 0 .00 RPM
FIN_HG00378 0 .00 RPM
GBR_HG00099 0 .00 RPM
GBR_HG00111 0 .00 RPM
GBR_HG00114 0 .00 RPM
GBR_HG00119 0 .00 RPM
GBR_HG00131 0 .00 RPM
GBR_HG00133 0 .00 RPM
GBR_HG00134 0 .00 RPM
GBR_HG00137 0 .00 RPM
GBR_HG00142 0 .00 RPM
GBR_HG00143 0 .00 RPM
TSI_NA20512 0 .00 RPM
TSI_NA20513 0 .00 RPM
TSI_NA20518 0 .00 RPM
TSI_NA20532 0 .00 RPM
TSI_NA20538 0 .00 RPM
TSI_NA20756 0 .00 RPM
TSI_NA20765 0 .00 RPM
TSI_NA20771 0 .00 RPM
TSI_NA20786 0 .00 RPM
TSI_NA20798 0 .00 RPM
YRI_NA18870 0 .00 RPM
YRI_NA18907 0 .00 RPM
YRI_NA18916 0 .00 RPM
YRI_NA19093 0 .00 RPM
YRI_NA19099 0 .00 RPM
YRI_NA19114 0 .00 RPM
YRI_NA19118 0 .00 RPM
YRI_NA19213 0 .00 RPM
YRI_NA19214 0 .00 RPM
YRI_NA19223 0 .00 RPM


Indel association:

No indels were associated with its genomic coordinates. Based on Kabza et al. 2015 (PubMed).




Copyright © RetrogeneDB 2014-2017