XB-FEAT-5768883: Difference between revisions

From XenWiki
Jump to navigation Jump to search
 
(8 intermediate revisions by the same user not shown)
Line 3: Line 3:


=orthology analysis=
=orthology analysis=
DIOPT/EggNog analysis finds no matches/homology groups for the. Xtrop protein XP_002933698.2
DIOPT/EggNog analysis finds no matches/homology groups for the X.laevis and  X.trop proteins [NP_001087818.1 and XP_002933698.2]


Human gene [Gene ID: 4744] is called NEFH neurofilament heavy chain.  
DIOPT/EggNog matches the human NEFH to 100+ other species proteins with high confidence.  


DIOPT/EggNog confirms that the 3rd Xenopus accession, XP_002934335.2, is also quickly matched to the other vertebrate NEFH homolog groups.


=Xtropicalis protein=
NB: the Human gene [Gene ID: 4744] is called NEFH neurofilament heavy chain.
>XP_002933698.2 neurofilament heavy polypeptide [Xenopus tropicalis]


NCBI/COBALT alignment of the human, Xtrop and model vertebrate NEFH genes and these uncharacterized ''Xenopus'' proteins shows that X.laevis and  X.trop proteins [NP_001087818.1 and XP_002933698.2] are much shorter and have no significant similarity to the other vertebrate NEFH genes, although they are almost identical to each other.
Accessions used in COBALT MSA:
NP_066554.2 neurofilament heavy polypeptide [Homo sapiens] (1020aa)
XP_002933698.2 neurofilament heavy polypeptide [Xenopus tropicalis] (626aa)
NP_001087818.1 uncharacterized protein LOC447642 [Xenopus laevis](644aa)
NP_035034.2 neurofilament heavy polypeptide [Mus musculus] (1090aa)
NP_036739.2 neurofilament heavy polypeptide [Rattus norvegicus] (1064aa)
XP_040541104.1 neurofilament heavy polypeptide isoform X2 [Gallus gallus] (939aa)
XP_002934335.2 neurofilament heavy polypeptide [Xenopus tropicalis] (1125aa)
=Human NEFH protein=
>NP_066554.2 neurofilament heavy polypeptide ['''Homo sapiens]'''
MMSFGGADALLGAPFAPLHGGGSLHYALARKGGAGGTRSAAGSSSGFHSWTRTSVSSVSASPSRFRGAGA
ASSTDSLDTLSNGPEGCMVAVATSRSEKEQLQALNDRFAGYIDKVRQLEAHNRSLEGEAAALRQQQAGRS
AMGELYEREVREMRGAVLRLGAARGQLRLEQEHLLEDIAHVRQRLDDEARQREEAEAAARALARFAQEAE
AARVDLQKKAQALQEECGYLRRHHQEEVGELLGQIQGSGAAQAQMQAETRDALKCDVTSALREIRAQLEG
HAVQSTLQSEEWFRVRLDRLSEAAKVNTDAMRSAQEEITEYRRQLQARTTELEALKSTKDSLERQRSELE
DRHQADIASYQEAIQQLDAELRNTKWEMAAQLREYQDLLNVKMALDIEIAAYRKLLEGEECRIGFGPIPF
SLPEGLPKIPSVSTHIKVKSEEKIKVVEKSEKETVIVEEQTEETQVTEEVTEEEEKEAKEEEGKEEEGGE
EEEAEGGEEETKSPPAEEAASPEKEAKSPVKEEAKSPAEAKSPEKEEAKSPAEVKSPEKAKSPAKEEAKS
PPEAKSPEKEEAKSPAEVKSPEKAKSPAKEEAKSPAEAKSPEKAKSPVKEEAKSPAEAKSPVKEEAKSPA
EVKSPEKAKSPTKEEAKSPEKAKSPEKEEAKSPEKAKSPVKAEAKSPEKAKSPVKAEAKSPEKAKSPVKE
EAKSPEKAKSPVKEEAKSPEKAKSPVKEEAKTPEKAKSPVKEEAKSPEKAKSPEKAKTLDVKSPEAKTPA
KEEARSPADKFPEKAKSPVKEEVKSPEKAKSPLKEDAKAPEKEIPKKEEVKSPVKEEEKPQEVKVKEPPK
KAEEEKAPATPKTEEKKDSKKEEAPKKEAPKPKVEEKKEPAVEKPKESKVEAKKEEAEDKKKVPTPEKEA
PAKVEVKEDAKPKEKTEVAKKEPDDAKAKEPSKPAEKKEAAPEKKDTKEEKAKKPEEKPKTEAKAKEDDK
TLSKEPSKPKAEKAEKSSSTDQKDSKPPEKATEDKAAKGK
=NCBI protein accessions=
>XP_002933698.2 neurofilament heavy polypeptide '''[Xenopus tropicalis]
'''
MGAKFSKKKKSYCLGAGKDGESTETTEVEQKETETQSGQKDAVVPKEETPQPDKTVANGTSEEQNPQAEK
MGAKFSKKKKSYCLGAGKDGESTETTEVEQKETETQSGQKDAVVPKEETPQPDKTVANGTSEEQNPQAEK
QLDLETKPDEPTKDDQNAGPVSENNLESNKAETDETPQEKAESNCLSAVQEQSSEVAKGDSSSKAQTAPP
QLDLETKPDEPTKDDQNAGPVSENNLESNKAETDETPQEKAESNCLSAVQEQSSEVAKGDSSSKAQTAPP
Line 20: Line 60:
DLESEDQSSDKHSLEQEKSVEEESKAVVDVVQETLNQEQRDSNHISSSPVDKQDISGSAPQPLTEDTIEN
DLESEDQSSDKHSLEQEKSVEEESKAVVDVVQETLNQEQRDSNHISSSPVDKQDISGSAPQPLTEDTIEN
HATLAEESVHEKILADVQIVDKHVTLNGLPSKEEIKVKENVENGSNHSISDLNGQNENHEIEVCNE
HATLAEESVHEKILADVQIVDKHVTLNGLPSKEEIKVKENVENGSNHSISDLNGQNENHEIEVCNE
>NP_001087818.1 uncharacterized protein LOC447642 ['''Xenopus laevis]'''
MGAKFSKKKKSYCLGAGKDGESTETTEVEQKGTETQNGQKDAVAPKEETPQPDKPVGNGTSEEQKPQVEK
QSDPETKQDEPTKDEQNAISMPENNLESNKAEAEETPQEKTESNCSSAVQEHSSEVAKDDRSNLTQKAPP
SHEPEKTEEEQLNKNLDSSSDPPCDAVAEPEVKQSSVEKHQVIVQPVDHLQEATVKEAAEEHPPASQLTA
EKENQLVTDIDPIPENTVPLPEEVPEVVVTIADNNEKVTTPVIMTESSPMQMQTPNVAEISKLEEESKQS
LEGQSPLPEHEIPEPVQTPQPIAQTEKPDELPVLATPEPVAEQLVPLVDPVSEEIESLEHPQAHMPLTTE
PATEKELVDLEATPQELKSVAEPENPDHVAEPVTSEGLQQSQTETGKVCENETKQSVQEDEQPQPKTSEV
PEQPAHDEVNTANQESSTFGNKESEVLCSQSSDVSSNDKPELISPKQEICTEIPSEVSPLQTLQDSAAEL
PVPLEPNAHGMKLEDQSSDKHSVEQEKCVEEENKAEVDVVQETLNQEPIDGNHTASSPTAVQDTISDPST
ADVSGSAPLPIIEDKIESHVTLAEESVDEKIPSDVQIVDKHVTLNGLPSKEEIKVKENLENGSNHAISEL
NGQNEKHEIEVCNE
neurofilament heavy polypeptide [Xenopus tropicalis]
NCBI Reference Sequence: XP_002934335.2
GenPept Identical Proteins Graphics
>XP_002934335.2 neurofilament heavy polypeptide [Xenopus tropicalis]
MLSFSLERSMGPVTYRRPPGDHYLRSSSVSLTHSGSSSFQSHSRSRRSAPSYSESPEPSNGPRDEKEALQ
GLNERFAGYIEKVRRLEEQNRSLQQEATALRKQQAGISAIGQLYEREIRDMRNQLLKINSENGQEQLERD
RLSEDIELLRLKTQEEERLKEEALNSERALRQFLQECSLDREQLGRKVQSLEDEAAHLCKCHQEDVEEMI
QQIHGSQVTTQQMGEVPSLELASALKDIRSQLEGHIGKNNLQTQGWFQAQLNKLNEASQVNTQAIRSASD
EITNYRHQLQSQVTQLEVLKSSQQSLERQCLDMEDRHQAEMASYQDTLQQLDNELRTTKWEMAAQIREYQ
DLLNVKMALDIEIAAYRKLLEGEECRLGSGFDPFSFDEKIPRGPSTPKHIKVKKEEKIKIVEKSDKETVI
VEKQTEETHVTEEVTEEDEVVGSPEEEQEEEKEKTPSEEQEEIESKEEEQGEEKDEENEEEHDTVEEEEE
TKDKQGEDSADTEEGEGEKAGETKDEELDKKEEKKPTKATDKGPELKIKEGEPKEKSDKEEGKGEQKSKE
DKPVKKEEKEEPAKKEDKAYKEGKGEATKTEDEAAVKKEKKGEPTKKEEKEPAKKNEQTGIEEKPEPAKK
KEKEEPKKEEKKEAAKTEDKKEQTRADEKPEPAKKEEKGEQIKKDQKEPAKKEEKVDSKKKEEKDAAKKE
EKMEQTKTVEKPDLAKKGEKGESKTEEKPKEDEPTKKEPASKEEKKEQAKTEEKPEPVKKEEKGEPKAKE
QKEPVKKEEKTEPVKKDEGKESSKKDVPKKEESAKKDDKEIAKKEEKGAAQKDEELSKKDEPMKKEEKQE
PAKKEEPSKKEEKKEPSTKVVPTEEEQNLPEKVEKKESAKKEEKKQPVTEEKKEAPKKEVAADKKEDKET
TKKETKESTKKEDKPESAKKEDSPKIDEKKETPKKEEKKEPTKKEEPSKQEEKTEPPKKEVKDQVKDDKE
PAKKEEKSPKAEVKTPSKEPEKQPKLDGKPAKDDVKKSAKVEESKPATDEEKIPSKVEKPPKTEEKKESS
KDNEPLKKAEKEPTKKITEVTKIEEKSKEDSQKITKKDPVGGNTQNVKPKTEKIEKSSGTDQLESESTQG
AKSEK

Latest revision as of 12:14, 26 April 2023

XB5768883

This is the community wiki page for the gene XB5768883 please feel free to add any information that is relevant to this gene that is not already captured elsewhere in Xenbase.

orthology analysis

DIOPT/EggNog analysis finds no matches/homology groups for the X.laevis and X.trop proteins [NP_001087818.1 and XP_002933698.2]

DIOPT/EggNog matches the human NEFH to 100+ other species proteins with high confidence.

DIOPT/EggNog confirms that the 3rd Xenopus accession, XP_002934335.2, is also quickly matched to the other vertebrate NEFH homolog groups.

NB: the Human gene [Gene ID: 4744] is called NEFH neurofilament heavy chain.


NCBI/COBALT alignment of the human, Xtrop and model vertebrate NEFH genes and these uncharacterized Xenopus proteins shows that X.laevis and X.trop proteins [NP_001087818.1 and XP_002933698.2] are much shorter and have no significant similarity to the other vertebrate NEFH genes, although they are almost identical to each other.

Accessions used in COBALT MSA:

NP_066554.2 neurofilament heavy polypeptide [Homo sapiens] (1020aa)

XP_002933698.2 neurofilament heavy polypeptide [Xenopus tropicalis] (626aa)

NP_001087818.1 uncharacterized protein LOC447642 [Xenopus laevis](644aa)

NP_035034.2 neurofilament heavy polypeptide [Mus musculus] (1090aa)

NP_036739.2 neurofilament heavy polypeptide [Rattus norvegicus] (1064aa)

XP_040541104.1 neurofilament heavy polypeptide isoform X2 [Gallus gallus] (939aa)

XP_002934335.2 neurofilament heavy polypeptide [Xenopus tropicalis] (1125aa)

Human NEFH protein

>NP_066554.2 neurofilament heavy polypeptide [Homo sapiens] MMSFGGADALLGAPFAPLHGGGSLHYALARKGGAGGTRSAAGSSSGFHSWTRTSVSSVSASPSRFRGAGA ASSTDSLDTLSNGPEGCMVAVATSRSEKEQLQALNDRFAGYIDKVRQLEAHNRSLEGEAAALRQQQAGRS AMGELYEREVREMRGAVLRLGAARGQLRLEQEHLLEDIAHVRQRLDDEARQREEAEAAARALARFAQEAE AARVDLQKKAQALQEECGYLRRHHQEEVGELLGQIQGSGAAQAQMQAETRDALKCDVTSALREIRAQLEG HAVQSTLQSEEWFRVRLDRLSEAAKVNTDAMRSAQEEITEYRRQLQARTTELEALKSTKDSLERQRSELE DRHQADIASYQEAIQQLDAELRNTKWEMAAQLREYQDLLNVKMALDIEIAAYRKLLEGEECRIGFGPIPF SLPEGLPKIPSVSTHIKVKSEEKIKVVEKSEKETVIVEEQTEETQVTEEVTEEEEKEAKEEEGKEEEGGE EEEAEGGEEETKSPPAEEAASPEKEAKSPVKEEAKSPAEAKSPEKEEAKSPAEVKSPEKAKSPAKEEAKS PPEAKSPEKEEAKSPAEVKSPEKAKSPAKEEAKSPAEAKSPEKAKSPVKEEAKSPAEAKSPVKEEAKSPA EVKSPEKAKSPTKEEAKSPEKAKSPEKEEAKSPEKAKSPVKAEAKSPEKAKSPVKAEAKSPEKAKSPVKE EAKSPEKAKSPVKEEAKSPEKAKSPVKEEAKTPEKAKSPVKEEAKSPEKAKSPEKAKTLDVKSPEAKTPA KEEARSPADKFPEKAKSPVKEEVKSPEKAKSPLKEDAKAPEKEIPKKEEVKSPVKEEEKPQEVKVKEPPK KAEEEKAPATPKTEEKKDSKKEEAPKKEAPKPKVEEKKEPAVEKPKESKVEAKKEEAEDKKKVPTPEKEA PAKVEVKEDAKPKEKTEVAKKEPDDAKAKEPSKPAEKKEAAPEKKDTKEEKAKKPEEKPKTEAKAKEDDK TLSKEPSKPKAEKAEKSSSTDQKDSKPPEKATEDKAAKGK

NCBI protein accessions

>XP_002933698.2 neurofilament heavy polypeptide [Xenopus tropicalis] MGAKFSKKKKSYCLGAGKDGESTETTEVEQKETETQSGQKDAVVPKEETPQPDKTVANGTSEEQNPQAEK QLDLETKPDEPTKDDQNAGPVSENNLESNKAETDETPQEKAESNCLSAVQEQSSEVAKGDSSSKAQTAPP SQEPEKTGEEQLNKDLVSSSDPPCAAVAEPEVIQSVSVEQQVTVQQVDHLQEATVKEAAEELPPASKLAA EEENQLVTEHSMSENYVPLPEVVPEVVSNSEVDNKEKDTTPVFMPESSTKQIAANVPEIRISVEEPKQSF EDQGPLPETDIPEPVQTPQLTEEVEEPDEVPVLAASEPAAEQLVPLADTVSSETESLGHPQVHMPLTTVP AHEKDAESAPQELKSVAEPEDHIAEAVTSEAPQQSKSETEVCEKETKQDAQEDDQQAKTSEVPEQPAHDE KNNANQESTTFDNKEEALCPESSDVVSNEQPKLTSPEQQICTEIASEVTPLQNLQDCAAELSVPLEPNAN DLESEDQSSDKHSLEQEKSVEEESKAVVDVVQETLNQEQRDSNHISSSPVDKQDISGSAPQPLTEDTIEN HATLAEESVHEKILADVQIVDKHVTLNGLPSKEEIKVKENVENGSNHSISDLNGQNENHEIEVCNE

>NP_001087818.1 uncharacterized protein LOC447642 [Xenopus laevis] MGAKFSKKKKSYCLGAGKDGESTETTEVEQKGTETQNGQKDAVAPKEETPQPDKPVGNGTSEEQKPQVEK QSDPETKQDEPTKDEQNAISMPENNLESNKAEAEETPQEKTESNCSSAVQEHSSEVAKDDRSNLTQKAPP SHEPEKTEEEQLNKNLDSSSDPPCDAVAEPEVKQSSVEKHQVIVQPVDHLQEATVKEAAEEHPPASQLTA EKENQLVTDIDPIPENTVPLPEEVPEVVVTIADNNEKVTTPVIMTESSPMQMQTPNVAEISKLEEESKQS LEGQSPLPEHEIPEPVQTPQPIAQTEKPDELPVLATPEPVAEQLVPLVDPVSEEIESLEHPQAHMPLTTE PATEKELVDLEATPQELKSVAEPENPDHVAEPVTSEGLQQSQTETGKVCENETKQSVQEDEQPQPKTSEV PEQPAHDEVNTANQESSTFGNKESEVLCSQSSDVSSNDKPELISPKQEICTEIPSEVSPLQTLQDSAAEL PVPLEPNAHGMKLEDQSSDKHSVEQEKCVEEENKAEVDVVQETLNQEPIDGNHTASSPTAVQDTISDPST ADVSGSAPLPIIEDKIESHVTLAEESVDEKIPSDVQIVDKHVTLNGLPSKEEIKVKENLENGSNHAISEL NGQNEKHEIEVCNE

neurofilament heavy polypeptide [Xenopus tropicalis] NCBI Reference Sequence: XP_002934335.2 GenPept Identical Proteins Graphics >XP_002934335.2 neurofilament heavy polypeptide [Xenopus tropicalis] MLSFSLERSMGPVTYRRPPGDHYLRSSSVSLTHSGSSSFQSHSRSRRSAPSYSESPEPSNGPRDEKEALQ GLNERFAGYIEKVRRLEEQNRSLQQEATALRKQQAGISAIGQLYEREIRDMRNQLLKINSENGQEQLERD RLSEDIELLRLKTQEEERLKEEALNSERALRQFLQECSLDREQLGRKVQSLEDEAAHLCKCHQEDVEEMI QQIHGSQVTTQQMGEVPSLELASALKDIRSQLEGHIGKNNLQTQGWFQAQLNKLNEASQVNTQAIRSASD EITNYRHQLQSQVTQLEVLKSSQQSLERQCLDMEDRHQAEMASYQDTLQQLDNELRTTKWEMAAQIREYQ DLLNVKMALDIEIAAYRKLLEGEECRLGSGFDPFSFDEKIPRGPSTPKHIKVKKEEKIKIVEKSDKETVI VEKQTEETHVTEEVTEEDEVVGSPEEEQEEEKEKTPSEEQEEIESKEEEQGEEKDEENEEEHDTVEEEEE TKDKQGEDSADTEEGEGEKAGETKDEELDKKEEKKPTKATDKGPELKIKEGEPKEKSDKEEGKGEQKSKE DKPVKKEEKEEPAKKEDKAYKEGKGEATKTEDEAAVKKEKKGEPTKKEEKEPAKKNEQTGIEEKPEPAKK KEKEEPKKEEKKEAAKTEDKKEQTRADEKPEPAKKEEKGEQIKKDQKEPAKKEEKVDSKKKEEKDAAKKE EKMEQTKTVEKPDLAKKGEKGESKTEEKPKEDEPTKKEPASKEEKKEQAKTEEKPEPVKKEEKGEPKAKE QKEPVKKEEKTEPVKKDEGKESSKKDVPKKEESAKKDDKEIAKKEEKGAAQKDEELSKKDEPMKKEEKQE PAKKEEPSKKEEKKEPSTKVVPTEEEQNLPEKVEKKESAKKEEKKQPVTEEKKEAPKKEVAADKKEDKET TKKETKESTKKEDKPESAKKEDSPKIDEKKETPKKEEKKEPTKKEEPSKQEEKTEPPKKEVKDQVKDDKE PAKKEEKSPKAEVKTPSKEPEKQPKLDGKPAKDDVKKSAKVEESKPATDEEKIPSKVEKPPKTEEKKESS KDNEPLKKAEKEPTKKITEVTKIEEKSKEDSQKITKKDPVGGNTQNVKPKTEKIEKSSGTDQLESESTQG AKSEK