XB-FEAT-22065629: Difference between revisions

From XenWiki
Jump to navigation Jump to search
(Created page with "=uncharacterized protein XB22065629= 05JAN2023 =protein identity and function= 04JAN2023 The following protein accessions where run through DIOPT Eggnog to attempt to identif...")
 
Line 3: Line 3:


=protein identity and function=
=protein identity and function=
04JAN2023
04JAN2023 (cjz)
The following protein accessions where run through DIOPT Eggnog to attempt to identify their gene and possible molecular function:
The following protein accessions where run through DIOPT Eggnog to attempt to identify their gene and possible molecular function with alsmost identical results ( confirming both models are paralogs in ''Xenopus laevis'' sub-genomes despite ~100aa difference in length):


1) ''uncharacterized protein XB22065629''.L [Xenopus laevis]
1) ''uncharacterized protein XB22065629''.L [Xenopus laevis]
Line 16: Line 16:
PPDVGREAVNTREIHLPHPPNMGRSYTGANEKKTNHGDQDMTQVVRRKGSVTGSHQSGDSMRRETSTESSSVSSRTNSYAHGITHPKP
PPDVGREAVNTREIHLPHPPNMGRSYTGANEKKTNHGDQDMTQVVRRKGSVTGSHQSGDSMRRETSTESSSVSSRTNSYAHGITHPKP


this protein hits the RBM43 homology group in 3 ( score 91) and 16 species (score 76), but also has a Nmi/IFP 35 domain (NID), matching NMI and IFP35 gens in 100+ species.


2)  
2) uncharacterized protein XB22065629.S isoform X1 [Xenopus laevis]
NCBI Reference Sequence: XP_018124435.1 646 aa            linear  VRT 14-MAY-2021
 
MTWYKRGVQVCGVPGHLFEEELLRDKLIIHFLRPKNQGGEIQELRYPTKDSGVAILTFEDEEVAERILKTTHLLDVNGQSFVLEVMRLQF
SMLVITSLDLSRFKNLNLLVDLLEKHRVTILGRRDETLNISAEFEDLRKFRSEIMAKALTCDTSPLGQRSRKQISVSGLESPVKDNLGVR
PTSRQPSGNANNMEKPQSRRTPNMSSSHEKLPSRSSSHEKLPSRSNSHEKPPSRSNSHDKPPSRSNSHEKPPSRSNSHEKPPSRSNSYEI
LPSRSNSHEKPPSRSNSHEKPPSRSNSHEKPPSRSNSHEKPPSRSNSYEILPSRTMRREADMAPTFRNADTLRVRPMASTIDSVVQGSNT
TSEGGMSFKATMNPRVRMNNDSVSQKLTDLNLVSSTIVRSRPSGEAIRTTRKPSAVTKDFPSEKALVKSFPVDKDVLHYIGVFKKQYIDE
VLRRGFTDIEVAEGEDISYVTLKSQSPFLQVLFTDCCDKISELFSKGQNSLRTEDLDLTQVPPLARRDITKEIENLGRANSVAVVDYKDL
LHLIGGSSHIHLLKERWKTISQSRPPYAGQEALSSGQISLPHPPDMGRSYAGANEKKTNHGDQDVAQAVRHKAGVTSSPQTVRTGNQSKG
FVRHETVGVTSPKPNR
 
these Xlaevis proteins hit the RBM43 homology group in 3 ( score 91) and 16 reptile/vertbrate species (score 76), but also has a Nmi/IFP 35 domain (NID), matching NMI and IFP35 genes/domains in 100+ species.
 
 
Human RBM43 (RNA binding motif protein 43) Gene ID: 375287 , 357aa NP_940959
MASVLNVKESKAPERTVVVAGLPVDLFSDQLLAVLVKSHFQDIKNEGGDVEDVIYPTRTKGVAYVIFKEKKVAENVIRQKKHWLARKTRH
AELTVSLRVSHFGDKIFSSVNAILDLSVFGKEVTLETLVKDLKKKIPSLSFSPLKPNGRISVEGSFLAVKRLRESLLARACSLLEKDRNF
TSEERKWNRQNPQRNLQRSNNSLASVRTLVPETARSGEMLVLDTDVFLYLKHKCGSYESTLKKFHILSQEKVDGEITTICLKSIQVGSQP
NNAKHVKELIEEWSHALYLKLRKETFILEGKENREKRMIKRACEQLSSRYLEVLINLYRTHIDIIGSSSDTYLFKKGVMKLIGQKVS
 
human NMI [N-myc and STAT interactor]  Gene ID: 9111  307aa NP_004679.2 
MEADKDDTQQILKEHSPDEFIKDEQNKGLIDEITKKNIQLKKEIQKLETELQEATKEFQIKEDIPETKMKFLSVETPENDSQLSNISCSF
QVSSKVPYEIQKGQALITFEKEEVAQNVVSMSKHHVQIKDVNLEVTAKPVPLNSGVRFQVYVEVSKMKINVTEIPDTLREDQMRDKLELS
FSKSRNGGGEVDRVDYDRQSGSAVITFVEIGVADKILKKKEYPLYINQTCHRVTVSPYTEIHLKKYQIFSGTSKRTVLLTGMEGIQMDEE
IVEDLINIHFQRAKNGGGEVDVVKCSLGQPHIAYFEE
 
sequence conservation
THe Xla.L and Xla.S proteins are moderateyl well conserved at 3' and 5" ends , with the Xal.S isoform has longer central aa string.
 
synteny
synteny not conserved between human and Xenopus, using NMI as anchor, (note Xenopus has no annotated fab5p10  gene models)
 
human  chr 2: LOC107985826[ncRNA]> ... FABP5P10< ... '''RBM43'''< ... '''NMI'''< ... LOC107985827[ncRNA<]...
 
Xenopus chr 9_10.L:  rnd3.L[Rho family GTPase 3 ]< ... '''nmi'''< ... LOC108701325[protein mono-ADP-ribosyltransferase PARP15]> ... LOC108701839[protein mono-ADP-ribosyltransferase PARP15]


=nomenclature changes=
=nomenclature changes=
05JAN2023
05JAN2023

Revision as of 08:16, 5 January 2023

uncharacterized protein XB22065629

05JAN2023

protein identity and function

04JAN2023 (cjz) The following protein accessions where run through DIOPT Eggnog to attempt to identify their gene and possible molecular function with alsmost identical results ( confirming both models are paralogs in Xenopus laevis sub-genomes despite ~100aa difference in length):

1) uncharacterized protein XB22065629.L [Xenopus laevis] NCBI Reference Sequence: XP_018122493.1 538 aa linear VRT 14-MAY-2021

MSCYKRGLRVCGVPGHLFEEELLRDKLLIHFLRPKNQGGEIQNLHYPTKDEGVAILTFEEEEVAERILKTQHLLDVNGQLFPLEVMRLQF SMPVITSLDLSRLKNKKLLVDLFEKHKVMILNKRDEMFIISAEFEDLRQLRSEIMATVLTCDTSPLGQRSQKQISESGPESPVKDNLGVK PSSQQPSGNDNNVEKPQSRRTSSRSNSQEKHPSRARRREANMAPTFSNAHILGHRPTTDSVAQGSNPTSEGGMSFKTTRDPIVRMNNDSA SQMLSDLNLGSSTNRGQPSIEATRTTRNPSAITKDFPSQASLVKSFSVDNDVLYYIRIFETERVDEVLKRCAVNIQVAEGEDISNVTLKS QSPSLKLLFSDCCDEILQIFSEWQSNLRTEDLDLTQFPTLERRKMTEKIQHLGRAHGVAIIASEDGLHLIGGPSQIHWLIDWWKTISQPR PPDVGREAVNTREIHLPHPPNMGRSYTGANEKKTNHGDQDMTQVVRRKGSVTGSHQSGDSMRRETSTESSSVSSRTNSYAHGITHPKP


2) uncharacterized protein XB22065629.S isoform X1 [Xenopus laevis] NCBI Reference Sequence: XP_018124435.1 646 aa linear VRT 14-MAY-2021

MTWYKRGVQVCGVPGHLFEEELLRDKLIIHFLRPKNQGGEIQELRYPTKDSGVAILTFEDEEVAERILKTTHLLDVNGQSFVLEVMRLQF SMLVITSLDLSRFKNLNLLVDLLEKHRVTILGRRDETLNISAEFEDLRKFRSEIMAKALTCDTSPLGQRSRKQISVSGLESPVKDNLGVR PTSRQPSGNANNMEKPQSRRTPNMSSSHEKLPSRSSSHEKLPSRSNSHEKPPSRSNSHDKPPSRSNSHEKPPSRSNSHEKPPSRSNSYEI LPSRSNSHEKPPSRSNSHEKPPSRSNSHEKPPSRSNSHEKPPSRSNSYEILPSRTMRREADMAPTFRNADTLRVRPMASTIDSVVQGSNT TSEGGMSFKATMNPRVRMNNDSVSQKLTDLNLVSSTIVRSRPSGEAIRTTRKPSAVTKDFPSEKALVKSFPVDKDVLHYIGVFKKQYIDE VLRRGFTDIEVAEGEDISYVTLKSQSPFLQVLFTDCCDKISELFSKGQNSLRTEDLDLTQVPPLARRDITKEIENLGRANSVAVVDYKDL LHLIGGSSHIHLLKERWKTISQSRPPYAGQEALSSGQISLPHPPDMGRSYAGANEKKTNHGDQDVAQAVRHKAGVTSSPQTVRTGNQSKG FVRHETVGVTSPKPNR

these Xlaevis proteins hit the RBM43 homology group in 3 ( score 91) and 16 reptile/vertbrate species (score 76), but also has a Nmi/IFP 35 domain (NID), matching NMI and IFP35 genes/domains in 100+ species.


Human RBM43 (RNA binding motif protein 43) Gene ID: 375287 , 357aa NP_940959 MASVLNVKESKAPERTVVVAGLPVDLFSDQLLAVLVKSHFQDIKNEGGDVEDVIYPTRTKGVAYVIFKEKKVAENVIRQKKHWLARKTRH AELTVSLRVSHFGDKIFSSVNAILDLSVFGKEVTLETLVKDLKKKIPSLSFSPLKPNGRISVEGSFLAVKRLRESLLARACSLLEKDRNF TSEERKWNRQNPQRNLQRSNNSLASVRTLVPETARSGEMLVLDTDVFLYLKHKCGSYESTLKKFHILSQEKVDGEITTICLKSIQVGSQP NNAKHVKELIEEWSHALYLKLRKETFILEGKENREKRMIKRACEQLSSRYLEVLINLYRTHIDIIGSSSDTYLFKKGVMKLIGQKVS

human NMI [N-myc and STAT interactor] Gene ID: 9111 307aa NP_004679.2 MEADKDDTQQILKEHSPDEFIKDEQNKGLIDEITKKNIQLKKEIQKLETELQEATKEFQIKEDIPETKMKFLSVETPENDSQLSNISCSF QVSSKVPYEIQKGQALITFEKEEVAQNVVSMSKHHVQIKDVNLEVTAKPVPLNSGVRFQVYVEVSKMKINVTEIPDTLREDQMRDKLELS FSKSRNGGGEVDRVDYDRQSGSAVITFVEIGVADKILKKKEYPLYINQTCHRVTVSPYTEIHLKKYQIFSGTSKRTVLLTGMEGIQMDEE IVEDLINIHFQRAKNGGGEVDVVKCSLGQPHIAYFEE

sequence conservation THe Xla.L and Xla.S proteins are moderateyl well conserved at 3' and 5" ends , with the Xal.S isoform has longer central aa string.

synteny synteny not conserved between human and Xenopus, using NMI as anchor, (note Xenopus has no annotated fab5p10 gene models)

human chr 2: LOC107985826[ncRNA]> ... FABP5P10< ... RBM43< ... NMI< ... LOC107985827[ncRNA<]...

Xenopus chr 9_10.L: rnd3.L[Rho family GTPase 3 ]< ... nmi< ... LOC108701325[protein mono-ADP-ribosyltransferase PARP15]> ... LOC108701839[protein mono-ADP-ribosyltransferase PARP15]

nomenclature changes

05JAN2023