XB-FEAT-22065629: Difference between revisions
Line 4: | Line 4: | ||
=protein identity and function= | =protein identity and function= | ||
04JAN2023 (cjz) | 04JAN2023 (cjz) | ||
The following protein accessions where run through DIOPT Eggnog to attempt to identify their gene and possible molecular function with | The following protein accessions where both run through DIOPT Eggnog to attempt to identify their gene and possible molecular function with asmost identical results ( confirming both models are paralogs in ''Xenopus laevis'' sub-genomes despite ~100aa difference in length): | ||
1) ''uncharacterized protein XB22065629''.L [Xenopus laevis] | 1) ''uncharacterized protein XB22065629''.L [Xenopus laevis] | ||
Line 32: | Line 32: | ||
=sequence conservation= | |||
alignment via Cobalt RID VDPN03A0212 (2 seqs) showed that the Xla.L and Xla.S proteins are moderately well conserved at 3' and 5" ends , with the Xal.S isoform has longer central aa string. | |||
alignment of the Xenopus seqs v human RBM3 aligns best to teh 3' end of the Xla.L/S seqs (Cobalt RID VDR6U4B0212 (3 seqs)) as does the human NMI (Cobalt RID VDRBCG61212 (3 seqs)) | |||
Human RBM43 (RNA binding motif protein 43) Gene ID: 375287 , 357aa NP_940959 | Human RBM43 (RNA binding motif protein 43) Gene ID: 375287 , 357aa NP_940959 | ||
MASVLNVKESKAPERTVVVAGLPVDLFSDQLLAVLVKSHFQDIKNEGGDVEDVIYPTRTKGVAYVIFKEKKVAENVIRQKKHWLARKTRH | MASVLNVKESKAPERTVVVAGLPVDLFSDQLLAVLVKSHFQDIKNEGGDVEDVIYPTRTKGVAYVIFKEKKVAENVIRQKKHWLARKTRH | ||
Line 43: | Line 49: | ||
FSKSRNGGGEVDRVDYDRQSGSAVITFVEIGVADKILKKKEYPLYINQTCHRVTVSPYTEIHLKKYQIFSGTSKRTVLLTGMEGIQMDEE | FSKSRNGGGEVDRVDYDRQSGSAVITFVEIGVADKILKKKEYPLYINQTCHRVTVSPYTEIHLKKYQIFSGTSKRTVLLTGMEGIQMDEE | ||
IVEDLINIHFQRAKNGGGEVDVVKCSLGQPHIAYFEE | IVEDLINIHFQRAKNGGGEVDVVKCSLGQPHIAYFEE | ||
synteny | synteny | ||
Line 53: | Line 56: | ||
Xenopus chr 9_10.L: rnd3.L[Rho family GTPase 3 ]< ... '''nmi'''< ... LOC108701325[protein mono-ADP-ribosyltransferase PARP15]> ... LOC108701839[protein mono-ADP-ribosyltransferase PARP15] | Xenopus chr 9_10.L: rnd3.L[Rho family GTPase 3 ]< ... '''nmi'''< ... LOC108701325[protein mono-ADP-ribosyltransferase PARP15]> ... LOC108701839[protein mono-ADP-ribosyltransferase PARP15] | ||
Xenopus chr6.L: ''XB22065629'' | |||
=nomenclature changes= | =nomenclature changes= | ||
05JAN2023 | 05JAN2023 |
Revision as of 07:25, 5 January 2023
uncharacterized protein XB22065629
05JAN2023
protein identity and function
04JAN2023 (cjz) The following protein accessions where both run through DIOPT Eggnog to attempt to identify their gene and possible molecular function with asmost identical results ( confirming both models are paralogs in Xenopus laevis sub-genomes despite ~100aa difference in length):
1) uncharacterized protein XB22065629.L [Xenopus laevis] NCBI Reference Sequence: XP_018122493.1 538 aa linear VRT 14-MAY-2021
MSCYKRGLRVCGVPGHLFEEELLRDKLLIHFLRPKNQGGEIQNLHYPTKDEGVAILTFEEEEVAERILKTQHLLDVNGQLFPLEVMRLQF SMPVITSLDLSRLKNKKLLVDLFEKHKVMILNKRDEMFIISAEFEDLRQLRSEIMATVLTCDTSPLGQRSQKQISESGPESPVKDNLGVK PSSQQPSGNDNNVEKPQSRRTSSRSNSQEKHPSRARRREANMAPTFSNAHILGHRPTTDSVAQGSNPTSEGGMSFKTTRDPIVRMNNDSA SQMLSDLNLGSSTNRGQPSIEATRTTRNPSAITKDFPSQASLVKSFSVDNDVLYYIRIFETERVDEVLKRCAVNIQVAEGEDISNVTLKS QSPSLKLLFSDCCDEILQIFSEWQSNLRTEDLDLTQFPTLERRKMTEKIQHLGRAHGVAIIASEDGLHLIGGPSQIHWLIDWWKTISQPR PPDVGREAVNTREIHLPHPPNMGRSYTGANEKKTNHGDQDMTQVVRRKGSVTGSHQSGDSMRRETSTESSSVSSRTNSYAHGITHPKP
2) uncharacterized protein XB22065629.S isoform X1 [Xenopus laevis]
NCBI Reference Sequence: XP_018124435.1 646 aa linear VRT 14-MAY-2021
MTWYKRGVQVCGVPGHLFEEELLRDKLIIHFLRPKNQGGEIQELRYPTKDSGVAILTFEDEEVAERILKTTHLLDVNGQSFVLEVMRLQF SMLVITSLDLSRFKNLNLLVDLLEKHRVTILGRRDETLNISAEFEDLRKFRSEIMAKALTCDTSPLGQRSRKQISVSGLESPVKDNLGVR PTSRQPSGNANNMEKPQSRRTPNMSSSHEKLPSRSSSHEKLPSRSNSHEKPPSRSNSHDKPPSRSNSHEKPPSRSNSHEKPPSRSNSYEI LPSRSNSHEKPPSRSNSHEKPPSRSNSHEKPPSRSNSHEKPPSRSNSYEILPSRTMRREADMAPTFRNADTLRVRPMASTIDSVVQGSNT TSEGGMSFKATMNPRVRMNNDSVSQKLTDLNLVSSTIVRSRPSGEAIRTTRKPSAVTKDFPSEKALVKSFPVDKDVLHYIGVFKKQYIDE VLRRGFTDIEVAEGEDISYVTLKSQSPFLQVLFTDCCDKISELFSKGQNSLRTEDLDLTQVPPLARRDITKEIENLGRANSVAVVDYKDL LHLIGGSSHIHLLKERWKTISQSRPPYAGQEALSSGQISLPHPPDMGRSYAGANEKKTNHGDQDVAQAVRHKAGVTSSPQTVRTGNQSKG FVRHETVGVTSPKPNR
these Xlaevis proteins hit the RBM43 homology group in 3 ( score 91) and 16 reptile/vertbrate species (score 76), but also has a Nmi/IFP 35 domain (NID), matching NMI and IFP35 genes/domains in 100+ species.
sequence conservation
alignment via Cobalt RID VDPN03A0212 (2 seqs) showed that the Xla.L and Xla.S proteins are moderately well conserved at 3' and 5" ends , with the Xal.S isoform has longer central aa string.
alignment of the Xenopus seqs v human RBM3 aligns best to teh 3' end of the Xla.L/S seqs (Cobalt RID VDR6U4B0212 (3 seqs)) as does the human NMI (Cobalt RID VDRBCG61212 (3 seqs))
Human RBM43 (RNA binding motif protein 43) Gene ID: 375287 , 357aa NP_940959 MASVLNVKESKAPERTVVVAGLPVDLFSDQLLAVLVKSHFQDIKNEGGDVEDVIYPTRTKGVAYVIFKEKKVAENVIRQKKHWLARKTRH AELTVSLRVSHFGDKIFSSVNAILDLSVFGKEVTLETLVKDLKKKIPSLSFSPLKPNGRISVEGSFLAVKRLRESLLARACSLLEKDRNF TSEERKWNRQNPQRNLQRSNNSLASVRTLVPETARSGEMLVLDTDVFLYLKHKCGSYESTLKKFHILSQEKVDGEITTICLKSIQVGSQP NNAKHVKELIEEWSHALYLKLRKETFILEGKENREKRMIKRACEQLSSRYLEVLINLYRTHIDIIGSSSDTYLFKKGVMKLIGQKVS
human NMI [N-myc and STAT interactor] Gene ID: 9111 307aa NP_004679.2 MEADKDDTQQILKEHSPDEFIKDEQNKGLIDEITKKNIQLKKEIQKLETELQEATKEFQIKEDIPETKMKFLSVETPENDSQLSNISCSF QVSSKVPYEIQKGQALITFEKEEVAQNVVSMSKHHVQIKDVNLEVTAKPVPLNSGVRFQVYVEVSKMKINVTEIPDTLREDQMRDKLELS FSKSRNGGGEVDRVDYDRQSGSAVITFVEIGVADKILKKKEYPLYINQTCHRVTVSPYTEIHLKKYQIFSGTSKRTVLLTGMEGIQMDEE IVEDLINIHFQRAKNGGGEVDVVKCSLGQPHIAYFEE
synteny synteny not conserved between human and Xenopus, using NMI as anchor, (note Xenopus has no annotated fab5p10 gene models)
human chr 2: LOC107985826[ncRNA]> ... FABP5P10< ... RBM43< ... NMI< ... LOC107985827[ncRNA<]...
Xenopus chr 9_10.L: rnd3.L[Rho family GTPase 3 ]< ... nmi< ... LOC108701325[protein mono-ADP-ribosyltransferase PARP15]> ... LOC108701839[protein mono-ADP-ribosyltransferase PARP15]
Xenopus chr6.L: XB22065629
nomenclature changes
05JAN2023