BLASTX nr result
ID: Mentha25_contig00019010
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00019010 (1602 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN79213.1| hypothetical protein VITISV_025939 [Vitis vinifera] 822 0.0 ref|XP_006483724.1| PREDICTED: LOW QUALITY PROTEIN: U2 snRNP-ass... 820 0.0 ref|XP_006450262.1| hypothetical protein CICLE_v10007357mg [Citr... 820 0.0 ref|XP_002324341.2| RNA recognition motif-containing family prot... 818 0.0 ref|XP_004234429.1| PREDICTED: U2 snRNP-associated SURP motif-co... 818 0.0 gb|EYU29204.1| hypothetical protein MIMGU_mgv1a000894mg [Mimulus... 813 0.0 ref|XP_006353899.1| PREDICTED: U2 snRNP-associated SURP motif-co... 812 0.0 ref|XP_006353898.1| PREDICTED: U2 snRNP-associated SURP motif-co... 812 0.0 ref|XP_006353897.1| PREDICTED: U2 snRNP-associated SURP motif-co... 812 0.0 ref|XP_007225360.1| hypothetical protein PRUPE_ppa000894mg [Prun... 812 0.0 emb|CBI21155.3| unnamed protein product [Vitis vinifera] 808 0.0 gb|EXC01118.1| U2 snRNP-associated SURP motif-containing protein... 807 0.0 ref|XP_002515412.1| RNA binding protein, putative [Ricinus commu... 806 0.0 ref|XP_006599196.1| PREDICTED: U2 snRNP-associated SURP motif-co... 804 0.0 ref|XP_004291970.1| PREDICTED: U2 snRNP-associated SURP motif-co... 803 0.0 ref|XP_002308714.1| RNA recognition motif-containing family prot... 801 0.0 ref|XP_006585862.1| PREDICTED: U2 snRNP-associated SURP motif-co... 800 0.0 ref|XP_006585860.1| PREDICTED: U2 snRNP-associated SURP motif-co... 800 0.0 ref|XP_004138695.1| PREDICTED: U2 snRNP-associated SURP motif-co... 798 0.0 ref|XP_007011692.1| RNA recognition motif-containing protein iso... 796 0.0 >emb|CAN79213.1| hypothetical protein VITISV_025939 [Vitis vinifera] Length = 1384 Score = 822 bits (2123), Expect = 0.0 Identities = 414/536 (77%), Positives = 463/536 (86%), Gaps = 2/536 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEGATVILSGPSGPPVT+VP+QNSELVLTPNVPDI V+PP+D+HL Sbjct: 434 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTSVPNQNSELVLTPNVPDIMVSPPEDDHL 493 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 HVIDTMALYVLDGGCAFEQAIMERGRGNPLF+FLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 494 HHVIDTMALYVLDGGCAFEQAIMERGRGNPLFNFLFELGSKEHTYYVWRLYSFAQGDTLQ 553 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRW+PP LPT + P+HEKE+G+T+AAG+S+RVE+ERTLTD QRDEFE Sbjct: 554 RWRTEPFIMITGSGRWMPPPLPTVRSPEHEKESGTTFAAGRSRRVELERTLTDPQRDEFE 613 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERSQIKEAMGFALDNADAAGE+VEVLTESLTLKET IPTKVARLMLVSD+LH Sbjct: 614 DMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTLKETPIPTKVARLMLVSDVLH 673 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERV+KVLQVWADWFLF Sbjct: 674 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVMKVLQVWADWFLF 733 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGV PFHSICGDAPE+E+K S +TG+GGK NQDAALA+GKGA Sbjct: 734 SDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIEKKTSSEDTGEGGKSNQDAALAMGKGA 793 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AMKEL +LPI ELERRCRHNGLSLVGGRE+MVARLL LEEAEKQRG+++DD+LK A S S Sbjct: 794 AMKELLSLPIAELERRCRHNGLSLVGGREIMVARLLSLEEAEKQRGYDLDDDLKYAQSHS 853 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDIN--SSEG 169 +SGRY S +KE E + +S +D + S G+GS+ L P QP++ +++G Sbjct: 854 NSGRYPSSRKEIGVETESVGLSGWNRYGEDEIQSQGKGSVPLAPTIPIPQPELKAFTNKG 913 Query: 168 KNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 K + +LP+SKWA L L+YSSSGSENAGD K +E+E T+ Sbjct: 914 KTDPVLPASKWAREDDDSDDEQKRSARGLGLSYSSSGSENAGDGPXKADEMEFATE 969 >ref|XP_006483724.1| PREDICTED: LOW QUALITY PROTEIN: U2 snRNP-associated SURP motif-containing protein-like [Citrus sinensis] Length = 1017 Score = 820 bits (2117), Expect = 0.0 Identities = 416/536 (77%), Positives = 459/536 (85%), Gaps = 2/536 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPG MAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDI V PP+D HL Sbjct: 317 QALPAPPPGQMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDIMVIPPEDRHL 376 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDT+ALYVLDGGCAFEQAIMERGRGNPLF+FLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 377 RHVIDTLALYVLDGGCAFEQAIMERGRGNPLFNFLFELGSKEHTYYVWRLYSFAQGDTLQ 436 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRWIPP+LPT+K P+HEKE+G+TYAAG+S+R E ERTLTD+QRDEFE Sbjct: 437 RWRTEPFIMITGSGRWIPPALPTSKSPEHEKESGTTYAAGRSRRAEPERTLTDSQRDEFE 496 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERSQIKEAMGFALDNADAAGE+VEVLTESLTLKET IPTKVARLMLVSD+LH Sbjct: 497 DMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTLKETPIPTKVARLMLVSDVLH 556 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRS+TGRITAEALKERVLKVLQVW+DWFLF Sbjct: 557 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSITGRITAEALKERVLKVLQVWSDWFLF 616 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGV PFHSICGDAPE+++K S +T D K NQD ALA+GKGA Sbjct: 617 SDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIDKKNNSEDTCDLSKTNQDTALAMGKGA 676 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 A+KEL NLP+ ELERRCRHNGLSLVGGREMMVARLL LE+AEKQRG+E+DD+LKSAHSQS Sbjct: 677 AIKELMNLPLSELERRCRHNGLSLVGGREMMVARLLSLEDAEKQRGYELDDDLKSAHSQS 736 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDIN--SSEG 169 SSGRY+ G KE N E + +S G +D S GS+ L QP+I + + Sbjct: 737 SSGRYSRGWKETNMEAESMGLSGWNGYEEDEKLSQAVGSVPLGTMLTTPQPEIKAFTKKE 796 Query: 168 KNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 KN+ +LP+SKWA L L+YSSSGSENAGD SK ++++ T D Sbjct: 797 KNDPVLPASKWALEDDESDDEQKRSSRGLGLSYSSSGSENAGDGPSKADDVDFTID 852 >ref|XP_006450262.1| hypothetical protein CICLE_v10007357mg [Citrus clementina] gi|567916514|ref|XP_006450263.1| hypothetical protein CICLE_v10007357mg [Citrus clementina] gi|557553488|gb|ESR63502.1| hypothetical protein CICLE_v10007357mg [Citrus clementina] gi|557553489|gb|ESR63503.1| hypothetical protein CICLE_v10007357mg [Citrus clementina] Length = 973 Score = 820 bits (2117), Expect = 0.0 Identities = 416/536 (77%), Positives = 459/536 (85%), Gaps = 2/536 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPG MAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDI V PP+D HL Sbjct: 273 QALPAPPPGQMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDIMVIPPEDRHL 332 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDT+ALYVLDGGCAFEQAIMERGRGNPLF+FLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 333 RHVIDTLALYVLDGGCAFEQAIMERGRGNPLFNFLFELGSKEHTYYVWRLYSFAQGDTLQ 392 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRWIPP+LPT+K P+HEKE+G+TYAAG+S+R E ERTLTD+QRDEFE Sbjct: 393 RWRTEPFIMITGSGRWIPPALPTSKSPEHEKESGTTYAAGRSRRAEPERTLTDSQRDEFE 452 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERSQIKEAMGFALDNADAAGE+VEVLTESLTLKET IPTKVARLMLVSD+LH Sbjct: 453 DMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTLKETPIPTKVARLMLVSDVLH 512 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRS+TGRITAEALKERVLKVLQVW+DWFLF Sbjct: 513 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSITGRITAEALKERVLKVLQVWSDWFLF 572 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGV PFHSICGDAPE+++K S +T D K NQD ALA+GKGA Sbjct: 573 SDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIDKKNNSEDTCDLSKTNQDTALAMGKGA 632 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 A+KEL NLP+ ELERRCRHNGLSLVGGREMMVARLL LE+AEKQRG+E+DD+LKSAHSQS Sbjct: 633 AIKELMNLPLSELERRCRHNGLSLVGGREMMVARLLSLEDAEKQRGYELDDDLKSAHSQS 692 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDIN--SSEG 169 SSGRY+ G KE N E + +S G +D S GS+ L QP+I + + Sbjct: 693 SSGRYSRGWKETNMEAESMGLSGWNGYEEDEKLSQAVGSVPLGTMLTTPQPEIKAFTKKE 752 Query: 168 KNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 KN+ +LP+SKWA L L+YSSSGSENAGD SK ++++ T D Sbjct: 753 KNDPVLPASKWALEDDESDDEQKRSSRGLGLSYSSSGSENAGDGPSKADDVDFTID 808 >ref|XP_002324341.2| RNA recognition motif-containing family protein [Populus trichocarpa] gi|550317898|gb|EEF02906.2| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 969 Score = 818 bits (2113), Expect = 0.0 Identities = 416/538 (77%), Positives = 464/538 (86%), Gaps = 4/538 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPG MAIRSKEGATVILSGPSGPPVT+VP+QNSELVLTPNVPDI VAPP+D+HL Sbjct: 274 QALPAPPPGQMAIRSKEGATVILSGPSGPPVTSVPNQNSELVLTPNVPDIMVAPPEDDHL 333 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 H+IDTMALYVLDGGCAFEQAIM+RGRGNPLF+FLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 334 HHMIDTMALYVLDGGCAFEQAIMQRGRGNPLFNFLFELGSKEHTYYVWRLYSFAQGDTLQ 393 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRW+PP LPTAK P+HEKE+GSTYAAG+S+RV+ ERTLTD QRDEFE Sbjct: 394 RWRTEPFIMITGSGRWVPPPLPTAKSPEHEKESGSTYAAGRSRRVDSERTLTDPQRDEFE 453 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERSQIK+AMGF+LDNADAAGEVVEVLTESLTLKET IPTKVARLMLVSDILH Sbjct: 454 DMLRALTLERSQIKDAMGFSLDNADAAGEVVEVLTESLTLKETPIPTKVARLMLVSDILH 513 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFEA LPDIMESFNDLYRS+TGRITAEALKERVLKVLQVW+DWFLF Sbjct: 514 NSSAPVKNASAYRTKFEAALPDIMESFNDLYRSITGRITAEALKERVLKVLQVWSDWFLF 573 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR SNSGVIPFHSICGDAPE+E+K+ S + +G KINQDAALA+GKGA Sbjct: 574 SDAYVNGLRATFLRSSNSGVIPFHSICGDAPEIEKKSSSEDAVEGAKINQDAALAMGKGA 633 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 A+KEL NLP+ ELERRCRHNGLSLVGGREMMVARLL LEEAE+QRG+E+DD+LK A S S Sbjct: 634 AVKELMNLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAERQRGYELDDDLKIAQSNS 693 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNM--DDGMPSIGRGSMLLPPKDLNLQPDIN--SS 175 SS RY+S +E N E + + G N+ +D MPS +GS+ + L QP++ + Sbjct: 694 SSSRYSSVHREMNVEAE--PVGSTGWNVYGEDEMPSQNKGSVSVASTLLIKQPELKAFAK 751 Query: 174 EGKNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 + KN+ +LP+SKWA DL L+YSSSGSENAGD K +E+E TD Sbjct: 752 KEKNDPVLPASKWARDDDESDDEQKRSARDLGLSYSSSGSENAGDGQGKADEMEFATD 809 >ref|XP_004234429.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like [Solanum lycopersicum] Length = 947 Score = 818 bits (2112), Expect = 0.0 Identities = 421/538 (78%), Positives = 460/538 (85%), Gaps = 4/538 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVP QNSELVLTPNVPDI V PP+D+HL Sbjct: 267 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPGQNSELVLTPNVPDIMVIPPEDDHL 326 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDTMAL VLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 327 RHVIDTMALCVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 386 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRT PFIMITGSGRWIPPSLPT KG DHEKEAGSTYAAG+S+RV++ERTLTD QRDEFE Sbjct: 387 RWRTVPFIMITGSGRWIPPSLPTPKGADHEKEAGSTYAAGRSRRVDVERTLTDAQRDEFE 446 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLR+LTLERSQIKEAMGF+LDNADAAGEVVEVLTESLTLKET IPTKV+RLMLVSDILH Sbjct: 447 DMLRSLTLERSQIKEAMGFSLDNADAAGEVVEVLTESLTLKETPIPTKVSRLMLVSDILH 506 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFEA+LPDIMESFNDLYRS+TGRITAEALKERVLKVLQVWADWFLF Sbjct: 507 NSSAPVKNASAYRTKFEASLPDIMESFNDLYRSITGRITAEALKERVLKVLQVWADWFLF 566 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGV PFHS+CGDAP++E++ S + GDGGK+N D ALAIGKGA Sbjct: 567 SDAYVNGLRATFLRTGNSGVTPFHSLCGDAPDVEQRTSSDDAGDGGKVNPDGALAIGKGA 626 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AMKEL +LP+ ELERRCRHNGLS+VGGREMMVARLLYLEEAEKQRG E+D++LK A S S Sbjct: 627 AMKELLSLPLTELERRCRHNGLSIVGGREMMVARLLYLEEAEKQRGHELDEDLKFA-SHS 685 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDIN----SS 175 SS R+ S +K+ N E+D S R MD + R S + +N P N SS Sbjct: 686 SSARFPSTRKDSNLELDRMAPSERNSQMDYDVQLKQRES--VSSHQINSAPHYNSIDFSS 743 Query: 174 EGKNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 +GK+E+ILP+SKWA DL L YSSSGSENAGD LSK ++ E+TTD Sbjct: 744 DGKSETILPTSKWAREDDESDDEQKRSSRDLGLTYSSSGSENAGDGLSKIKDAELTTD 801 >gb|EYU29204.1| hypothetical protein MIMGU_mgv1a000894mg [Mimulus guttatus] Length = 949 Score = 813 bits (2100), Expect = 0.0 Identities = 428/543 (78%), Positives = 458/543 (84%), Gaps = 9/543 (1%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGP---PVTTVPSQNSELVLTPNVPDINVAPPDD 1432 QALPAPPPG MAIRSKEGATVILSGPSGP PV ++P NSELVLTPNVPDI V PPDD Sbjct: 268 QALPAPPPGQMAIRSKEGATVILSGPSGPSGPPVNSIPGHNSELVLTPNVPDIKVVPPDD 327 Query: 1431 NHLQHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGD 1252 NH++HVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGS+ H+YYVWRLYSFAQGD Sbjct: 328 NHVRHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSEGHSYYVWRLYSFAQGD 387 Query: 1251 TLQRWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRD 1072 TLQRWRTEPFIMITGSGRWIPPSLPTAKGP+HEKE G TYAAGKSKRVEMERTLTD QRD Sbjct: 388 TLQRWRTEPFIMITGSGRWIPPSLPTAKGPEHEKEGGGTYAAGKSKRVEMERTLTDAQRD 447 Query: 1071 EFEDMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSD 892 EFEDMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKET IPTKVARLMLVSD Sbjct: 448 EFEDMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETPIPTKVARLMLVSD 507 Query: 891 ILHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADW 712 ILHNSSAPVKNASAYRTKFEAT+PDIMESFNDLYRSVTGR+TAEALKERVLKVLQVWADW Sbjct: 508 ILHNSSAPVKNASAYRTKFEATIPDIMESFNDLYRSVTGRMTAEALKERVLKVLQVWADW 567 Query: 711 FLFSDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKIN--QDAALA 538 FLFSDAYVNGLRATF+R +SGV FHSICGDAPELERK GSA+ G G KIN QDAALA Sbjct: 568 FLFSDAYVNGLRATFIRSGSSGVTTFHSICGDAPELERKPGSADHGQGEKINHGQDAALA 627 Query: 537 IGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKS 358 IGKGAAMKEL LP+ ELERRCRHNGLSLVGGRE MVARLLYLEEAEKQRG EIDDELKS Sbjct: 628 IGKGAAMKELLTLPLNELERRCRHNGLSLVGGRETMVARLLYLEEAEKQRGSEIDDELKS 687 Query: 357 AHSQSSSGRYTSGQKEPNSEMDMGQISVRGGN---MDDGMPSIGRGSMLLPPKDLNLQPD 187 SQ SGRY SGQ+E S+ + G G N +D+ +P + G++ LPP D + Sbjct: 688 GRSQLGSGRYQSGQRE--SKFEAGPAETSGWNSSRVDEMVPKV-TGAVFLPPSD--QKEL 742 Query: 186 INSSEGKNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDV-LSKTEELEV 10 IN+ +G +ESILP+SKWA +L L YSSSGS+ AGD KTEE + Sbjct: 743 INARDGGSESILPASKWARENEESDDENERSTKELGLTYSSSGSDMAGDSDPYKTEERGI 802 Query: 9 TTD 1 T D Sbjct: 803 TND 805 >ref|XP_006353899.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X3 [Solanum tuberosum] Length = 857 Score = 812 bits (2097), Expect = 0.0 Identities = 420/538 (78%), Positives = 458/538 (85%), Gaps = 4/538 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVP QNSELVLTPNVPDI V PP+D+HL Sbjct: 177 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPGQNSELVLTPNVPDIMVIPPEDDHL 236 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDTMAL VLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 237 RHVIDTMALCVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 296 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRT PFIMITGSGRWIPPSL T KG DHEKEAGSTYAAG+S+RVE+ERTLTD QRDEFE Sbjct: 297 RWRTVPFIMITGSGRWIPPSLSTPKGADHEKEAGSTYAAGRSRRVEVERTLTDAQRDEFE 356 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLR+LTLERSQIK AMGF+LDNADAAGEVVEVLTESLTLKET IPTKV+RLMLVSDILH Sbjct: 357 DMLRSLTLERSQIKAAMGFSLDNADAAGEVVEVLTESLTLKETPIPTKVSRLMLVSDILH 416 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFEA+LPDIMESFNDLYRS+TGRITAEALKERVLKVLQVWADWFLF Sbjct: 417 NSSAPVKNASAYRTKFEASLPDIMESFNDLYRSITGRITAEALKERVLKVLQVWADWFLF 476 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGV PFHS+CGDAP++E++A S + GDGGKIN D ALAIGKGA Sbjct: 477 SDAYVNGLRATFLRTGNSGVTPFHSLCGDAPDVEQRASSDDAGDGGKINPDGALAIGKGA 536 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AMKEL +LP+ ELERRCRHNGLS+VGGREMMVARLLYLEEAEKQRG E+D++LK A S S Sbjct: 537 AMKELLSLPLTELERRCRHNGLSIVGGREMMVARLLYLEEAEKQRGHELDEDLKFA-SHS 595 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDIN----SS 175 SS R+ S +K+ N E+D S R +D + R S + N P N SS Sbjct: 596 SSARFPSTRKDSNLELDRMAPSERNSQVDYDVQLKQRES--VSSHQTNSAPHYNSIDFSS 653 Query: 174 EGKNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 EGK+E+ILP+SKWA DL L YSSSGSENAGD ++K ++ E+TTD Sbjct: 654 EGKSETILPTSKWAREDDESDDEQKRSSRDLGLTYSSSGSENAGDGINKIKDAELTTD 711 >ref|XP_006353898.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X2 [Solanum tuberosum] Length = 947 Score = 812 bits (2097), Expect = 0.0 Identities = 420/538 (78%), Positives = 458/538 (85%), Gaps = 4/538 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVP QNSELVLTPNVPDI V PP+D+HL Sbjct: 267 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPGQNSELVLTPNVPDIMVIPPEDDHL 326 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDTMAL VLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 327 RHVIDTMALCVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 386 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRT PFIMITGSGRWIPPSL T KG DHEKEAGSTYAAG+S+RVE+ERTLTD QRDEFE Sbjct: 387 RWRTVPFIMITGSGRWIPPSLSTPKGADHEKEAGSTYAAGRSRRVEVERTLTDAQRDEFE 446 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLR+LTLERSQIK AMGF+LDNADAAGEVVEVLTESLTLKET IPTKV+RLMLVSDILH Sbjct: 447 DMLRSLTLERSQIKAAMGFSLDNADAAGEVVEVLTESLTLKETPIPTKVSRLMLVSDILH 506 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFEA+LPDIMESFNDLYRS+TGRITAEALKERVLKVLQVWADWFLF Sbjct: 507 NSSAPVKNASAYRTKFEASLPDIMESFNDLYRSITGRITAEALKERVLKVLQVWADWFLF 566 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGV PFHS+CGDAP++E++A S + GDGGKIN D ALAIGKGA Sbjct: 567 SDAYVNGLRATFLRTGNSGVTPFHSLCGDAPDVEQRASSDDAGDGGKINPDGALAIGKGA 626 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AMKEL +LP+ ELERRCRHNGLS+VGGREMMVARLLYLEEAEKQRG E+D++LK A S S Sbjct: 627 AMKELLSLPLTELERRCRHNGLSIVGGREMMVARLLYLEEAEKQRGHELDEDLKFA-SHS 685 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDIN----SS 175 SS R+ S +K+ N E+D S R +D + R S + N P N SS Sbjct: 686 SSARFPSTRKDSNLELDRMAPSERNSQVDYDVQLKQRES--VSSHQTNSAPHYNSIDFSS 743 Query: 174 EGKNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 EGK+E+ILP+SKWA DL L YSSSGSENAGD ++K ++ E+TTD Sbjct: 744 EGKSETILPTSKWAREDDESDDEQKRSSRDLGLTYSSSGSENAGDGINKIKDAELTTD 801 >ref|XP_006353897.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X1 [Solanum tuberosum] Length = 948 Score = 812 bits (2097), Expect = 0.0 Identities = 420/538 (78%), Positives = 458/538 (85%), Gaps = 4/538 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVP QNSELVLTPNVPDI V PP+D+HL Sbjct: 268 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPGQNSELVLTPNVPDIMVIPPEDDHL 327 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDTMAL VLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 328 RHVIDTMALCVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 387 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRT PFIMITGSGRWIPPSL T KG DHEKEAGSTYAAG+S+RVE+ERTLTD QRDEFE Sbjct: 388 RWRTVPFIMITGSGRWIPPSLSTPKGADHEKEAGSTYAAGRSRRVEVERTLTDAQRDEFE 447 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLR+LTLERSQIK AMGF+LDNADAAGEVVEVLTESLTLKET IPTKV+RLMLVSDILH Sbjct: 448 DMLRSLTLERSQIKAAMGFSLDNADAAGEVVEVLTESLTLKETPIPTKVSRLMLVSDILH 507 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFEA+LPDIMESFNDLYRS+TGRITAEALKERVLKVLQVWADWFLF Sbjct: 508 NSSAPVKNASAYRTKFEASLPDIMESFNDLYRSITGRITAEALKERVLKVLQVWADWFLF 567 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGV PFHS+CGDAP++E++A S + GDGGKIN D ALAIGKGA Sbjct: 568 SDAYVNGLRATFLRTGNSGVTPFHSLCGDAPDVEQRASSDDAGDGGKINPDGALAIGKGA 627 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AMKEL +LP+ ELERRCRHNGLS+VGGREMMVARLLYLEEAEKQRG E+D++LK A S S Sbjct: 628 AMKELLSLPLTELERRCRHNGLSIVGGREMMVARLLYLEEAEKQRGHELDEDLKFA-SHS 686 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDIN----SS 175 SS R+ S +K+ N E+D S R +D + R S + N P N SS Sbjct: 687 SSARFPSTRKDSNLELDRMAPSERNSQVDYDVQLKQRES--VSSHQTNSAPHYNSIDFSS 744 Query: 174 EGKNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 EGK+E+ILP+SKWA DL L YSSSGSENAGD ++K ++ E+TTD Sbjct: 745 EGKSETILPTSKWAREDDESDDEQKRSSRDLGLTYSSSGSENAGDGINKIKDAELTTD 802 >ref|XP_007225360.1| hypothetical protein PRUPE_ppa000894mg [Prunus persica] gi|462422296|gb|EMJ26559.1| hypothetical protein PRUPE_ppa000894mg [Prunus persica] Length = 968 Score = 812 bits (2097), Expect = 0.0 Identities = 413/536 (77%), Positives = 459/536 (85%), Gaps = 2/536 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEGATVILSGPSGPPVT+VPSQNSELVLTPNVPDI V PP+D+HL Sbjct: 274 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTSVPSQNSELVLTPNVPDITVVPPEDDHL 333 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HV+DTMALYVLDGGCAFEQAIMERGRGNPLF+FLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 334 RHVVDTMALYVLDGGCAFEQAIMERGRGNPLFTFLFELGSKEHTYYVWRLYSFAQGDTLQ 393 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRWIPP LPT K P+H KEAG+TYAAG+S+RVE ERTLTD+QRDEFE Sbjct: 394 RWRTEPFIMITGSGRWIPPPLPTVKSPEHGKEAGTTYAAGRSRRVEPERTLTDSQRDEFE 453 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERSQIK+AMGFALDNADAAGE+VEVLTESLTLKET IPTKVARLMLVSD+LH Sbjct: 454 DMLRALTLERSQIKDAMGFALDNADAAGEIVEVLTESLTLKETPIPTKVARLMLVSDVLH 513 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRT+FEATLPDIMESFNDLYRS+TGRITAEALKERVLKVLQVW+DWFLF Sbjct: 514 NSSAPVKNASAYRTRFEATLPDIMESFNDLYRSITGRITAEALKERVLKVLQVWSDWFLF 573 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGV+PFHSICGDAPE+++K S +TGD K NQDAALA+GKGA Sbjct: 574 SDAYVNGLRATFLRSGNSGVVPFHSICGDAPEIDKKITSEDTGDACKTNQDAALAMGKGA 633 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AM+EL +LP+ ELERRCRHNGLSLVGGRE MVARLL LEEAEKQRG+E+DD+LK A S S Sbjct: 634 AMRELLSLPLAELERRCRHNGLSLVGGRETMVARLLSLEEAEKQRGYELDDDLKYAQSHS 693 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDIN--SSEG 169 SS RY+S ++E N E D IS + G+GS+ L QP++ + + Sbjct: 694 SSARYSSSRREMNIEPDSMGISAQ-----------GKGSLPLVQTLPIPQPELKALTKKE 742 Query: 168 KNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 K++ +LP+SKWA DL L+YSSSGSENAGD SK +E+EV TD Sbjct: 743 KSDPVLPASKWAREDDDSDDEQKRSARDLGLSYSSSGSENAGDGPSKADEMEVATD 798 >emb|CBI21155.3| unnamed protein product [Vitis vinifera] Length = 941 Score = 808 bits (2088), Expect = 0.0 Identities = 411/536 (76%), Positives = 456/536 (85%), Gaps = 2/536 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEGATVILSGPSGPPVT+VP+QNSELVLTPNVPDI V+PP+D+HL Sbjct: 274 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTSVPNQNSELVLTPNVPDIMVSPPEDDHL 333 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 HVIDTMALYVLDGGCAFEQAIMERGRGNPLF+FLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 334 HHVIDTMALYVLDGGCAFEQAIMERGRGNPLFNFLFELGSKEHTYYVWRLYSFAQGDTLQ 393 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRW+PP LPT + P+HEKE+G+T+AAG+S+RVE+ERTLTD QRDEFE Sbjct: 394 RWRTEPFIMITGSGRWMPPPLPTVRSPEHEKESGTTFAAGRSRRVELERTLTDPQRDEFE 453 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERSQIKEAMGFALDNADAAGE+VEVLTESLTLKET IPTKVARLMLVSD+LH Sbjct: 454 DMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTLKETPIPTKVARLMLVSDVLH 513 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERV+KVLQVWADWFLF Sbjct: 514 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVMKVLQVWADWFLF 573 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGV PFHSICGDAPE+E+K S +TG+GGK NQDAALA+GKGA Sbjct: 574 SDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIEKKTSSEDTGEGGKSNQDAALAMGKGA 633 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AMKEL +LPI ELERRCRHNGLSLVGGRE+MVARLL LEEAEKQRG+++DD+LK A S S Sbjct: 634 AMKELLSLPIAELERRCRHNGLSLVGGREIMVARLLSLEEAEKQRGYDLDDDLKYAQSHS 693 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDIN--SSEG 169 +SGRY PN + S G+GS+ L P QP++ +++G Sbjct: 694 NSGRY------PNE-----------------IQSQGKGSVPLAPTIPIPQPELKAFTNKG 730 Query: 168 KNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 K + +LP+SKWA L L+YSSSGSENAGD SK +E+E T+ Sbjct: 731 KTDPVLPASKWAREDDDSDDEQKRSARGLGLSYSSSGSENAGDGPSKADEMEFATE 786 >gb|EXC01118.1| U2 snRNP-associated SURP motif-containing protein [Morus notabilis] Length = 999 Score = 807 bits (2085), Expect = 0.0 Identities = 412/536 (76%), Positives = 452/536 (84%), Gaps = 2/536 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPG MAIRSKEGATVILSGPSGPPVT+VPSQNSELVLTPNVPDI V PP+D+HL Sbjct: 292 QALPAPPPGQMAIRSKEGATVILSGPSGPPVTSVPSQNSELVLTPNVPDIMVVPPEDDHL 351 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDTMA+YVLDGGCAFEQAIMERGRGNPLF+FLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 352 RHVIDTMAIYVLDGGCAFEQAIMERGRGNPLFNFLFELGSKEHTYYVWRLYSFAQGDTLQ 411 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRWIPPSLPTAK PD EKE+G+TYAAG+S+RVE ERTLTD+QRDEFE Sbjct: 412 RWRTEPFIMITGSGRWIPPSLPTAKSPDLEKESGATYAAGRSRRVEPERTLTDSQRDEFE 471 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERSQIKEAMGFALDNADAAGE+VEVLTESLTLKET IPTKVARLMLVSD+LH Sbjct: 472 DMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTLKETPIPTKVARLMLVSDVLH 531 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFE TLPDIMESFNDLYRS+TGRITAEALKERVLKVLQVWADWFLF Sbjct: 532 NSSAPVKNASAYRTKFEGTLPDIMESFNDLYRSITGRITAEALKERVLKVLQVWADWFLF 591 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGV PFHSICGDAPE+E+ +TGD GK N+DAALA+GKGA Sbjct: 592 SDAYVNGLRATFLRLGNSGVTPFHSICGDAPEIEKIISFEDTGDAGKTNEDAALAMGKGA 651 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AM+EL NLP ELERRCRHNGLSLVGGREMMVARLL LEEAEKQRG+E+D++LK A S Sbjct: 652 AMQELMNLPFAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQRGYELDEDLKYAQGHS 711 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDINS--SEG 169 SSGRY+ G++E N E + S D + S +GS+ L QP++ + Sbjct: 712 SSGRYSGGRRETNVEGEPMGSSGWNHYAGDEIDSQAKGSVPLAQTIPIPQPELKPFVKKE 771 Query: 168 KNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 K++ +LP+SKWA L L YSSSGSENAGD SK +E+E D Sbjct: 772 KSDPVLPASKWAREDDDSDDEQKRSSRGLGLGYSSSGSENAGDGPSKADEMESAAD 827 >ref|XP_002515412.1| RNA binding protein, putative [Ricinus communis] gi|223545356|gb|EEF46861.1| RNA binding protein, putative [Ricinus communis] Length = 979 Score = 806 bits (2081), Expect = 0.0 Identities = 412/537 (76%), Positives = 460/537 (85%), Gaps = 3/537 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEGATVILSGPSGPPVT+VP+ NSELVLTPNVPDI V PPDD+HL Sbjct: 272 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTSVPNHNSELVLTPNVPDIMVVPPDDDHL 331 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDTMALYVLDGGCAFEQAIMERGRGN LF+FLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 332 RHVIDTMALYVLDGGCAFEQAIMERGRGNSLFNFLFELGSKEHTYYVWRLYSFAQGDTLQ 391 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRWIPPSLPTAK P+HEKE+G+TYAAGKS+RV+ ERTLTD QRDEFE Sbjct: 392 RWRTEPFIMITGSGRWIPPSLPTAKSPEHEKESGNTYAAGKSRRVDPERTLTDPQRDEFE 451 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERSQIK+AMGFALDNADAAGE+VEVLTESLTLKET IPTKVAR+MLVSDILH Sbjct: 452 DMLRALTLERSQIKDAMGFALDNADAAGEIVEVLTESLTLKETPIPTKVARIMLVSDILH 511 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRS+TGRITAEALKERV+KVLQVW+DWFLF Sbjct: 512 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSITGRITAEALKERVMKVLQVWSDWFLF 571 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR S SGVIPFHSICGDAP +E+K S +TGDGGK +QDAALA+GKGA Sbjct: 572 SDAYVNGLRATFLRSSTSGVIPFHSICGDAPAIEKKVTSEDTGDGGKTSQDAALAMGKGA 631 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AMKEL +LP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEKQRG+E+DD LK + S Sbjct: 632 AMKELLSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQRGYELDDNLKVSQSHL 691 Query: 342 SSGRYTSGQKEPNSEMD-MGQISVRGGNMDDGMPSIGRGSMLLPPKDL-NLQPDINSSEG 169 SS +++SG++E N E++ + + +V G +D + S R S L + + + + Sbjct: 692 SSSKFSSGRRETNVELEPVSEWNVYG---EDDVQSQSRASASLATFPIPQAELKAFTKKE 748 Query: 168 KNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKT-EELEVTTD 1 KN+ +LP+SKWA L L+YSSSGSENAGD L K +E+E TD Sbjct: 749 KNDPVLPASKWARDDDDSDDEQKRSSRGLGLSYSSSGSENAGDGLGKADDEMEFATD 805 >ref|XP_006599196.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like [Glycine max] Length = 969 Score = 804 bits (2076), Expect = 0.0 Identities = 411/536 (76%), Positives = 455/536 (84%), Gaps = 2/536 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEG+TVILSGPSGPPVTTVP+QNSELVLTPNVPDI V PP+D+HL Sbjct: 273 QALPAPPPGHMAIRSKEGSTVILSGPSGPPVTTVPNQNSELVLTPNVPDIMVTPPEDDHL 332 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDTMALYVLDGGCAFEQAIMERGRGNPLF+FLF LGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 333 RHVIDTMALYVLDGGCAFEQAIMERGRGNPLFNFLFVLGSKEHTYYVWRLYSFAQGDTLQ 392 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRWIPP LP +K P+HEKE G T+A G+S+RVE ERTLTD QRDEFE Sbjct: 393 RWRTEPFIMITGSGRWIPPPLPMSKSPEHEKEPGPTHAGGRSRRVEPERTLTDAQRDEFE 452 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERSQIKEAMGF+LDNADAAGEVVEVLTESLTLKET IPTK+ARLMLVSDILH Sbjct: 453 DMLRALTLERSQIKEAMGFSLDNADAAGEVVEVLTESLTLKETPIPTKIARLMLVSDILH 512 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPV+NASAYRTKFEATLPDIMESFNDLYRS+ GRITAEALKERVLKVLQVWADWFLF Sbjct: 513 NSSAPVRNASAYRTKFEATLPDIMESFNDLYRSIMGRITAEALKERVLKVLQVWADWFLF 572 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGVIPFHSICGDAPE+E+K S + GGK NQDAALA+G+GA Sbjct: 573 SDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQKTASEDMVVGGKTNQDAALAMGRGA 632 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AMKEL +LP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEKQ+GFE+DDELK AH+Q Sbjct: 633 AMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQKGFELDDELKYAHNQV 692 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDIN--SSEG 169 SSG+Y+S Q+E ++E+D +S D+ + S GR S+ L P QP + + + Sbjct: 693 SSGKYSSNQRETSAELDPVGLSAWNHYGDEDIQSQGRSSVPLAPTLPIPQPKLKAFTKKE 752 Query: 168 KNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 KN+ +LP+SKWA +L L+YSSSGSEN D L K +E E D Sbjct: 753 KNDPVLPASKWA-REDDESDDEQRSGKNLGLSYSSSGSENVDDGLVKADESESAAD 807 >ref|XP_004291970.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like [Fragaria vesca subsp. vesca] Length = 980 Score = 803 bits (2074), Expect = 0.0 Identities = 408/536 (76%), Positives = 457/536 (85%), Gaps = 2/536 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEGATVILSGPSGPPVT+VPSQNSELVLTPNVPDI V PP+D+HL Sbjct: 274 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTSVPSQNSELVLTPNVPDITVVPPEDDHL 333 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDTMALYVLDGGCAFEQAIMERGRGNPLF FLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 334 RHVIDTMALYVLDGGCAFEQAIMERGRGNPLFHFLFELGSKEHTYYVWRLYSFAQGDTLQ 393 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRWIPPSLP + P+HEKE+ STYAAG+S+RVE ERTLTD QRDEFE Sbjct: 394 RWRTEPFIMITGSGRWIPPSLPALRSPEHEKESSSTYAAGRSRRVESERTLTDPQRDEFE 453 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERSQIK+AMGFALDNADAAGE+VEVLTESLTLKET IPTKVARLMLVSD+LH Sbjct: 454 DMLRALTLERSQIKDAMGFALDNADAAGEIVEVLTESLTLKETPIPTKVARLMLVSDVLH 513 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFEATLPDIMESFNDLYR +TGRITAEALKERVLKVLQVW+DWFLF Sbjct: 514 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRGITGRITAEALKERVLKVLQVWSDWFLF 573 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGV+PFHS+CGDAP++E+K S + GD K NQDAALA+GKGA Sbjct: 574 SDAYVNGLRATFLRSGNSGVVPFHSVCGDAPDIEKKTTSEDAGD-AKTNQDAALAMGKGA 632 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 A +EL NLP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEKQRG+E+DD+LK + S Sbjct: 633 ATRELLNLPMAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQRGYELDDDLKYGQNHS 692 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDIN--SSEG 169 SSGR++S +KE N E D +S ++D + S G+ S+ + QP++ +++ Sbjct: 693 SSGRHSSSRKEMNIEPDPLGLSGWNRYVEDEIQSEGKVSLSKAQTHTSPQPELKPFTTKE 752 Query: 168 KNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 K++ +LP+SKWA L L+Y SSGSENAGD SK +E+EV TD Sbjct: 753 KSDPVLPASKWAREDDDSDDDQKRSAKGLGLSY-SSGSENAGDGPSKADEMEVATD 807 >ref|XP_002308714.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222854690|gb|EEE92237.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 988 Score = 801 bits (2068), Expect = 0.0 Identities = 414/548 (75%), Positives = 461/548 (84%), Gaps = 14/548 (2%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKE----------GATVILSGPSGPPVTTVPSQNSELVLTPNVPDI 1453 QALPAPPPG MAIRSKE GATVILSGPSGPPVT+VP+QNSELVLTPNVPDI Sbjct: 274 QALPAPPPGQMAIRSKEVCYGFLPKPIGATVILSGPSGPPVTSVPNQNSELVLTPNVPDI 333 Query: 1452 NVAPPDDNHLQHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRL 1273 VAPP+D+HL+HVIDTMALYVLDGGCAFEQAIM+RGRGNPLF+FLFELGSKEHTYYVWRL Sbjct: 334 MVAPPEDDHLRHVIDTMALYVLDGGCAFEQAIMQRGRGNPLFNFLFELGSKEHTYYVWRL 393 Query: 1272 YSFAQGDTLQRWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERT 1093 YSFAQGDTLQRWRTEPFIMITGSGRW+PPSLPTAK P+HEKE+GST+AAG+S+RV+ ERT Sbjct: 394 YSFAQGDTLQRWRTEPFIMITGSGRWVPPSLPTAKSPEHEKESGSTHAAGRSRRVDPERT 453 Query: 1092 LTDTQRDEFEDMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVA 913 LTD QRDEFEDMLRALTLERSQIK+AMGFALDN DAAGEVVEVLTESLTLKET IPTKVA Sbjct: 454 LTDPQRDEFEDMLRALTLERSQIKDAMGFALDNVDAAGEVVEVLTESLTLKETPIPTKVA 513 Query: 912 RLMLVSDILHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKV 733 RLMLVSDILHNSSAPVKNASAYRTKFEA LPDIMESFNDLYRS+TGRITAEALKERVLKV Sbjct: 514 RLMLVSDILHNSSAPVKNASAYRTKFEAALPDIMESFNDLYRSITGRITAEALKERVLKV 573 Query: 732 LQVWADWFLFSDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQ 553 LQVW+DWFLFSDAYVNGLRATFLR SNSGVIPFHS+CGDAPE+E+K + +T DGGK NQ Sbjct: 574 LQVWSDWFLFSDAYVNGLRATFLRSSNSGVIPFHSMCGDAPEIEKKNSTEDTVDGGKTNQ 633 Query: 552 DAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEID 373 DAALA+GKGAA KEL +LP+ ELERRCRHNGLSLVGGRE MVARLL LEEAEKQRG+E+D Sbjct: 634 DAALAMGKGAATKELMDLPLAELERRCRHNGLSLVGGRETMVARLLNLEEAEKQRGYELD 693 Query: 372 DELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNM--DDGMPSIGRGSMLLPPKDLN 199 +LK A S SSS RY+S +E N +D G + + G N+ +D PS + S+ L Sbjct: 694 GDLKIAQSNSSSSRYSSVHREVN--VDPGPVGLTGWNIYGEDDTPSQNKRSVSLVSTLPI 751 Query: 198 LQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKT 25 QP++ + + KN+ +LP+SKWA DL L+YSSSGSENAGD K Sbjct: 752 PQPELKAFAKKEKNDPVLPASKWARDDDESDDEQKRSVRDLGLSYSSSGSENAGDGQGKE 811 Query: 24 EELEVTTD 1 +E+E TD Sbjct: 812 DEMEFATD 819 >ref|XP_006585862.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X3 [Glycine max] gi|571473238|ref|XP_006585863.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X4 [Glycine max] Length = 874 Score = 800 bits (2067), Expect = 0.0 Identities = 407/536 (75%), Positives = 454/536 (84%), Gaps = 2/536 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEG+TVILSGPSGPPVT+VP+QNSELVLTPNVPDI V PP+D HL Sbjct: 178 QALPAPPPGHMAIRSKEGSTVILSGPSGPPVTSVPNQNSELVLTPNVPDIMVTPPEDEHL 237 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDTMAL+VLDGGCAFEQAIMERGRGNPLF+FLF LGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 238 RHVIDTMALHVLDGGCAFEQAIMERGRGNPLFNFLFILGSKEHTYYVWRLYSFAQGDTLQ 297 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRWIPP LP +K P+HEKE+GST+A G+S+RVE +RTLTD QRDEFE Sbjct: 298 RWRTEPFIMITGSGRWIPPQLPMSKSPEHEKESGSTHAGGRSRRVEPDRTLTDAQRDEFE 357 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERSQIKEAMGF+LDNADAAGE+VEVLTESLTLKET IPTK+ARLMLVSDILH Sbjct: 358 DMLRALTLERSQIKEAMGFSLDNADAAGEIVEVLTESLTLKETPIPTKIARLMLVSDILH 417 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPV+NASAYRTKFEATLPDIMESFNDLYRS+ GRITAEALKERVLKVLQVWADWFLF Sbjct: 418 NSSAPVRNASAYRTKFEATLPDIMESFNDLYRSIMGRITAEALKERVLKVLQVWADWFLF 477 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGVIPFHSICGDAPE+E+ S + GGK NQDAALA+G+GA Sbjct: 478 SDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQNTTSKDMVVGGKTNQDAALAMGRGA 537 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AMKEL +LP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEKQRGFE+D+ELK AH+Q Sbjct: 538 AMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQRGFELDEELKYAHNQV 597 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDIN--SSEG 169 SSG+Y+S Q+E + E D V D+ + S GR S+ L P QP++ + + Sbjct: 598 SSGKYSSNQRETSEEPD----PVWNHYGDEDLQSQGRSSVPLSPTLPIAQPELKAFTKKE 653 Query: 168 KNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 KN+ +LP+SKWA ++ L+YSSSGSEN GD L K +E E D Sbjct: 654 KNDPVLPASKWAWEGDESDDEQRRSGKNIGLSYSSSGSENVGDGLVKADESESAAD 709 >ref|XP_006585860.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X1 [Glycine max] gi|571473234|ref|XP_006585861.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X2 [Glycine max] Length = 969 Score = 800 bits (2067), Expect = 0.0 Identities = 407/536 (75%), Positives = 454/536 (84%), Gaps = 2/536 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEG+TVILSGPSGPPVT+VP+QNSELVLTPNVPDI V PP+D HL Sbjct: 273 QALPAPPPGHMAIRSKEGSTVILSGPSGPPVTSVPNQNSELVLTPNVPDIMVTPPEDEHL 332 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDTMAL+VLDGGCAFEQAIMERGRGNPLF+FLF LGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 333 RHVIDTMALHVLDGGCAFEQAIMERGRGNPLFNFLFILGSKEHTYYVWRLYSFAQGDTLQ 392 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRWIPP LP +K P+HEKE+GST+A G+S+RVE +RTLTD QRDEFE Sbjct: 393 RWRTEPFIMITGSGRWIPPQLPMSKSPEHEKESGSTHAGGRSRRVEPDRTLTDAQRDEFE 452 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERSQIKEAMGF+LDNADAAGE+VEVLTESLTLKET IPTK+ARLMLVSDILH Sbjct: 453 DMLRALTLERSQIKEAMGFSLDNADAAGEIVEVLTESLTLKETPIPTKIARLMLVSDILH 512 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPV+NASAYRTKFEATLPDIMESFNDLYRS+ GRITAEALKERVLKVLQVWADWFLF Sbjct: 513 NSSAPVRNASAYRTKFEATLPDIMESFNDLYRSIMGRITAEALKERVLKVLQVWADWFLF 572 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGVIPFHSICGDAPE+E+ S + GGK NQDAALA+G+GA Sbjct: 573 SDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQNTTSKDMVVGGKTNQDAALAMGRGA 632 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AMKEL +LP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEKQRGFE+D+ELK AH+Q Sbjct: 633 AMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQRGFELDEELKYAHNQV 692 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDIN--SSEG 169 SSG+Y+S Q+E + E D V D+ + S GR S+ L P QP++ + + Sbjct: 693 SSGKYSSNQRETSEEPD----PVWNHYGDEDLQSQGRSSVPLSPTLPIAQPELKAFTKKE 748 Query: 168 KNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 KN+ +LP+SKWA ++ L+YSSSGSEN GD L K +E E D Sbjct: 749 KNDPVLPASKWAWEGDESDDEQRRSGKNIGLSYSSSGSENVGDGLVKADESESAAD 804 >ref|XP_004138695.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like [Cucumis sativus] gi|449493301|ref|XP_004159248.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like [Cucumis sativus] Length = 961 Score = 798 bits (2061), Expect = 0.0 Identities = 405/536 (75%), Positives = 450/536 (83%), Gaps = 2/536 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEG TVILSG SGPPVT+VP+QNSELVLTPN+PDI V PP+D+HL Sbjct: 273 QALPAPPPGHMAIRSKEGGTVILSGSSGPPVTSVPNQNSELVLTPNIPDITVEPPEDDHL 332 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDTMALYVLDGGC FEQAIMERGRGNPLF+FLFELGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 333 RHVIDTMALYVLDGGCVFEQAIMERGRGNPLFNFLFELGSKEHTYYVWRLYSFAQGDTLQ 392 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRW+PP LPTAK P+ EKE+G TYAAG+S+R+E+ERTLTD+QRDEFE Sbjct: 393 RWRTEPFIMITGSGRWVPPPLPTAKSPELEKESGPTYAAGRSRRMELERTLTDSQRDEFE 452 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERSQIKEAMGFALDNADAAGE+VEVLTESLTL+ET IPTKVARLMLVSDILH Sbjct: 453 DMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTLRETPIPTKVARLMLVSDILH 512 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFEATLPDI+ESFNDLYRS+TGRITAEALKERVLK+LQVW+DWFLF Sbjct: 513 NSSAPVKNASAYRTKFEATLPDIIESFNDLYRSITGRITAEALKERVLKLLQVWSDWFLF 572 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGVIPFHS+CGDAPE+ERKA ++GDG KINQDA LA+GKG Sbjct: 573 SDAYVNGLRATFLRLGNSGVIPFHSLCGDAPEIERKANCDDSGDGSKINQDAELAMGKGG 632 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AMKEL NLP ELERRCRHNGLSLVGGREMMVARLL LEEAEK G+E+D++LK +S S Sbjct: 633 AMKELMNLPFGELERRCRHNGLSLVGGREMMVARLLSLEEAEKLSGYELDEDLK--YSNS 690 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDINS--SEG 169 SGRY+S +E E + S DD GS+ L QP++ G Sbjct: 691 HSGRYSSSSRETKVERGPAETSGWSRFGDDEADFQRMGSVPLAQTLSIPQPELKGFIKSG 750 Query: 168 KNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 KN+ +LP+SKWA L L+YSSSGSENAGD SK +E+E+TT+ Sbjct: 751 KNDPVLPASKWAREDDESDSEQKGGTRGLGLSYSSSGSENAGDGPSKADEMEITTE 806 >ref|XP_007011692.1| RNA recognition motif-containing protein isoform 2 [Theobroma cacao] gi|508782055|gb|EOY29311.1| RNA recognition motif-containing protein isoform 2 [Theobroma cacao] Length = 733 Score = 796 bits (2055), Expect = 0.0 Identities = 406/536 (75%), Positives = 456/536 (85%), Gaps = 2/536 (0%) Frame = -1 Query: 1602 QALPAPPPGHMAIRSKEGATVILSGPSGPPVTTVPSQNSELVLTPNVPDINVAPPDDNHL 1423 QALPAPPPGHMAIRSKEG ++ILSGPSGPPVT+VP+QNSELVLTPNVPDI VAPP+D+H+ Sbjct: 179 QALPAPPPGHMAIRSKEGGSIILSGPSGPPVTSVPNQNSELVLTPNVPDIMVAPPEDSHV 238 Query: 1422 QHVIDTMALYVLDGGCAFEQAIMERGRGNPLFSFLFELGSKEHTYYVWRLYSFAQGDTLQ 1243 +HVIDTMALYVLDGGCAFEQAIMERGRGNPLF+FLF LGSKEHTYYVWRLYSFAQGDTLQ Sbjct: 239 RHVIDTMALYVLDGGCAFEQAIMERGRGNPLFNFLFVLGSKEHTYYVWRLYSFAQGDTLQ 298 Query: 1242 RWRTEPFIMITGSGRWIPPSLPTAKGPDHEKEAGSTYAAGKSKRVEMERTLTDTQRDEFE 1063 RWRTEPFIMITGSGRW+PP LPT K P+HEK++ +TYAAG+S+RVE ERTLTD QRDEFE Sbjct: 299 RWRTEPFIMITGSGRWVPPPLPTTKSPEHEKDSTATYAAGRSRRVEPERTLTDPQRDEFE 358 Query: 1062 DMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILH 883 DMLRALTLERS IKEAMGFALDNADAAGE+VEVLTESLTLKET IPTKVARLMLVSDILH Sbjct: 359 DMLRALTLERSLIKEAMGFALDNADAAGEIVEVLTESLTLKETPIPTKVARLMLVSDILH 418 Query: 882 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWADWFLF 703 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVW+DWFLF Sbjct: 419 NSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAEALKERVLKVLQVWSDWFLF 478 Query: 702 SDAYVNGLRATFLRFSNSGVIPFHSICGDAPELERKAGSAETGDGGKINQDAALAIGKGA 523 SDAYVNGLRATFLR NSGV PFHSICGDAPE+E+ S + GDG K NQDAALA+GKGA Sbjct: 479 SDAYVNGLRATFLRSGNSGVAPFHSICGDAPEIEKNTSSEDAGDGIKGNQDAALAMGKGA 538 Query: 522 AMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEKQRGFEIDDELKSAHSQS 343 AM+EL +LP+ ELERRCRHNGLSLVGGRE+MVARLL LE+AEKQR +E+DD+LK A S+S Sbjct: 539 AMRELMDLPLAELERRCRHNGLSLVGGREIMVARLLSLEDAEKQRSYELDDDLKLAQSRS 598 Query: 342 SSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLPPKDLNLQPDINS--SEG 169 SS RY+SGQ++ N+E + +S D+ + S +GS+ L QP+I + + Sbjct: 599 SSCRYSSGQRDINAEAEPVGLSGWTHYADNEIHSQRKGSVPLAETLPIPQPEIKAFLKKE 658 Query: 168 KNESILPSSKWAXXXXXXXXXXXXXXXDLALAYSSSGSENAGDVLSKTEELEVTTD 1 K + +LP+SKW+ L L+YSSSGSENAGD SK +ELE TD Sbjct: 659 KIDPVLPASKWSREDDDSDDEEKRSTRGLGLSYSSSGSENAGDGTSKADELEFGTD 714