BLASTX nr result
ID: Mentha28_contig00016388
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00016388 (1243 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU29204.1| hypothetical protein MIMGU_mgv1a000894mg [Mimulus... 406 e-111 ref|XP_006483724.1| PREDICTED: LOW QUALITY PROTEIN: U2 snRNP-ass... 385 e-104 ref|XP_006450262.1| hypothetical protein CICLE_v10007357mg [Citr... 385 e-104 ref|XP_007225360.1| hypothetical protein PRUPE_ppa000894mg [Prun... 383 e-104 ref|XP_004234429.1| PREDICTED: U2 snRNP-associated SURP motif-co... 380 e-103 ref|XP_006585862.1| PREDICTED: U2 snRNP-associated SURP motif-co... 379 e-102 ref|XP_006585860.1| PREDICTED: U2 snRNP-associated SURP motif-co... 379 e-102 gb|EXC01118.1| U2 snRNP-associated SURP motif-containing protein... 378 e-102 ref|XP_002324341.2| RNA recognition motif-containing family prot... 378 e-102 ref|XP_002308714.1| RNA recognition motif-containing family prot... 378 e-102 ref|XP_006599196.1| PREDICTED: U2 snRNP-associated SURP motif-co... 377 e-102 emb|CAN79213.1| hypothetical protein VITISV_025939 [Vitis vinifera] 377 e-102 ref|XP_006353899.1| PREDICTED: U2 snRNP-associated SURP motif-co... 375 e-101 ref|XP_006353898.1| PREDICTED: U2 snRNP-associated SURP motif-co... 375 e-101 ref|XP_006353897.1| PREDICTED: U2 snRNP-associated SURP motif-co... 375 e-101 ref|XP_004291970.1| PREDICTED: U2 snRNP-associated SURP motif-co... 372 e-100 ref|XP_007011694.1| RNA recognition motif-containing protein iso... 372 e-100 ref|XP_007011693.1| RNA recognition motif-containing protein iso... 372 e-100 ref|XP_007011691.1| RNA recognition motif-containing protein iso... 372 e-100 ref|XP_002515412.1| RNA binding protein, putative [Ricinus commu... 371 e-100 >gb|EYU29204.1| hypothetical protein MIMGU_mgv1a000894mg [Mimulus guttatus] Length = 949 Score = 406 bits (1044), Expect = e-111 Identities = 225/337 (66%), Positives = 257/337 (76%), Gaps = 6/337 (1%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVWADWFLFSDAYVNGLRATF+R +SGV FHSICGDAPELERK GSA+ G Sbjct: 554 KERVLKVLQVWADWFLFSDAYVNGLRATFIRSGSSGVTTFHSICGDAPELERKPGSADHG 613 Query: 181 DAGKIN--QDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEA 354 KIN QDAALAIGKGAAMKEL LP+ ELERRCRHNGLSLVGGRE MVARLLYLEEA Sbjct: 614 QGEKINHGQDAALAIGKGAAMKELLTLPLNELERRCRHNGLSLVGGRETMVARLLYLEEA 673 Query: 355 EKQRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGN---MDDGMPSIGRG 525 EKQRG EIDDELKS SQ SGRY SGQ+E S+ + G G N +D+ +P + G Sbjct: 674 EKQRGSEIDDELKSGRSQLGSGRYQSGQRE--SKFEAGPAETSGWNSSRVDEMVPKV-TG 730 Query: 526 SMLLPPKDLNLQPDINSSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSEN 705 ++ LPP D + IN+ +G +ESILP+SKWA L L YSSSGS+ Sbjct: 731 AVFLPPSD--QKELINARDGGSESILPASKWARENEESDDENERSTKELGLTYSSSGSDM 788 Query: 706 AGDV-LSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEE 882 AGD KTEE +T DA NS ++DGGMNEEQRQKLRRLEVALMEYRESLEE+G+K+ +E Sbjct: 789 AGDSDPYKTEERGITNDATNSAYVDGGMNEEQRQKLRRLEVALMEYRESLEERGLKNSDE 848 Query: 883 IERKVAARRSRLQAEYGLVNSDADASGRKKSSLERGG 993 IE+KVA RSRLQAEYGL++S+ADASGRKKSSL+ G Sbjct: 849 IEKKVAIHRSRLQAEYGLLDSNADASGRKKSSLDGRG 885 >ref|XP_006483724.1| PREDICTED: LOW QUALITY PROTEIN: U2 snRNP-associated SURP motif-containing protein-like [Citrus sinensis] Length = 1017 Score = 385 bits (989), Expect = e-104 Identities = 203/326 (62%), Positives = 242/326 (74%), Gaps = 2/326 (0%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVW+DWFLFSDAYVNGLRATFLR NSGV PFHSICGDAPE+++K S +T Sbjct: 600 KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIDKKNNSEDTC 659 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 D K NQD ALA+GKGAA+KEL NLP+ ELERRCRHNGLSLVGGREMMVARLL LE+AEK Sbjct: 660 DLSKTNQDTALAMGKGAAIKELMNLPLSELERRCRHNGLSLVGGREMMVARLLSLEDAEK 719 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QRG+E+DD+LKSAHSQSSSGRY+ G KE N E + +S G +D S GS+ L Sbjct: 720 QRGYELDDDLKSAHSQSSSGRYSRGWKETNMEAESMGLSGWNGYEEDEKLSQAVGSVPLG 779 Query: 541 PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714 QP+I + + KN+ +LP+SKWA L L+YSSSGSENAGD Sbjct: 780 TMLTTPQPEIKAFTKKEKNDPVLPASKWALEDDESDDEQKRSSRGLGLSYSSSGSENAGD 839 Query: 715 VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894 SK ++++ T DA+ V D GMNEEQRQKLRRLEV+L+EYRESLEE+GIK EEIE+K Sbjct: 840 GPSKADDVDFTIDASIPVQPDSGMNEEQRQKLRRLEVSLIEYRESLEERGIKSSEEIEKK 899 Query: 895 VAARRSRLQAEYGLVNSDADASGRKK 972 VA R RL++EYGL + + D SG K+ Sbjct: 900 VAIHRKRLESEYGLADPNEDVSGNKR 925 >ref|XP_006450262.1| hypothetical protein CICLE_v10007357mg [Citrus clementina] gi|567916514|ref|XP_006450263.1| hypothetical protein CICLE_v10007357mg [Citrus clementina] gi|557553488|gb|ESR63502.1| hypothetical protein CICLE_v10007357mg [Citrus clementina] gi|557553489|gb|ESR63503.1| hypothetical protein CICLE_v10007357mg [Citrus clementina] Length = 973 Score = 385 bits (989), Expect = e-104 Identities = 203/326 (62%), Positives = 242/326 (74%), Gaps = 2/326 (0%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVW+DWFLFSDAYVNGLRATFLR NSGV PFHSICGDAPE+++K S +T Sbjct: 556 KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIDKKNNSEDTC 615 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 D K NQD ALA+GKGAA+KEL NLP+ ELERRCRHNGLSLVGGREMMVARLL LE+AEK Sbjct: 616 DLSKTNQDTALAMGKGAAIKELMNLPLSELERRCRHNGLSLVGGREMMVARLLSLEDAEK 675 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QRG+E+DD+LKSAHSQSSSGRY+ G KE N E + +S G +D S GS+ L Sbjct: 676 QRGYELDDDLKSAHSQSSSGRYSRGWKETNMEAESMGLSGWNGYEEDEKLSQAVGSVPLG 735 Query: 541 PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714 QP+I + + KN+ +LP+SKWA L L+YSSSGSENAGD Sbjct: 736 TMLTTPQPEIKAFTKKEKNDPVLPASKWALEDDESDDEQKRSSRGLGLSYSSSGSENAGD 795 Query: 715 VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894 SK ++++ T DA+ V D GMNEEQRQKLRRLEV+L+EYRESLEE+GIK EEIE+K Sbjct: 796 GPSKADDVDFTIDASIPVQPDSGMNEEQRQKLRRLEVSLIEYRESLEERGIKSSEEIEKK 855 Query: 895 VAARRSRLQAEYGLVNSDADASGRKK 972 VA R RL++EYGL + + D SG K+ Sbjct: 856 VAIHRKRLESEYGLADPNEDVSGNKR 881 >ref|XP_007225360.1| hypothetical protein PRUPE_ppa000894mg [Prunus persica] gi|462422296|gb|EMJ26559.1| hypothetical protein PRUPE_ppa000894mg [Prunus persica] Length = 968 Score = 383 bits (984), Expect = e-104 Identities = 205/331 (61%), Positives = 247/331 (74%), Gaps = 2/331 (0%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVW+DWFLFSDAYVNGLRATFLR NSGV+PFHSICGDAPE+++K S +TG Sbjct: 557 KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVVPFHSICGDAPEIDKKITSEDTG 616 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 DA K NQDAALA+GKGAAM+EL +LP+ ELERRCRHNGLSLVGGRE MVARLL LEEAEK Sbjct: 617 DACKTNQDAALAMGKGAAMRELLSLPLAELERRCRHNGLSLVGGRETMVARLLSLEEAEK 676 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QRG+E+DD+LK A S SSS RY+S ++E N E D IS + G+GS+ L Sbjct: 677 QRGYELDDDLKYAQSHSSSARYSSSRREMNIEPDSMGISAQ-----------GKGSLPLV 725 Query: 541 PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714 QP++ + + K++ +LP+SKWA L L+YSSSGSENAGD Sbjct: 726 QTLPIPQPELKALTKKEKSDPVLPASKWAREDDDSDDEQKRSARDLGLSYSSSGSENAGD 785 Query: 715 VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894 SK +E+EV TDA+ D G++EEQRQKLRRLEVAL+EYRESLEE+GIK+PEEIERK Sbjct: 786 GPSKADEMEVATDASIPAQPDSGISEEQRQKLRRLEVALIEYRESLEERGIKNPEEIERK 845 Query: 895 VAARRSRLQAEYGLVNSDADASGRKKSSLER 987 VA R RL++EYGL +S DA G K++S ER Sbjct: 846 VAIHRKRLESEYGLSDSSEDACGSKRTSSER 876 >ref|XP_004234429.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like [Solanum lycopersicum] Length = 947 Score = 380 bits (976), Expect = e-103 Identities = 207/334 (61%), Positives = 244/334 (73%), Gaps = 5/334 (1%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVWADWFLFSDAYVNGLRATFLR NSGV PFHS+CGDAP++E++ S + G Sbjct: 550 KERVLKVLQVWADWFLFSDAYVNGLRATFLRTGNSGVTPFHSLCGDAPDVEQRTSSDDAG 609 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 D GK+N D ALAIGKGAAMKEL +LP+ ELERRCRHNGLS+VGGREMMVARLLYLEEAEK Sbjct: 610 DGGKVNPDGALAIGKGAAMKELLSLPLTELERRCRHNGLSIVGGREMMVARLLYLEEAEK 669 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QRG E+D++LK A S SSS R+ S +K+ N E+D S R MD + R S + Sbjct: 670 QRGHELDEDLKFA-SHSSSARFPSTRKDSNLELDRMAPSERNSQMDYDVQLKQRES--VS 726 Query: 541 PKDLNLQPDIN----SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENA 708 +N P N SS+GK+E+ILP+SKWA L L YSSSGSENA Sbjct: 727 SHQINSAPHYNSIDFSSDGKSETILPTSKWAREDDESDDEQKRSSRDLGLTYSSSGSENA 786 Query: 709 GDVLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIE 888 GD LSK ++ E+TTD NS + + GMNEE RQKLRRLEVAL+EYRESLEEQGIK+P+EIE Sbjct: 787 GDGLSKIKDAELTTDTGNSAYPESGMNEELRQKLRRLEVALIEYRESLEEQGIKNPDEIE 846 Query: 889 RKVAARRSRLQAEYGLVNSDADASGR-KKSSLER 987 RKV R LQ+EYGL+N D S + +SS ER Sbjct: 847 RKVEIHRQCLQSEYGLLNFSEDTSKKGGRSSSER 880 >ref|XP_006585862.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X3 [Glycine max] gi|571473238|ref|XP_006585863.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X4 [Glycine max] Length = 874 Score = 379 bits (972), Expect = e-102 Identities = 202/331 (61%), Positives = 240/331 (72%), Gaps = 2/331 (0%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVWADWFLFSDAYVNGLRATFLR NSGVIPFHSICGDAPE+E+ S + Sbjct: 461 KERVLKVLQVWADWFLFSDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQNTTSKDMV 520 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 GK NQDAALA+G+GAAMKEL +LP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEK Sbjct: 521 VGGKTNQDAALAMGRGAAMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAEK 580 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QRGFE+D+ELK AH+Q SSG+Y+S Q+E + E D V D+ + S GR S+ L Sbjct: 581 QRGFELDEELKYAHNQVSSGKYSSNQRETSEEPD----PVWNHYGDEDLQSQGRSSVPLS 636 Query: 541 PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714 P QP++ + + KN+ +LP+SKWA + L+YSSSGSEN GD Sbjct: 637 PTLPIAQPELKAFTKKEKNDPVLPASKWAWEGDESDDEQRRSGKNIGLSYSSSGSENVGD 696 Query: 715 VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894 L K +E E D S H D GMNEEQRQKLRRLEVAL+EYRESLEE+G+K+ EEIE+K Sbjct: 697 GLVKADESESAADTRFSAHADSGMNEEQRQKLRRLEVALIEYRESLEERGVKNLEEIEKK 756 Query: 895 VAARRSRLQAEYGLVNSDADASGRKKSSLER 987 V + R RLQ EYGL +S D G +++S R Sbjct: 757 VQSHRKRLQVEYGLSDSGEDGHGHRRTSERR 787 >ref|XP_006585860.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X1 [Glycine max] gi|571473234|ref|XP_006585861.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X2 [Glycine max] Length = 969 Score = 379 bits (972), Expect = e-102 Identities = 202/331 (61%), Positives = 240/331 (72%), Gaps = 2/331 (0%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVWADWFLFSDAYVNGLRATFLR NSGVIPFHSICGDAPE+E+ S + Sbjct: 556 KERVLKVLQVWADWFLFSDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQNTTSKDMV 615 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 GK NQDAALA+G+GAAMKEL +LP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEK Sbjct: 616 VGGKTNQDAALAMGRGAAMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAEK 675 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QRGFE+D+ELK AH+Q SSG+Y+S Q+E + E D V D+ + S GR S+ L Sbjct: 676 QRGFELDEELKYAHNQVSSGKYSSNQRETSEEPD----PVWNHYGDEDLQSQGRSSVPLS 731 Query: 541 PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714 P QP++ + + KN+ +LP+SKWA + L+YSSSGSEN GD Sbjct: 732 PTLPIAQPELKAFTKKEKNDPVLPASKWAWEGDESDDEQRRSGKNIGLSYSSSGSENVGD 791 Query: 715 VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894 L K +E E D S H D GMNEEQRQKLRRLEVAL+EYRESLEE+G+K+ EEIE+K Sbjct: 792 GLVKADESESAADTRFSAHADSGMNEEQRQKLRRLEVALIEYRESLEERGVKNLEEIEKK 851 Query: 895 VAARRSRLQAEYGLVNSDADASGRKKSSLER 987 V + R RLQ EYGL +S D G +++S R Sbjct: 852 VQSHRKRLQVEYGLSDSGEDGHGHRRTSERR 882 >gb|EXC01118.1| U2 snRNP-associated SURP motif-containing protein [Morus notabilis] Length = 999 Score = 378 bits (971), Expect = e-102 Identities = 203/331 (61%), Positives = 242/331 (73%), Gaps = 2/331 (0%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVWADWFLFSDAYVNGLRATFLR NSGV PFHSICGDAPE+E+ +TG Sbjct: 575 KERVLKVLQVWADWFLFSDAYVNGLRATFLRLGNSGVTPFHSICGDAPEIEKIISFEDTG 634 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 DAGK N+DAALA+GKGAAM+EL NLP ELERRCRHNGLSLVGGREMMVARLL LEEAEK Sbjct: 635 DAGKTNEDAALAMGKGAAMQELMNLPFAELERRCRHNGLSLVGGREMMVARLLSLEEAEK 694 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QRG+E+D++LK A SSSGRY+ G++E N E + S D + S +GS+ L Sbjct: 695 QRGYELDEDLKYAQGHSSSGRYSGGRRETNVEGEPMGSSGWNHYAGDEIDSQAKGSVPLA 754 Query: 541 PKDLNLQPDINS--SEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714 QP++ + K++ +LP+SKWA L L YSSSGSENAGD Sbjct: 755 QTIPIPQPELKPFVKKEKSDPVLPASKWAREDDDSDDEQKRSSRGLGLGYSSSGSENAGD 814 Query: 715 VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894 SK +E+E D ++ V D GM+EEQR+KLRRLE AL+EYRESLEE+GI+ PEEIERK Sbjct: 815 GPSKADEMESAAD-SSVVQPDSGMSEEQRKKLRRLEAALIEYRESLEERGIRSPEEIERK 873 Query: 895 VAARRSRLQAEYGLVNSDADASGRKKSSLER 987 V R RL+AEYGL NS+ DA+G K++SLER Sbjct: 874 VTMHRKRLEAEYGLSNSNKDAAGSKRASLER 904 >ref|XP_002324341.2| RNA recognition motif-containing family protein [Populus trichocarpa] gi|550317898|gb|EEF02906.2| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 969 Score = 378 bits (970), Expect = e-102 Identities = 202/333 (60%), Positives = 245/333 (73%), Gaps = 4/333 (1%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVW+DWFLFSDAYVNGLRATFLR +NSGVIPFHSICGDAPE+E+K+ S + Sbjct: 557 KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSSNSGVIPFHSICGDAPEIEKKSSSEDAV 616 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 + KINQDAALA+GKGAA+KEL NLP+ ELERRCRHNGLSLVGGREMMVARLL LEEAE+ Sbjct: 617 EGAKINQDAALAMGKGAAVKELMNLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAER 676 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNM--DDGMPSIGRGSML 534 QRG+E+DD+LK A S SSS RY+S +E N E + + G N+ +D MPS +GS+ Sbjct: 677 QRGYELDDDLKIAQSNSSSSRYSSVHREMNVEAE--PVGSTGWNVYGEDEMPSQNKGSVS 734 Query: 535 LPPKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENA 708 + L QP++ + + KN+ +LP+SKWA L L+YSSSGSENA Sbjct: 735 VASTLLIKQPELKAFAKKEKNDPVLPASKWARDDDESDDEQKRSARDLGLSYSSSGSENA 794 Query: 709 GDVLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIE 888 GD K +E+E TDAN D GMNEEQRQKLRRLEVAL+EYRESLEE+G+K EIE Sbjct: 795 GDGQGKADEMEFATDANIPTQPDSGMNEEQRQKLRRLEVALIEYRESLEERGMKSSVEIE 854 Query: 889 RKVAARRSRLQAEYGLVNSDADASGRKKSSLER 987 KVA R L++EYGL +S+ D + +K S ER Sbjct: 855 GKVAIHRKWLESEYGLSSSNEDVTSKKSISSER 887 >ref|XP_002308714.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222854690|gb|EEE92237.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 988 Score = 378 bits (970), Expect = e-102 Identities = 215/417 (51%), Positives = 263/417 (63%), Gaps = 4/417 (0%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVW+DWFLFSDAYVNGLRATFLR +NSGVIPFHS+CGDAPE+E+K + +T Sbjct: 567 KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSSNSGVIPFHSMCGDAPEIEKKNSTEDTV 626 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 D GK NQDAALA+GKGAA KEL +LP+ ELERRCRHNGLSLVGGRE MVARLL LEEAEK Sbjct: 627 DGGKTNQDAALAMGKGAATKELMDLPLAELERRCRHNGLSLVGGRETMVARLLNLEEAEK 686 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNM--DDGMPSIGRGSML 534 QRG+E+D +LK A S SSS RY+S +E N +D G + + G N+ +D PS + S+ Sbjct: 687 QRGYELDGDLKIAQSNSSSSRYSSVHREVN--VDPGPVGLTGWNIYGEDDTPSQNKRSVS 744 Query: 535 LPPKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENA 708 L QP++ + + KN+ +LP+SKWA L L+YSSSGSENA Sbjct: 745 LVSTLPIPQPELKAFAKKEKNDPVLPASKWARDDDESDDEQKRSVRDLGLSYSSSGSENA 804 Query: 709 GDVLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIE 888 GD K +E+E TDA+ + GMNEEQRQKLRRLEVAL+EYRESLEEQG+K+ EE E Sbjct: 805 GDGQGKEDEMEFATDASIPTQPESGMNEEQRQKLRRLEVALIEYRESLEEQGMKNSEEFE 864 Query: 889 RKVAARRSRLQAEYGLVNSDADASGRKKSSLERGGXXXXXXXXXXXXXXXXXXXXXXXXX 1068 RKVA R RL++EYGL +S+ D +G K+ S ER Sbjct: 865 RKVAVHRKRLESEYGLSSSNEDVTGNKRISSERRDRRDDNHESSRKRHRSESRSESPQRK 924 Query: 1069 XXXXXXERESDANSNXXXXXXXXXPQELXXXXXXXXXXXXXXXXXXXKERDDHDRER 1239 ERE D++ + L KERDDHDR+R Sbjct: 925 LSLRDREREHDSDKDRERHRERDRGNNL----ESERRDRDYREKSGSKERDDHDRDR 977 >ref|XP_006599196.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like [Glycine max] Length = 969 Score = 377 bits (967), Expect = e-102 Identities = 203/331 (61%), Positives = 241/331 (72%), Gaps = 2/331 (0%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVWADWFLFSDAYVNGLRATFLR NSGVIPFHSICGDAPE+E+K S + Sbjct: 556 KERVLKVLQVWADWFLFSDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQKTASEDMV 615 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 GK NQDAALA+G+GAAMKEL +LP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEK Sbjct: 616 VGGKTNQDAALAMGRGAAMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAEK 675 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 Q+GFE+DDELK AH+Q SSG+Y+S Q+E ++E+D +S D+ + S GR S+ L Sbjct: 676 QKGFELDDELKYAHNQVSSGKYSSNQRETSAELDPVGLSAWNHYGDEDIQSQGRSSVPLA 735 Query: 541 PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714 P QP + + + KN+ +LP+SKWA L L+YSSSGSEN D Sbjct: 736 PTLPIPQPKLKAFTKKEKNDPVLPASKWA-REDDESDDEQRSGKNLGLSYSSSGSENVDD 794 Query: 715 VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894 L K +E E D + S H D GMNEEQRQKLRRLEVAL+EY ESLEE+GIK+ EEIE+K Sbjct: 795 GLVKADESESAADRSFSAHADSGMNEEQRQKLRRLEVALIEYGESLEERGIKNLEEIEKK 854 Query: 895 VAARRSRLQAEYGLVNSDADASGRKKSSLER 987 V R RLQ EYGL +S D G +++S R Sbjct: 855 VQLHRKRLQVEYGLSDSGEDGQGNRRTSERR 885 >emb|CAN79213.1| hypothetical protein VITISV_025939 [Vitis vinifera] Length = 1384 Score = 377 bits (967), Expect = e-102 Identities = 203/326 (62%), Positives = 241/326 (73%), Gaps = 3/326 (0%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERVMKVLQVWADWFLFSDAYVNGLRATFLR NSGV PFHSICGDAPE+E+K S +TG Sbjct: 717 KERVMKVLQVWADWFLFSDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIEKKTSSEDTG 776 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 + GK NQDAALA+GKGAAMKEL +LPI ELERRCRHNGLSLVGGRE+MVARLL LEEAEK Sbjct: 777 EGGKSNQDAALAMGKGAAMKELLSLPIAELERRCRHNGLSLVGGREIMVARLLSLEEAEK 836 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QRG+++DD+LK A S S+SGRY S +KE E + +S +D + S G+GS+ L Sbjct: 837 QRGYDLDDDLKYAQSHSNSGRYPSSRKEIGVETESVGLSGWNRYGEDEIQSQGKGSVPLA 896 Query: 541 PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714 P QP++ +++GK + +LP+SKWA L L+YSSSGSENAGD Sbjct: 897 PTIPIPQPELKAFTNKGKTDPVLPASKWAREDDDSDDEQKRSARGLGLSYSSSGSENAGD 956 Query: 715 VLSKTEELEVTTDANNSVHLDGG-MNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIER 891 K +E+E T+++ D G MNEE RQKLRRLEVAL+EYRESLEE+GIK EEIER Sbjct: 957 GPXKADEMEFATESSIPSQPDSGMMNEEHRQKLRRLEVALIEYRESLEERGIKSSEEIER 1016 Query: 892 KVAARRSRLQAEYGLVNSDADASGRK 969 KVA R RLQ+EYGL +S+ D S K Sbjct: 1017 KVAIHRKRLQSEYGLSDSNEDVSWNK 1042 >ref|XP_006353899.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X3 [Solanum tuberosum] Length = 857 Score = 375 bits (963), Expect = e-101 Identities = 206/334 (61%), Positives = 244/334 (73%), Gaps = 5/334 (1%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVWADWFLFSDAYVNGLRATFLR NSGV PFHS+CGDAP++E++A S + G Sbjct: 460 KERVLKVLQVWADWFLFSDAYVNGLRATFLRTGNSGVTPFHSLCGDAPDVEQRASSDDAG 519 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 D GKIN D ALAIGKGAAMKEL +LP+ ELERRCRHNGLS+VGGREMMVARLLYLEEAEK Sbjct: 520 DGGKINPDGALAIGKGAAMKELLSLPLTELERRCRHNGLSIVGGREMMVARLLYLEEAEK 579 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QRG E+D++LK A S SSS R+ S +K+ N E+D S R +D + R S + Sbjct: 580 QRGHELDEDLKFA-SHSSSARFPSTRKDSNLELDRMAPSERNSQVDYDVQLKQRES--VS 636 Query: 541 PKDLNLQPDIN----SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENA 708 N P N SSEGK+E+ILP+SKWA L L YSSSGSENA Sbjct: 637 SHQTNSAPHYNSIDFSSEGKSETILPTSKWAREDDESDDEQKRSSRDLGLTYSSSGSENA 696 Query: 709 GDVLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIE 888 GD ++K ++ E+TTD +NS + + GMNEE RQKLRRLEVAL+EYRESLEEQGIK+ +EIE Sbjct: 697 GDGINKIKDAELTTDTSNSAYPESGMNEELRQKLRRLEVALIEYRESLEEQGIKNLDEIE 756 Query: 889 RKVAARRSRLQAEYGLVNSDADASGR-KKSSLER 987 RKV R LQ+EYGL+N D S + +SS ER Sbjct: 757 RKVEIHRQCLQSEYGLLNFSEDTSKKGGRSSSER 790 >ref|XP_006353898.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X2 [Solanum tuberosum] Length = 947 Score = 375 bits (963), Expect = e-101 Identities = 206/334 (61%), Positives = 244/334 (73%), Gaps = 5/334 (1%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVWADWFLFSDAYVNGLRATFLR NSGV PFHS+CGDAP++E++A S + G Sbjct: 550 KERVLKVLQVWADWFLFSDAYVNGLRATFLRTGNSGVTPFHSLCGDAPDVEQRASSDDAG 609 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 D GKIN D ALAIGKGAAMKEL +LP+ ELERRCRHNGLS+VGGREMMVARLLYLEEAEK Sbjct: 610 DGGKINPDGALAIGKGAAMKELLSLPLTELERRCRHNGLSIVGGREMMVARLLYLEEAEK 669 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QRG E+D++LK A S SSS R+ S +K+ N E+D S R +D + R S + Sbjct: 670 QRGHELDEDLKFA-SHSSSARFPSTRKDSNLELDRMAPSERNSQVDYDVQLKQRES--VS 726 Query: 541 PKDLNLQPDIN----SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENA 708 N P N SSEGK+E+ILP+SKWA L L YSSSGSENA Sbjct: 727 SHQTNSAPHYNSIDFSSEGKSETILPTSKWAREDDESDDEQKRSSRDLGLTYSSSGSENA 786 Query: 709 GDVLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIE 888 GD ++K ++ E+TTD +NS + + GMNEE RQKLRRLEVAL+EYRESLEEQGIK+ +EIE Sbjct: 787 GDGINKIKDAELTTDTSNSAYPESGMNEELRQKLRRLEVALIEYRESLEEQGIKNLDEIE 846 Query: 889 RKVAARRSRLQAEYGLVNSDADASGR-KKSSLER 987 RKV R LQ+EYGL+N D S + +SS ER Sbjct: 847 RKVEIHRQCLQSEYGLLNFSEDTSKKGGRSSSER 880 >ref|XP_006353897.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like isoform X1 [Solanum tuberosum] Length = 948 Score = 375 bits (963), Expect = e-101 Identities = 206/334 (61%), Positives = 244/334 (73%), Gaps = 5/334 (1%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVWADWFLFSDAYVNGLRATFLR NSGV PFHS+CGDAP++E++A S + G Sbjct: 551 KERVLKVLQVWADWFLFSDAYVNGLRATFLRTGNSGVTPFHSLCGDAPDVEQRASSDDAG 610 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 D GKIN D ALAIGKGAAMKEL +LP+ ELERRCRHNGLS+VGGREMMVARLLYLEEAEK Sbjct: 611 DGGKINPDGALAIGKGAAMKELLSLPLTELERRCRHNGLSIVGGREMMVARLLYLEEAEK 670 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QRG E+D++LK A S SSS R+ S +K+ N E+D S R +D + R S + Sbjct: 671 QRGHELDEDLKFA-SHSSSARFPSTRKDSNLELDRMAPSERNSQVDYDVQLKQRES--VS 727 Query: 541 PKDLNLQPDIN----SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENA 708 N P N SSEGK+E+ILP+SKWA L L YSSSGSENA Sbjct: 728 SHQTNSAPHYNSIDFSSEGKSETILPTSKWAREDDESDDEQKRSSRDLGLTYSSSGSENA 787 Query: 709 GDVLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIE 888 GD ++K ++ E+TTD +NS + + GMNEE RQKLRRLEVAL+EYRESLEEQGIK+ +EIE Sbjct: 788 GDGINKIKDAELTTDTSNSAYPESGMNEELRQKLRRLEVALIEYRESLEEQGIKNLDEIE 847 Query: 889 RKVAARRSRLQAEYGLVNSDADASGR-KKSSLER 987 RKV R LQ+EYGL+N D S + +SS ER Sbjct: 848 RKVEIHRQCLQSEYGLLNFSEDTSKKGGRSSSER 881 >ref|XP_004291970.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like [Fragaria vesca subsp. vesca] Length = 980 Score = 372 bits (956), Expect = e-100 Identities = 200/332 (60%), Positives = 248/332 (74%), Gaps = 3/332 (0%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVW+DWFLFSDAYVNGLRATFLR NSGV+PFHS+CGDAP++E+K S + G Sbjct: 557 KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVVPFHSVCGDAPDIEKKTTSEDAG 616 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 DA K NQDAALA+GKGAA +EL NLP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEK Sbjct: 617 DA-KTNQDAALAMGKGAATRELLNLPMAELERRCRHNGLSLVGGREMMVARLLSLEEAEK 675 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QRG+E+DD+LK + SSSGR++S +KE N E D +S ++D + S G+ S+ Sbjct: 676 QRGYELDDDLKYGQNHSSSGRHSSSRKEMNIEPDPLGLSGWNRYVEDEIQSEGKVSLSKA 735 Query: 541 PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714 + QP++ +++ K++ +LP+SKWA L L+Y SSGSENAGD Sbjct: 736 QTHTSPQPELKPFTTKEKSDPVLPASKWAREDDDSDDDQKRSAKGLGLSY-SSGSENAGD 794 Query: 715 VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894 SK +E+EV TD D G++EEQRQKLRRLEV+L+EYRESLEE+GI+ PEEIERK Sbjct: 795 GPSKADEMEVATDVRIPAQPDSGLSEEQRQKLRRLEVSLLEYRESLEERGIRSPEEIERK 854 Query: 895 VAARRSRLQAEYGLVNSDADASGR-KKSSLER 987 VA R RL++EYGL +S DASGR K++S ER Sbjct: 855 VAIHRKRLESEYGLSDSSEDASGRSKRTSSER 886 >ref|XP_007011694.1| RNA recognition motif-containing protein isoform 4 [Theobroma cacao] gi|590571807|ref|XP_007011695.1| RNA recognition motif-containing protein isoform 4 [Theobroma cacao] gi|508782057|gb|EOY29313.1| RNA recognition motif-containing protein isoform 4 [Theobroma cacao] gi|508782058|gb|EOY29314.1| RNA recognition motif-containing protein isoform 4 [Theobroma cacao] Length = 811 Score = 372 bits (954), Expect = e-100 Identities = 200/331 (60%), Positives = 245/331 (74%), Gaps = 2/331 (0%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVW+DWFLFSDAYVNGLRATFLR NSGV PFHSICGDAPE+E+ S + G Sbjct: 383 KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVAPFHSICGDAPEIEKNTSSEDAG 442 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 D K NQDAALA+GKGAAM+EL +LP+ ELERRCRHNGLSLVGGRE+MVARLL LE+AEK Sbjct: 443 DGIKGNQDAALAMGKGAAMRELMDLPLAELERRCRHNGLSLVGGREIMVARLLSLEDAEK 502 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QR +E+DD+LK A S+SSS RY+SGQ++ N+E + +S D+ + S +GS+ L Sbjct: 503 QRSYELDDDLKLAQSRSSSCRYSSGQRDINAEAEPVGLSGWTHYADNEIHSQRKGSVPLA 562 Query: 541 PKDLNLQPDINS--SEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714 QP+I + + K + +LP+SKW+ L L+YSSSGSENAGD Sbjct: 563 ETLPIPQPEIKAFLKKEKIDPVLPASKWSREDDDSDDEEKRSTRGLGLSYSSSGSENAGD 622 Query: 715 VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894 SK +ELE TDA+ + MNEEQRQKLRRLEVAL+EYRESLEE+GIK E+IER+ Sbjct: 623 GTSKADELEFGTDASIPAPSESAMNEEQRQKLRRLEVALIEYRESLEERGIKSAEDIERR 682 Query: 895 VAARRSRLQAEYGLVNSDADASGRKKSSLER 987 VAA R RL++EYGL +S D SGRK++S ER Sbjct: 683 VAAHRKRLESEYGLSDSSEDISGRKRTSSER 713 >ref|XP_007011693.1| RNA recognition motif-containing protein isoform 3 [Theobroma cacao] gi|508782056|gb|EOY29312.1| RNA recognition motif-containing protein isoform 3 [Theobroma cacao] Length = 819 Score = 372 bits (954), Expect = e-100 Identities = 200/331 (60%), Positives = 245/331 (74%), Gaps = 2/331 (0%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVW+DWFLFSDAYVNGLRATFLR NSGV PFHSICGDAPE+E+ S + G Sbjct: 391 KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVAPFHSICGDAPEIEKNTSSEDAG 450 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 D K NQDAALA+GKGAAM+EL +LP+ ELERRCRHNGLSLVGGRE+MVARLL LE+AEK Sbjct: 451 DGIKGNQDAALAMGKGAAMRELMDLPLAELERRCRHNGLSLVGGREIMVARLLSLEDAEK 510 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QR +E+DD+LK A S+SSS RY+SGQ++ N+E + +S D+ + S +GS+ L Sbjct: 511 QRSYELDDDLKLAQSRSSSCRYSSGQRDINAEAEPVGLSGWTHYADNEIHSQRKGSVPLA 570 Query: 541 PKDLNLQPDINS--SEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714 QP+I + + K + +LP+SKW+ L L+YSSSGSENAGD Sbjct: 571 ETLPIPQPEIKAFLKKEKIDPVLPASKWSREDDDSDDEEKRSTRGLGLSYSSSGSENAGD 630 Query: 715 VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894 SK +ELE TDA+ + MNEEQRQKLRRLEVAL+EYRESLEE+GIK E+IER+ Sbjct: 631 GTSKADELEFGTDASIPAPSESAMNEEQRQKLRRLEVALIEYRESLEERGIKSAEDIERR 690 Query: 895 VAARRSRLQAEYGLVNSDADASGRKKSSLER 987 VAA R RL++EYGL +S D SGRK++S ER Sbjct: 691 VAAHRKRLESEYGLSDSSEDISGRKRTSSER 721 >ref|XP_007011691.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|508782054|gb|EOY29310.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] Length = 985 Score = 372 bits (954), Expect = e-100 Identities = 200/331 (60%), Positives = 245/331 (74%), Gaps = 2/331 (0%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERV+KVLQVW+DWFLFSDAYVNGLRATFLR NSGV PFHSICGDAPE+E+ S + G Sbjct: 557 KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVAPFHSICGDAPEIEKNTSSEDAG 616 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 D K NQDAALA+GKGAAM+EL +LP+ ELERRCRHNGLSLVGGRE+MVARLL LE+AEK Sbjct: 617 DGIKGNQDAALAMGKGAAMRELMDLPLAELERRCRHNGLSLVGGREIMVARLLSLEDAEK 676 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540 QR +E+DD+LK A S+SSS RY+SGQ++ N+E + +S D+ + S +GS+ L Sbjct: 677 QRSYELDDDLKLAQSRSSSCRYSSGQRDINAEAEPVGLSGWTHYADNEIHSQRKGSVPLA 736 Query: 541 PKDLNLQPDINS--SEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714 QP+I + + K + +LP+SKW+ L L+YSSSGSENAGD Sbjct: 737 ETLPIPQPEIKAFLKKEKIDPVLPASKWSREDDDSDDEEKRSTRGLGLSYSSSGSENAGD 796 Query: 715 VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894 SK +ELE TDA+ + MNEEQRQKLRRLEVAL+EYRESLEE+GIK E+IER+ Sbjct: 797 GTSKADELEFGTDASIPAPSESAMNEEQRQKLRRLEVALIEYRESLEERGIKSAEDIERR 856 Query: 895 VAARRSRLQAEYGLVNSDADASGRKKSSLER 987 VAA R RL++EYGL +S D SGRK++S ER Sbjct: 857 VAAHRKRLESEYGLSDSSEDISGRKRTSSER 887 >ref|XP_002515412.1| RNA binding protein, putative [Ricinus communis] gi|223545356|gb|EEF46861.1| RNA binding protein, putative [Ricinus communis] Length = 979 Score = 371 bits (952), Expect = e-100 Identities = 202/333 (60%), Positives = 247/333 (74%), Gaps = 4/333 (1%) Frame = +1 Query: 1 KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180 KERVMKVLQVW+DWFLFSDAYVNGLRATFLR + SGVIPFHSICGDAP +E+K S +TG Sbjct: 555 KERVMKVLQVWSDWFLFSDAYVNGLRATFLRSSTSGVIPFHSICGDAPAIEKKVTSEDTG 614 Query: 181 DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360 D GK +QDAALA+GKGAAMKEL +LP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEK Sbjct: 615 DGGKTSQDAALAMGKGAAMKELLSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAEK 674 Query: 361 QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMD-MGQISVRGGNMDDGMPSIGRGSMLL 537 QRG+E+DD LK + S SS +++SG++E N E++ + + +V G +D + S R S L Sbjct: 675 QRGYELDDNLKVSQSHLSSSKFSSGRRETNVELEPVSEWNVYG---EDDVQSQSRASASL 731 Query: 538 PPKDL-NLQPDINSSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714 + + + + KN+ +LP+SKWA L L+YSSSGSENAGD Sbjct: 732 ATFPIPQAELKAFTKKEKNDPVLPASKWARDDDDSDDEQKRSSRGLGLSYSSSGSENAGD 791 Query: 715 VLSKT-EELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIER 891 L K +E+E TD + SV D GMNEEQRQKLRRLEVAL+EYRESLEE+G+K EEIER Sbjct: 792 GLGKADDEMEFATDGSISVQPDSGMNEEQRQKLRRLEVALIEYRESLEERGMKSAEEIER 851 Query: 892 KVAARRSRLQAEYGLVNSDADASGR-KKSSLER 987 KVA+ R RLQ++YGL++S D G K++S ER Sbjct: 852 KVASHRKRLQSDYGLLDSSQDTPGNSKRASSER 884