BLASTX nr result

ID: Mentha28_contig00016388 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00016388
         (1243 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU29204.1| hypothetical protein MIMGU_mgv1a000894mg [Mimulus...   406   e-111
ref|XP_006483724.1| PREDICTED: LOW QUALITY PROTEIN: U2 snRNP-ass...   385   e-104
ref|XP_006450262.1| hypothetical protein CICLE_v10007357mg [Citr...   385   e-104
ref|XP_007225360.1| hypothetical protein PRUPE_ppa000894mg [Prun...   383   e-104
ref|XP_004234429.1| PREDICTED: U2 snRNP-associated SURP motif-co...   380   e-103
ref|XP_006585862.1| PREDICTED: U2 snRNP-associated SURP motif-co...   379   e-102
ref|XP_006585860.1| PREDICTED: U2 snRNP-associated SURP motif-co...   379   e-102
gb|EXC01118.1| U2 snRNP-associated SURP motif-containing protein...   378   e-102
ref|XP_002324341.2| RNA recognition motif-containing family prot...   378   e-102
ref|XP_002308714.1| RNA recognition motif-containing family prot...   378   e-102
ref|XP_006599196.1| PREDICTED: U2 snRNP-associated SURP motif-co...   377   e-102
emb|CAN79213.1| hypothetical protein VITISV_025939 [Vitis vinifera]   377   e-102
ref|XP_006353899.1| PREDICTED: U2 snRNP-associated SURP motif-co...   375   e-101
ref|XP_006353898.1| PREDICTED: U2 snRNP-associated SURP motif-co...   375   e-101
ref|XP_006353897.1| PREDICTED: U2 snRNP-associated SURP motif-co...   375   e-101
ref|XP_004291970.1| PREDICTED: U2 snRNP-associated SURP motif-co...   372   e-100
ref|XP_007011694.1| RNA recognition motif-containing protein iso...   372   e-100
ref|XP_007011693.1| RNA recognition motif-containing protein iso...   372   e-100
ref|XP_007011691.1| RNA recognition motif-containing protein iso...   372   e-100
ref|XP_002515412.1| RNA binding protein, putative [Ricinus commu...   371   e-100

>gb|EYU29204.1| hypothetical protein MIMGU_mgv1a000894mg [Mimulus guttatus]
          Length = 949

 Score =  406 bits (1044), Expect = e-111
 Identities = 225/337 (66%), Positives = 257/337 (76%), Gaps = 6/337 (1%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVWADWFLFSDAYVNGLRATF+R  +SGV  FHSICGDAPELERK GSA+ G
Sbjct: 554  KERVLKVLQVWADWFLFSDAYVNGLRATFIRSGSSGVTTFHSICGDAPELERKPGSADHG 613

Query: 181  DAGKIN--QDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEA 354
               KIN  QDAALAIGKGAAMKEL  LP+ ELERRCRHNGLSLVGGRE MVARLLYLEEA
Sbjct: 614  QGEKINHGQDAALAIGKGAAMKELLTLPLNELERRCRHNGLSLVGGRETMVARLLYLEEA 673

Query: 355  EKQRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGN---MDDGMPSIGRG 525
            EKQRG EIDDELKS  SQ  SGRY SGQ+E  S+ + G     G N   +D+ +P +  G
Sbjct: 674  EKQRGSEIDDELKSGRSQLGSGRYQSGQRE--SKFEAGPAETSGWNSSRVDEMVPKV-TG 730

Query: 526  SMLLPPKDLNLQPDINSSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSEN 705
            ++ LPP D   +  IN+ +G +ESILP+SKWA                L L YSSSGS+ 
Sbjct: 731  AVFLPPSD--QKELINARDGGSESILPASKWARENEESDDENERSTKELGLTYSSSGSDM 788

Query: 706  AGDV-LSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEE 882
            AGD    KTEE  +T DA NS ++DGGMNEEQRQKLRRLEVALMEYRESLEE+G+K+ +E
Sbjct: 789  AGDSDPYKTEERGITNDATNSAYVDGGMNEEQRQKLRRLEVALMEYRESLEERGLKNSDE 848

Query: 883  IERKVAARRSRLQAEYGLVNSDADASGRKKSSLERGG 993
            IE+KVA  RSRLQAEYGL++S+ADASGRKKSSL+  G
Sbjct: 849  IEKKVAIHRSRLQAEYGLLDSNADASGRKKSSLDGRG 885


>ref|XP_006483724.1| PREDICTED: LOW QUALITY PROTEIN: U2 snRNP-associated SURP
            motif-containing protein-like [Citrus sinensis]
          Length = 1017

 Score =  385 bits (989), Expect = e-104
 Identities = 203/326 (62%), Positives = 242/326 (74%), Gaps = 2/326 (0%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVW+DWFLFSDAYVNGLRATFLR  NSGV PFHSICGDAPE+++K  S +T 
Sbjct: 600  KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIDKKNNSEDTC 659

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            D  K NQD ALA+GKGAA+KEL NLP+ ELERRCRHNGLSLVGGREMMVARLL LE+AEK
Sbjct: 660  DLSKTNQDTALAMGKGAAIKELMNLPLSELERRCRHNGLSLVGGREMMVARLLSLEDAEK 719

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QRG+E+DD+LKSAHSQSSSGRY+ G KE N E +   +S   G  +D   S   GS+ L 
Sbjct: 720  QRGYELDDDLKSAHSQSSSGRYSRGWKETNMEAESMGLSGWNGYEEDEKLSQAVGSVPLG 779

Query: 541  PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714
                  QP+I   + + KN+ +LP+SKWA                L L+YSSSGSENAGD
Sbjct: 780  TMLTTPQPEIKAFTKKEKNDPVLPASKWALEDDESDDEQKRSSRGLGLSYSSSGSENAGD 839

Query: 715  VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894
              SK ++++ T DA+  V  D GMNEEQRQKLRRLEV+L+EYRESLEE+GIK  EEIE+K
Sbjct: 840  GPSKADDVDFTIDASIPVQPDSGMNEEQRQKLRRLEVSLIEYRESLEERGIKSSEEIEKK 899

Query: 895  VAARRSRLQAEYGLVNSDADASGRKK 972
            VA  R RL++EYGL + + D SG K+
Sbjct: 900  VAIHRKRLESEYGLADPNEDVSGNKR 925


>ref|XP_006450262.1| hypothetical protein CICLE_v10007357mg [Citrus clementina]
            gi|567916514|ref|XP_006450263.1| hypothetical protein
            CICLE_v10007357mg [Citrus clementina]
            gi|557553488|gb|ESR63502.1| hypothetical protein
            CICLE_v10007357mg [Citrus clementina]
            gi|557553489|gb|ESR63503.1| hypothetical protein
            CICLE_v10007357mg [Citrus clementina]
          Length = 973

 Score =  385 bits (989), Expect = e-104
 Identities = 203/326 (62%), Positives = 242/326 (74%), Gaps = 2/326 (0%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVW+DWFLFSDAYVNGLRATFLR  NSGV PFHSICGDAPE+++K  S +T 
Sbjct: 556  KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIDKKNNSEDTC 615

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            D  K NQD ALA+GKGAA+KEL NLP+ ELERRCRHNGLSLVGGREMMVARLL LE+AEK
Sbjct: 616  DLSKTNQDTALAMGKGAAIKELMNLPLSELERRCRHNGLSLVGGREMMVARLLSLEDAEK 675

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QRG+E+DD+LKSAHSQSSSGRY+ G KE N E +   +S   G  +D   S   GS+ L 
Sbjct: 676  QRGYELDDDLKSAHSQSSSGRYSRGWKETNMEAESMGLSGWNGYEEDEKLSQAVGSVPLG 735

Query: 541  PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714
                  QP+I   + + KN+ +LP+SKWA                L L+YSSSGSENAGD
Sbjct: 736  TMLTTPQPEIKAFTKKEKNDPVLPASKWALEDDESDDEQKRSSRGLGLSYSSSGSENAGD 795

Query: 715  VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894
              SK ++++ T DA+  V  D GMNEEQRQKLRRLEV+L+EYRESLEE+GIK  EEIE+K
Sbjct: 796  GPSKADDVDFTIDASIPVQPDSGMNEEQRQKLRRLEVSLIEYRESLEERGIKSSEEIEKK 855

Query: 895  VAARRSRLQAEYGLVNSDADASGRKK 972
            VA  R RL++EYGL + + D SG K+
Sbjct: 856  VAIHRKRLESEYGLADPNEDVSGNKR 881


>ref|XP_007225360.1| hypothetical protein PRUPE_ppa000894mg [Prunus persica]
            gi|462422296|gb|EMJ26559.1| hypothetical protein
            PRUPE_ppa000894mg [Prunus persica]
          Length = 968

 Score =  383 bits (984), Expect = e-104
 Identities = 205/331 (61%), Positives = 247/331 (74%), Gaps = 2/331 (0%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVW+DWFLFSDAYVNGLRATFLR  NSGV+PFHSICGDAPE+++K  S +TG
Sbjct: 557  KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVVPFHSICGDAPEIDKKITSEDTG 616

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            DA K NQDAALA+GKGAAM+EL +LP+ ELERRCRHNGLSLVGGRE MVARLL LEEAEK
Sbjct: 617  DACKTNQDAALAMGKGAAMRELLSLPLAELERRCRHNGLSLVGGRETMVARLLSLEEAEK 676

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QRG+E+DD+LK A S SSS RY+S ++E N E D   IS +           G+GS+ L 
Sbjct: 677  QRGYELDDDLKYAQSHSSSARYSSSRREMNIEPDSMGISAQ-----------GKGSLPLV 725

Query: 541  PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714
                  QP++   + + K++ +LP+SKWA                L L+YSSSGSENAGD
Sbjct: 726  QTLPIPQPELKALTKKEKSDPVLPASKWAREDDDSDDEQKRSARDLGLSYSSSGSENAGD 785

Query: 715  VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894
              SK +E+EV TDA+     D G++EEQRQKLRRLEVAL+EYRESLEE+GIK+PEEIERK
Sbjct: 786  GPSKADEMEVATDASIPAQPDSGISEEQRQKLRRLEVALIEYRESLEERGIKNPEEIERK 845

Query: 895  VAARRSRLQAEYGLVNSDADASGRKKSSLER 987
            VA  R RL++EYGL +S  DA G K++S ER
Sbjct: 846  VAIHRKRLESEYGLSDSSEDACGSKRTSSER 876


>ref|XP_004234429.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            [Solanum lycopersicum]
          Length = 947

 Score =  380 bits (976), Expect = e-103
 Identities = 207/334 (61%), Positives = 244/334 (73%), Gaps = 5/334 (1%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVWADWFLFSDAYVNGLRATFLR  NSGV PFHS+CGDAP++E++  S + G
Sbjct: 550  KERVLKVLQVWADWFLFSDAYVNGLRATFLRTGNSGVTPFHSLCGDAPDVEQRTSSDDAG 609

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            D GK+N D ALAIGKGAAMKEL +LP+ ELERRCRHNGLS+VGGREMMVARLLYLEEAEK
Sbjct: 610  DGGKVNPDGALAIGKGAAMKELLSLPLTELERRCRHNGLSIVGGREMMVARLLYLEEAEK 669

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QRG E+D++LK A S SSS R+ S +K+ N E+D    S R   MD  +    R S  + 
Sbjct: 670  QRGHELDEDLKFA-SHSSSARFPSTRKDSNLELDRMAPSERNSQMDYDVQLKQRES--VS 726

Query: 541  PKDLNLQPDIN----SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENA 708
               +N  P  N    SS+GK+E+ILP+SKWA                L L YSSSGSENA
Sbjct: 727  SHQINSAPHYNSIDFSSDGKSETILPTSKWAREDDESDDEQKRSSRDLGLTYSSSGSENA 786

Query: 709  GDVLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIE 888
            GD LSK ++ E+TTD  NS + + GMNEE RQKLRRLEVAL+EYRESLEEQGIK+P+EIE
Sbjct: 787  GDGLSKIKDAELTTDTGNSAYPESGMNEELRQKLRRLEVALIEYRESLEEQGIKNPDEIE 846

Query: 889  RKVAARRSRLQAEYGLVNSDADASGR-KKSSLER 987
            RKV   R  LQ+EYGL+N   D S +  +SS ER
Sbjct: 847  RKVEIHRQCLQSEYGLLNFSEDTSKKGGRSSSER 880


>ref|XP_006585862.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            isoform X3 [Glycine max] gi|571473238|ref|XP_006585863.1|
            PREDICTED: U2 snRNP-associated SURP motif-containing
            protein-like isoform X4 [Glycine max]
          Length = 874

 Score =  379 bits (972), Expect = e-102
 Identities = 202/331 (61%), Positives = 240/331 (72%), Gaps = 2/331 (0%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVWADWFLFSDAYVNGLRATFLR  NSGVIPFHSICGDAPE+E+   S +  
Sbjct: 461  KERVLKVLQVWADWFLFSDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQNTTSKDMV 520

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
              GK NQDAALA+G+GAAMKEL +LP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEK
Sbjct: 521  VGGKTNQDAALAMGRGAAMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAEK 580

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QRGFE+D+ELK AH+Q SSG+Y+S Q+E + E D     V     D+ + S GR S+ L 
Sbjct: 581  QRGFELDEELKYAHNQVSSGKYSSNQRETSEEPD----PVWNHYGDEDLQSQGRSSVPLS 636

Query: 541  PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714
            P     QP++   + + KN+ +LP+SKWA                + L+YSSSGSEN GD
Sbjct: 637  PTLPIAQPELKAFTKKEKNDPVLPASKWAWEGDESDDEQRRSGKNIGLSYSSSGSENVGD 696

Query: 715  VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894
             L K +E E   D   S H D GMNEEQRQKLRRLEVAL+EYRESLEE+G+K+ EEIE+K
Sbjct: 697  GLVKADESESAADTRFSAHADSGMNEEQRQKLRRLEVALIEYRESLEERGVKNLEEIEKK 756

Query: 895  VAARRSRLQAEYGLVNSDADASGRKKSSLER 987
            V + R RLQ EYGL +S  D  G +++S  R
Sbjct: 757  VQSHRKRLQVEYGLSDSGEDGHGHRRTSERR 787


>ref|XP_006585860.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            isoform X1 [Glycine max] gi|571473234|ref|XP_006585861.1|
            PREDICTED: U2 snRNP-associated SURP motif-containing
            protein-like isoform X2 [Glycine max]
          Length = 969

 Score =  379 bits (972), Expect = e-102
 Identities = 202/331 (61%), Positives = 240/331 (72%), Gaps = 2/331 (0%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVWADWFLFSDAYVNGLRATFLR  NSGVIPFHSICGDAPE+E+   S +  
Sbjct: 556  KERVLKVLQVWADWFLFSDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQNTTSKDMV 615

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
              GK NQDAALA+G+GAAMKEL +LP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEK
Sbjct: 616  VGGKTNQDAALAMGRGAAMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAEK 675

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QRGFE+D+ELK AH+Q SSG+Y+S Q+E + E D     V     D+ + S GR S+ L 
Sbjct: 676  QRGFELDEELKYAHNQVSSGKYSSNQRETSEEPD----PVWNHYGDEDLQSQGRSSVPLS 731

Query: 541  PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714
            P     QP++   + + KN+ +LP+SKWA                + L+YSSSGSEN GD
Sbjct: 732  PTLPIAQPELKAFTKKEKNDPVLPASKWAWEGDESDDEQRRSGKNIGLSYSSSGSENVGD 791

Query: 715  VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894
             L K +E E   D   S H D GMNEEQRQKLRRLEVAL+EYRESLEE+G+K+ EEIE+K
Sbjct: 792  GLVKADESESAADTRFSAHADSGMNEEQRQKLRRLEVALIEYRESLEERGVKNLEEIEKK 851

Query: 895  VAARRSRLQAEYGLVNSDADASGRKKSSLER 987
            V + R RLQ EYGL +S  D  G +++S  R
Sbjct: 852  VQSHRKRLQVEYGLSDSGEDGHGHRRTSERR 882


>gb|EXC01118.1| U2 snRNP-associated SURP motif-containing protein [Morus notabilis]
          Length = 999

 Score =  378 bits (971), Expect = e-102
 Identities = 203/331 (61%), Positives = 242/331 (73%), Gaps = 2/331 (0%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVWADWFLFSDAYVNGLRATFLR  NSGV PFHSICGDAPE+E+     +TG
Sbjct: 575  KERVLKVLQVWADWFLFSDAYVNGLRATFLRLGNSGVTPFHSICGDAPEIEKIISFEDTG 634

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            DAGK N+DAALA+GKGAAM+EL NLP  ELERRCRHNGLSLVGGREMMVARLL LEEAEK
Sbjct: 635  DAGKTNEDAALAMGKGAAMQELMNLPFAELERRCRHNGLSLVGGREMMVARLLSLEEAEK 694

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QRG+E+D++LK A   SSSGRY+ G++E N E +    S       D + S  +GS+ L 
Sbjct: 695  QRGYELDEDLKYAQGHSSSGRYSGGRRETNVEGEPMGSSGWNHYAGDEIDSQAKGSVPLA 754

Query: 541  PKDLNLQPDINS--SEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714
                  QP++     + K++ +LP+SKWA                L L YSSSGSENAGD
Sbjct: 755  QTIPIPQPELKPFVKKEKSDPVLPASKWAREDDDSDDEQKRSSRGLGLGYSSSGSENAGD 814

Query: 715  VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894
              SK +E+E   D ++ V  D GM+EEQR+KLRRLE AL+EYRESLEE+GI+ PEEIERK
Sbjct: 815  GPSKADEMESAAD-SSVVQPDSGMSEEQRKKLRRLEAALIEYRESLEERGIRSPEEIERK 873

Query: 895  VAARRSRLQAEYGLVNSDADASGRKKSSLER 987
            V   R RL+AEYGL NS+ DA+G K++SLER
Sbjct: 874  VTMHRKRLEAEYGLSNSNKDAAGSKRASLER 904


>ref|XP_002324341.2| RNA recognition motif-containing family protein [Populus trichocarpa]
            gi|550317898|gb|EEF02906.2| RNA recognition
            motif-containing family protein [Populus trichocarpa]
          Length = 969

 Score =  378 bits (970), Expect = e-102
 Identities = 202/333 (60%), Positives = 245/333 (73%), Gaps = 4/333 (1%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVW+DWFLFSDAYVNGLRATFLR +NSGVIPFHSICGDAPE+E+K+ S +  
Sbjct: 557  KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSSNSGVIPFHSICGDAPEIEKKSSSEDAV 616

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            +  KINQDAALA+GKGAA+KEL NLP+ ELERRCRHNGLSLVGGREMMVARLL LEEAE+
Sbjct: 617  EGAKINQDAALAMGKGAAVKELMNLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAER 676

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNM--DDGMPSIGRGSML 534
            QRG+E+DD+LK A S SSS RY+S  +E N E +   +   G N+  +D MPS  +GS+ 
Sbjct: 677  QRGYELDDDLKIAQSNSSSSRYSSVHREMNVEAE--PVGSTGWNVYGEDEMPSQNKGSVS 734

Query: 535  LPPKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENA 708
            +    L  QP++   + + KN+ +LP+SKWA                L L+YSSSGSENA
Sbjct: 735  VASTLLIKQPELKAFAKKEKNDPVLPASKWARDDDESDDEQKRSARDLGLSYSSSGSENA 794

Query: 709  GDVLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIE 888
            GD   K +E+E  TDAN     D GMNEEQRQKLRRLEVAL+EYRESLEE+G+K   EIE
Sbjct: 795  GDGQGKADEMEFATDANIPTQPDSGMNEEQRQKLRRLEVALIEYRESLEERGMKSSVEIE 854

Query: 889  RKVAARRSRLQAEYGLVNSDADASGRKKSSLER 987
             KVA  R  L++EYGL +S+ D + +K  S ER
Sbjct: 855  GKVAIHRKWLESEYGLSSSNEDVTSKKSISSER 887


>ref|XP_002308714.1| RNA recognition motif-containing family protein [Populus trichocarpa]
            gi|222854690|gb|EEE92237.1| RNA recognition
            motif-containing family protein [Populus trichocarpa]
          Length = 988

 Score =  378 bits (970), Expect = e-102
 Identities = 215/417 (51%), Positives = 263/417 (63%), Gaps = 4/417 (0%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVW+DWFLFSDAYVNGLRATFLR +NSGVIPFHS+CGDAPE+E+K  + +T 
Sbjct: 567  KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSSNSGVIPFHSMCGDAPEIEKKNSTEDTV 626

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            D GK NQDAALA+GKGAA KEL +LP+ ELERRCRHNGLSLVGGRE MVARLL LEEAEK
Sbjct: 627  DGGKTNQDAALAMGKGAATKELMDLPLAELERRCRHNGLSLVGGRETMVARLLNLEEAEK 686

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNM--DDGMPSIGRGSML 534
            QRG+E+D +LK A S SSS RY+S  +E N  +D G + + G N+  +D  PS  + S+ 
Sbjct: 687  QRGYELDGDLKIAQSNSSSSRYSSVHREVN--VDPGPVGLTGWNIYGEDDTPSQNKRSVS 744

Query: 535  LPPKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENA 708
            L       QP++   + + KN+ +LP+SKWA                L L+YSSSGSENA
Sbjct: 745  LVSTLPIPQPELKAFAKKEKNDPVLPASKWARDDDESDDEQKRSVRDLGLSYSSSGSENA 804

Query: 709  GDVLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIE 888
            GD   K +E+E  TDA+     + GMNEEQRQKLRRLEVAL+EYRESLEEQG+K+ EE E
Sbjct: 805  GDGQGKEDEMEFATDASIPTQPESGMNEEQRQKLRRLEVALIEYRESLEEQGMKNSEEFE 864

Query: 889  RKVAARRSRLQAEYGLVNSDADASGRKKSSLERGGXXXXXXXXXXXXXXXXXXXXXXXXX 1068
            RKVA  R RL++EYGL +S+ D +G K+ S ER                           
Sbjct: 865  RKVAVHRKRLESEYGLSSSNEDVTGNKRISSERRDRRDDNHESSRKRHRSESRSESPQRK 924

Query: 1069 XXXXXXERESDANSNXXXXXXXXXPQELXXXXXXXXXXXXXXXXXXXKERDDHDRER 1239
                  ERE D++ +            L                   KERDDHDR+R
Sbjct: 925  LSLRDREREHDSDKDRERHRERDRGNNL----ESERRDRDYREKSGSKERDDHDRDR 977


>ref|XP_006599196.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            [Glycine max]
          Length = 969

 Score =  377 bits (967), Expect = e-102
 Identities = 203/331 (61%), Positives = 241/331 (72%), Gaps = 2/331 (0%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVWADWFLFSDAYVNGLRATFLR  NSGVIPFHSICGDAPE+E+K  S +  
Sbjct: 556  KERVLKVLQVWADWFLFSDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQKTASEDMV 615

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
              GK NQDAALA+G+GAAMKEL +LP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEK
Sbjct: 616  VGGKTNQDAALAMGRGAAMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAEK 675

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            Q+GFE+DDELK AH+Q SSG+Y+S Q+E ++E+D   +S      D+ + S GR S+ L 
Sbjct: 676  QKGFELDDELKYAHNQVSSGKYSSNQRETSAELDPVGLSAWNHYGDEDIQSQGRSSVPLA 735

Query: 541  PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714
            P     QP +   + + KN+ +LP+SKWA                L L+YSSSGSEN  D
Sbjct: 736  PTLPIPQPKLKAFTKKEKNDPVLPASKWA-REDDESDDEQRSGKNLGLSYSSSGSENVDD 794

Query: 715  VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894
             L K +E E   D + S H D GMNEEQRQKLRRLEVAL+EY ESLEE+GIK+ EEIE+K
Sbjct: 795  GLVKADESESAADRSFSAHADSGMNEEQRQKLRRLEVALIEYGESLEERGIKNLEEIEKK 854

Query: 895  VAARRSRLQAEYGLVNSDADASGRKKSSLER 987
            V   R RLQ EYGL +S  D  G +++S  R
Sbjct: 855  VQLHRKRLQVEYGLSDSGEDGQGNRRTSERR 885


>emb|CAN79213.1| hypothetical protein VITISV_025939 [Vitis vinifera]
          Length = 1384

 Score =  377 bits (967), Expect = e-102
 Identities = 203/326 (62%), Positives = 241/326 (73%), Gaps = 3/326 (0%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERVMKVLQVWADWFLFSDAYVNGLRATFLR  NSGV PFHSICGDAPE+E+K  S +TG
Sbjct: 717  KERVMKVLQVWADWFLFSDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIEKKTSSEDTG 776

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            + GK NQDAALA+GKGAAMKEL +LPI ELERRCRHNGLSLVGGRE+MVARLL LEEAEK
Sbjct: 777  EGGKSNQDAALAMGKGAAMKELLSLPIAELERRCRHNGLSLVGGREIMVARLLSLEEAEK 836

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QRG+++DD+LK A S S+SGRY S +KE   E +   +S      +D + S G+GS+ L 
Sbjct: 837  QRGYDLDDDLKYAQSHSNSGRYPSSRKEIGVETESVGLSGWNRYGEDEIQSQGKGSVPLA 896

Query: 541  PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714
            P     QP++   +++GK + +LP+SKWA                L L+YSSSGSENAGD
Sbjct: 897  PTIPIPQPELKAFTNKGKTDPVLPASKWAREDDDSDDEQKRSARGLGLSYSSSGSENAGD 956

Query: 715  VLSKTEELEVTTDANNSVHLDGG-MNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIER 891
               K +E+E  T+++     D G MNEE RQKLRRLEVAL+EYRESLEE+GIK  EEIER
Sbjct: 957  GPXKADEMEFATESSIPSQPDSGMMNEEHRQKLRRLEVALIEYRESLEERGIKSSEEIER 1016

Query: 892  KVAARRSRLQAEYGLVNSDADASGRK 969
            KVA  R RLQ+EYGL +S+ D S  K
Sbjct: 1017 KVAIHRKRLQSEYGLSDSNEDVSWNK 1042


>ref|XP_006353899.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            isoform X3 [Solanum tuberosum]
          Length = 857

 Score =  375 bits (963), Expect = e-101
 Identities = 206/334 (61%), Positives = 244/334 (73%), Gaps = 5/334 (1%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVWADWFLFSDAYVNGLRATFLR  NSGV PFHS+CGDAP++E++A S + G
Sbjct: 460  KERVLKVLQVWADWFLFSDAYVNGLRATFLRTGNSGVTPFHSLCGDAPDVEQRASSDDAG 519

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            D GKIN D ALAIGKGAAMKEL +LP+ ELERRCRHNGLS+VGGREMMVARLLYLEEAEK
Sbjct: 520  DGGKINPDGALAIGKGAAMKELLSLPLTELERRCRHNGLSIVGGREMMVARLLYLEEAEK 579

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QRG E+D++LK A S SSS R+ S +K+ N E+D    S R   +D  +    R S  + 
Sbjct: 580  QRGHELDEDLKFA-SHSSSARFPSTRKDSNLELDRMAPSERNSQVDYDVQLKQRES--VS 636

Query: 541  PKDLNLQPDIN----SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENA 708
                N  P  N    SSEGK+E+ILP+SKWA                L L YSSSGSENA
Sbjct: 637  SHQTNSAPHYNSIDFSSEGKSETILPTSKWAREDDESDDEQKRSSRDLGLTYSSSGSENA 696

Query: 709  GDVLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIE 888
            GD ++K ++ E+TTD +NS + + GMNEE RQKLRRLEVAL+EYRESLEEQGIK+ +EIE
Sbjct: 697  GDGINKIKDAELTTDTSNSAYPESGMNEELRQKLRRLEVALIEYRESLEEQGIKNLDEIE 756

Query: 889  RKVAARRSRLQAEYGLVNSDADASGR-KKSSLER 987
            RKV   R  LQ+EYGL+N   D S +  +SS ER
Sbjct: 757  RKVEIHRQCLQSEYGLLNFSEDTSKKGGRSSSER 790


>ref|XP_006353898.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            isoform X2 [Solanum tuberosum]
          Length = 947

 Score =  375 bits (963), Expect = e-101
 Identities = 206/334 (61%), Positives = 244/334 (73%), Gaps = 5/334 (1%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVWADWFLFSDAYVNGLRATFLR  NSGV PFHS+CGDAP++E++A S + G
Sbjct: 550  KERVLKVLQVWADWFLFSDAYVNGLRATFLRTGNSGVTPFHSLCGDAPDVEQRASSDDAG 609

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            D GKIN D ALAIGKGAAMKEL +LP+ ELERRCRHNGLS+VGGREMMVARLLYLEEAEK
Sbjct: 610  DGGKINPDGALAIGKGAAMKELLSLPLTELERRCRHNGLSIVGGREMMVARLLYLEEAEK 669

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QRG E+D++LK A S SSS R+ S +K+ N E+D    S R   +D  +    R S  + 
Sbjct: 670  QRGHELDEDLKFA-SHSSSARFPSTRKDSNLELDRMAPSERNSQVDYDVQLKQRES--VS 726

Query: 541  PKDLNLQPDIN----SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENA 708
                N  P  N    SSEGK+E+ILP+SKWA                L L YSSSGSENA
Sbjct: 727  SHQTNSAPHYNSIDFSSEGKSETILPTSKWAREDDESDDEQKRSSRDLGLTYSSSGSENA 786

Query: 709  GDVLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIE 888
            GD ++K ++ E+TTD +NS + + GMNEE RQKLRRLEVAL+EYRESLEEQGIK+ +EIE
Sbjct: 787  GDGINKIKDAELTTDTSNSAYPESGMNEELRQKLRRLEVALIEYRESLEEQGIKNLDEIE 846

Query: 889  RKVAARRSRLQAEYGLVNSDADASGR-KKSSLER 987
            RKV   R  LQ+EYGL+N   D S +  +SS ER
Sbjct: 847  RKVEIHRQCLQSEYGLLNFSEDTSKKGGRSSSER 880


>ref|XP_006353897.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            isoform X1 [Solanum tuberosum]
          Length = 948

 Score =  375 bits (963), Expect = e-101
 Identities = 206/334 (61%), Positives = 244/334 (73%), Gaps = 5/334 (1%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVWADWFLFSDAYVNGLRATFLR  NSGV PFHS+CGDAP++E++A S + G
Sbjct: 551  KERVLKVLQVWADWFLFSDAYVNGLRATFLRTGNSGVTPFHSLCGDAPDVEQRASSDDAG 610

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            D GKIN D ALAIGKGAAMKEL +LP+ ELERRCRHNGLS+VGGREMMVARLLYLEEAEK
Sbjct: 611  DGGKINPDGALAIGKGAAMKELLSLPLTELERRCRHNGLSIVGGREMMVARLLYLEEAEK 670

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QRG E+D++LK A S SSS R+ S +K+ N E+D    S R   +D  +    R S  + 
Sbjct: 671  QRGHELDEDLKFA-SHSSSARFPSTRKDSNLELDRMAPSERNSQVDYDVQLKQRES--VS 727

Query: 541  PKDLNLQPDIN----SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENA 708
                N  P  N    SSEGK+E+ILP+SKWA                L L YSSSGSENA
Sbjct: 728  SHQTNSAPHYNSIDFSSEGKSETILPTSKWAREDDESDDEQKRSSRDLGLTYSSSGSENA 787

Query: 709  GDVLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIE 888
            GD ++K ++ E+TTD +NS + + GMNEE RQKLRRLEVAL+EYRESLEEQGIK+ +EIE
Sbjct: 788  GDGINKIKDAELTTDTSNSAYPESGMNEELRQKLRRLEVALIEYRESLEEQGIKNLDEIE 847

Query: 889  RKVAARRSRLQAEYGLVNSDADASGR-KKSSLER 987
            RKV   R  LQ+EYGL+N   D S +  +SS ER
Sbjct: 848  RKVEIHRQCLQSEYGLLNFSEDTSKKGGRSSSER 881


>ref|XP_004291970.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            [Fragaria vesca subsp. vesca]
          Length = 980

 Score =  372 bits (956), Expect = e-100
 Identities = 200/332 (60%), Positives = 248/332 (74%), Gaps = 3/332 (0%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVW+DWFLFSDAYVNGLRATFLR  NSGV+PFHS+CGDAP++E+K  S + G
Sbjct: 557  KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVVPFHSVCGDAPDIEKKTTSEDAG 616

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            DA K NQDAALA+GKGAA +EL NLP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEK
Sbjct: 617  DA-KTNQDAALAMGKGAATRELLNLPMAELERRCRHNGLSLVGGREMMVARLLSLEEAEK 675

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QRG+E+DD+LK   + SSSGR++S +KE N E D   +S     ++D + S G+ S+   
Sbjct: 676  QRGYELDDDLKYGQNHSSSGRHSSSRKEMNIEPDPLGLSGWNRYVEDEIQSEGKVSLSKA 735

Query: 541  PKDLNLQPDIN--SSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714
                + QP++   +++ K++ +LP+SKWA                L L+Y SSGSENAGD
Sbjct: 736  QTHTSPQPELKPFTTKEKSDPVLPASKWAREDDDSDDDQKRSAKGLGLSY-SSGSENAGD 794

Query: 715  VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894
              SK +E+EV TD       D G++EEQRQKLRRLEV+L+EYRESLEE+GI+ PEEIERK
Sbjct: 795  GPSKADEMEVATDVRIPAQPDSGLSEEQRQKLRRLEVSLLEYRESLEERGIRSPEEIERK 854

Query: 895  VAARRSRLQAEYGLVNSDADASGR-KKSSLER 987
            VA  R RL++EYGL +S  DASGR K++S ER
Sbjct: 855  VAIHRKRLESEYGLSDSSEDASGRSKRTSSER 886


>ref|XP_007011694.1| RNA recognition motif-containing protein isoform 4 [Theobroma cacao]
            gi|590571807|ref|XP_007011695.1| RNA recognition
            motif-containing protein isoform 4 [Theobroma cacao]
            gi|508782057|gb|EOY29313.1| RNA recognition
            motif-containing protein isoform 4 [Theobroma cacao]
            gi|508782058|gb|EOY29314.1| RNA recognition
            motif-containing protein isoform 4 [Theobroma cacao]
          Length = 811

 Score =  372 bits (954), Expect = e-100
 Identities = 200/331 (60%), Positives = 245/331 (74%), Gaps = 2/331 (0%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVW+DWFLFSDAYVNGLRATFLR  NSGV PFHSICGDAPE+E+   S + G
Sbjct: 383  KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVAPFHSICGDAPEIEKNTSSEDAG 442

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            D  K NQDAALA+GKGAAM+EL +LP+ ELERRCRHNGLSLVGGRE+MVARLL LE+AEK
Sbjct: 443  DGIKGNQDAALAMGKGAAMRELMDLPLAELERRCRHNGLSLVGGREIMVARLLSLEDAEK 502

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QR +E+DD+LK A S+SSS RY+SGQ++ N+E +   +S      D+ + S  +GS+ L 
Sbjct: 503  QRSYELDDDLKLAQSRSSSCRYSSGQRDINAEAEPVGLSGWTHYADNEIHSQRKGSVPLA 562

Query: 541  PKDLNLQPDINS--SEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714
                  QP+I +   + K + +LP+SKW+                L L+YSSSGSENAGD
Sbjct: 563  ETLPIPQPEIKAFLKKEKIDPVLPASKWSREDDDSDDEEKRSTRGLGLSYSSSGSENAGD 622

Query: 715  VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894
              SK +ELE  TDA+     +  MNEEQRQKLRRLEVAL+EYRESLEE+GIK  E+IER+
Sbjct: 623  GTSKADELEFGTDASIPAPSESAMNEEQRQKLRRLEVALIEYRESLEERGIKSAEDIERR 682

Query: 895  VAARRSRLQAEYGLVNSDADASGRKKSSLER 987
            VAA R RL++EYGL +S  D SGRK++S ER
Sbjct: 683  VAAHRKRLESEYGLSDSSEDISGRKRTSSER 713


>ref|XP_007011693.1| RNA recognition motif-containing protein isoform 3 [Theobroma cacao]
            gi|508782056|gb|EOY29312.1| RNA recognition
            motif-containing protein isoform 3 [Theobroma cacao]
          Length = 819

 Score =  372 bits (954), Expect = e-100
 Identities = 200/331 (60%), Positives = 245/331 (74%), Gaps = 2/331 (0%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVW+DWFLFSDAYVNGLRATFLR  NSGV PFHSICGDAPE+E+   S + G
Sbjct: 391  KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVAPFHSICGDAPEIEKNTSSEDAG 450

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            D  K NQDAALA+GKGAAM+EL +LP+ ELERRCRHNGLSLVGGRE+MVARLL LE+AEK
Sbjct: 451  DGIKGNQDAALAMGKGAAMRELMDLPLAELERRCRHNGLSLVGGREIMVARLLSLEDAEK 510

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QR +E+DD+LK A S+SSS RY+SGQ++ N+E +   +S      D+ + S  +GS+ L 
Sbjct: 511  QRSYELDDDLKLAQSRSSSCRYSSGQRDINAEAEPVGLSGWTHYADNEIHSQRKGSVPLA 570

Query: 541  PKDLNLQPDINS--SEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714
                  QP+I +   + K + +LP+SKW+                L L+YSSSGSENAGD
Sbjct: 571  ETLPIPQPEIKAFLKKEKIDPVLPASKWSREDDDSDDEEKRSTRGLGLSYSSSGSENAGD 630

Query: 715  VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894
              SK +ELE  TDA+     +  MNEEQRQKLRRLEVAL+EYRESLEE+GIK  E+IER+
Sbjct: 631  GTSKADELEFGTDASIPAPSESAMNEEQRQKLRRLEVALIEYRESLEERGIKSAEDIERR 690

Query: 895  VAARRSRLQAEYGLVNSDADASGRKKSSLER 987
            VAA R RL++EYGL +S  D SGRK++S ER
Sbjct: 691  VAAHRKRLESEYGLSDSSEDISGRKRTSSER 721


>ref|XP_007011691.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao]
            gi|508782054|gb|EOY29310.1| RNA recognition
            motif-containing protein isoform 1 [Theobroma cacao]
          Length = 985

 Score =  372 bits (954), Expect = e-100
 Identities = 200/331 (60%), Positives = 245/331 (74%), Gaps = 2/331 (0%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERV+KVLQVW+DWFLFSDAYVNGLRATFLR  NSGV PFHSICGDAPE+E+   S + G
Sbjct: 557  KERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVAPFHSICGDAPEIEKNTSSEDAG 616

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            D  K NQDAALA+GKGAAM+EL +LP+ ELERRCRHNGLSLVGGRE+MVARLL LE+AEK
Sbjct: 617  DGIKGNQDAALAMGKGAAMRELMDLPLAELERRCRHNGLSLVGGREIMVARLLSLEDAEK 676

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMDMGQISVRGGNMDDGMPSIGRGSMLLP 540
            QR +E+DD+LK A S+SSS RY+SGQ++ N+E +   +S      D+ + S  +GS+ L 
Sbjct: 677  QRSYELDDDLKLAQSRSSSCRYSSGQRDINAEAEPVGLSGWTHYADNEIHSQRKGSVPLA 736

Query: 541  PKDLNLQPDINS--SEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714
                  QP+I +   + K + +LP+SKW+                L L+YSSSGSENAGD
Sbjct: 737  ETLPIPQPEIKAFLKKEKIDPVLPASKWSREDDDSDDEEKRSTRGLGLSYSSSGSENAGD 796

Query: 715  VLSKTEELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIERK 894
              SK +ELE  TDA+     +  MNEEQRQKLRRLEVAL+EYRESLEE+GIK  E+IER+
Sbjct: 797  GTSKADELEFGTDASIPAPSESAMNEEQRQKLRRLEVALIEYRESLEERGIKSAEDIERR 856

Query: 895  VAARRSRLQAEYGLVNSDADASGRKKSSLER 987
            VAA R RL++EYGL +S  D SGRK++S ER
Sbjct: 857  VAAHRKRLESEYGLSDSSEDISGRKRTSSER 887


>ref|XP_002515412.1| RNA binding protein, putative [Ricinus communis]
            gi|223545356|gb|EEF46861.1| RNA binding protein, putative
            [Ricinus communis]
          Length = 979

 Score =  371 bits (952), Expect = e-100
 Identities = 202/333 (60%), Positives = 247/333 (74%), Gaps = 4/333 (1%)
 Frame = +1

Query: 1    KERVMKVLQVWADWFLFSDAYVNGLRATFLRFNNSGVIPFHSICGDAPELERKAGSAETG 180
            KERVMKVLQVW+DWFLFSDAYVNGLRATFLR + SGVIPFHSICGDAP +E+K  S +TG
Sbjct: 555  KERVMKVLQVWSDWFLFSDAYVNGLRATFLRSSTSGVIPFHSICGDAPAIEKKVTSEDTG 614

Query: 181  DAGKINQDAALAIGKGAAMKELSNLPIPELERRCRHNGLSLVGGREMMVARLLYLEEAEK 360
            D GK +QDAALA+GKGAAMKEL +LP+ ELERRCRHNGLSLVGGREMMVARLL LEEAEK
Sbjct: 615  DGGKTSQDAALAMGKGAAMKELLSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEAEK 674

Query: 361  QRGFEIDDELKSAHSQSSSGRYTSGQKEPNSEMD-MGQISVRGGNMDDGMPSIGRGSMLL 537
            QRG+E+DD LK + S  SS +++SG++E N E++ + + +V G   +D + S  R S  L
Sbjct: 675  QRGYELDDNLKVSQSHLSSSKFSSGRRETNVELEPVSEWNVYG---EDDVQSQSRASASL 731

Query: 538  PPKDL-NLQPDINSSEGKNESILPSSKWAXXXXXXXXXXXXXXXXLALAYSSSGSENAGD 714
                +   +    + + KN+ +LP+SKWA                L L+YSSSGSENAGD
Sbjct: 732  ATFPIPQAELKAFTKKEKNDPVLPASKWARDDDDSDDEQKRSSRGLGLSYSSSGSENAGD 791

Query: 715  VLSKT-EELEVTTDANNSVHLDGGMNEEQRQKLRRLEVALMEYRESLEEQGIKDPEEIER 891
             L K  +E+E  TD + SV  D GMNEEQRQKLRRLEVAL+EYRESLEE+G+K  EEIER
Sbjct: 792  GLGKADDEMEFATDGSISVQPDSGMNEEQRQKLRRLEVALIEYRESLEERGMKSAEEIER 851

Query: 892  KVAARRSRLQAEYGLVNSDADASGR-KKSSLER 987
            KVA+ R RLQ++YGL++S  D  G  K++S ER
Sbjct: 852  KVASHRKRLQSDYGLLDSSQDTPGNSKRASSER 884


Top