BLASTX nr result

ID: Sinomenium21_contig00009350 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00009350
         (1183 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN79213.1| hypothetical protein VITISV_025939 [Vitis vinifera]   318   3e-84
ref|XP_002324341.2| RNA recognition motif-containing family prot...   311   3e-82
ref|XP_004291970.1| PREDICTED: U2 snRNP-associated SURP motif-co...   310   8e-82
ref|XP_006599196.1| PREDICTED: U2 snRNP-associated SURP motif-co...   306   1e-80
gb|EXC01118.1| U2 snRNP-associated SURP motif-containing protein...   302   2e-79
ref|XP_002308714.1| RNA recognition motif-containing family prot...   302   2e-79
ref|XP_007011694.1| RNA recognition motif-containing protein iso...   301   5e-79
ref|XP_007011693.1| RNA recognition motif-containing protein iso...   301   5e-79
ref|XP_007011691.1| RNA recognition motif-containing protein iso...   301   5e-79
ref|XP_002515412.1| RNA binding protein, putative [Ricinus commu...   299   1e-78
ref|XP_006858350.1| hypothetical protein AMTR_s00064p00173090 [A...   298   2e-78
ref|XP_006483724.1| PREDICTED: LOW QUALITY PROTEIN: U2 snRNP-ass...   296   2e-77
ref|XP_007156303.1| hypothetical protein PHAVU_003G2751000g, par...   296   2e-77
ref|XP_006450262.1| hypothetical protein CICLE_v10007357mg [Citr...   296   2e-77
ref|XP_004509625.1| PREDICTED: U2 snRNP-associated SURP motif-co...   292   2e-76
ref|XP_004509622.1| PREDICTED: U2 snRNP-associated SURP motif-co...   292   2e-76
ref|XP_007225360.1| hypothetical protein PRUPE_ppa000894mg [Prun...   292   2e-76
emb|CBI21155.3| unnamed protein product [Vitis vinifera]              287   5e-75
ref|XP_003628951.1| U2-associated protein SR140 [Medicago trunca...   285   2e-74
ref|XP_006585862.1| PREDICTED: U2 snRNP-associated SURP motif-co...   282   2e-73

>emb|CAN79213.1| hypothetical protein VITISV_025939 [Vitis vinifera]
          Length = 1384

 Score =  318 bits (815), Expect = 3e-84
 Identities = 177/309 (57%), Positives = 204/309 (66%), Gaps = 6/309 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+RSGNSGV PFHSICGDAPEIE KT +E   EG K NQD ALAMGKGAA+K      
Sbjct: 743  ATFLRSGNSGVTPFHSICGDAPEIEKKTSSEDTGEGGKSNQDAALAMGKGAAMKELLSLP 802

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGRE+MVARLLSLEEAE+Q     DDD KY QSHSNSGR+  + 
Sbjct: 803  IAELERRCRHNGLSLVGGREIMVARLLSLEEAEKQRGYDLDDDLKYAQSHSNSGRYPSSR 862

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPT-SLTQSDLKASAKK 647
                      +E     E +G SGWN YGED +  QGK S+ +APT  + Q +LKA   K
Sbjct: 863  ----------KEIGVETESVGLSGWNRYGEDEIQSQGKGSVPLAPTIPIPQPELKAFTNK 912

Query: 646  ENSDPILPVSKWAREDDGSDDEDKGTAQ----XXXXXXXXXXXXXXSRXXXXXXXXXVGV 479
              +DP+LP SKWAREDD SDDE K +A+                   +           +
Sbjct: 913  GKTDPVLPASKWAREDDDSDDEQKRSARGLGLSYSSSGSENAGDGPXKADEMEFATESSI 972

Query: 478  SSQPDSS-MNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGIS 302
             SQPDS  MNEE RQKLRR+EV LIEYRE LEERGI+S EEIERKVAIHR+RLQ ++G+S
Sbjct: 973  PSQPDSGMMNEEHRQKLRRLEVALIEYRESLEERGIKSSEEIERKVAIHRKRLQSEYGLS 1032

Query: 301  DSNDDVQGN 275
            DSN+DV  N
Sbjct: 1033 DSNEDVSWN 1041


>ref|XP_002324341.2| RNA recognition motif-containing family protein [Populus trichocarpa]
            gi|550317898|gb|EEF02906.2| RNA recognition
            motif-containing family protein [Populus trichocarpa]
          Length = 969

 Score =  311 bits (798), Expect = 3e-82
 Identities = 173/305 (56%), Positives = 203/305 (66%), Gaps = 5/305 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+RS NSGVIPFHSICGDAPEIE K+ +E   EG+K+NQD ALAMGKGAAVK      
Sbjct: 583  ATFLRSSNSGVIPFHSICGDAPEIEKKSSSEDAVEGAKINQDAALAMGKGAAVKELMNLP 642

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGREMMVARLLSLEEAERQ     DDD K  QS+S+S R+S   
Sbjct: 643  LAELERRCRHNGLSLVGGREMMVARLLSLEEAERQRGYELDDDLKIAQSNSSSSRYSSVH 702

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPTSL-TQSDLKASAKK 647
               N             EP+GS+GWN YGED M  Q K S++VA T L  Q +LKA AKK
Sbjct: 703  REMNVE----------AEPVGSTGWNVYGEDEMPSQNKGSVSVASTLLIKQPELKAFAKK 752

Query: 646  ENSDPILPVSKWAREDDGSDDEDKGTAQ----XXXXXXXXXXXXXXSRXXXXXXXXXVGV 479
            E +DP+LP SKWAR+DD SDDE K +A+                   +           +
Sbjct: 753  EKNDPVLPASKWARDDDESDDEQKRSARDLGLSYSSSGSENAGDGQGKADEMEFATDANI 812

Query: 478  SSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISD 299
             +QPDS MNEEQRQKLRR+EV LIEYRE LEERG++S  EIE KVAIHR+ L+ ++G+S 
Sbjct: 813  PTQPDSGMNEEQRQKLRRLEVALIEYRESLEERGMKSSVEIEGKVAIHRKWLESEYGLSS 872

Query: 298  SNDDV 284
            SN+DV
Sbjct: 873  SNEDV 877


>ref|XP_004291970.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            [Fragaria vesca subsp. vesca]
          Length = 980

 Score =  310 bits (794), Expect = 8e-82
 Identities = 188/402 (46%), Positives = 229/402 (56%), Gaps = 8/402 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+RSGNSGV+PFHS+CGDAP+IE KT +E   + +K NQD ALAMGKGAA +      
Sbjct: 583  ATFLRSGNSGVVPFHSVCGDAPDIEKKTTSEDAGD-AKTNQDAALAMGKGAATRELLNLP 641

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGREMMVARLLSLEEAE+Q     DDD KY Q+HS+SGR S + 
Sbjct: 642  MAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQRGYELDDDLKYGQNHSSSGRHSSSR 701

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPTSLT-QSDLKASAKK 647
               N   D          P+G SGWN Y ED +  +GK SL+ A T  + Q +LK    K
Sbjct: 702  KEMNIEPD----------PLGLSGWNRYVEDEIQSEGKVSLSKAQTHTSPQPELKPFTTK 751

Query: 646  ENSDPILPVSKWAREDDGSDDEDKGTAQXXXXXXXXXXXXXXS---RXXXXXXXXXVGVS 476
            E SDP+LP SKWAREDD SDD+ K +A+                  +         V + 
Sbjct: 752  EKSDPVLPASKWAREDDDSDDDQKRSAKGLGLSYSSGSENAGDGPSKADEMEVATDVRIP 811

Query: 475  SQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISDS 296
            +QPDS ++EEQRQKLRR+EV+L+EYRE LEERGIRSPEEIERKVAIHR+RL+ ++G+SDS
Sbjct: 812  AQPDSGLSEEQRQKLRRLEVSLLEYRESLEERGIRSPEEIERKVAIHRKRLESEYGLSDS 871

Query: 295  NDDVQG--NXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTRDRDREKENXXXXX 122
            ++D  G                                      S+RDRDRE +      
Sbjct: 872  SEDASGRSKRTSSERKDRRDDDSRDASRKRHRSGSQSDSPLQKSSSRDRDREYDLDRDRE 931

Query: 121  XXXXXXXXRTH--EPXXXXXXXXXXXXXXXRDDHDRDKGRER 2
                    R H  E                RDDHDRD+GR+R
Sbjct: 932  RQRDRDRDRAHDFEGNRGRDWDRDKSGSRERDDHDRDRGRDR 973


>ref|XP_006599196.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            [Glycine max]
          Length = 969

 Score =  306 bits (783), Expect = 1e-80
 Identities = 166/307 (54%), Positives = 202/307 (65%), Gaps = 4/307 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+R GNSGVIPFHSICGDAPEIE KT +E +  G K NQD ALAMG+GAA+K      
Sbjct: 582  ATFLRPGNSGVIPFHSICGDAPEIEQKTASEDMVVGGKTNQDAALAMGRGAAMKELMSLP 641

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGREMMVARLLSLEEAE+Q     DD+ KY  +  +SG++S N+
Sbjct: 642  LAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQKGFELDDELKYAHNQVSSGKYSSNQ 701

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPT-SLTQSDLKASAKK 647
                      RE    ++P+G S WNHYG++ +  QG+SS+ +APT  + Q  LKA  KK
Sbjct: 702  ----------RETSAELDPVGLSAWNHYGDEDIQSQGRSSVPLAPTLPIPQPKLKAFTKK 751

Query: 646  ENSDPILPVSKWAREDDGSDDED---KGTAQXXXXXXXXXXXXXXSRXXXXXXXXXVGVS 476
            E +DP+LP SKWAREDD SDDE    K                   +            S
Sbjct: 752  EKNDPVLPASKWAREDDESDDEQRSGKNLGLSYSSSGSENVDDGLVKADESESAADRSFS 811

Query: 475  SQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISDS 296
            +  DS MNEEQRQKLRR+EV LIEY E LEERGI++ EEIE+KV +HR+RLQV++G+SDS
Sbjct: 812  AHADSGMNEEQRQKLRRLEVALIEYGESLEERGIKNLEEIEKKVQLHRKRLQVEYGLSDS 871

Query: 295  NDDVQGN 275
             +D QGN
Sbjct: 872  GEDGQGN 878


>gb|EXC01118.1| U2 snRNP-associated SURP motif-containing protein [Morus notabilis]
          Length = 999

 Score =  302 bits (773), Expect = 2e-79
 Identities = 180/398 (45%), Positives = 217/398 (54%), Gaps = 4/398 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+R GNSGV PFHSICGDAPEIE     E   +  K N+D ALAMGKGAA++      
Sbjct: 601  ATFLRLGNSGVTPFHSICGDAPEIEKIISFEDTGDAGKTNEDAALAMGKGAAMQELMNLP 660

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGREMMVARLLSLEEAE+Q     D+D KY Q HS+SGR+S   
Sbjct: 661  FAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQRGYELDEDLKYAQGHSSSGRYSGGR 720

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPT-SLTQSDLKASAKK 647
               N             EP+GSSGWNHY  D +  Q K S+ +A T  + Q +LK   KK
Sbjct: 721  RETNVEG----------EPMGSSGWNHYAGDEIDSQAKGSVPLAQTIPIPQPELKPFVKK 770

Query: 646  ENSDPILPVSKWAREDDGSDDEDKGTAQXXXXXXXXXXXXXXSRXXXXXXXXXVGVSS-- 473
            E SDP+LP SKWAREDD SDDE K +++                             S  
Sbjct: 771  EKSDPVLPASKWAREDDDSDDEQKRSSRGLGLGYSSSGSENAGDGPSKADEMESAADSSV 830

Query: 472  -QPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISDS 296
             QPDS M+EEQR+KLRR+E  LIEYRE LEERGIRSPEEIERKV +HR+RL+ ++G+S+S
Sbjct: 831  VQPDSGMSEEQRKKLRRLEAALIEYRESLEERGIRSPEEIERKVTMHRKRLEAEYGLSNS 890

Query: 295  NDDVQGNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTRDRDREKENXXXXXXX 116
            N D  G+                                   + RDR+RE +        
Sbjct: 891  NKDAAGSKRASLERRDRRDNSHETSRKRHRSRSRSDSPTRRSTNRDREREHDLDRDRERH 950

Query: 115  XXXXXXRTHEPXXXXXXXXXXXXXXXRDDHDRDKGRER 2
                  R H+                RDD++RD+GRER
Sbjct: 951  RERDRDRGHD-FENERGKREKSGSRERDDNERDRGRER 987


>ref|XP_002308714.1| RNA recognition motif-containing family protein [Populus trichocarpa]
            gi|222854690|gb|EEE92237.1| RNA recognition
            motif-containing family protein [Populus trichocarpa]
          Length = 988

 Score =  302 bits (773), Expect = 2e-79
 Identities = 177/399 (44%), Positives = 219/399 (54%), Gaps = 5/399 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+RS NSGVIPFHS+CGDAPEIE K  TE   +G K NQD ALAMGKGAA K      
Sbjct: 593  ATFLRSSNSGVIPFHSMCGDAPEIEKKNSTEDTVDGGKTNQDAALAMGKGAATKELMDLP 652

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGRE MVARLL+LEEAE+Q     D D K  QS+S+S R+S   
Sbjct: 653  LAELERRCRHNGLSLVGGRETMVARLLNLEEAEKQRGYELDGDLKIAQSNSSSSRYSSVH 712

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPT-SLTQSDLKASAKK 647
               N +            P+G +GWN YGED    Q K S+++  T  + Q +LKA AKK
Sbjct: 713  REVNVDPG----------PVGLTGWNIYGEDDTPSQNKRSVSLVSTLPIPQPELKAFAKK 762

Query: 646  ENSDPILPVSKWAREDDGSDDEDKGTAQ----XXXXXXXXXXXXXXSRXXXXXXXXXVGV 479
            E +DP+LP SKWAR+DD SDDE K + +                   +           +
Sbjct: 763  EKNDPVLPASKWARDDDESDDEQKRSVRDLGLSYSSSGSENAGDGQGKEDEMEFATDASI 822

Query: 478  SSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISD 299
             +QP+S MNEEQRQKLRR+EV LIEYRE LEE+G+++ EE ERKVA+HR+RL+ ++G+S 
Sbjct: 823  PTQPESGMNEEQRQKLRRLEVALIEYRESLEEQGMKNSEEFERKVAVHRKRLESEYGLSS 882

Query: 298  SNDDVQGNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTRDRDREKENXXXXXX 119
            SN+DV GN                                   S RDR+RE ++      
Sbjct: 883  SNEDVTGNKRISSERRDRRDDNHESSRKRHRSESRSESPQRKLSLRDREREHDSDKDRER 942

Query: 118  XXXXXXXRTHEPXXXXXXXXXXXXXXXRDDHDRDKGRER 2
                      E                RDDHDRD+GR+R
Sbjct: 943  HRERDRGNNLESERRDRDYREKSGSKERDDHDRDRGRDR 981


>ref|XP_007011694.1| RNA recognition motif-containing protein isoform 4 [Theobroma cacao]
            gi|590571807|ref|XP_007011695.1| RNA recognition
            motif-containing protein isoform 4 [Theobroma cacao]
            gi|508782057|gb|EOY29313.1| RNA recognition
            motif-containing protein isoform 4 [Theobroma cacao]
            gi|508782058|gb|EOY29314.1| RNA recognition
            motif-containing protein isoform 4 [Theobroma cacao]
          Length = 811

 Score =  301 bits (770), Expect = 5e-79
 Identities = 180/404 (44%), Positives = 225/404 (55%), Gaps = 10/404 (2%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+RSGNSGV PFHSICGDAPEIE  T +E   +G K NQD ALAMGKGAA++      
Sbjct: 409  ATFLRSGNSGVAPFHSICGDAPEIEKNTSSEDAGDGIKGNQDAALAMGKGAAMRELMDLP 468

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGRE+MVARLLSLE+AE+Q S   DDD K  QS S+S R+S  +
Sbjct: 469  LAELERRCRHNGLSLVGGREIMVARLLSLEDAEKQRSYELDDDLKLAQSRSSSCRYSSGQ 528

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPT-SLTQSDLKASAKK 647
               NA            EP+G SGW HY ++ +  Q K S+ +A T  + Q ++KA  KK
Sbjct: 529  RDINAE----------AEPVGLSGWTHYADNEIHSQRKGSVPLAETLPIPQPEIKAFLKK 578

Query: 646  ENSDPILPVSKWAREDDGSDDEDK----GTAQXXXXXXXXXXXXXXSRXXXXXXXXXVGV 479
            E  DP+LP SKW+REDD SDDE+K    G                 S+           +
Sbjct: 579  EKIDPVLPASKWSREDDDSDDEEKRSTRGLGLSYSSSGSENAGDGTSKADELEFGTDASI 638

Query: 478  SSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISD 299
             +  +S+MNEEQRQKLRR+EV LIEYRE LEERGI+S E+IER+VA HR+RL+ ++G+SD
Sbjct: 639  PAPSESAMNEEQRQKLRRLEVALIEYRESLEERGIKSAEDIERRVAAHRKRLESEYGLSD 698

Query: 298  SNDDVQGNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTRDRDREKENXXXXXX 119
            S++D+ G                                    S RDRDRE ++      
Sbjct: 699  SSEDISGRKRTSSERRERRDDAHDSSRKRHRSQSRSESPPRKSSNRDRDRENDSVNDREK 758

Query: 118  XXXXXXXRTHE-----PXXXXXXXXXXXXXXXRDDHDRDKGRER 2
                   R+H+                     RDDHDRD+GRER
Sbjct: 759  HRDRDRDRSHDLESERGRERERDRREKSGSRERDDHDRDRGRER 802


>ref|XP_007011693.1| RNA recognition motif-containing protein isoform 3 [Theobroma cacao]
            gi|508782056|gb|EOY29312.1| RNA recognition
            motif-containing protein isoform 3 [Theobroma cacao]
          Length = 819

 Score =  301 bits (770), Expect = 5e-79
 Identities = 180/404 (44%), Positives = 225/404 (55%), Gaps = 10/404 (2%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+RSGNSGV PFHSICGDAPEIE  T +E   +G K NQD ALAMGKGAA++      
Sbjct: 417  ATFLRSGNSGVAPFHSICGDAPEIEKNTSSEDAGDGIKGNQDAALAMGKGAAMRELMDLP 476

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGRE+MVARLLSLE+AE+Q S   DDD K  QS S+S R+S  +
Sbjct: 477  LAELERRCRHNGLSLVGGREIMVARLLSLEDAEKQRSYELDDDLKLAQSRSSSCRYSSGQ 536

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPT-SLTQSDLKASAKK 647
               NA            EP+G SGW HY ++ +  Q K S+ +A T  + Q ++KA  KK
Sbjct: 537  RDINAE----------AEPVGLSGWTHYADNEIHSQRKGSVPLAETLPIPQPEIKAFLKK 586

Query: 646  ENSDPILPVSKWAREDDGSDDEDK----GTAQXXXXXXXXXXXXXXSRXXXXXXXXXVGV 479
            E  DP+LP SKW+REDD SDDE+K    G                 S+           +
Sbjct: 587  EKIDPVLPASKWSREDDDSDDEEKRSTRGLGLSYSSSGSENAGDGTSKADELEFGTDASI 646

Query: 478  SSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISD 299
             +  +S+MNEEQRQKLRR+EV LIEYRE LEERGI+S E+IER+VA HR+RL+ ++G+SD
Sbjct: 647  PAPSESAMNEEQRQKLRRLEVALIEYRESLEERGIKSAEDIERRVAAHRKRLESEYGLSD 706

Query: 298  SNDDVQGNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTRDRDREKENXXXXXX 119
            S++D+ G                                    S RDRDRE ++      
Sbjct: 707  SSEDISGRKRTSSERRERRDDAHDSSRKRHRSQSRSESPPRKSSNRDRDRENDSVNDREK 766

Query: 118  XXXXXXXRTHE-----PXXXXXXXXXXXXXXXRDDHDRDKGRER 2
                   R+H+                     RDDHDRD+GRER
Sbjct: 767  HRDRDRDRSHDLESERGRERERDRREKSGSRERDDHDRDRGRER 810


>ref|XP_007011691.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao]
            gi|508782054|gb|EOY29310.1| RNA recognition
            motif-containing protein isoform 1 [Theobroma cacao]
          Length = 985

 Score =  301 bits (770), Expect = 5e-79
 Identities = 180/404 (44%), Positives = 225/404 (55%), Gaps = 10/404 (2%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+RSGNSGV PFHSICGDAPEIE  T +E   +G K NQD ALAMGKGAA++      
Sbjct: 583  ATFLRSGNSGVAPFHSICGDAPEIEKNTSSEDAGDGIKGNQDAALAMGKGAAMRELMDLP 642

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGRE+MVARLLSLE+AE+Q S   DDD K  QS S+S R+S  +
Sbjct: 643  LAELERRCRHNGLSLVGGREIMVARLLSLEDAEKQRSYELDDDLKLAQSRSSSCRYSSGQ 702

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPT-SLTQSDLKASAKK 647
               NA            EP+G SGW HY ++ +  Q K S+ +A T  + Q ++KA  KK
Sbjct: 703  RDINAE----------AEPVGLSGWTHYADNEIHSQRKGSVPLAETLPIPQPEIKAFLKK 752

Query: 646  ENSDPILPVSKWAREDDGSDDEDK----GTAQXXXXXXXXXXXXXXSRXXXXXXXXXVGV 479
            E  DP+LP SKW+REDD SDDE+K    G                 S+           +
Sbjct: 753  EKIDPVLPASKWSREDDDSDDEEKRSTRGLGLSYSSSGSENAGDGTSKADELEFGTDASI 812

Query: 478  SSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISD 299
             +  +S+MNEEQRQKLRR+EV LIEYRE LEERGI+S E+IER+VA HR+RL+ ++G+SD
Sbjct: 813  PAPSESAMNEEQRQKLRRLEVALIEYRESLEERGIKSAEDIERRVAAHRKRLESEYGLSD 872

Query: 298  SNDDVQGNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTRDRDREKENXXXXXX 119
            S++D+ G                                    S RDRDRE ++      
Sbjct: 873  SSEDISGRKRTSSERRERRDDAHDSSRKRHRSQSRSESPPRKSSNRDRDRENDSVNDREK 932

Query: 118  XXXXXXXRTHE-----PXXXXXXXXXXXXXXXRDDHDRDKGRER 2
                   R+H+                     RDDHDRD+GRER
Sbjct: 933  HRDRDRDRSHDLESERGRERERDRREKSGSRERDDHDRDRGRER 976


>ref|XP_002515412.1| RNA binding protein, putative [Ricinus communis]
            gi|223545356|gb|EEF46861.1| RNA binding protein, putative
            [Ricinus communis]
          Length = 979

 Score =  299 bits (766), Expect = 1e-78
 Identities = 182/400 (45%), Positives = 219/400 (54%), Gaps = 6/400 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+RS  SGVIPFHSICGDAP IE K  +E   +G K +QD ALAMGKGAA+K      
Sbjct: 581  ATFLRSSTSGVIPFHSICGDAPAIEKKVTSEDTGDGGKTSQDAALAMGKGAAMKELLSLP 640

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGREMMVARLLSLEEAE+Q     DD+ K  QSH +S +FS   
Sbjct: 641  LAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQRGYELDDNLKVSQSHLSSSKFSSGR 700

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPTSLTQSDLKASAKKE 644
                      RE    +EP+  S WN YGED +  Q ++S ++A   + Q++LKA  KKE
Sbjct: 701  ----------RETNVELEPV--SEWNVYGEDDVQSQSRASASLATFPIPQAELKAFTKKE 748

Query: 643  NSDPILPVSKWAREDDGSDDEDKGTAQ-----XXXXXXXXXXXXXXSRXXXXXXXXXVGV 479
             +DP+LP SKWAR+DD SDDE K +++                                +
Sbjct: 749  KNDPVLPASKWARDDDDSDDEQKRSSRGLGLSYSSSGSENAGDGLGKADDEMEFATDGSI 808

Query: 478  SSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISD 299
            S QPDS MNEEQRQKLRR+EV LIEYRE LEERG++S EEIERKVA HR+RLQ D+G+ D
Sbjct: 809  SVQPDSGMNEEQRQKLRRLEVALIEYRESLEERGMKSAEEIERKVASHRKRLQSDYGLLD 868

Query: 298  SNDDVQGN-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTRDRDREKENXXXXX 122
            S+ D  GN                                    STRDR+RE+E      
Sbjct: 869  SSQDTPGNSKRASSERRDRRDDSRESSRKRHRSESSSRSPQRKTSTRDRERERERENDSD 928

Query: 121  XXXXXXXXRTHEPXXXXXXXXXXXXXXXRDDHDRDKGRER 2
                       E                RDDHDRD+GRE+
Sbjct: 929  RDRERHRAHDLENERWERDHHEKSGSRERDDHDRDRGREK 968


>ref|XP_006858350.1| hypothetical protein AMTR_s00064p00173090 [Amborella trichopoda]
            gi|548862457|gb|ERN19817.1| hypothetical protein
            AMTR_s00064p00173090 [Amborella trichopoda]
          Length = 1011

 Score =  298 bits (764), Expect = 2e-78
 Identities = 185/399 (46%), Positives = 220/399 (55%), Gaps = 5/399 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATFIRS NSGVIPFHSICGD PE+ENKT +    EG+K+NQD ALAMGKGAAVK      
Sbjct: 615  ATFIRSSNSGVIPFHSICGDLPEMENKTTSTDSGEGAKVNQDAALAMGKGAAVKELLNLP 674

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSL GGREMMVARLLSLEEAE+Q S  RDDD +Y Q      R+S+ E
Sbjct: 675  LTELERRCRHNGLSLCGGREMMVARLLSLEEAEKQKSHDRDDDLRYGQ------RYSREE 728

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKS-SLTVAPT-SLTQSDLKASA- 653
            S+WN  D   +E   G EP     W+HYGE+    Q K+ S ++ PT  + Q +LKA A 
Sbjct: 729  STWNVCDAGQKETNSGAEP-----WSHYGEEVFRSQSKAPSSSMTPTLPIPQPELKAFAI 783

Query: 652  KKENSDPILPVSKWAREDDGSDDED--KGTAQXXXXXXXXXXXXXXSRXXXXXXXXXVGV 479
            KK  SDP+LP+SKWAREDD SDD++  KG                  +           +
Sbjct: 784  KKGKSDPVLPISKWAREDDASDDDEDKKGLGLGYSSSGSEDGGDGPRKAGDPEVSGDASL 843

Query: 478  SSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISD 299
             S  DS M+EE RQKLR +EV ++EYRE LEERGIR+PEEIERKVA HRRRLQ +FG+ D
Sbjct: 844  PSYADSLMSEEYRQKLRSLEVAVMEYRESLEERGIRNPEEIERKVAAHRRRLQSEFGLLD 903

Query: 298  SNDDVQGNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTRDRDREKENXXXXXX 119
            S  D  GN                                    +  R+RE+EN      
Sbjct: 904  SFGDASGNSKHFSRSSERSSLERRERRDDRKRHRSQSRSPPQRKSSSRERERENEADRDR 963

Query: 118  XXXXXXXRTHEPXXXXXXXXXXXXXXXRDDHDRDKGRER 2
                      E                R+D DRDKGR+R
Sbjct: 964  DRERH----RERDRGSHDERERNESREREDFDRDKGRDR 998


>ref|XP_006483724.1| PREDICTED: LOW QUALITY PROTEIN: U2 snRNP-associated SURP
            motif-containing protein-like [Citrus sinensis]
          Length = 1017

 Score =  296 bits (757), Expect = 2e-77
 Identities = 185/401 (46%), Positives = 223/401 (55%), Gaps = 7/401 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+RSGNSGV PFHSICGDAPEI+ K ++E   + SK NQDTALAMGKGAA+K      
Sbjct: 626  ATFLRSGNSGVTPFHSICGDAPEIDKKNNSEDTCDLSKTNQDTALAMGKGAAIKELMNLP 685

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGREMMVARLLSLE+AE+Q     DDD K   S S+SGR+S+  
Sbjct: 686  LSELERRCRHNGLSLVGGREMMVARLLSLEDAEKQRGYELDDDLKSAHSQSSSGRYSR-- 743

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPTSLT--QSDLKASAK 650
              W       +E     E +G SGWN Y ED    Q   S+ +  T LT  Q ++KA  K
Sbjct: 744  -GW-------KETNMEAESMGLSGWNGYEEDEKLSQAVGSVPLG-TMLTTPQPEIKAFTK 794

Query: 649  KENSDPILPVSKWAREDDGSDDEDK----GTAQXXXXXXXXXXXXXXSRXXXXXXXXXVG 482
            KE +DP+LP SKWA EDD SDDE K    G                 S+           
Sbjct: 795  KEKNDPVLPASKWALEDDESDDEQKRSSRGLGLSYSSSGSENAGDGPSKADDVDFTIDAS 854

Query: 481  VSSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGIS 302
            +  QPDS MNEEQRQKLRR+EV+LIEYRE LEERGI+S EEIE+KVAIHR+RL+ ++G++
Sbjct: 855  IPVQPDSGMNEEQRQKLRRLEVSLIEYRESLEERGIKSSEEIEKKVAIHRKRLESEYGLA 914

Query: 301  DSNDDVQGNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTRDRDREKENXXXXX 122
            D N+DV GN                                    +  RDRE+E+     
Sbjct: 915  DPNEDVSGN-------KRRDRRDEILDSRKRHRSQSQSESPPPRKSSIRDRERESDLDRD 967

Query: 121  XXXXXXXXRTHE-PXXXXXXXXXXXXXXXRDDHDRDKGRER 2
                    R H+                 RDDHDRD+GR+R
Sbjct: 968  RERHRDRDRAHDFESERGRERREKSGSRERDDHDRDRGRDR 1008


>ref|XP_007156303.1| hypothetical protein PHAVU_003G2751000g, partial [Phaseolus vulgaris]
            gi|593786527|ref|XP_007156304.1| hypothetical protein
            PHAVU_003G2751000g, partial [Phaseolus vulgaris]
            gi|561029657|gb|ESW28297.1| hypothetical protein
            PHAVU_003G2751000g, partial [Phaseolus vulgaris]
            gi|561029658|gb|ESW28298.1| hypothetical protein
            PHAVU_003G2751000g, partial [Phaseolus vulgaris]
          Length = 813

 Score =  296 bits (757), Expect = 2e-77
 Identities = 161/308 (52%), Positives = 196/308 (63%), Gaps = 5/308 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+R GNSGVIPFHSICGDAPEIE KT +E +  G K NQD ALAMG+GAA+K      
Sbjct: 425  ATFLRPGNSGVIPFHSICGDAPEIEQKTTSEDIVVGGKTNQDAALAMGRGAAMKELMSLP 484

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGREMMVARLLSLEEAE+Q     DD+ KY  +   SG++S N 
Sbjct: 485  LAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQRGYELDDELKYAHNQGTSGKYSSNL 544

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPT-SLTQSDLKASAKK 647
               +A            EP+G S WN YG++ +  Q +SS+++A T  + Q +LKA  KK
Sbjct: 545  QETSAES----------EPVGLSAWNQYGDEDLQSQSRSSISLASTLPIPQPELKAFTKK 594

Query: 646  ENSDPILPVSKWAREDDGSDDED----KGTAQXXXXXXXXXXXXXXSRXXXXXXXXXVGV 479
            E SDP+LP SKWAREDD SDDE     K                   +            
Sbjct: 595  EKSDPVLPASKWAREDDESDDEQRKGGKNLGLSYSSSGSENVDDGPIKADELESAAGTSF 654

Query: 478  SSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISD 299
             +  DS MNEEQRQKLRR+EV LIEYRE LEERGI++ EEI++KV  HR+RLQ ++G+SD
Sbjct: 655  PAHTDSGMNEEQRQKLRRLEVALIEYRESLEERGIKNLEEIDKKVESHRKRLQAEYGLSD 714

Query: 298  SNDDVQGN 275
            S +D +GN
Sbjct: 715  SGEDGKGN 722


>ref|XP_006450262.1| hypothetical protein CICLE_v10007357mg [Citrus clementina]
            gi|567916514|ref|XP_006450263.1| hypothetical protein
            CICLE_v10007357mg [Citrus clementina]
            gi|557553488|gb|ESR63502.1| hypothetical protein
            CICLE_v10007357mg [Citrus clementina]
            gi|557553489|gb|ESR63503.1| hypothetical protein
            CICLE_v10007357mg [Citrus clementina]
          Length = 973

 Score =  296 bits (757), Expect = 2e-77
 Identities = 185/401 (46%), Positives = 223/401 (55%), Gaps = 7/401 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+RSGNSGV PFHSICGDAPEI+ K ++E   + SK NQDTALAMGKGAA+K      
Sbjct: 582  ATFLRSGNSGVTPFHSICGDAPEIDKKNNSEDTCDLSKTNQDTALAMGKGAAIKELMNLP 641

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGREMMVARLLSLE+AE+Q     DDD K   S S+SGR+S+  
Sbjct: 642  LSELERRCRHNGLSLVGGREMMVARLLSLEDAEKQRGYELDDDLKSAHSQSSSGRYSR-- 699

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPTSLT--QSDLKASAK 650
              W       +E     E +G SGWN Y ED    Q   S+ +  T LT  Q ++KA  K
Sbjct: 700  -GW-------KETNMEAESMGLSGWNGYEEDEKLSQAVGSVPLG-TMLTTPQPEIKAFTK 750

Query: 649  KENSDPILPVSKWAREDDGSDDEDK----GTAQXXXXXXXXXXXXXXSRXXXXXXXXXVG 482
            KE +DP+LP SKWA EDD SDDE K    G                 S+           
Sbjct: 751  KEKNDPVLPASKWALEDDESDDEQKRSSRGLGLSYSSSGSENAGDGPSKADDVDFTIDAS 810

Query: 481  VSSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGIS 302
            +  QPDS MNEEQRQKLRR+EV+LIEYRE LEERGI+S EEIE+KVAIHR+RL+ ++G++
Sbjct: 811  IPVQPDSGMNEEQRQKLRRLEVSLIEYRESLEERGIKSSEEIEKKVAIHRKRLESEYGLA 870

Query: 301  DSNDDVQGNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTRDRDREKENXXXXX 122
            D N+DV GN                                    +  RDRE+E+     
Sbjct: 871  DPNEDVSGN-------KRRDRRDEILDSRKRHRSQSQSESPPPRKSSIRDRERESDLDRD 923

Query: 121  XXXXXXXXRTHE-PXXXXXXXXXXXXXXXRDDHDRDKGRER 2
                    R H+                 RDDHDRD+GR+R
Sbjct: 924  RERHRDRDRAHDFESERGRERREKSGSRERDDHDRDRGRDR 964


>ref|XP_004509625.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            isoform X4 [Cicer arietinum]
          Length = 851

 Score =  292 bits (748), Expect = 2e-76
 Identities = 162/308 (52%), Positives = 200/308 (64%), Gaps = 5/308 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+R GNSGVIPFHSICGDAPEIE K  +E    G K +QD ALAMG+GAA +      
Sbjct: 456  ATFLRPGNSGVIPFHSICGDAPEIEQKMTSEDAVVGGKTDQDAALAMGRGAATQELMSLP 515

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGREMMVARLLSLEEAE+Q     DD+ KY  + ++SG++S + 
Sbjct: 516  LAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQRGFELDDELKYPLNQASSGKYSSSR 575

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPT-SLTQSDLKASAKK 647
                      RE     EP+GSSGWNHY +D +  QGK S+ +APT  + Q +LKA  +K
Sbjct: 576  ----------RETSAEPEPMGSSGWNHYEDDDVQLQGKGSVPLAPTLPIPQPELKAFTRK 625

Query: 646  ENSDPILPVSKWAREDDGSDDED----KGTAQXXXXXXXXXXXXXXSRXXXXXXXXXVGV 479
            E SD +LP SKWAREDD SDDE     K                   +            
Sbjct: 626  EKSDIVLPASKWAREDDESDDEQTKGGKNLGLSYSSSGSENVGDGLIKADESEAAADSSF 685

Query: 478  SSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISD 299
            S+  DS +NEEQRQKLRR+EV LIEYRE LEERGI++ EEIE+KV +HR+RLQV++G+S+
Sbjct: 686  SAHADSGLNEEQRQKLRRLEVALIEYRESLEERGIKNLEEIEKKVLMHRKRLQVEYGLSE 745

Query: 298  SNDDVQGN 275
            S++D QG+
Sbjct: 746  SSEDGQGS 753


>ref|XP_004509622.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            isoform X1 [Cicer arietinum]
            gi|502154215|ref|XP_004509623.1| PREDICTED: U2
            snRNP-associated SURP motif-containing protein-like
            isoform X2 [Cicer arietinum]
            gi|502154218|ref|XP_004509624.1| PREDICTED: U2
            snRNP-associated SURP motif-containing protein-like
            isoform X3 [Cicer arietinum]
          Length = 977

 Score =  292 bits (748), Expect = 2e-76
 Identities = 162/308 (52%), Positives = 200/308 (64%), Gaps = 5/308 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+R GNSGVIPFHSICGDAPEIE K  +E    G K +QD ALAMG+GAA +      
Sbjct: 582  ATFLRPGNSGVIPFHSICGDAPEIEQKMTSEDAVVGGKTDQDAALAMGRGAATQELMSLP 641

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGREMMVARLLSLEEAE+Q     DD+ KY  + ++SG++S + 
Sbjct: 642  LAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQRGFELDDELKYPLNQASSGKYSSSR 701

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPT-SLTQSDLKASAKK 647
                      RE     EP+GSSGWNHY +D +  QGK S+ +APT  + Q +LKA  +K
Sbjct: 702  ----------RETSAEPEPMGSSGWNHYEDDDVQLQGKGSVPLAPTLPIPQPELKAFTRK 751

Query: 646  ENSDPILPVSKWAREDDGSDDED----KGTAQXXXXXXXXXXXXXXSRXXXXXXXXXVGV 479
            E SD +LP SKWAREDD SDDE     K                   +            
Sbjct: 752  EKSDIVLPASKWAREDDESDDEQTKGGKNLGLSYSSSGSENVGDGLIKADESEAAADSSF 811

Query: 478  SSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISD 299
            S+  DS +NEEQRQKLRR+EV LIEYRE LEERGI++ EEIE+KV +HR+RLQV++G+S+
Sbjct: 812  SAHADSGLNEEQRQKLRRLEVALIEYRESLEERGIKNLEEIEKKVLMHRKRLQVEYGLSE 871

Query: 298  SNDDVQGN 275
            S++D QG+
Sbjct: 872  SSEDGQGS 879


>ref|XP_007225360.1| hypothetical protein PRUPE_ppa000894mg [Prunus persica]
            gi|462422296|gb|EMJ26559.1| hypothetical protein
            PRUPE_ppa000894mg [Prunus persica]
          Length = 968

 Score =  292 bits (747), Expect = 2e-76
 Identities = 182/401 (45%), Positives = 221/401 (55%), Gaps = 7/401 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+RSGNSGV+PFHSICGDAPEI+ K  +E   +  K NQD ALAMGKGAA++      
Sbjct: 583  ATFLRSGNSGVVPFHSICGDAPEIDKKITSEDTGDACKTNQDAALAMGKGAAMRELLSLP 642

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGRE MVARLLSLEEAE+Q     DDD KY QSHS+S R+S + 
Sbjct: 643  LAELERRCRHNGLSLVGGRETMVARLLSLEEAEKQRGYELDDDLKYAQSHSSSARYSSSR 702

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMG--QQGKSSLTVAPT-SLTQSDLKASA 653
               N            IEP           D+MG   QGK SL +  T  + Q +LKA  
Sbjct: 703  REMN------------IEP-----------DSMGISAQGKGSLPLVQTLPIPQPELKALT 739

Query: 652  KKENSDPILPVSKWAREDDGSDDEDKGTAQ----XXXXXXXXXXXXXXSRXXXXXXXXXV 485
            KKE SDP+LP SKWAREDD SDDE K +A+                  S+          
Sbjct: 740  KKEKSDPVLPASKWAREDDDSDDEQKRSARDLGLSYSSSGSENAGDGPSKADEMEVATDA 799

Query: 484  GVSSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGI 305
             + +QPDS ++EEQRQKLRR+EV LIEYRE LEERGI++PEEIERKVAIHR+RL+ ++G+
Sbjct: 800  SIPAQPDSGISEEQRQKLRRLEVALIEYRESLEERGIKNPEEIERKVAIHRKRLESEYGL 859

Query: 304  SDSNDDVQGNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTRDRDREKENXXXX 125
            SDS++D  G+                                   S RDR+RE +     
Sbjct: 860  SDSSEDACGS-KRTSSERKDRRDDDNTSRKRHRSGSQSDSPLQRSSNRDREREHDLDRDR 918

Query: 124  XXXXXXXXXRTHEPXXXXXXXXXXXXXXXRDDHDRDKGRER 2
                     R H+                 DDH+RD+GRER
Sbjct: 919  ERQRGSDRDRAHDFEGDRVRDREKSGSREGDDHERDRGRER 959


>emb|CBI21155.3| unnamed protein product [Vitis vinifera]
          Length = 941

 Score =  287 bits (735), Expect = 5e-75
 Identities = 168/309 (54%), Positives = 191/309 (61%), Gaps = 6/309 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+RSGNSGV PFHSICGDAPEIE KT +E   EG K NQD ALAMGKGAA+K      
Sbjct: 583  ATFLRSGNSGVTPFHSICGDAPEIEKKTSSEDTGEGGKSNQDAALAMGKGAAMKELLSLP 642

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGRE+MVARLLSLEEAE+Q     DDD KY QSHSNSGR+    
Sbjct: 643  IAELERRCRHNGLSLVGGREIMVARLLSLEEAEKQRGYDLDDDLKYAQSHSNSGRYPNEI 702

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPT-SLTQSDLKASAKK 647
             S                                 QGK S+ +APT  + Q +LKA   K
Sbjct: 703  QS---------------------------------QGKGSVPLAPTIPIPQPELKAFTNK 729

Query: 646  ENSDPILPVSKWAREDDGSDDEDKGTAQ----XXXXXXXXXXXXXXSRXXXXXXXXXVGV 479
              +DP+LP SKWAREDD SDDE K +A+                  S+           +
Sbjct: 730  GKTDPVLPASKWAREDDDSDDEQKRSARGLGLSYSSSGSENAGDGPSKADEMEFATESSI 789

Query: 478  SSQPDSS-MNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGIS 302
             SQPDS  MNEE RQKLRR+EV LIEYRE LEERGI+S EEIERKVAIHR+RLQ ++G+S
Sbjct: 790  PSQPDSGMMNEEHRQKLRRLEVALIEYRESLEERGIKSSEEIERKVAIHRKRLQSEYGLS 849

Query: 301  DSNDDVQGN 275
            DSN+DV  N
Sbjct: 850  DSNEDVSWN 858


>ref|XP_003628951.1| U2-associated protein SR140 [Medicago truncatula]
            gi|355522973|gb|AET03427.1| U2-associated protein SR140
            [Medicago truncatula]
          Length = 1139

 Score =  285 bits (730), Expect = 2e-74
 Identities = 165/308 (53%), Positives = 196/308 (63%), Gaps = 5/308 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+R GNSGVIPFHSICGDAP+IE K  ++    G K +QD ALAMG+GAA K      
Sbjct: 649  ATFLRPGNSGVIPFHSICGDAPDIEQKITSDDAIVGGKTDQDAALAMGRGAATKELMSLP 708

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGREMMVARLLSLEEAE+Q     DD  KY  + ++SG+     
Sbjct: 709  LAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQRGYELDDGLKYPGNQTSSGK----- 763

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPT-SLTQSDLKASAKK 647
                 N    RE     EP+G SG NHYG++ +  QGK    +APT  + Q +LKA AKK
Sbjct: 764  -----NSSGQRETSADPEPMGLSGLNHYGDEDLQLQGKGYAPLAPTLPIPQPELKAFAKK 818

Query: 646  ENSDPILPVSKWAREDDGSDDED-KGTAQXXXXXXXXXXXXXXSRXXXXXXXXXVGVSSQ 470
            E +D +LP SKWAREDD SDDE  KG                               SS 
Sbjct: 819  EKNDLVLPASKWAREDDESDDEQGKGGKNLGLSYSSSGSENVGDDLIKADESEAAADSSF 878

Query: 469  P---DSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISD 299
            P   DS MNEEQRQKLRR+EV LIEYRE LEERGI++ EEIE+KV +HR+RLQV++G+SD
Sbjct: 879  PAHADSGMNEEQRQKLRRLEVALIEYRESLEERGIKNLEEIEKKVLMHRKRLQVEYGLSD 938

Query: 298  SNDDVQGN 275
            SN+D QG+
Sbjct: 939  SNEDGQGS 946


>ref|XP_006585862.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            isoform X3 [Glycine max] gi|571473238|ref|XP_006585863.1|
            PREDICTED: U2 snRNP-associated SURP motif-containing
            protein-like isoform X4 [Glycine max]
          Length = 874

 Score =  282 bits (721), Expect = 2e-73
 Identities = 155/308 (50%), Positives = 198/308 (64%), Gaps = 5/308 (1%)
 Frame = -1

Query: 1183 ATFIRSGNSGVIPFHSICGDAPEIENKTDTEVVAEGSKLNQDTALAMGKGAAVKXXXXXX 1004
            ATF+R GNSGVIPFHSICGDAPEIE  T ++ +  G K NQD ALAMG+GAA+K      
Sbjct: 487  ATFLRPGNSGVIPFHSICGDAPEIEQNTTSKDMVVGGKTNQDAALAMGRGAAMKELMSLP 546

Query: 1003 XXXXERRCRHNGLSLVGGREMMVARLLSLEEAERQASEIRDDDTKYRQSHSNSGRFSKNE 824
                ERRCRHNGLSLVGGREMMVARLLSLEEAE+Q     D++ KY  +  +SG++S N+
Sbjct: 547  LAELERRCRHNGLSLVGGREMMVARLLSLEEAEKQRGFELDEELKYAHNQVSSGKYSSNQ 606

Query: 823  SSWNANDDACREAYHGIEPIGSSGWNHYGEDAMGQQGKSSLTVAPT-SLTQSDLKASAKK 647
                      RE     +P+    WNHYG++ +  QG+SS+ ++PT  + Q +LKA  KK
Sbjct: 607  ----------RETSEEPDPV----WNHYGDEDLQSQGRSSVPLSPTLPIAQPELKAFTKK 652

Query: 646  ENSDPILPVSKWAREDDGSDDEDKGTAQXXXXXXXXXXXXXXS----RXXXXXXXXXVGV 479
            E +DP+LP SKWA E D SDDE + + +                   +            
Sbjct: 653  EKNDPVLPASKWAWEGDESDDEQRRSGKNIGLSYSSSGSENVGDGLVKADESESAADTRF 712

Query: 478  SSQPDSSMNEEQRQKLRRMEVNLIEYREYLEERGIRSPEEIERKVAIHRRRLQVDFGISD 299
            S+  DS MNEEQRQKLRR+EV LIEYRE LEERG+++ EEIE+KV  HR+RLQV++G+SD
Sbjct: 713  SAHADSGMNEEQRQKLRRLEVALIEYRESLEERGVKNLEEIEKKVQSHRKRLQVEYGLSD 772

Query: 298  SNDDVQGN 275
            S +D  G+
Sbjct: 773  SGEDGHGH 780


Top