BLASTX nr result

ID: Akebia27_contig00003048 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00003048
         (1927 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI21155.3| unnamed protein product [Vitis vinifera]              545   e-152
ref|XP_007225360.1| hypothetical protein PRUPE_ppa000894mg [Prun...   540   e-151
emb|CAN79213.1| hypothetical protein VITISV_025939 [Vitis vinifera]   537   e-150
ref|XP_002324341.2| RNA recognition motif-containing family prot...   531   e-148
ref|XP_004291970.1| PREDICTED: U2 snRNP-associated SURP motif-co...   526   e-146
gb|EXC01118.1| U2 snRNP-associated SURP motif-containing protein...   525   e-146
ref|XP_004138695.1| PREDICTED: U2 snRNP-associated SURP motif-co...   522   e-145
ref|XP_007011691.1| RNA recognition motif-containing protein iso...   521   e-145
ref|XP_002308714.1| RNA recognition motif-containing family prot...   518   e-144
ref|XP_006483724.1| PREDICTED: LOW QUALITY PROTEIN: U2 snRNP-ass...   513   e-142
ref|XP_006450262.1| hypothetical protein CICLE_v10007357mg [Citr...   513   e-142
ref|XP_002515412.1| RNA binding protein, putative [Ricinus commu...   511   e-142
ref|XP_007156303.1| hypothetical protein PHAVU_003G2751000g, par...   504   e-140
ref|XP_004509625.1| PREDICTED: U2 snRNP-associated SURP motif-co...   504   e-140
ref|XP_004509622.1| PREDICTED: U2 snRNP-associated SURP motif-co...   504   e-140
ref|XP_006858350.1| hypothetical protein AMTR_s00064p00173090 [A...   502   e-139
ref|XP_006599196.1| PREDICTED: U2 snRNP-associated SURP motif-co...   501   e-139
ref|XP_006585862.1| PREDICTED: U2 snRNP-associated SURP motif-co...   490   e-135
ref|XP_006585860.1| PREDICTED: U2 snRNP-associated SURP motif-co...   490   e-135
dbj|BAD28014.1| putative U2-associated SR140 protein [Oryza sati...   476   e-131

>emb|CBI21155.3| unnamed protein product [Vitis vinifera]
          Length = 941

 Score =  545 bits (1405), Expect = e-152
 Identities = 319/520 (61%), Positives = 365/520 (70%), Gaps = 14/520 (2%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRWMPPPL + +SPE++ E  TTF+  RSR VELER    
Sbjct: 387  AQGDTLQRWRTEPFIMITGSGRWMPPPLPTVRSPEHEKESGTTFAAGRSRRVELER---- 442

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT  QRDEFED+LR LTLER  IKEAMGFALD+ADAA EIVEVLTESLTLK
Sbjct: 443  --------TLTDPQRDEFEDMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTLK 494

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  KVARLMLVSD+LHNS AP+KNA AY ++F++TLPDIM+SFNDLY  + GRITAE
Sbjct: 495  ETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAE 554

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRIS------SVILDAPEIGNKSSSED 1224
            ALKERV+KVLQVW+ W LFSDAYVN LRATFLR   S      S+  DAPEI  K+SSED
Sbjct: 555  ALKERVMKVLQVWADWFLFSDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIEKKTSSED 614

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
              EG K  +D  L+ G   AM  L  LP+AELER CRHN +SLVGGRE+MVARLL+L+EA
Sbjct: 615  TGEGGKSNQDAALAMGKGAAMKELLSLPIAELERRCRHNGLSLVGGREIMVARLLSLEEA 674

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGMKLESSRSNNYGEDVM 870
             +Q  Y+ DDD KY QSHSNS RY  +             TI   + E     N      
Sbjct: 675  EKQRGYDLDDDLKYAQSHSNSGRYPNEIQSQGKGSVPLAPTIPIPQPELKAFTN------ 728

Query: 869  QLHGQGKPNPVLPISNWAREDDGSDVEDKRSAWDLGLSYSFS----VGYGPIMAHDMKVA 702
                +GK +PVLP S WAREDD SD E KRSA  LGLSYS S     G GP  A +M+ A
Sbjct: 729  ----KGKTDPVLPASKWAREDDDSDDEQKRSARGLGLSYSSSGSENAGDGPSKADEMEFA 784

Query: 701  TDMSVLSQHDSGLA-EEQRQKLRHMEFALIDYREYLEERGIWSYEEIDKKVAIYRRRLHS 525
            T+ S+ SQ DSG+  EE RQKLR +E ALI+YRE LEERGI S EEI++KVAI+R+RL S
Sbjct: 785  TESSIPSQPDSGMMNEEHRQKLRRLEVALIEYRESLEERGIKSSEEIERKVAIHRKRLQS 844

Query: 524  EYGLSDSNQVVLGYDTSYLESY-RRDYSHESSRKRHCSHS 408
            EYGLSDSN+ V     S  E   RRD S E++RKRH S S
Sbjct: 845  EYGLSDSNEDVSWNKRSSAERRDRRDDSRETTRKRHRSRS 884


>ref|XP_007225360.1| hypothetical protein PRUPE_ppa000894mg [Prunus persica]
            gi|462422296|gb|EMJ26559.1| hypothetical protein
            PRUPE_ppa000894mg [Prunus persica]
          Length = 968

 Score =  540 bits (1392), Expect = e-151
 Identities = 314/525 (59%), Positives = 363/525 (69%), Gaps = 19/525 (3%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PPPL + KSPE+  E  TT++  RSR VE ER    
Sbjct: 387  AQGDTLQRWRTEPFIMITGSGRWIPPPLPTVKSPEHGKEAGTTYAAGRSRRVEPER---- 442

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT SQRDEFED+LR LTLER  IK+AMGFALD+ADAA EIVEVLTESLTLK
Sbjct: 443  --------TLTDSQRDEFEDMLRALTLERSQIKDAMGFALDNADAAGEIVEVLTESLTLK 494

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  KVARLMLVSD+LHNS AP+KNA AY + F++TLPDIM+SFNDLY  I GRITAE
Sbjct: 495  ETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTRFEATLPDIMESFNDLYRSITGRITAE 554

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSED 1224
            ALKERVLKVLQVWS W LFSDAYVN LRATFLR   S V+       DAPEI  K +SED
Sbjct: 555  ALKERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVVPFHSICGDAPEIDKKITSED 614

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
              +  K  +D  L+ G   AM  L  LPLAELER CRHN +SLVGGRE MVARLL+L+EA
Sbjct: 615  TGDACKTNQDAALAMGKGAAMRELLSLPLAELERRCRHNGLSLVGGRETMVARLLSLEEA 674

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGMKLESSRSNNYGE--- 879
             +Q  Y  DDD KY QSHS+S RYS     +N           G+  +   S    +   
Sbjct: 675  EKQRGYELDDDLKYAQSHSSSARYSSSRREMNIEPDS-----MGISAQGKGSLPLVQTLP 729

Query: 878  ----DVMQLHGQGKPNPVLPISNWAREDDGSDVEDKRSAWDLGLSYSFS----VGYGPIM 723
                ++  L  + K +PVLP S WAREDD SD E KRSA DLGLSYS S     G GP  
Sbjct: 730  IPQPELKALTKKEKSDPVLPASKWAREDDDSDDEQKRSARDLGLSYSSSGSENAGDGPSK 789

Query: 722  AHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEERGIWSYEEIDKKVAIY 543
            A +M+VATD S+ +Q DSG++EEQRQKLR +E ALI+YRE LEERGI + EEI++KVAI+
Sbjct: 790  ADEMEVATDASIPAQPDSGISEEQRQKLRRLEVALIEYRESLEERGIKNPEEIERKVAIH 849

Query: 542  RRRLHSEYGLSDSNQVVLGYDTSYLESYRRDYSHESSRKRHCSHS 408
            R+RL SEYGLSDS++   G   +  E   R     +SRKRH S S
Sbjct: 850  RKRLESEYGLSDSSEDACGSKRTSSERKDRRDDDNTSRKRHRSGS 894


>emb|CAN79213.1| hypothetical protein VITISV_025939 [Vitis vinifera]
          Length = 1384

 Score =  537 bits (1384), Expect = e-150
 Identities = 313/516 (60%), Positives = 359/516 (69%), Gaps = 38/516 (7%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRWMPPPL + +SPE++ E  TTF+  RSR VELER    
Sbjct: 547  AQGDTLQRWRTEPFIMITGSGRWMPPPLPTVRSPEHEKESGTTFAAGRSRRVELER---- 602

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT  QRDEFED+LR LTLER  IKEAMGFALD+ADAA EIVEVLTESLTLK
Sbjct: 603  --------TLTDPQRDEFEDMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTLK 654

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  KVARLMLVSD+LHNS AP+KNA AY ++F++TLPDIM+SFNDLY  + GRITAE
Sbjct: 655  ETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAE 714

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRIS------SVILDAPEIGNKSSSED 1224
            ALKERV+KVLQVW+ W LFSDAYVN LRATFLR   S      S+  DAPEI  K+SSED
Sbjct: 715  ALKERVMKVLQVWADWFLFSDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIEKKTSSED 774

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
              EG K  +D  L+ G   AM  L  LP+AELER CRHN +SLVGGRE+MVARLL+L+EA
Sbjct: 775  TGEGGKSNQDAALAMGKGAAMKELLSLPIAELERRCRHNGLSLVGGREIMVARLLSLEEA 834

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGMKLES---SRSNNYGE 879
             +Q  Y+ DDD KY QSHSNS RY      +            G++ ES   S  N YGE
Sbjct: 835  EKQRGYDLDDDLKYAQSHSNSGRYPSSRKEI------------GVETESVGLSGWNRYGE 882

Query: 878  DVMQLHG----------------------QGKPNPVLPISNWAREDDGSDVEDKRSAWDL 765
            D +Q  G                      +GK +PVLP S WAREDD SD E KRSA  L
Sbjct: 883  DEIQSQGKGSVPLAPTIPIPQPELKAFTNKGKTDPVLPASKWAREDDDSDDEQKRSARGL 942

Query: 764  GLSYSFS----VGYGPIMAHDMKVATDMSVLSQHDSGLA-EEQRQKLRHMEFALIDYREY 600
            GLSYS S     G GP  A +M+ AT+ S+ SQ DSG+  EE RQKLR +E ALI+YRE 
Sbjct: 943  GLSYSSSGSENAGDGPXKADEMEFATESSIPSQPDSGMMNEEHRQKLRRLEVALIEYRES 1002

Query: 599  LEERGIWSYEEIDKKVAIYRRRLHSEYGLSDSNQVV 492
            LEERGI S EEI++KVAI+R+RL SEYGLSDSN+ V
Sbjct: 1003 LEERGIKSSEEIERKVAIHRKRLQSEYGLSDSNEDV 1038


>ref|XP_002324341.2| RNA recognition motif-containing family protein [Populus trichocarpa]
            gi|550317898|gb|EEF02906.2| RNA recognition
            motif-containing family protein [Populus trichocarpa]
          Length = 969

 Score =  531 bits (1367), Expect = e-148
 Identities = 315/541 (58%), Positives = 368/541 (68%), Gaps = 35/541 (6%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PPPL + KSPE++ E  +T++  RSR V+ ER    
Sbjct: 387  AQGDTLQRWRTEPFIMITGSGRWVPPPLPTAKSPEHEKESGSTYAAGRSRRVDSER---- 442

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT  QRDEFED+LR LTLER  IK+AMGF+LD+ADAA E+VEVLTESLTLK
Sbjct: 443  --------TLTDPQRDEFEDMLRALTLERSQIKDAMGFSLDNADAAGEVVEVLTESLTLK 494

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  KVARLMLVSDILHNS AP+KNA AY ++F++ LPDIM+SFNDLY  I GRITAE
Sbjct: 495  ETPIPTKVARLMLVSDILHNSSAPVKNASAYRTKFEAALPDIMESFNDLYRSITGRITAE 554

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSED 1224
            ALKERVLKVLQVWS W LFSDAYVN LRATFLR   S VI       DAPEI  KSSSED
Sbjct: 555  ALKERVLKVLQVWSDWFLFSDAYVNGLRATFLRSSNSGVIPFHSICGDAPEIEKKSSSED 614

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
              EG+KI +D  L+ G   A+  L +LPLAELER CRHN +SLVGGREMMVARLL+L+EA
Sbjct: 615  AVEGAKINQDAALAMGKGAAVKELMNLPLAELERRCRHNGLSLVGGREMMVARLLSLEEA 674

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGMKLESSRSNNYGEDVM 870
             RQ  Y  DDD K  QS+S+S RYS     +N         +    + S+  N YGED M
Sbjct: 675  ERQRGYELDDDLKIAQSNSSSSRYSSVHREMN---------VEAEPVGSTGWNVYGEDEM 725

Query: 869  QLHGQG----------------------KPNPVLPISNWAREDDGSDVEDKRSAWDLGLS 756
                +G                      K +PVLP S WAR+DD SD E KRSA DLGLS
Sbjct: 726  PSQNKGSVSVASTLLIKQPELKAFAKKEKNDPVLPASKWARDDDESDDEQKRSARDLGLS 785

Query: 755  YSFS----VGYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEER 588
            YS S     G G   A +M+ ATD ++ +Q DSG+ EEQRQKLR +E ALI+YRE LEER
Sbjct: 786  YSSSGSENAGDGQGKADEMEFATDANIPTQPDSGMNEEQRQKLRRLEVALIEYRESLEER 845

Query: 587  GIWSYEEIDKKVAIYRRRLHSEYGLSDSNQVVLGYDTSYLESY-RRDYSHESSRKRHCSH 411
            G+ S  EI+ KVAI+R+ L SEYGLS SN+ V    +   E   RR  +H+SSRKRH + 
Sbjct: 846  GMKSSVEIEGKVAIHRKWLESEYGLSSSNEDVTSKKSISSERRDRRSDNHDSSRKRHRNE 905

Query: 410  S 408
            S
Sbjct: 906  S 906


>ref|XP_004291970.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            [Fragaria vesca subsp. vesca]
          Length = 980

 Score =  526 bits (1354), Expect = e-146
 Identities = 311/542 (57%), Positives = 371/542 (68%), Gaps = 36/542 (6%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PP L + +SPE++ E  +T++  RSR VE ER    
Sbjct: 387  AQGDTLQRWRTEPFIMITGSGRWIPPSLPALRSPEHEKESSSTYAAGRSRRVESER---- 442

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT  QRDEFED+LR LTLER  IK+AMGFALD+ADAA EIVEVLTESLTLK
Sbjct: 443  --------TLTDPQRDEFEDMLRALTLERSQIKDAMGFALDNADAAGEIVEVLTESLTLK 494

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  KVARLMLVSD+LHNS AP+KNA AY ++F++TLPDIM+SFNDLY  I GRITAE
Sbjct: 495  ETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRGITGRITAE 554

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSED 1224
            ALKERVLKVLQVWS W LFSDAYVN LRATFLR   S V+       DAP+I  K++SED
Sbjct: 555  ALKERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVVPFHSVCGDAPDIEKKTTSED 614

Query: 1223 MAEGSKITEDTVLSTGNETA-MVLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
             A  +K  +D  L+ G   A   L +LP+AELER CRHN +SLVGGREMMVARLL+L+EA
Sbjct: 615  -AGDAKTNQDAALAMGKGAATRELLNLPMAELERRCRHNGLSLVGGREMMVARLLSLEEA 673

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGMKLESSRSNNYGEDVM 870
             +Q  Y  DDD KYGQ+HS+S R+S     +N              L  S  N Y ED +
Sbjct: 674  EKQRGYELDDDLKYGQNHSSSGRHSSSRKEMNIEPD---------PLGLSGWNRYVEDEI 724

Query: 869  QLHG----------------------QGKPNPVLPISNWAREDDGSDVEDKRSAWDLGLS 756
            Q  G                      + K +PVLP S WAREDD SD + KRSA  LGLS
Sbjct: 725  QSEGKVSLSKAQTHTSPQPELKPFTTKEKSDPVLPASKWAREDDDSDDDQKRSAKGLGLS 784

Query: 755  YSF---SVGYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEERG 585
            YS    + G GP  A +M+VATD+ + +Q DSGL+EEQRQKLR +E +L++YRE LEERG
Sbjct: 785  YSSGSENAGDGPSKADEMEVATDVRIPAQPDSGLSEEQRQKLRRLEVSLLEYRESLEERG 844

Query: 584  IWSYEEIDKKVAIYRRRLHSEYGLSDSNQVVLGYD---TSYLESYRRDYSHESSRKRHCS 414
            I S EEI++KVAI+R+RL SEYGLSDS++   G     +S  +  R D S ++SRKRH S
Sbjct: 845  IRSPEEIERKVAIHRKRLESEYGLSDSSEDASGRSKRTSSERKDRRDDDSRDASRKRHRS 904

Query: 413  HS 408
             S
Sbjct: 905  GS 906


>gb|EXC01118.1| U2 snRNP-associated SURP motif-containing protein [Morus notabilis]
          Length = 999

 Score =  525 bits (1352), Expect = e-146
 Identities = 313/542 (57%), Positives = 364/542 (67%), Gaps = 36/542 (6%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PP L + KSP+ + E   T++  RSR VE ER    
Sbjct: 405  AQGDTLQRWRTEPFIMITGSGRWIPPSLPTAKSPDLEKESGATYAAGRSRRVEPER---- 460

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT SQRDEFED+LR LTLER  IKEAMGFALD+ADAA EIVEVLTESLTLK
Sbjct: 461  --------TLTDSQRDEFEDMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTLK 512

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  KVARLMLVSD+LHNS AP+KNA AY ++F+ TLPDIM+SFNDLY  I GRITAE
Sbjct: 513  ETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEGTLPDIMESFNDLYRSITGRITAE 572

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLR------PRISSVILDAPEIGNKSSSED 1224
            ALKERVLKVLQVW+ W LFSDAYVN LRATFLR          S+  DAPEI    S ED
Sbjct: 573  ALKERVLKVLQVWADWFLFSDAYVNGLRATFLRLGNSGVTPFHSICGDAPEIEKIISFED 632

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
              +  K  ED  L+ G   AM  L +LP AELER CRHN +SLVGGREMMVARLL+L+EA
Sbjct: 633  TGDAGKTNEDAALAMGKGAAMQELMNLPFAELERRCRHNGLSLVGGREMMVARLLSLEEA 692

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQT-IFGMKLESSRSNNYGEDV 873
             +Q  Y  D+D KY Q HS+S RYS          G  R+T + G  + SS  N+Y  D 
Sbjct: 693  EKQRGYELDEDLKYAQGHSSSGRYS----------GGRRETNVEGEPMGSSGWNHYAGDE 742

Query: 872  MQLHGQG----------------------KPNPVLPISNWAREDDGSDVEDKRSAWDLGL 759
            +    +G                      K +PVLP S WAREDD SD E KRS+  LGL
Sbjct: 743  IDSQAKGSVPLAQTIPIPQPELKPFVKKEKSDPVLPASKWAREDDDSDDEQKRSSRGLGL 802

Query: 758  SYSFS----VGYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEE 591
             YS S     G GP  A +M+ A D SV+ Q DSG++EEQR+KLR +E ALI+YRE LEE
Sbjct: 803  GYSSSGSENAGDGPSKADEMESAADSSVV-QPDSGMSEEQRKKLRRLEAALIEYRESLEE 861

Query: 590  RGIWSYEEIDKKVAIYRRRLHSEYGLSDSNQVVLGYDTSYLESY-RRDYSHESSRKRHCS 414
            RGI S EEI++KV ++R+RL +EYGLS+SN+   G   + LE   RRD SHE+SRKRH S
Sbjct: 862  RGIRSPEEIERKVTMHRKRLEAEYGLSNSNKDAAGSKRASLERRDRRDNSHETSRKRHRS 921

Query: 413  HS 408
             S
Sbjct: 922  RS 923


>ref|XP_004138695.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            [Cucumis sativus] gi|449493301|ref|XP_004159248.1|
            PREDICTED: U2 snRNP-associated SURP motif-containing
            protein-like [Cucumis sativus]
          Length = 961

 Score =  522 bits (1345), Expect = e-145
 Identities = 312/540 (57%), Positives = 361/540 (66%), Gaps = 34/540 (6%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PPPL + KSPE + E   T++  RSR +ELER    
Sbjct: 386  AQGDTLQRWRTEPFIMITGSGRWVPPPLPTAKSPELEKESGPTYAAGRSRRMELER---- 441

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT SQRDEFED+LR LTLER  IKEAMGFALD+ADAA EIVEVLTESLTL+
Sbjct: 442  --------TLTDSQRDEFEDMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTLR 493

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  KVARLMLVSDILHNS AP+KNA AY ++F++TLPDI++SFNDLY  I GRITAE
Sbjct: 494  ETPIPTKVARLMLVSDILHNSSAPVKNASAYRTKFEATLPDIIESFNDLYRSITGRITAE 553

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSED 1224
            ALKERVLK+LQVWS W LFSDAYVN LRATFLR   S VI       DAPEI  K++ +D
Sbjct: 554  ALKERVLKLLQVWSDWFLFSDAYVNGLRATFLRLGNSGVIPFHSLCGDAPEIERKANCDD 613

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
              +GSKI +D  L+ G   AM  L +LP  ELER CRHN +SLVGGREMMVARLL+L+EA
Sbjct: 614  SGDGSKINQDAELAMGKGGAMKELMNLPFGELERRCRHNGLSLVGGREMMVARLLSLEEA 673

Query: 1046 RQMS-YNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGMKLESSRSNNYGEDVM 870
             ++S Y  D+D KY  SHS   RYS          G           E+S  + +G+D  
Sbjct: 674  EKLSGYELDEDLKYSNSHSG--RYSSSSRETKVERG---------PAETSGWSRFGDDEA 722

Query: 869  -------------------QLHG---QGKPNPVLPISNWAREDDGSDVEDKRSAWDLGLS 756
                               +L G    GK +PVLP S WAREDD SD E K     LGLS
Sbjct: 723  DFQRMGSVPLAQTLSIPQPELKGFIKSGKNDPVLPASKWAREDDESDSEQKGGTRGLGLS 782

Query: 755  YSFS----VGYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEER 588
            YS S     G GP  A +M++ T++S L Q DSGL EEQRQKLR +E ALI+YRE LEER
Sbjct: 783  YSSSGSENAGDGPSKADEMEITTELSALMQPDSGLNEEQRQKLRRVEVALIEYRESLEER 842

Query: 587  GIWSYEEIDKKVAIYRRRLHSEYGLSDSNQVVLGYDTSYLESYRRDYSHESSRKRHCSHS 408
            GI S EEI++KV IYR++L SEYGLSDSN+      +      R D SHESSRK H S S
Sbjct: 843  GIKSTEEIERKVLIYRKQLESEYGLSDSNETA-SRKSKIERRDRPDDSHESSRKLHRSQS 901


>ref|XP_007011691.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao]
            gi|508782054|gb|EOY29310.1| RNA recognition
            motif-containing protein isoform 1 [Theobroma cacao]
          Length = 985

 Score =  521 bits (1343), Expect = e-145
 Identities = 308/532 (57%), Positives = 368/532 (69%), Gaps = 26/532 (4%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PPPL +TKSPE++ +   T++  RSR VE ER    
Sbjct: 387  AQGDTLQRWRTEPFIMITGSGRWVPPPLPTTKSPEHEKDSTATYAAGRSRRVEPER---- 442

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT  QRDEFED+LR LTLER  IKEAMGFALD+ADAA EIVEVLTESLTLK
Sbjct: 443  --------TLTDPQRDEFEDMLRALTLERSLIKEAMGFALDNADAAGEIVEVLTESLTLK 494

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  KVARLMLVSDILHNS AP+KNA AY ++F++TLPDIM+SFNDLY  + GRITAE
Sbjct: 495  ETPIPTKVARLMLVSDILHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRSVTGRITAE 554

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRIS------SVILDAPEIGNKSSSED 1224
            ALKERVLKVLQVWS W LFSDAYVN LRATFLR   S      S+  DAPEI   +SSED
Sbjct: 555  ALKERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVAPFHSICGDAPEIEKNTSSED 614

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
              +G K  +D  L+ G   AM  L DLPLAELER CRHN +SLVGGRE+MVARLL+L++A
Sbjct: 615  AGDGIKGNQDAALAMGKGAAMRELMDLPLAELERRCRHNGLSLVGGREIMVARLLSLEDA 674

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNAND---GDHRQTIFG-MKLESSRSNNYG 882
             +Q SY  DDD K  QS S+S RYS  +  +NA     G    T +   ++ S R  +  
Sbjct: 675  EKQRSYELDDDLKLAQSRSSSCRYSSGQRDINAEAEPVGLSGWTHYADNEIHSQRKGSVP 734

Query: 881  ---------EDVMQLHGQGKPNPVLPISNWAREDDGSDVEDKRSAWDLGLSYSFS----V 741
                      ++     + K +PVLP S W+REDD SD E+KRS   LGLSYS S     
Sbjct: 735  LAETLPIPQPEIKAFLKKEKIDPVLPASKWSREDDDSDDEEKRSTRGLGLSYSSSGSENA 794

Query: 740  GYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEERGIWSYEEID 561
            G G   A +++  TD S+ +  +S + EEQRQKLR +E ALI+YRE LEERGI S E+I+
Sbjct: 795  GDGTSKADELEFGTDASIPAPSESAMNEEQRQKLRRLEVALIEYRESLEERGIKSAEDIE 854

Query: 560  KKVAIYRRRLHSEYGLSDSNQVVLGYD-TSYLESYRRDYSHESSRKRHCSHS 408
            ++VA +R+RL SEYGLSDS++ + G   TS     RRD +H+SSRKRH S S
Sbjct: 855  RRVAAHRKRLESEYGLSDSSEDISGRKRTSSERRERRDDAHDSSRKRHRSQS 906


>ref|XP_002308714.1| RNA recognition motif-containing family protein [Populus trichocarpa]
            gi|222854690|gb|EEE92237.1| RNA recognition
            motif-containing family protein [Populus trichocarpa]
          Length = 988

 Score =  518 bits (1335), Expect = e-144
 Identities = 308/532 (57%), Positives = 361/532 (67%), Gaps = 26/532 (4%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PP L + KSPE++ E  +T +  RSR V+ ER    
Sbjct: 397  AQGDTLQRWRTEPFIMITGSGRWVPPSLPTAKSPEHEKESGSTHAAGRSRRVDPER---- 452

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT  QRDEFED+LR LTLER  IK+AMGFALD+ DAA E+VEVLTESLTLK
Sbjct: 453  --------TLTDPQRDEFEDMLRALTLERSQIKDAMGFALDNVDAAGEVVEVLTESLTLK 504

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  KVARLMLVSDILHNS AP+KNA AY ++F++ LPDIM+SFNDLY  I GRITAE
Sbjct: 505  ETPIPTKVARLMLVSDILHNSSAPVKNASAYRTKFEAALPDIMESFNDLYRSITGRITAE 564

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSED 1224
            ALKERVLKVLQVWS W LFSDAYVN LRATFLR   S VI       DAPEI  K+S+ED
Sbjct: 565  ALKERVLKVLQVWSDWFLFSDAYVNGLRATFLRSSNSGVIPFHSMCGDAPEIEKKNSTED 624

Query: 1223 MAEGSKITEDTVLSTGNETA-MVLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
              +G K  +D  L+ G   A   L DLPLAELER CRHN +SLVGGRE MVARLLNL+EA
Sbjct: 625  TVDGGKTNQDAALAMGKGAATKELMDLPLAELERRCRHNGLSLVGGRETMVARLLNLEEA 684

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQT---IFGMKLESSRSNNYGE 879
             +Q  Y  D D K  QS+S+S RYS     +N + G    T   I+G     S++     
Sbjct: 685  EKQRGYELDGDLKIAQSNSSSSRYSSVHREVNVDPGPVGLTGWNIYGEDDTPSQNKRSVS 744

Query: 878  DVMQL----------HGQGKPNPVLPISNWAREDDGSDVEDKRSAWDLGLSYSFS----V 741
             V  L            + K +PVLP S WAR+DD SD E KRS  DLGLSYS S     
Sbjct: 745  LVSTLPIPQPELKAFAKKEKNDPVLPASKWARDDDESDDEQKRSVRDLGLSYSSSGSENA 804

Query: 740  GYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEERGIWSYEEID 561
            G G     +M+ ATD S+ +Q +SG+ EEQRQKLR +E ALI+YRE LEE+G+ + EE +
Sbjct: 805  GDGQGKEDEMEFATDASIPTQPESGMNEEQRQKLRRLEVALIEYRESLEEQGMKNSEEFE 864

Query: 560  KKVAIYRRRLHSEYGLSDSNQVVLGYDTSYLESY-RRDYSHESSRKRHCSHS 408
            +KVA++R+RL SEYGLS SN+ V G      E   RRD +HESSRKRH S S
Sbjct: 865  RKVAVHRKRLESEYGLSSSNEDVTGNKRISSERRDRRDDNHESSRKRHRSES 916


>ref|XP_006483724.1| PREDICTED: LOW QUALITY PROTEIN: U2 snRNP-associated SURP
            motif-containing protein-like [Citrus sinensis]
          Length = 1017

 Score =  513 bits (1320), Expect = e-142
 Identities = 309/534 (57%), Positives = 363/534 (67%), Gaps = 28/534 (5%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PP L ++KSPE++ E  TT++  RSR  E ER    
Sbjct: 430  AQGDTLQRWRTEPFIMITGSGRWIPPALPTSKSPEHEKESGTTYAAGRSRRAEPER---- 485

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT SQRDEFED+LR LTLER  IKEAMGFALD+ADAA EIVEVLTESLTLK
Sbjct: 486  --------TLTDSQRDEFEDMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTLK 537

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  KVARLMLVSD+LHNS AP+KNA AY ++F++TLPDIM+SFNDLY  I GRITAE
Sbjct: 538  ETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRSITGRITAE 597

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRIS------SVILDAPEIGNKSSSED 1224
            ALKERVLKVLQVWS W LFSDAYVN LRATFLR   S      S+  DAPEI  K++SED
Sbjct: 598  ALKERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIDKKNNSED 657

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
              + SK  +DT L+ G   A+  L +LPL+ELER CRHN +SLVGGREMMVARLL+L++A
Sbjct: 658  TCDLSKTNQDTALAMGKGAAIKELMNLPLSELERRCRHNGLSLVGGREMMVARLLSLEDA 717

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSK--DESCLNA------------NDGDHRQTIFGMK 912
             +Q  Y  DDD K   S S+S RYS+   E+ + A             D    Q +  + 
Sbjct: 718  EKQRGYELDDDLKSAHSQSSSGRYSRGWKETNMEAESMGLSGWNGYEEDEKLSQAVGSVP 777

Query: 911  LESSRSNNYGEDVMQLHGQGKPNPVLPISNWAREDDGSDVEDKRSAWDLGLSYSFS---- 744
            L +  +    E +     + K +PVLP S WA EDD SD E KRS+  LGLSYS S    
Sbjct: 778  LGTMLTTPQPE-IKAFTKKEKNDPVLPASKWALEDDESDDEQKRSSRGLGLSYSSSGSEN 836

Query: 743  VGYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEERGIWSYEEI 564
             G GP  A D+    D S+  Q DSG+ EEQRQKLR +E +LI+YRE LEERGI S EEI
Sbjct: 837  AGDGPSKADDVDFTIDASIPVQPDSGMNEEQRQKLRRLEVSLIEYRESLEERGIKSSEEI 896

Query: 563  DKKVAIYRRRLHSEYGLSDSNQVVLGYDTSYLESYRRDYSHE--SSRKRHCSHS 408
            +KKVAI+R+RL SEYGL+D N+ V G       + RRD   E   SRKRH S S
Sbjct: 897  EKKVAIHRKRLESEYGLADPNEDVSG-------NKRRDRRDEILDSRKRHRSQS 943


>ref|XP_006450262.1| hypothetical protein CICLE_v10007357mg [Citrus clementina]
            gi|567916514|ref|XP_006450263.1| hypothetical protein
            CICLE_v10007357mg [Citrus clementina]
            gi|557553488|gb|ESR63502.1| hypothetical protein
            CICLE_v10007357mg [Citrus clementina]
            gi|557553489|gb|ESR63503.1| hypothetical protein
            CICLE_v10007357mg [Citrus clementina]
          Length = 973

 Score =  513 bits (1320), Expect = e-142
 Identities = 309/534 (57%), Positives = 363/534 (67%), Gaps = 28/534 (5%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PP L ++KSPE++ E  TT++  RSR  E ER    
Sbjct: 386  AQGDTLQRWRTEPFIMITGSGRWIPPALPTSKSPEHEKESGTTYAAGRSRRAEPER---- 441

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT SQRDEFED+LR LTLER  IKEAMGFALD+ADAA EIVEVLTESLTLK
Sbjct: 442  --------TLTDSQRDEFEDMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTLK 493

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  KVARLMLVSD+LHNS AP+KNA AY ++F++TLPDIM+SFNDLY  I GRITAE
Sbjct: 494  ETPIPTKVARLMLVSDVLHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRSITGRITAE 553

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRIS------SVILDAPEIGNKSSSED 1224
            ALKERVLKVLQVWS W LFSDAYVN LRATFLR   S      S+  DAPEI  K++SED
Sbjct: 554  ALKERVLKVLQVWSDWFLFSDAYVNGLRATFLRSGNSGVTPFHSICGDAPEIDKKNNSED 613

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
              + SK  +DT L+ G   A+  L +LPL+ELER CRHN +SLVGGREMMVARLL+L++A
Sbjct: 614  TCDLSKTNQDTALAMGKGAAIKELMNLPLSELERRCRHNGLSLVGGREMMVARLLSLEDA 673

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSK--DESCLNA------------NDGDHRQTIFGMK 912
             +Q  Y  DDD K   S S+S RYS+   E+ + A             D    Q +  + 
Sbjct: 674  EKQRGYELDDDLKSAHSQSSSGRYSRGWKETNMEAESMGLSGWNGYEEDEKLSQAVGSVP 733

Query: 911  LESSRSNNYGEDVMQLHGQGKPNPVLPISNWAREDDGSDVEDKRSAWDLGLSYSFS---- 744
            L +  +    E +     + K +PVLP S WA EDD SD E KRS+  LGLSYS S    
Sbjct: 734  LGTMLTTPQPE-IKAFTKKEKNDPVLPASKWALEDDESDDEQKRSSRGLGLSYSSSGSEN 792

Query: 743  VGYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEERGIWSYEEI 564
             G GP  A D+    D S+  Q DSG+ EEQRQKLR +E +LI+YRE LEERGI S EEI
Sbjct: 793  AGDGPSKADDVDFTIDASIPVQPDSGMNEEQRQKLRRLEVSLIEYRESLEERGIKSSEEI 852

Query: 563  DKKVAIYRRRLHSEYGLSDSNQVVLGYDTSYLESYRRDYSHE--SSRKRHCSHS 408
            +KKVAI+R+RL SEYGL+D N+ V G       + RRD   E   SRKRH S S
Sbjct: 853  EKKVAIHRKRLESEYGLADPNEDVSG-------NKRRDRRDEILDSRKRHRSQS 899


>ref|XP_002515412.1| RNA binding protein, putative [Ricinus communis]
            gi|223545356|gb|EEF46861.1| RNA binding protein, putative
            [Ricinus communis]
          Length = 979

 Score =  511 bits (1316), Expect = e-142
 Identities = 311/544 (57%), Positives = 364/544 (66%), Gaps = 37/544 (6%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PP L + KSPE++ E   T++  +SR V+ ER    
Sbjct: 385  AQGDTLQRWRTEPFIMITGSGRWIPPSLPTAKSPEHEKESGNTYAAGKSRRVDPER---- 440

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT  QRDEFED+LR LTLER  IK+AMGFALD+ADAA EIVEVLTESLTLK
Sbjct: 441  --------TLTDPQRDEFEDMLRALTLERSQIKDAMGFALDNADAAGEIVEVLTESLTLK 492

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  KVAR+MLVSDILHNS AP+KNA AY ++F++TLPDIM+SFNDLY  I GRITAE
Sbjct: 493  ETPIPTKVARIMLVSDILHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRSITGRITAE 552

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSED 1224
            ALKERV+KVLQVWS W LFSDAYVN LRATFLR   S VI       DAP I  K +SED
Sbjct: 553  ALKERVMKVLQVWSDWFLFSDAYVNGLRATFLRSSTSGVIPFHSICGDAPAIEKKVTSED 612

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
              +G K ++D  L+ G   AM  L  LPLAELER CRHN +SLVGGREMMVARLL+L+EA
Sbjct: 613  TGDGGKTSQDAALAMGKGAAMKELLSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEA 672

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGMKLES-SRSNNYGEDV 873
             +Q  Y  DD+ K  QSH +S ++S             R+T   ++LE  S  N YGED 
Sbjct: 673  EKQRGYELDDNLKVSQSHLSSSKFS----------SGRRET--NVELEPVSEWNVYGEDD 720

Query: 872  MQLHGQG---------------------KPNPVLPISNWAREDDGSDVEDKRSAWDLGLS 756
            +Q   +                      K +PVLP S WAR+DD SD E KRS+  LGLS
Sbjct: 721  VQSQSRASASLATFPIPQAELKAFTKKEKNDPVLPASKWARDDDDSDDEQKRSSRGLGLS 780

Query: 755  YSFS----VGYGPIMAHD-MKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEE 591
            YS S     G G   A D M+ ATD S+  Q DSG+ EEQRQKLR +E ALI+YRE LEE
Sbjct: 781  YSSSGSENAGDGLGKADDEMEFATDGSISVQPDSGMNEEQRQKLRRLEVALIEYRESLEE 840

Query: 590  RGIWSYEEIDKKVAIYRRRLHSEYGLSDSNQVVLGYD--TSYLESYRRDYSHESSRKRHC 417
            RG+ S EEI++KVA +R+RL S+YGL DS+Q   G     S     RRD S ESSRKRH 
Sbjct: 841  RGMKSAEEIERKVASHRKRLQSDYGLLDSSQDTPGNSKRASSERRDRRDDSRESSRKRHR 900

Query: 416  SHST 405
            S S+
Sbjct: 901  SESS 904


>ref|XP_007156303.1| hypothetical protein PHAVU_003G2751000g, partial [Phaseolus vulgaris]
            gi|593786527|ref|XP_007156304.1| hypothetical protein
            PHAVU_003G2751000g, partial [Phaseolus vulgaris]
            gi|561029657|gb|ESW28297.1| hypothetical protein
            PHAVU_003G2751000g, partial [Phaseolus vulgaris]
            gi|561029658|gb|ESW28298.1| hypothetical protein
            PHAVU_003G2751000g, partial [Phaseolus vulgaris]
          Length = 813

 Score =  504 bits (1298), Expect = e-140
 Identities = 302/540 (55%), Positives = 357/540 (66%), Gaps = 34/540 (6%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PP L  +KSPE++ E  +T +  RSR VE ER    
Sbjct: 229  AQGDTLQRWRTEPFIMITGSGRWIPPSLPISKSPEHEKESGSTHAGGRSRRVEPER---- 284

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT +QRDEFED+LR LTLER  IKEAMGF+LD+ADAA EIVEVLTESLTLK
Sbjct: 285  --------TLTDAQRDEFEDMLRALTLERSQIKEAMGFSLDNADAAGEIVEVLTESLTLK 336

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  K+ARLMLVSDILHNS AP++NA AY ++F++TLPDIM+SFNDLY  I GRITAE
Sbjct: 337  ETPIPTKIARLMLVSDILHNSSAPVRNASAYRTKFEATLPDIMESFNDLYRSIMGRITAE 396

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSED 1224
            ALKERVLKVLQVW+ W LFSD YVN LRATFLRP  S VI       DAPEI  K++SED
Sbjct: 397  ALKERVLKVLQVWADWFLFSDGYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQKTTSED 456

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
            +  G K  +D  L+ G   AM  L  LPLAELER CRHN +SLVGGREMMVARLL+L+EA
Sbjct: 457  IVVGGKTNQDAALAMGRGAAMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEA 516

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGMKLESSRSNNYGEDVM 870
             +Q  Y  DD+ KY  +   S +YS      N  +        G+    S  N YG++ +
Sbjct: 517  EKQRGYELDDELKYAHNQGTSGKYSS-----NLQETSAESEPVGL----SAWNQYGDEDL 567

Query: 869  QLHGQG----------------------KPNPVLPISNWAREDDGSDVEDKRSAWDLGLS 756
            Q   +                       K +PVLP S WAREDD SD E ++   +LGLS
Sbjct: 568  QSQSRSSISLASTLPIPQPELKAFTKKEKSDPVLPASKWAREDDESDDEQRKGGKNLGLS 627

Query: 755  YSFS----VGYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEER 588
            YS S    V  GPI A +++ A   S  +  DSG+ EEQRQKLR +E ALI+YRE LEER
Sbjct: 628  YSSSGSENVDDGPIKADELESAAGTSFPAHTDSGMNEEQRQKLRRLEVALIEYRESLEER 687

Query: 587  GIWSYEEIDKKVAIYRRRLHSEYGLSDSNQVVLGYDTSYLESYRRDYSHESSRKRHCSHS 408
            GI + EEIDKKV  +R+RL +EYGLSDS +   G   +   S RRD  H+ SRKRH S S
Sbjct: 688  GIKNLEEIDKKVESHRKRLQAEYGLSDSGEDGKG---NRRTSERRD-RHDVSRKRHRSRS 743


>ref|XP_004509625.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            isoform X4 [Cicer arietinum]
          Length = 851

 Score =  504 bits (1298), Expect = e-140
 Identities = 302/540 (55%), Positives = 358/540 (66%), Gaps = 34/540 (6%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PP L   KSPE+  E  +T +  RSR VE ER    
Sbjct: 260  AQGDTLQRWRTEPFIMITGSGRWIPPALPIAKSPEHDKESGSTHAAGRSRRVEPER---- 315

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT +QRDEFED+LR LTLER  IKE MGF+LD+ADAA EIVEVLTESLTLK
Sbjct: 316  --------TLTDAQRDEFEDMLRALTLERSQIKETMGFSLDNADAAGEIVEVLTESLTLK 367

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  K+ARLMLVSDILHNS AP++NA AY ++F++TLPD+M+SFNDLY  I GRITAE
Sbjct: 368  ETPIPTKIARLMLVSDILHNSSAPVRNASAYRTKFEATLPDVMESFNDLYRSIMGRITAE 427

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSED 1224
            ALKERVLKVLQVW+ W LFSDAYVN LRATFLRP  S VI       DAPEI  K +SED
Sbjct: 428  ALKERVLKVLQVWADWFLFSDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQKMTSED 487

Query: 1223 MAEGSKITEDTVLSTGNETA-MVLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
               G K  +D  L+ G   A   L  LPLAELER CRHN +SLVGGREMMVARLL+L+EA
Sbjct: 488  AVVGGKTDQDAALAMGRGAATQELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEA 547

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGMKLESSRSNNYGEDVM 870
             +Q  +  DD+ KY  + ++S +YS      +A             + SS  N+Y +D +
Sbjct: 548  EKQRGFELDDELKYPLNQASSGKYSSSRRETSAEP---------EPMGSSGWNHYEDDDV 598

Query: 869  QLHGQGK---------PNP-------------VLPISNWAREDDGSDVEDKRSAWDLGLS 756
            QL G+G          P P             VLP S WAREDD SD E  +   +LGLS
Sbjct: 599  QLQGKGSVPLAPTLPIPQPELKAFTRKEKSDIVLPASKWAREDDESDDEQTKGGKNLGLS 658

Query: 755  YSFS----VGYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEER 588
            YS S    VG G I A + + A D S  +  DSGL EEQRQKLR +E ALI+YRE LEER
Sbjct: 659  YSSSGSENVGDGLIKADESEAAADSSFSAHADSGLNEEQRQKLRRLEVALIEYRESLEER 718

Query: 587  GIWSYEEIDKKVAIYRRRLHSEYGLSDSNQVVLGYDTSYLESYRRDYSHESSRKRHCSHS 408
            GI + EEI+KKV ++R+RL  EYGLS+S++   G  +    S RRD  H++SRKRH +HS
Sbjct: 719  GIKNLEEIEKKVLMHRKRLQVEYGLSESSED--GQGSRRTSSERRD-RHDASRKRHRTHS 775


>ref|XP_004509622.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            isoform X1 [Cicer arietinum]
            gi|502154215|ref|XP_004509623.1| PREDICTED: U2
            snRNP-associated SURP motif-containing protein-like
            isoform X2 [Cicer arietinum]
            gi|502154218|ref|XP_004509624.1| PREDICTED: U2
            snRNP-associated SURP motif-containing protein-like
            isoform X3 [Cicer arietinum]
          Length = 977

 Score =  504 bits (1298), Expect = e-140
 Identities = 302/540 (55%), Positives = 358/540 (66%), Gaps = 34/540 (6%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PP L   KSPE+  E  +T +  RSR VE ER    
Sbjct: 386  AQGDTLQRWRTEPFIMITGSGRWIPPALPIAKSPEHDKESGSTHAAGRSRRVEPER---- 441

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT +QRDEFED+LR LTLER  IKE MGF+LD+ADAA EIVEVLTESLTLK
Sbjct: 442  --------TLTDAQRDEFEDMLRALTLERSQIKETMGFSLDNADAAGEIVEVLTESLTLK 493

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  K+ARLMLVSDILHNS AP++NA AY ++F++TLPD+M+SFNDLY  I GRITAE
Sbjct: 494  ETPIPTKIARLMLVSDILHNSSAPVRNASAYRTKFEATLPDVMESFNDLYRSIMGRITAE 553

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSED 1224
            ALKERVLKVLQVW+ W LFSDAYVN LRATFLRP  S VI       DAPEI  K +SED
Sbjct: 554  ALKERVLKVLQVWADWFLFSDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQKMTSED 613

Query: 1223 MAEGSKITEDTVLSTGNETA-MVLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
               G K  +D  L+ G   A   L  LPLAELER CRHN +SLVGGREMMVARLL+L+EA
Sbjct: 614  AVVGGKTDQDAALAMGRGAATQELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEA 673

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGMKLESSRSNNYGEDVM 870
             +Q  +  DD+ KY  + ++S +YS      +A             + SS  N+Y +D +
Sbjct: 674  EKQRGFELDDELKYPLNQASSGKYSSSRRETSAEP---------EPMGSSGWNHYEDDDV 724

Query: 869  QLHGQGK---------PNP-------------VLPISNWAREDDGSDVEDKRSAWDLGLS 756
            QL G+G          P P             VLP S WAREDD SD E  +   +LGLS
Sbjct: 725  QLQGKGSVPLAPTLPIPQPELKAFTRKEKSDIVLPASKWAREDDESDDEQTKGGKNLGLS 784

Query: 755  YSFS----VGYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEER 588
            YS S    VG G I A + + A D S  +  DSGL EEQRQKLR +E ALI+YRE LEER
Sbjct: 785  YSSSGSENVGDGLIKADESEAAADSSFSAHADSGLNEEQRQKLRRLEVALIEYRESLEER 844

Query: 587  GIWSYEEIDKKVAIYRRRLHSEYGLSDSNQVVLGYDTSYLESYRRDYSHESSRKRHCSHS 408
            GI + EEI+KKV ++R+RL  EYGLS+S++   G  +    S RRD  H++SRKRH +HS
Sbjct: 845  GIKNLEEIEKKVLMHRKRLQVEYGLSESSED--GQGSRRTSSERRD-RHDASRKRHRTHS 901


>ref|XP_006858350.1| hypothetical protein AMTR_s00064p00173090 [Amborella trichopoda]
            gi|548862457|gb|ERN19817.1| hypothetical protein
            AMTR_s00064p00173090 [Amborella trichopoda]
          Length = 1011

 Score =  502 bits (1293), Expect = e-139
 Identities = 303/549 (55%), Positives = 362/549 (65%), Gaps = 39/549 (7%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVE-RSRYVELERYVD 1749
            AQGDTLQRWRTEPFIMITGSGRW+PPPL  +KSPE + E  TTF+   RSR VELER   
Sbjct: 418  AQGDTLQRWRTEPFIMITGSGRWIPPPLPISKSPELEKESGTTFAAAGRSRRVELER--- 474

Query: 1748 LEQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTL 1569
                      LT  QRD+FED+LR LTLER  IKEAMGFALD+ADAA E+VEVLTESLTL
Sbjct: 475  ---------TLTDPQRDQFEDMLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTL 525

Query: 1568 KETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITA 1389
            KET I  KVARLMLVSDILHNS AP+KNA AY ++F++TLPDIM+SFNDLY  I GRITA
Sbjct: 526  KETLIPTKVARLMLVSDILHNSSAPVKNASAYRTKFEATLPDIMESFNDLYRSITGRITA 585

Query: 1388 EALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSE 1227
            EALKERVLKVLQVWS W LFSDAYVN LRATF+R   S VI       D PE+ NK++S 
Sbjct: 586  EALKERVLKVLQVWSDWFLFSDAYVNGLRATFIRSSNSGVIPFHSICGDLPEMENKTTST 645

Query: 1226 DMAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKE 1050
            D  EG+K+ +D  L+ G   A+  L +LPL ELER CRHN +SL GGREMMVARLL+L+E
Sbjct: 646  DSGEGAKVNQDAALAMGKGAAVKELLNLPLTELERRCRHNGLSLCGGREMMVARLLSLEE 705

Query: 1049 A-RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGMKLESSRSNNYGEDV 873
            A +Q S++RDDD +YGQ      RYS++ES  N  D   ++T  G +  S    +YGE+V
Sbjct: 706  AEKQKSHDRDDDLRYGQ------RYSREESTWNVCDAGQKETNSGAEPWS----HYGEEV 755

Query: 872  MQLHG------------------------QGKPNPVLPISNWAREDDGSDVEDKRSAWDL 765
             +                           +GK +PVLPIS WAREDD SD ++ +    L
Sbjct: 756  FRSQSKAPSSSMTPTLPIPQPELKAFAIKKGKSDPVLPISKWAREDDASDDDEDKKGLGL 815

Query: 764  GLSYSFSV--GYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEE 591
            G S S S   G GP  A D +V+ D S+ S  DS ++EE RQKLR +E A+++YRE LEE
Sbjct: 816  GYSSSGSEDGGDGPRKAGDPEVSGDASLPSYADSLMSEEYRQKLRSLEVAVMEYRESLEE 875

Query: 590  RGIWSYEEIDKKVAIYRRRLHSEYGLSDSNQVVLGYDTSYLESYRRD----YSHESSRKR 423
            RGI + EEI++KVA +RRRL SE+GL DS     G    +  S  R           RKR
Sbjct: 876  RGIRNPEEIERKVAAHRRRLQSEFGLLDSFGDASGNSKHFSRSSERSSLERRERRDDRKR 935

Query: 422  HCSHSTWPP 396
            H S S  PP
Sbjct: 936  HRSQSRSPP 944


>ref|XP_006599196.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            [Glycine max]
          Length = 969

 Score =  501 bits (1289), Expect = e-139
 Identities = 302/541 (55%), Positives = 364/541 (67%), Gaps = 35/541 (6%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PPPL  +KSPE++ E   T +  RSR VE ER    
Sbjct: 386  AQGDTLQRWRTEPFIMITGSGRWIPPPLPMSKSPEHEKEPGPTHAGGRSRRVEPER---- 441

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT +QRDEFED+LR LTLER  IKEAMGF+LD+ADAA E+VEVLTESLTLK
Sbjct: 442  --------TLTDAQRDEFEDMLRALTLERSQIKEAMGFSLDNADAAGEVVEVLTESLTLK 493

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  K+ARLMLVSDILHNS AP++NA AY ++F++TLPDIM+SFNDLY  I GRITAE
Sbjct: 494  ETPIPTKIARLMLVSDILHNSSAPVRNASAYRTKFEATLPDIMESFNDLYRSIMGRITAE 553

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSED 1224
            ALKERVLKVLQVW+ W LFSDAYVN LRATFLRP  S VI       DAPEI  K++SED
Sbjct: 554  ALKERVLKVLQVWADWFLFSDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQKTASED 613

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
            M  G K  +D  L+ G   AM  L  LPLAELER CRHN +SLVGGREMMVARLL+L+EA
Sbjct: 614  MVVGGKTNQDAALAMGRGAAMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEA 673

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGM-KLESSRSNNYGEDV 873
             +Q  +  DD+ KY  +  +S +YS ++          R+T   +  +  S  N+YG++ 
Sbjct: 674  EKQKGFELDDELKYAHNQVSSGKYSSNQ----------RETSAELDPVGLSAWNHYGDED 723

Query: 872  MQLHGQG----------------------KPNPVLPISNWAREDDGSDVEDKRSAWDLGL 759
            +Q  G+                       K +PVLP S WAREDD SD +++RS  +LGL
Sbjct: 724  IQSQGRSSVPLAPTLPIPQPKLKAFTKKEKNDPVLPASKWAREDDESD-DEQRSGKNLGL 782

Query: 758  SYSFS----VGYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEE 591
            SYS S    V  G + A + + A D S  +  DSG+ EEQRQKLR +E ALI+Y E LEE
Sbjct: 783  SYSSSGSENVDDGLVKADESESAADRSFSAHADSGMNEEQRQKLRRLEVALIEYGESLEE 842

Query: 590  RGIWSYEEIDKKVAIYRRRLHSEYGLSDSNQVVLGYDTSYLESYRRDYSHESSRKRHCSH 411
            RGI + EEI+KKV ++R+RL  EYGLSDS +   G   +   S RRD  H+ SRKRH S 
Sbjct: 843  RGIKNLEEIEKKVQLHRKRLQVEYGLSDSGEDGQG---NRRTSERRD-RHDVSRKRHRSR 898

Query: 410  S 408
            S
Sbjct: 899  S 899


>ref|XP_006585862.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            isoform X3 [Glycine max] gi|571473238|ref|XP_006585863.1|
            PREDICTED: U2 snRNP-associated SURP motif-containing
            protein-like isoform X4 [Glycine max]
          Length = 874

 Score =  490 bits (1261), Expect = e-135
 Identities = 294/540 (54%), Positives = 360/540 (66%), Gaps = 34/540 (6%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PP L  +KSPE++ E  +T +  RSR VE +R    
Sbjct: 291  AQGDTLQRWRTEPFIMITGSGRWIPPQLPMSKSPEHEKESGSTHAGGRSRRVEPDR---- 346

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT +QRDEFED+LR LTLER  IKEAMGF+LD+ADAA EIVEVLTESLTLK
Sbjct: 347  --------TLTDAQRDEFEDMLRALTLERSQIKEAMGFSLDNADAAGEIVEVLTESLTLK 398

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  K+ARLMLVSDILHNS AP++NA AY ++F++TLPDIM+SFNDLY  I GRITAE
Sbjct: 399  ETPIPTKIARLMLVSDILHNSSAPVRNASAYRTKFEATLPDIMESFNDLYRSIMGRITAE 458

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSED 1224
            ALKERVLKVLQVW+ W LFSDAYVN LRATFLRP  S VI       DAPEI   ++S+D
Sbjct: 459  ALKERVLKVLQVWADWFLFSDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQNTTSKD 518

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
            M  G K  +D  L+ G   AM  L  LPLAELER CRHN +SLVGGREMMVARLL+L+EA
Sbjct: 519  MVVGGKTNQDAALAMGRGAAMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEA 578

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGMKLESSRSNNYGEDVM 870
             +Q  +  D++ KY  +  +S +YS ++          R+T    +      N+YG++ +
Sbjct: 579  EKQRGFELDEELKYAHNQVSSGKYSSNQ----------RET---SEEPDPVWNHYGDEDL 625

Query: 869  QLHGQG----------------------KPNPVLPISNWAREDDGSDVEDKRSAWDLGLS 756
            Q  G+                       K +PVLP S WA E D SD E +RS  ++GLS
Sbjct: 626  QSQGRSSVPLSPTLPIAQPELKAFTKKEKNDPVLPASKWAWEGDESDDEQRRSGKNIGLS 685

Query: 755  YSFS----VGYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEER 588
            YS S    VG G + A + + A D    +  DSG+ EEQRQKLR +E ALI+YRE LEER
Sbjct: 686  YSSSGSENVGDGLVKADESESAADTRFSAHADSGMNEEQRQKLRRLEVALIEYRESLEER 745

Query: 587  GIWSYEEIDKKVAIYRRRLHSEYGLSDSNQVVLGYDTSYLESYRRDYSHESSRKRHCSHS 408
            G+ + EEI+KKV  +R+RL  EYGLSDS +   G+  +   S RRD+ ++ SRKRH S S
Sbjct: 746  GVKNLEEIEKKVQSHRKRLQVEYGLSDSGEDGHGHRRT---SERRDW-NDVSRKRHRSPS 801


>ref|XP_006585860.1| PREDICTED: U2 snRNP-associated SURP motif-containing protein-like
            isoform X1 [Glycine max] gi|571473234|ref|XP_006585861.1|
            PREDICTED: U2 snRNP-associated SURP motif-containing
            protein-like isoform X2 [Glycine max]
          Length = 969

 Score =  490 bits (1261), Expect = e-135
 Identities = 294/540 (54%), Positives = 360/540 (66%), Gaps = 34/540 (6%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PP L  +KSPE++ E  +T +  RSR VE +R    
Sbjct: 386  AQGDTLQRWRTEPFIMITGSGRWIPPQLPMSKSPEHEKESGSTHAGGRSRRVEPDR---- 441

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT +QRDEFED+LR LTLER  IKEAMGF+LD+ADAA EIVEVLTESLTLK
Sbjct: 442  --------TLTDAQRDEFEDMLRALTLERSQIKEAMGFSLDNADAAGEIVEVLTESLTLK 493

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  K+ARLMLVSDILHNS AP++NA AY ++F++TLPDIM+SFNDLY  I GRITAE
Sbjct: 494  ETPIPTKIARLMLVSDILHNSSAPVRNASAYRTKFEATLPDIMESFNDLYRSIMGRITAE 553

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSED 1224
            ALKERVLKVLQVW+ W LFSDAYVN LRATFLRP  S VI       DAPEI   ++S+D
Sbjct: 554  ALKERVLKVLQVWADWFLFSDAYVNGLRATFLRPGNSGVIPFHSICGDAPEIEQNTTSKD 613

Query: 1223 MAEGSKITEDTVLSTGNETAM-VLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
            M  G K  +D  L+ G   AM  L  LPLAELER CRHN +SLVGGREMMVARLL+L+EA
Sbjct: 614  MVVGGKTNQDAALAMGRGAAMKELMSLPLAELERRCRHNGLSLVGGREMMVARLLSLEEA 673

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDHRQTIFGMKLESSRSNNYGEDVM 870
             +Q  +  D++ KY  +  +S +YS ++          R+T    +      N+YG++ +
Sbjct: 674  EKQRGFELDEELKYAHNQVSSGKYSSNQ----------RET---SEEPDPVWNHYGDEDL 720

Query: 869  QLHGQG----------------------KPNPVLPISNWAREDDGSDVEDKRSAWDLGLS 756
            Q  G+                       K +PVLP S WA E D SD E +RS  ++GLS
Sbjct: 721  QSQGRSSVPLSPTLPIAQPELKAFTKKEKNDPVLPASKWAWEGDESDDEQRRSGKNIGLS 780

Query: 755  YSFS----VGYGPIMAHDMKVATDMSVLSQHDSGLAEEQRQKLRHMEFALIDYREYLEER 588
            YS S    VG G + A + + A D    +  DSG+ EEQRQKLR +E ALI+YRE LEER
Sbjct: 781  YSSSGSENVGDGLVKADESESAADTRFSAHADSGMNEEQRQKLRRLEVALIEYRESLEER 840

Query: 587  GIWSYEEIDKKVAIYRRRLHSEYGLSDSNQVVLGYDTSYLESYRRDYSHESSRKRHCSHS 408
            G+ + EEI+KKV  +R+RL  EYGLSDS +   G+  +   S RRD+ ++ SRKRH S S
Sbjct: 841  GVKNLEEIEKKVQSHRKRLQVEYGLSDSGEDGHGHRRT---SERRDW-NDVSRKRHRSPS 896


>dbj|BAD28014.1| putative U2-associated SR140 protein [Oryza sativa Japonica Group]
          Length = 954

 Score =  476 bits (1224), Expect = e-131
 Identities = 293/539 (54%), Positives = 353/539 (65%), Gaps = 29/539 (5%)
 Frame = -3

Query: 1925 AQGDTLQRWRTEPFIMITGSGRWMPPPLQSTKSPENKGEFDTTFSVERSRYVELERYVDL 1746
            AQGDTLQRWRTEPFIMITGSGRW+PP L S++SPE + E  +TF+  RSR VE+ER    
Sbjct: 378  AQGDTLQRWRTEPFIMITGSGRWVPPALPSSRSPEREKE--STFAAGRSRRVEVER---- 431

Query: 1745 EQYVELEGILTVSQRDEFEDILRTLTLERRHIKEAMGFALDHADAAREIVEVLTESLTLK 1566
                     LT SQRDEFED+LR LTLER  IKEAMGFALD+ADAA EIVEVLTESLTLK
Sbjct: 432  --------TLTDSQRDEFEDMLRALTLERSQIKEAMGFALDNADAAGEIVEVLTESLTLK 483

Query: 1565 ETPISKKVARLMLVSDILHNSRAPIKNAFAYCSEFQSTLPDIMKSFNDLYHRIEGRITAE 1386
            ETPI  KVARLMLVSDILHNS AP+KNA A+ ++F++ LPD+++SFNDLY  I GRITAE
Sbjct: 484  ETPIPTKVARLMLVSDILHNSSAPVKNASAFRTKFEAALPDVIESFNDLYRSITGRITAE 543

Query: 1385 ALKERVLKVLQVWSIWLLFSDAYVNELRATFLRPRISSVIL------DAPEIGNKSSSED 1224
            ALKERVLKVLQVW+ W LFSDAY+N LRATFLR     VI       D PEI  K+SSED
Sbjct: 544  ALKERVLKVLQVWADWFLFSDAYLNGLRATFLRSSHLGVIPFHSLCGDTPEIEKKASSED 603

Query: 1223 MAEGSKITEDTVLSTGNETA-MVLSDLPLAELERCCRHNRISLVGGREMMVARLLNLKEA 1047
             ++G ++ ED  L+TG   A   L  LPLAELER CRHN +SL GG+EMMVARLL+L+EA
Sbjct: 604  GSDGFRLNEDGALATGKAAATRELLGLPLAELERRCRHNGLSLCGGKEMMVARLLSLEEA 663

Query: 1046 -RQMSYNRDDDKKYGQSHSNSERYSKDESCLNANDGDH-------------RQTIFGMKL 909
             ++  Y +D   KYGQ  S+  R  +D+  +NA +                  + + M+ 
Sbjct: 664  EKERVYEKDAGIKYGQGESH--RTGRDDIAVNARNASRPGEGTDSGESDMLGLSHYAMEA 721

Query: 908  ESSRSNNYGEDVMQLHGQGKPNPVLPISNWAREDDGSDVEDKRSAWDLGLSYSFSVGYGP 729
               RSN           + K +PVLP S W+REDD SD ED++    LGLSYS     G 
Sbjct: 722  GYKRSNESTPAEPVPSKKPKVDPVLPASKWSREDDVSDDEDRKGGRGLGLSYS----SGS 777

Query: 728  IMAHDMKVATDMSVLSQH-----DSGLAEEQRQKLRHMEFALIDYREYLEERGIWSYEEI 564
             +A D   A    V + H     D+ L EE R+KLR +E A++ YRE LEE+G+ + EEI
Sbjct: 778  DIAGDSGKADATEVSTDHSNHHQDTILDEEHRKKLRQIEIAVMQYRESLEEKGLRNTEEI 837

Query: 563  DKKVAIYRRRLHSEYGLSDSNQVVLGYDTS-YLESYRRDYSHESSRKRH--CSHSTWPP 396
            +KKVA +RRRL SEYGLS SN       +S    S RRD   +SSRKRH   S S  PP
Sbjct: 838  EKKVASHRRRLQSEYGLSFSNDGANSRRSSERTSSERRDRHDDSSRKRHRSLSRSRSPP 896


Top