BLASTX nr result

ID: Akebia23_contig00003817 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00003817
         (1777 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI34879.3| unnamed protein product [Vitis vinifera]              217   1e-53
ref|XP_002275607.1| PREDICTED: uncharacterized protein LOC100264...   217   1e-53
ref|XP_004232320.1| PREDICTED: uncharacterized protein LOC101257...   205   6e-50
emb|CAN60421.1| hypothetical protein VITISV_021069 [Vitis vinifera]   203   2e-49
ref|XP_006338590.1| PREDICTED: AT-rich interactive domain-contai...   202   4e-49
ref|XP_004299014.1| PREDICTED: uncharacterized protein LOC101298...   197   1e-47
ref|XP_007031556.1| Uncharacterized protein TCM_016944 [Theobrom...   194   8e-47
ref|XP_003525675.1| PREDICTED: arginine-glutamic acid dipeptide ...   191   1e-45
ref|XP_006833453.1| hypothetical protein AMTR_s00082p00053170 [A...   188   6e-45
ref|XP_003554822.1| PREDICTED: vacuolar protein sorting-associat...   183   2e-43
ref|XP_007217219.1| hypothetical protein PRUPE_ppa003425mg [Prun...   180   2e-42
gb|EXC20326.1| hypothetical protein L484_020546 [Morus notabilis]     180   2e-42
gb|ACZ74657.1| hypothetical protein [Phaseolus vulgaris]              175   7e-41
ref|XP_007150938.1| hypothetical protein PHAVU_004G007500g [Phas...   174   9e-41
ref|XP_007222918.1| hypothetical protein PRUPE_ppa003684mg [Prun...   174   1e-40
ref|NP_186805.2| uncharacterized protein [Arabidopsis thaliana] ...   168   8e-39
gb|AAF01541.1|AC009325_11 unknown protein [Arabidopsis thaliana]      167   1e-38
ref|XP_006470271.1| PREDICTED: COPII coat assembly protein sec16...   167   1e-38
ref|XP_002873675.1| predicted protein [Arabidopsis lyrata subsp....   167   1e-38
ref|XP_006289230.1| hypothetical protein CARUB_v10002686mg [Caps...   166   3e-38

>emb|CBI34879.3| unnamed protein product [Vitis vinifera]
          Length = 465

 Score =  217 bits (553), Expect = 1e-53
 Identities = 122/228 (53%), Positives = 157/228 (68%), Gaps = 3/228 (1%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLS-GSQKHDFLDRFNPQEEQLH-VFGDGLKKESKEEILPSYDFQPIQ 256
           MN+ Q  DKQ+M LS GSQ +DF++  +P+++ L  V G G     KEEI+PSYDF PI+
Sbjct: 1   MNTSQFMDKQIMDLSAGSQSNDFINLMSPEDDHLTGVGGGGGVGSKKEEIVPSYDFLPIR 60

Query: 257 PLRASHSINLEE-SNVGGIRGHNLVDSMSNSSNLRTYGSLGTIESAKITQKKDRNAYHSE 433
           P  +S   NL+     GG R  +  DS +N+  +R YGSL + E +KI+ +KDRN   + 
Sbjct: 61  PKGSSQFSNLDAVGGAGGPRAWSSTDSKTNTPGIRNYGSLDSNELSKISLEKDRNI-DAA 119

Query: 434 MVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKISVGNNHGNTDGKLR 613
           +VSEID+TMK + DNLLHVLEG+SARL+QLESRT  LE+S+DDLK+SVGNNHG+ DGK+R
Sbjct: 120 IVSEIDRTMKKHADNLLHVLEGLSARLTQLESRTRNLENSVDDLKVSVGNNHGSADGKMR 179

Query: 614 QLENLLREVQMGVQVLRDXXXXXXXXXXXXXXXXSKGDQKSENQNSST 757
           QLEN+LREVQ GVQVLRD                SK DQ+SE QN+ T
Sbjct: 180 QLENILREVQTGVQVLRDKQEIVEAHLQLAKLQVSKADQQSETQNTVT 227



 Score =  103 bits (256), Expect = 3e-19
 Identities = 66/161 (40%), Positives = 75/161 (46%), Gaps = 4/161 (2%)
 Frame = +3

Query: 1305 PQSYPPIVRQQPPSQQFCGAPPPHMYEPTSSRPNSGVSSGYAPPSGTNFNDNXXXXXXXX 1484
            PQ  PP+      +  F G PP HMYE  S R + G                        
Sbjct: 335  PQHQPPLGHHPEETSYFYG-PPSHMYEVPSGRSSGG------------------------ 369

Query: 1485 XXXXXXXXXXXXXXXXXXXXXXXNYPRLPTAKILPHALPTVNSDVXXXXXXXXX----NR 1652
                                    YP+LPTA++LPHALPT +  V             NR
Sbjct: 370  ----------------------SGYPQLPTARVLPHALPTASGPVGGSGPGSGSSGSGNR 407

Query: 1653 VPVDDVVDKVATMGFSRDQVRATVRRLTENGQSVDLNVVLD 1775
            VP+DDVVDKV  MGF RD VRATVR+LTENGQSVDLNVVLD
Sbjct: 408  VPIDDVVDKVTNMGFPRDVVRATVRKLTENGQSVDLNVVLD 448


>ref|XP_002275607.1| PREDICTED: uncharacterized protein LOC100264681 [Vitis vinifera]
          Length = 563

 Score =  217 bits (553), Expect = 1e-53
 Identities = 122/228 (53%), Positives = 157/228 (68%), Gaps = 3/228 (1%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLS-GSQKHDFLDRFNPQEEQLH-VFGDGLKKESKEEILPSYDFQPIQ 256
           MN+ Q  DKQ+M LS GSQ +DF++  +P+++ L  V G G     KEEI+PSYDF PI+
Sbjct: 1   MNTSQFMDKQIMDLSAGSQSNDFINLMSPEDDHLTGVGGGGGVGSKKEEIVPSYDFLPIR 60

Query: 257 PLRASHSINLEE-SNVGGIRGHNLVDSMSNSSNLRTYGSLGTIESAKITQKKDRNAYHSE 433
           P  +S   NL+     GG R  +  DS +N+  +R YGSL + E +KI+ +KDRN   + 
Sbjct: 61  PKGSSQFSNLDAVGGAGGPRAWSSTDSKTNTPGIRNYGSLDSNELSKISLEKDRNI-DAA 119

Query: 434 MVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKISVGNNHGNTDGKLR 613
           +VSEID+TMK + DNLLHVLEG+SARL+QLESRT  LE+S+DDLK+SVGNNHG+ DGK+R
Sbjct: 120 IVSEIDRTMKKHADNLLHVLEGLSARLTQLESRTRNLENSVDDLKVSVGNNHGSADGKMR 179

Query: 614 QLENLLREVQMGVQVLRDXXXXXXXXXXXXXXXXSKGDQKSENQNSST 757
           QLEN+LREVQ GVQVLRD                SK DQ+SE QN+ T
Sbjct: 180 QLENILREVQTGVQVLRDKQEIVEAHLQLAKLQVSKADQQSETQNTVT 227



 Score =  144 bits (364), Expect = 1e-31
 Identities = 98/257 (38%), Positives = 113/257 (43%), Gaps = 16/257 (6%)
 Frame = +3

Query: 1053 REPYFPPPGQLPETTHLXXXXXXXXXXXXXXXXXXXXXXXXXXLPQYTRXXXXXXXXXXX 1232
            REPYF PPGQ  E  +                           LPQY++           
Sbjct: 293  REPYFQPPGQAQEAPNQQYQLPPTQQPQPPPAAPSHQQYQPASLPQYSQPPQLPQQHHSI 352

Query: 1233 XXXXXXXXXXXXXXXXXEENLYMLPQSYPPIVRQ---QPPSQQFCGAPPP--------HM 1379
                             EE  Y+ PQ+YPP +RQ   QPPSQ   GAPP         HM
Sbjct: 353  APINPPPQHQPPLGHHPEETSYVPPQTYPPSLRQPPSQPPSQPLSGAPPSQQFYGPPSHM 412

Query: 1380 YEPTSSRPNSGVSSGYAPP-SGTNFNDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXN 1556
            YE  S R +SG S+G+ PP SG                                      
Sbjct: 413  YEVPSGRSSSGFSTGFVPPPSGPT---EPYSYSGSPSQYGSNMPMKPQQFSSGVQGGGSG 469

Query: 1557 YPRLPTAKILPHALPTVNSDVXXXXXXXXX----NRVPVDDVVDKVATMGFSRDQVRATV 1724
            YP+LPTA++LPHALPT +  V             NRVP+DDVVDKV  MGF RD VRATV
Sbjct: 470  YPQLPTARVLPHALPTASGPVGGSGPGSGSSGSGNRVPIDDVVDKVTNMGFPRDVVRATV 529

Query: 1725 RRLTENGQSVDLNVVLD 1775
            R+LTENGQSVDLNVVLD
Sbjct: 530  RKLTENGQSVDLNVVLD 546


>ref|XP_004232320.1| PREDICTED: uncharacterized protein LOC101257918 [Solanum
           lycopersicum]
          Length = 545

 Score =  205 bits (521), Expect = 6e-50
 Identities = 116/230 (50%), Positives = 153/230 (66%), Gaps = 6/230 (2%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLSGSQKH----DFLDRFNPQEEQLHVFGDGLKKESKEEILPSYDFQP 250
           MNS Q  DKQ+M LS SQ      DF+D  NPQ ++ H+ GD  KKE   +I+PSY+F P
Sbjct: 1   MNSSQYMDKQIMDLSNSQNSNNNSDFIDLVNPQADR-HISGDDQKKE---DIVPSYEFHP 56

Query: 251 IQPLRASH-SINLEESNVGGIRGHNLVDSMSNS-SNLRTYGSLGTIESAKITQKKDRNAY 424
           I+P+ +S    N++ SNVG  R  N  DS +N+ S +R YGSL TI   K+  +KD  + 
Sbjct: 57  IRPIGSSSPKSNIDSSNVGVARAWNSADSKNNAESYIRNYGSLDTIGPTKVILEKDLGSV 116

Query: 425 HSEMVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKISVGNNHGNTDG 604
           +S  +SEID T+K Y DNLLH +EGVSARLSQLE+R  Q+++SID+LK+SVGNNHG TDG
Sbjct: 117 YSSQLSEIDHTVKKYADNLLHAVEGVSARLSQLETRNRQIDNSIDELKLSVGNNHGVTDG 176

Query: 605 KLRQLENLLREVQMGVQVLRDXXXXXXXXXXXXXXXXSKGDQKSENQNSS 754
           KLRQLEN+LREVQ GVQV+RD                 K +Q++E  +++
Sbjct: 177 KLRQLENILREVQDGVQVIRDKQEIMDAQLQLMKSQAPKIEQQAETHSTT 226



 Score =  117 bits (294), Expect = 1e-23
 Identities = 75/172 (43%), Positives = 90/172 (52%), Gaps = 8/172 (4%)
 Frame = +3

Query: 1284 EENLYMLPQSYPP-IVRQQP-------PSQQFCGAPPPHMYEPTSSRPNSGVSSGYAPPS 1439
            EE  ++  Q+YPP  +RQ P       PSQQ  G PP +++EP SSRP  G S  Y P S
Sbjct: 359  EETPFVPSQTYPPPSIRQPPHSSSGAPPSQQLYGTPP-NIFEPPSSRPGPGYSGVYGPSS 417

Query: 1440 GTNFNDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPRLPTAKILPHALPTVNSDV 1619
                +                                  Y +LPTA+ILP ALPT ++  
Sbjct: 418  MPG-DPYPYSSSPGQYGSGSSMKPPQVSLPSMGQSGSSGYQQLPTARILPQALPTASAVS 476

Query: 1620 XXXXXXXXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENGQSVDLNVVLD 1775
                     NRVP+DDVVDKV  MGF RDQVRATVRRLTE+GQ+VDLN VLD
Sbjct: 477  GGSSSPGSGNRVPIDDVVDKVTNMGFPRDQVRATVRRLTESGQTVDLNTVLD 528


>emb|CAN60421.1| hypothetical protein VITISV_021069 [Vitis vinifera]
          Length = 604

 Score =  203 bits (517), Expect = 2e-49
 Identities = 112/213 (52%), Positives = 146/213 (68%), Gaps = 2/213 (0%)
 Frame = +2

Query: 125 SGSQKHDFLDRFNPQEEQLH-VFGDGLKKESKEEILPSYDFQPIQPLRASHSINLEE-SN 298
           +GSQ +DF++  +P+++ L  V G G     KEEI+PSYDFQPI+P+ +S   NL+    
Sbjct: 5   AGSQSNDFINLMSPEDDHLTGVGGGGGVGSKKEEIVPSYDFQPIRPMGSSQFSNLDAVGG 64

Query: 299 VGGIRGHNLVDSMSNSSNLRTYGSLGTIESAKITQKKDRNAYHSEMVSEIDQTMKIYTDN 478
            GG R  +  DS +N+  +R YGSL + E +KI+ +KDRN   + +VSEID+TMK   DN
Sbjct: 65  AGGPRAWSSTDSKTNTPGIRNYGSLDSNELSKISLEKDRNI-DAAIVSEIDRTMKKXADN 123

Query: 479 LLHVLEGVSARLSQLESRTHQLESSIDDLKISVGNNHGNTDGKLRQLENLLREVQMGVQV 658
           LLH LEG+SARL+QLESRT  LE+S+DDLK+SVGNNHG+ DGK+RQLEN+LREVQ GVQV
Sbjct: 124 LLHXLEGLSARLTQLESRTRNLENSVDDLKVSVGNNHGSADGKMRQLENILREVQTGVQV 183

Query: 659 LRDXXXXXXXXXXXXXXXXSKGDQKSENQNSST 757
           LRD                SK DQ+SE Q + T
Sbjct: 184 LRDKQEIVEAHLQLAKLQVSKADQQSETQKTVT 216



 Score =  142 bits (357), Expect = 6e-31
 Identities = 97/257 (37%), Positives = 112/257 (43%), Gaps = 16/257 (6%)
 Frame = +3

Query: 1053 REPYFPPPGQLPETTHLXXXXXXXXXXXXXXXXXXXXXXXXXXLPQYTRXXXXXXXXXXX 1232
            REPYF  PGQ  E  +                           LPQY++           
Sbjct: 282  REPYFQAPGQAQEAPNQQYQLPPTQQPQPPPAAPSHQQYQPASLPQYSQPPQLPQQHLSI 341

Query: 1233 XXXXXXXXXXXXXXXXXEENLYMLPQSYPPIVRQ---QPPSQQFCGAPPP--------HM 1379
                             EE  Y+ PQ+YPP +RQ   QPPSQ   GAPP         HM
Sbjct: 342  APINPPPQHQPPLGHHPEETSYVPPQTYPPSLRQPPSQPPSQPLSGAPPSQQFYGPPSHM 401

Query: 1380 YEPTSSRPNSGVSSGYAPP-SGTNFNDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXN 1556
            YE  S R +SG S+G+ PP SG                                      
Sbjct: 402  YEAPSGRSSSGFSTGFGPPPSGPT---EPYSYSGSPSQYGSNMPMKPQQFSSGVQGGGSG 458

Query: 1557 YPRLPTAKILPHALPTVNSDVXXXXXXXXX----NRVPVDDVVDKVATMGFSRDQVRATV 1724
            YP+LPTA++LPHALPT +  V             NRVP+DDVVDKV  MGF RD VRATV
Sbjct: 459  YPQLPTARVLPHALPTASGPVGGPGPGSGSSGSGNRVPIDDVVDKVTNMGFPRDVVRATV 518

Query: 1725 RRLTENGQSVDLNVVLD 1775
            R+LTENGQSVDLNVVLD
Sbjct: 519  RKLTENGQSVDLNVVLD 535


>ref|XP_006338590.1| PREDICTED: AT-rich interactive domain-containing protein 1A-like
           [Solanum tuberosum]
          Length = 549

 Score =  202 bits (514), Expect = 4e-49
 Identities = 112/230 (48%), Positives = 154/230 (66%), Gaps = 6/230 (2%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLSGSQK----HDFLDRFNPQEEQLHVFGDGLKKESKEEILPSYDFQP 250
           MNS    DKQ+M LS SQ     +DF+D  NPQ +  H+ G   KKE   +I+PSY+F P
Sbjct: 1   MNSSHYMDKQIMDLSNSQNSSNNNDFIDLVNPQADH-HISGGDQKKE---DIVPSYEFHP 56

Query: 251 IQPLRASH-SINLEESNVGGIRGHNLVDSMSNS-SNLRTYGSLGTIESAKITQKKDRNAY 424
           I+P+ +S    N++ SNVG  R  N  DS +N+ SN+R YGSL +I+  K+  +KD  + 
Sbjct: 57  IRPIGSSSPKSNIDSSNVGVARAWNSADSKNNTESNIRNYGSLDSIDPTKVIVEKDLGSV 116

Query: 425 HSEMVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKISVGNNHGNTDG 604
           +S ++SEID T+K Y DNLLH +EGVSARLSQLE+R  Q+++S+D+LK+SVGN+HG TDG
Sbjct: 117 YSSLLSEIDHTVKKYADNLLHAVEGVSARLSQLETRNRQIDNSVDELKLSVGNSHGVTDG 176

Query: 605 KLRQLENLLREVQMGVQVLRDXXXXXXXXXXXXXXXXSKGDQKSENQNSS 754
           KLRQLEN+LREVQ GVQV+RD                 K +Q++E  +++
Sbjct: 177 KLRQLENILREVQDGVQVIRDKQEIMDAQLQLMKSQAPKIEQQAETHSTT 226



 Score =  117 bits (292), Expect = 2e-23
 Identities = 75/172 (43%), Positives = 89/172 (51%), Gaps = 8/172 (4%)
 Frame = +3

Query: 1284 EENLYMLPQSYPP-IVRQQP-------PSQQFCGAPPPHMYEPTSSRPNSGVSSGYAPPS 1439
            EE  ++  Q+YPP  +RQ P       PSQQ  G PP +++EP SSRP  G S  Y P S
Sbjct: 363  EETPFVPSQTYPPPSIRQPPHSSSGAPPSQQLYGTPP-NIFEPPSSRPGLGYSGVYGPSS 421

Query: 1440 GTNFNDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPRLPTAKILPHALPTVNSDV 1619
                                                   Y +LPTA+ILP ALPT ++  
Sbjct: 422  VPG-EPYPYSSSPGQYGSGSSMKPLQVSLPTMGQSGSSGYQQLPTARILPQALPTASAVS 480

Query: 1620 XXXXXXXXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENGQSVDLNVVLD 1775
                     NRVP+DDVVDKV  MGF RDQVRATVRRLTE+GQ+VDLN VLD
Sbjct: 481  GGSSSPGTGNRVPIDDVVDKVTNMGFPRDQVRATVRRLTESGQTVDLNTVLD 532


>ref|XP_004299014.1| PREDICTED: uncharacterized protein LOC101298222 [Fragaria vesca
           subsp. vesca]
          Length = 561

 Score =  197 bits (501), Expect = 1e-47
 Identities = 110/204 (53%), Positives = 143/204 (70%), Gaps = 9/204 (4%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLS---GSQKHDFLDRFNPQEEQLHVFGDGLKKESKEEILPSYDFQPI 253
           MN+    DKQ+M LS       +DFLD  N  +E+ H  G G     KEEILP+YDF PI
Sbjct: 1   MNTTSFMDKQIMDLSHGSSQNNNDFLDLMNNSQEEEHQVGRGNGLTKKEEILPNYDFHPI 60

Query: 254 QPLR--ASHSINLEES-NVGGIRGHNLVDSMSNSSN---LRTYGSLGTIESAKITQKKDR 415
           +P+   +SHS N + + N+GG      V +MSNS+    +R YGS+ +++ AK   +KDR
Sbjct: 61  RPITGVSSHSQNFDATPNLGG----GGVSTMSNSNTNAPVRNYGSVDSLKPAKDIVEKDR 116

Query: 416 NAYHSEMVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKISVGNNHGN 595
           NA  + ++SEIDQTMK Y DNLL V+EG+SARL+QLESRT  LE+S+DDLK+SVGNNHGN
Sbjct: 117 NAPDATVISEIDQTMKKYADNLLQVMEGISARLTQLESRTCHLENSVDDLKVSVGNNHGN 176

Query: 596 TDGKLRQLENLLREVQMGVQVLRD 667
            DGK+RQLEN+LR+VQ GVQ L+D
Sbjct: 177 ADGKMRQLENILRDVQTGVQDLKD 200



 Score =  140 bits (354), Expect = 1e-30
 Identities = 92/250 (36%), Positives = 112/250 (44%), Gaps = 9/250 (3%)
 Frame = +3

Query: 1053 REPYFPP-PGQLPETTHLXXXXXXXXXXXXXXXXXXXXXXXXXXLPQYTRXXXXXXXXXX 1229
            R+PYFP  PGQ  ET +                            PQY +          
Sbjct: 297  RDPYFPAAPGQTQETPNQQYQLPAGQQSLPPPTVPPHQQFQPTSQPQYPQPPPQLPQQHH 356

Query: 1230 XXXXXXXXXXXXXXXXXXEENLYMLPQSYPPIVRQQPPSQQFCGAPPP--------HMYE 1385
                              EE  Y   Q+YPP +RQ PPSQ   G PP         ++YE
Sbjct: 357  SLPPVNHSQVQPTLGHHAEETPYAPSQTYPPSLRQ-PPSQTPTGLPPSQQYYNPTSNVYE 415

Query: 1386 PTSSRPNSGVSSGYAPPSGTNFNDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPR 1565
            P SSRPNSG SSGY PPSG N                                    YP+
Sbjct: 416  PPSSRPNSGFSSGYGPPSGLN-EPYHYGGSPSQYGGTSSMKPQLSSATSQSQSGGSGYPQ 474

Query: 1566 LPTAKILPHALPTVNSDVXXXXXXXXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENG 1745
            LPTA++LPHA+PT +            N+VP+DDV+D+V +MGF RD VRATVR+LT+NG
Sbjct: 475  LPTARVLPHAVPTPSGVSDRSGSAGTGNKVPIDDVIDRVTSMGFPRDHVRATVRKLTDNG 534

Query: 1746 QSVDLNVVLD 1775
            Q+VDLNVVLD
Sbjct: 535  QAVDLNVVLD 544


>ref|XP_007031556.1| Uncharacterized protein TCM_016944 [Theobroma cacao]
           gi|508710585|gb|EOY02482.1| Uncharacterized protein
           TCM_016944 [Theobroma cacao]
          Length = 541

 Score =  194 bits (494), Expect = 8e-47
 Identities = 111/231 (48%), Positives = 148/231 (64%), Gaps = 8/231 (3%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLSGSQKH-------DFLDRFN-PQEEQLHVFGDGLKKESKEEILPSY 238
           MN+ Q  DKQ+M L+ S          DF+D  N PQ E  H  G G+   +KE I PSY
Sbjct: 1   MNTSQFMDKQIMDLTSSSSSPPHNTNKDFIDLMNNPQNEDNHNQGSGIS--NKEGIFPSY 58

Query: 239 DFQPIQPLRASHSINLEESNVGGIRGHNLVDSMSNSSNLRTYGSLGTIESAKITQKKDRN 418
           DFQPI+P+    S +L+ + V     +N     S  S  + YGSL ++E AK+  +KDRN
Sbjct: 59  DFQPIRPV----STSLDAAAVN----NNPRSWSSGDSKTKNYGSLDSVEPAKVILEKDRN 110

Query: 419 AYHSEMVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKISVGNNHGNT 598
           A+ + +V+EID+TMK +TDNL+H+LE VSARL+QLESRT  LE+S+DDLK+SVGNNHG+T
Sbjct: 111 AFDTSIVAEIDRTMKKHTDNLIHMLEVVSARLTQLESRTRNLENSVDDLKVSVGNNHGST 170

Query: 599 DGKLRQLENLLREVQMGVQVLRDXXXXXXXXXXXXXXXXSKGDQKSENQNS 751
           +GK+RQLEN+L EVQ GV VL++                +KGD  SE QN+
Sbjct: 171 EGKMRQLENILNEVQTGVHVLKEKQEIMEAQLHLAKLQVTKGDHPSETQNT 221



 Score =  139 bits (350), Expect = 4e-30
 Identities = 85/172 (49%), Positives = 98/172 (56%), Gaps = 8/172 (4%)
 Frame = +3

Query: 1284 EENLYMLPQSYPPIVRQ---QPPS-----QQFCGAPPPHMYEPTSSRPNSGVSSGYAPPS 1439
            EE  Y+  Q+YPP +RQ   QPPS     QQ+ GAPP  M+EP SSRP SG S+GY P S
Sbjct: 355  EEAPYVPSQNYPPNLRQPPSQPPSGPPSSQQYYGAPP-QMHEPPSSRPGSGFSAGYIPQS 413

Query: 1440 GTNFNDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPRLPTAKILPHALPTVNSDV 1619
            G +                                    YP+LPTA+ILPHALPT +   
Sbjct: 414  GQS-EPYAYGGSPSQYGSGSPMKMQQLPSSPMGQSGGSGYPQLPTARILPHALPTASGVG 472

Query: 1620 XXXXXXXXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENGQSVDLNVVLD 1775
                     NRVPVDDV+DKV +MGF RD VRATVR+LTENGQSVDLNVVLD
Sbjct: 473  GGSGPSGPGNRVPVDDVIDKVTSMGFPRDHVRATVRKLTENGQSVDLNVVLD 524


>ref|XP_003525675.1| PREDICTED: arginine-glutamic acid dipeptide repeats protein-like
           [Glycine max]
          Length = 573

 Score =  191 bits (484), Expect = 1e-45
 Identities = 116/248 (46%), Positives = 154/248 (62%), Gaps = 24/248 (9%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLSG-----------SQKHDFLDRFN--PQEEQLHVFGDGLKKE---- 211
           MN+    DKQ+M L+            SQ  DF+D     PQ    H   D    E    
Sbjct: 1   MNTTPFMDKQIMDLTHGHGSSSSSTTQSQSKDFIDLMKEPPQHHHHHHLEDEDNDEEEKA 60

Query: 212 -----SKEEILPSYDFQPIQPLRASHSINLEESNVGGIRGHNLVDSMSNSSN--LRTYGS 370
                SK++I+PSYDFQPI+PL AS++ +    +    R  N  DS SN+S   ++ Y S
Sbjct: 61  RGNGISKDDIVPSYDFQPIRPLAASNNFD----SAAFSRPWNS-DSNSNASPPVIKNYSS 115

Query: 371 LGTIESAKITQKKDRNAYHSEMVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLES 550
           L ++E AK+  +KDR+A+ + M+SEID+TMK + +N+LHVLEGVSARL+QLE+RTH LE+
Sbjct: 116 LDSMEPAKVIVEKDRSAFDATMLSEIDRTMKKHMENMLHVLEGVSARLTQLETRTHHLEN 175

Query: 551 SIDDLKISVGNNHGNTDGKLRQLENLLREVQMGVQVLRDXXXXXXXXXXXXXXXXSKGDQ 730
           S+DDLK+SVGNNHG+TDGKLRQLEN+LREVQ GVQ ++D                SK DQ
Sbjct: 176 SVDDLKVSVGNNHGSTDGKLRQLENILREVQSGVQTIKDKQDIVQAQLQLAKLQVSKTDQ 235

Query: 731 KSENQNSS 754
           +SE Q S+
Sbjct: 236 QSEMQTSA 243



 Score =  129 bits (325), Expect = 3e-27
 Identities = 92/250 (36%), Positives = 104/250 (41%), Gaps = 9/250 (3%)
 Frame = +3

Query: 1053 REPYFPPPGQLPETTHLXXXXXXXXXXXXXXXXXXXXXXXXXXLPQYTRXXXXXXXXXXX 1232
            R+PYFPPP Q  ET +                            PQY +           
Sbjct: 311  RDPYFPPPVQSQETPNQQYQMPLSQQPHAQPGAPPHQQYQQTPHPQYPQPAPHLPQQQPP 370

Query: 1233 XXXXXXXXXXXXXXXXXEENLYMLPQSYPPIVRQQP---------PSQQFCGAPPPHMYE 1385
                              E     PQ+YPP VRQ P         P QQF G P  H YE
Sbjct: 371  SHPSMNPPQLQSSLGHHVEEPPYPPQNYPPNVRQPPSPSPTGPPPPPQQFYGTPT-HAYE 429

Query: 1386 PTSSRPNSGVSSGYAPPSGTNFNDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPR 1565
            P+SSR  SG SSGY   SG                                      YP+
Sbjct: 430  PSSSRSGSGYSSGYGTLSGPV---EQYRYGPPQYAGTPALKPQQLPTASLAPSSGSGYPQ 486

Query: 1566 LPTAKILPHALPTVNSDVXXXXXXXXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENG 1745
            LPTA++LP A+PT ++            RV VDDVVDKVATMGF RD VRATVR+LTENG
Sbjct: 487  LPTARVLPQAIPTASAVSGGSGSTGTGGRVSVDDVVDKVATMGFPRDHVRATVRKLTENG 546

Query: 1746 QSVDLNVVLD 1775
            QSVDLN VLD
Sbjct: 547  QSVDLNAVLD 556


>ref|XP_006833453.1| hypothetical protein AMTR_s00082p00053170 [Amborella trichopoda]
           gi|548838159|gb|ERM98731.1| hypothetical protein
           AMTR_s00082p00053170 [Amborella trichopoda]
          Length = 695

 Score =  188 bits (478), Expect = 6e-45
 Identities = 108/222 (48%), Positives = 141/222 (63%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLSGSQKHDFLDRFNPQEEQLHVFGDGLKKESKEEILPSYDFQPIQPL 262
           MNS    DKQ+MGLSGSQ  DF +  NP  ++     +G KKE   ++LPSYDFQPI+P+
Sbjct: 1   MNSSHFMDKQIMGLSGSQNSDFFELLNPPSQE----HNGSKKE---DMLPSYDFQPIRPI 53

Query: 263 RASHSINLEESNVGGIRGHNLVDSMSNSSNLRTYGSLGTIESAKITQKKDRNAYHSEMVS 442
            +  S  L+                 +SSN R YGSL   E    +Q+ +R+A  + +VS
Sbjct: 54  VSPPSPELQ-----------------SSSNFRKYGSLELKEPTNASQEHERDASDAAIVS 96

Query: 443 EIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKISVGNNHGNTDGKLRQLE 622
           EID+T+K + DNLLH LEGVSARLSQLESRT +LE+S+D+LK+SVGN+HG+TDGKLRQLE
Sbjct: 97  EIDRTVKKHVDNLLHSLEGVSARLSQLESRTRRLENSVDELKVSVGNSHGSTDGKLRQLE 156

Query: 623 NLLREVQMGVQVLRDXXXXXXXXXXXXXXXXSKGDQKSENQN 748
           N+LREVQ  VQVLRD                SK +Q +  ++
Sbjct: 157 NILREVQASVQVLRDKQEIAEAHSQLMKLQLSKSEQHAVTES 198



 Score =  121 bits (304), Expect = 9e-25
 Identities = 82/176 (46%), Positives = 90/176 (51%), Gaps = 19/176 (10%)
 Frame = +3

Query: 1305 PQSYPPIVRQQPPS-----------QQFCGAPPPHMYEPTSSRPN------SGVSSGYAP 1433
            P  YPP  RQ  P+           QQF G P  HMYEP+S  P       SG    Y P
Sbjct: 366  PSYYPPPGRQGGPAGPTGPVGPTPPQQFYG-PSGHMYEPSSPSPPALGRAVSGFPGPYGP 424

Query: 1434 P-SGTNFND-NXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPRLPTAKILPHALPTV 1607
            P SG NF + +                                YPRLPTA+ILPHALPT 
Sbjct: 425  PPSGPNFTEPSYSSYNGPVPYGSVGGNKVPQSSMPSAPSGAGGYPRLPTAQILPHALPTA 484

Query: 1608 NSDVXXXXXXXXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENGQSVDLNVVLD 1775
            +            NRVP+DDVVDKV  MGFSRDQVRATVR+LTENGQSVDLNVVLD
Sbjct: 485  SGG--GSGSSGGGNRVPIDDVVDKVTNMGFSRDQVRATVRKLTENGQSVDLNVVLD 538


>ref|XP_003554822.1| PREDICTED: vacuolar protein sorting-associated protein 27-like
           isoform X1 [Glycine max]
          Length = 578

 Score =  183 bits (465), Expect = 2e-43
 Identities = 113/254 (44%), Positives = 156/254 (61%), Gaps = 30/254 (11%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLSGS------------QKHDFLDRFN--PQEEQLHVF---------- 190
           MN+    DKQ+M L+ +            Q  DF+D     PQ +  H            
Sbjct: 1   MNTTPFMDKQIMDLTHAHGSSSSSSTTQLQSKDFIDLMKEPPQNQHNHHHHHLEDEDEEE 60

Query: 191 ----GDGLKKESKEEILPSYDFQPIQPLRASHSINLEESNVGGIRGHNLVDSMSNSSN-- 352
               G+G+   SK++I+PSYDFQPI+PL AS+S N + +     R  N  DS SN+S   
Sbjct: 61  KASRGNGI---SKDDIVPSYDFQPIRPLAASNSNNFDSAAFS--RPWNS-DSNSNASPPI 114

Query: 353 LRTYGSLGTIESAKITQKKDRNAYHSEMVSEIDQTMKIYTDNLLHVLEGVSARLSQLESR 532
           L+ Y SL ++E AK+  +KD++A+ + M+SEID+T+K + +N+LHVLEGVSARL+QLE+R
Sbjct: 115 LKNYNSLDSMEPAKVIVEKDQSAFDATMLSEIDRTVKKHMENMLHVLEGVSARLTQLETR 174

Query: 533 THQLESSIDDLKISVGNNHGNTDGKLRQLENLLREVQMGVQVLRDXXXXXXXXXXXXXXX 712
           TH LE+S+DDLK+SVGN+HG+TDGKLRQ+EN LREVQ GVQ ++D               
Sbjct: 175 THHLENSVDDLKVSVGNSHGSTDGKLRQMENSLREVQSGVQTIKDKQDIVQAQLQLAKLE 234

Query: 713 XSKGDQKSENQNSS 754
            SK D +SE Q S+
Sbjct: 235 VSKTDPQSETQTST 248



 Score =  129 bits (325), Expect = 3e-27
 Identities = 91/249 (36%), Positives = 104/249 (41%), Gaps = 8/249 (3%)
 Frame = +3

Query: 1053 REPYFPPPGQLPETTHLXXXXXXXXXXXXXXXXXXXXXXXXXXLPQYTRXXXXXXXXXXX 1232
            R+ YFPPP Q  ET +                            PQY +           
Sbjct: 316  RDQYFPPPVQSQETPNQQYQLPLSQQPHAQPGAPPHQQYQQIPHPQYPQPAPHLPQQQPP 375

Query: 1233 XXXXXXXXXXXXXXXXXEENLYMLPQSYPPIVRQQP--------PSQQFCGAPPPHMYEP 1388
                              E     PQ+YPP VRQ P        P QQF G PP H YEP
Sbjct: 376  SHPSMNPPQLQSSLGHHVEEPPYPPQNYPPNVRQPPSQSPTGPPPPQQFYGTPP-HAYEP 434

Query: 1389 TSSRPNSGVSSGYAPPSGTNFNDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPRL 1568
             SSR  SG SSGY   SG    +                                 YP+L
Sbjct: 435  PSSRSGSGYSSGYGTLSGPA--EQYRYGGPPQYAGTPALKPQQLPTASVAPSGGSGYPQL 492

Query: 1569 PTAKILPHALPTVNSDVXXXXXXXXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENGQ 1748
            PTA++LP A+PT ++            RV VDDVVDKV+TMGF RD VRATVR+LTENGQ
Sbjct: 493  PTARVLPQAIPTASAVSGGSGSAGTGGRVSVDDVVDKVSTMGFPRDHVRATVRKLTENGQ 552

Query: 1749 SVDLNVVLD 1775
            SVDLN VLD
Sbjct: 553  SVDLNAVLD 561


>ref|XP_007217219.1| hypothetical protein PRUPE_ppa003425mg [Prunus persica]
           gi|462413369|gb|EMJ18418.1| hypothetical protein
           PRUPE_ppa003425mg [Prunus persica]
          Length = 575

 Score =  180 bits (457), Expect = 2e-42
 Identities = 107/214 (50%), Positives = 145/214 (67%), Gaps = 19/214 (8%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLS-GSQK---HDFLDRFN--------PQEEQLHVFGDGLKKESKEEI 226
           M++    DKQ+M LS GS +   +DF+D+           +EEQ    G+GL  +   E+
Sbjct: 1   MSTTSFMDKQIMDLSQGSPQQNNNDFIDQMKMNDNNHPKEEEEQQVGHGNGLSNKLYHEM 60

Query: 227 LPSYDFQPIQPL--RASHSINLEES-NVGGIRGHNLVDSMSNSSN----LRTYGSLGTIE 385
           LPSYDFQPI+P+   +S S +L+ + N+GG     + +S    SN    +R YGSL +IE
Sbjct: 61  LPSYDFQPIRPIVGTSSQSQSLDPAPNLGGGGAARVWNSGEPKSNTTAPIRNYGSLDSIE 120

Query: 386 SAKITQKKDRNAYHSEMVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDL 565
            AK+  +KDRN   + +VSEIDQ MK + DNLLHVLEGVSARL+QLESRT  LE+S+DDL
Sbjct: 121 PAKVILQKDRNVLDATVVSEIDQAMKKHADNLLHVLEGVSARLTQLESRTRHLENSVDDL 180

Query: 566 KISVGNNHGNTDGKLRQLENLLREVQMGVQVLRD 667
           K+SVGNNHGN DGK+ +LE++LR+VQ GV+ L+D
Sbjct: 181 KVSVGNNHGNADGKMIRLEDILRDVQTGVKDLKD 214



 Score =  129 bits (323), Expect = 6e-27
 Identities = 84/249 (33%), Positives = 109/249 (43%), Gaps = 8/249 (3%)
 Frame = +3

Query: 1053 REPYFPPPGQLPETTHLXXXXXXXXXXXXXXXXXXXXXXXXXXLPQYTRXXXXXXXXXXX 1232
            ++PYFPPPGQ     +                            PQY++           
Sbjct: 312  QDPYFPPPGQNQGAPNQQYQLPPGQQTVPLPPVPPHQQFQPTTQPQYSQPPPQLPQQHPS 371

Query: 1233 XXXXXXXXXXXXXXXXXEENLYMLPQSYPPIVRQQP--------PSQQFCGAPPPHMYEP 1388
                             EE  Y+  Q+YPP +RQ P        PSQQ+  +P    YEP
Sbjct: 372  HTPVNPSQLQPTLGHHAEETPYIPSQNYPPSLRQPPSHTPSGLPPSQQYY-SPASQAYEP 430

Query: 1389 TSSRPNSGVSSGYAPPSGTNFNDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPRL 1568
             SSR +SG SSGY+PP+G                                      YP+L
Sbjct: 431  PSSRSSSGYSSGYSPPAGLG-ESYHYGGSPSQYGGSSSMKPPQLSSSATAQSGGSGYPQL 489

Query: 1569 PTAKILPHALPTVNSDVXXXXXXXXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENGQ 1748
            PTA++LP ALPT +            NRVP++DV+D V TMGF RD VRATVR++T++GQ
Sbjct: 490  PTARVLPQALPTPSGAGGGSASAGTGNRVPIEDVIDTVTTMGFPRDYVRATVRKMTDSGQ 549

Query: 1749 SVDLNVVLD 1775
            SVD+NVVLD
Sbjct: 550  SVDVNVVLD 558


>gb|EXC20326.1| hypothetical protein L484_020546 [Morus notabilis]
          Length = 591

 Score =  180 bits (456), Expect = 2e-42
 Identities = 109/239 (45%), Positives = 142/239 (59%), Gaps = 16/239 (6%)
 Frame = +2

Query: 83  MNSPQLTDKQVM----GLSGSQKHDFLDRFN---PQEEQLHVFGDGLKKESKEEILPSYD 241
           MN+    DKQ+M    G S  Q  DF+D  +     E+Q    G G     KEEI PSYD
Sbjct: 1   MNTTPYMDKQIMDLSQGSSSPQMKDFIDLMSHPREDEDQTGHGGTGNGISKKEEIFPSYD 60

Query: 242 FQPIQPLRASHSINLEESNV-------GGIRGHNLVDSMSNS-SNLRTYGSLGTIESAKI 397
           FQP++P+    + +    N        G  R  +  DS   + S  R Y SL ++E AK 
Sbjct: 61  FQPLRPIAGLGASSSPPPNFDSAPAIGGSTRAWSPGDSKPKTGSPFRNYSSLDSVEPAKF 120

Query: 398 TQKKDRNAYHSEMV-SEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKIS 574
             +KD++++ S  + +EID+TMK + DNLLHVL+GVSARL+QLESRT  LE+S+DDLK+S
Sbjct: 121 ILEKDQSSFDSSTIMAEIDKTMKKHADNLLHVLDGVSARLTQLESRTRNLENSVDDLKVS 180

Query: 575 VGNNHGNTDGKLRQLENLLREVQMGVQVLRDXXXXXXXXXXXXXXXXSKGDQKSENQNS 751
           VGNNHG+TDGK+RQLEN+LREVQ GVQVL+D                S  DQ  E QN+
Sbjct: 181 VGNNHGSTDGKMRQLENILREVQSGVQVLKDKQEIVEAQLQLAKVQLSNVDQHQETQNT 239



 Score =  137 bits (344), Expect = 2e-29
 Identities = 92/250 (36%), Positives = 111/250 (44%), Gaps = 9/250 (3%)
 Frame = +3

Query: 1053 REPYFPPPGQLPETTHLXXXXXXXXXXXXXXXXXXXXXXXXXXLPQYTRXXXXXXXXXXX 1232
            R+PYFP PGQ  E  +                            PQ+++           
Sbjct: 334  RDPYFPAPGQTQEPQNQQYPGQQQLPPSAIPQPQQYQPTPQ---PQFSQPPPQPPQQHPS 390

Query: 1233 XXXXXXXXXXXXXXXXXEENLYMLPQSYPPIVRQQP--------PSQQFCGAPPPHMYEP 1388
                             EE  Y+  Q+YPP +RQ P        PSQQF GAPP H YEP
Sbjct: 391  LAPVNPAQLQPPLSHHSEEPPYVPSQNYPPNLRQPPSQPPTGPPPSQQFYGAPPSHGYEP 450

Query: 1389 T-SSRPNSGVSSGYAPPSGTNFNDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPR 1565
              SSRP+S   SGY   SG +                                    YP+
Sbjct: 451  PPSSRPSSSFPSGYGLTSGLS------EQFHYGGLPSQYTSGVKPHSPTTAQSGGSGYPQ 504

Query: 1566 LPTAKILPHALPTVNSDVXXXXXXXXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENG 1745
            +PTA++LPHALP  ++           NRVP+DDV+DKV TMGF RD VRATVR+LTENG
Sbjct: 505  MPTARVLPHALPAASTVGAGSGSSGTGNRVPIDDVIDKVTTMGFPRDHVRATVRKLTENG 564

Query: 1746 QSVDLNVVLD 1775
            QSVDLNVVLD
Sbjct: 565  QSVDLNVVLD 574


>gb|ACZ74657.1| hypothetical protein [Phaseolus vulgaris]
          Length = 574

 Score =  175 bits (443), Expect = 7e-41
 Identities = 108/247 (43%), Positives = 149/247 (60%), Gaps = 23/247 (9%)
 Frame = +2

Query: 83  MNSPQLTDKQVM----GLSGSQKH--DFLD--RFNPQEEQLH---------------VFG 193
           MN+    DKQ+M    G S + +H  DF+D  +  P ++Q H                 G
Sbjct: 1   MNTTPFMDKQIMDLTHGSSTAHQHTKDFIDLMKHEPPQQQHHHQHREEDDDEEEEEKARG 60

Query: 194 DGLKKESKEEILPSYDFQPIQPLRASHSINLEESNVGGIRGHNLVDSMSNSSNLRTYGSL 373
           +G+   SK++I+PSYDFQPI+PL AS   +         R  N      + SN + Y SL
Sbjct: 61  NGI---SKDDIVPSYDFQPIRPLAASSYDSAPSFAAAFSRPWN------SESNSKNYSSL 111

Query: 374 GTIESAKITQKKDRNAYHSEMVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESS 553
            +IE AK+  +KDR+A  + M++EID+TM+ + +N+L+VLEGVSARL+QLE+RTH LE+S
Sbjct: 112 DSIEPAKVIVEKDRSASDASMLAEIDRTMQKHMENMLNVLEGVSARLTQLETRTHHLENS 171

Query: 554 IDDLKISVGNNHGNTDGKLRQLENLLREVQMGVQVLRDXXXXXXXXXXXXXXXXSKGDQK 733
           +DDLK+SVGNNHG TDGKLRQLEN+LREVQ GV  ++D                S  +QK
Sbjct: 172 VDDLKVSVGNNHGITDGKLRQLENILREVQSGVLTIKDKQDIMQAQLQFAKLQMSNTNQK 231

Query: 734 SENQNSS 754
            E Q+S+
Sbjct: 232 PEAQSST 238



 Score =  131 bits (329), Expect = 1e-27
 Identities = 81/166 (48%), Positives = 90/166 (54%), Gaps = 9/166 (5%)
 Frame = +3

Query: 1305 PQSYPPIVRQQPPSQQFCGAPPP--------HMYEPTSSRPNSGVSSGYAPPSGTNFNDN 1460
            PQ+YPP VRQ PPSQ   G PPP        H YEP SSRP SG SSGY   SG    + 
Sbjct: 393  PQTYPPNVRQ-PPSQSPSGPPPPQQFYGTPSHSYEPPSSRPGSGYSSGYGTLSGPGPAEQ 451

Query: 1461 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXN-YPRLPTAKILPHALPTVNSDVXXXXXX 1637
                                           + YP+LPTA+ILP ALPT ++        
Sbjct: 452  YRYGGPPPQYGSNPAMKPAQLPTASVSPSGGSGYPQLPTARILPQALPTASAVSGSSGSA 511

Query: 1638 XXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENGQSVDLNVVLD 1775
                RV VDDVVDKVA+MGF RD VRATVR+LTENGQSVDLN VLD
Sbjct: 512  GTGGRVSVDDVVDKVASMGFPRDHVRATVRKLTENGQSVDLNTVLD 557


>ref|XP_007150938.1| hypothetical protein PHAVU_004G007500g [Phaseolus vulgaris]
           gi|561024247|gb|ESW22932.1| hypothetical protein
           PHAVU_004G007500g [Phaseolus vulgaris]
          Length = 575

 Score =  174 bits (442), Expect = 9e-41
 Identities = 108/248 (43%), Positives = 149/248 (60%), Gaps = 24/248 (9%)
 Frame = +2

Query: 83  MNSPQLTDKQVM----GLSGSQKH--DFLD--RFNPQEEQLH----------------VF 190
           MN+    DKQ+M    G S + +H  DF+D  +  P ++Q H                  
Sbjct: 1   MNTTPFMDKQIMDLTHGSSTAHQHTKDFIDLMKHEPPQQQHHHQHREEDDDEEEEEEKAR 60

Query: 191 GDGLKKESKEEILPSYDFQPIQPLRASHSINLEESNVGGIRGHNLVDSMSNSSNLRTYGS 370
           G+G+   SK++I+PSYDFQPI+PL AS   +         R  N      + SN + Y S
Sbjct: 61  GNGI---SKDDIVPSYDFQPIRPLAASSYDSAPSFAAAFSRPWN------SESNSKNYSS 111

Query: 371 LGTIESAKITQKKDRNAYHSEMVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLES 550
           L +IE AK+  +KDR+A  + M++EID+TM+ + +N+L+VLEGVSARL+QLE+RTH LE+
Sbjct: 112 LDSIEPAKVIVEKDRSASDASMLAEIDRTMQKHMENMLNVLEGVSARLTQLETRTHHLEN 171

Query: 551 SIDDLKISVGNNHGNTDGKLRQLENLLREVQMGVQVLRDXXXXXXXXXXXXXXXXSKGDQ 730
           S+DDLK+SVGNNHG TDGKLRQLEN+LREVQ GV  ++D                S  +Q
Sbjct: 172 SVDDLKVSVGNNHGITDGKLRQLENILREVQSGVLTIKDKQDIMQAQLQFAKLQMSNTNQ 231

Query: 731 KSENQNSS 754
           K E Q+S+
Sbjct: 232 KPEAQSST 239



 Score =  131 bits (329), Expect = 1e-27
 Identities = 81/166 (48%), Positives = 90/166 (54%), Gaps = 9/166 (5%)
 Frame = +3

Query: 1305 PQSYPPIVRQQPPSQQFCGAPPP--------HMYEPTSSRPNSGVSSGYAPPSGTNFNDN 1460
            PQ+YPP VRQ PPSQ   G PPP        H YEP SSRP SG SSGY   SG    + 
Sbjct: 394  PQTYPPNVRQ-PPSQSPSGPPPPQQFYGTPSHSYEPPSSRPGSGYSSGYGTLSGPGPAEQ 452

Query: 1461 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXN-YPRLPTAKILPHALPTVNSDVXXXXXX 1637
                                           + YP+LPTA+ILP ALPT ++        
Sbjct: 453  YRYGGPPPQYGGNPALKPPQLPTASVSPSGGSGYPQLPTARILPQALPTASAVSGSSGSA 512

Query: 1638 XXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENGQSVDLNVVLD 1775
                RV VDDVVDKVA+MGF RD VRATVR+LTENGQSVDLN VLD
Sbjct: 513  GTGGRVSVDDVVDKVASMGFPRDHVRATVRKLTENGQSVDLNTVLD 558


>ref|XP_007222918.1| hypothetical protein PRUPE_ppa003684mg [Prunus persica]
           gi|462419854|gb|EMJ24117.1| hypothetical protein
           PRUPE_ppa003684mg [Prunus persica]
          Length = 556

 Score =  174 bits (441), Expect = 1e-40
 Identities = 106/208 (50%), Positives = 149/208 (71%), Gaps = 13/208 (6%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLS--GSQKHDFLDRFN-PQEEQLHV-FGDGLKKESKEEILPSYDFQP 250
           MN+    DKQ+M LS   SQ++DF+   N PQE +  V +G+GL K   E+IL  Y FQ 
Sbjct: 1   MNTTSFLDKQIMDLSQGSSQQNDFIGLMNHPQEVEQQVGYGNGLSKN--EKILSDY-FQS 57

Query: 251 IQPLRAS--HSINLE-ESNVGG-----IRGHNLVDSMSNSSN-LRTYGSLGTIESAKITQ 403
           I+P+  S   S N++ + N+GG      R  N  +S SN+++ +R YGSL +I+ +++  
Sbjct: 58  IRPIIGSSFQSPNIDAKHNLGGGGEGSTRAWNSSESKSNTTSPIRNYGSLDSIKPSELIL 117

Query: 404 KKDRNAYHSEMVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKISVGN 583
           +KD+N   + +VSEID+TMK + ++LLHVLEGVS +L+QLESRT  LE+S++DLKISVGN
Sbjct: 118 EKDQNVPDATIVSEIDRTMKKHVNSLLHVLEGVSEKLTQLESRTCHLENSVEDLKISVGN 177

Query: 584 NHGNTDGKLRQLENLLREVQMGVQVLRD 667
           NHGNTDGK+RQLEN+LR+VQ G+QVL+D
Sbjct: 178 NHGNTDGKMRQLENVLRDVQTGIQVLKD 205



 Score =  126 bits (316), Expect = 4e-26
 Identities = 83/249 (33%), Positives = 106/249 (42%), Gaps = 8/249 (3%)
 Frame = +3

Query: 1053 REPYFPPPGQLPETTHLXXXXXXXXXXXXXXXXXXXXXXXXXXLPQYTRXXXXXXXXXXX 1232
            R+PYFP PGQ  E  +                            PQ+++           
Sbjct: 296  RDPYFPVPGQTQEAPNQQYQLPPSQQSLPPPTAAPHQQFQPTTQPQHSQPPPQLPQQHPS 355

Query: 1233 XXXXXXXXXXXXXXXXXEENLYMLPQSYPPIVRQQP--------PSQQFCGAPPPHMYEP 1388
                             EE  Y+   SYPP + Q P        PSQQ+ G P  H YEP
Sbjct: 356  LAPVNPSQLRPTLGHHAEETPYVPSLSYPPNLPQPPYQTPSGLPPSQQYYG-PGSHAYEP 414

Query: 1389 TSSRPNSGVSSGYAPPSGTNFNDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPRL 1568
             SS+ ++G SSGY PPS                                       YP+L
Sbjct: 415  PSSKSSTGFSSGYGPPSALG----ETYHYGGSLQDDSSSMKPRMPSSATAHSGGIGYPQL 470

Query: 1569 PTAKILPHALPTVNSDVXXXXXXXXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENGQ 1748
            P A++LPHA PT +            N+VP+DDV+D V TMGFSRD VRAT+R+LT+NGQ
Sbjct: 471  PVARVLPHASPTSSRVGGSSGSAGTGNKVPIDDVIDHVTTMGFSRDHVRATIRKLTDNGQ 530

Query: 1749 SVDLNVVLD 1775
            +VD+NVVLD
Sbjct: 531  AVDVNVVLD 539


>ref|NP_186805.2| uncharacterized protein [Arabidopsis thaliana]
           gi|332640167|gb|AEE73688.1| uncharacterized protein
           AT3G01560 [Arabidopsis thaliana]
          Length = 511

 Score =  168 bits (425), Expect = 8e-39
 Identities = 100/207 (48%), Positives = 135/207 (65%), Gaps = 12/207 (5%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLSGSQK---HDFLDRFNPQEEQLH----VFGDGLKKESKEEILPSYD 241
           MN+ Q  DKQ+M LS S      DF+D  N  +   H    V GD      KE I+PSYD
Sbjct: 1   MNTCQFMDKQIMDLSSSSSLPSTDFIDLMNNHDGDDHQKKQVIGDNGLDSKKEVIVPSYD 60

Query: 242 FQPIQPL---RASHS-INLEESNVGGIRGHNLVDSMSNSSNLRTYGSLGTIESAKITQKK 409
           F PI+P    R SHS ++L  S        +    +S +S    +GSL +IE +K+   K
Sbjct: 61  FHPIRPTTAARLSHSALDLAGSTTRVNWSASDYKPVSTTSPNTNFGSLDSIEPSKLVPDK 120

Query: 410 DRNAYHSEMVSEI-DQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKISVGNN 586
            +N +++ ++SEI D+TMK +TD LLHV+EGVSARLSQLE+RTH LE+ +DDLK+SV N+
Sbjct: 121 GQNVFNTTIMSEIIDRTMKKHTDTLLHVMEGVSARLSQLETRTHNLENLVDDLKVSVDNS 180

Query: 587 HGNTDGKLRQLENLLREVQMGVQVLRD 667
           HG+TDGK+RQL+N+L EVQ GVQ+L+D
Sbjct: 181 HGSTDGKMRQLKNILVEVQSGVQLLKD 207



 Score =  104 bits (259), Expect = 1e-19
 Identities = 71/166 (42%), Positives = 82/166 (49%), Gaps = 10/166 (6%)
 Frame = +3

Query: 1308 QSYPPIV-RQQPP-----SQQFCGAPPPH--MYEPTSSRPNSGVSSGYAPPSGTNFNDNX 1463
            QSYPP   RQQPP     SQQF   P P   MY+    R NSG  SGY     T      
Sbjct: 346  QSYPPNPPRQQPPAGSTPSQQFYNPPQPQPSMYDGAGGRSNSGFPSGYLSEPYTYSGS-- 403

Query: 1464 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPRLPTAKILPHALPTVN--SDVXXXXXX 1637
                                           YP+L  ++ LPHALP V+  S        
Sbjct: 404  ---------------PMSSAKPPHISSNGTGYPQLSNSRPLPHALPMVSAVSSGGGSSSP 448

Query: 1638 XXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENGQSVDLNVVLD 1775
               +R P+DDV+D+V TMGF RDQVRATVR+LTENGQ+VDLNVVLD
Sbjct: 449  RSESRAPIDDVIDRVTTMGFPRDQVRATVRKLTENGQAVDLNVVLD 494


>gb|AAF01541.1|AC009325_11 unknown protein [Arabidopsis thaliana]
          Length = 493

 Score =  167 bits (424), Expect = 1e-38
 Identities = 96/203 (47%), Positives = 132/203 (65%), Gaps = 8/203 (3%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLSGSQK---HDFLDRFNPQEEQLH----VFGDGLKKESKEEILPSYD 241
           MN+ Q  DKQ+M LS S      DF+D  N  +   H    V GD      KE I+PSYD
Sbjct: 1   MNTCQFMDKQIMDLSSSSSLPSTDFIDLMNNHDGDDHQKKQVIGDNGLDSKKEVIVPSYD 60

Query: 242 FQPIQPLRASHSINLEESNVGGIRGHNLVDSMSNSSNLRTYGSLGTIESAKITQKKDRNA 421
           F PI+P  A+               H+ +D   +++  R +GSL +IE +K+   K +N 
Sbjct: 61  FHPIRPTTAARL------------SHSALDLAGSTT--RNFGSLDSIEPSKLVPDKGQNV 106

Query: 422 YHSEMVSEI-DQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKISVGNNHGNT 598
           +++ ++SEI D+TMK +TD LLHV+EGVSARLSQLE+RTH LE+ +DDLK+SV N+HG+T
Sbjct: 107 FNTTIMSEIIDRTMKKHTDTLLHVMEGVSARLSQLETRTHNLENLVDDLKVSVDNSHGST 166

Query: 599 DGKLRQLENLLREVQMGVQVLRD 667
           DGK+RQL+N+L EVQ GVQ+L+D
Sbjct: 167 DGKMRQLKNILVEVQSGVQLLKD 189



 Score =  104 bits (259), Expect = 1e-19
 Identities = 71/166 (42%), Positives = 82/166 (49%), Gaps = 10/166 (6%)
 Frame = +3

Query: 1308 QSYPPIV-RQQPP-----SQQFCGAPPPH--MYEPTSSRPNSGVSSGYAPPSGTNFNDNX 1463
            QSYPP   RQQPP     SQQF   P P   MY+    R NSG  SGY     T      
Sbjct: 328  QSYPPNPPRQQPPAGSTPSQQFYNPPQPQPSMYDGAGGRSNSGFPSGYLSEPYTYSGS-- 385

Query: 1464 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPRLPTAKILPHALPTVN--SDVXXXXXX 1637
                                           YP+L  ++ LPHALP V+  S        
Sbjct: 386  ---------------PMSSAKPPHISSNGTGYPQLSNSRPLPHALPMVSAVSSGGGSSSP 430

Query: 1638 XXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENGQSVDLNVVLD 1775
               +R P+DDV+D+V TMGF RDQVRATVR+LTENGQ+VDLNVVLD
Sbjct: 431  RSESRAPIDDVIDRVTTMGFPRDQVRATVRKLTENGQAVDLNVVLD 476


>ref|XP_006470271.1| PREDICTED: COPII coat assembly protein sec16-like [Citrus sinensis]
          Length = 574

 Score =  167 bits (423), Expect = 1e-38
 Identities = 110/239 (46%), Positives = 144/239 (60%), Gaps = 16/239 (6%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGL--SGSQKHDFLDRFN----PQEEQ----LHVFGDGLKKESKEEILP 232
           MN+ Q  DKQ+M L  S S   D +D  N    PQ E+    ++  G G+KKE   EI+P
Sbjct: 1   MNTSQFMDKQIMDLTSSPSMDKDLMDLTNHHRPPQHEEDDRDVNNNGIGIKKE---EIVP 57

Query: 233 SYDFQPIQPLRASHSINLEES-NVGGIRGHNLVDSMSNSSN-----LRTYGSLGTIESAK 394
           SYDF PI+    S S+NL+ S N     G  + +S  N  N     +R +GSL   +  K
Sbjct: 58  SYDFLPIRG-GLSQSLNLDSSVNTDAAVGARVWNSSENKPNSSLSPVRNFGSLDNFDCPK 116

Query: 395 ITQKKDRNAYHSEMVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKIS 574
                  N   + +VS+IDQTMK Y DNLLHVLEGVSARL+QL++RT  LESS+DDLK+S
Sbjct: 117 FNLG---NRSDATIVSDIDQTMKKYADNLLHVLEGVSARLTQLDARTRNLESSVDDLKVS 173

Query: 575 VGNNHGNTDGKLRQLENLLREVQMGVQVLRDXXXXXXXXXXXXXXXXSKGDQKSENQNS 751
           VG+NH +TDGK+RQ+EN+LREVQ GV VL+D                SK DQ SE++++
Sbjct: 174 VGSNHASTDGKMRQVENILREVQSGVLVLKDKQEMLEAQMQHGKLQGSKVDQPSESRST 232



 Score =  130 bits (327), Expect = 2e-27
 Identities = 74/172 (43%), Positives = 93/172 (54%), Gaps = 8/172 (4%)
 Frame = +3

Query: 1284 EENLYMLPQSYPPIVRQQ--------PPSQQFCGAPPPHMYEPTSSRPNSGVSSGYAPPS 1439
            EE  YM  Q+YPP +RQ         PPSQ + GAPP H+YE   SRPNSG  +GY   S
Sbjct: 388  EETAYMPSQNYPPNLRQSASQTPSVSPPSQTYYGAPPSHLYESPPSRPNSGFPTGYGTHS 447

Query: 1440 GTNFNDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPRLPTAKILPHALPTVNSDV 1619
            G N  +                                 Y +LP+A++LP ++P+ +   
Sbjct: 448  GPN--EPHPYGGPPSQYVSGSTIKPQQHSSAMMHSGGSGYLQLPSARVLPQSIPSASGVS 505

Query: 1620 XXXXXXXXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENGQSVDLNVVLD 1775
                     NRVP+DDVVDKVA+MGF RD VRATV+++TENGQSVDLN VLD
Sbjct: 506  GGPGSPGTGNRVPIDDVVDKVASMGFPRDHVRATVQKMTENGQSVDLNKVLD 557


>ref|XP_002873675.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
           gi|297319512|gb|EFH49934.1| predicted protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score =  167 bits (423), Expect = 1e-38
 Identities = 99/232 (42%), Positives = 143/232 (61%), Gaps = 9/232 (3%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLSGSQKHDFLDRFNPQE---EQLHVFGDGLKKESKEEILPSYDFQPI 253
           MN+   +DKQ+M L         D  N Q+    Q H  GD   + +KE I PSYDF PI
Sbjct: 1   MNTALFSDKQIMDLMN-------DNNNSQDGDHHQKHRVGDNGLESNKEAIFPSYDFHPI 53

Query: 254 QPLRA----SHSINLEES-NVGGIRGHNLVDSMS-NSSNLRTYGSLGTIESAKITQKKDR 415
           +P  +     H+++L  S N    R  +  D    ++S+ R+YGS+ ++E +K+  +KDR
Sbjct: 54  RPNASVGLSHHALDLAGSVNSTAARVWDASDPKPVSASSARSYGSMDSLEPSKLFAEKDR 113

Query: 416 NAYHSEMVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKISVGNNHGN 595
           N+  S ++S ID+TMK + D+LLHV+EGVSARL+QLE+RT  LE+ +DD+K+SVGN+HG 
Sbjct: 114 NSPESAIISAIDRTMKAHADSLLHVMEGVSARLTQLETRTRNLENLVDDVKVSVGNSHGK 173

Query: 596 TDGKLRQLENLLREVQMGVQVLRDXXXXXXXXXXXXXXXXSKGDQKSENQNS 751
           TDGKLRQLEN++ EVQ GVQ+L+D                SK +Q+ E  ++
Sbjct: 174 TDGKLRQLENIMLEVQSGVQLLKDKQEIVEAQLQLSKLQLSKVNQQPETHST 225


>ref|XP_006289230.1| hypothetical protein CARUB_v10002686mg [Capsella rubella]
           gi|482557936|gb|EOA22128.1| hypothetical protein
           CARUB_v10002686mg [Capsella rubella]
          Length = 541

 Score =  166 bits (420), Expect = 3e-38
 Identities = 97/229 (42%), Positives = 144/229 (62%), Gaps = 6/229 (2%)
 Frame = +2

Query: 83  MNSPQLTDKQVMGLSGSQKHDFLDRFNPQEEQLHVFGDGLKKESKEEILPSYDFQPIQPL 262
           MN+   +DKQ+M L     ++  D  + +    +   +GL+ + KE I PSYDFQPI+P 
Sbjct: 1   MNTALFSDKQIMDLMNDDNNNSQDGDHQKHRAGNCSNNGLESK-KEAIFPSYDFQPIRPN 59

Query: 263 RA----SHSINLEES-NVGGIRGHNLVDSMS-NSSNLRTYGSLGTIESAKITQKKDRNAY 424
            +     H+++L  S N    R  ++ D     +S+ R+YGS+ ++E +K+  +KDRNA 
Sbjct: 60  ASVGLSHHALDLAGSVNPTAARVWDVSDPKPVATSSARSYGSMDSLEPSKLFAEKDRNAP 119

Query: 425 HSEMVSEIDQTMKIYTDNLLHVLEGVSARLSQLESRTHQLESSIDDLKISVGNNHGNTDG 604
            S ++S ID+TMK + DNL+HV+E VSARL+QLE+RT  LE+ +DD+K+SVGN+HG TDG
Sbjct: 120 DSAILSAIDRTMKAHADNLIHVIECVSARLTQLETRTRNLENLVDDVKVSVGNSHGTTDG 179

Query: 605 KLRQLENLLREVQMGVQVLRDXXXXXXXXXXXXXXXXSKGDQKSENQNS 751
           KLRQLEN++ EVQ GVQ+L+D                SK +Q+ E  +S
Sbjct: 180 KLRQLENIMLEVQSGVQLLKDKQEIVEAQLQLSKLQLSKVNQQPETHSS 228



 Score =  101 bits (252), Expect = 1e-18
 Identities = 68/178 (38%), Positives = 85/178 (47%), Gaps = 22/178 (12%)
 Frame = +3

Query: 1308 QSYPPIVRQQPPS---------QQFCGAPP--PHMYEPTSSRPNSGVSSGYAP------- 1433
            QSYPP   +QPPS         QQ+  APP  P +Y+    R NSG +SGY+P       
Sbjct: 358  QSYPPNPPRQPPSHPPTVSAPSQQYYNAPPTPPSIYDGAGGRSNSGFASGYSPEPYPYTG 417

Query: 1434 PSGTNFNDNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYPRLPTAKILPHALPTVNS 1613
            P  + + +                                 YP+LP A+ LP  LP  ++
Sbjct: 418  PPSSQYGNTSSVKPSHQSGSGSGA-----------------YPQLPMARPLPQGLPMASA 460

Query: 1614 ----DVXXXXXXXXXNRVPVDDVVDKVATMGFSRDQVRATVRRLTENGQSVDLNVVLD 1775
                           ++ PVDDV+DKV TMGF RDQVR TVR LTENGQ+VDLNVVLD
Sbjct: 461  ISSGGSGGSGSPRSGSQAPVDDVIDKVVTMGFPRDQVRGTVRTLTENGQAVDLNVVLD 518


Top