BLASTX nr result

ID: Paeonia22_contig00023078 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00023078
         (1450 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007017377.1| Uncharacterized protein TCM_033924 [Theobrom...   383   e-103
ref|XP_002510285.1| signal peptidase I, putative [Ricinus commun...   369   1e-99
ref|XP_006375011.1| hypothetical protein POPTR_0014s03560g [Popu...   365   3e-98
ref|XP_007160813.1| hypothetical protein PHAVU_001G018600g [Phas...   351   4e-94
ref|XP_006354938.1| PREDICTED: uncharacterized protein LOC102588...   345   3e-92
ref|XP_004238590.1| PREDICTED: uncharacterized protein LOC101263...   342   2e-91
ref|XP_004499101.1| PREDICTED: uncharacterized protein LOC101493...   340   7e-91
gb|AGV54177.1| signal peptidase I [Phaseolus vulgaris]                337   8e-90
gb|EXB38625.1| putative thylakoidal processing peptidase 2 [Moru...   336   1e-89
ref|XP_003549415.1| PREDICTED: uncharacterized protein LOC100804...   330   7e-88
ref|XP_003589258.1| hypothetical protein MTR_1g021180 [Medicago ...   323   2e-85
ref|XP_004160620.1| PREDICTED: uncharacterized protein LOC101229...   320   8e-85
ref|XP_004141368.1| PREDICTED: uncharacterized protein LOC101221...   320   8e-85
gb|AAM61120.1| unknown [Arabidopsis thaliana]                         319   2e-84
ref|NP_564503.1| uncharacterized protein [Arabidopsis thaliana] ...   318   3e-84
gb|EYU36280.1| hypothetical protein MIMGU_mgv1a007411mg [Mimulus...   314   7e-83
ref|XP_002891379.1| hypothetical protein ARALYDRAFT_473912 [Arab...   314   7e-83
ref|XP_006393531.1| hypothetical protein EUTSA_v10011550mg [Eutr...   311   6e-82
ref|XP_006307471.1| hypothetical protein CARUB_v10009097mg [Caps...   306   1e-80
ref|XP_007225638.1| hypothetical protein PRUPE_ppa008077mg [Prun...   292   3e-76

>ref|XP_007017377.1| Uncharacterized protein TCM_033924 [Theobroma cacao]
            gi|508722705|gb|EOY14602.1| Uncharacterized protein
            TCM_033924 [Theobroma cacao]
          Length = 387

 Score =  383 bits (984), Expect = e-103
 Identities = 197/358 (55%), Positives = 254/358 (70%), Gaps = 1/358 (0%)
 Frame = +2

Query: 194  LQDVLKAISARQRWDLGDIRVSKLDVRKAKFGRAQRIEFQIHIGKNDLIFGFPDEVGLWK 373
            LQDVL+ I+ +Q W+L  +  SKL+V KA+FG  +R EF+I  GK  L+F FPDEV  W 
Sbjct: 34   LQDVLEKIALKQEWELEGLNFSKLEVSKARFGAGKRYEFRIRFGKTHLLFKFPDEVSSWS 93

Query: 374  DLSEG-GDDFGFLINEVSSSMSVLDTFKVEGPMELWVSGDDELSIMLPLNTSHTGLKRIL 550
               +G GDDF   + E++S+   LD+FK+EGP EL ++ + + S++LPLNTSHT LKR+L
Sbjct: 94   KFRKGSGDDFLDFVKEINSTAG-LDSFKMEGPFELRLAPNHQASLLLPLNTSHTDLKRVL 152

Query: 551  VGEGITVEVNGAREVSLFHTSDLGLSVNGSVVVKKDRSEFWPLWHSLCKPLPPIHIFGSS 730
            VGEGITVEV+GA+EVSLFH    GL VN S V  ++++ +WP   S C PL P+++ GS 
Sbjct: 153  VGEGITVEVSGAQEVSLFHAFSFGLPVNESEV--EEKTGYWPFRQSFCMPLLPVNVLGSV 210

Query: 731  SLVAFRTRNRDAYIEITSLSKDMIELLPEKCYSGRHINKKRACPIDSXXXXXXXXXXXXX 910
            SLVA++TRN DA+IE   LS D IELLPEKCY  R    K++ P+DS             
Sbjct: 211  SLVAYQTRNPDAHIEAVFLSSDTIELLPEKCYGDRAY-MKQSYPMDSISLRISKLRKVLR 269

Query: 911  SFLGDRIRQNGLLGSLKAENKASTVVRFQLELERDIRSNESFKGILAEWRTKPSVERVSF 1090
            +FLGDR   NG   SL  + KAS ++ FQLELE+ I  NE+ +G+LAEWR+KP+VER+ F
Sbjct: 270  TFLGDRDNGNGFSSSLNVKTKASPIIHFQLELEKTIGKNETVRGMLAEWRSKPTVERLWF 329

Query: 1091 EVVARVEAERLKPLTVKKVRPFIEADLTSWSNLMSNISFTKFPSILVPPEAFMLDVKW 1264
            +V AR+EAE+LKPL +KKVRPF+  D  SWSNL+SNISFTKFPSILVPPEA  LDVKW
Sbjct: 330  DVTARIEAEKLKPLMIKKVRPFVGVDTVSWSNLLSNISFTKFPSILVPPEALTLDVKW 387


>ref|XP_002510285.1| signal peptidase I, putative [Ricinus communis]
            gi|223550986|gb|EEF52472.1| signal peptidase I, putative
            [Ricinus communis]
          Length = 831

 Score =  369 bits (948), Expect = 1e-99
 Identities = 196/380 (51%), Positives = 253/380 (66%)
 Frame = +2

Query: 125  PSLNASSPTPSTTHESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKFGRAQRI 304
            P+L  +   P   H      +  L+DVLK IS R  WDL  IR SKL V K +FG AQR 
Sbjct: 457  PTLILAINIPDPNHHITNNNTDILEDVLKEISERHNWDLERIRTSKLKVSKIRFGTAQRY 516

Query: 305  EFQIHIGKNDLIFGFPDEVGLWKDLSEGGDDFGFLINEVSSSMSVLDTFKVEGPMELWVS 484
            EF+I  GK  LIF FPDEV  WK  ++  DDF   + E+ ++ +VLDTFKVEGP +LW+ 
Sbjct: 517  EFRIRFGKMSLIFKFPDEVYSWKRYNKKNDDFENSVKEIGTA-AVLDTFKVEGPFDLWIG 575

Query: 485  GDDELSIMLPLNTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNGSVVVKKDRS 664
            G D LS+ LPLN SH+ LKR+LVGEGITVEV  A+++S+F T D   S+NG V + K +S
Sbjct: 576  GQDHLSLSLPLNVSHSSLKRMLVGEGITVEVKDAQQLSIFQTFDPSFSMNGRVKINKGKS 635

Query: 665  EFWPLWHSLCKPLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPEKCYSGRHIN 844
             F   W  LC PL PI + GS+SL+A++TRN DA +E T LS+  I+LL EKCYS   + 
Sbjct: 636  GFCLFWRQLCMPLLPIRVIGSASLIAYKTRNPDAPVETTLLSEGTIKLLSEKCYSD-DLY 694

Query: 845  KKRACPIDSXXXXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQLELERDIRS 1024
            K +A                  +FLG+++    L G L++  KA+T++RFQLELE++I S
Sbjct: 695  KNQAQLSHFLSLKIDRLGKLLRTFLGNQME---LSGFLRSNVKAATIIRFQLELEKNIGS 751

Query: 1025 NESFKGILAEWRTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTSWSNLMSNIS 1204
            + +    L +WRT+P++ERV FEV+ARVE E+L+P+ VKKVRPFI  D  SWSNLMSN+S
Sbjct: 752  SATLHDALEDWRTRPTIERVYFEVLARVEDEKLRPVVVKKVRPFIAVDSASWSNLMSNLS 811

Query: 1205 FTKFPSILVPPEAFMLDVKW 1264
            FTKFPSILVPPEA  LDVKW
Sbjct: 812  FTKFPSILVPPEALTLDVKW 831


>ref|XP_006375011.1| hypothetical protein POPTR_0014s03560g [Populus trichocarpa]
            gi|550323325|gb|ERP52808.1| hypothetical protein
            POPTR_0014s03560g [Populus trichocarpa]
          Length = 398

 Score =  365 bits (937), Expect = 3e-98
 Identities = 201/375 (53%), Positives = 257/375 (68%), Gaps = 3/375 (0%)
 Frame = +2

Query: 149  TPSTTHESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKFGRAQRIEFQIHIGK 328
            TP+  +++  Q   FL+DVLK IS +Q WDL  I +SKL+V K +   +QR EF+I +GK
Sbjct: 29   TPNHLNDNNTQ---FLKDVLKEISVKQDWDLEGIEISKLEVSKVRIFSSQRYEFKIRVGK 85

Query: 329  NDLIFGFPDEVGLWKDLSE--GGDDFGFLINEVSSSMSVLDTFKVEGPMELWVSGDDELS 502
            + ++  FPDE+   K LS+     DFG LI E  S + VLDT K++GP +LWVSG D  S
Sbjct: 86   SYMLLKFPDEIDSRKKLSKPKSSIDFGDLIKEFGS-VPVLDTLKLQGPFDLWVSGHDNFS 144

Query: 503  IMLPLNTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNGSVVVK-KDRSEFWPL 679
            ++LP+N S+ GLKRI+VGEGI+VEV GA+EVSLF   DL L++NGS +   K  + F+P 
Sbjct: 145  LLLPMNASYGGLKRIIVGEGISVEVKGAKEVSLFQDFDLSLALNGSDINNNKGGNGFYPF 204

Query: 680  WHSLCKPLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPEKCYSGRHINKKRAC 859
              S+C PL PI I GS+SLVA +  + DA IE   LSK  IEL+ +KCY  R++ K RA 
Sbjct: 205  GDSICPPLLPIRIIGSASLVANKNWDPDAEIETRLLSKKTIELVSDKCYD-RNVYKIRAS 263

Query: 860  PIDSXXXXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQLELERDIRSNESFK 1039
             +               SFLGDRI +NGL   L+A  KAST++RFQLELE+   SNE+ +
Sbjct: 264  TMHFLSSSIARLEEVLRSFLGDRITRNGLSSFLRATAKASTLIRFQLELEKSFGSNETAQ 323

Query: 1040 GILAEWRTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTSWSNLMSNISFTKFP 1219
             + AEWRT+P+VERV FEV+ARVE E+LKP+ VKKVRPFI  D  SWSNLMSNISFT FP
Sbjct: 324  EVFAEWRTRPTVERVWFEVIARVEGEKLKPVIVKKVRPFIAVDSASWSNLMSNISFTNFP 383

Query: 1220 SILVPPEAFMLDVKW 1264
            S+LVPPEA  LDVKW
Sbjct: 384  SVLVPPEALTLDVKW 398


>ref|XP_007160813.1| hypothetical protein PHAVU_001G018600g [Phaseolus vulgaris]
            gi|561034277|gb|ESW32807.1| hypothetical protein
            PHAVU_001G018600g [Phaseolus vulgaris]
          Length = 384

 Score =  351 bits (901), Expect = 4e-94
 Identities = 181/369 (49%), Positives = 242/369 (65%)
 Frame = +2

Query: 158  TTHESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKFGRAQRIEFQIHIGKNDL 337
            T   S    +H LQDVL+A+SA+Q+WD  D+RV+KLD  K +FG +Q  EF+I +G  + 
Sbjct: 18   TAFASSSNLTHILQDVLRAVSAKQKWDSNDVRVAKLDAAKVRFGTSQSYEFRIGLGTGNF 77

Query: 338  IFGFPDEVGLWKDLSEGGDDFGFLINEVSSSMSVLDTFKVEGPMELWVSGDDELSIMLPL 517
               F D+V  W        D   L++ + S   +L T K+EGP  L V     LS+ LP+
Sbjct: 78   TLKFADQVATWNKFRTPFPDLPSLVHRLGS-FPLLPTLKLEGPFSLRVDSLHNLSLFLPM 136

Query: 518  NTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNGSVVVKKDRSEFWPLWHSLCK 697
            N S+TGLK+ILVGEGITVEV GA+E+SLF++SD+ L +NGS +    +S+ WP  HS C 
Sbjct: 137  NVSYTGLKQILVGEGITVEVKGAQEISLFYSSDIDLLMNGSAMCSGGKSDIWPFLHSTCM 196

Query: 698  PLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPEKCYSGRHINKKRACPIDSXX 877
             + PI I GS+SLVA+R RN  A+I  T +S+D IE+LPEKCY GR + KK+ACP+DS  
Sbjct: 197  AVIPIRISGSASLVAYRARNPYAHIATTLISEDAIEMLPEKCYHGR-MFKKQACPLDSVS 255

Query: 878  XXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQLELERDIRSNESFKGILAEW 1057
                       S LG +I Q    G LKA  KAS VV+F++ELERDIR+N +    + +W
Sbjct: 256  LKLSMLEKVLRSLLGRKILQGQSFGLLKANIKASAVVKFRIELERDIRNNVTLNRTIPDW 315

Query: 1058 RTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTSWSNLMSNISFTKFPSILVPP 1237
            RT+PS ER  FE++ARVE  RLKPL++KKV+PFIE+   SW+NLMSN+S+T    + +PP
Sbjct: 316  RTRPSFERFWFEILARVEENRLKPLSIKKVKPFIESVSVSWANLMSNMSYTMLRPVFLPP 375

Query: 1238 EAFMLDVKW 1264
            E   LDVKW
Sbjct: 376  EPLTLDVKW 384


>ref|XP_006354938.1| PREDICTED: uncharacterized protein LOC102588271 [Solanum tuberosum]
          Length = 420

 Score =  345 bits (885), Expect = 3e-92
 Identities = 182/369 (49%), Positives = 245/369 (66%), Gaps = 3/369 (0%)
 Frame = +2

Query: 167  ESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKFGRAQRIEFQIHIGKNDLIFG 346
            +S P P  FL+DVL+ I+ R++WDL D+RVSKLDV+K+KFG  ++ EF++ IGK + +F 
Sbjct: 54   QSSPNPPSFLEDVLEGIAEREKWDLQDLRVSKLDVKKSKFGTFRKYEFRVRIGKTEFVFM 113

Query: 347  FPDEVGLWKDL---SEGGDDFGFLINEVSSSMSVLDTFKVEGPMELWVSGDDELSIMLPL 517
              DEV  WK     ++   DF  L+ E+ S ++ LD  K++GP EL+ +GDD LS+  PL
Sbjct: 114  MADEVSQWKSFHFPNKNESDFESLVKEIGSKVT-LDVLKIQGPFELYATGDDYLSLTFPL 172

Query: 518  NTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNGSVVVKKDRSEFWPLWHSLCK 697
            N+S+TGLK+ILVGEGITVEV GA E+S+F+ SDL   VNGS++ K    +F  +  S C 
Sbjct: 173  NSSYTGLKKILVGEGITVEVKGADEISMFNISDLLKLVNGSILTKSGSGQFRYMSQSSCI 232

Query: 698  PLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPEKCYSGRHINKKRACPIDSXX 877
            PL P+H+ G +S++A+ TRN D  IE  S+SK  I+LL EKCY+ RHI +K +   D   
Sbjct: 233  PLLPVHVRGPASVLAYITRNPDLRIETASVSKRSIKLLSEKCYT-RHIYRKWSLYNDFLS 291

Query: 878  XXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQLELERDIRSNESFKGILAEW 1057
                        FLG +  +      +K + K  T+ RFQLELER I++N+++   L EW
Sbjct: 292  QKITLLEKILRRFLGGKTSEIARFNLIKVKVKDLTLFRFQLELERGIQNNDTYWTTLGEW 351

Query: 1058 RTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTSWSNLMSNISFTKFPSILVPP 1237
            RT+P+VE   FEV AR EAE LKP  +KKVRPFIE D +SWSNLMSN+SFTK  S LVPP
Sbjct: 352  RTRPAVEHSWFEVTARFEAEILKPRLIKKVRPFIEVDSSSWSNLMSNMSFTKISSFLVPP 411

Query: 1238 EAFMLDVKW 1264
            E   LDV+W
Sbjct: 412  EPLTLDVRW 420


>ref|XP_004238590.1| PREDICTED: uncharacterized protein LOC101263904 [Solanum
            lycopersicum]
          Length = 853

 Score =  342 bits (877), Expect = 2e-91
 Identities = 182/369 (49%), Positives = 244/369 (66%), Gaps = 3/369 (0%)
 Frame = +2

Query: 167  ESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKFGRAQRIEFQIHIGKNDLIFG 346
            +S P P  FL+DVLK I+ R++WDL D+RVSKLDV+K+KFG  +R EF++ IGK + +F 
Sbjct: 487  QSSPNPPSFLEDVLKGIAEREKWDLQDLRVSKLDVKKSKFGTLRRYEFRVRIGKTEFVFM 546

Query: 347  FPDEVGLWKDL---SEGGDDFGFLINEVSSSMSVLDTFKVEGPMELWVSGDDELSIMLPL 517
              DEV  WK L   ++   DF  L+ E+ S  + LD  K++GP EL+ +GDD LS+ LPL
Sbjct: 547  MADEVSQWKGLHFPNKNESDFESLVKEIGSK-ATLDVLKIQGPFELYATGDDYLSLTLPL 605

Query: 518  NTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNGSVVVKKDRSEFWPLWHSLCK 697
            N+S+TGLK+ILV EGITVEV GA E+S+F+ SDL   VNGS++ K    ++  +  S C 
Sbjct: 606  NSSYTGLKKILVDEGITVEVKGADEISMFNISDLLKLVNGSMLTKSGSGQYRYMLQSSCI 665

Query: 698  PLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPEKCYSGRHINKKRACPIDSXX 877
            PL P+H+ G +S++A+ TRN D  IE   +S+  I+LL +KCY+ RHI +K +   D   
Sbjct: 666  PLLPVHVKGPASVLAYITRNPDLRIETVFVSRRSIKLLSQKCYT-RHIYRKWSSYNDFQS 724

Query: 878  XXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQLELERDIRSNESFKGILAEW 1057
                        FLG +  Q G    LK + K  T+ RFQLELER I++N+++   L EW
Sbjct: 725  QKIALLEKVLRRFLGGKTSQIGRYNLLKVKVKDLTLFRFQLELERGIQNNDTYWTTLGEW 784

Query: 1058 RTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTSWSNLMSNISFTKFPSILVPP 1237
            RT+P+VE   FEV AR EA+ LKP  +KKV PFIE D +SWSNLMSN+SFTK  S LVPP
Sbjct: 785  RTRPAVEHSWFEVTARFEADILKPRLIKKVSPFIEVDSSSWSNLMSNMSFTKISSFLVPP 844

Query: 1238 EAFMLDVKW 1264
            E   LDV+W
Sbjct: 845  EPLTLDVRW 853


>ref|XP_004499101.1| PREDICTED: uncharacterized protein LOC101493524 [Cicer arietinum]
          Length = 390

 Score =  340 bits (873), Expect = 7e-91
 Identities = 179/375 (47%), Positives = 242/375 (64%), Gaps = 3/375 (0%)
 Frame = +2

Query: 149  TPSTTHESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKFGRAQRIEFQIHIGK 328
            T S+TH +   P+H  QD+LKAISA+Q+WD  D+RV   D+ K +FG +Q   F+I  G 
Sbjct: 22   TSSSTHSN---PTHIFQDILKAISAKQKWDFNDVRVYNFDLAKLRFGTSQTYHFRIGSGN 78

Query: 329  NDLIFGFPDEVGLWKDLSEGGD---DFGFLINEVSSSMSVLDTFKVEGPMELWVSGDDEL 499
            ++    F D+V  W + +       D   L++  +S ++ LD  K+EGP EL V      
Sbjct: 79   DNFTLKFSDQVSSWNNNNNFATPKLDLETLVDRFTS-IAFLDDIKLEGPFELHVDELHHF 137

Query: 500  SIMLPLNTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNGSVVVKKDRSEFWPL 679
            S+ LP+N S+TGLK ++VGEGITVEV  ARE+S F+  DL    NGSV   K +SEFWP 
Sbjct: 138  SLSLPMNVSYTGLKHVIVGEGITVEVRRAREMSFFYRPDLDRQTNGSVACSKGKSEFWPF 197

Query: 680  WHSLCKPLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPEKCYSGRHINKKRAC 859
              S C PL P++I GS+SL+A+  RN   +I  T +S+D +ELLPEKCY GR + +KRAC
Sbjct: 198  LQSTCVPLIPLNIIGSASLIAYGARNPYTHIGTTLISEDTVELLPEKCYHGR-VFRKRAC 256

Query: 860  PIDSXXXXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQLELERDIRSNESFK 1039
            P+ S             S LG +I Q+   G +KA  KA   V+F LELERD+ +N + +
Sbjct: 257  PVASLSLRLSMLEKILRSLLGHKILQDRFSGLIKANIKAYAAVKFPLELERDVGNNVT-R 315

Query: 1040 GILAEWRTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTSWSNLMSNISFTKFP 1219
              L +WRT+PSVERV FE++ARVE  RLKP+ +KKV+PFIE+D  SW+NLMSN+S+TK  
Sbjct: 316  SALPDWRTRPSVERVWFEILARVEENRLKPVLIKKVKPFIESDSVSWANLMSNMSYTKLR 375

Query: 1220 SILVPPEAFMLDVKW 1264
             +L+PPEA  LDVKW
Sbjct: 376  PVLLPPEALTLDVKW 390


>gb|AGV54177.1| signal peptidase I [Phaseolus vulgaris]
          Length = 384

 Score =  337 bits (864), Expect = 8e-90
 Identities = 175/369 (47%), Positives = 237/369 (64%)
 Frame = +2

Query: 158  TTHESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKFGRAQRIEFQIHIGKNDL 337
            T   S    +H LQDVL+A+SA+Q+WD  D+RV+KLD  K +FG +   EF+I +G  + 
Sbjct: 18   TAFASSSNLTHILQDVLRAVSAKQKWDSNDVRVAKLDAAKVRFGTSLSYEFRIGLGTGNF 77

Query: 338  IFGFPDEVGLWKDLSEGGDDFGFLINEVSSSMSVLDTFKVEGPMELWVSGDDELSIMLPL 517
               F D+V  W        D   L++ + S   +L T K+EGP  L V     LS+ LP+
Sbjct: 78   TLKFADQVATWNKFRTPFPDLPSLVHRLGS-FPLLPTLKLEGPFSLRVDSLHNLSLFLPM 136

Query: 518  NTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNGSVVVKKDRSEFWPLWHSLCK 697
            N S+TGLK+ILVGEGITVEV GA+E+SLF++SD+ L +NGS +    +S+ WP  HS C 
Sbjct: 137  NVSYTGLKQILVGEGITVEVKGAQEISLFYSSDIDLLMNGSAMCSGGKSDIWPFLHSTCM 196

Query: 698  PLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPEKCYSGRHINKKRACPIDSXX 877
             + PI I GS+SLVA+R RN  A+I  T +S+D IE+LPEKCY G  + KK+ACP+DS  
Sbjct: 197  AVIPIRISGSASLVAYRARNPYAHIATTLISEDAIEMLPEKCYHG-CMFKKQACPLDSVS 255

Query: 878  XXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQLELERDIRSNESFKGILAEW 1057
                       S  G +I Q    G LKA  KAS VV+F++ELERDI ++ +F   + +W
Sbjct: 256  WKLSRLEKVLRSLFGRKIVQGQSFGLLKANIKASAVVKFRIELERDISNSVTFNRTIPDW 315

Query: 1058 RTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTSWSNLMSNISFTKFPSILVPP 1237
            RT+PS ER  FE++ARVE   LKPL++K+V+PFIE    SW+NLMSN+S+T    + +PP
Sbjct: 316  RTRPSFERFWFEILARVEENSLKPLSIKRVKPFIEFVSVSWANLMSNMSYTMLRPVFLPP 375

Query: 1238 EAFMLDVKW 1264
            E   LDVKW
Sbjct: 376  EPLTLDVKW 384


>gb|EXB38625.1| putative thylakoidal processing peptidase 2 [Morus notabilis]
          Length = 787

 Score =  336 bits (862), Expect = 1e-89
 Identities = 181/356 (50%), Positives = 236/356 (66%)
 Frame = +2

Query: 197  QDVLKAISARQRWDLGDIRVSKLDVRKAKFGRAQRIEFQIHIGKNDLIFGFPDEVGLWKD 376
            +DVLK IS +Q+WDL  I+VS+LD+RK +FG + R EF++ IGK  L   F DEV  W +
Sbjct: 438  KDVLKEISVKQKWDLDAIKVSRLDLRKLRFGTSNRYEFRVGIGKTHLSAIFSDEVSSWNN 497

Query: 377  LSEGGDDFGFLINEVSSSMSVLDTFKVEGPMELWVSGDDELSIMLPLNTSHTGLKRILVG 556
                  D G L++EV S  ++LDTFK+EGP EL V   +  S++LP+N +H G  RILVG
Sbjct: 498  FRNPTADLGSLLDEVRS-FALLDTFKLEGPFELRVGDSNYSSLLLPMNRTHAGFNRILVG 556

Query: 557  EGITVEVNGAREVSLFHTSDLGLSVNGSVVVKKDRSEFWPLWHSLCKPLPPIHIFGSSSL 736
            EGIT+EV GA+EVS F  SD   +VN S  +   ++EFWP+ HS C  L  I +FGS++L
Sbjct: 557  EGITIEVRGAQEVSAFQASDFSSTVNVSHEIGNGKTEFWPIRHSFCGVLVQIQVFGSAAL 616

Query: 737  VAFRTRNRDAYIEITSLSKDMIELLPEKCYSGRHINKKRACPIDSXXXXXXXXXXXXXSF 916
             A+RT+N D  I+   +SK+ IELL EKCY G +I+KKR CP+DS             S+
Sbjct: 617  AAYRTKNPDNCIKTKRISKETIELLAEKCY-GNNIHKKRNCPVDSLGLRIAMLEKVLRSY 675

Query: 917  LGDRIRQNGLLGSLKAENKASTVVRFQLELERDIRSNESFKGILAEWRTKPSVERVSFEV 1096
             G+R+  NG +G  + +  A  ++RFQLELE D RSN++ +   A WRT+PSVERV F+V
Sbjct: 676  FGERL--NGTVGLFRGKISALALIRFQLELEMDSRSNDT-QQAKASWRTRPSVERVWFDV 732

Query: 1097 VARVEAERLKPLTVKKVRPFIEADLTSWSNLMSNISFTKFPSILVPPEAFMLDVKW 1264
            +ARVEAERLK L  K+  P    D   WSNL SNISFTKFPS+LVP EA  LDVKW
Sbjct: 733  LARVEAERLKLLVAKETNPSFVTDTAGWSNL-SNISFTKFPSLLVPSEALTLDVKW 787


>ref|XP_003549415.1| PREDICTED: uncharacterized protein LOC100804093 [Glycine max]
          Length = 393

 Score =  330 bits (847), Expect = 7e-88
 Identities = 182/376 (48%), Positives = 237/376 (63%), Gaps = 6/376 (1%)
 Frame = +2

Query: 155  STTHESRPQPSHFLQDVLKAISARQRWDLG---DIRVSKLDVRKAKFGRAQRIEFQIHIG 325
            S+TH +    +H LQDVLKA+SA+Q+WD     D+RV+K DV K  FG +   EF+I  G
Sbjct: 23   SSTHSNL---THILQDVLKAVSAKQKWDSSNNDDVRVTKFDVGKVMFGTSLSYEFRIRFG 79

Query: 326  ---KNDLIFGFPDEVGLWKDLSEGGDDFGFLINEVSSSMSVLDTFKVEGPMELWVSGDDE 496
                ++    F D+V  W        D   L++ + S   +L T K+EGP  L V     
Sbjct: 80   TDNNDNFTLKFVDQVATWNKFRTPFTDLPPLVHRLGS-FPLLHTLKLEGPFALRVDALHN 138

Query: 497  LSIMLPLNTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNGSVVVKKDRSEFWP 676
            LS+ LP+N S+TGLK ILVGEGITVEV  A+E+SLF++SDL L +NGS +  + +S+ WP
Sbjct: 139  LSLSLPMNVSYTGLKHILVGEGITVEVRRAQEISLFYSSDLDLQMNGSAMCSEGKSDLWP 198

Query: 677  LWHSLCKPLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPEKCYSGRHINKKRA 856
               S C  L PI I GS+SLVA+R RN  A I  T +S+D IELLPEKCY G H+ +KRA
Sbjct: 199  FMRSTCMALIPIRISGSASLVAYRARNAYAQIATTLISEDAIELLPEKCYHG-HVFRKRA 257

Query: 857  CPIDSXXXXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQLELERDIRSNESF 1036
            CPIDS             SFL  +I ++ L G LKA  KAS VV+F LELERDI +N + 
Sbjct: 258  CPIDSLSLRLSLLEKVLRSFLDHKILKDQLFGLLKANIKASAVVKFPLELERDISNNATL 317

Query: 1037 KGILAEWRTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTSWSNLMSNISFTKF 1216
               + +WRT+P  ER  FE++ARVE  +LKPL +K+VRPFIE+   SW+NLMSN+S+TK 
Sbjct: 318  NRTIPDWRTRPGFERFWFEILARVEENKLKPLLIKEVRPFIESVSVSWANLMSNMSYTKL 377

Query: 1217 PSILVPPEAFMLDVKW 1264
              +   PE   LDVKW
Sbjct: 378  RPVFFLPEPLTLDVKW 393


>ref|XP_003589258.1| hypothetical protein MTR_1g021180 [Medicago truncatula]
            gi|355478306|gb|AES59509.1| hypothetical protein
            MTR_1g021180 [Medicago truncatula]
          Length = 451

 Score =  323 bits (827), Expect = 2e-85
 Identities = 186/416 (44%), Positives = 248/416 (59%), Gaps = 39/416 (9%)
 Frame = +2

Query: 134  NASSPTPSTTHESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKFGRAQRIEFQ 313
            NASS + S+TH +    +H  QD+LKAIS+RQ+WDL D+RV   DV K +FG +Q   F+
Sbjct: 25   NASSSS-SSTHSNI---THIFQDILKAISSRQKWDLNDVRVFNFDVAKIRFGTSQNYLFR 80

Query: 314  IHIGKNDLIFGFPDEVGLW---KDLSEGGDDFGFLINEVSSSMSVLDTFKVEGPMELWVS 484
            I   KN+    F DE+  W   K  +    D   L++++ SS++ LD  K+EGP EL V 
Sbjct: 81   IGSSKNNFTVKFSDEISSWNHNKFTTTPKPDLASLVDQL-SSIAFLDYIKLEGPFELRVH 139

Query: 485  GDDELSIMLP------------------------------------LNTSHTGLKRILVG 556
                LS+ LP                                    +N S+ GLK I+VG
Sbjct: 140  ESHHLSLSLPSSQITRGKRKLRKIIREIVKKDLEINEFDRRMIYDTMNVSYNGLKHIIVG 199

Query: 557  EGITVEVNGAREVSLFHTSDLGLSVNGSVVVKKDRSEFWPLWHSLCKPLPPIHIFGSSSL 736
            +GITVEV  ARE+S ++ SDL L  NGSV+    ++EFWP   S+C PL PI I GS+SL
Sbjct: 200  KGITVEVRRAREISFYYQSDLDLQRNGSVICSNQKNEFWPFLQSMCVPLIPIRIIGSASL 259

Query: 737  VAFRTRNRDAYIEITSLSKDMIELLPEKCYSGRHINKKRACPIDSXXXXXXXXXXXXXSF 916
            +A+  RN    I    +S+D +ELLPEKCY G  + +K+ACP+ S             S 
Sbjct: 260  IAYVARNPYVQIGTALISEDAVELLPEKCYHG-CVFRKQACPVASLNLRLILLEKILRSL 318

Query: 917  LGDRIRQNGLLGSLKAENKASTVVRFQLELERDIRSNESFKGILAEWRTKPSVERVSFEV 1096
            LG +I Q+ L G +KA  KA   V+F LELERD+ +N +    L +WRT+PSVERV FEV
Sbjct: 319  LGHKILQDRLSGLIKANIKAYAGVKFPLELERDVGNNATL-STLPDWRTRPSVERVWFEV 377

Query: 1097 VARVEAERLKPLTVKKVRPFIEADLTSWSNLMSNISFTKFPSILVPPEAFMLDVKW 1264
            +ARVE  RLKPL++KKV+PFIE+D  SW+NLMSN+S+TK   +L+PPEA  LDVKW
Sbjct: 378  MARVEDSRLKPLSIKKVKPFIESDSVSWANLMSNLSYTKLRPVLLPPEALTLDVKW 433


>ref|XP_004160620.1| PREDICTED: uncharacterized protein LOC101229456 [Cucumis sativus]
          Length = 763

 Score =  320 bits (821), Expect = 8e-85
 Identities = 182/389 (46%), Positives = 242/389 (62%), Gaps = 3/389 (0%)
 Frame = +2

Query: 107  IEAFISPSLNASSPTPSTTHE--SRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKA 280
            ++AF+   LN+ S   S  H   +    +H LQDVL  ++A+Q+WDL  I++ +LDV   
Sbjct: 380  LQAFLF--LNSLSIASSLNHSISNDDDNAHLLQDVLNDLAAKQKWDLEGIKILELDVESL 437

Query: 281  KFGRAQRIEFQIHIGKNDLIFGFPDEVGLWKDLSEGGDD-FGFLINEVSSSMSVLDTFKV 457
            +FG A+  E ++ +GK  L+  F DEV  WK  S      FG LIN + S M+ + TFK+
Sbjct: 438  RFGFAESYEIRLGLGKTRLLAKFSDEVSSWKKPSSANQTRFGSLINGIGS-MAAIRTFKI 496

Query: 458  EGPMELWVSGDDELSIMLPLNTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNG 637
             GP +L V G+  LS+ LP N +H G+KRILVGEGITVEV+ A EVS+F++SDL   +N 
Sbjct: 497  VGPFDLMVEGEARLSVSLPKNATHVGVKRILVGEGITVEVSEAEEVSVFYSSDLSKLLN- 555

Query: 638  SVVVKKDRSEFWPLWHSLCKPLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPE 817
                   +   +P     C PL P+ + GS++L A+RT+N D YI    LSKD IELLP 
Sbjct: 556  ETRRSNGKIRTYPFRLPFCSPLLPLRVLGSATLSAYRTQNPDDYIRTRFLSKDSIELLPN 615

Query: 818  KCYSGRHINKKRACPIDSXXXXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQ 997
            KCY GR+ + + +  + S              +L + I QNGLL  +K + +A  VVRFQ
Sbjct: 616  KCY-GRNTHIENSPLLGSLKPQFHMLDTVFQRYLRNWILQNGLLAFVKVKMRACVVVRFQ 674

Query: 998  LELERDIRSNESFKGILAEWRTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTS 1177
            LELE    +N S    LAEWRTKP+VER SFEV+AR++  RLKPL VKK++P I AD T 
Sbjct: 675  LELENTFGTNSSLYARLAEWRTKPTVERASFEVLARLDTVRLKPLAVKKLKPLIVADSTE 734

Query: 1178 WSNLMSNISFTKFPSILVPPEAFMLDVKW 1264
            W NL+ NISFTKFPS+LV PEA  LDVKW
Sbjct: 735  WRNLLPNISFTKFPSLLVSPEALTLDVKW 763


>ref|XP_004141368.1| PREDICTED: uncharacterized protein LOC101221060, partial [Cucumis
            sativus]
          Length = 761

 Score =  320 bits (821), Expect = 8e-85
 Identities = 182/389 (46%), Positives = 242/389 (62%), Gaps = 3/389 (0%)
 Frame = +2

Query: 107  IEAFISPSLNASSPTPSTTHE--SRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKA 280
            ++AF+   LN+ S   S  H   +    +H LQDVL  ++A+Q+WDL  I++ +LDV   
Sbjct: 378  LQAFLF--LNSLSIASSLNHSISNDDDNAHLLQDVLNDLAAKQKWDLEGIKILELDVESL 435

Query: 281  KFGRAQRIEFQIHIGKNDLIFGFPDEVGLWKDLSEGGDD-FGFLINEVSSSMSVLDTFKV 457
            +FG A+  E ++ +GK  L+  F DEV  WK  S      FG LIN + S M+ + TFK+
Sbjct: 436  RFGFAESYEIRLGLGKTRLLAKFSDEVSSWKKPSSANQTRFGSLINGIGS-MAAIRTFKI 494

Query: 458  EGPMELWVSGDDELSIMLPLNTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNG 637
             GP +L V G+  LS+ LP N +H G+KRILVGEGITVEV+ A EVS+F++SDL   +N 
Sbjct: 495  VGPFDLMVEGEARLSVSLPKNATHVGVKRILVGEGITVEVSEAEEVSVFYSSDLSKLLN- 553

Query: 638  SVVVKKDRSEFWPLWHSLCKPLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPE 817
                   +   +P     C PL P+ + GS++L A+RT+N D YI    LSKD IELLP 
Sbjct: 554  ETRRSNGKIRTYPFRLPFCSPLLPLRVLGSATLSAYRTQNPDDYIRTRFLSKDSIELLPN 613

Query: 818  KCYSGRHINKKRACPIDSXXXXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQ 997
            KCY GR+ + + +  + S              +L + I QNGLL  +K + +A  VVRFQ
Sbjct: 614  KCY-GRNTHIENSPLLGSLKPQFHMLDTVFQRYLRNWILQNGLLAFVKVKMRACVVVRFQ 672

Query: 998  LELERDIRSNESFKGILAEWRTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTS 1177
            LELE    +N S    LAEWRTKP+VER SFEV+AR++  RLKPL VKK++P I AD T 
Sbjct: 673  LELENTFGTNSSLYARLAEWRTKPTVERASFEVLARLDTVRLKPLAVKKLKPLIVADSTE 732

Query: 1178 WSNLMSNISFTKFPSILVPPEAFMLDVKW 1264
            W NL+ NISFTKFPS+LV PEA  LDVKW
Sbjct: 733  WRNLLPNISFTKFPSLLVSPEALTLDVKW 761


>gb|AAM61120.1| unknown [Arabidopsis thaliana]
          Length = 395

 Score =  319 bits (817), Expect = 2e-84
 Identities = 179/387 (46%), Positives = 237/387 (61%), Gaps = 1/387 (0%)
 Frame = +2

Query: 107  IEAFISPSLNASSPTPSTTHESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKF 286
            +  FI     A +  PS   ES    +  LQDVLK IS +Q+W+L ++R SKL+V+K + 
Sbjct: 12   LSLFIQVLTLAVALDPSQPDESNITATPILQDVLKEISVKQKWNLEEVRFSKLEVKKIRI 71

Query: 287  GRAQRIEFQIHIGKNDLIFGFPDEVGLWKDLSEGGD-DFGFLINEVSSSMSVLDTFKVEG 463
            G ++R E +I +GK+  +F FPDEV  W+    G D +   L+ EV+SS  +     ++G
Sbjct: 72   GTSRRFEIRIRLGKSRFVFIFPDEVTDWRRSGGGRDVELQELVREVNSSKVLDPPLVLKG 131

Query: 464  PMELWVSGDDELSIMLPLNTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNGSV 643
            P EL V GDD LS+ LP+N SH+GLKR+LV EGI+VE+  A+ VSLFH+S    +     
Sbjct: 132  PFELRVDGDDRLSLSLPMNISHSGLKRVLVSEGISVEIREAQAVSLFHSSHRRYAATVDP 191

Query: 644  VVKKDRSEFWPLWHSLCKPLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPEKC 823
            V  K  S  W  W S+C PLPPI I GS+SLVAFRT N    I+ + LS + I L  EKC
Sbjct: 192  VNIKQGSSLWSFWGSVCVPLPPIQIIGSASLVAFRTSNATTQIKTSYLSDEAIHLYAEKC 251

Query: 824  YSGRHINKKRACPIDSXXXXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQLE 1003
            Y   H  ++   P D              S LG+  RQ   + S+ A+ KAS +VRFQLE
Sbjct: 252  YYKAHTYRQHRFPNDLLGLKIHKLEKVLNS-LGNGTRQT--VSSVTAKLKASGMVRFQLE 308

Query: 1004 LERDIRSNESFKGILAEWRTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTSWS 1183
            +ER I  NES       WRTKP +ERV FEV A++E ++LK + ++KV PFIE D  +WS
Sbjct: 309  IERSIGKNESVISKKVAWRTKPKIERVWFEVTAKIEGDKLKAVRLRKVVPFIEVDTEAWS 368

Query: 1184 NLMSNISFTKFPSILVPPEAFMLDVKW 1264
            +LMSN+SFTKFPS+LVP EA  LDVKW
Sbjct: 369  SLMSNMSFTKFPSLLVPQEALTLDVKW 395


>ref|NP_564503.1| uncharacterized protein [Arabidopsis thaliana]
            gi|9993349|gb|AAG11422.1|AC015449_4 Unknown protein
            [Arabidopsis thaliana] gi|30102708|gb|AAP21272.1|
            At1g47310 [Arabidopsis thaliana]
            gi|110736510|dbj|BAF00222.1| hypothetical protein
            [Arabidopsis thaliana] gi|332194034|gb|AEE32155.1|
            uncharacterized protein AT1G47310 [Arabidopsis thaliana]
          Length = 395

 Score =  318 bits (816), Expect = 3e-84
 Identities = 177/387 (45%), Positives = 238/387 (61%), Gaps = 1/387 (0%)
 Frame = +2

Query: 107  IEAFISPSLNASSPTPSTTHESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKF 286
            +  FI     A +  PS   ES    +  LQDVLK IS +Q+W+L ++R SKL+V+K + 
Sbjct: 12   LSLFIQVLTLAVALDPSQPDESNITATPILQDVLKEISVKQKWNLEEVRFSKLEVKKIRI 71

Query: 287  GRAQRIEFQIHIGKNDLIFGFPDEVGLWKDLSEGGD-DFGFLINEVSSSMSVLDTFKVEG 463
            G ++R E +I +GK+  +F FPDE+  W+    G D +   L+ EV+SS  +     ++G
Sbjct: 72   GTSRRFEIRIRLGKSRFVFIFPDEITDWRRSGGGSDVELQELVREVNSSKVLDPPLVLKG 131

Query: 464  PMELWVSGDDELSIMLPLNTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNGSV 643
            P EL V G+D LS+ LP+N SH+GLKR+LV EGI+VE+  A+ VSLFH+S    +     
Sbjct: 132  PFELLVDGNDRLSLSLPMNISHSGLKRVLVSEGISVEIREAQAVSLFHSSHRRYAATVDP 191

Query: 644  VVKKDRSEFWPLWHSLCKPLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPEKC 823
            V  K+ S  W  W S+C PLPPI I GS+SLVAFRT N    I+ + LS + I L  EKC
Sbjct: 192  VNIKEGSSLWSFWGSVCVPLPPIQIIGSASLVAFRTSNATTQIKTSYLSDEAIHLYAEKC 251

Query: 824  YSGRHINKKRACPIDSXXXXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQLE 1003
            Y   H  ++   P D              S LG+  RQ   + S+ A+ KAS +VRFQLE
Sbjct: 252  YYKAHTYRQHRFPNDLLGLKIHKLEKVLNS-LGNGTRQT--VSSVTAKLKASGMVRFQLE 308

Query: 1004 LERDIRSNESFKGILAEWRTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTSWS 1183
            +ER I  NES       WRTKP +ERV FEV A++E ++LK + ++KV PFIE D  +WS
Sbjct: 309  IERSIGKNESVISKKVAWRTKPKIERVWFEVTAKIEGDKLKAVRLRKVVPFIEVDTEAWS 368

Query: 1184 NLMSNISFTKFPSILVPPEAFMLDVKW 1264
            +LMSN+SFTKFPS+LVP EA  LDVKW
Sbjct: 369  SLMSNMSFTKFPSLLVPQEALTLDVKW 395


>gb|EYU36280.1| hypothetical protein MIMGU_mgv1a007411mg [Mimulus guttatus]
          Length = 408

 Score =  314 bits (804), Expect = 7e-83
 Identities = 180/378 (47%), Positives = 238/378 (62%), Gaps = 6/378 (1%)
 Frame = +2

Query: 149  TPSTTHESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKFGRAQRIEFQIHIGK 328
            T  T  +  PQP  FLQ VL  I+ R++W +  IRVS+LDV+KAKF   QR EF++  GK
Sbjct: 42   TALTESDITPQPPQFLQGVLDVIANREKWTVEYIRVSELDVKKAKFRSVQRYEFRVRAGK 101

Query: 329  NDLIFGFPDEVGLWKDLSE----GGDDFGFLINEVSSSMSVLDTFKVEGPMELWVSGD-D 493
             +++    +E   WK L E       DF  +  ++ S  +V+D+FK+EGP EL V+ D D
Sbjct: 102  AEIVLKMSEEGSEWKKLLEVRTNETSDFESVARKIGSK-AVIDSFKIEGPFELRVARDVD 160

Query: 494  ELSIMLPLNTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNGSVVVKKDRSEFW 673
            +LS+MLPLNTSH+GL RI V EGITVEV GA E++++H SD  L  +     K       
Sbjct: 161  QLSLMLPLNTSHSGLHRISVSEGITVEVKGAEEITIYHPSDNHLPRDFFTYGK------- 213

Query: 674  PLWHSLCKPLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPEKCYSGRHINKKR 853
              W + C  L PIHI GS+S+VA+++R  D+ I+    S+D I+LLP+KC+   +  K R
Sbjct: 214  --WPAFCTALLPIHIVGSASVVAYKSRRPDSLIQTALTSEDAIKLLPDKCHIQPNYKKPR 271

Query: 854  ACPIDSXXXXXXXXXXXXXSFLGDRI-RQNGLLGSLKAENKASTVVRFQLELERDIRSNE 1030
               + S             SFL DR    N  LGSLK   +A  V RF L LERDIR+N+
Sbjct: 272  HL-LSSLRIRITLLEELLRSFLSDRGGNANVALGSLKTRIRAVNVFRFHLGLERDIRAND 330

Query: 1031 SFKGILAEWRTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTSWSNLMSNISFT 1210
            +    LAEWRTKP++ERV FEVVAR+E E LKP+TVKKV P I+ D  SW++++SN+SFT
Sbjct: 331  TSWTTLAEWRTKPTIERVWFEVVARIEGEVLKPITVKKVGPLIDTDSFSWNSILSNLSFT 390

Query: 1211 KFPSILVPPEAFMLDVKW 1264
            K PS+LVPPEA  LDVKW
Sbjct: 391  KLPSVLVPPEALTLDVKW 408


>ref|XP_002891379.1| hypothetical protein ARALYDRAFT_473912 [Arabidopsis lyrata subsp.
            lyrata] gi|297337221|gb|EFH67638.1| hypothetical protein
            ARALYDRAFT_473912 [Arabidopsis lyrata subsp. lyrata]
          Length = 391

 Score =  314 bits (804), Expect = 7e-83
 Identities = 180/388 (46%), Positives = 244/388 (62%), Gaps = 2/388 (0%)
 Frame = +2

Query: 107  IEAFISPSLNASSPTPSTTHESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKF 286
            +  FI     A +  PS   ES    +  LQDVLK IS +Q+W+L ++R SKL+V+K + 
Sbjct: 12   LSLFIQVLTLAIALDPSQPDESNITATPILQDVLKEISVKQKWNLEEVRFSKLEVKKIRI 71

Query: 287  GRAQRIEFQIHIGKNDLIFGFPDEVGLWKDLSEGGDDFGF--LINEVSSSMSVLDTFKVE 460
            G  +R E +I +GK+  +F FPDEV  W+  S GG D     ++ EV+SS  VLD+  ++
Sbjct: 72   GTGRRFEIRIRLGKSRFVFIFPDEVTDWRR-SVGGKDVELQEVVREVNSS-KVLDSLVLK 129

Query: 461  GPMELWVSGDDELSIMLPLNTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNGS 640
            GP EL V GDD LS+ LP+N SH GLKR+LV EGI+VE+  A+ VSLFH+S    +   +
Sbjct: 130  GPFELRVDGDDRLSLALPMNISHNGLKRVLVSEGISVEIREAQAVSLFHSSHRRYA---A 186

Query: 641  VVVKKDRSEFWPLWHSLCKPLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPEK 820
             V  K+ +       S+C PLPPI I GS+SLVAFRT N D+ I+ + LS + I++ P+K
Sbjct: 187  TVDMKNGNCLLSFLGSVCVPLPPIQILGSASLVAFRTSNTDSQIKTSYLSDEAIQIHPDK 246

Query: 821  CYSGRHINKKRACPIDSXXXXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQL 1000
            CY   H  ++   P D              S LG+  RQ   + S+ A+ KAS +VRFQL
Sbjct: 247  CYDKAHTYRQHRFPTDLLGLKINKLEKVLSS-LGNGTRQT--VSSVTAKLKASGMVRFQL 303

Query: 1001 ELERDIRSNESFKGILAEWRTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTSW 1180
            E+ER I  NES      EWRTKP +ERV FE+ A++E ++LK + ++KV PFIE D  +W
Sbjct: 304  EIERSIGKNESVISKRVEWRTKPKIERVWFEITAKIEGDKLKAVGMRKVVPFIEVDTEAW 363

Query: 1181 SNLMSNISFTKFPSILVPPEAFMLDVKW 1264
            S+LMSN+SFTKFPS+LVP EA  LDVKW
Sbjct: 364  SSLMSNMSFTKFPSLLVPQEALTLDVKW 391


>ref|XP_006393531.1| hypothetical protein EUTSA_v10011550mg [Eutrema salsugineum]
            gi|557090109|gb|ESQ30817.1| hypothetical protein
            EUTSA_v10011550mg [Eutrema salsugineum]
          Length = 398

 Score =  311 bits (796), Expect = 6e-82
 Identities = 184/390 (47%), Positives = 246/390 (63%), Gaps = 4/390 (1%)
 Frame = +2

Query: 107  IEAFISPSLNASSPTPSTTHESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKF 286
            +  FI     A +  PS   ES    +  LQDVLK IS RQ+W+L ++R SKL+V+K + 
Sbjct: 13   LSLFIQALTLAVALDPSQPDESTITATPILQDVLKEISVRQKWNLTEVRFSKLEVKKLRV 72

Query: 287  GRAQRIEFQIHIGKNDLIFGFPDEVGLWKDLSEGGDDFGFL--INEVSSSMSVLDTFKVE 460
            G  +  E +I +GK+  +F FPDEV  W+  S GG     +  + EV+SS  VLD   ++
Sbjct: 73   GTGRSFEIRIRLGKSRFVFVFPDEVTDWRR-SGGGKQVELMEVVREVNSS-KVLDPIVLK 130

Query: 461  GPMELWVSGDDEL-SIMLPLNTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNG 637
            GP+EL V+G+D L S+ LP+N SH GLKR+LV EGI+VE+  A+ VSLFH+S+   + + 
Sbjct: 131  GPLELRVAGEDNLLSLALPMNISHNGLKRVLVSEGISVEIRKAQTVSLFHSSNRRFAASV 190

Query: 638  SVVVKKDRSEFWP-LWHSLCKPLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLP 814
              V   +RS  W  L  S+C PLPPI I GS+SLVAFRT  +D+ I+ + L+ + I+LLP
Sbjct: 191  EPVDMNERSCLWSSLGGSVCVPLPPIQIDGSASLVAFRTPYKDSRIKTSYLTNEAIQLLP 250

Query: 815  EKCYSGRHINKKRACPIDSXXXXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRF 994
            EKCY   H  K+     D              S LG++      + S+ A+ KAS +VRF
Sbjct: 251  EKCYHKAHTYKQNHLSTDLLGLKIKKLERVLSS-LGNKGNAE-TVSSMTAKLKASGMVRF 308

Query: 995  QLELERDIRSNESFKGILAEWRTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLT 1174
            QLE+ER I SNES      EWRTKP +ERV FEV A+VE ++LK + ++KV PFIE D  
Sbjct: 309  QLEIERRIGSNESVTSKRLEWRTKPKIERVWFEVAAKVEGDKLKAVGMRKVVPFIEVDTE 368

Query: 1175 SWSNLMSNISFTKFPSILVPPEAFMLDVKW 1264
            +WS+LMSN+SFTKFPSILVP EA  LDVKW
Sbjct: 369  AWSSLMSNMSFTKFPSILVPQEALTLDVKW 398


>ref|XP_006307471.1| hypothetical protein CARUB_v10009097mg [Capsella rubella]
            gi|482576182|gb|EOA40369.1| hypothetical protein
            CARUB_v10009097mg [Capsella rubella]
          Length = 454

 Score =  306 bits (785), Expect = 1e-80
 Identities = 177/387 (45%), Positives = 235/387 (60%), Gaps = 1/387 (0%)
 Frame = +2

Query: 107  IEAFISPSLNASSPTPSTTHESRPQPSHFLQDVLKAISARQRWDLGDIRVSKLDVRKAKF 286
            +  FI     A +  PS   ES       LQDVLK IS +Q+W+L ++R  KL+V+K + 
Sbjct: 72   LSLFIQALTLAVALDPSQPDESNITAIPILQDVLKEISMKQKWNLEEVRFKKLEVKKLRI 131

Query: 287  GRAQRIEFQIHIGKNDLIFGFPDEVGLWKDLSEGGD-DFGFLINEVSSSMSVLDTFKVEG 463
            G  +R E +I +GK+  +F FPDEV  W     G D +   ++ EV+S+  VLD   ++G
Sbjct: 132  GVGRRFEIRIRLGKSRFVFVFPDEVTDWSRSGGGRDVELHEVVREVNST-KVLDPIVLKG 190

Query: 464  PMELWVSGDDELSIMLPLNTSHTGLKRILVGEGITVEVNGAREVSLFHTSDLGLSVNGSV 643
            P EL V GD   S+ LP+N SH+GLKR+LV EGI+VE+ GA+ VSLFH+S    +     
Sbjct: 191  PFELRVDGDSRFSLALPMNISHSGLKRVLVSEGISVEIRGAQAVSLFHSSHRRYAATVDP 250

Query: 644  VVKKDRSEFWPLWHSLCKPLPPIHIFGSSSLVAFRTRNRDAYIEITSLSKDMIELLPEKC 823
            V  K+ +       S+C PLPPI I GS+SLVAFRTRN D+ I+ + LS + I L  EKC
Sbjct: 251  VNIKEGNCLRLFRSSVCAPLPPIQIIGSASLVAFRTRNADSQIKTSYLSNEAIHLHAEKC 310

Query: 824  YSGRHINKKRACPIDSXXXXXXXXXXXXXSFLGDRIRQNGLLGSLKAENKASTVVRFQLE 1003
            Y   H  ++   P D              S LG+  RQ   + S+ A+ K S +VRFQLE
Sbjct: 311  YYKAHTYRQHGFPTDLLGLKINKLEKVLSS-LGNGTRQT--VTSVTAKLKPSGMVRFQLE 367

Query: 1004 LERDIRSNESFKGILAEWRTKPSVERVSFEVVARVEAERLKPLTVKKVRPFIEADLTSWS 1183
            +ER I  NES      EWRTKP +ERV FEV A+VE ++LK   ++KV PFIE D  +WS
Sbjct: 368  IERSIGKNESVTSKKIEWRTKPKIERVWFEVTAKVERDKLKAAGMRKVVPFIEVDTEAWS 427

Query: 1184 NLMSNISFTKFPSILVPPEAFMLDVKW 1264
            ++MSN+SFTKFPS+LVP EA  LDVKW
Sbjct: 428  SMMSNMSFTKFPSLLVPQEALTLDVKW 454


>ref|XP_007225638.1| hypothetical protein PRUPE_ppa008077mg [Prunus persica]
            gi|462422574|gb|EMJ26837.1| hypothetical protein
            PRUPE_ppa008077mg [Prunus persica]
          Length = 346

 Score =  292 bits (747), Expect = 3e-76
 Identities = 150/271 (55%), Positives = 193/271 (71%)
 Frame = +2

Query: 200  DVLKAISARQRWDLGDIRVSKLDVRKAKFGRAQRIEFQIHIGKNDLIFGFPDEVGLWKDL 379
            DVLK ISA+ +W L DIRVS+LD  + +FG AQR EF++  GK  +   F D+V  WK  
Sbjct: 68   DVLKKISAKHKWYLQDIRVSRLDASRVRFGSAQRYEFRVGFGKIPVGVLFSDDVASWKKF 127

Query: 380  SEGGDDFGFLINEVSSSMSVLDTFKVEGPMELWVSGDDELSIMLPLNTSHTGLKRILVGE 559
             +    FG L+ E+SS M+V+DTFKVEGP EL V G   LS+ LP+NT+++G KR+LVG+
Sbjct: 128  RQPRTHFGSLVKELSS-MAVVDTFKVEGPFELRVGGIHHLSLSLPMNTTYSGFKRVLVGK 186

Query: 560  GITVEVNGAREVSLFHTSDLGLSVNGSVVVKKDRSEFWPLWHSLCKPLPPIHIFGSSSLV 739
            GITVEV+GA EVS+FH SDLGLS  GS  + K++SEFWP+WHS C PL PI + G ++LV
Sbjct: 187  GITVEVSGATEVSVFHASDLGLSSKGSGAIGKEKSEFWPIWHSYCTPLFPIRVLGPATLV 246

Query: 740  AFRTRNRDAYIEITSLSKDMIELLPEKCYSGRHINKKRACPIDSXXXXXXXXXXXXXSFL 919
            A++TRN DAYIE   +SK++IE LPEKCY   H  KKRACPIDS             SFL
Sbjct: 247  AYKTRNPDAYIETKFMSKEIIEFLPEKCYRS-HAYKKRACPIDSLRLRISMLESIWKSFL 305

Query: 920  GDRIRQNGLLGSLKAENKASTVVRFQLELER 1012
            GDRIRQ+GL G ++ + KASTVVRF++++ R
Sbjct: 306  GDRIRQSGLSGFVEGKIKASTVVRFKIKVAR 336


Top