BLASTX nr result

ID: Rehmannia23_contig00014718 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00014718
         (1609 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006354938.1| PREDICTED: uncharacterized protein LOC102588...   360   1e-96
ref|XP_004238590.1| PREDICTED: uncharacterized protein LOC101263...   352   2e-94
gb|EOY14602.1| Uncharacterized protein TCM_033924 [Theobroma cacao]   321   5e-85
ref|XP_002326915.1| predicted protein [Populus trichocarpa] gi|5...   305   5e-80
ref|XP_002510285.1| signal peptidase I, putative [Ricinus commun...   290   9e-76
ref|XP_004160620.1| PREDICTED: uncharacterized protein LOC101229...   275   3e-71
ref|XP_004141368.1| PREDICTED: uncharacterized protein LOC101221...   275   3e-71
gb|ESW32807.1| hypothetical protein PHAVU_001G018600g [Phaseolus...   274   9e-71
gb|AGV54177.1| signal peptidase I [Phaseolus vulgaris]                273   2e-70
gb|EXB38625.1| putative thylakoidal processing peptidase 2 [Moru...   270   1e-69
gb|EPS59742.1| hypothetical protein M569_15063, partial [Genlise...   269   3e-69
ref|XP_004499101.1| PREDICTED: uncharacterized protein LOC101493...   267   8e-69
ref|XP_002891379.1| hypothetical protein ARALYDRAFT_473912 [Arab...   267   8e-69
gb|AAM61120.1| unknown [Arabidopsis thaliana]                         266   2e-68
ref|XP_003549415.1| PREDICTED: uncharacterized protein LOC100804...   266   2e-68
ref|XP_006393531.1| hypothetical protein EUTSA_v10011550mg [Eutr...   263   1e-67
ref|NP_564503.1| uncharacterized protein [Arabidopsis thaliana] ...   263   2e-67
ref|XP_006307471.1| hypothetical protein CARUB_v10009097mg [Caps...   258   5e-66
ref|XP_003589258.1| hypothetical protein MTR_1g021180 [Medicago ...   251   6e-64
gb|EMJ26837.1| hypothetical protein PRUPE_ppa008077mg [Prunus pe...   204   1e-49

>ref|XP_006354938.1| PREDICTED: uncharacterized protein LOC102588271 [Solanum tuberosum]
          Length = 420

 Score =  360 bits (923), Expect = 1e-96
 Identities = 192/387 (49%), Positives = 257/387 (66%), Gaps = 6/387 (1%)
 Frame = +3

Query: 216  FNLSHFLYPKVTTFTESNTPSQPPNFLQGVLDAIVKKEKWVLEDIRVSELDVKRDKYRSG 395
            FN+SHFLYP++       +   PP+FL+ VL+ I ++EKW L+D+RVS+LDVK+ K+ + 
Sbjct: 37   FNISHFLYPRINYEEYPQSSPNPPSFLEDVLEGIAEREKWDLQDLRVSKLDVKKSKFGTF 96

Query: 396  LRYELQVRVGKAEIVLKMYDEVSEWKKLVAPWKNGTSDFEALARSIASKAVIDSLKIEGP 575
             +YE +VR+GK E V  M DEVS+WK    P KN  SDFE+L + I SK  +D LKI+GP
Sbjct: 97   RKYEFRVRIGKTEFVFMMADEVSQWKSFHFPNKN-ESDFESLVKEIGSKVTLDVLKIQGP 155

Query: 576  FELRVTGDDDQFSLMLPLNTSHYGLRRISVGEGITIEVKGAEEISFFHPS--VNQLPYNV 749
            FEL  TGDD   SL  PLN+S+ GL++I VGEGIT+EVKGA+EIS F+ S  +  +  ++
Sbjct: 156  FELYATGDD-YLSLTFPLNSSYTGLKKILVGEGITVEVKGADEISMFNISDLLKLVNGSI 214

Query: 750  LTWSNVGSIWH---SLCTALLPIRILGSASVVAYRNQRPTALIQTAFSSTDSVELLPDKC 920
            LT S  G   +   S C  LLP+ + G ASV+AY  + P   I+TA  S  S++LL +KC
Sbjct: 215  LTKSGSGQFRYMSQSSCIPLLPVHVRGPASVLAYITRNPDLRIETASVSKRSIKLLSEKC 274

Query: 921  YIRPNYTKPWHLLSS-LSNRIALLENVLRSFINERGNIDAALGSVKTRIRPLTIFRFQLE 1097
            Y R  Y K W L +  LS +I LLE +LR F+  + +  A    +K +++ LT+FRFQLE
Sbjct: 275  YTRHIYRK-WSLYNDFLSQKITLLEKILRRFLGGKTSEIARFNLIKVKVKDLTLFRFQLE 333

Query: 1098 LERDIKINDTFWSTLAEWRTRPTIERVLFEVVARIEGEVLKPLVIKRVRPLMDTDSFAWX 1277
            LER I+ NDT+W+TL EWRTRP +E   FEV AR E E+LKP +IK+VRP ++ DS +W 
Sbjct: 334  LERGIQNNDTYWTTLGEWRTRPAVEHSWFEVTARFEAEILKPRLIKKVRPFIEVDSSSWS 393

Query: 1278 XXXXXXXFTKFPSVLVPPEALTLDVKW 1358
                   FTK  S LVPPE LTLDV+W
Sbjct: 394  NLMSNMSFTKISSFLVPPEPLTLDVRW 420


>ref|XP_004238590.1| PREDICTED: uncharacterized protein LOC101263904 [Solanum
            lycopersicum]
          Length = 853

 Score =  352 bits (904), Expect = 2e-94
 Identities = 191/386 (49%), Positives = 252/386 (65%), Gaps = 5/386 (1%)
 Frame = +3

Query: 216  FNLSHFLYPKVTTFTESNTPSQPPNFLQGVLDAIVKKEKWVLEDIRVSELDVKRDKYRSG 395
            FN+SHFLYP++       +   PP+FL+ VL  I ++EKW L+D+RVS+LDVK+ K+ + 
Sbjct: 470  FNISHFLYPRINYEEYPQSSPNPPSFLEDVLKGIAEREKWDLQDLRVSKLDVKKSKFGTL 529

Query: 396  LRYELQVRVGKAEIVLKMYDEVSEWKKLVAPWKNGTSDFEALARSIASKAVIDSLKIEGP 575
             RYE +VR+GK E V  M DEVS+WK L  P KN  SDFE+L + I SKA +D LKI+GP
Sbjct: 530  RRYEFRVRIGKTEFVFMMADEVSQWKGLHFPNKN-ESDFESLVKEIGSKATLDVLKIQGP 588

Query: 576  FELRVTGDDDQFSLMLPLNTSHYGLRRISVGEGITIEVKGAEEISFFHPS--VNQLPYNV 749
            FEL  TGDD   SL LPLN+S+ GL++I V EGIT+EVKGA+EIS F+ S  +  +  ++
Sbjct: 589  FELYATGDD-YLSLTLPLNSSYTGLKKILVDEGITVEVKGADEISMFNISDLLKLVNGSM 647

Query: 750  LTWSNVGSIWHSL---CTALLPIRILGSASVVAYRNQRPTALIQTAFSSTDSVELLPDKC 920
            LT S  G   + L   C  LLP+ + G ASV+AY  + P   I+T F S  S++LL  KC
Sbjct: 648  LTKSGSGQYRYMLQSSCIPLLPVHVKGPASVLAYITRNPDLRIETVFVSRRSIKLLSQKC 707

Query: 921  YIRPNYTKPWHLLSSLSNRIALLENVLRSFINERGNIDAALGSVKTRIRPLTIFRFQLEL 1100
            Y R  Y K        S +IALLE VLR F+  + +       +K +++ LT+FRFQLEL
Sbjct: 708  YTRHIYRKWSSYNDFQSQKIALLEKVLRRFLGGKTSQIGRYNLLKVKVKDLTLFRFQLEL 767

Query: 1101 ERDIKINDTFWSTLAEWRTRPTIERVLFEVVARIEGEVLKPLVIKRVRPLMDTDSFAWXX 1280
            ER I+ NDT+W+TL EWRTRP +E   FEV AR E ++LKP +IK+V P ++ DS +W  
Sbjct: 768  ERGIQNNDTYWTTLGEWRTRPAVEHSWFEVTARFEADILKPRLIKKVSPFIEVDSSSWSN 827

Query: 1281 XXXXXXFTKFPSVLVPPEALTLDVKW 1358
                  FTK  S LVPPE LTLDV+W
Sbjct: 828  LMSNMSFTKISSFLVPPEPLTLDVRW 853


>gb|EOY14602.1| Uncharacterized protein TCM_033924 [Theobroma cacao]
          Length = 387

 Score =  321 bits (823), Expect = 5e-85
 Identities = 172/383 (44%), Positives = 238/383 (62%), Gaps = 3/383 (0%)
 Frame = +3

Query: 219  NLSHFLYPKVTTFTESNTPSQPPNFLQGVLDAIVKKEKWVLEDIRVSELDVKRDKYRSGL 398
            +L  FL  ++ + + +   S+ P  LQ VL+ I  K++W LE +  S+L+V + ++ +G 
Sbjct: 9    SLLFFLLSEILSLSFNPLQSKNPQILQDVLEKIALKQEWELEGLNFSKLEVSKARFGAGK 68

Query: 399  RYELQVRVGKAEIVLKMYDEVSEWKKLVAPWKNGTSDFEALARSIASKAVIDSLKIEGPF 578
            RYE ++R GK  ++ K  DEVS W K     K    DF    + I S A +DS K+EGPF
Sbjct: 69   RYEFRIRFGKTHLLFKFPDEVSSWSKFR---KGSGDDFLDFVKEINSTAGLDSFKMEGPF 125

Query: 579  ELRVTGDDDQFSLMLPLNTSHYGLRRISVGEGITIEVKGAEEISFFHPSVNQLPYNVLTW 758
            ELR+   + Q SL+LPLNTSH  L+R+ VGEGIT+EV GA+E+S FH     LP N    
Sbjct: 126  ELRLA-PNHQASLLLPLNTSHTDLKRVLVGEGITVEVSGAQEVSLFHAFSFGLPVNESEV 184

Query: 759  SNVGSIW---HSLCTALLPIRILGSASVVAYRNQRPTALIQTAFSSTDSVELLPDKCYIR 929
                  W    S C  LLP+ +LGS S+VAY+ + P A I+  F S+D++ELLP+KCY  
Sbjct: 185  EEKTGYWPFRQSFCMPLLPVNVLGSVSLVAYQTRNPDAHIEAVFLSSDTIELLPEKCYGD 244

Query: 930  PNYTKPWHLLSSLSNRIALLENVLRSFINERGNIDAALGSVKTRIRPLTIFRFQLELERD 1109
              Y K  + + S+S RI+ L  VLR+F+ +R N +    S+  + +   I  FQLELE+ 
Sbjct: 245  RAYMKQSYPMDSISLRISKLRKVLRTFLGDRDNGNGFSSSLNVKTKASPIIHFQLELEKT 304

Query: 1110 IKINDTFWSTLAEWRTRPTIERVLFEVVARIEGEVLKPLVIKRVRPLMDTDSFAWXXXXX 1289
            I  N+T    LAEWR++PT+ER+ F+V ARIE E LKPL+IK+VRP +  D+ +W     
Sbjct: 305  IGKNETVRGMLAEWRSKPTVERLWFDVTARIEAEKLKPLMIKKVRPFVGVDTVSWSNLLS 364

Query: 1290 XXXFTKFPSVLVPPEALTLDVKW 1358
               FTKFPS+LVPPEALTLDVKW
Sbjct: 365  NISFTKFPSILVPPEALTLDVKW 387


>ref|XP_002326915.1| predicted protein [Populus trichocarpa]
            gi|566202275|ref|XP_006375011.1| hypothetical protein
            POPTR_0014s03560g [Populus trichocarpa]
            gi|550323325|gb|ERP52808.1| hypothetical protein
            POPTR_0014s03560g [Populus trichocarpa]
          Length = 398

 Score =  305 bits (780), Expect = 5e-80
 Identities = 172/381 (45%), Positives = 235/381 (61%), Gaps = 6/381 (1%)
 Frame = +3

Query: 234  LYPKVTTFTESNTPSQPPNFLQGVLDAIVKKEKWVLEDIRVSELDVKRDKYRSGLRYELQ 413
            + P   +FT ++       FL+ VL  I  K+ W LE I +S+L+V + +  S  RYE +
Sbjct: 21   ILPLTISFTPNHLNDNNTQFLKDVLKEISVKQDWDLEGIEISKLEVSKVRIFSSQRYEFK 80

Query: 414  VRVGKAEIVLKMYDEVSEWKKLVAPWKNGTSDFEALARSIASKAVIDSLKIEGPFELRVT 593
            +RVGK+ ++LK  DE+   KKL  P    + DF  L +   S  V+D+LK++GPF+L V+
Sbjct: 81   IRVGKSYMLLKFPDEIDSRKKLSKP--KSSIDFGDLIKEFGSVPVLDTLKLQGPFDLWVS 138

Query: 594  GDDDQFSLMLPLNTSHYGLRRISVGEGITIEVKGAEEISFFHPSVNQLPYNVLTWSNVGS 773
            G D+ FSL+LP+N S+ GL+RI VGEGI++EVKGA+E+S F      L  N    +N   
Sbjct: 139  GHDN-FSLLLPMNASYGGLKRIIVGEGISVEVKGAKEVSLFQDFDLSLALNGSDINNNKG 197

Query: 774  ------IWHSLCTALLPIRILGSASVVAYRNQRPTALIQTAFSSTDSVELLPDKCYIRPN 935
                     S+C  LLPIRI+GSAS+VA +N  P A I+T   S  ++EL+ DKCY R  
Sbjct: 198  GNGFYPFGDSICPPLLPIRIIGSASLVANKNWDPDAEIETRLLSKKTIELVSDKCYDRNV 257

Query: 936  YTKPWHLLSSLSNRIALLENVLRSFINERGNIDAALGSVKTRIRPLTIFRFQLELERDIK 1115
            Y      +  LS+ IA LE VLRSF+ +R   +     ++   +  T+ RFQLELE+   
Sbjct: 258  YKIRASTMHFLSSSIARLEEVLRSFLGDRITRNGLSSFLRATAKASTLIRFQLELEKSFG 317

Query: 1116 INDTFWSTLAEWRTRPTIERVLFEVVARIEGEVLKPLVIKRVRPLMDTDSFAWXXXXXXX 1295
             N+T     AEWRTRPT+ERV FEV+AR+EGE LKP+++K+VRP +  DS +W       
Sbjct: 318  SNETAQEVFAEWRTRPTVERVWFEVIARVEGEKLKPVIVKKVRPFIAVDSASWSNLMSNI 377

Query: 1296 XFTKFPSVLVPPEALTLDVKW 1358
             FT FPSVLVPPEALTLDVKW
Sbjct: 378  SFTNFPSVLVPPEALTLDVKW 398


>ref|XP_002510285.1| signal peptidase I, putative [Ricinus communis]
            gi|223550986|gb|EEF52472.1| signal peptidase I, putative
            [Ricinus communis]
          Length = 831

 Score =  290 bits (743), Expect = 9e-76
 Identities = 160/372 (43%), Positives = 223/372 (59%), Gaps = 5/372 (1%)
 Frame = +3

Query: 258  TESNTPSQPPNFLQGVLDAIVKKEKWVLEDIRVSELDVKRDKYRSGLRYELQVRVGKAEI 437
            T +NT     + L+ VL  I ++  W LE IR S+L V + ++ +  RYE ++R GK  +
Sbjct: 473  TNNNT-----DILEDVLKEISERHNWDLERIRTSKLKVSKIRFGTAQRYEFRIRFGKMSL 527

Query: 438  VLKMYDEVSEWKKLVAPWKNGTSDFEALARSIASKAVIDSLKIEGPFELRVTGDDDQFSL 617
            + K  DEV  WK+    +     DFE   + I + AV+D+ K+EGPF+L + G  D  SL
Sbjct: 528  IFKFPDEVYSWKR----YNKKNDDFENSVKEIGTAAVLDTFKVEGPFDLWI-GGQDHLSL 582

Query: 618  MLPLNTSHYGLRRISVGEGITIEVKGAEEISFFHPSVNQLPYNVLTWSNVGS-----IWH 782
             LPLN SH  L+R+ VGEGIT+EVK A+++S F         N     N G       W 
Sbjct: 583  SLPLNVSHSSLKRMLVGEGITVEVKDAQQLSIFQTFDPSFSMNGRVKINKGKSGFCLFWR 642

Query: 783  SLCTALLPIRILGSASVVAYRNQRPTALIQTAFSSTDSVELLPDKCYIRPNYTKPWHLLS 962
             LC  LLPIR++GSAS++AY+ + P A ++T   S  +++LL +KCY    Y     L  
Sbjct: 643  QLCMPLLPIRVIGSASLIAYKTRNPDAPVETTLLSEGTIKLLSEKCYSDDLYKNQAQLSH 702

Query: 963  SLSNRIALLENVLRSFINERGNIDAALGSVKTRIRPLTIFRFQLELERDIKINDTFWSTL 1142
             LS +I  L  +LR+F+   GN     G +++ ++  TI RFQLELE++I  + T    L
Sbjct: 703  FLSLKIDRLGKLLRTFL---GNQMELSGFLRSNVKAATIIRFQLELEKNIGSSATLHDAL 759

Query: 1143 AEWRTRPTIERVLFEVVARIEGEVLKPLVIKRVRPLMDTDSFAWXXXXXXXXFTKFPSVL 1322
             +WRTRPTIERV FEV+AR+E E L+P+V+K+VRP +  DS +W        FTKFPS+L
Sbjct: 760  EDWRTRPTIERVYFEVLARVEDEKLRPVVVKKVRPFIAVDSASWSNLMSNLSFTKFPSIL 819

Query: 1323 VPPEALTLDVKW 1358
            VPPEALTLDVKW
Sbjct: 820  VPPEALTLDVKW 831


>ref|XP_004160620.1| PREDICTED: uncharacterized protein LOC101229456 [Cucumis sativus]
          Length = 763

 Score =  275 bits (704), Expect = 3e-71
 Identities = 157/362 (43%), Positives = 219/362 (60%), Gaps = 5/362 (1%)
 Frame = +3

Query: 288  NFLQGVLDAIVKKEKWVLEDIRVSELDVKRDKYRSGLRYELQVRVGKAEIVLKMYDEVSE 467
            + LQ VL+ +  K+KW LE I++ ELDV+  ++     YE+++ +GK  ++ K  DEVS 
Sbjct: 407  HLLQDVLNDLAAKQKWDLEGIKILELDVESLRFGFAESYEIRLGLGKTRLLAKFSDEVSS 466

Query: 468  WKKLVAPWKNGTSDFEALARSIASKAVIDSLKIEGPFELRVTGDDDQFSLMLPLNTSHYG 647
            WKK   P     + F +L   I S A I + KI GPF+L V G+  + S+ LP N +H G
Sbjct: 467  WKK---PSSANQTRFGSLINGIGSMAAIRTFKIVGPFDLMVEGEA-RLSVSLPKNATHVG 522

Query: 648  LRRISVGEGITIEVKGAEEISFFHPSVNQLPYNVLTWSNVGSIWH-----SLCTALLPIR 812
            ++RI VGEGIT+EV  AEE+S F+ S      N    SN G I         C+ LLP+R
Sbjct: 523  VKRILVGEGITVEVSEAEEVSVFYSSDLSKLLNETRRSN-GKIRTYPFRLPFCSPLLPLR 581

Query: 813  ILGSASVVAYRNQRPTALIQTAFSSTDSVELLPDKCYIRPNYTKPWHLLSSLSNRIALLE 992
            +LGSA++ AYR Q P   I+T F S DS+ELLP+KCY R  + +   LL SL  +  +L+
Sbjct: 582  VLGSATLSAYRTQNPDDYIRTRFLSKDSIELLPNKCYGRNTHIENSPLLGSLKPQFHMLD 641

Query: 993  NVLRSFINERGNIDAALGSVKTRIRPLTIFRFQLELERDIKINDTFWSTLAEWRTRPTIE 1172
             V + ++      +  L  VK ++R   + RFQLELE     N + ++ LAEWRT+PT+E
Sbjct: 642  TVFQRYLRNWILQNGLLAFVKVKMRACVVVRFQLELENTFGTNSSLYARLAEWRTKPTVE 701

Query: 1173 RVLFEVVARIEGEVLKPLVIKRVRPLMDTDSFAWXXXXXXXXFTKFPSVLVPPEALTLDV 1352
            R  FEV+AR++   LKPL +K+++PL+  DS  W        FTKFPS+LV PEALTLDV
Sbjct: 702  RASFEVLARLDTVRLKPLAVKKLKPLIVADSTEWRNLLPNISFTKFPSLLVSPEALTLDV 761

Query: 1353 KW 1358
            KW
Sbjct: 762  KW 763


>ref|XP_004141368.1| PREDICTED: uncharacterized protein LOC101221060, partial [Cucumis
            sativus]
          Length = 761

 Score =  275 bits (704), Expect = 3e-71
 Identities = 157/362 (43%), Positives = 219/362 (60%), Gaps = 5/362 (1%)
 Frame = +3

Query: 288  NFLQGVLDAIVKKEKWVLEDIRVSELDVKRDKYRSGLRYELQVRVGKAEIVLKMYDEVSE 467
            + LQ VL+ +  K+KW LE I++ ELDV+  ++     YE+++ +GK  ++ K  DEVS 
Sbjct: 405  HLLQDVLNDLAAKQKWDLEGIKILELDVESLRFGFAESYEIRLGLGKTRLLAKFSDEVSS 464

Query: 468  WKKLVAPWKNGTSDFEALARSIASKAVIDSLKIEGPFELRVTGDDDQFSLMLPLNTSHYG 647
            WKK   P     + F +L   I S A I + KI GPF+L V G+  + S+ LP N +H G
Sbjct: 465  WKK---PSSANQTRFGSLINGIGSMAAIRTFKIVGPFDLMVEGEA-RLSVSLPKNATHVG 520

Query: 648  LRRISVGEGITIEVKGAEEISFFHPSVNQLPYNVLTWSNVGSIWH-----SLCTALLPIR 812
            ++RI VGEGIT+EV  AEE+S F+ S      N    SN G I         C+ LLP+R
Sbjct: 521  VKRILVGEGITVEVSEAEEVSVFYSSDLSKLLNETRRSN-GKIRTYPFRLPFCSPLLPLR 579

Query: 813  ILGSASVVAYRNQRPTALIQTAFSSTDSVELLPDKCYIRPNYTKPWHLLSSLSNRIALLE 992
            +LGSA++ AYR Q P   I+T F S DS+ELLP+KCY R  + +   LL SL  +  +L+
Sbjct: 580  VLGSATLSAYRTQNPDDYIRTRFLSKDSIELLPNKCYGRNTHIENSPLLGSLKPQFHMLD 639

Query: 993  NVLRSFINERGNIDAALGSVKTRIRPLTIFRFQLELERDIKINDTFWSTLAEWRTRPTIE 1172
             V + ++      +  L  VK ++R   + RFQLELE     N + ++ LAEWRT+PT+E
Sbjct: 640  TVFQRYLRNWILQNGLLAFVKVKMRACVVVRFQLELENTFGTNSSLYARLAEWRTKPTVE 699

Query: 1173 RVLFEVVARIEGEVLKPLVIKRVRPLMDTDSFAWXXXXXXXXFTKFPSVLVPPEALTLDV 1352
            R  FEV+AR++   LKPL +K+++PL+  DS  W        FTKFPS+LV PEALTLDV
Sbjct: 700  RASFEVLARLDTVRLKPLAVKKLKPLIVADSTEWRNLLPNISFTKFPSLLVSPEALTLDV 759

Query: 1353 KW 1358
            KW
Sbjct: 760  KW 761


>gb|ESW32807.1| hypothetical protein PHAVU_001G018600g [Phaseolus vulgaris]
          Length = 384

 Score =  274 bits (700), Expect = 9e-71
 Identities = 157/386 (40%), Positives = 229/386 (59%), Gaps = 5/386 (1%)
 Frame = +3

Query: 216  FNLSHFLYPKVTTFTESNTPSQPPNFLQGVLDAIVKKEKWVLEDIRVSELDVKRDKYRSG 395
            F LS F+   +  FT   + S   + LQ VL A+  K+KW   D+RV++LD  + ++ + 
Sbjct: 6    FPLSFFIL--LLQFTAFASSSNLTHILQDVLRAVSAKQKWDSNDVRVAKLDAAKVRFGTS 63

Query: 396  LRYELQVRVGKAEIVLKMYDEVSEWKKLVAPWKNGTSDFEALARSIASKAVIDSLKIEGP 575
              YE ++ +G     LK  D+V+ W K   P+     D  +L   + S  ++ +LK+EGP
Sbjct: 64   QSYEFRIGLGTGNFTLKFADQVATWNKFRTPFP----DLPSLVHRLGSFPLLPTLKLEGP 119

Query: 576  FELRVTGDDDQFSLMLPLNTSHYGLRRISVGEGITIEVKGAEEISFFHPSVNQLPYNVLT 755
            F LRV    +  SL LP+N S+ GL++I VGEGIT+EVKGA+EIS F+ S   L  N   
Sbjct: 120  FSLRVDSLHN-LSLFLPMNVSYTGLKQILVGEGITVEVKGAQEISLFYSSDIDLLMNGSA 178

Query: 756  WSNVGS--IW---HSLCTALLPIRILGSASVVAYRNQRPTALIQTAFSSTDSVELLPDKC 920
              + G   IW   HS C A++PIRI GSAS+VAYR + P A I T   S D++E+LP+KC
Sbjct: 179  MCSGGKSDIWPFLHSTCMAVIPIRISGSASLVAYRARNPYAHIATTLISEDAIEMLPEKC 238

Query: 921  YIRPNYTKPWHLLSSLSNRIALLENVLRSFINERGNIDAALGSVKTRIRPLTIFRFQLEL 1100
            Y    + K    L S+S ++++LE VLRS +  +     + G +K  I+   + +F++EL
Sbjct: 239  YHGRMFKKQACPLDSVSLKLSMLEKVLRSLLGRKILQGQSFGLLKANIKASAVVKFRIEL 298

Query: 1101 ERDIKINDTFWSTLAEWRTRPTIERVLFEVVARIEGEVLKPLVIKRVRPLMDTDSFAWXX 1280
            ERDI+ N T   T+ +WRTRP+ ER  FE++AR+E   LKPL IK+V+P +++ S +W  
Sbjct: 299  ERDIRNNVTLNRTIPDWRTRPSFERFWFEILARVEENRLKPLSIKKVKPFIESVSVSWAN 358

Query: 1281 XXXXXXFTKFPSVLVPPEALTLDVKW 1358
                  +T    V +PPE LTLDVKW
Sbjct: 359  LMSNMSYTMLRPVFLPPEPLTLDVKW 384


>gb|AGV54177.1| signal peptidase I [Phaseolus vulgaris]
          Length = 384

 Score =  273 bits (698), Expect = 2e-70
 Identities = 159/386 (41%), Positives = 227/386 (58%), Gaps = 5/386 (1%)
 Frame = +3

Query: 216  FNLSHFLYPKVTTFTESNTPSQPPNFLQGVLDAIVKKEKWVLEDIRVSELDVKRDKYRSG 395
            F LS F+   +  FT   + S   + LQ VL A+  K+KW   D+RV++LD  + ++ + 
Sbjct: 6    FPLSFFIL--LLQFTAFASSSNLTHILQDVLRAVSAKQKWDSNDVRVAKLDAAKVRFGTS 63

Query: 396  LRYELQVRVGKAEIVLKMYDEVSEWKKLVAPWKNGTSDFEALARSIASKAVIDSLKIEGP 575
            L YE ++ +G     LK  D+V+ W K   P+     D  +L   + S  ++ +LK+EGP
Sbjct: 64   LSYEFRIGLGTGNFTLKFADQVATWNKFRTPFP----DLPSLVHRLGSFPLLPTLKLEGP 119

Query: 576  FELRVTGDDDQFSLMLPLNTSHYGLRRISVGEGITIEVKGAEEISFFHPSVNQLPYNVLT 755
            F LRV    +  SL LP+N S+ GL++I VGEGIT+EVKGA+EIS F+ S   L  N   
Sbjct: 120  FSLRVDSLHN-LSLFLPMNVSYTGLKQILVGEGITVEVKGAQEISLFYSSDIDLLMNGSA 178

Query: 756  WSNVGS--IW---HSLCTALLPIRILGSASVVAYRNQRPTALIQTAFSSTDSVELLPDKC 920
              + G   IW   HS C A++PIRI GSAS+VAYR + P A I T   S D++E+LP+KC
Sbjct: 179  MCSGGKSDIWPFLHSTCMAVIPIRISGSASLVAYRARNPYAHIATTLISEDAIEMLPEKC 238

Query: 921  YIRPNYTKPWHLLSSLSNRIALLENVLRSFINERGNIDAALGSVKTRIRPLTIFRFQLEL 1100
            Y    + K    L S+S +++ LE VLRS    +     + G +K  I+   + +F++EL
Sbjct: 239  YHGCMFKKQACPLDSVSWKLSRLEKVLRSLFGRKIVQGQSFGLLKANIKASAVVKFRIEL 298

Query: 1101 ERDIKINDTFWSTLAEWRTRPTIERVLFEVVARIEGEVLKPLVIKRVRPLMDTDSFAWXX 1280
            ERDI  + TF  T+ +WRTRP+ ER  FE++AR+E   LKPL IKRV+P ++  S +W  
Sbjct: 299  ERDISNSVTFNRTIPDWRTRPSFERFWFEILARVEENSLKPLSIKRVKPFIEFVSVSWAN 358

Query: 1281 XXXXXXFTKFPSVLVPPEALTLDVKW 1358
                  +T    V +PPE LTLDVKW
Sbjct: 359  LMSNMSYTMLRPVFLPPEPLTLDVKW 384


>gb|EXB38625.1| putative thylakoidal processing peptidase 2 [Morus notabilis]
          Length = 787

 Score =  270 bits (690), Expect = 1e-69
 Identities = 160/357 (44%), Positives = 217/357 (60%), Gaps = 5/357 (1%)
 Frame = +3

Query: 303  VLDAIVKKEKWVLEDIRVSELDVKRDKYRSGLRYELQVRVGKAEIVLKMYDEVSEWKKLV 482
            VL  I  K+KW L+ I+VS LD+++ ++ +  RYE +V +GK  +     DEVS W    
Sbjct: 440  VLKEISVKQKWDLDAIKVSRLDLRKLRFGTSNRYEFRVGIGKTHLSAIFSDEVSSWNN-- 497

Query: 483  APWKNGTSDFEALARSIASKAVIDSLKIEGPFELRVTGDDDQFSLMLPLNTSHYGLRRIS 662
              ++N T+D  +L   + S A++D+ K+EGPFELRV GD +  SL+LP+N +H G  RI 
Sbjct: 498  --FRNPTADLGSLLDEVRSFALLDTFKLEGPFELRV-GDSNYSSLLLPMNRTHAGFNRIL 554

Query: 663  VGEGITIEVKGAEEISFFHPSVNQLPYNVLTWSNVGS-----IWHSLCTALLPIRILGSA 827
            VGEGITIEV+GA+E+S F  S      NV      G      I HS C  L+ I++ GSA
Sbjct: 555  VGEGITIEVRGAQEVSAFQASDFSSTVNVSHEIGNGKTEFWPIRHSFCGVLVQIQVFGSA 614

Query: 828  SVVAYRNQRPTALIQTAFSSTDSVELLPDKCYIRPNYTKPWHLLSSLSNRIALLENVLRS 1007
            ++ AYR + P   I+T   S +++ELL +KCY    + K    + SL  RIA+LE VLRS
Sbjct: 615  ALAAYRTKNPDNCIKTKRISKETIELLAEKCYGNNIHKKRNCPVDSLGLRIAMLEKVLRS 674

Query: 1008 FINERGNIDAALGSVKTRIRPLTIFRFQLELERDIKINDTFWSTLAEWRTRPTIERVLFE 1187
            +  ER  ++  +G  + +I  L + RFQLELE D + NDT     A WRTRP++ERV F+
Sbjct: 675  YFGER--LNGTVGLFRGKISALALIRFQLELEMDSRSNDT-QQAKASWRTRPSVERVWFD 731

Query: 1188 VVARIEGEVLKPLVIKRVRPLMDTDSFAWXXXXXXXXFTKFPSVLVPPEALTLDVKW 1358
            V+AR+E E LK LV K   P   TD+  W        FTKFPS+LVP EALTLDVKW
Sbjct: 732  VLARVEAERLKLLVAKETNPSFVTDTAGW-SNLSNISFTKFPSLLVPSEALTLDVKW 787


>gb|EPS59742.1| hypothetical protein M569_15063, partial [Genlisea aurea]
          Length = 338

 Score =  269 bits (687), Expect = 3e-69
 Identities = 163/358 (45%), Positives = 222/358 (62%), Gaps = 8/358 (2%)
 Frame = +3

Query: 300  GVLDAIVKKEKWVLEDIRVSELDVKRDKYRSGLRYELQVRVGKAEIVLKMYDEVSEWKKL 479
            GVLD I  KEKW LEDIRVSE+D+K+ K+R+   YE ++   K  I +KM++ VSEWKKL
Sbjct: 1    GVLDVIASKEKWNLEDIRVSEVDLKKAKFRTVKLYEFRIPHRKTVIHVKMHEVVSEWKKL 60

Query: 480  VAPWKNGTSDFEALARSIASKAV-IDSLKIEGPFELRVTGDDDQFSLMLPLNTSHYGLRR 656
                    S+ E L+  I SK   IDS  +EGPFEL  +G+DD  +LMLP+N +H  L++
Sbjct: 61   ----NMAASNLEDLSAEIESKTTAIDSFTLEGPFELTASGNDDALTLMLPMNKTHSKLQK 116

Query: 657  ISVGEGITIEVKGAEEISFFHPSVNQLPYNVLTWSNVGSIWHSLCTALLPIRILGSASVV 836
            ISVG+GI + VKGA+ IS F+PS            +  ++   +C A   I I GSASV 
Sbjct: 117  ISVGQGIAVVVKGADAISGFYPS-----------HHPATLICGICRATPRIHINGSASVS 165

Query: 837  AYRNQRPTA-LIQTAFSS-TDSVELLPDKCYIRPNYTKPWHLLSSLSNRIALLENVLRSF 1010
            AY + RPT+ +I+T  SS TD++ LLPDKCY     T    L  S  ++ ALL+ VL +F
Sbjct: 166  AYTSTRPTSPIIRTQISSSTDAITLLPDKCYDDDKTTSL--LRGSFGSKFALLKRVLSTF 223

Query: 1011 INERGNIDAALGS--VKTRIRPLTIFRFQLELERDIKINDTFWSTLAEWRTRPTIERVLF 1184
            +++     A L    +K   R  T++RF+LELERD++ ND +W+   EWRTRP++ER  F
Sbjct: 224  LDDTA---ATLRGPPIKASARASTVYRFRLELERDVRKNDAYWTAFGEWRTRPSVERAWF 280

Query: 1185 EVVARIEGEVLKPLVIKRV---RPLMDTDSFAWXXXXXXXXFTKFPSVLVPPEALTLD 1349
            EV AR+E   LKP  +KRV    P++D D ++         FTKFPS+LV PEALTL+
Sbjct: 281  EVAARVEDGELKPAAVKRVVGLGPVIDADRYS-SGLVSNVSFTKFPSLLVAPEALTLE 337


>ref|XP_004499101.1| PREDICTED: uncharacterized protein LOC101493524 [Cicer arietinum]
          Length = 390

 Score =  267 bits (683), Expect = 8e-69
 Identities = 153/375 (40%), Positives = 219/375 (58%), Gaps = 5/375 (1%)
 Frame = +3

Query: 249  TTFTESNTPSQPPNFLQGVLDAIVKKEKWVLEDIRVSELDVKRDKYRSGLRYELQVRVGK 428
            T FT S+T S P +  Q +L AI  K+KW   D+RV   D+ + ++ +   Y  ++  G 
Sbjct: 19   TAFTSSSTHSNPTHIFQDILKAISAKQKWDFNDVRVYNFDLAKLRFGTSQTYHFRIGSGN 78

Query: 429  AEIVLKMYDEVSEWKKLVAPWKNGTSDFEALARSIASKAVIDSLKIEGPFELRVTGDDDQ 608
                LK  D+VS W      +     D E L     S A +D +K+EGPFEL V  +   
Sbjct: 79   DNFTLKFSDQVSSWNNN-NNFATPKLDLETLVDRFTSIAFLDDIKLEGPFELHVD-ELHH 136

Query: 609  FSLMLPLNTSHYGLRRISVGEGITIEVKGAEEISFFH-PSVN-QLPYNVLTWSNVGSIW- 779
            FSL LP+N S+ GL+ + VGEGIT+EV+ A E+SFF+ P ++ Q   +V         W 
Sbjct: 137  FSLSLPMNVSYTGLKHVIVGEGITVEVRRAREMSFFYRPDLDRQTNGSVACSKGKSEFWP 196

Query: 780  --HSLCTALLPIRILGSASVVAYRNQRPTALIQTAFSSTDSVELLPDKCYIRPNYTKPWH 953
               S C  L+P+ I+GSAS++AY  + P   I T   S D+VELLP+KCY    + K   
Sbjct: 197  FLQSTCVPLIPLNIIGSASLIAYGARNPYTHIGTTLISEDTVELLPEKCYHGRVFRKRAC 256

Query: 954  LLSSLSNRIALLENVLRSFINERGNIDAALGSVKTRIRPLTIFRFQLELERDIKINDTFW 1133
             ++SLS R+++LE +LRS +  +   D   G +K  I+     +F LELERD+  N T  
Sbjct: 257  PVASLSLRLSMLEKILRSLLGHKILQDRFSGLIKANIKAYAAVKFPLELERDVGNNVTR- 315

Query: 1134 STLAEWRTRPTIERVLFEVVARIEGEVLKPLVIKRVRPLMDTDSFAWXXXXXXXXFTKFP 1313
            S L +WRTRP++ERV FE++AR+E   LKP++IK+V+P +++DS +W        +TK  
Sbjct: 316  SALPDWRTRPSVERVWFEILARVEENRLKPVLIKKVKPFIESDSVSWANLMSNMSYTKLR 375

Query: 1314 SVLVPPEALTLDVKW 1358
             VL+PPEALTLDVKW
Sbjct: 376  PVLLPPEALTLDVKW 390


>ref|XP_002891379.1| hypothetical protein ARALYDRAFT_473912 [Arabidopsis lyrata subsp.
            lyrata] gi|297337221|gb|EFH67638.1| hypothetical protein
            ARALYDRAFT_473912 [Arabidopsis lyrata subsp. lyrata]
          Length = 391

 Score =  267 bits (683), Expect = 8e-69
 Identities = 157/391 (40%), Positives = 236/391 (60%), Gaps = 10/391 (2%)
 Frame = +3

Query: 216  FNLSHFLYPKVTTFTESNTPSQPPN-------FLQGVLDAIVKKEKWVLEDIRVSELDVK 374
            F+L   L+ +V T   +  PSQP          LQ VL  I  K+KW LE++R S+L+VK
Sbjct: 8    FSLVLSLFIQVLTLAIALDPSQPDESNITATPILQDVLKEISVKQKWNLEEVRFSKLEVK 67

Query: 375  RDKYRSGLRYELQVRVGKAEIVLKMYDEVSEWKKLVAPWKNGTSDFEALARSIASKAVID 554
            + +  +G R+E+++R+GK+  V    DEV++W++ V        + + + R + S  V+D
Sbjct: 68   KIRIGTGRRFEIRIRLGKSRFVFIFPDEVTDWRRSVG---GKDVELQEVVREVNSSKVLD 124

Query: 555  SLKIEGPFELRVTGDDDQFSLMLPLNTSHYGLRRISVGEGITIEVKGAEEISFFHPSVNQ 734
            SL ++GPFELRV GDD + SL LP+N SH GL+R+ V EGI++E++ A+ +S FH S  +
Sbjct: 125  SLVLKGPFELRVDGDD-RLSLALPMNISHNGLKRVLVSEGISVEIREAQAVSLFHSSHRR 183

Query: 735  LPYNV--LTWSNVGSIWHSLCTALLPIRILGSASVVAYRNQRPTALIQTAFSSTDSVELL 908
                V     + + S   S+C  L PI+ILGSAS+VA+R     + I+T++ S +++++ 
Sbjct: 184  YAATVDMKNGNCLLSFLGSVCVPLPPIQILGSASLVAFRTSNTDSQIKTSYLSDEAIQIH 243

Query: 909  PDKCYIRPN-YTKPWHLLSSLSNRIALLENVLRSFINERGNIDAALGSVKTRIRPLTIFR 1085
            PDKCY + + Y +       L  +I  LE VL S  N        + SV  +++   + R
Sbjct: 244  PDKCYDKAHTYRQHRFPTDLLGLKINKLEKVLSSLGN---GTRQTVSSVTAKLKASGMVR 300

Query: 1086 FQLELERDIKINDTFWSTLAEWRTRPTIERVLFEVVARIEGEVLKPLVIKRVRPLMDTDS 1265
            FQLE+ER I  N++  S   EWRT+P IERV FE+ A+IEG+ LK + +++V P ++ D+
Sbjct: 301  FQLEIERSIGKNESVISKRVEWRTKPKIERVWFEITAKIEGDKLKAVGMRKVVPFIEVDT 360

Query: 1266 FAWXXXXXXXXFTKFPSVLVPPEALTLDVKW 1358
             AW        FTKFPS+LVP EALTLDVKW
Sbjct: 361  EAWSSLMSNMSFTKFPSLLVPQEALTLDVKW 391


>gb|AAM61120.1| unknown [Arabidopsis thaliana]
          Length = 395

 Score =  266 bits (680), Expect = 2e-68
 Identities = 157/391 (40%), Positives = 232/391 (59%), Gaps = 16/391 (4%)
 Frame = +3

Query: 234  LYPKVTTFTESNTPSQPPN-------FLQGVLDAIVKKEKWVLEDIRVSELDVKRDKYRS 392
            L+ +V T   +  PSQP          LQ VL  I  K+KW LE++R S+L+VK+ +  +
Sbjct: 14   LFIQVLTLAVALDPSQPDESNITATPILQDVLKEISVKQKWNLEEVRFSKLEVKKIRIGT 73

Query: 393  GLRYELQVRVGKAEIVLKMYDEVSEWKKLVAPWKNGTSDFEA--LARSIASKAVIDS-LK 563
              R+E+++R+GK+  V    DEV++W++       G  D E   L R + S  V+D  L 
Sbjct: 74   SRRFEIRIRLGKSRFVFIFPDEVTDWRR-----SGGGRDVELQELVREVNSSKVLDPPLV 128

Query: 564  IEGPFELRVTGDDDQFSLMLPLNTSHYGLRRISVGEGITIEVKGAEEISFFHPSVNQL-- 737
            ++GPFELRV GDD + SL LP+N SH GL+R+ V EGI++E++ A+ +S FH S  +   
Sbjct: 129  LKGPFELRVDGDD-RLSLSLPMNISHSGLKRVLVSEGISVEIREAQAVSLFHSSHRRYAA 187

Query: 738  ---PYNVLTWSNVGSIWHSLCTALLPIRILGSASVVAYRNQRPTALIQTAFSSTDSVELL 908
               P N+   S++ S W S+C  L PI+I+GSAS+VA+R    T  I+T++ S +++ L 
Sbjct: 188  TVDPVNIKQGSSLWSFWGSVCVPLPPIQIIGSASLVAFRTSNATTQIKTSYLSDEAIHLY 247

Query: 909  PDKCYIRPNYTKPWHLLSSLSN-RIALLENVLRSFINERGNIDAALGSVKTRIRPLTIFR 1085
             +KCY + +  +     + L   +I  LE VL S  N        + SV  +++   + R
Sbjct: 248  AEKCYYKAHTYRQHRFPNDLLGLKIHKLEKVLNSLGN---GTRQTVSSVTAKLKASGMVR 304

Query: 1086 FQLELERDIKINDTFWSTLAEWRTRPTIERVLFEVVARIEGEVLKPLVIKRVRPLMDTDS 1265
            FQLE+ER I  N++  S    WRT+P IERV FEV A+IEG+ LK + +++V P ++ D+
Sbjct: 305  FQLEIERSIGKNESVISKKVAWRTKPKIERVWFEVTAKIEGDKLKAVRLRKVVPFIEVDT 364

Query: 1266 FAWXXXXXXXXFTKFPSVLVPPEALTLDVKW 1358
             AW        FTKFPS+LVP EALTLDVKW
Sbjct: 365  EAWSSLMSNMSFTKFPSLLVPQEALTLDVKW 395


>ref|XP_003549415.1| PREDICTED: uncharacterized protein LOC100804093 [Glycine max]
          Length = 393

 Score =  266 bits (679), Expect = 2e-68
 Identities = 163/393 (41%), Positives = 226/393 (57%), Gaps = 12/393 (3%)
 Frame = +3

Query: 216  FNLSHFLYP-KVTTFTESNTPSQPPNFLQGVLDAIVKKEKWVL---EDIRVSELDVKRDK 383
            F LS F++  +   F  S+T S   + LQ VL A+  K+KW     +D+RV++ DV +  
Sbjct: 6    FLLSFFIFLLQFIAFASSSTHSNLTHILQDVLKAVSAKQKWDSSNNDDVRVTKFDVGKVM 65

Query: 384  YRSGLRYELQVRVG---KAEIVLKMYDEVSEWKKLVAPWKNGTSDFEALARSIASKAVID 554
            + + L YE ++R G        LK  D+V+ W K   P+    +D   L   + S  ++ 
Sbjct: 66   FGTSLSYEFRIRFGTDNNDNFTLKFVDQVATWNKFRTPF----TDLPPLVHRLGSFPLLH 121

Query: 555  SLKIEGPFELRVTGDDDQFSLMLPLNTSHYGLRRISVGEGITIEVKGAEEISFFHPSVNQ 734
            +LK+EGPF LRV    +  SL LP+N S+ GL+ I VGEGIT+EV+ A+EIS F+ S   
Sbjct: 122  TLKLEGPFALRVDALHN-LSLSLPMNVSYTGLKHILVGEGITVEVRRAQEISLFYSSDLD 180

Query: 735  LPYNVLTWSNVGS--IW---HSLCTALLPIRILGSASVVAYRNQRPTALIQTAFSSTDSV 899
            L  N     + G   +W    S C AL+PIRI GSAS+VAYR +   A I T   S D++
Sbjct: 181  LQMNGSAMCSEGKSDLWPFMRSTCMALIPIRISGSASLVAYRARNAYAQIATTLISEDAI 240

Query: 900  ELLPDKCYIRPNYTKPWHLLSSLSNRIALLENVLRSFINERGNIDAALGSVKTRIRPLTI 1079
            ELLP+KCY    + K    + SLS R++LLE VLRSF++ +   D   G +K  I+   +
Sbjct: 241  ELLPEKCYHGHVFRKRACPIDSLSLRLSLLEKVLRSFLDHKILKDQLFGLLKANIKASAV 300

Query: 1080 FRFQLELERDIKINDTFWSTLAEWRTRPTIERVLFEVVARIEGEVLKPLVIKRVRPLMDT 1259
             +F LELERDI  N T   T+ +WRTRP  ER  FE++AR+E   LKPL+IK VRP +++
Sbjct: 301  VKFPLELERDISNNATLNRTIPDWRTRPGFERFWFEILARVEENKLKPLLIKEVRPFIES 360

Query: 1260 DSFAWXXXXXXXXFTKFPSVLVPPEALTLDVKW 1358
             S +W        +TK   V   PE LTLDVKW
Sbjct: 361  VSVSWANLMSNMSYTKLRPVFFLPEPLTLDVKW 393


>ref|XP_006393531.1| hypothetical protein EUTSA_v10011550mg [Eutrema salsugineum]
            gi|557090109|gb|ESQ30817.1| hypothetical protein
            EUTSA_v10011550mg [Eutrema salsugineum]
          Length = 398

 Score =  263 bits (673), Expect = 1e-67
 Identities = 153/400 (38%), Positives = 239/400 (59%), Gaps = 19/400 (4%)
 Frame = +3

Query: 216  FNLSHFLYPKVTTFTESNTPSQPPN-------FLQGVLDAIVKKEKWVLEDIRVSELDVK 374
            F++   L+ +  T   +  PSQP          LQ VL  I  ++KW L ++R S+L+VK
Sbjct: 9    FSVVLSLFIQALTLAVALDPSQPDESTITATPILQDVLKEISVRQKWNLTEVRFSKLEVK 68

Query: 375  RDKYRSGLRYELQVRVGKAEIVLKMYDEVSEWKKLVAPWKNGTSDFEAL--ARSIASKAV 548
            + +  +G  +E+++R+GK+  V    DEV++W++       G    E +   R + S  V
Sbjct: 69   KLRVGTGRSFEIRIRLGKSRFVFVFPDEVTDWRR-----SGGGKQVELMEVVREVNSSKV 123

Query: 549  IDSLKIEGPFELRVTGDDDQFSLMLPLNTSHYGLRRISVGEGITIEVKGAEEISFFHPSV 728
            +D + ++GP ELRV G+D+  SL LP+N SH GL+R+ V EGI++E++ A+ +S FH S 
Sbjct: 124  LDPIVLKGPLELRVAGEDNLLSLALPMNISHNGLKRVLVSEGISVEIRKAQTVSLFHSSN 183

Query: 729  NQLPYNV---------LTWSNVGSIWHSLCTALLPIRILGSASVVAYRNQRPTALIQTAF 881
             +   +V           WS++G    S+C  L PI+I GSAS+VA+R     + I+T++
Sbjct: 184  RRFAASVEPVDMNERSCLWSSLGG---SVCVPLPPIQIDGSASLVAFRTPYKDSRIKTSY 240

Query: 882  SSTDSVELLPDKCYIRPNYTKPWHLLSSLSN-RIALLENVLRSFINERGNIDAALGSVKT 1058
             + ++++LLP+KCY + +  K  HL + L   +I  LE VL S  N +GN +  + S+  
Sbjct: 241  LTNEAIQLLPEKCYHKAHTYKQNHLSTDLLGLKIKKLERVLSSLGN-KGNAET-VSSMTA 298

Query: 1059 RIRPLTIFRFQLELERDIKINDTFWSTLAEWRTRPTIERVLFEVVARIEGEVLKPLVIKR 1238
            +++   + RFQLE+ER I  N++  S   EWRT+P IERV FEV A++EG+ LK + +++
Sbjct: 299  KLKASGMVRFQLEIERRIGSNESVTSKRLEWRTKPKIERVWFEVAAKVEGDKLKAVGMRK 358

Query: 1239 VRPLMDTDSFAWXXXXXXXXFTKFPSVLVPPEALTLDVKW 1358
            V P ++ D+ AW        FTKFPS+LVP EALTLDVKW
Sbjct: 359  VVPFIEVDTEAWSSLMSNMSFTKFPSILVPQEALTLDVKW 398


>ref|NP_564503.1| uncharacterized protein [Arabidopsis thaliana]
            gi|9993349|gb|AAG11422.1|AC015449_4 Unknown protein
            [Arabidopsis thaliana] gi|30102708|gb|AAP21272.1|
            At1g47310 [Arabidopsis thaliana]
            gi|110736510|dbj|BAF00222.1| hypothetical protein
            [Arabidopsis thaliana] gi|332194034|gb|AEE32155.1|
            uncharacterized protein AT1G47310 [Arabidopsis thaliana]
          Length = 395

 Score =  263 bits (672), Expect = 2e-67
 Identities = 155/391 (39%), Positives = 232/391 (59%), Gaps = 16/391 (4%)
 Frame = +3

Query: 234  LYPKVTTFTESNTPSQPPN-------FLQGVLDAIVKKEKWVLEDIRVSELDVKRDKYRS 392
            L+ +V T   +  PSQP          LQ VL  I  K+KW LE++R S+L+VK+ +  +
Sbjct: 14   LFIQVLTLAVALDPSQPDESNITATPILQDVLKEISVKQKWNLEEVRFSKLEVKKIRIGT 73

Query: 393  GLRYELQVRVGKAEIVLKMYDEVSEWKKLVAPWKNGTSDFEA--LARSIASKAVIDS-LK 563
              R+E+++R+GK+  V    DE+++W++       G SD E   L R + S  V+D  L 
Sbjct: 74   SRRFEIRIRLGKSRFVFIFPDEITDWRR-----SGGGSDVELQELVREVNSSKVLDPPLV 128

Query: 564  IEGPFELRVTGDDDQFSLMLPLNTSHYGLRRISVGEGITIEVKGAEEISFFHPSVNQL-- 737
            ++GPFEL V G+D + SL LP+N SH GL+R+ V EGI++E++ A+ +S FH S  +   
Sbjct: 129  LKGPFELLVDGND-RLSLSLPMNISHSGLKRVLVSEGISVEIREAQAVSLFHSSHRRYAA 187

Query: 738  ---PYNVLTWSNVGSIWHSLCTALLPIRILGSASVVAYRNQRPTALIQTAFSSTDSVELL 908
               P N+   S++ S W S+C  L PI+I+GSAS+VA+R    T  I+T++ S +++ L 
Sbjct: 188  TVDPVNIKEGSSLWSFWGSVCVPLPPIQIIGSASLVAFRTSNATTQIKTSYLSDEAIHLY 247

Query: 909  PDKCYIRPNYTKPWHLLSSLSN-RIALLENVLRSFINERGNIDAALGSVKTRIRPLTIFR 1085
             +KCY + +  +     + L   +I  LE VL S  N        + SV  +++   + R
Sbjct: 248  AEKCYYKAHTYRQHRFPNDLLGLKIHKLEKVLNSLGN---GTRQTVSSVTAKLKASGMVR 304

Query: 1086 FQLELERDIKINDTFWSTLAEWRTRPTIERVLFEVVARIEGEVLKPLVIKRVRPLMDTDS 1265
            FQLE+ER I  N++  S    WRT+P IERV FEV A+IEG+ LK + +++V P ++ D+
Sbjct: 305  FQLEIERSIGKNESVISKKVAWRTKPKIERVWFEVTAKIEGDKLKAVRLRKVVPFIEVDT 364

Query: 1266 FAWXXXXXXXXFTKFPSVLVPPEALTLDVKW 1358
             AW        FTKFPS+LVP EALTLDVKW
Sbjct: 365  EAWSSLMSNMSFTKFPSLLVPQEALTLDVKW 395


>ref|XP_006307471.1| hypothetical protein CARUB_v10009097mg [Capsella rubella]
            gi|482576182|gb|EOA40369.1| hypothetical protein
            CARUB_v10009097mg [Capsella rubella]
          Length = 454

 Score =  258 bits (659), Expect = 5e-66
 Identities = 153/396 (38%), Positives = 231/396 (58%), Gaps = 15/396 (3%)
 Frame = +3

Query: 216  FNLSHFLYPKVTTFTESNTPSQPPN-------FLQGVLDAIVKKEKWVLEDIRVSELDVK 374
            F++   L+ +  T   +  PSQP          LQ VL  I  K+KW LE++R  +L+VK
Sbjct: 68   FSVILSLFIQALTLAVALDPSQPDESNITAIPILQDVLKEISMKQKWNLEEVRFKKLEVK 127

Query: 375  RDKYRSGLRYELQVRVGKAEIVLKMYDEVSEWKKLVAPWKNGTSDFEA--LARSIASKAV 548
            + +   G R+E+++R+GK+  V    DEV++W +       G  D E   + R + S  V
Sbjct: 128  KLRIGVGRRFEIRIRLGKSRFVFVFPDEVTDWSR-----SGGGRDVELHEVVREVNSTKV 182

Query: 549  IDSLKIEGPFELRVTGDDDQFSLMLPLNTSHYGLRRISVGEGITIEVKGAEEISFFHPSV 728
            +D + ++GPFELRV GD  +FSL LP+N SH GL+R+ V EGI++E++GA+ +S FH S 
Sbjct: 183  LDPIVLKGPFELRVDGDS-RFSLALPMNISHSGLKRVLVSEGISVEIRGAQAVSLFHSSH 241

Query: 729  NQL-----PYNVLTWSNVGSIWHSLCTALLPIRILGSASVVAYRNQRPTALIQTAFSSTD 893
             +      P N+   + +     S+C  L PI+I+GSAS+VA+R +   + I+T++ S +
Sbjct: 242  RRYAATVDPVNIKEGNCLRLFRSSVCAPLPPIQIIGSASLVAFRTRNADSQIKTSYLSNE 301

Query: 894  SVELLPDKCYIRPN-YTKPWHLLSSLSNRIALLENVLRSFINERGNIDAALGSVKTRIRP 1070
            ++ L  +KCY + + Y +       L  +I  LE VL S  N        + SV  +++P
Sbjct: 302  AIHLHAEKCYYKAHTYRQHGFPTDLLGLKINKLEKVLSSLGN---GTRQTVTSVTAKLKP 358

Query: 1071 LTIFRFQLELERDIKINDTFWSTLAEWRTRPTIERVLFEVVARIEGEVLKPLVIKRVRPL 1250
              + RFQLE+ER I  N++  S   EWRT+P IERV FEV A++E + LK   +++V P 
Sbjct: 359  SGMVRFQLEIERSIGKNESVTSKKIEWRTKPKIERVWFEVTAKVERDKLKAAGMRKVVPF 418

Query: 1251 MDTDSFAWXXXXXXXXFTKFPSVLVPPEALTLDVKW 1358
            ++ D+ AW        FTKFPS+LVP EALTLDVKW
Sbjct: 419  IEVDTEAWSSMMSNMSFTKFPSLLVPQEALTLDVKW 454


>ref|XP_003589258.1| hypothetical protein MTR_1g021180 [Medicago truncatula]
            gi|355478306|gb|AES59509.1| hypothetical protein
            MTR_1g021180 [Medicago truncatula]
          Length = 451

 Score =  251 bits (641), Expect = 6e-64
 Identities = 161/423 (38%), Positives = 228/423 (53%), Gaps = 43/423 (10%)
 Frame = +3

Query: 258  TESNTPSQPPNFLQGVLDAIVKKEKWVLEDIRVSELDVKRDKYRSGLRYELQVRVGKAEI 437
            + S+T S   +  Q +L AI  ++KW L D+RV   DV + ++ +   Y  ++   K   
Sbjct: 29   SSSSTHSNITHIFQDILKAISSRQKWDLNDVRVFNFDVAKIRFGTSQNYLFRIGSSKNNF 88

Query: 438  VLKMYDEVSEWK--KLVAPWKNGTSDFEALARSIASKAVIDSLKIEGPFELRVTGDDDQF 611
             +K  DE+S W   K     K    D  +L   ++S A +D +K+EGPFELRV  +    
Sbjct: 89   TVKFSDEISSWNHNKFTTTPK---PDLASLVDQLSSIAFLDYIKLEGPFELRVH-ESHHL 144

Query: 612  SLMLP------------------------------------LNTSHYGLRRISVGEGITI 683
            SL LP                                    +N S+ GL+ I VG+GIT+
Sbjct: 145  SLSLPSSQITRGKRKLRKIIREIVKKDLEINEFDRRMIYDTMNVSYNGLKHIIVGKGITV 204

Query: 684  EVKGAEEISFFHPSVNQLPYN--VLTWSNVGSIW---HSLCTALLPIRILGSASVVAYRN 848
            EV+ A EISF++ S   L  N  V+  +     W    S+C  L+PIRI+GSAS++AY  
Sbjct: 205  EVRRAREISFYYQSDLDLQRNGSVICSNQKNEFWPFLQSMCVPLIPIRIIGSASLIAYVA 264

Query: 849  QRPTALIQTAFSSTDSVELLPDKCYIRPNYTKPWHLLSSLSNRIALLENVLRSFINERGN 1028
            + P   I TA  S D+VELLP+KCY    + K    ++SL+ R+ LLE +LRS +  +  
Sbjct: 265  RNPYVQIGTALISEDAVELLPEKCYHGCVFRKQACPVASLNLRLILLEKILRSLLGHKIL 324

Query: 1029 IDAALGSVKTRIRPLTIFRFQLELERDIKINDTFWSTLAEWRTRPTIERVLFEVVARIEG 1208
             D   G +K  I+     +F LELERD+  N T  STL +WRTRP++ERV FEV+AR+E 
Sbjct: 325  QDRLSGLIKANIKAYAGVKFPLELERDVGNNATL-STLPDWRTRPSVERVWFEVMARVED 383

Query: 1209 EVLKPLVIKRVRPLMDTDSFAWXXXXXXXXFTKFPSVLVPPEALTLDVKW*ESLCLLLKN 1388
              LKPL IK+V+P +++DS +W        +TK   VL+PPEALTLDVKW   L L+  N
Sbjct: 384  SRLKPLSIKKVKPFIESDSVSWANLMSNLSYTKLRPVLLPPEALTLDVKW---LLLVEPN 440

Query: 1389 SKI 1397
            S +
Sbjct: 441  SLV 443


>gb|EMJ26837.1| hypothetical protein PRUPE_ppa008077mg [Prunus persica]
          Length = 346

 Score =  204 bits (518), Expect = 1e-49
 Identities = 116/280 (41%), Positives = 166/280 (59%), Gaps = 12/280 (4%)
 Frame = +3

Query: 303  VLDAIVKKEKWVLEDIRVSELDVKRDKYRSGLRYELQVRVGKAEIVLKMYDEVSEWKKLV 482
            VL  I  K KW L+DIRVS LD  R ++ S  RYE +V  GK  + +   D+V+ WKK  
Sbjct: 69   VLKKISAKHKWYLQDIRVSRLDASRVRFGSAQRYEFRVGFGKIPVGVLFSDDVASWKKFR 128

Query: 483  APWKNGTSDFEALARSIASKAVIDSLKIEGPFELRVTGDDDQFSLMLPLNTSHYGLRRIS 662
             P     + F +L + ++S AV+D+ K+EGPFELRV G     SL LP+NT++ G +R+ 
Sbjct: 129  QP----RTHFGSLVKELSSMAVVDTFKVEGPFELRV-GGIHHLSLSLPMNTTYSGFKRVL 183

Query: 663  VGEGITIEVKGAEEISFFHPSVNQLPYNVLTWSNVGS------------IWHSLCTALLP 806
            VG+GIT+EV GA E+S FH S        L  S+ GS            IWHS CT L P
Sbjct: 184  VGKGITVEVSGATEVSVFHASD-------LGLSSKGSGAIGKEKSEFWPIWHSYCTPLFP 236

Query: 807  IRILGSASVVAYRNQRPTALIQTAFSSTDSVELLPDKCYIRPNYTKPWHLLSSLSNRIAL 986
            IR+LG A++VAY+ + P A I+T F S + +E LP+KCY    Y K    + SL  RI++
Sbjct: 237  IRVLGPATLVAYKTRNPDAYIETKFMSKEIIEFLPEKCYRSHAYKKRACPIDSLRLRISM 296

Query: 987  LENVLRSFINERGNIDAALGSVKTRIRPLTIFRFQLELER 1106
            LE++ +SF+ +R       G V+ +I+  T+ RF++++ R
Sbjct: 297  LESIWKSFLGDRIRQSGLSGFVEGKIKASTVVRFKIKVAR 336


Top