BLASTX nr result

ID: Ephedra26_contig00010792 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra26_contig00010792
         (2094 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ22095.1| hypothetical protein PRUPE_ppa003056mg [Prunus pe...   741   0.0  
gb|EOY17057.1| Nitroreductase family protein isoform 1 [Theobrom...   739   0.0  
ref|XP_002274671.1| PREDICTED: uncharacterized protein LOC100243...   733   0.0  
ref|XP_002302529.2| nitroreductase family protein [Populus trich...   719   0.0  
ref|XP_002511685.1| oxidoreductase, putative [Ricinus communis] ...   717   0.0  
ref|XP_006347883.1| PREDICTED: uncharacterized protein LOC102602...   716   0.0  
ref|XP_004306936.1| PREDICTED: uncharacterized protein LOC101299...   712   0.0  
ref|XP_006445286.1| hypothetical protein CICLE_v10019231mg [Citr...   711   0.0  
gb|EXB99759.1| hypothetical protein L484_023290 [Morus notabilis]     707   0.0  
ref|XP_004229802.1| PREDICTED: uncharacterized protein LOC101249...   706   0.0  
ref|XP_003521959.2| PREDICTED: uncharacterized protein LOC100791...   699   0.0  
gb|ESW06843.1| hypothetical protein PHAVU_010G081400g [Phaseolus...   697   0.0  
ref|XP_004155780.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   696   0.0  
ref|XP_004133924.1| PREDICTED: uncharacterized protein LOC101216...   696   0.0  
ref|XP_002889341.1| hypothetical protein ARALYDRAFT_333455 [Arab...   689   0.0  
ref|XP_004506526.1| PREDICTED: uncharacterized protein LOC101496...   687   0.0  
gb|EOY17058.1| Nitroreductase family protein isoform 2 [Theobrom...   677   0.0  
ref|XP_006306995.1| hypothetical protein CARUB_v10008571mg [Caps...   675   0.0  
ref|NP_171704.2| nitroreductase family protein [Arabidopsis thal...   671   0.0  
gb|EPS65206.1| hypothetical protein M569_09571, partial [Genlise...   663   0.0  

>gb|EMJ22095.1| hypothetical protein PRUPE_ppa003056mg [Prunus persica]
          Length = 608

 Score =  741 bits (1914), Expect = 0.0
 Identities = 361/577 (62%), Positives = 439/577 (76%), Gaps = 12/577 (2%)
 Frame = +1

Query: 49   YHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDNS--------DIPYP 204
            YHN+TKH FTKYARGPHGLDW NQPNPFRRY +AP + LLH P +N         D  Y 
Sbjct: 40   YHNQTKHHFTKYARGPHGLDWANQPNPFRRYVSAPLLPLLHFPTENQNPNSSSTQDPLYS 99

Query: 205  QVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTEGYILSGP 384
             +F  +PPPKP++K+TISQ FYDSLALSAWKTTG STWSLRVNPSSGNLHPTE YI+S P
Sbjct: 100  SLFLNLPPPKPISKSTISQFFYDSLALSAWKTTGFSTWSLRVNPSSGNLHPTEAYIISPP 159

Query: 385  IDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIFWRESWKYGE 564
            I+ +SD  F+AHY+PKEH LE+RAE+PS +F   +   PK SF +GLSSIFWRE+WKYGE
Sbjct: 160  IESLSDSSFVAHYAPKEHALELRAEVPSWVFTNFL---PKDSFLIGLSSIFWREAWKYGE 216

Query: 565  RAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDFEIPEQAV 744
            RAFRYCNHDVGHAI A+SMAAA LGWDV+++D LG++++ +L+GL    K  F+IP + V
Sbjct: 217  RAFRYCNHDVGHAIAAVSMAAAGLGWDVKLLDGLGYEDLEKLMGLERFPK--FQIPSRPV 274

Query: 745  RGYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSNALSREHVCW 924
            +G FP++E EH DC+LVVFP+G+ GE ++N +  +   SEF  LE++G+ N LS+EH+CW
Sbjct: 275  KGRFPEMEFEHPDCILVVFPNGA-GEFDVNYKQLSLAISEFSKLEWKGKPNLLSKEHICW 333

Query: 925  DIIYRTANATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRKRRSAVDMDPGVS 1101
            DIIYRTA A KK ++   +   + P       SEGSYK F  REV+RKRRSAVDMD   +
Sbjct: 334  DIIYRTAEAVKKEIS-LGNTFLVDPFQSSGICSEGSYKGFTAREVVRKRRSAVDMDGVTA 392

Query: 1102 IDRNTFYQILAKVLPSGIKEDQGEQSQ---IPFRALPWEVNIHLMIFVNRVAGLKPGLYF 1272
            +DRNTFYQIL   LPSG +   G+Q +   +PFR LPW+  +H  +FV+RV GL  GLYF
Sbjct: 393  MDRNTFYQILLHCLPSGSRNG-GKQKKPLALPFRGLPWDAEVHAALFVHRVEGLPQGLYF 451

Query: 1273 LVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQEIAGNGCF 1452
            LVRN+ H D L+K+ RS F+W  PEGCP  LPLY L   DC+ LA +LSCHQEIA +GCF
Sbjct: 452  LVRNEDHLDKLKKSMRSGFKWMKPEGCPENLPLYELDRTDCRTLAERLSCHQEIASHGCF 511

Query: 1453 SLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYFDDPVHSVL 1632
            SLGM+A F   L D + WMYPRLFWE G+LGQ+LYLEAHAVGISATGIGCYFDDPVH +L
Sbjct: 512  SLGMVACFDRLLHDKNMWMYPRLFWETGVLGQVLYLEAHAVGISATGIGCYFDDPVHELL 571

Query: 1633 GLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            GL G+ FQSLYHFTVG  V DKRIMSLPAYPGP++D+
Sbjct: 572  GLKGSNFQSLYHFTVGGPVVDKRIMSLPAYPGPDVDA 608


>gb|EOY17057.1| Nitroreductase family protein isoform 1 [Theobroma cacao]
          Length = 638

 Score =  739 bits (1908), Expect = 0.0
 Identities = 353/574 (61%), Positives = 436/574 (75%), Gaps = 6/574 (1%)
 Frame = +1

Query: 40   AVDYHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDNSDIP-----YP 204
            A+ YH++TKH FT YARGP GLDW NQPNPFRRY +AP + LLH P +   I      Y 
Sbjct: 71   ALKYHHQTKHSFTNYARGPRGLDWANQPNPFRRYISAPLIPLLHFPAEKQAITDDAPLYS 130

Query: 205  QVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTEGYILSGP 384
             +F  +PPPKP++++TISQLFYDSLALSAWKTTG STWSLRVNPSSGNLHPTE Y++S P
Sbjct: 131  SLFHSLPPPKPISQSTISQLFYDSLALSAWKTTGYSTWSLRVNPSSGNLHPTEAYLISPP 190

Query: 385  IDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIFWRESWKYGE 564
            I  +SD PF+AHY+PKEH LEVRA IPS  F    K FP+ SF +G+SSIFWRE+WKYGE
Sbjct: 191  IQSLSDSPFVAHYAPKEHSLEVRATIPSGFFP---KFFPENSFLIGISSIFWREAWKYGE 247

Query: 565  RAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDFEIPEQAV 744
            RAFRYCNHDVGHAIGA++MAAA LGWDV+++D  G+D++ +L+GL      +F++P + +
Sbjct: 248  RAFRYCNHDVGHAIGAVAMAAATLGWDVKLLDGFGYDDLQKLMGL--DIFPEFKVPSRPI 305

Query: 745  RGYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSNALSREHVCW 924
            +G FP +E EH DCLL+VFP+GS  + ++N ++ +S   EFL LE++G+ N+LSREHVCW
Sbjct: 306  KGKFPDIEFEHPDCLLLVFPNGSN-QFHVNYKELSSAVKEFLNLEWKGKPNSLSREHVCW 364

Query: 925  DIIYRTANATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRKRRSAVDMDPGVS 1101
            DIIYRTA A KKPL  Q+    +         SE SYK   VREV+RKRRSAVDMD    
Sbjct: 365  DIIYRTAEAVKKPLTVQSGEFPVDQFQSSGICSENSYKGLTVREVVRKRRSAVDMDGVTV 424

Query: 1102 IDRNTFYQILAKVLPSGIKEDQGEQSQIPFRALPWEVNIHLMIFVNRVAGLKPGLYFLVR 1281
            ++R TFYQIL   +PSG       Q  +PFRAL W+  +H  +FV+RV GL  GLYFLVR
Sbjct: 425  MERETFYQILLHCVPSGNGGKHRRQLALPFRALSWDAEVHAALFVHRVVGLPKGLYFLVR 484

Query: 1282 NKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQEIAGNGCFSLG 1461
            N+ H + L++ATR EF W  P GCP  LPLY LA+ +C++LA +LSCHQ+IA +GCFSLG
Sbjct: 485  NEDHLEELKRATRPEFNWEKPAGCPDDLPLYELATDNCQELAKRLSCHQDIASDGCFSLG 544

Query: 1462 MLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYFDDPVHSVLGLS 1641
            M+A F+ +LSD  AWMYPRLFWE G+LGQ+LYLEAHAVGISATGIGC+FDDPVH +LG  
Sbjct: 545  MVAHFEPALSDNGAWMYPRLFWETGVLGQVLYLEAHAVGISATGIGCFFDDPVHELLGFR 604

Query: 1642 GNEFQSLYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            G++FQSLYHFT+G  V DKRIMSLPAYPGP ID+
Sbjct: 605  GSKFQSLYHFTIGGPVLDKRIMSLPAYPGPGIDT 638


>ref|XP_002274671.1| PREDICTED: uncharacterized protein LOC100243840 [Vitis vinifera]
          Length = 586

 Score =  733 bits (1893), Expect = 0.0
 Identities = 365/579 (63%), Positives = 438/579 (75%), Gaps = 5/579 (0%)
 Frame = +1

Query: 22   EDSTRIAV-DYHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDN-SDI 195
            +DS    V  YHN+TKH FT YARGP GLDW NQP PFRR+ +AP V LLH P  N +  
Sbjct: 15   QDSNEAQVLKYHNQTKHSFTNYARGPRGLDWANQPKPFRRFDSAPLVPLLHPPPPNQTPP 74

Query: 196  PYPQVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTEGYIL 375
            PY  VF  +PPPKP++K+TISQLF+DSLA+SAWKTTG STWSLRVNPSSGNLHPTE YI+
Sbjct: 75   PYSSVFLNLPPPKPISKSTISQLFFDSLAISAWKTTGFSTWSLRVNPSSGNLHPTESYII 134

Query: 376  SGPIDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIFWRESWK 555
            +  I+ VSD  F+AHY+PKEH LEVRAEI S   G L K FPKGSF +G SSIFWRE+WK
Sbjct: 135  APAIESVSDSAFVAHYAPKEHSLEVRAEISS---GFLPKFFPKGSFLIGFSSIFWREAWK 191

Query: 556  YGERAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDFEIPE 735
            YGERAFRYCNHDVGHAI A+SMAAA LGWDV+V+D LG++++ +L+GL      +FEIP 
Sbjct: 192  YGERAFRYCNHDVGHAIAAVSMAAAELGWDVKVLDGLGYEDLKKLMGL--EIFPEFEIPA 249

Query: 736  QAVRGYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSNALSREH 915
            + V+G FP +E +H DC+LVVFP+G  GE N+N R+ +   S F  L+++G+ N LSREH
Sbjct: 250  RPVKGKFPVIEFDHPDCVLVVFPNGV-GEFNVNYRELSMAISRFSELKWKGKPNVLSREH 308

Query: 916  VCWDIIYRTANATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRKRRSAVDMDP 1092
            +CWDIIYRTA A KKPL  +     I P       +E SYK   V EV+RKRRSAVDMD 
Sbjct: 309  ICWDIIYRTAEAVKKPLMIEHK-FSIDPFHSSRLFNESSYKNLTVSEVVRKRRSAVDMDG 367

Query: 1093 GVSIDRNTFYQILAKVLPSGIKED--QGEQSQIPFRALPWEVNIHLMIFVNRVAGLKPGL 1266
               + R+TFYQIL   LPSG +    QG Q  +PFR L W+  +H ++FV++VAGL  GL
Sbjct: 368  VHVMQRDTFYQILLHCLPSGSQNGGKQGRQLGLPFRVLSWDSEVHAVLFVHKVAGLPSGL 427

Query: 1267 YFLVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQEIAGNG 1446
            YFLVRN+ HFD L+K TRS F+WA PEGCP  LPLY L  GD ++LA ++SCHQ+IAG+G
Sbjct: 428  YFLVRNEDHFDDLKKVTRSNFKWAKPEGCPDDLPLYELTRGDFQELAKRISCHQDIAGDG 487

Query: 1447 CFSLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYFDDPVHS 1626
            CFSLGM+A F+ +L +  AWMYPRLFWE G+LGQ+LYLEAHAVGISATGIGCYFDD VH 
Sbjct: 488  CFSLGMVAHFEGTLQNKSAWMYPRLFWETGVLGQVLYLEAHAVGISATGIGCYFDDAVHE 547

Query: 1627 VLGLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            +LGL G+ FQSLYHFTVG  V DKRIMSLPAYPGP +DS
Sbjct: 548  LLGLRGSSFQSLYHFTVGGPVLDKRIMSLPAYPGPAVDS 586


>ref|XP_002302529.2| nitroreductase family protein [Populus trichocarpa]
            gi|550345028|gb|EEE81802.2| nitroreductase family protein
            [Populus trichocarpa]
          Length = 631

 Score =  719 bits (1856), Expect = 0.0
 Identities = 351/588 (59%), Positives = 433/588 (73%), Gaps = 13/588 (2%)
 Frame = +1

Query: 19   HEDSTRI---AVDYHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDNS 189
            H+D   +    + YHN+TKHFFT YARGPHGLDW NQPNPFRRY ++P + LLH P++N+
Sbjct: 51   HKDPENLIAQVLKYHNQTKHFFTNYARGPHGLDWANQPNPFRRYVSSPLLSLLHFPVENN 110

Query: 190  DIP-------YPQVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGN 348
                      Y  +F  +P PKP++K++ISQLFYDSLALSAWKTTG STWSLRVNPSSGN
Sbjct: 111  QDSTSVSAPLYHSLFNSLPSPKPISKSSISQLFYDSLALSAWKTTGFSTWSLRVNPSSGN 170

Query: 349  LHPTEGYILSGPIDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLS 528
            LHPTE YI+S  +D V D  F+AHY+PKEH LE+RA+IP     +    FP  +F +G+S
Sbjct: 171  LHPTEAYIISPAVDSVCDSAFVAHYAPKEHSLELRAKIPDTFLPSF---FPSNAFLIGVS 227

Query: 529  SIFWRESWKYGERAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGS 708
            SIFWRE+WKYGERAFRYCNHDVGHAI AIS+AAA LGWDV+++D LG  E+ +L+GL   
Sbjct: 228  SIFWREAWKYGERAFRYCNHDVGHAIAAISLAAAELGWDVKLLDGLGSKELERLMGL--G 285

Query: 709  NKMDFEIPEQAVRGYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRG 888
                F IP++ ++G FP++E EH DC+LVVFP+G   + N+N ++ +    EF  LE+ G
Sbjct: 286  IYQGFRIPDKPIKGKFPEIEFEHPDCVLVVFPNGVN-DFNVNYKELSLAIMEFGNLEWIG 344

Query: 889  RSNALSREHVCWDIIYRTANATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRK 1065
              N+LS++HVCWD+IY TA A KKPL      L       GV  SEGSYK F  RE+IRK
Sbjct: 345  NPNSLSKKHVCWDVIYSTAEAVKKPLKIDDRFLVDKFQSSGVC-SEGSYKGFSAREIIRK 403

Query: 1066 RRSAVDMDPGVSIDRNTFYQILAKVLPSGIK--EDQGEQSQIPFRALPWEVNIHLMIFVN 1239
            RRSAVDMD    I+R+TFYQI+   LPSG    E Q  Q  +PFRAL W+  +H ++FV+
Sbjct: 404  RRSAVDMDGVTKIERDTFYQIMLHCLPSGCGSGEKQKRQLALPFRALSWDAEVHAVLFVH 463

Query: 1240 RVAGLKPGLYFLVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLS 1419
            RV GL  GLYFLVRN+ H D L+K+TR+EF+W  PEGCP  LPLY LA  DC+ +A +LS
Sbjct: 464  RVVGLPKGLYFLVRNEDHLDELKKSTRAEFKWEKPEGCPVDLPLYELARSDCQQIAKQLS 523

Query: 1420 CHQEIAGNGCFSLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIG 1599
            CHQ+IA +GCFSLGM+A F+ +L    AWMYPRLFWE G+LGQ+LYLEAHAVGISATGIG
Sbjct: 524  CHQDIASDGCFSLGMVAHFEPTLHSKGAWMYPRLFWETGVLGQVLYLEAHAVGISATGIG 583

Query: 1600 CYFDDPVHSVLGLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            C+FDDPVH +LGL G+ FQSLYHFTVG  V DKRIM+LPAYPGP  D+
Sbjct: 584  CFFDDPVHEILGLRGSNFQSLYHFTVGGPVLDKRIMNLPAYPGPSTDA 631


>ref|XP_002511685.1| oxidoreductase, putative [Ricinus communis]
            gi|223548865|gb|EEF50354.1| oxidoreductase, putative
            [Ricinus communis]
          Length = 639

 Score =  717 bits (1850), Expect = 0.0
 Identities = 348/574 (60%), Positives = 429/574 (74%), Gaps = 9/574 (1%)
 Frame = +1

Query: 49   YHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDNSDIP------YPQV 210
            YHN+TKH FT YARGP GLDW NQPNPFRRY +AP + LLH P DN D        Y  V
Sbjct: 73   YHNQTKHSFTNYARGPRGLDWANQPNPFRRYISAPLLSLLHFPTDNQDPGVDSAPLYHSV 132

Query: 211  FKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTEGYILSGPID 390
            F  +P PK ++K++ISQLFY+SLALSAWKTTG STWSLRVNPSSGNLHPTE Y+++ PI+
Sbjct: 133  FNSLPSPKSISKSSISQLFYNSLALSAWKTTGFSTWSLRVNPSSGNLHPTEAYLIAPPIE 192

Query: 391  GVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIFWRESWKYGERA 570
             +SD  F++HY+PKEH LE+RA IPS  F    K FP+ SF +G+SSIFWRE+WKYGERA
Sbjct: 193  SISDSAFVSHYAPKEHSLELRATIPSNFFP---KYFPRNSFLIGISSIFWREAWKYGERA 249

Query: 571  FRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDFEIPEQAVRG 750
            FRYCNHDVGHAI AISMAAA LGWDV+++D LG  E+ +L+GL       F+IP+  ++G
Sbjct: 250  FRYCNHDVGHAIAAISMAAAGLGWDVKLLDGLGSKELERLMGL--EMYQGFQIPDNPIKG 307

Query: 751  YFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSNALSREHVCWDI 930
              P++E EH DCLL+VFP+G + + ++N ++ +S   EF  LE++G+ N+LS+EH+CWDI
Sbjct: 308  KMPEIEFEHPDCLLLVFPNGVK-DFDVNYKELSSAIMEFRNLEWKGKPNSLSKEHICWDI 366

Query: 931  IYRTANATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRKRRSAVDMDPGVSID 1107
            IY+TA A KKP       L I P       SEGSYK F VRE++RKRRSAVDMD    ID
Sbjct: 367  IYKTAEAVKKPFT-LGDDLSIYPFQSSGVCSEGSYKSFTVREIVRKRRSAVDMDGVTEID 425

Query: 1108 RNTFYQILAKVLPSGIK--EDQGEQSQIPFRALPWEVNIHLMIFVNRVAGLKPGLYFLVR 1281
            R+TFYQIL   +PSG    E Q     +PFRAL W+  +H  +FV+RV  L  GLYFLVR
Sbjct: 426  RDTFYQILLHCVPSGSGSGERQKRLLALPFRALSWDAEVHAALFVHRVTRLSKGLYFLVR 485

Query: 1282 NKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQEIAGNGCFSLG 1461
            N+ H + L+KATR+ F W  PEGCP  LPLY LA GDC+ ++ +LSCHQ+IA +GCFSLG
Sbjct: 486  NEDHLNDLKKATRAGFTWEKPEGCPDDLPLYELAGGDCQQISKQLSCHQDIASDGCFSLG 545

Query: 1462 MLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYFDDPVHSVLGLS 1641
            M+A F+ +L +   WMYPRLFWE G+LGQ+LYLEAHA+GISATGIGC+FDDPVH +LGL 
Sbjct: 546  MVAHFEPTLRNKGVWMYPRLFWETGVLGQVLYLEAHAIGISATGIGCFFDDPVHEILGLR 605

Query: 1642 GNEFQSLYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            G+ +QSLYHFTVG  V DKRIMSLPAYPGP ID+
Sbjct: 606  GSNYQSLYHFTVGGPVLDKRIMSLPAYPGPGIDA 639


>ref|XP_006347883.1| PREDICTED: uncharacterized protein LOC102602495 [Solanum tuberosum]
          Length = 620

 Score =  716 bits (1847), Expect = 0.0
 Identities = 347/585 (59%), Positives = 436/585 (74%), Gaps = 8/585 (1%)
 Frame = +1

Query: 13   QTHEDSTRIA-----VDYHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCP 177
            Q  ED  + A     + YH +TKH FT YARGP GLDW NQPNPFRRY ++P + LLH P
Sbjct: 43   QQEEDEEKQASLAQVLKYHKETKHSFTNYARGPRGLDWANQPNPFRRYVSSPLIPLLHPP 102

Query: 178  IDNSDIPYPQVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHP 357
              +    Y  VFK +P PKP++ +TISQLFYDSLALSAWK+TG STWSLRVNPSSGNLHP
Sbjct: 103  YSDESPLYASVFKTLPFPKPISDSTISQLFYDSLALSAWKSTGFSTWSLRVNPSSGNLHP 162

Query: 358  TEGYILSGPIDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIF 537
            TE YI+  P++ VSD  F+AHY+PKEH LE+RA+  S IF    + FP+ SF +GLSSIF
Sbjct: 163  TEAYIICPPVESVSDKGFVAHYAPKEHSLEIRAQFSSGIF---TRFFPENSFLIGLSSIF 219

Query: 538  WRESWKYGERAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKM 717
            WRE+WKYGERAFRYCNHDVGHAI A+SMAAA LGWDV+V+D LG++E+ +L G+   N  
Sbjct: 220  WREAWKYGERAFRYCNHDVGHAIAAVSMAAAGLGWDVKVLDGLGYEELEKLTGV--ENFP 277

Query: 718  DFEIPEQAVRGYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSN 897
             F+IP + V+G  P++E E  DC+L+VFPSG   E  ++  + +   SEF GL+++G+ N
Sbjct: 278  KFKIPSRPVKGAMPEIEFELPDCVLLVFPSGLS-EFEVDYEELSCAISEFSGLDWKGKPN 336

Query: 898  ALSREHVCWDIIYRTANATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRKRRS 1074
             LS+EH+CWDIIYRTA A KKPL   +++  + P       SE SYK   +REV+RKRRS
Sbjct: 337  VLSKEHICWDIIYRTAEAAKKPLT-MSNLSAVDPFQSSGTFSESSYKDLTLREVVRKRRS 395

Query: 1075 AVDMDPGVSIDRNTFYQILAKVLPSGIK--EDQGEQSQIPFRALPWEVNIHLMIFVNRVA 1248
            AVDMD   ++ + TFYQIL   +PSG    +    Q  +PFR+L W+  +H  +FV+R+ 
Sbjct: 396  AVDMDGSTAMSKETFYQILLHCMPSGSHGGKKHVRQLALPFRSLDWDSEVHAALFVHRIV 455

Query: 1249 GLKPGLYFLVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQ 1428
            GL  GLYFLVRN+ H D L+KATR EF+W  P+GCP  LPLY LASGDC++L+ +LSCHQ
Sbjct: 456  GLPNGLYFLVRNESHLDDLKKATRDEFKWVKPDGCPDDLPLYELASGDCRELSKRLSCHQ 515

Query: 1429 EIAGNGCFSLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYF 1608
            +IA +GCFSLGM+A F+ +L +  +WMYPRLFWE G+LGQ+LYLE+HAVGISATGIGC+F
Sbjct: 516  DIASDGCFSLGMIAHFEPTLRNKGSWMYPRLFWETGVLGQVLYLESHAVGISATGIGCFF 575

Query: 1609 DDPVHSVLGLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            DDPVH VLGL G++FQSLYHFTVG+ V DKRIMSLPAYPGP  D+
Sbjct: 576  DDPVHEVLGLKGSKFQSLYHFTVGSPVVDKRIMSLPAYPGPSDDA 620


>ref|XP_004306936.1| PREDICTED: uncharacterized protein LOC101299228 [Fragaria vesca
            subsp. vesca]
          Length = 621

 Score =  712 bits (1839), Expect = 0.0
 Identities = 344/566 (60%), Positives = 425/566 (75%), Gaps = 1/566 (0%)
 Frame = +1

Query: 49   YHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDNSDIPYPQVFKGIPP 228
            YH  TKH FT+YARGPHGLDW NQPNPFRRY ++P + LLH P  +    Y  +F  +PP
Sbjct: 64   YHTSTKHHFTRYARGPHGLDWANQPNPFRRYLSSPLLPLLH-PTSSPSPLYSSIFTSLPP 122

Query: 229  PKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTEGYILSGPIDGVSDLP 408
            P+P++ +T+SQL YDSL+LSAWK+T  STWSLRVNPSSGNLHPTE Y++S  I+ +SD  
Sbjct: 123  PQPISISTLSQLLYDSLSLSAWKSTPFSTWSLRVNPSSGNLHPTEAYVISPAINSLSDTA 182

Query: 409  FLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIFWRESWKYGERAFRYCNH 588
            F+AHY+PKEH LE+RAEIPS +F  L+   P  SF +GLSSIFWRE+WKYGERAFRYCNH
Sbjct: 183  FVAHYAPKEHSLELRAEIPSWVFRDLL---PDDSFLIGLSSIFWREAWKYGERAFRYCNH 239

Query: 589  DVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDFEIPEQAVRGYFPQLE 768
            DVGHAIGA+++AAA LGWDV+++D LG++++ ++LG+       FEIP +AV+G FP++E
Sbjct: 240  DVGHAIGAVAVAAAELGWDVKILDGLGYEDLEKVLGV--GRDSHFEIPVRAVKGRFPEME 297

Query: 769  KEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSNALSREHVCWDIIYRTAN 948
             EH DC+++VFPSG+ G + ++          F GLE++G  N LS+EH+CWD+IYRTA 
Sbjct: 298  FEHPDCVMLVFPSGN-GRVEVDYEKLRLAVKGFEGLEWKGERNVLSKEHICWDLIYRTAE 356

Query: 949  ATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRKRRSAVDMDPGVSIDRNTFYQ 1125
            A KK   D    L +         SEGSYK F VREV+RKRRSAVDMD    ++R+TFYQ
Sbjct: 357  AVKKE-RDLGEKLVVDEFRSSGCCSEGSYKGFTVREVVRKRRSAVDMDGVTVMERDTFYQ 415

Query: 1126 ILAKVLPSGIKEDQGEQSQIPFRALPWEVNIHLMIFVNRVAGLKPGLYFLVRNKRHFDAL 1305
            IL   LPSG   +Q  Q  +PFRAL W+  +H ++FV+RV GL  GLYFLVRN+ HFD L
Sbjct: 416  ILLHCLPSGSGGEQKRQLAMPFRALSWDAEVHAVLFVHRVKGLPEGLYFLVRNEDHFDKL 475

Query: 1306 RKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQEIAGNGCFSLGMLAQFQNS 1485
            +K+ RS F+W  PEGCP  LPLY L   DC+ LA KLSCHQEIA +GCFSLGM+A F+  
Sbjct: 476  KKSMRSSFKWVKPEGCPEELPLYELHRIDCQALAEKLSCHQEIASHGCFSLGMVACFEPL 535

Query: 1486 LSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYFDDPVHSVLGLSGNEFQSLY 1665
            L D   WMYPRLFWE G+LGQ+LYLEAHA+GISATGIGCYFDDPVH +LGL G+ FQSLY
Sbjct: 536  LKDKKVWMYPRLFWETGVLGQVLYLEAHAIGISATGIGCYFDDPVHELLGLQGSNFQSLY 595

Query: 1666 HFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            HFTVG  V DKRIMSLPAYPGP +D+
Sbjct: 596  HFTVGGPVIDKRIMSLPAYPGPNVDA 621


>ref|XP_006445286.1| hypothetical protein CICLE_v10019231mg [Citrus clementina]
            gi|568875625|ref|XP_006490893.1| PREDICTED:
            uncharacterized protein LOC102618582 [Citrus sinensis]
            gi|557547548|gb|ESR58526.1| hypothetical protein
            CICLE_v10019231mg [Citrus clementina]
          Length = 650

 Score =  711 bits (1836), Expect = 0.0
 Identities = 349/585 (59%), Positives = 430/585 (73%), Gaps = 20/585 (3%)
 Frame = +1

Query: 49   YHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPI---------------- 180
            YH++TKH FTKYARGPHGLDW NQPNPFRRY +AP + L+H P                 
Sbjct: 72   YHDQTKHSFTKYARGPHGLDWANQPNPFRRYISAPLLPLMHLPNRTDHRTQTPSSLSNYN 131

Query: 181  -DNSDIPYPQVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHP 357
             DN+ + Y  +F  +PPP+PL  ++ISQLFYDSLALSAWKTTG STWSLRVNPSSGNLHP
Sbjct: 132  HDNAPL-YSSLFTSLPPPQPLTVSSISQLFYDSLALSAWKTTGYSTWSLRVNPSSGNLHP 190

Query: 358  TEGYILSGPIDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIF 537
            TE YI++  I+ + D PF+AHY+PKEH LE+RA+IPS+ F      FPK SF VG SSIF
Sbjct: 191  TEAYIIAPAIESLCDSPFVAHYAPKEHALELRAKIPSR-FDLFNNFFPKNSFLVGFSSIF 249

Query: 538  WRESWKYGERAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKM 717
            WRE+WKYGERAFRYCNHDVGHAI A++MAAA LGWDV++++ +G+ E+ +L+GL      
Sbjct: 250  WREAWKYGERAFRYCNHDVGHAIAAVAMAAAELGWDVKILEGMGYKELKKLMGL--DIFP 307

Query: 718  DFEIPEQAVRGYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSN 897
            +F IP + ++G  P++E EH DC+LVVFPSG+ G  ++N      +  EF  L+++G+ N
Sbjct: 308  EFVIPSKPIKGKIPEIEFEHPDCVLVVFPSGATG-FDVNYEKLRLLMEEFSALDWKGKPN 366

Query: 898  ALSREHVCWDIIYRTANATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRKRRS 1074
             LS+EH CWDIIY TA   KKPL  + +   + P       SE SYK F VREV+RKRRS
Sbjct: 367  LLSKEHFCWDIIYSTAEVVKKPLTIR-NAFSVDPFSSSGVCSESSYKGFTVREVVRKRRS 425

Query: 1075 AVDMDPGVSIDRNTFYQILAKVLPSGIK--EDQGEQSQIPFRALPWEVNIHLMIFVNRVA 1248
            AVDMD   +IDR TFYQI+   LPSG +  E Q  Q  +P+R L W+  +H  +F++RV 
Sbjct: 426  AVDMDGVTAIDRETFYQIMLHCLPSGSRSREKQKRQLALPYRVLSWDAEVHAALFIHRVK 485

Query: 1249 GLKPGLYFLVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQ 1428
            GL  GLYFLVRN+ H   L+KA RS F W  PEGCP  LPLY LA GDC+ LA  LSCHQ
Sbjct: 486  GLPKGLYFLVRNEDHLGELKKAVRSGFVWEKPEGCPRDLPLYELARGDCQQLAKGLSCHQ 545

Query: 1429 EIAGNGCFSLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYF 1608
            +IAG+GCFSLGM+A F+ +LS+ + WMYPRLFWE G+LGQ+LYLEAHAVGISATGIGC+F
Sbjct: 546  DIAGDGCFSLGMVAHFEPTLSNKNVWMYPRLFWETGVLGQVLYLEAHAVGISATGIGCFF 605

Query: 1609 DDPVHSVLGLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            DDPVH VLGL+G++FQSLYHFTVG  V D+RIMSLPAYPGP ID+
Sbjct: 606  DDPVHEVLGLTGSKFQSLYHFTVGGPVVDRRIMSLPAYPGPNIDA 650


>gb|EXB99759.1| hypothetical protein L484_023290 [Morus notabilis]
          Length = 607

 Score =  707 bits (1824), Expect = 0.0
 Identities = 352/594 (59%), Positives = 430/594 (72%), Gaps = 15/594 (2%)
 Frame = +1

Query: 7    SSQTHEDSTRIAVDYHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCP--- 177
            SS+    +    + YH++TKH FTKYARGPHGLDW NQPNPFRR+ ++P + LLH     
Sbjct: 21   SSENDNQNLSQILHYHDQTKHAFTKYARGPHGLDWANQPNPFRRFLSSPLLPLLHLSTPD 80

Query: 178  -------IDNSDIP-YPQVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVN 333
                   ID +  P Y  VF  +PPPKPL+K  ISQ  YDSLALSAWKTTG STWSLRVN
Sbjct: 81   QTPSSSAIDGAQAPLYHSVFLSLPPPKPLSKPAISQFLYDSLALSAWKTTGFSTWSLRVN 140

Query: 334  PSSGNLHPTEGYILSGPIDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSF 513
            PSSGNLHPTE YI++ PI+ +S+  F+AHY+PKEH LE+RAE+P+  F    K FP+ +F
Sbjct: 141  PSSGNLHPTEAYIVALPIESLSNSGFVAHYAPKEHGLEIRAEVPAGFFA---KFFPENAF 197

Query: 514  FVGLSSIFWRESWKYGERAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLL 693
             VGLSSIFWRE+WKYGERAFRYCNHDVGHAIGA++M+AA LGWDV+V+D LG++++ +L+
Sbjct: 198  LVGLSSIFWREAWKYGERAFRYCNHDVGHAIGAVAMSAASLGWDVKVLDGLGYEDMKKLM 257

Query: 694  GLVGSNKMDFEIPEQAVRGYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLG 873
            GL      +F IP + VRG  P++E EH DC+L VFPSG   E  +N  + + V SEF  
Sbjct: 258  GL--DKFPEFRIPSRPVRGKIPEIEFEHPDCVLAVFPSGIT-EFGLNYEELSKVISEFSS 314

Query: 874  LEFRGRSNALSREHVCWDIIYRTANATKKPL-ADQASVLKITPLPEGVAVSEGSYK-FGV 1047
             E++G  N LS+EHVCWDIIYRTA A KKP+  D      + P       S  +YK + V
Sbjct: 315  FEWKGNPNLLSKEHVCWDIIYRTAEAVKKPIDIDNKDRFFVDPFVSSGLFSVNAYKGYTV 374

Query: 1048 REVIRKRRSAVDMDPGVSIDRNTFYQILAKVLPSGIKEDQGEQSQI--PFRALPWEVNIH 1221
            RE++RKRRSAVDMD   ++ RNTFYQIL   LPSG    +G++  +  PFRAL W+  +H
Sbjct: 375  REIVRKRRSAVDMDGVTAMQRNTFYQILLHCLPSGCGTGEGQKQPLALPFRALSWDAEVH 434

Query: 1222 LMIFVNRVAGLKPGLYFLVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKD 1401
              +FV+RV GL  GLYFLVRN+ HF  L+K+ R  F+W  PEGCP  LPLY L  GD + 
Sbjct: 435  AALFVHRVVGLPQGLYFLVRNEEHFGELKKSMRPGFKWTKPEGCPDELPLYELDRGDYRL 494

Query: 1402 LAMKLSCHQEIAGNGCFSLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGI 1581
            L+ +LSCHQEIA +GCFSLGM+A F+  L +  AWMYPRLFWE G+LGQ+LYLEAHA GI
Sbjct: 495  LSQRLSCHQEIASDGCFSLGMVAHFEPVLRE-KAWMYPRLFWETGVLGQVLYLEAHAAGI 553

Query: 1582 SATGIGCYFDDPVHSVLGLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            SATGIGCYFDDPVH VLGL G+ FQSLYHFTVG  V DKRIMSLPAYPGP ID+
Sbjct: 554  SATGIGCYFDDPVHEVLGLKGSNFQSLYHFTVGGPVLDKRIMSLPAYPGPGIDA 607


>ref|XP_004229802.1| PREDICTED: uncharacterized protein LOC101249301 [Solanum
            lycopersicum]
          Length = 617

 Score =  706 bits (1823), Expect = 0.0
 Identities = 343/585 (58%), Positives = 432/585 (73%), Gaps = 8/585 (1%)
 Frame = +1

Query: 13   QTHEDSTRIA-----VDYHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCP 177
            Q  ED  + A     + YH +TKH F  YARGP GLDW NQPNPFRRY ++P + LLH P
Sbjct: 40   QQEEDEEKQASLAQVLKYHKETKHSFNNYARGPRGLDWANQPNPFRRYVSSPLISLLHPP 99

Query: 178  IDNSDIPYPQVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHP 357
              +    Y  +FK +P PKPL+ +TISQLFYDSLALSAWK+TG STWSLRVNPSSGNLHP
Sbjct: 100  YSDESPLYSSLFKTLPFPKPLSDSTISQLFYDSLALSAWKSTGFSTWSLRVNPSSGNLHP 159

Query: 358  TEGYILSGPIDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIF 537
            TE  I+  P++ VSD  F+AHY+PKEH LE+RA+  S IF    + FP+ SF +GLSSIF
Sbjct: 160  TEACIICPPVESVSDKGFVAHYAPKEHSLEIRAQFSSGIF---TRFFPENSFLIGLSSIF 216

Query: 538  WRESWKYGERAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKM 717
            WRE+WKYGERAFRYCNHDVGHAI A+SMAAA LGWDV+V+D LG++E+ +L G+   N  
Sbjct: 217  WREAWKYGERAFRYCNHDVGHAIAAVSMAAAGLGWDVKVLDGLGYEELEKLTGV--ENFP 274

Query: 718  DFEIPEQAVRGYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSN 897
             F+IP + V+G  P++E EH DC+L+VFPSG   E  +  ++ +   S+F GL+++G+ N
Sbjct: 275  KFKIPSRPVKGSMPEIEFEHPDCVLLVFPSGLS-EFKVAYKELSCAISDFSGLDWKGKPN 333

Query: 898  ALSREHVCWDIIYRTANATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRKRRS 1074
             LS+EH+CWDIIYRTA A KKPL   +++  + P       SE SYK   +RE++RKRRS
Sbjct: 334  VLSKEHICWDIIYRTAEAAKKPLT-MSNLSVVDPFQSSGTFSESSYKDLSLRELVRKRRS 392

Query: 1075 AVDMDPGVSIDRNTFYQILAKVLPSGIK--EDQGEQSQIPFRALPWEVNIHLMIFVNRVA 1248
            AVDMD    + + TFYQIL   +PSG    +    Q  +PFR+L W+  +H  +FV+RV 
Sbjct: 393  AVDMDGSTVMSKETFYQILLHCVPSGSHGGKKHVRQLTLPFRSLAWDSEVHAALFVHRVV 452

Query: 1249 GLKPGLYFLVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQ 1428
            GL  GLYFLVRN+ H D L+K TR+EF+W  P+GCP  LPLY LASGDC++L+ +LSCHQ
Sbjct: 453  GLPKGLYFLVRNENHLDDLKKDTRAEFKWVKPDGCPDDLPLYELASGDCRELSKRLSCHQ 512

Query: 1429 EIAGNGCFSLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYF 1608
            +IA +GCFSLGM+A F+ +L +  +WMYPRLFWE G+LGQ+LYLE+HAVGISATGIGC+F
Sbjct: 513  DIASDGCFSLGMIAHFEPTLRNKGSWMYPRLFWETGVLGQVLYLESHAVGISATGIGCFF 572

Query: 1609 DDPVHSVLGLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            DDPVH VLGL G++FQSLYHFTVG  V DKRIMSLPAYPG   D+
Sbjct: 573  DDPVHEVLGLKGSKFQSLYHFTVGGPVVDKRIMSLPAYPGTSDDA 617


>ref|XP_003521959.2| PREDICTED: uncharacterized protein LOC100791269 [Glycine max]
          Length = 647

 Score =  699 bits (1803), Expect = 0.0
 Identities = 343/583 (58%), Positives = 430/583 (73%), Gaps = 8/583 (1%)
 Frame = +1

Query: 19   HEDSTRIAVDYHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDNSDIP 198
            HE      + YHN+TKH FT YARGPH LDW NQPNPFRRY ++P + LLH   +   + 
Sbjct: 72   HEHKLSHVLKYHNQTKHSFTNYARGPHNLDWANQPNPFRRYLSSPLLPLLHSEPNPQTLT 131

Query: 199  ----YPQVFKGIPPPKP-LNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTE 363
                Y  +F  +P P P ++K+T+SQL +DSL+LSAWK+TG STWSLRVNPSSGNLHPTE
Sbjct: 132  LTPLYHSLFLSLPSPHPPVSKSTLSQLLFDSLSLSAWKSTGFSTWSLRVNPSSGNLHPTE 191

Query: 364  GYILSGPIDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIFWR 543
             Y+++ PI GVSD  F+AHY+PKEH LE+RAEIPS  F    K FP  SF VGLSS+FWR
Sbjct: 192  AYVVAPPIPGVSDSAFVAHYAPKEHSLELRAEIPSGFFP---KFFPPNSFLVGLSSVFWR 248

Query: 544  ESWKYGERAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDF 723
            E+WKYGERAFRYCNHDVGHAIGA++MAAA LGWDV+V+D LG +E+  L+GL      +F
Sbjct: 249  EAWKYGERAFRYCNHDVGHAIGAVAMAAAGLGWDVKVLDSLGCEELKSLMGL--HVFPEF 306

Query: 724  EIPEQAVRGYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSNAL 903
            EIP +AVRG  P++E EH DC+++V+PSG  G  ++N ++ +     F  L+++G+ N+L
Sbjct: 307  EIPSRAVRGKIPEIEFEHPDCVMLVYPSGVGG-FDVNWKELSEAILGFDKLDWKGKPNSL 365

Query: 904  SREHVCWDIIYRTANATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRKRRSAV 1080
            S+EHVCW++IYRTA A KKPL      L + PL       E +YK   VREV+RKRRSAV
Sbjct: 366  SKEHVCWEVIYRTAEAVKKPLTLGERFL-VEPLQRSGVCGESAYKGLTVREVVRKRRSAV 424

Query: 1081 DMDPGVSIDRNTFYQILAKVLPSGIKED--QGEQSQIPFRALPWEVNIHLMIFVNRVAGL 1254
            DMD    I+R+ FYQIL   LPSG +    QG +  +PFRALPW+  +H  +FV+RV GL
Sbjct: 425  DMDGVTEIERDAFYQILLHCLPSGCQGGGRQGRELALPFRALPWDAEVHAALFVHRVVGL 484

Query: 1255 KPGLYFLVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQEI 1434
              GLYFLVRN+ HFD L+KA   +F W  PEGCP  LPLY L   DC+ L+ +LSCHQ+I
Sbjct: 485  PQGLYFLVRNEDHFDKLKKAMLPDFLWTKPEGCPDDLPLYELLRLDCRQLSKQLSCHQDI 544

Query: 1435 AGNGCFSLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYFDD 1614
            A +GCFSLGMLA+ + +L + + WMYPRLFWE G+LGQ+LYLEAHA+GISATGIGC+FD+
Sbjct: 545  ASDGCFSLGMLARMEPTLREKNVWMYPRLFWETGVLGQVLYLEAHAIGISATGIGCFFDN 604

Query: 1615 PVHSVLGLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            PVH +LGL G+ FQSLYHFTVG  V DKRIMSLPAYPGP++D+
Sbjct: 605  PVHQLLGLKGSTFQSLYHFTVGGPVLDKRIMSLPAYPGPDVDA 647


>gb|ESW06843.1| hypothetical protein PHAVU_010G081400g [Phaseolus vulgaris]
            gi|561007895|gb|ESW06844.1| hypothetical protein
            PHAVU_010G081400g [Phaseolus vulgaris]
          Length = 642

 Score =  697 bits (1799), Expect = 0.0
 Identities = 338/568 (59%), Positives = 418/568 (73%), Gaps = 3/568 (0%)
 Frame = +1

Query: 49   YHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDNSDIPYPQVFKGIPP 228
            YHN+TKH F  +ARGPHGLDW NQPNPFRRY ++P + LLH         Y  +F  +P 
Sbjct: 82   YHNQTKHNFNHFARGPHGLDWANQPNPFRRYLSSPLISLLHPQPPYLPPLYHSLFLSLPS 141

Query: 229  PKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTEGYILSGPIDGVSDLP 408
            P P++++T+SQ  +DSLALSAWKTT  STWSLRVNPSSGNLHPTE YI++ PI  +SD  
Sbjct: 142  PHPISQSTVSQFLFDSLALSAWKTTSFSTWSLRVNPSSGNLHPTEAYIVAPPIPSISDSA 201

Query: 409  FLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIFWRESWKYGERAFRYCNH 588
            F+AHY+PKEH LE+RA+IPS  F    K FP  SF VGLSS+FWRE+WKYGERAFRYCNH
Sbjct: 202  FVAHYAPKEHALELRAQIPSGFFP---KYFPPNSFLVGLSSVFWREAWKYGERAFRYCNH 258

Query: 589  DVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDFEIPEQAVRGYFPQLE 768
            DVGHAIGA++MAAA LGWDV+V+D LG +E+  L+GL      DFEIP +AVRG  P++E
Sbjct: 259  DVGHAIGAVAMAAAGLGWDVKVLDSLGCEELKSLMGL--HVFPDFEIPSRAVRGKIPEIE 316

Query: 769  KEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSNALSREHVCWDIIYRTAN 948
             EH DC+++V+PSG  G  ++N ++ +     F  L+++G+ N+LS+EHVCWD+IYRTA 
Sbjct: 317  FEHPDCVMLVYPSGVGG-FDVNWKELSEAILGFDKLDWKGKPNSLSKEHVCWDVIYRTAE 375

Query: 949  ATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRKRRSAVDMDPGVSIDRNTFYQ 1125
            A KKPL        + P        EG Y    VREV+R RRSAVDMD    I+R+ FYQ
Sbjct: 376  AVKKPLT-LGDKFSVEPFQRSGVCGEGLYNGLTVREVVRNRRSAVDMDGVTEIERDAFYQ 434

Query: 1126 ILAKVLPSGIKED--QGEQSQIPFRALPWEVNIHLMIFVNRVAGLKPGLYFLVRNKRHFD 1299
            IL   LPSG +    Q  Q  +PFRALPW+V +H  +FV+RV GL  GLYFLVRN+ +FD
Sbjct: 435  ILLHCLPSGCQSGGRQRRQLALPFRALPWDVEVHAALFVHRVVGLPQGLYFLVRNENNFD 494

Query: 1300 ALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQEIAGNGCFSLGMLAQFQ 1479
             L+KA   +F W  PEGCP  LPLY L   DC+ LA KLSCHQ+IA +GCFSLGMLA+ +
Sbjct: 495  ELKKAMLPDFLWTKPEGCPDELPLYELLRSDCRPLAKKLSCHQDIASDGCFSLGMLARME 554

Query: 1480 NSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYFDDPVHSVLGLSGNEFQS 1659
             +L + + WMYPRLFWE G+LGQ+LYLEAHA+GISATGIGC+FDDPVH +LGL G+ FQS
Sbjct: 555  PTLCEKNVWMYPRLFWETGVLGQVLYLEAHAIGISATGIGCFFDDPVHQLLGLKGSTFQS 614

Query: 1660 LYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            LYHFTVG+ V DKRIMSLP YPGP++D+
Sbjct: 615  LYHFTVGSPVLDKRIMSLPTYPGPDVDA 642


>ref|XP_004155780.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101228460
            [Cucumis sativus]
          Length = 599

 Score =  696 bits (1796), Expect = 0.0
 Identities = 337/578 (58%), Positives = 427/578 (73%), Gaps = 13/578 (2%)
 Frame = +1

Query: 49   YHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDNSDIP---------- 198
            YH++TKH F+ YARGPHGLDW NQPNPFRRY +AP + L H PI N              
Sbjct: 29   YHSQTKHGFSNYARGPHGLDWANQPNPFRRYISAPLLPLSHFPILNQTAASDDETHEASL 88

Query: 199  YPQVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTEGYILS 378
            Y  +F  +PPPKP+ KATISQ FYDSLALSAWK+TG STWSLRVNPSSGNLHPTE Y+++
Sbjct: 89   YDSLFVSLPPPKPVCKATISQFFYDSLALSAWKSTGFSTWSLRVNPSSGNLHPTEAYLIA 148

Query: 379  GPIDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIFWRESWKY 558
             P+  +SD  F+AHY+PKEH LE+R +IP   F    K FP+ SF +GLSSIFWRE+WKY
Sbjct: 149  PPVTSLSDYGFVAHYAPKEHALEIRTQIPPGFFS---KFFPENSFLIGLSSIFWREAWKY 205

Query: 559  GERAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDFEIPEQ 738
            GERAFRYCNHDVGHAI A++MAAA LGWDV+V+D LG+ ++ +L+GL      +FEIP Q
Sbjct: 206  GERAFRYCNHDVGHAIAAVAMAAAGLGWDVKVLDGLGYADLKKLMGL--HTFPEFEIPSQ 263

Query: 739  AVRGYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSNALSREHV 918
             V+G FP +E EH DC+L VFPSG+  + ++N  + +S   +F  L+++G+ N LS++H+
Sbjct: 264  PVKGSFPVIEFEHPDCVLAVFPSGT-ADFSMNYEELSSAVLKFSELDWKGKXNLLSKQHI 322

Query: 919  CWDIIYRTANATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRKRRSAVDMDPG 1095
            CWDIIYRTA A +KPL  ++  L + P      + E  YK F  REV+RKRRSAVDMD  
Sbjct: 323  CWDIIYRTAMAVEKPLTGESGSL-VEPFQSSGVLGERPYKGFTWREVVRKRRSAVDMDGV 381

Query: 1096 VSIDRNTFYQILAKVLPSGIKEDQGEQSQI--PFRALPWEVNIHLMIFVNRVAGLKPGLY 1269
             ++ R+TFYQIL   +PSG  E + ++ ++  PFRALPW+  +H  +FV+RV GL  GLY
Sbjct: 382  TTMARDTFYQILLHCVPSGSIEGERQRRELALPFRALPWDAEVHAALFVHRVVGLPQGLY 441

Query: 1270 FLVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQEIAGNGC 1449
            FLVRN+ HFD L+KAT  +F+W  P+GCP+ LPLY L  G+ + L+ +LSCHQ+IA +GC
Sbjct: 442  FLVRNEDHFDELKKATNPDFKWVKPDGCPSSLPLYELRRGNYQTLSKRLSCHQDIASDGC 501

Query: 1450 FSLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYFDDPVHSV 1629
            FSLGM+A ++ +L +    MYPRLFWE G++GQ+LYLEAHAV ISATGIGC+FDDPVH  
Sbjct: 502  FSLGMIAHYEPTLREKGVHMYPRLFWETGVIGQVLYLEAHAVDISATGIGCFFDDPVHEA 561

Query: 1630 LGLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            LGL G+ FQSLYHFTVG  V DKRIMSLPAYPGP +DS
Sbjct: 562  LGLKGSNFQSLYHFTVGGPVLDKRIMSLPAYPGPNVDS 599


>ref|XP_004133924.1| PREDICTED: uncharacterized protein LOC101216535 [Cucumis sativus]
          Length = 645

 Score =  696 bits (1796), Expect = 0.0
 Identities = 337/578 (58%), Positives = 427/578 (73%), Gaps = 13/578 (2%)
 Frame = +1

Query: 49   YHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDNSDIP---------- 198
            YH++TKH F+ YARGPHGLDW NQPNPFRRY +AP + L H PI N              
Sbjct: 75   YHSQTKHGFSNYARGPHGLDWANQPNPFRRYISAPLLPLSHFPILNQTAASDDETHEASL 134

Query: 199  YPQVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTEGYILS 378
            Y  +F  +PPPKP+ KATISQ FYDSLALSAWK+TG STWSLRVNPSSGNLHPTE Y+++
Sbjct: 135  YDSLFVSLPPPKPVCKATISQFFYDSLALSAWKSTGFSTWSLRVNPSSGNLHPTEAYLIA 194

Query: 379  GPIDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIFWRESWKY 558
             P+  +SD  F+AHY+PKEH LE+R +IP   F    K FP+ SF +GLSSIFWRE+WKY
Sbjct: 195  PPVTSLSDYGFVAHYAPKEHALEIRTQIPPGFFS---KFFPENSFLIGLSSIFWREAWKY 251

Query: 559  GERAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDFEIPEQ 738
            GERAFRYCNHDVGHAI A++MAAA LGWDV+V+D LG+ ++ +L+GL      +FEIP Q
Sbjct: 252  GERAFRYCNHDVGHAIAAVAMAAAGLGWDVKVLDGLGYADLKKLMGL--HTFPEFEIPSQ 309

Query: 739  AVRGYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSNALSREHV 918
             V+G FP +E EH DC+L VFPSG+  + ++N  + +S   +F  L+++G+ N LS++H+
Sbjct: 310  PVKGSFPVIEFEHPDCVLAVFPSGT-ADFSMNYEELSSAVLKFSELDWKGKPNLLSKQHI 368

Query: 919  CWDIIYRTANATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRKRRSAVDMDPG 1095
            CWDIIYRTA A +KPL  ++  L + P      + E  YK F  REV+RKRRSAVDMD  
Sbjct: 369  CWDIIYRTAMAVEKPLTGESGSL-VEPFQSSGVLGERPYKGFTWREVVRKRRSAVDMDGV 427

Query: 1096 VSIDRNTFYQILAKVLPSGIKEDQGEQSQI--PFRALPWEVNIHLMIFVNRVAGLKPGLY 1269
             ++ R+TFYQIL   +PSG  E + ++ ++  PFRALPW+  +H  +FV+RV GL  GLY
Sbjct: 428  TTMARDTFYQILLHCVPSGSIEGERQRRELALPFRALPWDAEVHAALFVHRVVGLPQGLY 487

Query: 1270 FLVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQEIAGNGC 1449
            FLVRN+ HFD L+KAT  +F+W  P+GCP+ LPLY L  G+ + L+ +LSCHQ+IA +GC
Sbjct: 488  FLVRNEDHFDELKKATNPDFKWVKPDGCPSSLPLYELRRGNYQTLSKRLSCHQDIASDGC 547

Query: 1450 FSLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYFDDPVHSV 1629
            FSLGM+A ++ +L +    MYPRLFWE G++GQ+LYLEAHAV ISATGIGC+FDDPVH  
Sbjct: 548  FSLGMIAHYEPTLREKGVHMYPRLFWETGVIGQVLYLEAHAVDISATGIGCFFDDPVHEA 607

Query: 1630 LGLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            LGL G+ FQSLYHFTVG  V DKRIMSLPAYPGP +DS
Sbjct: 608  LGLKGSNFQSLYHFTVGGPVLDKRIMSLPAYPGPNVDS 645


>ref|XP_002889341.1| hypothetical protein ARALYDRAFT_333455 [Arabidopsis lyrata subsp.
            lyrata] gi|297335183|gb|EFH65600.1| hypothetical protein
            ARALYDRAFT_333455 [Arabidopsis lyrata subsp. lyrata]
          Length = 872

 Score =  689 bits (1779), Expect = 0.0
 Identities = 344/585 (58%), Positives = 422/585 (72%), Gaps = 10/585 (1%)
 Frame = +1

Query: 7    SSQTHEDSTRIAVDYHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHC---P 177
            SS +   S  + ++YHN+TKH FT YARGP GLDW NQPNPFRRY +AP + L H     
Sbjct: 295  SSSSSSSSLELVLEYHNQTKHSFTGYARGPRGLDWANQPNPFRRYLSAPLLPLQHPNHDD 354

Query: 178  IDNSDIP-YPQVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLH 354
             DN D P Y  +F  +PPPKP++ ATIS LFY SLALSAWKTTG STW LRVNPSSGNLH
Sbjct: 355  DDNDDSPLYSCLFDSLPPPKPISLATISHLFYHSLALSAWKTTGSSTWPLRVNPSSGNLH 414

Query: 355  PTEGYILSGPIDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSI 534
            PTE Y+++ PI  +S   F+AHY+PKEH LEVRA IPS  F       P+ SF +G+SSI
Sbjct: 415  PTEAYLIAPPIPSLSQSAFVAHYAPKEHSLEVRAHIPSSFF-------PENSFLIGISSI 467

Query: 535  FWRESWKYGERAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNK 714
            FWRE+WKYGERAFRYCNHDVGHAI A+S+AAA LGWD++++D  G D++ +L+GL     
Sbjct: 468  FWREAWKYGERAFRYCNHDVGHAIAALSIAAAELGWDLKLLDGFGADDLKRLMGLP---- 523

Query: 715  MDFEIPEQAVRGYFPQLEKEHGDCLLVVFPSG-SQGELNINPRDFASVASEFLGLEFRGR 891
             +F+IP  + +G  P++E EH DCLL+VFP+G S+G+LN++    +S   +F  LE+ G 
Sbjct: 524  -EFQIPSSSGKGKLPEIEFEHPDCLLLVFPNGTSRGDLNLDYLGISSALRDFPSLEWNGN 582

Query: 892  SNALSREHVCWDIIYRTANATKKP-LADQASVLKITPLPEGVAVSEGSY-KFGVREVIRK 1065
             N LS+EH+CWDIIYRTA A +KP L    S     P       S  SY K   R+V+R 
Sbjct: 583  PNTLSKEHLCWDIIYRTAKAVEKPSLIYSTSSSFDAPFTSSALFSHTSYNKLTARQVVRT 642

Query: 1066 RRSAVDMDPGVSIDRNTFYQILAKVLPSGIK--EDQGEQSQIPFRALPWEV-NIHLMIFV 1236
            RRSAVDMD    ID + FYQIL   LPSG    E Q EQ  +PFRALPW+   +HL +FV
Sbjct: 643  RRSAVDMDAVTCIDMSAFYQILMHCLPSGSTRGEPQKEQLALPFRALPWDTAEVHLALFV 702

Query: 1237 NRVAGLKPGLYFLVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKL 1416
            +RV GL  GLYFLVRN+ H   L+ ATR EF+W  P+GCPA LPLY L  GDC+ LA  L
Sbjct: 703  HRVLGLPKGLYFLVRNEDHLSDLKTATRPEFEWKKPDGCPADLPLYKLTEGDCQKLAKGL 762

Query: 1417 SCHQEIAGNGCFSLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGI 1596
            SCHQ+IAG+GCFSLGM+A+F+ +L +  +W+YPRLFWE G++GQ+LYLEAHA+GISATGI
Sbjct: 763  SCHQDIAGDGCFSLGMVARFEPALREKGSWVYPRLFWETGVIGQVLYLEAHAMGISATGI 822

Query: 1597 GCYFDDPVHSVLGLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGP 1731
            GCYFDDPVH VLG+  + FQSLYHFTVG  V DKRIM+LPAYPGP
Sbjct: 823  GCYFDDPVHEVLGIKDSSFQSLYHFTVGGPVVDKRIMTLPAYPGP 867


>ref|XP_004506526.1| PREDICTED: uncharacterized protein LOC101496891 [Cicer arietinum]
          Length = 642

 Score =  687 bits (1772), Expect = 0.0
 Identities = 339/578 (58%), Positives = 419/578 (72%), Gaps = 11/578 (1%)
 Frame = +1

Query: 43   VDYHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDNSDIP--YPQVFK 216
            + YHN+TKH F  YARGPHGLDW NQPNPFRRY ++P + LLH           Y  +F 
Sbjct: 72   IKYHNQTKHNFNNYARGPHGLDWANQPNPFRRYLSSPLLPLLHFTTQQQQQQPLYSSLFN 131

Query: 217  GIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTEGYILSGPIDGV 396
             +P PKP++K TISQ  YDSL+LSAWK+T  STWSLRVNPSSGNLHPTE YI++  I+ +
Sbjct: 132  SLPSPKPISKTTISQFLYDSLSLSAWKSTSFSTWSLRVNPSSGNLHPTEAYIIAPSIESI 191

Query: 397  SDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIFWRESWKYGERAFR 576
            SD PF+AHY+PKEH LE+RA+IPS  F    K FP  SF VG SSIFWRESWKYGER FR
Sbjct: 192  SDSPFVAHYAPKEHSLELRAQIPSGFFP---KFFPPNSFLVGFSSIFWRESWKYGERGFR 248

Query: 577  YCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDFEIPEQAVRGYF 756
            YCNHDVGHAI A+SMAAA LGWDV+++D LG DE+  L+G+      +FE P  AV+G  
Sbjct: 249  YCNHDVGHAIAAVSMAAASLGWDVKLLDSLGFDELKFLMGV--HVFPEFETPSNAVKGKI 306

Query: 757  PQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSNALSREHVCWDIIY 936
            P++E EH DC+++VFPSG  G  +++ ++ +S    F  LE++G+ N+LS+EHVCWDIIY
Sbjct: 307  PEIEFEHPDCVMLVFPSGVSG-FDLDYKELSSDILLFSKLEWKGKPNSLSKEHVCWDIIY 365

Query: 937  RTANATKKPLADQASVLKITPLPEGVAVSEGS------YK-FGVREVIRKRRSAVDMDPG 1095
            +T+   KK L      L + P       SE        YK   VREV+RKRRSAVDMD  
Sbjct: 366  KTSEVVKKNLTLGDRFL-VDPFQRSGLCSENENDCESCYKGLTVREVVRKRRSAVDMDGV 424

Query: 1096 VSIDRNTFYQILAKVLPSGIK--EDQGEQSQIPFRALPWEVNIHLMIFVNRVAGLKPGLY 1269
              ++R+TFYQIL++ LPSG +  + Q  Q  +PFRALPW+  +H  +FV+RV GL  GLY
Sbjct: 425  TGMERDTFYQILSRCLPSGSENGKKQRRQLSLPFRALPWDAEVHAALFVHRVVGLPQGLY 484

Query: 1270 FLVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQEIAGNGC 1449
            FLVRN+ HF  L+KA   +F W  PEGCP  LPLY L   DC+ LA +LSCHQ+IA +GC
Sbjct: 485  FLVRNESHFGELKKAMLPDFVWTKPEGCPDDLPLYELLRSDCRRLAKQLSCHQDIASDGC 544

Query: 1450 FSLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYFDDPVHSV 1629
            FSLGMLA+ + +L + + WMYPRLFWE G+LGQ+LYLEAHAVGISATGIGC+FDDPVH +
Sbjct: 545  FSLGMLARMEPTLREKNVWMYPRLFWETGVLGQVLYLEAHAVGISATGIGCFFDDPVHQL 604

Query: 1630 LGLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGPEIDS 1743
            LGL G+ FQSLYHFT+GA V DKRIMSLPAYPGP+ D+
Sbjct: 605  LGLKGSTFQSLYHFTLGAPVEDKRIMSLPAYPGPDADA 642


>gb|EOY17058.1| Nitroreductase family protein isoform 2 [Theobroma cacao]
          Length = 599

 Score =  677 bits (1747), Expect = 0.0
 Identities = 324/533 (60%), Positives = 402/533 (75%), Gaps = 6/533 (1%)
 Frame = +1

Query: 40   AVDYHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDNSDIP-----YP 204
            A+ YH++TKH FT YARGP GLDW NQPNPFRRY +AP + LLH P +   I      Y 
Sbjct: 71   ALKYHHQTKHSFTNYARGPRGLDWANQPNPFRRYISAPLIPLLHFPAEKQAITDDAPLYS 130

Query: 205  QVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTEGYILSGP 384
             +F  +PPPKP++++TISQLFYDSLALSAWKTTG STWSLRVNPSSGNLHPTE Y++S P
Sbjct: 131  SLFHSLPPPKPISQSTISQLFYDSLALSAWKTTGYSTWSLRVNPSSGNLHPTEAYLISPP 190

Query: 385  IDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIFWRESWKYGE 564
            I  +SD PF+AHY+PKEH LEVRA IPS  F    K FP+ SF +G+SSIFWRE+WKYGE
Sbjct: 191  IQSLSDSPFVAHYAPKEHSLEVRATIPSGFFP---KFFPENSFLIGISSIFWREAWKYGE 247

Query: 565  RAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDFEIPEQAV 744
            RAFRYCNHDVGHAIGA++MAAA LGWDV+++D  G+D++ +L+GL      +F++P + +
Sbjct: 248  RAFRYCNHDVGHAIGAVAMAAATLGWDVKLLDGFGYDDLQKLMGL--DIFPEFKVPSRPI 305

Query: 745  RGYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSNALSREHVCW 924
            +G FP +E EH DCLL+VFP+GS  + ++N ++ +S   EFL LE++G+ N+LSREHVCW
Sbjct: 306  KGKFPDIEFEHPDCLLLVFPNGSN-QFHVNYKELSSAVKEFLNLEWKGKPNSLSREHVCW 364

Query: 925  DIIYRTANATKKPLADQASVLKITPLPEGVAVSEGSYK-FGVREVIRKRRSAVDMDPGVS 1101
            DIIYRTA A KKPL  Q+    +         SE SYK   VREV+RKRRSAVDMD    
Sbjct: 365  DIIYRTAEAVKKPLTVQSGEFPVDQFQSSGICSENSYKGLTVREVVRKRRSAVDMDGVTV 424

Query: 1102 IDRNTFYQILAKVLPSGIKEDQGEQSQIPFRALPWEVNIHLMIFVNRVAGLKPGLYFLVR 1281
            ++R TFYQIL   +PSG       Q  +PFRAL W+  +H  +FV+RV GL  GLYFLVR
Sbjct: 425  MERETFYQILLHCVPSGNGGKHRRQLALPFRALSWDAEVHAALFVHRVVGLPKGLYFLVR 484

Query: 1282 NKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQEIAGNGCFSLG 1461
            N+ H + L++ATR EF W  P GCP  LPLY LA+ +C++LA +LSCHQ+IA +GCFSLG
Sbjct: 485  NEDHLEELKRATRPEFNWEKPAGCPDDLPLYELATDNCQELAKRLSCHQDIASDGCFSLG 544

Query: 1462 MLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYFDDPV 1620
            M+A F+ +LSD  AWMYPRLFWE G+LGQ+LYLEAHAVGISATGIGC+FDDPV
Sbjct: 545  MVAHFEPALSDNGAWMYPRLFWETGVLGQVLYLEAHAVGISATGIGCFFDDPV 597


>ref|XP_006306995.1| hypothetical protein CARUB_v10008571mg [Capsella rubella]
            gi|482575706|gb|EOA39893.1| hypothetical protein
            CARUB_v10008571mg [Capsella rubella]
          Length = 630

 Score =  675 bits (1741), Expect = 0.0
 Identities = 332/581 (57%), Positives = 421/581 (72%), Gaps = 6/581 (1%)
 Frame = +1

Query: 7    SSQTHEDSTRIAVDYHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDN 186
            S      S ++ ++YHN+TKH F  YARGP GLDW NQPNPFRRY +AP + L H P D 
Sbjct: 55   SCSCSSSSLQLVLEYHNQTKHSFNGYARGPRGLDWANQPNPFRRYLSAPLLPLQH-PDDP 113

Query: 187  SDIPYPQVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTEG 366
                Y  +F   PPPKP++ +TIS LFY SLALSAWKTTG+STW LRVNPSSGNLHPTE 
Sbjct: 114  LQSSYASLFDSPPPPKPVSLSTISDLFYHSLALSAWKTTGVSTWPLRVNPSSGNLHPTEA 173

Query: 367  YILSGPIDGVSDLPFLAHYSPKEHRLEVRAE-IPSQIFGALVKGFPKGSFFVGLSSIFWR 543
            Y+++ PID +S+  F+AHY+P+EH LEVRA  IPS  F +    FP  SF +GLSSIFWR
Sbjct: 174  YLIAPPIDSLSESAFVAHYAPREHSLEVRAPTIPSSFFPSF---FPANSFLIGLSSIFWR 230

Query: 544  ESWKYGERAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDF 723
            E+WKYGERAFRYCNHDVGHAI ++S+AAA LGW ++++D  G D++ +L+GL      DF
Sbjct: 231  EAWKYGERAFRYCNHDVGHAIASLSIAAAELGWHLKLLDAFGADDLKRLMGLP-----DF 285

Query: 724  EIPEQAVRGYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSNAL 903
              P    +G+ P++E EH DCLL+VFP G+ G+++++     S   +F  L++ G+ N L
Sbjct: 286  HFPSAKGKGHLPEIEFEHPDCLLLVFPYGT-GDIHLDYLGICSAIRDFPTLDWIGKPNVL 344

Query: 904  SREHVCWDIIYRTANATKKPLADQASV-LKITPLPEGVAVSEGSYK-FGVREVIRKRRSA 1077
            SREH+CWD+IY TA A +KP +  AS  +  T        S+GSY     R+V+RKRRSA
Sbjct: 345  SREHLCWDVIYTTAKAVEKPSSIPASSSIDDTSFRSSALFSQGSYNDLTARQVVRKRRSA 404

Query: 1078 VDMDPGVSIDRNTFYQILAKVLPSGIK--EDQGEQSQIPFRALPWEV-NIHLMIFVNRVA 1248
            VDMD    ID+++F+Q+L   LPSG    E Q EQ  +PFRALPW+   +HL +FV+RV+
Sbjct: 405  VDMDAVTFIDKSSFFQMLMHCLPSGSTRGEPQREQLALPFRALPWDTAEVHLALFVHRVS 464

Query: 1249 GLKPGLYFLVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQ 1428
            GL  G YFLVRN+ H   L+ ATR EF+W  P+GCP  LPLY LA GDC+ LA  LSCHQ
Sbjct: 465  GLPKGFYFLVRNEDHLSDLKTATRPEFEWKKPDGCPDDLPLYKLAQGDCQKLAKGLSCHQ 524

Query: 1429 EIAGNGCFSLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYF 1608
            +IAG+GCFSLGM+A+F+ +L +  +WMYPRLFWE G++GQ+LYLEAHA+GISATGIGCYF
Sbjct: 525  DIAGDGCFSLGMVARFEPALREKGSWMYPRLFWETGVVGQVLYLEAHAMGISATGIGCYF 584

Query: 1609 DDPVHSVLGLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGP 1731
            DDPVH +LG+  + FQSLYHFTVG  V DKRIM+LPAYPGP
Sbjct: 585  DDPVHEILGMKDSSFQSLYHFTVGGPVVDKRIMTLPAYPGP 625


>ref|NP_171704.2| nitroreductase family protein [Arabidopsis thaliana]
            gi|2317902|gb|AAC24366.1| hypothetical protein
            [Arabidopsis thaliana] gi|17979093|gb|AAL49814.1| unknown
            protein [Arabidopsis thaliana] gi|21689753|gb|AAM67520.1|
            unknown protein [Arabidopsis thaliana]
            gi|332189246|gb|AEE27367.1| nitroreductase family protein
            [Arabidopsis thaliana]
          Length = 642

 Score =  671 bits (1731), Expect = 0.0
 Identities = 335/579 (57%), Positives = 420/579 (72%), Gaps = 11/579 (1%)
 Frame = +1

Query: 28   STRIAVDYHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPID---NSDIP 198
            S  + + YHN+TKH    YARGP GLDW NQPNPFRRY +AP + L H   D   +SD P
Sbjct: 68   SLELVLKYHNQTKHSLNGYARGPRGLDWANQPNPFRRYLSAPLLPLQHPNHDIDDDSDSP 127

Query: 199  -YPQVFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTEGYIL 375
             Y  +F  +PPPKP++  TIS LFY SLALSAWKTTG STW LRVNPSSGNLHPTE Y++
Sbjct: 128  LYSTLFDSLPPPKPISLPTISHLFYHSLALSAWKTTGSSTWPLRVNPSSGNLHPTEAYLI 187

Query: 376  SGPIDGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIFWRESWK 555
            + PI  +S   F++HY+PKEH LEVRA IPS  F      FP+ SF +G+SSIFWRE+WK
Sbjct: 188  APPIPSLSQSAFVSHYAPKEHSLEVRAHIPSSFFPNF---FPENSFLIGISSIFWREAWK 244

Query: 556  YGERAFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDFEIPE 735
            YGERAFRYCNHDVGHAI A+S+AAA LGWD++++D  G D++ +L+GL      +F++PE
Sbjct: 245  YGERAFRYCNHDVGHAIAALSIAAADLGWDLKLLDAFGADDLKRLMGLP-----EFQLPE 299

Query: 736  QAVRGYFPQLEKEHGDCLLVVFPSGSQGE-LNINPRDFASVASEFLGLEFRGRSNALSRE 912
               +   P++E EH DCLL+VFP+G+  E LN++    +S   +F  LE+ G  N LS+E
Sbjct: 300  GKGKAELPEIEFEHPDCLLLVFPNGTSREHLNLDYLAISSALRDFPSLEWTGNPNTLSKE 359

Query: 913  HVCWDIIYRTANATKKP---LADQASVLKITPLPEGVAV-SEGSY-KFGVREVIRKRRSA 1077
            H+CWDIIYRTA A +KP    +  +S + +       A+ S  SY K  VR+V+R RRSA
Sbjct: 360  HLCWDIIYRTAKAVEKPPLIYSTSSSSIDVASFTSSRALFSHSSYNKLTVRQVVRTRRSA 419

Query: 1078 VDMDPGVSIDRNTFYQILAKVLPSGIKEDQGEQSQIPFRALPWEV-NIHLMIFVNRVAGL 1254
            VDMD    ID ++FYQ+L   LPS   E Q EQ  +PFRALPW+   +HL +FV+RV+GL
Sbjct: 420  VDMDAVTCIDMSSFYQMLMHCLPS-TGESQKEQLALPFRALPWDTAEVHLALFVHRVSGL 478

Query: 1255 KPGLYFLVRNKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQEI 1434
              GLY LVRN+ H   L+ ATR EF+W  P+GCP  LPLY LA GDC+ LA  LSCHQ+I
Sbjct: 479  PKGLYLLVRNEDHLSDLKTATRPEFEWTKPDGCPDNLPLYKLAEGDCQRLAKGLSCHQDI 538

Query: 1435 AGNGCFSLGMLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYFDD 1614
            AG+GCFSLGM+A+F+ +L +  +WMYPRLFWE G++GQ+LYLEAHA+GISATGIGCYFDD
Sbjct: 539  AGDGCFSLGMIARFEPALREKGSWMYPRLFWETGVVGQVLYLEAHAMGISATGIGCYFDD 598

Query: 1615 PVHSVLGLSGNEFQSLYHFTVGAAVSDKRIMSLPAYPGP 1731
            PVH VLG++ + FQSLYHFTVG  V DKRIM+LPAYPGP
Sbjct: 599  PVHEVLGINDSSFQSLYHFTVGGPVVDKRIMTLPAYPGP 637


>gb|EPS65206.1| hypothetical protein M569_09571, partial [Genlisea aurea]
          Length = 555

 Score =  663 bits (1710), Expect = 0.0
 Identities = 324/568 (57%), Positives = 411/568 (72%), Gaps = 9/568 (1%)
 Frame = +1

Query: 49   YHNKTKHFFTKYARGPHGLDWKNQPNPFRRYTNAPTVDLLHCPIDNSD-------IPYPQ 207
            YH +TKH FT YARGP GLDW NQP+PFRRY+ AP V LLH    + D         Y  
Sbjct: 7    YHRRTKHSFTDYARGPRGLDWANQPDPFRRYSPAPLVHLLHPSAADGDGDCCYPTPSYSS 66

Query: 208  VFKGIPPPKPLNKATISQLFYDSLALSAWKTTGISTWSLRVNPSSGNLHPTEGYILSGPI 387
            +F  +P PKP++++TIS+LF++SLALSAWKT+G STWSLRVNPSSGNLHPTE YI+S  +
Sbjct: 67   LFDSLPAPKPISRSTISELFHNSLALSAWKTSGFSTWSLRVNPSSGNLHPTEAYIVSPAV 126

Query: 388  DGVSDLPFLAHYSPKEHRLEVRAEIPSQIFGALVKGFPKGSFFVGLSSIFWRESWKYGER 567
            DG+S+  F+AHY+P+EH LEVRA IP+  F      FP GSF VG SSIFWRE+WKYGER
Sbjct: 127  DGLSEHAFVAHYAPEEHALEVRARIPTDFFPEC---FPDGSFLVGFSSIFWREAWKYGER 183

Query: 568  AFRYCNHDVGHAIGAISMAAAVLGWDVRVVDELGHDEVGQLLGLVGSNKMDFEIPEQAVR 747
            AFRYCNHDVGHAI ++SMAAA LGW VR++D LGH+++G+L+GL    +  F +P + V+
Sbjct: 184  AFRYCNHDVGHAIASVSMAAAALGWTVRILDGLGHEDLGKLMGLQPPVR-SFSMPTKPVK 242

Query: 748  GYFPQLEKEHGDCLLVVFPSGSQGELNINPRDFASVASEFLGLEFRGRSNALSREHVCWD 927
            G  P++E EH DC L+VFP G  G+  +   D+  + S+F  L++ G++NALS EH+CWD
Sbjct: 243  GKMPEIEFEHPDCALLVFP-GDSGDFEV---DYDRLRSKFSHLDWIGKANALSEEHICWD 298

Query: 928  IIYRTANATKKP--LADQASVLKITPLPEGVAVSEGSYKFGVREVIRKRRSAVDMDPGVS 1101
            IIYRTA A +KP   + + S++     P        S    + +V RKRRSAVDMD   S
Sbjct: 299  IIYRTAEAVQKPPTTSKKESIIINDNPPSNAIDDPSSLGLTLSQVARKRRSAVDMDGSTS 358

Query: 1102 IDRNTFYQILAKVLPSGIKEDQGEQSQIPFRALPWEVNIHLMIFVNRVAGLKPGLYFLVR 1281
            I R TFY+I+   LPS             F AL W  N+H ++FV+RVAGL  GLYFLVR
Sbjct: 359  ISRETFYRIMLHCLPSN-----------KFGALTWACNLHAVVFVHRVAGLPAGLYFLVR 407

Query: 1282 NKRHFDALRKATRSEFQWAIPEGCPAGLPLYLLASGDCKDLAMKLSCHQEIAGNGCFSLG 1461
            N  HF  L+ + RS+F+WA+P+GCP  LPL+ L+ GDC+DL+ +LSCHQ+IAG+GCFSLG
Sbjct: 408  NAAHFPELKSSMRSDFKWAVPDGCPDDLPLFELSRGDCRDLSKRLSCHQDIAGDGCFSLG 467

Query: 1462 MLAQFQNSLSDGHAWMYPRLFWEAGLLGQMLYLEAHAVGISATGIGCYFDDPVHSVLGLS 1641
            M+A+F+ +L D  AWMYPRLFWE+G+LGQ+LYLEAH VG+SATGIGC+FDDPVH V+G+ 
Sbjct: 468  MVARFEPTLRDVGAWMYPRLFWESGVLGQILYLEAHEVGVSATGIGCFFDDPVHEVMGVK 527

Query: 1642 GNEFQSLYHFTVGAAVSDKRIMSLPAYP 1725
            G  +QSLYHFTVG AV D RI SLPAYP
Sbjct: 528  GWAYQSLYHFTVGGAVVDTRITSLPAYP 555


Top