BLASTX nr result

ID: Mentha26_contig00041545 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00041545
         (1181 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU25506.1| hypothetical protein MIMGU_mgv11b017110mg, partia...   338   2e-90
ref|XP_006362061.1| PREDICTED: QWRF motif-containing protein 2-l...   290   1e-75
ref|XP_006362059.1| PREDICTED: QWRF motif-containing protein 2-l...   285   3e-74
ref|XP_004238098.1| PREDICTED: uncharacterized protein LOC101261...   281   3e-73
ref|XP_007042615.1| Family of Uncharacterized protein function (...   269   2e-69
ref|XP_007018531.1| Family of Uncharacterized protein function, ...   268   3e-69
ref|XP_007018530.1| Family of Uncharacterized protein function, ...   268   3e-69
ref|XP_002283295.1| PREDICTED: uncharacterized protein LOC100242...   268   4e-69
ref|XP_006393204.1| hypothetical protein EUTSA_v10011296mg [Eutr...   260   7e-67
ref|XP_002527498.1| conserved hypothetical protein [Ricinus comm...   259   1e-66
ref|XP_002298769.1| hypothetical protein POPTR_0001s30290g [Popu...   259   2e-66
ref|XP_007225113.1| hypothetical protein PRUPE_ppa002663mg [Prun...   258   3e-66
ref|XP_007042616.1| Family of Uncharacterized protein function, ...   258   3e-66
gb|EXB80036.1| hypothetical protein L484_003678 [Morus notabilis]     258   5e-66
ref|XP_004136940.1| PREDICTED: uncharacterized protein LOC101215...   257   8e-66
gb|AAG60167.1|AC074110_5 hypothetical protein [Arabidopsis thali...   256   1e-65
gb|AAG51768.1|AC079674_1 unknown protein; 38618-41990 [Arabidops...   256   1e-65
ref|XP_002891540.1| hypothetical protein ARALYDRAFT_891910 [Arab...   256   1e-65
ref|NP_564558.1| uncharacterized protein [Arabidopsis thaliana] ...   256   1e-65
ref|XP_006306956.1| hypothetical protein CARUB_v10008528mg [Caps...   256   2e-65

>gb|EYU25506.1| hypothetical protein MIMGU_mgv11b017110mg, partial [Mimulus guttatus]
          Length = 473

 Score =  338 bits (868), Expect = 2e-90
 Identities = 187/342 (54%), Positives = 234/342 (68%), Gaps = 19/342 (5%)
 Frame = +1

Query: 1    PSNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVR 180
            PS+AEKMLV S+RSLSVSFQG+SYS+           VGTP   RKGTPERRKAGVTP R
Sbjct: 113  PSSAEKMLVTSMRSLSVSFQGESYSIPVSKVKPPPAAVGTPSALRKGTPERRKAGVTPTR 172

Query: 181  DRTAK--ETPRPIEH---QQHLWPGRLRSENPSLLSRSLDYGTDRAKWNGSSPALTELRK 345
            DR  +  E  RP +H   QQH WPGRLR E+ S L+RSLDYG++R K +GS  AL E RK
Sbjct: 173  DRRERDIENSRPSDHHQLQQHRWPGRLRREDSSFLTRSLDYGSERVKCSGSGAALKEFRK 232

Query: 346  SVNAE---------LKLDNREIEDPAVSD-EGRSRLGGXXXXXXXXXXXXXXXXXXXXQL 495
            SV  E         LKL+  ++E   + + E R R G                     QL
Sbjct: 233  SVGEENSSDKVGNDLKLETNDVEVRGIGELENRPRSGSSLNLDVESTATT--------QL 284

Query: 496  RGGPRGVIVAARFYQDASNRVQKVLDPASPL----SNRTTGSPRIMVAKKFQNDNPISSP 663
            RGGPR V+V  R +Q+ +NRV KV DPASPL    SNRT G  ++++AKKFQND+P+SSP
Sbjct: 285  RGGPRAVVVPQRCWQE-TNRVNKVRDPASPLPISTSNRTIGPSKLVLAKKFQNDSPVSSP 343

Query: 664  REICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRARNGAGSVMNDSNMCSTPS 843
            RE+  +RG+SPLRGG RAASP +AL+S SG++ RGM+SP+RAR+G G+ MN++N CSTPS
Sbjct: 344  REVSSSRGISPLRGGVRAASPCKALSSSSGSVSRGMASPTRARSGVGNSMNENNTCSTPS 403

Query: 844  MISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVE 969
            ++ F ++ RRG  GEN+MADAH LR+LYNRLLQWRLANA+ +
Sbjct: 404  VLRFAVDARRGNSGENQMADAHVLRLLYNRLLQWRLANARAD 445


>ref|XP_006362061.1| PREDICTED: QWRF motif-containing protein 2-like isoform X3 [Solanum
            tuberosum]
          Length = 665

 Score =  290 bits (741), Expect = 1e-75
 Identities = 183/432 (42%), Positives = 240/432 (55%), Gaps = 40/432 (9%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGV----- 168
            S A K L+ S RSLSVSFQGQS+S+          T    G  RKGTPERRK        
Sbjct: 126  SAAAKQLLNSSRSLSVSFQGQSFSIPVSKAKPPPAT-NNIGSVRKGTPERRKVTAEFVTP 184

Query: 169  ----------TPVRDRTAKETPRPIEHQ--------QHLWPGRLRSENPSLLSRSLDYGT 294
                      TP R + A E   P   +        QH WPGR +S N S L+RS+D G+
Sbjct: 185  ERKKATAEFFTPERSKVAAELSTPARDRTENVKTSDQHRWPGRSKSLNSSFLTRSMDCGS 244

Query: 295  -DRAKWNGSSPALTEL--------RKSVNAELK--LDNREIEDPAVSDEGRSR--LGGXX 435
             D+ K+   S   + +        R  + A LK   DN E++  +      S   L    
Sbjct: 245  IDKPKFGSGSVTSSSMKSVIDIYHRARIEARLKPQSDNGEVDMKSAYGSAMSADALASDS 304

Query: 436  XXXXXXXXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPASPLSN---RTTG 606
                                +GGPRG++V ARF+Q+ +NR+++  +  S + N   +T  
Sbjct: 305  ESVSSGSTSGVHDGPTVTHGKGGPRGIVVPARFWQETNNRIRRGPELGSSMDNGNLKTVA 364

Query: 607  SPRIMVAKKFQNDNPISSPREICPNRGL-SPLRGGSRAASPSRALTSGSGALLRGMSSPS 783
            S + M  KKF  D+P +SPR +  +RGL SPLRGG R ASPS+ALT  +   LRGM SP+
Sbjct: 365  SSKQMGNKKFLIDSPRTSPRVVPASRGLVSPLRGGLRPASPSKALTPSANTPLRGMPSPT 424

Query: 784  RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963
            R +NG+  V   +N C  PS++SF  + RRGK+GENR+ DAH+LR+LYNR LQWR  NAQ
Sbjct: 425  RTKNGS-MVSISNNSCIMPSILSFAADARRGKVGENRIVDAHELRLLYNRNLQWRFVNAQ 483

Query: 964  VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143
             E  L  Q   AER LYNAW+ T KLRHSV+SK I+LQ LR N+KL+S+LK Q   LE W
Sbjct: 484  AEAALRAQTTTAERTLYNAWLTTLKLRHSVKSKRIQLQLLRKNVKLHSILKGQRPCLENW 543

Query: 1144 GLLDRDYCNSIS 1179
             ++D D+CNS+S
Sbjct: 544  SMIDGDHCNSLS 555


>ref|XP_006362059.1| PREDICTED: QWRF motif-containing protein 2-like isoform X1 [Solanum
            tuberosum] gi|565392762|ref|XP_006362060.1| PREDICTED:
            QWRF motif-containing protein 2-like isoform X2 [Solanum
            tuberosum]
          Length = 677

 Score =  285 bits (729), Expect = 3e-74
 Identities = 183/444 (41%), Positives = 240/444 (54%), Gaps = 52/444 (11%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGV----- 168
            S A K L+ S RSLSVSFQGQS+S+          T    G  RKGTPERRK        
Sbjct: 126  SAAAKQLLNSSRSLSVSFQGQSFSIPVSKAKPPPAT-NNIGSVRKGTPERRKVTAEFVTP 184

Query: 169  ----------------------TPVRDRTAKETPRPIEHQ--------QHLWPGRLRSEN 258
                                  TP R + A E   P   +        QH WPGR +S N
Sbjct: 185  ERRKVTAEFVTPERKKATAEFFTPERSKVAAELSTPARDRTENVKTSDQHRWPGRSKSLN 244

Query: 259  PSLLSRSLDYGT-DRAKWNGSSPALTEL--------RKSVNAELK--LDNREIEDPAVSD 405
             S L+RS+D G+ D+ K+   S   + +        R  + A LK   DN E++  +   
Sbjct: 245  SSFLTRSMDCGSIDKPKFGSGSVTSSSMKSVIDIYHRARIEARLKPQSDNGEVDMKSAYG 304

Query: 406  EGRSR--LGGXXXXXXXXXXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPA 579
               S   L                        +GGPRG++V ARF+Q+ +NR+++  +  
Sbjct: 305  SAMSADALASDSESVSSGSTSGVHDGPTVTHGKGGPRGIVVPARFWQETNNRIRRGPELG 364

Query: 580  SPLSN---RTTGSPRIMVAKKFQNDNPISSPREICPNRGL-SPLRGGSRAASPSRALTSG 747
            S + N   +T  S + M  KKF  D+P +SPR +  +RGL SPLRGG R ASPS+ALT  
Sbjct: 365  SSMDNGNLKTVASSKQMGNKKFLIDSPRTSPRVVPASRGLVSPLRGGLRPASPSKALTPS 424

Query: 748  SGALLRGMSSPSRARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLY 927
            +   LRGM SP+R +NG+  V   +N C  PS++SF  + RRGK+GENR+ DAH+LR+LY
Sbjct: 425  ANTPLRGMPSPTRTKNGS-MVSISNNSCIMPSILSFAADARRGKVGENRIVDAHELRLLY 483

Query: 928  NRLLQWRLANAQVENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYS 1107
            NR LQWR  NAQ E  L  Q   AER LYNAW+ T KLRHSV+SK I+LQ LR N+KL+S
Sbjct: 484  NRNLQWRFVNAQAEAALRAQTTTAERTLYNAWLTTLKLRHSVKSKRIQLQLLRKNVKLHS 543

Query: 1108 VLKEQELHLERWGLLDRDYCNSIS 1179
            +LK Q   LE W ++D D+CNS+S
Sbjct: 544  ILKGQRPCLENWSMIDGDHCNSLS 567


>ref|XP_004238098.1| PREDICTED: uncharacterized protein LOC101261324 [Solanum
            lycopersicum]
          Length = 677

 Score =  281 bits (720), Expect = 3e-73
 Identities = 183/445 (41%), Positives = 240/445 (53%), Gaps = 53/445 (11%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGV----- 168
            S A K L+ S RSLSVSFQGQS+S+          T    G  R+GTPERRK        
Sbjct: 126  SAAAKQLLNSSRSLSVSFQGQSFSIPVSKAKPPPAT-NNIGNVRRGTPERRKVTADFVTP 184

Query: 169  ----------------------TPVRDRTAKETPRPIEHQ--------QHLWPGRLRSEN 258
                                  TP R + A E   P   +        QH WPGR +S N
Sbjct: 185  ERRKVSANFVTPERKKATADFYTPERSKVAAELSTPARDRTENVKTSDQHRWPGRSKSLN 244

Query: 259  PSLLSRSLDYGT-DRAKWNGSSPALTEL--------RKSVNAELK--LDNREIEDPAVSD 405
             S L+RS+D G+ D+ K+   S   + +        R  + A+LK   DN E++  +   
Sbjct: 245  SSFLTRSMDCGSIDKPKFGSGSVTSSSMKSVIDIYHRARIEAKLKPQSDNDEVDMKSAYG 304

Query: 406  EGRS--RLGGXXXXXXXXXXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPA 579
               S   L                        RGGPRG++V ARF+Q+ +NR+++  +  
Sbjct: 305  SAMSADTLASDSESVSSGSTSGVHDGPSVIHGRGGPRGIVVPARFWQETNNRIRRGPELG 364

Query: 580  SPLSN---RTTGSPRIMVAKKFQNDNPISSPREICPNRGL-SPLRGGSRAASPSRALTSG 747
            S + N   +T  S + M  KKF  D+P +S R +  +RGL SPLRGG R ASPS+ LT  
Sbjct: 365  SSMDNGNLKTVASSKQMGNKKFLTDSPRTSARVVPASRGLGSPLRGGLRPASPSKTLTPS 424

Query: 748  SGALLRGMSSPSRARNGA-GSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRML 924
            +   LRGM SP+R +NG+ GS  N  N C  PS++SF  + RRGK+GENR+ DAH+LR+L
Sbjct: 425  ANTPLRGMPSPTRTKNGSMGSTSN--NSCIMPSILSFAADARRGKVGENRIVDAHELRLL 482

Query: 925  YNRLLQWRLANAQVENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLY 1104
            YNR LQWR  NAQ E  L  Q   AER LYNAW+ T KLRHSV+SK I+LQ LR N+KL+
Sbjct: 483  YNRNLQWRFVNAQAEAALRAQTTTAERTLYNAWLTTLKLRHSVKSKRIQLQLLRKNVKLH 542

Query: 1105 SVLKEQELHLERWGLLDRDYCNSIS 1179
            S+LK Q   LE W ++D D+CNS+S
Sbjct: 543  SILKGQGPCLENWSMIDGDHCNSLS 567


>ref|XP_007042615.1| Family of Uncharacterized protein function (DUF566), putative isoform
            1 [Theobroma cacao] gi|508706550|gb|EOX98446.1| Family of
            Uncharacterized protein function (DUF566), putative
            isoform 1 [Theobroma cacao]
          Length = 684

 Score =  269 bits (688), Expect = 2e-69
 Identities = 174/428 (40%), Positives = 246/428 (57%), Gaps = 37/428 (8%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183
            S A KML+ S RSLSVSFQG+++SL           VG+  ++RK TPERR+A  TPVRD
Sbjct: 161  SAATKMLITSTRSLSVSFQGEAFSLPISKTKAQ---VGS-AMTRKATPERRRA--TPVRD 214

Query: 184  RTAKETPRPIEHQQHLWPGRLRSENPSL--LSRSLDYGTDRAKWNGSSPALTELRKSV-- 351
                E  +P++  QH WPGR R  N     LSRSLDY ++R  +   +     L++S+  
Sbjct: 215  HG--ENSKPVD--QHRWPGRTRQGNSGTNPLSRSLDYSSERKMFGSGAIVAKSLQQSMML 270

Query: 352  -----------NAELKLD------------------NREIEDPAVSDEGRSRLGGXXXXX 444
                       ++ L LD                  N   E   VS +  +         
Sbjct: 271  DESSRRVSFDGSSRLSLDLGSSAELLKEATKQNSDANSINEASCVSCDLTASDTDSVSSG 330

Query: 445  XXXXXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPASPLS----NRTTGSP 612
                           + R GPR ++V+ARF+Q+ ++R++++ DP SPLS    +R   S 
Sbjct: 331  STNSGMQECGGSGILKGRSGPRNIVVSARFWQETNSRLRRLQDPGSPLSTSPGSRIGASA 390

Query: 613  RIMVAKKFQNDNPISSPREICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRAR 792
            +   +K+F +D  +SSPR +      SP+RGG+R ASPS+  TS + + LRG+S P+R R
Sbjct: 391  KFSQSKRFSSDGVVSSPRTMA-----SPIRGGTRPASPSKLWTSATSSPLRGLS-PARVR 444

Query: 793  NGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVEN 972
            N  G  M   N  +TPS++SF +++RRGK+GE+R+ DAH LR+LYNR LQWR ANA+ + 
Sbjct: 445  NAVGGQMM-GNSVNTPSILSFSVDIRRGKMGEDRIVDAHMLRLLYNRYLQWRFANARADA 503

Query: 973  TLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLL 1152
            T ++QK +AE+NL+NAWV TS+LRHSV  K I+L  LR  LKL S+LK Q  +LE W LL
Sbjct: 504  TFMLQKLSAEKNLWNAWVTTSELRHSVTLKRIKLLLLRQKLKLTSILKGQIAYLEEWALL 563

Query: 1153 DRDYCNSI 1176
            DRD+ +S+
Sbjct: 564  DRDHSSSL 571


>ref|XP_007018531.1| Family of Uncharacterized protein function, putative isoform 2
            [Theobroma cacao] gi|508723859|gb|EOY15756.1| Family of
            Uncharacterized protein function, putative isoform 2
            [Theobroma cacao]
          Length = 609

 Score =  268 bits (686), Expect = 3e-69
 Identities = 167/407 (41%), Positives = 239/407 (58%), Gaps = 15/407 (3%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183
            S  +K+L  S RSLSVSFQG+S+S              +P  +RKGTPERRK   T    
Sbjct: 135  SAVQKLLFTSTRSLSVSFQGESFSYQFSKAKPAP----SPSAARKGTPERRKP--TAATT 188

Query: 184  RTAKETPRPIEHQQHLWPGRLRSENPSLLSRSLDYGTDRA--KWNGSSPALTELRKSVNA 357
               + T +    +   WP RLR   P+ +SRS+D   +R   K +G+   +  L+ S+  
Sbjct: 189  TPGRATDQMENSKAERWPARLRQ--PNSMSRSMDCTDERKRLKGSGNGNVVRALQDSM-- 244

Query: 358  ELKLDNREIE-----------DPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLRGG 504
               +DNR++            DPAVSD      G                      ++ G
Sbjct: 245  ---IDNRDLTVVPAVGSEAQCDPAVSDTESVSSGSTSGALESSCNGNG-------DIKRG 294

Query: 505  PRGVIVAARFYQDASNRVQKVLDPASPLSNRTTGSPRIMVAKKFQNDNPISSPREICPNR 684
            PRG++V ARF+Q+ +NR+++  DP SP+S + T   +++  +KF  D+P+SSP+ +  +R
Sbjct: 295  PRGIVVPARFWQETNNRLRRS-DPGSPVSKKNTAQSKLIAPEKFGIDSPLSSPKSVVNSR 353

Query: 685  GLS-PLRGGSRAASPSRALTSGSGALLRGMSSPSRARNGAGSVMNDSNMCSTPSMISFPL 861
            G S P+RG  R ASPS+   S + + LRGMS PSR RNG GS     N+ +TPS++SF  
Sbjct: 354  GQSSPIRGPVRPASPSKLGVSSTSSPLRGMS-PSRVRNGLGS-----NLVNTPSILSFAG 407

Query: 862  EL-RRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENTLLVQKHAAERNLYNAWVNTSK 1038
            ++ + GK+GEN+++DAH LR+L+NRLLQWR  NA+ +  L  Q+  AE++LYNAW+ TSK
Sbjct: 408  DVVKMGKIGENKVSDAHFLRLLHNRLLQWRFVNAREDAALSSQRSNAEKSLYNAWITTSK 467

Query: 1039 LRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLDRDYCNSIS 1179
            LR SVR+K  ELQ LR NLKL S+LK Q + L+ W +LD DYC+S+S
Sbjct: 468  LRESVRTKRTELQLLRQNLKLMSILKGQMIVLDEWAILDHDYCSSLS 514


>ref|XP_007018530.1| Family of Uncharacterized protein function, putative isoform 1
            [Theobroma cacao] gi|508723858|gb|EOY15755.1| Family of
            Uncharacterized protein function, putative isoform 1
            [Theobroma cacao]
          Length = 615

 Score =  268 bits (686), Expect = 3e-69
 Identities = 167/407 (41%), Positives = 239/407 (58%), Gaps = 15/407 (3%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183
            S  +K+L  S RSLSVSFQG+S+S              +P  +RKGTPERRK   T    
Sbjct: 135  SAVQKLLFTSTRSLSVSFQGESFSYQFSKAKPAP----SPSAARKGTPERRKP--TAATT 188

Query: 184  RTAKETPRPIEHQQHLWPGRLRSENPSLLSRSLDYGTDRA--KWNGSSPALTELRKSVNA 357
               + T +    +   WP RLR   P+ +SRS+D   +R   K +G+   +  L+ S+  
Sbjct: 189  TPGRATDQMENSKAERWPARLRQ--PNSMSRSMDCTDERKRLKGSGNGNVVRALQDSM-- 244

Query: 358  ELKLDNREIE-----------DPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLRGG 504
               +DNR++            DPAVSD      G                      ++ G
Sbjct: 245  ---IDNRDLTVVPAVGSEAQCDPAVSDTESVSSGSTSGALESSCNGNG-------DIKRG 294

Query: 505  PRGVIVAARFYQDASNRVQKVLDPASPLSNRTTGSPRIMVAKKFQNDNPISSPREICPNR 684
            PRG++V ARF+Q+ +NR+++  DP SP+S + T   +++  +KF  D+P+SSP+ +  +R
Sbjct: 295  PRGIVVPARFWQETNNRLRRS-DPGSPVSKKNTAQSKLIAPEKFGIDSPLSSPKSVVNSR 353

Query: 685  GLS-PLRGGSRAASPSRALTSGSGALLRGMSSPSRARNGAGSVMNDSNMCSTPSMISFPL 861
            G S P+RG  R ASPS+   S + + LRGMS PSR RNG GS     N+ +TPS++SF  
Sbjct: 354  GQSSPIRGPVRPASPSKLGVSSTSSPLRGMS-PSRVRNGLGS-----NLVNTPSILSFAG 407

Query: 862  EL-RRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENTLLVQKHAAERNLYNAWVNTSK 1038
            ++ + GK+GEN+++DAH LR+L+NRLLQWR  NA+ +  L  Q+  AE++LYNAW+ TSK
Sbjct: 408  DVVKMGKIGENKVSDAHFLRLLHNRLLQWRFVNAREDAALSSQRSNAEKSLYNAWITTSK 467

Query: 1039 LRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLDRDYCNSIS 1179
            LR SVR+K  ELQ LR NLKL S+LK Q + L+ W +LD DYC+S+S
Sbjct: 468  LRESVRTKRTELQLLRQNLKLMSILKGQMIVLDEWAILDHDYCSSLS 514


>ref|XP_002283295.1| PREDICTED: uncharacterized protein LOC100242050 [Vitis vinifera]
          Length = 743

 Score =  268 bits (684), Expect = 4e-69
 Identities = 179/427 (41%), Positives = 242/427 (56%), Gaps = 35/427 (8%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVR- 180
            + A KML+ S RSLSVSFQG+S+SL          T   P   RKGTPERRK   TP R 
Sbjct: 229  TTASKMLITSARSLSVSFQGESFSLRVSK------TKPAPASVRKGTPERRKP--TPTRA 280

Query: 181  DRTAKETPRPIEHQQHLWPGRLRSENPSLLSRSLDYGTDRAKWNGSSPALTELRKSV--- 351
            D+T  E  +P++  QH WPGR R  N   L+RS+D   ++ K  GS      L++S+   
Sbjct: 281  DQT--ENSKPVD--QHRWPGRSRQVNS--LTRSMDCTDEKKKLGGSGIMARSLQQSMIDE 334

Query: 352  ---------------NAELKLDNREIE-----------DPAVSDEGRSRLGGXXXXXXXX 453
                           NAEL   N  +            DPA SD      G         
Sbjct: 335  RNRTPLDGRLNLDSGNAELGKANELVNANSVVGSTMTSDPAASDTESVSSGSTSGAQESG 394

Query: 454  XXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPASPLSN----RTTG-SPRI 618
                        Q RG PRG++V ARF+Q+ SNR+++  +P+SP S     RT    P++
Sbjct: 395  GGGGGT------QGRGVPRGIMVPARFWQETSNRLRRTPEPSSPQSKSNGLRTPAVPPKL 448

Query: 619  MVAKKFQNDNPISSPREICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRARNG 798
            +  KK   D+P+SSPR I P+RG SPLRG  R ASPS+ +T+ + + LRGM SP+R R  
Sbjct: 449  IAPKKLLTDSPMSSPRGILPSRGQSPLRGPVRPASPSKLVTTSTYSPLRGMPSPTRVRAV 508

Query: 799  AGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENTL 978
             GS+  + N+ + PS++SF  ++RRGK+GENRM DAH LR+L+NR LQWR  NA+ + +L
Sbjct: 509  VGSL--NGNLSNNPSILSFAADVRRGKVGENRMVDAHLLRLLHNRYLQWRFINARADASL 566

Query: 979  LVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLDR 1158
            LVQ+  AE++L NA V    LR SVR K   LQ +R  LKL ++LK Q ++L+ WG +DR
Sbjct: 567  LVQRMNAEQSLCNARVAIVDLRDSVRDKRKMLQLMRQKLKLTTILKGQIMYLDEWGPMDR 626

Query: 1159 DYCNSIS 1179
            D+ NS+S
Sbjct: 627  DHSNSLS 633


>ref|XP_006393204.1| hypothetical protein EUTSA_v10011296mg [Eutrema salsugineum]
            gi|557089782|gb|ESQ30490.1| hypothetical protein
            EUTSA_v10011296mg [Eutrema salsugineum]
          Length = 661

 Score =  260 bits (665), Expect = 7e-67
 Identities = 171/432 (39%), Positives = 245/432 (56%), Gaps = 40/432 (9%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183
            S A KML+ S RSLSVSFQG+++SL             TP   RK TPERR++  TPVRD
Sbjct: 133  SAATKMLITSTRSLSVSFQGEAFSLPISKKKEAT----TPVSHRKSTPERRRS--TPVRD 186

Query: 184  RTAKETPRPIEHQQHLWPGRLRSEN-----PSLLSRSLDYGTDRAKWNGSSPALTEL--- 339
            +  +E  +P++ Q+  WPG  R  N     P+ LSRSLD G+DR K        + L   
Sbjct: 187  Q--RENSKPVDQQR--WPGASRRGNSESVAPNPLSRSLDCGSDRGKLGSGYVGRSMLHSS 242

Query: 340  ------RKSVNAELKLDNREIEDPA-VSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLR 498
                  R S+N  L LD    ++   + DE + R                          
Sbjct: 243  MIDESPRVSINGRLSLDMEGRDEYLEIGDESQRRPNNGLTSSVSCDFTASDTDSVSSGST 302

Query: 499  GG------------------PRGVIVAARFYQDASNRVQKVLDPASPLSNR-----TTGS 609
             G                  PR ++ +ARF+Q+ ++R++++ DP SPLS+      ++ S
Sbjct: 303  NGVQECGSGVNGDISKSKSLPRNIMASARFWQETNSRLRRLQDPGSPLSSSPGLKTSSVS 362

Query: 610  PRIMVAKKFQNDN-PISSPREICPNRGLSPLRGGS-RAASPSRALTSGSGALLRGMSSPS 783
             +  ++K+F +D  P+SSPR +      SP+RG + R+ASPS+   + + +  R +SSPS
Sbjct: 363  SKFGLSKRFSSDAAPLSSPRGMA-----SPVRGSAIRSASPSKLWATTTSSPARALSSPS 417

Query: 784  RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963
            RARNG    MN  N  +TPS++SF  ++RRGK+GE+R+ DAH +R+LYNR LQWR  NA+
Sbjct: 418  RARNGVSDQMNAYNRNNTPSILSFSADIRRGKIGEDRVMDAHLVRLLYNRYLQWRFVNAR 477

Query: 964  VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143
             ++TL+VQ+  AE+NL+NAWV+ S+LRHSV  K I+L  LR  LKL S+L+ Q  +LE W
Sbjct: 478  ADSTLMVQRLNAEKNLWNAWVSISELRHSVTLKRIKLLLLRQKLKLASILRGQMGYLEEW 537

Query: 1144 GLLDRDYCNSIS 1179
             LLDRD+ NS+S
Sbjct: 538  SLLDRDHSNSLS 549


>ref|XP_002527498.1| conserved hypothetical protein [Ricinus communis]
            gi|223533138|gb|EEF34896.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 634

 Score =  259 bits (663), Expect = 1e-66
 Identities = 174/417 (41%), Positives = 239/417 (57%), Gaps = 26/417 (6%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183
            S A +ML+ S RSLSVSFQG+++SL             +P V+RK TPERRK+  TPVRD
Sbjct: 126  SAATRMLITSTRSLSVSFQGEAFSLPISKAKAVS---SSPNVTRKVTPERRKS--TPVRD 180

Query: 184  RTAKETPRPIEHQQHLWPGRLRSENPSL------LSRSLD--YGTDRAKWNGS------- 318
            +   E  RP++  QH WPGR R  N +L      LSRS D   G D  +  GS       
Sbjct: 181  QG--ENSRPLD--QHRWPGRSRGGNLALNERNPSLSRSFDCSVGGDEKRVMGSGFMSVKS 236

Query: 319  ---SPALTELRKSV---NAELKLD-NREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXX 477
               S  + E R S+   NA+   D N  + D  V+  G                      
Sbjct: 237  LQQSMIVDERRLSLDLGNAKRNPDVNSSVSDSFVT--GDLTASDSDSVSSGSTSGLQDFG 294

Query: 478  XXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPASPLSN----RTTGSPRIMVAKKFQND 645
                + + GPRG+ V+ARF+Q+ ++R++++ DP SPLS     RT+ S + + +K+F +D
Sbjct: 295  SGISRAKTGPRGIAVSARFWQETNSRLRRLQDPGSPLSTSPNPRTSISSKTIQSKRFSSD 354

Query: 646  NPISSPREICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRARNGAGSVMNDSN 825
             P++SPR      G SP+RG +R ASPS+  T  + +  RG+SSPSR R  +      SN
Sbjct: 355  APVASPRTF----GSSPIRGATRPASPSKLWTHSASSPSRGISSPSRGRPMS------SN 404

Query: 826  MCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENTLLVQKHAAER 1005
            + S PS++SF ++LRRGK+GE+R+ DAH LR+LYN  LQWR  NA+ + T  VQ+  AE+
Sbjct: 405  LSSMPSILSFAVDLRRGKMGEDRIGDAHMLRLLYNHYLQWRFVNARADATFFVQRVNAEK 464

Query: 1006 NLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLDRDYCNSI 1176
            NL+NAWV  S+LRHSV  K ++L  LR  LKL S+LK Q   LE W LLDRD+  S+
Sbjct: 465  NLWNAWVTISELRHSVTLKRVKLLLLRQKLKLTSILKGQITCLEEWSLLDRDHSTSL 521


>ref|XP_002298769.1| hypothetical protein POPTR_0001s30290g [Populus trichocarpa]
            gi|222846027|gb|EEE83574.1| hypothetical protein
            POPTR_0001s30290g [Populus trichocarpa]
          Length = 651

 Score =  259 bits (662), Expect = 2e-66
 Identities = 169/413 (40%), Positives = 234/413 (56%), Gaps = 22/413 (5%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183
            S A KML+ S RSLSVSFQG+++SL          T     V+RK TPE+R+A  TPV D
Sbjct: 147  SAATKMLITSTRSLSVSFQGEAFSLPISKAKSV--TPPQNNVARKATPEKRRA--TPVGD 202

Query: 184  RTAKETPRPIEHQQHLWPGRLRS----ENPSLLSRSLDY------GTDRAKWNGSSPALT 333
            +   E  RP++H  H WPGR R     E   LLSRSLD       G D+         + 
Sbjct: 203  QG--ENSRPVDH--HRWPGRSREGNLKERNQLLSRSLDCSVVVGCGGDKRVVGSGLMGVK 258

Query: 334  ELRKSV------NAELKLDNREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXX-- 489
             L++S+         L L N   ++P       S   G                      
Sbjct: 259  SLQQSMMVGEGRRLSLDLGNIAKQNPDTISVNESSYTGDLTASDSDSVSSGSTSGVTEIG 318

Query: 490  QLRGGPRGVIVAARFYQDASNRVQKVLDPASPLS----NRTTGSPRIMVAKKFQNDNPIS 657
            + + G RG+ V+ARF+Q+ ++R++++ DP SPLS    +R   SP+ + +K+F +D P++
Sbjct: 319  KWKTGARGIAVSARFWQETNSRMRRLQDPGSPLSTSPGSRMGVSPKAIQSKRFSSDGPLA 378

Query: 658  SPREICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRARNGAGSVMNDSNMCST 837
            SPR +      SP+RG +R ASP +  TS   +  RGMSSPSR R  + S         T
Sbjct: 379  SPRMMAA----SPIRGATRPASPGKLWTSSFSSPSRGMSSPSRVRPMSSS---------T 425

Query: 838  PSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENTLLVQKHAAERNLYN 1017
            PS++SF ++LRRGK+GE+R+ DAH LR+LYNR LQWR  NA+ + T +VQ+ +AE+NL+N
Sbjct: 426  PSILSFSVDLRRGKMGEDRIVDAHMLRLLYNRYLQWRFVNARADATFMVQRLSAEKNLWN 485

Query: 1018 AWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLDRDYCNSI 1176
            AWV  S+LRHSV  + I+L  LR  LKL S+LK Q  HLE W LLDRD+ +S+
Sbjct: 486  AWVTISELRHSVTLRRIKLILLRQKLKLTSILKRQIAHLEEWSLLDRDHSSSL 538


>ref|XP_007225113.1| hypothetical protein PRUPE_ppa002663mg [Prunus persica]
            gi|462422049|gb|EMJ26312.1| hypothetical protein
            PRUPE_ppa002663mg [Prunus persica]
          Length = 647

 Score =  258 bits (660), Expect = 3e-66
 Identities = 175/414 (42%), Positives = 242/414 (58%), Gaps = 22/414 (5%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVS-RKGTPERRKAGVTPVR 180
            S A+K+L  S RSLSVSFQG+SYSL             TP  S RKGTPERRKA  TP R
Sbjct: 128  SAAQKLLFTSTRSLSVSFQGESYSLQVSKVKP------TPSPSTRKGTPERRKA-TTPFR 180

Query: 181  DRTAKETPRPIEHQQHLWPGRLRSENPSLLSRSLDYGTDRAKWNGSSPALTELRKS---- 348
                 E  +P E Q+  WP RLR   P+ ++RSLD   +R + +GS   +    ++    
Sbjct: 181  -ADQSENSKPTEQQR--WPARLRQ--PNCMTRSLDCTDERRRMSGSGANVVRALQNSMVD 235

Query: 349  -VNAELKLDN---REIEDPAVSDEGRSRL---------GGXXXXXXXXXXXXXXXXXXXX 489
             V+  L+ ++     ++     D+G S                                 
Sbjct: 236  DVDGRLRSNSCNLGSVKATETVDDGTSATTQSEPVACSDTDSVSSGSTNSGPHESNGHGG 295

Query: 490  QLRGG-PRGVIVAARFYQDASNRVQKVLDP-ASPLSNRTTGSPRIMVAKKFQNDNPISSP 663
             L+G  PRG++V ARF+Q+ +NR+++  +  A     RT GSP+I  A +   D+P SSP
Sbjct: 296  ALQGPRPRGIVVPARFWQETNNRLRRQSESKAIGAGARTMGSPKIAEANRLSIDSPTSSP 355

Query: 664  REICPNRG-LSPLRGGSRAASPSRALTS-GSGALLRGMSSPSRARNGAGSVMNDSNMCST 837
            R +  +R  LSP+RG +R ASPS+   S  + + +RG+S PSR RNG  +  + SN+ +T
Sbjct: 356  RGVANSRAQLSPIRGTARPASPSKLSRSLMTSSPMRGVS-PSRVRNGVAATPS-SNLSNT 413

Query: 838  PSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENTLLVQKHAAERNLYN 1017
            PS++SF  ++RRGK+GENR+ DAH +R+L+NRLLQWR  NA+   +L  Q+  AER+LYN
Sbjct: 414  PSILSFAADVRRGKVGENRIVDAHVVRLLHNRLLQWRFVNARANASLAAQRSNAERSLYN 473

Query: 1018 AWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLDRDYCNSIS 1179
            AWV +SKLR SVR+K IELQ LR NLKL S+LK Q ++LE   L+DRDY NS+S
Sbjct: 474  AWVTSSKLRESVRAKRIELQMLRQNLKLTSILKGQMIYLEELSLMDRDYSNSLS 527


>ref|XP_007042616.1| Family of Uncharacterized protein function, putative isoform 2
            [Theobroma cacao] gi|508706551|gb|EOX98447.1| Family of
            Uncharacterized protein function, putative isoform 2
            [Theobroma cacao]
          Length = 571

 Score =  258 bits (659), Expect = 3e-66
 Identities = 169/419 (40%), Positives = 238/419 (56%), Gaps = 37/419 (8%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183
            S A KML+ S RSLSVSFQG+++SL           VG+  ++RK TPERR+A  TPVRD
Sbjct: 161  SAATKMLITSTRSLSVSFQGEAFSLPISKTKAQ---VGS-AMTRKATPERRRA--TPVRD 214

Query: 184  RTAKETPRPIEHQQHLWPGRLRSENPSL--LSRSLDYGTDRAKWNGSSPALTELRKSV-- 351
                E  +P++  QH WPGR R  N     LSRSLDY ++R  +   +     L++S+  
Sbjct: 215  HG--ENSKPVD--QHRWPGRTRQGNSGTNPLSRSLDYSSERKMFGSGAIVAKSLQQSMML 270

Query: 352  -----------NAELKLD------------------NREIEDPAVSDEGRSRLGGXXXXX 444
                       ++ L LD                  N   E   VS +  +         
Sbjct: 271  DESSRRVSFDGSSRLSLDLGSSAELLKEATKQNSDANSINEASCVSCDLTASDTDSVSSG 330

Query: 445  XXXXXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPASPLS----NRTTGSP 612
                           + R GPR ++V+ARF+Q+ ++R++++ DP SPLS    +R   S 
Sbjct: 331  STNSGMQECGGSGILKGRSGPRNIVVSARFWQETNSRLRRLQDPGSPLSTSPGSRIGASA 390

Query: 613  RIMVAKKFQNDNPISSPREICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRAR 792
            +   +K+F +D  +SSPR +      SP+RGG+R ASPS+  TS + + LRG+S P+R R
Sbjct: 391  KFSQSKRFSSDGVVSSPRTMA-----SPIRGGTRPASPSKLWTSATSSPLRGLS-PARVR 444

Query: 793  NGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVEN 972
            N  G  M   N  +TPS++SF +++RRGK+GE+R+ DAH LR+LYNR LQWR ANA+ + 
Sbjct: 445  NAVGGQMM-GNSVNTPSILSFSVDIRRGKMGEDRIVDAHMLRLLYNRYLQWRFANARADA 503

Query: 973  TLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGL 1149
            T ++QK +AE+NL+NAWV TS+LRHSV  K I+L  LR  LKL S+LK Q  +LE W L
Sbjct: 504  TFMLQKLSAEKNLWNAWVTTSELRHSVTLKRIKLLLLRQKLKLTSILKGQIAYLEEWAL 562


>gb|EXB80036.1| hypothetical protein L484_003678 [Morus notabilis]
          Length = 670

 Score =  258 bits (658), Expect = 5e-66
 Identities = 177/434 (40%), Positives = 243/434 (55%), Gaps = 43/434 (9%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVR- 180
            S A K+LV S RSLSVSFQG+++SL             TP  +RK TPERR+   TP+R 
Sbjct: 139  SAATKLLVTSTRSLSVSFQGEAFSLPISKTKPT-----TPSGARKATPERRRT--TPLRG 191

Query: 181  -DRTAKETPRPIEHQQHLWPGRLRSENPS------LLSRSLDYGT--DRAKWNG------ 315
             +R   E  +P +  QH WP R R  N +      LLSRS+D+G   D  K NG      
Sbjct: 192  GERDQLENSKPGD--QHRWPARTRQGNSNSSNSNPLLSRSVDFGAGGDGRKLNGFRSGTV 249

Query: 316  ----SSPALTELRKSV----------NAEL-KLDNREIEDPAVSDEGRSRLGGXXXXXXX 450
                    L E R+S           +AEL K+++   E  A SD   S           
Sbjct: 250  VRALQQSLLDETRRSSFDGRLSLDLGSAELLKVNSSNNESSAPSDLTASDTDSVSSGSTS 309

Query: 451  XXXXXXXXXXXXXQLRGGPRGVIVAARFYQDASNRVQKVLDPASPLS----NRTTGSPRI 618
                            G PRG++V+ARF+Q+ ++R++++ DP SPLS    +R     + 
Sbjct: 310  GMQDANGVSKART---GTPRGIVVSARFWQETNSRLRRLQDPGSPLSTSPGSRMGAPAKF 366

Query: 619  MVAKKFQND-NPISSPREICPNRGLSPLRGGSRAASPSRALTSGSG-------ALLRGMS 774
            + +K++  D NP+SSPR +      SP+RG +R ASPS+  TS S        +  RG++
Sbjct: 367  VQSKRYSGDINPLSSPRTMA-----SPIRGANRPASPSKLWTSSSMPSPSRGMSPSRGIA 421

Query: 775  SPSRARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLA 954
            SPSR RNG    MN S   +TPS++SF +++RRGK+GE+R+ DAH LR+LYNR LQWR  
Sbjct: 422  SPSRVRNGVAGSMNGSYGGNTPSILSFSVDIRRGKMGEDRIVDAHMLRLLYNRYLQWRFV 481

Query: 955  NAQVENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHL 1134
            NA+ + T +VQK  AE+NL+NAWV  S+LRHSV  K I+L  LR  LKL S++K Q  +L
Sbjct: 482  NARADATFMVQKLNAEKNLWNAWVTISELRHSVTLKRIKLLLLRQKLKLTSIIKGQITYL 541

Query: 1135 ERWGLLDRDYCNSI 1176
            E W LLDRD+ +S+
Sbjct: 542  EDWALLDRDHSSSL 555


>ref|XP_004136940.1| PREDICTED: uncharacterized protein LOC101215899 [Cucumis sativus]
          Length = 667

 Score =  257 bits (656), Expect = 8e-66
 Identities = 173/427 (40%), Positives = 240/427 (56%), Gaps = 36/427 (8%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVS--RKG-TPERRKAGVTP 174
            S A K+LV S RSLSVSFQG+++SL             TP +S  RKG TPERR+A  TP
Sbjct: 141  SAAAKLLVTSTRSLSVSFQGEAFSLPISKTK----ATATPSLSNARKGSTPERRRA--TP 194

Query: 175  VRDRTAKETPRPIEHQ----QHLWPGRLRSEN--PSLLSRSLDYGTDRAKWNGSSPALT- 333
            +RD++     + +E+     QH WP R R  N   + LSRS D G ++ K NG    +  
Sbjct: 195  LRDKSDGSGVQ-VENSKLLDQHRWPARNRHANLEGNPLSRSFDCGGEQKKVNGIGSGMVV 253

Query: 334  ----------ELRKSVNAELKLDNREIE-------DPAVSDEGRSRLGGXXXXXXXXXXX 462
                        R S +  L LD    E       +P       S +             
Sbjct: 254  RALQQTISDDSRRASFDGRLSLDLNSSELIKAVRQNPDADSVNESSVPSDLTTSDTDSVS 313

Query: 463  XXXXXXXXX-----QLRGGPRGVIVAARFYQDASNRVQKVLDPASPLSNRT---TGSP-R 615
                          + R GPRG++V+ARF+Q+ ++R++++ DP SPLS       G+P +
Sbjct: 314  SGSTSGVQDCGSVAKGRNGPRGIVVSARFWQETNSRLRRLHDPGSPLSTSPGARVGAPSK 373

Query: 616  IMVAKKFQNDNPISSPREICPNRGLSPLRGGSRAASPSRALTSGSGALLRGMSSPSRARN 795
               +K+F ND P+SSPR +      SP+RGG+R  SPS+  TS   +  RG+SSPSR RN
Sbjct: 374  FSQSKRFSNDGPLSSPRTMA-----SPIRGGTRPPSPSKLWTSSVSSPSRGISSPSRTRN 428

Query: 796  GAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQVENT 975
            G G  +  SN  STPS++SF +++RRGK+GE+R+ DAH LR+ +NR LQWR  NA+ + T
Sbjct: 429  GVGGSLV-SNSISTPSILSFSVDIRRGKMGEDRIVDAHVLRLHHNRYLQWRFVNARADAT 487

Query: 976  LLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERWGLLD 1155
             ++Q+  AERN++NAWV  S+LRH+V  K I+L  LR  LKL SVLK Q  +LE W LLD
Sbjct: 488  FMLQRLNAERNVWNAWVTISELRHTVTLKRIKLLLLRQKLKLTSVLKGQISYLEEWALLD 547

Query: 1156 RDYCNSI 1176
            RD+ +S+
Sbjct: 548  RDHSSSM 554


>gb|AAG60167.1|AC074110_5 hypothetical protein [Arabidopsis thaliana]
          Length = 722

 Score =  256 bits (655), Expect = 1e-65
 Identities = 172/432 (39%), Positives = 244/432 (56%), Gaps = 40/432 (9%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183
            S A KML+ S RSLSVSFQG+++SL          T  TP   RK TPERR++  TPVRD
Sbjct: 130  SAATKMLITSTRSLSVSFQGEAFSLPISKKKE---TTSTPVSHRKSTPERRRS--TPVRD 184

Query: 184  RTAKETPRPIEHQQHLWPGRLRSEN-----PSLLSRSLDYGTDRAKWNGSSPALTEL--- 339
            +  +E  +P++ Q+  WPG  R  N     P+ LSRSLD G+DR K        + L   
Sbjct: 185  Q--RENSKPVDQQR--WPGASRRGNSESVVPNSLSRSLDCGSDRGKLGSGFVGRSMLHNS 240

Query: 340  ------RKSVNAELKLD-NREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLR 498
                  R SVN  L LD     E   + D+ + R                          
Sbjct: 241  MIDESPRVSVNGRLSLDLGGRDEYLDIGDDIQRRPNNGLTSSVSCDFTASDTDSVSSGST 300

Query: 499  GG------------------PRGVIVAARFYQDASNRVQKVLDPASPLSNR-----TTGS 609
             G                  PR ++ +ARF+Q+ ++R++++ DP SPLS+      ++ S
Sbjct: 301  NGVQECGSGVNGEISKSKSLPRNIMASARFWQETNSRLRRLQDPGSPLSSSPGLKTSSIS 360

Query: 610  PRIMVAKKFQNDN-PISSPREICPNRGLSPLRGGS-RAASPSRALTSGSGALLRGMSSPS 783
             +  ++K+F +D  P+SSPR +      SP+RG + R+ASPS+   + + +  R +SSPS
Sbjct: 361  SKFGLSKRFSSDAVPLSSPRGMA-----SPVRGSAIRSASPSKLWATTTSSPARALSSPS 415

Query: 784  RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963
            RARNG    MN  N  +TPS++SF  ++RRGK+GE+R+ DAH LR+LYNR LQWR  NA+
Sbjct: 416  RARNGVSDQMNAYNRNNTPSILSFSADIRRGKIGEDRVMDAHLLRLLYNRDLQWRFVNAR 475

Query: 964  VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143
             ++T++VQ+  AE+NL+NAWV+ S+LRHSV  K I+L  LR  LKL S+L+ Q   LE W
Sbjct: 476  ADSTVMVQRLNAEKNLWNAWVSISELRHSVTLKRIKLLLLRQKLKLASILRGQMGFLEEW 535

Query: 1144 GLLDRDYCNSIS 1179
             LLDRD+ +S+S
Sbjct: 536  SLLDRDHSSSLS 547


>gb|AAG51768.1|AC079674_1 unknown protein; 38618-41990 [Arabidopsis thaliana]
          Length = 718

 Score =  256 bits (655), Expect = 1e-65
 Identities = 172/432 (39%), Positives = 244/432 (56%), Gaps = 40/432 (9%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183
            S A KML+ S RSLSVSFQG+++SL          T  TP   RK TPERR++  TPVRD
Sbjct: 130  SAATKMLITSTRSLSVSFQGEAFSLPISKKKE---TTSTPVSHRKSTPERRRS--TPVRD 184

Query: 184  RTAKETPRPIEHQQHLWPGRLRSEN-----PSLLSRSLDYGTDRAKWNGSSPALTEL--- 339
            +  +E  +P++ Q+  WPG  R  N     P+ LSRSLD G+DR K        + L   
Sbjct: 185  Q--RENSKPVDQQR--WPGASRRGNSESVVPNSLSRSLDCGSDRGKLGSGFVGRSMLHNS 240

Query: 340  ------RKSVNAELKLD-NREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLR 498
                  R SVN  L LD     E   + D+ + R                          
Sbjct: 241  MIDESPRVSVNGRLSLDLGGRDEYLDIGDDIQRRPNNGLTSSVSCDFTASDTDSVSSGST 300

Query: 499  GG------------------PRGVIVAARFYQDASNRVQKVLDPASPLSNR-----TTGS 609
             G                  PR ++ +ARF+Q+ ++R++++ DP SPLS+      ++ S
Sbjct: 301  NGVQECGSGVNGEISKSKSLPRNIMASARFWQETNSRLRRLQDPGSPLSSSPGLKTSSIS 360

Query: 610  PRIMVAKKFQNDN-PISSPREICPNRGLSPLRGGS-RAASPSRALTSGSGALLRGMSSPS 783
             +  ++K+F +D  P+SSPR +      SP+RG + R+ASPS+   + + +  R +SSPS
Sbjct: 361  SKFGLSKRFSSDAVPLSSPRGMA-----SPVRGSAIRSASPSKLWATTTSSPARALSSPS 415

Query: 784  RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963
            RARNG    MN  N  +TPS++SF  ++RRGK+GE+R+ DAH LR+LYNR LQWR  NA+
Sbjct: 416  RARNGVSDQMNAYNRNNTPSILSFSADIRRGKIGEDRVMDAHLLRLLYNRDLQWRFVNAR 475

Query: 964  VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143
             ++T++VQ+  AE+NL+NAWV+ S+LRHSV  K I+L  LR  LKL S+L+ Q   LE W
Sbjct: 476  ADSTVMVQRLNAEKNLWNAWVSISELRHSVTLKRIKLLLLRQKLKLASILRGQMGFLEEW 535

Query: 1144 GLLDRDYCNSIS 1179
             LLDRD+ +S+S
Sbjct: 536  SLLDRDHSSSLS 547


>ref|XP_002891540.1| hypothetical protein ARALYDRAFT_891910 [Arabidopsis lyrata subsp.
            lyrata] gi|297337382|gb|EFH67799.1| hypothetical protein
            ARALYDRAFT_891910 [Arabidopsis lyrata subsp. lyrata]
          Length = 660

 Score =  256 bits (655), Expect = 1e-65
 Identities = 170/432 (39%), Positives = 242/432 (56%), Gaps = 40/432 (9%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183
            S A KML+ S RSLSVSFQG+++SL             TP   RK TPERR++  TPVRD
Sbjct: 131  SAATKMLITSTRSLSVSFQGEAFSLPISKKKE---ATTTPVSHRKSTPERRRS--TPVRD 185

Query: 184  RTAKETPRPIEHQQHLWPGRLRSEN-----PSLLSRSLDYGTDRAKWNGSSPALTEL--- 339
            +  +E  +P++ Q+  WPG  R  N     P+ LSRSLD G+DR K        + L   
Sbjct: 186  Q--RENSKPVDQQR--WPGASRRGNSESVVPNSLSRSLDCGSDRGKLGSGFVGRSMLHNS 241

Query: 340  ------RKSVNAELKLD-NREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLR 498
                  R S+N  L LD     E   + DE + R                          
Sbjct: 242  MIDESPRVSINGRLSLDLGGRDEYLEIGDESQRRPNNGLTSSVSCDFTASDTDSVSSGST 301

Query: 499  GG------------------PRGVIVAARFYQDASNRVQKVLDPASPLSNR-----TTGS 609
             G                  PR ++ +ARF+Q+ ++R++++ DP SPLS+      ++ S
Sbjct: 302  NGVQECGSGVNGEISKSKSLPRNIMASARFWQETNSRLRRLQDPGSPLSSSPGLKTSSVS 361

Query: 610  PRIMVAKKFQNDN-PISSPREICPNRGLSPLRGGS-RAASPSRALTSGSGALLRGMSSPS 783
             +  ++K+F +D  P+SSPR +      SP+RG + R+ASPS+   + + +  R +SSPS
Sbjct: 362  SKFGLSKRFSSDAVPLSSPRGMA-----SPVRGSAIRSASPSKLWATTTSSPARALSSPS 416

Query: 784  RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963
            R RNG    MN  N  +TPS++SF  ++RRGK+GE+R+ DAH LR+LYNR LQWR  NA+
Sbjct: 417  RVRNGVSDQMNAYNRNNTPSILSFSADIRRGKIGEDRVMDAHLLRLLYNRYLQWRFVNAR 476

Query: 964  VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143
             ++T++VQ+  AE+NL+NAWV+ S+LRHSV  K I+L  LR  LKL S+L+ Q   LE W
Sbjct: 477  ADSTVMVQRLNAEKNLWNAWVSISELRHSVTLKRIKLLLLRQKLKLASILRGQMGFLEEW 536

Query: 1144 GLLDRDYCNSIS 1179
             LLDRD+ +S+S
Sbjct: 537  SLLDRDHSSSLS 548


>ref|NP_564558.1| uncharacterized protein [Arabidopsis thaliana]
            gi|75164975|sp|Q94AI1.1|QWRF2_ARATH RecName: Full=QWRF
            motif-containing protein 2 gi|15028145|gb|AAK76696.1|
            unknown protein [Arabidopsis thaliana]
            gi|24030506|gb|AAN41399.1| unknown protein [Arabidopsis
            thaliana] gi|332194367|gb|AEE32488.1| uncharacterized
            protein AT1G49890 [Arabidopsis thaliana]
          Length = 659

 Score =  256 bits (655), Expect = 1e-65
 Identities = 172/432 (39%), Positives = 244/432 (56%), Gaps = 40/432 (9%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183
            S A KML+ S RSLSVSFQG+++SL          T  TP   RK TPERR++  TPVRD
Sbjct: 130  SAATKMLITSTRSLSVSFQGEAFSLPISKKKE---TTSTPVSHRKSTPERRRS--TPVRD 184

Query: 184  RTAKETPRPIEHQQHLWPGRLRSEN-----PSLLSRSLDYGTDRAKWNGSSPALTEL--- 339
            +  +E  +P++ Q+  WPG  R  N     P+ LSRSLD G+DR K        + L   
Sbjct: 185  Q--RENSKPVDQQR--WPGASRRGNSESVVPNSLSRSLDCGSDRGKLGSGFVGRSMLHNS 240

Query: 340  ------RKSVNAELKLD-NREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLR 498
                  R SVN  L LD     E   + D+ + R                          
Sbjct: 241  MIDESPRVSVNGRLSLDLGGRDEYLDIGDDIQRRPNNGLTSSVSCDFTASDTDSVSSGST 300

Query: 499  GG------------------PRGVIVAARFYQDASNRVQKVLDPASPLSNR-----TTGS 609
             G                  PR ++ +ARF+Q+ ++R++++ DP SPLS+      ++ S
Sbjct: 301  NGVQECGSGVNGEISKSKSLPRNIMASARFWQETNSRLRRLQDPGSPLSSSPGLKTSSIS 360

Query: 610  PRIMVAKKFQNDN-PISSPREICPNRGLSPLRGGS-RAASPSRALTSGSGALLRGMSSPS 783
             +  ++K+F +D  P+SSPR +      SP+RG + R+ASPS+   + + +  R +SSPS
Sbjct: 361  SKFGLSKRFSSDAVPLSSPRGMA-----SPVRGSAIRSASPSKLWATTTSSPARALSSPS 415

Query: 784  RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963
            RARNG    MN  N  +TPS++SF  ++RRGK+GE+R+ DAH LR+LYNR LQWR  NA+
Sbjct: 416  RARNGVSDQMNAYNRNNTPSILSFSADIRRGKIGEDRVMDAHLLRLLYNRDLQWRFVNAR 475

Query: 964  VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143
             ++T++VQ+  AE+NL+NAWV+ S+LRHSV  K I+L  LR  LKL S+L+ Q   LE W
Sbjct: 476  ADSTVMVQRLNAEKNLWNAWVSISELRHSVTLKRIKLLLLRQKLKLASILRGQMGFLEEW 535

Query: 1144 GLLDRDYCNSIS 1179
             LLDRD+ +S+S
Sbjct: 536  SLLDRDHSSSLS 547


>ref|XP_006306956.1| hypothetical protein CARUB_v10008528mg [Capsella rubella]
            gi|482575667|gb|EOA39854.1| hypothetical protein
            CARUB_v10008528mg [Capsella rubella]
          Length = 659

 Score =  256 bits (653), Expect = 2e-65
 Identities = 170/432 (39%), Positives = 242/432 (56%), Gaps = 40/432 (9%)
 Frame = +1

Query: 4    SNAEKMLVKSVRSLSVSFQGQSYSLXXXXXXXXXXTVGTPGVSRKGTPERRKAGVTPVRD 183
            S A KML+ S RSLSVSFQG+++SL          T  TP   RK TPERR++  TPVRD
Sbjct: 130  SAATKMLITSTRSLSVSFQGEAFSLPISKKKE---TTSTPVSHRKSTPERRRS--TPVRD 184

Query: 184  RTAKETPRPIEHQQHLWPGRLRSEN-----PSLLSRSLDYGTDRAKWNGSSPALTEL--- 339
            +  +E  +P++ Q+  WPG  R  N     P+ LSRSLD G+DR K        + L   
Sbjct: 185  Q--RENSKPVDQQR--WPGASRRGNSESVVPNSLSRSLDCGSDRGKLGSGFAGRSMLHNS 240

Query: 340  ------RKSVNAELKLD-NREIEDPAVSDEGRSRLGGXXXXXXXXXXXXXXXXXXXXQLR 498
                  R SVN  L LD     E   + D+ + R                          
Sbjct: 241  MIDESPRVSVNGRLSLDLGGRDEYLEIGDDSQRRPSNGLTSSVSCDFTASDTDSVSSGST 300

Query: 499  GG------------------PRGVIVAARFYQDASNRVQKVLDPASPLSNR-----TTGS 609
             G                  PR ++ +ARF+Q+ ++R++++ DP SPLS+      ++ S
Sbjct: 301  NGVQECGSGVNGEISKSKSLPRNIMASARFWQETNSRLRRLQDPGSPLSSSPGLKTSSIS 360

Query: 610  PRIMVAKKFQNDN-PISSPREICPNRGLSPLRGGS-RAASPSRALTSGSGALLRGMSSPS 783
             +  ++K+F +D  P SSPR +      SP+RG + R+ASPS+   + + +  R +SSPS
Sbjct: 361  SKFGLSKRFSSDAVPSSSPRGMA-----SPVRGSAIRSASPSKLWATTTSSPARALSSPS 415

Query: 784  RARNGAGSVMNDSNMCSTPSMISFPLELRRGKLGENRMADAHDLRMLYNRLLQWRLANAQ 963
            R RNG    MN  N  +TPS++SF  ++RRGK+GE+R+ DAH LR+LYNR LQWR  NA+
Sbjct: 416  RVRNGVSDQMNAYNRNNTPSILSFSADIRRGKIGEDRVMDAHLLRLLYNRYLQWRFVNAR 475

Query: 964  VENTLLVQKHAAERNLYNAWVNTSKLRHSVRSKGIELQTLRLNLKLYSVLKEQELHLERW 1143
             ++T++VQ+  AE+NL+NAWV+ S+LRHSV  K I+L  LR  LKL S+L+ Q   LE W
Sbjct: 476  ADSTVMVQRLNAEKNLWNAWVSISELRHSVTLKRIKLLLLRQKLKLASILRGQMGFLEEW 535

Query: 1144 GLLDRDYCNSIS 1179
             LLD+D+ +S+S
Sbjct: 536  SLLDKDHSSSLS 547


Top