BLASTX nr result

ID: Rehmannia23_contig00010992 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00010992
         (1402 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601...   199   2e-48
ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601...   198   4e-48
ref|XP_004243407.1| PREDICTED: uncharacterized protein LOC101260...   197   9e-48
ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256...   189   2e-45
gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis]     179   2e-42
ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus c...   172   2e-40
ref|XP_002279986.1| PREDICTED: uncharacterized protein LOC100250...   166   2e-38
gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma caca...   165   5e-38
ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citr...   160   9e-37
ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611...   159   3e-36
ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303...   159   3e-36
ref|XP_006348849.1| PREDICTED: uncharacterized protein LOC102594...   148   5e-33
gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theob...   147   1e-32
ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arab...   144   1e-31
gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theob...   139   3e-30
ref|XP_006294244.1| hypothetical protein CARUB_v10023243mg, part...   139   4e-30
ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205...   138   5e-30
ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana] ...   136   2e-29
ref|XP_006410238.1| hypothetical protein EUTSA_v10016698mg [Eutr...   135   3e-29
gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma caca...   133   2e-28

>ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601397 isoform X6 [Solanum
            tuberosum]
          Length = 420

 Score =  199 bits (506), Expect = 2e-48
 Identities = 162/483 (33%), Positives = 239/483 (49%), Gaps = 39/483 (8%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            MD KGI W G+IYQKFE MCLE+E+ MY+DT +YVENQVQ VG SVK+FYS+V+ DL P 
Sbjct: 1    MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60

Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1002
              +DPVKVA  DLSLNPYAHT+++K                               L+  
Sbjct: 61   FNIDPVKVAAADLSLNPYAHTEISK------------------------------KLKAK 90

Query: 1001 NDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKR-PIGIKRISQNNH-PSKISRPMTSL 828
                H   ++ + L+++T   V   KSK  GVY+R  +GIK I ++NH PSK S  +  +
Sbjct: 91   LKGGHPMVIN-KELIDDT--QVIKGKSKSGGVYRRQSVGIKEIVRDNHPPSKKSDALCLV 147

Query: 827  SGDKSRLL----------VASDDMNVTTSVRCHPG--------EAVNNI--TPISEA--C 714
            SG+  +L           VASD M +T+ +    G        E  N+I  T +S A   
Sbjct: 148  SGNAIKLSSDSKVRGGFEVASDHMTMTSPLASVKGRSSAETGKEVSNHIIKTDVSAAGIS 207

Query: 713  VESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAES 534
            +   ASD+ L  + V + + D    S   D     +S   D+  C+  +  +  K+S+  
Sbjct: 208  INVAASDRSLSVDCVGQNQADLRNTSSVGD----LQSDSHDRGTCKELAGDTGLKISS-- 261

Query: 533  LRQKKDDSQCTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNF 354
                       + D+ ++++ I+   +          I SNT D    GE++    ++  
Sbjct: 262  ----------NTGDNNIASEEINNIAK----------ISSNTGDNNITGEEINESCKERS 301

Query: 353  D---------MEVIENE-EAVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIRE 204
            D          ++IE++ E VE  + SKLE+TC+LV+ +KLH V Q + K KSYKKK+R+
Sbjct: 302  DKSCSPPPEKYDLIESDVEIVEHYDESKLEETCVLVEAEKLH-VPQESVKQKSYKKKLRQ 360

Query: 203  ALSSKLRSTRKQDPCVSHCKDLGGQNNGGV----TTIPALEMDSDKRNL-PVHDSFESDW 39
              S K +STRK+         L G     +      +  L  +S+ + L    D  ES+W
Sbjct: 361  VFSMKKKSTRKE---YEQLGALHGDQQPNLEPEEKPMQVLSKNSNMKKLSSADDHSESEW 417

Query: 38   EIL 30
            E+L
Sbjct: 418  ELL 420


>ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601397 isoform X1 [Solanum
            tuberosum] gi|565366720|ref|XP_006350033.1| PREDICTED:
            uncharacterized protein LOC102601397 isoform X2 [Solanum
            tuberosum] gi|565366722|ref|XP_006350034.1| PREDICTED:
            uncharacterized protein LOC102601397 isoform X3 [Solanum
            tuberosum] gi|565366724|ref|XP_006350035.1| PREDICTED:
            uncharacterized protein LOC102601397 isoform X4 [Solanum
            tuberosum] gi|565366726|ref|XP_006350036.1| PREDICTED:
            uncharacterized protein LOC102601397 isoform X5 [Solanum
            tuberosum]
          Length = 421

 Score =  198 bits (504), Expect = 4e-48
 Identities = 162/483 (33%), Positives = 238/483 (49%), Gaps = 39/483 (8%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            MD KGI W G+IYQKFE MCLE+E+ MY+DT +YVENQVQ VG SVK+FYS+V+ DL P 
Sbjct: 1    MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60

Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1002
              +DPVKVA  DLSLNPYAHT+++K                               L+  
Sbjct: 61   FNIDPVKVAAADLSLNPYAHTEISK------------------------------KLKAK 90

Query: 1001 NDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKR-PIGIKRISQNNH-PSKISRPMTSL 828
                H   ++ + L+++T   V   KSK  GVY+R  +GIK I ++NH PSK S  +  +
Sbjct: 91   LKGGHPMVIN-KELIDDT--QVIKGKSKSGGVYRRQSVGIKEIVRDNHPPSKKSDALCLV 147

Query: 827  SGDKSRLL----------VASDDMNVTTSVRCHPG--------EAVNNI--TPISEA--C 714
            SG+  +L           VASD M +T+ +    G        E  N+I  T +S A   
Sbjct: 148  SGNAIKLSSDSKVRGGFEVASDHMTMTSPLASVKGRSSAETGKEVSNHIIKTDVSAAGIS 207

Query: 713  VESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAES 534
            +   ASD+ L  + V + + D    S   D   L      D+  C+  +  +  K+S+  
Sbjct: 208  INVAASDRSLSVDCVGQNQADLRNTSSVGD---LQSDSHADRGTCKELAGDTGLKISS-- 262

Query: 533  LRQKKDDSQCTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNF 354
                       + D+ ++++ I+   +          I SNT D    GE++    ++  
Sbjct: 263  ----------NTGDNNIASEEINNIAK----------ISSNTGDNNITGEEINESCKERS 302

Query: 353  D---------MEVIENE-EAVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIRE 204
            D          ++IE++ E VE  + SKLE+TC+LV+ +KLH V Q + K KSYKKK+R+
Sbjct: 303  DKSCSPPPEKYDLIESDVEIVEHYDESKLEETCVLVEAEKLH-VPQESVKQKSYKKKLRQ 361

Query: 203  ALSSKLRSTRKQDPCVSHCKDLGGQNNGGV----TTIPALEMDSDKRNL-PVHDSFESDW 39
              S K +STRK+         L G     +      +  L  +S+ + L    D  ES+W
Sbjct: 362  VFSMKKKSTRKE---YEQLGALHGDQQPNLEPEEKPMQVLSKNSNMKKLSSADDHSESEW 418

Query: 38   EIL 30
            E+L
Sbjct: 419  ELL 421


>ref|XP_004243407.1| PREDICTED: uncharacterized protein LOC101260247 [Solanum
            lycopersicum]
          Length = 374

 Score =  197 bits (501), Expect = 9e-48
 Identities = 164/463 (35%), Positives = 227/463 (49%), Gaps = 19/463 (4%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            MD K I+W GNIYQKFETMCLE+EE MY+DTVKYVENQ+  VG +VK+F SEVMQD+ P 
Sbjct: 1    MDLKSISWVGNIYQKFETMCLEMEEAMYQDTVKYVENQMNTVGTNVKRFCSEVMQDVHPQ 60

Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1002
              +DPVKVA  DLSLNPYAH +++K     L                      K S R  
Sbjct: 61   CNIDPVKVAAADLSLNPYAHYEIDKKLKANL----------------------KGSAR-- 96

Query: 1001 NDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRP-IGIKRISQNNHPSKISRPMTSLS 825
              +N L          N  + V   KSK  GVYKR  +GIK I +++H +K    +   S
Sbjct: 97   GFSNKL----------NDDTQVIKGKSKSGGVYKRQNVGIKEIVRDSHLTKKPNAICLAS 146

Query: 824  GDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILIAESVKEEK--DD 651
            GD  +L         ++S     G                LASD + +  ++   K  D 
Sbjct: 147  GDALKL---------SSSAEVRGG--------------FELASDHVTLTSALASVKGSDS 183

Query: 650  SECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDS-----QC-----T 501
             E AS  S++++          D    S AS   +S ES+ +K+ D+      C     T
Sbjct: 184  GEVASKVSNHVI---QTNVSTADTSITSEAS-VMMSVESVGKKQTDTCTKELACNTRFKT 239

Query: 500  SAD--HGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEE 327
            S+D  + L+ + IDE     S  + + N++S    IES             D+E+     
Sbjct: 240  SSDVRNNLANEEIDE-----SHEEKSDNLLSKYDSIES-------------DLEI----- 276

Query: 326  AVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHC 147
             VE  +  +L +TC+LV+ D++H V QG  K KSYKKK+R+A S+K R TRK+       
Sbjct: 277  -VEKFDEFQLNETCVLVEEDRIH-VPQGPVKQKSYKKKLRDAFSTKKRLTRKE---YEQL 331

Query: 146  KDLGGQNNGGVTT----IPALEMDSDKRNLPVHDSFESDWEIL 30
              L G     V +    +P L M+S+ + L  +D  ES+WEIL
Sbjct: 332  GALYGDQQIKVESEDKVMPVLAMNSNTKMLSANDHPESEWEIL 374


>ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256948 [Solanum
            lycopersicum]
          Length = 421

 Score =  189 bits (481), Expect = 2e-45
 Identities = 158/483 (32%), Positives = 234/483 (48%), Gaps = 39/483 (8%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            MD KGI W G+IYQKFE MCLE+E+ MY+DT +YVENQVQ VG SVK+FYS+V+ DL P 
Sbjct: 1    MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60

Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1002
              +DPVKVA  DLSLNPYAHT+++K     L                     +    RV 
Sbjct: 61   FNIDPVKVAAADLSLNPYAHTEISKKLKAQL---------------------KGGHPRVI 99

Query: 1001 NDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKR-PIGIKRISQNNH-PSKISRPMTSL 828
            N          + L+++T   V   KSK  GVY+R  +G+K I ++NH PSK S  +  +
Sbjct: 100  N----------KELIDDT--QVIKGKSKSGGVYRRQSVGMKEIVRDNHPPSKKSDALCLV 147

Query: 827  SGDKSRLL----------VASDDMNVTTSVRCHPG--------EAVNNI----TPISEAC 714
            SG+  +L           VASD M +T+ +    G        E  N+I     P +   
Sbjct: 148  SGNTIKLSSDSKVRGGFEVASDHMTMTSPLASVKGLKSTETGKEVSNHIIKTEVPAAGIS 207

Query: 713  VESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAES 534
            +   ASD  L  + V + + D         N      ++ D        R + K+L+ ++
Sbjct: 208  INIAASDTSLSVDCVGQNQADLR-------NTFSVGDLQSDSH----VDRGTRKELAGDT 256

Query: 533  LRQKKDDSQCTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNF 354
              +   +    + D+ +++K ++   +          I SNT D    GE++    +   
Sbjct: 257  GLKISSN----TGDNNIASKEVNNIAK----------ISSNTDDNNIAGEEIKESCKARS 302

Query: 353  D---------MEVIENE-EAVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIRE 204
            D          ++IE++ E VE  +  KLE+TC+LV+ +KLH V QG+ K KSYKKK+R+
Sbjct: 303  DKSCSPPPDKYDLIESDVEIVERYDEPKLEETCVLVEAEKLH-VPQGSVKRKSYKKKLRQ 361

Query: 203  ALSSKLRSTRKQDPCVSHCKDLGGQNNGGV----TTIPALEMDSDKRNL-PVHDSFESDW 39
              S K +STR +         L G     +      +  L  +S+ + L    D  ES+W
Sbjct: 362  VFSMKKKSTRTE---YEQLGALYGDQQPNLQPEEKQMQVLSKNSNPKKLSSADDHSESEW 418

Query: 38   EIL 30
            E+L
Sbjct: 419  ELL 421


>gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis]
          Length = 443

 Score =  179 bits (455), Expect = 2e-42
 Identities = 158/485 (32%), Positives = 226/485 (46%), Gaps = 41/485 (8%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            MD KGITW GN+YQKFE MCLEVEE+MY+DTVKYVENQVQ VG SVK+FYS+VMQDLLPP
Sbjct: 1    MDVKGITWVGNVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGASVKRFYSDVMQDLLPP 60

Query: 1181 SCVDPVKV----------APGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDL 1032
            S  D  KV          +   +S  P       K KP   D     +  ++ ++     
Sbjct: 61   SSQDSEKVSLCGFIGKQDSDDGISKKPNV---AKKEKPAKADDEQLIRTLKVTSDSKDVY 117

Query: 1031 IAEKSSLRVHNDANHL---SSLSPRGLVENTHS-----DVCFTKSKKVGVYKRPIGIKRI 876
            +A   S+ V  D +++   S    +G   N  S     DV    S  + V +     K I
Sbjct: 118  LA--PSIHVRCDVDNMCRPSGECVKGACSNLRSRKKCRDVSVHSSSNLSVNENRSDKKLI 175

Query: 875  SQN-----NHPSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACV 711
                         +SRP++S S   + +   S D   TT        +VN          
Sbjct: 176  PPETSCAITREKHLSRPLSSYSEFVNEIHEISLDQTGTTK-----APSVN---------- 220

Query: 710  ESLASDKILIAESVKEEKDDSECAS------HASDNILLAESVKQDKEDCECASRASDKK 549
            E  +SD I+  ES  E ++ SEC +      HAS  I+L +SV  D  + +  S      
Sbjct: 221  EDTSSDSIV--ESCDEIENSSECMADLSSSFHASSEIILVKSVGYDGNEMDVPSGGG--- 275

Query: 548  LSAESLRQKKDDSQCTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVF-- 375
            LS     Q   D     + + L++        GGS             + +   EDVF  
Sbjct: 276  LS----EQANGDYTSKCSSNSLAS-------TGGSSQN------EEARNDKYADEDVFVS 318

Query: 374  -PCYEDNFDMEVIENE-------EAVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYK 219
             P   D++++ + E+E       E ++  +  KLE+TC+LV+ D+LH + Q   K + YK
Sbjct: 319  LPRKFDDWNLNITESEIATEHGTETIQQRDKVKLEETCVLVNEDELHILPQRGGKWRPYK 378

Query: 218  KKIREALSSKLRSTRKQ--DPCVSHCKDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFES 45
            KKIR+AL S++RS RK+  +  V    D    N      +    +  +++ LP  DS ES
Sbjct: 379  KKIRDALYSRMRSARKEEYEQLVLQYGDNKKLNQDFGEALAPTLIVKERKKLPHLDSCES 438

Query: 44   DWEIL 30
            +WE+L
Sbjct: 439  EWELL 443


>ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus communis]
            gi|223535579|gb|EEF37247.1| hypothetical protein
            RCOM_0553590 [Ricinus communis]
          Length = 490

 Score =  172 bits (437), Expect = 2e-40
 Identities = 164/524 (31%), Positives = 226/524 (43%), Gaps = 80/524 (15%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            MD KGI+W GNIYQKFE MCLEVEEVMY+DTVKYVENQVQ VG SVK+FYS+VMQDLLPP
Sbjct: 1    MDLKGISWVGNIYQKFEAMCLEVEEVMYQDTVKYVENQVQTVGSSVKRFYSDVMQDLLPP 60

Query: 1181 SCVDPVKVAPGDLSLNPYAHTDL-NKSKPTMLDSYGEFKKKEIENEDISDLIAEKS--SL 1011
            S VD  K A  D+ L  YA   +  K K  + +  G+   +E   ED      +KS   L
Sbjct: 61   SSVDAAKGAGVDVPLELYADLGIYMKPKVGVKEKQGKVDDRERLTEDPKITTDKKSMDPL 120

Query: 1010 RVHNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRISQNNHP-SKISRPMT 834
              H            GLVEN       ++    G   R  G + +S  ++P ++ +    
Sbjct: 121  TFHR----------LGLVENRFP---LSQGNSAGGASRQHGKRSLSNKSNPYTRKNSNRE 167

Query: 833  SLSGDKSRLLVASDDMNVTTSV----------------------RCHPGEA--------- 747
            ++S DK    ++  D  +  +                        C P +          
Sbjct: 168  NMSVDKKLEAISCLDKGLIRASFSERSNENLGDSGGGAPKQYGDSCLPKDTSLGTNGNSE 227

Query: 746  ----------------VNNITPISEACVESLASDKILIAESVK----------------E 663
                             N++T  S  C  S  + K  + +  K                E
Sbjct: 228  RQNIFLHEKARVVIPLYNDLTRASSICELSNENHKDCVDQQAKITTPGSVEMTGHDSVDE 287

Query: 662  EKDDSECASH----ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTSA 495
             K + E AS       D +   ES      D  C+S  S   LSAE+     DD     A
Sbjct: 288  SKYEIENASEQIPDIPDMVNSTESGASKGMDMTCSSHGS---LSAEA--HAADDCMSHGA 342

Query: 494  DHGLSTKPIDEFRQG---GSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 324
            D      P D F  G   G  S  + + VSN+   +    DV+       D  +    E 
Sbjct: 343  DF-----PADSFVNGNGKGQSSDSDEDFVSNSGS-DDCNTDVY-----KIDFSISHEMEI 391

Query: 323  VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHCK 144
            ++ V+ +KLE++CILV+ D+ H++ Q   K KSYKKKIR+  S + RS RK +  +S C 
Sbjct: 392  IQQVDKAKLEESCILVNRDECHYLPQSERKSKSYKKKIRDVFSPRKRSMRKHEQ-LSICP 450

Query: 143  DLGGQNNGGVTTIPALEM------DSDKRNLPVHDSFESDWEIL 30
              G  +N          M      D+D+ + P  D  +S+WE L
Sbjct: 451  --GSDSNPNQEECAKNSMPRHTIKDADRYSTP--DCCDSEWEFL 490


>ref|XP_002279986.1| PREDICTED: uncharacterized protein LOC100250516 [Vitis vinifera]
            gi|302143402|emb|CBI21963.3| unnamed protein product
            [Vitis vinifera]
          Length = 451

 Score =  166 bits (420), Expect = 2e-38
 Identities = 157/495 (31%), Positives = 236/495 (47%), Gaps = 51/495 (10%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDT-------VKYVENQVQKVGVSVKKFYSEV 1203
            MDFKGITW GN+YQKFET+CLEVE++MY+DT       VKYVE+QV+ VG SVKKF SE+
Sbjct: 1    MDFKGITWVGNMYQKFETICLEVEDIMYQDTVKYFENHVKYVEDQVETVGESVKKFCSEI 60

Query: 1202 MQDLLPPSCVDPVKVAPGDLSLNPYAHTDLNKSKPTM---------------LDSYGEFK 1068
            +QDLL P   D ++V   +LSL+ + +  L K KP +               +    EF 
Sbjct: 61   VQDLLLP---DSLEVTDSNLSLDQHDNVKLCK-KPKVGIKEEAKVGFKEEPKVSIKEEFI 116

Query: 1067 KKEI----ENEDISDL---IAEKSSLRVHNDANHL----------SSLSPRGLVENTHSD 939
            K +I    E+ +I+DL   +  KSS    +  N+L           + S   LV+N    
Sbjct: 117  KFDIDRLTEHSEIADLNEDVEHKSSFTGLHGVNNLFQSYSGNSVTGACSDLHLVQNDDGV 176

Query: 938  VCFTKSKKVGVYKRPIGIKRISQNNHPSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCH 759
            +C  K+   G+ + P+   ++SQ   P ++S  +  +SGD SRL      +N     +C+
Sbjct: 177  MC--KNLDAGIKRNPV---KVSQ--FPIEVSGVIAPISGDVSRL---PSSLNENCENKCN 226

Query: 758  PGEAVNNITPISEACVESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDC 579
                 +     S A VE    +   +  ++  E  D    S    ++ L ESV ++  + 
Sbjct: 227  QMAITS-----SPASVEITDCN---LEGAICNEIADVTAISVDLPSVPLVESVGKEGREM 278

Query: 578  ECASRAS-DKKLSAESLRQKKDDSQCTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWD 402
              +SR     +L+A ++            D+G+ +         GSF  I  N  +   D
Sbjct: 279  VFSSRGGLSSELNAGNI----------PMDNGVGSLI-------GSFRDIQQNETAEKKD 321

Query: 401  IESIGEDVFPCYEDNFDMEVIENEEAVEP-VETS-------KLEDTCILVDGDKLHFVSQ 246
            + S  E       D ++++ IE  + +E  +ET+       KLED C++VDGD+LH VS 
Sbjct: 322  LLSHSEG-----SDGWNIDAIEINDVIEQGIETTKDLLDKMKLEDACVMVDGDELHVVSH 376

Query: 245  GTEKHKSYKKKIREALSSKLRSTRKQDPCVS---HCKDLGGQNNGGVTTIPALEMDSDKR 75
               K    KKK+R A  SK R  RK+   ++      D      G     P+   DSDKR
Sbjct: 377  REGKVWLVKKKLRNAFYSKRRLARKEYERLAVWHRVIDSESNQPGAEGLTPSPSTDSDKR 436

Query: 74   NLPVHDSFESDWEIL 30
              P  D  +S+WE+L
Sbjct: 437  TSPDDDFCQSEWELL 451


>gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508700922|gb|EOX92818.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 397

 Score =  165 bits (417), Expect = 5e-38
 Identities = 143/461 (31%), Positives = 221/461 (47%), Gaps = 16/461 (3%)
 Frame = -3

Query: 1364 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 1197
            +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS    +VMQ
Sbjct: 3    SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62

Query: 1196 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 1026
            DLL PS ++P+K VA  DL +  YA T L K    + +    G+ ++   ++E I+D+  
Sbjct: 63   DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121

Query: 1025 E----KSSLRVHNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 861
                  SS ++H   N   S S    VE   SD+        G +     + + + ++  
Sbjct: 122  NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174

Query: 860  PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 681
            P++ S     +  +  R+     + N    V CH   A   +TP+S   VE    D   I
Sbjct: 175  PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227

Query: 680  AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 504
             ES  E K  S+       D + L   V++++ +  C+S   + + S   L   KD S  
Sbjct: 228  EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSSSIIESEESNGKLNWTKDAS-- 285

Query: 503  TSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 324
                 G ST    E                                            E 
Sbjct: 286  -----GSSTVGRKEI-------------------------------------------ET 297

Query: 323  VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQD---PCVS 153
            V+ ++  +++++C +V+G +LHF  Q   KHK+Y++KIR+A+SS++RS RK++     + 
Sbjct: 298  VQQLDKIRVDESCFMVNGAELHFHPQREGKHKTYQRKIRDAISSRMRSARKKEYEQLPLW 357

Query: 152  HCKDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 30
            +  D+    +   ++  AL  +  +R L  HD  +S+WE+L
Sbjct: 358  YGDDVKSDQDSEGSSTSALTREDTRRTLN-HDDLDSEWELL 397


>ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citrus clementina]
            gi|567908905|ref|XP_006446766.1| hypothetical protein
            CICLE_v10015391mg [Citrus clementina]
            gi|557549376|gb|ESR60005.1| hypothetical protein
            CICLE_v10015391mg [Citrus clementina]
            gi|557549377|gb|ESR60006.1| hypothetical protein
            CICLE_v10015391mg [Citrus clementina]
          Length = 416

 Score =  160 bits (406), Expect = 9e-37
 Identities = 141/458 (30%), Positives = 226/458 (49%), Gaps = 14/458 (3%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            MD KGITW G++YQKFE MCLEVEE+MY+DTVKYVENQVQ VG +VKKFYS+V++DLLPP
Sbjct: 1    MDLKGITWVGHVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPP 60

Query: 1181 SCVDPVKVA-PGDLSLNPYAHTDL-NKSKPTMLDSYGEFKKKEIENEDISDLIAEK---- 1020
              VD VK A   +L L   A   +  K K  + +       +++    ++    +K    
Sbjct: 61   PSVDLVKGAVASNLPLEQNADVGIYKKPKIGIKEEAMNVNNEQLSESSLATTDLDKGAGG 120

Query: 1019 --SSLRVH-NDANHLSSLSP--RGLVENTHSDVCFTKSKKVGVYKRPIGIKRISQNNH-- 861
              S  R H  D +   SL    +G+  + +S     +S   G  +  I +++IS+ ++  
Sbjct: 121  GQSFCRFHIEDTSFQPSLGDTLKGVFSDAYSKEYDIRS---GHNQSSICMQKISKEDNLP 177

Query: 860  PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 681
            PS++S     +     R            S  C   + +  ++        +  + ++  
Sbjct: 178  PSEMSGAGPHMERGLRR-----------ASSSCELLDKIQEVSDDQVVVDPTPVTTEVAS 226

Query: 680  AESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCT 501
             +S +E  D+ E AS  +   L +    ++ ++ E A  +S   LSAE       +  CT
Sbjct: 227  CKSFEEIYDELEKASKGASGALTSSPAAKNCDESENA-HSSCSSLSAEL------NGICT 279

Query: 500  SADHGLSTKPIDEFRQGGSFSQINPNIVSNTW-DIESIGEDVFPCYEDNFDMEVIENEEA 324
            + D  +S          GSF  +N ++  + + D            E N D+E  +  E 
Sbjct: 280  N-DGVVSLV--------GSF--VNEDVQPSEFPDPGRSDYSTVDATESNIDVE--QGYET 326

Query: 323  VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHCK 144
            V+ V+  ++E+TC+LV+GD+L FV     KH+ YKKKI++A+SS++RSTRK +      K
Sbjct: 327  VQRVDNIQVEETCVLVNGDELCFVPCREGKHRPYKKKIQDAISSRMRSTRKHE-----YK 381

Query: 143  DLGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 30
             L    N       + + +++ +  P H   E +WE+L
Sbjct: 382  QLAVWYN---EDEKSKQQNAEMKGKPSHGYCELEWELL 416


>ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611541 isoform X1 [Citrus
            sinensis] gi|568829444|ref|XP_006469033.1| PREDICTED:
            uncharacterized protein LOC102611541 isoform X2 [Citrus
            sinensis] gi|568829446|ref|XP_006469034.1| PREDICTED:
            uncharacterized protein LOC102611541 isoform X3 [Citrus
            sinensis] gi|568829448|ref|XP_006469035.1| PREDICTED:
            uncharacterized protein LOC102611541 isoform X4 [Citrus
            sinensis]
          Length = 416

 Score =  159 bits (402), Expect = 3e-36
 Identities = 139/459 (30%), Positives = 223/459 (48%), Gaps = 15/459 (3%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            MD KGITW G++YQKFE MCLEVEE+MY+DTVKYVENQVQ VG +VKKFYS+V++DLLPP
Sbjct: 1    MDLKGITWVGHVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPP 60

Query: 1181 SCVDPVKVA-PGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRV 1005
              VD VK A   +L L   A   + K     +      +  ++ NE +S+     + L  
Sbjct: 61   PSVDLVKGAVASNLPLEQNADVGIYKKPKIGIKE----EAMKVNNEQLSESSLATTDLDK 116

Query: 1004 HNDAN------HLSSLSPRGLVENTHSDV---CFTKSKKV--GVYKRPIGIKRISQNNH- 861
                       H+   S +  + NT   V    + K   +  G  +  I +++IS+ ++ 
Sbjct: 117  GAGGGQSFCRFHIEDTSFQPSLGNTLKGVFSDAYPKEYDIRSGHNQSSICMQKISKEDNL 176

Query: 860  -PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKIL 684
             PS++S     +     R            S  C   + +  ++        +  + ++ 
Sbjct: 177  PPSEMSGAGPHMERGLRR-----------ASSSCELLDKIQEVSDDQVVVDPTSVTTEVA 225

Query: 683  IAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 504
              +S +E  D+ E AS  +   L +    ++ ++ E A  +S   LSAE       +  C
Sbjct: 226  SCKSFEEIYDELEKASKGASGALTSSPAAKNCDESESA-HSSCSSLSAEL------NGIC 278

Query: 503  TSADHGLSTKPIDEFRQGGSFSQINPNIVSNTW-DIESIGEDVFPCYEDNFDMEVIENEE 327
            T+ D  +S          GSF  +N ++  + + D            E N D+E  +  E
Sbjct: 279  TN-DGVVSLV--------GSF--VNEDVQPSEFPDPGRSDYSTVDATESNIDVE--QGYE 325

Query: 326  AVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHC 147
             V+ V+  ++E+TC+LV+GD+L FV    +KH+  KKKI++A+SS++RSTRK +      
Sbjct: 326  TVQRVDNIQVEETCVLVNGDELCFVPCREDKHRPCKKKIQDAISSRMRSTRKHE-----Y 380

Query: 146  KDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 30
            K L    N       + + +++ +  P H   E +WE+L
Sbjct: 381  KQLAVWYN---EDEKSKQQNAETKGKPSHGYCELEWELL 416


>ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303722 [Fragaria vesca
            subsp. vesca]
          Length = 389

 Score =  159 bits (402), Expect = 3e-36
 Identities = 147/459 (32%), Positives = 211/459 (45%), Gaps = 14/459 (3%)
 Frame = -3

Query: 1364 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLP 1185
            TMD KGITW G +Y+KFE+MCLEVEE MYEDTVK+VE+QVQ VG SVKKFY++VMQDLL 
Sbjct: 3    TMDVKGITWVGCVYEKFESMCLEVEENMYEDTVKFVEDQVQTVGESVKKFYADVMQDLLC 62

Query: 1184 PSCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSL-R 1008
             S +D   V+ G   +  Y+  D +KSK          KKKE     + ++  +   +  
Sbjct: 63   DSSLDRDDVSAGGFPVEHYSDVDNSKSKIR--------KKKEHVKAGVEEVKGDSEVISA 114

Query: 1007 VHNDANHLSSLSPRGLVENTHSDVCFTKSK----KVGVYKRPIGI----KRISQNNHPSK 852
            V  D +H       GL          TKS     K+   ++  G+    K+I     P K
Sbjct: 115  VLKDVDH------TGLFHRQRVYDSCTKSSGNCAKLACSRQDHGVRSCNKKIVVRETPIK 168

Query: 851  ISRP--MTSLSGDKSR--LLVASDDMNVTTSVRC-HPGEAVNNITPISEACVESLASDKI 687
               P   T++  D SR  L   S+  N      C  P E +    P      +S+ S+  
Sbjct: 169  DRLPGANTAVGKDFSRESLSSCSEFSNEDRDTSCDQPDEVITPSKPPEGMRCDSM-SESC 227

Query: 686  LIAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQ 507
            ++A + +   DD      +SD I+L      D  D                   K+ +  
Sbjct: 228  VVANASQCTGDDVSVNCQSSDMIVL------DNSD------------------GKRWNEL 263

Query: 506  CTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEE 327
              S+  GLST+       GGS   INP++ +   +I + G ++                 
Sbjct: 264  LDSSIGGLSTE-----LNGGS---INPSMDAIESNIGTHGTEI----------------- 298

Query: 326  AVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHC 147
             ++  +  KLE+TC++V G+ LHFV      +K YKKKI +A +S+  S RKQ+      
Sbjct: 299  -IQQSDKPKLEETCVMVSGEDLHFVHHTVANYKPYKKKIPKAFTSRTSSARKQE-----Y 352

Query: 146  KDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 30
            + L   +  G  T   LE   + +  P HD  ES+WEIL
Sbjct: 353  EQLALWH--GHHTKSILEGGEESKKSPTHDFCESEWEIL 389


>ref|XP_006348849.1| PREDICTED: uncharacterized protein LOC102594335 isoform X1 [Solanum
            tuberosum]
          Length = 260

 Score =  148 bits (374), Expect = 5e-33
 Identities = 107/291 (36%), Positives = 141/291 (48%), Gaps = 4/291 (1%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            MD K I+W GNIYQKFETMCLE+EE MY+DTVKYVENQV  VG +VK+F SEVMQD+ P 
Sbjct: 1    MDLKSISWVGNIYQKFETMCLEMEEAMYQDTVKYVENQVNTVGTNVKRFCSEVMQDVHPQ 60

Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKS-KPTMLDSYGEFKKKEIENEDISDLIAEKSSLRV 1005
              +DPVKVA  DLS+NPYAH +++K  K  +  S   F  K                   
Sbjct: 61   CNIDPVKVAAADLSINPYAHYEIDKKLKANLKGSARRFSNK------------------- 101

Query: 1004 HNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRP-IGIKRISQNNHPSKISRPMTSL 828
                             N  + V   KSK  GVYKR  +GIK I +++HP+K    +   
Sbjct: 102  ----------------LNDDTQVIKGKSKSGGVYKRQNVGIKEIVRDSHPAKKPNAICLA 145

Query: 827  SGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILIAESVKEEK--D 654
            SGD  +L         ++S     G                +ASD + +  ++   K  D
Sbjct: 146  SGDALKL---------SSSAEVRGG--------------FEMASDHVTLTSALASVKGSD 182

Query: 653  DSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCT 501
              E AS   D+ +          D    S AS   +S ES+R+K+ D+ CT
Sbjct: 183  SGEAASKVRDHFI---QTNVSAADTSITSEAS-VTMSVESVRKKQTDT-CT 228


>gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theobroma cacao]
          Length = 343

 Score =  147 bits (370), Expect = 1e-32
 Identities = 128/406 (31%), Positives = 193/406 (47%), Gaps = 13/406 (3%)
 Frame = -3

Query: 1364 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 1197
            +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS    +VMQ
Sbjct: 3    SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62

Query: 1196 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 1026
            DLL PS ++P+K VA  DL +  YA T L K    + +    G+ ++   ++E I+D+  
Sbjct: 63   DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121

Query: 1025 E----KSSLRVHNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 861
                  SS ++H   N   S S    VE   SD+        G +     + + + ++  
Sbjct: 122  NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174

Query: 860  PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 681
            P++ S     +  +  R+     + N    V CH   A   +TP+S   VE    D   I
Sbjct: 175  PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227

Query: 680  AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 504
             ES  E K  S+       D + L   V++++ +  C+S   + + S   L   KD S  
Sbjct: 228  EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSSSIIESEESNGKLNWTKDAS-- 285

Query: 503  TSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 324
                 G ST    E                                            E 
Sbjct: 286  -----GSSTVGRKEI-------------------------------------------ET 297

Query: 323  VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKL 186
            V+ ++  +++++C +V+G +LHF  Q   KHK+Y++KIR+A+SS++
Sbjct: 298  VQQLDKIRVDESCFMVNGAELHFHPQREGKHKTYQRKIRDAISSRM 343


>ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arabidopsis lyrata subsp.
            lyrata] gi|297326997|gb|EFH57417.1| hypothetical protein
            ARALYDRAFT_482041 [Arabidopsis lyrata subsp. lyrata]
          Length = 418

 Score =  144 bits (362), Expect = 1e-31
 Identities = 126/421 (29%), Positives = 199/421 (47%), Gaps = 22/421 (5%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            MDFKGI W GN+YQKFE MCLEVEE++ +DT KYVENQVQ VG SVKKF S+V+QDLLP 
Sbjct: 1    MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVQDLLPD 60

Query: 1181 SCVDPVKVAPGDLSLNPYAHT-DLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRV 1005
              VD  K  P  + L+ YA      K + +M     + K+++   E   D  A+K     
Sbjct: 61   DSVDSGKPLPVSM-LHEYAPVCSFKKKRDSMNRKTRDVKQEQEVTEGKKDGCAQKF---- 115

Query: 1004 HNDANHLSSLSPRGLVENTHSDVCFTKSKKV--GVYKRP-IGIKRISQNNHPSKISRPMT 834
                        RGL  + + D+C +  +    G Y+R  +G K+I +    S+++RP  
Sbjct: 116  ------------RGLDADDY-DICTSPRQYSYGGPYRRTRVGRKQIFKKEELSQVTRPY- 161

Query: 833  SLSGDKSRLLV-----ASDD---MNVTTSVRCHPGEAVNNITPISEACVESLASDKILIA 678
             +  D S L +       DD   +N ++    H     +++  ++ + +  + S +  I 
Sbjct: 162  -MQKDSSSLSMVHSARVKDDVGTVNSSSLSMVHSARVKDDVGTVNSSSLTMVHSAR--IK 218

Query: 677  ESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTS 498
            + V   K            I   E  K DK         + + L+  +  ++ D      
Sbjct: 219  DDVGTVKSSDSPPGEVEKLIYKKECQKDDK-------TKNQQSLTVVNSVKRNDSEIRID 271

Query: 497  ADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPC-YEDNFDMEVI------ 339
             +HGL              S+I P++ ++     + G D   C  E N D +        
Sbjct: 272  NEHGL-------MGDSSQDSEIQPSVATSL----AAGSD--DCRKETNVDTKTSSSSVSE 318

Query: 338  ENEEAVEPVETSKLEDTCILVDGDKLHFV---SQGTEKHKSYKKKIREALSSKLRSTRKQ 168
            +  E ++P+    +E++CILVD D+ H V       +KHK Y KKIR+A+SS+++  R++
Sbjct: 319  QKSEILQPLSGRSVEESCILVDRDEFHCVFPDKMENDKHKPY-KKIRDAISSRMKQNREK 377

Query: 167  D 165
            +
Sbjct: 378  E 378


>gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
          Length = 341

 Score =  139 bits (350), Expect = 3e-30
 Identities = 127/406 (31%), Positives = 191/406 (47%), Gaps = 13/406 (3%)
 Frame = -3

Query: 1364 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 1197
            +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS    +VMQ
Sbjct: 3    SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62

Query: 1196 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 1026
            DLL PS ++P+K VA  DL +  YA T L K    + +    G+ ++   ++E I+D+  
Sbjct: 63   DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121

Query: 1025 E----KSSLRVHNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 861
                  SS ++H   N   S S    VE   SD+        G +     + + + ++  
Sbjct: 122  NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174

Query: 860  PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 681
            P++ S     +  +  R+     + N    V CH   A   +TP+S   VE    D   I
Sbjct: 175  PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227

Query: 680  AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 504
             ES  E K  S+       D + L   V++++ +  C+S   + + S   L   KD S  
Sbjct: 228  EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSSSIIESEESNGKLNWTKDAS-- 285

Query: 503  TSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 324
                 G ST    E                                            E 
Sbjct: 286  -----GSSTVGRKEI-------------------------------------------ET 297

Query: 323  VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKL 186
            V+ ++  +++++C +V+G +LHF  Q   KHK+Y  +IR+A+SS++
Sbjct: 298  VQQLDKIRVDESCFMVNGAELHFHPQREGKHKTY--QIRDAISSRM 341


>ref|XP_006294244.1| hypothetical protein CARUB_v10023243mg, partial [Capsella rubella]
            gi|482562952|gb|EOA27142.1| hypothetical protein
            CARUB_v10023243mg, partial [Capsella rubella]
          Length = 436

 Score =  139 bits (349), Expect = 4e-30
 Identities = 128/418 (30%), Positives = 200/418 (47%), Gaps = 19/418 (4%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            MDFKGI W GN+YQKFE MCLEVEE++ +DT KYVENQV  VG SVKKF S+V+QDLLP 
Sbjct: 13   MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVHTVGNSVKKFCSDVVQDLLP- 71

Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKK-EIENEDISDLIAEKSSLRV 1005
               D   V  G     P   + LN+  P        FKKK E  N    D+  E+     
Sbjct: 72   ---DDDSVGSG----KPLPVSMLNEYAPVC-----SFKKKRESANRKTRDVKQEEEVTEG 119

Query: 1004 HNDANHLSSLSPRGLVENTHSDVCFTKSKKV--GVYKR-PIGIKRISQNNHPSKISRPMT 834
              D     +++ RGL  + + D+C +  +    G Y+R  +G K+I +    S+I+RP  
Sbjct: 120  KKDG---CAMNLRGLDADDY-DICTSPRQYSYGGPYRRGRVGRKQIFKKEELSQITRPY- 174

Query: 833  SLSGDKSRLLV-----ASDD---MNVTTSVRCHPGEAVNNITPISEACVESLASDKILIA 678
             +  D S L +       DD   +N ++    H G   +++  ++ + +  + S +  I 
Sbjct: 175  -IQKDSSNLTMVHSARVKDDVGTVNSSSLSMAHSGRVKDDVGTVNSSSLSMVHSAR--IK 231

Query: 677  ESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTS 498
              V+  K            I   E  K D+ D       +   L+  +  + KD    T 
Sbjct: 232  ADVETVKSSDSRPGEIERLISKKECQKDDRTD-------NQHGLTMVNSVRSKDSEIRTE 284

Query: 497  ADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVI----ENE 330
             +H L+   ++  R     S+I P++ ++     S  E      ED+ +        +  
Sbjct: 285  IEHSLTV--VNSVR--SQDSEILPSVATSLL-TGSSNEFRKETKEDSMEASSSSVSEQKS 339

Query: 329  EAVEPVETSKLEDTCILVDGDKLHFV---SQGTEKHKSYKKKIREALSSKLRSTRKQD 165
            E ++ +    +E++CILVD D+ H V       +KHK Y KKIR+A+SS+++  R+++
Sbjct: 340  EILQHLSGRSVEESCILVDRDEFHCVFPDKMENDKHKPY-KKIRDAISSRMKQNREKE 396


>ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205697 [Cucumis sativus]
          Length = 379

 Score =  138 bits (348), Expect = 5e-30
 Identities = 129/452 (28%), Positives = 201/452 (44%), Gaps = 8/452 (1%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            MD KGI W G +Y+KFETMCLEVE+++ +DTVKYVENQV+ VG SVK+FYS+VMQD LPP
Sbjct: 1    MDVKGIAWVGRLYEKFETMCLEVEDIICQDTVKYVENQVEVVGASVKRFYSDVMQDFLPP 60

Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSS---- 1014
            S +   KVA  + +L  Y +  + K KPTM       K  E ++ + S + A+       
Sbjct: 61   SELSDEKVAVCNSALENYENVVICK-KPTMGMKIERSKFSEEKSNENSKVTADAKRDIAC 119

Query: 1013 --LRVHNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRISQN-NHPSKISR 843
               R HN AN+L  +S         S        ++  Y R    K+  +N +H   +  
Sbjct: 120  KLPRGHNHANYLYLVS---------SPYSAANRAQIDGYSR----KKDDENIHHKIDLDG 166

Query: 842  PMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILIAESVKE 663
              ++  G KS  L  +   N+              +   SEA  E   + + ++ +    
Sbjct: 167  RESTTRGCKS--LTETSPTNLEKKYENDASSCCTILNRKSEASSELAGNMETMLVK---- 220

Query: 662  EKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTSADHGL 483
               D+ C            SV Q          A++ ++  +++      S     +   
Sbjct: 221  ---DTRC-----------NSVMQS---------ANETEIKTDNILPDTPSSAIVDTE--- 254

Query: 482  STKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEAVEPVETS 303
              K       G S ++++    S++W ++ I              E+ +    ++  + +
Sbjct: 255  --KETRLLSYGDSSAELDGR--SDSWSLDDI--------------ELEQGTHNIQQADET 296

Query: 302  KL-EDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHCKDLGGQN 126
            KL E+ C+LV GD LHF      K + Y KKI  A S   +S RKQ+      K+L  ++
Sbjct: 297  KLDEEACVLVKGDDLHFDFNEEVKQRHY-KKIAGAFSFTKKSKRKQE-----YKELAMKH 350

Query: 125  NGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 30
              G  TIP      D++ L   D  E DW++L
Sbjct: 351  GYGFGTIP---NQQDEQKLTAEDVLEQDWQLL 379


>ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana]
            gi|16612317|gb|AAL27517.1|AF439849_1 At2g31130/T16B12.6
            [Arabidopsis thaliana] gi|20197328|gb|AAC63838.2|
            expressed protein [Arabidopsis thaliana]
            gi|23506163|gb|AAN31093.1| At2g31130/T16B12.6
            [Arabidopsis thaliana] gi|330253402|gb|AEC08496.1|
            uncharacterized protein AT2G31130 [Arabidopsis thaliana]
          Length = 419

 Score =  136 bits (342), Expect = 2e-29
 Identities = 136/466 (29%), Positives = 208/466 (44%), Gaps = 22/466 (4%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            MDFKGI W GN+YQKFE MCLEVEE++ +DT KYVENQVQ VG SVKKF S+V+ DLLP 
Sbjct: 1    MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVHDLLPD 60

Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1002
              VD  K  P  + L+ YA              Y   KKK+  N    D+  E+      
Sbjct: 61   ESVDSGKPLPVSM-LHEYAPV------------YSFKKKKDSMNRKTKDVTQEQEVTEGK 107

Query: 1001 NDANHLSSLSPRGLVENTHSDVCFTKSKKV--GVYKRP-IGIKRISQNNHPSKISRP--- 840
             D     +   RGL  + + D+C +  +    G Y+R  IG K+I +    S++ RP   
Sbjct: 108  KDG---FAKKLRGLDADDY-DICTSPRQYSYGGPYRRTRIGRKQIFKKEELSQVIRPYIQ 163

Query: 839  --MTSLSGDKSRLL------VASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKIL 684
              +TSLS   S  +      V S  +++  S R +      N + +S     S+  D   
Sbjct: 164  KDLTSLSMVHSARVKDDLGTVNSSSLSMVHSARVNDDVGTVNSSSLSMVHHASMKDDVGT 223

Query: 683  IAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKK-LSAESLRQKKDDSQ 507
            +  S     +  +  S               K+ C+   +A +++ L+  +  +  D   
Sbjct: 224  VKSSDSPPGEVEKLIS---------------KKKCQKDDKAKNQQSLTVVNSVKSNDSEV 268

Query: 506  CTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEE 327
                +HGLS       +      +I P++ ++    ES         E +         E
Sbjct: 269  IVDNEHGLSADKSVRSQD----LEIQPSLATSL-PAESDDCRKETNVETSSSSVSEPKSE 323

Query: 326  AVEPVETSKLEDTCILVDGDKLHFV---SQGTEKHKSYKKKIREALSSKLRSTRKQD--- 165
             ++ +    +E++CILVD D+ H V       +KHK Y KKIR+A+SS+++  R+++   
Sbjct: 324  ILQHLSGRSVEESCILVDRDEFHSVFPDKMENDKHKPY-KKIRDAISSRMKQNREKEYKR 382

Query: 164  -PCVSHCKDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 30
                 + +D+      G    P  E  S         S ES+WE+L
Sbjct: 383  LARQWYAEDVENGRECGDNPKPIEENQS---------SEESEWELL 419


>ref|XP_006410238.1| hypothetical protein EUTSA_v10016698mg [Eutrema salsugineum]
            gi|567211021|ref|XP_006410239.1| hypothetical protein
            EUTSA_v10016698mg [Eutrema salsugineum]
            gi|557111407|gb|ESQ51691.1| hypothetical protein
            EUTSA_v10016698mg [Eutrema salsugineum]
            gi|557111408|gb|ESQ51692.1| hypothetical protein
            EUTSA_v10016698mg [Eutrema salsugineum]
          Length = 426

 Score =  135 bits (341), Expect = 3e-29
 Identities = 125/414 (30%), Positives = 200/414 (48%), Gaps = 15/414 (3%)
 Frame = -3

Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182
            M FKGITW GN+YQKFE MCLEVEE++ +DT KYVENQVQ VG S+KKF S+V+ D LP 
Sbjct: 1    MAFKGITWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSMKKFCSDVVGDFLPD 60

Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1002
              V   K  P  + L+ YA     K            KK+E  N    D+  E+      
Sbjct: 61   ESVGSEKPLPVSM-LHEYAPVCSFK------------KKRESLNRKTRDVKQEQEVSEGK 107

Query: 1001 NDANHLSSLSPRGLVENTHSDVCFTKSKKV--GVYKRP-IGIKRISQNNHPSKISRPMTS 831
             D      +  RGL  + + D+C +  +    G Y+R  +G K+I +N    +++RP + 
Sbjct: 108  KDG---CEMKFRGLDADDY-DICTSPRQYSYGGPYRRTRLGRKQIYKNEEVFQVTRP-SY 162

Query: 830  LSGDKSRLLV-----ASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDK---ILIAE 675
            +  D S L +      ++D+    S    P E    I+   E C +   ++    + +  
Sbjct: 163  IQKDSSSLSMVHRSRVNNDVGAVKSSDSPPVEVERLIS--KEECQKDDRTENQHGLTVVN 220

Query: 674  SVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTSA 495
            SV+ +  DSE  +     + + +SV+   +D E  ++         S+R  +D       
Sbjct: 221  SVRSQ--DSETRTKKEHGLTMVDSVR--SQDSETRTKNEHGLTMVNSVR-SEDSEIGIEN 275

Query: 494  DHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEAVEP 315
            +HGL+   ++  R   S  Q + +  S     +   E      E +      +  E ++ 
Sbjct: 276  EHGLTV--VNSGRCQDSEIQTSVSTSSPAGSDDCRKETNENSMETSSSSVSEQKSEILQE 333

Query: 314  V-ETSKLEDTCILVDGDKLHFV---SQGTEKHKSYKKKIREALSSKLRSTRKQD 165
            + E   LE++CI+VD D+LH V    +  +KHK Y KKIR+A+SS+++  R+++
Sbjct: 334  LSEGRSLEESCIIVDRDELHCVFPDRKENDKHKPY-KKIRDAISSRMKQNREKE 386


>gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508700926|gb|EOX92822.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 334

 Score =  133 bits (334), Expect = 2e-28
 Identities = 122/395 (30%), Positives = 182/395 (46%), Gaps = 13/395 (3%)
 Frame = -3

Query: 1364 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 1197
            +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS    +VMQ
Sbjct: 3    SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62

Query: 1196 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 1026
            DLL PS ++P+K VA  DL +  YA T L K    + +    G+ ++   ++E I+D+  
Sbjct: 63   DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121

Query: 1025 E----KSSLRVHNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 861
                  SS ++H   N   S S    VE   SD+        G +     + + + ++  
Sbjct: 122  NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174

Query: 860  PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 681
            P++ S     +  +  R+     + N    V CH   A   +TP+S   VE    D   I
Sbjct: 175  PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227

Query: 680  AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 504
             ES  E K  S+       D + L   V++++ +  C+S   + + S   L   KD S  
Sbjct: 228  EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSSSIIESEESNGKLNWTKDAS-- 285

Query: 503  TSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 324
                 G ST    E                                            E 
Sbjct: 286  -----GSSTVGRKEI-------------------------------------------ET 297

Query: 323  VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYK 219
            V+ ++  +++++C +V+G +LHF  Q   KHK+Y+
Sbjct: 298  VQQLDKIRVDESCFMVNGAELHFHPQREGKHKTYQ 332


Top