BLASTX nr result

ID: Rehmannia22_contig00014794 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00014794
         (913 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601...   147   7e-33
ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601...   146   1e-32
ref|XP_006348849.1| PREDICTED: uncharacterized protein LOC102594...   144   4e-32
ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256...   142   1e-31
ref|XP_004243407.1| PREDICTED: uncharacterized protein LOC101260...   140   5e-31
gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis]     132   1e-28
gb|EPS62712.1| hypothetical protein M569_12076, partial [Genlise...   132   2e-28
ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus c...   126   1e-26
gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma caca...   121   3e-25
gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theob...   121   3e-25
gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theob...   121   3e-25
gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma caca...   121   3e-25
ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205...   117   6e-24
ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303...   115   2e-23
ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611...   113   9e-23
ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citr...   112   1e-22
ref|XP_002279986.1| PREDICTED: uncharacterized protein LOC100250...   112   1e-22
ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana] ...   111   4e-22
ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arab...   110   7e-22
ref|XP_006294245.1| hypothetical protein CARUB_v10023243mg, part...   108   2e-21

>ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601397 isoform X6 [Solanum
           tuberosum]
          Length = 420

 Score =  147 bits (370), Expect = 7e-33
 Identities = 110/310 (35%), Positives = 153/310 (49%), Gaps = 33/310 (10%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MD KGI W G+IYQKFE MCLE+E+ MY+DT +YVENQVQ VG SVK+FYS+V+ DL P 
Sbjct: 1   MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60

Query: 247 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 426
             +DPVKVA  DLSLNPYAHT+++K     L          + N+++ D           
Sbjct: 61  FNIDPVKVAAADLSLNPYAHTEISKKLKAKLKG----GHPMVINKELID----------- 105

Query: 427 NDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKR-PIGIKRISQNNH-PSKISRPMTSL 600
                              + V   KSK  GVY+R  +GIK I ++NH PSK S  +  +
Sbjct: 106 ------------------DTQVIKGKSKSGGVYRRQSVGIKEIVRDNHPPSKKSDALCLV 147

Query: 601 SGDKSRLL----------VASDDMNVTTSVRCHPG--------EAVNNI--TPISEA--C 714
           SG+  +L           VASD M +T+ +    G        E  N+I  T +S A   
Sbjct: 148 SGNAIKLSSDSKVRGGFEVASDHMTMTSPLASVKGRSSAETGKEVSNHIIKTDVSAAGIS 207

Query: 715 VESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDC---------ECASHA 867
           +   ASD+ L  + V + + D    S   D     +S   D+  C         + +S+ 
Sbjct: 208 INVAASDRSLSVDCVGQNQADLRNTSSVGD----LQSDSHDRGTCKELAGDTGLKISSNT 263

Query: 868 SDNILLAESV 897
            DN + +E +
Sbjct: 264 GDNNIASEEI 273


>ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601397 isoform X1 [Solanum
           tuberosum] gi|565366720|ref|XP_006350033.1| PREDICTED:
           uncharacterized protein LOC102601397 isoform X2 [Solanum
           tuberosum] gi|565366722|ref|XP_006350034.1| PREDICTED:
           uncharacterized protein LOC102601397 isoform X3 [Solanum
           tuberosum] gi|565366724|ref|XP_006350035.1| PREDICTED:
           uncharacterized protein LOC102601397 isoform X4 [Solanum
           tuberosum] gi|565366726|ref|XP_006350036.1| PREDICTED:
           uncharacterized protein LOC102601397 isoform X5 [Solanum
           tuberosum]
          Length = 421

 Score =  146 bits (368), Expect = 1e-32
 Identities = 110/310 (35%), Positives = 152/310 (49%), Gaps = 33/310 (10%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MD KGI W G+IYQKFE MCLE+E+ MY+DT +YVENQVQ VG SVK+FYS+V+ DL P 
Sbjct: 1   MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60

Query: 247 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 426
             +DPVKVA  DLSLNPYAHT+++K     L          + N+++ D           
Sbjct: 61  FNIDPVKVAAADLSLNPYAHTEISKKLKAKLKG----GHPMVINKELID----------- 105

Query: 427 NDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKR-PIGIKRISQNNH-PSKISRPMTSL 600
                              + V   KSK  GVY+R  +GIK I ++NH PSK S  +  +
Sbjct: 106 ------------------DTQVIKGKSKSGGVYRRQSVGIKEIVRDNHPPSKKSDALCLV 147

Query: 601 SGDKSRLL----------VASDDMNVTTSVRCHPG--------EAVNNI--TPISEA--C 714
           SG+  +L           VASD M +T+ +    G        E  N+I  T +S A   
Sbjct: 148 SGNAIKLSSDSKVRGGFEVASDHMTMTSPLASVKGRSSAETGKEVSNHIIKTDVSAAGIS 207

Query: 715 VESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDC---------ECASHA 867
           +   ASD+ L  + V + + D    S   D   L      D+  C         + +S+ 
Sbjct: 208 INVAASDRSLSVDCVGQNQADLRNTSSVGD---LQSDSHADRGTCKELAGDTGLKISSNT 264

Query: 868 SDNILLAESV 897
            DN + +E +
Sbjct: 265 GDNNIASEEI 274


>ref|XP_006348849.1| PREDICTED: uncharacterized protein LOC102594335 isoform X1 [Solanum
           tuberosum]
          Length = 260

 Score =  144 bits (364), Expect = 4e-32
 Identities = 98/261 (37%), Positives = 136/261 (52%), Gaps = 23/261 (8%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MD K I+W GNIYQKFETMCLE+EE MY+DTVKYVENQV  VG +VK+F SEVMQD+ P 
Sbjct: 1   MDLKSISWVGNIYQKFETMCLEMEEAMYQDTVKYVENQVNTVGTNVKRFCSEVMQDVHPQ 60

Query: 247 SCVDPVKVAPGDLSLNPYAHTDLNKS-KPTMLDSYGEFKKKEIENEDISDLIAEKSSLRV 423
             +DPVKVA  DLS+NPYAH +++K  K  +  S   F  K                   
Sbjct: 61  CNIDPVKVAAADLSINPYAHYEIDKKLKANLKGSARRFSNK------------------- 101

Query: 424 HNDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRP-IGIKRISQNNHPSKISRPMTSL 600
                            N  + V   KSK  GVYKR  +GIK I +++HP+K    +   
Sbjct: 102 ----------------LNDDTQVIKGKSKSGGVYKRQNVGIKEIVRDSHPAKKPNAICLA 145

Query: 601 SGDKSRLL----------VASDDMNVTTSVR----CHPGEAVNNI------TPISEACVE 720
           SGD  +L           +ASD + +T+++        GEA + +      T +S A   
Sbjct: 146 SGDALKLSSSAEVRGGFEMASDHVTLTSALASVKGSDSGEAASKVRDHFIQTNVSAADTS 205

Query: 721 SLASDKILIA-ESVKEEKDDS 780
             +   + ++ ESV++++ D+
Sbjct: 206 ITSEASVTMSVESVRKKQTDT 226


>ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256948 [Solanum
           lycopersicum]
          Length = 421

 Score =  142 bits (359), Expect = 1e-31
 Identities = 101/261 (38%), Positives = 136/261 (52%), Gaps = 24/261 (9%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MD KGI W G+IYQKFE MCLE+E+ MY+DT +YVENQVQ VG SVK+FYS+V+ DL P 
Sbjct: 1   MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60

Query: 247 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 426
             +DPVKVA  DLSLNPYAHT+++K     L                     +    RV 
Sbjct: 61  FNIDPVKVAAADLSLNPYAHTEISKKLKAQL---------------------KGGHPRVI 99

Query: 427 NDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKR-PIGIKRISQNNH-PSKISRPMTSL 600
           N          + L+++T   V   KSK  GVY+R  +G+K I ++NH PSK S  +  +
Sbjct: 100 N----------KELIDDT--QVIKGKSKSGGVYRRQSVGMKEIVRDNHPPSKKSDALCLV 147

Query: 601 SGDKSRLL----------VASDDMNVTTSVRCHPG--------EAVNNI----TPISEAC 714
           SG+  +L           VASD M +T+ +    G        E  N+I     P +   
Sbjct: 148 SGNTIKLSSDSKVRGGFEVASDHMTMTSPLASVKGLKSTETGKEVSNHIIKTEVPAAGIS 207

Query: 715 VESLASDKILIAESVKEEKDD 777
           +   ASD  L  + V + + D
Sbjct: 208 INIAASDTSLSVDCVGQNQAD 228


>ref|XP_004243407.1| PREDICTED: uncharacterized protein LOC101260247 [Solanum
           lycopersicum]
          Length = 374

 Score =  140 bits (354), Expect = 5e-31
 Identities = 107/309 (34%), Positives = 149/309 (48%), Gaps = 27/309 (8%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MD K I+W GNIYQKFETMCLE+EE MY+DTVKYVENQ+  VG +VK+F SEVMQD+ P 
Sbjct: 1   MDLKSISWVGNIYQKFETMCLEMEEAMYQDTVKYVENQMNTVGTNVKRFCSEVMQDVHPQ 60

Query: 247 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 426
             +DPVKVA  DLSLNPYAH +++K     L                      K S R  
Sbjct: 61  CNIDPVKVAAADLSLNPYAHYEIDKKLKANL----------------------KGSAR-- 96

Query: 427 NDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRP-IGIKRISQNNHPSKISRPMTSLS 603
             +N L          N  + V   KSK  GVYKR  +GIK I +++H +K    +   S
Sbjct: 97  GFSNKL----------NDDTQVIKGKSKSGGVYKRQNVGIKEIVRDSHLTKKPNAICLAS 146

Query: 604 GDKSRLL----------VASDDMNVTTSVR----CHPGEAVNNI------TPISEACVES 723
           GD  +L           +ASD + +T+++        GE  + +      T +S A    
Sbjct: 147 GDALKLSSSAEVRGGFELASDHVTLTSALASVKGSDSGEVASKVSNHVIQTNVSTADTSI 206

Query: 724 LASDKILIAESVKEEKDDSECASHASDNILLAESVK-----QDKEDCECASHASDNILLA 888
            +   ++++     +K    C    + N     S        ++E  E     SDN+L  
Sbjct: 207 TSEASVMMSVESVGKKQTDTCTKELACNTRFKTSSDVRNNLANEEIDESHEEKSDNLLSK 266

Query: 889 -ESVKQDKE 912
            +S++ D E
Sbjct: 267 YDSIESDLE 275


>gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis]
          Length = 443

 Score =  132 bits (333), Expect = 1e-28
 Identities = 105/294 (35%), Positives = 140/294 (47%), Gaps = 29/294 (9%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MD KGITW GN+YQKFE MCLEVEE+MY+DTVKYVENQVQ VG SVK+FYS+VMQDLLPP
Sbjct: 1   MDVKGITWVGNVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGASVKRFYSDVMQDLLPP 60

Query: 247 SCVDPVKV----------APGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDL 396
           S  D  KV          +   +S  P       K KP   D     +  ++ ++     
Sbjct: 61  SSQDSEKVSLCGFIGKQDSDDGISKKPNV---AKKEKPAKADDEQLIRTLKVTSDSKDVY 117

Query: 397 IAEKSSLRVHNDANHLSSPS---PRGLVENTHS-----DVCFTKSKKVGVYKRPIGIKRI 552
           +A   S+ V  D +++  PS    +G   N  S     DV    S  + V +     K I
Sbjct: 118 LA--PSIHVRCDVDNMCRPSGECVKGACSNLRSRKKCRDVSVHSSSNLSVNENRSDKKLI 175

Query: 553 SQN-----NHPSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACV 717
                        +SRP++S S   + +   S D   TT        +VN          
Sbjct: 176 PPETSCAITREKHLSRPLSSYSEFVNEIHEISLDQTGTTK-----APSVN---------- 220

Query: 718 ESLASDKILIAESVKEEKDDSECAS------HASDNILLAESVKQDKEDCECAS 861
           E  +SD I+  ES  E ++ SEC +      HAS  I+L +SV  D  + +  S
Sbjct: 221 EDTSSDSIV--ESCDEIENSSECMADLSSSFHASSEIILVKSVGYDGNEMDVPS 272


>gb|EPS62712.1| hypothetical protein M569_12076, partial [Genlisea aurea]
          Length = 147

 Score =  132 bits (332), Expect = 2e-28
 Identities = 80/165 (48%), Positives = 95/165 (57%), Gaps = 1/165 (0%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MDFKGI W GN+YQKFE MCLEVEEV+YEDTVKY+E Q+QKV  SVKKFY+E+M DL P 
Sbjct: 1   MDFKGIAWVGNVYQKFEAMCLEVEEVVYEDTVKYMEGQMQKVSGSVKKFYTEIMDDLNPS 60

Query: 247 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 426
           S   P K +  DL  +P+ H  L K KP       +   +E E  D  D  A K      
Sbjct: 61  SGDAPAKYSESDLVWDPFGHVHLMK-KPR------DIVPEEKEVGDAFDFAAGKKD---- 109

Query: 427 NDANHLSSPSPRGLVENTH-SDVCFTKSKKVGVYKRPIGIKRISQ 558
                     P   VE+ H      TKS K+G  +RPIGIKRIS+
Sbjct: 110 ---------PPLVFVEDLHCGSRAATKSPKLGACRRPIGIKRISK 145


>ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus communis]
           gi|223535579|gb|EEF37247.1| hypothetical protein
           RCOM_0553590 [Ricinus communis]
          Length = 490

 Score =  126 bits (317), Expect = 1e-26
 Identities = 80/171 (46%), Positives = 97/171 (56%), Gaps = 3/171 (1%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MD KGI+W GNIYQKFE MCLEVEEVMY+DTVKYVENQVQ VG SVK+FYS+VMQDLLPP
Sbjct: 1   MDLKGISWVGNIYQKFEAMCLEVEEVMYQDTVKYVENQVQTVGSSVKRFYSDVMQDLLPP 60

Query: 247 SCVDPVKVAPGDLSLNPYAHTDL-NKSKPTMLDSYGEFKKKEIENEDISDLIAEKS--SL 417
           S VD  K A  D+ L  YA   +  K K  + +  G+   +E   ED      +KS   L
Sbjct: 61  SSVDAAKGAGVDVPLELYADLGIYMKPKVGVKEKQGKVDDRERLTEDPKITTDKKSMDPL 120

Query: 418 RVHNDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRISQNNHP 570
             H            GLVEN       ++    G   R  G + +S  ++P
Sbjct: 121 TFHR----------LGLVENRFP---LSQGNSAGGASRQHGKRSLSNKSNP 158


>gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma cacao]
           gi|508700926|gb|EOX92822.1| Uncharacterized protein
           isoform 5 [Theobroma cacao]
          Length = 334

 Score =  121 bits (304), Expect = 3e-25
 Identities = 100/279 (35%), Positives = 145/279 (51%), Gaps = 13/279 (4%)
 Frame = +1

Query: 64  TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 231
           +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS    +VMQ
Sbjct: 3   SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62

Query: 232 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 402
           DLL PS ++P+K VA  DL +  YA T L K    + +    G+ ++   ++E I+D+  
Sbjct: 63  DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121

Query: 403 E----KSSLRVHNDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 567
                 SS ++H   N   S S    VE   SD+        G +     + + + ++  
Sbjct: 122 NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174

Query: 568 PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 747
           P++ S     +  +  R+     + N    V CH   A   +TP+S   VE    D   I
Sbjct: 175 PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227

Query: 748 AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECAS 861
            ES  E K  S+       D + L   V++++ +  C+S
Sbjct: 228 EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSS 266


>gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
          Length = 341

 Score =  121 bits (304), Expect = 3e-25
 Identities = 100/279 (35%), Positives = 145/279 (51%), Gaps = 13/279 (4%)
 Frame = +1

Query: 64  TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 231
           +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS    +VMQ
Sbjct: 3   SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62

Query: 232 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 402
           DLL PS ++P+K VA  DL +  YA T L K    + +    G+ ++   ++E I+D+  
Sbjct: 63  DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121

Query: 403 E----KSSLRVHNDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 567
                 SS ++H   N   S S    VE   SD+        G +     + + + ++  
Sbjct: 122 NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174

Query: 568 PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 747
           P++ S     +  +  R+     + N    V CH   A   +TP+S   VE    D   I
Sbjct: 175 PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227

Query: 748 AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECAS 861
            ES  E K  S+       D + L   V++++ +  C+S
Sbjct: 228 EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSS 266


>gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theobroma cacao]
          Length = 343

 Score =  121 bits (304), Expect = 3e-25
 Identities = 100/279 (35%), Positives = 145/279 (51%), Gaps = 13/279 (4%)
 Frame = +1

Query: 64  TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 231
           +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS    +VMQ
Sbjct: 3   SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62

Query: 232 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 402
           DLL PS ++P+K VA  DL +  YA T L K    + +    G+ ++   ++E I+D+  
Sbjct: 63  DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121

Query: 403 E----KSSLRVHNDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 567
                 SS ++H   N   S S    VE   SD+        G +     + + + ++  
Sbjct: 122 NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174

Query: 568 PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 747
           P++ S     +  +  R+     + N    V CH   A   +TP+S   VE    D   I
Sbjct: 175 PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227

Query: 748 AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECAS 861
            ES  E K  S+       D + L   V++++ +  C+S
Sbjct: 228 EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSS 266


>gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508700922|gb|EOX92818.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 397

 Score =  121 bits (304), Expect = 3e-25
 Identities = 100/279 (35%), Positives = 145/279 (51%), Gaps = 13/279 (4%)
 Frame = +1

Query: 64  TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 231
           +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS    +VMQ
Sbjct: 3   SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62

Query: 232 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 402
           DLL PS ++P+K VA  DL +  YA T L K    + +    G+ ++   ++E I+D+  
Sbjct: 63  DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121

Query: 403 E----KSSLRVHNDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 567
                 SS ++H   N   S S    VE   SD+        G +     + + + ++  
Sbjct: 122 NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174

Query: 568 PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 747
           P++ S     +  +  R+     + N    V CH   A   +TP+S   VE    D   I
Sbjct: 175 PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227

Query: 748 AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECAS 861
            ES  E K  S+       D + L   V++++ +  C+S
Sbjct: 228 EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSS 266


>ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205697 [Cucumis sativus]
          Length = 379

 Score =  117 bits (293), Expect = 6e-24
 Identities = 103/322 (31%), Positives = 145/322 (45%), Gaps = 43/322 (13%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MD KGI W G +Y+KFETMCLEVE+++ +DTVKYVENQV+ VG SVK+FYS+VMQD LPP
Sbjct: 1   MDVKGIAWVGRLYEKFETMCLEVEDIICQDTVKYVENQVEVVGASVKRFYSDVMQDFLPP 60

Query: 247 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSS---- 414
           S +   KVA  + +L  Y +  + K KPTM       K  E ++ + S + A+       
Sbjct: 61  SELSDEKVAVCNSALENYENVVICK-KPTMGMKIERSKFSEEKSNENSKVTADAKRDIAC 119

Query: 415 --LRVHNDANHL---SSP------------SPRGLVENTHSDV------CFTKSKKVGVY 525
              R HN AN+L   SSP            S +   EN H  +        T+  K    
Sbjct: 120 KLPRGHNHANYLYLVSSPYSAANRAQIDGYSRKKDDENIHHKIDLDGRESTTRGCKSLTE 179

Query: 526 KRPIGIKRISQNNHPS------KISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVN 687
             P  +++  +N+  S      + S   + L+G+   +LV     N             +
Sbjct: 180 TSPTNLEKKYENDASSCCTILNRKSEASSELAGNMETMLVKDTRCNSVMQSANETEIKTD 239

Query: 688 NITP--ISEACVESLASDKILIAESVKEEKD--------DSECASHASDNILLAESVKQD 837
           NI P   S A V++    ++L       E D        D       + NI  A+  K D
Sbjct: 240 NILPDTPSSAIVDTEKETRLLSYGDSSAELDGRSDSWSLDDIELEQGTHNIQQADETKLD 299

Query: 838 KEDCECASHASDNILLAESVKQ 903
           +E C        +    E VKQ
Sbjct: 300 EEACVLVKGDDLHFDFNEEVKQ 321


>ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303722 [Fragaria vesca
           subsp. vesca]
          Length = 389

 Score =  115 bits (288), Expect = 2e-23
 Identities = 95/268 (35%), Positives = 131/268 (48%), Gaps = 14/268 (5%)
 Frame = +1

Query: 64  TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLP 243
           TMD KGITW G +Y+KFE+MCLEVEE MYEDTVK+VE+QVQ VG SVKKFY++VMQDLL 
Sbjct: 3   TMDVKGITWVGCVYEKFESMCLEVEENMYEDTVKFVEDQVQTVGESVKKFYADVMQDLLC 62

Query: 244 PSCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSL-R 420
            S +D   V+ G   +  Y+  D +KSK          KKKE     + ++  +   +  
Sbjct: 63  DSSLDRDDVSAGGFPVEHYSDVDNSKSKIR--------KKKEHVKAGVEEVKGDSEVISA 114

Query: 421 VHNDANHLSSPSPRGLVENTHSDVCFTKSK----KVGVYKRPIGI----KRISQNNHPSK 576
           V  D +H       GL          TKS     K+   ++  G+    K+I     P K
Sbjct: 115 VLKDVDH------TGLFHRQRVYDSCTKSSGNCAKLACSRQDHGVRSCNKKIVVRETPIK 168

Query: 577 ISRP--MTSLSGDKSR--LLVASDDMNVTTSVRC-HPGEAVNNITPISEACVESLASDKI 741
              P   T++  D SR  L   S+  N      C  P E +    P      +S+ S+  
Sbjct: 169 DRLPGANTAVGKDFSRESLSSCSEFSNEDRDTSCDQPDEVITPSKPPEGMRCDSM-SESC 227

Query: 742 LIAESVKEEKDDSECASHASDNILLAES 825
           ++A + +   DD      +SD I+L  S
Sbjct: 228 VVANASQCTGDDVSVNCQSSDMIVLDNS 255


>ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611541 isoform X1 [Citrus
           sinensis] gi|568829444|ref|XP_006469033.1| PREDICTED:
           uncharacterized protein LOC102611541 isoform X2 [Citrus
           sinensis] gi|568829446|ref|XP_006469034.1| PREDICTED:
           uncharacterized protein LOC102611541 isoform X3 [Citrus
           sinensis] gi|568829448|ref|XP_006469035.1| PREDICTED:
           uncharacterized protein LOC102611541 isoform X4 [Citrus
           sinensis]
          Length = 416

 Score =  113 bits (283), Expect = 9e-23
 Identities = 90/287 (31%), Positives = 143/287 (49%), Gaps = 12/287 (4%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MD KGITW G++YQKFE MCLEVEE+MY+DTVKYVENQVQ VG +VKKFYS+V++DLLPP
Sbjct: 1   MDLKGITWVGHVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPP 60

Query: 247 SCVDPVKVA-PGDLSLNPYAHTDL-NKSKPTMLDSYGEFKKKEIENEDISDLIAEK---- 408
             VD VK A   +L L   A   +  K K  + +   +   +++    ++    +K    
Sbjct: 61  PSVDLVKGAVASNLPLEQNADVGIYKKPKIGIKEEAMKVNNEQLSESSLATTDLDKGAGG 120

Query: 409 --SSLRVHNDANHLSSPSPRGLVENTHSDVCFTKSKKV--GVYKRPIGIKRISQNNH--P 570
             S  R H +      PS    ++   SD  + K   +  G  +  I +++IS+ ++  P
Sbjct: 121 GQSFCRFHIEDTSF-QPSLGNTLKGVFSD-AYPKEYDIRSGHNQSSICMQKISKEDNLPP 178

Query: 571 SKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILIA 750
           S++S     +     R            S  C   + +  ++        +  + ++   
Sbjct: 179 SEMSGAGPHMERGLRR-----------ASSSCELLDKIQEVSDDQVVVDPTSVTTEVASC 227

Query: 751 ESVKEEKDDSECASHASDNILLAESVKQDKEDCECASHASDNILLAE 891
           +S +E  D+ E AS  +   L +    ++ ++ E A H+S + L AE
Sbjct: 228 KSFEEIYDELEKASKGASGALTSSPAAKNCDESESA-HSSCSSLSAE 273


>ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citrus clementina]
           gi|567908905|ref|XP_006446766.1| hypothetical protein
           CICLE_v10015391mg [Citrus clementina]
           gi|557549376|gb|ESR60005.1| hypothetical protein
           CICLE_v10015391mg [Citrus clementina]
           gi|557549377|gb|ESR60006.1| hypothetical protein
           CICLE_v10015391mg [Citrus clementina]
          Length = 416

 Score =  112 bits (281), Expect = 1e-22
 Identities = 52/69 (75%), Positives = 60/69 (86%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MD KGITW G++YQKFE MCLEVEE+MY+DTVKYVENQVQ VG +VKKFYS+V++DLLPP
Sbjct: 1   MDLKGITWVGHVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPP 60

Query: 247 SCVDPVKVA 273
             VD VK A
Sbjct: 61  PSVDLVKGA 69


>ref|XP_002279986.1| PREDICTED: uncharacterized protein LOC100250516 [Vitis vinifera]
           gi|302143402|emb|CBI21963.3| unnamed protein product
           [Vitis vinifera]
          Length = 451

 Score =  112 bits (281), Expect = 1e-22
 Identities = 86/222 (38%), Positives = 121/222 (54%), Gaps = 37/222 (16%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDT-------VKYVENQVQKVGVSVKKFYSEV 225
           MDFKGITW GN+YQKFET+CLEVE++MY+DT       VKYVE+QV+ VG SVKKF SE+
Sbjct: 1   MDFKGITWVGNMYQKFETICLEVEDIMYQDTVKYFENHVKYVEDQVETVGESVKKFCSEI 60

Query: 226 MQDLLPPSCVDPVKVAPGDLSLNPYAHTDLNKSKPTM---------------LDSYGEFK 360
           +QDLL P   D ++V   +LSL+ + +  L K KP +               +    EF 
Sbjct: 61  VQDLLLP---DSLEVTDSNLSLDQHDNVKLCK-KPKVGIKEEAKVGFKEEPKVSIKEEFI 116

Query: 361 KKEI----ENEDISDL---IAEKSSLRVHNDANHL----SSPSPRGLVENTH----SDVC 495
           K +I    E+ +I+DL   +  KSS    +  N+L    S  S  G   + H     D  
Sbjct: 117 KFDIDRLTEHSEIADLNEDVEHKSSFTGLHGVNNLFQSYSGNSVTGACSDLHLVQNDDGV 176

Query: 496 FTKSKKVGVYKRPIGIKRISQNNHPSKISRPMTSLSGDKSRL 621
             K+   G+ + P+   ++SQ   P ++S  +  +SGD SRL
Sbjct: 177 MCKNLDAGIKRNPV---KVSQ--FPIEVSGVIAPISGDVSRL 213


>ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana]
           gi|16612317|gb|AAL27517.1|AF439849_1 At2g31130/T16B12.6
           [Arabidopsis thaliana] gi|20197328|gb|AAC63838.2|
           expressed protein [Arabidopsis thaliana]
           gi|23506163|gb|AAN31093.1| At2g31130/T16B12.6
           [Arabidopsis thaliana] gi|330253402|gb|AEC08496.1|
           uncharacterized protein AT2G31130 [Arabidopsis thaliana]
          Length = 419

 Score =  111 bits (277), Expect = 4e-22
 Identities = 78/187 (41%), Positives = 101/187 (54%), Gaps = 8/187 (4%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MDFKGI W GN+YQKFE MCLEVEE++ +DT KYVENQVQ VG SVKKF S+V+ DLLP 
Sbjct: 1   MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVHDLLPD 60

Query: 247 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 426
             VD  K  P  + L+ YA              Y   KKK+  N    D+  E+      
Sbjct: 61  ESVDSGKPLPVSM-LHEYAPV------------YSFKKKKDSMNRKTKDVTQEQEVTEGK 107

Query: 427 NDANHLSSPSPRGLVENTHSDVCFTKSKKV--GVYKRP-IGIKRISQNNHPSKISRP--- 588
            D     +   RGL  + + D+C +  +    G Y+R  IG K+I +    S++ RP   
Sbjct: 108 KDG---FAKKLRGLDADDY-DICTSPRQYSYGGPYRRTRIGRKQIFKKEELSQVIRPYIQ 163

Query: 589 --MTSLS 603
             +TSLS
Sbjct: 164 KDLTSLS 170


>ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arabidopsis lyrata subsp.
           lyrata] gi|297326997|gb|EFH57417.1| hypothetical protein
           ARALYDRAFT_482041 [Arabidopsis lyrata subsp. lyrata]
          Length = 418

 Score =  110 bits (275), Expect = 7e-22
 Identities = 73/178 (41%), Positives = 100/178 (56%), Gaps = 4/178 (2%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MDFKGI W GN+YQKFE MCLEVEE++ +DT KYVENQVQ VG SVKKF S+V+QDLLP 
Sbjct: 1   MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVQDLLPD 60

Query: 247 SCVDPVKVAPGDLSLNPYAHT-DLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRV 423
             VD  K  P  + L+ YA      K + +M     + K+++   E   D  A+K     
Sbjct: 61  DSVDSGKPLPVSM-LHEYAPVCSFKKKRDSMNRKTRDVKQEQEVTEGKKDGCAQKF---- 115

Query: 424 HNDANHLSSPSPRGLVENTHSDVCFTKSKKV--GVYKRP-IGIKRISQNNHPSKISRP 588
                       RGL  + + D+C +  +    G Y+R  +G K+I +    S+++RP
Sbjct: 116 ------------RGLDADDY-DICTSPRQYSYGGPYRRTRVGRKQIFKKEELSQVTRP 160


>ref|XP_006294245.1| hypothetical protein CARUB_v10023243mg, partial [Capsella rubella]
           gi|482562953|gb|EOA27143.1| hypothetical protein
           CARUB_v10023243mg, partial [Capsella rubella]
          Length = 432

 Score =  108 bits (271), Expect = 2e-21
 Identities = 101/313 (32%), Positives = 149/313 (47%), Gaps = 31/313 (9%)
 Frame = +1

Query: 67  MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 246
           MDFKGI W GN+YQKFE MCLEVEE++ +DT KYVENQV  VG SVKKF S+V+QDLLP 
Sbjct: 49  MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVHTVGNSVKKFCSDVVQDLLP- 107

Query: 247 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKK-EIENEDISDLIAEKSSLRV 423
              D   V  G     P   + LN+  P        FKKK E  N    D+  E+     
Sbjct: 108 ---DDDSVGSG----KPLPVSMLNEYAPVC-----SFKKKRESANRKTRDVKQEEEVTEG 155

Query: 424 HNDANHLSSPSPRGLVENTHSDVCFTKSKKV--GVYKR-PIGIKRISQNNHPSKISRPMT 594
             D     + + RGL  + + D+C +  +    G Y+R  +G K+I +    S+I+RP  
Sbjct: 156 KKDG---CAMNLRGLDADDY-DICTSPRQYSYGGPYRRGRVGRKQIFKKEELSQITRPY- 210

Query: 595 SLSGDKSRLLV-----ASDD---MNVTTSVRCHPGEAVNNITPISEACVESLASDKILI- 747
            +  D S L +       DD   +N ++    H G   +++  ++ + +  + S +I   
Sbjct: 211 -IQKDSSNLTMVHSARVKDDVGTVNSSSLSMAHSGRVKDDVGTVNSSSLSMVHSARIKAD 269

Query: 748 AESVKE-----------------EKDDSECASHASDNILLAESVKQDKEDCECASHASDN 876
            E+VK                  +KDD     H    + +  SV+   +D E  +    +
Sbjct: 270 VETVKSSDSRPGEIERLISKKECQKDDRTDNQH---GLTMVNSVR--SKDSEIRTEIEHS 324

Query: 877 ILLAESVK-QDKE 912
           + +  SV+ QD E
Sbjct: 325 LTVVNSVRSQDSE 337


Top