BLASTX nr result

ID: Rehmannia27_contig00028385 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia27_contig00028385
         (2536 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011092915.1| PREDICTED: uncharacterized protein LOC105172...   196   3e-51
ref|XP_011101871.1| PREDICTED: uncharacterized protein LOC105179...   192   5e-48
ref|XP_011071645.1| PREDICTED: uncharacterized protein LOC105157...   186   2e-47
ref|XP_012846704.1| PREDICTED: uncharacterized protein LOC105966...   173   2e-42
emb|CDP20930.1| unnamed protein product [Coffea canephora]            169   1e-41
ref|XP_007026454.1| Uncharacterized protein TCM_030494 [Theobrom...   166   9e-39
ref|XP_007026455.1| Uncharacterized protein TCM_021519 [Theobrom...   160   1e-37
ref|XP_007010391.1| Uncharacterized protein TCM_044158 [Theobrom...   159   1e-36
ref|XP_007031319.1| Uncharacterized protein TCM_016772 [Theobrom...   159   2e-36
ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobrom...   157   1e-35
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...   157   1e-35
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...   156   2e-35
ref|XP_012065816.1| PREDICTED: uncharacterized protein LOC105628...   149   2e-35
ref|XP_007031317.1| Uncharacterized protein TCM_016768 [Theobrom...   146   5e-35
ref|XP_007040951.1| Uncharacterized protein TCM_016760 [Theobrom...   152   3e-34
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...   152   3e-34
ref|XP_007046403.1| Uncharacterized protein TCM_011922 [Theobrom...   151   4e-34
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...   150   2e-33
ref|XP_012841289.1| PREDICTED: uncharacterized protein LOC105961...   144   3e-33
emb|CDP14239.1| unnamed protein product [Coffea canephora]            145   4e-33

>ref|XP_011092915.1| PREDICTED: uncharacterized protein LOC105172985 [Sesamum indicum]
          Length = 470

 Score =  196 bits (498), Expect = 3e-51
 Identities = 93/209 (44%), Positives = 133/209 (63%)
 Frame = -1

Query: 2278 LGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKGPHSW 2099
            +G  S   G KVL +SS E +RL+  ++  L+GKFS   P  + +   + +   +G  S 
Sbjct: 1    MGVLSRDQGMKVLRFSSDEISRLSLPFRYALVGKFSHGYPSMQNLRRWMLAQGFRGDFSV 60

Query: 2098 KFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWIRLPG 1919
               N  H+ IK  LEED+ +LW+ + W +   PMRVFKWTP+FN +EE+P+ PVW+RLP 
Sbjct: 61   GAINVRHVFIKFALEEDYTKLWIKSTWFVEGFPMRVFKWTPTFNPREESPIVPVWVRLPE 120

Query: 1918 LPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIVLCSDGA 1739
            LPI FFD  AL+SIA  +GTPL+ D  TA   R ++AR+C+E++LL+    EI L     
Sbjct: 121  LPIQFFDREALFSIAHLLGTPLRTDVSTATLVRPSVARVCVEINLLEPLQTEIGLGIGTE 180

Query: 1738 VHVQKIVYERVPEYCEFCKHVGHNIQACY 1652
            V +Q ++YER+P+YC  CKH+GH+   CY
Sbjct: 181  VIIQPVIYERLPKYCGACKHLGHDEDECY 209


>ref|XP_011101871.1| PREDICTED: uncharacterized protein LOC105179909 [Sesamum indicum]
          Length = 733

 Score =  192 bits (488), Expect = 5e-48
 Identities = 84/214 (39%), Positives = 134/214 (62%)
 Frame = -1

Query: 2281 PLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKGPHS 2102
            PLG  S   GR  +++++ ET  LA  ++ +L+GKFS   P   ++   ++ L ++G  +
Sbjct: 71   PLGIKSVNQGRPTISFTNTETEELAAPFRFSLVGKFSHGAPPYSQMHQLIARLGIQGAFT 130

Query: 2101 WKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWIRLP 1922
                N  H +I +  E D++RLW+  IW ++  PMR+FKWTP+F   +E+ + P+++  P
Sbjct: 131  VSMINSKHTLISLSCESDYSRLWLRRIWFLQGFPMRIFKWTPTFTPTQESSVVPIFVCFP 190

Query: 1921 GLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIVLCSDG 1742
             LP H F   AL+S+A  +G+PLQ+D+ T  K++L+ AR+C+E+DLLK  +EE  L  + 
Sbjct: 191  KLPAHLFHKEALFSVASMVGSPLQIDALTLNKSKLSQARVCVEIDLLKPIIEEFDLHIND 250

Query: 1741 AVHVQKIVYERVPEYCEFCKHVGHNIQACYMNGN 1640
               VQK+V+E +P+YC  CKHVGH    C+  GN
Sbjct: 251  VTIVQKVVFEYLPKYCFLCKHVGHKDSDCFSKGN 284


>ref|XP_011071645.1| PREDICTED: uncharacterized protein LOC105157045 [Sesamum indicum]
          Length = 507

 Score =  186 bits (472), Expect = 2e-47
 Identities = 89/213 (41%), Positives = 122/213 (57%)
 Frame = -1

Query: 2278 LGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKGPHSW 2099
            +GT    D    L ++  ET  LA  +K  L+GKFS   P    +   ++   +K   + 
Sbjct: 102  IGTVLTGDKGPTLLFTDDETEVLAAPFKFALVGKFSHGAPSYSILHKLIAGTGIKNKFTV 161

Query: 2098 KFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWIRLPG 1919
               N  H++I +  E DF+RLW+  IW I+  PMRVFKWTP+F   +E+ + PVW+  P 
Sbjct: 162  SMLNTRHVLISLSCEADFSRLWLRRIWYIQGYPMRVFKWTPAFTPSKESSIVPVWVSFPE 221

Query: 1918 LPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIVLCSDGA 1739
            LP H F    L+++A  IGTPLQ+D  T  +++L+ AR CIELDLLK RLE   +   G 
Sbjct: 222  LPAHLFRKEVLFTVASMIGTPLQIDDATLNQSKLSKARACIELDLLKPRLENFQIQICGT 281

Query: 1738 VHVQKIVYERVPEYCEFCKHVGHNIQACYMNGN 1640
              VQ+I YE +P YC  CKHVGH    CY  G+
Sbjct: 282  TIVQRIEYEDIPHYCSLCKHVGHQDSDCYTKGD 314


>ref|XP_012846704.1| PREDICTED: uncharacterized protein LOC105966659 [Erythranthe guttata]
          Length = 582

 Score =  173 bits (439), Expect = 2e-42
 Identities = 86/219 (39%), Positives = 133/219 (60%), Gaps = 3/219 (1%)
 Frame = -1

Query: 2287 IVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKGP 2108
            I P+GT    DG+ VL +S  E +++    K TLIGKFS  I   K +   +  LK +G 
Sbjct: 88   IAPIGTIKVIDGKNVLYFSKEEVDKMLEPLKYTLIGKFSHGIHHYKVMEKFIYDLKPRGS 147

Query: 2107 HSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWIR 1928
                  N  H++I+  + + ++ L   +I  I   PMRVFK+TP FN K E  +APVW+ 
Sbjct: 148  FELHKLNYRHVLIQFSVLDYYSLLLRRSICYIDGLPMRVFKYTPGFNLKNETSIAPVWVN 207

Query: 1927 LPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIVLCS 1748
            +PG+P + ++  A++ +A  IG PL+ D  TA + +L++AR C+E+DLLK R+E+I + +
Sbjct: 208  VPGVPPYMYNREAIFFLASSIGNPLEFDDFTADRKKLSVARFCVEIDLLKPRVEQIPVMT 267

Query: 1747 ---DGAVHVQKIVYERVPEYCEFCKHVGHNIQACYMNGN 1640
               D  +    + YE VP++C FC H+GH+++ CYMNGN
Sbjct: 268  GYDDVEMISLPVNYENVPKFCTFCSHLGHSVENCYMNGN 306


>emb|CDP20930.1| unnamed protein product [Coffea canephora]
          Length = 497

 Score =  169 bits (429), Expect = 1e-41
 Identities = 85/212 (40%), Positives = 125/212 (58%), Gaps = 1/212 (0%)
 Frame = -1

Query: 2266 SFQDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKGPHSWKFAN 2087
            S   G   + +S  + ++LA  ++  L+GKFS   P  ++I    +SL LK   S    +
Sbjct: 39   SVYKGEAAVVFSKADADKLAAPFQWALVGKFSHGRPSLEDIRKFFASLNLKDHVSIGLMD 98

Query: 2086 PSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWIRLPGLPIH 1907
              H++IK   E DFNR+WM  IW +   PMRVF+WT  F+   E+ LAPVW+ LP LPIH
Sbjct: 99   YRHVLIKCMAEADFNRIWMRGIWQLGKYPMRVFRWTREFHVLRESSLAPVWVVLPALPIH 158

Query: 1906 FFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIVLCSDG-AVHV 1730
            +FD H+L+SI   +G PL +DS TA  TR ++AR+C+ELD+ K   + + +  +G +   
Sbjct: 159  YFDKHSLFSILSPVGRPLFLDSATAAGTRPSLARVCVELDVAKSFTQRVWVAVEGESGFW 218

Query: 1729 QKIVYERVPEYCEFCKHVGHNIQACYMNGNNV 1634
            Q+IV E +P YC  C  +GH+ + C  N   V
Sbjct: 219  QRIVPENMPLYCSSCSRLGHSQEQCKKNVTEV 250


>ref|XP_007026454.1| Uncharacterized protein TCM_030494 [Theobroma cacao]
            gi|508781820|gb|EOY29076.1| Uncharacterized protein
            TCM_030494 [Theobroma cacao]
          Length = 876

 Score =  166 bits (419), Expect = 9e-39
 Identities = 87/223 (39%), Positives = 126/223 (56%), Gaps = 7/223 (3%)
 Frame = -1

Query: 2287 IVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKGP 2108
            + P   T     +  + +   E   LA  +K  ++GKFS  +P+  EI     SL L G 
Sbjct: 103  VPPTRPTFRYKDKPAVRFFEDEIEALAQPFKFAIVGKFS-KMPRLTEIRQSFVSLGLSGV 161

Query: 2107 HSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWIR 1928
            ++ ++ N  HI+I +  E+DFNR+W    W I +  MRVFKWTP F + +E+P+ PVWI 
Sbjct: 162  YNIRWMNYKHILIHLSNEQDFNRIWTKQTWFITNQKMRVFKWTPDFETDKESPIVPVWIS 221

Query: 1927 LPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEE--IVL 1754
             P L  H F+  AL  IAK IG PL +D  TA  TR ++AR+CIE D LK  ++   IV+
Sbjct: 222  FPNLKAHLFEKSALLMIAKAIGNPLYIDEATANGTRPSVARVCIEYDCLKPPVDSVWIVV 281

Query: 1753 CSDGAV-----HVQKIVYERVPEYCEFCKHVGHNIQACYMNGN 1640
               G+      ++QK+ +  +PEYC  C HVGHN+  C + G+
Sbjct: 282  SKRGSEDMSGGYLQKVEFAPMPEYCNHCCHVGHNVSKCLILGS 324


>ref|XP_007026455.1| Uncharacterized protein TCM_021519 [Theobroma cacao]
            gi|508715060|gb|EOY06957.1| Uncharacterized protein
            TCM_021519 [Theobroma cacao]
          Length = 667

 Score =  160 bits (405), Expect = 1e-37
 Identities = 100/327 (30%), Positives = 165/327 (50%), Gaps = 17/327 (5%)
 Frame = -1

Query: 2356 KSYANVTGXXXXXSINLSFDPKKIVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGK 2177
            KS+ ++        I L+ DP          ++D    + Y   E   LA  + L L+GK
Sbjct: 56   KSFLSIAAGSKPPVIPLNRDP--------AVYKDRPAAVFYED-EICILAKPFSLCLVGK 106

Query: 2176 FSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPM 1997
            F+  +P+ +E+ S    + L G +  K+ +  H++I +  ++DFNR+W    W I    M
Sbjct: 107  FT-RMPKLQEVRSAFKGIGLSGAYEIKWLDYKHVLIHLSNDQDFNRIWTRQQWFIVGQKM 165

Query: 1996 RVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRL 1817
            R+FKW+P F +++E+P+ PVWI  P L  H ++  AL  IAK IG PL VD PTAK +R 
Sbjct: 166  RIFKWSPEFEAEKESPVVPVWISFPNLKAHLYEKSALLLIAKTIGKPLFVDEPTAKGSRP 225

Query: 1816 TMARICIELDLLKERLEEIVLCSD----GAV---HVQKIVYERVPEYCEFCKHVGHNIQA 1658
            ++AR+C+E D  +  ++++ + +     G V   + QK+ + ++P+YCE C HVGHN   
Sbjct: 226  SVARVCVEYDCREPPIDQVWIVTQKRETGMVTNGYAQKVEFSQMPDYCEHCCHVGHNETT 285

Query: 1657 CYMNGNNVXXXXXXXXXXXXXRGAGNQESLKQKQGGTSDISKDINTKQ--GEQTNPVHEQ 1484
            C + GNN                   +  LK +   T ++SK    ++  GE+ +     
Sbjct: 286  CLVLGNN------------SKSSGSMKAQLKGQTKQTLNMSKTQTREKTDGEKEDKAKGI 333

Query: 1483 VVEKGEPNPKE--------WNIIRKKG 1427
            +VE+  P  K+        W ++ K G
Sbjct: 334  MVEEIRPATKQTDMSKQSIWRVVGKAG 360


>ref|XP_007010391.1| Uncharacterized protein TCM_044158 [Theobroma cacao]
            gi|508727304|gb|EOY19201.1| Uncharacterized protein
            TCM_044158 [Theobroma cacao]
          Length = 830

 Score =  159 bits (401), Expect = 1e-36
 Identities = 88/317 (27%), Positives = 163/317 (51%), Gaps = 9/317 (2%)
 Frame = -1

Query: 2299 DPKKIVPLGTTSF-QDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSL 2123
            +   ++PL    F    R   ++   E + LA  +K +++GKFS  + + +EI      +
Sbjct: 65   EKSSLIPLDREPFWYKDRPAASFFDDEISTLAQPFKFSMVGKFSRML-RMQEIRVAFKGI 123

Query: 2122 KLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLA 1943
             L G +  ++ +  HI+I++  E D NR+W+  +W I +  MRVFKW+P F  ++E+ + 
Sbjct: 124  GLIGAYEIRWLDYKHILIQLSNEHDLNRIWLKQVWFISNQKMRVFKWSPEFQPEKESSMV 183

Query: 1942 PVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEE 1763
            PVWI  P L  H ++  AL +I K +G PL VD  TA  TR ++AR+C+E D  +  +++
Sbjct: 184  PVWISFPNLKAHLYEKSALSAIVKTVGRPLMVDEATANGTRPSVARVCVEFDCQQPPIDQ 243

Query: 1762 IVLCS----DGAV---HVQKIVYERVPEYCEFCKHVGHNIQACYMNGNNVXXXXXXXXXX 1604
            + + +     G+V   ++QK+ + R+ E+C  C HVGH + +C + GN            
Sbjct: 244  VWIVTRNRQSGSVMGGYMQKVEFARLSEFCTHCSHVGHGVSSCMVIGNRPEKNKQPM--- 300

Query: 1603 XXXRGAGNQESLKQKQGGTSDISKDINTKQGEQTNPVHEQVVEKGEPNPKEWNIIRKKGP 1424
                  G ++  K+ +  T+    D+  ++ ++T P+  +     +     W ++ + GP
Sbjct: 301  -----GGKKQLKKEDKDRTNARKGDLKPQEEKETEPIQAE----QQKQSTRWQVMARPGP 351

Query: 1423 RET-GFISQEVLKLAKK 1376
                G   +E++  A+K
Sbjct: 352  SSAKGTRGEELVLNAQK 368


>ref|XP_007031319.1| Uncharacterized protein TCM_016772 [Theobroma cacao]
            gi|508710348|gb|EOY02245.1| Uncharacterized protein
            TCM_016772 [Theobroma cacao]
          Length = 1296

 Score =  159 bits (402), Expect = 2e-36
 Identities = 86/224 (38%), Positives = 130/224 (58%), Gaps = 8/224 (3%)
 Frame = -1

Query: 2287 IVPLGTT-SFQDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKG 2111
            ++PL    S+   R   ++   E   LA ++K ++IGKF+  +P+ +EI +    + L G
Sbjct: 75   VIPLNREPSWYRDRPAASFFDNEIATLALSFKFSMIGKFT-RMPKLQEIRTAFKGIGLVG 133

Query: 2110 PHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWI 1931
             ++ ++ +  HI+I +  E D NR+WM   W I +  MRVFKWTP F+ ++E+ L PVWI
Sbjct: 134  AYNIRWLDYKHILIHLSNEHDLNRIWMKQNWFIVNKKMRVFKWTPEFHPEKESSLVPVWI 193

Query: 1930 RLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIVLC 1751
              P L  HF++   L  IAK +G PL VD  TA  TR  +ARIC+E D  K  L++I + 
Sbjct: 194  SFPNLRAHFYEKSTLMMIAKSVGRPLFVDEATANGTRPNVARICVEYDCQKSLLDQIWIV 253

Query: 1750 S----DGAV---HVQKIVYERVPEYCEFCKHVGHNIQACYMNGN 1640
            +     G V    +QK+ + ++P+YC  C HVGHN  AC + GN
Sbjct: 254  TRSRQTGEVTGGFIQKVEFVKMPDYCTHCCHVGHNASACLVLGN 297


>ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobroma cacao]
            gi|508787493|gb|EOY34749.1| Uncharacterized protein
            TCM_042329 [Theobroma cacao]
          Length = 2606

 Score =  157 bits (398), Expect = 1e-35
 Identities = 99/322 (30%), Positives = 157/322 (48%), Gaps = 12/322 (3%)
 Frame = -1

Query: 2356 KSYANVTGXXXXXSINLSFDPKKIVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGK 2177
            KS+ ++        I LS DP          F+D R    +   E   LA   KL+L+GK
Sbjct: 1688 KSFLSIVSGDKPPVIPLSRDP--------LVFKD-RPAAAFFEDEIQTLAQPLKLSLVGK 1738

Query: 2176 FSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPM 1997
            FS  +P+ +++ S    + L G +  ++ +  H++I +  E+D NR+W   +W I +  M
Sbjct: 1739 FS-RMPKLQDVRSAFKGIGLTGAYEVRWLDYKHVLIHLSNEQDCNRVWTKQVWFIANQKM 1797

Query: 1996 RVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRL 1817
            RVFKWTP F  ++E+ + PVWI  P L  H F+  AL  IAK +G PL VD  TA  +R 
Sbjct: 1798 RVFKWTPEFEPEKESAVVPVWIAFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRP 1857

Query: 1816 TMARICIELDLLKERLEEIVLC----SDGAV---HVQKIVYERVPEYCEFCKHVGHNIQA 1658
            ++AR+CIE D  +  ++++ +       G V   + Q++ + ++P YC+ C HVGH    
Sbjct: 1858 SVARVCIEFDCRRPPIDQVWIVVQNRETGTVTSGYPQRVEFSQMPAYCDHCCHVGHKEND 1917

Query: 1657 CYMNGNNVXXXXXXXXXXXXXRGAGNQESLK-----QKQGGTSDISKDINTKQGEQTNPV 1493
            C + GN                G    +SL+     +K G      K++  ++    NP 
Sbjct: 1918 CIVLGNK-----------DKSLGLSKSQSLRTLAVEKKTGYGGGSEKNLEKRK----NPE 1962

Query: 1492 HEQVVEKGEPNPKEWNIIRKKG 1427
             E++V   EP    W  + K G
Sbjct: 1963 KEKIVRPEEPASLRWQQVSKAG 1984



 Score =  148 bits (374), Expect = 8e-33
 Identities = 81/227 (35%), Positives = 124/227 (54%), Gaps = 8/227 (3%)
 Frame = -1

Query: 2290 KIVPLGTTSF-QDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLK 2114
            +I+P     F    R  + +   E   LA  +K +++GKFS  +P+  +I +    + L 
Sbjct: 74   QIIPTNREPFWYRDRPAVAFFEDEIVALAQPFKHSMVGKFS-RMPKLNDIRAAFKGISLV 132

Query: 2113 GPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVW 1934
            G +  ++ +  HI+I +  E+D NRLWM   W I +  MRVFKWTP F  ++E+ L PVW
Sbjct: 133  GVYEIRWLDYKHILIHLSNEQDLNRLWMRQAWFIANQKMRVFKWTPDFQPEKESSLVPVW 192

Query: 1933 IRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEI-V 1757
            I  P L  H ++  AL  IAK +G PL VD  TA  TR ++AR+C+E D  +  LE+I +
Sbjct: 193  ISFPNLRAHLYEKSALLMIAKSVGRPLFVDEATANGTRPSVARVCVEYDCQQPPLEQIWI 252

Query: 1756 LCSDGAV------HVQKIVYERVPEYCEFCKHVGHNIQACYMNGNNV 1634
            +  D           QK+ + ++P YC  C HVGH+   C + G+ +
Sbjct: 253  VTRDRRTGDITGGFQQKVDFAKLPNYCTHCCHVGHSASTCLVMGHRM 299


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  157 bits (397), Expect = 1e-35
 Identities = 100/321 (31%), Positives = 156/321 (48%), Gaps = 7/321 (2%)
 Frame = -1

Query: 2356 KSYANVTGXXXXXSINLSFDPKKIVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGK 2177
            KS+ ++        + LS DP          F+D R    +   E   LA   KL+L+GK
Sbjct: 91   KSFLSIVSGQKPPVVPLSRDP--------FVFKD-RPAAAFYEDEIQTLAQPLKLSLVGK 141

Query: 2176 FSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPM 1997
            FS  +P+ +++ S    + L G +  ++ +  HI+I +  E D NR+W   +W I +  M
Sbjct: 142  FS-RMPKLQDVRSAFKGIGLAGAYEVRWLDYKHILIHLTNEHDCNRVWTKQVWFIANQKM 200

Query: 1996 RVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRL 1817
            RVFKWTP F  ++E+ + PVWI  P L  H F+  AL  IAK +G PL VD  TA  +R 
Sbjct: 201  RVFKWTPEFEPEKESAMVPVWIAFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRP 260

Query: 1816 TMARICIELDLLKERLEEIVLC----SDGAV---HVQKIVYERVPEYCEFCKHVGHNIQA 1658
            ++AR+CIE D  K  ++++ +       G V   + QK+ + ++P YC+ C HVGH    
Sbjct: 261  SVARVCIEYDCRKPPIDQVWIVVQNRETGTVTSGYPQKVEFSQMPAYCDHCCHVGHKEID 320

Query: 1657 CYMNGNNVXXXXXXXXXXXXXRGAGNQESLKQKQGGTSDISKDINTKQGEQTNPVHEQVV 1478
            C + GN                 A      K+  GG+S+ + +      +  NP  E++ 
Sbjct: 321  CIVLGNKDKPLGSSKSQFLRVLEA----EKKKGYGGSSEKNLE------KSKNPEKEKIA 370

Query: 1477 EKGEPNPKEWNIIRKKGPRET 1415
             + EP  + W  + K G   T
Sbjct: 371  RQEEPVSQRWQPVNKAGTSGT 391


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  156 bits (395), Expect = 2e-35
 Identities = 85/247 (34%), Positives = 137/247 (55%), Gaps = 7/247 (2%)
 Frame = -1

Query: 2356 KSYANVTGXXXXXSINLSFDPKKIVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGK 2177
            KS+ ++        I L+ DP          ++D    + Y   E   LA  + L L+GK
Sbjct: 56   KSFLSIAAGSKPPVIPLNRDP--------AVYKDRPAAVFYED-EICILAKPFSLCLVGK 106

Query: 2176 FSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPM 1997
            F+  +P+ +E+ S    + L G +  K+ +  H++I +  ++DFNR+W    W I    M
Sbjct: 107  FT-RMPKLQEVRSAFKGIGLSGAYEIKWLDYKHVLIHLSNDQDFNRIWTRQQWFIVGQKM 165

Query: 1996 RVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRL 1817
            R+FKW+P F +++E+P+ PVWI  P L  H ++  AL  IAK IG PL VD  TAK +R 
Sbjct: 166  RIFKWSPEFEAEKESPVVPVWISFPNLKAHLYEKSALLLIAKTIGKPLFVDEATAKGSRP 225

Query: 1816 TMARICIELDLLKERLEEIVLCSD----GAV---HVQKIVYERVPEYCEFCKHVGHNIQA 1658
            ++AR+C+E D  +  ++++ + +     G V   + QK+ + ++P+YCE C HVGHN   
Sbjct: 226  SVARVCVEYDCREPPIDQVWIVTQKRETGMVTNGYAQKVEFSQMPDYCEHCCHVGHNETT 285

Query: 1657 CYMNGNN 1637
            C + GNN
Sbjct: 286  CLVLGNN 292


>ref|XP_012065816.1| PREDICTED: uncharacterized protein LOC105628933 [Jatropha curcas]
          Length = 397

 Score =  149 bits (375), Expect = 2e-35
 Identities = 73/202 (36%), Positives = 113/202 (55%)
 Frame = -1

Query: 2242 LNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKV 2063
            +++S  E+ +LA  ++  L+G F    P  K +   +  +  KG  S    + SHI+IK 
Sbjct: 62   ISFSWDESMKLANQFRFALVGIFQSGRPNMKSLRQFMDKIGFKGEFSLGLLDSSHILIKF 121

Query: 2062 QLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALY 1883
            +LEEDF+R W+  IW  +   MR+ KWT +F    +  + P WI   GLPIH F   AL+
Sbjct: 122  ELEEDFHRCWLKQIWYFQGFSMRISKWTRNFRPNTDCSIVPTWILFEGLPIHLFAKAALF 181

Query: 1882 SIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIVLCSDGAVHVQKIVYERVP 1703
             IA  IG PL+VD+ TA  +R ++AR+C+ELDL K+   ++ +        Q + YE +P
Sbjct: 182  PIANLIGKPLKVDAATATLSRPSVARVCVELDLSKDLPNKVWIDDGDLGFFQPVNYESLP 241

Query: 1702 EYCEFCKHVGHNIQACYMNGNN 1637
             +C  C  +GH I +C +N  +
Sbjct: 242  LFCTKCCRIGHEILSCPLNSTS 263


>ref|XP_007031317.1| Uncharacterized protein TCM_016768 [Theobroma cacao]
            gi|508710346|gb|EOY02243.1| Uncharacterized protein
            TCM_016768 [Theobroma cacao]
          Length = 351

 Score =  146 bits (369), Expect = 5e-35
 Identities = 89/260 (34%), Positives = 136/260 (52%), Gaps = 17/260 (6%)
 Frame = -1

Query: 2170 FAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRV 1991
            F +P+  EI      + L G +  K+ +  HI+I++  E D NR+W+  +W I +  M V
Sbjct: 83   FWMPRINEIRMAFKGIDLVGAYEIKWLDYKHILIQLSNEHDLNRIWLKQVWFISNQKMCV 142

Query: 1990 FKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTM 1811
            FKWTP+F  ++E+ L PVWI  P L  H ++  AL  IAK +G PL VD  TAK TR ++
Sbjct: 143  FKWTPNFQPEKESSLVPVWISFPNLRAHLYEKFALLVIAKTVGRPLMVDEATAKGTRPSV 202

Query: 1810 ARICIELDLLKERLEEIVLCS----DGAV---HVQKIVYERVPEYCEFCKHVGHNIQACY 1652
            AR+CIE D  K  ++++ + +     G+V   ++QK+ + ++ EYC  C HVGH +  C 
Sbjct: 203  ARVCIEYDCQKPPIDQVWIVTRDRKTGSVIGGYMQKVDFAKLLEYCSHCCHVGHGVSTCI 262

Query: 1651 MNGNNVXXXXXXXXXXXXXRG--AGNQESLKQKQ----GGTSDISKDINTKQGEQTNPVH 1490
            M G+                G   G ++ ++ +Q    G  +D  + I  KQ  +     
Sbjct: 263  MLGHRPEKRLQPTKTRMKRNGDDEGKEKPIEGEQGMRDGNGTDRVQFIEPKQSTKW---- 318

Query: 1489 EQVVEK----GEPNPKEWNI 1442
             QVVEK    G  +PK  NI
Sbjct: 319  -QVVEKPGTSGVNDPKPINI 337


>ref|XP_007040951.1| Uncharacterized protein TCM_016760 [Theobroma cacao]
            gi|508778196|gb|EOY25452.1| Uncharacterized protein
            TCM_016760 [Theobroma cacao]
          Length = 1109

 Score =  152 bits (384), Expect = 3e-34
 Identities = 81/225 (36%), Positives = 124/225 (55%), Gaps = 7/225 (3%)
 Frame = -1

Query: 2296 PKKIVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKL 2117
            P  I P    S    R    +   E   LA  +  +L+GKFS  +P+ +EI      + L
Sbjct: 71   PPVIPPSRDPSVYKDRPAAIFYEDEIQTLARPFSHSLVGKFS-RMPKLQEIRHAFKGIGL 129

Query: 2116 KGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPV 1937
             G +  ++ +  H++I +  E+DFNR+W+   W I +  MRVFKW P F +++E+ + PV
Sbjct: 130  SGAYEIRWMDYKHVLIHLSNEQDFNRVWVKQQWFIVNQKMRVFKWAPDFEAEKESAMVPV 189

Query: 1936 WIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIV 1757
            WI  P L  H ++  AL  IAK +G PL VD  TA  +R ++AR+C+E D  K+ +EEI 
Sbjct: 190  WISFPNLKAHLYEKSALLLIAKTVGKPLYVDEATANGSRPSVARVCVEYDCRKQPVEEIW 249

Query: 1756 LC----SDGAV---HVQKIVYERVPEYCEFCKHVGHNIQACYMNG 1643
            +       GAV   + Q++ + R+P+YC +C HVGH    C + G
Sbjct: 250  IVIRNRETGAVTGGYSQRVEFARMPDYCGYCSHVGHKENECIVLG 294


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  152 bits (385), Expect = 3e-34
 Identities = 84/231 (36%), Positives = 130/231 (56%), Gaps = 8/231 (3%)
 Frame = -1

Query: 2308 LSFDPKKIVPLGTTSF-QDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGL 2132
            +S +   +VPL    F    R    +   E + LA  +KL+L+GKFS  +P+ +E+ S  
Sbjct: 91   VSGEKPSVVPLTRDPFVYKDRPAAAFFEDEIHILAQPFKLSLVGKFS-RMPKLQEVRSAF 149

Query: 2131 SSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEA 1952
              + L G +  ++ +  HI+I +  E+DFNR W    W I +  MRVFKWTP F  ++E+
Sbjct: 150  KGIGLAGSYEIRWLDYKHILIHLSNEQDFNRFWTKQAWFIANQKMRVFKWTPEFEPEKES 209

Query: 1951 PLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKER 1772
             + PVWI  P L  H F+  AL  IAK +G PL +D  TA  +R ++AR+CIE D  +  
Sbjct: 210  AVVPVWISFPNLKAHLFEKSALLLIAKTVGKPLFIDEATANGSRPSVARVCIEYDCREPP 269

Query: 1771 LEEIVLC----SDGAV---HVQKIVYERVPEYCEFCKHVGHNIQACYMNGN 1640
            ++++ +     + GAV   + QK+ + ++P YC+ C HVGH    C + GN
Sbjct: 270  VDQVWIVVQNRATGAVTSGYPQKVEFAQMPAYCDHCCHVGHKEINCIVLGN 320


>ref|XP_007046403.1| Uncharacterized protein TCM_011922 [Theobroma cacao]
            gi|508710338|gb|EOY02235.1| Uncharacterized protein
            TCM_011922 [Theobroma cacao]
          Length = 928

 Score =  151 bits (382), Expect = 4e-34
 Identities = 85/246 (34%), Positives = 134/246 (54%), Gaps = 7/246 (2%)
 Frame = -1

Query: 2356 KSYANVTGXXXXXSINLSFDPKKIVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGK 2177
            KS+ ++T       I L+ +P          ++D    + Y   E   LA  + L L+GK
Sbjct: 56   KSFLSITAGSKPPVIPLNRNP--------VVYKDRPAAVFYED-EICILAKPFSLCLVGK 106

Query: 2176 FSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPM 1997
            F+  +P+ +E+ S    + L G +  K+ +  H+II +  ++DFNR+W    W I    M
Sbjct: 107  FT-RMPKLQEVRSAFKGIGLSGAYEIKWLDYKHVIIHLSNDQDFNRIWTRQQWFIVGQKM 165

Query: 1996 RVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRL 1817
            R+FKW+P F +++E+P+ PVWI  P L  H ++  AL  IAK IG PL VD  TAK +R 
Sbjct: 166  RIFKWSPEFEAEKESPVVPVWISFPNLKAHLYEKFALLLIAKTIGRPLFVDEATAKGSRP 225

Query: 1816 TMARICIELDLLKERLEEIVLCSD----GAV---HVQKIVYERVPEYCEFCKHVGHNIQA 1658
            ++AR+C E D  K  + ++ + +     G V   + QK+ + ++P YC+ C HVGHN   
Sbjct: 226  SVARVCAEYDCRKPPINQVWIVTQKRETGTVTNGYAQKVEFSQMPAYCDHCCHVGHNETN 285

Query: 1657 CYMNGN 1640
            C + GN
Sbjct: 286  CLVLGN 291


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  150 bits (379), Expect = 2e-33
 Identities = 95/313 (30%), Positives = 155/313 (49%), Gaps = 9/313 (2%)
 Frame = -1

Query: 2287 IVPLGTTSFQ-DGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKG 2111
            +VPL    F    R    +   E   LA  +KL+L+GKFS  +P+ +++ +    + L G
Sbjct: 1770 VVPLTRDPFVFKDRPAAAFFEDEIQTLAKPFKLSLVGKFS-RMPKLQDVRAAFKGIGLAG 1828

Query: 2110 PHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWI 1931
             +  ++ +  H++I +  E+DFNR+W    W I    MRVFKWTP F  ++E+ + PVWI
Sbjct: 1829 AYEVRWLDYKHVLIHLSNEQDFNRIWTKQNWFIATQKMRVFKWTPEFEPEKESAVVPVWI 1888

Query: 1930 RLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIVLC 1751
              P L  H F+  AL  IAK +G PL VD  TA  +R ++AR+C+E D  +  L+++ + 
Sbjct: 1889 SFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVARVCVEFDCRQPPLDQVWIV 1948

Query: 1750 ----SDGAV---HVQKIVYERVPEYCEFCKHVGHNIQACYMNGNNVXXXXXXXXXXXXXR 1592
                  G +   + Q++ + ++P YC+ C HVGH    C + GN                
Sbjct: 1949 VQNRKTGEITNGYSQRVEFAQMPAYCDHCCHVGHKETDCILLGNKARPPGITKQPNSRLE 2008

Query: 1591 GAGNQESLKQKQGGTSDISKDINTKQGEQTNPVHEQVVEKGEPNPKEWNIIRKKG-PRET 1415
              G +   K+    T++  K+I   +     P +++++   EP PK     +K+G P   
Sbjct: 2009 DGGRRVGSKEDGEFTTEKRKNIENSK----KPQNDKILYPEEP-PKH----QKRGQPANK 2059

Query: 1414 GFISQEVLKLAKK 1376
            G  S   +   KK
Sbjct: 2060 GSTSGTKIWQGKK 2072



 Score =  144 bits (364), Expect = 1e-31
 Identities = 66/182 (36%), Positives = 107/182 (58%), Gaps = 7/182 (3%)
 Frame = -1

Query: 2164 IPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFK 1985
            +P+ +EI      + L G +  ++ +  HI+I +  E+DFNR+W    W I +  MRVFK
Sbjct: 1    MPKMQEIRQAFKGIGLTGAYVIRWLDYKHILIHLSNEQDFNRIWTKQQWFIANQKMRVFK 60

Query: 1984 WTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMAR 1805
            W+P F +++E+P+ PVWI  P L  H ++  AL  IAK +G PL +D  T+  +R ++AR
Sbjct: 61   WSPDFEAEKESPIVPVWISFPNLKAHLYEKSALLLIAKTVGKPLFIDEATSNASRPSVAR 120

Query: 1804 ICIELDLLKERLEEIVLCSDGAV-------HVQKIVYERVPEYCEFCKHVGHNIQACYMN 1646
            +C+E +     +EEI +     V       + QK+ + ++P+YCE C HVGH++  C + 
Sbjct: 121  VCVEYNCRNAPVEEIWIVIKDRVTGTVTGGYAQKVEFSKMPDYCEHCGHVGHSVSTCLVL 180

Query: 1645 GN 1640
            GN
Sbjct: 181  GN 182


>ref|XP_012841289.1| PREDICTED: uncharacterized protein LOC105961601 [Erythranthe guttata]
          Length = 449

 Score =  144 bits (362), Expect = 3e-33
 Identities = 67/166 (40%), Positives = 107/166 (64%), Gaps = 4/166 (2%)
 Frame = -1

Query: 2125 LKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPL 1946
            LK +G       N  H++I+  + +D++ L   +I  I   PMRVFK+TP FN K E  +
Sbjct: 8    LKPRGSFELHKLNYRHVLIQFSVLDDYSLLLRRSICYIHGLPMRVFKYTPGFNLKNETSI 67

Query: 1945 APVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLE 1766
            APVW+ +PG+P + ++  A++ +A  IG PL+ D  TA + ++++AR C+E+DLLK R+E
Sbjct: 68   APVWVNVPGVPPYMYNREAIFFLASSIGNPLEFDDFTADRKKISVARFCVEIDLLKPRVE 127

Query: 1765 EIVLCSDGAVHVQKIV----YERVPEYCEFCKHVGHNIQACYMNGN 1640
            +I + + G   ++ I     YE VP++C FC H+GH+++ CYMNGN
Sbjct: 128  QIPVMT-GYDDIEMISLPGNYENVPKFCTFCSHLGHSVENCYMNGN 172


>emb|CDP14239.1| unnamed protein product [Coffea canephora]
          Length = 587

 Score =  145 bits (367), Expect = 4e-33
 Identities = 74/201 (36%), Positives = 111/201 (55%), Gaps = 1/201 (0%)
 Frame = -1

Query: 2254 GRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHI 2075
            G   + +S+ +   +A  ++ TL+GKFS   P   ++   LS+L LK   +    +  H+
Sbjct: 6    GEPAVVFSAADIAVVAAPFRYTLVGKFSKGRPLLPDLRKFLSTLDLKDTATVGLLDARHV 65

Query: 2074 IIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDY 1895
            ++K Q E DF R+W  ++W +   PMRVFKWT  F+   E+ L P+W RLP LPIH F  
Sbjct: 66   LLKFQCEADFLRVWGRSLWYVNGSPMRVFKWTSKFHVNRESSLVPIWFRLPKLPIHLFAK 125

Query: 1894 HALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEI-VLCSDGAVHVQKIV 1718
              L+ +   +GTPL VD+ T+  +R  +AR+C+E+DLLK     + V   DG    Q ++
Sbjct: 126  PCLFHLVSCLGTPLFVDAATSSFSRPNVARVCVEVDLLKSIPSRVWVDMGDGDGFWQVLI 185

Query: 1717 YERVPEYCEFCKHVGHNIQAC 1655
             E +P YC  C   GH    C
Sbjct: 186  PENLPNYCSHCYRQGHGEDQC 206


Top