BLASTX nr result

ID: Rehmannia27_contig00019221 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia27_contig00019221
         (3191 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011092915.1| PREDICTED: uncharacterized protein LOC105172...   202   1e-52
ref|XP_011071645.1| PREDICTED: uncharacterized protein LOC105157...   189   5e-48
ref|XP_011101871.1| PREDICTED: uncharacterized protein LOC105179...   191   4e-47
ref|XP_012846704.1| PREDICTED: uncharacterized protein LOC105966...   179   5e-44
emb|CDP20930.1| unnamed protein product [Coffea canephora]            170   2e-41
ref|XP_012065816.1| PREDICTED: uncharacterized protein LOC105628...   159   1e-38
ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom...   162   6e-37
ref|XP_007026454.1| Uncharacterized protein TCM_030494 [Theobrom...   159   2e-36
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...   160   5e-36
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...   157   4e-35
ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobrom...   156   6e-35
ref|XP_007031319.1| Uncharacterized protein TCM_016772 [Theobrom...   155   1e-34
ref|XP_007010391.1| Uncharacterized protein TCM_044158 [Theobrom...   153   1e-34
ref|XP_007026455.1| Uncharacterized protein TCM_021519 [Theobrom...   152   2e-34
ref|XP_007031317.1| Uncharacterized protein TCM_016768 [Theobrom...   142   2e-33
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...   151   2e-33
ref|XP_011075252.1| PREDICTED: uncharacterized protein LOC105159...   145   3e-33
ref|XP_012841289.1| PREDICTED: uncharacterized protein LOC105961...   143   8e-33
ref|XP_007023857.1| Uncharacterized protein TCM_028230 [Theobrom...   147   8e-33
ref|XP_011094921.1| PREDICTED: uncharacterized protein LOC105174...   134   1e-32

>ref|XP_011092915.1| PREDICTED: uncharacterized protein LOC105172985 [Sesamum indicum]
          Length = 470

 Score =  202 bits (513), Expect = 1e-52
 Identities = 98/209 (46%), Positives = 131/209 (62%)
 Frame = +3

Query: 342 IGTHKEKDGQQVLGFSSLENDRLAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSW 521
           +G      G +VL FSS E  RL+  ++  L+GKFS   P  + +     A   +G FS 
Sbjct: 1   MGVLSRDQGMKVLRFSSDEISRLSLPFRYALVGKFSHGYPSMQNLRRWMLAQGFRGDFSV 60

Query: 522 SFANPSHIIIKLHLEEDYNKLWMGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPG 701
              N  H+ IK  LEEDY KLW+ + W +   PMRVFKWTP+FNPR E+P+ PVW+RLP 
Sbjct: 61  GAINVRHVFIKFALEEDYTKLWIKSTWFVEGFPMRVFKWTPTFNPREESPIVPVWVRLPE 120

Query: 702 LPIHFFDHNALFAISKVIGTPLQVDSPTAVRSRLSMARVCVELDLLKEKIPEIILEFAGT 881
           LPI FFD  ALF+I+ ++GTPL+ D  TA   R S+ARVCVE++LL+    EI L     
Sbjct: 121 LPIQFFDREALFSIAHLLGTPLRTDVSTATLVRPSVARVCVEINLLEPLQTEIGLGIGTE 180

Query: 882 IHVQKIVYERIPDYCTHCKHIGHSIEGCY 968
           + +Q ++YER+P YC  CKH+GH  + CY
Sbjct: 181 VIIQPVIYERLPKYCGACKHLGHDEDECY 209


>ref|XP_011071645.1| PREDICTED: uncharacterized protein LOC105157045 [Sesamum indicum]
          Length = 507

 Score =  189 bits (480), Expect = 5e-48
 Identities = 90/213 (42%), Positives = 124/213 (58%)
 Frame = +3

Query: 342 IGTHKEKDGQQVLGFSSLENDRLAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSW 521
           IGT    D    L F+  E + LA  +K  L+GKFS   P    +    +   +K  F+ 
Sbjct: 102 IGTVLTGDKGPTLLFTDDETEVLAAPFKFALVGKFSHGAPSYSILHKLIAGTGIKNKFTV 161

Query: 522 SFANPSHIIIKLHLEEDYNKLWMGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPG 701
           S  N  H++I L  E D+++LW+  IW +   PMRVFKWTP+F P  E+ + PVW+  P 
Sbjct: 162 SMLNTRHVLISLSCEADFSRLWLRRIWYIQGYPMRVFKWTPAFTPSKESSIVPVWVSFPE 221

Query: 702 LPIHFFDHNALFAISKVIGTPLQVDSPTAVRSRLSMARVCVELDLLKEKIPEIILEFAGT 881
           LP H F    LF ++ +IGTPLQ+D  T  +S+LS AR C+ELDLLK ++    ++  GT
Sbjct: 222 LPAHLFRKEVLFTVASMIGTPLQIDDATLNQSKLSKARACIELDLLKPRLENFQIQICGT 281

Query: 882 IHVQKIVYERIPDYCTHCKHIGHSIEGCYMNGN 980
             VQ+I YE IP YC+ CKH+GH    CY  G+
Sbjct: 282 TIVQRIEYEDIPHYCSLCKHVGHQDSDCYTKGD 314


>ref|XP_011101871.1| PREDICTED: uncharacterized protein LOC105179909 [Sesamum indicum]
          Length = 733

 Score =  191 bits (485), Expect = 4e-47
 Identities = 87/214 (40%), Positives = 129/214 (60%)
 Frame = +3

Query: 339 PIGTHKEKDGQQVLGFSSLENDRLAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFS 518
           P+G      G+  + F++ E + LA  ++ +L+GKFS   P    +    + L ++G F+
Sbjct: 71  PLGIKSVNQGRPTISFTNTETEELAAPFRFSLVGKFSHGAPPYSQMHQLIARLGIQGAFT 130

Query: 519 WSFANPSHIIIKLHLEEDYNKLWMGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLP 698
            S  N  H +I L  E DY++LW+  IW L   PMR+FKWTP+F P  E+ + P+++  P
Sbjct: 131 VSMINSKHTLISLSCESDYSRLWLRRIWFLQGFPMRIFKWTPTFTPTQESSVVPIFVCFP 190

Query: 699 GLPIHFFDHNALFAISKVIGTPLQVDSPTAVRSRLSMARVCVELDLLKEKIPEIILEFAG 878
            LP H F   ALF+++ ++G+PLQ+D+ T  +S+LS ARVCVE+DLLK  I E  L    
Sbjct: 191 KLPAHLFHKEALFSVASMVGSPLQIDALTLNKSKLSQARVCVEIDLLKPIIEEFDLHIND 250

Query: 879 TIHVQKIVYERIPDYCTHCKHIGHSIEGCYMNGN 980
              VQK+V+E +P YC  CKH+GH    C+  GN
Sbjct: 251 VTIVQKVVFEYLPKYCFLCKHVGHKDSDCFSKGN 284


>ref|XP_012846704.1| PREDICTED: uncharacterized protein LOC105966659 [Erythranthe guttata]
          Length = 582

 Score =  179 bits (454), Expect = 5e-44
 Identities = 132/399 (33%), Positives = 195/399 (48%), Gaps = 28/399 (7%)
 Frame = +3

Query: 333  IVPIGTHKEKDGQQVLGFSSLENDRLAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGP 512
            I PIGT K  DG+ VL FS  E D++ E  K TLIGKFS  I H K +      L  +G 
Sbjct: 88   IAPIGTIKVIDGKNVLYFSKEEVDKMLEPLKYTLIGKFSHGIHHYKVMEKFIYDLKPRGS 147

Query: 513  FSWSFANPSHIIIKLHLEEDYNKLWMGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIR 692
            F     N  H++I+  + + Y+ L   +I  +   PMRVFK+TP FN + E  +APVW+ 
Sbjct: 148  FELHKLNYRHVLIQFSVLDYYSLLLRRSICYIDGLPMRVFKYTPGFNLKNETSIAPVWVN 207

Query: 693  LPGLPIHFFDHNALFAISKVIGTPLQVDSPTAVRSRLSMARVCVELDLLKEKIPEIILEF 872
            +PG+P + ++  A+F ++  IG PL+ D  TA R +LS+AR CVE+DLLK ++ +I +  
Sbjct: 208  VPGVPPYMYNREAIFFLASSIGNPLEFDDFTADRKKLSVARFCVEIDLLKPRVEQIPV-M 266

Query: 873  AGTIHVQKIV----YERIPDYCTHCKHIGHSIEGCYMNGNXXXXXXXXXXXXXXXXSHDQ 1040
             G   V+ I     YE +P +CT C H+GHS+E CYMNGN                    
Sbjct: 267  TGYDDVEMISLPVNYENVPKFCTFCSHLGHSVENCYMNGN-------------------- 306

Query: 1041 ASKGPIGNIEKLRGTMDKSKSLNKEIPIPNPGVNLAAAKNQDWMRVNKKG---------P 1193
             +K P  +       + K  +L KE               Q W RV K+          P
Sbjct: 307  -AKKP--DFPPPPQRIPKPTALPKE--------------KQVWRRVEKRKNVVVENMDIP 349

Query: 1194 RETGLVSKEVLNFAKTAKHTYFKGSDSIDEG--SGINRFSVLDNGVFE-----------S 1334
            + +G  S E  +F++    T    +D+I EG     N F +L++G  E           +
Sbjct: 350  KTSGTKSTENPSFSQAIVRT---TADNISEGDFEHYNPFELLESGAQEDIEAAQIQPDVA 406

Query: 1335 SDDLTQ-FMKGQDKVTELQLDVPIDPI-TMEDDLFKDVE 1445
            S  +T+   KG+   +  +  +  DPI T+ DD   D++
Sbjct: 407  SKHVTKSSKKGRKNTSNTKQPMSPDPIDTIADDQATDLD 445


>emb|CDP20930.1| unnamed protein product [Coffea canephora]
          Length = 497

 Score =  170 bits (430), Expect = 2e-41
 Identities = 88/204 (43%), Positives = 123/204 (60%), Gaps = 1/204 (0%)
 Frame = +3

Query: 366 GQQVLGFSSLENDRLAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSWSFANPSHI 545
           G+  + FS  + D+LA  ++  L+GKFS   P  + I   F++LNLK   S    +  H+
Sbjct: 43  GEAAVVFSKADADKLAAPFQWALVGKFSHGRPSLEDIRKFFASLNLKDHVSIGLMDYRHV 102

Query: 546 IIKLHLEEDYNKLWMGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPGLPIHFFDH 725
           +IK   E D+N++WM  IW L   PMRVF+WT  F+   E+ LAPVW+ LP LPIH+FD 
Sbjct: 103 LIKCMAEADFNRIWMRGIWQLGKYPMRVFRWTREFHVLRESSLAPVWVVLPALPIHYFDK 162

Query: 726 NALFAISKVIGTPLQVDSPTAVRSRLSMARVCVELDLLKEKIPEIILEFAGTIHV-QKIV 902
           ++LF+I   +G PL +DS TA  +R S+ARVCVELD+ K     + +   G     Q+IV
Sbjct: 163 HSLFSILSPVGRPLFLDSATAAGTRPSLARVCVELDVAKSFTQRVWVAVEGESGFWQRIV 222

Query: 903 YERIPDYCTHCKHIGHSIEGCYMN 974
            E +P YC+ C  +GHS E C  N
Sbjct: 223 PENMPLYCSSCSRLGHSQEQCKKN 246


>ref|XP_012065816.1| PREDICTED: uncharacterized protein LOC105628933 [Jatropha curcas]
          Length = 397

 Score =  159 bits (402), Expect = 1e-38
 Identities = 81/203 (39%), Positives = 114/203 (56%)
 Frame = +3

Query: 366 GQQVLGFSSLENDRLAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSWSFANPSHI 545
           G   + FS  E+ +LA  ++  L+G F    P+ K +      +  KG FS    + SHI
Sbjct: 58  GVPSISFSWDESMKLANQFRFALVGIFQSGRPNMKSLRQFMDKIGFKGEFSLGLLDSSHI 117

Query: 546 IIKLHLEEDYNKLWMGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPGLPIHFFDH 725
           +IK  LEED+++ W+  IW      MR+ KWT +F P  +  + P WI   GLPIH F  
Sbjct: 118 LIKFELEEDFHRCWLKQIWYFQGFSMRISKWTRNFRPNTDCSIVPTWILFEGLPIHLFAK 177

Query: 726 NALFAISKVIGTPLQVDSPTAVRSRLSMARVCVELDLLKEKIPEIILEFAGTIHVQKIVY 905
            ALF I+ +IG PL+VD+ TA  SR S+ARVCVELDL K+   ++ ++       Q + Y
Sbjct: 178 AALFPIANLIGKPLKVDAATATLSRPSVARVCVELDLSKDLPNKVWIDDGDLGFFQPVNY 237

Query: 906 ERIPDYCTHCKHIGHSIEGCYMN 974
           E +P +CT C  IGH I  C +N
Sbjct: 238 ESLPLFCTKCCRIGHEILSCPLN 260


>ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
            gi|508715062|gb|EOY06959.1| Uncharacterized protein
            TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  162 bits (411), Expect = 6e-37
 Identities = 107/346 (30%), Positives = 171/346 (49%), Gaps = 21/346 (6%)
 Frame = +3

Query: 408  LAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLW 587
            LA+ +K +++GKFS  +P    I + F  + L G +   + +  HI+I L  E+D N+LW
Sbjct: 101  LAQPFKHSMVGKFS-RMPKLNDIRAAFKGIGLVGVYEIRWLDYKHILIHLSNEQDLNRLW 159

Query: 588  MGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVIGTPL 767
            M   W +A+  MRVFKW+P F P  E+ L PVWI  P L  H ++ +AL  I+K +G PL
Sbjct: 160  MRQAWFIANQKMRVFKWSPDFQPEKESSLVPVWISFPNLRAHLYEKSALLMIAKSVGRPL 219

Query: 768  QVDSPTAVRSRLSMARVCVELDLLKEKIPEIIL--------EFAGTIHVQKIVYERIPDY 923
             VD  TA  +R S+ARVCVE D  +  + +I +        +  G    QK+ + ++P+Y
Sbjct: 220  FVDEATANGTRPSVARVCVEYDCQQPPLEQIWIVSRDRRTGDITGGFQ-QKVDFAKLPNY 278

Query: 924  CTHCKHIGHSIEGCYMNG------NXXXXXXXXXXXXXXXXSHDQASKGPIGNIEKLRGT 1085
            CTHC H+GHS   C + G      N                  + A+K P G++   +GT
Sbjct: 279  CTHCCHVGHSASTCLVMGHRMEKANNSNAQPYTGRKQAENDGKEVANK-PTGDLMSCKGT 337

Query: 1086 MDKSKSLNKEIPIPNPGVNLAAAKNQDWMRVNKKGP-------RETGLVSKEVLNFAKTA 1244
              K+           PG ++AAA  +     +++ P       +E G + +  +  +  A
Sbjct: 338  DRKNIEERPTAADTVPGEDVAAAAEKKTKNPSREVPLKLFPRWQEVGSLDRPAVQVSIDA 397

Query: 1245 KHTYFKGSDSIDEGSGINRFSVLDNGVFESSDDLTQFMKGQDKVTE 1382
            +      ++  ++ S +NRF+VL +   E +D+  Q  K   K  E
Sbjct: 398  ETVL--ENEGKEQYSSLNRFTVLGSVEKEENDEQQQMEKQGQKDDE 441


>ref|XP_007026454.1| Uncharacterized protein TCM_030494 [Theobroma cacao]
           gi|508781820|gb|EOY29076.1| Uncharacterized protein
           TCM_030494 [Theobroma cacao]
          Length = 876

 Score =  159 bits (403), Expect = 2e-36
 Identities = 82/218 (37%), Positives = 123/218 (56%), Gaps = 7/218 (3%)
 Frame = +3

Query: 348 THKEKDGQQVLGFSSLENDRLAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSWSF 527
           T + KD   V  F   E + LA+ +K  ++GKFS  +P    I   F +L L G ++  +
Sbjct: 109 TFRYKDKPAVRFFED-EIEALAQPFKFAIVGKFS-KMPRLTEIRQSFVSLGLSGVYNIRW 166

Query: 528 ANPSHIIIKLHLEEDYNKLWMGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPGLP 707
            N  HI+I L  E+D+N++W    W + +  MRVFKWTP F    E+P+ PVWI  P L 
Sbjct: 167 MNYKHILIHLSNEQDFNRIWTKQTWFITNQKMRVFKWTPDFETDKESPIVPVWISFPNLK 226

Query: 708 IHFFDHNALFAISKVIGTPLQVDSPTAVRSRLSMARVCVELDLLKEKIPEIIL------- 866
            H F+ +AL  I+K IG PL +D  TA  +R S+ARVC+E D LK  +  + +       
Sbjct: 227 AHLFEKSALLMIAKAIGNPLYIDEATANGTRPSVARVCIEYDCLKPPVDSVWIVVSKRGS 286

Query: 867 EFAGTIHVQKIVYERIPDYCTHCKHIGHSIEGCYMNGN 980
           E     ++QK+ +  +P+YC HC H+GH++  C + G+
Sbjct: 287 EDMSGGYLQKVEFAPMPEYCNHCCHVGHNVSKCLILGS 324


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  160 bits (404), Expect = 5e-36
 Identities = 98/276 (35%), Positives = 140/276 (50%), Gaps = 7/276 (2%)
 Frame = +3

Query: 396  ENDRLAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSWSFANPSHIIIKLHLEEDY 575
            E   LA+  KL+L+GKFS  +P  + + S F  + L G +   + +  HI+I L  E D 
Sbjct: 126  EIQTLAQPLKLSLVGKFS-RMPKLQDVRSAFKGIGLAGAYEVRWLDYKHILIHLTNEHDC 184

Query: 576  NKLWMGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVI 755
            N++W   +W +A+  MRVFKWTP F P  E+ + PVWI  P L  H F+ +AL  I+K +
Sbjct: 185  NRVWTKQVWFIANQKMRVFKWTPEFEPEKESAMVPVWIAFPNLKAHLFEKSALLLIAKTV 244

Query: 756  GTPLQVDSPTAVRSRLSMARVCVELDLLKEKIPEIIL----EFAGTI---HVQKIVYERI 914
            G PL VD  TA  SR S+ARVC+E D  K  I ++ +       GT+   + QK+ + ++
Sbjct: 245  GKPLFVDEATANGSRPSVARVCIEYDCRKPPIDQVWIVVQNRETGTVTSGYPQKVEFSQM 304

Query: 915  PDYCTHCKHIGHSIEGCYMNGNXXXXXXXXXXXXXXXXSHDQASKGPIGNIEKLRGTMDK 1094
            P YC HC H+GH    C + GN                  ++  KG  G+ EK    ++K
Sbjct: 305  PAYCDHCCHVGHKEIDCIVLGNKDKPLGSSKSQFLRVLEAEK-KKGYGGSSEK---NLEK 360

Query: 1095 SKSLNKEIPIPNPGVNLAAAKNQDWMRVNKKGPRET 1202
            SK+  KE              +Q W  VNK G   T
Sbjct: 361  SKNPEKE-----KIARQEEPVSQRWQPVNKAGTSGT 391


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  157 bits (396), Expect = 4e-35
 Identities = 91/272 (33%), Positives = 136/272 (50%), Gaps = 7/272 (2%)
 Frame = +3

Query: 408  LAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLW 587
            LA+ +KL+L+GKFS  +P  + + S F  + L G +   + +  HI+I L  E+D+N+ W
Sbjct: 124  LAQPFKLSLVGKFS-RMPKLQEVRSAFKGIGLAGSYEIRWLDYKHILIHLSNEQDFNRFW 182

Query: 588  MGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVIGTPL 767
                W +A+  MRVFKWTP F P  E+ + PVWI  P L  H F+ +AL  I+K +G PL
Sbjct: 183  TKQAWFIANQKMRVFKWTPEFEPEKESAVVPVWISFPNLKAHLFEKSALLLIAKTVGKPL 242

Query: 768  QVDSPTAVRSRLSMARVCVELDLLKEKIPEIIL----EFAGTI---HVQKIVYERIPDYC 926
             +D  TA  SR S+ARVC+E D  +  + ++ +       G +   + QK+ + ++P YC
Sbjct: 243  FIDEATANGSRPSVARVCIEYDCREPPVDQVWIVVQNRATGAVTSGYPQKVEFAQMPAYC 302

Query: 927  THCKHIGHSIEGCYMNGNXXXXXXXXXXXXXXXXSHDQASKGPIGNIEKLRGTMDKSKSL 1106
             HC H+GH    C + GN                 H       + N+EK++   DK K +
Sbjct: 303  DHCCHVGHKEINCIVLGN-----KNGLQGSGKPQPHSVVDADKLRNLEKIKNP-DKGKIV 356

Query: 1107 NKEIPIPNPGVNLAAAKNQDWMRVNKKGPRET 1202
            + E           A   Q W  V K G   T
Sbjct: 357  STED---------QAKHQQKWQPVGKVGTSGT 379


>ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobroma cacao]
            gi|508787493|gb|EOY34749.1| Uncharacterized protein
            TCM_042329 [Theobroma cacao]
          Length = 2606

 Score =  156 bits (395), Expect = 6e-35
 Identities = 110/348 (31%), Positives = 166/348 (47%), Gaps = 23/348 (6%)
 Frame = +3

Query: 408  LAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLW 587
            LA+ +K +++GKFS  +P    I + F  ++L G +   + +  HI+I L  E+D N+LW
Sbjct: 101  LAQPFKHSMVGKFS-RMPKLNDIRAAFKGISLVGVYEIRWLDYKHILIHLSNEQDLNRLW 159

Query: 588  MGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVIGTPL 767
            M   W +A+  MRVFKWTP F P  E+ L PVWI  P L  H ++ +AL  I+K +G PL
Sbjct: 160  MRQAWFIANQKMRVFKWTPDFQPEKESSLVPVWISFPNLRAHLYEKSALLMIAKSVGRPL 219

Query: 768  QVDSPTAVRSRLSMARVCVELDLLKEKIPEIIL--------EFAGTIHVQKIVYERIPDY 923
             VD  TA  +R S+ARVCVE D  +  + +I +        +  G    QK+ + ++P+Y
Sbjct: 220  FVDEATANGTRPSVARVCVEYDCQQPPLEQIWIVTRDRRTGDITGGFQ-QKVDFAKLPNY 278

Query: 924  CTHCKHIGHSIEGCYMNGN-----XXXXXXXXXXXXXXXXSHDQASKGPIGNIEKLRGTM 1088
            CTHC H+GHS   C + G+                        + +  P G+    +GT 
Sbjct: 279  CTHCCHVGHSASTCLVMGHRMEKAENSNAQPYTGRKQAENERKEVANKPTGDPMSSKGTD 338

Query: 1089 DKSKSLNKEIPIPNPGVNLAAA-----KNQDWMRVNKKGPR--ETGLVSKEVLNF---AK 1238
             K+           PG ++AAA     KN       K  PR    G + +  +     AK
Sbjct: 339  RKNIEKRPTAADTVPGGDVAAAVEKKKKNPSREIPTKVFPRWQVVGSLDRPAVQVSIGAK 398

Query: 1239 TAKHTYFKGSDSIDEGSGINRFSVLDNGVFESSDDLTQFMKGQDKVTE 1382
            T      K     ++ S +NRF+VL +   E +++  Q  K   K  E
Sbjct: 399  TVLENVGK-----EQYSSLNRFTVLGSVEKEENEEQQQMEKQGQKDDE 441



 Score =  155 bits (393), Expect = 1e-34
 Identities = 78/202 (38%), Positives = 117/202 (57%), Gaps = 7/202 (3%)
 Frame = +3

Query: 396  ENDRLAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSWSFANPSHIIIKLHLEEDY 575
            E   LA+  KL+L+GKFS  +P  + + S F  + L G +   + +  H++I L  E+D 
Sbjct: 1723 EIQTLAQPLKLSLVGKFS-RMPKLQDVRSAFKGIGLTGAYEVRWLDYKHVLIHLSNEQDC 1781

Query: 576  NKLWMGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVI 755
            N++W   +W +A+  MRVFKWTP F P  E+ + PVWI  P L  H F+ +AL  I+K +
Sbjct: 1782 NRVWTKQVWFIANQKMRVFKWTPEFEPEKESAVVPVWIAFPNLKAHLFEKSALLLIAKTV 1841

Query: 756  GTPLQVDSPTAVRSRLSMARVCVELDLLKEKIPEIIL----EFAGTI---HVQKIVYERI 914
            G PL VD  TA  SR S+ARVC+E D  +  I ++ +       GT+   + Q++ + ++
Sbjct: 1842 GKPLFVDEATANGSRPSVARVCIEFDCRRPPIDQVWIVVQNRETGTVTSGYPQRVEFSQM 1901

Query: 915  PDYCTHCKHIGHSIEGCYMNGN 980
            P YC HC H+GH    C + GN
Sbjct: 1902 PAYCDHCCHVGHKENDCIVLGN 1923


>ref|XP_007031319.1| Uncharacterized protein TCM_016772 [Theobroma cacao]
           gi|508710348|gb|EOY02245.1| Uncharacterized protein
           TCM_016772 [Theobroma cacao]
          Length = 1296

 Score =  155 bits (391), Expect = 1e-34
 Identities = 78/199 (39%), Positives = 118/199 (59%), Gaps = 8/199 (4%)
 Frame = +3

Query: 408 LAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLW 587
           LA ++K ++IGKF+  +P  + I + F  + L G ++  + +  HI+I L  E D N++W
Sbjct: 101 LALSFKFSMIGKFT-RMPKLQEIRTAFKGIGLVGAYNIRWLDYKHILIHLSNEHDLNRIW 159

Query: 588 MGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVIGTPL 767
           M   W + +  MRVFKWTP F+P  E+ L PVWI  P L  HF++ + L  I+K +G PL
Sbjct: 160 MKQNWFIVNKKMRVFKWTPEFHPEKESSLVPVWISFPNLRAHFYEKSTLMMIAKSVGRPL 219

Query: 768 QVDSPTAVRSRLSMARVCVELDLLKEKIPEIIL--------EFAGTIHVQKIVYERIPDY 923
            VD  TA  +R ++AR+CVE D  K  + +I +        E  G   +QK+ + ++PDY
Sbjct: 220 FVDEATANGTRPNVARICVEYDCQKSLLDQIWIVTRSRQTGEVTGGF-IQKVEFVKMPDY 278

Query: 924 CTHCKHIGHSIEGCYMNGN 980
           CTHC H+GH+   C + GN
Sbjct: 279 CTHCCHVGHNASACLVLGN 297


>ref|XP_007010391.1| Uncharacterized protein TCM_044158 [Theobroma cacao]
           gi|508727304|gb|EOY19201.1| Uncharacterized protein
           TCM_044158 [Theobroma cacao]
          Length = 830

 Score =  153 bits (387), Expect = 1e-34
 Identities = 74/202 (36%), Positives = 120/202 (59%), Gaps = 7/202 (3%)
 Frame = +3

Query: 396 ENDRLAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSWSFANPSHIIIKLHLEEDY 575
           E   LA+ +K +++GKFS  +   + I   F  + L G +   + +  HI+I+L  E D 
Sbjct: 91  EISTLAQPFKFSMVGKFSRML-RMQEIRVAFKGIGLIGAYEIRWLDYKHILIQLSNEHDL 149

Query: 576 NKLWMGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVI 755
           N++W+  +W +++  MRVFKW+P F P  E+ + PVWI  P L  H ++ +AL AI K +
Sbjct: 150 NRIWLKQVWFISNQKMRVFKWSPEFQPEKESSMVPVWISFPNLKAHLYEKSALSAIVKTV 209

Query: 756 GTPLQVDSPTAVRSRLSMARVCVELDLLKEKIPEIIL----EFAGTI---HVQKIVYERI 914
           G PL VD  TA  +R S+ARVCVE D  +  I ++ +      +G++   ++QK+ + R+
Sbjct: 210 GRPLMVDEATANGTRPSVARVCVEFDCQQPPIDQVWIVTRNRQSGSVMGGYMQKVEFARL 269

Query: 915 PDYCTHCKHIGHSIEGCYMNGN 980
            ++CTHC H+GH +  C + GN
Sbjct: 270 SEFCTHCSHVGHGVSSCMVIGN 291


>ref|XP_007026455.1| Uncharacterized protein TCM_021519 [Theobroma cacao]
           gi|508715060|gb|EOY06957.1| Uncharacterized protein
           TCM_021519 [Theobroma cacao]
          Length = 667

 Score =  152 bits (383), Expect = 2e-34
 Identities = 74/198 (37%), Positives = 115/198 (58%), Gaps = 7/198 (3%)
 Frame = +3

Query: 408 LAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLW 587
           LA+ + L L+GKF+  +P  + + S F  + L G +   + +  H++I L  ++D+N++W
Sbjct: 95  LAKPFSLCLVGKFT-RMPKLQEVRSAFKGIGLSGAYEIKWLDYKHVLIHLSNDQDFNRIW 153

Query: 588 MGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVIGTPL 767
               W +    MR+FKW+P F    E+P+ PVWI  P L  H ++ +AL  I+K IG PL
Sbjct: 154 TRQQWFIVGQKMRIFKWSPEFEAEKESPVVPVWISFPNLKAHLYEKSALLLIAKTIGKPL 213

Query: 768 QVDSPTAVRSRLSMARVCVELDLLKEKIPEIIL----EFAGTI---HVQKIVYERIPDYC 926
            VD PTA  SR S+ARVCVE D  +  I ++ +       G +   + QK+ + ++PDYC
Sbjct: 214 FVDEPTAKGSRPSVARVCVEYDCREPPIDQVWIVTQKRETGMVTNGYAQKVEFSQMPDYC 273

Query: 927 THCKHIGHSIEGCYMNGN 980
            HC H+GH+   C + GN
Sbjct: 274 EHCCHVGHNETTCLVLGN 291


>ref|XP_007031317.1| Uncharacterized protein TCM_016768 [Theobroma cacao]
            gi|508710346|gb|EOY02243.1| Uncharacterized protein
            TCM_016768 [Theobroma cacao]
          Length = 351

 Score =  142 bits (359), Expect = 2e-33
 Identities = 71/217 (32%), Positives = 115/217 (52%), Gaps = 7/217 (3%)
 Frame = +3

Query: 450  FAIPHPKGIVSGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLWMGTIWSLADCPMRV 629
            F +P    I   F  ++L G +   + +  HI+I+L  E D N++W+  +W +++  M V
Sbjct: 83   FWMPRINEIRMAFKGIDLVGAYEIKWLDYKHILIQLSNEHDLNRIWLKQVWFISNQKMCV 142

Query: 630  FKWTPSFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVIGTPLQVDSPTAVRSRLSM 809
            FKWTP+F P  E+ L PVWI  P L  H ++  AL  I+K +G PL VD  TA  +R S+
Sbjct: 143  FKWTPNFQPEKESSLVPVWISFPNLRAHLYEKFALLVIAKTVGRPLMVDEATAKGTRPSV 202

Query: 810  ARVCVELDLLKEKIPEIIL----EFAGTI---HVQKIVYERIPDYCTHCKHIGHSIEGCY 968
            ARVC+E D  K  I ++ +       G++   ++QK+ + ++ +YC+HC H+GH +  C 
Sbjct: 203  ARVCIEYDCQKPPIDQVWIVTRDRKTGSVIGGYMQKVDFAKLLEYCSHCCHVGHGVSTCI 262

Query: 969  MNGNXXXXXXXXXXXXXXXXSHDQASKGPIGNIEKLR 1079
            M G+                  D+  + PI   + +R
Sbjct: 263  MLGHRPEKRLQPTKTRMKRNGDDEGKEKPIEGEQGMR 299


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  151 bits (382), Expect = 2e-33
 Identities = 77/202 (38%), Positives = 116/202 (57%), Gaps = 7/202 (3%)
 Frame = +3

Query: 396  ENDRLAETWKLTLIGKFSFAIPHPKGIVSGFSALNLKGPFSWSFANPSHIIIKLHLEEDY 575
            E   LA+ +KL+L+GKFS  +P  + + + F  + L G +   + +  H++I L  E+D+
Sbjct: 1792 EIQTLAKPFKLSLVGKFS-RMPKLQDVRAAFKGIGLAGAYEVRWLDYKHVLIHLSNEQDF 1850

Query: 576  NKLWMGTIWSLADCPMRVFKWTPSFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVI 755
            N++W    W +A   MRVFKWTP F P  E+ + PVWI  P L  H F+ +AL  I+K +
Sbjct: 1851 NRIWTKQNWFIATQKMRVFKWTPEFEPEKESAVVPVWISFPNLKAHLFEKSALLLIAKTV 1910

Query: 756  GTPLQVDSPTAVRSRLSMARVCVELDLLKEKIPEIIL----EFAGTI---HVQKIVYERI 914
            G PL VD  TA  SR S+ARVCVE D  +  + ++ +       G I   + Q++ + ++
Sbjct: 1911 GKPLFVDEATANGSRPSVARVCVEFDCRQPPLDQVWIVVQNRKTGEITNGYSQRVEFAQM 1970

Query: 915  PDYCTHCKHIGHSIEGCYMNGN 980
            P YC HC H+GH    C + GN
Sbjct: 1971 PAYCDHCCHVGHKETDCILLGN 1992



 Score =  148 bits (374), Expect = 2e-32
 Identities = 97/323 (30%), Positives = 157/323 (48%), Gaps = 19/323 (5%)
 Frame = +3

Query: 456  IPHPKGIVSGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLWMGTIWSLADCPMRVFK 635
            +P  + I   F  + L G +   + +  HI+I L  E+D+N++W    W +A+  MRVFK
Sbjct: 1    MPKMQEIRQAFKGIGLTGAYVIRWLDYKHILIHLSNEQDFNRIWTKQQWFIANQKMRVFK 60

Query: 636  WTPSFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVIGTPLQVDSPTAVRSRLSMAR 815
            W+P F    E+P+ PVWI  P L  H ++ +AL  I+K +G PL +D  T+  SR S+AR
Sbjct: 61   WSPDFEAEKESPIVPVWISFPNLKAHLYEKSALLLIAKTVGKPLFIDEATSNASRPSVAR 120

Query: 816  VCVELDLLKEKIPEIIL----EFAGTI---HVQKIVYERIPDYCTHCKHIGHSIEGCYMN 974
            VCVE +     + EI +       GT+   + QK+ + ++PDYC HC H+GHS+  C + 
Sbjct: 121  VCVEYNCRNAPVEEIWIVIKDRVTGTVTGGYAQKVEFSKMPDYCEHCGHVGHSVSTCLVL 180

Query: 975  GNXXXXXXXXXXXXXXXXS---HDQASKGPIGNIEKLRGTMDKSKSLNKEIPIPNP---G 1136
            GN                S     Q      G   K    + ++K  +++I    P   G
Sbjct: 181  GNRSENLRKEKLSNVHSKSLAGKKQTENDDKGLDSKPMDDLKRNKETDRKISEERPMMTG 240

Query: 1137 VNLAAAKNQDWMRVNKKGPRETGLVSKEVLNFAKTAKHTYFKGSDS--IDEGS----GIN 1298
             N  A   +    +N++   +  L  + V +  +  K   FKG++    DEG+     +N
Sbjct: 241  RNTEATAEKRNKILNREVLAKHSLQWQAVGHLGQ-PKFNGFKGAERHLEDEGTKQFQNVN 299

Query: 1299 RFSVLDNGVFESSDDLTQFMKGQ 1367
            RFS L  G  + +++  Q  +G+
Sbjct: 300  RFSAL--GSVQDTENEEQIREGK 320


>ref|XP_011075252.1| PREDICTED: uncharacterized protein LOC105159763 [Sesamum indicum]
          Length = 476

 Score =  145 bits (365), Expect = 3e-33
 Identities = 80/235 (34%), Positives = 112/235 (47%)
 Frame = +3

Query: 276 ANVTASTSHQSNLPFDPKRIVPIGTHKEKDGQQVLGFSSLENDRLAETWKLTLIGKFSFA 455
           A+ TA ++H    P D       GT    D    L F+  E + LA  ++  L+GKFS  
Sbjct: 81  ASKTAKSAHHKYFPTDSPPAA-FGTVLTGDNGPTLQFTDAETEILAAPFRFALVGKFSHG 139

Query: 456 IPHPKGIVSGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLWMGTIWSLADCPMRVFK 635
            P    +    +   +K  F+                                 PMRVFK
Sbjct: 140 APSYSMLHKLMAGTGIKNRFT-------------------------------GYPMRVFK 168

Query: 636 WTPSFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVIGTPLQVDSPTAVRSRLSMAR 815
           WTP+F P  E+ + P W+  P LP + F    LF ++ +IGTPLQ+D  T  +S+LS AR
Sbjct: 169 WTPTFTPSQESSIVPGWVSFPELPAYLFRKEVLFTVASMIGTPLQIDDATLNQSKLSKAR 228

Query: 816 VCVELDLLKEKIPEIILEFAGTIHVQKIVYERIPDYCTHCKHIGHSIEGCYMNGN 980
            C+ELDLLK ++    ++  GT  VQ+I YE IP YC+ CK +GH    CY  G+
Sbjct: 229 ACIELDLLKPRLENFQIQICGTTIVQRIEYEDIPHYCSLCKQVGHQDSDCYTKGD 283


>ref|XP_012841289.1| PREDICTED: uncharacterized protein LOC105961601 [Erythranthe
           guttata]
          Length = 449

 Score =  143 bits (360), Expect = 8e-33
 Identities = 68/166 (40%), Positives = 104/166 (62%), Gaps = 4/166 (2%)
 Frame = +3

Query: 495 LNLKGPFSWSFANPSHIIIKLHLEEDYNKLWMGTIWSLADCPMRVFKWTPSFNPRMEAPL 674
           L  +G F     N  H++I+  + +DY+ L   +I  +   PMRVFK+TP FN + E  +
Sbjct: 8   LKPRGSFELHKLNYRHVLIQFSVLDDYSLLLRRSICYIHGLPMRVFKYTPGFNLKNETSI 67

Query: 675 APVWIRLPGLPIHFFDHNALFAISKVIGTPLQVDSPTAVRSRLSMARVCVELDLLKEKIP 854
           APVW+ +PG+P + ++  A+F ++  IG PL+ D  TA R ++S+AR CVE+DLLK ++ 
Sbjct: 68  APVWVNVPGVPPYMYNREAIFFLASSIGNPLEFDDFTADRKKISVARFCVEIDLLKPRVE 127

Query: 855 EIILEFAGTIHVQKIV----YERIPDYCTHCKHIGHSIEGCYMNGN 980
           +I +   G   ++ I     YE +P +CT C H+GHS+E CYMNGN
Sbjct: 128 QIPV-MTGYDDIEMISLPGNYENVPKFCTFCSHLGHSVENCYMNGN 172


>ref|XP_007023857.1| Uncharacterized protein TCM_028230 [Theobroma cacao]
           gi|508779223|gb|EOY26479.1| Uncharacterized protein
           TCM_028230 [Theobroma cacao]
          Length = 748

 Score =  147 bits (371), Expect = 8e-33
 Identities = 70/180 (38%), Positives = 105/180 (58%), Gaps = 8/180 (4%)
 Frame = +3

Query: 465 PKGIVSGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLWMGTIWSLADCPMRVFKWTP 644
           P  I + F  + L G +   + +  HI I L  E+D N++W+  +W +++  +RVFKWT 
Sbjct: 90  PTEIRNAFKGIGLAGAYDIRWLDYKHIHIGLSNEQDMNRIWLKQVWFISNQKLRVFKWTK 149

Query: 645 SFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVIGTPLQVDSPTAVRSRLSMARVCV 824
            F P  E+ L PVWI  P L  H ++ +A+  I+K +G PL VD  T   +R S+ARVC+
Sbjct: 150 DFQPEKESSLVPVWISFPNLRAHLYEKSAVLVIAKTVGRPLFVDEATDNGTRPSLARVCI 209

Query: 825 ELDLLKEKIPEIIL--------EFAGTIHVQKIVYERIPDYCTHCKHIGHSIEGCYMNGN 980
           E D LK  + ++ +        E  G   +QK+ +ER+PDYCTHC H+GHS+  C + GN
Sbjct: 210 EYDCLKPPLDQVWIVMRDRRTGEITGGF-MQKVDFERMPDYCTHCCHVGHSVSTCIVMGN 268


>ref|XP_011094921.1| PREDICTED: uncharacterized protein LOC105174492 [Sesamum indicum]
          Length = 171

 Score =  134 bits (337), Expect = 1e-32
 Identities = 58/120 (48%), Positives = 78/120 (65%)
 Frame = +3

Query: 621 MRVFKWTPSFNPRMEAPLAPVWIRLPGLPIHFFDHNALFAISKVIGTPLQVDSPTAVRSR 800
           MRVFKWTP+F P  E+ + PVW+  P LP H F    LF ++ +I TPLQ+D  T  +S+
Sbjct: 1   MRVFKWTPTFTPSKESSIVPVWVSFPKLPAHLFRKEVLFTVASMIETPLQIDDATLNQSK 60

Query: 801 LSMARVCVELDLLKEKIPEIILEFAGTIHVQKIVYERIPDYCTHCKHIGHSIEGCYMNGN 980
           LS AR C+ELDLLK ++ +  ++  G   VQ+I YE IP YC+ CKH+GH    CY  G+
Sbjct: 61  LSKARACIELDLLKPRLEDFQIQICGATIVQRIEYEDIPHYCSLCKHVGHRDSDCYTEGD 120


Top