BLASTX nr result

ID: Rehmannia27_contig00019744 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia27_contig00019744
         (1182 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011092915.1| PREDICTED: uncharacterized protein LOC105172...   179   2e-48
ref|XP_011101871.1| PREDICTED: uncharacterized protein LOC105179...   168   1e-42
ref|XP_011071645.1| PREDICTED: uncharacterized protein LOC105157...   162   8e-42
emb|CDP20930.1| unnamed protein product [Coffea canephora]            153   2e-38
ref|XP_012846704.1| PREDICTED: uncharacterized protein LOC105966...   150   1e-36
ref|XP_007010391.1| Uncharacterized protein TCM_044158 [Theobrom...   149   7e-36
ref|XP_007026454.1| Uncharacterized protein TCM_030494 [Theobrom...   142   1e-33
ref|XP_007031319.1| Uncharacterized protein TCM_016772 [Theobrom...   141   4e-33
ref|XP_012065816.1| PREDICTED: uncharacterized protein LOC105628...   136   6e-33
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...   139   4e-32
ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobrom...   137   2e-31
ref|XP_007026455.1| Uncharacterized protein TCM_021519 [Theobrom...   135   2e-31
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...   135   5e-31
ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom...   135   7e-31
ref|XP_007046403.1| Uncharacterized protein TCM_011922 [Theobrom...   134   1e-30
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...   134   1e-30
ref|XP_012084349.1| PREDICTED: uncharacterized protein LOC105643...   125   2e-30
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...   132   4e-30
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...   127   3e-28
ref|XP_007031317.1| Uncharacterized protein TCM_016768 [Theobrom...   122   6e-28

>ref|XP_011092915.1| PREDICTED: uncharacterized protein LOC105172985 [Sesamum indicum]
          Length = 470

 Score =  179 bits (455), Expect = 2e-48
 Identities = 87/209 (41%), Positives = 118/209 (56%)
 Frame = +1

Query: 439  IGTHKEKDGQQVIGFSSSENNRLAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSW 618
            +G      G +V+ FSS E +RL+  ++  L+GKFS   P  + +     A   +G FS 
Sbjct: 1    MGVLSRDQGMKVLRFSSDEISRLSLPFRYALVGKFSHGYPSMQNLRRWMLAQGFRGDFSV 60

Query: 619  SFANPSHIIIKLHLEEDYNKLWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPG 798
               N  H+ IK  LEEDY KLW+ + W +   PMRVFKWTP+FNP+ E+P+ PVW+RLP 
Sbjct: 61   GAINVRHVFIKFALEEDYTKLWIKSTWFVEGFPMRVFKWTPTFNPREESPIVPVWVRLPE 120

Query: 799  LPIHFFDHNALFAICKIIGTPLQMDSPTATRTRLSMARXXXXXXXXXXXXXXXXXXFDGT 978
            LPI FFD  ALF+I  ++GTPL+ D  TAT  R S+AR                      
Sbjct: 121  LPIQFFDREALFSIAHLLGTPLRTDVSTATLVRPSVARVCVEINLLEPLQTEIGLGIGTE 180

Query: 979  THVQKIVFERTPDYCLHCKHIGHTIEGCY 1065
              +Q +++ER P YC  CKH+GH  + CY
Sbjct: 181  VIIQPVIYERLPKYCGACKHLGHDEDECY 209


>ref|XP_011101871.1| PREDICTED: uncharacterized protein LOC105179909 [Sesamum indicum]
          Length = 733

 Score =  168 bits (425), Expect = 1e-42
 Identities = 77/214 (35%), Positives = 117/214 (54%)
 Frame = +1

Query: 436  PIGTHKEKDGQQVIGFSSSENNRLAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFS 615
            P+G      G+  I F+++E   LA  ++ +L+GKFS   P    +    + L ++G F+
Sbjct: 71   PLGIKSVNQGRPTISFTNTETEELAAPFRFSLVGKFSHGAPPYSQMHQLIARLGIQGAFT 130

Query: 616  WSFANPSHIIIKLHLEEDYNKLWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLP 795
             S  N  H +I L  E DY++LW+  +W L   PMR+FKWTP+F P  E+ + P+++  P
Sbjct: 131  VSMINSKHTLISLSCESDYSRLWLRRIWFLQGFPMRIFKWTPTFTPTQESSVVPIFVCFP 190

Query: 796  GLPIHFFDHNALFAICKIIGTPLQMDSPTATRTRLSMARXXXXXXXXXXXXXXXXXXFDG 975
             LP H F   ALF++  ++G+PLQ+D+ T  +++LS AR                   + 
Sbjct: 191  KLPAHLFHKEALFSVASMVGSPLQIDALTLNKSKLSQARVCVEIDLLKPIIEEFDLHIND 250

Query: 976  TTHVQKIVFERTPDYCLHCKHIGHTIEGCYMNGN 1077
             T VQK+VFE  P YC  CKH+GH    C+  GN
Sbjct: 251  VTIVQKVVFEYLPKYCFLCKHVGHKDSDCFSKGN 284


>ref|XP_011071645.1| PREDICTED: uncharacterized protein LOC105157045 [Sesamum indicum]
          Length = 507

 Score =  162 bits (411), Expect = 8e-42
 Identities = 90/258 (34%), Positives = 123/258 (47%), Gaps = 7/258 (2%)
 Frame = +1

Query: 325  SGSRPPEKPPLKHGSYANVTASTSRQSNLPFDPKRIVP-------IGTHKEKDGQQVIGF 483
            S S P    P K  ++A V A T         P +  P       IGT    D    + F
Sbjct: 59   SSSIPTSNFPKK--TFAEVLAPTRASKPATPAPHKYFPVDLPSPGIGTVLTGDKGPTLLF 116

Query: 484  SSSENNRLAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHLE 663
            +  E   LA  +K  L+GKFS   P    +    +   +K  F+ S  N  H++I L  E
Sbjct: 117  TDDETEVLAAPFKFALVGKFSHGAPSYSILHKLIAGTGIKNKFTVSMLNTRHVLISLSCE 176

Query: 664  EDYNKLWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAIC 843
             D+++LW+  +W +   PMRVFKWTP+F P  E+ + PVW+  P LP H F    LF + 
Sbjct: 177  ADFSRLWLRRIWYIQGYPMRVFKWTPAFTPSKESSIVPVWVSFPELPAHLFRKEVLFTVA 236

Query: 844  KIIGTPLQMDSPTATRTRLSMARXXXXXXXXXXXXXXXXXXFDGTTHVQKIVFERTPDYC 1023
             +IGTPLQ+D  T  +++LS AR                    GTT VQ+I +E  P YC
Sbjct: 237  SMIGTPLQIDDATLNQSKLSKARACIELDLLKPRLENFQIQICGTTIVQRIEYEDIPHYC 296

Query: 1024 LHCKHIGHTIEGCYMNGN 1077
              CKH+GH    CY  G+
Sbjct: 297  SLCKHVGHQDSDCYTKGD 314


>emb|CDP20930.1| unnamed protein product [Coffea canephora]
          Length = 497

 Score =  153 bits (387), Expect = 2e-38
 Identities = 80/204 (39%), Positives = 115/204 (56%), Gaps = 1/204 (0%)
 Frame = +1

Query: 463  GQQVIGFSSSENNRLAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHI 642
            G+  + FS ++ ++LA  ++  L+GKFS   P  + I   F++LNLK   S    +  H+
Sbjct: 43   GEAAVVFSKADADKLAAPFQWALVGKFSHGRPSLEDIRKFFASLNLKDHVSIGLMDYRHV 102

Query: 643  IIKLHLEEDYNKLWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDH 822
            +IK   E D+N++WM  +W LG  PMRVF+WT  F+   E+ LAPVW+ LP LPIH+FD 
Sbjct: 103  LIKCMAEADFNRIWMRGIWQLGKYPMRVFRWTREFHVLRESSLAPVWVVLPALPIHYFDK 162

Query: 823  NALFAICKIIGTPLQMDSPTATRTRLSMARXXXXXXXXXXXXXXXXXXFDGTTHV-QKIV 999
            ++LF+I   +G PL +DS TA  TR S+AR                   +G +   Q+IV
Sbjct: 163  HSLFSILSPVGRPLFLDSATAAGTRPSLARVCVELDVAKSFTQRVWVAVEGESGFWQRIV 222

Query: 1000 FERTPDYCLHCKHIGHTIEGCYMN 1071
             E  P YC  C  +GH+ E C  N
Sbjct: 223  PENMPLYCSSCSRLGHSQEQCKKN 246


>ref|XP_012846704.1| PREDICTED: uncharacterized protein LOC105966659 [Erythranthe guttata]
          Length = 582

 Score =  150 bits (378), Expect = 1e-36
 Identities = 80/219 (36%), Positives = 117/219 (53%), Gaps = 3/219 (1%)
 Frame = +1

Query: 430  IVPIGTHKEKDGQQVIGFSSSENNRLAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGP 609
            I PIGT K  DG+ V+ FS  E +++ E  K TLIGKFS  I H K +      L  +G 
Sbjct: 88   IAPIGTIKVIDGKNVLYFSKEEVDKMLEPLKYTLIGKFSHGIHHYKVMEKFIYDLKPRGS 147

Query: 610  FSWSFANPSHIIIKLHLEEDYNKLWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIR 789
            F     N  H++I+  + + Y+ L   ++  +   PMRVFK+TP FN K E  +APVW+ 
Sbjct: 148  FELHKLNYRHVLIQFSVLDYYSLLLRRSICYIDGLPMRVFKYTPGFNLKNETSIAPVWVN 207

Query: 790  LPGLPIHFFDHNALFAICKIIGTPLQMDSPTATRTRLSMAR--XXXXXXXXXXXXXXXXX 963
            +PG+P + ++  A+F +   IG PL+ D  TA R +LS+AR                   
Sbjct: 208  VPGVPPYMYNREAIFFLASSIGNPLEFDDFTADRKKLSVARFCVEIDLLKPRVEQIPVMT 267

Query: 964  XFDGTTHVQ-KIVFERTPDYCLHCKHIGHTIEGCYMNGN 1077
             +D    +   + +E  P +C  C H+GH++E CYMNGN
Sbjct: 268  GYDDVEMISLPVNYENVPKFCTFCSHLGHSVENCYMNGN 306


>ref|XP_007010391.1| Uncharacterized protein TCM_044158 [Theobroma cacao]
            gi|508727304|gb|EOY19201.1| Uncharacterized protein
            TCM_044158 [Theobroma cacao]
          Length = 830

 Score =  149 bits (376), Expect = 7e-36
 Identities = 93/290 (32%), Positives = 142/290 (48%), Gaps = 19/290 (6%)
 Frame = +1

Query: 268  PSALPTLYQVVFCPPSMADSGSRPP------EKPPLKHG-SYANVTASTSRQSNLPF--- 417
            P  LPTL  V    PSM  SG+ P        KP L HG + A V+  T ++S L     
Sbjct: 7    PDPLPTLPPVA--TPSMLQSGATPNALATENSKPSLSHGHTQAPVSPRTQKKSFLAVAAG 64

Query: 418  DPKRIVPIGTHK--EKDGQQVIGFSSSENNRLAETWKLTLIGKFSFAIPHPKGIASGFSA 591
            +   ++P+       KD +    F   E + LA+ +K +++GKFS  +   + I   F  
Sbjct: 65   EKSSLIPLDREPFWYKD-RPAASFFDDEISTLAQPFKFSMVGKFSRML-RMQEIRVAFKG 122

Query: 592  LNLKGPFSWSFANPSHIIIKLHLEEDYNKLWMGTLWSLGDCPMRVFKWTPSFNPKTEAPL 771
            + L G +   + +  HI+I+L  E D N++W+  +W + +  MRVFKW+P F P+ E+ +
Sbjct: 123  IGLIGAYEIRWLDYKHILIQLSNEHDLNRIWLKQVWFISNQKMRVFKWSPEFQPEKESSM 182

Query: 772  APVWIRLPGLPIHFFDHNALFAICKIIGTPLQMDSPTATRTRLSMARXXXXXXXXXXXXX 951
             PVWI  P L  H ++ +AL AI K +G PL +D  TA  TR S+AR             
Sbjct: 183  VPVWISFPNLKAHLYEKSALSAIVKTVGRPLMVDEATANGTRPSVARVCVEFDCQQPPID 242

Query: 952  XXXXXFDGTT-------HVQKIVFERTPDYCLHCKHIGHTIEGCYMNGNK 1080
                             ++QK+ F R  ++C HC H+GH +  C + GN+
Sbjct: 243  QVWIVTRNRQSGSVMGGYMQKVEFARLSEFCTHCSHVGHGVSSCMVIGNR 292


>ref|XP_007026454.1| Uncharacterized protein TCM_030494 [Theobroma cacao]
            gi|508781820|gb|EOY29076.1| Uncharacterized protein
            TCM_030494 [Theobroma cacao]
          Length = 876

 Score =  142 bits (359), Expect = 1e-33
 Identities = 83/256 (32%), Positives = 121/256 (47%), Gaps = 7/256 (2%)
 Frame = +1

Query: 334  RPPEKPPLKHGSYANVTASTSRQSNLPFDPKRIVPIGTHKEKDGQQVIGFSSSENNRLAE 513
            +PP  P     S+ +V  +       P  P       T + KD +  + F   E   LA+
Sbjct: 79   QPPASPRTAKKSFLSVVNAVKLALVPPTRP-------TFRYKD-KPAVRFFEDEIEALAQ 130

Query: 514  TWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLWMGT 693
             +K  ++GKFS  +P    I   F +L L G ++  + N  HI+I L  E+D+N++W   
Sbjct: 131  PFKFAIVGKFS-KMPRLTEIRQSFVSLGLSGVYNIRWMNYKHILIHLSNEQDFNRIWTKQ 189

Query: 694  LWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAICKIIGTPLQMD 873
             W + +  MRVFKWTP F    E+P+ PVWI  P L  H F+ +AL  I K IG PL +D
Sbjct: 190  TWFITNQKMRVFKWTPDFETDKESPIVPVWISFPNLKAHLFEKSALLMIAKAIGNPLYID 249

Query: 874  SPTATRTRLSMARXXXXXXXXXXXXXXXXXXFD-------GTTHVQKIVFERTPDYCLHC 1032
              TA  TR S+AR                              ++QK+ F   P+YC HC
Sbjct: 250  EATANGTRPSVARVCIEYDCLKPPVDSVWIVVSKRGSEDMSGGYLQKVEFAPMPEYCNHC 309

Query: 1033 KHIGHTIEGCYMNGNK 1080
             H+GH +  C + G++
Sbjct: 310  CHVGHNVSKCLILGSR 325


>ref|XP_007031319.1| Uncharacterized protein TCM_016772 [Theobroma cacao]
            gi|508710348|gb|EOY02245.1| Uncharacterized protein
            TCM_016772 [Theobroma cacao]
          Length = 1296

 Score =  141 bits (356), Expect = 4e-33
 Identities = 84/264 (31%), Positives = 128/264 (48%), Gaps = 7/264 (2%)
 Frame = +1

Query: 310  PSMADSGSRPPEKPPLKHGSYANVTASTSRQSNLPFDPKRIVPIGTHKEKDGQQVIGFSS 489
            P   +  + PP  P ++  S+ +V A      N P  P    P   ++++       F  
Sbjct: 44   PQATNLSNYPPISPRMQKKSFLSVVAG----ENPPVIPLNREP-SWYRDRPAAS---FFD 95

Query: 490  SENNRLAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHLEED 669
            +E   LA ++K ++IGKF+  +P  + I + F  + L G ++  + +  HI+I L  E D
Sbjct: 96   NEIATLALSFKFSMIGKFT-RMPKLQEIRTAFKGIGLVGAYNIRWLDYKHILIHLSNEHD 154

Query: 670  YNKLWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAICKI 849
             N++WM   W + +  MRVFKWTP F+P+ E+ L PVWI  P L  HF++ + L  I K 
Sbjct: 155  LNRIWMKQNWFIVNKKMRVFKWTPEFHPEKESSLVPVWISFPNLRAHFYEKSTLMMIAKS 214

Query: 850  IGTPLQMDSPTATRTRLSMARXXXXXXXXXXXXXXXXXXFDGTT-------HVQKIVFER 1008
            +G PL +D  TA  TR ++AR                               +QK+ F +
Sbjct: 215  VGRPLFVDEATANGTRPNVARICVEYDCQKSLLDQIWIVTRSRQTGEVTGGFIQKVEFVK 274

Query: 1009 TPDYCLHCKHIGHTIEGCYMNGNK 1080
             PDYC HC H+GH    C + GNK
Sbjct: 275  MPDYCTHCCHVGHNASACLVLGNK 298


>ref|XP_012065816.1| PREDICTED: uncharacterized protein LOC105628933 [Jatropha curcas]
          Length = 397

 Score =  136 bits (343), Expect = 6e-33
 Identities = 71/203 (34%), Positives = 100/203 (49%)
 Frame = +1

Query: 463  GQQVIGFSSSENNRLAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHI 642
            G   I FS  E+ +LA  ++  L+G F    P+ K +      +  KG FS    + SHI
Sbjct: 58   GVPSISFSWDESMKLANQFRFALVGIFQSGRPNMKSLRQFMDKIGFKGEFSLGLLDSSHI 117

Query: 643  IIKLHLEEDYNKLWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDH 822
            +IK  LEED+++ W+  +W      MR+ KWT +F P T+  + P WI   GLPIH F  
Sbjct: 118  LIKFELEEDFHRCWLKQIWYFQGFSMRISKWTRNFRPNTDCSIVPTWILFEGLPIHLFAK 177

Query: 823  NALFAICKIIGTPLQMDSPTATRTRLSMARXXXXXXXXXXXXXXXXXXFDGTTHVQKIVF 1002
             ALF I  +IG PL++D+ TAT +R S+AR                         Q + +
Sbjct: 178  AALFPIANLIGKPLKVDAATATLSRPSVARVCVELDLSKDLPNKVWIDDGDLGFFQPVNY 237

Query: 1003 ERTPDYCLHCKHIGHTIEGCYMN 1071
            E  P +C  C  IGH I  C +N
Sbjct: 238  ESLPLFCTKCCRIGHEILSCPLN 260


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  139 bits (349), Expect = 4e-32
 Identities = 82/255 (32%), Positives = 124/255 (48%), Gaps = 7/255 (2%)
 Frame = +1

Query: 337  PPEKPPLKHGSYANVTASTSRQSNLPFDPKRIVPIGTHKEKDGQQVIGFSSSENNRLAET 516
            PP  P  +  S+ ++  S  + S +P      V    +K++       F   E + LA+ 
Sbjct: 76   PPSSPRFQKKSFLSIV-SGEKPSVVPLTRDPFV----YKDRPAA---AFFEDEIHILAQP 127

Query: 517  WKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLWMGTL 696
            +KL+L+GKFS  +P  + + S F  + L G +   + +  HI+I L  E+D+N+ W    
Sbjct: 128  FKLSLVGKFS-RMPKLQEVRSAFKGIGLAGSYEIRWLDYKHILIHLSNEQDFNRFWTKQA 186

Query: 697  WSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAICKIIGTPLQMDS 876
            W + +  MRVFKWTP F P+ E+ + PVWI  P L  H F+ +AL  I K +G PL +D 
Sbjct: 187  WFIANQKMRVFKWTPEFEPEKESAVVPVWISFPNLKAHLFEKSALLLIAKTVGKPLFIDE 246

Query: 877  PTATRTRLSMARXXXXXXXXXXXXXXXXXXFDG-------TTHVQKIVFERTPDYCLHCK 1035
             TA  +R S+AR                            + + QK+ F + P YC HC 
Sbjct: 247  ATANGSRPSVARVCIEYDCREPPVDQVWIVVQNRATGAVTSGYPQKVEFAQMPAYCDHCC 306

Query: 1036 HIGHTIEGCYMNGNK 1080
            H+GH    C + GNK
Sbjct: 307  HVGHKEINCIVLGNK 321


>ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobroma cacao]
            gi|508787493|gb|EOY34749.1| Uncharacterized protein
            TCM_042329 [Theobroma cacao]
          Length = 2606

 Score =  137 bits (344), Expect = 2e-31
 Identities = 88/288 (30%), Positives = 133/288 (46%), Gaps = 16/288 (5%)
 Frame = +1

Query: 265  SPSALPTLYQVVFCPPSMADSGSRPPEKPPLKHGSYANVTASTSRQSN-------LPFDP 423
            +P A  T   +   PPS+    SR P     +  +   +    S +S        +  D 
Sbjct: 1639 APPAAETTTLLTTNPPSIWTKNSRLPLSHGCQQTTPTQIQPPPSPRSQKKSFLSIVSGDK 1698

Query: 424  KRIVPIGTHKE--KDGQQVIGFSSSENNRLAETWKLTLIGKFSFAIPHPKGIASGFSALN 597
              ++P+       KD +    F   E   LA+  KL+L+GKFS  +P  + + S F  + 
Sbjct: 1699 PPVIPLSRDPLVFKD-RPAAAFFEDEIQTLAQPLKLSLVGKFS-RMPKLQDVRSAFKGIG 1756

Query: 598  LKGPFSWSFANPSHIIIKLHLEEDYNKLWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAP 777
            L G +   + +  H++I L  E+D N++W   +W + +  MRVFKWTP F P+ E+ + P
Sbjct: 1757 LTGAYEVRWLDYKHVLIHLSNEQDCNRVWTKQVWFIANQKMRVFKWTPEFEPEKESAVVP 1816

Query: 778  VWIRLPGLPIHFFDHNALFAICKIIGTPLQMDSPTATRTRLSMARXXXXXXXXXXXXXXX 957
            VWI  P L  H F+ +AL  I K +G PL +D  TA  +R S+AR               
Sbjct: 1817 VWIAFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVARVCIEFDCRRPPIDQV 1876

Query: 958  XXXFD----GTT---HVQKIVFERTPDYCLHCKHIGHTIEGCYMNGNK 1080
                     GT    + Q++ F + P YC HC H+GH    C + GNK
Sbjct: 1877 WIVVQNRETGTVTSGYPQRVEFSQMPAYCDHCCHVGHKENDCIVLGNK 1924



 Score =  135 bits (340), Expect = 6e-31
 Identities = 84/259 (32%), Positives = 126/259 (48%), Gaps = 10/259 (3%)
 Frame = +1

Query: 334  RPPEKPPLKHGSYANVTASTSRQSNLPFDPKRIVPIGTHKEK---DGQQVIGFSSSENNR 504
            + P  P  +  S+ +V A    Q         I+P  T++E      +  + F   E   
Sbjct: 52   KTPVSPRAQKKSFLSVAAGEKLQ---------IIP--TNREPFWYRDRPAVAFFEDEIVA 100

Query: 505  LAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLW 684
            LA+ +K +++GKFS  +P    I + F  ++L G +   + +  HI+I L  E+D N+LW
Sbjct: 101  LAQPFKHSMVGKFS-RMPKLNDIRAAFKGISLVGVYEIRWLDYKHILIHLSNEQDLNRLW 159

Query: 685  MGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAICKIIGTPL 864
            M   W + +  MRVFKWTP F P+ E+ L PVWI  P L  H ++ +AL  I K +G PL
Sbjct: 160  MRQAWFIANQKMRVFKWTPDFQPEKESSLVPVWISFPNLRAHLYEKSALLMIAKSVGRPL 219

Query: 865  QMDSPTATRTRLSMARXXXXXXXXXXXXXXXXXXF-DGTT------HVQKIVFERTPDYC 1023
             +D  TA  TR S+AR                    D  T        QK+ F + P+YC
Sbjct: 220  FVDEATANGTRPSVARVCVEYDCQQPPLEQIWIVTRDRRTGDITGGFQQKVDFAKLPNYC 279

Query: 1024 LHCKHIGHTIEGCYMNGNK 1080
             HC H+GH+   C + G++
Sbjct: 280  THCCHVGHSASTCLVMGHR 298


>ref|XP_007026455.1| Uncharacterized protein TCM_021519 [Theobroma cacao]
            gi|508715060|gb|EOY06957.1| Uncharacterized protein
            TCM_021519 [Theobroma cacao]
          Length = 667

 Score =  135 bits (341), Expect = 2e-31
 Identities = 79/260 (30%), Positives = 125/260 (48%), Gaps = 7/260 (2%)
 Frame = +1

Query: 319  ADSGSRPPEKPPLKHGSYANVTASTSRQSNLPFDPKRIVPIGTHKEKDGQQVIGFSSSEN 498
            +D+ ++PP  P  +  S+ ++ A     S  P  P    P   +K++       F   E 
Sbjct: 41   SDNHTQPPTSPRFQKKSFLSIAAG----SKPPVIPLNRDP-AVYKDRPAAV---FYEDEI 92

Query: 499  NRLAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNK 678
              LA+ + L L+GKF+  +P  + + S F  + L G +   + +  H++I L  ++D+N+
Sbjct: 93   CILAKPFSLCLVGKFT-RMPKLQEVRSAFKGIGLSGAYEIKWLDYKHVLIHLSNDQDFNR 151

Query: 679  LWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAICKIIGT 858
            +W    W +    MR+FKW+P F  + E+P+ PVWI  P L  H ++ +AL  I K IG 
Sbjct: 152  IWTRQQWFIVGQKMRIFKWSPEFEAEKESPVVPVWISFPNLKAHLYEKSALLLIAKTIGK 211

Query: 859  PLQMDSPTATRTRLSMARXXXXXXXXXXXXXXXXXXFDGTT-------HVQKIVFERTPD 1017
            PL +D PTA  +R S+AR                              + QK+ F + PD
Sbjct: 212  PLFVDEPTAKGSRPSVARVCVEYDCREPPIDQVWIVTQKRETGMVTNGYAQKVEFSQMPD 271

Query: 1018 YCLHCKHIGHTIEGCYMNGN 1077
            YC HC H+GH    C + GN
Sbjct: 272  YCEHCCHVGHNETTCLVLGN 291


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  135 bits (340), Expect = 5e-31
 Identities = 75/207 (36%), Positives = 105/207 (50%), Gaps = 7/207 (3%)
 Frame = +1

Query: 481  FSSSENNRLAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHL 660
            F   E   LA+  KL+L+GKFS  +P  + + S F  + L G +   + +  HI+I L  
Sbjct: 122  FYEDEIQTLAQPLKLSLVGKFS-RMPKLQDVRSAFKGIGLAGAYEVRWLDYKHILIHLTN 180

Query: 661  EEDYNKLWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAI 840
            E D N++W   +W + +  MRVFKWTP F P+ E+ + PVWI  P L  H F+ +AL  I
Sbjct: 181  EHDCNRVWTKQVWFIANQKMRVFKWTPEFEPEKESAMVPVWIAFPNLKAHLFEKSALLLI 240

Query: 841  CKIIGTPLQMDSPTATRTRLSMARXXXXXXXXXXXXXXXXXXFD----GTT---HVQKIV 999
             K +G PL +D  TA  +R S+AR                        GT    + QK+ 
Sbjct: 241  AKTVGKPLFVDEATANGSRPSVARVCIEYDCRKPPIDQVWIVVQNRETGTVTSGYPQKVE 300

Query: 1000 FERTPDYCLHCKHIGHTIEGCYMNGNK 1080
            F + P YC HC H+GH    C + GNK
Sbjct: 301  FSQMPAYCDHCCHVGHKEIDCIVLGNK 327


>ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
            gi|508715062|gb|EOY06959.1| Uncharacterized protein
            TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  135 bits (339), Expect = 7e-31
 Identities = 82/256 (32%), Positives = 121/256 (47%), Gaps = 7/256 (2%)
 Frame = +1

Query: 334  RPPEKPPLKHGSYANVTASTSRQSNLPFDPKRIVPIGTHKEKDGQQVIGFSSSENNRLAE 513
            +PP  P  +  S+ +V A        P  P    P         +  + F   E   LA+
Sbjct: 52   KPPVSPRAQKKSFLSVAAGEKP----PIIPTNREPFWYRD----RPAVAFFEDEIVALAQ 103

Query: 514  TWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLWMGT 693
             +K +++GKFS  +P    I + F  + L G +   + +  HI+I L  E+D N+LWM  
Sbjct: 104  PFKHSMVGKFS-RMPKLNDIRAAFKGIGLVGVYEIRWLDYKHILIHLSNEQDLNRLWMRQ 162

Query: 694  LWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAICKIIGTPLQMD 873
             W + +  MRVFKW+P F P+ E+ L PVWI  P L  H ++ +AL  I K +G PL +D
Sbjct: 163  AWFIANQKMRVFKWSPDFQPEKESSLVPVWISFPNLRAHLYEKSALLMIAKSVGRPLFVD 222

Query: 874  SPTATRTRLSMARXXXXXXXXXXXXXXXXXXF-DGTT------HVQKIVFERTPDYCLHC 1032
              TA  TR S+AR                    D  T        QK+ F + P+YC HC
Sbjct: 223  EATANGTRPSVARVCVEYDCQQPPLEQIWIVSRDRRTGDITGGFQQKVDFAKLPNYCTHC 282

Query: 1033 KHIGHTIEGCYMNGNK 1080
             H+GH+   C + G++
Sbjct: 283  CHVGHSASTCLVMGHR 298


>ref|XP_007046403.1| Uncharacterized protein TCM_011922 [Theobroma cacao]
            gi|508710338|gb|EOY02235.1| Uncharacterized protein
            TCM_011922 [Theobroma cacao]
          Length = 928

 Score =  134 bits (336), Expect = 1e-30
 Identities = 81/260 (31%), Positives = 126/260 (48%), Gaps = 7/260 (2%)
 Frame = +1

Query: 319  ADSGSRPPEKPPLKHGSYANVTASTSRQSNLPFDPKRIVPIGTHKEKDGQQVIGFSSSEN 498
            +D  ++PP  P  +  S+ ++TA  S+   +P +   +V       KD    + F   E 
Sbjct: 41   SDPHTQPPTSPRFQKKSFLSITAG-SKPPVIPLNRNPVV------YKDRPAAV-FYEDEI 92

Query: 499  NRLAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNK 678
              LA+ + L L+GKF+  +P  + + S F  + L G +   + +  H+II L  ++D+N+
Sbjct: 93   CILAKPFSLCLVGKFT-RMPKLQEVRSAFKGIGLSGAYEIKWLDYKHVIIHLSNDQDFNR 151

Query: 679  LWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAICKIIGT 858
            +W    W +    MR+FKW+P F  + E+P+ PVWI  P L  H ++  AL  I K IG 
Sbjct: 152  IWTRQQWFIVGQKMRIFKWSPEFEAEKESPVVPVWISFPNLKAHLYEKFALLLIAKTIGR 211

Query: 859  PLQMDSPTATRTRLSMARXXXXXXXXXXXXXXXXXXFD----GTT---HVQKIVFERTPD 1017
            PL +D  TA  +R S+AR                        GT    + QK+ F + P 
Sbjct: 212  PLFVDEATAKGSRPSVARVCAEYDCRKPPINQVWIVTQKRETGTVTNGYAQKVEFSQMPA 271

Query: 1018 YCLHCKHIGHTIEGCYMNGN 1077
            YC HC H+GH    C + GN
Sbjct: 272  YCDHCCHVGHNETNCLVLGN 291


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  134 bits (337), Expect = 1e-30
 Identities = 70/207 (33%), Positives = 104/207 (50%), Gaps = 7/207 (3%)
 Frame = +1

Query: 481  FSSSENNRLAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHL 660
            F   E   LA+ +KL+L+GKFS  +P  + + + F  + L G +   + +  H++I L  
Sbjct: 1788 FFEDEIQTLAKPFKLSLVGKFS-RMPKLQDVRAAFKGIGLAGAYEVRWLDYKHVLIHLSN 1846

Query: 661  EEDYNKLWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAI 840
            E+D+N++W    W +    MRVFKWTP F P+ E+ + PVWI  P L  H F+ +AL  I
Sbjct: 1847 EQDFNRIWTKQNWFIATQKMRVFKWTPEFEPEKESAVVPVWISFPNLKAHLFEKSALLLI 1906

Query: 841  CKIIGTPLQMDSPTATRTRLSMARXXXXXXXXXXXXXXXXXXFDG-------TTHVQKIV 999
             K +G PL +D  TA  +R S+AR                              + Q++ 
Sbjct: 1907 AKTVGKPLFVDEATANGSRPSVARVCVEFDCRQPPLDQVWIVVQNRKTGEITNGYSQRVE 1966

Query: 1000 FERTPDYCLHCKHIGHTIEGCYMNGNK 1080
            F + P YC HC H+GH    C + GNK
Sbjct: 1967 FAQMPAYCDHCCHVGHKETDCILLGNK 1993



 Score =  125 bits (315), Expect = 1e-27
 Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 7/183 (3%)
 Frame = +1

Query: 553  IPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLWMGTLWSLGDCPMRVFK 732
            +P  + I   F  + L G +   + +  HI+I L  E+D+N++W    W + +  MRVFK
Sbjct: 1    MPKMQEIRQAFKGIGLTGAYVIRWLDYKHILIHLSNEQDFNRIWTKQQWFIANQKMRVFK 60

Query: 733  WTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAICKIIGTPLQMDSPTATRTRLSMAR 912
            W+P F  + E+P+ PVWI  P L  H ++ +AL  I K +G PL +D  T+  +R S+AR
Sbjct: 61   WSPDFEAEKESPIVPVWISFPNLKAHLYEKSALLLIAKTVGKPLFIDEATSNASRPSVAR 120

Query: 913  XXXXXXXXXXXXXXXXXXF----DGTT---HVQKIVFERTPDYCLHCKHIGHTIEGCYMN 1071
                                    GT    + QK+ F + PDYC HC H+GH++  C + 
Sbjct: 121  VCVEYNCRNAPVEEIWIVIKDRVTGTVTGGYAQKVEFSKMPDYCEHCGHVGHSVSTCLVL 180

Query: 1072 GNK 1080
            GN+
Sbjct: 181  GNR 183


>ref|XP_012084349.1| PREDICTED: uncharacterized protein LOC105643761 [Jatropha curcas]
          Length = 220

 Score =  125 bits (314), Expect = 2e-30
 Identities = 64/202 (31%), Positives = 106/202 (52%)
 Frame = +1

Query: 307 PPSMADSGSRPPEKPPLKHGSYANVTASTSRQSNLPFDPKRIVPIGTHKEKDGQQVIGFS 486
           PP++A  G           G   ++ A    +S    D  + +   T  +  G   + F+
Sbjct: 7   PPAVAPQGVAGRNFASALTGQSFSLKAVEDLRSFASIDINKRLSSKTPSKYKGFPAVNFT 66

Query: 487 SSENNRLAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHLEE 666
             +  +L+  ++  LIG F F  P  + +   F  ++ KG F+    + +HI+I   LEE
Sbjct: 67  DDDIQKLSLPYRYALIGAFMFGRPSMQALKKAFDMISFKGDFTLGLIDSNHILINFVLEE 126

Query: 667 DYNKLWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAICK 846
           D+ + W+   W      MRV KW+ SF P  ++ +A +W+ LPGLPIHFFD  AL++I +
Sbjct: 127 DFLRCWLRQTWFFNGFSMRVSKWSASFRPDVDSSIALIWVSLPGLPIHFFDKEALYSIVE 186

Query: 847 IIGTPLQMDSPTATRTRLSMAR 912
           +IG PL++D+ TA+  R S+A+
Sbjct: 187 LIGRPLKVDASTASLIRPSVAK 208


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  132 bits (333), Expect = 4e-30
 Identities = 78/260 (30%), Positives = 124/260 (47%), Gaps = 7/260 (2%)
 Frame = +1

Query: 319  ADSGSRPPEKPPLKHGSYANVTASTSRQSNLPFDPKRIVPIGTHKEKDGQQVIGFSSSEN 498
            +D+ ++PP  P  +  S+ ++ A     S  P  P    P   +K++       F   E 
Sbjct: 41   SDNHTQPPTSPRFQKKSFLSIAAG----SKPPVIPLNRDP-AVYKDRPAAV---FYEDEI 92

Query: 499  NRLAETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNK 678
              LA+ + L L+GKF+  +P  + + S F  + L G +   + +  H++I L  ++D+N+
Sbjct: 93   CILAKPFSLCLVGKFT-RMPKLQEVRSAFKGIGLSGAYEIKWLDYKHVLIHLSNDQDFNR 151

Query: 679  LWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAICKIIGT 858
            +W    W +    MR+FKW+P F  + E+P+ PVWI  P L  H ++ +AL  I K IG 
Sbjct: 152  IWTRQQWFIVGQKMRIFKWSPEFEAEKESPVVPVWISFPNLKAHLYEKSALLLIAKTIGK 211

Query: 859  PLQMDSPTATRTRLSMARXXXXXXXXXXXXXXXXXXFDGTT-------HVQKIVFERTPD 1017
            PL +D  TA  +R S+AR                              + QK+ F + PD
Sbjct: 212  PLFVDEATAKGSRPSVARVCVEYDCREPPIDQVWIVTQKRETGMVTNGYAQKVEFSQMPD 271

Query: 1018 YCLHCKHIGHTIEGCYMNGN 1077
            YC HC H+GH    C + GN
Sbjct: 272  YCEHCCHVGHNETTCLVLGN 291


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  127 bits (319), Expect = 3e-28
 Identities = 67/186 (36%), Positives = 96/186 (51%)
 Frame = +1

Query: 523  LTLIGKFSFAIPHPKGIASGFSALNLKGPFSWSFANPSHIIIKLHLEEDYNKLWMGTLWS 702
            ++L+GKFS  +P  + I S F  + L G +   + +  HI+I L  E D N++W   +W 
Sbjct: 1    MSLVGKFS-RMPKLQDIRSAFKGIGLAGAYEVRWLDYKHILIHLTNEHDCNRVWTKQVWF 59

Query: 703  LGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGLPIHFFDHNALFAICKIIGTPLQMDSPT 882
            + +  MRVFKWTP F P+ E+ + PVWI  P L  H F+ +AL  I K +G PL +D  T
Sbjct: 60   IANQKMRVFKWTPDFEPEKESAVVPVWIAFPNLKAHLFEKSALLLIAKTVGKPLFVDEAT 119

Query: 883  ATRTRLSMARXXXXXXXXXXXXXXXXXXFDGTTHVQKIVFERTPDYCLHCKHIGHTIEGC 1062
            A  +R S+AR                         Q++ F + P YC HC H+GH    C
Sbjct: 120  ANGSRPSVARVCIEYDCRRPPID------------QRVEFSQMPAYCDHCCHVGHKEIDC 167

Query: 1063 YMNGNK 1080
             + GNK
Sbjct: 168  IVLGNK 173


>ref|XP_007031317.1| Uncharacterized protein TCM_016768 [Theobroma cacao]
            gi|508710346|gb|EOY02243.1| Uncharacterized protein
            TCM_016768 [Theobroma cacao]
          Length = 351

 Score =  122 bits (305), Expect = 6e-28
 Identities = 83/280 (29%), Positives = 121/280 (43%), Gaps = 31/280 (11%)
 Frame = +1

Query: 334  RPPEKP-PLKHGSYANVTASTSRQSNLPFDPKRIVPIGTHKE----KDGQQVIGFSSSEN 498
            RPP+ P P  HG                 D   I+P GT+++    KD Q  +      N
Sbjct: 5    RPPDPPLPFPHG-----------------DSSLIMPHGTNRDPTDPKDLQPPVNNGGLPN 47

Query: 499  NRL-------------------AETWKLTLIGKFSFAIPHPKGIASGFSALNLKGPFSWS 621
            N L                    E   L    +  F +P    I   F  ++L G +   
Sbjct: 48   NNLQNPPISPRAQKKSFLSVVAGEKPPLIPPTREPFWMPRINEIRMAFKGIDLVGAYEIK 107

Query: 622  FANPSHIIIKLHLEEDYNKLWMGTLWSLGDCPMRVFKWTPSFNPKTEAPLAPVWIRLPGL 801
            + +  HI+I+L  E D N++W+  +W + +  M VFKWTP+F P+ E+ L PVWI  P L
Sbjct: 108  WLDYKHILIQLSNEHDLNRIWLKQVWFISNQKMCVFKWTPNFQPEKESSLVPVWISFPNL 167

Query: 802  PIHFFDHNALFAICKIIGTPLQMDSPTATRTRLSMARXXXXXXXXXXXXXXXXXXF-DGT 978
              H ++  AL  I K +G PL +D  TA  TR S+AR                    D  
Sbjct: 168  RAHLYEKFALLVIAKTVGRPLMVDEATAKGTRPSVARVCIEYDCQKPPIDQVWIVTRDRK 227

Query: 979  T------HVQKIVFERTPDYCLHCKHIGHTIEGCYMNGNK 1080
            T      ++QK+ F +  +YC HC H+GH +  C M G++
Sbjct: 228  TGSVIGGYMQKVDFAKLLEYCSHCCHVGHGVSTCIMLGHR 267


Top