BLASTX nr result

ID: Catharanthus22_contig00007871 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00007871
         (1967 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY29076.1| Uncharacterized protein TCM_030494 [Theobroma cacao]   100   2e-18
gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao]    98   1e-17
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]    97   2e-17
gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]    97   3e-17
gb|EOY06957.1| Uncharacterized protein TCM_021519 [Theobroma cacao]    97   3e-17
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]    94   2e-16
gb|EOY19201.1| Uncharacterized protein TCM_044158 [Theobroma cacao]    94   2e-16
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]    94   3e-16
gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]    92   8e-16
gb|EOY02245.1| Uncharacterized protein TCM_016772 [Theobroma cacao]    92   1e-15
gb|EOY02235.1| Uncharacterized protein TCM_011922 [Theobroma cacao]    92   1e-15
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]    63   1e-15
gb|EOY25452.1| Uncharacterized protein TCM_016760 [Theobroma cacao]    88   2e-14
gb|EOY02242.1| Uncharacterized protein TCM_016767 [Theobroma cacao]    81   1e-12
gb|EOY03450.1| Uncharacterized protein TCM_018528 [Theobroma cacao]    80   2e-12
gb|EOY19056.1| Uncharacterized protein TCM_043721 [Theobroma cacao]    56   4e-12
gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]    57   5e-12
gb|EOY17515.1| Uncharacterized protein TCM_042331 [Theobroma cacao]    79   7e-12
gb|EOX96782.1| Uncharacterized protein TCM_005953 [Theobroma cacao]    52   2e-11
ref|XP_006295777.1| hypothetical protein CARUB_v10024899mg, part...    63   4e-11

>gb|EOY29076.1| Uncharacterized protein TCM_030494 [Theobroma cacao]
          Length = 876

 Score =  100 bits (250), Expect = 2e-18
 Identities = 80/278 (28%), Positives = 128/278 (46%), Gaps = 19/278 (6%)
 Frame = +3

Query: 348  FQFQPAPMHGHRSL*PACLLLNPDLSTTL----KCVSSYKREPAIVFEDQEISSFIAPFK 515
            FQ QP P    R+   + L +   +   L    +    YK +PA+ F + EI +   PFK
Sbjct: 74   FQVQPQPPASPRTAKKSFLSVVNAVKLALVPPTRPTFRYKDKPAVRFFEDEIEALAQPFK 133

Query: 516  LSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPRQILIRL*P*GGFHSFIAQRHMV 695
             +++GKFS     +  +R+ F ++GL G +NI W++ + ILI L     F+    ++   
Sbjct: 134  FAIVGKFSK-MPRLTEIRQSFVSLGLSGVYNIRWMNYKHILIHLSNEQDFNRIWTKQ--- 189

Query: 696  ISWISHEGVQMVYQFSPYIGIFCISSVD----CLRGIADSLFQQIKLFSIARVIGKPLRI 863
             +W        V++++P       S +         +   LF++  L  IA+ IG PL I
Sbjct: 190  -TWFITNQKMRVFKWTPDFETDKESPIVPVWISFPNLKAHLFEKSALLMIAKAIGNPLYI 248

Query: 864  GKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTSSR------KGFWQPVANENLPPYCSS 1025
             + TA  +RPS+ARVC++ D LKP    V I  S R       G+ Q V    +P YC+ 
Sbjct: 249  DEATANGTRPSVARVCIEYDCLKPPVDSVWIVVSKRGSEDMSGGYLQKVEFAPMPEYCNH 308

Query: 1026 CFCLVHIVHSC-----KRVSHEGNGALMLKPQQLRNLV 1124
            C  + H V  C     +  +H+  G   L+    R  +
Sbjct: 309  CCHVGHNVSKCLILGSRSNTHKSGGKTALESSHERTQI 346


>gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao]
          Length = 2606

 Score = 98.2 bits (243), Expect = 1e-17
 Identities = 77/263 (29%), Positives = 121/263 (46%), Gaps = 14/263 (5%)
 Frame = +3

Query: 450  YKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPR 629
            +K  PA  F + EI +   P KLSL+GKFS     +  +R  F  IGL G + + WL  +
Sbjct: 1711 FKDRPAAAFFEDEIQTLAQPLKLSLVGKFSR-MPKLQDVRSAFKGIGLTGAYEVRWLDYK 1769

Query: 630  QILIRL*P*GGFHS-------FIAQRHM-VISWISHEGVQMVYQFSPYIGIFCISSVDCL 785
             +LI L      +        FIA + M V  W          +F P      +      
Sbjct: 1770 HVLIHLSNEQDCNRVWTKQVWFIANQKMRVFKWTP--------EFEPEKESAVVPVWIAF 1821

Query: 786  RGIADSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTS 965
              +   LF++  L  IA+ +GKPL + + TA  SRPS+ARVC++ D  +P   +V I   
Sbjct: 1822 PNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVARVCIEFDCRRPPIDQVWIVVQ 1881

Query: 966  SRK------GFWQPVANENLPPYCSSCFCLVHIVHSCKRVSHEGNGALMLKPQQLRNLVL 1127
            +R+      G+ Q V    +P YC  C  + H  + C  + ++     + K Q LR L +
Sbjct: 1882 NRETGTVTSGYPQRVEFSQMPAYCDHCCHVGHKENDCIVLGNKDKSLGLSKSQSLRTLAV 1941

Query: 1128 QRHLLGWGKLIQ*RLKQPRGKEK 1196
            ++   G+G   +  L++ +  EK
Sbjct: 1942 EKK-TGYGGGSEKNLEKRKNPEK 1963



 Score = 90.1 bits (222), Expect = 3e-15
 Identities = 66/218 (30%), Positives = 104/218 (47%), Gaps = 10/218 (4%)
 Frame = +3

Query: 450  YKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPR 629
            Y+  PA+ F + EI +   PFK S++GKFS     +  +R  F  I L G + I WL  +
Sbjct: 85   YRDRPAVAFFEDEIVALAQPFKHSMVGKFSR-MPKLNDIRAAFKGISLVGVYEIRWLDYK 143

Query: 630  QILIRL*P*GGFHSFIAQRHMVISWISHEGVQMVYQFSPYIGIFCISSVD----CLRGIA 797
             ILI L         + +  M  +W        V++++P       SS+         + 
Sbjct: 144  HILIHL----SNEQDLNRLWMRQAWFIANQKMRVFKWTPDFQPEKESSLVPVWISFPNLR 199

Query: 798  DSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTSSRK- 974
              L+++  L  IA+ +G+PL + + TA  +RPS+ARVCV+ D  +P   ++ I T  R+ 
Sbjct: 200  AHLYEKSALLMIAKSVGRPLFVDEATANGTRPSVARVCVEYDCQQPPLEQIWIVTRDRRT 259

Query: 975  -----GFWQPVANENLPPYCSSCFCLVHIVHSCKRVSH 1073
                 GF Q V    LP YC+ C  + H   +C  + H
Sbjct: 260  GDITGGFQQKVDFAKLPNYCTHCCHVGHSASTCLVMGH 297


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score = 97.4 bits (241), Expect = 2e-17
 Identities = 76/238 (31%), Positives = 110/238 (46%), Gaps = 14/238 (5%)
 Frame = +3

Query: 450  YKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPR 629
            YK  PA  F + EI     PFKLSL+GKFS     +  +R  F  IGL G++ I WL  +
Sbjct: 108  YKDRPAAAFFEDEIHILAQPFKLSLVGKFSR-MPKLQEVRSAFKGIGLAGSYEIRWLDYK 166

Query: 630  QILIRL*P*GGFHSFIAQRHMVISWISHEGVQMVYQFSPYIGIFCISSVDCLRGIADSLF 809
             ILI L     F+ F  ++   I+       +   +F P      +        +   LF
Sbjct: 167  HILIHLSNEQDFNRFWTKQAWFIANQKMRVFKWTPEFEPEKESAVVPVWISFPNLKAHLF 226

Query: 810  QQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTSSR------ 971
            ++  L  IA+ +GKPL I + TA  SRPS+ARVC++ D  +P   +V I   +R      
Sbjct: 227  EKSALLLIAKTVGKPLFIDEATANGSRPSVARVCIEYDCREPPVDQVWIVVQNRATGAVT 286

Query: 972  KGFWQPVANENLPPYCSSCFCLVHIVHSCKRVSHEGNGALMLKPQ--------QLRNL 1121
             G+ Q V    +P YC  C  + H   +C  + ++       KPQ        +LRNL
Sbjct: 287  SGYPQKVEFAQMPAYCDHCCHVGHKEINCIVLGNKNGLQGSGKPQPHSVVDADKLRNL 344


>gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score = 96.7 bits (239), Expect = 3e-17
 Identities = 66/209 (31%), Positives = 99/209 (47%), Gaps = 6/209 (2%)
 Frame = +3

Query: 450  YKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPR 629
            +K  PA  F + EI +   PFKLSL+GKFS     +  +R  F  IGL G + + WL  +
Sbjct: 1780 FKDRPAAAFFEDEIQTLAKPFKLSLVGKFSR-MPKLQDVRAAFKGIGLAGAYEVRWLDYK 1838

Query: 630  QILIRL*P*GGFHSFIAQRHMVISWISHEGVQMVYQFSPYIGIFCISSVDCLRGIADSLF 809
             +LI L     F+    +++  I+       +   +F P      +        +   LF
Sbjct: 1839 HVLIHLSNEQDFNRIWTKQNWFIATQKMRVFKWTPEFEPEKESAVVPVWISFPNLKAHLF 1898

Query: 810  QQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTSSRK----- 974
            ++  L  IA+ +GKPL + + TA  SRPS+ARVCV+ D  +P   +V I   +RK     
Sbjct: 1899 EKSALLLIAKTVGKPLFVDEATANGSRPSVARVCVEFDCRQPPLDQVWIVVQNRKTGEIT 1958

Query: 975  -GFWQPVANENLPPYCSSCFCLVHIVHSC 1058
             G+ Q V    +P YC  C  + H    C
Sbjct: 1959 NGYSQRVEFAQMPAYCDHCCHVGHKETDC 1987



 Score = 66.6 bits (161), Expect = 4e-08
 Identities = 56/199 (28%), Positives = 89/199 (44%), Gaps = 10/199 (5%)
 Frame = +3

Query: 555  MAVLRKEFHTIGLKGTFNIGWLSPRQILIRL*P*GGFHSFIAQRHMVISWISHEGVQMVY 734
            M  +R+ F  IGL G + I WL  + ILI L     F+    ++     W        V+
Sbjct: 4    MQEIRQAFKGIGLTGAYVIRWLDYKHILIHLSNEQDFNRIWTKQQ----WFIANQKMRVF 59

Query: 735  QFSPYIGIFCISSVD----CLRGIADSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIA 902
            ++SP       S +         +   L+++  L  IA+ +GKPL I + T+  SRPS+A
Sbjct: 60   KWSPDFEAEKESPIVPVWISFPNLKAHLYEKSALLLIAKTVGKPLFIDEATSNASRPSVA 119

Query: 903  RVCVKIDLLKPLPVRVCIGTSSR------KGFWQPVANENLPPYCSSCFCLVHIVHSCKR 1064
            RVCV+ +        + I    R       G+ Q V    +P YC  C  + H V +C  
Sbjct: 120  RVCVEYNCRNAPVEEIWIVIKDRVTGTVTGGYAQKVEFSKMPDYCEHCGHVGHSVSTCLV 179

Query: 1065 VSHEGNGALMLKPQQLRNL 1121
            +   GN +  L+ ++L N+
Sbjct: 180  L---GNRSENLRKEKLSNV 195


>gb|EOY06957.1| Uncharacterized protein TCM_021519 [Theobroma cacao]
          Length = 667

 Score = 96.7 bits (239), Expect = 3e-17
 Identities = 70/233 (30%), Positives = 107/233 (45%), Gaps = 14/233 (6%)
 Frame = +3

Query: 450  YKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPR 629
            YK  PA VF + EI     PF L L+GKF+     +  +R  F  IGL G + I WL  +
Sbjct: 79   YKDRPAAVFYEDEICILAKPFSLCLVGKFTR-MPKLQEVRSAFKGIGLSGAYEIKWLDYK 137

Query: 630  QILIRL*P*GGFHSF--------IAQRHMVISWISHEGVQMVYQFSPYIGIFCISSVDCL 785
             +LI L     F+          + Q+  +  W      +   + SP + ++        
Sbjct: 138  HVLIHLSNDQDFNRIWTRQQWFIVGQKMRIFKWSPEFEAE---KESPVVPVWI-----SF 189

Query: 786  RGIADSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTS 965
              +   L+++  L  IA+ IGKPL + +PTA  SRPS+ARVCV+ D  +P   +V I T 
Sbjct: 190  PNLKAHLYEKSALLLIAKTIGKPLFVDEPTAKGSRPSVARVCVEYDCREPPIDQVWIVTQ 249

Query: 966  SRK------GFWQPVANENLPPYCSSCFCLVHIVHSCKRVSHEGNGALMLKPQ 1106
             R+      G+ Q V    +P YC  C  + H   +C  + +    +  +K Q
Sbjct: 250  KRETGMVTNGYAQKVEFSQMPDYCEHCCHVGHNETTCLVLGNNSKSSGSMKAQ 302


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score = 94.0 bits (232), Expect = 2e-16
 Identities = 80/266 (30%), Positives = 118/266 (44%), Gaps = 17/266 (6%)
 Frame = +3

Query: 450  YKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPR 629
            +K  PA  F + EI +   P KLSL+GKFS     +  +R  F  IGL G + + WL  +
Sbjct: 114  FKDRPAAAFYEDEIQTLAQPLKLSLVGKFSR-MPKLQDVRSAFKGIGLAGAYEVRWLDYK 172

Query: 630  QILIRL*P*GGFHS----------FIAQRHM-VISWISHEGVQMVYQFSPYIGIFCISSV 776
             ILI L      H           FIA + M V  W          +F P      +   
Sbjct: 173  HILIHL---TNEHDCNRVWTKQVWFIANQKMRVFKWTP--------EFEPEKESAMVPVW 221

Query: 777  DCLRGIADSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCI 956
                 +   LF++  L  IA+ +GKPL + + TA  SRPS+ARVC++ D  KP   +V I
Sbjct: 222  IAFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVARVCIEYDCRKPPIDQVWI 281

Query: 957  GTSSRK------GFWQPVANENLPPYCSSCFCLVHIVHSCKRVSHEGNGALMLKPQQLRN 1118
               +R+      G+ Q V    +P YC  C  + H    C  + ++       K Q LR 
Sbjct: 282  VVQNRETGTVTSGYPQKVEFSQMPAYCDHCCHVGHKEIDCIVLGNKDKPLGSSKSQFLRV 341

Query: 1119 LVLQRHLLGWGKLIQ*RLKQPRGKEK 1196
            L  ++   G+G   +  L++ +  EK
Sbjct: 342  LEAEKK-KGYGGSSEKNLEKSKNPEK 366


>gb|EOY19201.1| Uncharacterized protein TCM_044158 [Theobroma cacao]
          Length = 830

 Score = 94.0 bits (232), Expect = 2e-16
 Identities = 71/213 (33%), Positives = 109/213 (51%), Gaps = 10/213 (4%)
 Frame = +3

Query: 450  YKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPR 629
            YK  PA  F D EIS+   PFK S++GKFS     M  +R  F  IGL G + I WL  +
Sbjct: 79   YKDRPAASFFDDEISTLAQPFKFSMVGKFSR-MLRMQEIRVAFKGIGLIGAYEIRWLDYK 137

Query: 630  QILIRL*P*GGFHSFIAQRHMVISWISHEGVQMVYQFSPYIGIFCISSVD----CLRGIA 797
             ILI+L      H         + +IS++ ++ V+++SP       SS+         + 
Sbjct: 138  HILIQL---SNEHDLNRIWLKQVWFISNQKMR-VFKWSPEFQPEKESSMVPVWISFPNLK 193

Query: 798  DSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTSSRK- 974
              L+++  L +I + +G+PL + + TA  +RPS+ARVCV+ D  +P   +V I T +R+ 
Sbjct: 194  AHLYEKSALSAIVKTVGRPLMVDEATANGTRPSVARVCVEFDCQQPPIDQVWIVTRNRQS 253

Query: 975  -----GFWQPVANENLPPYCSSCFCLVHIVHSC 1058
                 G+ Q V    L  +C+ C  + H V SC
Sbjct: 254  GSVMGGYMQKVEFARLSEFCTHCSHVGHGVSSC 286


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score = 93.6 bits (231), Expect = 3e-16
 Identities = 69/233 (29%), Positives = 106/233 (45%), Gaps = 14/233 (6%)
 Frame = +3

Query: 450  YKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPR 629
            YK  PA VF + EI     PF L L+GKF+     +  +R  F  IGL G + I WL  +
Sbjct: 79   YKDRPAAVFYEDEICILAKPFSLCLVGKFTR-MPKLQEVRSAFKGIGLSGAYEIKWLDYK 137

Query: 630  QILIRL*P*GGFHSF--------IAQRHMVISWISHEGVQMVYQFSPYIGIFCISSVDCL 785
             +LI L     F+          + Q+  +  W      +   + SP + ++        
Sbjct: 138  HVLIHLSNDQDFNRIWTRQQWFIVGQKMRIFKWSPEFEAE---KESPVVPVWI-----SF 189

Query: 786  RGIADSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTS 965
              +   L+++  L  IA+ IGKPL + + TA  SRPS+ARVCV+ D  +P   +V I T 
Sbjct: 190  PNLKAHLYEKSALLLIAKTIGKPLFVDEATAKGSRPSVARVCVEYDCREPPIDQVWIVTQ 249

Query: 966  SRK------GFWQPVANENLPPYCSSCFCLVHIVHSCKRVSHEGNGALMLKPQ 1106
             R+      G+ Q V    +P YC  C  + H   +C  + +    +  +K Q
Sbjct: 250  KRETGMVTNGYAQKVEFSQMPDYCEHCCHVGHNETTCLVLGNNSKSSGSMKAQ 302


>gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score = 92.0 bits (227), Expect = 8e-16
 Identities = 67/218 (30%), Positives = 105/218 (48%), Gaps = 10/218 (4%)
 Frame = +3

Query: 450  YKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPR 629
            Y+  PA+ F + EI +   PFK S++GKFS     +  +R  F  IGL G + I WL  +
Sbjct: 85   YRDRPAVAFFEDEIVALAQPFKHSMVGKFSR-MPKLNDIRAAFKGIGLVGVYEIRWLDYK 143

Query: 630  QILIRL*P*GGFHSFIAQRHMVISWISHEGVQMVYQFSPYIGIFCISSVD----CLRGIA 797
             ILI L         + +  M  +W        V+++SP       SS+         + 
Sbjct: 144  HILIHL----SNEQDLNRLWMRQAWFIANQKMRVFKWSPDFQPEKESSLVPVWISFPNLR 199

Query: 798  DSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTSSRK- 974
              L+++  L  IA+ +G+PL + + TA  +RPS+ARVCV+ D  +P   ++ I +  R+ 
Sbjct: 200  AHLYEKSALLMIAKSVGRPLFVDEATANGTRPSVARVCVEYDCQQPPLEQIWIVSRDRRT 259

Query: 975  -----GFWQPVANENLPPYCSSCFCLVHIVHSCKRVSH 1073
                 GF Q V    LP YC+ C  + H   +C  + H
Sbjct: 260  GDITGGFQQKVDFAKLPNYCTHCCHVGHSASTCLVMGH 297


>gb|EOY02245.1| Uncharacterized protein TCM_016772 [Theobroma cacao]
          Length = 1296

 Score = 91.7 bits (226), Expect = 1e-15
 Identities = 73/243 (30%), Positives = 116/243 (47%), Gaps = 12/243 (4%)
 Frame = +3

Query: 411  NPDLSTTLKCVSSYKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIG 590
            NP +    +  S Y+  PA  F D EI++    FK S+IGKF+     +  +R  F  IG
Sbjct: 72   NPPVIPLNREPSWYRDRPAASFFDNEIATLALSFKFSMIGKFTR-MPKLQEIRTAFKGIG 130

Query: 591  LKGTFNIGWLSPRQILIRL*P*GGFHSFIAQRHMVISWISHEGVQMVYQFSPYIGIFCIS 770
            L G +NI WL  + ILI L      H  + +  M  +W        V++++P       S
Sbjct: 131  LVGAYNIRWLDYKHILIHL---SNEHD-LNRIWMKQNWFIVNKKMRVFKWTPEFHPEKES 186

Query: 771  SVD----CLRGIADSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPL 938
            S+         +    +++  L  IA+ +G+PL + + TA  +RP++AR+CV+ D  K L
Sbjct: 187  SLVPVWISFPNLRAHFYEKSTLMMIAKSVGRPLFVDEATANGTRPNVARICVEYDCQKSL 246

Query: 939  PVRVCIGTSSRK------GFWQPVANENLPPYCSSCFCLVHIVHSCKRVSH--EGNGALM 1094
              ++ I T SR+      GF Q V    +P YC+ C  + H   +C  + +  E  G + 
Sbjct: 247  LDQIWIVTRSRQTGEVTGGFIQKVEFVKMPDYCTHCCHVGHNASACLVLGNKPEKQGLVS 306

Query: 1095 LKP 1103
             KP
Sbjct: 307  TKP 309


>gb|EOY02235.1| Uncharacterized protein TCM_011922 [Theobroma cacao]
          Length = 928

 Score = 91.7 bits (226), Expect = 1e-15
 Identities = 67/233 (28%), Positives = 104/233 (44%), Gaps = 14/233 (6%)
 Frame = +3

Query: 450  YKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPR 629
            YK  PA VF + EI     PF L L+GKF+     +  +R  F  IGL G + I WL  +
Sbjct: 79   YKDRPAAVFYEDEICILAKPFSLCLVGKFTR-MPKLQEVRSAFKGIGLSGAYEIKWLDYK 137

Query: 630  QILIRL*P*GGFHSF--------IAQRHMVISWISHEGVQMVYQFSPYIGIFCISSVDCL 785
             ++I L     F+          + Q+  +  W      +   + SP + ++        
Sbjct: 138  HVIIHLSNDQDFNRIWTRQQWFIVGQKMRIFKWSPEFEAE---KESPVVPVWI-----SF 189

Query: 786  RGIADSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTS 965
              +   L+++  L  IA+ IG+PL + + TA  SRPS+ARVC + D  KP   +V I T 
Sbjct: 190  PNLKAHLYEKFALLLIAKTIGRPLFVDEATAKGSRPSVARVCAEYDCRKPPINQVWIVTQ 249

Query: 966  SRK------GFWQPVANENLPPYCSSCFCLVHIVHSCKRVSHEGNGALMLKPQ 1106
             R+      G+ Q V    +P YC  C  + H   +C  + +       +K Q
Sbjct: 250  KRETGTVTNGYAQKVEFSQMPAYCDHCCHVGHNETNCLVLGNISKSLASMKSQ 302


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score = 63.2 bits (152), Expect(2) = 1e-15
 Identities = 41/107 (38%), Positives = 55/107 (51%), Gaps = 6/107 (5%)
 Frame = +3

Query: 804  LFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTSSRK--- 974
            L ++  L  +AR +GKPL + + TA  SRPS+ARVCV+ D  KP    V I + +RK   
Sbjct: 32   LHEKSALMMVARTVGKPLFVDEATANRSRPSVARVCVEYDCQKPPLDHVWIVSRNRKTET 91

Query: 975  ---GFWQPVANENLPPYCSSCFCLVHIVHSCKRVSHEGNGALMLKPQ 1106
               G  Q V    LP YC  C  + H V  C  +   GN  +  KP+
Sbjct: 92   MTGGLSQRVEFAKLPEYCQHCCHVGHAVTECMVL---GNKPVSTKPK 135



 Score = 48.9 bits (115), Expect(2) = 1e-15
 Identities = 22/37 (59%), Positives = 24/37 (64%)
 Frame = +1

Query: 712 MRVFKWSINFHPTLESSVFPAWIVFEGLPIHYFNKSS 822
           MRVFKWS +F P  ESSV P WI F  LP H   KS+
Sbjct: 1   MRVFKWSPDFQPEKESSVVPVWISFPNLPAHLHEKSA 37


>gb|EOY25452.1| Uncharacterized protein TCM_016760 [Theobroma cacao]
          Length = 1109

 Score = 87.8 bits (216), Expect = 2e-14
 Identities = 64/215 (29%), Positives = 100/215 (46%), Gaps = 10/215 (4%)
 Frame = +3

Query: 444  SSYKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLS 623
            S YK  PA +F + EI +   PF  SL+GKFS     +  +R  F  IGL G + I W+ 
Sbjct: 81   SVYKDRPAAIFYEDEIQTLARPFSHSLVGKFSR-MPKLQEIRHAFKGIGLSGAYEIRWMD 139

Query: 624  PRQILIRL*P*GGFHSFIAQRHMVISWISHEGVQMVYQFSPYIGIFCISSVD----CLRG 791
             + +LI L     F+    ++     W        V++++P       S++         
Sbjct: 140  YKHVLIHLSNEQDFNRVWVKQQ----WFIVNQKMRVFKWAPDFEAEKESAMVPVWISFPN 195

Query: 792  IADSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTSSR 971
            +   L+++  L  IA+ +GKPL + + TA  SRPS+ARVCV+ D  K     + I   +R
Sbjct: 196  LKAHLYEKSALLLIAKTVGKPLYVDEATANGSRPSVARVCVEYDCRKQPVEEIWIVIRNR 255

Query: 972  K------GFWQPVANENLPPYCSSCFCLVHIVHSC 1058
            +      G+ Q V    +P YC  C  + H  + C
Sbjct: 256  ETGAVTGGYSQRVEFARMPDYCGYCSHVGHKENEC 290


>gb|EOY02242.1| Uncharacterized protein TCM_016767 [Theobroma cacao]
          Length = 1707

 Score = 81.3 bits (199), Expect = 1e-12
 Identities = 56/178 (31%), Positives = 90/178 (50%), Gaps = 4/178 (2%)
 Frame = +3

Query: 450 YKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPR 629
           YK +PA+ F + EI +    F+ S++GKFS     +  +R+ F  +GL G +NI W+  +
Sbjct: 104 YKDKPAVRFYEDEIETLAKSFRFSIVGKFSR-TPRLVEIRQAFVGLGLSGAYNIRWMDYK 162

Query: 630 QILIRL*P*GGFHSFIAQRHMVISWISHEGVQMVYQFSPYIGIFCISSVD----CLRGIA 797
            +LI L     F+    ++    +W   +    V++ +P       SS+         + 
Sbjct: 163 HVLIHLSNEQDFNRIWTKQ----TWFIAKQKMRVFKGTPNFESDKESSIVPVWISFPNLR 218

Query: 798 DSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTSSR 971
             LF++  L  IA+ IG PL + + TA  +RPS+ARVC++ D LK     V I TS R
Sbjct: 219 AHLFEKSALLLIAKAIGNPLGVDEATANGTRPSVARVCIEYDCLKSPIKSVWIVTSKR 276


>gb|EOY03450.1| Uncharacterized protein TCM_018528 [Theobroma cacao]
          Length = 590

 Score = 80.5 bits (197), Expect = 2e-12
 Identities = 55/170 (32%), Positives = 91/170 (53%), Gaps = 5/170 (2%)
 Frame = +3

Query: 450 YKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPR 629
           YK   A+ F + +I +   PF LSL+GKF+     +A +R  F  IGL G + IGWL  +
Sbjct: 86  YKDRHAVAFFEDKIQALAKPFMLSLVGKFTR-MPKLAEIRLAFKGIGLAGAYEIGWLDYK 144

Query: 630 QILIRL*P*GGFHSFIAQRHMVIS----WISHEGVQM-VYQFSPYIGIFCISSVDCLRGI 794
            ILI L     F+    +++  I+    W+    +++   + SP + I+          +
Sbjct: 145 HILIHLFNEHDFNWIWTKQNWFIANQKMWVFKCSLEIEAEKESPIVPIWI-----SFPKL 199

Query: 795 ADSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPV 944
              L++++ L  +A+++GKPL + + T   SRPS+AR+CV+ D  K LPV
Sbjct: 200 KAHLYEKLVLLLVAKIVGKPLFVDEATTKGSRPSVARICVEYDCRK-LPV 248


>gb|EOY19056.1| Uncharacterized protein TCM_043721 [Theobroma cacao]
          Length = 359

 Score = 56.2 bits (134), Expect(2) = 4e-12
 Identities = 41/106 (38%), Positives = 55/106 (51%), Gaps = 12/106 (11%)
 Frame = +3

Query: 804  LFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPV-RVCIGTSSR--- 971
            LF++  L  IA+ IG PL I + TA  +RPS+ARVC++ D LK LPV  V I  S R   
Sbjct: 84   LFEKSALLLIAKAIGNPLWIDEATANGTRPSVARVCIEYDCLK-LPVDSVWIVVSKRGSK 142

Query: 972  ---KGFWQPVANENLPPYCSSCFCLVHIVHSC-----KRVSHEGNG 1085
                G+ Q V    +  YC+ C  + H V  C     K  +H+  G
Sbjct: 143  DMLGGYLQKVEFSPMSEYCNHCCHVGHSVSECLIVGTKSTTHKQGG 188



 Score = 43.9 bits (102), Expect(2) = 4e-12
 Identities = 21/47 (44%), Positives = 23/47 (48%)
 Frame = +1

Query: 682 KGTW*FHGFPMRVFKWSINFHPTLESSVFPAWIVFEGLPIHYFNKSS 822
           K TW      MRVFKW+  F    E S  P WI F  L  H F KS+
Sbjct: 43  KQTWFIANQKMRVFKWTPEFETEKEPSTVPVWISFPNLKAHLFEKSA 89


>gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
          Length = 1134

 Score = 57.4 bits (137), Expect(2) = 5e-12
 Identities = 35/91 (38%), Positives = 50/91 (54%), Gaps = 6/91 (6%)
 Frame = +3

Query: 804  LFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTSSR---- 971
            LF++  L  IA+ IG PL + + TA  +RPS+ARV ++ D LKP    V I TS R    
Sbjct: 32   LFEKSALLLIAKAIGNPLGVDEATANGTRPSVARVFIEYDCLKPPIESVWIVTSKRGSED 91

Query: 972  --KGFWQPVANENLPPYCSSCFCLVHIVHSC 1058
               G+ Q V    +P YC+ C  + H + +C
Sbjct: 92   VTGGYLQKVDFAPMPEYCNHCCHVGHGMENC 122



 Score = 42.4 bits (98), Expect(2) = 5e-12
 Identities = 18/37 (48%), Positives = 23/37 (62%)
 Frame = +1

Query: 712 MRVFKWSINFHPTLESSVFPAWIVFEGLPIHYFNKSS 822
           MRVFKW+++F    ESS+   WI F  L  H F KS+
Sbjct: 1   MRVFKWTLDFESDKESSIVQVWISFPNLRAHLFEKSA 37


>gb|EOY17515.1| Uncharacterized protein TCM_042331 [Theobroma cacao]
          Length = 1176

 Score = 79.0 bits (193), Expect = 7e-12
 Identities = 64/231 (27%), Positives = 105/231 (45%), Gaps = 12/231 (5%)
 Frame = +3

Query: 450  YKREPAIVFEDQEISSFIAPFKLSLIGKFSH*KSGMAVLRKEFHTIGLKGTFNIGWLSPR 629
            YK  PA +F + +I +   PF L L+GKF+     +  ++  F  I L G + I WL  +
Sbjct: 58   YKDRPAAIFYEDKICTLAKPFSLYLVGKFTR-MPKLQEVKFAFKGIDLLGAYEIKWLDYK 116

Query: 630  QILIRL*P*GGFHSFIAQRHMVISWISHEGVQMVYQF-----SPYIGIFCISSVDCLRGI 794
             ++I L     F+    ++   I+       +   +F     SP + ++          +
Sbjct: 117  HVIIHLSNDQDFNRIWTRQQWFIAGQKMRIFKWPLEFEAKTESPIVPVWI-----SFPNL 171

Query: 795  ADSLFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPV-RVCIGTSSR 971
               L+++  L  I + IGKPL + + T   SRP++ARVCVK D  K LP+ +V I T  R
Sbjct: 172  KAHLYEKFALLLIVKTIGKPLFVDEATTKGSRPTMARVCVKYDCRK-LPIDQVWIVTQKR 230

Query: 972  ------KGFWQPVANENLPPYCSSCFCLVHIVHSCKRVSHEGNGALMLKPQ 1106
                   G+ Q V   ++P Y   C  + H   +C  + +       +KPQ
Sbjct: 231  NTGIVTNGYAQKVEFSHMPNYWDHCCHVGHNETNCLVLGNNSKSLGSMKPQ 281


>gb|EOX96782.1| Uncharacterized protein TCM_005953 [Theobroma cacao]
          Length = 1659

 Score = 52.0 bits (123), Expect(2) = 2e-11
 Identities = 37/107 (34%), Positives = 51/107 (47%), Gaps = 6/107 (5%)
 Frame = +3

Query: 804  LFQQIKLFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTSSRK--- 974
            L  +  L  +AR +GKPL + + TA    PS+ARVCV+ D  KP    + I + +RK   
Sbjct: 107  LHDKSALMMVARTVGKPLFVDEATANRICPSVARVCVEYDCQKPPLDHIWIVSRNRKTET 166

Query: 975  ---GFWQPVANENLPPYCSSCFCLVHIVHSCKRVSHEGNGALMLKPQ 1106
               G  Q V    LP YC   +C  H+ H+       GN     KP+
Sbjct: 167  MTGGLSQRVEFAKLPKYCRH-YC--HVGHAMTECMVLGNKPASTKPK 210



 Score = 45.4 bits (106), Expect(2) = 2e-11
 Identities = 21/45 (46%), Positives = 26/45 (57%)
 Frame = +1

Query: 688 TW*FHGFPMRVFKWSINFHPTLESSVFPAWIVFEGLPIHYFNKSS 822
           TW      +RVFKWS +F P  +SSV P WI F  L  H  +KS+
Sbjct: 68  TWFIANQKIRVFKWSPDFQPEKKSSVVPVWISFPNLSAHLHDKSA 112


>ref|XP_006295777.1| hypothetical protein CARUB_v10024899mg, partial [Capsella rubella]
            gi|482564485|gb|EOA28675.1| hypothetical protein
            CARUB_v10024899mg, partial [Capsella rubella]
          Length = 451

 Score = 62.8 bits (151), Expect(2) = 4e-11
 Identities = 36/85 (42%), Positives = 47/85 (55%)
 Frame = +3

Query: 822  LFSIARVIGKPLRIGKPTACLSRPSIARVCVKIDLLKPLPVRVCIGTSSRKGFWQPVANE 1001
            L +IA  +G PLR+   T  L R   ARVCV+++L +PL      GT    G W  VA E
Sbjct: 183  LMAIAEGVGTPLRVDLTTLSLERARFARVCVEVNLSEPLK-----GTLLINGEWFFVAYE 237

Query: 1002 NLPPYCSSCFCLVHIVHSCKRVSHE 1076
             L   CSSC    H+VH+C +V+ E
Sbjct: 238  GLTRICSSCGLYGHLVHACPKVALE 262



 Score = 33.9 bits (76), Expect(2) = 4e-11
 Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
 Frame = +1

Query: 685 GTW*FHGFPMRVFKWSINFHPTLESSVF-PAWIVFEGLPIHYFNKS 819
           G W   G  + V +WS NF P  +  V  P W+    +P+ Y++KS
Sbjct: 136 GPWRVFGNYLMVQQWSPNFDPLKDDIVTTPVWVRLMNIPVWYYHKS 181


Top