BLASTX nr result

ID: Mentha23_contig00017677 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00017677
         (513 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU28232.1| hypothetical protein MIMGU_mgv1a006810mg [Mimulus...   125   8e-27
ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vi...    86   5e-15
ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-lik...    86   7e-15
ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-lik...    82   6e-14
ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-lik...    82   8e-14
ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-lik...    82   1e-13
ref|XP_007146939.1| hypothetical protein PHAVU_006G083300g [Phas...    81   2e-13
ref|XP_007146938.1| hypothetical protein PHAVU_006G083300g [Phas...    81   2e-13
ref|XP_007146937.1| hypothetical protein PHAVU_006G083300g [Phas...    81   2e-13
ref|XP_007032984.1| Hydroxyproline-rich glycoprotein family prot...    80   4e-13
ref|XP_007032983.1| Hydroxyproline-rich glycoprotein family prot...    80   4e-13
emb|CAB50925.1| translocon Tic40 [Pisum sativum]                       79   5e-13
sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplasti...    79   5e-13
ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-lik...    79   8e-13
ref|XP_007032982.1| Hydroxyproline-rich glycoprotein family prot...    76   4e-12
ref|XP_007032985.1| Hydroxyproline-rich glycoprotein family prot...    76   5e-12
gb|EPS65548.1| hypothetical protein M569_09229, partial [Genlise...    75   7e-12
ref|XP_002531917.1| conserved hypothetical protein [Ricinus comm...    74   2e-11
gb|ABF19057.1| plastid Tic40 [Ricinus communis]                        74   2e-11
gb|EXB68023.1| hypothetical protein L484_009630 [Morus notabilis]      72   6e-11

>gb|EYU28232.1| hypothetical protein MIMGU_mgv1a006810mg [Mimulus guttatus]
          Length = 430

 Score =  125 bits (313), Expect = 8e-27
 Identities = 88/178 (49%), Positives = 101/178 (56%), Gaps = 7/178 (3%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAP-SFQTTTS---SPFKS 346
           TFTQQMNTQN+PFGN                      +F  G+P  F   TS    PF++
Sbjct: 141 TFTQQMNTQNSPFGNA---------------------AFSPGSPFPFPPATSPALDPFRT 179

Query: 345 GA--ASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFEEDYK 175
               ASQ +T DVP +KVEDPPS SVKD+VE E  PKKYAF DVSPEET+QKNAF E+YK
Sbjct: 180 STPLASQPITVDVPASKVEDPPSISVKDEVEQETGPKKYAFVDVSPEETLQKNAF-ENYK 238

Query: 174 ESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
           E S+QTDSP +         QN                    PLMSV+ALEKMMEDPT
Sbjct: 239 E-SIQTDSPKD-PQSSQSVSQNGTAWNQGAGGSEGPTTSKTAPLMSVEALEKMMEDPT 294


>ref|XP_002282574.1| PREDICTED: protein TIC 40, chloroplastic [Vitis vinifera]
           gi|296089465|emb|CBI39284.3| unnamed protein product
           [Vitis vinifera]
          Length = 436

 Score = 85.9 bits (211), Expect = 5e-15
 Identities = 63/172 (36%), Positives = 77/172 (44%), Gaps = 1/172 (0%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSD-SFKTGAPSFQTTTSSPFKSGAA 337
           T   QM++QNN F                      +  S  T +PS  TT  SP    A 
Sbjct: 132 TLMGQMDSQNNQFNTTTFSPGSPFPFPMPPPSGPSTSHSGPTTSPSGPTT--SPSTVAAQ 189

Query: 336 SQSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFEEDYKESSVQT 157
           S    DVP TKVE PP+T VKD +E ++   KYAF DVSPEET+Q++ FE    E S +T
Sbjct: 190 SMVTVDVPATKVETPPATDVKDDIEKKNEQNKYAFVDVSPEETLQESPFEN--FEESTET 247

Query: 156 DSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
            S  +                               P +SVDALEKMMEDPT
Sbjct: 248 SSSKDAQFSAGVSQNGTPPRPGMGVSEDSQSTRNANPFLSVDALEKMMEDPT 299


>ref|XP_003553154.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max]
          Length = 432

 Score = 85.5 bits (210), Expect = 7e-15
 Identities = 68/186 (36%), Positives = 88/186 (47%), Gaps = 15/186 (8%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSS--PFKSGA 340
           T   QMN+QNN FGN                           AP+   TT S  P  S A
Sbjct: 135 TMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPT-------APASSATTQSRAPSASSA 187

Query: 339 ASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--EDYKES 169
           +  ++T D+P  KVE  P+T+VKD+VE ++ PKK AF DVSPEETVQ++ FE  +D + S
Sbjct: 188 SQSTITVDIPAAKVEVAPTTNVKDEVEVKNEPKKIAFVDVSPEETVQESPFESFKDDESS 247

Query: 168 SV----------QTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEK 19
           SV          Q  +P+N         Q+                     ++SVDALEK
Sbjct: 248 SVKEARVPDEVSQNGAPSNQGFGDFPGSQS-----------------TKKSVLSVDALEK 290

Query: 18  MMEDPT 1
           MMEDPT
Sbjct: 291 MMEDPT 296


>ref|XP_003538352.1| PREDICTED: protein TIC 40, chloroplastic-like [Glycine max]
          Length = 429

 Score = 82.4 bits (202), Expect = 6e-14
 Identities = 67/186 (36%), Positives = 87/186 (46%), Gaps = 15/186 (8%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSS--PFKSGA 340
           T   QMN+QNN FGN                           AP+   TT S  P  S A
Sbjct: 132 TMMGQMNSQNNQFGNAAFSPGSPFPFPMPTAAGPT-------APASSATTQSRAPSASSA 184

Query: 339 ASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--EDYKES 169
           +  ++T D+P  KVE  P+T+VKD+VE ++ PKK AF DVSPEETV+++ FE  +D + S
Sbjct: 185 SQSTITVDLPAAKVEAAPTTNVKDEVELKNEPKKIAFVDVSPEETVRESPFESFKDDESS 244

Query: 168 SV----------QTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEK 19
           SV          Q  +P+N         Q+                      +SVDALEK
Sbjct: 245 SVKEAWVPDEVSQNGAPSNLGFGDFPGSQS-----------------TKKSALSVDALEK 287

Query: 18  MMEDPT 1
           MMEDPT
Sbjct: 288 MMEDPT 293


>ref|XP_004500418.1| PREDICTED: protein TIC 40, chloroplastic-like [Cicer arietinum]
          Length = 433

 Score = 82.0 bits (201), Expect = 8e-14
 Identities = 62/184 (33%), Positives = 83/184 (45%), Gaps = 13/184 (7%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSSPFKSGAAS 334
           T   QMNTQNNPF +                    + +   G  S  T+     ++ + S
Sbjct: 134 TMMGQMNTQNNPFDSAAFSPGSPFPFPMPSSSGPAAPASSAGTQSQSTSA----RTASQS 189

Query: 333 QSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--EDYKESS-- 166
               D+P TKVE  PST+ KD+VE ++ PKK  F DVSPEE+VQK+ FE  +D  ESS  
Sbjct: 190 TVTVDIPATKVEAAPSTNAKDEVEVKNEPKKIGFVDVSPEESVQKSPFESFKDVDESSSF 249

Query: 165 ---------VQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMM 13
                     Q  +P+N         Q+                     ++SV+ALEKMM
Sbjct: 250 KEARAPAEAFQNGAPSNQGFGNSPGSQS-----------------GGKSVLSVEALEKMM 292

Query: 12  EDPT 1
           EDPT
Sbjct: 293 EDPT 296


>ref|XP_006350341.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum tuberosum]
          Length = 443

 Score = 81.6 bits (200), Expect = 1e-13
 Identities = 63/172 (36%), Positives = 79/172 (45%), Gaps = 1/172 (0%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSSPFKSGAAS 334
           T   QMN QN+ F N                    S    +  P    +TSS   +  AS
Sbjct: 139 TMMGQMNGQNSQFSNNAFSPGPGSPFPFPFPPPPVSGPASSSPPPPTASTSSTPSASFAS 198

Query: 333 QSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFEEDYKESSVQT 157
           Q VT DV  TKVE+PP+ +VK+  E    PKK AF D+SP+ET QK AF E++K+S   T
Sbjct: 199 QPVTVDVSATKVEEPPTVNVKNDTEAGKEPKKNAFVDISPDETFQKGAF-ENFKDS---T 254

Query: 156 DSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
           ++ +                                PLMSVDALEKMMEDPT
Sbjct: 255 ETASVTVDQVTQNGAASQLGFGPNTSDSTSSTGKSNPLMSVDALEKMMEDPT 306


>ref|XP_007146939.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris]
           gi|561020162|gb|ESW18933.1| hypothetical protein
           PHAVU_006G083300g [Phaseolus vulgaris]
          Length = 300

 Score = 80.9 bits (198), Expect = 2e-13
 Identities = 61/173 (35%), Positives = 79/173 (45%), Gaps = 2/173 (1%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSSPFKSGAAS 334
           T   QMN+ NN FGN                    + + + GAPS          SG+ S
Sbjct: 7   TMMGQMNSPNNDFGNAAFSPGSPFPFSMPSAAGPTATA-QYGAPSTS--------SGSQS 57

Query: 333 QSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--EDYKESSVQ 160
               D+P TKVE   +T +KD+VE ++ PKK AF DVSPEETVQK+ FE  +D + SSV+
Sbjct: 58  TVTVDIPATKVEATRTTDIKDEVEVQNKPKKIAFVDVSPEETVQKSPFESVKDNESSSVK 117

Query: 159 TDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
            ++                                    +SVDALEKMMEDPT
Sbjct: 118 EEA------RVPDEVSQNGAPFNQGFGGFPGSQSTKKSALSVDALEKMMEDPT 164


>ref|XP_007146938.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris]
           gi|561020161|gb|ESW18932.1| hypothetical protein
           PHAVU_006G083300g [Phaseolus vulgaris]
          Length = 367

 Score = 80.9 bits (198), Expect = 2e-13
 Identities = 61/173 (35%), Positives = 79/173 (45%), Gaps = 2/173 (1%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSSPFKSGAAS 334
           T   QMN+ NN FGN                    + + + GAPS          SG+ S
Sbjct: 137 TMMGQMNSPNNDFGNAAFSPGSPFPFSMPSAAGPTATA-QYGAPSTS--------SGSQS 187

Query: 333 QSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--EDYKESSVQ 160
               D+P TKVE   +T +KD+VE ++ PKK AF DVSPEETVQK+ FE  +D + SSV+
Sbjct: 188 TVTVDIPATKVEATRTTDIKDEVEVQNKPKKIAFVDVSPEETVQKSPFESVKDNESSSVK 247

Query: 159 TDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
            ++                                    +SVDALEKMMEDPT
Sbjct: 248 EEA------RVPDEVSQNGAPFNQGFGGFPGSQSTKKSALSVDALEKMMEDPT 294


>ref|XP_007146937.1| hypothetical protein PHAVU_006G083300g [Phaseolus vulgaris]
           gi|561020160|gb|ESW18931.1| hypothetical protein
           PHAVU_006G083300g [Phaseolus vulgaris]
          Length = 430

 Score = 80.9 bits (198), Expect = 2e-13
 Identities = 61/173 (35%), Positives = 79/173 (45%), Gaps = 2/173 (1%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSSPFKSGAAS 334
           T   QMN+ NN FGN                    + + + GAPS          SG+ S
Sbjct: 137 TMMGQMNSPNNDFGNAAFSPGSPFPFSMPSAAGPTATA-QYGAPSTS--------SGSQS 187

Query: 333 QSVTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--EDYKESSVQ 160
               D+P TKVE   +T +KD+VE ++ PKK AF DVSPEETVQK+ FE  +D + SSV+
Sbjct: 188 TVTVDIPATKVEATRTTDIKDEVEVQNKPKKIAFVDVSPEETVQKSPFESVKDNESSSVK 247

Query: 159 TDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
            ++                                    +SVDALEKMMEDPT
Sbjct: 248 EEA------RVPDEVSQNGAPFNQGFGGFPGSQSTKKSALSVDALEKMMEDPT 294


>ref|XP_007032984.1| Hydroxyproline-rich glycoprotein family protein isoform 3
           [Theobroma cacao] gi|508712013|gb|EOY03910.1|
           Hydroxyproline-rich glycoprotein family protein isoform
           3 [Theobroma cacao]
          Length = 368

 Score = 79.7 bits (195), Expect = 4e-13
 Identities = 64/174 (36%), Positives = 75/174 (43%), Gaps = 3/174 (1%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSSPFKSGAAS 334
           T   QMNTQNN F N                       F   AP      +SP  S   +
Sbjct: 144 TMMGQMNTQNNQFSNAAFPLG---------------SPFPFPAPPSPGPVTSPSPSSQTA 188

Query: 333 QSVTDVPVTKVEDPPSTSVKDKVEPEDA---PKKYAFKDVSPEETVQKNAFEEDYKESSV 163
            +V DVP TKVE  P+T+   +V+ E     PKKYAF DVSPEETVQK+AFE+     + 
Sbjct: 189 VTV-DVPATKVEAAPATAPATEVKSETETAEPKKYAFVDVSPEETVQKSAFED-----AA 242

Query: 162 QTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
              S NN                               P +SVDALEKMMEDPT
Sbjct: 243 GISSSNNTQFPKDVSDNGAASKQDAGAFGGSQSTGSADPALSVDALEKMMEDPT 296


>ref|XP_007032983.1| Hydroxyproline-rich glycoprotein family protein isoform 2
           [Theobroma cacao] gi|508712012|gb|EOY03909.1|
           Hydroxyproline-rich glycoprotein family protein isoform
           2 [Theobroma cacao]
          Length = 433

 Score = 79.7 bits (195), Expect = 4e-13
 Identities = 64/174 (36%), Positives = 75/174 (43%), Gaps = 3/174 (1%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSSPFKSGAAS 334
           T   QMNTQNN F N                       F   AP      +SP  S   +
Sbjct: 144 TMMGQMNTQNNQFSNAAFPLG---------------SPFPFPAPPSPGPVTSPSPSSQTA 188

Query: 333 QSVTDVPVTKVEDPPSTSVKDKVEPEDA---PKKYAFKDVSPEETVQKNAFEEDYKESSV 163
            +V DVP TKVE  P+T+   +V+ E     PKKYAF DVSPEETVQK+AFE+     + 
Sbjct: 189 VTV-DVPATKVEAAPATAPATEVKSETETAEPKKYAFVDVSPEETVQKSAFED-----AA 242

Query: 162 QTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
              S NN                               P +SVDALEKMMEDPT
Sbjct: 243 GISSSNNTQFPKDVSDNGAASKQDAGAFGGSQSTGSADPALSVDALEKMMEDPT 296


>emb|CAB50925.1| translocon Tic40 [Pisum sativum]
          Length = 436

 Score = 79.3 bits (194), Expect = 5e-13
 Identities = 61/178 (34%), Positives = 78/178 (43%), Gaps = 11/178 (6%)
 Frame = -1

Query: 501 QMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSSPFKSGAASQSVT 322
           QMNTQNNPF +                    + +   G  S  T+T    +S + S    
Sbjct: 138 QMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGFAGNQSQATST----RSASQSTVTV 193

Query: 321 DVPVTKVE---DPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--------EDYK 175
           D+P TKVE     P  +VK++VE ++ PKK AF DVSPEETVQKNAFE          +K
Sbjct: 194 DIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEETVQKNAFERFKDVDESSSFK 253

Query: 174 ESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
           E+    ++  N                                 +SVDALEKMMEDPT
Sbjct: 254 EARAPAEASQN------------GTPFKQGFGDSPGSPSERKSALSVDALEKMMEDPT 299


>sp|Q8GT66.1|TIC40_PEA RecName: Full=Protein TIC 40, chloroplastic; AltName:
           Full=Translocon at the inner envelope membrane of
           chloroplasts 40; Short=PsTIC40; Flags: Precursor
           gi|26000725|gb|AAN75219.1| chloroplast protein
           translocon component Tic40 precursor [Pisum sativum]
          Length = 436

 Score = 79.3 bits (194), Expect = 5e-13
 Identities = 61/178 (34%), Positives = 78/178 (43%), Gaps = 11/178 (6%)
 Frame = -1

Query: 501 QMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSSPFKSGAASQSVT 322
           QMNTQNNPF +                    + +   G  S  T+T    +S + S    
Sbjct: 138 QMNTQNNPFDSGAFSSGPPFPFPMPSASGPATPAGFAGNQSQATST----RSASQSTVTV 193

Query: 321 DVPVTKVE---DPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--------EDYK 175
           D+P TKVE     P  +VK++VE ++ PKK AF DVSPEETVQKNAFE          +K
Sbjct: 194 DIPATKVEAAAPAPDINVKEEVEVKNEPKKSAFVDVSPEETVQKNAFERFKDVDESSSFK 253

Query: 174 ESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
           E+    ++  N                                 +SVDALEKMMEDPT
Sbjct: 254 EARAPAEASQN------------GTPFKQGFGDSPSSPSERKSALSVDALEKMMEDPT 299


>ref|XP_004250413.1| PREDICTED: protein TIC 40, chloroplastic-like [Solanum
           lycopersicum]
          Length = 443

 Score = 78.6 bits (192), Expect = 8e-13
 Identities = 63/176 (35%), Positives = 80/176 (45%), Gaps = 5/176 (2%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSSPFKSGAAS 334
           T   QMN QN+ F N                    S    +  P    ++SS   +  AS
Sbjct: 139 TMMGQMNGQNSQFSNTAFSPGPGSPFPFPFPPPPVSGPASSSPPPPTASSSSTPSASFAS 198

Query: 333 QSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFEEDYKES---- 169
           Q VT DV  TKVE+PP+ +VK+  E E  PKK AF D+SP+ET QK AF E++K+S    
Sbjct: 199 QPVTVDVSATKVEEPPTVNVKNDKEAEKEPKKNAFVDISPDETFQKGAF-ENFKDSAETA 257

Query: 168 SVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
           +V  D              +                    PL+SVDALEKMMEDPT
Sbjct: 258 AVTVDQVTQNGAASQSGFGS-------NTSDSTSSTGKSNPLLSVDALEKMMEDPT 306


>ref|XP_007032982.1| Hydroxyproline-rich glycoprotein family protein isoform 1
           [Theobroma cacao] gi|508712011|gb|EOY03908.1|
           Hydroxyproline-rich glycoprotein family protein isoform
           1 [Theobroma cacao]
          Length = 531

 Score = 76.3 bits (186), Expect = 4e-12
 Identities = 65/180 (36%), Positives = 75/180 (41%), Gaps = 9/180 (5%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSSPFKSGAAS 334
           T   QMNTQNN F N                       F   AP      +SP  S   +
Sbjct: 203 TMMGQMNTQNNQFSNAAFPLG---------------SPFPFPAPPSPGPVTSPSPSSQTA 247

Query: 333 QSVTDVPVTKVEDPPSTSVKDKVEPEDA---PKKYAFKDVSPEETVQKNAFEEDYKESSV 163
            +V DVP TKVE  P+T+   +V+ E     PKKYAF DVSPEETVQK+AFE+    SS 
Sbjct: 248 VTV-DVPATKVEAAPATAPATEVKSETETAEPKKYAFVDVSPEETVQKSAFEDAAGISSS 306

Query: 162 QTD------SPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
                    S N                                P +SVDALEKMMEDPT
Sbjct: 307 NNTQFPKDVSDNGAASKQDAGAFGGSQSTVKLNKHPIALAGSADPALSVDALEKMMEDPT 366


>ref|XP_007032985.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial
           [Theobroma cacao] gi|508712014|gb|EOY03911.1|
           Hydroxyproline-rich glycoprotein family protein isoform
           4, partial [Theobroma cacao]
          Length = 412

 Score = 75.9 bits (185), Expect = 5e-12
 Identities = 64/174 (36%), Positives = 75/174 (43%), Gaps = 3/174 (1%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSSPFKSGAAS 334
           T   QMNTQNN F N                       F   AP      +SP  S   +
Sbjct: 144 TMMGQMNTQNNQFSNAAFPLG---------------SPFPFPAPPSPGPVTSPSPSSQTA 188

Query: 333 QSVTDVPVTKVEDPPSTSVKDKVEPEDA---PKKYAFKDVSPEETVQKNAFEEDYKESSV 163
            +V DVP TKVE  P+T+   +V+ E     PKKYAF DVSPEETVQK+AFE+     + 
Sbjct: 189 VTV-DVPATKVEAAPATAPATEVKSETETAEPKKYAFVDVSPEETVQKSAFED-----AA 242

Query: 162 QTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
              S NN                               P +SVDALEKMMEDPT
Sbjct: 243 GISSSNN----------TQFPKDDAGAFGGSQSTGSADPALSVDALEKMMEDPT 286


>gb|EPS65548.1| hypothetical protein M569_09229, partial [Genlisea aurea]
          Length = 247

 Score = 75.5 bits (184), Expect = 7e-12
 Identities = 65/179 (36%), Positives = 81/179 (45%), Gaps = 8/179 (4%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGA----PSFQTTTSSPFKS 346
           + +QQMN QN  FG+                      +F  G+    PS   T SS    
Sbjct: 73  SLSQQMNAQNISFGST---------------------NFTPGSNFPFPSAMFTRSSSVAE 111

Query: 345 GA---ASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFEEDY 178
                 SQ VT +VP ++V++ P  SV  K   E + KKYAF DVSPE+T+QKNAFE   
Sbjct: 112 SVIPLTSQPVTVEVPKSEVKEIPPLSVDGKKPSETSQKKYAFVDVSPEDTLQKNAFEN-- 169

Query: 177 KESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
            E SV+TDS  N          +                     L+SVDALEKMMEDPT
Sbjct: 170 YEESVKTDSEAN-TSRPVTASASKREPVTATTASASSTTSKASSLLSVDALEKMMEDPT 227


>ref|XP_002531917.1| conserved hypothetical protein [Ricinus communis]
           gi|223528427|gb|EEF30461.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 465

 Score = 73.9 bits (180), Expect = 2e-11
 Identities = 63/178 (35%), Positives = 78/178 (43%), Gaps = 11/178 (6%)
 Frame = -1

Query: 501 QMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDS-FKTGA-------PSFQTTTSSPFKS 346
           QMNTQN+ F N                    S   F T +       PS+ T+++S   S
Sbjct: 157 QMNTQNDQFNNPAFSPGSAFPFPTPPASVPASSPPFPTSSTSRPATSPSYPTSSASTSPS 216

Query: 345 GAASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--EDYK 175
            A+  +VT DV  TKVE    T  KD+ E    PKKYAF DVSPEET  K+ F+  ED  
Sbjct: 217 VASQPAVTVDVSATKVEAASVTDAKDEAEITKEPKKYAFVDVSPEETFPKSPFKSNEDIL 276

Query: 174 ESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
           E+S   D+  N          N                      +SV+ALEKMMEDPT
Sbjct: 277 ETSTSKDTQFNPEVLQNGAASNQGAADFTGSQSTRKAGSG----LSVEALEKMMEDPT 330


>gb|ABF19057.1| plastid Tic40 [Ricinus communis]
          Length = 460

 Score = 73.9 bits (180), Expect = 2e-11
 Identities = 63/178 (35%), Positives = 78/178 (43%), Gaps = 11/178 (6%)
 Frame = -1

Query: 501 QMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDS-FKTGA-------PSFQTTTSSPFKS 346
           QMNTQN+ F N                    S   F T +       PS+ T+++S   S
Sbjct: 152 QMNTQNDQFNNPAFSPGSAFPFPTPPASVPASSPPFPTSSTSRPATSPSYPTSSASTSPS 211

Query: 345 GAASQSVT-DVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFE--EDYK 175
            A+  +VT DV  TKVE    T  KD+ E    PKKYAF DVSPEET  K+ F+  ED  
Sbjct: 212 VASQPAVTVDVSATKVEAASVTDAKDEAEITKEPKKYAFVDVSPEETFPKSPFKSNEDIL 271

Query: 174 ESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
           E+S   D+  N          N                      +SV+ALEKMMEDPT
Sbjct: 272 ETSTSKDTQFNPEVLQNGAASNQGAADFTGSQSTRKAGSG----LSVEALEKMMEDPT 325


>gb|EXB68023.1| hypothetical protein L484_009630 [Morus notabilis]
          Length = 391

 Score = 72.4 bits (176), Expect = 6e-11
 Identities = 61/180 (33%), Positives = 72/180 (40%), Gaps = 9/180 (5%)
 Frame = -1

Query: 513 TFTQQMNTQNNPFGNXXXXXXXXXXXXXXXXXXXXSDSFKTGAPSFQTTTSSPFKSGAAS 334
           T   QMNTQNN F N                      +F  G P F     SP  SG AS
Sbjct: 116 TLMGQMNTQNNQFNNA---------------------AFSPGTP-FPFPPPSPSPSGLAS 153

Query: 333 QS---------VTDVPVTKVEDPPSTSVKDKVEPEDAPKKYAFKDVSPEETVQKNAFEED 181
                        DV  T VE  P+  VKD+ E +   KK+AF DVSPEET QK+ FE  
Sbjct: 154 TPRPAAFQPAVTVDVAATTVEATPAADVKDETEQKTEAKKFAFVDVSPEETKQKSPFESS 213

Query: 180 YKESSVQTDSPNNYXXXXXXXXQNXXXXXXXXXXXXXXXXXXXXPLMSVDALEKMMEDPT 1
            K++  +T S N                                  +SV+ALEKMMEDPT
Sbjct: 214 LKDAE-ETISSNE---GPTAGVSQNGTTSKHGVGASQESPPRQESTISVEALEKMMEDPT 269


Top