BLASTX nr result

ID: Glycyrrhiza31_contig00003627 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza31_contig00003627
         (1361 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_004511282.1 PREDICTED: presequence protease 1, chloroplastic/...   846   0.0  
GAU29533.1 hypothetical protein TSUD_115550 [Trifolium subterran...   840   0.0  
XP_003517606.1 PREDICTED: presequence protease 2, chloroplastic/...   840   0.0  
XP_013453279.1 presequence protease [Medicago truncatula] KEH273...   837   0.0  
KHN30412.1 Presequence protease 2, chloroplastic/mitochondrial [...   830   0.0  
KYP45082.1 hypothetical protein KK1_033364 [Cajanus cajan]            829   0.0  
XP_016175065.1 PREDICTED: presequence protease 1, chloroplastic/...   827   0.0  
XP_007157239.1 hypothetical protein PHAVU_002G054400g [Phaseolus...   825   0.0  
XP_017406762.1 PREDICTED: presequence protease 1, chloroplastic/...   824   0.0  
XP_014520661.1 PREDICTED: presequence protease 1, chloroplastic/...   823   0.0  
XP_015940159.1 PREDICTED: presequence protease 1, chloroplastic/...   822   0.0  
BAU00865.1 hypothetical protein VIGAN_10250100 [Vigna angularis ...   819   0.0  
EOX98218.1 Presequence protease 2 isoform 4 [Theobroma cacao]         799   0.0  
KJB77681.1 hypothetical protein B456_012G150300 [Gossypium raimo...   800   0.0  
EOX98219.1 Presequence protease 2 isoform 5 [Theobroma cacao]         799   0.0  
EOX98215.1 Presequence protease 2 isoform 1 [Theobroma cacao]         799   0.0  
EOX98216.1 Presequence protease 2 isoform 2 [Theobroma cacao]         799   0.0  
EOX98217.1 Presequence protease 2 isoform 3 [Theobroma cacao]         799   0.0  
XP_012459281.1 PREDICTED: presequence protease 2, chloroplastic/...   800   0.0  
XP_017615507.1 PREDICTED: presequence protease 2, chloroplastic/...   800   0.0  

>XP_004511282.1 PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Cicer arietinum]
          Length = 1080

 Score =  846 bits (2186), Expect = 0.0
 Identities = 414/453 (91%), Positives = 429/453 (94%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP+CV+DLQTFQQEGWH+ELN PSEDI
Sbjct: 194  NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPKCVDDLQTFQQEGWHYELNHPSEDI 253

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDNILGR AQQALFPD TYGVDSGGDP+VIP LTFEEFKEFHRK
Sbjct: 254  TYKGVVFNEMKGVYSQPDNILGRAAQQALFPDNTYGVDSGGDPRVIPNLTFEEFKEFHRK 313

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSNSRIWFYGDDDPNERLRILSEYL+MFDASSAPNESKVEPQKLFSKP+RIVETYPA
Sbjct: 314  YYHPSNSRIWFYGDDDPNERLRILSEYLNMFDASSAPNESKVEPQKLFSKPIRIVETYPA 373

Query: 542  GEGSDLKKHMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIVG 721
            GEG DLKKHMVCLNWLL+DKPLD+ETE             PASPLRK+LLES LGDAIVG
Sbjct: 374  GEGGDLKKHMVCLNWLLADKPLDLETELALGFLNHLLLGTPASPLRKVLLESRLGDAIVG 433

Query: 722  GGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSLR 901
            GGLEDELLQPQFSIGMKGVSEDDIHKVEELI STLKKLAEEGFDTDAIEASMNTIEFSLR
Sbjct: 434  GGLEDELLQPQFSIGMKGVSEDDIHKVEELIMSTLKKLAEEGFDTDAIEASMNTIEFSLR 493

Query: 902  ENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEKF 1081
            ENNTGSFPRGLSLMLQSIGKW+YDMNP EPLKYEKPLQDLKSKIAKEGSKSVFSPLIEKF
Sbjct: 494  ENNTGSFPRGLSLMLQSIGKWIYDMNPLEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEKF 553

Query: 1082 ILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETPD 1261
            ILNN H+VTV+MQPDPEKAARDE TEKQ+LQK+KASMTTEDLAEL RAT+ELRLKQETPD
Sbjct: 554  ILNNPHKVTVQMQPDPEKAARDEETEKQVLQKIKASMTTEDLAELARATHELRLKQETPD 613

Query: 1262 PPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            PPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV
Sbjct: 614  PPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 646


>GAU29533.1 hypothetical protein TSUD_115550 [Trifolium subterraneum]
          Length = 1056

 Score =  840 bits (2171), Expect = 0.0
 Identities = 417/459 (90%), Positives = 429/459 (93%), Gaps = 6/459 (1%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP+CV+D+QTFQQEGWH+ELN PSEDI
Sbjct: 197  NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPKCVQDIQTFQQEGWHYELNHPSEDI 256

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALF------PDTTYGVDSGGDPQVIPKLTFEEF 343
            TYKGVVFNEMKGVYSQPDNILGR AQQA F      PD TYGVDSGGDPQVIPKLTFEEF
Sbjct: 257  TYKGVVFNEMKGVYSQPDNILGRAAQQASFFLSALCPDNTYGVDSGGDPQVIPKLTFEEF 316

Query: 344  KEFHRKYYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRI 523
            KEFHRKYYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAP+ESKVEPQKLFSKPVRI
Sbjct: 317  KEFHRKYYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPSESKVEPQKLFSKPVRI 376

Query: 524  VETYPAGEGSDLKKHMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGL 703
            +ETYPAGEG DLKKHMVCLNWLLSDKPLD+ETE             PASPLRKILLES L
Sbjct: 377  IETYPAGEGGDLKKHMVCLNWLLSDKPLDLETELTLGFLNHLLLGTPASPLRKILLESRL 436

Query: 704  GDAIVGGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNT 883
            GDAIVGGGLEDELLQPQFSIGMKGVSEDDIHKVEELI STLKKLAEEGFDTDAIEASMNT
Sbjct: 437  GDAIVGGGLEDELLQPQFSIGMKGVSEDDIHKVEELIMSTLKKLAEEGFDTDAIEASMNT 496

Query: 884  IEFSLRENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFS 1063
            IEFSLRENNTGSFPRGLSLMLQSIGKW+YDMNP EPLKYEKPLQDLKSKIAKEGSK VFS
Sbjct: 497  IEFSLRENNTGSFPRGLSLMLQSIGKWIYDMNPLEPLKYEKPLQDLKSKIAKEGSKFVFS 556

Query: 1064 PLIEKFILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRL 1243
            PLIEKFILNN H+VTV+MQPDPEKAARDEATEKQILQ+VKASMTTEDLAELTRAT+ELRL
Sbjct: 557  PLIEKFILNNPHKVTVQMQPDPEKAARDEATEKQILQEVKASMTTEDLAELTRATHELRL 616

Query: 1244 KQETPDPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            KQETPDPPEALKTVPSLSLQDIPKEPI VPTEVGDINGV
Sbjct: 617  KQETPDPPEALKTVPSLSLQDIPKEPIHVPTEVGDINGV 655


>XP_003517606.1 PREDICTED: presequence protease 2, chloroplastic/mitochondrial
            [Glycine max] KRH77969.1 hypothetical protein
            GLYMA_01G244900 [Glycine max]
          Length = 1078

 Score =  840 bits (2171), Expect = 0.0
 Identities = 416/454 (91%), Positives = 427/454 (94%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN KDFYNLVDVYLDAVFFPRCVED Q FQQEGWHFELNDPSEDI
Sbjct: 192  NAFTYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPRCVEDFQIFQQEGWHFELNDPSEDI 251

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDNILGR AQQALFPDTTYGVDSGGDP+VIPKLTFEEFKEFHRK
Sbjct: 252  TYKGVVFNEMKGVYSQPDNILGRAAQQALFPDTTYGVDSGGDPRVIPKLTFEEFKEFHRK 311

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSNSRIWFYGDDDPNERLRILSEYLD+FD+S A +ES+VEPQ LFSKPVRIVETYPA
Sbjct: 312  YYHPSNSRIWFYGDDDPNERLRILSEYLDLFDSSLASHESRVEPQTLFSKPVRIVETYPA 371

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
            GEG DLKK HMVCLNWLLSDKPLD+ETE             PASPLRKILLES LGDAIV
Sbjct: 372  GEGGDLKKKHMVCLNWLLSDKPLDLETELTLGFLNHLLLGTPASPLRKILLESRLGDAIV 431

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIGMKGVSEDDIHKVEEL+TSTLKKLAEEGFDTDAIEASMNTIEFSL
Sbjct: 432  GGGVEDELLQPQFSIGMKGVSEDDIHKVEELVTSTLKKLAEEGFDTDAIEASMNTIEFSL 491

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLMLQSIGKW+YDMNPFEPLKYEKPLQDLKS+IAKEGSKSVFSPLIEK
Sbjct: 492  RENNTGSFPRGLSLMLQSIGKWIYDMNPFEPLKYEKPLQDLKSRIAKEGSKSVFSPLIEK 551

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN HQVTVEMQPDPEKAARDE  EKQILQKVKASMTTEDLAEL RAT+ELRLKQETP
Sbjct: 552  FILNNPHQVTVEMQPDPEKAARDEVAEKQILQKVKASMTTEDLAELARATHELRLKQETP 611

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV
Sbjct: 612  DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 645


>XP_013453279.1 presequence protease [Medicago truncatula] KEH27308.1 presequence
            protease [Medicago truncatula]
          Length = 1077

 Score =  837 bits (2161), Expect = 0.0
 Identities = 413/453 (91%), Positives = 426/453 (94%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP+CVED+QTFQQEGWH+ELN PSEDI
Sbjct: 191  NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDVQTFQQEGWHYELNHPSEDI 250

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDNILGR +QQALFPD TYGVDSGGDPQVIPKLTFEEFKEFHRK
Sbjct: 251  TYKGVVFNEMKGVYSQPDNILGRASQQALFPDNTYGVDSGGDPQVIPKLTFEEFKEFHRK 310

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSNSRIWFYGDDDP ERLRILSEYLDMFDASS+PNESK+EPQKLFSKPVRIVETYPA
Sbjct: 311  YYHPSNSRIWFYGDDDPTERLRILSEYLDMFDASSSPNESKIEPQKLFSKPVRIVETYPA 370

Query: 542  GEGSDLKKHMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIVG 721
            GEG DLKKHMV LNWLLSDKPLD+ETE             PASPLRKILLES LGDAIVG
Sbjct: 371  GEGGDLKKHMVSLNWLLSDKPLDLETELALSFLNHLLLGTPASPLRKILLESRLGDAIVG 430

Query: 722  GGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSLR 901
            GGLEDELLQPQFSIGMKGVSEDDI KVEELI +TLKKL EEGFDTDAIEASMNTIEFSLR
Sbjct: 431  GGLEDELLQPQFSIGMKGVSEDDIPKVEELIVNTLKKLVEEGFDTDAIEASMNTIEFSLR 490

Query: 902  ENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEKF 1081
            ENNTGSFPRGLSLMLQSIGKW+YDMNP EPLKYEKPLQDLKSKIAKEGSKSVFSPLIEKF
Sbjct: 491  ENNTGSFPRGLSLMLQSIGKWIYDMNPLEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEKF 550

Query: 1082 ILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETPD 1261
            ILNN H+VTV+MQPDPEKAAR+EATEKQILQ+VKASMTTEDLAELTRAT ELRLKQETPD
Sbjct: 551  ILNNLHKVTVQMQPDPEKAAREEATEKQILQEVKASMTTEDLAELTRATQELRLKQETPD 610

Query: 1262 PPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            PPEALKTVPSLSLQDIPKEPI VPTEVGDINGV
Sbjct: 611  PPEALKTVPSLSLQDIPKEPIHVPTEVGDINGV 643


>KHN30412.1 Presequence protease 2, chloroplastic/mitochondrial [Glycine soja]
          Length = 1094

 Score =  830 bits (2144), Expect = 0.0
 Identities = 416/470 (88%), Positives = 427/470 (90%), Gaps = 17/470 (3%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN KDFYNLVDVYLDAVFFPRCVED Q FQQEGWHFELNDPSEDI
Sbjct: 192  NAFTYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPRCVEDFQIFQQEGWHFELNDPSEDI 251

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQA----------------LFPDTTYGVDSGGDPQ 313
            TYKGVVFNEMKGVYSQPDNILGR AQQA                LFPDTTYGVDSGGDP+
Sbjct: 252  TYKGVVFNEMKGVYSQPDNILGRAAQQASFLMACPFLIFISWMALFPDTTYGVDSGGDPR 311

Query: 314  VIPKLTFEEFKEFHRKYYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEP 493
            VIPKLTFEEFKEFHRKYYHPSNSRIWFYGDDDPNERLRILSEYLD+FD+S A +ES+VEP
Sbjct: 312  VIPKLTFEEFKEFHRKYYHPSNSRIWFYGDDDPNERLRILSEYLDLFDSSLASHESRVEP 371

Query: 494  QKLFSKPVRIVETYPAGEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPAS 670
            Q LFSKPVRIVETYPAGEG DLKK HMVCLNWLLSDKPLD+ETE             PAS
Sbjct: 372  QTLFSKPVRIVETYPAGEGGDLKKKHMVCLNWLLSDKPLDLETELTLGFLNHLLLGTPAS 431

Query: 671  PLRKILLESGLGDAIVGGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGF 850
            PLRKILLES LGDAIVGGG+EDELLQPQFSIGMKGVSEDDIHKVEEL+TSTLKKLAEEGF
Sbjct: 432  PLRKILLESRLGDAIVGGGVEDELLQPQFSIGMKGVSEDDIHKVEELVTSTLKKLAEEGF 491

Query: 851  DTDAIEASMNTIEFSLRENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSK 1030
            DTDAIEASMNTIEFSLRENNTGSFPRGLSLMLQSIGKW+YDMNPFEPLKYEKPLQDLKS+
Sbjct: 492  DTDAIEASMNTIEFSLRENNTGSFPRGLSLMLQSIGKWIYDMNPFEPLKYEKPLQDLKSR 551

Query: 1031 IAKEGSKSVFSPLIEKFILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLA 1210
            IAKEGSKSVFSPLIEKFILNN HQVTVEMQPDPEKAARDE  EKQILQKVKASMTTEDLA
Sbjct: 552  IAKEGSKSVFSPLIEKFILNNPHQVTVEMQPDPEKAARDEVAEKQILQKVKASMTTEDLA 611

Query: 1211 ELTRATYELRLKQETPDPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            EL RAT+ELRLKQETPDPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV
Sbjct: 612  ELARATHELRLKQETPDPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 661


>KYP45082.1 hypothetical protein KK1_033364 [Cajanus cajan]
          Length = 1091

 Score =  829 bits (2142), Expect = 0.0
 Identities = 412/469 (87%), Positives = 428/469 (91%), Gaps = 16/469 (3%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVED Q FQQEGWHFELNDPSE+I
Sbjct: 190  NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDFQIFQQEGWHFELNDPSEEI 249

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQA----------------LFPDTTYGVDSGGDPQ 313
            TYKGVVFNEMKGVYSQPDNILGR AQQA                LFPDTTYGVDSGGDP+
Sbjct: 250  TYKGVVFNEMKGVYSQPDNILGRAAQQASFLMACWPFLIFMYFALFPDTTYGVDSGGDPR 309

Query: 314  VIPKLTFEEFKEFHRKYYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEP 493
            VIP LTFEEFKEFHRKYYHPSNSRIWFYGDDDPNERLRILSEYLD+FD+S AP+ES+VEP
Sbjct: 310  VIPNLTFEEFKEFHRKYYHPSNSRIWFYGDDDPNERLRILSEYLDLFDSSVAPDESRVEP 369

Query: 494  QKLFSKPVRIVETYPAGEGSDLKKHMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASP 673
            Q LFSKPVRIVETY AGEG DLKKHMVCLNWLLSDKPLD+ETE             PASP
Sbjct: 370  QTLFSKPVRIVETYSAGEGGDLKKHMVCLNWLLSDKPLDLETELTLGFLNHLLLGSPASP 429

Query: 674  LRKILLESGLGDAIVGGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFD 853
            LRKILLES LGDAIVGGG+EDELLQPQFSIGMKGVS DDIHKVEEL+TST KKLAEEGFD
Sbjct: 430  LRKILLESRLGDAIVGGGVEDELLQPQFSIGMKGVSADDIHKVEELVTSTFKKLAEEGFD 489

Query: 854  TDAIEASMNTIEFSLRENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKI 1033
            TDAIEASMNTIEFSLRENNTGSFPRGLSLMLQSIGKW+YD+NPFEPLKYEKPLQDLKS+I
Sbjct: 490  TDAIEASMNTIEFSLRENNTGSFPRGLSLMLQSIGKWIYDLNPFEPLKYEKPLQDLKSRI 549

Query: 1034 AKEGSKSVFSPLIEKFILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAE 1213
            AKEGSKSVFSPLIEKFILNN HQVTVEMQPDPEKAAR+EATEKQILQKVKA+MTTEDLAE
Sbjct: 550  AKEGSKSVFSPLIEKFILNNPHQVTVEMQPDPEKAAREEATEKQILQKVKANMTTEDLAE 609

Query: 1214 LTRATYELRLKQETPDPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            LTRAT+ELRLKQETPDPPEALK+VPSLSLQDIPKEPIRVPTEVGDINGV
Sbjct: 610  LTRATHELRLKQETPDPPEALKSVPSLSLQDIPKEPIRVPTEVGDINGV 658


>XP_016175065.1 PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Arachis ipaensis]
          Length = 1085

 Score =  827 bits (2135), Expect = 0.0
 Identities = 404/454 (88%), Positives = 426/454 (93%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVED QTFQQEGWHFELNDPSEDI
Sbjct: 199  NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDFQTFQQEGWHFELNDPSEDI 258

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDNILGRT+QQAL+PDTTYGVDSGGDPQVIPKLTFEEFKEFHRK
Sbjct: 259  TYKGVVFNEMKGVYSQPDNILGRTSQQALYPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 318

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSNSRIWFYGDDDPNERLRIL EYLDMFDASSAPNESK+EPQKLFSKPVRI+E YPA
Sbjct: 319  YYHPSNSRIWFYGDDDPNERLRILGEYLDMFDASSAPNESKIEPQKLFSKPVRIIEKYPA 378

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
             EG+DLKK HMV LNWLLSDKPLD+ETE             PASPLRKILLESGLGDAIV
Sbjct: 379  SEGADLKKQHMVTLNWLLSDKPLDLETELALGFLDHLLLGTPASPLRKILLESGLGDAIV 438

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIG+KGVSE DIHKVEEL+ +TLKKLA EGFDTDA+EASMNTIEFSL
Sbjct: 439  GGGVEDELLQPQFSIGLKGVSEQDIHKVEELVMTTLKKLANEGFDTDAVEASMNTIEFSL 498

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLML+SIGKW+YDMNPFEPLKYEKPLQDLKS++AKEGSK+VFSPLIEK
Sbjct: 499  RENNTGSFPRGLSLMLRSIGKWIYDMNPFEPLKYEKPLQDLKSRLAKEGSKAVFSPLIEK 558

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN H+VTVEMQPDPEKAARDEATEK+ILQKVKA MT EDL EL++AT++LRLKQETP
Sbjct: 559  FILNNPHRVTVEMQPDPEKAARDEATEKEILQKVKAGMTKEDLEELSQATHDLRLKQETP 618

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            DPPEALKTVPSLSLQDIPKEPI VP EVGDINGV
Sbjct: 619  DPPEALKTVPSLSLQDIPKEPIYVPIEVGDINGV 652


>XP_007157239.1 hypothetical protein PHAVU_002G054400g [Phaseolus vulgaris]
            ESW29233.1 hypothetical protein PHAVU_002G054400g
            [Phaseolus vulgaris]
          Length = 1078

 Score =  825 bits (2131), Expect = 0.0
 Identities = 406/454 (89%), Positives = 425/454 (93%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN+KDFYNLVDVYLDAVFFP+CVED Q FQQEGWHFELNDPSEDI
Sbjct: 192  NAFTYPDRTCYPVASTNSKDFYNLVDVYLDAVFFPKCVEDFQIFQQEGWHFELNDPSEDI 251

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDNILGR +QQALFPDTTYGVDSGGDP+VIPKLTFEEFKEFHRK
Sbjct: 252  TYKGVVFNEMKGVYSQPDNILGRASQQALFPDTTYGVDSGGDPRVIPKLTFEEFKEFHRK 311

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSNSRIWFYG+DDP ERLRILSEYLD+FD+S A  ES++EPQ LFSKPVRIVETYPA
Sbjct: 312  YYHPSNSRIWFYGNDDPKERLRILSEYLDLFDSSLASEESRIEPQTLFSKPVRIVETYPA 371

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
            GEG DLKK HMVCLNWLLSDKPLD+ETE             PASPLRKILLESGLGDAIV
Sbjct: 372  GEGGDLKKKHMVCLNWLLSDKPLDLETELAIGFLNHLLLGTPASPLRKILLESGLGDAIV 431

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIG+KGVSEDDIHKVEEL+TSTLKKLAEEGFDTDAIEASMNTIEFSL
Sbjct: 432  GGGVEDELLQPQFSIGLKGVSEDDIHKVEELVTSTLKKLAEEGFDTDAIEASMNTIEFSL 491

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLMLQSIGKW+YDMNPFEPLKYEKPLQ LKS+IA+EG KSVFSPLIEK
Sbjct: 492  RENNTGSFPRGLSLMLQSIGKWIYDMNPFEPLKYEKPLQGLKSRIAEEGPKSVFSPLIEK 551

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN H+VTVEMQPDPEKAAR+EATEK ILQKVK SMTTEDLAELTRAT+ELRLKQETP
Sbjct: 552  FILNNPHKVTVEMQPDPEKAAREEATEKHILQKVKTSMTTEDLAELTRATHELRLKQETP 611

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            D PEALKTVPSLSLQDIPKEPIRVPTEVGDINGV
Sbjct: 612  DSPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 645


>XP_017406762.1 PREDICTED: presequence protease 1, chloroplastic/mitochondrial [Vigna
            angularis] KOM26648.1 hypothetical protein
            LR48_Vigan303s007000 [Vigna angularis]
          Length = 1081

 Score =  824 bits (2128), Expect = 0.0
 Identities = 405/454 (89%), Positives = 425/454 (93%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN+KDFYNLVDVYLDAVFFPRCVED Q FQQEGWHFELNDPSEDI
Sbjct: 195  NAFTYPDRTCYPVASTNSKDFYNLVDVYLDAVFFPRCVEDFQIFQQEGWHFELNDPSEDI 254

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDNILGR +QQALFPDTTYGVDSGGDP++IP LTFEEFKEFHRK
Sbjct: 255  TYKGVVFNEMKGVYSQPDNILGRASQQALFPDTTYGVDSGGDPRIIPNLTFEEFKEFHRK 314

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSNSRIWFYG+DDPNERLRIL EYLD+FD+S A  ES+VEPQ LFSKPVRIVETYPA
Sbjct: 315  YYHPSNSRIWFYGNDDPNERLRILKEYLDLFDSSLASEESRVEPQTLFSKPVRIVETYPA 374

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
            GE  DLKK HMVCLNWLLSDKPLD+ETE             PASPLRKILLESGLGDAIV
Sbjct: 375  GEEGDLKKKHMVCLNWLLSDKPLDLETELTIGFLNHLLLGTPASPLRKILLESGLGDAIV 434

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIGMKGVSEDDIHKVEEL+TSTLKKLAEEGFDTDAIEASMNTIEFSL
Sbjct: 435  GGGVEDELLQPQFSIGMKGVSEDDIHKVEELVTSTLKKLAEEGFDTDAIEASMNTIEFSL 494

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLMLQS+GKW+YDMNPFEPLKYEKPL+DLKS+I+KEGSKSVFSPLIEK
Sbjct: 495  RENNTGSFPRGLSLMLQSMGKWIYDMNPFEPLKYEKPLEDLKSRISKEGSKSVFSPLIEK 554

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN H+VTVEMQPDPEKAAR+EATEKQILQKVK SMT EDLAELTRAT+EL+LKQETP
Sbjct: 555  FILNNPHKVTVEMQPDPEKAAREEATEKQILQKVKTSMTAEDLAELTRATHELQLKQETP 614

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            DPPEALKTVPSLSLQDIPKEPIRVPTEV DINGV
Sbjct: 615  DPPEALKTVPSLSLQDIPKEPIRVPTEVCDINGV 648


>XP_014520661.1 PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Vigna radiata var. radiata]
          Length = 1079

 Score =  823 bits (2127), Expect = 0.0
 Identities = 406/454 (89%), Positives = 424/454 (93%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN+KDFYNLVDVYLDAVFFPRCVED Q FQQEGWHFELNDPSEDI
Sbjct: 193  NAFTYPDRTCYPVASTNSKDFYNLVDVYLDAVFFPRCVEDFQIFQQEGWHFELNDPSEDI 252

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDNILGR +QQALFPD TYGVDSGGDP+VIPKLTFEEFKEFHRK
Sbjct: 253  TYKGVVFNEMKGVYSQPDNILGRASQQALFPDNTYGVDSGGDPRVIPKLTFEEFKEFHRK 312

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSNSRIWFYG+DDPNERLRIL EYLD+FD+S A  ES+VEPQ LFSKPVRIVETYPA
Sbjct: 313  YYHPSNSRIWFYGNDDPNERLRILKEYLDLFDSSLASEESRVEPQALFSKPVRIVETYPA 372

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
            GE  DLKK HMVCLNWLLSDKPLD+ETE             PASPLRKILLES LGDAIV
Sbjct: 373  GEEGDLKKKHMVCLNWLLSDKPLDLETELTIGFLNHLLLGTPASPLRKILLESELGDAIV 432

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIGMKGVSEDDIHKVEEL+TSTLKKLAEEGFDTDAIEASMNTIEFSL
Sbjct: 433  GGGVEDELLQPQFSIGMKGVSEDDIHKVEELVTSTLKKLAEEGFDTDAIEASMNTIEFSL 492

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLMLQS+GKW+YDMNPFEPLKYEKPL+ LKS+I+KEGSKSVFSPLIEK
Sbjct: 493  RENNTGSFPRGLSLMLQSMGKWIYDMNPFEPLKYEKPLEGLKSRISKEGSKSVFSPLIEK 552

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN H+VTVEMQPDPEKAAR+EATEKQILQKVK SMT EDLAELTRAT+ELRLKQETP
Sbjct: 553  FILNNPHKVTVEMQPDPEKAAREEATEKQILQKVKTSMTAEDLAELTRATHELRLKQETP 612

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV
Sbjct: 613  DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 646


>XP_015940159.1 PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Arachis duranensis]
          Length = 1086

 Score =  822 bits (2124), Expect = 0.0
 Identities = 402/454 (88%), Positives = 424/454 (93%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVED QTFQQEGWHFELNDPSEDI
Sbjct: 200  NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDFQTFQQEGWHFELNDPSEDI 259

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDNILGR +QQAL+PDTTYGVDSGGDPQVIPKLTFEEFKEFHRK
Sbjct: 260  TYKGVVFNEMKGVYSQPDNILGRISQQALYPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 319

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSNSRIWFYGDDDPNERLRIL EYLDMFDASSAPNESK+EPQKLFSKPVRI+E YPA
Sbjct: 320  YYHPSNSRIWFYGDDDPNERLRILGEYLDMFDASSAPNESKIEPQKLFSKPVRIIEKYPA 379

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
             EG+DLKK HMV LNWLLSDKPLD+ETE             PASPLRKILLESGLGDAIV
Sbjct: 380  SEGADLKKQHMVTLNWLLSDKPLDLETELALGFLDHLLLGTPASPLRKILLESGLGDAIV 439

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIG+KGVSE DIHKVEEL+ +TLKKLA EGFDTDA+EASMNTIEFSL
Sbjct: 440  GGGVEDELLQPQFSIGLKGVSEQDIHKVEELVMTTLKKLANEGFDTDAVEASMNTIEFSL 499

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLML+SIGKW+YDMNPFEPLKYEKPLQDLKS++AKEGSK+VFSPLIEK
Sbjct: 500  RENNTGSFPRGLSLMLRSIGKWIYDMNPFEPLKYEKPLQDLKSRLAKEGSKAVFSPLIEK 559

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN H+VTVEMQPDPEKAA DEATEK+ILQKVKA MT EDL EL++AT++LRLKQETP
Sbjct: 560  FILNNPHRVTVEMQPDPEKAAHDEATEKEILQKVKAGMTKEDLEELSQATHDLRLKQETP 619

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            DPPEALKTVPSLSLQDIPKEPI VP EVGDINGV
Sbjct: 620  DPPEALKTVPSLSLQDIPKEPIYVPIEVGDINGV 653


>BAU00865.1 hypothetical protein VIGAN_10250100 [Vigna angularis var. angularis]
          Length = 1082

 Score =  819 bits (2116), Expect = 0.0
 Identities = 405/455 (89%), Positives = 425/455 (93%), Gaps = 2/455 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN+KDFYNLVDVYLDAVFFPRCVED Q FQQEGWHFELNDPSEDI
Sbjct: 195  NAFTYPDRTCYPVASTNSKDFYNLVDVYLDAVFFPRCVEDFQIFQQEGWHFELNDPSEDI 254

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQ-ALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHR 358
            TYKGVVFNEMKGVYSQPDNILGR +QQ ALFPDTTYGVDSGGDP++IP LTFEEFKEFHR
Sbjct: 255  TYKGVVFNEMKGVYSQPDNILGRASQQQALFPDTTYGVDSGGDPRIIPNLTFEEFKEFHR 314

Query: 359  KYYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYP 538
            KYYHPSNSRIWFYG+DDPNERLRIL EYLD+FD+S A  ES+VEPQ LFSKPVRIVETYP
Sbjct: 315  KYYHPSNSRIWFYGNDDPNERLRILKEYLDLFDSSLASEESRVEPQTLFSKPVRIVETYP 374

Query: 539  AGEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAI 715
            AGE  DLKK HMVCLNWLLSDKPLD+ETE             PASPLRKILLESGLGDAI
Sbjct: 375  AGEEGDLKKKHMVCLNWLLSDKPLDLETELTIGFLNHLLLGTPASPLRKILLESGLGDAI 434

Query: 716  VGGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFS 895
            VGGG+EDELLQPQFSIGMKGVSEDDIHKVEEL+TSTLKKLAEEGFDTDAIEASMNTIEFS
Sbjct: 435  VGGGVEDELLQPQFSIGMKGVSEDDIHKVEELVTSTLKKLAEEGFDTDAIEASMNTIEFS 494

Query: 896  LRENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIE 1075
            LRENNTGSFPRGLSLMLQS+GKW+YDMNPFEPLKYEKPL+DLKS+I+KEGSKSVFSPLIE
Sbjct: 495  LRENNTGSFPRGLSLMLQSMGKWIYDMNPFEPLKYEKPLEDLKSRISKEGSKSVFSPLIE 554

Query: 1076 KFILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQET 1255
            KFILNN H+VTVEMQPDPEKAAR+EATEKQILQKVK SMT EDLAELTRAT+EL+LKQET
Sbjct: 555  KFILNNPHKVTVEMQPDPEKAAREEATEKQILQKVKTSMTAEDLAELTRATHELQLKQET 614

Query: 1256 PDPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            PDPPEALKTVPSLSLQDIPKEPIRVPTEV DINGV
Sbjct: 615  PDPPEALKTVPSLSLQDIPKEPIRVPTEVCDINGV 649


>EOX98218.1 Presequence protease 2 isoform 4 [Theobroma cacao]
          Length = 849

 Score =  799 bits (2064), Expect = 0.0
 Identities = 389/454 (85%), Positives = 419/454 (92%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN KDFYNLVDVYLDAVFFP+C+ED QTFQQEGWH+ELND SEDI
Sbjct: 199  NAFTYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPKCIEDFQTFQQEGWHYELNDTSEDI 258

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDN+LGRTAQQALFPD TYGVDSGGDPQVIPKLT+EEFKEFHRK
Sbjct: 259  TYKGVVFNEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGDPQVIPKLTYEEFKEFHRK 318

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSN+RIWFYGDDDP ERLRILSEYLDMFDAS+AP+ESKVEPQKLFS+PVR VE YP 
Sbjct: 319  YYHPSNARIWFYGDDDPIERLRILSEYLDMFDASTAPDESKVEPQKLFSEPVRFVEKYPV 378

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
            GEG DLKK HMVCLNWLLSDKPLD++TE             PASPLRK+LLESGLGDAI+
Sbjct: 379  GEGGDLKKKHMVCLNWLLSDKPLDLQTELTLGFLDHLMLGTPASPLRKVLLESGLGDAII 438

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIG+KGVSEDDI KVEELI S+LKKLAEEGFDTDA+EASMNTIEFSL
Sbjct: 439  GGGVEDELLQPQFSIGLKGVSEDDIPKVEELIMSSLKKLAEEGFDTDAVEASMNTIEFSL 498

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLML+SIGKW+YDM+PFEPLKYEKPL  LK++IA+EGSK+VFSPLIEK
Sbjct: 499  RENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILKARIAEEGSKAVFSPLIEK 558

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN H VT+EMQPDPEKA+RDEA EK+IL KVKASMT EDLAEL RAT EL+LKQETP
Sbjct: 559  FILNNPHCVTIEMQPDPEKASRDEAAEKEILNKVKASMTEEDLAELARATQELKLKQETP 618

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            DPPEAL++VPSLSL DIPKEPIRVPTEVGDINGV
Sbjct: 619  DPPEALRSVPSLSLHDIPKEPIRVPTEVGDINGV 652


>KJB77681.1 hypothetical protein B456_012G150300 [Gossypium raimondii]
          Length = 906

 Score =  800 bits (2067), Expect = 0.0
 Identities = 387/454 (85%), Positives = 423/454 (93%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN+KDFYNLVDVYLDAVFFP+C+ED QTFQQEGWH+ELNDPSEDI
Sbjct: 203  NAFTYPDRTCYPVASTNSKDFYNLVDVYLDAVFFPKCIEDFQTFQQEGWHYELNDPSEDI 262

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDN+LGRTAQQALFPD TYGVDSGGDP VIPKLTFEEFKEFHRK
Sbjct: 263  TYKGVVFNEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGDPLVIPKLTFEEFKEFHRK 322

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSN+RIWFYGDDDP+ERLRILSEYLDMFDAS+APNESKVEPQKLFS+PVRIVE YPA
Sbjct: 323  YYHPSNARIWFYGDDDPSERLRILSEYLDMFDASTAPNESKVEPQKLFSEPVRIVEKYPA 382

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
            G+G DLKK HMVCLNWLLSDKPLD++TE             PASPLRK+LLESGLGDAI+
Sbjct: 383  GDGGDLKKKHMVCLNWLLSDKPLDLQTELTLGFLDHLLLGTPASPLRKVLLESGLGDAII 442

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIG+KGVS+DDI KVEELI S+L+KLAEEGFDT+A+EASMNTIEFSL
Sbjct: 443  GGGVEDELLQPQFSIGLKGVSDDDIPKVEELIMSSLRKLAEEGFDTEAVEASMNTIEFSL 502

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLML+S+GKW+YDM+PFEPLKYE+PL DLK++IA+EGSK+VFSPLIEK
Sbjct: 503  RENNTGSFPRGLSLMLRSMGKWIYDMDPFEPLKYEQPLLDLKARIAEEGSKAVFSPLIEK 562

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN H VT+EMQPDPEKA+RDEA EK+ L+KVKASMT EDLAEL RAT EL+LKQETP
Sbjct: 563  FILNNPHCVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDLAELARATEELKLKQETP 622

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            DPPEALK VPSLSL DIPKEPIR+PTEVGDINGV
Sbjct: 623  DPPEALKCVPSLSLHDIPKEPIRIPTEVGDINGV 656


>EOX98219.1 Presequence protease 2 isoform 5 [Theobroma cacao]
          Length = 971

 Score =  799 bits (2064), Expect = 0.0
 Identities = 389/454 (85%), Positives = 419/454 (92%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN KDFYNLVDVYLDAVFFP+C+ED QTFQQEGWH+ELND SEDI
Sbjct: 199  NAFTYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPKCIEDFQTFQQEGWHYELNDTSEDI 258

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDN+LGRTAQQALFPD TYGVDSGGDPQVIPKLT+EEFKEFHRK
Sbjct: 259  TYKGVVFNEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGDPQVIPKLTYEEFKEFHRK 318

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSN+RIWFYGDDDP ERLRILSEYLDMFDAS+AP+ESKVEPQKLFS+PVR VE YP 
Sbjct: 319  YYHPSNARIWFYGDDDPIERLRILSEYLDMFDASTAPDESKVEPQKLFSEPVRFVEKYPV 378

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
            GEG DLKK HMVCLNWLLSDKPLD++TE             PASPLRK+LLESGLGDAI+
Sbjct: 379  GEGGDLKKKHMVCLNWLLSDKPLDLQTELTLGFLDHLMLGTPASPLRKVLLESGLGDAII 438

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIG+KGVSEDDI KVEELI S+LKKLAEEGFDTDA+EASMNTIEFSL
Sbjct: 439  GGGVEDELLQPQFSIGLKGVSEDDIPKVEELIMSSLKKLAEEGFDTDAVEASMNTIEFSL 498

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLML+SIGKW+YDM+PFEPLKYEKPL  LK++IA+EGSK+VFSPLIEK
Sbjct: 499  RENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILKARIAEEGSKAVFSPLIEK 558

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN H VT+EMQPDPEKA+RDEA EK+IL KVKASMT EDLAEL RAT EL+LKQETP
Sbjct: 559  FILNNPHCVTIEMQPDPEKASRDEAAEKEILNKVKASMTEEDLAELARATQELKLKQETP 618

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            DPPEAL++VPSLSL DIPKEPIRVPTEVGDINGV
Sbjct: 619  DPPEALRSVPSLSLHDIPKEPIRVPTEVGDINGV 652


>EOX98215.1 Presequence protease 2 isoform 1 [Theobroma cacao]
          Length = 1037

 Score =  799 bits (2064), Expect = 0.0
 Identities = 389/454 (85%), Positives = 419/454 (92%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN KDFYNLVDVYLDAVFFP+C+ED QTFQQEGWH+ELND SEDI
Sbjct: 199  NAFTYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPKCIEDFQTFQQEGWHYELNDTSEDI 258

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDN+LGRTAQQALFPD TYGVDSGGDPQVIPKLT+EEFKEFHRK
Sbjct: 259  TYKGVVFNEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGDPQVIPKLTYEEFKEFHRK 318

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSN+RIWFYGDDDP ERLRILSEYLDMFDAS+AP+ESKVEPQKLFS+PVR VE YP 
Sbjct: 319  YYHPSNARIWFYGDDDPIERLRILSEYLDMFDASTAPDESKVEPQKLFSEPVRFVEKYPV 378

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
            GEG DLKK HMVCLNWLLSDKPLD++TE             PASPLRK+LLESGLGDAI+
Sbjct: 379  GEGGDLKKKHMVCLNWLLSDKPLDLQTELTLGFLDHLMLGTPASPLRKVLLESGLGDAII 438

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIG+KGVSEDDI KVEELI S+LKKLAEEGFDTDA+EASMNTIEFSL
Sbjct: 439  GGGVEDELLQPQFSIGLKGVSEDDIPKVEELIMSSLKKLAEEGFDTDAVEASMNTIEFSL 498

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLML+SIGKW+YDM+PFEPLKYEKPL  LK++IA+EGSK+VFSPLIEK
Sbjct: 499  RENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILKARIAEEGSKAVFSPLIEK 558

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN H VT+EMQPDPEKA+RDEA EK+IL KVKASMT EDLAEL RAT EL+LKQETP
Sbjct: 559  FILNNPHCVTIEMQPDPEKASRDEAAEKEILNKVKASMTEEDLAELARATQELKLKQETP 618

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            DPPEAL++VPSLSL DIPKEPIRVPTEVGDINGV
Sbjct: 619  DPPEALRSVPSLSLHDIPKEPIRVPTEVGDINGV 652


>EOX98216.1 Presequence protease 2 isoform 2 [Theobroma cacao]
          Length = 1040

 Score =  799 bits (2064), Expect = 0.0
 Identities = 389/454 (85%), Positives = 419/454 (92%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN KDFYNLVDVYLDAVFFP+C+ED QTFQQEGWH+ELND SEDI
Sbjct: 199  NAFTYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPKCIEDFQTFQQEGWHYELNDTSEDI 258

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDN+LGRTAQQALFPD TYGVDSGGDPQVIPKLT+EEFKEFHRK
Sbjct: 259  TYKGVVFNEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGDPQVIPKLTYEEFKEFHRK 318

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSN+RIWFYGDDDP ERLRILSEYLDMFDAS+AP+ESKVEPQKLFS+PVR VE YP 
Sbjct: 319  YYHPSNARIWFYGDDDPIERLRILSEYLDMFDASTAPDESKVEPQKLFSEPVRFVEKYPV 378

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
            GEG DLKK HMVCLNWLLSDKPLD++TE             PASPLRK+LLESGLGDAI+
Sbjct: 379  GEGGDLKKKHMVCLNWLLSDKPLDLQTELTLGFLDHLMLGTPASPLRKVLLESGLGDAII 438

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIG+KGVSEDDI KVEELI S+LKKLAEEGFDTDA+EASMNTIEFSL
Sbjct: 439  GGGVEDELLQPQFSIGLKGVSEDDIPKVEELIMSSLKKLAEEGFDTDAVEASMNTIEFSL 498

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLML+SIGKW+YDM+PFEPLKYEKPL  LK++IA+EGSK+VFSPLIEK
Sbjct: 499  RENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILKARIAEEGSKAVFSPLIEK 558

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN H VT+EMQPDPEKA+RDEA EK+IL KVKASMT EDLAEL RAT EL+LKQETP
Sbjct: 559  FILNNPHCVTIEMQPDPEKASRDEAAEKEILNKVKASMTEEDLAELARATQELKLKQETP 618

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            DPPEAL++VPSLSL DIPKEPIRVPTEVGDINGV
Sbjct: 619  DPPEALRSVPSLSLHDIPKEPIRVPTEVGDINGV 652


>EOX98217.1 Presequence protease 2 isoform 3 [Theobroma cacao]
          Length = 1041

 Score =  799 bits (2064), Expect = 0.0
 Identities = 389/454 (85%), Positives = 419/454 (92%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN KDFYNLVDVYLDAVFFP+C+ED QTFQQEGWH+ELND SEDI
Sbjct: 199  NAFTYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPKCIEDFQTFQQEGWHYELNDTSEDI 258

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDN+LGRTAQQALFPD TYGVDSGGDPQVIPKLT+EEFKEFHRK
Sbjct: 259  TYKGVVFNEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGDPQVIPKLTYEEFKEFHRK 318

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSN+RIWFYGDDDP ERLRILSEYLDMFDAS+AP+ESKVEPQKLFS+PVR VE YP 
Sbjct: 319  YYHPSNARIWFYGDDDPIERLRILSEYLDMFDASTAPDESKVEPQKLFSEPVRFVEKYPV 378

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
            GEG DLKK HMVCLNWLLSDKPLD++TE             PASPLRK+LLESGLGDAI+
Sbjct: 379  GEGGDLKKKHMVCLNWLLSDKPLDLQTELTLGFLDHLMLGTPASPLRKVLLESGLGDAII 438

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIG+KGVSEDDI KVEELI S+LKKLAEEGFDTDA+EASMNTIEFSL
Sbjct: 439  GGGVEDELLQPQFSIGLKGVSEDDIPKVEELIMSSLKKLAEEGFDTDAVEASMNTIEFSL 498

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLML+SIGKW+YDM+PFEPLKYEKPL  LK++IA+EGSK+VFSPLIEK
Sbjct: 499  RENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILKARIAEEGSKAVFSPLIEK 558

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN H VT+EMQPDPEKA+RDEA EK+IL KVKASMT EDLAEL RAT EL+LKQETP
Sbjct: 559  FILNNPHCVTIEMQPDPEKASRDEAAEKEILNKVKASMTEEDLAELARATQELKLKQETP 618

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            DPPEAL++VPSLSL DIPKEPIRVPTEVGDINGV
Sbjct: 619  DPPEALRSVPSLSLHDIPKEPIRVPTEVGDINGV 652


>XP_012459281.1 PREDICTED: presequence protease 2, chloroplastic/mitochondrial-like
            [Gossypium raimondii] KJB77679.1 hypothetical protein
            B456_012G150300 [Gossypium raimondii]
          Length = 1089

 Score =  800 bits (2067), Expect = 0.0
 Identities = 387/454 (85%), Positives = 423/454 (93%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN+KDFYNLVDVYLDAVFFP+C+ED QTFQQEGWH+ELNDPSEDI
Sbjct: 203  NAFTYPDRTCYPVASTNSKDFYNLVDVYLDAVFFPKCIEDFQTFQQEGWHYELNDPSEDI 262

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDN+LGRTAQQALFPD TYGVDSGGDP VIPKLTFEEFKEFHRK
Sbjct: 263  TYKGVVFNEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGDPLVIPKLTFEEFKEFHRK 322

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSN+RIWFYGDDDP+ERLRILSEYLDMFDAS+APNESKVEPQKLFS+PVRIVE YPA
Sbjct: 323  YYHPSNARIWFYGDDDPSERLRILSEYLDMFDASTAPNESKVEPQKLFSEPVRIVEKYPA 382

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
            G+G DLKK HMVCLNWLLSDKPLD++TE             PASPLRK+LLESGLGDAI+
Sbjct: 383  GDGGDLKKKHMVCLNWLLSDKPLDLQTELTLGFLDHLLLGTPASPLRKVLLESGLGDAII 442

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIG+KGVS+DDI KVEELI S+L+KLAEEGFDT+A+EASMNTIEFSL
Sbjct: 443  GGGVEDELLQPQFSIGLKGVSDDDIPKVEELIMSSLRKLAEEGFDTEAVEASMNTIEFSL 502

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLML+S+GKW+YDM+PFEPLKYE+PL DLK++IA+EGSK+VFSPLIEK
Sbjct: 503  RENNTGSFPRGLSLMLRSMGKWIYDMDPFEPLKYEQPLLDLKARIAEEGSKAVFSPLIEK 562

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN H VT+EMQPDPEKA+RDEA EK+ L+KVKASMT EDLAEL RAT EL+LKQETP
Sbjct: 563  FILNNPHCVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDLAELARATEELKLKQETP 622

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            DPPEALK VPSLSL DIPKEPIR+PTEVGDINGV
Sbjct: 623  DPPEALKCVPSLSLHDIPKEPIRIPTEVGDINGV 656


>XP_017615507.1 PREDICTED: presequence protease 2, chloroplastic/mitochondrial-like
            [Gossypium arboreum]
          Length = 1089

 Score =  800 bits (2065), Expect = 0.0
 Identities = 386/454 (85%), Positives = 423/454 (93%), Gaps = 1/454 (0%)
 Frame = +2

Query: 2    NAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDLQTFQQEGWHFELNDPSEDI 181
            NAFTYPDRTCYPVASTN+KDFYNLVDVYLDAVFFP+C+ED QTFQQEGWH+ELNDPSEDI
Sbjct: 203  NAFTYPDRTCYPVASTNSKDFYNLVDVYLDAVFFPKCIEDFQTFQQEGWHYELNDPSEDI 262

Query: 182  TYKGVVFNEMKGVYSQPDNILGRTAQQALFPDTTYGVDSGGDPQVIPKLTFEEFKEFHRK 361
            TYKGVVFNEMKGVYSQPDN+LGRTAQQALFPD TYGVDSGGDP VIPKLTFEEFKEFHRK
Sbjct: 263  TYKGVVFNEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGDPLVIPKLTFEEFKEFHRK 322

Query: 362  YYHPSNSRIWFYGDDDPNERLRILSEYLDMFDASSAPNESKVEPQKLFSKPVRIVETYPA 541
            YYHPSN+RIWFYGDDDP+ERLRILSEYLDMFDAS+APNESKVEPQKLFS+PVRIVE YPA
Sbjct: 323  YYHPSNARIWFYGDDDPSERLRILSEYLDMFDASTAPNESKVEPQKLFSEPVRIVEKYPA 382

Query: 542  GEGSDLKK-HMVCLNWLLSDKPLDMETEXXXXXXXXXXXXXPASPLRKILLESGLGDAIV 718
            G+G DLKK HMVCLNWLLSDKPLD++TE             PASPLRK+LLESGLGDAI+
Sbjct: 383  GDGGDLKKKHMVCLNWLLSDKPLDLQTELTLGFLDHLMLGTPASPLRKVLLESGLGDAII 442

Query: 719  GGGLEDELLQPQFSIGMKGVSEDDIHKVEELITSTLKKLAEEGFDTDAIEASMNTIEFSL 898
            GGG+EDELLQPQFSIG+KGVS++DI KVEELI S+L+KLAEEGFDT+A+EASMNTIEFSL
Sbjct: 443  GGGVEDELLQPQFSIGLKGVSDEDIPKVEELIMSSLRKLAEEGFDTEAVEASMNTIEFSL 502

Query: 899  RENNTGSFPRGLSLMLQSIGKWVYDMNPFEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEK 1078
            RENNTGSFPRGLSLML+S+GKW+YDM+PFEPLKYE+PL DLK++IA+EGSK+VFSPLIEK
Sbjct: 503  RENNTGSFPRGLSLMLRSMGKWIYDMDPFEPLKYEQPLSDLKARIAEEGSKAVFSPLIEK 562

Query: 1079 FILNNSHQVTVEMQPDPEKAARDEATEKQILQKVKASMTTEDLAELTRATYELRLKQETP 1258
            FILNN H VT+EMQPDPEKA+RDEA EK+ L+KVKASMT EDLAEL RAT EL+LKQETP
Sbjct: 563  FILNNPHCVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDLAELARATEELKLKQETP 622

Query: 1259 DPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGV 1360
            DPPEALK VPSLSL DIPKEPIR+PTEVGDINGV
Sbjct: 623  DPPEALKCVPSLSLHDIPKEPIRIPTEVGDINGV 656


Top