BLASTX nr result

ID: Mentha25_contig00015890 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00015890
         (938 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU40853.1| hypothetical protein MIMGU_mgv1a000693mg [Mimulus...   424   e-116
gb|EPS62321.1| hypothetical protein M569_12467, partial [Genlise...   419   e-114
ref|XP_002281154.2| PREDICTED: protein CHUP1, chloroplastic-like...   395   e-107
emb|CBI27077.3| unnamed protein product [Vitis vinifera]              395   e-107
emb|CAN78725.1| hypothetical protein VITISV_020008 [Vitis vinifera]   395   e-107
ref|XP_006362524.1| PREDICTED: protein CHUP1, chloroplastic-like...   392   e-106
ref|XP_004298311.1| PREDICTED: protein CHUP1, chloroplastic-like...   391   e-106
ref|XP_004159306.1| PREDICTED: protein CHUP1, chloroplastic-like...   390   e-106
ref|XP_004135119.1| PREDICTED: protein CHUP1, chloroplastic-like...   390   e-106
ref|XP_004238973.1| PREDICTED: uncharacterized protein LOC101267...   390   e-106
ref|XP_002315963.1| hypothetical protein POPTR_0010s14080g [Popu...   390   e-106
ref|XP_002875270.1| hypothetical protein ARALYDRAFT_484330 [Arab...   382   e-104
ref|XP_002524394.1| conserved hypothetical protein [Ricinus comm...   382   e-103
ref|NP_189197.2| protein CHUP1 [Arabidopsis thaliana] gi|3341856...   381   e-103
ref|NP_001189975.1| protein CHUP1 [Arabidopsis thaliana] gi|3326...   381   e-103
ref|XP_006395634.1| hypothetical protein EUTSA_v10003588mg [Eutr...   380   e-103
ref|XP_006395633.1| hypothetical protein EUTSA_v10003588mg [Eutr...   380   e-103
ref|XP_007046330.1| Hydroxyproline-rich glycoprotein family prot...   380   e-103
ref|XP_007046327.1| Hydroxyproline-rich glycoprotein family prot...   380   e-103
ref|XP_007227359.1| hypothetical protein PRUPE_ppa000786mg [Prun...   380   e-103

>gb|EYU40853.1| hypothetical protein MIMGU_mgv1a000693mg [Mimulus guttatus]
          Length = 1016

 Score =  424 bits (1091), Expect = e-116
 Identities = 227/302 (75%), Positives = 251/302 (83%), Gaps = 13/302 (4%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKNKEL +EKRELVVKLD+AE+ V+ LSN+TETEMVAKVREEV E++HANEDLVKQVEGL
Sbjct: 318  RKNKELHYEKRELVVKLDAAEANVKALSNMTETEMVAKVREEVNEMRHANEDLVKQVEGL 377

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGK+SARDL+K+LSPRSQE+AKQLMLE+A
Sbjct: 378  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKISARDLNKSLSPRSQERAKQLMLEFA 437

Query: 529  GSER-GGGDTDMESNFDNTSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSER GGGDTDMESNFDNTSV+SEDFDN+            SKKP LIQKLKRWG     
Sbjct: 438  GSERGGGGDTDMESNFDNTSVDSEDFDNVSIDSSTSRFSTLSKKPSLIQKLKRWGGKSRD 497

Query: 352  XXXXXXSPARSFAGASPGRASL--KLRGPLEALMLRNASDGIAITSFGTGENDDL--NSP 185
                  SPARSFAG SP R+S+  K RGPLEALM+RNA DG+AITSFGT E D+   NSP
Sbjct: 498  DSSAFSSPARSFAGGSPSRSSVSQKPRGPLEALMIRNAGDGVAITSFGTAEMDESNNNSP 557

Query: 184  ETP--------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKAQQARAV 29
             TP        N+VASSFHLMSKSVEGVL+EKYPAYKDRHK+A EREK IKE+AQQARAV
Sbjct: 558  VTPKLPTPDSLNSVASSFHLMSKSVEGVLEEKYPAYKDRHKIATEREKQIKERAQQARAV 617

Query: 28   KF 23
            +F
Sbjct: 618  RF 619


>gb|EPS62321.1| hypothetical protein M569_12467, partial [Genlisea aurea]
          Length = 950

 Score =  419 bits (1076), Expect = e-114
 Identities = 222/300 (74%), Positives = 251/300 (83%), Gaps = 6/300 (2%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKN+ELQHEKREL+VKLD+AES V+ LSN+TETEMVA +R EV EL+H N+DLVKQVEGL
Sbjct: 272  RKNRELQHEKRELMVKLDAAESNVKLLSNMTETEMVASIRGEVNELRHKNDDLVKQVEGL 331

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEE+VYLRWVNACLRFELRN+QTPSG++SARDLSK+LSP+SQE+AKQL+LEYA
Sbjct: 332  QMNRFSEVEEMVYLRWVNACLRFELRNHQTPSGRISARDLSKSLSPKSQERAKQLLLEYA 391

Query: 529  GSERGGGDTDMESNFDNTSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXXX 350
            GSER GGDTD+ESNFDNTSV+SEDFD++            +KKPGLIQKLKRWG      
Sbjct: 392  GSER-GGDTDIESNFDNTSVDSEDFDSV-SVDSSSVTKFSNKKPGLIQKLKRWGGKGHED 449

Query: 349  XXXXXSPARSFAGASPGRASLKLRGPLEALMLRNASDGIAITSFGTGENDDLNSPETP-- 176
                 SPARS    SPGR +L+ +GPLEALMLRNA D +AITSFGTGEN+DLNSPETP  
Sbjct: 450  SSAMSSPARSSYAGSPGRVNLRPKGPLEALMLRNAGDNMAITSFGTGENEDLNSPETPVQ 509

Query: 175  ---NNVASSFHLMSKSVE-GVLDEKYPAYKDRHKLALEREKHIKEKAQQARAVKFGGVDS 8
               N+VASSF LMSKSVE GVLDEKYPA+KDRHKLA EREK IKEKAQQARAV+FGG  S
Sbjct: 510  VGLNSVASSFQLMSKSVEGGVLDEKYPAFKDRHKLASEREKQIKEKAQQARAVRFGGDSS 569


>ref|XP_002281154.2| PREDICTED: protein CHUP1, chloroplastic-like [Vitis vinifera]
          Length = 1003

 Score =  395 bits (1016), Expect = e-107
 Identities = 214/308 (69%), Positives = 240/308 (77%), Gaps = 18/308 (5%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            R+NKELQHEKREL+VKLD AE++V  LSN+TE+EMVAK RE+V  L+HANEDL+KQVEGL
Sbjct: 292  RRNKELQHEKRELLVKLDGAEARVAALSNMTESEMVAKAREDVNNLRHANEDLLKQVEGL 351

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQTP GK+SARDLSK+LSPRSQE+AKQLMLEYA
Sbjct: 352  QMNRFSEVEELVYLRWVNACLRYELRNYQTPGGKISARDLSKSLSPRSQERAKQLMLEYA 411

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD+ESNF + +S  SEDFDN             SKKP LIQKLK+WG     
Sbjct: 412  GSERGQGDTDLESNFSHPSSPGSEDFDNASIDSSTSRYSSLSKKPSLIQKLKKWG-KSRD 470

Query: 352  XXXXXXSPARSFAGASPGRASLKL--RGPLEALMLRNASDGIAITSFGTGENDDLNSPET 179
                  SPARSF G SPGR S+ L  RGPLEALMLRNA DG+AIT+FG  + +   SPET
Sbjct: 471  DSSVLSSPARSFGGGSPGRTSISLRPRGPLEALMLRNAGDGVAITTFGKIDQEAPESPET 530

Query: 178  P---------------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKAQ 44
            P               NNVA+SF LMSKSVEGVLDEKYPAYKDRHKLALEREK IKEKA+
Sbjct: 531  PNLSHIRTRVSSSDSLNNVAASFQLMSKSVEGVLDEKYPAYKDRHKLALEREKQIKEKAE 590

Query: 43   QARAVKFG 20
            +ARA +FG
Sbjct: 591  KARAERFG 598


>emb|CBI27077.3| unnamed protein product [Vitis vinifera]
          Length = 969

 Score =  395 bits (1016), Expect = e-107
 Identities = 214/308 (69%), Positives = 240/308 (77%), Gaps = 18/308 (5%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            R+NKELQHEKREL+VKLD AE++V  LSN+TE+EMVAK RE+V  L+HANEDL+KQVEGL
Sbjct: 258  RRNKELQHEKRELLVKLDGAEARVAALSNMTESEMVAKAREDVNNLRHANEDLLKQVEGL 317

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQTP GK+SARDLSK+LSPRSQE+AKQLMLEYA
Sbjct: 318  QMNRFSEVEELVYLRWVNACLRYELRNYQTPGGKISARDLSKSLSPRSQERAKQLMLEYA 377

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD+ESNF + +S  SEDFDN             SKKP LIQKLK+WG     
Sbjct: 378  GSERGQGDTDLESNFSHPSSPGSEDFDNASIDSSTSRYSSLSKKPSLIQKLKKWG-KSRD 436

Query: 352  XXXXXXSPARSFAGASPGRASLKL--RGPLEALMLRNASDGIAITSFGTGENDDLNSPET 179
                  SPARSF G SPGR S+ L  RGPLEALMLRNA DG+AIT+FG  + +   SPET
Sbjct: 437  DSSVLSSPARSFGGGSPGRTSISLRPRGPLEALMLRNAGDGVAITTFGKIDQEAPESPET 496

Query: 178  P---------------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKAQ 44
            P               NNVA+SF LMSKSVEGVLDEKYPAYKDRHKLALEREK IKEKA+
Sbjct: 497  PNLSHIRTRVSSSDSLNNVAASFQLMSKSVEGVLDEKYPAYKDRHKLALEREKQIKEKAE 556

Query: 43   QARAVKFG 20
            +ARA +FG
Sbjct: 557  KARAERFG 564


>emb|CAN78725.1| hypothetical protein VITISV_020008 [Vitis vinifera]
          Length = 955

 Score =  395 bits (1016), Expect = e-107
 Identities = 214/308 (69%), Positives = 240/308 (77%), Gaps = 18/308 (5%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            R+NKELQHEKREL+VKLD AE++V  LSN+TE+EMVAK RE+V  L+HANEDL+KQVEGL
Sbjct: 316  RRNKELQHEKRELLVKLDGAEARVAALSNMTESEMVAKAREDVNNLRHANEDLLKQVEGL 375

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQTP GK+SARDLSK+LSPRSQE+AKQLMLEYA
Sbjct: 376  QMNRFSEVEELVYLRWVNACLRYELRNYQTPGGKISARDLSKSLSPRSQERAKQLMLEYA 435

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD+ESNF + +S  SEDFDN             SKKP LIQKLK+WG     
Sbjct: 436  GSERGQGDTDLESNFSHPSSPGSEDFDNASIDSSTSRYSSLSKKPSLIQKLKKWG-KSRD 494

Query: 352  XXXXXXSPARSFAGASPGRASLKL--RGPLEALMLRNASDGIAITSFGTGENDDLNSPET 179
                  SPARSF G SPGR S+ L  RGPLEALMLRNA DG+AIT+FG  + +   SPET
Sbjct: 495  DSSVLSSPARSFGGGSPGRTSISLRPRGPLEALMLRNAGDGVAITTFGKIDQEAPESPET 554

Query: 178  P---------------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKAQ 44
            P               NNVA+SF LMSKSVEGVLDEKYPAYKDRHKLALEREK IKEKA+
Sbjct: 555  PNLSHIRTRVSSSDSLNNVAASFQLMSKSVEGVLDEKYPAYKDRHKLALEREKQIKEKAE 614

Query: 43   QARAVKFG 20
            +ARA +FG
Sbjct: 615  KARAERFG 622


>ref|XP_006362524.1| PREDICTED: protein CHUP1, chloroplastic-like [Solanum tuberosum]
          Length = 991

 Score =  392 bits (1006), Expect = e-106
 Identities = 216/308 (70%), Positives = 238/308 (77%), Gaps = 19/308 (6%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKNKELQHEKRELV+KLD+AESK+  LSN+TE EMVA+VREEV  LKH N+DL+KQVEGL
Sbjct: 281  RKNKELQHEKRELVIKLDTAESKIAKLSNMTENEMVAQVREEVTNLKHTNDDLLKQVEGL 340

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLRFELRNYQTP GKVSARDLSKNLSP+SQ+KAKQLMLEYA
Sbjct: 341  QMNRFSEVEELVYLRWVNACLRFELRNYQTPQGKVSARDLSKNLSPKSQQKAKQLMLEYA 400

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWG-XXXX 356
            GSERG GDTD+ESNF   +S  SEDFDN             SKKP LIQKLK+WG     
Sbjct: 401  GSERGQGDTDLESNFSQPSSPGSEDFDNASIDSSTSRFSSFSKKPNLIQKLKKWGSRGGR 460

Query: 355  XXXXXXXSPARSFAGASPGRASLKL--RGPLEALMLRNASDGIAITSFGTGENDDLNSPE 182
                   SPARS  GASPGR S+ +  RGPLE+LMLRNA DG+AITSFGT E  +  SPE
Sbjct: 461  DDSSVMSSPARSLGGASPGRMSMSVRPRGPLESLMLRNAGDGVAITSFGTAE--EYGSPE 518

Query: 181  TP---------------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKA 47
            TP               N+VASSF LMSKSVEGVLDEKYPA+KDRHKLA+EREK IK KA
Sbjct: 519  TPKLPPIRTQESSAETLNSVASSFTLMSKSVEGVLDEKYPAFKDRHKLAVEREKTIKVKA 578

Query: 46   QQARAVKF 23
            +QARA +F
Sbjct: 579  EQARAARF 586


>ref|XP_004298311.1| PREDICTED: protein CHUP1, chloroplastic-like [Fragaria vesca subsp.
            vesca]
          Length = 1001

 Score =  391 bits (1004), Expect = e-106
 Identities = 212/308 (68%), Positives = 242/308 (78%), Gaps = 18/308 (5%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKNKELQ EKREL +KL++AES+V  LSN+TETEMVA VR EV  LKHANEDL+KQVEGL
Sbjct: 289  RKNKELQIEKRELSIKLNAAESRVAELSNMTETEMVANVRSEVNNLKHANEDLLKQVEGL 348

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLRFELRNYQTP GK+SARDL+KNLSP+SQEKAKQLMLEYA
Sbjct: 349  QMNRFSEVEELVYLRWVNACLRFELRNYQTPQGKISARDLNKNLSPKSQEKAKQLMLEYA 408

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTDMESN+   +S  SEDFDN             +K+P LIQKLK+WG     
Sbjct: 409  GSERGQGDTDMESNYSQPSSPGSEDFDNASIDSSTSRYSALTKRPSLIQKLKKWG-KSKD 467

Query: 352  XXXXXXSPARSFAGASPGRASLKL--RGPLEALMLRNASDGIAITSFGTGENDDLNSPET 179
                  SPARSF+G+SPGRAS+ +  RGPLE+LMLRNASDG+AIT+FG  + +  +SP+T
Sbjct: 468  DSSALSSPARSFSGSSPGRASMSVRPRGPLESLMLRNASDGVAITTFGKMDQELPDSPQT 527

Query: 178  ---------------PNNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKAQ 44
                           PN+V+SSF LMSKSVEGVLDEKYPAYKDRHKLALERE+ IKE+A+
Sbjct: 528  PTLPSIRTQMPSSDSPNSVSSSFQLMSKSVEGVLDEKYPAYKDRHKLALERERQIKERAE 587

Query: 43   QARAVKFG 20
            QARA KFG
Sbjct: 588  QARAEKFG 595


>ref|XP_004159306.1| PREDICTED: protein CHUP1, chloroplastic-like [Cucumis sativus]
          Length = 987

 Score =  390 bits (1003), Expect = e-106
 Identities = 208/312 (66%), Positives = 242/312 (77%), Gaps = 16/312 (5%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKNKELQ EKREL +KLD+AE+K+ TLSN+TE+E+VA+ RE+V  L+HANEDL+KQVEGL
Sbjct: 273  RKNKELQIEKRELTIKLDAAENKISTLSNMTESELVAQTREQVSNLRHANEDLIKQVEGL 332

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQ P+GK+SARDLSKNLSP+SQEKAKQLM+EYA
Sbjct: 333  QMNRFSEVEELVYLRWVNACLRYELRNYQAPTGKISARDLSKNLSPKSQEKAKQLMVEYA 392

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD+ESN+   +S  SEDFDN             SKKP LIQKLK+WG     
Sbjct: 393  GSERGQGDTDLESNYSQPSSPGSEDFDNASIDSSFSRYSSLSKKPSLIQKLKKWGGRSKD 452

Query: 352  XXXXXXSPARSFAGASPGRA-SLKLRGPLEALMLRNASDGIAITSFGTGENDDLNSPETP 176
                  SPARSF+G SP  + S K RGPLE+LMLRNASD +AIT+FGT E + L+SP TP
Sbjct: 453  DSSALSSPARSFSGGSPRMSMSQKPRGPLESLMLRNASDSVAITTFGTMEQEPLDSPGTP 512

Query: 175  --------------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKAQQA 38
                          N+V+SSF LMSKSVEGVLDEKYPAYKDRHKLAL REK +KE+A QA
Sbjct: 513  NLPSIRTQTPNDSLNSVSSSFQLMSKSVEGVLDEKYPAYKDRHKLALAREKQLKERADQA 572

Query: 37   RAVKFGGVDSNN 2
            RA KFG + ++N
Sbjct: 573  RAEKFGNLSNSN 584


>ref|XP_004135119.1| PREDICTED: protein CHUP1, chloroplastic-like [Cucumis sativus]
          Length = 987

 Score =  390 bits (1003), Expect = e-106
 Identities = 208/312 (66%), Positives = 242/312 (77%), Gaps = 16/312 (5%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKNKELQ EKREL +KLD+AE+K+ TLSN+TE+E+VA+ RE+V  L+HANEDL+KQVEGL
Sbjct: 273  RKNKELQIEKRELTIKLDAAENKISTLSNMTESELVAQTREQVSNLRHANEDLIKQVEGL 332

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQ P+GK+SARDLSKNLSP+SQEKAKQLM+EYA
Sbjct: 333  QMNRFSEVEELVYLRWVNACLRYELRNYQAPTGKISARDLSKNLSPKSQEKAKQLMVEYA 392

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD+ESN+   +S  SEDFDN             SKKP LIQKLK+WG     
Sbjct: 393  GSERGQGDTDLESNYSQPSSPGSEDFDNASIDSSFSRYSSLSKKPSLIQKLKKWGGRSKD 452

Query: 352  XXXXXXSPARSFAGASPGRA-SLKLRGPLEALMLRNASDGIAITSFGTGENDDLNSPETP 176
                  SPARSF+G SP  + S K RGPLE+LMLRNASD +AIT+FGT E + L+SP TP
Sbjct: 453  DSSALSSPARSFSGGSPRMSMSQKPRGPLESLMLRNASDSVAITTFGTMEQEPLDSPGTP 512

Query: 175  --------------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKAQQA 38
                          N+V+SSF LMSKSVEGVLDEKYPAYKDRHKLAL REK +KE+A QA
Sbjct: 513  NLPSIRTQTPNDSLNSVSSSFQLMSKSVEGVLDEKYPAYKDRHKLALAREKQLKERADQA 572

Query: 37   RAVKFGGVDSNN 2
            RA KFG + ++N
Sbjct: 573  RAEKFGNLSNSN 584


>ref|XP_004238973.1| PREDICTED: uncharacterized protein LOC101267989 [Solanum
            lycopersicum]
          Length = 1174

 Score =  390 bits (1002), Expect = e-106
 Identities = 216/308 (70%), Positives = 237/308 (76%), Gaps = 19/308 (6%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKNKELQHEKRELV+KLD+AESK+  LSN+TE EMVA+VREEV  LKH N+DL+KQVEGL
Sbjct: 464  RKNKELQHEKRELVIKLDAAESKIAKLSNMTENEMVAQVREEVTNLKHTNDDLLKQVEGL 523

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLRFELRNYQTP GKVSARDLSK+LSP+SQ KAKQLMLEYA
Sbjct: 524  QMNRFSEVEELVYLRWVNACLRFELRNYQTPQGKVSARDLSKSLSPKSQHKAKQLMLEYA 583

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWG-XXXX 356
            GSERG GDTD+ESNF   +S  SEDFDN             SKKP LIQKLK+WG     
Sbjct: 584  GSERGQGDTDLESNFSQPSSPGSEDFDNASIDSSTSRFSTFSKKPNLIQKLKKWGSRGGK 643

Query: 355  XXXXXXXSPARSFAGASPGRASLKL--RGPLEALMLRNASDGIAITSFGTGENDDLNSPE 182
                   SPARS  GASPGR S+ +  RGPLE+LMLRNA DG+AITSFGT E  D  SPE
Sbjct: 644  DDSSIMSSPARSLGGASPGRMSMSVRPRGPLESLMLRNAGDGVAITSFGTAEEYD--SPE 701

Query: 181  TP---------------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKA 47
            TP               N+VASSF LMSKSVEGVLDEKYPA+KDRHKLA+EREK IK KA
Sbjct: 702  TPKLPPIRTQESSAETLNSVASSFTLMSKSVEGVLDEKYPAFKDRHKLAVEREKTIKAKA 761

Query: 46   QQARAVKF 23
            +QARA +F
Sbjct: 762  EQARAARF 769


>ref|XP_002315963.1| hypothetical protein POPTR_0010s14080g [Populus trichocarpa]
            gi|222865003|gb|EEF02134.1| hypothetical protein
            POPTR_0010s14080g [Populus trichocarpa]
          Length = 955

 Score =  390 bits (1002), Expect = e-106
 Identities = 210/293 (71%), Positives = 245/293 (83%), Gaps = 4/293 (1%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKNKELQHEKREL++KL +AE+K+ +LSN++ETEMVAKVREEV  LKHANEDL+KQVEGL
Sbjct: 276  RKNKELQHEKRELIIKLGAAEAKLTSLSNLSETEMVAKVREEVNNLKHANEDLLKQVEGL 335

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQTPSGKVSARDL+K+LSP+SQE+AKQL+LEYA
Sbjct: 336  QMNRFSEVEELVYLRWVNACLRYELRNYQTPSGKVSARDLNKSLSPKSQERAKQLLLEYA 395

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTDMESN+ + +S  SEDFDN             SKKP LIQKLK+WG     
Sbjct: 396  GSERGQGDTDMESNYSHPSSPGSEDFDN-TSIDSSSSRYSFSKKPNLIQKLKKWG-RSKD 453

Query: 352  XXXXXXSPARSFAGASPGRASL--KLRGPLEALMLRNASDGIAITSFGTGENDDLNSP-E 182
                  SP+RSF+G SP R+S+  + RGPLE+LM+RNASD +AITSFG  + D  +SP +
Sbjct: 454  DSSAFSSPSRSFSGVSPSRSSMSHRPRGPLESLMIRNASDTVAITSFGKMDQDAPDSPGD 513

Query: 181  TPNNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKAQQARAVKF 23
            + N+VASSF +MSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKA++ARAVKF
Sbjct: 514  SLNSVASSFQVMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKAEKARAVKF 566


>ref|XP_002875270.1| hypothetical protein ARALYDRAFT_484330 [Arabidopsis lyrata subsp.
            lyrata] gi|297321108|gb|EFH51529.1| hypothetical protein
            ARALYDRAFT_484330 [Arabidopsis lyrata subsp. lyrata]
          Length = 1002

 Score =  382 bits (982), Expect = e-104
 Identities = 205/313 (65%), Positives = 238/313 (76%), Gaps = 22/313 (7%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKN+ELQHEKREL +KLDSAE+++ TLSN+TE++ VAKVREEV  LKH NEDL+KQVEGL
Sbjct: 277  RKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVAKVREEVNNLKHNNEDLLKQVEGL 336

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQTP+GK+SARDLSKNLSP+SQ KAK+LMLEYA
Sbjct: 337  QMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLMLEYA 396

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD+ESN+   +S  S+DFDN             SKKPGLIQKLKRWG     
Sbjct: 397  GSERGQGDTDLESNYSQPSSPGSDDFDNASMDSSTSRLSSFSKKPGLIQKLKRWG-KSKD 455

Query: 352  XXXXXXSPARSFAGASPGRASL---KLRGPLEALMLRNASDGIAITSFGTGENDDLNSPE 182
                  SP+RSF G SPGR S    K RGPLE+LM+RNA + +AIT+FG  + +   +PE
Sbjct: 456  DSSVQSSPSRSFYGGSPGRLSSSMNKQRGPLESLMIRNAGESVAITTFGQVDQESPGTPE 515

Query: 181  TP------------------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIK 56
            TP                  N+VA+SFH+MSKSV+ VLDEKYPAYKDRHKLA+EREKHIK
Sbjct: 516  TPNLPRIRTQQQASSPGEGLNSVATSFHVMSKSVDNVLDEKYPAYKDRHKLAVEREKHIK 575

Query: 55   EKAQQARAVKFGG 17
             KA QARA +FGG
Sbjct: 576  HKADQARAERFGG 588


>ref|XP_002524394.1| conserved hypothetical protein [Ricinus communis]
            gi|223536355|gb|EEF38005.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 998

 Score =  382 bits (981), Expect = e-103
 Identities = 205/309 (66%), Positives = 237/309 (76%), Gaps = 19/309 (6%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKNKELQHEKREL +KLD+A++K+ +LSN+TE+EMVAK R++V  L+HANEDL+KQVEGL
Sbjct: 287  RKNKELQHEKRELTIKLDAAQAKIVSLSNMTESEMVAKARDDVNNLRHANEDLLKQVEGL 346

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQ P G+VSARDLSKNLSP+SQEKAK LMLEYA
Sbjct: 347  QMNRFSEVEELVYLRWVNACLRYELRNYQAPPGRVSARDLSKNLSPKSQEKAKHLMLEYA 406

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD++SNF + +S  SEDFDN             SKKP LIQK+K+WG     
Sbjct: 407  GSERGQGDTDLDSNFSHPSSPGSEDFDNTSIDSSTSRYSSLSKKPSLIQKIKKWG-KSKD 465

Query: 352  XXXXXXSPARSFAGASPGRASLKL--RGPLEALMLRNASDGIAITSFGTGENDDLNSPET 179
                  SP+RSF+  SP R S+ L  RGPLEALMLRN  D +AIT+FG  E D  +SPET
Sbjct: 466  DSSALSSPSRSFSADSPSRTSMSLRSRGPLEALMLRNVGDSVAITTFGKSEQDVPDSPET 525

Query: 178  P----------------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKA 47
            P                N+VASSF LMSKSVEGVLDEKYPAYKDRHKLALEREK IKE+A
Sbjct: 526  PSTLPQIRTRVASGDSLNSVASSFQLMSKSVEGVLDEKYPAYKDRHKLALEREKQIKERA 585

Query: 46   QQARAVKFG 20
            ++ARA +FG
Sbjct: 586  EKARAARFG 594


>ref|NP_189197.2| protein CHUP1 [Arabidopsis thaliana] gi|334185625|ref|NP_001189974.1|
            protein CHUP1 [Arabidopsis thaliana]
            gi|75273319|sp|Q9LI74.1|CHUP1_ARATH RecName: Full=Protein
            CHUP1, chloroplastic; AltName: Full=Protein CHLOROPLAST
            UNUSUAL POSITIONING 1 gi|11994760|dbj|BAB03089.1| unnamed
            protein product [Arabidopsis thaliana]
            gi|28071265|dbj|BAC55960.1| actin binding protein
            [Arabidopsis thaliana] gi|332643530|gb|AEE77051.1|
            protein CHUP1 [Arabidopsis thaliana]
            gi|332643531|gb|AEE77052.1| protein CHUP1 [Arabidopsis
            thaliana]
          Length = 1004

 Score =  381 bits (979), Expect = e-103
 Identities = 204/313 (65%), Positives = 238/313 (76%), Gaps = 22/313 (7%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKN+ELQHEKREL +KLDSAE+++ TLSN+TE++ VAKVREEV  LKH NEDL+KQVEGL
Sbjct: 278  RKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVAKVREEVNNLKHNNEDLLKQVEGL 337

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQTP+GK+SARDLSKNLSP+SQ KAK+LMLEYA
Sbjct: 338  QMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLMLEYA 397

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD+ESN+   +S  S+DFDN             SKKPGLIQKLK+WG     
Sbjct: 398  GSERGQGDTDLESNYSQPSSPGSDDFDNASMDSSTSRFSSFSKKPGLIQKLKKWG-KSKD 456

Query: 352  XXXXXXSPARSFAGASPGRASL---KLRGPLEALMLRNASDGIAITSFGTGENDDLNSPE 182
                  SP+RSF G SPGR S    K RGPLE+LM+RNA + +AIT+FG  + +   +PE
Sbjct: 457  DSSVQSSPSRSFYGGSPGRLSSSMNKQRGPLESLMIRNAGESVAITTFGQVDQESPGTPE 516

Query: 181  TP------------------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIK 56
            TP                  N+VA+SFH+MSKSV+ VLDEKYPAYKDRHKLA+EREKHIK
Sbjct: 517  TPNLPRIRTQQQASSPGEGLNSVAASFHVMSKSVDNVLDEKYPAYKDRHKLAVEREKHIK 576

Query: 55   EKAQQARAVKFGG 17
             KA QARA +FGG
Sbjct: 577  HKADQARAERFGG 589


>ref|NP_001189975.1| protein CHUP1 [Arabidopsis thaliana] gi|332643532|gb|AEE77053.1|
            protein CHUP1 [Arabidopsis thaliana]
          Length = 863

 Score =  381 bits (979), Expect = e-103
 Identities = 204/313 (65%), Positives = 238/313 (76%), Gaps = 22/313 (7%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKN+ELQHEKREL +KLDSAE+++ TLSN+TE++ VAKVREEV  LKH NEDL+KQVEGL
Sbjct: 137  RKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVAKVREEVNNLKHNNEDLLKQVEGL 196

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQTP+GK+SARDLSKNLSP+SQ KAK+LMLEYA
Sbjct: 197  QMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLMLEYA 256

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD+ESN+   +S  S+DFDN             SKKPGLIQKLK+WG     
Sbjct: 257  GSERGQGDTDLESNYSQPSSPGSDDFDNASMDSSTSRFSSFSKKPGLIQKLKKWG-KSKD 315

Query: 352  XXXXXXSPARSFAGASPGRASL---KLRGPLEALMLRNASDGIAITSFGTGENDDLNSPE 182
                  SP+RSF G SPGR S    K RGPLE+LM+RNA + +AIT+FG  + +   +PE
Sbjct: 316  DSSVQSSPSRSFYGGSPGRLSSSMNKQRGPLESLMIRNAGESVAITTFGQVDQESPGTPE 375

Query: 181  TP------------------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIK 56
            TP                  N+VA+SFH+MSKSV+ VLDEKYPAYKDRHKLA+EREKHIK
Sbjct: 376  TPNLPRIRTQQQASSPGEGLNSVAASFHVMSKSVDNVLDEKYPAYKDRHKLAVEREKHIK 435

Query: 55   EKAQQARAVKFGG 17
             KA QARA +FGG
Sbjct: 436  HKADQARAERFGG 448


>ref|XP_006395634.1| hypothetical protein EUTSA_v10003588mg [Eutrema salsugineum]
            gi|557092273|gb|ESQ32920.1| hypothetical protein
            EUTSA_v10003588mg [Eutrema salsugineum]
          Length = 1000

 Score =  380 bits (976), Expect = e-103
 Identities = 204/314 (64%), Positives = 238/314 (75%), Gaps = 23/314 (7%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKN+ELQHEKREL +KLDSAE+++  LSN+TE++ VAKVREEV  LKH NEDL+KQVEGL
Sbjct: 280  RKNRELQHEKRELTIKLDSAEARISALSNMTESDKVAKVREEVNNLKHNNEDLLKQVEGL 339

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQTP+GK+SARDLSKNLSP+SQ KAK+LMLEYA
Sbjct: 340  QMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLMLEYA 399

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD+ESNF   +S  S+DFDN             SKKPGLIQKLKRWG     
Sbjct: 400  GSERGQGDTDVESNFSQPSSPGSDDFDNASMDSSTSRFSSFSKKPGLIQKLKRWG-KSKD 458

Query: 352  XXXXXXSPARSFAGASPGRASL---KLRGPLEALMLRNASDGIAITSFGTGENDDLNSPE 182
                  SP+RSF G SPGR S+   K RGPLE+LM+RNA + +AIT+FG  + +  ++PE
Sbjct: 459  DSSVQSSPSRSFYGGSPGRLSVSMNKQRGPLESLMIRNAGESVAITTFGKVDQESPSTPE 518

Query: 181  TP-------------------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHI 59
            TP                   N+VA+SF +MSKSV+ VLDEKYPAYKDRHKLA+EREKHI
Sbjct: 519  TPNLPRIRTQQQASSSPGEPLNSVAASFQVMSKSVDNVLDEKYPAYKDRHKLAVEREKHI 578

Query: 58   KEKAQQARAVKFGG 17
            K KA QARA +FGG
Sbjct: 579  KHKADQARAERFGG 592


>ref|XP_006395633.1| hypothetical protein EUTSA_v10003588mg [Eutrema salsugineum]
            gi|557092272|gb|ESQ32919.1| hypothetical protein
            EUTSA_v10003588mg [Eutrema salsugineum]
          Length = 998

 Score =  380 bits (976), Expect = e-103
 Identities = 204/314 (64%), Positives = 238/314 (75%), Gaps = 23/314 (7%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKN+ELQHEKREL +KLDSAE+++  LSN+TE++ VAKVREEV  LKH NEDL+KQVEGL
Sbjct: 278  RKNRELQHEKRELTIKLDSAEARISALSNMTESDKVAKVREEVNNLKHNNEDLLKQVEGL 337

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQTP+GK+SARDLSKNLSP+SQ KAK+LMLEYA
Sbjct: 338  QMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLMLEYA 397

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD+ESNF   +S  S+DFDN             SKKPGLIQKLKRWG     
Sbjct: 398  GSERGQGDTDVESNFSQPSSPGSDDFDNASMDSSTSRFSSFSKKPGLIQKLKRWG-KSKD 456

Query: 352  XXXXXXSPARSFAGASPGRASL---KLRGPLEALMLRNASDGIAITSFGTGENDDLNSPE 182
                  SP+RSF G SPGR S+   K RGPLE+LM+RNA + +AIT+FG  + +  ++PE
Sbjct: 457  DSSVQSSPSRSFYGGSPGRLSVSMNKQRGPLESLMIRNAGESVAITTFGKVDQESPSTPE 516

Query: 181  TP-------------------NNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHI 59
            TP                   N+VA+SF +MSKSV+ VLDEKYPAYKDRHKLA+EREKHI
Sbjct: 517  TPNLPRIRTQQQASSSPGEPLNSVAASFQVMSKSVDNVLDEKYPAYKDRHKLAVEREKHI 576

Query: 58   KEKAQQARAVKFGG 17
            K KA QARA +FGG
Sbjct: 577  KHKADQARAERFGG 590


>ref|XP_007046330.1| Hydroxyproline-rich glycoprotein family protein isoform 4 [Theobroma
            cacao] gi|508710265|gb|EOY02162.1| Hydroxyproline-rich
            glycoprotein family protein isoform 4 [Theobroma cacao]
          Length = 933

 Score =  380 bits (975), Expect = e-103
 Identities = 204/308 (66%), Positives = 235/308 (76%), Gaps = 18/308 (5%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKNKELQHEKREL VKLD+AE+K+  LSN+TETE+  + REEV  L+HANEDL+KQVEGL
Sbjct: 287  RKNKELQHEKRELTVKLDAAEAKIAALSNMTETEIDVRAREEVSNLRHANEDLLKQVEGL 346

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQTP GK+SARDL+K+LSP+SQE AKQL+LEYA
Sbjct: 347  QMNRFSEVEELVYLRWVNACLRYELRNYQTPEGKISARDLNKSLSPKSQETAKQLLLEYA 406

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD+ESNF + +S  SED DN             SKKP LIQKLK+WG     
Sbjct: 407  GSERGQGDTDIESNFSHPSSTGSEDLDNASIYSSNSRYSSLSKKPSLIQKLKKWG-RSKD 465

Query: 352  XXXXXXSPARSFAGASPGRASLK--LRGPLEALMLRNASDGIAITSFGTGENDDLNSPET 179
                  SPARS +G SP R S+    RGPLEALMLRNA DG+AIT+FG  E +  +SPET
Sbjct: 466  DSSAVSSPARSLSGGSPSRISMSQHSRGPLEALMLRNAGDGVAITTFGKNEQEFTDSPET 525

Query: 178  ---------------PNNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKAQ 44
                           PN+VA+SFHLMS+SV+G L+EKYPAYKDRHKLALEREK IK+KAQ
Sbjct: 526  PTIPNIRTQVSSGDSPNSVATSFHLMSRSVDGSLEEKYPAYKDRHKLALEREKQIKQKAQ 585

Query: 43   QARAVKFG 20
            QARA +FG
Sbjct: 586  QARAERFG 593


>ref|XP_007046327.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|590701143|ref|XP_007046328.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|590701146|ref|XP_007046329.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|590701152|ref|XP_007046331.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|590701156|ref|XP_007046332.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|590701159|ref|XP_007046333.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|590701163|ref|XP_007046334.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710262|gb|EOY02159.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710263|gb|EOY02160.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710264|gb|EOY02161.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710266|gb|EOY02163.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710267|gb|EOY02164.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710268|gb|EOY02165.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710269|gb|EOY02166.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao]
          Length = 996

 Score =  380 bits (975), Expect = e-103
 Identities = 204/308 (66%), Positives = 235/308 (76%), Gaps = 18/308 (5%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKNKELQHEKREL VKLD+AE+K+  LSN+TETE+  + REEV  L+HANEDL+KQVEGL
Sbjct: 287  RKNKELQHEKRELTVKLDAAEAKIAALSNMTETEIDVRAREEVSNLRHANEDLLKQVEGL 346

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQTP GK+SARDL+K+LSP+SQE AKQL+LEYA
Sbjct: 347  QMNRFSEVEELVYLRWVNACLRYELRNYQTPEGKISARDLNKSLSPKSQETAKQLLLEYA 406

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD+ESNF + +S  SED DN             SKKP LIQKLK+WG     
Sbjct: 407  GSERGQGDTDIESNFSHPSSTGSEDLDNASIYSSNSRYSSLSKKPSLIQKLKKWG-RSKD 465

Query: 352  XXXXXXSPARSFAGASPGRASLK--LRGPLEALMLRNASDGIAITSFGTGENDDLNSPET 179
                  SPARS +G SP R S+    RGPLEALMLRNA DG+AIT+FG  E +  +SPET
Sbjct: 466  DSSAVSSPARSLSGGSPSRISMSQHSRGPLEALMLRNAGDGVAITTFGKNEQEFTDSPET 525

Query: 178  ---------------PNNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKAQ 44
                           PN+VA+SFHLMS+SV+G L+EKYPAYKDRHKLALEREK IK+KAQ
Sbjct: 526  PTIPNIRTQVSSGDSPNSVATSFHLMSRSVDGSLEEKYPAYKDRHKLALEREKQIKQKAQ 585

Query: 43   QARAVKFG 20
            QARA +FG
Sbjct: 586  QARAERFG 593


>ref|XP_007227359.1| hypothetical protein PRUPE_ppa000786mg [Prunus persica]
            gi|462424295|gb|EMJ28558.1| hypothetical protein
            PRUPE_ppa000786mg [Prunus persica]
          Length = 1004

 Score =  380 bits (975), Expect = e-103
 Identities = 207/314 (65%), Positives = 241/314 (76%), Gaps = 18/314 (5%)
 Frame = -2

Query: 889  RKNKELQHEKRELVVKLDSAESKVRTLSNITETEMVAKVREEVYELKHANEDLVKQVEGL 710
            RKNKELQ EKREL +KL++AE++V  LSN+TE++MVA VREEV  LKHANEDL KQVEGL
Sbjct: 296  RKNKELQIEKRELTIKLNAAEARVAALSNMTESDMVANVREEVNNLKHANEDLSKQVEGL 355

Query: 709  QMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLSKNLSPRSQEKAKQLMLEYA 530
            QMNRFSEVEELVYLRWVNACLR+ELRNYQTP GKVSARDL+K+LSP+SQEKAKQLMLEYA
Sbjct: 356  QMNRFSEVEELVYLRWVNACLRYELRNYQTPQGKVSARDLNKSLSPKSQEKAKQLMLEYA 415

Query: 529  GSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXSKKPGLIQKLKRWGXXXXX 353
            GSERG GDTD+ESNF + +S  SEDFDN+            SKKP ++QKLKRWG     
Sbjct: 416  GSERGQGDTDIESNFSHPSSPGSEDFDNVSIDSSTSRYNSLSKKPSIMQKLKRWG-KSKD 474

Query: 352  XXXXXXSPARSFAGASPGRASLKL--RGPLEALMLRNASDGIAITSFGTGENDDLNSPET 179
                  SP+RS +G SP RAS+ +  RGPLE+LM+RNA DG+AIT+FG  + +  +SP+T
Sbjct: 475  DSSALSSPSRSLSGGSPSRASMSVRPRGPLESLMIRNAGDGVAITTFGKVDQELPDSPQT 534

Query: 178  ---------------PNNVASSFHLMSKSVEGVLDEKYPAYKDRHKLALEREKHIKEKAQ 44
                           PN+VA+SF LMSKSVEGVLDEKYPAYKDRHKLALEREK I E+AQ
Sbjct: 535  PSLPNIRTQMSSSDSPNSVAASFQLMSKSVEGVLDEKYPAYKDRHKLALEREKQINERAQ 594

Query: 43   QARAVKFGGVDSNN 2
            QARA KFG   + N
Sbjct: 595  QARAEKFGDKSNVN 608


Top