BLASTX nr result

ID: Mentha26_contig00006291 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00006291
         (1150 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU40853.1| hypothetical protein MIMGU_mgv1a000693mg [Mimulus...   417   e-114
ref|XP_002281154.2| PREDICTED: protein CHUP1, chloroplastic-like...   378   e-102
emb|CAN78725.1| hypothetical protein VITISV_020008 [Vitis vinifera]   378   e-102
gb|EPS62321.1| hypothetical protein M569_12467, partial [Genlise...   375   e-101
ref|XP_002315963.1| hypothetical protein POPTR_0010s14080g [Popu...   375   e-101
ref|XP_002524394.1| conserved hypothetical protein [Ricinus comm...   358   2e-96
ref|XP_006362524.1| PREDICTED: protein CHUP1, chloroplastic-like...   357   4e-96
ref|XP_004238973.1| PREDICTED: uncharacterized protein LOC101267...   357   5e-96
gb|EXB53975.1| hypothetical protein L484_022943 [Morus notabilis]     355   3e-95
ref|XP_004159306.1| PREDICTED: protein CHUP1, chloroplastic-like...   355   3e-95
ref|XP_004135119.1| PREDICTED: protein CHUP1, chloroplastic-like...   355   3e-95
ref|XP_006573276.1| PREDICTED: protein CHUP1, chloroplastic-like...   354   3e-95
ref|XP_006574884.1| PREDICTED: protein CHUP1, chloroplastic-like...   351   4e-94
ref|XP_006484398.1| PREDICTED: protein CHUP1, chloroplastic-like...   350   8e-94
ref|XP_006395634.1| hypothetical protein EUTSA_v10003588mg [Eutr...   349   1e-93
ref|XP_006395633.1| hypothetical protein EUTSA_v10003588mg [Eutr...   349   1e-93
ref|XP_007046330.1| Hydroxyproline-rich glycoprotein family prot...   349   1e-93
ref|XP_007046327.1| Hydroxyproline-rich glycoprotein family prot...   349   1e-93
ref|XP_006290457.1| hypothetical protein CARUB_v10019508mg [Caps...   348   2e-93
ref|XP_007153329.1| hypothetical protein PHAVU_003G026100g [Phas...   348   3e-93

>gb|EYU40853.1| hypothetical protein MIMGU_mgv1a000693mg [Mimulus guttatus]
          Length = 1016

 Score =  417 bits (1073), Expect = e-114
 Identities = 232/385 (60%), Positives = 263/385 (68%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEIDFP+PTDKYD +AN K+EKD++YE+EMA NA+                         
Sbjct: 135  GEIDFPIPTDKYDTSANSKSEKDKLYENEMAINATELERLRNLVRELEEREVKLEGELLE 194

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QESSI+ELQKQLKIKTVEIDMLNITI+SLQAERKKLQEE+S G+A+RKELE+A+
Sbjct: 195  YYGLKEQESSISELQKQLKIKTVEIDMLNITISSLQAERKKLQEEVSHGVAARKELEIAK 254

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                 +KEQ++    A                 
Sbjct: 255  KKMKDLQKQIQLEANQTKGQLLLLKQTVSGLQSKEQEAVTKDADVEKKLKAVKELEVEVM 314

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKNKEL +EKRELVVKLD+A++ V+ LSN+TETEMVAKVREEV E++HANEDLVKQV
Sbjct: 315  ELKRKNKELHYEKRELVVKLDAAEANVKALSNMTETEMVAKVREEVNEMRHANEDLVKQV 374

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGK+SARDLNKSLSPRSQE+AKQLML
Sbjct: 375  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKISARDLNKSLSPRSQERAKQLML 434

Query: 891  EYAGSER-GGGDTDMESNFDNTSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            E+AGSER GGGDTDMESNFDNTSV+SEDFDN+             KKP LIQKLKRWG  
Sbjct: 435  EFAGSERGGGGDTDMESNFDNTSVDSEDFDNVSIDSSTSRFSTLSKKPSLIQKLKRWGGK 494

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      PARSFAG SPSR S+
Sbjct: 495  SRDDSSAFSSPARSFAGGSPSRSSV 519


>ref|XP_002281154.2| PREDICTED: protein CHUP1, chloroplastic-like [Vitis vinifera]
          Length = 1003

 Score =  378 bits (971), Expect = e-102
 Identities = 213/385 (55%), Positives = 248/385 (64%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEID PLP+DK+D     K EKDRVYE+EMANNA+                         
Sbjct: 109  GEIDIPLPSDKFDTETAAKVEKDRVYETEMANNANELERLRNLVKELEEREVKLEGELLE 168

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QE+ IAELQ+QLKIKTVEIDMLNITI+SLQAERKKLQ+E++ G+++RKELE+AR
Sbjct: 169  YYGLKEQETDIAELQRQLKIKTVEIDMLNITISSLQAERKKLQDEVALGVSARKELEVAR 228

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                 TKEQ++    A                 
Sbjct: 229  NKIKELQRQIQVEANQTKGHLLLLKQQVSGLQTKEQEAIKKDAEIEKKLKAAKELEVEVV 288

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               R+NKELQHEKREL+VKLD A+++V  LSN+TE+EMVAK RE+V  L+HANEDL+KQV
Sbjct: 289  ELKRRNKELQHEKRELLVKLDGAEARVAALSNMTESEMVAKAREDVNNLRHANEDLLKQV 348

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP GK+SARDL+KSLSPRSQE+AKQLML
Sbjct: 349  EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPGGKISARDLSKSLSPRSQERAKQLML 408

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESNF + +S  SEDFDN              KKP LIQKLK+WG  
Sbjct: 409  EYAGSERGQGDTDLESNFSHPSSPGSEDFDNASIDSSTSRYSSLSKKPSLIQKLKKWG-K 467

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      PARSF G SP R S+
Sbjct: 468  SRDDSSVLSSPARSFGGGSPGRTSI 492


>emb|CAN78725.1| hypothetical protein VITISV_020008 [Vitis vinifera]
          Length = 955

 Score =  378 bits (971), Expect = e-102
 Identities = 213/385 (55%), Positives = 248/385 (64%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEID PLP+DK+D     K EKDRVYE+EMANNA+                         
Sbjct: 133  GEIDIPLPSDKFDTETAAKVEKDRVYETEMANNANELERLRNLVKELEEREVKLEGELLE 192

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QE+ IAELQ+QLKIKTVEIDMLNITI+SLQAERKKLQ+E++ G+++RKELE+AR
Sbjct: 193  YYGLKEQETDIAELQRQLKIKTVEIDMLNITISSLQAERKKLQDEVALGVSARKELEVAR 252

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                 TKEQ++    A                 
Sbjct: 253  NKIKELQRQIQVEANQTKGHLLLLKQQVSGLQTKEQEAIKKDAEIEKKLKAAKELEVEVV 312

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               R+NKELQHEKREL+VKLD A+++V  LSN+TE+EMVAK RE+V  L+HANEDL+KQV
Sbjct: 313  ELKRRNKELQHEKRELLVKLDGAEARVAALSNMTESEMVAKAREDVNNLRHANEDLLKQV 372

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP GK+SARDL+KSLSPRSQE+AKQLML
Sbjct: 373  EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPGGKISARDLSKSLSPRSQERAKQLML 432

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESNF + +S  SEDFDN              KKP LIQKLK+WG  
Sbjct: 433  EYAGSERGQGDTDLESNFSHPSSPGSEDFDNASIDSSTSRYSSLSKKPSLIQKLKKWG-K 491

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      PARSF G SP R S+
Sbjct: 492  SRDDSSVLSSPARSFGGGSPGRTSI 516


>gb|EPS62321.1| hypothetical protein M569_12467, partial [Genlisea aurea]
          Length = 950

 Score =  375 bits (962), Expect = e-101
 Identities = 212/386 (54%), Positives = 251/386 (65%), Gaps = 4/386 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEIDFPLPTDKY++A+   A  D+VYE EMANNAS                         
Sbjct: 90   GEIDFPLPTDKYESAS-ASAADDKVYEYEMANNASELERLRNLVKELEEREVKLEGELLE 148

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES+++ELQKQL IKT+EIDML ITINSLQAERKKLQEE+SQG++ + EL++AR
Sbjct: 149  YYGLKEQESNVSELQKQLHIKTLEIDMLQITINSLQAERKKLQEEVSQGVSVKNELDLAR 208

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                  KEQ++                      
Sbjct: 209  KKINELQKQIQLDANQTKGQLLLLKQQVSTLQAKEQETIRKDGEFEKKFKALKELEVEVM 268

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKN+ELQHEKREL+VKLD+A+S V+ LSN+TETEMVA +R EV EL+H N+DLVKQV
Sbjct: 269  ELKRKNRELQHEKRELMVKLDAAESNVKLLSNMTETEMVASIRGEVNELRHKNDDLVKQV 328

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEE+VYLRWVNACLRFELRN+QTPSG++SARDL+KSLSP+SQE+AKQL+L
Sbjct: 329  EGLQMNRFSEVEEMVYLRWVNACLRFELRNHQTPSGRISARDLSKSLSPKSQERAKQLLL 388

Query: 891  EYAGSERGGGDTDMESNFDNTSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXXX 1070
            EYAGSER GGDTD+ESNFDNTSV+SEDFD++             KKPGLIQKLKRWG   
Sbjct: 389  EYAGSER-GGDTDIESNFDNTSVDSEDFDSV-SVDSSSVTKFSNKKPGLIQKLKRWGGKG 446

Query: 1071 XXXXXXXXXPARSFAGASPSRPSLKP 1148
                     PARS    SP R +L+P
Sbjct: 447  HEDSSAMSSPARSSYAGSPGRVNLRP 472


>ref|XP_002315963.1| hypothetical protein POPTR_0010s14080g [Populus trichocarpa]
            gi|222865003|gb|EEF02134.1| hypothetical protein
            POPTR_0010s14080g [Populus trichocarpa]
          Length = 955

 Score =  375 bits (962), Expect = e-101
 Identities = 215/385 (55%), Positives = 251/385 (65%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEID+PLP +K+D     +AEKD++YE+EMANNAS                         
Sbjct: 98   GEIDYPLPGEKFD-----QAEKDKIYETEMANNASELECLRNLVRELEEREVKLEGELLE 152

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES + ELQ+QLKIKTVEIDMLNITINSLQAERKKLQEE+S G +S+KELE+AR
Sbjct: 153  YYGLKEQESDVVELQRQLKIKTVEIDMLNITINSLQAERKKLQEEISHGASSKKELELAR 212

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                  KEQ++    A                 
Sbjct: 213  NKIKEFQRQIQLDANQTKGQLLLLKQQVSGLQAKEQEAVKKDAEVEKRLKAVKELEVEVV 272

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKNKELQHEKREL++KL +A++K+ +LSN++ETEMVAKVREEV  LKHANEDL+KQV
Sbjct: 273  ELKRKNKELQHEKRELIIKLGAAEAKLTSLSNLSETEMVAKVREEVNNLKHANEDLLKQV 332

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTPSGKVSARDLNKSLSP+SQE+AKQL+L
Sbjct: 333  EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPSGKVSARDLNKSLSPKSQERAKQLLL 392

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTDMESN+ + +S  SEDFDN              KKP LIQKLK+WG  
Sbjct: 393  EYAGSERGQGDTDMESNYSHPSSPGSEDFDN-TSIDSSSSRYSFSKKPNLIQKLKKWG-R 450

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      P+RSF+G SPSR S+
Sbjct: 451  SKDDSSAFSSPSRSFSGVSPSRSSM 475


>ref|XP_002524394.1| conserved hypothetical protein [Ricinus communis]
            gi|223536355|gb|EEF38005.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 998

 Score =  358 bits (919), Expect = 2e-96
 Identities = 204/385 (52%), Positives = 244/385 (63%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEID+PLP D+ D     KAEKD+VYE+EMANNAS                         
Sbjct: 109  GEIDYPLPGDRVD-----KAEKDKVYENEMANNASELERLRNLVRELEEREVKLEGELLE 163

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES +AE+ +QLKIKTVEIDMLNITINSLQAERKKLQEE++QG +++KELE AR
Sbjct: 164  YYGLKEQESDVAEIHRQLKIKTVEIDMLNITINSLQAERKKLQEEVAQGASAKKELEAAR 223

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                  KE+++    A                 
Sbjct: 224  TKIKELQRQIQLDANQTKGQLLLLKQQVSGLQAKEEEAIKKDAELERKLKAVKDLEVEVV 283

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKNKELQHEKREL +KLD+A +K+ +LSN+TE+EMVAK R++V  L+HANEDL+KQV
Sbjct: 284  ELRRKNKELQHEKRELTIKLDAAQAKIVSLSNMTESEMVAKARDDVNNLRHANEDLLKQV 343

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQ P G+VSARDL+K+LSP+SQEKAK LML
Sbjct: 344  EGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPPGRVSARDLSKNLSPKSQEKAKHLML 403

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD++SNF + +S  SEDFDN              KKP LIQK+K+WG  
Sbjct: 404  EYAGSERGQGDTDLDSNFSHPSSPGSEDFDNTSIDSSTSRYSSLSKKPSLIQKIKKWG-K 462

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      P+RSF+  SPSR S+
Sbjct: 463  SKDDSSALSSPSRSFSADSPSRTSM 487


>ref|XP_006362524.1| PREDICTED: protein CHUP1, chloroplastic-like [Solanum tuberosum]
          Length = 991

 Score =  357 bits (917), Expect = 4e-96
 Identities = 206/386 (53%), Positives = 241/386 (62%), Gaps = 6/386 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI+FPLP+DKYD     + E++RVY++EMA NA+                         
Sbjct: 101  GEIEFPLPSDKYDTG---REERERVYQTEMAYNANELERLRNLVKELEEREVKLEGELLE 157

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES I ELQKQLKIK+VEIDMLNITIN+LQAE++KLQEE+  G  +RK+LE AR
Sbjct: 158  YYGLKEQESDILELQKQLKIKSVEIDMLNITINTLQAEKQKLQEEVFHGTTARKDLEAAR 217

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                  KE+++    +                 
Sbjct: 218  SKIKELQRQMQLEANQTKAQLLLLKQHVTGLQEKEEEAFKRDSDVDKKLKLVKELEVEVM 277

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKNKELQHEKRELV+KLD+A+SK+  LSN+TE EMVA+VREEV  LKH N+DL+KQV
Sbjct: 278  ELKRKNKELQHEKRELVIKLDTAESKIAKLSNMTENEMVAQVREEVTNLKHTNDDLLKQV 337

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTP GKVSARDL+K+LSP+SQ+KAKQLML
Sbjct: 338  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPQGKVSARDLSKNLSPKSQQKAKQLML 397

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWG-X 1064
            EYAGSERG GDTD+ESNF   +S  SEDFDN              KKP LIQKLK+WG  
Sbjct: 398  EYAGSERGQGDTDLESNFSQPSSPGSEDFDNASIDSSTSRFSSFSKKPNLIQKLKKWGSR 457

Query: 1065 XXXXXXXXXXXPARSFAGASPSRPSL 1142
                       PARS  GASP R S+
Sbjct: 458  GGRDDSSVMSSPARSLGGASPGRMSM 483


>ref|XP_004238973.1| PREDICTED: uncharacterized protein LOC101267989 [Solanum
            lycopersicum]
          Length = 1174

 Score =  357 bits (916), Expect = 5e-96
 Identities = 206/386 (53%), Positives = 239/386 (61%), Gaps = 6/386 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI+FPLP+DKYD     + E++RVY++EMA NA+                         
Sbjct: 284  GEIEFPLPSDKYDTG---REERERVYQTEMAYNANELERLRNLVKELEEREVKLEGELLE 340

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES + ELQKQLKIK VEIDMLNITIN+LQAE++KLQEE+  G  +RK+LE AR
Sbjct: 341  YYGLKEQESDVLELQKQLKIKAVEIDMLNITINTLQAEKQKLQEEVFHGTTARKDLEAAR 400

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                  KE+++    +                 
Sbjct: 401  SKIKELQRQMQLEANQTKAQLLLLKQHVTELQEKEEEAFKRDSEVDKKLKLVKELEVEVM 460

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKNKELQHEKRELV+KLD+A+SK+  LSN+TE EMVA+VREEV  LKH N+DL+KQV
Sbjct: 461  ELKRKNKELQHEKRELVIKLDAAESKIAKLSNMTENEMVAQVREEVTNLKHTNDDLLKQV 520

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTP GKVSARDL+KSLSP+SQ KAKQLML
Sbjct: 521  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPQGKVSARDLSKSLSPKSQHKAKQLML 580

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWG-X 1064
            EYAGSERG GDTD+ESNF   +S  SEDFDN              KKP LIQKLK+WG  
Sbjct: 581  EYAGSERGQGDTDLESNFSQPSSPGSEDFDNASIDSSTSRFSTFSKKPNLIQKLKKWGSR 640

Query: 1065 XXXXXXXXXXXPARSFAGASPSRPSL 1142
                       PARS  GASP R S+
Sbjct: 641  GGKDDSSIMSSPARSLGGASPGRMSM 666


>gb|EXB53975.1| hypothetical protein L484_022943 [Morus notabilis]
          Length = 1617

 Score =  355 bits (910), Expect = 3e-95
 Identities = 206/385 (53%), Positives = 242/385 (62%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI+FPLP+ K D     K++KD+VYE+EMANNAS                         
Sbjct: 729  GEIEFPLPSSKSD-----KSQKDKVYETEMANNASELERLRKLVKELEEREVKLEGELLE 783

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES I ELQ+QLKIK+VE++MLNITINSLQAERKKLQ+E++QG ++RKELE AR
Sbjct: 784  YYGLKEQESDIDELQRQLKIKSVEVNMLNITINSLQAERKKLQDEIAQGASARKELEAAR 843

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                  KE+++    A                 
Sbjct: 844  NKIKELQRQIQLDANQTKGQLLLLKQQVSGLQAKEEEAVKKDAELEKKLKAVKELEVEVV 903

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKNKELQHEKREL+VKLD+A ++V  LS++TE+E VA  REEV  L+HANEDL+KQV
Sbjct: 904  ELKRKNKELQHEKRELIVKLDAAQARVTALSSMTESEKVANAREEVNNLRHANEDLLKQV 963

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQ P GK+SARDLNKSLSPRSQEKAKQLML
Sbjct: 964  EGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPPGKMSARDLNKSLSPRSQEKAKQLML 1023

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESNF + +S  SEDFDN              KK  LIQKLK+WG  
Sbjct: 1024 EYAGSERGQGDTDIESNFSHPSSPGSEDFDNASIDSFTSRVSSLGKKTSLIQKLKKWG-R 1082

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      P+RS +G SPSR S+
Sbjct: 1083 SKDDSSALLSPSRSLSGGSPSRMSM 1107


>ref|XP_004159306.1| PREDICTED: protein CHUP1, chloroplastic-like [Cucumis sativus]
          Length = 987

 Score =  355 bits (910), Expect = 3e-95
 Identities = 204/380 (53%), Positives = 241/380 (63%), Gaps = 5/380 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI+FPLP  + D++   KAEKDRVYE+EMANNAS                         
Sbjct: 95   GEIEFPLP--EIDDS---KAEKDRVYETEMANNASELERLRNLVKELEEREVKLEGELLE 149

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES I ELQ+QLKIK VEIDMLNITI+SLQAERKKLQEE++Q  A +KELE AR
Sbjct: 150  YYGLKEQESDITELQRQLKIKAVEIDMLNITISSLQAERKKLQEEIAQDAAVKKELEFAR 209

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                 +KEQ++    A                 
Sbjct: 210  NKIKELQRQIQLDANQTKGQLLLLKQQVSGLQSKEQETIKKDAELEKKLKAVKELEVEVM 269

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKNKELQ EKREL +KLD+A++K+ TLSN+TE+E+VA+ RE+V  L+HANEDL+KQV
Sbjct: 270  ELKRKNKELQIEKRELTIKLDAAENKISTLSNMTESELVAQTREQVSNLRHANEDLIKQV 329

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQ P+GK+SARDL+K+LSP+SQEKAKQLM+
Sbjct: 330  EGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPTGKISARDLSKNLSPKSQEKAKQLMV 389

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESN+   +S  SEDFDN              KKP LIQKLK+WG  
Sbjct: 390  EYAGSERGQGDTDLESNYSQPSSPGSEDFDNASIDSSFSRYSSLSKKPSLIQKLKKWGGR 449

Query: 1068 XXXXXXXXXXPARSFAGASP 1127
                      PARSF+G SP
Sbjct: 450  SKDDSSALSSPARSFSGGSP 469


>ref|XP_004135119.1| PREDICTED: protein CHUP1, chloroplastic-like [Cucumis sativus]
          Length = 987

 Score =  355 bits (910), Expect = 3e-95
 Identities = 204/380 (53%), Positives = 241/380 (63%), Gaps = 5/380 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI+FPLP  + D++   KAEKDRVYE+EMANNAS                         
Sbjct: 95   GEIEFPLP--EIDDS---KAEKDRVYETEMANNASELERLRNLVKELEEREVKLEGELLE 149

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES I ELQ+QLKIK VEIDMLNITI+SLQAERKKLQEE++Q  A +KELE AR
Sbjct: 150  YYGLKEQESDITELQRQLKIKAVEIDMLNITISSLQAERKKLQEEIAQDAAVKKELEFAR 209

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                 +KEQ++    A                 
Sbjct: 210  NKIKELQRQIQLDANQTKGQLLLLKQQVSGLQSKEQETIKKDAELEKKLKAVKELEVEVM 269

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKNKELQ EKREL +KLD+A++K+ TLSN+TE+E+VA+ RE+V  L+HANEDL+KQV
Sbjct: 270  ELKRKNKELQIEKRELTIKLDAAENKISTLSNMTESELVAQTREQVSNLRHANEDLIKQV 329

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQ P+GK+SARDL+K+LSP+SQEKAKQLM+
Sbjct: 330  EGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPTGKISARDLSKNLSPKSQEKAKQLMV 389

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESN+   +S  SEDFDN              KKP LIQKLK+WG  
Sbjct: 390  EYAGSERGQGDTDLESNYSQPSSPGSEDFDNASIDSSFSRYSSLSKKPSLIQKLKKWGGR 449

Query: 1068 XXXXXXXXXXPARSFAGASP 1127
                      PARSF+G SP
Sbjct: 450  SKDDSSALSSPARSFSGGSP 469


>ref|XP_006573276.1| PREDICTED: protein CHUP1, chloroplastic-like [Glycine max]
          Length = 968

 Score =  354 bits (909), Expect = 3e-95
 Identities = 210/385 (54%), Positives = 238/385 (61%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI+FPLP DK         EKD+VYE EMANNAS                         
Sbjct: 82   GEIEFPLPPDK--------DEKDKVYEIEMANNASELERLRQLVKELEEREVKLEGELLE 133

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES I ELQ+QLKIKTVEIDMLNITINSLQAERKKLQEEL+QG +++KELE+AR
Sbjct: 134  YYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEELTQGASAKKELEVAR 193

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQSAXXXXXXXXXXXXXXXXXXXX- 539
                         AN                  KE+++A                     
Sbjct: 194  NKIKELQRQIQLEANQTKGQLLLLKQQVSTLLVKEEEAARKDAEVEKKLKAVNDLEVAVV 253

Query: 540  ---RKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKNKELQHEKREL VKL+ A+S+   LSN+TE+EMVAK +EEV  L+HANEDL+KQV
Sbjct: 254  ELKRKNKELQHEKRELTVKLNVAESRAAELSNMTESEMVAKAKEEVSNLRHANEDLLKQV 313

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRN QTP GKVSARDL+KSLSP+SQEKAKQLML
Sbjct: 314  EGLQMNRFSEVEELVYLRWVNACLRYELRNNQTPQGKVSARDLSKSLSPKSQEKAKQLML 373

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESNF + +S  SEDFDN              KK  LIQK K+WG  
Sbjct: 374  EYAGSERGQGDTDLESNFSHPSSPGSEDFDNASIDSSTSKYSSLSKKTSLIQKFKKWG-K 432

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      PARSF+G SP R S+
Sbjct: 433  SKDDSSALSSPARSFSGGSPRRMSV 457


>ref|XP_006574884.1| PREDICTED: protein CHUP1, chloroplastic-like [Glycine max]
          Length = 977

 Score =  351 bits (900), Expect = 4e-94
 Identities = 206/385 (53%), Positives = 240/385 (62%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI+FP+P DK         EKD+VYE EMA+NA+                         
Sbjct: 88   GEIEFPIPPDK--------DEKDKVYEIEMAHNATELERLRQLVKELEEREVKLEGELLE 139

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES I ELQ+QLKIKTVEIDMLNITINSLQAERKKLQEEL+QG ++++ELE+AR
Sbjct: 140  YYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEELTQGASAKRELEVAR 199

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQSAXXXXXXXXXXXXXXXXXXXX- 539
                         AN                  KE+++A                     
Sbjct: 200  NKIKELQRQIQLEANQTKGQLLLLKQQVSTLLVKEEEAARKDAEVQKKLKAVNDLEVTVV 259

Query: 540  ---RKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKNKELQHEKREL+VKL++A+S+   LSN+TE+EMVAK +EEV  L+HANEDL+KQV
Sbjct: 260  ELKRKNKELQHEKRELMVKLNAAESRAAELSNMTESEMVAKAKEEVSNLRHANEDLLKQV 319

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRN QTP GKVSARDL+KSLSP+SQEKAKQLML
Sbjct: 320  EGLQMNRFSEVEELVYLRWVNACLRYELRNNQTPQGKVSARDLSKSLSPKSQEKAKQLML 379

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESNF + +S  SEDFDN              KK  LIQK K+WG  
Sbjct: 380  EYAGSERGQGDTDLESNFSHPSSPGSEDFDNASIDSSTSKYSSLSKKTSLIQKFKKWG-K 438

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      PARSF+G SP R S+
Sbjct: 439  SKDDSSALSSPARSFSGGSPRRMSV 463


>ref|XP_006484398.1| PREDICTED: protein CHUP1, chloroplastic-like isoform X1 [Citrus
            sinensis] gi|568861823|ref|XP_006484399.1| PREDICTED:
            protein CHUP1, chloroplastic-like isoform X2 [Citrus
            sinensis]
          Length = 992

 Score =  350 bits (897), Expect = 8e-94
 Identities = 204/385 (52%), Positives = 241/385 (62%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI++ LP DKYD     +AEK++VYE+EMA+NA                          
Sbjct: 109  GEIEYQLPIDKYD-----EAEKNKVYETEMADNARELERLRSLVLELQEREVKLEGELLE 163

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES I ELQ+QLKIKTVEIDMLNITINSLQAERKKLQE+++Q    +KELE+AR
Sbjct: 164  YYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEQIAQSSYVKKELEVAR 223

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                  KE+++                      
Sbjct: 224  NKIKELQRQIQLDANQTKGQLLLLKQQVSGLQAKEEEAIKKDVELEKKLKSVKDLEVEVV 283

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKNKELQ EKREL+VK D+A+SK+ +LSN+TE+E VAK REEV  L+HAN+DL+KQV
Sbjct: 284  ELKRKNKELQIEKRELLVKQDAAESKISSLSNMTESEKVAKAREEVNNLRHANDDLLKQV 343

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQ P+GK SARDLNKSLSP+SQE+AKQLML
Sbjct: 344  EGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPAGKTSARDLNKSLSPKSQERAKQLML 403

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESNF + +S  SEDFDN              KKP LIQKLK+WG  
Sbjct: 404  EYAGSERGQGDTDLESNFSHPSSPGSEDFDNASIDSSTSKYSNLSKKPSLIQKLKKWG-K 462

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      PARS +G+SPSR S+
Sbjct: 463  SKDDLSALSSPARSISGSSPSRMSM 487


>ref|XP_006395634.1| hypothetical protein EUTSA_v10003588mg [Eutrema salsugineum]
            gi|557092273|gb|ESQ32920.1| hypothetical protein
            EUTSA_v10003588mg [Eutrema salsugineum]
          Length = 1000

 Score =  349 bits (895), Expect = 1e-93
 Identities = 205/385 (53%), Positives = 241/385 (62%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI++PLP+D  DN+   KAEK+R YE+EMA N S                         
Sbjct: 100  GEIEYPLPSD--DNSLE-KAEKEREYETEMAYNDSELERLRQLVKELEEREVKLEGELLE 156

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES I ELQ+QLKIKTVEIDMLNITINSLQAERKKLQEE++Q    RKELE+AR
Sbjct: 157  YYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEEITQNGVVRKELEVAR 216

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                  KE+++    +                 
Sbjct: 217  NKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEEAMNKDSEVDRKLKAVQGLEVEVM 276

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKN+ELQHEKREL +KLDSA++++  LSN+TE++ VAKVREEV  LKH NEDL+KQV
Sbjct: 277  ELKRKNRELQHEKRELTIKLDSAEARISALSNMTESDKVAKVREEVNNLKHNNEDLLKQV 336

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP+GK+SARDL+K+LSP+SQ KAK+LML
Sbjct: 337  EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLML 396

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESNF   +S  S+DFDN              KKPGLIQKLKRWG  
Sbjct: 397  EYAGSERGQGDTDVESNFSQPSSPGSDDFDNASMDSSTSRFSSFSKKPGLIQKLKRWG-K 455

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      P+RSF G SP R S+
Sbjct: 456  SKDDSSVQSSPSRSFYGGSPGRLSV 480


>ref|XP_006395633.1| hypothetical protein EUTSA_v10003588mg [Eutrema salsugineum]
            gi|557092272|gb|ESQ32919.1| hypothetical protein
            EUTSA_v10003588mg [Eutrema salsugineum]
          Length = 998

 Score =  349 bits (895), Expect = 1e-93
 Identities = 205/385 (53%), Positives = 241/385 (62%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI++PLP+D  DN+   KAEK+R YE+EMA N S                         
Sbjct: 98   GEIEYPLPSD--DNSLE-KAEKEREYETEMAYNDSELERLRQLVKELEEREVKLEGELLE 154

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES I ELQ+QLKIKTVEIDMLNITINSLQAERKKLQEE++Q    RKELE+AR
Sbjct: 155  YYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEEITQNGVVRKELEVAR 214

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                  KE+++    +                 
Sbjct: 215  NKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEEAMNKDSEVDRKLKAVQGLEVEVM 274

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKN+ELQHEKREL +KLDSA++++  LSN+TE++ VAKVREEV  LKH NEDL+KQV
Sbjct: 275  ELKRKNRELQHEKRELTIKLDSAEARISALSNMTESDKVAKVREEVNNLKHNNEDLLKQV 334

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP+GK+SARDL+K+LSP+SQ KAK+LML
Sbjct: 335  EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLML 394

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESNF   +S  S+DFDN              KKPGLIQKLKRWG  
Sbjct: 395  EYAGSERGQGDTDVESNFSQPSSPGSDDFDNASMDSSTSRFSSFSKKPGLIQKLKRWG-K 453

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      P+RSF G SP R S+
Sbjct: 454  SKDDSSVQSSPSRSFYGGSPGRLSV 478


>ref|XP_007046330.1| Hydroxyproline-rich glycoprotein family protein isoform 4 [Theobroma
            cacao] gi|508710265|gb|EOY02162.1| Hydroxyproline-rich
            glycoprotein family protein isoform 4 [Theobroma cacao]
          Length = 933

 Score =  349 bits (895), Expect = 1e-93
 Identities = 200/385 (51%), Positives = 240/385 (62%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI++PL  DK+      +AE++++YE+EMANNAS                         
Sbjct: 109  GEIEYPLSADKF-----ARAEREKIYETEMANNASELERLRNLVKELEEREVKLEGELLE 163

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES I EL++QLKIKTVEIDMLNITI+SLQ+ERKKLQE+++ G + +KELE+AR
Sbjct: 164  YYGLKEQESDIFELKRQLKIKTVEIDMLNITISSLQSERKKLQEDIAHGASVKKELEVAR 223

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                  KEQ++    A                 
Sbjct: 224  NKIKELQRQIQLDANQTKAQLLFLKQQVSGLQAKEQEAIKNDAEVEKKLKAVKELEMEVM 283

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKNKELQHEKREL VKLD+A++K+  LSN+TETE+  + REEV  L+HANEDL+KQV
Sbjct: 284  ELRRKNKELQHEKRELTVKLDAAEAKIAALSNMTETEIDVRAREEVSNLRHANEDLLKQV 343

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP GK+SARDLNKSLSP+SQE AKQL+L
Sbjct: 344  EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPEGKISARDLNKSLSPKSQETAKQLLL 403

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESNF + +S  SED DN              KKP LIQKLK+WG  
Sbjct: 404  EYAGSERGQGDTDIESNFSHPSSTGSEDLDNASIYSSNSRYSSLSKKPSLIQKLKKWG-R 462

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      PARS +G SPSR S+
Sbjct: 463  SKDDSSAVSSPARSLSGGSPSRISM 487


>ref|XP_007046327.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|590701143|ref|XP_007046328.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|590701146|ref|XP_007046329.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|590701152|ref|XP_007046331.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|590701156|ref|XP_007046332.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|590701159|ref|XP_007046333.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|590701163|ref|XP_007046334.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710262|gb|EOY02159.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710263|gb|EOY02160.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710264|gb|EOY02161.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710266|gb|EOY02163.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710267|gb|EOY02164.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710268|gb|EOY02165.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao] gi|508710269|gb|EOY02166.1|
            Hydroxyproline-rich glycoprotein family protein isoform 1
            [Theobroma cacao]
          Length = 996

 Score =  349 bits (895), Expect = 1e-93
 Identities = 200/385 (51%), Positives = 240/385 (62%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI++PL  DK+      +AE++++YE+EMANNAS                         
Sbjct: 109  GEIEYPLSADKF-----ARAEREKIYETEMANNASELERLRNLVKELEEREVKLEGELLE 163

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES I EL++QLKIKTVEIDMLNITI+SLQ+ERKKLQE+++ G + +KELE+AR
Sbjct: 164  YYGLKEQESDIFELKRQLKIKTVEIDMLNITISSLQSERKKLQEDIAHGASVKKELEVAR 223

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                  KEQ++    A                 
Sbjct: 224  NKIKELQRQIQLDANQTKAQLLFLKQQVSGLQAKEQEAIKNDAEVEKKLKAVKELEMEVM 283

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKNKELQHEKREL VKLD+A++K+  LSN+TETE+  + REEV  L+HANEDL+KQV
Sbjct: 284  ELRRKNKELQHEKRELTVKLDAAEAKIAALSNMTETEIDVRAREEVSNLRHANEDLLKQV 343

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP GK+SARDLNKSLSP+SQE AKQL+L
Sbjct: 344  EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPEGKISARDLNKSLSPKSQETAKQLLL 403

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESNF + +S  SED DN              KKP LIQKLK+WG  
Sbjct: 404  EYAGSERGQGDTDIESNFSHPSSTGSEDLDNASIYSSNSRYSSLSKKPSLIQKLKKWG-R 462

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      PARS +G SPSR S+
Sbjct: 463  SKDDSSAVSSPARSLSGGSPSRISM 487


>ref|XP_006290457.1| hypothetical protein CARUB_v10019508mg [Capsella rubella]
            gi|482559164|gb|EOA23355.1| hypothetical protein
            CARUB_v10019508mg [Capsella rubella]
          Length = 997

 Score =  348 bits (894), Expect = 2e-93
 Identities = 205/384 (53%), Positives = 238/384 (61%), Gaps = 5/384 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI++PLP D  DN+   KAEK+R YE EMA N                           
Sbjct: 97   GEIEYPLPDD--DNSLE-KAEKERKYEVEMAYNDGELERLKQLVKELEEREVKLEGELLE 153

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES I ELQ+QLKIKTVEIDMLNITINSLQAERKKLQEE+SQ +  RKELE+AR
Sbjct: 154  YYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEEISQNVIVRKELEVAR 213

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530
                         AN                  KE+++                      
Sbjct: 214  NKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEEAMNKDTEVERKLKAVQDLEVEVM 273

Query: 531  XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               RKN+ELQHEKREL +KLDSA++++ TLSN+TE++ VAKVREEV  LKH NEDL+KQV
Sbjct: 274  ELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVAKVREEVNNLKHNNEDLLKQV 333

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP+GK+SARDL+K+LSP+SQ KAK+LML
Sbjct: 334  EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLML 393

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESN+   +S  S+DFDN              KKPGLIQKLKRWG  
Sbjct: 394  EYAGSERGQGDTDLESNYSQPSSPGSDDFDNASMDSSTSRFSSFSKKPGLIQKLKRWG-K 452

Query: 1068 XXXXXXXXXXPARSFAGASPSRPS 1139
                      P+RSF G SP R S
Sbjct: 453  SKDDSSVQSSPSRSFYGGSPGRLS 476


>ref|XP_007153329.1| hypothetical protein PHAVU_003G026100g [Phaseolus vulgaris]
            gi|561026683|gb|ESW25323.1| hypothetical protein
            PHAVU_003G026100g [Phaseolus vulgaris]
          Length = 979

 Score =  348 bits (892), Expect = 3e-93
 Identities = 204/385 (52%), Positives = 237/385 (61%), Gaps = 5/385 (1%)
 Frame = +3

Query: 3    GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182
            GEI+FPLP D+         EKDRVYE EMANN S                         
Sbjct: 91   GEIEFPLPPDR--------DEKDRVYEIEMANNESELERLRLLVKELEEREVKLEGELLE 142

Query: 183  XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362
                  QES I ELQ+QLKIK VEIDMLNITINSLQAERKKLQEEL+QG ++++ELE+AR
Sbjct: 143  YYGLKEQESDIVELQRQLKIKAVEIDMLNITINSLQAERKKLQEELTQGASAKRELEVAR 202

Query: 363  XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQSAXXXXXXXXXXXXXXXXXXXX- 539
                         AN                  KE+++A                     
Sbjct: 203  NKIKELQRQMQLEANQTKGQLLLLKQQVLGLQVKEEEAATKDAQVEKKLKAVNDLEVAVV 262

Query: 540  ---RKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710
               R+NKELQHEKREL VKL++A+S+   LSN+TE++MVAK +EEV  L+HANEDL KQV
Sbjct: 263  ELKRRNKELQHEKRELTVKLNAAESRAAELSNMTESDMVAKAKEEVSNLRHANEDLQKQV 322

Query: 711  EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890
            EGLQ+NRFSEVEELVYLRWVNACLR+ELRNYQTP GKVSARDL+KSLSP+SQEKAKQLML
Sbjct: 323  EGLQINRFSEVEELVYLRWVNACLRYELRNYQTPQGKVSARDLSKSLSPKSQEKAKQLML 382

Query: 891  EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067
            EYAGSERG GDTD+ESNF + +S  S+DFDN              KK  LIQK K+WG  
Sbjct: 383  EYAGSERGQGDTDLESNFSHPSSPGSDDFDNASIDSYSSKYSTLSKKTSLIQKFKKWG-K 441

Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142
                      PARSF+G SP R S+
Sbjct: 442  SKDDSSALSSPARSFSGGSPRRMSV 466


Top