BLASTX nr result
ID: Mentha26_contig00006291
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00006291 (1150 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU40853.1| hypothetical protein MIMGU_mgv1a000693mg [Mimulus... 417 e-114 ref|XP_002281154.2| PREDICTED: protein CHUP1, chloroplastic-like... 378 e-102 emb|CAN78725.1| hypothetical protein VITISV_020008 [Vitis vinifera] 378 e-102 gb|EPS62321.1| hypothetical protein M569_12467, partial [Genlise... 375 e-101 ref|XP_002315963.1| hypothetical protein POPTR_0010s14080g [Popu... 375 e-101 ref|XP_002524394.1| conserved hypothetical protein [Ricinus comm... 358 2e-96 ref|XP_006362524.1| PREDICTED: protein CHUP1, chloroplastic-like... 357 4e-96 ref|XP_004238973.1| PREDICTED: uncharacterized protein LOC101267... 357 5e-96 gb|EXB53975.1| hypothetical protein L484_022943 [Morus notabilis] 355 3e-95 ref|XP_004159306.1| PREDICTED: protein CHUP1, chloroplastic-like... 355 3e-95 ref|XP_004135119.1| PREDICTED: protein CHUP1, chloroplastic-like... 355 3e-95 ref|XP_006573276.1| PREDICTED: protein CHUP1, chloroplastic-like... 354 3e-95 ref|XP_006574884.1| PREDICTED: protein CHUP1, chloroplastic-like... 351 4e-94 ref|XP_006484398.1| PREDICTED: protein CHUP1, chloroplastic-like... 350 8e-94 ref|XP_006395634.1| hypothetical protein EUTSA_v10003588mg [Eutr... 349 1e-93 ref|XP_006395633.1| hypothetical protein EUTSA_v10003588mg [Eutr... 349 1e-93 ref|XP_007046330.1| Hydroxyproline-rich glycoprotein family prot... 349 1e-93 ref|XP_007046327.1| Hydroxyproline-rich glycoprotein family prot... 349 1e-93 ref|XP_006290457.1| hypothetical protein CARUB_v10019508mg [Caps... 348 2e-93 ref|XP_007153329.1| hypothetical protein PHAVU_003G026100g [Phas... 348 3e-93 >gb|EYU40853.1| hypothetical protein MIMGU_mgv1a000693mg [Mimulus guttatus] Length = 1016 Score = 417 bits (1073), Expect = e-114 Identities = 232/385 (60%), Positives = 263/385 (68%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEIDFP+PTDKYD +AN K+EKD++YE+EMA NA+ Sbjct: 135 GEIDFPIPTDKYDTSANSKSEKDKLYENEMAINATELERLRNLVRELEEREVKLEGELLE 194 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QESSI+ELQKQLKIKTVEIDMLNITI+SLQAERKKLQEE+S G+A+RKELE+A+ Sbjct: 195 YYGLKEQESSISELQKQLKIKTVEIDMLNITISSLQAERKKLQEEVSHGVAARKELEIAK 254 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN +KEQ++ A Sbjct: 255 KKMKDLQKQIQLEANQTKGQLLLLKQTVSGLQSKEQEAVTKDADVEKKLKAVKELEVEVM 314 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKNKEL +EKRELVVKLD+A++ V+ LSN+TETEMVAKVREEV E++HANEDLVKQV Sbjct: 315 ELKRKNKELHYEKRELVVKLDAAEANVKALSNMTETEMVAKVREEVNEMRHANEDLVKQV 374 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGK+SARDLNKSLSPRSQE+AKQLML Sbjct: 375 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKISARDLNKSLSPRSQERAKQLML 434 Query: 891 EYAGSER-GGGDTDMESNFDNTSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 E+AGSER GGGDTDMESNFDNTSV+SEDFDN+ KKP LIQKLKRWG Sbjct: 435 EFAGSERGGGGDTDMESNFDNTSVDSEDFDNVSIDSSTSRFSTLSKKPSLIQKLKRWGGK 494 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 PARSFAG SPSR S+ Sbjct: 495 SRDDSSAFSSPARSFAGGSPSRSSV 519 >ref|XP_002281154.2| PREDICTED: protein CHUP1, chloroplastic-like [Vitis vinifera] Length = 1003 Score = 378 bits (971), Expect = e-102 Identities = 213/385 (55%), Positives = 248/385 (64%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEID PLP+DK+D K EKDRVYE+EMANNA+ Sbjct: 109 GEIDIPLPSDKFDTETAAKVEKDRVYETEMANNANELERLRNLVKELEEREVKLEGELLE 168 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QE+ IAELQ+QLKIKTVEIDMLNITI+SLQAERKKLQ+E++ G+++RKELE+AR Sbjct: 169 YYGLKEQETDIAELQRQLKIKTVEIDMLNITISSLQAERKKLQDEVALGVSARKELEVAR 228 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN TKEQ++ A Sbjct: 229 NKIKELQRQIQVEANQTKGHLLLLKQQVSGLQTKEQEAIKKDAEIEKKLKAAKELEVEVV 288 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 R+NKELQHEKREL+VKLD A+++V LSN+TE+EMVAK RE+V L+HANEDL+KQV Sbjct: 289 ELKRRNKELQHEKRELLVKLDGAEARVAALSNMTESEMVAKAREDVNNLRHANEDLLKQV 348 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP GK+SARDL+KSLSPRSQE+AKQLML Sbjct: 349 EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPGGKISARDLSKSLSPRSQERAKQLML 408 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESNF + +S SEDFDN KKP LIQKLK+WG Sbjct: 409 EYAGSERGQGDTDLESNFSHPSSPGSEDFDNASIDSSTSRYSSLSKKPSLIQKLKKWG-K 467 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 PARSF G SP R S+ Sbjct: 468 SRDDSSVLSSPARSFGGGSPGRTSI 492 >emb|CAN78725.1| hypothetical protein VITISV_020008 [Vitis vinifera] Length = 955 Score = 378 bits (971), Expect = e-102 Identities = 213/385 (55%), Positives = 248/385 (64%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEID PLP+DK+D K EKDRVYE+EMANNA+ Sbjct: 133 GEIDIPLPSDKFDTETAAKVEKDRVYETEMANNANELERLRNLVKELEEREVKLEGELLE 192 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QE+ IAELQ+QLKIKTVEIDMLNITI+SLQAERKKLQ+E++ G+++RKELE+AR Sbjct: 193 YYGLKEQETDIAELQRQLKIKTVEIDMLNITISSLQAERKKLQDEVALGVSARKELEVAR 252 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN TKEQ++ A Sbjct: 253 NKIKELQRQIQVEANQTKGHLLLLKQQVSGLQTKEQEAIKKDAEIEKKLKAAKELEVEVV 312 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 R+NKELQHEKREL+VKLD A+++V LSN+TE+EMVAK RE+V L+HANEDL+KQV Sbjct: 313 ELKRRNKELQHEKRELLVKLDGAEARVAALSNMTESEMVAKAREDVNNLRHANEDLLKQV 372 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP GK+SARDL+KSLSPRSQE+AKQLML Sbjct: 373 EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPGGKISARDLSKSLSPRSQERAKQLML 432 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESNF + +S SEDFDN KKP LIQKLK+WG Sbjct: 433 EYAGSERGQGDTDLESNFSHPSSPGSEDFDNASIDSSTSRYSSLSKKPSLIQKLKKWG-K 491 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 PARSF G SP R S+ Sbjct: 492 SRDDSSVLSSPARSFGGGSPGRTSI 516 >gb|EPS62321.1| hypothetical protein M569_12467, partial [Genlisea aurea] Length = 950 Score = 375 bits (962), Expect = e-101 Identities = 212/386 (54%), Positives = 251/386 (65%), Gaps = 4/386 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEIDFPLPTDKY++A+ A D+VYE EMANNAS Sbjct: 90 GEIDFPLPTDKYESAS-ASAADDKVYEYEMANNASELERLRNLVKELEEREVKLEGELLE 148 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES+++ELQKQL IKT+EIDML ITINSLQAERKKLQEE+SQG++ + EL++AR Sbjct: 149 YYGLKEQESNVSELQKQLHIKTLEIDMLQITINSLQAERKKLQEEVSQGVSVKNELDLAR 208 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN KEQ++ Sbjct: 209 KKINELQKQIQLDANQTKGQLLLLKQQVSTLQAKEQETIRKDGEFEKKFKALKELEVEVM 268 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKN+ELQHEKREL+VKLD+A+S V+ LSN+TETEMVA +R EV EL+H N+DLVKQV Sbjct: 269 ELKRKNRELQHEKRELMVKLDAAESNVKLLSNMTETEMVASIRGEVNELRHKNDDLVKQV 328 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEE+VYLRWVNACLRFELRN+QTPSG++SARDL+KSLSP+SQE+AKQL+L Sbjct: 329 EGLQMNRFSEVEEMVYLRWVNACLRFELRNHQTPSGRISARDLSKSLSPKSQERAKQLLL 388 Query: 891 EYAGSERGGGDTDMESNFDNTSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXXX 1070 EYAGSER GGDTD+ESNFDNTSV+SEDFD++ KKPGLIQKLKRWG Sbjct: 389 EYAGSER-GGDTDIESNFDNTSVDSEDFDSV-SVDSSSVTKFSNKKPGLIQKLKRWGGKG 446 Query: 1071 XXXXXXXXXPARSFAGASPSRPSLKP 1148 PARS SP R +L+P Sbjct: 447 HEDSSAMSSPARSSYAGSPGRVNLRP 472 >ref|XP_002315963.1| hypothetical protein POPTR_0010s14080g [Populus trichocarpa] gi|222865003|gb|EEF02134.1| hypothetical protein POPTR_0010s14080g [Populus trichocarpa] Length = 955 Score = 375 bits (962), Expect = e-101 Identities = 215/385 (55%), Positives = 251/385 (65%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEID+PLP +K+D +AEKD++YE+EMANNAS Sbjct: 98 GEIDYPLPGEKFD-----QAEKDKIYETEMANNASELECLRNLVRELEEREVKLEGELLE 152 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES + ELQ+QLKIKTVEIDMLNITINSLQAERKKLQEE+S G +S+KELE+AR Sbjct: 153 YYGLKEQESDVVELQRQLKIKTVEIDMLNITINSLQAERKKLQEEISHGASSKKELELAR 212 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN KEQ++ A Sbjct: 213 NKIKEFQRQIQLDANQTKGQLLLLKQQVSGLQAKEQEAVKKDAEVEKRLKAVKELEVEVV 272 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKNKELQHEKREL++KL +A++K+ +LSN++ETEMVAKVREEV LKHANEDL+KQV Sbjct: 273 ELKRKNKELQHEKRELIIKLGAAEAKLTSLSNLSETEMVAKVREEVNNLKHANEDLLKQV 332 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTPSGKVSARDLNKSLSP+SQE+AKQL+L Sbjct: 333 EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPSGKVSARDLNKSLSPKSQERAKQLLL 392 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTDMESN+ + +S SEDFDN KKP LIQKLK+WG Sbjct: 393 EYAGSERGQGDTDMESNYSHPSSPGSEDFDN-TSIDSSSSRYSFSKKPNLIQKLKKWG-R 450 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 P+RSF+G SPSR S+ Sbjct: 451 SKDDSSAFSSPSRSFSGVSPSRSSM 475 >ref|XP_002524394.1| conserved hypothetical protein [Ricinus communis] gi|223536355|gb|EEF38005.1| conserved hypothetical protein [Ricinus communis] Length = 998 Score = 358 bits (919), Expect = 2e-96 Identities = 204/385 (52%), Positives = 244/385 (63%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEID+PLP D+ D KAEKD+VYE+EMANNAS Sbjct: 109 GEIDYPLPGDRVD-----KAEKDKVYENEMANNASELERLRNLVRELEEREVKLEGELLE 163 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES +AE+ +QLKIKTVEIDMLNITINSLQAERKKLQEE++QG +++KELE AR Sbjct: 164 YYGLKEQESDVAEIHRQLKIKTVEIDMLNITINSLQAERKKLQEEVAQGASAKKELEAAR 223 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN KE+++ A Sbjct: 224 TKIKELQRQIQLDANQTKGQLLLLKQQVSGLQAKEEEAIKKDAELERKLKAVKDLEVEVV 283 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKNKELQHEKREL +KLD+A +K+ +LSN+TE+EMVAK R++V L+HANEDL+KQV Sbjct: 284 ELRRKNKELQHEKRELTIKLDAAQAKIVSLSNMTESEMVAKARDDVNNLRHANEDLLKQV 343 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQ P G+VSARDL+K+LSP+SQEKAK LML Sbjct: 344 EGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPPGRVSARDLSKNLSPKSQEKAKHLML 403 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD++SNF + +S SEDFDN KKP LIQK+K+WG Sbjct: 404 EYAGSERGQGDTDLDSNFSHPSSPGSEDFDNTSIDSSTSRYSSLSKKPSLIQKIKKWG-K 462 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 P+RSF+ SPSR S+ Sbjct: 463 SKDDSSALSSPSRSFSADSPSRTSM 487 >ref|XP_006362524.1| PREDICTED: protein CHUP1, chloroplastic-like [Solanum tuberosum] Length = 991 Score = 357 bits (917), Expect = 4e-96 Identities = 206/386 (53%), Positives = 241/386 (62%), Gaps = 6/386 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI+FPLP+DKYD + E++RVY++EMA NA+ Sbjct: 101 GEIEFPLPSDKYDTG---REERERVYQTEMAYNANELERLRNLVKELEEREVKLEGELLE 157 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES I ELQKQLKIK+VEIDMLNITIN+LQAE++KLQEE+ G +RK+LE AR Sbjct: 158 YYGLKEQESDILELQKQLKIKSVEIDMLNITINTLQAEKQKLQEEVFHGTTARKDLEAAR 217 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN KE+++ + Sbjct: 218 SKIKELQRQMQLEANQTKAQLLLLKQHVTGLQEKEEEAFKRDSDVDKKLKLVKELEVEVM 277 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKNKELQHEKRELV+KLD+A+SK+ LSN+TE EMVA+VREEV LKH N+DL+KQV Sbjct: 278 ELKRKNKELQHEKRELVIKLDTAESKIAKLSNMTENEMVAQVREEVTNLKHTNDDLLKQV 337 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTP GKVSARDL+K+LSP+SQ+KAKQLML Sbjct: 338 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPQGKVSARDLSKNLSPKSQQKAKQLML 397 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWG-X 1064 EYAGSERG GDTD+ESNF +S SEDFDN KKP LIQKLK+WG Sbjct: 398 EYAGSERGQGDTDLESNFSQPSSPGSEDFDNASIDSSTSRFSSFSKKPNLIQKLKKWGSR 457 Query: 1065 XXXXXXXXXXXPARSFAGASPSRPSL 1142 PARS GASP R S+ Sbjct: 458 GGRDDSSVMSSPARSLGGASPGRMSM 483 >ref|XP_004238973.1| PREDICTED: uncharacterized protein LOC101267989 [Solanum lycopersicum] Length = 1174 Score = 357 bits (916), Expect = 5e-96 Identities = 206/386 (53%), Positives = 239/386 (61%), Gaps = 6/386 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI+FPLP+DKYD + E++RVY++EMA NA+ Sbjct: 284 GEIEFPLPSDKYDTG---REERERVYQTEMAYNANELERLRNLVKELEEREVKLEGELLE 340 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES + ELQKQLKIK VEIDMLNITIN+LQAE++KLQEE+ G +RK+LE AR Sbjct: 341 YYGLKEQESDVLELQKQLKIKAVEIDMLNITINTLQAEKQKLQEEVFHGTTARKDLEAAR 400 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN KE+++ + Sbjct: 401 SKIKELQRQMQLEANQTKAQLLLLKQHVTELQEKEEEAFKRDSEVDKKLKLVKELEVEVM 460 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKNKELQHEKRELV+KLD+A+SK+ LSN+TE EMVA+VREEV LKH N+DL+KQV Sbjct: 461 ELKRKNKELQHEKRELVIKLDAAESKIAKLSNMTENEMVAQVREEVTNLKHTNDDLLKQV 520 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTP GKVSARDL+KSLSP+SQ KAKQLML Sbjct: 521 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPQGKVSARDLSKSLSPKSQHKAKQLML 580 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWG-X 1064 EYAGSERG GDTD+ESNF +S SEDFDN KKP LIQKLK+WG Sbjct: 581 EYAGSERGQGDTDLESNFSQPSSPGSEDFDNASIDSSTSRFSTFSKKPNLIQKLKKWGSR 640 Query: 1065 XXXXXXXXXXXPARSFAGASPSRPSL 1142 PARS GASP R S+ Sbjct: 641 GGKDDSSIMSSPARSLGGASPGRMSM 666 >gb|EXB53975.1| hypothetical protein L484_022943 [Morus notabilis] Length = 1617 Score = 355 bits (910), Expect = 3e-95 Identities = 206/385 (53%), Positives = 242/385 (62%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI+FPLP+ K D K++KD+VYE+EMANNAS Sbjct: 729 GEIEFPLPSSKSD-----KSQKDKVYETEMANNASELERLRKLVKELEEREVKLEGELLE 783 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES I ELQ+QLKIK+VE++MLNITINSLQAERKKLQ+E++QG ++RKELE AR Sbjct: 784 YYGLKEQESDIDELQRQLKIKSVEVNMLNITINSLQAERKKLQDEIAQGASARKELEAAR 843 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN KE+++ A Sbjct: 844 NKIKELQRQIQLDANQTKGQLLLLKQQVSGLQAKEEEAVKKDAELEKKLKAVKELEVEVV 903 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKNKELQHEKREL+VKLD+A ++V LS++TE+E VA REEV L+HANEDL+KQV Sbjct: 904 ELKRKNKELQHEKRELIVKLDAAQARVTALSSMTESEKVANAREEVNNLRHANEDLLKQV 963 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQ P GK+SARDLNKSLSPRSQEKAKQLML Sbjct: 964 EGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPPGKMSARDLNKSLSPRSQEKAKQLML 1023 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESNF + +S SEDFDN KK LIQKLK+WG Sbjct: 1024 EYAGSERGQGDTDIESNFSHPSSPGSEDFDNASIDSFTSRVSSLGKKTSLIQKLKKWG-R 1082 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 P+RS +G SPSR S+ Sbjct: 1083 SKDDSSALLSPSRSLSGGSPSRMSM 1107 >ref|XP_004159306.1| PREDICTED: protein CHUP1, chloroplastic-like [Cucumis sativus] Length = 987 Score = 355 bits (910), Expect = 3e-95 Identities = 204/380 (53%), Positives = 241/380 (63%), Gaps = 5/380 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI+FPLP + D++ KAEKDRVYE+EMANNAS Sbjct: 95 GEIEFPLP--EIDDS---KAEKDRVYETEMANNASELERLRNLVKELEEREVKLEGELLE 149 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES I ELQ+QLKIK VEIDMLNITI+SLQAERKKLQEE++Q A +KELE AR Sbjct: 150 YYGLKEQESDITELQRQLKIKAVEIDMLNITISSLQAERKKLQEEIAQDAAVKKELEFAR 209 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN +KEQ++ A Sbjct: 210 NKIKELQRQIQLDANQTKGQLLLLKQQVSGLQSKEQETIKKDAELEKKLKAVKELEVEVM 269 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKNKELQ EKREL +KLD+A++K+ TLSN+TE+E+VA+ RE+V L+HANEDL+KQV Sbjct: 270 ELKRKNKELQIEKRELTIKLDAAENKISTLSNMTESELVAQTREQVSNLRHANEDLIKQV 329 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQ P+GK+SARDL+K+LSP+SQEKAKQLM+ Sbjct: 330 EGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPTGKISARDLSKNLSPKSQEKAKQLMV 389 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESN+ +S SEDFDN KKP LIQKLK+WG Sbjct: 390 EYAGSERGQGDTDLESNYSQPSSPGSEDFDNASIDSSFSRYSSLSKKPSLIQKLKKWGGR 449 Query: 1068 XXXXXXXXXXPARSFAGASP 1127 PARSF+G SP Sbjct: 450 SKDDSSALSSPARSFSGGSP 469 >ref|XP_004135119.1| PREDICTED: protein CHUP1, chloroplastic-like [Cucumis sativus] Length = 987 Score = 355 bits (910), Expect = 3e-95 Identities = 204/380 (53%), Positives = 241/380 (63%), Gaps = 5/380 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI+FPLP + D++ KAEKDRVYE+EMANNAS Sbjct: 95 GEIEFPLP--EIDDS---KAEKDRVYETEMANNASELERLRNLVKELEEREVKLEGELLE 149 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES I ELQ+QLKIK VEIDMLNITI+SLQAERKKLQEE++Q A +KELE AR Sbjct: 150 YYGLKEQESDITELQRQLKIKAVEIDMLNITISSLQAERKKLQEEIAQDAAVKKELEFAR 209 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN +KEQ++ A Sbjct: 210 NKIKELQRQIQLDANQTKGQLLLLKQQVSGLQSKEQETIKKDAELEKKLKAVKELEVEVM 269 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKNKELQ EKREL +KLD+A++K+ TLSN+TE+E+VA+ RE+V L+HANEDL+KQV Sbjct: 270 ELKRKNKELQIEKRELTIKLDAAENKISTLSNMTESELVAQTREQVSNLRHANEDLIKQV 329 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQ P+GK+SARDL+K+LSP+SQEKAKQLM+ Sbjct: 330 EGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPTGKISARDLSKNLSPKSQEKAKQLMV 389 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESN+ +S SEDFDN KKP LIQKLK+WG Sbjct: 390 EYAGSERGQGDTDLESNYSQPSSPGSEDFDNASIDSSFSRYSSLSKKPSLIQKLKKWGGR 449 Query: 1068 XXXXXXXXXXPARSFAGASP 1127 PARSF+G SP Sbjct: 450 SKDDSSALSSPARSFSGGSP 469 >ref|XP_006573276.1| PREDICTED: protein CHUP1, chloroplastic-like [Glycine max] Length = 968 Score = 354 bits (909), Expect = 3e-95 Identities = 210/385 (54%), Positives = 238/385 (61%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI+FPLP DK EKD+VYE EMANNAS Sbjct: 82 GEIEFPLPPDK--------DEKDKVYEIEMANNASELERLRQLVKELEEREVKLEGELLE 133 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES I ELQ+QLKIKTVEIDMLNITINSLQAERKKLQEEL+QG +++KELE+AR Sbjct: 134 YYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEELTQGASAKKELEVAR 193 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQSAXXXXXXXXXXXXXXXXXXXX- 539 AN KE+++A Sbjct: 194 NKIKELQRQIQLEANQTKGQLLLLKQQVSTLLVKEEEAARKDAEVEKKLKAVNDLEVAVV 253 Query: 540 ---RKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKNKELQHEKREL VKL+ A+S+ LSN+TE+EMVAK +EEV L+HANEDL+KQV Sbjct: 254 ELKRKNKELQHEKRELTVKLNVAESRAAELSNMTESEMVAKAKEEVSNLRHANEDLLKQV 313 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRN QTP GKVSARDL+KSLSP+SQEKAKQLML Sbjct: 314 EGLQMNRFSEVEELVYLRWVNACLRYELRNNQTPQGKVSARDLSKSLSPKSQEKAKQLML 373 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESNF + +S SEDFDN KK LIQK K+WG Sbjct: 374 EYAGSERGQGDTDLESNFSHPSSPGSEDFDNASIDSSTSKYSSLSKKTSLIQKFKKWG-K 432 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 PARSF+G SP R S+ Sbjct: 433 SKDDSSALSSPARSFSGGSPRRMSV 457 >ref|XP_006574884.1| PREDICTED: protein CHUP1, chloroplastic-like [Glycine max] Length = 977 Score = 351 bits (900), Expect = 4e-94 Identities = 206/385 (53%), Positives = 240/385 (62%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI+FP+P DK EKD+VYE EMA+NA+ Sbjct: 88 GEIEFPIPPDK--------DEKDKVYEIEMAHNATELERLRQLVKELEEREVKLEGELLE 139 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES I ELQ+QLKIKTVEIDMLNITINSLQAERKKLQEEL+QG ++++ELE+AR Sbjct: 140 YYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEELTQGASAKRELEVAR 199 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQSAXXXXXXXXXXXXXXXXXXXX- 539 AN KE+++A Sbjct: 200 NKIKELQRQIQLEANQTKGQLLLLKQQVSTLLVKEEEAARKDAEVQKKLKAVNDLEVTVV 259 Query: 540 ---RKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKNKELQHEKREL+VKL++A+S+ LSN+TE+EMVAK +EEV L+HANEDL+KQV Sbjct: 260 ELKRKNKELQHEKRELMVKLNAAESRAAELSNMTESEMVAKAKEEVSNLRHANEDLLKQV 319 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRN QTP GKVSARDL+KSLSP+SQEKAKQLML Sbjct: 320 EGLQMNRFSEVEELVYLRWVNACLRYELRNNQTPQGKVSARDLSKSLSPKSQEKAKQLML 379 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESNF + +S SEDFDN KK LIQK K+WG Sbjct: 380 EYAGSERGQGDTDLESNFSHPSSPGSEDFDNASIDSSTSKYSSLSKKTSLIQKFKKWG-K 438 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 PARSF+G SP R S+ Sbjct: 439 SKDDSSALSSPARSFSGGSPRRMSV 463 >ref|XP_006484398.1| PREDICTED: protein CHUP1, chloroplastic-like isoform X1 [Citrus sinensis] gi|568861823|ref|XP_006484399.1| PREDICTED: protein CHUP1, chloroplastic-like isoform X2 [Citrus sinensis] Length = 992 Score = 350 bits (897), Expect = 8e-94 Identities = 204/385 (52%), Positives = 241/385 (62%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI++ LP DKYD +AEK++VYE+EMA+NA Sbjct: 109 GEIEYQLPIDKYD-----EAEKNKVYETEMADNARELERLRSLVLELQEREVKLEGELLE 163 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES I ELQ+QLKIKTVEIDMLNITINSLQAERKKLQE+++Q +KELE+AR Sbjct: 164 YYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEQIAQSSYVKKELEVAR 223 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN KE+++ Sbjct: 224 NKIKELQRQIQLDANQTKGQLLLLKQQVSGLQAKEEEAIKKDVELEKKLKSVKDLEVEVV 283 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKNKELQ EKREL+VK D+A+SK+ +LSN+TE+E VAK REEV L+HAN+DL+KQV Sbjct: 284 ELKRKNKELQIEKRELLVKQDAAESKISSLSNMTESEKVAKAREEVNNLRHANDDLLKQV 343 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQ P+GK SARDLNKSLSP+SQE+AKQLML Sbjct: 344 EGLQMNRFSEVEELVYLRWVNACLRYELRNYQAPAGKTSARDLNKSLSPKSQERAKQLML 403 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESNF + +S SEDFDN KKP LIQKLK+WG Sbjct: 404 EYAGSERGQGDTDLESNFSHPSSPGSEDFDNASIDSSTSKYSNLSKKPSLIQKLKKWG-K 462 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 PARS +G+SPSR S+ Sbjct: 463 SKDDLSALSSPARSISGSSPSRMSM 487 >ref|XP_006395634.1| hypothetical protein EUTSA_v10003588mg [Eutrema salsugineum] gi|557092273|gb|ESQ32920.1| hypothetical protein EUTSA_v10003588mg [Eutrema salsugineum] Length = 1000 Score = 349 bits (895), Expect = 1e-93 Identities = 205/385 (53%), Positives = 241/385 (62%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI++PLP+D DN+ KAEK+R YE+EMA N S Sbjct: 100 GEIEYPLPSD--DNSLE-KAEKEREYETEMAYNDSELERLRQLVKELEEREVKLEGELLE 156 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES I ELQ+QLKIKTVEIDMLNITINSLQAERKKLQEE++Q RKELE+AR Sbjct: 157 YYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEEITQNGVVRKELEVAR 216 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN KE+++ + Sbjct: 217 NKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEEAMNKDSEVDRKLKAVQGLEVEVM 276 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKN+ELQHEKREL +KLDSA++++ LSN+TE++ VAKVREEV LKH NEDL+KQV Sbjct: 277 ELKRKNRELQHEKRELTIKLDSAEARISALSNMTESDKVAKVREEVNNLKHNNEDLLKQV 336 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP+GK+SARDL+K+LSP+SQ KAK+LML Sbjct: 337 EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLML 396 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESNF +S S+DFDN KKPGLIQKLKRWG Sbjct: 397 EYAGSERGQGDTDVESNFSQPSSPGSDDFDNASMDSSTSRFSSFSKKPGLIQKLKRWG-K 455 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 P+RSF G SP R S+ Sbjct: 456 SKDDSSVQSSPSRSFYGGSPGRLSV 480 >ref|XP_006395633.1| hypothetical protein EUTSA_v10003588mg [Eutrema salsugineum] gi|557092272|gb|ESQ32919.1| hypothetical protein EUTSA_v10003588mg [Eutrema salsugineum] Length = 998 Score = 349 bits (895), Expect = 1e-93 Identities = 205/385 (53%), Positives = 241/385 (62%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI++PLP+D DN+ KAEK+R YE+EMA N S Sbjct: 98 GEIEYPLPSD--DNSLE-KAEKEREYETEMAYNDSELERLRQLVKELEEREVKLEGELLE 154 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES I ELQ+QLKIKTVEIDMLNITINSLQAERKKLQEE++Q RKELE+AR Sbjct: 155 YYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEEITQNGVVRKELEVAR 214 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN KE+++ + Sbjct: 215 NKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEEAMNKDSEVDRKLKAVQGLEVEVM 274 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKN+ELQHEKREL +KLDSA++++ LSN+TE++ VAKVREEV LKH NEDL+KQV Sbjct: 275 ELKRKNRELQHEKRELTIKLDSAEARISALSNMTESDKVAKVREEVNNLKHNNEDLLKQV 334 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP+GK+SARDL+K+LSP+SQ KAK+LML Sbjct: 335 EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLML 394 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESNF +S S+DFDN KKPGLIQKLKRWG Sbjct: 395 EYAGSERGQGDTDVESNFSQPSSPGSDDFDNASMDSSTSRFSSFSKKPGLIQKLKRWG-K 453 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 P+RSF G SP R S+ Sbjct: 454 SKDDSSVQSSPSRSFYGGSPGRLSV 478 >ref|XP_007046330.1| Hydroxyproline-rich glycoprotein family protein isoform 4 [Theobroma cacao] gi|508710265|gb|EOY02162.1| Hydroxyproline-rich glycoprotein family protein isoform 4 [Theobroma cacao] Length = 933 Score = 349 bits (895), Expect = 1e-93 Identities = 200/385 (51%), Positives = 240/385 (62%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI++PL DK+ +AE++++YE+EMANNAS Sbjct: 109 GEIEYPLSADKF-----ARAEREKIYETEMANNASELERLRNLVKELEEREVKLEGELLE 163 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES I EL++QLKIKTVEIDMLNITI+SLQ+ERKKLQE+++ G + +KELE+AR Sbjct: 164 YYGLKEQESDIFELKRQLKIKTVEIDMLNITISSLQSERKKLQEDIAHGASVKKELEVAR 223 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN KEQ++ A Sbjct: 224 NKIKELQRQIQLDANQTKAQLLFLKQQVSGLQAKEQEAIKNDAEVEKKLKAVKELEMEVM 283 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKNKELQHEKREL VKLD+A++K+ LSN+TETE+ + REEV L+HANEDL+KQV Sbjct: 284 ELRRKNKELQHEKRELTVKLDAAEAKIAALSNMTETEIDVRAREEVSNLRHANEDLLKQV 343 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP GK+SARDLNKSLSP+SQE AKQL+L Sbjct: 344 EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPEGKISARDLNKSLSPKSQETAKQLLL 403 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESNF + +S SED DN KKP LIQKLK+WG Sbjct: 404 EYAGSERGQGDTDIESNFSHPSSTGSEDLDNASIYSSNSRYSSLSKKPSLIQKLKKWG-R 462 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 PARS +G SPSR S+ Sbjct: 463 SKDDSSAVSSPARSLSGGSPSRISM 487 >ref|XP_007046327.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|590701143|ref|XP_007046328.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|590701146|ref|XP_007046329.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|590701152|ref|XP_007046331.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|590701156|ref|XP_007046332.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|590701159|ref|XP_007046333.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|590701163|ref|XP_007046334.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508710262|gb|EOY02159.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508710263|gb|EOY02160.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508710264|gb|EOY02161.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508710266|gb|EOY02163.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508710267|gb|EOY02164.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508710268|gb|EOY02165.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508710269|gb|EOY02166.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 996 Score = 349 bits (895), Expect = 1e-93 Identities = 200/385 (51%), Positives = 240/385 (62%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI++PL DK+ +AE++++YE+EMANNAS Sbjct: 109 GEIEYPLSADKF-----ARAEREKIYETEMANNASELERLRNLVKELEEREVKLEGELLE 163 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES I EL++QLKIKTVEIDMLNITI+SLQ+ERKKLQE+++ G + +KELE+AR Sbjct: 164 YYGLKEQESDIFELKRQLKIKTVEIDMLNITISSLQSERKKLQEDIAHGASVKKELEVAR 223 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN KEQ++ A Sbjct: 224 NKIKELQRQIQLDANQTKAQLLFLKQQVSGLQAKEQEAIKNDAEVEKKLKAVKELEMEVM 283 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKNKELQHEKREL VKLD+A++K+ LSN+TETE+ + REEV L+HANEDL+KQV Sbjct: 284 ELRRKNKELQHEKRELTVKLDAAEAKIAALSNMTETEIDVRAREEVSNLRHANEDLLKQV 343 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP GK+SARDLNKSLSP+SQE AKQL+L Sbjct: 344 EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPEGKISARDLNKSLSPKSQETAKQLLL 403 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESNF + +S SED DN KKP LIQKLK+WG Sbjct: 404 EYAGSERGQGDTDIESNFSHPSSTGSEDLDNASIYSSNSRYSSLSKKPSLIQKLKKWG-R 462 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 PARS +G SPSR S+ Sbjct: 463 SKDDSSAVSSPARSLSGGSPSRISM 487 >ref|XP_006290457.1| hypothetical protein CARUB_v10019508mg [Capsella rubella] gi|482559164|gb|EOA23355.1| hypothetical protein CARUB_v10019508mg [Capsella rubella] Length = 997 Score = 348 bits (894), Expect = 2e-93 Identities = 205/384 (53%), Positives = 238/384 (61%), Gaps = 5/384 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI++PLP D DN+ KAEK+R YE EMA N Sbjct: 97 GEIEYPLPDD--DNSLE-KAEKERKYEVEMAYNDGELERLKQLVKELEEREVKLEGELLE 153 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES I ELQ+QLKIKTVEIDMLNITINSLQAERKKLQEE+SQ + RKELE+AR Sbjct: 154 YYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEEISQNVIVRKELEVAR 213 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQS----AXXXXXXXXXXXXXXXXX 530 AN KE+++ Sbjct: 214 NKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEEAMNKDTEVERKLKAVQDLEVEVM 273 Query: 531 XXXRKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 RKN+ELQHEKREL +KLDSA++++ TLSN+TE++ VAKVREEV LKH NEDL+KQV Sbjct: 274 ELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVAKVREEVNNLKHNNEDLLKQV 333 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQMNRFSEVEELVYLRWVNACLR+ELRNYQTP+GK+SARDL+K+LSP+SQ KAK+LML Sbjct: 334 EGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLML 393 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESN+ +S S+DFDN KKPGLIQKLKRWG Sbjct: 394 EYAGSERGQGDTDLESNYSQPSSPGSDDFDNASMDSSTSRFSSFSKKPGLIQKLKRWG-K 452 Query: 1068 XXXXXXXXXXPARSFAGASPSRPS 1139 P+RSF G SP R S Sbjct: 453 SKDDSSVQSSPSRSFYGGSPGRLS 476 >ref|XP_007153329.1| hypothetical protein PHAVU_003G026100g [Phaseolus vulgaris] gi|561026683|gb|ESW25323.1| hypothetical protein PHAVU_003G026100g [Phaseolus vulgaris] Length = 979 Score = 348 bits (892), Expect = 3e-93 Identities = 204/385 (52%), Positives = 237/385 (61%), Gaps = 5/385 (1%) Frame = +3 Query: 3 GEIDFPLPTDKYDNAANVKAEKDRVYESEMANNASXXXXXXXXXXXXXXXXXXXXXXXXX 182 GEI+FPLP D+ EKDRVYE EMANN S Sbjct: 91 GEIEFPLPPDR--------DEKDRVYEIEMANNESELERLRLLVKELEEREVKLEGELLE 142 Query: 183 XXXXXXQESSIAELQKQLKIKTVEIDMLNITINSLQAERKKLQEELSQGIASRKELEMAR 362 QES I ELQ+QLKIK VEIDMLNITINSLQAERKKLQEEL+QG ++++ELE+AR Sbjct: 143 YYGLKEQESDIVELQRQLKIKAVEIDMLNITINSLQAERKKLQEELTQGASAKRELEVAR 202 Query: 363 XXXXXXXXXXXXXANXXXXXXXXXXXXXXXXXTKEQQSAXXXXXXXXXXXXXXXXXXXX- 539 AN KE+++A Sbjct: 203 NKIKELQRQMQLEANQTKGQLLLLKQQVLGLQVKEEEAATKDAQVEKKLKAVNDLEVAVV 262 Query: 540 ---RKNKELQHEKRELVVKLDSADSKVRTLSNITETEMVAKVREEVYELKHANEDLVKQV 710 R+NKELQHEKREL VKL++A+S+ LSN+TE++MVAK +EEV L+HANEDL KQV Sbjct: 263 ELKRRNKELQHEKRELTVKLNAAESRAAELSNMTESDMVAKAKEEVSNLRHANEDLQKQV 322 Query: 711 EGLQMNRFSEVEELVYLRWVNACLRFELRNYQTPSGKVSARDLNKSLSPRSQEKAKQLML 890 EGLQ+NRFSEVEELVYLRWVNACLR+ELRNYQTP GKVSARDL+KSLSP+SQEKAKQLML Sbjct: 323 EGLQINRFSEVEELVYLRWVNACLRYELRNYQTPQGKVSARDLSKSLSPKSQEKAKQLML 382 Query: 891 EYAGSERGGGDTDMESNFDN-TSVESEDFDNMXXXXXXXXXXXXXKKPGLIQKLKRWGXX 1067 EYAGSERG GDTD+ESNF + +S S+DFDN KK LIQK K+WG Sbjct: 383 EYAGSERGQGDTDLESNFSHPSSPGSDDFDNASIDSYSSKYSTLSKKTSLIQKFKKWG-K 441 Query: 1068 XXXXXXXXXXPARSFAGASPSRPSL 1142 PARSF+G SP R S+ Sbjct: 442 SKDDSSALSSPARSFSGGSPRRMSV 466