BLASTX nr result
ID: Mentha26_contig00012172
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00012172 (974 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU23414.1| hypothetical protein MIMGU_mgv1a020543mg [Mimulus... 164 7e-38 ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578... 121 5e-25 ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241... 119 1e-24 ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-r... 108 3e-21 ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620... 104 6e-20 ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citr... 103 8e-20 ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prun... 103 1e-19 ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r... 100 1e-18 ref|XP_002509872.1| conserved hypothetical protein [Ricinus comm... 97 1e-17 ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r... 96 2e-17 ref|XP_006573794.1| PREDICTED: uncharacterized protein LOC102667... 96 2e-17 gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis] 96 3e-17 ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-r... 96 3e-17 ref|XP_002887922.1| hypothetical protein ARALYDRAFT_474948 [Arab... 95 4e-17 ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, part... 94 1e-16 ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297... 94 1e-16 ref|NP_974086.1| late embryogenesis abundant hydroxyproline-rich... 91 7e-16 ref|XP_006300370.1| hypothetical protein CARUB_v10022353mg [Caps... 91 9e-16 ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r... 90 1e-15 ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r... 90 1e-15 >gb|EYU23414.1| hypothetical protein MIMGU_mgv1a020543mg [Mimulus guttatus] Length = 198 Score = 164 bits (414), Expect = 7e-38 Identities = 99/211 (46%), Positives = 125/211 (59%), Gaps = 2/211 (0%) Frame = -3 Query: 762 MDEETY--LNPEAKSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLV 589 M+EE++ +NP KSD++ T + + SSK LVYIL+A V S+ FL+FGLV Sbjct: 1 MEEESHRIINPYIKSDEEEFTTTTKN-NRRGKGGGSSKCLVYILVAVVLQSVAFLVFGLV 59 Query: 588 VLRINAPSLRLSNVVVKDLRYSNSSFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATX 409 LRI+ PSLRLS+ V LR+ ++S N T +A IRL N NFG F+F GGS +L YG AT Sbjct: 60 ALRISNPSLRLSSAAVAVLRHDSASLNMTVVAGIRLRNPNFGDFEFNGGSASLLYGEATV 119 Query: 408 XXXXXXXXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRV 229 + I+ +EV+GG LVKL +AELRGE+RV Sbjct: 120 GVASIYGGRVGRRDKKEINVTMEVIGGGGG------------GELVKLRSMAELRGEVRV 167 Query: 228 MKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 136 +KIV R R A MNCTMDLNLT QA Q LSCQ Sbjct: 168 VKIVKRRRIAFMNCTMDLNLTSQAFQDLSCQ 198 >ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578735 [Solanum tuberosum] Length = 223 Score = 121 bits (303), Expect = 5e-25 Identities = 71/205 (34%), Positives = 114/205 (55%), Gaps = 8/205 (3%) Frame = -3 Query: 729 KSDQKIVTNKALH---PTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLR 559 KSDQ + +K+++ + +S K VY L V LSI+ LIF +V R +PS Sbjct: 18 KSDQGLNLSKSINYYNNHNNNKNRKSGKCFVYFLSTIVILSIIMLIFSMVFFRFKSPSFE 77 Query: 558 LSNVVVKDLRYSNS----SFNATFIADIRLHNMNFGRFDFRGGSTALY-YGNATXXXXXX 394 L ++ V++LR+SNS SFN +I + N NFG+ +++ S +++ Y N T Sbjct: 78 LDHINVQNLRFSNSTNSSSFNMNMGGEIIVDNDNFGQINYQDSSMSVFLYDNVTIGIANV 137 Query: 393 XXXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVN 214 + I ++++ +Y N+S DI S ++KL E RG+++ MKI++ Sbjct: 138 NVGRVEARKSKRIGISLQLRTNYQLNYSYGNLSSDINSRMLKLTSFGEFRGKVKAMKIIS 197 Query: 213 RWRTAMMNCTMDLNLTGQAIQGLSC 139 + +T++MNCTM+LNLT QAIQ L C Sbjct: 198 KHKTSIMNCTMNLNLTSQAIQDLLC 222 >ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241699 [Vitis vinifera] Length = 213 Score = 119 bits (299), Expect = 1e-24 Identities = 76/218 (34%), Positives = 109/218 (50%), Gaps = 9/218 (4%) Frame = -3 Query: 762 MDEETYLNPEA------KSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLI 601 M E+ P A KSD++ K P + RSSK VY+L V L+ + L+ Sbjct: 1 MPEDNQFQPLAPARLHGKSDEEFGVFK---PRASKPPRRSSKCPVYVLAGLVTLAAIALV 57 Query: 600 FGLVVLRINAPSLRLSNVVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSTAL 430 F L VLR+ AP + L +V VK+L + S SFN T A++ + N NFG F+F G+ + Sbjct: 58 FALAVLRVEAPDVELKSVAVKNLTHGTSPSPSFNVTLTAEVSVQNKNFGAFNFENGTATV 117 Query: 429 YYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAE 250 Y + ++ ++V N N+S DI S V L A+ Sbjct: 118 LYEGMVVGDEEFSKAHVESRKTKRMNVTLDVRS--DRLWNDKNLSSDISSGSVNLTTYAQ 175 Query: 249 LRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 136 + G++RVMK+V R TA MNC+M LNLT +IQ L C+ Sbjct: 176 VTGKVRVMKVVRRRTTARMNCSMTLNLTSSSIQDLVCR 213 >ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777616|gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 213 Score = 108 bits (270), Expect = 3e-21 Identities = 62/190 (32%), Positives = 97/190 (51%), Gaps = 3/190 (1%) Frame = -3 Query: 696 LHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS 517 + PT Q + +SSK LVY+L+ V V LIF +VLR P + + +V V++L+Y NS Sbjct: 26 IKPTASQRKEKSSKCLVYVLVGMVIQGAVLLIFASIVLRARTPDVEIVSVTVRNLKYGNS 85 Query: 516 ---SFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAA 346 SFN T + ++ + N NFG F F + ++ G+ ++ + Sbjct: 86 SAPSFNLTLVTEVTVENSNFGDFKFENTTGTVWCGSVVVGKMKIPTGRAQARATERLNVS 145 Query: 345 VEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLT 166 V+V P + N+S +I S L++L +L G++ +M + R R MNC M LNLT Sbjct: 146 VDVSSLP--LPDTKNVSCNISSGLLELNSHVKLSGKVSIMNFMKRRRHPEMNCFMTLNLT 203 Query: 165 GQAIQGLSCQ 136 GQ Q C+ Sbjct: 204 GQTKQDFPCE 213 >ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620163 [Citrus sinensis] Length = 214 Score = 104 bits (259), Expect = 6e-20 Identities = 59/204 (28%), Positives = 106/204 (51%), Gaps = 3/204 (1%) Frame = -3 Query: 741 NPEAKSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSL 562 N +SDQ+ P + + +SSK LVY+L+ V +S LI + LR N P + Sbjct: 15 NEYPRSDQEYA------PAVIESQRKSSKCLVYVLVTIVTVSAALLISASIFLRPNTPEV 68 Query: 561 RLSNVVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXX 391 +L +V VK+L + N SFN T + ++ + N N+G F+++ S +++YG+ T Sbjct: 69 QLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVFYGSVTVGDVKIR 128 Query: 390 XXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNR 211 + I+ V+V + + N+ DI S +VKL A+L G + + ++ + Sbjct: 129 DGRVEAREVKRINVTVDVDVRSNGNLDNQNLRSDINSGIVKLNSYAKLHGNVSLFNVLKK 188 Query: 210 WRTAMMNCTMDLNLTGQAIQGLSC 139 +T ++C+M+L L +A++ L C Sbjct: 189 TKTPELDCSMNLVLARRAVEDLVC 212 >ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citrus clementina] gi|557541714|gb|ESR52692.1| hypothetical protein CICLE_v10023929mg [Citrus clementina] Length = 214 Score = 103 bits (258), Expect = 8e-20 Identities = 59/204 (28%), Positives = 106/204 (51%), Gaps = 3/204 (1%) Frame = -3 Query: 741 NPEAKSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSL 562 N +SDQ+ P + + +SSK LVY+L+ V +S LI + LR N P + Sbjct: 15 NEYPRSDQEYA------PAVIESQRKSSKCLVYVLVTIVTVSAALLISASIFLRPNTPEV 68 Query: 561 RLSNVVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXX 391 +L +V VK+L + N SFN T + ++ + N N+G F+++ S +++YG+ T Sbjct: 69 QLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVFYGSVTVGDVKIR 128 Query: 390 XXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNR 211 + I+ V+V + + N+S D S +VKL A+L G + + ++ + Sbjct: 129 DGRVEAREVKRINVTVDVDVRSNGNLDNQNLSSDRNSGIVKLNSYAKLHGNVNLFNVLKK 188 Query: 210 WRTAMMNCTMDLNLTGQAIQGLSC 139 +T ++C+M+L L +A++ L C Sbjct: 189 TKTPELDCSMNLVLARRAVEDLVC 212 >ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica] gi|462406396|gb|EMJ11860.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica] Length = 213 Score = 103 bits (256), Expect = 1e-19 Identities = 63/203 (31%), Positives = 104/203 (51%), Gaps = 5/203 (2%) Frame = -3 Query: 729 KSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSN 550 +SD++ T +A+ RS+K VY+ A V SI L+F LVVLR+ +P LS+ Sbjct: 19 RSDEENPTFRAIR------RERSNKCFVYVFAAIVLQSIFILVFALVVLRVKSPGFNLSS 72 Query: 549 VVVKDLRYS---NSSFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXX 379 V VK L+++ SS NAT + ++ + N NFG + F G S +L+YG Sbjct: 73 VSVKSLKHTTSPTSSLNATLVTELAIKNKNFGEYKFEGSSASLWYGGFKVGEAKIGKGRV 132 Query: 378 XXXXXRNIDAAVEVMGG--PSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWR 205 R + +++V P A N ++ S +K+ A+L G++ +MKI+ + + Sbjct: 133 KARGTRRVSLSIDVRSNRLPQEAKN--GFEGEMNSGYLKISSYAKLTGKVNLMKIMKKRK 190 Query: 204 TAMMNCTMDLNLTGQAIQGLSCQ 136 T NCTM + L + ++ L C+ Sbjct: 191 TIDTNCTMVVVLKSRTVKDLFCR 213 >ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777614|gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 188 Score = 100 bits (248), Expect = 1e-18 Identities = 58/180 (32%), Positives = 91/180 (50%), Gaps = 3/180 (1%) Frame = -3 Query: 666 RSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS---FNATFI 496 ++ K YI+ V +I+ L+F L V+RI PS RL +V V+ L Y+ S FN I Sbjct: 11 QNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPHFNMRLI 70 Query: 495 ADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGPSSA 316 +I + N NFG F F + + +G+ + ++ V+V S Sbjct: 71 MEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVSSSAVSD 130 Query: 315 ANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 136 + L + S + L GVA LRG++ +MK++ + +TA MNCTM +NL A+Q L C+ Sbjct: 131 EDELRTK--LSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQDLDCE 188 >ref|XP_002509872.1| conserved hypothetical protein [Ricinus communis] gi|223549771|gb|EEF51259.1| conserved hypothetical protein [Ricinus communis] Length = 221 Score = 97.1 bits (240), Expect = 1e-17 Identities = 61/189 (32%), Positives = 94/189 (49%), Gaps = 11/189 (5%) Frame = -3 Query: 672 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS------- 514 + RSSK LVY+L V LS V L+F LVVLR P+ LS V +KDL Y+ S Sbjct: 33 QERSSKCLVYVLAGIVILSAVILVFALVVLRPVNPNAELSFVRLKDLNYAAGSGGNGNNV 92 Query: 513 ----FNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAA 346 FN T +++++ N NFG F + S ++YG ++ Sbjct: 93 SLPAFNMTLESELKIENSNFGEFKYDNTSARVFYGGMAVGEAILREGRVSARDTLRMNVK 152 Query: 345 VEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLT 166 VEV N +++ DI S ++KL A+ G + +++I + R+A M+C+ L+L Sbjct: 153 VEVR-SHKYIYNGTDLTSDINSGILKLNSHAKFSGRVNLLQIAKKRRSASMDCSFSLDLR 211 Query: 165 GQAIQGLSC 139 ++IQ L C Sbjct: 212 SRSIQDLVC 220 >ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776113|gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 96.3 bits (238), Expect = 2e-17 Identities = 55/194 (28%), Positives = 95/194 (48%), Gaps = 4/194 (2%) Frame = -3 Query: 705 NKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRY 526 N + E + + K Y V +IV L+F L V+RI P R+ ++ V+D+ Y Sbjct: 10 NIDMESAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAY 69 Query: 525 SNS----SFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRN 358 +++ SFN F A++ + N NFG F F + + YG + Sbjct: 70 TSTPNPPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKK 129 Query: 357 IDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMD 178 ++ V++ A + N++ DI S + L +L G++ +MK++ + ++A MNCTM Sbjct: 130 MNVTVDLNSNNIPANS--NLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMT 187 Query: 177 LNLTGQAIQGLSCQ 136 +NL +AIQ + CQ Sbjct: 188 VNLASRAIQDIKCQ 201 >ref|XP_006573794.1| PREDICTED: uncharacterized protein LOC102667904 [Glycine max] Length = 184 Score = 95.9 bits (237), Expect = 2e-17 Identities = 59/182 (32%), Positives = 88/182 (48%), Gaps = 1/182 (0%) Frame = -3 Query: 678 QEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS-SFNAT 502 Q+E RS K VY+L A V L + L+F + LR+ P L+L + + YS S SFNAT Sbjct: 6 QQERRSGKCFVYLLAAFVILCALVLVFASL-LRVKNPYLKLRSATSNKISYSTSPSFNAT 64 Query: 501 FIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGPS 322 I + + N NFG F + ++ Y + I+ V++M + Sbjct: 65 LIIFLGIKNPNFGAFSYNNNRVSVLYAGVKIADRQINGGRVRFRETKEINVTVKLMSAKA 124 Query: 321 SAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLS 142 + N+S DI S + L + G + ++KI+N +T M C M LNLT IQG+ Sbjct: 125 PISE--NLSIDISSGSLNLTSNVKFSGTVHMLKIINIRKTIEMACAMKLNLTSHTIQGIQ 182 Query: 141 CQ 136 CQ Sbjct: 183 CQ 184 >gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis] Length = 213 Score = 95.5 bits (236), Expect = 3e-17 Identities = 63/201 (31%), Positives = 97/201 (48%), Gaps = 4/201 (1%) Frame = -3 Query: 729 KSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSN 550 +SD++ KAL + R++K VYI V L + LIF L+VLR +P ++L + Sbjct: 19 RSDEENPAFKALR------KERTNKCFVYIFAGIVILGAILLIFALIVLRSKSPEIKLKS 72 Query: 549 VVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSTALY-YGNATXXXXXXXXXX 382 V VK L YS S S NAT IA + + N NFG + F ++A++ YG Sbjct: 73 VTVKSLDYSTSPWPSLNATLIATVAIKNPNFGPYRFGSNNSAVFLYGGGKLGEQRIRQGK 132 Query: 381 XXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRT 202 + ++ VE+ + N+ D+ S +V L + G + ++KI +T Sbjct: 133 ATAKATKRVNVTVEIRTSRLPQGSN-NLGGDLSSGMVNLSSYCKFTGRVHLIKIFENRKT 191 Query: 201 AMMNCTMDLNLTGQAIQGLSC 139 A MNC M L L + I+ L C Sbjct: 192 AEMNCAMTLVLKTKMIKNLRC 212 >ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508721845|gb|EOY13742.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 192 Score = 95.5 bits (236), Expect = 3e-17 Identities = 55/181 (30%), Positives = 93/181 (51%), Gaps = 4/181 (2%) Frame = -3 Query: 666 RSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSSFNATFI--- 496 R+ K ++ +A +I+ L+F L+V+RI P +RL V V++LR S+SS + +F Sbjct: 12 RNIKCWAIVVAGVIAKTIIILLFVLIVMRIRNPKVRLGGVTVENLRASSSSSSPSFSTKL 71 Query: 495 -ADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGPSS 319 A + + N NFG F F+ + + Y + + + + V Sbjct: 72 NAQVSVKNTNFGHFKFKNSTLTISYNGSPVGKATIVEGLARARSTKKFNVTILVSSNNKI 131 Query: 318 AANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSC 139 + N +S DIES + L A+L G+I + KI + ++A MNCTMD+N + + IQ L+C Sbjct: 132 SRNSDQLSSDIESGTINLSSHAKLEGKIHLFKIFKKKKSAEMNCTMDVNTSLKQIQKLTC 191 Query: 138 Q 136 + Sbjct: 192 K 192 >ref|XP_002887922.1| hypothetical protein ARALYDRAFT_474948 [Arabidopsis lyrata subsp. lyrata] gi|297333763|gb|EFH64181.1| hypothetical protein ARALYDRAFT_474948 [Arabidopsis lyrata subsp. lyrata] Length = 214 Score = 95.1 bits (235), Expect = 4e-17 Identities = 61/185 (32%), Positives = 92/185 (49%), Gaps = 6/185 (3%) Frame = -3 Query: 672 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS----FNA 505 E K LVY L V + + LI + LRI+ P + ++ +DLR+ +S FNA Sbjct: 33 EEPPGKCLVYSLTIIVIIFALCLILSSIFLRISKPEIETRSISTRDLRFGGNSTNPYFNA 92 Query: 504 TFIADIRLHNMNFGRFDFRGGSTALYYGN--ATXXXXXXXXXXXXXXXXRNIDAAVEVMG 331 T ++DI + N NFG F+F S + Y + R D VE+ Sbjct: 93 TLVSDISIRNSNFGAFEFEDSSLRVVYADHGVVGETKIAGRRVEAHKTVRITDVVVEI-- 150 Query: 330 GPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQ 151 G N ++ D+ ++L VAE+RG I+V+ RW+ ++M+CTM LNLTG+ IQ Sbjct: 151 GSFRLLNTKDLDSDLRLGFLELRSVAEVRGRIKVLGR-RRWKVSVMSCTMRLNLTGRFIQ 209 Query: 150 GLSCQ 136 L C+ Sbjct: 210 NLLCE 214 >ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, partial [Populus trichocarpa] gi|550346930|gb|ERP65340.1| hypothetical protein POPTR_0001s09980g, partial [Populus trichocarpa] Length = 173 Score = 93.6 bits (231), Expect = 1e-16 Identities = 58/170 (34%), Positives = 92/170 (54%), Gaps = 3/170 (1%) Frame = -3 Query: 639 LLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS---SFNATFIADIRLHNMN 469 L V LS + L+F ++V + P ++LS+V V+ L Y N+ SFN T A++ + N N Sbjct: 7 LALIVILSAIILVFAIIV-KPRTPRVKLSSVAVEHLSYGNNPIPSFNMTLAAEVSVKNSN 65 Query: 468 FGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRD 289 F RF F S++ Y R ++ V++ G P S + N+S D Sbjct: 66 FVRFKFENTSSSALYKGMVVGEAKLRSGRVGARKTRRMNIVVKI-GSPGSLSEAKNLSSD 124 Query: 288 IESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSC 139 I S ++K+ A L+G++R+ IV RTA+M+C M+LNL+ ++IQ L C Sbjct: 125 INSGMLKMNSYATLKGDVRLFGIVKN-RTAVMSCGMNLNLSSRSIQDLEC 173 >ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297644 [Fragaria vesca subsp. vesca] Length = 210 Score = 93.6 bits (231), Expect = 1e-16 Identities = 60/212 (28%), Positives = 103/212 (48%), Gaps = 4/212 (1%) Frame = -3 Query: 759 DEETYLNPEAKSDQKIVTNKALHPTPEQ-EEHRSSKTLVYILLAAVALSIVFLIFGLVVL 583 D+E+ + P A K+ +PT + RS+K VY+ V + L+F L+VL Sbjct: 3 DQESQIWPLAPG--KLHQRSEENPTFKAIRRERSNKCFVYVFSGIVFFCVTVLVFALLVL 60 Query: 582 RINAPSLRLSNVVVKDLRYSNS--SFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATX 409 R+ +P +RL +V VK L+Y++S SFN + + + N NFG ++F + + Y Sbjct: 61 RVKSPEIRLRSVTVKSLKYTSSPPSFNVSLSGQMSVKNPNFGDYEFVPTTVSFLYSRGAV 120 Query: 408 XXXXXXXXXXXXXXXRNIDAAVEVMGGP-SSAANYLNISRDIESNLVKLVGVAELRGEIR 232 + V++ AN + DI S ++KL G ++ G++ Sbjct: 121 GSTKVAKGLAKVKKTERLSFGVDLRSNKLPEGAN--TLKSDINSGMLKLTGTGKVSGKVT 178 Query: 231 VMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 136 + KI+N+ +T M+CTM L L + I+ L C+ Sbjct: 179 LWKIINKRKTGKMDCTMTLVLKSKTIKDLVCR 210 >ref|NP_974086.1| late embryogenesis abundant hydroxyproline-rich glycoprotein [Arabidopsis thaliana] gi|49823490|gb|AAT68728.1| hypothetical protein At1g64065 [Arabidopsis thaliana] gi|55740529|gb|AAV63857.1| hypothetical protein At1g64065 [Arabidopsis thaliana] gi|332196066|gb|AEE34187.1| late embryogenesis abundant hydroxyproline-rich glycoprotein [Arabidopsis thaliana] Length = 214 Score = 90.9 bits (224), Expect = 7e-16 Identities = 57/183 (31%), Positives = 89/183 (48%), Gaps = 4/183 (2%) Frame = -3 Query: 672 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS----FNA 505 E K LVY L V + + LI + LRI+ P + ++ +DLR +S FNA Sbjct: 33 EEPPGKCLVYSLTIIVIIFALCLILSSIFLRISKPEIETRSISTRDLRSGGNSTNPYFNA 92 Query: 504 TFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGP 325 T ++DI + N NFG F+F + + Y + V V G Sbjct: 93 TLVSDISIRNSNFGAFEFEDSTLRVVYADHGVVGETKIEGRRVEAHKTVRITGVVVEIGS 152 Query: 324 SSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGL 145 + ++ +D+ ++L VAE+RG I+V+ RW+ ++M+CTM LNLTG+ IQ L Sbjct: 153 FRLLDTKDLDKDLRLGFLELRSVAEVRGRIKVLGR-KRWKVSVMSCTMRLNLTGRFIQNL 211 Query: 144 SCQ 136 C+ Sbjct: 212 LCE 214 >ref|XP_006300370.1| hypothetical protein CARUB_v10022353mg [Capsella rubella] gi|482569080|gb|EOA33268.1| hypothetical protein CARUB_v10022353mg [Capsella rubella] Length = 215 Score = 90.5 bits (223), Expect = 9e-16 Identities = 60/182 (32%), Positives = 86/182 (47%), Gaps = 4/182 (2%) Frame = -3 Query: 672 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS----FNA 505 E K LVY L V + V LI + LRI+ P + +V +DLR +S FNA Sbjct: 34 EEPPGKCLVYSLTIIVIVFAVCLILSSIFLRISKPEIETRSVSTRDLRSGGNSTNPYFNA 93 Query: 504 TFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAVEVMGGP 325 T ++DI + N NFG F+F S + Y + V V G Sbjct: 94 TLVSDISIRNSNFGAFEFEDSSLRVVYADHGVVGEATIPGRRVEAHKTVRITGVVVEIGS 153 Query: 324 SSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGL 145 + + D+ S ++L VAE+RG I+V+ RW+ ++M+CTM LNLT + IQ L Sbjct: 154 FRLLDRKGLELDLRSGFLELRSVAEVRGRIKVLG-RRRWKVSVMSCTMRLNLTNRFIQNL 212 Query: 144 SC 139 C Sbjct: 213 FC 214 >ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777615|gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 215 Score = 90.1 bits (222), Expect = 1e-15 Identities = 54/206 (26%), Positives = 101/206 (49%), Gaps = 4/206 (1%) Frame = -3 Query: 741 NPEAKSDQKIVTNKALHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSL 562 N +SD++ A + E + + K VYI AV ++V LIF L V+R+ P + Sbjct: 15 NGHPRSDEE----SASLQSKELKRKKRIKYAVYIAAFAVFQTVVILIFALTVMRVKNPKV 70 Query: 561 RLSNVVVKDLRYSN----SSFNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXX 394 R+ V V+ + SN +SFN FI + + N NFG + F + + Y Sbjct: 71 RIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLYDGVMVGEAII 130 Query: 393 XXXXXXXXXXRNIDAAVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVN 214 + +D VEV + + + ++ S+++ L A+L+G++ +MK++ Sbjct: 131 PKARARARSTKKLDVTVEV-NSSALTSTTTGLGSELSSSVLTLNSQAKLKGKVELMKVMK 189 Query: 213 RWRTAMMNCTMDLNLTGQAIQGLSCQ 136 + ++ MNCT+ N++ +++Q L C+ Sbjct: 190 KKKSPEMNCTLIFNVSTRSLQDLKCK 215 >ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508721844|gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 259 Score = 90.1 bits (222), Expect = 1e-15 Identities = 48/188 (25%), Positives = 88/188 (46%), Gaps = 2/188 (1%) Frame = -3 Query: 696 LHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS 517 + + E + + K L Y+ + + + L+F L V+RI P R+ +V+V DL ++NS Sbjct: 10 MEQSKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNS 69 Query: 516 S--FNATFIADIRLHNMNFGRFDFRGGSTALYYGNATXXXXXXXXXXXXXXXXRNIDAAV 343 S FN FIA + + N NFG + F + Y + V Sbjct: 70 SPSFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNV 129 Query: 342 EVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTG 163 + + AN ++ D+ S + L + L G++ +MK++ + ++ MNCTM +NL Sbjct: 130 TMDLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQ 189 Query: 162 QAIQGLSC 139 + ++ + C Sbjct: 190 KLVRDIKC 197