BLASTX nr result
ID: Mentha28_contig00010996
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00010996 (1015 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU23414.1| hypothetical protein MIMGU_mgv1a020543mg [Mimulus... 164 4e-38 ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241... 122 3e-25 ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578... 118 3e-24 ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-r... 110 1e-21 ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620... 104 5e-20 ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citr... 104 7e-20 ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r... 102 3e-19 ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prun... 101 4e-19 ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r... 97 8e-18 ref|XP_002509872.1| conserved hypothetical protein [Ricinus comm... 97 8e-18 ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-r... 97 1e-17 ref|XP_002887922.1| hypothetical protein ARALYDRAFT_474948 [Arab... 96 2e-17 ref|XP_006573794.1| PREDICTED: uncharacterized protein LOC102667... 96 3e-17 gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis] 94 7e-17 ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, part... 92 3e-16 ref|NP_974086.1| late embryogenesis abundant hydroxyproline-rich... 92 4e-16 ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r... 91 6e-16 ref|XP_006300370.1| hypothetical protein CARUB_v10022353mg [Caps... 91 6e-16 ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297... 91 8e-16 ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r... 91 1e-15 >gb|EYU23414.1| hypothetical protein MIMGU_mgv1a020543mg [Mimulus guttatus] Length = 198 Score = 164 bits (416), Expect = 4e-38 Identities = 103/216 (47%), Positives = 127/216 (58%), Gaps = 7/216 (3%) Frame = -3 Query: 779 MDEETY--LNPEAKSDQKIVTNKGLHPTPEQEEHR-----SSKTLVYILLAAVALSIVFL 621 M+EE++ +NP KSD++ T T + R SSK LVYIL+A V S+ FL Sbjct: 1 MEEESHRIINPYIKSDEEEFT------TTTKNNRRGKGGGSSKCLVYILVAVVLQSVAFL 54 Query: 620 IFGLVVLRINAPSLRLSNVVVKDLRYSNSSFNATFIADIRLHNMNFGRFDFRGGSAALYY 441 +FGLV LRI+ PSLRLS+ V LR+ ++S N T +A IRL N NFG F+F GGSA+L Y Sbjct: 55 VFGLVALRISNPSLRLSSAAVAVLRHDSASLNMTVVAGIRLRNPNFGDFEFNGGSASLLY 114 Query: 440 GNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPSSAANYLNISRDIESNLVKLVGVAELR 261 G AT I+ T+EV+GG LVKL +AELR Sbjct: 115 GEATVGVASIYGGRVGRRDKKEINVTMEVIGGGGG------------GELVKLRSMAELR 162 Query: 260 GEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 153 GE+RV+KIV R R A MNCTMDLNLT QA Q LSCQ Sbjct: 163 GEVRVVKIVKRRRIAFMNCTMDLNLTSQAFQDLSCQ 198 >ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241699 [Vitis vinifera] Length = 213 Score = 122 bits (305), Expect = 3e-25 Identities = 78/218 (35%), Positives = 110/218 (50%), Gaps = 9/218 (4%) Frame = -3 Query: 779 MDEETYLNPEA------KSDQKIVTNKGLHPTPEQEEHRSSKTLVYILLAAVALSIVFLI 618 M E+ P A KSD++ K P + RSSK VY+L V L+ + L+ Sbjct: 1 MPEDNQFQPLAPARLHGKSDEEFGVFK---PRASKPPRRSSKCPVYVLAGLVTLAAIALV 57 Query: 617 FGLVVLRINAPSLRLSNVVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSAAL 447 F L VLR+ AP + L +V VK+L + S SFN T A++ + N NFG F+F G+A + Sbjct: 58 FALAVLRVEAPDVELKSVAVKNLTHGTSPSPSFNVTLTAEVSVQNKNFGAFNFENGTATV 117 Query: 446 YYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPSSAANYLNISRDIESNLVKLVGVAE 267 Y ++ T++V N N+S DI S V L A+ Sbjct: 118 LYEGMVVGDEEFSKAHVESRKTKRMNVTLDVRS--DRLWNDKNLSSDISSGSVNLTTYAQ 175 Query: 266 LRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 153 + G++RVMK+V R TA MNC+M LNLT +IQ L C+ Sbjct: 176 VTGKVRVMKVVRRRTTARMNCSMTLNLTSSSIQDLVCR 213 >ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578735 [Solanum tuberosum] Length = 223 Score = 118 bits (296), Expect = 3e-24 Identities = 71/205 (34%), Positives = 112/205 (54%), Gaps = 8/205 (3%) Frame = -3 Query: 746 KSDQKIVTNKGLH---PTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLR 576 KSDQ + +K ++ + +S K VY L V LSI+ LIF +V R +PS Sbjct: 18 KSDQGLNLSKSINYYNNHNNNKNRKSGKCFVYFLSTIVILSIIMLIFSMVFFRFKSPSFE 77 Query: 575 LSNVVVKDLRYSNS----SFNATFIADIRLHNMNFGRFDFRGGSAALY-YGNATXXXXXX 411 L ++ V++LR+SNS SFN +I + N NFG+ +++ S +++ Y N T Sbjct: 78 LDHINVQNLRFSNSTNSSSFNMNMGGEIIVDNDNFGQINYQDSSMSVFLYDNVTIGIANV 137 Query: 410 XXXXXXXXXXXXIDATVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVN 231 I ++++ +Y N+S DI S ++KL E RG+++ MKI++ Sbjct: 138 NVGRVEARKSKRIGISLQLRTNYQLNYSYGNLSSDINSRMLKLTSFGEFRGKVKAMKIIS 197 Query: 230 RWRTAMMNCTMDLNLTGQAIQGLSC 156 + +T++MNCTM+LNLT QAIQ L C Sbjct: 198 KHKTSIMNCTMNLNLTSQAIQDLLC 222 >ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777616|gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 213 Score = 110 bits (274), Expect = 1e-21 Identities = 63/191 (32%), Positives = 98/191 (51%), Gaps = 3/191 (1%) Frame = -3 Query: 716 GLHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSN 537 G+ PT Q + +SSK LVY+L+ V V LIF +VLR P + + +V V++L+Y N Sbjct: 25 GIKPTASQRKEKSSKCLVYVLVGMVIQGAVLLIFASIVLRARTPDVEIVSVTVRNLKYGN 84 Query: 536 S---SFNATFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDA 366 S SFN T + ++ + N NFG F F + ++ G+ ++ Sbjct: 85 SSAPSFNLTLVTEVTVENSNFGDFKFENTTGTVWCGSVVVGKMKIPTGRAQARATERLNV 144 Query: 365 TVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNL 186 +V+V P + N+S +I S L++L +L G++ +M + R R MNC M LNL Sbjct: 145 SVDVSSLP--LPDTKNVSCNISSGLLELNSHVKLSGKVSIMNFMKRRRHPEMNCFMTLNL 202 Query: 185 TGQAIQGLSCQ 153 TGQ Q C+ Sbjct: 203 TGQTKQDFPCE 213 >ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620163 [Citrus sinensis] Length = 214 Score = 104 bits (260), Expect = 5e-20 Identities = 60/204 (29%), Positives = 106/204 (51%), Gaps = 3/204 (1%) Frame = -3 Query: 758 NPEAKSDQKIVTNKGLHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSL 579 N +SDQ+ P + + +SSK LVY+L+ V +S LI + LR N P + Sbjct: 15 NEYPRSDQEYA------PAVIESQRKSSKCLVYVLVTIVTVSAALLISASIFLRPNTPEV 68 Query: 578 RLSNVVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXX 408 +L +V VK+L + N SFN T + ++ + N N+G F+++ S +++YG+ T Sbjct: 69 QLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVFYGSVTVGDVKIR 128 Query: 407 XXXXXXXXXXXIDATVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNR 228 I+ TV+V + + N+ DI S +VKL A+L G + + ++ + Sbjct: 129 DGRVEAREVKRINVTVDVDVRSNGNLDNQNLRSDINSGIVKLNSYAKLHGNVSLFNVLKK 188 Query: 227 WRTAMMNCTMDLNLTGQAIQGLSC 156 +T ++C+M+L L +A++ L C Sbjct: 189 TKTPELDCSMNLVLARRAVEDLVC 212 >ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citrus clementina] gi|557541714|gb|ESR52692.1| hypothetical protein CICLE_v10023929mg [Citrus clementina] Length = 214 Score = 104 bits (259), Expect = 7e-20 Identities = 60/204 (29%), Positives = 106/204 (51%), Gaps = 3/204 (1%) Frame = -3 Query: 758 NPEAKSDQKIVTNKGLHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSL 579 N +SDQ+ P + + +SSK LVY+L+ V +S LI + LR N P + Sbjct: 15 NEYPRSDQEYA------PAVIESQRKSSKCLVYVLVTIVTVSAALLISASIFLRPNTPEV 68 Query: 578 RLSNVVVKDLRYSNS---SFNATFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXX 408 +L +V VK+L + N SFN T + ++ + N N+G F+++ S +++YG+ T Sbjct: 69 QLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVFYGSVTVGDVKIR 128 Query: 407 XXXXXXXXXXXIDATVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNR 228 I+ TV+V + + N+S D S +VKL A+L G + + ++ + Sbjct: 129 DGRVEAREVKRINVTVDVDVRSNGNLDNQNLSSDRNSGIVKLNSYAKLHGNVNLFNVLKK 188 Query: 227 WRTAMMNCTMDLNLTGQAIQGLSC 156 +T ++C+M+L L +A++ L C Sbjct: 189 TKTPELDCSMNLVLARRAVEDLVC 212 >ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777614|gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 188 Score = 102 bits (254), Expect = 3e-19 Identities = 60/180 (33%), Positives = 92/180 (51%), Gaps = 3/180 (1%) Frame = -3 Query: 683 RSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS---FNATFI 513 ++ K YI+ V +I+ L+F L V+RI PS RL +V V+ L Y+ S FN I Sbjct: 11 QNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPHFNMRLI 70 Query: 512 ADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPSSA 333 +I + N NFG F F +A + +G+ ++ TV+V S Sbjct: 71 MEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVSSSAVSD 130 Query: 332 ANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 153 + L + S + L GVA LRG++ +MK++ + +TA MNCTM +NL A+Q L C+ Sbjct: 131 EDELRTK--LSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQDLDCE 188 >ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica] gi|462406396|gb|EMJ11860.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica] Length = 213 Score = 101 bits (252), Expect = 4e-19 Identities = 59/182 (32%), Positives = 95/182 (52%), Gaps = 5/182 (2%) Frame = -3 Query: 683 RSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYS---NSSFNATFI 513 RS+K VY+ A V SI L+F LVVLR+ +P LS+V VK L+++ SS NAT + Sbjct: 34 RSNKCFVYVFAAIVLQSIFILVFALVVLRVKSPGFNLSSVSVKSLKHTTSPTSSLNATLV 93 Query: 512 ADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGG--PS 339 ++ + N NFG + F G SA+L+YG + +++V P Sbjct: 94 TELAIKNKNFGEYKFEGSSASLWYGGFKVGEAKIGKGRVKARGTRRVSLSIDVRSNRLPQ 153 Query: 338 SAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLS 159 A N ++ S +K+ A+L G++ +MKI+ + +T NCTM + L + ++ L Sbjct: 154 EAKN--GFEGEMNSGYLKISSYAKLTGKVNLMKIMKKRKTIDTNCTMVVVLKSRTVKDLF 211 Query: 158 CQ 153 C+ Sbjct: 212 CR 213 >ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776113|gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 97.4 bits (241), Expect = 8e-18 Identities = 56/194 (28%), Positives = 95/194 (48%), Gaps = 4/194 (2%) Frame = -3 Query: 722 NKGLHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRY 543 N + E + + K Y V +IV L+F L V+RI P R+ ++ V+D+ Y Sbjct: 10 NIDMESAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAY 69 Query: 542 SNS----SFNATFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXX 375 +++ SFN F A++ + N NFG F F + + YG Sbjct: 70 TSTPNPPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKK 129 Query: 374 IDATVEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMD 195 ++ TV++ A + N++ DI S + L +L G++ +MK++ + ++A MNCTM Sbjct: 130 MNVTVDLNSNNIPANS--NLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMT 187 Query: 194 LNLTGQAIQGLSCQ 153 +NL +AIQ + CQ Sbjct: 188 VNLASRAIQDIKCQ 201 >ref|XP_002509872.1| conserved hypothetical protein [Ricinus communis] gi|223549771|gb|EEF51259.1| conserved hypothetical protein [Ricinus communis] Length = 221 Score = 97.4 bits (241), Expect = 8e-18 Identities = 62/189 (32%), Positives = 95/189 (50%), Gaps = 11/189 (5%) Frame = -3 Query: 689 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS------- 531 + RSSK LVY+L V LS V L+F LVVLR P+ LS V +KDL Y+ S Sbjct: 33 QERSSKCLVYVLAGIVILSAVILVFALVVLRPVNPNAELSFVRLKDLNYAAGSGGNGNNV 92 Query: 530 ----FNATFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDAT 363 FN T +++++ N NFG F + SA ++YG ++ Sbjct: 93 SLPAFNMTLESELKIENSNFGEFKYDNTSARVFYGGMAVGEAILREGRVSARDTLRMNVK 152 Query: 362 VEVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLT 183 VEV N +++ DI S ++KL A+ G + +++I + R+A M+C+ L+L Sbjct: 153 VEVR-SHKYIYNGTDLTSDINSGILKLNSHAKFSGRVNLLQIAKKRRSASMDCSFSLDLR 211 Query: 182 GQAIQGLSC 156 ++IQ L C Sbjct: 212 SRSIQDLVC 220 >ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508721845|gb|EOY13742.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 192 Score = 96.7 bits (239), Expect = 1e-17 Identities = 56/181 (30%), Positives = 93/181 (51%), Gaps = 4/181 (2%) Frame = -3 Query: 683 RSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSSFNATFI--- 513 R+ K ++ +A +I+ L+F L+V+RI P +RL V V++LR S+SS + +F Sbjct: 12 RNIKCWAIVVAGVIAKTIIILLFVLIVMRIRNPKVRLGGVTVENLRASSSSSSPSFSTKL 71 Query: 512 -ADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPSS 336 A + + N NFG F F+ + + Y + + T+ V Sbjct: 72 NAQVSVKNTNFGHFKFKNSTLTISYNGSPVGKATIVEGLARARSTKKFNVTILVSSNNKI 131 Query: 335 AANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSC 156 + N +S DIES + L A+L G+I + KI + ++A MNCTMD+N + + IQ L+C Sbjct: 132 SRNSDQLSSDIESGTINLSSHAKLEGKIHLFKIFKKKKSAEMNCTMDVNTSLKQIQKLTC 191 Query: 155 Q 153 + Sbjct: 192 K 192 >ref|XP_002887922.1| hypothetical protein ARALYDRAFT_474948 [Arabidopsis lyrata subsp. lyrata] gi|297333763|gb|EFH64181.1| hypothetical protein ARALYDRAFT_474948 [Arabidopsis lyrata subsp. lyrata] Length = 214 Score = 95.9 bits (237), Expect = 2e-17 Identities = 59/183 (32%), Positives = 89/183 (48%), Gaps = 4/183 (2%) Frame = -3 Query: 689 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS----FNA 522 E K LVY L V + + LI + LRI+ P + ++ +DLR+ +S FNA Sbjct: 33 EEPPGKCLVYSLTIIVIIFALCLILSSIFLRISKPEIETRSISTRDLRFGGNSTNPYFNA 92 Query: 521 TFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGP 342 T ++DI + N NFG F+F S + Y + V V G Sbjct: 93 TLVSDISIRNSNFGAFEFEDSSLRVVYADHGVVGETKIAGRRVEAHKTVRITDVVVEIGS 152 Query: 341 SSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGL 162 N ++ D+ ++L VAE+RG I+V+ RW+ ++M+CTM LNLTG+ IQ L Sbjct: 153 FRLLNTKDLDSDLRLGFLELRSVAEVRGRIKVLGR-RRWKVSVMSCTMRLNLTGRFIQNL 211 Query: 161 SCQ 153 C+ Sbjct: 212 LCE 214 >ref|XP_006573794.1| PREDICTED: uncharacterized protein LOC102667904 [Glycine max] Length = 184 Score = 95.5 bits (236), Expect = 3e-17 Identities = 60/182 (32%), Positives = 88/182 (48%), Gaps = 1/182 (0%) Frame = -3 Query: 695 QEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS-SFNAT 519 Q+E RS K VY+L A V L + L+F + LR+ P L+L + + YS S SFNAT Sbjct: 6 QQERRSGKCFVYLLAAFVILCALVLVFASL-LRVKNPYLKLRSATSNKISYSTSPSFNAT 64 Query: 518 FIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPS 339 I + + N NFG F + ++ Y I+ TV++M + Sbjct: 65 LIIFLGIKNPNFGAFSYNNNRVSVLYAGVKIADRQINGGRVRFRETKEINVTVKLMSAKA 124 Query: 338 SAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLS 159 + N+S DI S + L + G + ++KI+N +T M C M LNLT IQG+ Sbjct: 125 PISE--NLSIDISSGSLNLTSNVKFSGTVHMLKIINIRKTIEMACAMKLNLTSHTIQGIQ 182 Query: 158 CQ 153 CQ Sbjct: 183 CQ 184 >gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis] Length = 213 Score = 94.4 bits (233), Expect = 7e-17 Identities = 59/182 (32%), Positives = 89/182 (48%), Gaps = 4/182 (2%) Frame = -3 Query: 689 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS---SFNAT 519 + R++K VYI V L + LIF L+VLR +P ++L +V VK L YS S S NAT Sbjct: 32 KERTNKCFVYIFAGIVILGAILLIFALIVLRSKSPEIKLKSVTVKSLDYSTSPWPSLNAT 91 Query: 518 FIADIRLHNMNFGRFDFRGGSAALY-YGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGP 342 IA + + N NFG + F ++A++ YG ++ TVE+ Sbjct: 92 LIATVAIKNPNFGPYRFGSNNSAVFLYGGGKLGEQRIRQGKATAKATKRVNVTVEIRTSR 151 Query: 341 SSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGL 162 + N+ D+ S +V L + G + ++KI +TA MNC M L L + I+ L Sbjct: 152 LPQGSN-NLGGDLSSGMVNLSSYCKFTGRVHLIKIFENRKTAEMNCAMTLVLKTKMIKNL 210 Query: 161 SC 156 C Sbjct: 211 RC 212 >ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, partial [Populus trichocarpa] gi|550346930|gb|ERP65340.1| hypothetical protein POPTR_0001s09980g, partial [Populus trichocarpa] Length = 173 Score = 92.4 bits (228), Expect = 3e-16 Identities = 57/170 (33%), Positives = 91/170 (53%), Gaps = 3/170 (1%) Frame = -3 Query: 656 LLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS---SFNATFIADIRLHNMN 486 L V LS + L+F ++V + P ++LS+V V+ L Y N+ SFN T A++ + N N Sbjct: 7 LALIVILSAIILVFAIIV-KPRTPRVKLSSVAVEHLSYGNNPIPSFNMTLAAEVSVKNSN 65 Query: 485 FGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPSSAANYLNISRD 306 F RF F S++ Y ++ V++ G P S + N+S D Sbjct: 66 FVRFKFENTSSSALYKGMVVGEAKLRSGRVGARKTRRMNIVVKI-GSPGSLSEAKNLSSD 124 Query: 305 IESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSC 156 I S ++K+ A L+G++R+ IV RTA+M+C M+LNL+ ++IQ L C Sbjct: 125 INSGMLKMNSYATLKGDVRLFGIVKN-RTAVMSCGMNLNLSSRSIQDLEC 173 >ref|NP_974086.1| late embryogenesis abundant hydroxyproline-rich glycoprotein [Arabidopsis thaliana] gi|49823490|gb|AAT68728.1| hypothetical protein At1g64065 [Arabidopsis thaliana] gi|55740529|gb|AAV63857.1| hypothetical protein At1g64065 [Arabidopsis thaliana] gi|332196066|gb|AEE34187.1| late embryogenesis abundant hydroxyproline-rich glycoprotein [Arabidopsis thaliana] Length = 214 Score = 91.7 bits (226), Expect = 4e-16 Identities = 57/183 (31%), Positives = 89/183 (48%), Gaps = 4/183 (2%) Frame = -3 Query: 689 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS----FNA 522 E K LVY L V + + LI + LRI+ P + ++ +DLR +S FNA Sbjct: 33 EEPPGKCLVYSLTIIVIIFALCLILSSIFLRISKPEIETRSISTRDLRSGGNSTNPYFNA 92 Query: 521 TFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGP 342 T ++DI + N NFG F+F + + Y + V V G Sbjct: 93 TLVSDISIRNSNFGAFEFEDSTLRVVYADHGVVGETKIEGRRVEAHKTVRITGVVVEIGS 152 Query: 341 SSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGL 162 + ++ +D+ ++L VAE+RG I+V+ RW+ ++M+CTM LNLTG+ IQ L Sbjct: 153 FRLLDTKDLDKDLRLGFLELRSVAEVRGRIKVLGR-KRWKVSVMSCTMRLNLTGRFIQNL 211 Query: 161 SCQ 153 C+ Sbjct: 212 LCE 214 >ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508721844|gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 259 Score = 91.3 bits (225), Expect = 6e-16 Identities = 48/188 (25%), Positives = 88/188 (46%), Gaps = 2/188 (1%) Frame = -3 Query: 713 LHPTPEQEEHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS 534 + + E + + K L Y+ + + + L+F L V+RI P R+ +V+V DL ++NS Sbjct: 10 MEQSKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNS 69 Query: 533 S--FNATFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATV 360 S FN FIA + + N NFG + F + Y + V Sbjct: 70 SPSFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNV 129 Query: 359 EVMGGPSSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTG 180 + + AN ++ D+ S + L + L G++ +MK++ + ++ MNCTM +NL Sbjct: 130 TMDLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQ 189 Query: 179 QAIQGLSC 156 + ++ + C Sbjct: 190 KLVRDIKC 197 >ref|XP_006300370.1| hypothetical protein CARUB_v10022353mg [Capsella rubella] gi|482569080|gb|EOA33268.1| hypothetical protein CARUB_v10022353mg [Capsella rubella] Length = 215 Score = 91.3 bits (225), Expect = 6e-16 Identities = 60/182 (32%), Positives = 86/182 (47%), Gaps = 4/182 (2%) Frame = -3 Query: 689 EHRSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS----FNA 522 E K LVY L V + V LI + LRI+ P + +V +DLR +S FNA Sbjct: 34 EEPPGKCLVYSLTIIVIVFAVCLILSSIFLRISKPEIETRSVSTRDLRSGGNSTNPYFNA 93 Query: 521 TFIADIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGP 342 T ++DI + N NFG F+F S + Y + V V G Sbjct: 94 TLVSDISIRNSNFGAFEFEDSSLRVVYADHGVVGEATIPGRRVEAHKTVRITGVVVEIGS 153 Query: 341 SSAANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGL 162 + + D+ S ++L VAE+RG I+V+ RW+ ++M+CTM LNLT + IQ L Sbjct: 154 FRLLDRKGLELDLRSGFLELRSVAEVRGRIKVLG-RRRWKVSVMSCTMRLNLTNRFIQNL 212 Query: 161 SC 156 C Sbjct: 213 FC 214 >ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297644 [Fragaria vesca subsp. vesca] Length = 210 Score = 90.9 bits (224), Expect = 8e-16 Identities = 53/180 (29%), Positives = 90/180 (50%), Gaps = 3/180 (1%) Frame = -3 Query: 683 RSSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNS--SFNATFIA 510 RS+K VY+ V + L+F L+VLR+ +P +RL +V VK L+Y++S SFN + Sbjct: 33 RSNKCFVYVFSGIVFFCVTVLVFALLVLRVKSPEIRLRSVTVKSLKYTSSPPSFNVSLSG 92 Query: 509 DIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGP-SSA 333 + + N NFG ++F + + Y + V++ Sbjct: 93 QMSVKNPNFGDYEFVPTTVSFLYSRGAVGSTKVAKGLAKVKKTERLSFGVDLRSNKLPEG 152 Query: 332 ANYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 153 AN + DI S ++KL G ++ G++ + KI+N+ +T M+CTM L L + I+ L C+ Sbjct: 153 AN--TLKSDINSGMLKLTGTGKVSGKVTLWKIINKRKTGKMDCTMTLVLKSKTIKDLVCR 210 >ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 185 Score = 90.5 bits (223), Expect = 1e-15 Identities = 49/179 (27%), Positives = 88/179 (49%), Gaps = 3/179 (1%) Frame = -3 Query: 680 SSKTLVYILLAAVALSIVFLIFGLVVLRINAPSLRLSNVVVKDLRYSNSS---FNATFIA 510 ++K L Y+ + V + + LIF L V+RI P +R V V++ NSS F+ +A Sbjct: 9 NAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMRLMA 68 Query: 509 DIRLHNMNFGRFDFRGGSAALYYGNATXXXXXXXXXXXXXXXXXXIDATVEVMGGPSSAA 330 + + N NFG F + S + YG D T+++ S Sbjct: 69 QVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKLSTN 128 Query: 329 NYLNISRDIESNLVKLVGVAELRGEIRVMKIVNRWRTAMMNCTMDLNLTGQAIQGLSCQ 153 + N+ DI S ++ L A+L G++ +MK++ + +++ M+CTM +N+ + +Q L C+ Sbjct: 129 S--NLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKCK 185