BLASTX nr result
ID: Rauwolfia21_contig00011544
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00011544 (2001 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006341680.1| PREDICTED: uncharacterized protein LOC102589... 231 9e-58 ref|XP_004235710.1| PREDICTED: uncharacterized protein LOC101250... 228 1e-56 gb|EOY26757.1| Late embryogenesis abundant hydroxyproline-rich g... 209 4e-51 gb|EOY26756.1| Late embryogenesis abundant hydroxyproline-rich g... 209 4e-51 gb|EOY26755.1| Late embryogenesis abundant hydroxyproline-rich g... 209 4e-51 ref|XP_002284574.1| PREDICTED: uncharacterized protein LOC100254... 202 4e-49 emb|CBI28084.3| unnamed protein product [Vitis vinifera] 202 4e-49 ref|XP_006427015.1| hypothetical protein CICLE_v10026444mg [Citr... 201 1e-48 ref|XP_006465533.1| PREDICTED: uncharacterized protein LOC102615... 200 2e-48 gb|EXB37026.1| hypothetical protein L484_020812 [Morus notabilis] 199 3e-48 gb|EOY30818.1| Late embryogenesis abundant (LEA) hydroxyproline-... 199 4e-48 ref|XP_002279706.1| PREDICTED: uncharacterized protein LOC100258... 198 8e-48 ref|XP_004287097.1| PREDICTED: uncharacterized protein LOC101303... 197 1e-47 ref|XP_002331158.1| predicted protein [Populus trichocarpa] gi|5... 197 2e-47 gb|EOY26762.1| Late embryogenesis abundant hydroxyproline-rich g... 196 3e-47 ref|XP_002863251.1| hypothetical protein ARALYDRAFT_497058 [Arab... 196 3e-47 ref|XP_006414934.1| hypothetical protein EUTSA_v10026222mg [Eutr... 196 4e-47 ref|XP_002304190.2| hypothetical protein POPTR_0003s05380g, part... 194 9e-47 ref|XP_002299644.2| hypothetical protein POPTR_0001s18090g [Popu... 193 2e-46 ref|NP_193063.2| late embryogenesis abundant hydroxyproline-rich... 193 3e-46 >ref|XP_006341680.1| PREDICTED: uncharacterized protein LOC102589613 [Solanum tuberosum] Length = 221 Score = 231 bits (589), Expect = 9e-58 Identities = 107/200 (53%), Positives = 149/200 (74%), Gaps = 1/200 (0%) Frame = -3 Query: 1837 PRREPQPQYVIVLPHYYN-PAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRL 1661 P+ QPQY+IVLP YY P C A +LL+ A+FFLWPSDP++SI RL Sbjct: 22 PQPHQQPQYIIVLPQYYRTPRQFLRRPTRRYVYCAAVFILLSAALFFLWPSDPELSIARL 81 Query: 1660 HLHHFRIRTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVR 1481 L H ++ +FP I++D+ L +T K+RNKD YS++++ +V+SIGYRGKQLG V S++G ++ Sbjct: 82 KLRHLKVHSFPKIAIDVTLDVTAKIRNKDFYSVNFRYVVISIGYRGKQLGHVISDYGRIK 141 Query: 1480 ARGSSYVSATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVS 1301 AR SSYV+ATL L+ + I +D+IPL+ED+ RG ITFDTV++I G+LGL F++P++GKV Sbjct: 142 ARASSYVNATLELTDISIFSDLIPLIEDLARGSITFDTVTQIGGELGLVLFDIPIKGKVV 201 Query: 1300 CEIIVDIHNQTIEHQNCYPQ 1241 CEI+VD N+TI HQNCYP+ Sbjct: 202 CEIVVDTRNETISHQNCYPE 221 >ref|XP_004235710.1| PREDICTED: uncharacterized protein LOC101250488 [Solanum lycopersicum] Length = 221 Score = 228 bits (580), Expect = 1e-56 Identities = 106/200 (53%), Positives = 147/200 (73%), Gaps = 1/200 (0%) Frame = -3 Query: 1837 PRREPQPQYVIVLPHYYN-PAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRL 1661 P+ QPQY+IVLP YY P C A +LL+ A+F +WPSDP++SI RL Sbjct: 22 PQPHQQPQYIIVLPQYYRTPRQFLRRPTRRYVCCAAVFILLSAALFLIWPSDPELSIARL 81 Query: 1660 HLHHFRIRTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVR 1481 L H ++ +FP I++D+ L +T K+RNKD YS+ ++ +V+SIGYRGKQLG V S++G ++ Sbjct: 82 KLRHLKVHSFPKIAIDVTLDVTAKIRNKDFYSVGFRYVVISIGYRGKQLGHVISDYGRIK 141 Query: 1480 ARGSSYVSATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVS 1301 AR SSYV+ATL L+ V I +D+IPL+ED+ RG ITFDTV++I G+LGL F++P++GKV Sbjct: 142 ARASSYVNATLELTDVSIFSDLIPLIEDLARGSITFDTVTQIGGELGLVLFDIPIKGKVV 201 Query: 1300 CEIIVDIHNQTIEHQNCYPQ 1241 CEI+VD N+TI HQNCYP+ Sbjct: 202 CEIVVDTRNETISHQNCYPE 221 >gb|EOY26757.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein isoform 3 [Theobroma cacao] Length = 222 Score = 209 bits (532), Expect = 4e-51 Identities = 104/195 (53%), Positives = 140/195 (71%) Frame = -3 Query: 1828 EPQPQYVIVLPHYYNPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRLHLHH 1649 +P Q +VLP YY P ASL+LLA +++ WPSDP+V IVR+H+ Sbjct: 28 QPPDQNYLVLP-YYRPTLRWCGCRILCT---ASLVLLATSVYIFWPSDPEVKIVRMHVDR 83 Query: 1648 FRIRTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVRARGS 1469 ++ T PII+LDI+L +T+KVRN D+YS+D+ SL V++GYRGK LG V S HGHVRA GS Sbjct: 84 MQLHTIPIIALDISLLVTLKVRNSDVYSVDFTSLDVAVGYRGKMLGHVTSEHGHVRAWGS 143 Query: 1468 SYVSATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVSCEII 1289 SYV A L L+GVE+L+DV+ +LED+ RG + FDTV+++ G LGL+ F+ PL+ +VSCEI+ Sbjct: 144 SYVQAELELNGVEVLSDVVYMLEDLARGTVPFDTVTEVAGWLGLSLFKFPLKARVSCEIV 203 Query: 1288 VDIHNQTIEHQNCYP 1244 V+ NQTI QNCYP Sbjct: 204 VNRTNQTIIRQNCYP 218 >gb|EOY26756.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein isoform 2 [Theobroma cacao] Length = 249 Score = 209 bits (532), Expect = 4e-51 Identities = 104/195 (53%), Positives = 140/195 (71%) Frame = -3 Query: 1828 EPQPQYVIVLPHYYNPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRLHLHH 1649 +P Q +VLP YY P ASL+LLA +++ WPSDP+V IVR+H+ Sbjct: 28 QPPDQNYLVLP-YYRPTLRWCGCRILCT---ASLVLLATSVYIFWPSDPEVKIVRMHVDR 83 Query: 1648 FRIRTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVRARGS 1469 ++ T PII+LDI+L +T+KVRN D+YS+D+ SL V++GYRGK LG V S HGHVRA GS Sbjct: 84 MQLHTIPIIALDISLLVTLKVRNSDVYSVDFTSLDVAVGYRGKMLGHVTSEHGHVRAWGS 143 Query: 1468 SYVSATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVSCEII 1289 SYV A L L+GVE+L+DV+ +LED+ RG + FDTV+++ G LGL+ F+ PL+ +VSCEI+ Sbjct: 144 SYVQAELELNGVEVLSDVVYMLEDLARGTVPFDTVTEVAGWLGLSLFKFPLKARVSCEIV 203 Query: 1288 VDIHNQTIEHQNCYP 1244 V+ NQTI QNCYP Sbjct: 204 VNRTNQTIIRQNCYP 218 >gb|EOY26755.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein isoform 1 [Theobroma cacao] Length = 220 Score = 209 bits (532), Expect = 4e-51 Identities = 104/195 (53%), Positives = 140/195 (71%) Frame = -3 Query: 1828 EPQPQYVIVLPHYYNPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRLHLHH 1649 +P Q +VLP YY P ASL+LLA +++ WPSDP+V IVR+H+ Sbjct: 28 QPPDQNYLVLP-YYRPTLRWCGCRILCT---ASLVLLATSVYIFWPSDPEVKIVRMHVDR 83 Query: 1648 FRIRTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVRARGS 1469 ++ T PII+LDI+L +T+KVRN D+YS+D+ SL V++GYRGK LG V S HGHVRA GS Sbjct: 84 MQLHTIPIIALDISLLVTLKVRNSDVYSVDFTSLDVAVGYRGKMLGHVTSEHGHVRAWGS 143 Query: 1468 SYVSATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVSCEII 1289 SYV A L L+GVE+L+DV+ +LED+ RG + FDTV+++ G LGL+ F+ PL+ +VSCEI+ Sbjct: 144 SYVQAELELNGVEVLSDVVYMLEDLARGTVPFDTVTEVAGWLGLSLFKFPLKARVSCEIV 203 Query: 1288 VDIHNQTIEHQNCYP 1244 V+ NQTI QNCYP Sbjct: 204 VNRTNQTIIRQNCYP 218 >ref|XP_002284574.1| PREDICTED: uncharacterized protein LOC100254347 [Vitis vinifera] Length = 212 Score = 202 bits (514), Expect = 4e-49 Identities = 96/189 (50%), Positives = 134/189 (70%) Frame = -3 Query: 1810 VIVLPHYYNPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRLHLHHFRIRTF 1631 V++LP YY A L ++ A++ L+PSDP V +V LHL+ ++ T Sbjct: 23 VVLLPVYYPRRRRLLYRLCNAFLACAVFLSISAAVYLLYPSDPTVQVVGLHLNSVQVHTS 82 Query: 1630 PIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVRARGSSYVSAT 1451 P+ISLD++L LT++VRN+D +S Y SL S+GYRG++LG V S+ G++RARGSSY++AT Sbjct: 83 PVISLDLSLDLTIRVRNRDFFSFSYTSLTASVGYRGRRLGFVNSSGGYLRARGSSYINAT 142 Query: 1450 LSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVSCEIIVDIHNQ 1271 L L G+E+L DV LLED+ RG I FDTVS++ G+LGL FFE+PL+ +VSCE+ V+ NQ Sbjct: 143 LDLDGIEVLHDVFYLLEDLARGSIPFDTVSEVRGKLGLFFFEIPLKARVSCEVYVNTSNQ 202 Query: 1270 TIEHQNCYP 1244 TI HQ+CYP Sbjct: 203 TIIHQDCYP 211 >emb|CBI28084.3| unnamed protein product [Vitis vinifera] Length = 218 Score = 202 bits (514), Expect = 4e-49 Identities = 96/189 (50%), Positives = 134/189 (70%) Frame = -3 Query: 1810 VIVLPHYYNPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRLHLHHFRIRTF 1631 V++LP YY A L ++ A++ L+PSDP V +V LHL+ ++ T Sbjct: 23 VVLLPVYYPRRRRLLYRLCNAFLACAVFLSISAAVYLLYPSDPTVQVVGLHLNSVQVHTS 82 Query: 1630 PIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVRARGSSYVSAT 1451 P+ISLD++L LT++VRN+D +S Y SL S+GYRG++LG V S+ G++RARGSSY++AT Sbjct: 83 PVISLDLSLDLTIRVRNRDFFSFSYTSLTASVGYRGRRLGFVNSSGGYLRARGSSYINAT 142 Query: 1450 LSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVSCEIIVDIHNQ 1271 L L G+E+L DV LLED+ RG I FDTVS++ G+LGL FFE+PL+ +VSCE+ V+ NQ Sbjct: 143 LDLDGIEVLHDVFYLLEDLARGSIPFDTVSEVRGKLGLFFFEIPLKARVSCEVYVNTSNQ 202 Query: 1270 TIEHQNCYP 1244 TI HQ+CYP Sbjct: 203 TIIHQDCYP 211 >ref|XP_006427015.1| hypothetical protein CICLE_v10026444mg [Citrus clementina] gi|557529005|gb|ESR40255.1| hypothetical protein CICLE_v10026444mg [Citrus clementina] Length = 219 Score = 201 bits (510), Expect = 1e-48 Identities = 96/198 (48%), Positives = 138/198 (69%), Gaps = 2/198 (1%) Frame = -3 Query: 1828 EPQPQYVIVLPHYY--NPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRLHL 1655 +PQ + +LP+YY NP SL+LLA ++ WPS+P++ I +LHL Sbjct: 27 QPQDENYTILPYYYLANPRRNWCATIAI------SLILLAALLYVFWPSEPELKIEKLHL 80 Query: 1654 HHFRIRTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVRAR 1475 HF +R P I +DI+L +T+KV N+D+YS++YKSL VS+GYRG++LG V+SNHG V+A Sbjct: 81 AHFHVRMKPAICIDISLNVTLKVHNRDVYSVNYKSLDVSVGYRGRKLGHVKSNHGRVKAL 140 Query: 1474 GSSYVSATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVSCE 1295 SSY+ A L L V++L+DV+ LLED+ RG + FDT++K+ G LGL F E PLE +VSCE Sbjct: 141 ASSYIDAELQLKCVKVLSDVVYLLEDLARGTVPFDTITKVTGHLGLFFLEFPLEARVSCE 200 Query: 1294 IIVDIHNQTIEHQNCYPQ 1241 ++++ +QTI QNCYP+ Sbjct: 201 VLINTTSQTIARQNCYPK 218 >ref|XP_006465533.1| PREDICTED: uncharacterized protein LOC102615257 [Citrus sinensis] Length = 219 Score = 200 bits (509), Expect = 2e-48 Identities = 96/198 (48%), Positives = 138/198 (69%), Gaps = 2/198 (1%) Frame = -3 Query: 1828 EPQPQYVIVLPHYY--NPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRLHL 1655 +PQ + +LP+YY NP SL+LLA ++ WPS+P++ I RLHL Sbjct: 27 QPQDENYTILPYYYLENPRRNWYATIAI------SLILLAALLYVFWPSEPELKIERLHL 80 Query: 1654 HHFRIRTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVRAR 1475 HF +R P I +DI+L +T+KV N+D+YS++YKSL VS+GYRG++LG V+SNHG V+A Sbjct: 81 AHFHVRMKPAICIDISLNVTLKVHNRDVYSVNYKSLDVSVGYRGRKLGHVKSNHGRVKAL 140 Query: 1474 GSSYVSATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVSCE 1295 SS++ A L L V++L+DV+ LLED+ RG + FDT++K+ G LGL F E PLE +VSCE Sbjct: 141 ASSFIDAELQLKCVKVLSDVVYLLEDLARGTVPFDTITKVTGHLGLFFLEFPLEARVSCE 200 Query: 1294 IIVDIHNQTIEHQNCYPQ 1241 ++++ +QTI QNCYP+ Sbjct: 201 VLINTTSQTIARQNCYPK 218 >gb|EXB37026.1| hypothetical protein L484_020812 [Morus notabilis] Length = 250 Score = 199 bits (507), Expect = 3e-48 Identities = 98/193 (50%), Positives = 137/193 (70%), Gaps = 1/193 (0%) Frame = -3 Query: 1816 QYVIVLPHYY-NPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRLHLHHFRI 1640 Q V+VLP+Y +P+ A++LLL A+F L+PSDP + +VR+HL+ R+ Sbjct: 24 QNVVVLPYYRPSPSKRRSRRLCRCLLASAAVLLLIAAVFILYPSDPSLQLVRVHLNRVRV 83 Query: 1639 RTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVRARGSSYV 1460 + P ++LD++ LTVKV N+D +SLDY SL VS+GYRG++LG V S+ G +RARGSSYV Sbjct: 84 NSSPDLTLDLSFFLTVKVFNRDFFSLDYDSLAVSVGYRGRELGFVNSDGGKIRARGSSYV 143 Query: 1459 SATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVSCEIIVDI 1280 ATL L+G I+ DV LLED+ RG I FDTV+K++G LGL F++PL+ VSCE+ V+ Sbjct: 144 DATLDLNGFAIIQDVFYLLEDLARGVIPFDTVTKVEGNLGLFLFKIPLKASVSCEVYVNT 203 Query: 1279 HNQTIEHQNCYPQ 1241 +NQTI Q+CYP+ Sbjct: 204 NNQTIARQDCYPE 216 >gb|EOY30818.1| Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family isoform 1 [Theobroma cacao] Length = 214 Score = 199 bits (506), Expect = 4e-48 Identities = 96/195 (49%), Positives = 137/195 (70%) Frame = -3 Query: 1825 PQPQYVIVLPHYYNPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRLHLHHF 1646 P Q VIVLP YY+ ++LL+ A+FFL+PSDP + +VRL L+H Sbjct: 20 PNQQNVIVLPVYYSRPNQNYRCLRRCLIFTGIVVLLSAAVFFLYPSDPTLQLVRLQLNHV 79 Query: 1645 RIRTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVRARGSS 1466 R+ + P ++LD++ +LT++VRN+D +SLDY LVVS+GYRG++LG+V S G VRARGSS Sbjct: 80 RVNSSPALTLDLSFSLTIRVRNRDFFSLDYDKLVVSVGYRGRELGVVSSEGGRVRARGSS 139 Query: 1465 YVSATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVSCEIIV 1286 YV+ATL L+G E++ DVI L+ D +G I FDT +K+DG LGL F+ P++ +VSCE+ V Sbjct: 140 YVNATLDLNGFEVVHDVIYLIADWAKGVIPFDTNTKVDGDLGLFLFKAPIKAEVSCEVYV 199 Query: 1285 DIHNQTIEHQNCYPQ 1241 + +NQTI Q+CY + Sbjct: 200 NTNNQTIVRQDCYAE 214 >ref|XP_002279706.1| PREDICTED: uncharacterized protein LOC100258307 [Vitis vinifera] Length = 237 Score = 198 bits (503), Expect = 8e-48 Identities = 95/165 (57%), Positives = 126/165 (76%) Frame = -3 Query: 1735 ASLLLLAGAIFFLWPSDPDVSIVRLHLHHFRIRTFPIISLDINLALTVKVRNKDLYSLDY 1556 AS+LLL + F LWPSDPDVSIVRL L + TFP +SLD++++L VKVRN DLYS++Y Sbjct: 73 ASVLLLVASTFVLWPSDPDVSIVRLRLRRIAVHTFPRLSLDVSMSLMVKVRNVDLYSMNY 132 Query: 1555 KSLVVSIGYRGKQLGLVRSNHGHVRARGSSYVSATLSLSGVEILADVIPLLEDVTRGEIT 1376 +SL V+I YRGK+LG V S GHVRARGSS V A+L L+GV +L+DVI +LED+ +G I Sbjct: 133 RSLHVAIEYRGKELGNVTSEEGHVRARGSSLVDASLELNGVAVLSDVIFVLEDLAKGTIP 192 Query: 1375 FDTVSKIDGQLGLAFFELPLEGKVSCEIIVDIHNQTIEHQNCYPQ 1241 DTV+++ G +G FF+LPL KVSC++ V+ + Q + HQNCYP+ Sbjct: 193 IDTVTEVRGSMGFLFFQLPLRTKVSCQVYVNTNTQKVLHQNCYPE 237 >ref|XP_004287097.1| PREDICTED: uncharacterized protein LOC101303354 [Fragaria vesca subsp. vesca] Length = 218 Score = 197 bits (501), Expect = 1e-47 Identities = 96/198 (48%), Positives = 140/198 (70%), Gaps = 2/198 (1%) Frame = -3 Query: 1828 EPQPQYVIVLPHY--YNPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRLHL 1655 +PQ Q+V+VL HY Y A+ L ++ A+FFL+PSDP +S+ R+ L Sbjct: 22 QPQ-QHVVVLTHYRPYRDDYLERRRFRLCVSTTAAFLFISAAVFFLFPSDPALSLARIQL 80 Query: 1654 HHFRIRTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVRAR 1475 +H + + P ++LD + +LT+KVRN+D +SL+Y SLVV IGYRG++LG V S+ G VRAR Sbjct: 81 NHVGVHSSPKLTLDASFSLTIKVRNRDFFSLEYDSLVVKIGYRGRELGFVSSDGGRVRAR 140 Query: 1474 GSSYVSATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVSCE 1295 GSSYV+ATL + G+E++ DV LLED+ RGEI FDT S +DG +GL FF +P++G+ SCE Sbjct: 141 GSSYVNATLVVDGLEVIHDVFYLLEDLARGEIPFDTDSVVDGTVGLFFFRIPIKGRASCE 200 Query: 1294 IIVDIHNQTIEHQNCYPQ 1241 + V+ ++QT+ Q+CYP+ Sbjct: 201 VYVNTNDQTVVRQDCYPE 218 >ref|XP_002331158.1| predicted protein [Populus trichocarpa] gi|566176693|ref|XP_006381711.1| hypothetical protein POPTR_0006s16220g [Populus trichocarpa] gi|550336462|gb|ERP59508.1| hypothetical protein POPTR_0006s16220g [Populus trichocarpa] Length = 210 Score = 197 bits (500), Expect = 2e-47 Identities = 91/193 (47%), Positives = 136/193 (70%) Frame = -3 Query: 1819 PQYVIVLPHYYNPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRLHLHHFRI 1640 PQ VIVL +Y+ P ++LLL+ A + L+PSDP + + R+ L+H R+ Sbjct: 21 PQNVIVLSYYHRPPNHILRRCLLFT---TAILLLSAAAYLLYPSDPAIQLSRIKLNHIRV 77 Query: 1639 RTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVRARGSSYV 1460 + P ++LD++ +LT+KV N+D +SLDY SLVVS+GYRG++LG V S G +RAR SSYV Sbjct: 78 NSSPELTLDVSFSLTIKVENRDFFSLDYDSLVVSVGYRGRELGFVNSKGGKIRARRSSYV 137 Query: 1459 SATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVSCEIIVDI 1280 A L L+G+E++ DV L++D+ RG I FDT +++ G LGL F++P+ G+VSC++ V+ Sbjct: 138 DARLDLNGLEVIKDVFYLIQDLARGVIIFDTDTQVKGDLGLLLFKIPINGRVSCQVFVNT 197 Query: 1279 HNQTIEHQNCYPQ 1241 +NQT+EHQ+CYPQ Sbjct: 198 NNQTVEHQDCYPQ 210 >gb|EOY26762.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein isoform 4 [Theobroma cacao] Length = 167 Score = 196 bits (498), Expect = 3e-47 Identities = 89/164 (54%), Positives = 128/164 (78%) Frame = -3 Query: 1735 ASLLLLAGAIFFLWPSDPDVSIVRLHLHHFRIRTFPIISLDINLALTVKVRNKDLYSLDY 1556 A L+LLA +++ WPS P+V IVR+H+ ++ T PII+LDI+L +T+KVRN D+YS+D+ Sbjct: 2 AFLVLLAASVYIFWPSQPEVKIVRMHVKRMQMHTVPIIALDISLLVTLKVRNSDVYSMDF 61 Query: 1555 KSLVVSIGYRGKQLGLVRSNHGHVRARGSSYVSATLSLSGVEILADVIPLLEDVTRGEIT 1376 SL +++GYRGK LG V+S H H+RA GSSY+ A L L+GVE+L+DV+ +LED+ RG + Sbjct: 62 TSLDMAVGYRGKMLGHVKSEHDHLRAWGSSYLQAELELNGVEVLSDVVYMLEDLARGTVP 121 Query: 1375 FDTVSKIDGQLGLAFFELPLEGKVSCEIIVDIHNQTIEHQNCYP 1244 FDT++++ G LGL+ F+ PL+ K+SCEI+V+ NQ I HQNCYP Sbjct: 122 FDTITEVAGWLGLSLFKFPLKVKISCEIVVNRTNQIIIHQNCYP 165 >ref|XP_002863251.1| hypothetical protein ARALYDRAFT_497058 [Arabidopsis lyrata subsp. lyrata] gi|297309085|gb|EFH39510.1| hypothetical protein ARALYDRAFT_497058 [Arabidopsis lyrata subsp. lyrata] Length = 224 Score = 196 bits (498), Expect = 3e-47 Identities = 88/198 (44%), Positives = 142/198 (71%) Frame = -3 Query: 1840 VPRREPQPQYVIVLPHYYNPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRL 1661 +P +P +++ P+ N A++LLL+ A++ L+PSDPD+++ R+ Sbjct: 16 LPSSQPSQTVILLTPYRRNRYPSIFRNLRCSLLFTAAILLLSAAVYLLYPSDPDITVSRI 75 Query: 1660 HLHHFRIRTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVR 1481 +L+H + I+LD++ +LT+KVRN+D +SLDY SLVVSIGYRG++LGLV+S GH++ Sbjct: 76 NLNHISVVDSHKIALDLSFSLTIKVRNRDFFSLDYDSLVVSIGYRGRELGLVKSKGGHLK 135 Query: 1480 ARGSSYVSATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVS 1301 AR SSY++ATL L G+E++ DVI L+ D+ +G I FDT++++ G LG+ F++P++GKVS Sbjct: 136 ARDSSYINATLELDGLEVVHDVIYLIGDLAKGVIPFDTIAQVKGDLGVLLFQIPIQGKVS 195 Query: 1300 CEIIVDIHNQTIEHQNCY 1247 CE+ V+++NQ I HQ+C+ Sbjct: 196 CEVYVNVNNQKISHQDCH 213 >ref|XP_006414934.1| hypothetical protein EUTSA_v10026222mg [Eutrema salsugineum] gi|557116104|gb|ESQ56387.1| hypothetical protein EUTSA_v10026222mg [Eutrema salsugineum] Length = 213 Score = 196 bits (497), Expect = 4e-47 Identities = 89/198 (44%), Positives = 141/198 (71%) Frame = -3 Query: 1840 VPRREPQPQYVIVLPHYYNPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRL 1661 +P +P +++ P+ N A++LLL+ A++FL+PSDPD+++ R+ Sbjct: 14 LPSSQPSQTVIVLTPYRRNRRPSFLRNLRCSLLFAAAILLLSAAVYFLYPSDPDINLSRI 73 Query: 1660 HLHHFRIRTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVR 1481 L+H R+ +LD++ +LT+KVRN+D +S DY SLVVSIGYRG++LGLV+S GH+R Sbjct: 74 QLNHIRVLDSLKPALDLSFSLTIKVRNRDFFSFDYDSLVVSIGYRGRELGLVKSRGGHLR 133 Query: 1480 ARGSSYVSATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVS 1301 AR SSYV+ATL L G+E++ DVI L+ D+ +G I FDT++++ G LG+ F++P+EG+VS Sbjct: 134 ARDSSYVNATLELDGLEVVHDVIYLIGDLAKGVIPFDTIAQVTGDLGVLLFQIPVEGEVS 193 Query: 1300 CEIIVDIHNQTIEHQNCY 1247 CE+ V++++Q I HQ+C+ Sbjct: 194 CEVFVNVNSQKISHQDCH 211 >ref|XP_002304190.2| hypothetical protein POPTR_0003s05380g, partial [Populus trichocarpa] gi|550342460|gb|EEE79169.2| hypothetical protein POPTR_0003s05380g, partial [Populus trichocarpa] Length = 171 Score = 194 bits (494), Expect = 9e-47 Identities = 88/166 (53%), Positives = 126/166 (75%) Frame = -3 Query: 1738 FASLLLLAGAIFFLWPSDPDVSIVRLHLHHFRIRTFPIISLDINLALTVKVRNKDLYSLD 1559 F LLLL+ ++ WPSDP + +V L L RI T PII++D++L +T++VRN D+YS+D Sbjct: 6 FLLLLLLSALVYVFWPSDPMIKVVGLRLDKIRIHTLPIINIDLSLYVTLRVRNVDVYSMD 65 Query: 1558 YKSLVVSIGYRGKQLGLVRSNHGHVRARGSSYVSATLSLSGVEILADVIPLLEDVTRGEI 1379 ++SL V++ Y+GK+LG VRS+HGHVRA GSSYV A + L G+ +L+ V+ LLED+ RG + Sbjct: 66 FRSLDVAVRYKGKRLGHVRSDHGHVRALGSSYVDAEIDLRGISVLSGVVSLLEDLGRGTV 125 Query: 1378 TFDTVSKIDGQLGLAFFELPLEGKVSCEIIVDIHNQTIEHQNCYPQ 1241 FDTV+++ G+LGL FF PL+ +VSCE++V+ HNQTI Q CYP+ Sbjct: 126 PFDTVTEVSGKLGLLFFGFPLKARVSCEVLVNTHNQTIVRQTCYPE 171 >ref|XP_002299644.2| hypothetical protein POPTR_0001s18090g [Populus trichocarpa] gi|550347583|gb|EEE84449.2| hypothetical protein POPTR_0001s18090g [Populus trichocarpa] Length = 220 Score = 193 bits (491), Expect = 2e-46 Identities = 92/189 (48%), Positives = 133/189 (70%) Frame = -3 Query: 1807 IVLPHYYNPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRLHLHHFRIRTFP 1628 +VLP Y +P LLLL+ ++ WPSDP V +VRL L+ I T P Sbjct: 32 VVLPFYRHPTTQDCRRWPMIIAIIF-LLLLSTLVYVFWPSDPTVKVVRLRLNKIHIHTLP 90 Query: 1627 IISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVRARGSSYVSATL 1448 II++DI+L +++KVRN D+YS+D++SL V++ YRGK+LG VRS+HGHVRA GSSYV A + Sbjct: 91 IINIDISLYVSLKVRNVDVYSMDFRSLDVAVKYRGKRLGHVRSDHGHVRALGSSYVHAGV 150 Query: 1447 SLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVSCEIIVDIHNQT 1268 SG+ +L+DV+ LL+D+ RG + FDTV+++ G+LGL FF P++ K+ C ++V+I+NQT Sbjct: 151 DFSGISVLSDVVSLLDDLARGTVPFDTVTEVSGRLGLLFFGFPMKAKLFCAVLVNINNQT 210 Query: 1267 IEHQNCYPQ 1241 I Q CYP+ Sbjct: 211 IVRQTCYPE 219 >ref|NP_193063.2| late embryogenesis abundant hydroxyproline-rich glycoprotein [Arabidopsis thaliana] gi|62867635|gb|AAY17421.1| At4g13270 [Arabidopsis thaliana] gi|66841356|gb|AAY57315.1| At4g13270 [Arabidopsis thaliana] gi|332657857|gb|AEE83257.1| late embryogenesis abundant hydroxyproline-rich glycoprotein [Arabidopsis thaliana] Length = 215 Score = 193 bits (490), Expect = 3e-46 Identities = 87/198 (43%), Positives = 139/198 (70%) Frame = -3 Query: 1840 VPRREPQPQYVIVLPHYYNPAXXXXXXXXXXXXCFASLLLLAGAIFFLWPSDPDVSIVRL 1661 +P +P +++ P+ + A +LLL+ A++ L+PSDPD+++ R+ Sbjct: 16 LPSSQPSQSVILLTPYRRHRRPSLLRNLRCSLLFTAVILLLSAAVYLLYPSDPDITVSRI 75 Query: 1660 HLHHFRIRTFPIISLDINLALTVKVRNKDLYSLDYKSLVVSIGYRGKQLGLVRSNHGHVR 1481 +L+H + I+LD++ +LT+KVRN+D +SLDY SLVVSIGYRG++LGLV+S GH++ Sbjct: 76 NLNHISVVDSHKIALDLSFSLTIKVRNRDFFSLDYDSLVVSIGYRGRELGLVKSKGGHLK 135 Query: 1480 ARGSSYVSATLSLSGVEILADVIPLLEDVTRGEITFDTVSKIDGQLGLAFFELPLEGKVS 1301 AR SSY+ ATL L G+E++ DVI L+ D+ +G I FDT++++ G LG+ F +P++GKVS Sbjct: 136 ARDSSYIDATLELDGLEVVHDVIYLIGDLAKGVIPFDTIAQVQGDLGVLLFNIPIQGKVS 195 Query: 1300 CEIIVDIHNQTIEHQNCY 1247 CE+ V+++NQ I HQ+C+ Sbjct: 196 CEVYVNVNNQKISHQDCH 213