BLASTX nr result
ID: Rauwolfia21_contig00031643
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00031643 (2066 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ16210.1| hypothetical protein PRUPE_ppa002494mg [Prunus pe... 192 4e-46 gb|EOY27178.1| Hydroxyproline-rich glycoprotein family protein, ... 190 2e-45 ref|XP_006426604.1| hypothetical protein CICLE_v10025206mg [Citr... 172 5e-40 emb|CBI35923.3| unnamed protein product [Vitis vinifera] 171 9e-40 ref|XP_002521366.1| conserved hypothetical protein [Ricinus comm... 156 3e-35 ref|XP_002329058.1| predicted protein [Populus trichocarpa] gi|5... 133 3e-28 gb|EXB29688.1| hypothetical protein L484_013462 [Morus notabilis] 125 7e-26 ref|XP_006390619.1| hypothetical protein EUTSA_v10019633mg [Eutr... 123 3e-25 ref|XP_006302100.1| hypothetical protein CARUB_v10020090mg [Caps... 121 1e-24 ref|NP_177422.1| hydroxyproline-rich glycoprotein family protein... 118 1e-23 gb|AAM13859.1| unknown protein [Arabidopsis thaliana] 114 1e-22 ref|XP_002887451.1| hypothetical protein ARALYDRAFT_476416 [Arab... 114 2e-22 gb|AAL50061.1| At1g72790/F28P22_2 [Arabidopsis thaliana] gi|1954... 114 2e-22 ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261... 108 1e-20 gb|EXB38898.1| hypothetical protein L484_027333 [Morus notabilis] 96 5e-17 ref|NP_200517.2| hydroxyproline-rich glycoprotein family protein... 92 7e-16 dbj|BAA97357.1| unnamed protein product [Arabidopsis thaliana] 92 9e-16 ref|XP_004288965.1| PREDICTED: uncharacterized protein LOC101306... 90 4e-15 ref|XP_002864497.1| hypothetical protein ARALYDRAFT_495801 [Arab... 88 1e-14 ref|XP_006401247.1| hypothetical protein EUTSA_v10013114mg [Eutr... 88 2e-14 >gb|EMJ16210.1| hypothetical protein PRUPE_ppa002494mg [Prunus persica] Length = 666 Score = 192 bits (489), Expect = 4e-46 Identities = 177/624 (28%), Positives = 259/624 (41%), Gaps = 32/624 (5%) Frame = +1 Query: 106 PFIKSETRK*KPIYMEEDGEDFTPFWLQ-STAKRRRTDRVRDXXXXXXXXXXXXXXXXXV 282 P ++++T++ P MEE + PFWLQ S + R+ R+R Sbjct: 70 PKVENQTKQ-NPKTMEEKEDMLPPFWLQPSDSFRQANRRLRRSSSSVFFSSGAFILALLA 128 Query: 283 TAASFLVFVVPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEY 462 A F+ F++PS LSF++QIFRP++VKKSWDS SR TN D + Sbjct: 129 IALVFIFFIIPSVLSFTSQIFRPHSVKKSWDSLNLVLVLFAIVCGFLSRNTNNDGN---- 184 Query: 463 QTSPVSRDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYG-YSDQMADNQVNVSRGG 639 +SP S D V Q +N S+ + KSN S +QW+ YSD+ NQ + S Sbjct: 185 LSSPSSYDQVH-------NQTVFNSSSPQAPKSNPSTPRQWFDQYSDRTGYNQSSSSTSA 237 Query: 640 LR----RTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQL--LRRRSWKDKF 801 RTSSSYPDL + + + + DD WRFYDD V YRVS + L R RSW ++ Sbjct: 238 AMNRGVRTSSSYPDLRQQEASWVARDDRWRFYDDTHVVNYRVSGSDPLHHRRHRSWHEES 297 Query: 802 ------EVSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKE 963 E + V++K I VDTF R E+ Sbjct: 298 VQLPVEEEAEQVQTKTIEVDTFAIRTEQPSPPRRSPSSSQPPPPTI-------------R 344 Query: 964 KPKRVHRSLAHK----GERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKT 1131 K KR ++++ K +S ++++ E ++ + P + + GK Sbjct: 345 KSKRTYQAIGEKENSGSTQSLERNDNFEAKKNLPPPPARPPPSPPSPPPR--ISKSAGKD 402 Query: 1132 DKKRSGGSATKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXX 1311 KKR G + TK+FL + S+EN +SLL + + P + Sbjct: 403 VKKR-GVATTKEFLITSLRRKKKKQRQKSVENFESLLASASSAP--YSLLPPPSPPPPPP 459 Query: 1312 XXXXXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEIIK----- 1476 VFHNLF + S P+PP P P + + ST+++ K Sbjct: 460 PLPPPPSVFHNLFSTKKSNKPRKTMQ------SIPQPP-PPPPVAATTSTAQLSKTKAQM 512 Query: 1477 ---VSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDY 1647 ++T K P PVK+ +F + F + KFVV GD+ Sbjct: 513 RPMMTTQKPPLPVKMSTFINGDDENTNSGGESPLARIPPPPPLPPF-RMPEMKFVVHGDF 571 Query: 1648 XXXXXXXXXXXXXPDIDDAESD----GTPTATDGEDAV--AXXXXXXXXXXXDVNTKAEN 1809 PD+DD + +PT + DVNTKA+ Sbjct: 572 VRIKSNNSSRSGSPDLDDGDDPDSAVSSPTTETNRTPLESGESPKAMFCPSPDVNTKADT 631 Query: 1810 FITKFRAGLKLEKINSMNKRQGLG 1881 FI +FRAGL+LEK+NS+ R LG Sbjct: 632 FIARFRAGLRLEKMNSVRGRSNLG 655 >gb|EOY27178.1| Hydroxyproline-rich glycoprotein family protein, putative [Theobroma cacao] Length = 553 Score = 190 bits (482), Expect = 2e-45 Identities = 186/601 (30%), Positives = 252/601 (41%), Gaps = 23/601 (3%) Frame = +1 Query: 148 MEEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLVFVVPSTLS 327 MEED ED TPFWLQ TA RR R R V A +F+ ++PS LS Sbjct: 1 MEED-EDVTPFWLQ-TADNRRIRRRRQPSSLFFNTGILIILLL-VVALAFIFVIIPSFLS 57 Query: 328 FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSD--------EYQTSPV-S 480 F++QIF+P+ VKKSWDS + N + DSD ++ T+P Sbjct: 58 FTSQIFKPHLVKKSWDSLNLVLVLFAIICGFLGK-NNGNNDSDTRSTYEDYKFSTTPKHD 116 Query: 481 RDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSS 660 RD V +S+PSTP+Q WY+YS+ SD+ A N + R+S+S Sbjct: 117 RDHVGRSNPSTPRQ-WYDYSSS----------------SDRTAYNSLQ-----RLRSSNS 154 Query: 661 YPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSRVVESKNIHV 840 YPDL SS +GDD WRFYDD + YR R R D+ EV +K+I V Sbjct: 155 YPDLRPESSWMMNGDDRWRFYDDTPLYNYR-------SRSRREHDREEVYS-NNTKDIAV 206 Query: 841 DTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLAHKGERSKKK 1020 DT V+RP + + KPKR + L K ERS++K Sbjct: 207 DT-VHRPPPPPSSSPPPAATASSPSPPQSPPPQPPKVV-RRKPKRTYEDLKPK-ERSERK 263 Query: 1021 D---NELENRE--PISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATKDFLNSLY 1185 + +EL+ + P + P +++ K++KKR G TKDFL SL Sbjct: 264 EVINSELKIKHSLPSTPPAARPPPPPPPPPPPSVFEKRSNKSEKKR--GGVTKDFLISL- 320 Query: 1186 HXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFHNLFPXXXX 1365 S+ENLD LS P + N+FP Sbjct: 321 RRKKKKQRQRSVENLDEFFKLSTLP-----LYPPASPPPPPPPPPPLPSFYQNIFPSKKN 375 Query: 1366 XXXXXDLDISASTFSQPKPPTPTPAIHSKPS--TSEIIKVSTHKSPKPVKIRSFDSVXXX 1539 + P PP P P++ ++ S S+ V+T K P PVKIR+ +V Sbjct: 376 KAR------KNHSVPPPPPPPPLPSVEARASKRESQTPPVTTQKPPLPVKIRNMHNVEES 429 Query: 1540 XXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXXXPDIDD------ 1701 K +WKF V GD+ PD+DD Sbjct: 430 VESGNESPLNPIPPPPPPPPF--KMPAWKFEVHGDFVRLKSIRSSRSGSPDLDDPLSCEA 487 Query: 1702 AESDGTPTA-TDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMNKRQGL 1878 + SDG T DG ++ DV+TKA+NFI +FRAGLKLEK+NS+ R L Sbjct: 488 SPSDGNKTGEMDGGESTT---GPLFCPSPDVDTKADNFIARFRAGLKLEKMNSVRGRSNL 544 Query: 1879 G 1881 G Sbjct: 545 G 545 >ref|XP_006426604.1| hypothetical protein CICLE_v10025206mg [Citrus clementina] gi|557528594|gb|ESR39844.1| hypothetical protein CICLE_v10025206mg [Citrus clementina] Length = 602 Score = 172 bits (436), Expect = 5e-40 Identities = 171/627 (27%), Positives = 245/627 (39%), Gaps = 44/627 (7%) Frame = +1 Query: 148 MEEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLVFVVPSTLS 327 MEE+ + +PFWLQST R+ R R A +F+ V+PS S Sbjct: 1 MEEEEDASSPFWLQST--RQAGHRRRRSSSSFLFNSGALLVFLLAVAVAFIFIVIPSIQS 58 Query: 328 FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDR-----------------DSD 456 F++QIF+P+ VK+SWDS SR N + + + Sbjct: 59 FTSQIFKPHAVKRSWDSLNLVLVLFAIICGFLSRNHNNNESIATTPTTTSAAASASYEDE 118 Query: 457 EYQTSPVSRDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRG 636 EY+ S + QKS+P TP++ WY+++ + +N N N +RG Sbjct: 119 EYKFENRS-ESFQKSNPETPRRFWYDHAYSNNNNNN----------------NNDNNNRG 161 Query: 637 GLR-RTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSW-------- 789 R R+ SS+PDL + + + + +D WRFYDD + R S + W Sbjct: 162 LSRLRSFSSHPDLRQEEALWANVEDQWRFYDDTHLYHNRFSSF-DYKQLHQWQPQPQPQP 220 Query: 790 --------KDKFEVSRVVESKNIHVDTFVN------RPEKXXXXXXXXXXXXXXXXXXXX 927 K+K E +V KN+ D + R E Sbjct: 221 KLEVLEEEKEKKENEKVDAVKNVDADNTTSSTVEESRKEIIYTPPQPPPASPSPEPAELP 280 Query: 928 XXXXXXXXMPKEKPKRVHRSL--AHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXX 1101 +P + K V R + H+G ++N+LE + P+ P Sbjct: 281 PSPSPPPPLPPPQTKVVRRRVKRTHQGNGGNYRNNDLEVK-PLPPPPPQLQPPPLPAPPE 339 Query: 1102 RFVDQKIGKTDKKRSGGSATKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXX 1281 V++ GK++KKR G SATK+FL SL S+ENLDS + + PL Sbjct: 340 TEVEESGGKSEKKRGGTSATKEFLTSLRRKKKKKQRQKSVENLDSFFNYESSYPLP-PSL 398 Query: 1282 XXXXXXXXXXXXXXXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPST 1461 F N+F I + S P PP P P K ST Sbjct: 399 IPPPSPPPPPPPPPPPPFFQNIFSSRKRKAK----KILSVIPSPPPPPPPPPTRTQKLST 454 Query: 1462 SEIIKV-STHKSPK-PVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVV 1635 S V ST + P PVKI+S+ +V K +WKF V Sbjct: 455 SRTRNVQSTSQEPSLPVKIKSYSNVEQNVNSGNESPLNAIPPPPPLPPF--KMPAWKFEV 512 Query: 1636 QGDYXXXXXXXXXXXXXPDIDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFI 1815 GD+ DD E P +DG D + DVNTKA+NFI Sbjct: 513 HGDFVRLKSNNI------GADDVEESSCP--SDGGD--SPVVTPLSCPSPDVNTKADNFI 562 Query: 1816 TKFRAGLKLEKINSMNKRQGLGLSNLG 1896 +FRAGL+LEK+NS+ +++G SNLG Sbjct: 563 ARFRAGLRLEKMNSVKEKEGRRRSNLG 589 >emb|CBI35923.3| unnamed protein product [Vitis vinifera] Length = 628 Score = 171 bits (434), Expect = 9e-40 Identities = 175/639 (27%), Positives = 253/639 (39%), Gaps = 29/639 (4%) Frame = +1 Query: 148 MEEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLVFVVPSTLS 327 MEE+G TPFW+ +++ RR R +TA F+VFV+P LS Sbjct: 39 MEEEGAT-TPFWMPASSGHRRRRSSRSPSSIFLSSGFLIIFLP-LTALLFIVFVLPPILS 96 Query: 328 FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQTSPVSRDDVQKSSP 507 F++ IF+PN VKKSWDS SR E S V + Q+S+ Sbjct: 97 FTSYIFKPNMVKKSWDSLNLVLVLFAIICGFLSRGGGGGSSDMESSVSEVPEESTQRSNH 156 Query: 508 STPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRR--TSSSYPDLLEA 681 + Y ++++ GG+RR +SSSYPDL + Sbjct: 157 G-------------------------HCYEERIS------GYGGMRRMRSSSSYPDLRQE 185 Query: 682 SSQFTSGDDPWRFYDDMTVDTYRV-SETGQLLRRRSWKDKFEVSRVVESKNIHVDTFVNR 858 S+ + GD WR +DD +D +RV QL RR ++D+ E KNI VD Sbjct: 186 SA-WAGGDGRWRSFDDTQLDNHRVLGSHRQLYIRRRYEDQ----DYCEVKNIDVDNTSMI 240 Query: 859 PEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLAHKGERSKKKDNELEN 1038 K + K K KR +++A + R ++++ E+ Sbjct: 241 SPKEKVLSHIPPRPPSPPLPPSPPPPPPPPPVVKRKVKRSFQAVAREERRETRENSSFES 300 Query: 1039 REPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATKDFLNSLYHXXXXXXXXX- 1215 + ++P V+++ K+D+KR G ATK+FL SLY+ Sbjct: 301 KRVQAAPPPPPPPPPPPPPLA--VERRSEKSDRKRGG--ATKEFLTSLYYQRNKKKKQRQ 356 Query: 1216 -SIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFHNLFPXXXXXXXXXDLDI 1392 S+ENLD++LH S P VFHNLF + Sbjct: 357 KSMENLDTILHNS--PHSDQPLRPPPSPPPPPPLPPPPNSVFHNLFSSKKGKSKRF---L 411 Query: 1393 SASTFSQPKPPTPTPAIHSKPSTSEIIKVSTH---------KSPKPVKIRSFDSVXXXXX 1545 + P PP P ++ + ++I +H K P P K SF+SV Sbjct: 412 TVPPPPPPPPPPPASRAYAGKTKTKIALSRSHPYDHPLNASKPPIPEKSSSFNSVDGNPY 471 Query: 1546 XXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXXXPDID--------- 1698 F K WKFVV GDY PD+D Sbjct: 472 AGSESLLIPVPPPPPPPPPF-KMPDWKFVVHGDYVRIKSTNSSRSGSPDLDYIGSPSSKG 530 Query: 1699 DAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMNKRQGL 1878 + S + T+G D+ DVNTKA+ FI +FRAGLKLEKINS+ ++Q + Sbjct: 531 PSRSTSLKSETEGGDSAQPLFCPSP----DVNTKADTFIARFRAGLKLEKINSIKEKQEV 586 Query: 1879 GLSNLG------RGSDSTQL*RGLYFSSSTNYHLFICLF 1977 G+SNLG +GS + L + S NY L+ F Sbjct: 587 GMSNLGPEPGQAQGSGAWGLCGSGFPDSVCNYKLWYRFF 625 >ref|XP_002521366.1| conserved hypothetical protein [Ricinus communis] gi|223539444|gb|EEF41034.1| conserved hypothetical protein [Ricinus communis] Length = 553 Score = 156 bits (395), Expect = 3e-35 Identities = 159/614 (25%), Positives = 228/614 (37%), Gaps = 26/614 (4%) Frame = +1 Query: 154 EDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLVFVVPSTLSFS 333 E+ ED PFWLQ+T + R R+R V A F+ VVPS ++F+ Sbjct: 2 EEEEDVPPFWLQATDQHHRGRRLRRQASSIFLNSGVILIMLLVIAFVFVFVVVPSVVTFT 61 Query: 334 AQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSR--ATNQDRDSDEYQ--------TSPVSR 483 +Q+F+PN +KK WDS R + S YQ +S + Sbjct: 62 SQVFKPNLIKKGWDSLNFVLVLFAIVCGFLGRNSPNTSNESSTSYQRLSSSSSASSSNVQ 121 Query: 484 DDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSY 663 DVQ+S PSTP +WY+ +D+ ++ + F + R+ SY Sbjct: 122 QDVQRSYPSTPAYRWYDDGQYQDRTASYNTFNR--------------------LRSFRSY 161 Query: 664 PDLLEASSQFTSGDDPWRFYDDMTVDTYRVS---------------ETGQLLRRRSWKDK 798 PDL + S +++ D+ WRFYDD V+ Y+ S + Q + K Sbjct: 162 PDLRQ-ESLWSNNDERWRFYDDTRVNGYKFSSPLHQDELQDDHPPQQQQQEQDQEPRKQD 220 Query: 799 FEVSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRV 978 E + V +K+I VDTFV E+ K + KR Sbjct: 221 QEQEQDVSTKDIAVDTFVIHKEEVVQTPPPPMPPAPVSPPRLPTRSTV-----KRRAKRT 275 Query: 979 HRSLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSA 1158 + L GE K+++N+ + I+ P K+DK+R Sbjct: 276 YHDL---GEHEKRRENKNLEVKTINIPPPPPPPQLIS-----------SKSDKRRG---- 317 Query: 1159 TKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVF 1338 KD L SL S+ENL+SL + P + F Sbjct: 318 -KDLLISL-RRKRKKQRQKSVENLESLFNPEPLPSI--------IPPPPPPPPPPPPHFF 367 Query: 1339 HNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEIIKVSTHKSPKPVKIRS 1518 NLF + + QP+PP+ T H +T + + +K K VK + Sbjct: 368 QNLFSSKKGKTKKD----HSHSVPQPQPPSRT---HRSRTTVQEATIEAYKPLKAVKTGN 420 Query: 1519 FDSVXXXXXXXXXXXXXXXXXXXXXXXXFT-KSLSWKFVVQGDYXXXXXXXXXXXXXPDI 1695 F SV K WKF+ GDY PDI Sbjct: 421 FSSVEENVERGNASPLIPIPPPPPPPPPPPFKMKPWKFISDGDYVRVASFNSSRSGSPDI 480 Query: 1696 DDAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMNKRQG 1875 D + ++ + DVNTKAENFI +FRAGLKLEKINS+ Sbjct: 481 DSEDPSDKESSPMARNKEGDSAMPSFCPSPDVNTKAENFIARFRAGLKLEKINSVK---- 536 Query: 1876 LGLSNLGRGSDSTQ 1917 G SNLG G D + Sbjct: 537 -GRSNLGPGPDRVE 549 >ref|XP_002329058.1| predicted protein [Populus trichocarpa] gi|566150019|ref|XP_006369280.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550347738|gb|ERP65849.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 560 Score = 133 bits (335), Expect = 3e-28 Identities = 152/608 (25%), Positives = 216/608 (35%), Gaps = 32/608 (5%) Frame = +1 Query: 154 EDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLVFVVPSTLSFS 333 ++ ED FWLQ+T R R+ +R V A +F++ VV S S + Sbjct: 2 DEEEDMPLFWLQATNTRHRSRGLRRQTSSIFLNSGVFLVILLVVALAFVLVVVSSIGSLT 61 Query: 334 AQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQ-----------DRDSDEYQTSPVS 480 +QI RP ++KKSWDS S + D ++ Y S Sbjct: 62 SQILRPQSIKKSWDSLNLVLVLFAIVCGFLSSNNSSGSSGSGSGSGGDNENTSYYEDQ-S 120 Query: 481 RDDVQKSS--PSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTS 654 +VQK S STP +W+ + D V+ + R+ Sbjct: 121 LSNVQKPSHPSSTPSHRWFEHQ-----------------------DRTVSYNTLNRLRSF 157 Query: 655 SSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSRVVES--- 825 SSYPDL + S + WRFYDD ++ YR S + + + E + E Sbjct: 158 SSYPDLRQESLE------RWRFYDDTHLNNYRFSTSSDQIHHHYPQQVEETKKQEEGVGV 211 Query: 826 KNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLAHKGE 1005 K+I VDTFV ++ + + K KR ++ L ++ + Sbjct: 212 KDIDVDTFVINQKEVSYPSSPPPFPPPHPSSSPPLPPSPPPKLVRRKVKRTYQDLGYE-K 270 Query: 1006 RSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATKDFLNSLY 1185 R+ ++ LEN I P K +K+R KDFL SL Sbjct: 271 RTDHEEKVLENFYNIPPPSPPPPPPPPPPPPPPI----FSKNEKRRG-----KDFLISL- 320 Query: 1186 HXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFHNLFPXXXX 1365 S+ENLDS + TP NLF Sbjct: 321 RRKKKKQRQKSVENLDSFFNPQPTPTSTLPLIPPPPPPPPPPHF------LQNLFSKKGK 374 Query: 1366 XXXXXDLDISASTFSQPKPPTPTPA-----IHSKPSTS----EIIKVSTHKSPKPVKIRS 1518 + P PP P P + S+ TS ++ +++ K P+P K R Sbjct: 375 TKKLHPV---------PPPPPPPPVTRVSKVVSQKVTSRTKVQVAPLTSDKPPEPAKTRR 425 Query: 1519 FDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXXXPDID 1698 F SV K +WKFV GDY PD+D Sbjct: 426 FHSVEENVERGNASRLIPLPPPPPPPPF--KMPAWKFVHDGDYVRVGSFNSSRSGSPDLD 483 Query: 1699 DAESDGTP-------TATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINS 1857 E + A G D+ A DVNTKA+NFI +FRAGL LEK+NS Sbjct: 484 SIEDASSEKDQSSPVAAASGSDSAATALFCPSP---DVNTKADNFIARFRAGLTLEKVNS 540 Query: 1858 MNKRQGLG 1881 N+R LG Sbjct: 541 ANRRSNLG 548 >gb|EXB29688.1| hypothetical protein L484_013462 [Morus notabilis] Length = 530 Score = 125 bits (314), Expect = 7e-26 Identities = 90/249 (36%), Positives = 122/249 (48%), Gaps = 9/249 (3%) Frame = +1 Query: 148 MEEDGED-FTPFWLQSTAKRRRTDRVR---DXXXXXXXXXXXXXXXXXVTAASFLVFVVP 315 ME D E+ TPFW QS+ RR D R VTA +F+ ++P Sbjct: 1 MEGDQENSLTPFWPQSSDSIRRADHRRRRLSRSSSLLFNSGAVLIALIVTALAFIFVIIP 60 Query: 316 STLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQTSPVSRDDVQ 495 S LSF++QIFRP++VKKSWDS SR + ++ S+ + VS + Q Sbjct: 61 SFLSFTSQIFRPHSVKKSWDSLNLVLVLFAIVCGFLSRNSTENTSSN-HDDQRVSNEGGQ 119 Query: 496 KSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSYPDLL 675 KS+PSTP QWY YSD+ + N R+SSSYPDL Sbjct: 120 KSNPSTP--------------------HQWYEYSDRTQSDSFNSRIYRRMRSSSSYPDLR 159 Query: 676 EASSQFTSGDDPWRFYDDMTVDTYRVSETGQ-----LLRRRSWKDKFEVSRVVESKNIHV 840 + SS + S D+ WRFYDD V YR S++ Q RRR W + E + +++KNI V Sbjct: 160 QESS-WVSRDEQWRFYDDTHVANYRPSDSDQHHHQLQYRRRPWNEPEEEIQ-LQTKNIEV 217 Query: 841 DTFVNRPEK 867 DTF R ++ Sbjct: 218 DTFEVRAKE 226 Score = 59.3 bits (142), Expect = 6e-06 Identities = 59/200 (29%), Positives = 80/200 (40%), Gaps = 11/200 (5%) Frame = +1 Query: 1333 VFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEIIKVSTHKSPK-PVK 1509 VFH LF S S S P PP P P++ S ++ + + P PV Sbjct: 336 VFHTLFSSKKGKTKKVH-SFSQSPSSPPPPPPPPPSVRVSKSKAQSRPIPVTQKPSLPVH 394 Query: 1510 IRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXXXP 1689 + DSV + W+FV GD+ P Sbjct: 395 TSNVDSVEENTKIGSESPLIPIPPPPPPPPF--RFQEWRFVRHGDFVRIKSDNSSRSGSP 452 Query: 1690 DIDDAESD----GTPTA-TDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKIN 1854 ++D E +P A T+G + A DV+TKA+NFI +FRAGL LEK N Sbjct: 453 ELDGCEDSPVGGASPLAVTEGGMSPARMFCPSP----DVDTKADNFIARFRAGLILEKEN 508 Query: 1855 SMNKR----QGLGL-SNLGR 1899 S+ +R LGL +NLGR Sbjct: 509 SIKERDRGKSRLGLEANLGR 528 >ref|XP_006390619.1| hypothetical protein EUTSA_v10019633mg [Eutrema salsugineum] gi|557087053|gb|ESQ27905.1| hypothetical protein EUTSA_v10019633mg [Eutrema salsugineum] Length = 548 Score = 123 bits (309), Expect = 3e-25 Identities = 157/600 (26%), Positives = 226/600 (37%), Gaps = 18/600 (3%) Frame = +1 Query: 151 EEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLV--FVVPSTL 324 EEDG+ TPFWLQS RR R AA+ L+ F++P Sbjct: 3 EEDGDASTPFWLQS---RRNNTYFRRTSSLGGRATTVATQVFFAGAAAILIVFFIIPPFF 59 Query: 325 SFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQTSPVSRDDVQKSS 504 + +QIFRP+ V+KSWD SR D + +++V K++ Sbjct: 60 TSVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNAGNDESTHH-------KEEVSKNN 112 Query: 505 PSTPQQKWYNYST-----DEDQKSNLSAFQQWYG--YSDQMADNQVNVSRGGLRRTSSSY 663 + +S+ D + SN + + W DQ D+ V R R+SSSY Sbjct: 113 EVINGYGFNKFSSSPLIIDRGRVSNGATPRYWIDDRGGDQFPDHTV-YKRISRLRSSSSY 171 Query: 664 PDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSRVVESKNIHVD 843 PDL +F + D WRFYDD V R + + + W + ++ + D Sbjct: 172 PDL--RLPEFDT-DQRWRFYDDTRVSQCRYEASDPIYQNPVWPEVKSPEEDIDQTD-GGD 227 Query: 844 TFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLAHKGERSKKKD 1023 N EK + + KRV++ +A K E+ ++ D Sbjct: 228 GGGNVTEK-VEVVATATAEVVEELSPPPPSAPASPPRAQRRTKRVYQDVARKEEKKERAD 286 Query: 1024 --NELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATKDFLNSLYHXXX 1197 P+ P T V+QK K +KK+ GG ATK+FL +L Sbjct: 287 FVTATPPMTPVPPPAT--------------VNQKSNKQEKKKKGG-ATKEFLIAL-RRKK 330 Query: 1198 XXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFHNLFPXXXXXXXX 1377 SI+ LD LL S +PPL + F +LF Sbjct: 331 KKQRQQSIDGLD-LLFGSDSPPLAYS--MPPKSPPHPPPPPPPPPFFQSLFSSKKGK--- 384 Query: 1378 XDLDISASTFSQPKPPTPTPAIHSKPSTSEIIKV------STHKSPKP-VKIRSFDSVXX 1536 S T+S P PP P P + S + + K+ S P P K+ F Sbjct: 385 -----SKRTYSTPPPPPPPPPERNFESRASMAKIRKAPMESRTSKPNPAAKVSQFVGTGS 439 Query: 1537 XXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXXXPDIDDAESDG 1716 K +WKFV +GDY PD DD++ Sbjct: 440 ESPLMPIPPPPPPPPF--------KMPAWKFVKRGDYVRMASNISISSDEPD-DDSD--- 487 Query: 1717 TPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMNKRQGLGLSNLG 1896 DG+ A + DV+TKA++FI +FRAGLKLEK+NS+ + G SNLG Sbjct: 488 VAQLADGKAASS-----MFCPSPDVDTKADDFIARFRAGLKLEKMNSVKR----GRSNLG 538 >ref|XP_006302100.1| hypothetical protein CARUB_v10020090mg [Capsella rubella] gi|482570810|gb|EOA34998.1| hypothetical protein CARUB_v10020090mg [Capsella rubella] Length = 550 Score = 121 bits (304), Expect = 1e-24 Identities = 153/611 (25%), Positives = 214/611 (35%), Gaps = 29/611 (4%) Frame = +1 Query: 151 EEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFL--VFVVPSTL 324 EEDG+ +PFWLQS RR R AA+ L VF++P Sbjct: 3 EEDGDASSPFWLQS---RRNNTYFRRTASLGGRATTVATQIFFAGAAAILIVVFIIPPLF 59 Query: 325 SFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQTSPVSRDDVQKSS 504 S +QIFRP+ V+KSWD SR TN D +T+ DD Sbjct: 60 SSVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNND------ETNHNKEDDESDKF 113 Query: 505 PSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSYPDLLEAS 684 ++P + D+ SN + + W D +Q R R+ SSYPDL Sbjct: 114 LNSP-----SIVDRGDRVSNGATPRYWI---DDRGGDQTVYKRFSRLRSVSSYPDLRLRE 165 Query: 685 SQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSR-----VVES-------- 825 + D+ WRFYDD V R T + +S+++ E ++ VV++ Sbjct: 166 YE---ADERWRFYDDTRVSQCRYEHTDPIYPNQSYRNWQEEAKPPPGDVVQTERDGSNGD 222 Query: 826 -KNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMP------KEKPKRVHR 984 + +H+D V + P K K KRV++ Sbjct: 223 ERKVHIDGSVAEKVEVVATAKAEVVEELPVPSAPPYIPSPPPSPPPQPKQAKRKTKRVYQ 282 Query: 985 SLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATK 1164 + K E+ ++ D PI P T V Q+ K +KK+ GG ATK Sbjct: 283 DVPPKEEKKERADFSEVATPPILPPTT--------------VHQRSNKPEKKKKGG-ATK 327 Query: 1165 DFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFHN 1344 DFL L SI+ LD L PPL + Sbjct: 328 DFLVVL-RRKKKKQRQQSIDGLDLL--FGSDPPLVYTMPLPPPPPPPPPPPPP------- 377 Query: 1345 LFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEI-------IKVSTHKSPKP 1503 P I + + P PP P P S + + ++ T K P Sbjct: 378 --PFLRGLFSSKKGKIKRTNSNPPPPPPPPPPERRYESRASMAKSRKTPVQSRTSKPNPP 435 Query: 1504 VKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXX 1683 K+ F K +WKFV +GDY Sbjct: 436 TKVTQFVGTGSESPLLPIPPPPPPPPF--------KMPAWKFVKRGDYVRMASDISISSD 487 Query: 1684 XPDIDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMN 1863 PD D T + DV+TKA++FI +FRAGLKLEK+NS+ Sbjct: 488 EPDDTDVAQSATGS--------------MFCPSPDVDTKADDFIARFRAGLKLEKMNSVK 533 Query: 1864 KRQGLGLSNLG 1896 + G SNLG Sbjct: 534 R----GRSNLG 540 >ref|NP_177422.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|12323765|gb|AAG51845.1|AC010926_8 unknown protein; 15669-13984 [Arabidopsis thaliana] gi|24030251|gb|AAN41301.1| unknown protein [Arabidopsis thaliana] gi|332197252|gb|AEE35373.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 561 Score = 118 bits (295), Expect = 1e-23 Identities = 159/621 (25%), Positives = 221/621 (35%), Gaps = 39/621 (6%) Frame = +1 Query: 151 EEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLV-FVVPSTLS 327 E+DG+ TPFWLQS +R T R TAA +V F++P S Sbjct: 3 EDDGDASTPFWLQS--RRNNTYFRRTASLGGRTTTIATQIFFAGTAAILIVVFIIPPFFS 60 Query: 328 FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDS--------DEYQTSPVSR 483 +QIFRP+ V+KSWD SR TN D + +++ TSP Sbjct: 61 SVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNNDESNHHKEEDIRNKFSTSPSII 120 Query: 484 DDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSY 663 D + S S +++N D + G DQ R R+ SSY Sbjct: 121 DRRSRVSNSGTTPRYWN-----DDRGG--------GGGDQTV-----YKRFSRLRSVSSY 162 Query: 664 PDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWK----------------- 792 PDL + D+ WRFYDD V R + + +S++ Sbjct: 163 PDLRLREYE---ADERWRFYDDTRVSQCRYEDVDPIYPNQSYRNWHEEGKPPPEDVDQTE 219 Query: 793 --DKFEVSRVVE--SKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPK 960 D E S+V S+ V+ + K Sbjct: 220 DGDNGEGSKVRNGGSETEKVEVVATAEAEVVEELKVPSAPPYIPSPPPSPPRPPPAKQAK 279 Query: 961 EKPKRVHRSLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKK 1140 K RV++ ++ + E K++D+ + PI P T V QK K +KK Sbjct: 280 RKTNRVYQDVSPQ-EEKKERDDFVATTTPIPPPAT--------------VYQKSNKQEKK 324 Query: 1141 RSGGSATKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXX 1320 + G ATKDFL +L SI+ LD L PPL + Sbjct: 325 K--GGATKDFLIAL-RRKKKKQRQQSIDGLDLL--FGSDPPLVYS---------PPPPPP 370 Query: 1321 XXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTP----AIHSKPSTSEI----IK 1476 F LF S S P PP P P S+ STS++ ++ Sbjct: 371 PPPPFFQGLFSSKKGK--------SKKNNSNPPPPPPPPPPERRYESRASTSKLRKAPVE 422 Query: 1477 VSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXX 1656 T K P K+ + K +WKFV +GDY Sbjct: 423 SRTSKPNPPAKVTQYVGTGSESPLMPIPPPPPPPPF--------KMPAWKFVKRGDYVRM 474 Query: 1657 XXXXXXXXXXPDIDD-AESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAG 1833 PD D A+S G+ A DV+TKA++FI +FRAG Sbjct: 475 ASDISISSDEPDDPDVAQSAGSKEAAGS----------MFCPSPDVDTKADDFIARFRAG 524 Query: 1834 LKLEKINSMNKRQGLGLSNLG 1896 LKLEK+NS+ + G SNLG Sbjct: 525 LKLEKMNSVKR----GRSNLG 541 >gb|AAM13859.1| unknown protein [Arabidopsis thaliana] Length = 535 Score = 114 bits (286), Expect = 1e-22 Identities = 154/611 (25%), Positives = 216/611 (35%), Gaps = 39/611 (6%) Frame = +1 Query: 151 EEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLV-FVVPSTLS 327 E+DG+ TPFWLQS +R T R TAA +V F++P S Sbjct: 3 EDDGDASTPFWLQS--RRNNTYFRRTASLGGRTTTIATQIFFAGTAAILIVVFIIPPFFS 60 Query: 328 FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDS--------DEYQTSPVSR 483 +QIFRP+ V+KSWD SR TN D + +++ TSP Sbjct: 61 SVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNNDESNHHKEEDIRNKFSTSPSII 120 Query: 484 DDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSY 663 D + S S +++N D + G DQ R R+ SSY Sbjct: 121 DRRSRVSNSGTTPRYWN-----DDRGG--------GGGDQTV-----YKRFSRLRSVSSY 162 Query: 664 PDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWK----------------- 792 PDL + D+ WRFYDD V R + + +S++ Sbjct: 163 PDLRLREYE---ADERWRFYDDTRVSQCRYEDVDPIYPNQSYRNWHEEGKPPPEDVDQTE 219 Query: 793 --DKFEVSRVVE--SKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPK 960 D E S+V S+ V+ + K Sbjct: 220 DGDNGEGSKVRNGGSETEKVEVVATAEAEVVEELKVPSAPPYIPSPPPSPPRPPPAKQAK 279 Query: 961 EKPKRVHRSLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKK 1140 K RV++ ++ + E K++D+ + PI P T V QK K +KK Sbjct: 280 RKTNRVYQDVSPQ-EEKKERDDFVATTTPIPPPAT--------------VYQKSNKQEKK 324 Query: 1141 RSGGSATKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXX 1320 + G ATKDFL +L SI+ LD L PPL + Sbjct: 325 K--GGATKDFLIAL-RRKKKKQRQQSIDGLDLL--FGSDPPLVYS---------PPPPPP 370 Query: 1321 XXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTP----AIHSKPSTSEI----IK 1476 F LF S S P PP P P S+ STS++ ++ Sbjct: 371 PPPPFFQGLFSSKKGK--------SKKNNSNPPPPPPPPPPERRYESRASTSKLRKAPVE 422 Query: 1477 VSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXX 1656 T K P K+ + K +WKFV +GDY Sbjct: 423 SRTSKPNPPAKVTQYVGTGSESPLMPIPPPPPPPPF--------KMPAWKFVKRGDYVRM 474 Query: 1657 XXXXXXXXXXPDIDD-AESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAG 1833 PD D A+S G+ A DV+TKA++FI +FRAG Sbjct: 475 ASDISISSDEPDDPDVAQSAGSKEAAGS----------MFCPSPDVDTKADDFIARFRAG 524 Query: 1834 LKLEKINSMNK 1866 LKLEK+NS+ + Sbjct: 525 LKLEKMNSVKR 535 >ref|XP_002887451.1| hypothetical protein ARALYDRAFT_476416 [Arabidopsis lyrata subsp. lyrata] gi|297333292|gb|EFH63710.1| hypothetical protein ARALYDRAFT_476416 [Arabidopsis lyrata subsp. lyrata] Length = 552 Score = 114 bits (285), Expect = 2e-22 Identities = 152/609 (24%), Positives = 214/609 (35%), Gaps = 27/609 (4%) Frame = +1 Query: 151 EEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLVF-VVPSTLS 327 EEDG+ TPFWLQS +R T R TAA +VF ++P S Sbjct: 3 EEDGDASTPFWLQS--RRNNTYFRRTASLGGRATTVATQIFFAGTAAILIVFFIIPPLFS 60 Query: 328 FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQTSPVSRDDVQKSSP 507 +Q+FRP+ V+KSWD SR TN D +T+ +D+ Sbjct: 61 SVSQVFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNND------ETNHNKEEDISNKFS 114 Query: 508 STPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSYPDLLEASS 687 ++P + + SN + + W +Q R R+ SSYPDL Sbjct: 115 NSP-----SIIDRGGRVSNSATPRYWIDDRGGGGGDQTVYKRFSRLRSVSSYPDLRLREY 169 Query: 688 QFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKD-KFEVSRVVE-------------- 822 + D+ WRFYDD V R + + +S+++ + EV E Sbjct: 170 E---ADERWRFYDDTRVSQCRYEDVDPIYPNQSYRNWQEEVKPPPEDLDQTEDGGNEGGG 226 Query: 823 ------SKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHR 984 S+ V+ F + K K KRV++ Sbjct: 227 KVHSGGSETEKVEVFETAEAEVVEELTVPSAPPYIPSPPPSPPRPPPPKQAKRKTKRVYQ 286 Query: 985 SLAHKGERSKKKDNELEN-REPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSAT 1161 + K E +++ D PI P T V QK K +KK+ G AT Sbjct: 287 DVPPKEENNERSDFVAATPMTPIPPPAT--------------VYQKSNKQEKKK--GGAT 330 Query: 1162 KDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFH 1341 KDFL +L SI+ LD L PPL + F Sbjct: 331 KDFLIAL-RRKKKKQRQQSIDGLDLL--FGSDPPLVYS---------PPPPPPPPPPFFQ 378 Query: 1342 NLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEI----IKVSTHKSPKPVK 1509 LF +++ P PP P S+ S + I ++ T K P + Sbjct: 379 GLFSSKKGKGKKN----NSNPPPPPPPPPPERRYESRASMTSIRKAPVESRTSKPNPPAR 434 Query: 1510 IRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXXXP 1689 + F K +WKFV +GDY Sbjct: 435 VTQFVGTGSESPLMPIPPPPPPPPF--------KMPAWKFVKRGDYVRMASDI------- 479 Query: 1690 DIDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMNKR 1869 I E D A E VA DV+TKA++FI +FRAGLKLEK+NS+ + Sbjct: 480 SISSDEPDDPDVAQSAEGKVA--AGSMFCPSPDVDTKADDFIARFRAGLKLEKMNSVKR- 536 Query: 1870 QGLGLSNLG 1896 G SNLG Sbjct: 537 ---GRSNLG 542 >gb|AAL50061.1| At1g72790/F28P22_2 [Arabidopsis thaliana] gi|19548027|gb|AAL87377.1| At1g72790/F28P22_2 [Arabidopsis thaliana] Length = 561 Score = 114 bits (285), Expect = 2e-22 Identities = 157/621 (25%), Positives = 218/621 (35%), Gaps = 39/621 (6%) Frame = +1 Query: 151 EEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLV-FVVPSTLS 327 E+DG+ TPFWLQS +R T R TAA +V F++P S Sbjct: 3 EDDGDASTPFWLQS--RRNNTYFRRTASLGGRTTTIATQIFFAGTAAILIVVFIIPPFFS 60 Query: 328 FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDS--------DEYQTSPVSR 483 +QIFRP+ V+KSWD SR TN D + +++ TSP Sbjct: 61 SVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNNDESNHHKEEDIRNKFSTSPSII 120 Query: 484 DDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSY 663 D + S S +++N D + G DQ R R+ SSY Sbjct: 121 DRRSRVSNSGTTPRYWN-----DDRGG--------GGGDQTV-----YKRFSRLRSVSSY 162 Query: 664 PDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFE------------- 804 PDL + D+ WRFYDD V R + +S ++ E Sbjct: 163 PDLRLREYE---ADERWRFYDDTRVSQCRYEDVDPTYPNQSCRNWHEEGKPPPEDVDQTE 219 Query: 805 --------VSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPK 960 +R S+ V+ + K Sbjct: 220 DGDNGEGSKARNGGSETEKVEVVATAEAEVVEELKVPSAPPYIPSPPPSPPRPPPAKQAK 279 Query: 961 EKPKRVHRSLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKK 1140 K RV++ ++ + E K++D+ + PI P T V QK K +KK Sbjct: 280 RKTNRVYQDVSPQ-EEKKERDDFVATTTPIPPPAT--------------VCQKSNKQEKK 324 Query: 1141 RSGGSATKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXX 1320 + G ATKDFL +L SI+ LD L PPL + Sbjct: 325 K--GGATKDFLIAL-RRKKKKQRQQSIDGLDLL--FGSDPPLVYS---------PPPPPP 370 Query: 1321 XXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTP----AIHSKPSTSEI----IK 1476 F LF S S P PP P P S+ STS++ ++ Sbjct: 371 PPPPFFQGLFSSKKGK--------SKKNNSNPPPPPPPPPPERRYESRASTSKLRKAPVE 422 Query: 1477 VSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXX 1656 T K P K+ + K +WKFV +GDY Sbjct: 423 SRTSKPNPPAKVTQYVGTGSESPLMPIPPPPPPPPF--------KMPAWKFVKRGDYVRM 474 Query: 1657 XXXXXXXXXXPDIDD-AESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAG 1833 PD D A+S G+ A DV+TKA++FI +FRAG Sbjct: 475 ASDISISSDEPDDPDVAQSAGSKEAAGS----------MFCPSPDVDTKADDFIARFRAG 524 Query: 1834 LKLEKINSMNKRQGLGLSNLG 1896 LKLEK+NS+ + G SNLG Sbjct: 525 LKLEKMNSVKR----GRSNLG 541 >ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261010 [Vitis vinifera] Length = 555 Score = 108 bits (269), Expect = 1e-20 Identities = 137/562 (24%), Positives = 207/562 (36%), Gaps = 26/562 (4%) Frame = +1 Query: 280 VTAASFLVFVVPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDE 459 + A + F VPS L+F++Q RPN+V+KSWDS +R N +++ D Sbjct: 41 ILAMIVVFFAVPSFLNFTSQFLRPNSVRKSWDSLNVLLVLFAILCGVFAR-KNDEKNDDV 99 Query: 460 YQTSPVSRDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGG 639 + S V S + + +S D+K + D + Sbjct: 100 LENHGSSGSVVMGKSHESISHSLFEFS---DRK---------------IYDPPIQSGSVR 141 Query: 640 LRRTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSR-V 816 LRR+SSSYPDL + S + +GDD RF+DD V+ YR + +RR + E+ R Sbjct: 142 LRRSSSSYPDLRQ-ESLWGAGDDRRRFFDDFEVNNYRSPASSDYVRRHR---RSELERDD 197 Query: 817 VESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLAH 996 E K I VDTF R + + KP+R + ++A Sbjct: 198 SEVKVIPVDTFAVRSS---------PSPSPAPPRTPPPPPPPPPPIVQRKPRRSYETVAR 248 Query: 997 KGERSKK-KDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATKDFL 1173 K + S D ++R P + P +QK K+ ++ G ATKD Sbjct: 249 KEKLSNSDADQFKKSRSPPAPPPPPPPPPPPRVPGGHLPEQKSRKSARRM--GGATKDIA 306 Query: 1174 N---SLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFHN 1344 SLY+ + +N+ + Q+PP + HN Sbjct: 307 TVFVSLYNQTRKKKKQRT-KNIHE--NAVQSPP-----SATTPTPPPPPPPPPPPSMLHN 358 Query: 1345 LF-----------------PXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEII 1473 LF P T P PPTP P P TS Sbjct: 359 LFRKGSKSKRIHSVSAPPPPPPPPPRPPPPRSSKRKTHIPPAPPTPPPP--PPPDTSR-- 414 Query: 1474 KVSTHKSPKPVKIRSF----DSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQG 1641 + + K P P + SF D+V + K+VV+G Sbjct: 415 RRAAGKPPLPARKSSFYNRDDNVNSGGQSPLIPMPPPPPPF--------RMPELKYVVRG 466 Query: 1642 DYXXXXXXXXXXXXXPDIDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITK 1821 D+ P++DD + +A DG DA+ DVN KA+ FI + Sbjct: 467 DFVRIRSTHSSRCSSPELDDVDLSSNKSAMDGGDAIG----ATFCPSPDVNVKADTFIAR 522 Query: 1822 FRAGLKLEKINSMNKRQGLGLS 1887 R +LEKINS+ +R+ +GL+ Sbjct: 523 LRGEWRLEKINSLRERKNVGLT 544 >gb|EXB38898.1| hypothetical protein L484_027333 [Morus notabilis] Length = 509 Score = 96.3 bits (238), Expect = 5e-17 Identities = 104/415 (25%), Positives = 166/415 (40%), Gaps = 12/415 (2%) Frame = +1 Query: 295 FLVFVVPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQTSP 474 FL+ VP LSF++ IFRP VKKSWD +R + + +++ + Sbjct: 49 FLLSAVPPFLSFTSLIFRPIAVKKSWDLLNIFLVLFAILCGIFARRNDDESANNDVVPTA 108 Query: 475 VSRDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTS 654 V++S P+ P Q+W+ +S +D++S ++ Y D+ A++ S LRR+S Sbjct: 109 RRSGGVEESEPANP-QRWFAFS--DDRRS-----EKIYDSVDRTAESG---SLRRLRRSS 157 Query: 655 SSYPDLLEASSQFTSGDDP---WRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSRVVES 825 SSYPDL + S + +GDDP +RF+DD ++ YRV T R + + + E+ Sbjct: 158 SSYPDLRQ-ESLWETGDDPRFQFRFFDDFEINKYRV--TAPFDPSREIRGRRREADDGEA 214 Query: 826 KNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLAHKGE 1005 K I VDTFV RP + + KP+R +R++ + E Sbjct: 215 KEILVDTFVVRP------TTPPKSPSPSPSPATPSPPPPPPPVERHKPRRTYRAVGERKE 268 Query: 1006 RSKKKDNELEN-------REPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATK 1164 +++KK ++ + R P +P R +Q+ K ++++S Sbjct: 269 KAEKKQDDHNDADQFAKVRPPPPTPPPPPPRPPPSPARVR-PEQRHVKLERRKSNVKKEI 327 Query: 1165 DFLNSLYHXXXXXXXXXSIENLDSLLHLSQT--PPLQFQXXXXXXXXXXXXXXXXXXXVF 1338 + + I + S H S T PP + + VF Sbjct: 328 AMAFTSLYNQRKRKKKQKIASSGSHAHDSATSSPPEKTRFPPPSPPPPPPPLPPPPSSVF 387 Query: 1339 HNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEIIKVSTHKSPKP 1503 HNLF T P PP P P S P +S K P P Sbjct: 388 HNLFKKGIKSK-------RIHTIPPPPPPPPPPFSSSPPPSSRPSKHKNRSVPPP 435 >ref|NP_200517.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|332009460|gb|AED96843.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 575 Score = 92.4 bits (228), Expect = 7e-16 Identities = 123/566 (21%), Positives = 210/566 (37%), Gaps = 40/566 (7%) Frame = +1 Query: 295 FLVFVVPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQ--- 465 F+ FVVP+ LS ++QI +P +VK+ WDS +R + S+ Sbjct: 44 FVTFVVPTFLSVTSQILQPASVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGE 103 Query: 466 ----------TSPVSRDDVQK--SSPSTPQQKWYNYSTDEDQ-KSNLSAFQQWYGYSDQM 606 ++ ++ K SS ST ++W++ D D+ K S + + + + Sbjct: 104 EEEVGGGAVTNGEMTVGEISKISSSSSTVSEQWFDDVYDSDRLKIYESVSSRSFSHGLPV 163 Query: 607 ADNQVNVSRGGLRRTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRS 786 N LRR+SSSYPDL + + T GD +RFYDD +D YR ++ + ++ Sbjct: 164 TGNV------PLRRSSSSYPDLRQGVFRET-GDRRFRFYDDFEIDKYRSQDSSSYQQFQN 216 Query: 787 WKDKFEVSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEK 966 E K I +DTFV +P +K Sbjct: 217 LSKTEIEEEESEPKEIQIDTFVVKPSSPPQQPPATPPPPPPPPPVEV----------PQK 266 Query: 967 PKRVHRSLAHKGERSKKKDNELENR---EPISSPVTXXXXXXXXXXXXRFVDQKIGKTDK 1137 P+R HRS+ ++ + K +E + + +P SP +K G + Sbjct: 267 PRRTHRSVRNRDLQENAKRSETKFKRTFQPPPSPPPPPPPPPPQPLIAATPPRKQGTLQR 326 Query: 1138 KRSGGS-ATKDFLNSLYHXXXXXXXXXSIENLD----SLLHLSQTPPLQFQ-----XXXX 1287 ++S + K SLY+ + + S + T P Q+Q Sbjct: 327 RKSNAAKEIKMVFASLYNQGKKKKKLQKSKRKERIESSPMVEDVTEPPQYQSLIPPPSPP 386 Query: 1288 XXXXXXXXXXXXXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSK--PST 1461 VF+ LF + + S P PP P P +++ P T Sbjct: 387 PPPPPPPPPLRSSQSVFYGLF--------KKGVKSNKKIHSVPAPPPPPPPRYTQFDPQT 438 Query: 1462 SEIIKVSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQG 1641 +V + + P+P K ++F+ + K+VV G Sbjct: 439 PP-RRVKSGRPPRPTKPKNFNEENNGQGSPLIQITPPPPPPPPF-----RVPPLKYVVSG 492 Query: 1642 DYXXXXXXXXXXXXXPD---------IDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVN 1794 D+ P+ ++ +SDG + + AV+ DV+ Sbjct: 493 DFAKIRSNQSSRCSSPEREVFDIGWGLELTQSDG---GVETKAAVSGGGMPGFCPSPDVD 549 Query: 1795 TKAENFITKFRAGLKLEKINSMNKRQ 1872 TKA+NFI + R +L+KINS+N+++ Sbjct: 550 TKADNFIARLRDEWRLDKINSVNRKR 575 >dbj|BAA97357.1| unnamed protein product [Arabidopsis thaliana] Length = 607 Score = 92.0 bits (227), Expect = 9e-16 Identities = 123/565 (21%), Positives = 209/565 (36%), Gaps = 40/565 (7%) Frame = +1 Query: 295 FLVFVVPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQ--- 465 F+ FVVP+ LS ++QI +P +VK+ WDS +R + S+ Sbjct: 44 FVTFVVPTFLSVTSQILQPASVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGE 103 Query: 466 ----------TSPVSRDDVQK--SSPSTPQQKWYNYSTDEDQ-KSNLSAFQQWYGYSDQM 606 ++ ++ K SS ST ++W++ D D+ K S + + + + Sbjct: 104 EEEVGGGAVTNGEMTVGEISKISSSSSTVSEQWFDDVYDSDRLKIYESVSSRSFSHGLPV 163 Query: 607 ADNQVNVSRGGLRRTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRS 786 N LRR+SSSYPDL + + T GD +RFYDD +D YR ++ + ++ Sbjct: 164 TGNV------PLRRSSSSYPDLRQGVFRET-GDRRFRFYDDFEIDKYRSQDSSSYQQFQN 216 Query: 787 WKDKFEVSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEK 966 E K I +DTFV +P +K Sbjct: 217 LSKTEIEEEESEPKEIQIDTFVVKPSSPPQQPPATPPPPPPPPPVEV----------PQK 266 Query: 967 PKRVHRSLAHKGERSKKKDNELENR---EPISSPVTXXXXXXXXXXXXRFVDQKIGKTDK 1137 P+R HRS+ ++ + K +E + + +P SP +K G + Sbjct: 267 PRRTHRSVRNRDLQENAKRSETKFKRTFQPPPSPPPPPPPPPPQPLIAATPPRKQGTLQR 326 Query: 1138 KRSGGS-ATKDFLNSLYHXXXXXXXXXSIENLD----SLLHLSQTPPLQFQ-----XXXX 1287 ++S + K SLY+ + + S + T P Q+Q Sbjct: 327 RKSNAAKEIKMVFASLYNQGKKKKKLQKSKRKERIESSPMVEDVTEPPQYQSLIPPPSPP 386 Query: 1288 XXXXXXXXXXXXXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSK--PST 1461 VF+ LF + + S P PP P P +++ P T Sbjct: 387 PPPPPPPPPLRSSQSVFYGLF--------KKGVKSNKKIHSVPAPPPPPPPRYTQFDPQT 438 Query: 1462 SEIIKVSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQG 1641 +V + + P+P K ++F+ + K+VV G Sbjct: 439 PP-RRVKSGRPPRPTKPKNFNEENNGQGSPLIQITPPPPPPPPF-----RVPPLKYVVSG 492 Query: 1642 DYXXXXXXXXXXXXXPD---------IDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVN 1794 D+ P+ ++ +SDG + + AV+ DV+ Sbjct: 493 DFAKIRSNQSSRCSSPEREVFDIGWGLELTQSDG---GVETKAAVSGGGMPGFCPSPDVD 549 Query: 1795 TKAENFITKFRAGLKLEKINSMNKR 1869 TKA+NFI + R +L+KINS+N++ Sbjct: 550 TKADNFIARLRDEWRLDKINSVNRK 574 >ref|XP_004288965.1| PREDICTED: uncharacterized protein LOC101306381 [Fragaria vesca subsp. vesca] Length = 548 Score = 89.7 bits (221), Expect = 4e-15 Identities = 128/584 (21%), Positives = 200/584 (34%), Gaps = 58/584 (9%) Frame = +1 Query: 295 FLVFVVPSTLSFSAQIFRPN-TVKKSWDSXXXXXXXXXXXXXXXSRATNQ------DRDS 453 FL++ +P LS ++ I +P +VKKSWDS +R + D D Sbjct: 29 FLIYAIPPFLSLTSHILQPTVSVKKSWDSLNVFLVIFAILCGVFARKHDDAGEGLPDPDH 88 Query: 454 DEYQTSPVSRDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMA-------- 609 + D + + S+PQ + + QQW+GYS++ + Sbjct: 89 HHVHIHNANSDPLLDRTTSSPQSESSVLHVPQ---------QQWFGYSERTSRMYDTTPV 139 Query: 610 DNQVNVSRGGLRRTSSSYPDLLEASSQFTSGDD---PWRFYDDMTVDTYRVSETGQLLRR 780 NVS G R SSYPD+ + S + + DD +RF+DD + Y+ Sbjct: 140 KTPENVSGDGRLRRRSSYPDMRQVESLWETLDDTKSQFRFFDDFEISNYKT--------H 191 Query: 781 RSWKDKFEVSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPK 960 R +K+ + + K I VDTFV +P P+ Sbjct: 192 RHYKE----TTSDDVKEIKVDTFVLQPSPTPPPPPPPPP-------------------PR 228 Query: 961 EKPKRVHRSLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRF---VDQKIGKT 1131 + +R + ++ G RSK+K E++ E S+P DQK G+ Sbjct: 229 REKQRTYETV---GRRSKEKVEEVKFEEVRSTPPPPPPPPSTVAPPSPMRVRSDQKHGRL 285 Query: 1132 DKKRSG--GSATKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXX 1305 ++++S + + L + + N+ ++ PP Q Q Sbjct: 286 ERRKSNVKKEIAMVWNSVLSNQRKRKRKQKATRNIYDTAATTEPPPEQSQ-------PPP 338 Query: 1306 XXXXXXXXXVFHNLF-------------------PXXXXXXXXXDLDISASTFSQPKPPT 1428 VFHNLF P S ST P PPT Sbjct: 339 PPPPPPPSSVFHNLFKKGSKTKKVHSVPTAPPPPPPLPEVSVRTHQTRSRSTLPPPAPPT 398 Query: 1429 ---PTPAIHSKPSTSEIIKVSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXX 1599 P P+ HS+ + P P K + V Sbjct: 399 PPRPPPSAHSR-----------RRPPLPTKPSTSYEVDNVNSGCQSPLIPIPPPPPPF-- 445 Query: 1600 XFTKSLSWKFVVQGDYXXXXXXXXXXXXXPDIDDAESD-------------GTPTATDGE 1740 K + KF V+GD+ P+ ++ +D T TDG Sbjct: 446 ---KMPAMKFFVKGDFVKIRSAQSSRSASPEPEEVVADHALPAGKEESTTTSTVNVTDGG 502 Query: 1741 DAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMNKRQ 1872 D DVNTKA+NFI + R +LEKINS+ +++ Sbjct: 503 DGAGRASPSVFCPSPDVNTKADNFIARLRDEWRLEKINSLREKK 546 >ref|XP_002864497.1| hypothetical protein ARALYDRAFT_495801 [Arabidopsis lyrata subsp. lyrata] gi|297310332|gb|EFH40756.1| hypothetical protein ARALYDRAFT_495801 [Arabidopsis lyrata subsp. lyrata] Length = 566 Score = 88.2 bits (217), Expect = 1e-14 Identities = 121/566 (21%), Positives = 204/566 (36%), Gaps = 35/566 (6%) Frame = +1 Query: 280 VTAASFLVFV---VPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRD 450 ++AA FL+FV +P LS ++QI +P++VK+ WDS +R + Sbjct: 36 ISAAIFLLFVNFVIPPFLSVTSQILQPSSVKRGWDSINVVLVVFAILCGVLARRNDDGLS 95 Query: 451 SDEYQ------------TSPVSRDDVQK--SSPSTPQQKWYNYSTDEDQKSNLSAFQQWY 588 S+ + ++ ++ K SS S ++W++ D ++ + Sbjct: 96 SESLHGGEEEEVGGAVTSGEMTLGEISKISSSSSAVSEQWFDDVYDAERLKIYESVS--- 152 Query: 589 GYSDQMADNQVNVSRGGLRRTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQ 768 S + LRR+ SSYPDL + + T GD +RFYDD + E Sbjct: 153 --SRSFSHGLPVTGTVPLRRSCSSYPDLRQGVFRET-GDRRFRFYDDFEIHNRSYEE--- 206 Query: 769 LLRRRSWKDKFEVSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXX 948 + RS K E+ E K I +DTFV +P Sbjct: 207 -FQNRS---KIEIEEESEPKEIQIDTFVVKPSSPPQQPPAPPTPPPPPPPPPVEV----- 257 Query: 949 XMPKEKPKRVHRSLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGK 1128 +KP+R HRS+ ++ + K N+++ + P +K G Sbjct: 258 ---SQKPRRTHRSVKNRDIQENVKRNDIKFKRAFQPPNPPPPPPPPPPLITATPPRKQGT 314 Query: 1129 TDKKRSGGS-ATKDFLNSLYHXXXXXXXXXSIENLD----SLLHLSQTPPLQFQ-----X 1278 +++S + K SLY+ + + S + + T P Q+Q Sbjct: 315 LQRRKSNAAKEIKMVFASLYNQGKRKKKIQKSKRKERIESSPVVVDVTEPPQYQSLIPPP 374 Query: 1279 XXXXXXXXXXXXXXXXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSK-- 1452 VF+ LF + + S P PP P P H++ Sbjct: 375 SPPPPPPPPPPPPRTSQSVFYGLF--------KKGVKSNKKIHSVPAPPPPPPPRHTQFD 426 Query: 1453 PSTSEIIKVSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFV 1632 P T +V++ + P+P K +F+ + KFV Sbjct: 427 PQT-PTRRVNSGRPPRPTKPTNFNEENNGQGSPLIQITPPPPPPPPF-----RVPPLKFV 480 Query: 1633 VQGDYXXXXXXXXXXXXXPDIDDAESDGTPTATDGED------AVAXXXXXXXXXXXDVN 1794 V GD+ P+ + + T +D AV DV+ Sbjct: 481 VSGDFAKIRSNQSSRCSSPEREVIDIGWGLELTQSDDGVKTKAAVGGGGMPGFCPSPDVD 540 Query: 1795 TKAENFITKFRAGLKLEKINSMNKRQ 1872 TKA+NFI + R +L+KINS+N+++ Sbjct: 541 TKADNFIARLRDEWRLDKINSVNRKR 566 >ref|XP_006401247.1| hypothetical protein EUTSA_v10013114mg [Eutrema salsugineum] gi|557102337|gb|ESQ42700.1| hypothetical protein EUTSA_v10013114mg [Eutrema salsugineum] Length = 570 Score = 87.8 bits (216), Expect = 2e-14 Identities = 128/546 (23%), Positives = 195/546 (35%), Gaps = 24/546 (4%) Frame = +1 Query: 295 FLVFVVPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSR-------ATNQDRDS 453 F+ FV+P LS ++QIF+P +VKK WDS +R +++Q Sbjct: 47 FMTFVLPPFLSITSQIFQPASVKKGWDSINVVLVVFAILCGVLARQNDDGLSSSSQSSHV 106 Query: 454 DEYQTSPVSRDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSR 633 +E + + +D + SS Q+W++ D D+ L ++ S + Sbjct: 107 EEEEDDVTNGEDSKISSSPVVSQQWFDDVYDADR---LKIYESLSNRS--FSPGLPVTGT 161 Query: 634 GGLRRTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSR 813 LRR+SSSYPDL + + T+ D +RFYDD +D YR ++ K E+ Sbjct: 162 LPLRRSSSSYPDLRNGAFRETA-DRRFRFYDDFEIDKYRSQDS---------PSKIEIEE 211 Query: 814 VVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLA 993 E K I VD FV RP +KPKR HRS+ Sbjct: 212 -SEPKEIPVDKFVVRPSS---PPPHPPQQPPAPPPPPPPLPESSPVQVSQKPKRTHRSVR 267 Query: 994 HKG--ERSKKKD-NELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGS-AT 1161 ++ E+SK+ D N R + +K G +++S + Sbjct: 268 NRDIQEKSKRSDANSDSTRFKRAFQPPPPPPPPPPPFITATPPRKQGTLQRRKSNAAKEI 327 Query: 1162 KDFLNSLYHXXXXXXXXXSIENLDSL----LHLSQTPPLQFQ-----XXXXXXXXXXXXX 1314 K SLY+ + + + ++ T P Q+Q Sbjct: 328 KMVFASLYNQGKKKKKLQKPKRKERSESPEVVVAATEPPQYQSSFPPPSPPPPPPPPPPP 387 Query: 1315 XXXXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTP--AIHSKPSTSEIIKVSTH 1488 VF+ LF S S P PP P P I P T + + Sbjct: 388 LRSSQSVFYGLFKKGVK---------SKKIHSVPAPPPPPPPRKIQLDPQTPP-RRSKSG 437 Query: 1489 KSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXX 1668 + P+P+K +F+ L KFVV GD+ Sbjct: 438 RPPRPMKPTNFNEDSYVNNGHASPLIQTTPPPPPPPPFRVPPL--KFVVSGDFAKIRSNQ 495 Query: 1669 XXXXXXP--DIDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKL 1842 P ++ D T +DG DVNTKA+NFI + R +L Sbjct: 496 SSRCSSPEREVIDLGWGLELTQSDGGAETLTAVGSGFCPSPDVNTKADNFIARLRDEWRL 555 Query: 1843 EKINSM 1860 +KINS+ Sbjct: 556 DKINSV 561