BLASTX nr result

ID: Rauwolfia21_contig00031643 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00031643
         (2066 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ16210.1| hypothetical protein PRUPE_ppa002494mg [Prunus pe...   192   4e-46
gb|EOY27178.1| Hydroxyproline-rich glycoprotein family protein, ...   190   2e-45
ref|XP_006426604.1| hypothetical protein CICLE_v10025206mg [Citr...   172   5e-40
emb|CBI35923.3| unnamed protein product [Vitis vinifera]              171   9e-40
ref|XP_002521366.1| conserved hypothetical protein [Ricinus comm...   156   3e-35
ref|XP_002329058.1| predicted protein [Populus trichocarpa] gi|5...   133   3e-28
gb|EXB29688.1| hypothetical protein L484_013462 [Morus notabilis]     125   7e-26
ref|XP_006390619.1| hypothetical protein EUTSA_v10019633mg [Eutr...   123   3e-25
ref|XP_006302100.1| hypothetical protein CARUB_v10020090mg [Caps...   121   1e-24
ref|NP_177422.1| hydroxyproline-rich glycoprotein family protein...   118   1e-23
gb|AAM13859.1| unknown protein [Arabidopsis thaliana]                 114   1e-22
ref|XP_002887451.1| hypothetical protein ARALYDRAFT_476416 [Arab...   114   2e-22
gb|AAL50061.1| At1g72790/F28P22_2 [Arabidopsis thaliana] gi|1954...   114   2e-22
ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261...   108   1e-20
gb|EXB38898.1| hypothetical protein L484_027333 [Morus notabilis]      96   5e-17
ref|NP_200517.2| hydroxyproline-rich glycoprotein family protein...    92   7e-16
dbj|BAA97357.1| unnamed protein product [Arabidopsis thaliana]         92   9e-16
ref|XP_004288965.1| PREDICTED: uncharacterized protein LOC101306...    90   4e-15
ref|XP_002864497.1| hypothetical protein ARALYDRAFT_495801 [Arab...    88   1e-14
ref|XP_006401247.1| hypothetical protein EUTSA_v10013114mg [Eutr...    88   2e-14

>gb|EMJ16210.1| hypothetical protein PRUPE_ppa002494mg [Prunus persica]
          Length = 666

 Score =  192 bits (489), Expect = 4e-46
 Identities = 177/624 (28%), Positives = 259/624 (41%), Gaps = 32/624 (5%)
 Frame = +1

Query: 106  PFIKSETRK*KPIYMEEDGEDFTPFWLQ-STAKRRRTDRVRDXXXXXXXXXXXXXXXXXV 282
            P ++++T++  P  MEE  +   PFWLQ S + R+   R+R                   
Sbjct: 70   PKVENQTKQ-NPKTMEEKEDMLPPFWLQPSDSFRQANRRLRRSSSSVFFSSGAFILALLA 128

Query: 283  TAASFLVFVVPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEY 462
             A  F+ F++PS LSF++QIFRP++VKKSWDS               SR TN D +    
Sbjct: 129  IALVFIFFIIPSVLSFTSQIFRPHSVKKSWDSLNLVLVLFAIVCGFLSRNTNNDGN---- 184

Query: 463  QTSPVSRDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYG-YSDQMADNQVNVSRGG 639
             +SP S D V         Q  +N S+ +  KSN S  +QW+  YSD+   NQ + S   
Sbjct: 185  LSSPSSYDQVH-------NQTVFNSSSPQAPKSNPSTPRQWFDQYSDRTGYNQSSSSTSA 237

Query: 640  LR----RTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQL--LRRRSWKDKF 801
                  RTSSSYPDL +  + + + DD WRFYDD  V  YRVS +  L   R RSW ++ 
Sbjct: 238  AMNRGVRTSSSYPDLRQQEASWVARDDRWRFYDDTHVVNYRVSGSDPLHHRRHRSWHEES 297

Query: 802  ------EVSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKE 963
                  E +  V++K I VDTF  R E+                                
Sbjct: 298  VQLPVEEEAEQVQTKTIEVDTFAIRTEQPSPPRRSPSSSQPPPPTI-------------R 344

Query: 964  KPKRVHRSLAHK----GERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKT 1131
            K KR ++++  K      +S ++++  E ++ +  P                + +  GK 
Sbjct: 345  KSKRTYQAIGEKENSGSTQSLERNDNFEAKKNLPPPPARPPPSPPSPPPR--ISKSAGKD 402

Query: 1132 DKKRSGGSATKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXX 1311
             KKR G + TK+FL +            S+EN +SLL  + + P  +             
Sbjct: 403  VKKR-GVATTKEFLITSLRRKKKKQRQKSVENFESLLASASSAP--YSLLPPPSPPPPPP 459

Query: 1312 XXXXXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEIIK----- 1476
                   VFHNLF           +       S P+PP P P + +  ST+++ K     
Sbjct: 460  PLPPPPSVFHNLFSTKKSNKPRKTMQ------SIPQPP-PPPPVAATTSTAQLSKTKAQM 512

Query: 1477 ---VSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDY 1647
               ++T K P PVK+ +F +                         F +    KFVV GD+
Sbjct: 513  RPMMTTQKPPLPVKMSTFINGDDENTNSGGESPLARIPPPPPLPPF-RMPEMKFVVHGDF 571

Query: 1648 XXXXXXXXXXXXXPDIDDAESD----GTPTATDGEDAV--AXXXXXXXXXXXDVNTKAEN 1809
                         PD+DD +       +PT       +              DVNTKA+ 
Sbjct: 572  VRIKSNNSSRSGSPDLDDGDDPDSAVSSPTTETNRTPLESGESPKAMFCPSPDVNTKADT 631

Query: 1810 FITKFRAGLKLEKINSMNKRQGLG 1881
            FI +FRAGL+LEK+NS+  R  LG
Sbjct: 632  FIARFRAGLRLEKMNSVRGRSNLG 655


>gb|EOY27178.1| Hydroxyproline-rich glycoprotein family protein, putative [Theobroma
            cacao]
          Length = 553

 Score =  190 bits (482), Expect = 2e-45
 Identities = 186/601 (30%), Positives = 252/601 (41%), Gaps = 23/601 (3%)
 Frame = +1

Query: 148  MEEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLVFVVPSTLS 327
            MEED ED TPFWLQ TA  RR  R R                  V A +F+  ++PS LS
Sbjct: 1    MEED-EDVTPFWLQ-TADNRRIRRRRQPSSLFFNTGILIILLL-VVALAFIFVIIPSFLS 57

Query: 328  FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSD--------EYQTSPV-S 480
            F++QIF+P+ VKKSWDS                +  N + DSD        ++ T+P   
Sbjct: 58   FTSQIFKPHLVKKSWDSLNLVLVLFAIICGFLGK-NNGNNDSDTRSTYEDYKFSTTPKHD 116

Query: 481  RDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSS 660
            RD V +S+PSTP+Q WY+YS+                 SD+ A N +        R+S+S
Sbjct: 117  RDHVGRSNPSTPRQ-WYDYSSS----------------SDRTAYNSLQ-----RLRSSNS 154

Query: 661  YPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSRVVESKNIHV 840
            YPDL   SS   +GDD WRFYDD  +  YR        R R   D+ EV     +K+I V
Sbjct: 155  YPDLRPESSWMMNGDDRWRFYDDTPLYNYR-------SRSRREHDREEVYS-NNTKDIAV 206

Query: 841  DTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLAHKGERSKKK 1020
            DT V+RP                              + + KPKR +  L  K ERS++K
Sbjct: 207  DT-VHRPPPPPSSSPPPAATASSPSPPQSPPPQPPKVV-RRKPKRTYEDLKPK-ERSERK 263

Query: 1021 D---NELENRE--PISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATKDFLNSLY 1185
            +   +EL+ +   P + P                 +++  K++KKR  G  TKDFL SL 
Sbjct: 264  EVINSELKIKHSLPSTPPAARPPPPPPPPPPPSVFEKRSNKSEKKR--GGVTKDFLISL- 320

Query: 1186 HXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFHNLFPXXXX 1365
                      S+ENLD    LS  P                         + N+FP    
Sbjct: 321  RRKKKKQRQRSVENLDEFFKLSTLP-----LYPPASPPPPPPPPPPLPSFYQNIFPSKKN 375

Query: 1366 XXXXXDLDISASTFSQPKPPTPTPAIHSKPS--TSEIIKVSTHKSPKPVKIRSFDSVXXX 1539
                        +   P PP P P++ ++ S   S+   V+T K P PVKIR+  +V   
Sbjct: 376  KAR------KNHSVPPPPPPPPLPSVEARASKRESQTPPVTTQKPPLPVKIRNMHNVEES 429

Query: 1540 XXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXXXPDIDD------ 1701
                                   K  +WKF V GD+             PD+DD      
Sbjct: 430  VESGNESPLNPIPPPPPPPPF--KMPAWKFEVHGDFVRLKSIRSSRSGSPDLDDPLSCEA 487

Query: 1702 AESDGTPTA-TDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMNKRQGL 1878
            + SDG  T   DG ++             DV+TKA+NFI +FRAGLKLEK+NS+  R  L
Sbjct: 488  SPSDGNKTGEMDGGESTT---GPLFCPSPDVDTKADNFIARFRAGLKLEKMNSVRGRSNL 544

Query: 1879 G 1881
            G
Sbjct: 545  G 545


>ref|XP_006426604.1| hypothetical protein CICLE_v10025206mg [Citrus clementina]
            gi|557528594|gb|ESR39844.1| hypothetical protein
            CICLE_v10025206mg [Citrus clementina]
          Length = 602

 Score =  172 bits (436), Expect = 5e-40
 Identities = 171/627 (27%), Positives = 245/627 (39%), Gaps = 44/627 (7%)
 Frame = +1

Query: 148  MEEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLVFVVPSTLS 327
            MEE+ +  +PFWLQST  R+   R R                    A +F+  V+PS  S
Sbjct: 1    MEEEEDASSPFWLQST--RQAGHRRRRSSSSFLFNSGALLVFLLAVAVAFIFIVIPSIQS 58

Query: 328  FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDR-----------------DSD 456
            F++QIF+P+ VK+SWDS               SR  N +                  + +
Sbjct: 59   FTSQIFKPHAVKRSWDSLNLVLVLFAIICGFLSRNHNNNESIATTPTTTSAAASASYEDE 118

Query: 457  EYQTSPVSRDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRG 636
            EY+    S +  QKS+P TP++ WY+++   +  +N                N  N +RG
Sbjct: 119  EYKFENRS-ESFQKSNPETPRRFWYDHAYSNNNNNN----------------NNDNNNRG 161

Query: 637  GLR-RTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSW-------- 789
              R R+ SS+PDL +  + + + +D WRFYDD  +   R S      +   W        
Sbjct: 162  LSRLRSFSSHPDLRQEEALWANVEDQWRFYDDTHLYHNRFSSF-DYKQLHQWQPQPQPQP 220

Query: 790  --------KDKFEVSRVVESKNIHVDTFVN------RPEKXXXXXXXXXXXXXXXXXXXX 927
                    K+K E  +V   KN+  D   +      R E                     
Sbjct: 221  KLEVLEEEKEKKENEKVDAVKNVDADNTTSSTVEESRKEIIYTPPQPPPASPSPEPAELP 280

Query: 928  XXXXXXXXMPKEKPKRVHRSL--AHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXX 1101
                    +P  + K V R +   H+G     ++N+LE + P+  P              
Sbjct: 281  PSPSPPPPLPPPQTKVVRRRVKRTHQGNGGNYRNNDLEVK-PLPPPPPQLQPPPLPAPPE 339

Query: 1102 RFVDQKIGKTDKKRSGGSATKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXX 1281
              V++  GK++KKR G SATK+FL SL           S+ENLDS  +   + PL     
Sbjct: 340  TEVEESGGKSEKKRGGTSATKEFLTSLRRKKKKKQRQKSVENLDSFFNYESSYPLP-PSL 398

Query: 1282 XXXXXXXXXXXXXXXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPST 1461
                              F N+F             I +   S P PP P P    K ST
Sbjct: 399  IPPPSPPPPPPPPPPPPFFQNIFSSRKRKAK----KILSVIPSPPPPPPPPPTRTQKLST 454

Query: 1462 SEIIKV-STHKSPK-PVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVV 1635
            S    V ST + P  PVKI+S+ +V                          K  +WKF V
Sbjct: 455  SRTRNVQSTSQEPSLPVKIKSYSNVEQNVNSGNESPLNAIPPPPPLPPF--KMPAWKFEV 512

Query: 1636 QGDYXXXXXXXXXXXXXPDIDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFI 1815
             GD+                DD E    P  +DG D  +           DVNTKA+NFI
Sbjct: 513  HGDFVRLKSNNI------GADDVEESSCP--SDGGD--SPVVTPLSCPSPDVNTKADNFI 562

Query: 1816 TKFRAGLKLEKINSMNKRQGLGLSNLG 1896
             +FRAGL+LEK+NS+ +++G   SNLG
Sbjct: 563  ARFRAGLRLEKMNSVKEKEGRRRSNLG 589


>emb|CBI35923.3| unnamed protein product [Vitis vinifera]
          Length = 628

 Score =  171 bits (434), Expect = 9e-40
 Identities = 175/639 (27%), Positives = 253/639 (39%), Gaps = 29/639 (4%)
 Frame = +1

Query: 148  MEEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLVFVVPSTLS 327
            MEE+G   TPFW+ +++  RR    R                  +TA  F+VFV+P  LS
Sbjct: 39   MEEEGAT-TPFWMPASSGHRRRRSSRSPSSIFLSSGFLIIFLP-LTALLFIVFVLPPILS 96

Query: 328  FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQTSPVSRDDVQKSSP 507
            F++ IF+PN VKKSWDS               SR         E   S V  +  Q+S+ 
Sbjct: 97   FTSYIFKPNMVKKSWDSLNLVLVLFAIICGFLSRGGGGGSSDMESSVSEVPEESTQRSNH 156

Query: 508  STPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRR--TSSSYPDLLEA 681
                                      + Y ++++        GG+RR  +SSSYPDL + 
Sbjct: 157  G-------------------------HCYEERIS------GYGGMRRMRSSSSYPDLRQE 185

Query: 682  SSQFTSGDDPWRFYDDMTVDTYRV-SETGQLLRRRSWKDKFEVSRVVESKNIHVDTFVNR 858
            S+ +  GD  WR +DD  +D +RV     QL  RR ++D+       E KNI VD     
Sbjct: 186  SA-WAGGDGRWRSFDDTQLDNHRVLGSHRQLYIRRRYEDQ----DYCEVKNIDVDNTSMI 240

Query: 859  PEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLAHKGERSKKKDNELEN 1038
              K                            + K K KR  +++A +  R  ++++  E+
Sbjct: 241  SPKEKVLSHIPPRPPSPPLPPSPPPPPPPPPVVKRKVKRSFQAVAREERRETRENSSFES 300

Query: 1039 REPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATKDFLNSLYHXXXXXXXXX- 1215
            +   ++P                V+++  K+D+KR G  ATK+FL SLY+          
Sbjct: 301  KRVQAAPPPPPPPPPPPPPLA--VERRSEKSDRKRGG--ATKEFLTSLYYQRNKKKKQRQ 356

Query: 1216 -SIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFHNLFPXXXXXXXXXDLDI 1392
             S+ENLD++LH S  P                        VFHNLF             +
Sbjct: 357  KSMENLDTILHNS--PHSDQPLRPPPSPPPPPPLPPPPNSVFHNLFSSKKGKSKRF---L 411

Query: 1393 SASTFSQPKPPTPTPAIHSKPSTSEIIKVSTH---------KSPKPVKIRSFDSVXXXXX 1545
            +      P PP P    ++  + ++I    +H         K P P K  SF+SV     
Sbjct: 412  TVPPPPPPPPPPPASRAYAGKTKTKIALSRSHPYDHPLNASKPPIPEKSSSFNSVDGNPY 471

Query: 1546 XXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXXXPDID--------- 1698
                               F K   WKFVV GDY             PD+D         
Sbjct: 472  AGSESLLIPVPPPPPPPPPF-KMPDWKFVVHGDYVRIKSTNSSRSGSPDLDYIGSPSSKG 530

Query: 1699 DAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMNKRQGL 1878
             + S    + T+G D+             DVNTKA+ FI +FRAGLKLEKINS+ ++Q +
Sbjct: 531  PSRSTSLKSETEGGDSAQPLFCPSP----DVNTKADTFIARFRAGLKLEKINSIKEKQEV 586

Query: 1879 GLSNLG------RGSDSTQL*RGLYFSSSTNYHLFICLF 1977
            G+SNLG      +GS +  L    +  S  NY L+   F
Sbjct: 587  GMSNLGPEPGQAQGSGAWGLCGSGFPDSVCNYKLWYRFF 625


>ref|XP_002521366.1| conserved hypothetical protein [Ricinus communis]
            gi|223539444|gb|EEF41034.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 553

 Score =  156 bits (395), Expect = 3e-35
 Identities = 159/614 (25%), Positives = 228/614 (37%), Gaps = 26/614 (4%)
 Frame = +1

Query: 154  EDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLVFVVPSTLSFS 333
            E+ ED  PFWLQ+T +  R  R+R                  V A  F+  VVPS ++F+
Sbjct: 2    EEEEDVPPFWLQATDQHHRGRRLRRQASSIFLNSGVILIMLLVIAFVFVFVVVPSVVTFT 61

Query: 334  AQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSR--ATNQDRDSDEYQ--------TSPVSR 483
            +Q+F+PN +KK WDS                R      +  S  YQ        +S   +
Sbjct: 62   SQVFKPNLIKKGWDSLNFVLVLFAIVCGFLGRNSPNTSNESSTSYQRLSSSSSASSSNVQ 121

Query: 484  DDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSY 663
             DVQ+S PSTP  +WY+    +D+ ++ + F +                     R+  SY
Sbjct: 122  QDVQRSYPSTPAYRWYDDGQYQDRTASYNTFNR--------------------LRSFRSY 161

Query: 664  PDLLEASSQFTSGDDPWRFYDDMTVDTYRVS---------------ETGQLLRRRSWKDK 798
            PDL +  S +++ D+ WRFYDD  V+ Y+ S               +  Q   +   K  
Sbjct: 162  PDLRQ-ESLWSNNDERWRFYDDTRVNGYKFSSPLHQDELQDDHPPQQQQQEQDQEPRKQD 220

Query: 799  FEVSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRV 978
             E  + V +K+I VDTFV   E+                              K + KR 
Sbjct: 221  QEQEQDVSTKDIAVDTFVIHKEEVVQTPPPPMPPAPVSPPRLPTRSTV-----KRRAKRT 275

Query: 979  HRSLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSA 1158
            +  L   GE  K+++N+    + I+ P                      K+DK+R     
Sbjct: 276  YHDL---GEHEKRRENKNLEVKTINIPPPPPPPQLIS-----------SKSDKRRG---- 317

Query: 1159 TKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVF 1338
             KD L SL           S+ENL+SL +    P +                       F
Sbjct: 318  -KDLLISL-RRKRKKQRQKSVENLESLFNPEPLPSI--------IPPPPPPPPPPPPHFF 367

Query: 1339 HNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEIIKVSTHKSPKPVKIRS 1518
             NLF               + +  QP+PP+ T   H   +T +   +  +K  K VK  +
Sbjct: 368  QNLFSSKKGKTKKD----HSHSVPQPQPPSRT---HRSRTTVQEATIEAYKPLKAVKTGN 420

Query: 1519 FDSVXXXXXXXXXXXXXXXXXXXXXXXXFT-KSLSWKFVVQGDYXXXXXXXXXXXXXPDI 1695
            F SV                           K   WKF+  GDY             PDI
Sbjct: 421  FSSVEENVERGNASPLIPIPPPPPPPPPPPFKMKPWKFISDGDYVRVASFNSSRSGSPDI 480

Query: 1696 DDAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMNKRQG 1875
            D  +     ++    +              DVNTKAENFI +FRAGLKLEKINS+     
Sbjct: 481  DSEDPSDKESSPMARNKEGDSAMPSFCPSPDVNTKAENFIARFRAGLKLEKINSVK---- 536

Query: 1876 LGLSNLGRGSDSTQ 1917
             G SNLG G D  +
Sbjct: 537  -GRSNLGPGPDRVE 549


>ref|XP_002329058.1| predicted protein [Populus trichocarpa]
            gi|566150019|ref|XP_006369280.1| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
            gi|550347738|gb|ERP65849.1| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 560

 Score =  133 bits (335), Expect = 3e-28
 Identities = 152/608 (25%), Positives = 216/608 (35%), Gaps = 32/608 (5%)
 Frame = +1

Query: 154  EDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLVFVVPSTLSFS 333
            ++ ED   FWLQ+T  R R+  +R                  V A +F++ VV S  S +
Sbjct: 2    DEEEDMPLFWLQATNTRHRSRGLRRQTSSIFLNSGVFLVILLVVALAFVLVVVSSIGSLT 61

Query: 334  AQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQ-----------DRDSDEYQTSPVS 480
            +QI RP ++KKSWDS               S   +            D ++  Y     S
Sbjct: 62   SQILRPQSIKKSWDSLNLVLVLFAIVCGFLSSNNSSGSSGSGSGSGGDNENTSYYEDQ-S 120

Query: 481  RDDVQKSS--PSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTS 654
              +VQK S   STP  +W+ +                        D  V+ +     R+ 
Sbjct: 121  LSNVQKPSHPSSTPSHRWFEHQ-----------------------DRTVSYNTLNRLRSF 157

Query: 655  SSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSRVVES--- 825
            SSYPDL + S +       WRFYDD  ++ YR S +   +     +   E  +  E    
Sbjct: 158  SSYPDLRQESLE------RWRFYDDTHLNNYRFSTSSDQIHHHYPQQVEETKKQEEGVGV 211

Query: 826  KNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLAHKGE 1005
            K+I VDTFV   ++                            + + K KR ++ L ++ +
Sbjct: 212  KDIDVDTFVINQKEVSYPSSPPPFPPPHPSSSPPLPPSPPPKLVRRKVKRTYQDLGYE-K 270

Query: 1006 RSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATKDFLNSLY 1185
            R+  ++  LEN   I  P                      K +K+R      KDFL SL 
Sbjct: 271  RTDHEEKVLENFYNIPPPSPPPPPPPPPPPPPPI----FSKNEKRRG-----KDFLISL- 320

Query: 1186 HXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFHNLFPXXXX 1365
                      S+ENLDS  +   TP                           NLF     
Sbjct: 321  RRKKKKQRQKSVENLDSFFNPQPTPTSTLPLIPPPPPPPPPPHF------LQNLFSKKGK 374

Query: 1366 XXXXXDLDISASTFSQPKPPTPTPA-----IHSKPSTS----EIIKVSTHKSPKPVKIRS 1518
                  +         P PP P P      + S+  TS    ++  +++ K P+P K R 
Sbjct: 375  TKKLHPV---------PPPPPPPPVTRVSKVVSQKVTSRTKVQVAPLTSDKPPEPAKTRR 425

Query: 1519 FDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXXXPDID 1698
            F SV                          K  +WKFV  GDY             PD+D
Sbjct: 426  FHSVEENVERGNASRLIPLPPPPPPPPF--KMPAWKFVHDGDYVRVGSFNSSRSGSPDLD 483

Query: 1699 DAESDGTP-------TATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINS 1857
              E   +         A  G D+ A           DVNTKA+NFI +FRAGL LEK+NS
Sbjct: 484  SIEDASSEKDQSSPVAAASGSDSAATALFCPSP---DVNTKADNFIARFRAGLTLEKVNS 540

Query: 1858 MNKRQGLG 1881
             N+R  LG
Sbjct: 541  ANRRSNLG 548


>gb|EXB29688.1| hypothetical protein L484_013462 [Morus notabilis]
          Length = 530

 Score =  125 bits (314), Expect = 7e-26
 Identities = 90/249 (36%), Positives = 122/249 (48%), Gaps = 9/249 (3%)
 Frame = +1

Query: 148 MEEDGED-FTPFWLQSTAKRRRTDRVR---DXXXXXXXXXXXXXXXXXVTAASFLVFVVP 315
           ME D E+  TPFW QS+   RR D  R                     VTA +F+  ++P
Sbjct: 1   MEGDQENSLTPFWPQSSDSIRRADHRRRRLSRSSSLLFNSGAVLIALIVTALAFIFVIIP 60

Query: 316 STLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQTSPVSRDDVQ 495
           S LSF++QIFRP++VKKSWDS               SR + ++  S+ +    VS +  Q
Sbjct: 61  SFLSFTSQIFRPHSVKKSWDSLNLVLVLFAIVCGFLSRNSTENTSSN-HDDQRVSNEGGQ 119

Query: 496 KSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSYPDLL 675
           KS+PSTP                     QWY YSD+   +  N       R+SSSYPDL 
Sbjct: 120 KSNPSTP--------------------HQWYEYSDRTQSDSFNSRIYRRMRSSSSYPDLR 159

Query: 676 EASSQFTSGDDPWRFYDDMTVDTYRVSETGQ-----LLRRRSWKDKFEVSRVVESKNIHV 840
           + SS + S D+ WRFYDD  V  YR S++ Q       RRR W +  E  + +++KNI V
Sbjct: 160 QESS-WVSRDEQWRFYDDTHVANYRPSDSDQHHHQLQYRRRPWNEPEEEIQ-LQTKNIEV 217

Query: 841 DTFVNRPEK 867
           DTF  R ++
Sbjct: 218 DTFEVRAKE 226



 Score = 59.3 bits (142), Expect = 6e-06
 Identities = 59/200 (29%), Positives = 80/200 (40%), Gaps = 11/200 (5%)
 Frame = +1

Query: 1333 VFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEIIKVSTHKSPK-PVK 1509
            VFH LF              S S  S P PP P P++    S ++   +   + P  PV 
Sbjct: 336  VFHTLFSSKKGKTKKVH-SFSQSPSSPPPPPPPPPSVRVSKSKAQSRPIPVTQKPSLPVH 394

Query: 1510 IRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXXXP 1689
              + DSV                          +   W+FV  GD+             P
Sbjct: 395  TSNVDSVEENTKIGSESPLIPIPPPPPPPPF--RFQEWRFVRHGDFVRIKSDNSSRSGSP 452

Query: 1690 DIDDAESD----GTPTA-TDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKIN 1854
            ++D  E       +P A T+G  + A           DV+TKA+NFI +FRAGL LEK N
Sbjct: 453  ELDGCEDSPVGGASPLAVTEGGMSPARMFCPSP----DVDTKADNFIARFRAGLILEKEN 508

Query: 1855 SMNKR----QGLGL-SNLGR 1899
            S+ +R      LGL +NLGR
Sbjct: 509  SIKERDRGKSRLGLEANLGR 528


>ref|XP_006390619.1| hypothetical protein EUTSA_v10019633mg [Eutrema salsugineum]
            gi|557087053|gb|ESQ27905.1| hypothetical protein
            EUTSA_v10019633mg [Eutrema salsugineum]
          Length = 548

 Score =  123 bits (309), Expect = 3e-25
 Identities = 157/600 (26%), Positives = 226/600 (37%), Gaps = 18/600 (3%)
 Frame = +1

Query: 151  EEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLV--FVVPSTL 324
            EEDG+  TPFWLQS   RR     R                    AA+ L+  F++P   
Sbjct: 3    EEDGDASTPFWLQS---RRNNTYFRRTSSLGGRATTVATQVFFAGAAAILIVFFIIPPFF 59

Query: 325  SFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQTSPVSRDDVQKSS 504
            +  +QIFRP+ V+KSWD                SR    D  +         +++V K++
Sbjct: 60   TSVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNAGNDESTHH-------KEEVSKNN 112

Query: 505  PSTPQQKWYNYST-----DEDQKSNLSAFQQWYG--YSDQMADNQVNVSRGGLRRTSSSY 663
                   +  +S+     D  + SN +  + W      DQ  D+ V   R    R+SSSY
Sbjct: 113  EVINGYGFNKFSSSPLIIDRGRVSNGATPRYWIDDRGGDQFPDHTV-YKRISRLRSSSSY 171

Query: 664  PDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSRVVESKNIHVD 843
            PDL     +F + D  WRFYDD  V   R   +  + +   W +       ++  +   D
Sbjct: 172  PDL--RLPEFDT-DQRWRFYDDTRVSQCRYEASDPIYQNPVWPEVKSPEEDIDQTD-GGD 227

Query: 844  TFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLAHKGERSKKKD 1023
               N  EK                              + + KRV++ +A K E+ ++ D
Sbjct: 228  GGGNVTEK-VEVVATATAEVVEELSPPPPSAPASPPRAQRRTKRVYQDVARKEEKKERAD 286

Query: 1024 --NELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATKDFLNSLYHXXX 1197
                     P+  P T              V+QK  K +KK+ GG ATK+FL +L     
Sbjct: 287  FVTATPPMTPVPPPAT--------------VNQKSNKQEKKKKGG-ATKEFLIAL-RRKK 330

Query: 1198 XXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFHNLFPXXXXXXXX 1377
                  SI+ LD LL  S +PPL +                     F +LF         
Sbjct: 331  KKQRQQSIDGLD-LLFGSDSPPLAYS--MPPKSPPHPPPPPPPPPFFQSLFSSKKGK--- 384

Query: 1378 XDLDISASTFSQPKPPTPTPAIHSKPSTSEIIKV------STHKSPKP-VKIRSFDSVXX 1536
                 S  T+S P PP P P   +  S + + K+      S    P P  K+  F     
Sbjct: 385  -----SKRTYSTPPPPPPPPPERNFESRASMAKIRKAPMESRTSKPNPAAKVSQFVGTGS 439

Query: 1537 XXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXXXPDIDDAESDG 1716
                                    K  +WKFV +GDY             PD DD++   
Sbjct: 440  ESPLMPIPPPPPPPPF--------KMPAWKFVKRGDYVRMASNISISSDEPD-DDSD--- 487

Query: 1717 TPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMNKRQGLGLSNLG 1896
                 DG+ A +           DV+TKA++FI +FRAGLKLEK+NS+ +    G SNLG
Sbjct: 488  VAQLADGKAASS-----MFCPSPDVDTKADDFIARFRAGLKLEKMNSVKR----GRSNLG 538


>ref|XP_006302100.1| hypothetical protein CARUB_v10020090mg [Capsella rubella]
            gi|482570810|gb|EOA34998.1| hypothetical protein
            CARUB_v10020090mg [Capsella rubella]
          Length = 550

 Score =  121 bits (304), Expect = 1e-24
 Identities = 153/611 (25%), Positives = 214/611 (35%), Gaps = 29/611 (4%)
 Frame = +1

Query: 151  EEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFL--VFVVPSTL 324
            EEDG+  +PFWLQS   RR     R                    AA+ L  VF++P   
Sbjct: 3    EEDGDASSPFWLQS---RRNNTYFRRTASLGGRATTVATQIFFAGAAAILIVVFIIPPLF 59

Query: 325  SFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQTSPVSRDDVQKSS 504
            S  +QIFRP+ V+KSWD                SR TN D      +T+    DD     
Sbjct: 60   SSVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNND------ETNHNKEDDESDKF 113

Query: 505  PSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSYPDLLEAS 684
             ++P     +     D+ SN +  + W    D    +Q    R    R+ SSYPDL    
Sbjct: 114  LNSP-----SIVDRGDRVSNGATPRYWI---DDRGGDQTVYKRFSRLRSVSSYPDLRLRE 165

Query: 685  SQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSR-----VVES-------- 825
             +    D+ WRFYDD  V   R   T  +   +S+++  E ++     VV++        
Sbjct: 166  YE---ADERWRFYDDTRVSQCRYEHTDPIYPNQSYRNWQEEAKPPPGDVVQTERDGSNGD 222

Query: 826  -KNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMP------KEKPKRVHR 984
             + +H+D  V    +                             P      K K KRV++
Sbjct: 223  ERKVHIDGSVAEKVEVVATAKAEVVEELPVPSAPPYIPSPPPSPPPQPKQAKRKTKRVYQ 282

Query: 985  SLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATK 1164
             +  K E+ ++ D       PI  P T              V Q+  K +KK+ GG ATK
Sbjct: 283  DVPPKEEKKERADFSEVATPPILPPTT--------------VHQRSNKPEKKKKGG-ATK 327

Query: 1165 DFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFHN 1344
            DFL  L           SI+ LD L      PPL +                        
Sbjct: 328  DFLVVL-RRKKKKQRQQSIDGLDLL--FGSDPPLVYTMPLPPPPPPPPPPPPP------- 377

Query: 1345 LFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEI-------IKVSTHKSPKP 1503
              P            I  +  + P PP P P      S + +       ++  T K   P
Sbjct: 378  --PFLRGLFSSKKGKIKRTNSNPPPPPPPPPPERRYESRASMAKSRKTPVQSRTSKPNPP 435

Query: 1504 VKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXX 1683
             K+  F                             K  +WKFV +GDY            
Sbjct: 436  TKVTQFVGTGSESPLLPIPPPPPPPPF--------KMPAWKFVKRGDYVRMASDISISSD 487

Query: 1684 XPDIDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMN 1863
             PD  D     T +                    DV+TKA++FI +FRAGLKLEK+NS+ 
Sbjct: 488  EPDDTDVAQSATGS--------------MFCPSPDVDTKADDFIARFRAGLKLEKMNSVK 533

Query: 1864 KRQGLGLSNLG 1896
            +    G SNLG
Sbjct: 534  R----GRSNLG 540


>ref|NP_177422.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|12323765|gb|AAG51845.1|AC010926_8 unknown
            protein; 15669-13984 [Arabidopsis thaliana]
            gi|24030251|gb|AAN41301.1| unknown protein [Arabidopsis
            thaliana] gi|332197252|gb|AEE35373.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 561

 Score =  118 bits (295), Expect = 1e-23
 Identities = 159/621 (25%), Positives = 221/621 (35%), Gaps = 39/621 (6%)
 Frame = +1

Query: 151  EEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLV-FVVPSTLS 327
            E+DG+  TPFWLQS  +R  T   R                   TAA  +V F++P   S
Sbjct: 3    EDDGDASTPFWLQS--RRNNTYFRRTASLGGRTTTIATQIFFAGTAAILIVVFIIPPFFS 60

Query: 328  FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDS--------DEYQTSPVSR 483
              +QIFRP+ V+KSWD                SR TN D  +        +++ TSP   
Sbjct: 61   SVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNNDESNHHKEEDIRNKFSTSPSII 120

Query: 484  DDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSY 663
            D   + S S    +++N     D +          G  DQ         R    R+ SSY
Sbjct: 121  DRRSRVSNSGTTPRYWN-----DDRGG--------GGGDQTV-----YKRFSRLRSVSSY 162

Query: 664  PDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWK----------------- 792
            PDL     +    D+ WRFYDD  V   R  +   +   +S++                 
Sbjct: 163  PDLRLREYE---ADERWRFYDDTRVSQCRYEDVDPIYPNQSYRNWHEEGKPPPEDVDQTE 219

Query: 793  --DKFEVSRVVE--SKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPK 960
              D  E S+V    S+   V+       +                              K
Sbjct: 220  DGDNGEGSKVRNGGSETEKVEVVATAEAEVVEELKVPSAPPYIPSPPPSPPRPPPAKQAK 279

Query: 961  EKPKRVHRSLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKK 1140
             K  RV++ ++ + E  K++D+ +    PI  P T              V QK  K +KK
Sbjct: 280  RKTNRVYQDVSPQ-EEKKERDDFVATTTPIPPPAT--------------VYQKSNKQEKK 324

Query: 1141 RSGGSATKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXX 1320
            +  G ATKDFL +L           SI+ LD L      PPL +                
Sbjct: 325  K--GGATKDFLIAL-RRKKKKQRQQSIDGLDLL--FGSDPPLVYS---------PPPPPP 370

Query: 1321 XXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTP----AIHSKPSTSEI----IK 1476
                 F  LF              S    S P PP P P       S+ STS++    ++
Sbjct: 371  PPPPFFQGLFSSKKGK--------SKKNNSNPPPPPPPPPPERRYESRASTSKLRKAPVE 422

Query: 1477 VSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXX 1656
              T K   P K+  +                             K  +WKFV +GDY   
Sbjct: 423  SRTSKPNPPAKVTQYVGTGSESPLMPIPPPPPPPPF--------KMPAWKFVKRGDYVRM 474

Query: 1657 XXXXXXXXXXPDIDD-AESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAG 1833
                      PD  D A+S G+  A                   DV+TKA++FI +FRAG
Sbjct: 475  ASDISISSDEPDDPDVAQSAGSKEAAGS----------MFCPSPDVDTKADDFIARFRAG 524

Query: 1834 LKLEKINSMNKRQGLGLSNLG 1896
            LKLEK+NS+ +    G SNLG
Sbjct: 525  LKLEKMNSVKR----GRSNLG 541


>gb|AAM13859.1| unknown protein [Arabidopsis thaliana]
          Length = 535

 Score =  114 bits (286), Expect = 1e-22
 Identities = 154/611 (25%), Positives = 216/611 (35%), Gaps = 39/611 (6%)
 Frame = +1

Query: 151  EEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLV-FVVPSTLS 327
            E+DG+  TPFWLQS  +R  T   R                   TAA  +V F++P   S
Sbjct: 3    EDDGDASTPFWLQS--RRNNTYFRRTASLGGRTTTIATQIFFAGTAAILIVVFIIPPFFS 60

Query: 328  FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDS--------DEYQTSPVSR 483
              +QIFRP+ V+KSWD                SR TN D  +        +++ TSP   
Sbjct: 61   SVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNNDESNHHKEEDIRNKFSTSPSII 120

Query: 484  DDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSY 663
            D   + S S    +++N     D +          G  DQ         R    R+ SSY
Sbjct: 121  DRRSRVSNSGTTPRYWN-----DDRGG--------GGGDQTV-----YKRFSRLRSVSSY 162

Query: 664  PDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWK----------------- 792
            PDL     +    D+ WRFYDD  V   R  +   +   +S++                 
Sbjct: 163  PDLRLREYE---ADERWRFYDDTRVSQCRYEDVDPIYPNQSYRNWHEEGKPPPEDVDQTE 219

Query: 793  --DKFEVSRVVE--SKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPK 960
              D  E S+V    S+   V+       +                              K
Sbjct: 220  DGDNGEGSKVRNGGSETEKVEVVATAEAEVVEELKVPSAPPYIPSPPPSPPRPPPAKQAK 279

Query: 961  EKPKRVHRSLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKK 1140
             K  RV++ ++ + E  K++D+ +    PI  P T              V QK  K +KK
Sbjct: 280  RKTNRVYQDVSPQ-EEKKERDDFVATTTPIPPPAT--------------VYQKSNKQEKK 324

Query: 1141 RSGGSATKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXX 1320
            +  G ATKDFL +L           SI+ LD L      PPL +                
Sbjct: 325  K--GGATKDFLIAL-RRKKKKQRQQSIDGLDLL--FGSDPPLVYS---------PPPPPP 370

Query: 1321 XXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTP----AIHSKPSTSEI----IK 1476
                 F  LF              S    S P PP P P       S+ STS++    ++
Sbjct: 371  PPPPFFQGLFSSKKGK--------SKKNNSNPPPPPPPPPPERRYESRASTSKLRKAPVE 422

Query: 1477 VSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXX 1656
              T K   P K+  +                             K  +WKFV +GDY   
Sbjct: 423  SRTSKPNPPAKVTQYVGTGSESPLMPIPPPPPPPPF--------KMPAWKFVKRGDYVRM 474

Query: 1657 XXXXXXXXXXPDIDD-AESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAG 1833
                      PD  D A+S G+  A                   DV+TKA++FI +FRAG
Sbjct: 475  ASDISISSDEPDDPDVAQSAGSKEAAGS----------MFCPSPDVDTKADDFIARFRAG 524

Query: 1834 LKLEKINSMNK 1866
            LKLEK+NS+ +
Sbjct: 525  LKLEKMNSVKR 535


>ref|XP_002887451.1| hypothetical protein ARALYDRAFT_476416 [Arabidopsis lyrata subsp.
            lyrata] gi|297333292|gb|EFH63710.1| hypothetical protein
            ARALYDRAFT_476416 [Arabidopsis lyrata subsp. lyrata]
          Length = 552

 Score =  114 bits (285), Expect = 2e-22
 Identities = 152/609 (24%), Positives = 214/609 (35%), Gaps = 27/609 (4%)
 Frame = +1

Query: 151  EEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLVF-VVPSTLS 327
            EEDG+  TPFWLQS  +R  T   R                   TAA  +VF ++P   S
Sbjct: 3    EEDGDASTPFWLQS--RRNNTYFRRTASLGGRATTVATQIFFAGTAAILIVFFIIPPLFS 60

Query: 328  FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQTSPVSRDDVQKSSP 507
              +Q+FRP+ V+KSWD                SR TN D      +T+    +D+     
Sbjct: 61   SVSQVFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNND------ETNHNKEEDISNKFS 114

Query: 508  STPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSYPDLLEASS 687
            ++P     +      + SN +  + W         +Q    R    R+ SSYPDL     
Sbjct: 115  NSP-----SIIDRGGRVSNSATPRYWIDDRGGGGGDQTVYKRFSRLRSVSSYPDLRLREY 169

Query: 688  QFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKD-KFEVSRVVE-------------- 822
            +    D+ WRFYDD  V   R  +   +   +S+++ + EV    E              
Sbjct: 170  E---ADERWRFYDDTRVSQCRYEDVDPIYPNQSYRNWQEEVKPPPEDLDQTEDGGNEGGG 226

Query: 823  ------SKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHR 984
                  S+   V+ F     +                              K K KRV++
Sbjct: 227  KVHSGGSETEKVEVFETAEAEVVEELTVPSAPPYIPSPPPSPPRPPPPKQAKRKTKRVYQ 286

Query: 985  SLAHKGERSKKKDNELEN-REPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSAT 1161
             +  K E +++ D        PI  P T              V QK  K +KK+  G AT
Sbjct: 287  DVPPKEENNERSDFVAATPMTPIPPPAT--------------VYQKSNKQEKKK--GGAT 330

Query: 1162 KDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFH 1341
            KDFL +L           SI+ LD L      PPL +                     F 
Sbjct: 331  KDFLIAL-RRKKKKQRQQSIDGLDLL--FGSDPPLVYS---------PPPPPPPPPPFFQ 378

Query: 1342 NLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEI----IKVSTHKSPKPVK 1509
             LF              +++    P PP P     S+ S + I    ++  T K   P +
Sbjct: 379  GLFSSKKGKGKKN----NSNPPPPPPPPPPERRYESRASMTSIRKAPVESRTSKPNPPAR 434

Query: 1510 IRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXXXXXXXXP 1689
            +  F                             K  +WKFV +GDY              
Sbjct: 435  VTQFVGTGSESPLMPIPPPPPPPPF--------KMPAWKFVKRGDYVRMASDI------- 479

Query: 1690 DIDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMNKR 1869
             I   E D    A   E  VA           DV+TKA++FI +FRAGLKLEK+NS+ + 
Sbjct: 480  SISSDEPDDPDVAQSAEGKVA--AGSMFCPSPDVDTKADDFIARFRAGLKLEKMNSVKR- 536

Query: 1870 QGLGLSNLG 1896
               G SNLG
Sbjct: 537  ---GRSNLG 542


>gb|AAL50061.1| At1g72790/F28P22_2 [Arabidopsis thaliana] gi|19548027|gb|AAL87377.1|
            At1g72790/F28P22_2 [Arabidopsis thaliana]
          Length = 561

 Score =  114 bits (285), Expect = 2e-22
 Identities = 157/621 (25%), Positives = 218/621 (35%), Gaps = 39/621 (6%)
 Frame = +1

Query: 151  EEDGEDFTPFWLQSTAKRRRTDRVRDXXXXXXXXXXXXXXXXXVTAASFLV-FVVPSTLS 327
            E+DG+  TPFWLQS  +R  T   R                   TAA  +V F++P   S
Sbjct: 3    EDDGDASTPFWLQS--RRNNTYFRRTASLGGRTTTIATQIFFAGTAAILIVVFIIPPFFS 60

Query: 328  FSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDS--------DEYQTSPVSR 483
              +QIFRP+ V+KSWD                SR TN D  +        +++ TSP   
Sbjct: 61   SVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNNDESNHHKEEDIRNKFSTSPSII 120

Query: 484  DDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTSSSY 663
            D   + S S    +++N     D +          G  DQ         R    R+ SSY
Sbjct: 121  DRRSRVSNSGTTPRYWN-----DDRGG--------GGGDQTV-----YKRFSRLRSVSSY 162

Query: 664  PDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFE------------- 804
            PDL     +    D+ WRFYDD  V   R  +       +S ++  E             
Sbjct: 163  PDLRLREYE---ADERWRFYDDTRVSQCRYEDVDPTYPNQSCRNWHEEGKPPPEDVDQTE 219

Query: 805  --------VSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPK 960
                     +R   S+   V+       +                              K
Sbjct: 220  DGDNGEGSKARNGGSETEKVEVVATAEAEVVEELKVPSAPPYIPSPPPSPPRPPPAKQAK 279

Query: 961  EKPKRVHRSLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKK 1140
             K  RV++ ++ + E  K++D+ +    PI  P T              V QK  K +KK
Sbjct: 280  RKTNRVYQDVSPQ-EEKKERDDFVATTTPIPPPAT--------------VCQKSNKQEKK 324

Query: 1141 RSGGSATKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXX 1320
            +  G ATKDFL +L           SI+ LD L      PPL +                
Sbjct: 325  K--GGATKDFLIAL-RRKKKKQRQQSIDGLDLL--FGSDPPLVYS---------PPPPPP 370

Query: 1321 XXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTP----AIHSKPSTSEI----IK 1476
                 F  LF              S    S P PP P P       S+ STS++    ++
Sbjct: 371  PPPPFFQGLFSSKKGK--------SKKNNSNPPPPPPPPPPERRYESRASTSKLRKAPVE 422

Query: 1477 VSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXX 1656
              T K   P K+  +                             K  +WKFV +GDY   
Sbjct: 423  SRTSKPNPPAKVTQYVGTGSESPLMPIPPPPPPPPF--------KMPAWKFVKRGDYVRM 474

Query: 1657 XXXXXXXXXXPDIDD-AESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAG 1833
                      PD  D A+S G+  A                   DV+TKA++FI +FRAG
Sbjct: 475  ASDISISSDEPDDPDVAQSAGSKEAAGS----------MFCPSPDVDTKADDFIARFRAG 524

Query: 1834 LKLEKINSMNKRQGLGLSNLG 1896
            LKLEK+NS+ +    G SNLG
Sbjct: 525  LKLEKMNSVKR----GRSNLG 541


>ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261010 [Vitis vinifera]
          Length = 555

 Score =  108 bits (269), Expect = 1e-20
 Identities = 137/562 (24%), Positives = 207/562 (36%), Gaps = 26/562 (4%)
 Frame = +1

Query: 280  VTAASFLVFVVPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDE 459
            + A   + F VPS L+F++Q  RPN+V+KSWDS               +R  N +++ D 
Sbjct: 41   ILAMIVVFFAVPSFLNFTSQFLRPNSVRKSWDSLNVLLVLFAILCGVFAR-KNDEKNDDV 99

Query: 460  YQTSPVSRDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGG 639
             +    S   V   S  +     + +S   D+K               + D  +      
Sbjct: 100  LENHGSSGSVVMGKSHESISHSLFEFS---DRK---------------IYDPPIQSGSVR 141

Query: 640  LRRTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSR-V 816
            LRR+SSSYPDL +  S + +GDD  RF+DD  V+ YR   +   +RR     + E+ R  
Sbjct: 142  LRRSSSSYPDLRQ-ESLWGAGDDRRRFFDDFEVNNYRSPASSDYVRRHR---RSELERDD 197

Query: 817  VESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLAH 996
             E K I VDTF  R                               + + KP+R + ++A 
Sbjct: 198  SEVKVIPVDTFAVRSS---------PSPSPAPPRTPPPPPPPPPPIVQRKPRRSYETVAR 248

Query: 997  KGERSKK-KDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATKDFL 1173
            K + S    D   ++R P + P                 +QK  K+ ++   G ATKD  
Sbjct: 249  KEKLSNSDADQFKKSRSPPAPPPPPPPPPPPRVPGGHLPEQKSRKSARRM--GGATKDIA 306

Query: 1174 N---SLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXXXXXXXXXXXVFHN 1344
                SLY+         + +N+    +  Q+PP                       + HN
Sbjct: 307  TVFVSLYNQTRKKKKQRT-KNIHE--NAVQSPP-----SATTPTPPPPPPPPPPPSMLHN 358

Query: 1345 LF-----------------PXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEII 1473
            LF                 P                T   P PPTP P     P TS   
Sbjct: 359  LFRKGSKSKRIHSVSAPPPPPPPPPRPPPPRSSKRKTHIPPAPPTPPPP--PPPDTSR-- 414

Query: 1474 KVSTHKSPKPVKIRSF----DSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQG 1641
            + +  K P P +  SF    D+V                          +    K+VV+G
Sbjct: 415  RRAAGKPPLPARKSSFYNRDDNVNSGGQSPLIPMPPPPPPF--------RMPELKYVVRG 466

Query: 1642 DYXXXXXXXXXXXXXPDIDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITK 1821
            D+             P++DD +     +A DG DA+            DVN KA+ FI +
Sbjct: 467  DFVRIRSTHSSRCSSPELDDVDLSSNKSAMDGGDAIG----ATFCPSPDVNVKADTFIAR 522

Query: 1822 FRAGLKLEKINSMNKRQGLGLS 1887
             R   +LEKINS+ +R+ +GL+
Sbjct: 523  LRGEWRLEKINSLRERKNVGLT 544


>gb|EXB38898.1| hypothetical protein L484_027333 [Morus notabilis]
          Length = 509

 Score = 96.3 bits (238), Expect = 5e-17
 Identities = 104/415 (25%), Positives = 166/415 (40%), Gaps = 12/415 (2%)
 Frame = +1

Query: 295  FLVFVVPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQTSP 474
            FL+  VP  LSF++ IFRP  VKKSWD                +R  + +  +++   + 
Sbjct: 49   FLLSAVPPFLSFTSLIFRPIAVKKSWDLLNIFLVLFAILCGIFARRNDDESANNDVVPTA 108

Query: 475  VSRDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSRGGLRRTS 654
                 V++S P+ P Q+W+ +S  +D++S     ++ Y   D+ A++    S   LRR+S
Sbjct: 109  RRSGGVEESEPANP-QRWFAFS--DDRRS-----EKIYDSVDRTAESG---SLRRLRRSS 157

Query: 655  SSYPDLLEASSQFTSGDDP---WRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSRVVES 825
            SSYPDL +  S + +GDDP   +RF+DD  ++ YRV  T      R  + +   +   E+
Sbjct: 158  SSYPDLRQ-ESLWETGDDPRFQFRFFDDFEINKYRV--TAPFDPSREIRGRRREADDGEA 214

Query: 826  KNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLAHKGE 1005
            K I VDTFV RP                              + + KP+R +R++  + E
Sbjct: 215  KEILVDTFVVRP------TTPPKSPSPSPSPATPSPPPPPPPVERHKPRRTYRAVGERKE 268

Query: 1006 RSKKKDNELEN-------REPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGSATK 1164
            +++KK ++  +       R P  +P              R  +Q+  K ++++S      
Sbjct: 269  KAEKKQDDHNDADQFAKVRPPPPTPPPPPPRPPPSPARVR-PEQRHVKLERRKSNVKKEI 327

Query: 1165 DFLNSLYHXXXXXXXXXSIENLDSLLHLSQT--PPLQFQXXXXXXXXXXXXXXXXXXXVF 1338
                +  +          I +  S  H S T  PP + +                   VF
Sbjct: 328  AMAFTSLYNQRKRKKKQKIASSGSHAHDSATSSPPEKTRFPPPSPPPPPPPLPPPPSSVF 387

Query: 1339 HNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSKPSTSEIIKVSTHKSPKP 1503
            HNLF                 T   P PP P P   S P +S   K      P P
Sbjct: 388  HNLFKKGIKSK-------RIHTIPPPPPPPPPPFSSSPPPSSRPSKHKNRSVPPP 435


>ref|NP_200517.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|332009460|gb|AED96843.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 575

 Score = 92.4 bits (228), Expect = 7e-16
 Identities = 123/566 (21%), Positives = 210/566 (37%), Gaps = 40/566 (7%)
 Frame = +1

Query: 295  FLVFVVPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQ--- 465
            F+ FVVP+ LS ++QI +P +VK+ WDS               +R  +    S+      
Sbjct: 44   FVTFVVPTFLSVTSQILQPASVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGE 103

Query: 466  ----------TSPVSRDDVQK--SSPSTPQQKWYNYSTDEDQ-KSNLSAFQQWYGYSDQM 606
                         ++  ++ K  SS ST  ++W++   D D+ K   S   + + +   +
Sbjct: 104  EEEVGGGAVTNGEMTVGEISKISSSSSTVSEQWFDDVYDSDRLKIYESVSSRSFSHGLPV 163

Query: 607  ADNQVNVSRGGLRRTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRS 786
              N        LRR+SSSYPDL +   + T GD  +RFYDD  +D YR  ++    + ++
Sbjct: 164  TGNV------PLRRSSSSYPDLRQGVFRET-GDRRFRFYDDFEIDKYRSQDSSSYQQFQN 216

Query: 787  WKDKFEVSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEK 966
                       E K I +DTFV +P                                 +K
Sbjct: 217  LSKTEIEEEESEPKEIQIDTFVVKPSSPPQQPPATPPPPPPPPPVEV----------PQK 266

Query: 967  PKRVHRSLAHKGERSKKKDNELENR---EPISSPVTXXXXXXXXXXXXRFVDQKIGKTDK 1137
            P+R HRS+ ++  +   K +E + +   +P  SP                  +K G   +
Sbjct: 267  PRRTHRSVRNRDLQENAKRSETKFKRTFQPPPSPPPPPPPPPPQPLIAATPPRKQGTLQR 326

Query: 1138 KRSGGS-ATKDFLNSLYHXXXXXXXXXSIENLD----SLLHLSQTPPLQFQ-----XXXX 1287
            ++S  +   K    SLY+           +  +    S +    T P Q+Q         
Sbjct: 327  RKSNAAKEIKMVFASLYNQGKKKKKLQKSKRKERIESSPMVEDVTEPPQYQSLIPPPSPP 386

Query: 1288 XXXXXXXXXXXXXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSK--PST 1461
                           VF+ LF           +  +    S P PP P P  +++  P T
Sbjct: 387  PPPPPPPPPLRSSQSVFYGLF--------KKGVKSNKKIHSVPAPPPPPPPRYTQFDPQT 438

Query: 1462 SEIIKVSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQG 1641
                +V + + P+P K ++F+                            +    K+VV G
Sbjct: 439  PP-RRVKSGRPPRPTKPKNFNEENNGQGSPLIQITPPPPPPPPF-----RVPPLKYVVSG 492

Query: 1642 DYXXXXXXXXXXXXXPD---------IDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVN 1794
            D+             P+         ++  +SDG     + + AV+           DV+
Sbjct: 493  DFAKIRSNQSSRCSSPEREVFDIGWGLELTQSDG---GVETKAAVSGGGMPGFCPSPDVD 549

Query: 1795 TKAENFITKFRAGLKLEKINSMNKRQ 1872
            TKA+NFI + R   +L+KINS+N+++
Sbjct: 550  TKADNFIARLRDEWRLDKINSVNRKR 575


>dbj|BAA97357.1| unnamed protein product [Arabidopsis thaliana]
          Length = 607

 Score = 92.0 bits (227), Expect = 9e-16
 Identities = 123/565 (21%), Positives = 209/565 (36%), Gaps = 40/565 (7%)
 Frame = +1

Query: 295  FLVFVVPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRDSDEYQ--- 465
            F+ FVVP+ LS ++QI +P +VK+ WDS               +R  +    S+      
Sbjct: 44   FVTFVVPTFLSVTSQILQPASVKRGWDSINVVLVVFAILCGVLARRNDDGLSSESLHGGE 103

Query: 466  ----------TSPVSRDDVQK--SSPSTPQQKWYNYSTDEDQ-KSNLSAFQQWYGYSDQM 606
                         ++  ++ K  SS ST  ++W++   D D+ K   S   + + +   +
Sbjct: 104  EEEVGGGAVTNGEMTVGEISKISSSSSTVSEQWFDDVYDSDRLKIYESVSSRSFSHGLPV 163

Query: 607  ADNQVNVSRGGLRRTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRS 786
              N        LRR+SSSYPDL +   + T GD  +RFYDD  +D YR  ++    + ++
Sbjct: 164  TGNV------PLRRSSSSYPDLRQGVFRET-GDRRFRFYDDFEIDKYRSQDSSSYQQFQN 216

Query: 787  WKDKFEVSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEK 966
                       E K I +DTFV +P                                 +K
Sbjct: 217  LSKTEIEEEESEPKEIQIDTFVVKPSSPPQQPPATPPPPPPPPPVEV----------PQK 266

Query: 967  PKRVHRSLAHKGERSKKKDNELENR---EPISSPVTXXXXXXXXXXXXRFVDQKIGKTDK 1137
            P+R HRS+ ++  +   K +E + +   +P  SP                  +K G   +
Sbjct: 267  PRRTHRSVRNRDLQENAKRSETKFKRTFQPPPSPPPPPPPPPPQPLIAATPPRKQGTLQR 326

Query: 1138 KRSGGS-ATKDFLNSLYHXXXXXXXXXSIENLD----SLLHLSQTPPLQFQ-----XXXX 1287
            ++S  +   K    SLY+           +  +    S +    T P Q+Q         
Sbjct: 327  RKSNAAKEIKMVFASLYNQGKKKKKLQKSKRKERIESSPMVEDVTEPPQYQSLIPPPSPP 386

Query: 1288 XXXXXXXXXXXXXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSK--PST 1461
                           VF+ LF           +  +    S P PP P P  +++  P T
Sbjct: 387  PPPPPPPPPLRSSQSVFYGLF--------KKGVKSNKKIHSVPAPPPPPPPRYTQFDPQT 438

Query: 1462 SEIIKVSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQG 1641
                +V + + P+P K ++F+                            +    K+VV G
Sbjct: 439  PP-RRVKSGRPPRPTKPKNFNEENNGQGSPLIQITPPPPPPPPF-----RVPPLKYVVSG 492

Query: 1642 DYXXXXXXXXXXXXXPD---------IDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVN 1794
            D+             P+         ++  +SDG     + + AV+           DV+
Sbjct: 493  DFAKIRSNQSSRCSSPEREVFDIGWGLELTQSDG---GVETKAAVSGGGMPGFCPSPDVD 549

Query: 1795 TKAENFITKFRAGLKLEKINSMNKR 1869
            TKA+NFI + R   +L+KINS+N++
Sbjct: 550  TKADNFIARLRDEWRLDKINSVNRK 574


>ref|XP_004288965.1| PREDICTED: uncharacterized protein LOC101306381 [Fragaria vesca
            subsp. vesca]
          Length = 548

 Score = 89.7 bits (221), Expect = 4e-15
 Identities = 128/584 (21%), Positives = 200/584 (34%), Gaps = 58/584 (9%)
 Frame = +1

Query: 295  FLVFVVPSTLSFSAQIFRPN-TVKKSWDSXXXXXXXXXXXXXXXSRATNQ------DRDS 453
            FL++ +P  LS ++ I +P  +VKKSWDS               +R  +       D D 
Sbjct: 29   FLIYAIPPFLSLTSHILQPTVSVKKSWDSLNVFLVIFAILCGVFARKHDDAGEGLPDPDH 88

Query: 454  DEYQTSPVSRDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMA-------- 609
                    + D +   + S+PQ +       +         QQW+GYS++ +        
Sbjct: 89   HHVHIHNANSDPLLDRTTSSPQSESSVLHVPQ---------QQWFGYSERTSRMYDTTPV 139

Query: 610  DNQVNVSRGGLRRTSSSYPDLLEASSQFTSGDD---PWRFYDDMTVDTYRVSETGQLLRR 780
                NVS  G  R  SSYPD+ +  S + + DD    +RF+DD  +  Y+          
Sbjct: 140  KTPENVSGDGRLRRRSSYPDMRQVESLWETLDDTKSQFRFFDDFEISNYKT--------H 191

Query: 781  RSWKDKFEVSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPK 960
            R +K+    +   + K I VDTFV +P                               P+
Sbjct: 192  RHYKE----TTSDDVKEIKVDTFVLQPSPTPPPPPPPPP-------------------PR 228

Query: 961  EKPKRVHRSLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRF---VDQKIGKT 1131
             + +R + ++   G RSK+K  E++  E  S+P                    DQK G+ 
Sbjct: 229  REKQRTYETV---GRRSKEKVEEVKFEEVRSTPPPPPPPPSTVAPPSPMRVRSDQKHGRL 285

Query: 1132 DKKRSG--GSATKDFLNSLYHXXXXXXXXXSIENLDSLLHLSQTPPLQFQXXXXXXXXXX 1305
            ++++S         + + L +         +  N+      ++ PP Q Q          
Sbjct: 286  ERRKSNVKKEIAMVWNSVLSNQRKRKRKQKATRNIYDTAATTEPPPEQSQ-------PPP 338

Query: 1306 XXXXXXXXXVFHNLF-------------------PXXXXXXXXXDLDISASTFSQPKPPT 1428
                     VFHNLF                   P             S ST   P PPT
Sbjct: 339  PPPPPPPSSVFHNLFKKGSKTKKVHSVPTAPPPPPPLPEVSVRTHQTRSRSTLPPPAPPT 398

Query: 1429 ---PTPAIHSKPSTSEIIKVSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXX 1599
               P P+ HS+            + P P K  +   V                       
Sbjct: 399  PPRPPPSAHSR-----------RRPPLPTKPSTSYEVDNVNSGCQSPLIPIPPPPPPF-- 445

Query: 1600 XFTKSLSWKFVVQGDYXXXXXXXXXXXXXPDIDDAESD-------------GTPTATDGE 1740
               K  + KF V+GD+             P+ ++  +D              T   TDG 
Sbjct: 446  ---KMPAMKFFVKGDFVKIRSAQSSRSASPEPEEVVADHALPAGKEESTTTSTVNVTDGG 502

Query: 1741 DAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKLEKINSMNKRQ 1872
            D              DVNTKA+NFI + R   +LEKINS+ +++
Sbjct: 503  DGAGRASPSVFCPSPDVNTKADNFIARLRDEWRLEKINSLREKK 546


>ref|XP_002864497.1| hypothetical protein ARALYDRAFT_495801 [Arabidopsis lyrata subsp.
            lyrata] gi|297310332|gb|EFH40756.1| hypothetical protein
            ARALYDRAFT_495801 [Arabidopsis lyrata subsp. lyrata]
          Length = 566

 Score = 88.2 bits (217), Expect = 1e-14
 Identities = 121/566 (21%), Positives = 204/566 (36%), Gaps = 35/566 (6%)
 Frame = +1

Query: 280  VTAASFLVFV---VPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSRATNQDRD 450
            ++AA FL+FV   +P  LS ++QI +P++VK+ WDS               +R  +    
Sbjct: 36   ISAAIFLLFVNFVIPPFLSVTSQILQPSSVKRGWDSINVVLVVFAILCGVLARRNDDGLS 95

Query: 451  SDEYQ------------TSPVSRDDVQK--SSPSTPQQKWYNYSTDEDQKSNLSAFQQWY 588
            S+               +  ++  ++ K  SS S   ++W++   D ++     +     
Sbjct: 96   SESLHGGEEEEVGGAVTSGEMTLGEISKISSSSSAVSEQWFDDVYDAERLKIYESVS--- 152

Query: 589  GYSDQMADNQVNVSRGGLRRTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQ 768
              S   +          LRR+ SSYPDL +   + T GD  +RFYDD  +      E   
Sbjct: 153  --SRSFSHGLPVTGTVPLRRSCSSYPDLRQGVFRET-GDRRFRFYDDFEIHNRSYEE--- 206

Query: 769  LLRRRSWKDKFEVSRVVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXX 948
              + RS   K E+    E K I +DTFV +P                             
Sbjct: 207  -FQNRS---KIEIEEESEPKEIQIDTFVVKPSSPPQQPPAPPTPPPPPPPPPVEV----- 257

Query: 949  XMPKEKPKRVHRSLAHKGERSKKKDNELENREPISSPVTXXXXXXXXXXXXRFVDQKIGK 1128
                +KP+R HRS+ ++  +   K N+++ +     P                  +K G 
Sbjct: 258  ---SQKPRRTHRSVKNRDIQENVKRNDIKFKRAFQPPNPPPPPPPPPPLITATPPRKQGT 314

Query: 1129 TDKKRSGGS-ATKDFLNSLYHXXXXXXXXXSIENLD----SLLHLSQTPPLQFQ-----X 1278
              +++S  +   K    SLY+           +  +    S + +  T P Q+Q      
Sbjct: 315  LQRRKSNAAKEIKMVFASLYNQGKRKKKIQKSKRKERIESSPVVVDVTEPPQYQSLIPPP 374

Query: 1279 XXXXXXXXXXXXXXXXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTPAIHSK-- 1452
                              VF+ LF           +  +    S P PP P P  H++  
Sbjct: 375  SPPPPPPPPPPPPRTSQSVFYGLF--------KKGVKSNKKIHSVPAPPPPPPPRHTQFD 426

Query: 1453 PSTSEIIKVSTHKSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFV 1632
            P T    +V++ + P+P K  +F+                            +    KFV
Sbjct: 427  PQT-PTRRVNSGRPPRPTKPTNFNEENNGQGSPLIQITPPPPPPPPF-----RVPPLKFV 480

Query: 1633 VQGDYXXXXXXXXXXXXXPDIDDAESDGTPTATDGED------AVAXXXXXXXXXXXDVN 1794
            V GD+             P+ +  +       T  +D      AV            DV+
Sbjct: 481  VSGDFAKIRSNQSSRCSSPEREVIDIGWGLELTQSDDGVKTKAAVGGGGMPGFCPSPDVD 540

Query: 1795 TKAENFITKFRAGLKLEKINSMNKRQ 1872
            TKA+NFI + R   +L+KINS+N+++
Sbjct: 541  TKADNFIARLRDEWRLDKINSVNRKR 566


>ref|XP_006401247.1| hypothetical protein EUTSA_v10013114mg [Eutrema salsugineum]
            gi|557102337|gb|ESQ42700.1| hypothetical protein
            EUTSA_v10013114mg [Eutrema salsugineum]
          Length = 570

 Score = 87.8 bits (216), Expect = 2e-14
 Identities = 128/546 (23%), Positives = 195/546 (35%), Gaps = 24/546 (4%)
 Frame = +1

Query: 295  FLVFVVPSTLSFSAQIFRPNTVKKSWDSXXXXXXXXXXXXXXXSR-------ATNQDRDS 453
            F+ FV+P  LS ++QIF+P +VKK WDS               +R       +++Q    
Sbjct: 47   FMTFVLPPFLSITSQIFQPASVKKGWDSINVVLVVFAILCGVLARQNDDGLSSSSQSSHV 106

Query: 454  DEYQTSPVSRDDVQKSSPSTPQQKWYNYSTDEDQKSNLSAFQQWYGYSDQMADNQVNVSR 633
            +E +    + +D + SS     Q+W++   D D+   L  ++     S   +        
Sbjct: 107  EEEEDDVTNGEDSKISSSPVVSQQWFDDVYDADR---LKIYESLSNRS--FSPGLPVTGT 161

Query: 634  GGLRRTSSSYPDLLEASSQFTSGDDPWRFYDDMTVDTYRVSETGQLLRRRSWKDKFEVSR 813
              LRR+SSSYPDL   + + T+ D  +RFYDD  +D YR  ++           K E+  
Sbjct: 162  LPLRRSSSSYPDLRNGAFRETA-DRRFRFYDDFEIDKYRSQDS---------PSKIEIEE 211

Query: 814  VVESKNIHVDTFVNRPEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXMPKEKPKRVHRSLA 993
              E K I VD FV RP                                 +KPKR HRS+ 
Sbjct: 212  -SEPKEIPVDKFVVRPSS---PPPHPPQQPPAPPPPPPPLPESSPVQVSQKPKRTHRSVR 267

Query: 994  HKG--ERSKKKD-NELENREPISSPVTXXXXXXXXXXXXRFVDQKIGKTDKKRSGGS-AT 1161
            ++   E+SK+ D N    R   +                    +K G   +++S  +   
Sbjct: 268  NRDIQEKSKRSDANSDSTRFKRAFQPPPPPPPPPPPFITATPPRKQGTLQRRKSNAAKEI 327

Query: 1162 KDFLNSLYHXXXXXXXXXSIENLDSL----LHLSQTPPLQFQ-----XXXXXXXXXXXXX 1314
            K    SLY+           +  +      + ++ T P Q+Q                  
Sbjct: 328  KMVFASLYNQGKKKKKLQKPKRKERSESPEVVVAATEPPQYQSSFPPPSPPPPPPPPPPP 387

Query: 1315 XXXXXXVFHNLFPXXXXXXXXXDLDISASTFSQPKPPTPTP--AIHSKPSTSEIIKVSTH 1488
                  VF+ LF              S    S P PP P P   I   P T    +  + 
Sbjct: 388  LRSSQSVFYGLFKKGVK---------SKKIHSVPAPPPPPPPRKIQLDPQTPP-RRSKSG 437

Query: 1489 KSPKPVKIRSFDSVXXXXXXXXXXXXXXXXXXXXXXXXFTKSLSWKFVVQGDYXXXXXXX 1668
            + P+P+K  +F+                              L  KFVV GD+       
Sbjct: 438  RPPRPMKPTNFNEDSYVNNGHASPLIQTTPPPPPPPPFRVPPL--KFVVSGDFAKIRSNQ 495

Query: 1669 XXXXXXP--DIDDAESDGTPTATDGEDAVAXXXXXXXXXXXDVNTKAENFITKFRAGLKL 1842
                  P  ++ D       T +DG                DVNTKA+NFI + R   +L
Sbjct: 496  SSRCSSPEREVIDLGWGLELTQSDGGAETLTAVGSGFCPSPDVNTKADNFIARLRDEWRL 555

Query: 1843 EKINSM 1860
            +KINS+
Sbjct: 556  DKINSV 561