BLASTX nr result

ID: Catharanthus23_contig00010503 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00010503
         (3042 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252...   654   0.0  
gb|EMJ26321.1| hypothetical protein PRUPE_ppa002630mg [Prunus pe...   629   e-177
emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]   622   e-175
gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]     622   e-175
emb|CBI26785.3| unnamed protein product [Vitis vinifera]              615   e-173
ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309...   590   e-165
gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, ...   588   e-165
ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600...   586   e-164
ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citr...   585   e-164
gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, ...   583   e-163
ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618...   580   e-162
ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261...   579   e-162
ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794...   577   e-162
ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm...   577   e-162
ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu...   573   e-160
gb|ABK95394.1| unknown [Populus trichocarpa]                          572   e-160
ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr...   566   e-158
ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809...   565   e-158
ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210...   560   e-156
gb|ESW30779.1| hypothetical protein PHAVU_002G181800g [Phaseolus...   557   e-156

>ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera]
          Length = 698

 Score =  654 bits (1687), Expect = 0.0
 Identities = 381/727 (52%), Positives = 459/727 (63%), Gaps = 38/727 (5%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDGFISWLR 2332
            MAMPSGN V+ +KMQ  GGGG      +G G                  DERDGFISWLR
Sbjct: 1    MAMPSGNVVISDKMQFPGGGG------RGGGGGAAEIHHHRQWFP----DERDGFISWLR 50

Query: 2331 GEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXX 2152
            GEFAAANA+ID+LC+HLR++GEPGEYD VIGCIQQRR NW+ VLHMQ YFSV +V+Y   
Sbjct: 51   GEFAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQ 110

Query: 2151 XXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEG-----GHHMN-SKAVP 1990
                          +  GK  K Y             +++G+ G      H+ N      
Sbjct: 111  QVGWRRQQRHLDPVKGAGKEYKRYGVA----------YRQGQRGETAKDSHNSNFENHSH 160

Query: 1989 NVNGNENLDAG--------DVKGG-KGEA--KVE-----SGEERK---DIVEESGGDG-- 1873
            + N +  L+ G        DVKGG KG+   K+E     + EE+K   D V +   +   
Sbjct: 161  DANSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCS 220

Query: 1872 --SVESQGSREAVSTIKPEHSSENTDDGHLYDS-KENDCHSERILHEKQSPIVTPKTFVG 1702
              S  S+GSR  +S    E  + + DDG   +   EN+ H  +  +EK +P  +PKTFVG
Sbjct: 221  KSSENSEGSRCGIS----ETEANDMDDGGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVG 276

Query: 1701 TEIYDGKSFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQGQTFIVSKRPMKG 1522
            TEI+DGK+ NVVDG+KLYEELFD+SEVSK ++LVNDLRAAG+RGQLQGQTF+VSKRPMKG
Sbjct: 277  TEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKG 336

Query: 1521 HGRETIQFGLPIADAPHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCII 1342
            HGRE IQ G+PIADAP EDE+  GTSKDR+ E IP L QDVI  L+ +QV+ VKPD+CII
Sbjct: 337  HGREMIQLGVPIADAPLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTVKPDACII 396

Query: 1341 DIFNEGDHSQPHMWPHSFGRPVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLL 1162
            D +NEGDHSQPH+WP  FGRPVC+LFLTEC+MTFG++IG DHPGDYRG+ K SL PGSLL
Sbjct: 397  DFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLSLVPGSLL 456

Query: 1161 VLQGRSTDFAKHAIPSIRKQRILVTLTKSQPKRMGTADAQR-FPSAVGAAPSHWVPPPSR 985
            V+QG+S DFAKHAIPS+RKQRILVT TKSQPK+   +D QR  P A  A  SHWVPPPSR
Sbjct: 457  VMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTMASDGQRLLPPA--AQSSHWVPPPSR 514

Query: 984  SPNHMRHPVGPKHYGHVP-TGVL--SAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXX 814
            SPNHMRHP+GPKHYG VP TGVL   AP  R           +F                
Sbjct: 515  SPNHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLF---VTTAVAPAMPFPA 571

Query: 813  XXXXXXASAGWPAATPRHPSPRLPVPGTGVFLXXXXXXXXXXXXSIATPAIESSITDTSA 634
                   S GWPAA PRHP PRLPVPGTGVFL             I+T A  +S+ +T+A
Sbjct: 572  PVPLPTGSPGWPAAPPRHPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATSTSV-ETAA 630

Query: 633  YSEKDI---KGNINGNTDSPRGKVDENLQYQECNGSVDGNGHTE-VIPKEEQQHQNSESK 466
             +EK+    K + N NT SP+GK+D  +  QECNGS+D  G  E  + KEEQQH N E K
Sbjct: 631  PTEKENGSGKSSSNSNTVSPKGKLDGKVHRQECNGSMDETGVDERAVTKEEQQH-NDELK 689

Query: 465  GTEKSAG 445
               K AG
Sbjct: 690  VASKPAG 696


>gb|EMJ26321.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica]
          Length = 650

 Score =  629 bits (1623), Expect = e-177
 Identities = 352/694 (50%), Positives = 425/694 (61%), Gaps = 4/694 (0%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDGFISWLR 2332
            M MPSGN V+ +KMQ   GGG   +   G G                  DERDGFISWLR
Sbjct: 1    MTMPSGNVVLSDKMQFPSGGGGGAV---GGGEIAQHHRQWFP-------DERDGFISWLR 50

Query: 2331 GEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXX 2152
            GEFAAANA+ID+LCHHLR VGEPGEYD VIGCIQQRR NWNPVLHMQ YFSV +V+Y   
Sbjct: 51   GEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVIYALQ 110

Query: 2151 XXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEGGHHMNSKAVPNVNGNE 1972
                          + G K  K             + FKEG       +S        N+
Sbjct: 111  HVAWRRQQRYYDPVKAGAKEFKRSGVGFNKGQQRAEAFKEGHNSTLESHS--------ND 162

Query: 1971 NLDAGDVKGGKGEAKVESGEERKDIVEESGGDGSVESQGSREAVSTIKPEHSSENTDDGH 1792
               +G V   K E   E GEE    VE  G  G +  +G   A                 
Sbjct: 163  GNSSGVVAPEKFERGSEVGEE----VEPGGEVGKLNDKGLAPAG---------------- 202

Query: 1791 LYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGMKLYEELFDNSEVSKL 1612
              + K N+ HS +I ++KQ+  + PKTF+G EI DGK+ NVVDG+KLYE+   ++EVSKL
Sbjct: 203  --EKKVNESHSIQIQNQKQNLSIVPKTFIGNEISDGKTVNVVDGLKLYEDFLGDTEVSKL 260

Query: 1611 ITLVNDLRAAGRRGQLQGQTFIVSKRPMKGHGRETIQFGLPIADAPHEDEAAAGTSKDRK 1432
            ++LVNDLRAAG+R QLQGQT++VSKRPMKGHGRE IQ G+PIADAP EDE +AGTSKDRK
Sbjct: 261  VSLVNDLRAAGKRRQLQGQTYVVSKRPMKGHGREMIQLGIPIADAPPEDEISAGTSKDRK 320

Query: 1431 IEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEGDHSQPHMWPHSFGRPVCLLFLTEC 1252
            IEPIP L QDVI+RL+   V+ VKPDSCIID++NEGDHSQPH WP  FGRPVC L+LTEC
Sbjct: 321  IEPIPSLLQDVIDRLVGMHVMTVKPDSCIIDVYNEGDHSQPHTWPSWFGRPVCALYLTEC 380

Query: 1251 EMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQGRSTDFAKHAIPSIRKQRILVTLTKSQ 1072
            +MTFG+++ +DHPGDYRG+ + SLTPGS+L++QG+S DFAKHAIPSIRKQRILVTLTKSQ
Sbjct: 381  DMTFGRLLLMDHPGDYRGSLRLSLTPGSILLMQGKSADFAKHAIPSIRKQRILVTLTKSQ 440

Query: 1071 PKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNHMRHPVGPKHYGHVP-TGVLSAPNTRXX 895
            PK+  T+D QRFP+   A  S+W PPPSRSPNH+RHP GPKHY  VP TGVL AP  R  
Sbjct: 441  PKKSTTSDGQRFPAPAPAQSSYWGPPPSRSPNHIRHPTGPKHYAAVPTTGVLPAPPIRSQ 500

Query: 894  XXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASAGWPAATPRHPSPRLPVPGTGVFLX 715
                     +F                       SAGWPAA PRHP PR+P+PGTGVFL 
Sbjct: 501  LPPQNGIQPLF---VPAPVGPAIPFAAAVPIPPGSAGWPAA-PRHPPPRIPLPGTGVFLP 556

Query: 714  XXXXXXXXXXXSIATPAIESSIT-DTSAYSEKDI-KGNINGNTD-SPRGKVDENLQYQEC 544
                        +   A E S T +T +  +KD   G  N +T  SP+GK D   Q Q+C
Sbjct: 557  PPGSGNSSAPQQLPGTATEMSPTVETPSPRDKDNGSGKSNHSTSASPKGKSDGKAQRQDC 616

Query: 543  NGSVDGNGHTEVIPKEEQQHQNSESKGTEKSAGV 442
            NGS +G G      KEE+Q    ++  + ++  V
Sbjct: 617  NGSAEGTGSGRTAVKEEEQQTYDKTAASNQAGAV 650


>emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]
          Length = 1145

 Score =  622 bits (1605), Expect = e-175
 Identities = 371/741 (50%), Positives = 448/741 (60%), Gaps = 58/741 (7%)
 Frame = -1

Query: 2505 MPSGNAVVPEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDGFISWLRGE 2326
            MPSGN V+ +KMQ  GGGG       G G                  DERDGFISWLRGE
Sbjct: 1    MPSGNVVISDKMQFPGGGGG------GGGGGAAEIHHHRQWFP----DERDGFISWLRGE 50

Query: 2325 FAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXXXX 2146
            FAAANA+ID+LC+HLR++GEPGEYD VIGCIQQRR NW+ VLHMQ YFSV +V+Y     
Sbjct: 51   FAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQV 110

Query: 2145 XXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEG-----GHHMN-SKAVPNV 1984
                        +  GK  K Y             +++G+ G      H+ N      + 
Sbjct: 111  GWRRQQRHLDPVKGAGKEYKRYGVA----------YRQGQRGETAKDSHNSNFENHSHDA 160

Query: 1983 NGNENLDAG--------DVKGG-KGEA-------KVESGEERKDI--------------- 1897
            N +  L+ G        DVKGG KG+         + +  E+K++               
Sbjct: 161  NSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLSAAAEKKEVMNFVIFGQLEQMLLQ 220

Query: 1896 ---------VEESGGDGSVESQGSREAVSTIKPEHSSENTDDGHLYDSKENDCHSERILH 1744
                     V+++  D  V  Q  R    T   E  S N          EN+ H  +  +
Sbjct: 221  NPMQIAVRRVQKTQKDPDVAFQRLRPM--TWMMEARSCNM-------IMENNAHPVQNQN 271

Query: 1743 EKQSPIVTPKTFVGTEIYDGKSFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQL 1564
            EK +P  +PKTFVGTEI+DGK+ NVVDG+KLYEELFD+SEVSK ++LVNDLRAAG+RGQL
Sbjct: 272  EKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQL 331

Query: 1563 QGQTFIVSKRPMKGHGRETIQFGLPIADAPHEDEAAAGTSK----DRKIEPIPGLFQDVI 1396
            QGQTF+VSKRPMKGHGRE IQ G+PIADAP EDE+  GTSK    +R+ E IP L QDVI
Sbjct: 332  QGQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKGMFHNRRTESIPSLLQDVI 391

Query: 1395 ERLMANQVVNVKPDSCIIDIFNEGDHSQPHMWPHSFGRPVCLLFLTECEMTFGKIIGVDH 1216
             +L+ +QV+ VKPD+CIID +NEGDHSQPH+WP  FGRPVC+LFLTEC+MTFG++IG DH
Sbjct: 392  GQLVGSQVLTVKPDACIIDFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADH 451

Query: 1215 PGDYRGAFKHSLTPGSLLVLQGRSTDFAKHAIPSIRKQRILVTLTKSQPKRMGTADAQR- 1039
            PGDYRG+ K SL PGSLLV+QG+S DFAKHAIPS+RKQRILVT TKSQPK+   +D QR 
Sbjct: 452  PGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTTASDGQRL 511

Query: 1038 FPSAVGAAPSHWVPPPSRSPNHMRHPVGPKHYGHVP-TGVL--SAPNTRXXXXXXXXXXX 868
             P A  A  SHWVPPPSRSPNHMRHP+GPKHYG VP TGVL   AP  R           
Sbjct: 512  LPPA--AQSSHWVPPPSRSPNHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQP 569

Query: 867  IFXXXXXXXXXXXXXXXXXXXXXXASAGWPAATPRHPSPRLPVPGTGVFLXXXXXXXXXX 688
            +F                       S GWPAA PRHP PRLPVPGTGVFL          
Sbjct: 570  LF---VTTAVAPAMPFPAPXPLPTGSPGWPAAPPRHPPPRLPVPGTGVFLPPPGSGNSSS 626

Query: 687  XXSIATPAIESSITDTSAYSEKDI---KGNINGNTDSPRGKVDENLQYQECNGSVDGNGH 517
               I+T A  +S+ +T+A +EK+    K + N NT SP+GK+D  +  QECNGS+D  G 
Sbjct: 627  PQHISTEATSTSV-ETAAPTEKENGSGKSSSNSNTVSPKGKLDGKVHRQECNGSMDETGV 685

Query: 516  TE-VIPKEEQQHQNSESKGTE 457
             E  + KEEQQH N E K  E
Sbjct: 686  DERAVTKEEQQH-NDELKELE 705


>gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]
          Length = 681

 Score =  622 bits (1603), Expect = e-175
 Identities = 360/707 (50%), Positives = 430/707 (60%), Gaps = 19/707 (2%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQGRGG--GGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDGFISW 2338
            MAMPSGN V  +KMQ   G  G  E+     R                   DERDGFISW
Sbjct: 1    MAMPSGNVVSSDKMQFPSGTAGAGEISHHNNRQWFP---------------DERDGFISW 45

Query: 2337 LRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYX 2158
            LRGEFAAANAMID+LCHHLR VGEPGEYD VI CIQ RR NWNPVLHMQ YFSV +V++ 
Sbjct: 46   LRGEFAAANAMIDSLCHHLRAVGEPGEYDAVIACIQLRRCNWNPVLHMQQYFSVAEVMFA 105

Query: 2157 XXXXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEGGHHMNSKAVPN-VN 1981
                            + G K  K             D FK+G+      NS A  + ++
Sbjct: 106  LQQVAWRRQQRFYDPVKMGNKEFKR-SGVGFKQWQRNDSFKDGR------NSAAESHCLD 158

Query: 1980 GNENL-DAGDVKGGKGEAKVESG-----------EERKDIVEESGGDGSVESQGSREAV- 1840
            GN +  +A   KGG  ++  E G           +E+ D   +S  DG+V+S G+ E V 
Sbjct: 159  GNSSFGNAASEKGGSDKSGDEVGNSDDRGSMPAAKEKNDSAAKSQEDGNVKSLGNFEGVV 218

Query: 1839 STIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDG 1660
            S  +PE  +   DDG    SKEND HS    +E  +    PKTF G E++DGK  NVV+G
Sbjct: 219  SGSEPEVHA--VDDGCTSSSKENDSHSTPKQNENSNLANVPKTFSGNEMFDGKPVNVVEG 276

Query: 1659 MKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQGQTFIVSKRPMKGHGRETIQFGLPIAD 1480
            +KLYEE   ++EVSKL+ LVNDLR+AG RG  Q QT++VSKRPMKGHGRE IQ GLPIAD
Sbjct: 277  LKLYEEFCADTEVSKLVALVNDLRSAGERGHFQSQTYVVSKRPMKGHGREKIQLGLPIAD 336

Query: 1479 APHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEGDHSQPHMW 1300
            AP EDE +AGT KDR+ E IP L QDV ERL++ QV  VKPDSCIID +NEGDHSQPH+W
Sbjct: 337  APVEDEISAGTLKDRRTEAIPPLLQDVAERLVSMQVATVKPDSCIIDFYNEGDHSQPHLW 396

Query: 1299 PHSFGRPVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQGRSTDFAKHAI 1120
            P  FGRPVC+LFLTEC+MTFG++  +DHPGDYRGA K SL PGSLL +QG+S DFAKHAI
Sbjct: 397  PSWFGRPVCVLFLTECDMTFGRVFAIDHPGDYRGALKLSLKPGSLLAMQGKSADFAKHAI 456

Query: 1119 PSIRKQRILVTLTKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNHMRHPVGPKHYG 940
            PS+R+QRILVT TKSQPK+   +D QR PS   A  SHW P PSRSPNH+RHP GPKHY 
Sbjct: 457  PSLRRQRILVTFTKSQPKKSMPSDGQRMPSPGVAPSSHWGPQPSRSPNHIRHP-GPKHYA 515

Query: 939  HVP-TGVLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASAGWPAATPR 763
             VP TGVL A   R           +F                      +S+GW AA PR
Sbjct: 516  PVPTTGVLQASPVRPQIPPPNGIQPLF---VTAPVAPAMPFPAPVPIPPSSSGWSAAPPR 572

Query: 762  HPSPRLPVPGTGVFLXXXXXXXXXXXXSIATPAIESSITDTSAYSEKDI-KGNIN-GNTD 589
            HP PRLPVPGTGVFL                    +   +T+A  EK+   G +N G T 
Sbjct: 573  HPPPRLPVPGTGVFLPPPGSGGNSSGSQQVLGNDTNHTVETAAPPEKENGSGKLNHGMTA 632

Query: 588  SPRGKVDENLQYQECNGSVDGNGHTEVIPKEEQQHQNSESKGTEKSA 448
            SP+GKVD   Q QECNGS+DG+G    + KEE+Q Q+S++  T KSA
Sbjct: 633  SPKGKVDSKTQKQECNGSLDGSGSVISVTKEERQ-QSSDNTATSKSA 678


>emb|CBI26785.3| unnamed protein product [Vitis vinifera]
          Length = 672

 Score =  615 bits (1587), Expect = e-173
 Identities = 366/730 (50%), Positives = 442/730 (60%), Gaps = 41/730 (5%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDGFISWLR 2332
            MAMPSGN V+ +KMQ  GGGG      +G G                  DERDGFISWLR
Sbjct: 1    MAMPSGNVVISDKMQFPGGGG------RGGGGGAAEIHHHRQWFP----DERDGFISWLR 50

Query: 2331 GEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXX 2152
            GEFAAANA+ID+LC+HLR++GEPGEYD VIGCIQQRR NW+ VLHMQ YFSV +V+Y   
Sbjct: 51   GEFAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQ 110

Query: 2151 XXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEG-----GHHMN-SKAVP 1990
                          +  GK  K Y             +++G+ G      H+ N      
Sbjct: 111  QVGWRRQQRHLDPVKGAGKEYKRYGVA----------YRQGQRGETAKDSHNSNFENHSH 160

Query: 1989 NVNGNENLDAG--------DVKGG-KGEA--KVE-----SGEERK---DIVEESGGDG-- 1873
            + N +  L+ G        DVKGG KG+   K+E     + EE+K   D V +   +   
Sbjct: 161  DANSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNANSCS 220

Query: 1872 --SVESQGSREAVSTIKPEHSSENTDDGHLYDSK-------ENDCHSERILHEKQSPIVT 1720
              S  S+GSR  +S    E  + + DDG   + K       EN+ H  +  +EK +P  +
Sbjct: 221  KSSENSEGSRCGIS----ETEANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPNPTTS 276

Query: 1719 PKTFVGTEIYDGKSFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ-GQTFIV 1543
            PKTFVGTEI+DGK+ NVVDG+KLYEELFD+SEVSK ++LVNDLRAAG+RGQLQ GQTF+V
Sbjct: 277  PKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQAGQTFVV 336

Query: 1542 SKRPMKGHGRETIQFGLPIADAPHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNV 1363
            SKRPMKGHGRE IQ G+PIADAP EDE+  GTSKDR+ E IP L QDVI  L+ +QV+ V
Sbjct: 337  SKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKDRRTESIPSLLQDVIGHLVGSQVLTV 396

Query: 1362 KPDSCIIDIFNEGDHSQPHMWPHSFGRPVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHS 1183
            KPD+CIID +NEGDHSQPH+WP  FGRPVC+LFLTEC+MTFG++IG DHPGDYRG+ K S
Sbjct: 397  KPDACIIDFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYRGSLKLS 456

Query: 1182 LTPGSLLVLQGRSTDFAKHAIPSIRKQRILVTLTKSQPKRMGTADAQR-FPSAVGAAPSH 1006
            L PGSLLV+QG+S DFAKHAIPS+RKQRILVT TKSQPK+   +D QR  P A  A  SH
Sbjct: 457  LVPGSLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTMASDGQRLLPPA--AQSSH 514

Query: 1005 WVPPPSRSPNHMRHPVGPKHYGHVP-TGVL--SAPNTRXXXXXXXXXXXIFXXXXXXXXX 835
            WVPPPSRSPNHMRHP+GPKHYG VP TGVL   AP  R           +F         
Sbjct: 515  WVPPPSRSPNHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLF---VTTAVA 571

Query: 834  XXXXXXXXXXXXXASAGWPAATPRHPSPRLPVPGTGVFLXXXXXXXXXXXXSIATPAIES 655
                          S GWPAA PRHP PRLPVPGTGVFL             I+T A  +
Sbjct: 572  PAMPFPAPVPLPTGSPGWPAAPPRHPPPRLPVPGTGVFLPPPGSGNSSSPQHISTEATST 631

Query: 654  SITDTSAYSEKDIKGNINGNTDSPRGKVDENLQYQECNGSVDGNGHTEVIPKEEQQHQNS 475
            S+ +T+A +EK+                             +G+G +  + KEEQQH N 
Sbjct: 632  SV-ETAAPTEKE-----------------------------NGSGKSSTVTKEEQQH-ND 660

Query: 474  ESKGTEKSAG 445
            E K   K AG
Sbjct: 661  ELKVASKPAG 670


>ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca
            subsp. vesca]
          Length = 682

 Score =  590 bits (1520), Expect = e-165
 Identities = 340/714 (47%), Positives = 419/714 (58%), Gaps = 25/714 (3%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQ-----GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDGF 2347
            M MPSGN V+ +KMQ     G    G E+ Q   +                   DERDGF
Sbjct: 1    MTMPSGNVVLSDKMQYPSVAGAAVSGGEIHQQPRQWFP----------------DERDGF 44

Query: 2346 ISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDV 2167
            ISWLRGEFAAANA+ID+LCHHLR VGEP EYD VIGC+QQRR NW PVLHMQ YFSV +V
Sbjct: 45   ISWLRGEFAAANAIIDSLCHHLRAVGEPSEYDMVIGCVQQRRCNWTPVLHMQQYFSVAEV 104

Query: 2166 LYXXXXXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEGGHHMNSKAVPN 1987
            +Y                 + G K  K               FK   E     ++ +V  
Sbjct: 105  IYALQQVAWRRQQRYYEPVKMGNKDYKRSNSGVG--------FKPRNEPVKEWHTASVEY 156

Query: 1986 VNGNENLDAGDVK--GGKGEAKVESGEERKDIVEESGGDGSV------------ESQGSR 1849
                 + D   ++  G +   +V+ G E   + ++    G+V             S+ S 
Sbjct: 157  ----RSYDGSGLEKVGSEMREEVKPGGEAGKVDDKGSAAGAVTKGVLTKPHEYISSRSSA 212

Query: 1848 EAVSTIKPEHSSENT--DDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSF 1675
             +  TI     SE+   ++G     KEN+ +S +I +EKQ+  + PKTFVG E +DGK+ 
Sbjct: 213  NSQGTISGNSESEDAVVNEGCTSSIKENESNSIQIQNEKQNLSLIPKTFVGNETFDGKTV 272

Query: 1674 NVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQGQTFIVSKRPMKGHGRETIQFG 1495
            NVVDG+KLYEE   ++EVSKL +LVNDLR  GRRGQLQGQT+++SKRPMKGHGRE IQ G
Sbjct: 273  NVVDGLKLYEEFLGDTEVSKLFSLVNDLRTTGRRGQLQGQTYVLSKRPMKGHGREMIQLG 332

Query: 1494 LPIADAPHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEGDHS 1315
            +PIAD P EDE +AG SKDR++E IP L QDVI+RL+  QV+  KPDSCIID FNEGDHS
Sbjct: 333  IPIADGPQEDEISAGISKDRRMEAIPSLLQDVIDRLIGTQVLTDKPDSCIIDFFNEGDHS 392

Query: 1314 QPHMWPHSFGRPVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQGRSTDF 1135
             PHMWP  FGRPV +LFLTEC++TFGK++G+DHPGDYRGA + SLTPGSLL+LQG+S D+
Sbjct: 393  HPHMWPPWFGRPVSVLFLTECDLTFGKVLGMDHPGDYRGALRLSLTPGSLLLLQGKSADY 452

Query: 1134 AKHAIPSIRKQRILVTLTKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNHMRHPVG 955
            AKHAIPSIRKQRILVT TKSQP++    D QR PS   +   +W PPP RSPNH+RHP G
Sbjct: 453  AKHAIPSIRKQRILVTFTKSQPRKSFPTDGQRLPSPGPSQSPYWSPPPGRSPNHIRHPAG 512

Query: 954  PKHYGHVP-TGVLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASAGWP 778
            PKHY  VP TGVL AP  R           +F                       S GW 
Sbjct: 513  PKHYAAVPTTGVLPAPPNRPQLPPANGIQPLF---VAAPVGPAMPFPAPVVIPPGSPGWV 569

Query: 777  AATPRHPSPRLPVPGTGVFL-XXXXXXXXXXXXSIATPAIESSITDTSAYSEKDIKGNIN 601
            AA PRHP PR+P+PGTGVFL                + A E + +  +A +EKD  G   
Sbjct: 570  AA-PRHPPPRMPLPGTGVFLPPPGSGSSSAPPQQFPSTATEMNPSVETASTEKD-NGTAK 627

Query: 600  GN--TDSPRGKVDENLQYQECNGSVDGNGHTEVIPKEEQQHQNSESKGTEKSAG 445
             +    SP+ K+D   Q Q+CNGSVDG G      K+EQQ QNS +      AG
Sbjct: 628  SSHAIASPKAKLDVKAQRQDCNGSVDGTGSGRGTVKQEQQ-QNSNNAAANNQAG 680


>gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
            [Theobroma cacao] gi|508709405|gb|EOY01302.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao]
          Length = 680

 Score =  588 bits (1515), Expect = e-165
 Identities = 343/704 (48%), Positives = 421/704 (59%), Gaps = 20/704 (2%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQ-------GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERD 2353
            MAMPSGN V+ +KMQ       G GGGG+      G G                  DERD
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLP---DERD 57

Query: 2352 GFISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVN 2173
            GFI WLRGEFAA+NA+ID+LCHHLR VGE GEY+ VI CIQQRR NWNPVLHMQ YFSV 
Sbjct: 58   GFIYWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVA 117

Query: 2172 DVLYXXXXXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEGGHHMNSKAV 1993
            +V Y               + + GGK  K             +  KEG+  G        
Sbjct: 118  EVSYALQQVAWRRRQRHYESGKVGGKEFKR--SGMGFKGQRMEVAKEGQNSG-------- 167

Query: 1992 PNVNGNENLDAGDVKGGKGEAKVESGEERKDIVEESGGDGSVESQGSR----EAVSTIKP 1825
             + +GN  + A   +        E G E+++ V+  G  G VE + S     +  +  KP
Sbjct: 168  VDSDGNSTVTAVSERN-------ERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKP 220

Query: 1824 -----EHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDG 1660
                 E  +E+ + G     KEND  S +  +EKQ+    PKTFVG E++DGK  NVVDG
Sbjct: 221  HAGDAESVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDG 280

Query: 1659 MKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQGQTFIVSKRPMKGHGRETIQFGLPIAD 1480
            +KLYEELFD+ EV  L++LVNDLRAAG+RGQLQGQT++ +KRPMKGHGRE IQ GLPIAD
Sbjct: 281  LKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAKRPMKGHGREMIQLGLPIAD 340

Query: 1479 APHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEGDHSQPHMW 1300
            AP +DE AAGTSKDR+IE IP L QD IERL+  QV+ VKPDSCIID++NEGDHSQP MW
Sbjct: 341  APLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRMW 400

Query: 1299 PHSFGRPVCLLFLTECEMTFGKIIGV-DHPGDYRGAFKHSLTPGSLLVLQGRSTDFAKHA 1123
            P  FG+PVC++FLTEC++TFG+++ V DHPGDYRG+ K SL PGSLLV+QG+S DFAKHA
Sbjct: 401  PPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKHA 460

Query: 1122 IPSIRKQRILVTLTKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNHMRHPVGPKHY 943
            +PS+RKQRILVT TK    +  T D QR  S   +  S W PPPSRSPN +RH  GPKHY
Sbjct: 461  LPSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGPKHY 520

Query: 942  GHVP-TGVLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASAGWPAATP 766
              +P TGVL AP  R           +F                       S GWPAA P
Sbjct: 521  AVIPTTGVLPAPPIRPQIPPSSGVQPLF---VPTAVAPAISFPAPVPIPPGSTGWPAA-P 576

Query: 765  RHPSPRLPVPGTGVFLXXXXXXXXXXXXSIATPAIESSITDTSAYSEKDIKGNI--NGNT 592
            RHP PRLPVPGTGVFL               T    + + +T++  EK+  G++  N +T
Sbjct: 577  RHPPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKE-NGSVKPNHHT 635

Query: 591  DSPRGKVDENLQYQECNGSVDGNGHTEVIPKEEQQHQNSESKGT 460
             SPRG++D     Q+CNGSVDG G    + KEEQ   ++  K T
Sbjct: 636  TSPRGRLDGKSPKQDCNGSVDGAGSGRALMKEEQHCADNSVKQT 679


>ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600383 [Solanum tuberosum]
          Length = 638

 Score =  586 bits (1510), Expect = e-164
 Identities = 343/680 (50%), Positives = 417/680 (61%), Gaps = 8/680 (1%)
 Frame = -1

Query: 2505 MPSGNAVV--PEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDGFISWLR 2332
            M SGNA V  PEKM G G GG  +     R                 Q+DERDGFISWLR
Sbjct: 1    MQSGNAAVAVPEKMNGNGVGGEAVAVALPR-----QHQHQQQWFHPQQVDERDGFISWLR 55

Query: 2331 GEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXX 2152
            GEFAA+NA+IDALCHHLR+VGEPGEYDGVIGC+QQRR+NWN VLHMQ Y SV +V+Y   
Sbjct: 56   GEFAASNAIIDALCHHLRLVGEPGEYDGVIGCVQQRRANWNSVLHMQQYHSVAEVIYSLH 115

Query: 2151 XXXXXXXXXXXXAFEWG-GKMGKEYXXXXXXXXXXGDFFKEGKEG-GHHMNSKAVPNVNG 1978
                         F+ G  K+ K             +  K+GKE  G + +  A    NG
Sbjct: 116  QVEWMKQQKG---FDGGVKKVEKRNGSRGGGGGWKSEGLKDGKESQGQNFSLDAHSKTNG 172

Query: 1977 NENLDAGDVKGGKGEAKVESGEERKDIVEESGGDGSVESQGSREAVSTIKPEHSSENTDD 1798
             E +D  +VK G          E+K++      + SV+S    EA  +      +++  D
Sbjct: 173  VEKIDVVEVKQG----------EKKELAANPEANSSVKSSVCTEAGDSQGEVDKTDDKRD 222

Query: 1797 GHLYDSK--ENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGMKLYEELFDNSE 1624
             +   S   E++ HS ++  EKQ+  V PKTFV TEIYDGK  NVVDGMKLYEEL  +SE
Sbjct: 223  SNSEGSSNVESESHSIQVPTEKQN--VVPKTFVATEIYDGKPVNVVDGMKLYEELLSSSE 280

Query: 1623 VSKLITLVNDLRAAGRRGQLQGQTFIVSKRPMKGHGRETIQFGLPIADAPHEDEAAAGTS 1444
            VSKL+TLVNDLRAAGRRGQL  Q FIVSKRPMKGHGRE +Q GLPI DAP E+EAA  T 
Sbjct: 281  VSKLLTLVNDLRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPEEEAAISTY 340

Query: 1443 KDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEGDHSQPHMWPHSFGRPVCLLF 1264
            KDRK E IPGLFQDVI++L A Q ++VKPD+C+IDIFNEGDHSQPH+WP+ +GRP+ +LF
Sbjct: 341  KDRKTEAIPGLFQDVIDQLSAMQALSVKPDACVIDIFNEGDHSQPHLWPYWYGRPISMLF 400

Query: 1263 LTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQGRSTDFAKHAIPSIRKQRILVTL 1084
            LT+CEMTFGK+IGVDHPGDYRG+ K SL PGS+LV+QGRST+FAK+AIPS RKQRILVT 
Sbjct: 401  LTDCEMTFGKVIGVDHPGDYRGSLKLSLAPGSVLVMQGRSTEFAKYAIPSTRKQRILVTF 460

Query: 1083 TKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNHMRHPVGPKHYGHV-PTGVLSAPN 907
            TK Q +R+ +AD+QRFPS+ G   S WV PPSRSPNH+R P GPKHYG +  TGVL  P 
Sbjct: 461  TKLQLRRIKSADSQRFPSSAGGPVSQWV-PPSRSPNHIRRPFGPKHYGSMSTTGVLPIPG 519

Query: 906  TRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASAGWPAATPRHPSPRLPVPGTG 727
             R                                   ASAGW     RHP PRLP+PGTG
Sbjct: 520  VRPQFAPANMQ----PIFVPATVAPAMPFPAPVALPPASAGWAVPPLRHPPPRLPLPGTG 575

Query: 726  VFLXXXXXXXXXXXXSIATPAIESSITDTSAYSEKDIKGNINGNTDSPR-GKVDENLQYQ 550
            VFL                P   +S TD +  +EK   G ++ +T S +       +Q Q
Sbjct: 576  VFL---------------PPGSGTSSTD-NIPAEK--AGPLSDSTVSQKVNSGSSEVQTQ 617

Query: 549  ECNGSVDGNGHTEVIPKEEQ 490
            ECNG  D +   + +  EE+
Sbjct: 618  ECNGKADVSDAEKPVAYEER 637


>ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citrus clementina]
            gi|557550702|gb|ESR61331.1| hypothetical protein
            CICLE_v10014588mg [Citrus clementina]
          Length = 635

 Score =  585 bits (1508), Expect = e-164
 Identities = 318/635 (50%), Positives = 394/635 (62%), Gaps = 7/635 (1%)
 Frame = -1

Query: 2355 DGFISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSV 2176
            D F+ WLRGEFAAANA+ID LCHHLRV+GEPGEYD  I CIQQRR NWN VLH+Q YFSV
Sbjct: 14   DPFVMWLRGEFAAANAIIDTLCHHLRVIGEPGEYDFAINCIQQRRCNWNSVLHLQQYFSV 73

Query: 2175 NDVLYXXXXXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEGGHHMNSKA 1996
            ++V+                 F+        +             F   K+  H+ N+  
Sbjct: 74   SEVMLALQQVAWRKQQRS---FDHHHHHHHHHQQQHHLNRTKRSAFV--KKDFHNNNNN- 127

Query: 1995 VPNVNGNENLDAGDVKGGKGEAKVESGEERKDIVEESGGDGSVESQGSREAVSTIKPEHS 1816
              N N N   D+             + +++KD+V ++  DGS +S G+ E       E  
Sbjct: 128  --NNNNNHAFDSNS----------SAFDDKKDVVMKAHDDGSAKSLGNSEITQVGDAEPK 175

Query: 1815 SENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGMKLYEELF 1636
            +E  DDG     KEND  S +  +EKQ+  +  K+FVGTE+ DGK  NVVDG+KLYEE+ 
Sbjct: 176  AEALDDGCTPSLKENDSQSVQSQNEKQNQSMAAKSFVGTEMVDGKMVNVVDGLKLYEEVS 235

Query: 1635 DNSEVSKLITLVNDLRAAGRRGQLQGQTFIVSKRPMKGHGRETIQFGLPIADAPHEDEAA 1456
             NSEVSKL++LVNDLR AG+RGQ+QG  ++VSKRP++GHGRE IQ GLPI D P EDE A
Sbjct: 236  GNSEVSKLVSLVNDLRTAGKRGQIQGPAYVVSKRPIRGHGREVIQLGLPIVDGPPEDEIA 295

Query: 1455 AGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEGDHSQPHMWPHSFGRPV 1276
            AGTS+DR+IEPIP L QDVI+RL+  Q++ VKPDSCI+D+FNEGDHSQPH+ P  FGRPV
Sbjct: 296  AGTSRDRRIEPIPSLLQDVIDRLVGLQIMTVKPDSCIVDVFNEGDHSQPHISPSWFGRPV 355

Query: 1275 CLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQGRSTDFAKHAIPSIRKQRI 1096
            C+LFLTEC+MTFG++IG+DHPGDYRG  + S+ PGSLLV+QG+S D AKHAI SIRKQRI
Sbjct: 356  CILFLTECDMTFGRMIGIDHPGDYRGTLRLSVAPGSLLVMQGKSADIAKHAISSIRKQRI 415

Query: 1095 LVTLTKSQPKRMGTADAQRFPSAVGAAPS-HWVPPPSRSPNHMRHPVGPKHYGHVP-TGV 922
            LVT TKSQPK++   D QR  S  G APS HW PPP R PNH+RHP GPKH+  +P TGV
Sbjct: 416  LVTFTKSQPKKLTPTDGQRLASP-GIAPSPHWGPPPGRPPNHIRHPTGPKHFAPIPTTGV 474

Query: 921  LSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASAGWPAATPRH----PS 754
            L AP  R           IF                       S GW AA PRH    P 
Sbjct: 475  LPAPAIRAQIPPTNGVPPIF---VSPPVTPAMPFPAPVPIPPGSTGWTAAPPRHTPPPPP 531

Query: 753  PRLPVPGTGVFLXXXXXXXXXXXXSIATPAIESSITDTSAYSEKDI-KGNINGNTDSPRG 577
            PRLPVPGTGVFL             +++ A E  I +  + +EK+   G  N  T++P+ 
Sbjct: 532  PRLPVPGTGVFLPPPGSGGSSSPRQVSSAATEHLIPEMGSQAEKENGSGKSNHETNAPKE 591

Query: 576  KVDENLQYQECNGSVDGNGHTEVIPKEEQQHQNSE 472
            K+    Q Q CNGSVDG G  + + KEE QHQ+ E
Sbjct: 592  KLVGETQGQGCNGSVDGTGSVKAVMKEENQHQSVE 626


>gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|508709404|gb|EOY01301.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 681

 Score =  583 bits (1503), Expect = e-163
 Identities = 343/705 (48%), Positives = 421/705 (59%), Gaps = 21/705 (2%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQ-------GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERD 2353
            MAMPSGN V+ +KMQ       G GGGG+      G G                  DERD
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLP---DERD 57

Query: 2352 GFISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVN 2173
            GFI WLRGEFAA+NA+ID+LCHHLR VGE GEY+ VI CIQQRR NWNPVLHMQ YFSV 
Sbjct: 58   GFIYWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVA 117

Query: 2172 DVLYXXXXXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEGGHHMNSKAV 1993
            +V Y               + + GGK  K             +  KEG+  G        
Sbjct: 118  EVSYALQQVAWRRRQRHYESGKVGGKEFKR--SGMGFKGQRMEVAKEGQNSG-------- 167

Query: 1992 PNVNGNENLDAGDVKGGKGEAKVESGEERKDIVEESGGDGSVESQGSR----EAVSTIKP 1825
             + +GN  + A   +        E G E+++ V+  G  G VE + S     +  +  KP
Sbjct: 168  VDSDGNSTVTAVSERN-------ERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKP 220

Query: 1824 -----EHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDG 1660
                 E  +E+ + G     KEND  S +  +EKQ+    PKTFVG E++DGK  NVVDG
Sbjct: 221  HAGDAESVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDG 280

Query: 1659 MKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ-GQTFIVSKRPMKGHGRETIQFGLPIA 1483
            +KLYEELFD+ EV  L++LVNDLRAAG+RGQLQ GQT++ +KRPMKGHGRE IQ GLPIA
Sbjct: 281  LKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLPIA 340

Query: 1482 DAPHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEGDHSQPHM 1303
            DAP +DE AAGTSKDR+IE IP L QD IERL+  QV+ VKPDSCIID++NEGDHSQP M
Sbjct: 341  DAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPDSCIIDVYNEGDHSQPRM 400

Query: 1302 WPHSFGRPVCLLFLTECEMTFGKIIGV-DHPGDYRGAFKHSLTPGSLLVLQGRSTDFAKH 1126
            WP  FG+PVC++FLTEC++TFG+++ V DHPGDYRG+ K SL PGSLLV+QG+S DFAKH
Sbjct: 401  WPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLAPGSLLVMQGKSADFAKH 460

Query: 1125 AIPSIRKQRILVTLTKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNHMRHPVGPKH 946
            A+PS+RKQRILVT TK    +  T D QR  S   +  S W PPPSRSPN +RH  GPKH
Sbjct: 461  ALPSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWGPPPSRSPNRIRHSAGPKH 520

Query: 945  YGHVP-TGVLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASAGWPAAT 769
            Y  +P TGVL AP  R           +F                       S GWPAA 
Sbjct: 521  YAVIPTTGVLPAPPIRPQIPPSSGVQPLF---VPTAVAPAISFPAPVPIPPGSTGWPAA- 576

Query: 768  PRHPSPRLPVPGTGVFLXXXXXXXXXXXXSIATPAIESSITDTSAYSEKDIKGNI--NGN 595
            PRHP PRLPVPGTGVFL               T    + + +T++  EK+  G++  N +
Sbjct: 577  PRHPPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTSPREKE-NGSVKPNHH 635

Query: 594  TDSPRGKVDENLQYQECNGSVDGNGHTEVIPKEEQQHQNSESKGT 460
            T SPRG++D     Q+CNGSVDG G    + KEEQ   ++  K T
Sbjct: 636  TTSPRGRLDGKSPKQDCNGSVDGAGSGRALMKEEQHCADNSVKQT 680


>ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618872 [Citrus sinensis]
          Length = 627

 Score =  580 bits (1495), Expect = e-162
 Identities = 318/641 (49%), Positives = 394/641 (61%), Gaps = 13/641 (2%)
 Frame = -1

Query: 2355 DGFISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSV 2176
            D F+ WLRGEFAAANA+ID LCHHLRV+GEPGEYD  I CIQQRR NWN VLH+Q YFSV
Sbjct: 14   DPFVMWLRGEFAAANAIIDTLCHHLRVIGEPGEYDFAINCIQQRRCNWNSVLHLQQYFSV 73

Query: 2175 NDVLYXXXXXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEGGHHMNS-- 2002
            ++V+                      K  + +                  +  HH+N   
Sbjct: 74   SEVMLALQQVAWR-------------KQQRSFDHHHHH------------QQQHHLNRTK 108

Query: 2001 -----KAVPNVNGNENLDAGDVKGGKGEAKVESGEERKDIVEESGGDGSVESQGSREAVS 1837
                 K   + N N N  A D       +   + +++KD+V ++  DGS +S G+ E   
Sbjct: 109  RSAFVKKDFHNNNNNNNHAFD-------SNSSAFDDKKDVVMKAHDDGSAKSLGNSEITQ 161

Query: 1836 TIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGM 1657
                E  +E  DDG     KEND  S +  +EKQ+  +  K+FVGTE+ DGK  NVVDG+
Sbjct: 162  VGDAEPKAEALDDGCTPGLKENDSQSVQSQNEKQNQSMAAKSFVGTEMVDGKMVNVVDGL 221

Query: 1656 KLYEELFDNSEVSKLITLVNDLRAAGRRGQLQGQTFIVSKRPMKGHGRETIQFGLPIADA 1477
            KLYEE+  NSEVSKL++LVNDLR AG+RGQ+QG  ++VSKRP++GHGRE IQ GLPI D 
Sbjct: 222  KLYEEVSGNSEVSKLVSLVNDLRTAGKRGQIQGPAYVVSKRPIRGHGREVIQLGLPIVDG 281

Query: 1476 PHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEGDHSQPHMWP 1297
            P EDE AAGTS+DR+IEPIP L QDVI+RL+  Q++ VKPDSCI+D+FNEGDHSQPH+ P
Sbjct: 282  PPEDEIAAGTSRDRRIEPIPSLLQDVIDRLVGLQIMTVKPDSCIVDVFNEGDHSQPHISP 341

Query: 1296 HSFGRPVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQGRSTDFAKHAIP 1117
              FGRPVC+LFLTEC+MTFG++IG+DHPGDYRG  + S+ PGSLLV+QG+S D AKHAI 
Sbjct: 342  SWFGRPVCILFLTECDMTFGRMIGIDHPGDYRGTLRLSVAPGSLLVMQGKSADIAKHAIS 401

Query: 1116 SIRKQRILVTLTKSQPKRMGTADAQRFPSAVGAAPS-HWVPPPSRSPNHMRHPVGPKHYG 940
            SIRKQRILVT TKSQPK++   D QR  S  G APS HW  PP R PNH+RHP GPKH+ 
Sbjct: 402  SIRKQRILVTFTKSQPKKLTPTDGQRLASP-GIAPSPHWGLPPGRPPNHIRHPTGPKHFA 460

Query: 939  HVP-TGVLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASAGWPAATPR 763
             +P TGVL AP  R           IF                       S GW AA PR
Sbjct: 461  PIPTTGVLPAPAIRAQIPPTNGVPPIF---VSPPVTPAMPFPAPVPIPPGSTGWTAAPPR 517

Query: 762  H---PSPRLPVPGTGVFLXXXXXXXXXXXXSIATPAIESSITDTSAYSEKDI-KGNINGN 595
            H   P PRLPVPGTGVFL             +++ A E  I +  + +EK+   G  N  
Sbjct: 518  HTPPPPPRLPVPGTGVFLPPPGSGGSSSPRQVSSAATEHLIPEMGSQAEKENGSGKSNHE 577

Query: 594  TDSPRGKVDENLQYQECNGSVDGNGHTEVIPKEEQQHQNSE 472
            T++P+ K+    Q Q CNGSVDG G  + + KEE QHQ+ E
Sbjct: 578  TNAPKEKLVGETQGQGCNGSVDGTGSVKAVMKEENQHQSVE 618


>ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261013 [Solanum
            lycopersicum]
          Length = 641

 Score =  579 bits (1492), Expect = e-162
 Identities = 341/691 (49%), Positives = 413/691 (59%), Gaps = 19/691 (2%)
 Frame = -1

Query: 2505 MPSGNAVV------PEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDGFI 2344
            M SGNA V      PEK    GGGG  +  P+                   Q+DERDGFI
Sbjct: 1    MQSGNAAVAVAVAVPEKKHSNGGGGEAVAVPRQH-------QHQQQWFHPQQVDERDGFI 53

Query: 2343 SWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVL 2164
            SWLRGEFAA+NA+IDALCHHLR+VGEPGEYDGVIGC+QQRR+NWN VLHMQ Y SV +V+
Sbjct: 54   SWLRGEFAASNAIIDALCHHLRLVGEPGEYDGVIGCVQQRRANWNSVLHMQQYHSVAEVI 113

Query: 2163 YXXXXXXXXXXXXXXXAFEWG-GKMGKEYXXXXXXXXXXG-DFFKEGKEG-GHHMNSKAV 1993
            Y                F+ G  K+GK              +  K+GKE  G + +  A 
Sbjct: 114  YSLHQVEWMKQQKG---FDGGVNKVGKRNGSKGGGGGGWKSEGLKDGKESQGQNFSLDAH 170

Query: 1992 PNVNGNENLDAGDVKGGKGE---AKVESGEERKDIVEESGGDGSVESQGSREAVSTIKPE 1822
               NG E +D  + K G  +   AK E+    K  V    GD    SQG  +     K +
Sbjct: 171  SKTNGVEKIDVVEEKQGDKKELAAKPEANSSVKGSVCTEAGD----SQGEVD-----KTD 221

Query: 1821 HSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGMKLYEE 1642
               ++  +G    + E++ HS +I  EKQ+  V PKTFV TEIYDGK  NVVDGMKLYEE
Sbjct: 222  DKRDSNSEGS--SNVESESHSFQIPTEKQN--VVPKTFVATEIYDGKPVNVVDGMKLYEE 277

Query: 1641 LFDNSEVSKLITLVNDLRAAGRRGQLQGQTFIVSKRPMKGHGRETIQFGLPIADAPHEDE 1462
            L  +SEVSKL+TLVNDLRAAGRRGQL  Q FIVSKRPMKGHGRE +Q GLPI DAP E+E
Sbjct: 278  LLSSSEVSKLVTLVNDLRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPEEE 337

Query: 1461 AAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEGDHSQPHMWPHSFGR 1282
            +A  T KDRK E IPGL QDVI++L A Q ++VKPD+C+IDIFNEGDHSQPH+WP+ +GR
Sbjct: 338  SAISTYKDRKTEAIPGLLQDVIDQLSAMQALSVKPDACVIDIFNEGDHSQPHLWPYWYGR 397

Query: 1281 PVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQGRSTDFAKHAIPSIRKQ 1102
            P+  LFLT+CEMTFGK+IGVDHPGDYRG+ K SL PGS+LV+QGRST+FAK+AIPSIRKQ
Sbjct: 398  PISTLFLTDCEMTFGKVIGVDHPGDYRGSLKLSLAPGSVLVMQGRSTEFAKYAIPSIRKQ 457

Query: 1101 RILVTLTKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNHMRHPVGPKHYGHVP-TG 925
            R+LVT TK Q +R+ + D+QRFPS+ G   S WV PPSRS NH+R P GPKHYG +P TG
Sbjct: 458  RMLVTFTKLQLRRIKSGDSQRFPSSAGGPVSQWV-PPSRSSNHIRRPFGPKHYGSMPATG 516

Query: 924  VLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASAGWPAATPRHPSPRL 745
            VL  P  R                                   ASAGW     RHP PRL
Sbjct: 517  VLPIPGVRPQFAPANMQ----PIFVPATVAPAMPFPAPVALPPASAGWAVPPIRHPPPRL 572

Query: 744  PVPGTGVFLXXXXXXXXXXXXSIATPAIESSITD------TSAYSEKDIKGNINGNTDSP 583
            P+PGTGVFL                P   +S TD      T   S+  +   +N ++   
Sbjct: 573  PLPGTGVFL---------------PPGSGTSSTDNIPAENTGPLSDSTVSQKVNSDS--- 614

Query: 582  RGKVDENLQYQECNGSVDGNGHTEVIPKEEQ 490
                   +Q Q+CNG  D +   + +  EEQ
Sbjct: 615  -----SEVQTQDCNGKADVSDAEKAVACEEQ 640


>ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794176 [Glycine max]
          Length = 683

 Score =  577 bits (1488), Expect = e-162
 Identities = 333/699 (47%), Positives = 416/699 (59%), Gaps = 19/699 (2%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQ---GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDGFIS 2341
            MAMPSGN V+ +KMQ   G GGGG       G G                 +DERDG I 
Sbjct: 1    MAMPSGNVVIQDKMQFPSGAGGGG-------GGGGAGGEIHQPHHYRPQWFVDERDGLIG 53

Query: 2340 WLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLY 2161
            WLR EFAAANA+ID+LCHHLRVVG+PGEYD V+G IQQRR NWN VL MQ YFSV DV Y
Sbjct: 54   WLRSEFAAANAIIDSLCHHLRVVGDPGEYDMVVGAIQQRRCNWNQVLMMQQYFSVADVAY 113

Query: 2160 XXXXXXXXXXXXXXXAFEWGGK----MGKEYXXXXXXXXXXGDFFKEGKEGGHHMN---- 2005
                             + G K     G  Y            +    +   H  N    
Sbjct: 114  ALQQVAWRRQQRPLDPMKVGAKEVRKSGSGYRHGQRFESVKEGYNSSVESYSHDANVAVT 173

Query: 2004 ---SKAVPNVNGNENLDAGDVKGGKGEAKVESGEERKDIVEESGGDGSVESQGSREAVST 1834
                K  P V  +E   +G      G+  + S EE+KD +     +GS++S  S E   +
Sbjct: 174  GGTEKGTPVVEKSEEHKSGGKVEKVGDKGLASVEEKKDAITNHQSEGSLKSARSTEG--S 231

Query: 1833 IKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGMK 1654
            +    S    +DG + +SK ND HS +   + QS     KTF+G E++DGK+ NVVDG+K
Sbjct: 232  LSNLESEAVVNDGCISNSKGNDLHSVQNQSQSQSLSNIAKTFIGNEMFDGKTVNVVDGLK 291

Query: 1653 LYEELFDNSEVSKLITLVNDLRAAGRRGQLQG-QTFIVSKRPMKGHGRETIQFGLPIADA 1477
            LY++LFD++EV+ L++LVNDLR +G++GQLQG Q +IVS+RPMKGHGRE IQ G+ IADA
Sbjct: 292  LYDDLFDSTEVANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVRIADA 351

Query: 1476 PHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEGDHSQPHMWP 1297
            P E E   G SKD  +E IP LFQD+IER++++QV+ VKPD CI+D +NEGDHSQPH WP
Sbjct: 352  PAEGENMTGASKDMNVESIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEGDHSQPHSWP 411

Query: 1296 HSFGRPVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQGRSTDFAKHAIP 1117
              +GRPV +LFLTECEMTFG++I  +HPGDYRG+ K SL PGSLLV+QG+S+DFAKHA+P
Sbjct: 412  SWYGRPVYVLFLTECEMTFGRVIASEHPGDYRGSIKLSLVPGSLLVMQGKSSDFAKHALP 471

Query: 1116 SIRKQRILVTLTKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNHMRHPVGPKHYGH 937
            S RKQRILVT TKSQP++  ++DAQ+  SAV  A SHW PPPSRSPNH+RH VGPKHY  
Sbjct: 472  STRKQRILVTFTKSQPRKSLSSDAQQLASAV--ASSHWGPPPSRSPNHVRHHVGPKHYAT 529

Query: 936  VP-TGVLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASAGWPAA-TPR 763
            +P TGVL AP  R           +F                       S GW AA  PR
Sbjct: 530  LPTTGVLPAPPIRPQMAAPVGMQPLF---VAAPVVPPMPFSAPVPIPAGSTGWTAAPPPR 586

Query: 762  HPSPRLPVPGTGVFLXXXXXXXXXXXXSIATPAIESSITDTSAYSEKDIKGNINGNTD-- 589
            HP PR+P PGTGVFL              +T A  +  T+T    EK+  G IN N+   
Sbjct: 587  HPPPRVPAPGTGVFLPPSGSGNSSQQLPASTLAEVNPSTETPTMPEKE-NGKINHNSTSA 645

Query: 588  SPRGKVDENLQYQECNGSVDGNGHTEVIPKEEQQHQNSE 472
            SP+GKV    Q QECNG  DG   T+V P  E +  +++
Sbjct: 646  SPKGKV----QKQECNGHADG---TQVEPALETRLDSND 677


>ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis]
            gi|223533099|gb|EEF34858.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 697

 Score =  577 bits (1488), Expect = e-162
 Identities = 339/718 (47%), Positives = 420/718 (58%), Gaps = 29/718 (4%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDGFISWLR 2332
            MAMP GN V+ +K+Q   GGG       G                   +DERDGFISWLR
Sbjct: 1    MAMPPGNVVISDKIQFPAGGGGVGGGNNGGIGNEIQQQQHHHRHQWFPVDERDGFISWLR 60

Query: 2331 GEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXX 2152
            GEFAAANA+ID+LCHHLR  GEPGEYD VIGCIQQRR NWNPVLHMQ YFSV +V+    
Sbjct: 61   GEFAAANAIIDSLCHHLRAAGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVGEVILALQ 120

Query: 2151 XXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEG---GHHMNSKAVPNVN 1981
                          +      +             DF +    G   GH    + V  VN
Sbjct: 121  QVALRKQQQHQHQHQHQ----QHRYYYDQPKVGGKDFKRNSSMGFNKGHRGGGEVVKEVN 176

Query: 1980 -------------GNENLDAGDVKGGKGEAKVES-----GEERKDIVEESGGDGSVESQG 1855
                         GNE  +  ++K G    ++E+      E++KD   +   D +++S G
Sbjct: 177  YGAESHGLDGNTSGNEKFN--EIKSGGDSGRLENKSLATAEDKKDAASKPHVD-NLKSSG 233

Query: 1854 SREAVSTIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSF 1675
            + E   +   E  +E   +      KE+D H  +    K +   TPKTFVG E+ DGKS 
Sbjct: 234  NSEGSLSGNLETEAEAVHEQS--SPKEHDSHFIQNQIVKLNLTTTPKTFVGAEMVDGKSV 291

Query: 1674 NVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQGQTFIVSKRPMKGHGRETIQFG 1495
            NVVDG+KLYE+L D+ EVSKL++LVNDLRAAGR+GQ QGQ ++VSKRPMKGHGRE IQ G
Sbjct: 292  NVVDGLKLYEQLLDDVEVSKLVSLVNDLRAAGRKGQFQGQAYVVSKRPMKGHGREMIQLG 351

Query: 1494 LPIADAPHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEGDHS 1315
            LPIADAP E+E AAGTSKDRKIE IP L Q+VIER ++ Q++ +KPDSCIIDI+NEGDHS
Sbjct: 352  LPIADAPAEEENAAGTSKDRKIESIPTLLQEVIERFVSMQIMTMKPDSCIIDIYNEGDHS 411

Query: 1314 QPHMWPHSFGRPVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQGRSTDF 1135
            QPHMWP  FG+P+ +LFLTEC++TFG++I  DHPGDYRG+ K  L PGSLLV+QG++TDF
Sbjct: 412  QPHMWPPWFGKPISVLFLTECDLTFGRVITADHPGDYRGSLKLPLAPGSLLVMQGKATDF 471

Query: 1134 AKHAIPSIRKQRILVTLTKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNHMRHPVG 955
            AKHAIP+IRKQR+L+T TKSQPK+   +D QR  S   +  SHW PPPSRSPNH+RHPV 
Sbjct: 472  AKHAIPAIRKQRVLLTFTKSQPKKFVQSDGQRLTSPAASPSSHWGPPPSRSPNHIRHPVS 531

Query: 954  PKHYGHVP-TGVLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASAGWP 778
             KHY  +P TGVL AP+ R           +F                       S GWP
Sbjct: 532  -KHYAPIPTTGVLPAPSIRPQIAPPNGVQPLF---VTAPVAAPMPFPAPVPMPPVSTGWP 587

Query: 777  AATPRHPSPRL--PVPGTGVFL-----XXXXXXXXXXXXSIATPAIESSITDTSAYSEKD 619
            AA PRHP  RL  PVPGTGVFL                  I  PA  +S+ D     E  
Sbjct: 588  AA-PRHPPNRLPVPVPGTGVFLPPPGSGNASSPQIPNATEINFPAETASLQD----KENG 642

Query: 618  IKGNINGNTDSPRGKVDENLQYQECNGSVDGNGHTEVIPKEEQQHQNSESKGTEKSAG 445
            +  + +G   SP+ K++   Q Q+CNG  DG   T    KEE Q Q+ +    +KSAG
Sbjct: 643  LGKSNHGTCASPKEKLEAKSQKQDCNGITDGKAGT----KEEHQ-QSVDHTAVDKSAG 695


>ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa]
            gi|550333016|gb|ERP57586.1| hypothetical protein
            POPTR_0008s13830g [Populus trichocarpa]
          Length = 693

 Score =  573 bits (1476), Expect = e-160
 Identities = 335/719 (46%), Positives = 410/719 (57%), Gaps = 31/719 (4%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQ------GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDG 2350
            MAMP GN V+P+K+Q      G GGGG+E+ Q Q                    +DERDG
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQ------------LQRHQWFPVDERDG 48

Query: 2349 FISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVND 2170
            FISWLRGEFAAANA+ID+LCHHLR VGE GEYD V+GCIQQRRSNWN VLHMQ YFSV +
Sbjct: 49   FISWLRGEFAAANAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGE 108

Query: 2169 VL---------------------YXXXXXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXX 2053
            V+                     +                F+     G            
Sbjct: 109  VIVALQQVVLRRQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGGGGG 168

Query: 2052 XGDFFKEGKEGGHHMNSKAVPNVNGNENLDAGDVKGGKGEAKVESGEERKDIVEESGGDG 1873
             GD  KEG       +S    N N +EN+ +   +  K        +++KD   +S  D 
Sbjct: 169  GGDAVKEGVNSSVENHSF---NGNSSENIRSEKFEEVKSGGDGGKSDDKKDATAKSHTDN 225

Query: 1872 SVESQGSREAVSTIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEI 1693
               S G+  A  T      +   DD      +E+D H     +EKQ+  +TPKTFV  E 
Sbjct: 226  HKNSSGN--AQGTFSGNSEAVAVDDRS--SPEESDSHPSNNQNEKQNLAITPKTFVAEEK 281

Query: 1692 YDGKSFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQGQTFIVSKRPMKGHGR 1513
             DG+  NVVDG+KLYE L D  EVSKL++LVN+LRA GRRGQ QGQT+I+SKRPMKGHGR
Sbjct: 282  IDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGR 341

Query: 1512 ETIQFGLPIADAPHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIF 1333
            E IQ GLPIADAP EDE A GTSK+R++E IP L QDVIE  +A QV+ +KPDSCIIDI+
Sbjct: 342  EMIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCIIDIY 401

Query: 1332 NEGDHSQPHMWPHSFGRPVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQ 1153
            NEGDHSQPHMWP  FG+PV +LFLTECE+TFGK+I   H GDY+G+ K S+ PGSLLV+Q
Sbjct: 402  NEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQ 461

Query: 1152 GRSTDFAKHAIPSIRKQRILVTLTKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNH 973
            G+S+D AKHAIP I+KQR+LVT TKSQPK++ + D  R PS   A  SHW PPPSRSPNH
Sbjct: 462  GKSSDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAVAPSSHWGPPPSRSPNH 521

Query: 972  MRHPVGPKHYGHVP-TGVLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXX 796
            +RHPV PKHY  +P TGVL  P  R           +F                      
Sbjct: 522  LRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLF---MTTPVAAPMPFPAPVPIPP 577

Query: 795  ASAGWPAATPRHPSPRLPV--PGTGVFLXXXXXXXXXXXXSIATPAIESSITDTSAYSEK 622
             S GWP ++PRHPS RLPV  PGTGVFL             ++  A E +    +   ++
Sbjct: 578  VSTGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKE 637

Query: 621  DIKGNINGNTD-SPRGKVDENLQYQECNGSVDGNGHTEVIPKEEQQHQNSESKGTEKSA 448
            +  G  N +T  SP+ K  E  Q Q+ NG VDG      + KEEQQ  +    G    A
Sbjct: 638  NGPGKSNHDTSASPKEKSAEKTQRQDSNGDVDG----IAVKKEEQQSVSHTVAGQSAGA 692


>gb|ABK95394.1| unknown [Populus trichocarpa]
          Length = 694

 Score =  572 bits (1473), Expect = e-160
 Identities = 335/723 (46%), Positives = 413/723 (57%), Gaps = 35/723 (4%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQ------GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDG 2350
            MAMP GN V+P+K+Q      G GGGG+E+ Q Q                    +DERDG
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQ------------LQRHQWFPVDERDG 48

Query: 2349 FISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVND 2170
            FISWLRGEFAAANA+ID+LCHHLR VGE GEYD V+GCIQQRRSNWN VLHMQ YFSV +
Sbjct: 49   FISWLRGEFAAANAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGE 108

Query: 2169 VLYXXXXXXXXXXXXXXXA--------------FEWGGKMGKEYXXXXXXXXXXGDFFKE 2032
            V+                               ++ G   G+++             F  
Sbjct: 109  VIVALQQVVLRRQQQQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAG------FNR 162

Query: 2031 GKEGG--------HHMNSKAVP---NVNGNENLDAGDVKGGKGEAKVESGEERKDIVEES 1885
            G  GG          +NS       N N +EN+ +   +  K        +++KD   +S
Sbjct: 163  GHRGGGGGGDAVKEGVNSSVENHSFNGNSSENIRSEKFEEVKSGGDGGKSDDKKDATAKS 222

Query: 1884 GGDGSVESQGSREAVSTIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFV 1705
              D    S G+  A  T      +   DD      +E+D H     +EKQ+  +TPKTFV
Sbjct: 223  HTDNHKNSSGN--AQGTFSGNSEAVAVDDRS--SPEESDSHPSNNQNEKQNLAITPKTFV 278

Query: 1704 GTEIYDGKSFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQGQTFIVSKRPMK 1525
              E  DG+  NVVDG+KLYE L D  EVSKL++LVN+LRA GRRGQ QGQT+I+SKRPMK
Sbjct: 279  AEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMK 338

Query: 1524 GHGRETIQFGLPIADAPHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCI 1345
            GHGRE IQ GLPIADAP EDE A GTSK+R++E IP L QDVIE  +A QV+ +KPDSCI
Sbjct: 339  GHGREMIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAMQVMTMKPDSCI 398

Query: 1344 IDIFNEGDHSQPHMWPHSFGRPVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSL 1165
            IDI+NEGDHSQPHMWP  FG+PV +LFLTECE+TFGK+I   H GDY+G+ K S+ PGSL
Sbjct: 399  IDIYNEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSL 458

Query: 1164 LVLQGRSTDFAKHAIPSIRKQRILVTLTKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSR 985
            LV+QG+S+D AKHAIP I+KQR+LVT TKSQPK++ + D  R PS   A  SHW PPPSR
Sbjct: 459  LVMQGKSSDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAVAPSSHWGPPPSR 518

Query: 984  SPNHMRHPVGPKHYGHVP-TGVLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXX 808
            SPNH+RHPV PKHY  +P TGVL  P  R           +F                  
Sbjct: 519  SPNHLRHPV-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLF---MTTPVAAPMPFPAPV 574

Query: 807  XXXXASAGWPAATPRHPSPRLPV--PGTGVFLXXXXXXXXXXXXSIATPAIESSITDTSA 634
                 S GWP ++PRHPS RLPV  PGTGVFL             ++  A E +    + 
Sbjct: 575  PIPPVSTGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETE 634

Query: 633  YSEKDIKGNINGNTD-SPRGKVDENLQYQECNGSVDGNGHTEVIPKEEQQHQNSESKGTE 457
              +++  G  N +T  SP+ K  E  Q Q+ NG VDG      + KEEQQ  +    G  
Sbjct: 635  KEKENGPGKSNHDTSASPKEKSAEKTQRQDSNGDVDG----IAVKKEEQQSVSHTVAGQS 690

Query: 456  KSA 448
              A
Sbjct: 691  AGA 693


>ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550333015|gb|EEE88914.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 675

 Score =  566 bits (1459), Expect = e-158
 Identities = 333/715 (46%), Positives = 409/715 (57%), Gaps = 27/715 (3%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQ------GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDG 2350
            MAMP GN V+P+K+Q      G GGGG+E+ Q Q                    +DERDG
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQ------------LQRHQWFPVDERDG 48

Query: 2349 FISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVND 2170
            FISWLRGEFAAANA+ID+LCHHLR VGE GEYD V+GCIQQRRSNWN VLHMQ YFSV +
Sbjct: 49   FISWLRGEFAAANAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGE 108

Query: 2169 VLYXXXXXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEGGHHMNSKAVP 1990
            V+                      +  ++             ++  GK GG      +  
Sbjct: 109  VIVALQQVVLR-------------RQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSA 155

Query: 1989 NVNGNENLDAGDVKGGKGEAKVE-----------SGEERKDIVEE------SGGDGSVES 1861
              N       G   GG G+A  E           +G   ++I  E      SGGDG    
Sbjct: 156  GFNRGHRGGGG---GGGGDAVKEGVNSSVENHSFNGNSSENIRSEKFEEVKSGGDGG--K 210

Query: 1860 QGSREAVSTIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGK 1681
               ++A +T K    +     G+   +   +  SE + +EKQ+  +TPKTFV  E  DG+
Sbjct: 211  SDDKKADATAKSHTDNHKNSSGNAQGTFSGN--SEAVANEKQNLAITPKTFVAEEKIDGQ 268

Query: 1680 SFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQGQTFIVSKRPMKGHGRETIQ 1501
              NVVDG+KLYE L D  EVSKL++LVN+LRA GRRGQ QGQT+I+SKRPMKGHGRE IQ
Sbjct: 269  MVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGREMIQ 328

Query: 1500 FGLPIADAPHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEGD 1321
             GLPIADAP EDE A GTSK   +E IP L QDVIE  +A QV+ +KPDSCIIDI+NEGD
Sbjct: 329  LGLPIADAPAEDENATGTSKGT-VESIPALLQDVIEHFVAMQVMTMKPDSCIIDIYNEGD 387

Query: 1320 HSQPHMWPHSFGRPVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQGRST 1141
            HSQPHMWP  FG+PV +LFLTECE+TFGK+I   H GDY+G+ K S+ PGSLLV+QG+S+
Sbjct: 388  HSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKGSLKLSVAPGSLLVMQGKSS 447

Query: 1140 DFAKHAIPSIRKQRILVTLTKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNHMRHP 961
            D AKHAIP I+KQR+LVT TKSQPK++ + D  R PS   A  SHW PPPSRSPNH+RHP
Sbjct: 448  DLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAVAPSSHWGPPPSRSPNHLRHP 507

Query: 960  VGPKHYGHVP-TGVLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASAG 784
            V PKHY  +P TGVL  P  R           +F                       S G
Sbjct: 508  V-PKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLF---MTTPVAAPMPFPAPVPIPPVSTG 563

Query: 783  WPAATPRHPSPRLPV--PGTGVFLXXXXXXXXXXXXSIATPAIESSITDTSAYSEKDIKG 610
            WP ++PRHPS RLPV  PGTGVFL             ++  A E +    +   +++  G
Sbjct: 564  WPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATEMNFPTETEKEKENGPG 623

Query: 609  NINGNTD-SPRGKVDENLQYQECNGSVDGNGHTEVIPKEEQQHQNSESKGTEKSA 448
              N +T  SP+ K  E  Q Q+ NG VDG      + KEEQQ  +    G    A
Sbjct: 624  KSNHDTSASPKEKSAEKTQRQDSNGDVDG----IAVKKEEQQSVSHTVAGQSAGA 674


>ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine
            max]
          Length = 681

 Score =  565 bits (1457), Expect = e-158
 Identities = 328/708 (46%), Positives = 419/708 (59%), Gaps = 28/708 (3%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQ------GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDG 2350
            MAMPSGN V+ +KMQ      G GG G E+ QP                     +DERDG
Sbjct: 1    MAMPSGNVVIQDKMQFPSGGAGAGGAGGEIHQPH--------------YCQQWFVDERDG 46

Query: 2349 FISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVND 2170
             I WLR EFAAANA+ID+LCHHLRVVG+PGEYD VIG IQQRR NWN VL MQ YFSV D
Sbjct: 47   LIGWLRSEFAAANAIIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVAD 106

Query: 2169 VLYXXXXXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEGGHHMNSKAVP 1990
            V +                 + G    KE+            F  E  + G++ + ++  
Sbjct: 107  VAHALQQVAWRRQQRPLDPVKVG---AKEFRKSGSGYRHGQRF--EPVKEGYNSSVESYN 161

Query: 1989 NVNGNENLDAGDVK-------------GGK----GEAKVESGEERKDIVEESGGDGSVES 1861
              + N  +  G  K             GGK    G+  + S E++KD + +   DGS++S
Sbjct: 162  QYDANVTVTGGTEKGTPVVEKSEEHKSGGKVEKVGDKGLASAEDKKDAITKHQTDGSLKS 221

Query: 1860 QGSREAVSTIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGK 1681
              S E   ++    S    +D  + +SK +D HS +  H+ QS     KTF+G E++DGK
Sbjct: 222  TRSTE--GSLSNLESEAVVNDECISNSKGDDSHSVQNQHQSQSLSTKAKTFIGNEMFDGK 279

Query: 1680 SFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQG-QTFIVSKRPMKGHGRETI 1504
              NVVDG+KLYE+LFD++E++ L++LVNDLR +G++GQLQG Q +IVS+RPMKGHGRE I
Sbjct: 280  MVNVVDGLKLYEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHGREMI 339

Query: 1503 QFGLPIADAPHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEG 1324
            Q G+PIADAP E E   G SKD  +EPIP LFQD+IER++++QV+ VKPD CI+D +NEG
Sbjct: 340  QLGVPIADAPAEGENMTGASKDMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDFYNEG 399

Query: 1323 DHSQPHMWPHSFGRPVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQGRS 1144
            DHSQPH WP  +GRPV +LFLTECEMTFG++I  +HPGDYRG  K SL PGSLLV++G+S
Sbjct: 400  DHSQPHSWPSWYGRPVYILFLTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVMEGKS 459

Query: 1143 TDFAKHAIPSIRKQRILVTLTKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNHMRH 964
            +DFAKHA+PS+RKQRILVT TKSQP++  ++DAQR  S   A  SHW P PSRSPNH+RH
Sbjct: 460  SDFAKHALPSVRKQRILVTFTKSQPRKSLSSDAQRLAST--ATSSHWGPLPSRSPNHVRH 517

Query: 963  PVGPKHYGHVP-TGVLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASA 787
             VG KHY  +P TGVL +P  R           +F                       S 
Sbjct: 518  HVGSKHYATLPTTGVLPSPPIRPQMAAPVGMQPLF---VTAPVVPPMPFPAPVAFPPGST 574

Query: 786  GWPAA-TPRHPSPRLPVPGTGVFLXXXXXXXXXXXXSIATPAIESSITDTSAYSEKDI-K 613
            GW  A  PRHP PR+P PGTGVFL               T A  +  T+T    EK+  K
Sbjct: 575  GWTGAPPPRHPPPRVPAPGTGVFLPPPGSGNSSQQLPAGTLAEVNPSTETPTMLEKENGK 634

Query: 612  GNINGNTDSPRGKVDENLQYQECNG-SVDGNGHTEVIPKEEQQHQNSE 472
             N N  + SP+GKV    Q QECNG + DG   T+V P  E +  +++
Sbjct: 635  TNHNSTSASPKGKV----QKQECNGHAADG---TQVEPALETRQDSND 675


>ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210274 [Cucumis sativus]
            gi|449481289|ref|XP_004156139.1| PREDICTED:
            uncharacterized LOC101210274 [Cucumis sativus]
          Length = 684

 Score =  560 bits (1442), Expect = e-156
 Identities = 327/705 (46%), Positives = 405/705 (57%), Gaps = 25/705 (3%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXQMDERDGFISWLR 2332
            MAMPSGN  VP+K+  + GGG  +    G                    DERDGFISWLR
Sbjct: 1    MAMPSGNVGVPDKVSFQSGGGVAVSGGGGE--------IHQHHPRPWFPDERDGFISWLR 52

Query: 2331 GEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXX 2152
            GEFAA+NA+IDALCHHLR VGEPGEYD VIGCIQQRR NW PVLHMQ YFSV +V+Y   
Sbjct: 53   GEFAASNAIIDALCHHLRAVGEPGEYDMVIGCIQQRRCNWTPVLHMQQYFSVAEVMYALQ 112

Query: 2151 XXXXXXXXXXXXAFEWGGKMGKEYXXXXXXXXXXGDFFKEGKEGGHHMNSKAVPN-VNGN 1975
                          + G K+ +               FK+  + GH   +      +   
Sbjct: 113  QVTSRRQQRYMDPVKVGPKLYRRPGPG----------FKQ--QQGHRAEATVKEETITCA 160

Query: 1974 ENLDAGDVKGGKGEAKVESGEERKDIVEESGGDGSVESQGSREAVSTIKPEH-------- 1819
            E+ + G+        KVE      D  + SG D  +  + S  AV   K  H        
Sbjct: 161  ESCNGGNSSTFVSSRKVEQVSNTCDESKASGEDEKLSEKDSGSAVDN-KDTHGKDQSNCK 219

Query: 1818 --SSENT-------------DDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDG 1684
              S+EN              DDG     ++ +  S +  + KQ    TP+TFV +E++DG
Sbjct: 220  TKSAENLEDNAINKDSQVEPDDGCSSSHRDKELQSVQSQNGKQYAATTPRTFVASEMFDG 279

Query: 1683 KSFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQGQTFIVSKRPMKGHGRETI 1504
            K  NV+DG+KL+EEL D++EVSKL++LVNDLRA+G+RGQ QGQT++VSKRPMKGHGRE I
Sbjct: 280  KMVNVMDGLKLFEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTYVVSKRPMKGHGREMI 339

Query: 1503 QFGLPIADAPHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEG 1324
            Q G PIADAPHED+ + G SKDR+IEPIP L QD+I+RL+ +QV+ VKPDSCIID +NEG
Sbjct: 340  QLGFPIADAPHEDDNSLGLSKDRRIEPIPSLLQDLIDRLVGDQVMTVKPDSCIIDFYNEG 399

Query: 1323 DHSQPHMWPHSFGRPVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQGRS 1144
            DHSQPH+WP  FGRPV +L LTECE+TFG++IG DH G+YRGA K SLTPG+LLV+QG+S
Sbjct: 400  DHSQPHVWPSWFGRPVGVLLLTECEITFGRVIGTDHSGNYRGAMKLSLTPGNLLVVQGKS 459

Query: 1143 TDFAKHAIPSIRKQRILVTLTKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNHMRH 964
             DFAKHA+P+IRKQRILVTLTKSQPKR   AD QR    VG   S W PP +RSPN    
Sbjct: 460  ADFAKHALPAIRKQRILVTLTKSQPKRAAPADGQRTSLNVGTF-SGWGPPSARSPNPRLS 518

Query: 963  PVGPKHYGHVP-TGVLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASA 787
            P G K Y  VP TGVL  P  R           +                         +
Sbjct: 519  P-GQKPYPTVPSTGVLPVPPIRPQMAPPNGIPPLI-----VPPVASPMPFTPVPIPTGPS 572

Query: 786  GWPAATPRHPSPRLPVPGTGVFLXXXXXXXXXXXXSIATPAIESSITDTSAYSEKDIKGN 607
             WP A  RHP PRLPVPGTGVFL                  I +  T + +  E  +  +
Sbjct: 573  AWPTAHTRHPPPRLPVPGTGVFLPPPGSSSAPTPSPQQQLPISNIETGSLSEKENGLTKS 632

Query: 606  INGNTDSPRGKVDENLQYQECNGSVDGNGHTEVIPKEEQQHQNSE 472
             + +   P  K D   Q QECNGS+DG+G+ +V  +E+QQ Q  E
Sbjct: 633  DHSSGTFPGEKPDAKAQRQECNGSIDGSGNDKVKEEEQQQQQEEE 677


>gb|ESW30779.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris]
          Length = 671

 Score =  557 bits (1436), Expect = e-156
 Identities = 325/695 (46%), Positives = 402/695 (57%), Gaps = 22/695 (3%)
 Frame = -1

Query: 2511 MAMPSGNAVVPEKMQGRGGGGS----ELMQPQGRGXXXXXXXXXXXXXXXXQMDERDGFI 2344
            MAMPSGN V+ +KMQ   GGG     E+ Q   R                  +DERDG I
Sbjct: 1    MAMPSGNVVIQDKMQFPNGGGGAGVGEIQQHHYR--------------QQWFVDERDGLI 46

Query: 2343 SWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVL 2164
             WLR EFAAANA+ID+LCHHLRVVG+PGEYD VIG IQQRR NWN VL MQ YFSV DV 
Sbjct: 47   GWLRSEFAAANAIIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLLMQQYFSVADVT 106

Query: 2163 YXXXXXXXXXXXXXXXAFEWGG----KMGKEYXXXXXXXXXXGDFFKEGKEGGHHMNS-- 2002
            Y                 + G     K G  Y            +    +   H  N+  
Sbjct: 107  YTLQQVAWRKQQRPLDPVKVGAKEVRKPGPGYRYGHRFEPSKEGYNSSVESYSHDGNATF 166

Query: 2001 -----KAVPNVNGNENLDAGDVKGGKGEAKVESGEERKDIVEESGGDGSVESQGSREA-V 1840
                 K  P V+ +E   +G      G+  + S EE+KD + +   DG+++S GS E  +
Sbjct: 167  TRGMEKGTPTVDKSEEHKSGSKVEKVGDKGLASPEEKKDAIIKHQTDGNLKSTGSSEGYL 226

Query: 1839 STIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDG 1660
            S ++ E    N  D  + +SK ND  S    H+ QS     KTF+G E+ DGK  N+ DG
Sbjct: 227  SNLESEAVVVN--DEFISNSKGNDSDSVESQHQSQSFSTIAKTFIGNEMIDGKMVNLADG 284

Query: 1659 MKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQG-QTFIVSKRPMKGHGRETIQFGLPIA 1483
            +KLYE++FD++EVS L++LVNDLR +G++GQLQG Q ++VS+RPMKGHGRE IQ G+PIA
Sbjct: 285  LKLYEDIFDSTEVSNLVSLVNDLRISGKKGQLQGNQAYVVSRRPMKGHGREMIQLGVPIA 344

Query: 1482 DAPHEDEAAAGTSKDRKIEPIPGLFQDVIERLMANQVVNVKPDSCIIDIFNEGDHSQPHM 1303
            DAP E E   G SK   +EPIP LF+D+IER++++QV+  KPD CI+D +NEGDHSQPH 
Sbjct: 345  DAPVEGENMTGASKVMNVEPIPSLFEDIIERMVSSQVMTTKPDCCIVDFYNEGDHSQPHS 404

Query: 1302 WPHSFGRPVCLLFLTECEMTFGKIIGVDHPGDYRGAFKHSLTPGSLLVLQGRSTDFAKHA 1123
            WP  FGRPV  LFLTECEMTFG++I  +HPGDYRG+ K SL PGSLL +QG+S DFAKHA
Sbjct: 405  WPSWFGRPVYTLFLTECEMTFGRLIASEHPGDYRGSLKLSLVPGSLLAMQGKSCDFAKHA 464

Query: 1122 IPSIRKQRILVTLTKSQPKRMGTADAQRFPSAVGAAPSHWVPPPSRSPNHMRHPVGPKHY 943
            +PSIRKQRILVT TKSQPK+   +DAQR    + AA S W PPPSRSPNH+RH VG KHY
Sbjct: 465  LPSIRKQRILVTFTKSQPKKSVPSDAQRL--YLPAASSQWGPPPSRSPNHVRHSVGSKHY 522

Query: 942  GHVP-TGVLSAPNTRXXXXXXXXXXXIFXXXXXXXXXXXXXXXXXXXXXXASAGWPAA-T 769
              +P TGVL AP  R           +F                       SAGW  A  
Sbjct: 523  AALPTTGVLPAPPIRPQIPAQVGMQPLF---VAAPVVPPMPYPAPVSIPPGSAGWTTAPP 579

Query: 768  PRHPSPRLPVPGTGVFLXXXXXXXXXXXXSIATPA-IESSITDTSAYSEKD--IKGNING 598
            PRHP PR+P PGTGVFL               T A +  SI   +   EK+     + N 
Sbjct: 580  PRHPPPRIPAPGTGVFLPPPGSGNSQQQLPAGTLAEVNPSIETPTTMQEKENGKSNDDNS 639

Query: 597  NTDSPRGKVDENLQYQECNGSVDGNGHTEVIPKEE 493
            ++ SP+GKV    Q QECNG  DG      +   E
Sbjct: 640  SSTSPKGKV----QKQECNGHTDGTRDEAALESRE 670


Top