BLASTX nr result

ID: Catharanthus22_contig00010376 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00010376
         (1450 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI26785.3| unnamed protein product [Vitis vinifera]              242   3e-61
ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252...   241   4e-61
ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600...   237   1e-59
ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261...   236   2e-59
gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]     229   3e-57
gb|EMJ26321.1| hypothetical protein PRUPE_ppa002630mg [Prunus pe...   228   5e-57
ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794...   228   6e-57
ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809...   223   2e-55
gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, ...   222   4e-55
gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, ...   222   4e-55
gb|ESW30779.1| hypothetical protein PHAVU_002G181800g [Phaseolus...   221   5e-55
emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]   221   8e-55
ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr...   218   4e-54
ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu...   217   9e-54
ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm...   217   1e-53
ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814...   216   1e-53
gb|ESW25182.1| hypothetical protein PHAVU_003G014200g [Phaseolus...   216   1e-53
gb|ABK95394.1| unknown [Populus trichocarpa]                          216   2e-53
ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781...   214   7e-53
ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309...   214   7e-53

>emb|CBI26785.3| unnamed protein product [Vitis vinifera]
          Length = 672

 Score =  242 bits (618), Expect = 3e-61
 Identities = 155/356 (43%), Positives = 196/356 (55%), Gaps = 39/356 (10%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGFISWLR 677
            MAMPSGN V+ +KMQ  GGGG      +G G                  DERDGFISWLR
Sbjct: 1    MAMPSGNVVISDKMQFPGGGG------RGGGGGAAEIHHHRQWFP----DERDGFISWLR 50

Query: 678  GEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXX 857
            GEFAAANA+ID+LC+HLR++GEPGEYD VIGCIQQRR NW+ VLHMQ YFSV +V+Y   
Sbjct: 51   GEFAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQ 110

Query: 858  XXXXXXXXXXXXXFEWGGKMGKEYXXXXXXXXXXXDFFKEGKEG-----GHHMN------ 1004
                          +  GK  K Y             +++G+ G      H+ N      
Sbjct: 111  QVGWRRQQRHLDPVKGAGKEYKRYGVA----------YRQGQRGETAKDSHNSNFENHSH 160

Query: 1005 --------------SKAVPNVNGNENLDAGDVKGSKGEAKVESGEERK---DIVEE---- 1121
                          S+   +V G    D GDV G   +  + + EE+K   D V +    
Sbjct: 161  DANSSGTLEKGERVSEIYDDVKGG---DKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNAN 217

Query: 1122 SGGDGSVESQGSREAVSTIKPEHSSENTDDGHLYDSK-------ENDCHSERILHEKQSP 1280
            S    S  S+GSR  +S    E  + + DDG   + K       EN+ H  +  +EK +P
Sbjct: 218  SCSKSSENSEGSRCGIS----ETEANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPNP 273

Query: 1281 IVTPKTFVGTEIYDGKSFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
              +PKTFVGTEI+DGK+ NVVDG+KLYEELFD+SEVSK ++LVNDLRAAG+RGQLQ
Sbjct: 274  TTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQ 329


>ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera]
          Length = 698

 Score =  241 bits (616), Expect = 4e-61
 Identities = 154/350 (44%), Positives = 195/350 (55%), Gaps = 33/350 (9%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGFISWLR 677
            MAMPSGN V+ +KMQ  GGGG      +G G                  DERDGFISWLR
Sbjct: 1    MAMPSGNVVISDKMQFPGGGG------RGGGGGAAEIHHHRQWFP----DERDGFISWLR 50

Query: 678  GEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXX 857
            GEFAAANA+ID+LC+HLR++GEPGEYD VIGCIQQRR NW+ VLHMQ YFSV +V+Y   
Sbjct: 51   GEFAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQ 110

Query: 858  XXXXXXXXXXXXXFEWGGKMGKEYXXXXXXXXXXXDFFKEGKEG-----GHHMN------ 1004
                          +  GK  K Y             +++G+ G      H+ N      
Sbjct: 111  QVGWRRQQRHLDPVKGAGKEYKRYGVA----------YRQGQRGETAKDSHNSNFENHSH 160

Query: 1005 --------------SKAVPNVNGNENLDAGDVKGSKGEAKVESGEERK---DIVEE---- 1121
                          S+   +V G    D GDV G   +  + + EE+K   D V +    
Sbjct: 161  DANSSGTLEKGERVSEIYDDVKGG---DKGDVVGKLEDKDLAAAEEKKAGTDAVAKPNAN 217

Query: 1122 SGGDGSVESQGSREAVSTIKPEHSSENTDDGHLYDS-KENDCHSERILHEKQSPIVTPKT 1298
            S    S  S+GSR  +S    E  + + DDG   +   EN+ H  +  +EK +P  +PKT
Sbjct: 218  SCSKSSENSEGSRCGIS----ETEANDMDDGGSCNMIMENNAHPVQNQNEKPNPTTSPKT 273

Query: 1299 FVGTEIYDGKSFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
            FVGTEI+DGK+ NVVDG+KLYEELFD+SEVSK ++LVNDLRAAG+RGQLQ
Sbjct: 274  FVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQ 323


>ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600383 [Solanum tuberosum]
          Length = 638

 Score =  237 bits (604), Expect = 1e-59
 Identities = 151/319 (47%), Positives = 187/319 (58%), Gaps = 5/319 (1%)
 Frame = +3

Query: 504  MPSGNAVV--PEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGFISWLR 677
            M SGNA V  PEKM G G GG  +     R                  +DERDGFISWLR
Sbjct: 1    MQSGNAAVAVPEKMNGNGVGGEAVAVALPR-----QHQHQQQWFHPQQVDERDGFISWLR 55

Query: 678  GEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXX 857
            GEFAA+NA+IDALCHHLR+VGEPGEYDGVIGC+QQRR+NWN VLHMQ Y SV +V+Y   
Sbjct: 56   GEFAASNAIIDALCHHLRLVGEPGEYDGVIGCVQQRRANWNSVLHMQQYHSVAEVIY--- 112

Query: 858  XXXXXXXXXXXXXFEWG-GKMGKEYXXXXXXXXXXXDFFKEGKEG-GHHMNSKAVPNVNG 1031
                         F+ G  K+ K             +  K+GKE  G + +  A    NG
Sbjct: 113  SLHQVEWMKQQKGFDGGVKKVEKRNGSRGGGGGWKSEGLKDGKESQGQNFSLDAHSKTNG 172

Query: 1032 NENLDAGDVKGSKGEAK-VESGEERKDIVEESGGDGSVESQGSREAVSTIKPEHSSENTD 1208
             E +D  +VK  +GE K + +  E    V+ S    + +SQG  +     K +   ++  
Sbjct: 173  VEKIDVVEVK--QGEKKELAANPEANSSVKSSVCTEAGDSQGEVD-----KTDDKRDSNS 225

Query: 1209 DGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGMKLYEELFDNSEV 1388
            +G    + E++ HS ++  EKQ+  V PKTFV TEIYDGK  NVVDGMKLYEEL  +SEV
Sbjct: 226  EGS--SNVESESHSIQVPTEKQN--VVPKTFVATEIYDGKPVNVVDGMKLYEELLSSSEV 281

Query: 1389 SKLITLVNDLRAAGRRGQL 1445
            SKL+TLVNDLRAAGRRGQL
Sbjct: 282  SKLLTLVNDLRAAGRRGQL 300


>ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261013 [Solanum
            lycopersicum]
          Length = 641

 Score =  236 bits (601), Expect = 2e-59
 Identities = 155/326 (47%), Positives = 186/326 (57%), Gaps = 12/326 (3%)
 Frame = +3

Query: 504  MPSGNAVV------PEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGFI 665
            M SGNA V      PEK    GGGG  +  P+                    +DERDGFI
Sbjct: 1    MQSGNAAVAVAVAVPEKKHSNGGGGEAVAVPRQH-------QHQQQWFHPQQVDERDGFI 53

Query: 666  SWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVL 845
            SWLRGEFAA+NA+IDALCHHLR+VGEPGEYDGVIGC+QQRR+NWN VLHMQ Y SV +V+
Sbjct: 54   SWLRGEFAASNAIIDALCHHLRLVGEPGEYDGVIGCVQQRRANWNSVLHMQQYHSVAEVI 113

Query: 846  YXXXXXXXXXXXXXXXXFEWG-GKMGKEY-XXXXXXXXXXXDFFKEGKEG-GHHMNSKAV 1016
            Y                F+ G  K+GK              +  K+GKE  G + +  A 
Sbjct: 114  Y---SLHQVEWMKQQKGFDGGVNKVGKRNGSKGGGGGGWKSEGLKDGKESQGQNFSLDAH 170

Query: 1017 PNVNGNENLDAGDVK-GSKGE--AKVESGEERKDIVEESGGDGSVESQGSREAVSTIKPE 1187
               NG E +D  + K G K E  AK E+    K  V    GD    SQG  +     K +
Sbjct: 171  SKTNGVEKIDVVEEKQGDKKELAAKPEANSSVKGSVCTEAGD----SQGEVD-----KTD 221

Query: 1188 HSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGMKLYEE 1367
               ++  +G    + E++ HS +I  EKQ+  V PKTFV TEIYDGK  NVVDGMKLYEE
Sbjct: 222  DKRDSNSEGS--SNVESESHSFQIPTEKQN--VVPKTFVATEIYDGKPVNVVDGMKLYEE 277

Query: 1368 LFDNSEVSKLITLVNDLRAAGRRGQL 1445
            L  +SEVSKL+TLVNDLRAAGRRGQL
Sbjct: 278  LLSSSEVSKLVTLVNDLRAAGRRGQL 303


>gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]
          Length = 681

 Score =  229 bits (583), Expect = 3e-57
 Identities = 141/327 (43%), Positives = 176/327 (53%), Gaps = 10/327 (3%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQGRGG--GGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGFISW 671
            MAMPSGN V  +KMQ   G  G  E+     R                   DERDGFISW
Sbjct: 1    MAMPSGNVVSSDKMQFPSGTAGAGEISHHNNRQWFP---------------DERDGFISW 45

Query: 672  LRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYX 851
            LRGEFAAANAMID+LCHHLR VGEPGEYD VI CIQ RR NWNPVLHMQ YFSV +V++ 
Sbjct: 46   LRGEFAAANAMIDSLCHHLRAVGEPGEYDAVIACIQLRRCNWNPVLHMQQYFSVAEVMFA 105

Query: 852  XXXXXXXXXXXXXXXFEWGGKMGKEYXXXXXXXXXXXDFFKEGKEG---GHHMNSKA--- 1013
                            + G K  K             D FK+G+      H ++  +   
Sbjct: 106  LQQVAWRRQQRFYDPVKMGNKEFKR-SGVGFKQWQRNDSFKDGRNSAAESHCLDGNSSFG 164

Query: 1014 -VPNVNGNENLDAGDVKGSKGEAKVESGEERKDIVEESGGDGSVESQGSRE-AVSTIKPE 1187
               +  G  +    +V  S     + + +E+ D   +S  DG+V+S G+ E  VS  +PE
Sbjct: 165  NAASEKGGSDKSGDEVGNSDDRGSMPAAKEKNDSAAKSQEDGNVKSLGNFEGVVSGSEPE 224

Query: 1188 HSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGMKLYEE 1367
              +   DDG    SKEND HS    +E  +    PKTF G E++DGK  NVV+G+KLYEE
Sbjct: 225  VHA--VDDGCTSSSKENDSHSTPKQNENSNLANVPKTFSGNEMFDGKPVNVVEGLKLYEE 282

Query: 1368 LFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
               ++EVSKL+ LVNDLR+AG RG  Q
Sbjct: 283  FCADTEVSKLVALVNDLRSAGERGHFQ 309


>gb|EMJ26321.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica]
          Length = 650

 Score =  228 bits (581), Expect = 5e-57
 Identities = 141/317 (44%), Positives = 174/317 (54%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGFISWLR 677
            M MPSGN V+ +KMQ   GGG   +   G G                  DERDGFISWLR
Sbjct: 1    MTMPSGNVVLSDKMQFPSGGGGGAV---GGGEIAQHHRQWFP-------DERDGFISWLR 50

Query: 678  GEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXX 857
            GEFAAANA+ID+LCHHLR VGEPGEYD VIGCIQQRR NWNPVLHMQ YFSV +V+Y   
Sbjct: 51   GEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVAEVIYALQ 110

Query: 858  XXXXXXXXXXXXXFEWGGKMGKEYXXXXXXXXXXXDFFKEGKEGGHHMNSKAVPNVNGNE 1037
                          + G K  K             + FKE    GH+   ++    + N+
Sbjct: 111  HVAWRRQQRYYDPVKAGAKEFKRSGVGFNKGQQRAEAFKE----GHNSTLES----HSND 162

Query: 1038 NLDAGDVKGSKGEAKVESGEERKDIVEESGGDGSVESQGSREAVSTIKPEHSSENTDDGH 1217
               +G V   K E   E GEE    VE  G  G +  +G   A                 
Sbjct: 163  GNSSGVVAPEKFERGSEVGEE----VEPGGEVGKLNDKGLAPA----------------- 201

Query: 1218 LYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGMKLYEELFDNSEVSKL 1397
              + K N+ HS +I ++KQ+  + PKTF+G EI DGK+ NVVDG+KLYE+   ++EVSKL
Sbjct: 202  -GEKKVNESHSIQIQNQKQNLSIVPKTFIGNEISDGKTVNVVDGLKLYEDFLGDTEVSKL 260

Query: 1398 ITLVNDLRAAGRRGQLQ 1448
            ++LVNDLRAAG+R QLQ
Sbjct: 261  VSLVNDLRAAGKRRQLQ 277


>ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794176 [Glycine max]
          Length = 683

 Score =  228 bits (580), Expect = 6e-57
 Identities = 137/331 (41%), Positives = 178/331 (53%), Gaps = 14/331 (4%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQ---GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGFIS 668
            MAMPSGN V+ +KMQ   G GGGG       G G                 +DERDG I 
Sbjct: 1    MAMPSGNVVIQDKMQFPSGAGGGG-------GGGGAGGEIHQPHHYRPQWFVDERDGLIG 53

Query: 669  WLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLY 848
            WLR EFAAANA+ID+LCHHLRVVG+PGEYD V+G IQQRR NWN VL MQ YFSV DV Y
Sbjct: 54   WLRSEFAAANAIIDSLCHHLRVVGDPGEYDMVVGAIQQRRCNWNQVLMMQQYFSVADVAY 113

Query: 849  XXXXXXXXXXXXXXXXFEWGG----KMGKEYXXXXXXXXXXXDFFKEGKEGGHHMN---- 1004
                             + G     K G  Y            +    +   H  N    
Sbjct: 114  ALQQVAWRRQQRPLDPMKVGAKEVRKSGSGYRHGQRFESVKEGYNSSVESYSHDANVAVT 173

Query: 1005 ---SKAVPNVNGNENLDAGDVKGSKGEAKVESGEERKDIVEESGGDGSVESQGSREAVST 1175
                K  P V  +E   +G      G+  + S EE+KD +     +GS++S  S E   +
Sbjct: 174  GGTEKGTPVVEKSEEHKSGGKVEKVGDKGLASVEEKKDAITNHQSEGSLKSARSTE--GS 231

Query: 1176 IKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGMK 1355
            +    S    +DG + +SK ND HS +   + QS     KTF+G E++DGK+ NVVDG+K
Sbjct: 232  LSNLESEAVVNDGCISNSKGNDLHSVQNQSQSQSLSNIAKTFIGNEMFDGKTVNVVDGLK 291

Query: 1356 LYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
            LY++LFD++EV+ L++LVNDLR +G++GQLQ
Sbjct: 292  LYDDLFDSTEVANLVSLVNDLRVSGKKGQLQ 322


>ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine
            max]
          Length = 681

 Score =  223 bits (568), Expect = 2e-55
 Identities = 138/340 (40%), Positives = 185/340 (54%), Gaps = 23/340 (6%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQ------GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDG 659
            MAMPSGN V+ +KMQ      G GG G E+ QP                     +DERDG
Sbjct: 1    MAMPSGNVVIQDKMQFPSGGAGAGGAGGEIHQPH--------------YCQQWFVDERDG 46

Query: 660  FISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVND 839
             I WLR EFAAANA+ID+LCHHLRVVG+PGEYD VIG IQQRR NWN VL MQ YFSV D
Sbjct: 47   LIGWLRSEFAAANAIIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLMMQQYFSVAD 106

Query: 840  VLYXXXXXXXXXXXXXXXXFEWGGKMGKEYXXXXXXXXXXXDFFKEGKEGGHHMNSKAVP 1019
            V +                 + G    KE+            F  E  + G++ + ++  
Sbjct: 107  VAHALQQVAWRRQQRPLDPVKVG---AKEFRKSGSGYRHGQRF--EPVKEGYNSSVESYN 161

Query: 1020 NVNGNENLDAGDVKGS---------KGEAKVE--------SGEERKDIVEESGGDGSVES 1148
              + N  +  G  KG+         K   KVE        S E++KD + +   DGS++S
Sbjct: 162  QYDANVTVTGGTEKGTPVVEKSEEHKSGGKVEKVGDKGLASAEDKKDAITKHQTDGSLKS 221

Query: 1149 QGSREAVSTIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGK 1328
              S E   ++    S    +D  + +SK +D HS +  H+ QS     KTF+G E++DGK
Sbjct: 222  TRSTE--GSLSNLESEAVVNDECISNSKGDDSHSVQNQHQSQSLSTKAKTFIGNEMFDGK 279

Query: 1329 SFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
              NVVDG+KLYE+LFD++E++ L++LVNDLR +G++GQLQ
Sbjct: 280  MVNVVDGLKLYEDLFDSTEIANLVSLVNDLRVSGKKGQLQ 319


>gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
            [Theobroma cacao] gi|508709405|gb|EOY01302.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao]
          Length = 680

 Score =  222 bits (565), Expect = 4e-55
 Identities = 146/334 (43%), Positives = 182/334 (54%), Gaps = 17/334 (5%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQ-------GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERD 656
            MAMPSGN V+ +KMQ       G GGGG+      G G                  DERD
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLP---DERD 57

Query: 657  GFISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVN 836
            GFI WLRGEFAA+NA+ID+LCHHLR VGE GEY+ VI CIQQRR NWNPVLHMQ YFSV 
Sbjct: 58   GFIYWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVA 117

Query: 837  DVLYXXXXXXXXXXXXXXXXFEWGGKMGKEY-XXXXXXXXXXXDFFKEGKEGGHHMNSKA 1013
            +V Y                +E G   GKE+            +  KEG+  G       
Sbjct: 118  EVSY---ALQQVAWRRRQRHYESGKVGGKEFKRSGMGFKGQRMEVAKEGQNSG------- 167

Query: 1014 VPNVNGNENLDAGDVKGSKGEAKVESGEERKDIVEESGGDGSVESQGSR----EAVSTIK 1181
              + +GN  + A   +        E G E+++ V+  G  G VE + S     +  +  K
Sbjct: 168  -VDSDGNSTVTAVSERN-------ERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSK 219

Query: 1182 P-----EHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVD 1346
            P     E  +E+ + G     KEND  S +  +EKQ+    PKTFVG E++DGK  NVVD
Sbjct: 220  PHAGDAESVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVD 279

Query: 1347 GMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
            G+KLYEELFD+ EV  L++LVNDLRAAG+RGQLQ
Sbjct: 280  GLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQ 313


>gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|508709404|gb|EOY01301.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 681

 Score =  222 bits (565), Expect = 4e-55
 Identities = 146/334 (43%), Positives = 182/334 (54%), Gaps = 17/334 (5%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQ-------GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERD 656
            MAMPSGN V+ +KMQ       G GGGG+      G G                  DERD
Sbjct: 1    MAMPSGNVVLSDKMQFPATAAAGAGGGGAVGAVGGGGGGGGEIHQHHHRQWLP---DERD 57

Query: 657  GFISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVN 836
            GFI WLRGEFAA+NA+ID+LCHHLR VGE GEY+ VI CIQQRR NWNPVLHMQ YFSV 
Sbjct: 58   GFIYWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRRCNWNPVLHMQQYFSVA 117

Query: 837  DVLYXXXXXXXXXXXXXXXXFEWGGKMGKEY-XXXXXXXXXXXDFFKEGKEGGHHMNSKA 1013
            +V Y                +E G   GKE+            +  KEG+  G       
Sbjct: 118  EVSY---ALQQVAWRRRQRHYESGKVGGKEFKRSGMGFKGQRMEVAKEGQNSG------- 167

Query: 1014 VPNVNGNENLDAGDVKGSKGEAKVESGEERKDIVEESGGDGSVESQGSR----EAVSTIK 1181
              + +GN  + A   +        E G E+++ V+  G  G VE + S     +  +  K
Sbjct: 168  -VDSDGNSTVTAVSERN-------ERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSK 219

Query: 1182 P-----EHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVD 1346
            P     E  +E+ + G     KEND  S +  +EKQ+    PKTFVG E++DGK  NVVD
Sbjct: 220  PHAGDAESVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVD 279

Query: 1347 GMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
            G+KLYEELFD+ EV  L++LVNDLRAAG+RGQLQ
Sbjct: 280  GLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQ 313


>gb|ESW30779.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris]
          Length = 671

 Score =  221 bits (564), Expect = 5e-55
 Identities = 138/333 (41%), Positives = 179/333 (53%), Gaps = 16/333 (4%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQGRGGGGS----ELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGFI 665
            MAMPSGN V+ +KMQ   GGG     E+ Q   R                  +DERDG I
Sbjct: 1    MAMPSGNVVIQDKMQFPNGGGGAGVGEIQQHHYR--------------QQWFVDERDGLI 46

Query: 666  SWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVL 845
             WLR EFAAANA+ID+LCHHLRVVG+PGEYD VIG IQQRR NWN VL MQ YFSV DV 
Sbjct: 47   GWLRSEFAAANAIIDSLCHHLRVVGDPGEYDMVIGAIQQRRCNWNQVLLMQQYFSVADVT 106

Query: 846  YXXXXXXXXXXXXXXXXFEWGG----KMGKEYXXXXXXXXXXXDFFKEGKEGGHHMNS-- 1007
            Y                 + G     K G  Y            +    +   H  N+  
Sbjct: 107  YTLQQVAWRKQQRPLDPVKVGAKEVRKPGPGYRYGHRFEPSKEGYNSSVESYSHDGNATF 166

Query: 1008 -----KAVPNVNGNENLDAGDVKGSKGEAKVESGEERKDIVEESGGDGSVESQGSREA-V 1169
                 K  P V+ +E   +G      G+  + S EE+KD + +   DG+++S GS E  +
Sbjct: 167  TRGMEKGTPTVDKSEEHKSGSKVEKVGDKGLASPEEKKDAIIKHQTDGNLKSTGSSEGYL 226

Query: 1170 STIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDG 1349
            S ++ E    N  D  + +SK ND  S    H+ QS     KTF+G E+ DGK  N+ DG
Sbjct: 227  SNLESEAVVVN--DEFISNSKGNDSDSVESQHQSQSFSTIAKTFIGNEMIDGKMVNLADG 284

Query: 1350 MKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
            +KLYE++FD++EVS L++LVNDLR +G++GQLQ
Sbjct: 285  LKLYEDIFDSTEVSNLVSLVNDLRISGKKGQLQ 317


>emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]
          Length = 1145

 Score =  221 bits (562), Expect = 8e-55
 Identities = 147/364 (40%), Positives = 186/364 (51%), Gaps = 49/364 (13%)
 Frame = +3

Query: 504  MPSGNAVVPEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGFISWLRGE 683
            MPSGN V+ +KMQ  GGGG       G G                  DERDGFISWLRGE
Sbjct: 1    MPSGNVVISDKMQFPGGGGG------GGGGGAAEIHHHRQWFP----DERDGFISWLRGE 50

Query: 684  FAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXXXX 863
            FAAANA+ID+LC+HLR++GEPGEYD VIGCIQQRR NW+ VLHMQ YFSV +V+Y     
Sbjct: 51   FAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWSSVLHMQQYFSVAEVIYALQQV 110

Query: 864  XXXXXXXXXXXFEWGGKMGKEYXXXXXXXXXXXDFFKEGKEG-----GHHMN-------- 1004
                        +  GK  K Y             +++G+ G      H+ N        
Sbjct: 111  GWRRQQRHLDPVKGAGKEYKRYGVA----------YRQGQRGETAKDSHNSNFENHSHDA 160

Query: 1005 ------------SKAVPNVNGNENLDAGDVKGSKGEAKVESGEERKDI------------ 1112
                        S+   +V G    D GDV G   +  + +  E+K++            
Sbjct: 161  NSSGTLEKGERVSEIYDDVKGG---DKGDVVGKLEDKDLSAAAEKKEVMNFVIFGQLEQM 217

Query: 1113 ------------VEESGGDGSVESQGSREAVSTIKPEHSSENTDDGHLYDSKENDCHSER 1256
                        V+++  D  V  Q  R    T   E  S N          EN+ H  +
Sbjct: 218  LLQNPMQIAVRRVQKTQKDPDVAFQRLRP--MTWMMEARSCNM-------IMENNAHPVQ 268

Query: 1257 ILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRR 1436
              +EK +P  +PKTFVGTEI+DGK+ NVVDG+KLYEELFD+SEVSK ++LVNDLRAAG+R
Sbjct: 269  NQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKR 328

Query: 1437 GQLQ 1448
            GQLQ
Sbjct: 329  GQLQ 332


>ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550333015|gb|EEE88914.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 675

 Score =  218 bits (556), Expect = 4e-54
 Identities = 138/337 (40%), Positives = 173/337 (51%), Gaps = 20/337 (5%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQ------GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDG 659
            MAMP GN V+P+K+Q      G GGGG+E+ Q Q                    +DERDG
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQ------------LQRHQWFPVDERDG 48

Query: 660  FISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVND 839
            FISWLRGEFAAANA+ID+LCHHLR VGE GEYD V+GCIQQRRSNWN VLHMQ YFSV +
Sbjct: 49   FISWLRGEFAAANAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGE 108

Query: 840  VLYXXXXXXXXXXXXXXXXFEWGGKMGKEYXXXXXXXXXXXDFFKEGKEGGHHMNSKAVP 1019
            V+                      +  ++             ++  GK GG      +  
Sbjct: 109  VIVALQQVVLR-------------RQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSA 155

Query: 1020 NVNGNENLDAGDVKGSKGEAKVESGEE------------RKDIVEE--SGGDGSVESQGS 1157
              N       G   G   +  V S  E            R +  EE  SGGDG       
Sbjct: 156  GFNRGHRGGGGGGGGDAVKEGVNSSVENHSFNGNSSENIRSEKFEEVKSGGDGG--KSDD 213

Query: 1158 REAVSTIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFN 1337
            ++A +T K    +     G+   +     +SE + +EKQ+  +TPKTFV  E  DG+  N
Sbjct: 214  KKADATAKSHTDNHKNSSGNAQGTFSG--NSEAVANEKQNLAITPKTFVAEEKIDGQMVN 271

Query: 1338 VVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
            VVDG+KLYE L D  EVSKL++LVN+LRA GRRGQ Q
Sbjct: 272  VVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQ 308


>ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa]
            gi|550333016|gb|ERP57586.1| hypothetical protein
            POPTR_0008s13830g [Populus trichocarpa]
          Length = 693

 Score =  217 bits (553), Expect = 9e-54
 Identities = 139/344 (40%), Positives = 173/344 (50%), Gaps = 27/344 (7%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQ------GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDG 659
            MAMP GN V+P+K+Q      G GGGG+E+ Q Q                    +DERDG
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQ------------LQRHQWFPVDERDG 48

Query: 660  FISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVND 839
            FISWLRGEFAAANA+ID+LCHHLR VGE GEYD V+GCIQQRRSNWN VLHMQ YFSV +
Sbjct: 49   FISWLRGEFAAANAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGE 108

Query: 840  VL---------------------YXXXXXXXXXXXXXXXXFEWGGKMGKEYXXXXXXXXX 956
            V+                     +                F+     G            
Sbjct: 109  VIVALQQVVLRRQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGFNRGHRGGGGGG 168

Query: 957  XXDFFKEGKEGGHHMNSKAVPNVNGNENLDAGDVKGSKGEAKVESGEERKDIVEESGGDG 1136
              D  KEG       +S    N N +EN+ +   +  K        +++KD   +S  D 
Sbjct: 169  GGDAVKEGVNSSVENHSF---NGNSSENIRSEKFEEVKSGGDGGKSDDKKDATAKSHTDN 225

Query: 1137 SVESQGSREAVSTIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEI 1316
               S G+  A  T      +   DD      +E+D H     +EKQ+  +TPKTFV  E 
Sbjct: 226  HKNSSGN--AQGTFSGNSEAVAVDDRS--SPEESDSHPSNNQNEKQNLAITPKTFVAEEK 281

Query: 1317 YDGKSFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
             DG+  NVVDG+KLYE L D  EVSKL++LVN+LRA GRRGQ Q
Sbjct: 282  IDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQ 325


>ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis]
            gi|223533099|gb|EEF34858.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 697

 Score =  217 bits (552), Expect = 1e-53
 Identities = 138/334 (41%), Positives = 176/334 (52%), Gaps = 17/334 (5%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGFISWLR 677
            MAMP GN V+ +K+Q   GGG       G                   +DERDGFISWLR
Sbjct: 1    MAMPPGNVVISDKIQFPAGGGGVGGGNNGGIGNEIQQQQHHHRHQWFPVDERDGFISWLR 60

Query: 678  GEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXX 857
            GEFAAANA+ID+LCHHLR  GEPGEYD VIGCIQQRR NWNPVLHMQ YFSV +V+    
Sbjct: 61   GEFAAANAIIDSLCHHLRAAGEPGEYDVVIGCIQQRRCNWNPVLHMQQYFSVGEVILALQ 120

Query: 858  XXXXXXXXXXXXXFEWGGKMGKEYXXXXXXXXXXXDFFKEGKEG---GHHMNSKAVPNVN 1028
                          +      +             DF +    G   GH    + V  VN
Sbjct: 121  QVALRKQQQHQHQHQ----HQQHRYYYDQPKVGGKDFKRNSSMGFNKGHRGGGEVVKEVN 176

Query: 1029 ---GNENLDAGDVKGSKGEAKVESGEERKDIVEESGGDG-SVESQGSREAVSTIKPEHSS 1196
                +  LD G+  G++   +++SG +   +  +S       +   S+  V  +K   +S
Sbjct: 177  YGAESHGLD-GNTSGNEKFNEIKSGGDSGRLENKSLATAEDKKDAASKPHVDNLKSSGNS 235

Query: 1197 ENTDDGHL----------YDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVD 1346
            E +  G+L             KE+D H  +    K +   TPKTFVG E+ DGKS NVVD
Sbjct: 236  EGSLSGNLETEAEAVHEQSSPKEHDSHFIQNQIVKLNLTTTPKTFVGAEMVDGKSVNVVD 295

Query: 1347 GMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
            G+KLYE+L D+ EVSKL++LVNDLRAAGR+GQ Q
Sbjct: 296  GLKLYEQLLDDVEVSKLVSLVNDLRAAGRKGQFQ 329


>ref|XP_006605475.1| PREDICTED: uncharacterized protein LOC100814525 [Glycine max]
          Length = 626

 Score =  216 bits (551), Expect = 1e-53
 Identities = 134/331 (40%), Positives = 174/331 (52%), Gaps = 14/331 (4%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQ-GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGFISWL 674
            MAMPSGNAV+PEK+Q   GGGGSE+   Q                    +DERDGFI WL
Sbjct: 1    MAMPSGNAVMPEKLQFPGGGGGSEIHYRQ-----------------QWFVDERDGFIGWL 43

Query: 675  RGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXX 854
            R EFAAANA+ID+LCHHLR VGEPGEYD V+G IQQRR NW  VL MQ YFSV++V+   
Sbjct: 44   RSEFAAANAIIDSLCHHLRCVGEPGEYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVCAL 103

Query: 855  XXXXXXXXXXXXXXFEWGGKMGKEYXXXXXXXXXXXDFFKEGKEGG-----HHMNS---- 1007
                           + G K  +++           +  K+G         H  N+    
Sbjct: 104  QQVSWRRQQRVVDLAKTGAKEFRKFGSGIRQGQHRLEAAKDGYNSSVESFCHGTNAVVVA 163

Query: 1008 ----KAVPNVNGNENLDAGDVKGSKGEAKVESGEERKDIVEESGGDGSVESQGSREAVST 1175
                K  P    N  + +G   G+     + S EERKD +     DG ++  G+ +  S 
Sbjct: 164  GGVEKGTPLTEKNGEIKSGGKVGTMDNKSLASPEERKDTITNHQSDGILKGSGNSQG-SL 222

Query: 1176 IKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGMK 1355
               E  +   ++  + +SKEND                 KTF+G E++DGK  NVVDG+K
Sbjct: 223  STSECEAVGVNEECVSNSKENDS-------------TMGKTFIGNEMFDGKMVNVVDGLK 269

Query: 1356 LYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
            LYE+L D +EVSKL++LVNDLR AG+RGQ Q
Sbjct: 270  LYEDLLDRTEVSKLVSLVNDLRVAGKRGQFQ 300


>gb|ESW25182.1| hypothetical protein PHAVU_003G014200g [Phaseolus vulgaris]
          Length = 691

 Score =  216 bits (551), Expect = 1e-53
 Identities = 137/343 (39%), Positives = 184/343 (53%), Gaps = 26/343 (7%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGFISWLR 677
            MAMPSGN  +PEK+Q   GGG+      G G                 +DERDGFI WLR
Sbjct: 1    MAMPSGNGGMPEKLQFPVGGGAA----SGGGEIQYRHQQWF-------VDERDGFIGWLR 49

Query: 678  GEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXX 857
             EFAAANA+ID+LC HLRVVGEPG YD V+G IQQRR NW  VL MQ YFSV++V+Y   
Sbjct: 50   SEFAAANAIIDSLCQHLRVVGEPGVYDMVVGAIQQRRCNWTQVLLMQQYFSVSEVVYALQ 109

Query: 858  XXXXXXXXXXXXXFEWGGKMGKEYXXXXXXXXXXXDFFKEG---------KEG------- 989
                          + G K  +++           +  KEG         KEG       
Sbjct: 110  QVAWRRQQRFVDPAKAGSKEFRKFGSGFRQGQHRNEASKEGYNNSRNEAAKEGYNSKVES 169

Query: 990  -GHHMNSKAVPN--------VNGNENLDAGDVKGSKGEAKVESGEERKDIVEESGGDGSV 1142
             G  MN+  V          ++ N  L++G   G+     + S EE KD +     DG +
Sbjct: 170  FGREMNAVVVTGGVEKGTRVIDKNGELNSGGKVGTMDNNSIASPEESKDTITNDQLDGIL 229

Query: 1143 ESQGS-REAVSTIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIY 1319
               G+ + ++S+ + E   EN +     +SK ND HS +  H+ Q+     KTF+G E++
Sbjct: 230  NGSGNFQGSLSSSECEAVGENEE--CTSNSKGNDSHSVQNQHQSQNASTIGKTFIGNEMF 287

Query: 1320 DGKSFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
            +GK  NVVDG+KLYE+L D++EVSKL++LVND+R AG+RGQ Q
Sbjct: 288  EGKMVNVVDGLKLYEDLIDSAEVSKLVSLVNDMRVAGKRGQFQ 330


>gb|ABK95394.1| unknown [Populus trichocarpa]
          Length = 694

 Score =  216 bits (550), Expect = 2e-53
 Identities = 140/348 (40%), Positives = 177/348 (50%), Gaps = 31/348 (8%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQ------GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDG 659
            MAMP GN V+P+K+Q      G GGGG+E+ Q Q                    +DERDG
Sbjct: 1    MAMPPGNVVIPDKVQFPAGAAGGGGGGNEIHQHQ------------LQRHQWFPVDERDG 48

Query: 660  FISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVND 839
            FISWLRGEFAAANA+ID+LCHHLR VGE GEYD V+GCIQQRRSNWN VLHMQ YFSV +
Sbjct: 49   FISWLRGEFAAANAIIDSLCHHLRAVGEAGEYDLVVGCIQQRRSNWNHVLHMQQYFSVGE 108

Query: 840  VL--------------YXXXXXXXXXXXXXXXXFEWGGKMGKEYXXXXXXXXXXXDFFKE 977
            V+                               ++ G   G+++             F  
Sbjct: 109  VIVALQQVVLRRQQQQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAG------FNR 162

Query: 978  GKEGG--------HHMNSKAVP---NVNGNENLDAGDVKGSKGEAKVESGEERKDIVEES 1124
            G  GG          +NS       N N +EN+ +   +  K        +++KD   +S
Sbjct: 163  GHRGGGGGGDAVKEGVNSSVENHSFNGNSSENIRSEKFEEVKSGGDGGKSDDKKDATAKS 222

Query: 1125 GGDGSVESQGSREAVSTIKPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFV 1304
              D    S G+  A  T      +   DD      +E+D H     +EKQ+  +TPKTFV
Sbjct: 223  HTDNHKNSSGN--AQGTFSGNSEAVAVDDRS--SPEESDSHPSNNQNEKQNLAITPKTFV 278

Query: 1305 GTEIYDGKSFNVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
              E  DG+  NVVDG+KLYE L D  EVSKL++LVN+LRA GRRGQ Q
Sbjct: 279  AEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQ 326


>ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781773 [Glycine max]
          Length = 664

 Score =  214 bits (545), Expect = 7e-53
 Identities = 133/331 (40%), Positives = 179/331 (54%), Gaps = 14/331 (4%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQGRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGFISWLR 677
            MAMPSGNAV+PEK+Q  GGGG+    P G                   +DERDGFI WLR
Sbjct: 1    MAMPSGNAVMPEKLQFPGGGGA----PGGGSEIHFRQQWF--------VDERDGFIGWLR 48

Query: 678  GEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDVLYXXX 857
             EFAAANA+ID+LCHHLR VGEPGEY+ V+G IQQRR NW  VL MQ YFSV++V+Y   
Sbjct: 49   SEFAAANAIIDSLCHHLRDVGEPGEYNMVVGAIQQRRCNWTQVLLMQQYFSVSEVVYALQ 108

Query: 858  XXXXXXXXXXXXXFEWGGKMGKEYXXXXXXXXXXXDFFKEG-----KEGGHHMNSKAVPN 1022
                          + G K  +++           +  K+G     +  GH  N+  V  
Sbjct: 109  QVSWRRQQRVVDPAKTGAKEFRKFGLGFKQGQHRFEAVKDGYNSSVESFGHGTNAVVVAG 168

Query: 1023 --------VNGNENLDAGDVKGSKGEAKVESGEERKDIVEESGGDGSVESQGSREAVSTI 1178
                       N  + +G + G+     + S EERKD +     DG +  +GSR +  ++
Sbjct: 169  GVEKGACVTEKNGEIKSGGMVGTMDNKNLGSPEERKDAITNHQSDGIL--KGSRNSQGSL 226

Query: 1179 -KPEHSSENTDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSFNVVDGMK 1355
               E  +   ++  + +SKEND              +  K F+G E++DGK  NVVDG+K
Sbjct: 227  SSSECEAVGVNEECVSNSKENDS-------------IMGKFFIGNEMFDGKMVNVVDGLK 273

Query: 1356 LYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
            LYE+L D++EVSKL++LVNDLR AG+RGQ Q
Sbjct: 274  LYEDLLDSTEVSKLVSLVNDLRVAGKRGQFQ 304


>ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca
            subsp. vesca]
          Length = 682

 Score =  214 bits (545), Expect = 7e-53
 Identities = 138/338 (40%), Positives = 178/338 (52%), Gaps = 21/338 (6%)
 Frame = +3

Query: 498  MAMPSGNAVVPEKMQ-----GRGGGGSELMQPQGRGXXXXXXXXXXXXXXXXXMDERDGF 662
            M MPSGN V+ +KMQ     G    G E+ Q   +                   DERDGF
Sbjct: 1    MTMPSGNVVLSDKMQYPSVAGAAVSGGEIHQQPRQWFP----------------DERDGF 44

Query: 663  ISWLRGEFAAANAMIDALCHHLRVVGEPGEYDGVIGCIQQRRSNWNPVLHMQHYFSVNDV 842
            ISWLRGEFAAANA+ID+LCHHLR VGEP EYD VIGC+QQRR NW PVLHMQ YFSV +V
Sbjct: 45   ISWLRGEFAAANAIIDSLCHHLRAVGEPSEYDMVIGCVQQRRCNWTPVLHMQQYFSVAEV 104

Query: 843  LYXXXXXXXXXXXXXXXXFEWGGKMGKEYXXXXXXXXXXXDFFKEGKEGGHHMNSKAVPN 1022
            +Y                 + G K  K               FK   E     ++ +V  
Sbjct: 105  IYALQQVAWRRQQRYYEPVKMGNKDYKRSNSGVG--------FKPRNEPVKEWHTASVE- 155

Query: 1023 VNGNENLDAGDVK--GSKGEAKVESGEERKDIVEESGGDGSV------------ESQGSR 1160
                 + D   ++  GS+   +V+ G E   + ++    G+V             S+ S 
Sbjct: 156  ---YRSYDGSGLEKVGSEMREEVKPGGEAGKVDDKGSAAGAVTKGVLTKPHEYISSRSSA 212

Query: 1161 EAVSTIKPEHSSEN--TDDGHLYDSKENDCHSERILHEKQSPIVTPKTFVGTEIYDGKSF 1334
             +  TI     SE+   ++G     KEN+ +S +I +EKQ+  + PKTFVG E +DGK+ 
Sbjct: 213  NSQGTISGNSESEDAVVNEGCTSSIKENESNSIQIQNEKQNLSLIPKTFVGNETFDGKTV 272

Query: 1335 NVVDGMKLYEELFDNSEVSKLITLVNDLRAAGRRGQLQ 1448
            NVVDG+KLYEE   ++EVSKL +LVNDLR  GRRGQLQ
Sbjct: 273  NVVDGLKLYEEFLGDTEVSKLFSLVNDLRTTGRRGQLQ 310


Top