BLASTX nr result

ID: Atropa21_contig00022001 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00022001
         (717 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600...   342   9e-92
ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261...   333   3e-89
gb|EMJ26321.1| hypothetical protein PRUPE_ppa002630mg [Prunus pe...   203   5e-50
emb|CBI26785.3| unnamed protein product [Vitis vinifera]              200   3e-49
ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252...   200   4e-49
ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309...   192   1e-46
ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citr...   188   2e-45
ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618...   187   2e-45
gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]     186   7e-45
gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, ...   184   3e-44
emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]   182   1e-43
gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, ...   181   3e-43
gb|EOY01303.1| Hydroxyproline-rich glycoprotein family protein, ...   174   2e-41
ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu...   170   5e-40
ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm...   170   5e-40
gb|ABK95394.1| unknown [Populus trichocarpa]                          169   8e-40
ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781...   166   7e-39
ref|XP_004513242.1| PREDICTED: uncharacterized protein LOC101506...   166   7e-39
ref|XP_004513244.1| PREDICTED: uncharacterized protein LOC101507...   165   1e-38
ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr...   165   2e-38

>ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600383 [Solanum tuberosum]
          Length = 638

 Score =  342 bits (876), Expect = 9e-92
 Identities = 185/253 (73%), Positives = 195/253 (77%), Gaps = 15/253 (5%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGG--------------GKMXXXXXXXXXXX 140
           VLHMQQY SVAEVIYSLHQVEW KQQKGFDGG              G             
Sbjct: 98  VLHMQQYHSVAEVIYSLHQVEWMKQQKGFDGGVKKVEKRNGSRGGGGGWKSEGLKDGKES 157

Query: 141 XXXXXXXXXXLKVNGVVKIDVVNKEAAVKQGETKELVGNPEENNSMKSSVI-EVGDSQNE 317
                      K NGV KIDVV     VKQGE KEL  NPE N+S+KSSV  E GDSQ E
Sbjct: 158 QGQNFSLDAHSKTNGVEKIDVVE----VKQGEKKELAANPEANSSVKSSVCTEAGDSQGE 213

Query: 318 VDKTDDKRDSNSDGSSTVENESHSVQVPTEKQNVVPKTFVATEIYDGKPVNVVDGMKLYE 497
           VDKTDDKRDSNS+GSS VE+ESHS+QVPTEKQNVVPKTFVATEIYDGKPVNVVDGMKLYE
Sbjct: 214 VDKTDDKRDSNSEGSSNVESESHSIQVPTEKQNVVPKTFVATEIYDGKPVNVVDGMKLYE 273

Query: 498 ELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQLGLPIVDAPPED 677
           ELLS+SEVSKL+TLVNDLR AGRRGQLPAQ FIVSKRPMKGHGREM+QLGLPIVDAPPE+
Sbjct: 274 ELLSSSEVSKLLTLVNDLRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPEE 333

Query: 678 EAAIATYKDRKTE 716
           EAAI+TYKDRKTE
Sbjct: 334 EAAISTYKDRKTE 346


>ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261013 [Solanum
           lycopersicum]
          Length = 641

 Score =  333 bits (854), Expect = 3e-89
 Identities = 180/254 (70%), Positives = 192/254 (75%), Gaps = 16/254 (6%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGG---------------GKMXXXXXXXXXX 137
           VLHMQQY SVAEVIYSLHQVEW KQQKGFDGG               G            
Sbjct: 100 VLHMQQYHSVAEVIYSLHQVEWMKQQKGFDGGVNKVGKRNGSKGGGGGGWKSEGLKDGKE 159

Query: 138 XXXXXXXXXXXLKVNGVVKIDVVNKEAAVKQGETKELVGNPEENNSMKSSVI-EVGDSQN 314
                       K NGV KIDVV +    KQG+ KEL   PE N+S+K SV  E GDSQ 
Sbjct: 160 SQGQNFSLDAHSKTNGVEKIDVVEE----KQGDKKELAAKPEANSSVKGSVCTEAGDSQG 215

Query: 315 EVDKTDDKRDSNSDGSSTVENESHSVQVPTEKQNVVPKTFVATEIYDGKPVNVVDGMKLY 494
           EVDKTDDKRDSNS+GSS VE+ESHS Q+PTEKQNVVPKTFVATEIYDGKPVNVVDGMKLY
Sbjct: 216 EVDKTDDKRDSNSEGSSNVESESHSFQIPTEKQNVVPKTFVATEIYDGKPVNVVDGMKLY 275

Query: 495 EELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQLGLPIVDAPPE 674
           EELLS+SEVSKLVTLVNDLR AGRRGQLPAQ FIVSKRPMKGHGREM+QLGLPIVDAPPE
Sbjct: 276 EELLSSSEVSKLVTLVNDLRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPE 335

Query: 675 DEAAIATYKDRKTE 716
           +E+AI+TYKDRKTE
Sbjct: 336 EESAISTYKDRKTE 349


>gb|EMJ26321.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica]
          Length = 650

 Score =  203 bits (516), Expect = 5e-50
 Identities = 124/253 (49%), Positives = 152/253 (60%), Gaps = 15/253 (5%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFD---GGGKMXXXXXXXXXXXXXXXXXXXXXL 173
           VLHMQQYFSVAEVIY+L  V WR+QQ+ +D    G K                       
Sbjct: 93  VLHMQQYFSVAEVIYALQHVAWRRQQRYYDPVKAGAK---------------------EF 131

Query: 174 KVNGVVKIDVVNKEAAVKQGETKELVGNPEENNSMK----------SSVIEVGDSQNEVD 323
           K +GV       +  A K+G    L  +  + NS            S V E  +   EV 
Sbjct: 132 KRSGVGFNKGQQRAEAFKEGHNSTLESHSNDGNSSGVVAPEKFERGSEVGEEVEPGGEVG 191

Query: 324 KTDDKRDSNSDGSSTVENESHSVQVPTEKQN--VVPKTFVATEIYDGKPVNVVDGMKLYE 497
           K +DK  + + G   V NESHS+Q+  +KQN  +VPKTF+  EI DGK VNVVDG+KLYE
Sbjct: 192 KLNDKGLAPA-GEKKV-NESHSIQIQNQKQNLSIVPKTFIGNEISDGKTVNVVDGLKLYE 249

Query: 498 ELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQLGLPIVDAPPED 677
           + L ++EVSKLV+LVNDLR AG+R QL  QT++VSKRPMKGHGREMIQLG+PI DAPPED
Sbjct: 250 DFLGDTEVSKLVSLVNDLRAAGKRRQLQGQTYVVSKRPMKGHGREMIQLGIPIADAPPED 309

Query: 678 EAAIATYKDRKTE 716
           E +  T KDRK E
Sbjct: 310 EISAGTSKDRKIE 322


>emb|CBI26785.3| unnamed protein product [Vitis vinifera]
          Length = 672

 Score =  200 bits (509), Expect = 3e-49
 Identities = 133/283 (46%), Positives = 154/283 (54%), Gaps = 45/283 (15%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFD---GGGKMXXXXXXXXXXXXXXXXXXXXXL 173
           VLHMQQYFSVAEVIY+L QV WR+QQ+  D   G GK                       
Sbjct: 93  VLHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSHN 152

Query: 174 K---------------------------VNGVVKIDVVNK------EAAVKQGETKELVG 254
                                       V G  K DVV K       AA ++    + V 
Sbjct: 153 SNFENHSHDANSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVA 212

Query: 255 NPEENNSMKSSVIEVGD----SQNEVDKTDDKRDSNSDGSSTV--ENESHSVQVPTEKQN 416
            P  N+  KSS    G     S+ E +  DD    N  GS  +  EN +H VQ   EK N
Sbjct: 213 KPNANSCSKSSENSEGSRCGISETEANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKPN 272

Query: 417 VV--PKTFVATEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPA-Q 587
               PKTFV TEI+DGK VNVVDG+KLYEEL  +SEVSK V+LVNDLR AG+RGQL A Q
Sbjct: 273 PTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQAGQ 332

Query: 588 TFIVSKRPMKGHGREMIQLGLPIVDAPPEDEAAIATYKDRKTE 716
           TF+VSKRPMKGHGREMIQLG+PI DAP EDE+ + T KDR+TE
Sbjct: 333 TFVVSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKDRRTE 375


>ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera]
          Length = 698

 Score =  200 bits (508), Expect = 4e-49
 Identities = 130/280 (46%), Positives = 151/280 (53%), Gaps = 42/280 (15%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFD---GGGKMXXXXXXXXXXXXXXXXXXXXXL 173
           VLHMQQYFSVAEVIY+L QV WR+QQ+  D   G GK                       
Sbjct: 93  VLHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSHN 152

Query: 174 K---------------------------VNGVVKIDVVNK------EAAVKQGETKELVG 254
                                       V G  K DVV K       AA ++    + V 
Sbjct: 153 SNFENHSHDANSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAVA 212

Query: 255 NPEENNSMKSSVIEVGD----SQNEVDKTDDKRDSNSDGSSTVENESHSVQVPTEKQNVV 422
            P  N+  KSS    G     S+ E +  DD    N      +EN +H VQ   EK N  
Sbjct: 213 KPNANSCSKSSENSEGSRCGISETEANDMDDGGSCNM----IMENNAHPVQNQNEKPNPT 268

Query: 423 --PKTFVATEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFI 596
             PKTFV TEI+DGK VNVVDG+KLYEEL  +SEVSK V+LVNDLR AG+RGQL  QTF+
Sbjct: 269 TSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFV 328

Query: 597 VSKRPMKGHGREMIQLGLPIVDAPPEDEAAIATYKDRKTE 716
           VSKRPMKGHGREMIQLG+PI DAP EDE+ + T KDR+TE
Sbjct: 329 VSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKDRRTE 368


>ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca
           subsp. vesca]
          Length = 682

 Score =  192 bits (487), Expect = 1e-46
 Identities = 117/265 (44%), Positives = 150/265 (56%), Gaps = 27/265 (10%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDG---GGK--------MXXXXXXXXXXXXXX 149
           VLHMQQYFSVAEVIY+L QV WR+QQ+ ++    G K        +              
Sbjct: 92  VLHMQQYFSVAEVIYALQQVAWRRQQRYYEPVKMGNKDYKRSNSGVGFKPRNEPVKEWHT 151

Query: 150 XXXXXXXLKVNGVVKIDVVNKEAAVKQGE--------------TKELVGNPEENNSMKSS 287
                     +G+ K+    +E     GE              TK ++  P E  S +SS
Sbjct: 152 ASVEYRSYDGSGLEKVGSEMREEVKPGGEAGKVDDKGSAAGAVTKGVLTKPHEYISSRSS 211

Query: 288 VIEVGDSQNEVDKTDDKRDSNSDGSSTVENESHSVQVPTEKQNV--VPKTFVATEIYDGK 461
               G       +++D   +    SS  ENES+S+Q+  EKQN+  +PKTFV  E +DGK
Sbjct: 212 ANSQGTISGN-SESEDAVVNEGCTSSIKENESNSIQIQNEKQNLSLIPKTFVGNETFDGK 270

Query: 462 PVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQ 641
            VNVVDG+KLYEE L ++EVSKL +LVNDLR  GRRGQL  QT+++SKRPMKGHGREMIQ
Sbjct: 271 TVNVVDGLKLYEEFLGDTEVSKLFSLVNDLRTTGRRGQLQGQTYVLSKRPMKGHGREMIQ 330

Query: 642 LGLPIVDAPPEDEAAIATYKDRKTE 716
           LG+PI D P EDE +    KDR+ E
Sbjct: 331 LGIPIADGPQEDEISAGISKDRRME 355


>ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citrus clementina]
           gi|557550702|gb|ESR61331.1| hypothetical protein
           CICLE_v10014588mg [Citrus clementina]
          Length = 635

 Score =  188 bits (477), Expect = 2e-45
 Identities = 114/256 (44%), Positives = 147/256 (57%), Gaps = 18/256 (7%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGGGKMXXXXXXXXXXXXXXXXXXXXXLKVN 182
           VLH+QQYFSV+EV+ +L QV WRKQQ+ FD                           K +
Sbjct: 64  VLHLQQYFSVSEVMLALQQVAWRKQQRSFD---------HHHHHHHHHQQQHHLNRTKRS 114

Query: 183 GVVKIDVVNKEAAVKQG------------ETKELVGNPEENNSMKS----SVIEVGDSQN 314
             VK D  N                    + K++V    ++ S KS     + +VGD++ 
Sbjct: 115 AFVKKDFHNNNNNNNNNNHAFDSNSSAFDDKKDVVMKAHDDGSAKSLGNSEITQVGDAEP 174

Query: 315 EVDKTDDKRDSNSDGSSTVENESHSVQVPTEKQN--VVPKTFVATEIYDGKPVNVVDGMK 488
           + +  DD         S  EN+S SVQ   EKQN  +  K+FV TE+ DGK VNVVDG+K
Sbjct: 175 KAEALDD-----GCTPSLKENDSQSVQSQNEKQNQSMAAKSFVGTEMVDGKMVNVVDGLK 229

Query: 489 LYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQLGLPIVDAP 668
           LYEE+  NSEVSKLV+LVNDLR AG+RGQ+    ++VSKRP++GHGRE+IQLGLPIVD P
Sbjct: 230 LYEEVSGNSEVSKLVSLVNDLRTAGKRGQIQGPAYVVSKRPIRGHGREVIQLGLPIVDGP 289

Query: 669 PEDEAAIATYKDRKTE 716
           PEDE A  T +DR+ E
Sbjct: 290 PEDEIAAGTSRDRRIE 305


>ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618872 [Citrus sinensis]
          Length = 627

 Score =  187 bits (476), Expect = 2e-45
 Identities = 113/253 (44%), Positives = 146/253 (57%), Gaps = 15/253 (5%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGGGKMXXXXXXXXXXXXXXXXXXXXXLKVN 182
           VLH+QQYFSV+EV+ +L QV WRKQQ+ FD                           K +
Sbjct: 64  VLHLQQYFSVSEVMLALQQVAWRKQQRSFD-------------HHHHHQQQHHLNRTKRS 110

Query: 183 GVVKIDVVNKEAAVKQG---------ETKELVGNPEENNSMKS----SVIEVGDSQNEVD 323
             VK D  N                 + K++V    ++ S KS     + +VGD++ + +
Sbjct: 111 AFVKKDFHNNNNNNNHAFDSNSSAFDDKKDVVMKAHDDGSAKSLGNSEITQVGDAEPKAE 170

Query: 324 KTDDKRDSNSDGSSTVENESHSVQVPTEKQN--VVPKTFVATEIYDGKPVNVVDGMKLYE 497
             DD            EN+S SVQ   EKQN  +  K+FV TE+ DGK VNVVDG+KLYE
Sbjct: 171 ALDD-----GCTPGLKENDSQSVQSQNEKQNQSMAAKSFVGTEMVDGKMVNVVDGLKLYE 225

Query: 498 ELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQLGLPIVDAPPED 677
           E+  NSEVSKLV+LVNDLR AG+RGQ+    ++VSKRP++GHGRE+IQLGLPIVD PPED
Sbjct: 226 EVSGNSEVSKLVSLVNDLRTAGKRGQIQGPAYVVSKRPIRGHGREVIQLGLPIVDGPPED 285

Query: 678 EAAIATYKDRKTE 716
           E A  T +DR+ E
Sbjct: 286 EIAAGTSRDRRIE 298


>gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]
          Length = 681

 Score =  186 bits (472), Expect = 7e-45
 Identities = 115/268 (42%), Positives = 149/268 (55%), Gaps = 30/268 (11%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFD-----------GGGKMXXXXXXXXXXXXXX 149
           VLHMQQYFSVAEV+++L QV WR+QQ+ +D            G                 
Sbjct: 90  VLHMQQYFSVAEVMFALQQVAWRRQQRFYDPVKMGNKEFKRSGVGFKQWQRNDSFKDGRN 149

Query: 150 XXXXXXXLKVNGVVKIDVVNKEAAVKQGETKELVGNPEENNSMKSS-------------- 287
                  L  N         K  + K G+    VGN ++  SM ++              
Sbjct: 150 SAAESHCLDGNSSFGNAASEKGGSDKSGDE---VGNSDDRGSMPAAKEKNDSAAKSQEDG 206

Query: 288 -VIEVGDSQNEVDKTDDKRDSNSDG--SSTVENESHSVQVPTEKQNV--VPKTFVATEIY 452
            V  +G+ +  V  ++ +  +  DG  SS+ EN+SHS     E  N+  VPKTF   E++
Sbjct: 207 NVKSLGNFEGVVSGSEPEVHAVDDGCTSSSKENDSHSTPKQNENSNLANVPKTFSGNEMF 266

Query: 453 DGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGRE 632
           DGKPVNVV+G+KLYEE  +++EVSKLV LVNDLR AG RG   +QT++VSKRPMKGHGRE
Sbjct: 267 DGKPVNVVEGLKLYEEFCADTEVSKLVALVNDLRSAGERGHFQSQTYVVSKRPMKGHGRE 326

Query: 633 MIQLGLPIVDAPPEDEAAIATYKDRKTE 716
            IQLGLPI DAP EDE +  T KDR+TE
Sbjct: 327 KIQLGLPIADAPVEDEISAGTLKDRRTE 354


>gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
           [Theobroma cacao] gi|508709405|gb|EOY01302.1|
           Hydroxyproline-rich glycoprotein family protein,
           putative isoform 2 [Theobroma cacao]
          Length = 680

 Score =  184 bits (466), Expect = 3e-44
 Identities = 113/252 (44%), Positives = 145/252 (57%), Gaps = 14/252 (5%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGG---GKMXXXXXXXXXXXXXXXXXXXXXL 173
           VLHMQQYFSVAEV Y+L QV WR++Q+ ++ G   GK                       
Sbjct: 107 VLHMQQYFSVAEVSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFKGQRMEVAKEGQNS 166

Query: 174 KVNG-----VVKIDVVNKEAAVKQGETKEL--VGNPEEN-NSMKSSVIEVGDSQNEVDKT 329
            V+      V  +   N+  + K+ E K    VG  E+  ++      + G   +  D  
Sbjct: 167 GVDSDGNSTVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAE 226

Query: 330 DDKRDSNSDGSSTV-ENESHSVQVPTEKQNVV--PKTFVATEIYDGKPVNVVDGMKLYEE 500
               D N   +S+  EN+  S+Q   EKQN+   PKTFV  E++DGK VNVVDG+KLYEE
Sbjct: 227 SVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGLKLYEE 286

Query: 501 LLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQLGLPIVDAPPEDE 680
           L  + EV  LV+LVNDLR AG+RGQL  QT++ +KRPMKGHGREMIQLGLPI DAP +DE
Sbjct: 287 LFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAKRPMKGHGREMIQLGLPIADAPLDDE 346

Query: 681 AAIATYKDRKTE 716
            A  T KDR+ E
Sbjct: 347 NAAGTSKDRRIE 358


>emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]
          Length = 1145

 Score =  182 bits (461), Expect = 1e-43
 Identities = 127/291 (43%), Positives = 153/291 (52%), Gaps = 53/291 (18%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFD---GGGKMXXXXXXXXXXXXXXXXXXXXXL 173
           VLHMQQYFSVAEVIY+L QV WR+QQ+  D   G GK                       
Sbjct: 91  VLHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSHN 150

Query: 174 K---------------------------VNGVVKIDVVNK------EAAVKQGETKELV- 251
                                       V G  K DVV K       AA ++ E    V 
Sbjct: 151 SNFENHSHDANSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLSAAAEKKEVMNFVI 210

Query: 252 -GNPEE---NNSMKSSVIEVGDSQNEVDKTDDKRDSNS------DGSSTVENESHSVQVP 401
            G  E+    N M+ +V  V  +Q + D    +    +        +  +EN +H VQ  
Sbjct: 211 FGQLEQMLLQNPMQIAVRRVQKTQKDPDVAFQRLRPMTWMMEARSCNMIMENNAHPVQNQ 270

Query: 402 TEKQNVV--PKTFVATEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQ 575
            EK N    PKTFV TEI+DGK VNVVDG+KLYEEL  +SEVSK V+LVNDLR AG+RGQ
Sbjct: 271 NEKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQ 330

Query: 576 LPAQTFIVSKRPMKGHGREMIQLGLPIVDAPPEDEAAIATYK----DRKTE 716
           L  QTF+VSKRPMKGHGREMIQLG+PI DAP EDE+ + T K    +R+TE
Sbjct: 331 LQGQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKGMFHNRRTE 381


>gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
           [Theobroma cacao] gi|508709404|gb|EOY01301.1|
           Hydroxyproline-rich glycoprotein family protein,
           putative isoform 1 [Theobroma cacao]
          Length = 681

 Score =  181 bits (458), Expect = 3e-43
 Identities = 114/253 (45%), Positives = 146/253 (57%), Gaps = 15/253 (5%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGG---GKMXXXXXXXXXXXXXXXXXXXXXL 173
           VLHMQQYFSVAEV Y+L QV WR++Q+ ++ G   GK                       
Sbjct: 107 VLHMQQYFSVAEVSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFKGQRMEVAKEGQNS 166

Query: 174 KVNG-----VVKIDVVNKEAAVKQGETKEL--VGNPEEN-NSMKSSVIEVGDSQNEVDKT 329
            V+      V  +   N+  + K+ E K    VG  E+  ++      + G   +  D  
Sbjct: 167 GVDSDGNSTVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAE 226

Query: 330 DDKRDSNSDGSSTV-ENESHSVQVPTEKQNVV--PKTFVATEIYDGKPVNVVDGMKLYEE 500
               D N   +S+  EN+  S+Q   EKQN+   PKTFV  E++DGK VNVVDG+KLYEE
Sbjct: 227 SVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGLKLYEE 286

Query: 501 LLSNSEVSKLVTLVNDLRVAGRRGQLPA-QTFIVSKRPMKGHGREMIQLGLPIVDAPPED 677
           L  + EV  LV+LVNDLR AG+RGQL A QT++ +KRPMKGHGREMIQLGLPI DAP +D
Sbjct: 287 LFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLPIADAPLDD 346

Query: 678 EAAIATYKDRKTE 716
           E A  T KDR+ E
Sbjct: 347 ENAAGTSKDRRIE 359


>gb|EOY01303.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 5
           [Theobroma cacao]
          Length = 572

 Score =  174 bits (442), Expect = 2e-41
 Identities = 111/250 (44%), Positives = 143/250 (57%), Gaps = 15/250 (6%)
 Frame = +3

Query: 12  MQQYFSVAEVIYSLHQVEWRKQQKGFDGG---GKMXXXXXXXXXXXXXXXXXXXXXLKVN 182
           MQQYFSVAEV Y+L QV WR++Q+ ++ G   GK                        V+
Sbjct: 1   MQQYFSVAEVSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFKGQRMEVAKEGQNSGVD 60

Query: 183 G-----VVKIDVVNKEAAVKQGETKEL--VGNPEEN-NSMKSSVIEVGDSQNEVDKTDDK 338
                 V  +   N+  + K+ E K    VG  E+  ++      + G   +  D     
Sbjct: 61  SDGNSTVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAESVT 120

Query: 339 RDSNSDGSSTV-ENESHSVQVPTEKQNVV--PKTFVATEIYDGKPVNVVDGMKLYEELLS 509
            D N   +S+  EN+  S+Q   EKQN+   PKTFV  E++DGK VNVVDG+KLYEEL  
Sbjct: 121 EDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKTFVGNEMFDGKMVNVVDGLKLYEELFD 180

Query: 510 NSEVSKLVTLVNDLRVAGRRGQLPA-QTFIVSKRPMKGHGREMIQLGLPIVDAPPEDEAA 686
           + EV  LV+LVNDLR AG+RGQL A QT++ +KRPMKGHGREMIQLGLPI DAP +DE A
Sbjct: 181 DKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKRPMKGHGREMIQLGLPIADAPLDDENA 240

Query: 687 IATYKDRKTE 716
             T KDR+ E
Sbjct: 241 AGTSKDRRIE 250


>ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa]
           gi|550333016|gb|ERP57586.1| hypothetical protein
           POPTR_0008s13830g [Populus trichocarpa]
          Length = 693

 Score =  170 bits (430), Expect = 5e-40
 Identities = 119/281 (42%), Positives = 142/281 (50%), Gaps = 43/281 (15%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKG-------------FDGGGKMXXXXXXXXXXXX 143
           VLHMQQYFSV EVI +L QV  R+QQ+              +   GK+            
Sbjct: 97  VLHMQQYFSVGEVIVALQQVVLRRQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAG 156

Query: 144 XXXXXXXXXLKVNGVVKIDVVNKEAAVKQGETKELVGNPEENNSMKSSVIEVGDSQNEVD 323
                        G    D V KE      E     GN  EN  ++S   E   S  +  
Sbjct: 157 FNRGHRGGGGGGGG----DAV-KEGVNSSVENHSFNGNSSEN--IRSEKFEEVKSGGDGG 209

Query: 324 KTDDKRDSNS----------------------------DGSSTVENESHSVQVPTEKQN- 416
           K+DDK+D+ +                            D SS  E++SH      EKQN 
Sbjct: 210 KSDDKKDATAKSHTDNHKNSSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNL 269

Query: 417 -VVPKTFVATEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTF 593
            + PKTFVA E  DG+ VNVVDG+KLYE LL   EVSKLV+LVN+LR  GRRGQ   QT+
Sbjct: 270 AITPKTFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTY 329

Query: 594 IVSKRPMKGHGREMIQLGLPIVDAPPEDEAAIATYKDRKTE 716
           I+SKRPMKGHGREMIQLGLPI DAP EDE A  T K+R+ E
Sbjct: 330 ILSKRPMKGHGREMIQLGLPIADAPAEDENATGTSKERRVE 370


>ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis]
           gi|223533099|gb|EEF34858.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 697

 Score =  170 bits (430), Expect = 5e-40
 Identities = 117/279 (41%), Positives = 148/279 (53%), Gaps = 41/279 (14%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQ---------------------------------- 80
           VLHMQQYFSV EVI +L QV  RKQQ                                  
Sbjct: 103 VLHMQQYFSVGEVILALQQVALRKQQQHQHQHQHQQHRYYYDQPKVGGKDFKRNSSMGFN 162

Query: 81  KGFDGGGKMXXXXXXXXXXXXXXXXXXXXXLKVNGVVKID----VVNKEAAVKQGETKEL 248
           KG  GGG++                      K N +        + NK  A  + + K+ 
Sbjct: 163 KGHRGGGEVVKEVNYGAESHGLDGNTSGNE-KFNEIKSGGDSGRLENKSLATAE-DKKDA 220

Query: 249 VGNPEENNSMKSSVIEVGDSQNEVD-KTDDKRDSNSDGSSTVENESHSVQVPTEKQNVV- 422
              P  +N +KSS    G+S+  +    + + ++  + SS  E++SH +Q    K N+  
Sbjct: 221 ASKPHVDN-LKSS----GNSEGSLSGNLETEAEAVHEQSSPKEHDSHFIQNQIVKLNLTT 275

Query: 423 -PKTFVATEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIV 599
            PKTFV  E+ DGK VNVVDG+KLYE+LL + EVSKLV+LVNDLR AGR+GQ   Q ++V
Sbjct: 276 TPKTFVGAEMVDGKSVNVVDGLKLYEQLLDDVEVSKLVSLVNDLRAAGRKGQFQGQAYVV 335

Query: 600 SKRPMKGHGREMIQLGLPIVDAPPEDEAAIATYKDRKTE 716
           SKRPMKGHGREMIQLGLPI DAP E+E A  T KDRK E
Sbjct: 336 SKRPMKGHGREMIQLGLPIADAPAEEENAAGTSKDRKIE 374


>gb|ABK95394.1| unknown [Populus trichocarpa]
          Length = 694

 Score =  169 bits (428), Expect = 8e-40
 Identities = 117/278 (42%), Positives = 144/278 (51%), Gaps = 40/278 (14%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGGGKMXXXXXXXXXXXXXXXXXXXXXLKVN 182
           VLHMQQYFSV EVI +L QV  R+QQ+      +                       K +
Sbjct: 97  VLHMQQYFSVGEVIVALQQVVLRRQQQQ-QQQQQQQQNHHHQQRFYYDHGKVGGRDFKRS 155

Query: 183 GVVKIDVVNKEA-----AVKQG-----ETKELVGNPEENNSMKSSVIEVGDSQNEVDKTD 332
                +  ++       AVK+G     E     GN  EN  ++S   E   S  +  K+D
Sbjct: 156 SSAGFNRGHRGGGGGGDAVKEGVNSSVENHSFNGNSSEN--IRSEKFEEVKSGGDGGKSD 213

Query: 333 DKRDSNS----------------------------DGSSTVENESHSVQVPTEKQN--VV 422
           DK+D+ +                            D SS  E++SH      EKQN  + 
Sbjct: 214 DKKDATAKSHTDNHKNSSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNLAIT 273

Query: 423 PKTFVATEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVS 602
           PKTFVA E  DG+ VNVVDG+KLYE LL   EVSKLV+LVN+LR  GRRGQ   QT+I+S
Sbjct: 274 PKTFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILS 333

Query: 603 KRPMKGHGREMIQLGLPIVDAPPEDEAAIATYKDRKTE 716
           KRPMKGHGREMIQLGLPI DAP EDE A  T K+R+ E
Sbjct: 334 KRPMKGHGREMIQLGLPIADAPAEDENATGTSKERRVE 371


>ref|XP_006583757.1| PREDICTED: uncharacterized protein LOC100781773 [Glycine max]
          Length = 664

 Score =  166 bits (420), Expect = 7e-39
 Identities = 115/261 (44%), Positives = 145/261 (55%), Gaps = 23/261 (8%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFD------------GGGKMXXXXXXXXXXXXX 146
           VL MQQYFSV+EV+Y+L QV WR+QQ+  D            G G               
Sbjct: 91  VLLMQQYFSVSEVVYALQQVSWRRQQRVVDPAKTGAKEFRKFGLGFKQGQHRFEAVKDGY 150

Query: 147 XXXXXXXXLKVNGVVKIDVVNKEAAV--KQGETKE--LVGNPEENNSMKSSVIEVGDSQN 314
                      N VV    V K A V  K GE K   +VG  +  N       +   + +
Sbjct: 151 NSSVESFGHGTNAVVVAGGVEKGACVTEKNGEIKSGGMVGTMDNKNLGSPEERKDAITNH 210

Query: 315 EVDKTDDKRDSNSDGS-STVENESHSVQ---VPTEKQN--VVPKTFVATEIYDGKPVNVV 476
           + D    K   NS GS S+ E E+  V    V   K+N  ++ K F+  E++DGK VNVV
Sbjct: 211 QSDGIL-KGSRNSQGSLSSSECEAVGVNEECVSNSKENDSIMGKFFIGNEMFDGKMVNVV 269

Query: 477 DGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPA-QTFIVSKRPMKGHGREMIQLGLP 653
           DG+KLYE+LL ++EVSKLV+LVNDLRVAG+RGQ    QTF+VSKRPMKGHGREMIQLG+P
Sbjct: 270 DGLKLYEDLLDSTEVSKLVSLVNDLRVAGKRGQFQGNQTFVVSKRPMKGHGREMIQLGVP 329

Query: 654 IVDAPPEDEAAIATYKDRKTE 716
           I DAPP+ +      KD+K E
Sbjct: 330 IADAPPDVDNVTGISKDKKVE 350


>ref|XP_004513242.1| PREDICTED: uncharacterized protein LOC101506929 isoform X1 [Cicer
           arietinum]
          Length = 669

 Score =  166 bits (420), Expect = 7e-39
 Identities = 108/273 (39%), Positives = 147/273 (53%), Gaps = 35/273 (12%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQK----------------GFDGGGKMXXXXXXXXX 134
           VL MQQY+SV+EV Y+L QV WR+QQ+                 F+GGG +         
Sbjct: 94  VLLMQQYYSVSEVAYALQQVAWRRQQRVVKPVAREFKKVRQWQRFEGGGNVKEGCNSGVE 153

Query: 135 XXXXXXXXXXXXLKVNGVVK-IDVVNKEAAVKQG---------------ETKELVGNPEE 266
                        + N  VK   VV+K   +K G               E K+   N + 
Sbjct: 154 FHRN---------EANSTVKGTRVVDKSEELKSGGKVGVKDDKSSDIAEEKKDTTTNHQS 204

Query: 267 NNSMKSSVIEVGDSQNEVDKTDDKRDSNSDGSSTVENESHSVQVPTEKQN--VVPKTFVA 440
           +  +KS V   G   +   K +D  +  +  S   EN+SHS+Q   + +N     KTF A
Sbjct: 205 DGILKSPVNSQGSLSSAEYKAEDVNEEGASNSG--ENDSHSIQNQHQNENGSFTGKTFTA 262

Query: 441 TEIYDGKPVNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPA-QTFIVSKRPMK 617
            E++DGK VN V+G+KLYE+L  ++EVSKLV+LVNDLRVAGR+GQL   QT++VSKRPM+
Sbjct: 263 NEMFDGKTVNAVEGLKLYEDLFDSTEVSKLVSLVNDLRVAGRKGQLQGNQTYVVSKRPMR 322

Query: 618 GHGREMIQLGLPIVDAPPEDEAAIATYKDRKTE 716
           G GREMIQLG+PI  A P+ +   A+ KD+  E
Sbjct: 323 GRGREMIQLGVPIAYASPDVDNVTASTKDKNME 355


>ref|XP_004513244.1| PREDICTED: uncharacterized protein LOC101507475 [Cicer arietinum]
          Length = 657

 Score =  165 bits (418), Expect = 1e-38
 Identities = 105/265 (39%), Positives = 154/265 (58%), Gaps = 27/265 (10%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKGFDGGGK-MXXXXXXXXXXXXXXXXXXXXXLKV 179
           VL MQQY+SV+EV Y+L QV WR+QQ+      K                       +++
Sbjct: 91  VLLMQQYYSVSEVSYALQQVAWRRQQRVVKPVVKEFRKVRQWQRFEGANVKEGCNSSVEL 150

Query: 180 NG------VVKIDVVNKEAAVKQG---------------ETKELVGNPEENNSMKSSVIE 296
           NG      V +  V++K   +K                 E K+ + N +  N +K S   
Sbjct: 151 NGNKANLSVKETPVIDKIGELKSEGKVGTKDDKSSDIGEEKKDTITNHQSGNILKRS--- 207

Query: 297 VGDSQNEVDKTDDKRDSNSDG--SSTVENESHSVQVPTEKQN--VVPKTFVATEIYDGKP 464
            G+SQ  +  ++ +    ++G  S++ EN+SHS+Q   +K+N   + K F+  EI DGK 
Sbjct: 208 -GNSQGSLSSSECEAVGVNEGITSNSRENDSHSMQNQNQKENNSTMGKAFIGNEIVDGKM 266

Query: 465 VNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPA-QTFIVSKRPMKGHGREMIQ 641
           VNVVDG+KL+E+L  ++EVSKLV+LVND+R+AG++GQ    QT++VSKRPM+GHGREMIQ
Sbjct: 267 VNVVDGLKLHEDLFDSTEVSKLVSLVNDMRIAGKKGQFQGNQTYVVSKRPMRGHGREMIQ 326

Query: 642 LGLPIVDAPPEDEAAIATYKDRKTE 716
           LGLPIVDAP +++   A+ K +K E
Sbjct: 327 LGLPIVDAPQDEDNMTASTKGKKIE 351


>ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
           gi|550333015|gb|EEE88914.2| hydroxyproline-rich
           glycoprotein [Populus trichocarpa]
          Length = 675

 Score =  165 bits (417), Expect = 2e-38
 Identities = 115/259 (44%), Positives = 135/259 (52%), Gaps = 26/259 (10%)
 Frame = +3

Query: 3   VLHMQQYFSVAEVIYSLHQVEWRKQQKG-------------FDGGGKMXXXXXXXXXXXX 143
           VLHMQQYFSV EVI +L QV  R+QQ+              +   GK+            
Sbjct: 97  VLHMQQYFSVGEVIVALQQVVLRRQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAG 156

Query: 144 XXXXXXXXXLKVNGVVKIDVVNKEAAVKQGETKELVGNPEENNSMKSSVIEVGDSQNEVD 323
                        G    D V KE      E     GN  EN  ++S   E   S  +  
Sbjct: 157 FNRGHRGGGGGGGG----DAV-KEGVNSSVENHSFNGNSSEN--IRSEKFEEVKSGGDGG 209

Query: 324 KTDDKR-----------DSNSDGSSTVENESHSVQVPTEKQN--VVPKTFVATEIYDGKP 464
           K+DDK+             NS G++      +S  V  EKQN  + PKTFVA E  DG+ 
Sbjct: 210 KSDDKKADATAKSHTDNHKNSSGNAQGTFSGNSEAVANEKQNLAITPKTFVAEEKIDGQM 269

Query: 465 VNVVDGMKLYEELLSNSEVSKLVTLVNDLRVAGRRGQLPAQTFIVSKRPMKGHGREMIQL 644
           VNVVDG+KLYE LL   EVSKLV+LVN+LR  GRRGQ   QT+I+SKRPMKGHGREMIQL
Sbjct: 270 VNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGREMIQL 329

Query: 645 GLPIVDAPPEDEAAIATYK 701
           GLPI DAP EDE A  T K
Sbjct: 330 GLPIADAPAEDENATGTSK 348


Top