BLASTX nr result

ID: Rehmannia25_contig00023268 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00023268
         (834 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ26321.1| hypothetical protein PRUPE_ppa002630mg [Prunus pe...   166   1e-38
ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600...   162   1e-37
gb|ESW30780.1| hypothetical protein PHAVU_002G181800g [Phaseolus...   154   4e-35
ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809...   153   6e-35
ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252...   127   6e-27
emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]   125   1e-26
ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr...   123   7e-26
ref|XP_004496294.1| PREDICTED: uncharacterized protein LOC101493...   121   3e-25
emb|CBI26785.3| unnamed protein product [Vitis vinifera]              121   3e-25
ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu...   120   6e-25
gb|ABK95394.1| unknown [Populus trichocarpa]                          120   6e-25
gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, ...   117   5e-24
ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309...   116   1e-23
gb|EOY01303.1| Hydroxyproline-rich glycoprotein family protein, ...   112   1e-22
gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, ...   112   1e-22
ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm...   112   1e-22
ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210...   111   3e-22
gb|EPS61205.1| hypothetical protein M569_13593, partial [Genlise...   109   1e-21
ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261...   109   1e-21
ref|XP_006849434.1| hypothetical protein AMTR_s00024p00040890 [A...   108   3e-21

>gb|EMJ26321.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica]
          Length = 650

 Score =  166 bits (419), Expect = 1e-38
 Identities = 102/264 (38%), Positives = 138/264 (52%), Gaps = 43/264 (16%)
 Frame = +3

Query: 171 KREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQIRSSWDPIL------ 332
           +R+G +  LRGEFA ANAII++LC HL+AVGEPGEYD V+G I+Q R +W+P+L      
Sbjct: 41  ERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRCNWNPVLHMQQYF 100

Query: 333 ---------------RRQR-----------FSVAEVGYLTRRQQQNAVRMXXXXXXXXXX 434
                          R+QR           F  + VG+   +Q+  A +           
Sbjct: 101 SVAEVIYALQHVAWRRQQRYYDPVKAGAKEFKRSGVGFNKGQQRAEAFKEGHNSTLESHS 160

Query: 435 XXXXKSEA---DNFEIGQ--------GSTQGXXXXXXXXXXXXGSCRVDGSELARDEKQN 581
                S     + FE G         G   G                   S   +++KQN
Sbjct: 161 NDGNSSGVVAPEKFERGSEVGEEVEPGGEVGKLNDKGLAPAGEKKVNESHSIQIQNQKQN 220

Query: 582 LQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWRGQLQGQT 761
           L + PKT+   EI  G+++N+ +GLK YE+   D+E+SKL++LVN LR +G R QLQGQT
Sbjct: 221 LSIVPKTFIGNEISDGKTVNVVDGLKLYEDFLGDTEVSKLVSLVNDLRAAGKRRQLQGQT 280

Query: 762 FVTSKRPMNGHGREMIQFGVPIAD 833
           +V SKRPM GHGREMIQ G+PIAD
Sbjct: 281 YVVSKRPMKGHGREMIQLGIPIAD 304


>ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600383 [Solanum tuberosum]
          Length = 638

 Score =  162 bits (411), Expect = 1e-37
 Identities = 108/299 (36%), Positives = 151/299 (50%), Gaps = 63/299 (21%)
 Frame = +3

Query: 126 RWYNHQQQPPY-HQMGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIE 302
           R + HQQQ  +  Q+ +R+G +  LRGEFA +NAII+ALC HL+ VGEPGEYD V+G ++
Sbjct: 30  RQHQHQQQWFHPQQVDERDGFISWLRGEFAASNAIIDALCHHLRLVGEPGEYDGVIGCVQ 89

Query: 303 QIRSSWDPILRRQRF-SVAEV-----------------GYLTRRQQQNAVRMXXXXXXXX 428
           Q R++W+ +L  Q++ SVAEV                 G + + +++N  R         
Sbjct: 90  QRRANWNSVLHMQQYHSVAEVIYSLHQVEWMKQQKGFDGGVKKVEKRNGSRGGGGGWKSE 149

Query: 429 XXXXXXKSEADNF-----------------EIGQGSTQGXXXXXXXXXXXXGSC------ 539
                 +S+  NF                 E+ QG  +              S       
Sbjct: 150 GLKDGKESQGQNFSLDAHSKTNGVEKIDVVEVKQGEKKELAANPEANSSVKSSVCTEAGD 209

Query: 540 ---RVDGSELARD-----------EKQNLQVS-------PKTYDATEICGGESINIAEGL 656
               VD ++  RD           E  ++QV        PKT+ ATEI  G+ +N+ +G+
Sbjct: 210 SQGEVDKTDDKRDSNSEGSSNVESESHSIQVPTEKQNVVPKTFVATEIYDGKPVNVVDGM 269

Query: 657 KQYENLFDDSEISKLITLVNGLRHSGWRGQLQGQTFVTSKRPMNGHGREMIQFGVPIAD 833
           K YE L   SE+SKL+TLVN LR +G RGQL  Q F+ SKRPM GHGREM+Q G+PI D
Sbjct: 270 KLYEELLSSSEVSKLLTLVNDLRAAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVD 328


>gb|ESW30780.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris]
          Length = 630

 Score =  154 bits (389), Expect = 4e-35
 Identities = 106/276 (38%), Positives = 145/276 (52%), Gaps = 46/276 (16%)
 Frame = +3

Query: 144 QQPPYHQ---MGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQIRS 314
           QQ  Y Q   + +R+G++  LR EFA ANAII++LC HL+ VG+PGEYD V+G+I+Q R 
Sbjct: 29  QQHHYRQQWFVDERDGLIGWLRSEFAAANAIIDSLCHHLRVVGDPGEYDMVIGAIQQRRC 88

Query: 315 SWDPILRRQR-FSVAEVGYLT-----RRQQQ---------NAVRMXXXXXXXXXXXXXXK 449
           +W+ +L  Q+ FSVA+V Y       R+QQ+           VR               K
Sbjct: 89  NWNQVLLMQQYFSVADVTYTLQQVAWRKQQRPLDPVKVGAKEVRKPGPGYRYGHRFEPSK 148

Query: 450 ----------SEADNFEIGQGSTQGXXXXXXXXXXXXGSC--RVDGSELARDEK------ 575
                     S   N    +G  +G            GS   +V    LA  E+      
Sbjct: 149 EGYNSSVESYSHDGNATFTRGMEKGTPTVDKSEEHKSGSKVEKVGDKGLASPEEKKGNDS 208

Query: 576 ---------QNLQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITLVNGLRH 728
                    Q+     KT+   E+  G+ +N+A+GLK YE++FD +E+S L++LVN LR 
Sbjct: 209 DSVESQHQSQSFSTIAKTFIGNEMIDGKMVNLADGLKLYEDIFDSTEVSNLVSLVNDLRI 268

Query: 729 SGWRGQLQG-QTFVTSKRPMNGHGREMIQFGVPIAD 833
           SG +GQLQG Q +V S+RPM GHGREMIQ GVPIAD
Sbjct: 269 SGKKGQLQGNQAYVVSRRPMKGHGREMIQLGVPIAD 304


>ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809865 isoform X2 [Glycine
           max]
          Length = 641

 Score =  153 bits (387), Expect = 6e-35
 Identities = 103/277 (37%), Positives = 145/277 (52%), Gaps = 48/277 (17%)
 Frame = +3

Query: 147 QPPYHQ---MGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQIRSS 317
           QP Y Q   + +R+G++  LR EFA ANAII++LC HL+ VG+PGEYD V+G+I+Q R +
Sbjct: 32  QPHYCQQWFVDERDGLIGWLRSEFAAANAIIDSLCHHLRVVGDPGEYDMVIGAIQQRRCN 91

Query: 318 WDPILRRQR-FSVAEVGYLT-----RRQQ--------------------QNAVRMXXXXX 419
           W+ +L  Q+ FSVA+V +       RRQQ                    ++  R      
Sbjct: 92  WNQVLMMQQYFSVADVAHALQQVAWRRQQRPLDPVKVGAKEFRKSGSGYRHGQRFEPVKE 151

Query: 420 XXXXXXXXXKSEADNFEIGQGSTQGXXXXXXXXXXXXGSCRVD-----GSELARDEK--- 575
                         N  +  G+ +G            G  +V+     G   A D+K   
Sbjct: 152 GYNSSVESYNQYDANVTVTGGTEKGTPVVEKSEEHKSGG-KVEKVGDKGLASAEDKKGDD 210

Query: 576 ----------QNLQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITLVNGLR 725
                     Q+L    KT+   E+  G+ +N+ +GLK YE+LFD +EI+ L++LVN LR
Sbjct: 211 SHSVQNQHQSQSLSTKAKTFIGNEMFDGKMVNVVDGLKLYEDLFDSTEIANLVSLVNDLR 270

Query: 726 HSGWRGQLQG-QTFVTSKRPMNGHGREMIQFGVPIAD 833
            SG +GQLQG Q ++ S+RPM GHGREMIQ GVPIAD
Sbjct: 271 VSGKKGQLQGSQAYIVSRRPMKGHGREMIQLGVPIAD 307


>ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera]
          Length = 698

 Score =  127 bits (318), Expect = 6e-27
 Identities = 68/127 (53%), Positives = 82/127 (64%), Gaps = 5/127 (3%)
 Frame = +3

Query: 468 EIGQGSTQGXXXXXXXXXXXXGSCRVDGSELA-----RDEKQNLQVSPKTYDATEICGGE 632
           E  +GS  G            GSC +     A     ++EK N   SPKT+  TEI  G+
Sbjct: 224 ENSEGSRCGISETEANDMDDGGSCNMIMENNAHPVQNQNEKPNPTTSPKTFVGTEIFDGK 283

Query: 633 SINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWRGQLQGQTFVTSKRPMNGHGREMIQ 812
           ++N+ +GLK YE LFDDSE+SK ++LVN LR +G RGQLQGQTFV SKRPM GHGREMIQ
Sbjct: 284 AVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQGQTFVVSKRPMKGHGREMIQ 343

Query: 813 FGVPIAD 833
            GVPIAD
Sbjct: 344 LGVPIAD 350



 Score = 83.2 bits (204), Expect = 1e-13
 Identities = 46/93 (49%), Positives = 64/93 (68%), Gaps = 6/93 (6%)
 Frame = +3

Query: 132 YNHQQQPPYHQMGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQIR 311
           ++H+Q  P     +R+G +  LRGEFA ANAII++LC HL+ +GEPGEYD+V+G I+Q R
Sbjct: 32  HHHRQWFP----DERDGFISWLRGEFAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRR 87

Query: 312 SSWDPILRRQR-FSVAEVGYLT-----RRQQQN 392
            +W  +L  Q+ FSVAEV Y       RRQQ++
Sbjct: 88  YNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120


>emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]
          Length = 1145

 Score =  125 bits (315), Expect = 1e-26
 Identities = 60/90 (66%), Positives = 72/90 (80%)
 Frame = +3

Query: 564 RDEKQNLQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWRG 743
           ++EK N   SPKT+  TEI  G+++N+ +GLK YE LFDDSE+SK ++LVN LR +G RG
Sbjct: 270 QNEKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRG 329

Query: 744 QLQGQTFVTSKRPMNGHGREMIQFGVPIAD 833
           QLQGQTFV SKRPM GHGREMIQ GVPIAD
Sbjct: 330 QLQGQTFVVSKRPMKGHGREMIQLGVPIAD 359



 Score = 83.2 bits (204), Expect = 1e-13
 Identities = 46/93 (49%), Positives = 64/93 (68%), Gaps = 6/93 (6%)
 Frame = +3

Query: 132 YNHQQQPPYHQMGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQIR 311
           ++H+Q  P     +R+G +  LRGEFA ANAII++LC HL+ +GEPGEYD+V+G I+Q R
Sbjct: 30  HHHRQWFP----DERDGFISWLRGEFAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRR 85

Query: 312 SSWDPILRRQR-FSVAEVGYLT-----RRQQQN 392
            +W  +L  Q+ FSVAEV Y       RRQQ++
Sbjct: 86  YNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 118


>ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
           gi|550333015|gb|EEE88914.2| hydroxyproline-rich
           glycoprotein [Populus trichocarpa]
          Length = 675

 Score =  123 bits (309), Expect = 7e-26
 Identities = 64/129 (49%), Positives = 81/129 (62%)
 Frame = +3

Query: 447 KSEADNFEIGQGSTQGXXXXXXXXXXXXGSCRVDGSELARDEKQNLQVSPKTYDATEICG 626
           KS  DN +   G+ QG                   SE   +EKQNL ++PKT+ A E   
Sbjct: 221 KSHTDNHKNSSGNAQGTFSG--------------NSEAVANEKQNLAITPKTFVAEEKID 266

Query: 627 GESINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWRGQLQGQTFVTSKRPMNGHGREM 806
           G+ +N+ +GLK YENL D  E+SKL++LVN LR +G RGQ QGQT++ SKRPM GHGREM
Sbjct: 267 GQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGHGREM 326

Query: 807 IQFGVPIAD 833
           IQ G+PIAD
Sbjct: 327 IQLGLPIAD 335



 Score = 84.7 bits (208), Expect = 4e-14
 Identities = 49/93 (52%), Positives = 63/93 (67%), Gaps = 9/93 (9%)
 Frame = +3

Query: 138 HQQQPPYHQ---MGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQI 308
           HQ Q   HQ   + +R+G +  LRGEFA ANAII++LC HL+AVGE GEYD V+G I+Q 
Sbjct: 31  HQHQLQRHQWFPVDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEAGEYDLVVGCIQQR 90

Query: 309 RSSWDPILRRQR-FSVAEV-----GYLTRRQQQ 389
           RS+W+ +L  Q+ FSV EV       + RRQQQ
Sbjct: 91  RSNWNHVLHMQQYFSVGEVIVALQQVVLRRQQQ 123


>ref|XP_004496294.1| PREDICTED: uncharacterized protein LOC101493086 [Cicer arietinum]
          Length = 508

 Score =  121 bits (304), Expect = 3e-25
 Identities = 73/226 (32%), Positives = 110/226 (48%), Gaps = 8/226 (3%)
 Frame = +3

Query: 174 REGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQIRSSWDPILRRQRF-S 350
           ++ +L   RGEFA ANAII+ALC HL  +    +Y SV  +I + R  W P+L+ Q++ S
Sbjct: 33  KDAILAWFRGEFAAANAIIDALCTHLSQLSSAADYSSVFAAIHRRRLHWIPVLQMQKYHS 92

Query: 351 VAEVGYLTRRQQQNAVRMXXXXXXXXXXXXXXKSEADNFEIGQGSTQGXXXXXXXXXXXX 530
           +A+V    R+  +N   +              K+EA   E G    +             
Sbjct: 93  IADVALQLRKVDENKNIVEEVRENDVVVEEERKTEAKVIEAGDEHEE--------YDSPE 144

Query: 531 GSCRVDGSELARDEKQNLQVSP-------KTYDATEICGGESINIAEGLKQYENLFDDSE 689
                 GS+  +D   N+ +         K + A E   G  +N+ +GLK YE++F DSE
Sbjct: 145 SEITDSGSQENQDNSMNIDICSNHEECLTKGFSAKESVKGHMVNVVKGLKLYEDIFTDSE 204

Query: 690 ISKLITLVNGLRHSGWRGQLQGQTFVTSKRPMNGHGREMIQFGVPI 827
           + KL   VN +  +G  G L G+TF+   + M G+ RE+IQ GVPI
Sbjct: 205 LCKLSDFVNEIHTAGQNGDLSGETFILFNKQMKGNKRELIQLGVPI 250


>emb|CBI26785.3| unnamed protein product [Vitis vinifera]
          Length = 672

 Score =  121 bits (303), Expect = 3e-25
 Identities = 60/91 (65%), Positives = 72/91 (79%), Gaps = 1/91 (1%)
 Frame = +3

Query: 564 RDEKQNLQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWRG 743
           ++EK N   SPKT+  TEI  G+++N+ +GLK YE LFDDSE+SK ++LVN LR +G RG
Sbjct: 267 QNEKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRG 326

Query: 744 QLQ-GQTFVTSKRPMNGHGREMIQFGVPIAD 833
           QLQ GQTFV SKRPM GHGREMIQ GVPIAD
Sbjct: 327 QLQAGQTFVVSKRPMKGHGREMIQLGVPIAD 357



 Score = 83.2 bits (204), Expect = 1e-13
 Identities = 46/93 (49%), Positives = 64/93 (68%), Gaps = 6/93 (6%)
 Frame = +3

Query: 132 YNHQQQPPYHQMGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQIR 311
           ++H+Q  P     +R+G +  LRGEFA ANAII++LC HL+ +GEPGEYD+V+G I+Q R
Sbjct: 32  HHHRQWFP----DERDGFISWLRGEFAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRR 87

Query: 312 SSWDPILRRQR-FSVAEVGYLT-----RRQQQN 392
            +W  +L  Q+ FSVAEV Y       RRQQ++
Sbjct: 88  YNWSSVLHMQQYFSVAEVIYALQQVGWRRQQRH 120


>ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa]
           gi|550333016|gb|ERP57586.1| hypothetical protein
           POPTR_0008s13830g [Populus trichocarpa]
          Length = 693

 Score =  120 bits (301), Expect = 6e-25
 Identities = 63/133 (47%), Positives = 83/133 (62%), Gaps = 4/133 (3%)
 Frame = +3

Query: 447 KSEADNFEIGQGSTQGXXXXXXXXXXXXGSCRVDGSELA----RDEKQNLQVSPKTYDAT 614
           KS  DN +   G+ QG                 + S+      ++EKQNL ++PKT+ A 
Sbjct: 220 KSHTDNHKNSSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNLAITPKTFVAE 279

Query: 615 EICGGESINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWRGQLQGQTFVTSKRPMNGH 794
           E   G+ +N+ +GLK YENL D  E+SKL++LVN LR +G RGQ QGQT++ SKRPM GH
Sbjct: 280 EKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGH 339

Query: 795 GREMIQFGVPIAD 833
           GREMIQ G+PIAD
Sbjct: 340 GREMIQLGLPIAD 352



 Score = 84.7 bits (208), Expect = 4e-14
 Identities = 49/93 (52%), Positives = 63/93 (67%), Gaps = 9/93 (9%)
 Frame = +3

Query: 138 HQQQPPYHQ---MGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQI 308
           HQ Q   HQ   + +R+G +  LRGEFA ANAII++LC HL+AVGE GEYD V+G I+Q 
Sbjct: 31  HQHQLQRHQWFPVDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEAGEYDLVVGCIQQR 90

Query: 309 RSSWDPILRRQR-FSVAEV-----GYLTRRQQQ 389
           RS+W+ +L  Q+ FSV EV       + RRQQQ
Sbjct: 91  RSNWNHVLHMQQYFSVGEVIVALQQVVLRRQQQ 123


>gb|ABK95394.1| unknown [Populus trichocarpa]
          Length = 694

 Score =  120 bits (301), Expect = 6e-25
 Identities = 63/133 (47%), Positives = 83/133 (62%), Gaps = 4/133 (3%)
 Frame = +3

Query: 447 KSEADNFEIGQGSTQGXXXXXXXXXXXXGSCRVDGSELA----RDEKQNLQVSPKTYDAT 614
           KS  DN +   G+ QG                 + S+      ++EKQNL ++PKT+ A 
Sbjct: 221 KSHTDNHKNSSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNEKQNLAITPKTFVAE 280

Query: 615 EICGGESINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWRGQLQGQTFVTSKRPMNGH 794
           E   G+ +N+ +GLK YENL D  E+SKL++LVN LR +G RGQ QGQT++ SKRPM GH
Sbjct: 281 EKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQGQTYILSKRPMKGH 340

Query: 795 GREMIQFGVPIAD 833
           GREMIQ G+PIAD
Sbjct: 341 GREMIQLGLPIAD 353



 Score = 84.7 bits (208), Expect = 4e-14
 Identities = 49/93 (52%), Positives = 63/93 (67%), Gaps = 9/93 (9%)
 Frame = +3

Query: 138 HQQQPPYHQ---MGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQI 308
           HQ Q   HQ   + +R+G +  LRGEFA ANAII++LC HL+AVGE GEYD V+G I+Q 
Sbjct: 31  HQHQLQRHQWFPVDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEAGEYDLVVGCIQQR 90

Query: 309 RSSWDPILRRQR-FSVAEV-----GYLTRRQQQ 389
           RS+W+ +L  Q+ FSV EV       + RRQQQ
Sbjct: 91  RSNWNHVLHMQQYFSVGEVIVALQQVVLRRQQQ 123


>gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
           [Theobroma cacao] gi|508709405|gb|EOY01302.1|
           Hydroxyproline-rich glycoprotein family protein,
           putative isoform 2 [Theobroma cacao]
          Length = 680

 Score =  117 bits (293), Expect = 5e-24
 Identities = 54/90 (60%), Positives = 69/90 (76%)
 Frame = +3

Query: 564 RDEKQNLQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWRG 743
           ++EKQNL   PKT+   E+  G+ +N+ +GLK YE LFDD E+  L++LVN LR +G RG
Sbjct: 251 QNEKQNLAAGPKTFVGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRG 310

Query: 744 QLQGQTFVTSKRPMNGHGREMIQFGVPIAD 833
           QLQGQT+V +KRPM GHGREMIQ G+PIAD
Sbjct: 311 QLQGQTYVAAKRPMKGHGREMIQLGLPIAD 340



 Score = 79.0 bits (193), Expect = 2e-12
 Identities = 40/80 (50%), Positives = 58/80 (72%), Gaps = 1/80 (1%)
 Frame = +3

Query: 132 YNHQQQPPYHQMGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQIR 311
           ++H+Q  P     +R+G +  LRGEFA +NAII++LC HL+ VGE GEY++V+  I+Q R
Sbjct: 46  HHHRQWLP----DERDGFIYWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRR 101

Query: 312 SSWDPILRRQR-FSVAEVGY 368
            +W+P+L  Q+ FSVAEV Y
Sbjct: 102 CNWNPVLHMQQYFSVAEVSY 121


>ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca
           subsp. vesca]
          Length = 682

 Score =  116 bits (290), Expect = 1e-23
 Identities = 54/90 (60%), Positives = 69/90 (76%)
 Frame = +3

Query: 564 RDEKQNLQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWRG 743
           ++EKQNL + PKT+   E   G+++N+ +GLK YE    D+E+SKL +LVN LR +G RG
Sbjct: 248 QNEKQNLSLIPKTFVGNETFDGKTVNVVDGLKLYEEFLGDTEVSKLFSLVNDLRTTGRRG 307

Query: 744 QLQGQTFVTSKRPMNGHGREMIQFGVPIAD 833
           QLQGQT+V SKRPM GHGREMIQ G+PIAD
Sbjct: 308 QLQGQTYVLSKRPMKGHGREMIQLGIPIAD 337



 Score = 86.3 bits (212), Expect = 1e-14
 Identities = 47/88 (53%), Positives = 60/88 (68%), Gaps = 6/88 (6%)
 Frame = +3

Query: 144 QQPPYHQMGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQIRSSWD 323
           QQP      +R+G +  LRGEFA ANAII++LC HL+AVGEP EYD V+G ++Q R +W 
Sbjct: 31  QQPRQWFPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPSEYDMVIGCVQQRRCNWT 90

Query: 324 PILRRQR-FSVAEVGYLT-----RRQQQ 389
           P+L  Q+ FSVAEV Y       RRQQ+
Sbjct: 91  PVLHMQQYFSVAEVIYALQQVAWRRQQR 118


>gb|EOY01303.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 5
           [Theobroma cacao]
          Length = 572

 Score =  112 bits (281), Expect = 1e-22
 Identities = 54/91 (59%), Positives = 69/91 (75%), Gaps = 1/91 (1%)
 Frame = +3

Query: 564 RDEKQNLQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWRG 743
           ++EKQNL   PKT+   E+  G+ +N+ +GLK YE LFDD E+  L++LVN LR +G RG
Sbjct: 142 QNEKQNLAAGPKTFVGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRG 201

Query: 744 QLQ-GQTFVTSKRPMNGHGREMIQFGVPIAD 833
           QLQ GQT+V +KRPM GHGREMIQ G+PIAD
Sbjct: 202 QLQAGQTYVAAKRPMKGHGREMIQLGLPIAD 232


>gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
           [Theobroma cacao] gi|508709404|gb|EOY01301.1|
           Hydroxyproline-rich glycoprotein family protein,
           putative isoform 1 [Theobroma cacao]
          Length = 681

 Score =  112 bits (281), Expect = 1e-22
 Identities = 54/91 (59%), Positives = 69/91 (75%), Gaps = 1/91 (1%)
 Frame = +3

Query: 564 RDEKQNLQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWRG 743
           ++EKQNL   PKT+   E+  G+ +N+ +GLK YE LFDD E+  L++LVN LR +G RG
Sbjct: 251 QNEKQNLAAGPKTFVGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRG 310

Query: 744 QLQ-GQTFVTSKRPMNGHGREMIQFGVPIAD 833
           QLQ GQT+V +KRPM GHGREMIQ G+PIAD
Sbjct: 311 QLQAGQTYVAAKRPMKGHGREMIQLGLPIAD 341



 Score = 79.0 bits (193), Expect = 2e-12
 Identities = 40/80 (50%), Positives = 58/80 (72%), Gaps = 1/80 (1%)
 Frame = +3

Query: 132 YNHQQQPPYHQMGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQIR 311
           ++H+Q  P     +R+G +  LRGEFA +NAII++LC HL+ VGE GEY++V+  I+Q R
Sbjct: 46  HHHRQWLP----DERDGFIYWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRR 101

Query: 312 SSWDPILRRQR-FSVAEVGY 368
            +W+P+L  Q+ FSVAEV Y
Sbjct: 102 CNWNPVLHMQQYFSVAEVSY 121


>ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis]
           gi|223533099|gb|EEF34858.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 697

 Score =  112 bits (281), Expect = 1e-22
 Identities = 52/87 (59%), Positives = 66/87 (75%)
 Frame = +3

Query: 573 KQNLQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWRGQLQ 752
           K NL  +PKT+   E+  G+S+N+ +GLK YE L DD E+SKL++LVN LR +G +GQ Q
Sbjct: 270 KLNLTTTPKTFVGAEMVDGKSVNVVDGLKLYEQLLDDVEVSKLVSLVNDLRAAGRKGQFQ 329

Query: 753 GQTFVTSKRPMNGHGREMIQFGVPIAD 833
           GQ +V SKRPM GHGREMIQ G+PIAD
Sbjct: 330 GQAYVVSKRPMKGHGREMIQLGLPIAD 356



 Score = 85.1 bits (209), Expect = 3e-14
 Identities = 47/95 (49%), Positives = 64/95 (67%), Gaps = 11/95 (11%)
 Frame = +3

Query: 141 QQQPPYHQ-----MGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQ 305
           QQQ  +H+     + +R+G +  LRGEFA ANAII++LC HL+A GEPGEYD V+G I+Q
Sbjct: 36  QQQQHHHRHQWFPVDERDGFISWLRGEFAAANAIIDSLCHHLRAAGEPGEYDVVIGCIQQ 95

Query: 306 IRSSWDPILRRQR-FSVAEV-----GYLTRRQQQN 392
            R +W+P+L  Q+ FSV EV         R+QQQ+
Sbjct: 96  RRCNWNPVLHMQQYFSVGEVILALQQVALRKQQQH 130


>ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210274 [Cucumis sativus]
           gi|449481289|ref|XP_004156139.1| PREDICTED:
           uncharacterized LOC101210274 [Cucumis sativus]
          Length = 684

 Score =  111 bits (278), Expect = 3e-22
 Identities = 52/91 (57%), Positives = 70/91 (76%)
 Frame = +3

Query: 561 ARDEKQNLQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWR 740
           +++ KQ    +P+T+ A+E+  G+ +N+ +GLK +E L DD+E+SKL++LVN LR SG R
Sbjct: 257 SQNGKQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEVSKLLSLVNDLRASGKR 316

Query: 741 GQLQGQTFVTSKRPMNGHGREMIQFGVPIAD 833
           GQ QGQT+V SKRPM GHGREMIQ G PIAD
Sbjct: 317 GQFQGQTYVVSKRPMKGHGREMIQLGFPIAD 347



 Score = 90.9 bits (224), Expect = 5e-16
 Identities = 49/89 (55%), Positives = 62/89 (69%), Gaps = 5/89 (5%)
 Frame = +3

Query: 138 HQQQPPYHQMGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQIRSS 317
           HQ  P      +R+G +  LRGEFA +NAII+ALC HL+AVGEPGEYD V+G I+Q R +
Sbjct: 32  HQHHPRPWFPDERDGFISWLRGEFAASNAIIDALCHHLRAVGEPGEYDMVIGCIQQRRCN 91

Query: 318 WDPILRRQR-FSVAEVGY----LTRRQQQ 389
           W P+L  Q+ FSVAEV Y    +T R+QQ
Sbjct: 92  WTPVLHMQQYFSVAEVMYALQQVTSRRQQ 120


>gb|EPS61205.1| hypothetical protein M569_13593, partial [Genlisea aurea]
          Length = 275

 Score =  109 bits (273), Expect = 1e-21
 Identities = 60/98 (61%), Positives = 68/98 (69%)
 Frame = +3

Query: 531 GSCRVDGSELARDEKQNLQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITL 710
           G+   D S  A  E       PKT+ A+EI  G+S+NIAEGLK YE+L DDSEISKLITL
Sbjct: 40  GTMNGDASGFASTENP-----PKTFVASEIHDGKSVNIAEGLKLYEDLCDDSEISKLITL 94

Query: 711 VNGLRHSGWRGQLQGQTFVTSKRPMNGHGREMIQFGVP 824
           V  LR SG +G+LQGQ FV SKRPM GHGR MIQ G P
Sbjct: 95  VKDLRASGRKGELQGQAFVVSKRPMKGHGRVMIQLGTP 132


>ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261013 [Solanum
           lycopersicum]
          Length = 641

 Score =  109 bits (272), Expect = 1e-21
 Identities = 53/88 (60%), Positives = 65/88 (73%)
 Frame = +3

Query: 570 EKQNLQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITLVNGLRHSGWRGQL 749
           EKQN  V PKT+ ATEI  G+ +N+ +G+K YE L   SE+SKL+TLVN LR +G RGQL
Sbjct: 246 EKQN--VVPKTFVATEIYDGKPVNVVDGMKLYEELLSSSEVSKLVTLVNDLRAAGRRGQL 303

Query: 750 QGQTFVTSKRPMNGHGREMIQFGVPIAD 833
             Q F+ SKRPM GHGREM+Q G+PI D
Sbjct: 304 PAQAFIVSKRPMKGHGREMVQLGLPIVD 331



 Score = 86.7 bits (213), Expect = 1e-14
 Identities = 44/83 (53%), Positives = 62/83 (74%), Gaps = 2/83 (2%)
 Frame = +3

Query: 126 RWYNHQQQPPY-HQMGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIE 302
           R + HQQQ  +  Q+ +R+G +  LRGEFA +NAII+ALC HL+ VGEPGEYD V+G ++
Sbjct: 32  RQHQHQQQWFHPQQVDERDGFISWLRGEFAASNAIIDALCHHLRLVGEPGEYDGVIGCVQ 91

Query: 303 QIRSSWDPILRRQRF-SVAEVGY 368
           Q R++W+ +L  Q++ SVAEV Y
Sbjct: 92  QRRANWNSVLHMQQYHSVAEVIY 114


>ref|XP_006849434.1| hypothetical protein AMTR_s00024p00040890 [Amborella trichopoda]
           gi|548853009|gb|ERN11015.1| hypothetical protein
           AMTR_s00024p00040890 [Amborella trichopoda]
          Length = 655

 Score =  108 bits (269), Expect = 3e-21
 Identities = 52/96 (54%), Positives = 70/96 (72%)
 Frame = +3

Query: 546 DGSELARDEKQNLQVSPKTYDATEICGGESINIAEGLKQYENLFDDSEISKLITLVNGLR 725
           DG +   +E +++  +PKT+ ATE   G+++N+ EGL+ YE LFD +EIS+L+T  N LR
Sbjct: 248 DGVQKEVEENESVP-APKTFVATEYLDGKAVNVLEGLELYEELFDSTEISRLVTFANELR 306

Query: 726 HSGWRGQLQGQTFVTSKRPMNGHGREMIQFGVPIAD 833
            +G RG +QG TFV SKRPM GHGREMIQ G+PI D
Sbjct: 307 AAGRRGDIQGPTFVVSKRPMRGHGREMIQLGIPIYD 342



 Score = 82.4 bits (202), Expect = 2e-13
 Identities = 41/78 (52%), Positives = 55/78 (70%), Gaps = 1/78 (1%)
 Frame = +3

Query: 138 HQQQPPYHQMGKREGVLRRLRGEFATANAIIEALCCHLKAVGEPGEYDSVLGSIEQIRSS 317
           HQ+Q P+    +R+G +  LR EFA ANAII++LC HLKAVG PGEY++ L  I+Q R +
Sbjct: 28  HQRQQPWFP-DERDGFISWLRSEFAAANAIIDSLCYHLKAVGSPGEYETTLAFIQQRRCN 86

Query: 318 WDPILRRQR-FSVAEVGY 368
           W P+L  Q+ F VAE+ Y
Sbjct: 87  WTPVLHMQQYFPVAEIAY 104


Top