BLASTX nr result

ID: Mentha25_contig00047456 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00047456
         (653 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus...   134   2e-29
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   124   2e-26
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   124   2e-26
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   123   6e-26
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   123   6e-26
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     119   8e-25
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   119   8e-25
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   119   8e-25
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   114   2e-23
ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot...   114   3e-23
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   112   8e-23
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   107   4e-21
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   107   4e-21
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   103   6e-20
emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]   103   6e-20
emb|CBI22685.3| unnamed protein product [Vitis vinifera]               99   1e-18
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...    98   2e-18
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...    95   2e-17
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...    95   2e-17
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...    93   8e-17

>gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus guttatus]
          Length = 420

 Score =  134 bits (337), Expect = 2e-29
 Identities = 96/208 (46%), Positives = 118/208 (56%), Gaps = 31/208 (14%)
 Frame = -2

Query: 652 SSPFPE------HPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD----PRSRD 503
           SSPF +      HPFFLE   GN P +          +WES Q SG  TP     PRSRD
Sbjct: 229 SSPFLDRDFAAVHPFFLEFGGGNPPRR---------DQWESCQESGVVTPTDAVGPRSRD 279

Query: 502 N-FLLNRQDSDVAPVSES---------AVSHRVSFEITNEEVVRCVEKKEPNTGIERSLI 353
           +  LLNRQ+SD++P+ ++         A+ HRVSFEIT E+V+RCVEKK   T  E   +
Sbjct: 280 SCVLLNRQNSDISPLPDNCTGLENDVAAIDHRVSFEITAEKVIRCVEKKSLETAQES--V 337

Query: 352 EKTRVGESSNRKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFE--NVDE----DSDWF 191
            K  + E  NR++             RHQKNRTITLGS+KEFNFE  N DE     S+W+
Sbjct: 338 GKKPI-ELINREE----DQTEIVNEKRHQKNRTITLGSTKEFNFEGGNCDEPCVDSSEWW 392

Query: 190 VGE-----EGAGHSEKWSFFPMMQTGVS 122
           V E     EG G SE WSFFP++Q GVS
Sbjct: 393 VNEKKVPKEGGGSSENWSFFPILQPGVS 420


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  124 bits (312), Expect = 2e-26
 Identities = 92/219 (42%), Positives = 116/219 (52%), Gaps = 46/219 (21%)
 Frame = -2

Query: 652 SSPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD---PRSRDN 500
           SSPFP+  F      F +   G+ PPKLL LDK+ +REW SRQGSGT TPD      R+ 
Sbjct: 238 SSPFPDGEFATAGPQFPDFHRGD-PPKLLNLDKLSIREWGSRQGSGTLTPDAVGSTPRNG 296

Query: 499 FLLNRQDSDVA--PVSESA------VSHRVSFEITNEEVVRCVEKKEPNT---GIERSLI 353
           F  NRQ S+VA  P SE+       V HRVSFE+T E+VVRCVEKK P T    +  SL 
Sbjct: 297 FFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVRCVEKK-PTTLAEAVSESLQ 355

Query: 352 EKTRV------GESSN---------RKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFE 218
             T V      GE+ N                         RHQK ++ITLGS+KEFNF+
Sbjct: 356 NGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFD 415

Query: 217 NVDED-------SDWFVGE----EGAGHSEKWSFFPMMQ 134
           + D D       SDW+  E    + +G  + W+FFP++Q
Sbjct: 416 SADGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQ 454


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
           gi|557541785|gb|ESR52763.1| hypothetical protein
           CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  124 bits (312), Expect = 2e-26
 Identities = 92/219 (42%), Positives = 116/219 (52%), Gaps = 46/219 (21%)
 Frame = -2

Query: 652 SSPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD---PRSRDN 500
           SSPFP+  F      F +   G+ PPKLL LDK+ +REW SRQGSGT TPD      R+ 
Sbjct: 238 SSPFPDGEFATAGPQFPDFHRGD-PPKLLNLDKLSIREWGSRQGSGTLTPDAVRSTPRNG 296

Query: 499 FLLNRQDSDVA--PVSESA------VSHRVSFEITNEEVVRCVEKKEPNT---GIERSLI 353
           F  NRQ S+VA  P SE+       V HRVSFE+T E+VVRCVEKK P T    +  SL 
Sbjct: 297 FFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVRCVEKK-PTTLAEAVSESLQ 355

Query: 352 EKTRV------GESSN---------RKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFE 218
             T V      GE+ N                         RHQK ++ITLGS+KEFNF+
Sbjct: 356 NGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQQSITLGSTKEFNFD 415

Query: 217 NVDED-------SDWFVGE----EGAGHSEKWSFFPMMQ 134
           + D D       SDW+  E    + +G  + W+FFP++Q
Sbjct: 416 SADGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQ 454


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
           gi|462404864|gb|EMJ10328.1| hypothetical protein
           PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  123 bits (308), Expect = 6e-26
 Identities = 91/228 (39%), Positives = 116/228 (50%), Gaps = 51/228 (22%)
 Frame = -2

Query: 652 SSPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD---PRSRDN 500
           SSPFP+  F      FLE RTG+ PPKLL LD +  R+W SR GSG+ TPD     S D 
Sbjct: 234 SSPFPDLEFAARGHHFLEFRTGD-PPKLLNLDILSTRDWGSRLGSGSVTPDGAKSTSSDG 292

Query: 499 FLLNRQDSDVA--PVSES-------AVSHRVSFEITNEEVVRCVEKK------------- 386
           FLL  Q  +V   P S +       +++HRVSFE+++EEV+RCVEKK             
Sbjct: 293 FLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIRCVEKKPVALAEAVSTSLE 352

Query: 385 ---------EPNTGIERSLIEKTRVGESSNRKQXXXXXXXXXXXXXRHQKNRTITLGSSK 233
                    +P+  +  S+     VGE+SN                 H K R+ITLGS K
Sbjct: 353 DTEKAQSKEDPSKVVSSSICP---VGETSN--DAAEKAVADGEEAQLHPKQRSITLGSVK 407

Query: 232 EFNFENVDE-------DSDWFVGE----EGAGHSEKWSFFPMMQTGVS 122
           EFNF+N D         SDW+  E    +  G ++ WSFFPMMQ GVS
Sbjct: 408 EFNFDNPDGGDSGNSIGSDWWANEKVDAKENGPTKNWSFFPMMQPGVS 455


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  123 bits (308), Expect = 6e-26
 Identities = 94/245 (38%), Positives = 113/245 (46%), Gaps = 68/245 (27%)
 Frame = -2

Query: 652 SSPFPEHPF-------FLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD---PRSRD 503
           SSPFP+  F       FLE R G  PPKLL LDK+   EW SR GSG+ TPD   P SRD
Sbjct: 236 SSPFPDRDFVCSGSSQFLEFRAGG-PPKLLTLDKLSNHEWGSRIGSGSITPDALGPPSRD 294

Query: 502 NFLLNRQDSDV---------------------------APVSESAVSHRVSFEITNEEVV 404
             +L+RQ SDV                            P +E  V HRVSFE+T E+VV
Sbjct: 295 GSVLDRQVSDVIHPPSGDDSVLDRQISDVASHSLSDSGCPNNEIMVDHRVSFELTAEDVV 354

Query: 403 RCVEK-------------KEPNT------GIERSLIEKTRVGESSNRKQXXXXXXXXXXX 281
           RCVEK             + P T        E  +  + RVGE++N              
Sbjct: 355 RCVEKDSAALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEE 414

Query: 280 XXRHQKNRTITLGSSKEFNFENVDE--------DSDWFVGE----EGAGHSEKWSFFPMM 137
              H K R+ITLGS+KEFNF+N D          SDW+  E    +  G S+ WS F MM
Sbjct: 415 GQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKNWSIFHMM 474

Query: 136 QTGVS 122
           Q  VS
Sbjct: 475 QPSVS 479


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  119 bits (298), Expect = 8e-25
 Identities = 84/220 (38%), Positives = 115/220 (52%), Gaps = 43/220 (19%)
 Frame = -2

Query: 652 SSPFPEH------PFFLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD---PRSRDN 500
           SSPFP+       P FLE RTG+ PPKLL LDK+   +W SRQGSG+ TPD   P S   
Sbjct: 239 SSPFPDPEFAARGPHFLEFRTGD-PPKLLNLDKLSKFDWGSRQGSGSLTPDSVKPISTFE 297

Query: 499 FLLNRQDSDVAPVSESAVSHRVSFEITNEEVVRCVEKK------------------EPNT 374
              + + +     +E+    RVSF+++ E+V+R VEKK                  +   
Sbjct: 298 VAPHLKPNGRCRNAENVADRRVSFDVSTEDVIRYVEKKTVPLAEAMLTSLKDTTMGQREE 357

Query: 373 GIERSLIE----KTRVGESSNRKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFENVDE 206
             + + +E    + RVGE+SN  +             +HQK+R+ITLGSSKEFNF+N D 
Sbjct: 358 NSDSNKVEEIGCENRVGETSN--EEPDKAPTSGEEVLQHQKHRSITLGSSKEFNFDNADA 415

Query: 205 D--------SDWFVGEEGAGH----SEKWSFFPMMQTGVS 122
                    SDW+  ++ AG     S+ WSFFPM+Q GVS
Sbjct: 416 GDLHKSDSVSDWWANQKVAGKEGAPSQNWSFFPMIQPGVS 455


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
           gi|550346902|gb|ERP65330.1| hypothetical protein
           POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  119 bits (298), Expect = 8e-25
 Identities = 91/225 (40%), Positives = 114/225 (50%), Gaps = 48/225 (21%)
 Frame = -2

Query: 652 SSPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSGTATPDP--RSRDNF 497
           SSPFP+  F      F E R G  PPKLL LDK+   EW S QGSG  TP+   R   NF
Sbjct: 233 SSPFPDGEFAVGGAHFPEFRIGE-PPKLLNLDKLSTCEWGSYQGSGALTPESVRRGSPNF 291

Query: 496 LLNRQDSDVAPVSES--------AVSHRVSFEITNEEVVRCVE----------------- 392
           LL+RQ SDV     S         V+HRVSFE+T E+  RCVE                 
Sbjct: 292 LLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCVEEKPAFSIKTVPEYVENG 351

Query: 391 ---KKEPNTGIERSLIEKTRVGESSNRKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNF 221
              K+E N+G      E  RVG +SN                +H+K ++ITLGS KEFNF
Sbjct: 352 TQAKEEKNSGESIQSFE-CRVGVTSN--DSPEMASTDGEAAPQHRKQQSITLGSVKEFNF 408

Query: 220 ENVDE-------DSDWF-----VGEEGAGHSEKWSFFPMMQTGVS 122
           +N DE        S+W+     +G+EG   ++ WSFFPM+Q+GVS
Sbjct: 409 DNADEGDSRKPSSSNWWANGSVIGKEGE-TTKNWSFFPMVQSGVS 452


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
           gi|550346901|gb|EEE82832.2| hypothetical protein
           POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  119 bits (298), Expect = 8e-25
 Identities = 91/225 (40%), Positives = 114/225 (50%), Gaps = 48/225 (21%)
 Frame = -2

Query: 652 SSPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSGTATPDP--RSRDNF 497
           SSPFP+  F      F E R G  PPKLL LDK+   EW S QGSG  TP+   R   NF
Sbjct: 234 SSPFPDGEFAVGGAHFPEFRIGE-PPKLLNLDKLSTCEWGSYQGSGALTPESVRRGSPNF 292

Query: 496 LLNRQDSDVAPVSES--------AVSHRVSFEITNEEVVRCVE----------------- 392
           LL+RQ SDV     S         V+HRVSFE+T E+  RCVE                 
Sbjct: 293 LLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCVEEKPAFSIKTVPEYVENG 352

Query: 391 ---KKEPNTGIERSLIEKTRVGESSNRKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNF 221
              K+E N+G      E  RVG +SN                +H+K ++ITLGS KEFNF
Sbjct: 353 TQAKEEKNSGESIQSFE-CRVGVTSN--DSPEMASTDGEAAPQHRKQQSITLGSVKEFNF 409

Query: 220 ENVDE-------DSDWF-----VGEEGAGHSEKWSFFPMMQTGVS 122
           +N DE        S+W+     +G+EG   ++ WSFFPM+Q+GVS
Sbjct: 410 DNADEGDSRKPSSSNWWANGSVIGKEGE-TTKNWSFFPMVQSGVS 453


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum
           tuberosum]
          Length = 443

 Score =  114 bits (286), Expect = 2e-23
 Identities = 84/198 (42%), Positives = 102/198 (51%), Gaps = 40/198 (20%)
 Frame = -2

Query: 595 PKLLELDKIVLREWESRQGSGTATPD---PRSRDNFLLNRQDSDVAPVSE---------S 452
           P+ L L+KI   EW SRQGSGT TP+   P+  DNFLLN Q+S V  + +         +
Sbjct: 247 PQFLNLEKIAPHEWGSRQGSGTLTPEAVNPKYHDNFLLNYQNSGVHRLPKPFNGWKNDLT 306

Query: 451 AVSHRVSFEITNEEVVRCVEKKEP---NTG------IERSLIEKTRVGESSN-------- 323
            V HRVSFEIT E+VVRCVEKK      TG       ERS   +  + E SN        
Sbjct: 307 VVDHRVSFEITAEDVVRCVEKKPTMMMRTGSVSLQDTERSTKRQENLAEMSNGHDHGGHE 366

Query: 322 -RKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFENVDE--------DSDWFVGEEGAG 170
             ++             R QK+R+ITLGSSKEFNF+NVD          SDW+  E+  G
Sbjct: 367 PSREIHEGSSTDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLG 426

Query: 169 HS--EKWSFFPMMQTGVS 122
                 W  FPMMQ GVS
Sbjct: 427 KEPCNNW-IFPMMQPGVS 443


>ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
           gi|508777528|gb|EOY24784.1| Hydroxyproline-rich
           glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  114 bits (285), Expect = 3e-23
 Identities = 90/225 (40%), Positives = 113/225 (50%), Gaps = 49/225 (21%)
 Frame = -2

Query: 652 SSPFPEHPF-----FLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD---PRSRDNF 497
           SSPF +  F     F E R G+ PPKLL LDK    EW S  GSGT TPD      R+ F
Sbjct: 236 SSPFRDGEFAASLHFPEFRMGD-PPKLLNLDKHSSCEWGSHHGSGTLTPDATRSTPRNGF 294

Query: 496 LLNRQDSDV----------APVSESAVSHRVSFEITNEEVVRCVEKK--EPNTGIERSL- 356
           LL+ Q S++              + A +HRVSFE+T EEVVR +E +   P+  +  SL 
Sbjct: 295 LLDHQISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEEVVRSLEMETATPSEAVSGSLQ 354

Query: 355 IEKT----------------RVGESSNRKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFN 224
           IE T                RVGE+SN +              +H K+++ITLGS+KEFN
Sbjct: 355 IEATRESEEHDTKVVDDYECRVGETSNER--PEKALADREGKPQHHKHQSITLGSAKEFN 412

Query: 223 FENVDE--------DSDWF----VGEEGAGHSEKWSFFPMMQTGV 125
           F+NVD          SDW+    V  +G G    WSFFPMMQ GV
Sbjct: 413 FDNVDGGDAHKPILTSDWWANDKVAGKGGGVPRNWSFFPMMQPGV 457


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
           lycopersicum]
          Length = 443

 Score =  112 bits (281), Expect = 8e-23
 Identities = 89/217 (41%), Positives = 109/217 (50%), Gaps = 40/217 (18%)
 Frame = -2

Query: 652 SSPFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD---PRSRDNFLLNRQ 482
           SSPF E     E   G   P+ L L+KI   EW SRQGSGT TP+   P+  D+FLLN Q
Sbjct: 234 SSPFLER----EYTPGR--PQFLNLEKIAPHEWGSRQGSGTLTPEAVNPKYHDSFLLNYQ 287

Query: 481 DSDVAPVSE---------SAVSHRVSFEITNEEVVRCVEKKEP---NTG------IERSL 356
           ++ V  + +         + V HRVSFEIT E+VVRCVEKK      TG       ERS 
Sbjct: 288 NTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMRTGSVSLQDTERST 347

Query: 355 IEKTRVGESSN---------RKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFENVDE- 206
             +  + E SN          ++             R QK+R+ITLGSSKEFNF+NVD  
Sbjct: 348 KRQENLAEMSNAHDHSGHEPSREIHEGSSTDGEDGQRQQKHRSITLGSSKEFNFDNVDGG 407

Query: 205 -------DSDWFVGEEGAGHS--EKWSFFPMMQTGVS 122
                   SDW+  E+  G      W  FPMMQ GVS
Sbjct: 408 YPDKATIGSDWWANEKVLGKEPCNNW-IFPMMQPGVS 443


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
           vesca subsp. vesca]
          Length = 422

 Score =  107 bits (266), Expect = 4e-21
 Identities = 81/227 (35%), Positives = 108/227 (47%), Gaps = 50/227 (22%)
 Frame = -2

Query: 652 SSPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSGTATPDPR---SRDN 500
           SSPF +  F      FLE RTG   PK+L LD +  R+W SR  SG+ TPD     S + 
Sbjct: 197 SSPFLDSEFASGGHHFLEFRTGE-APKVLNLDILFTRDWGSRLCSGSVTPDAAKSTSSEG 255

Query: 499 F---------LLNRQDSDVAPVSESAVSHRVSFEITNEEVVRCVEKK------------- 386
           F         +LN + +       +++ HRVSFE++ EEVVRCVEKK             
Sbjct: 256 FTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKPVALAEAVSTSLQ 315

Query: 385 -------EPNTGIERSLIEKTRVGESSNRKQXXXXXXXXXXXXXRHQKNRTITLGSSKEF 227
                  E     E S   +  V ++SN                R+QK R+ITLGS+KEF
Sbjct: 316 SAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKERSITLGSAKEF 375

Query: 226 NFENVDE--------DSDWFVGEEGA----GHSEKWSFFPMMQTGVS 122
           NF+N D          +DW+  E+      G S+ WSFFPM+Q G+S
Sbjct: 376 NFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFFPMIQPGMS 422


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
           vesca subsp. vesca]
          Length = 459

 Score =  107 bits (266), Expect = 4e-21
 Identities = 81/227 (35%), Positives = 108/227 (47%), Gaps = 50/227 (22%)
 Frame = -2

Query: 652 SSPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSGTATPDPR---SRDN 500
           SSPF +  F      FLE RTG   PK+L LD +  R+W SR  SG+ TPD     S + 
Sbjct: 234 SSPFLDSEFASGGHHFLEFRTGE-APKVLNLDILFTRDWGSRLCSGSVTPDAAKSTSSEG 292

Query: 499 F---------LLNRQDSDVAPVSESAVSHRVSFEITNEEVVRCVEKK------------- 386
           F         +LN + +       +++ HRVSFE++ EEVVRCVEKK             
Sbjct: 293 FTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKPVALAEAVSTSLQ 352

Query: 385 -------EPNTGIERSLIEKTRVGESSNRKQXXXXXXXXXXXXXRHQKNRTITLGSSKEF 227
                  E     E S   +  V ++SN                R+QK R+ITLGS+KEF
Sbjct: 353 SAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKERSITLGSAKEF 412

Query: 226 NFENVDE--------DSDWFVGEEGA----GHSEKWSFFPMMQTGVS 122
           NF+N D          +DW+  E+      G S+ WSFFPM+Q G+S
Sbjct: 413 NFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFFPMIQPGMS 459


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  103 bits (256), Expect = 6e-20
 Identities = 76/224 (33%), Positives = 103/224 (45%), Gaps = 47/224 (20%)
 Frame = -2

Query: 652 SSPFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD---PRSRDNFLLNRQ 482
           SSPFP+    +E       PKLL  +    R W SR GSG+ TPD   P SRD+FLL  Q
Sbjct: 231 SSPFPDRRPIVEA------PKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLENQ 284

Query: 481 DSDVAPVS---------ESAVSHRVSFEITNEEVVRCVEKKEPNTG----------IERS 359
            S+VA ++         E+ + HRVSFE+  E+V  CVEKK   +           +E  
Sbjct: 285 ISEVASLANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEG 344

Query: 358 LIEKTRVGESSNR------------KQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFEN 215
            IE+ R G S +             K               H+K+  I  GS KEFNF+N
Sbjct: 345 EIERERDGISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDN 404

Query: 214 VDED---------SDWFVGE----EGAGHSEKWSFFPMMQTGVS 122
              +         S+W+V E    +G G    W+FFP++Q G+S
Sbjct: 405 TKGEVSAKPNIIGSEWWVNEKVVGKGTGPQTNWTFFPLLQPGIS 448


>emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]
          Length = 385

 Score =  103 bits (256), Expect = 6e-20
 Identities = 76/224 (33%), Positives = 103/224 (45%), Gaps = 47/224 (20%)
 Frame = -2

Query: 652 SSPFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD---PRSRDNFLLNRQ 482
           SSPFP+    +E       PKLL  +    R W SR GSG+ TPD   P SRD+FLL  Q
Sbjct: 168 SSPFPDRRPIVEA------PKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLENQ 221

Query: 481 DSDVAPVS---------ESAVSHRVSFEITNEEVVRCVEKKEPNTG----------IERS 359
            S+VA ++         E+ + HRVSFE+  E+V  CVEKK   +           +E  
Sbjct: 222 ISEVASLANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEG 281

Query: 358 LIEKTRVGESSNR------------KQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFEN 215
            IE+ R G S +             K               H+K+  I  GS KEFNF+N
Sbjct: 282 EIERERDGISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDN 341

Query: 214 VDED---------SDWFVGE----EGAGHSEKWSFFPMMQTGVS 122
              +         S+W+V E    +G G    W+FFP++Q G+S
Sbjct: 342 TKGEVSAKPNIIGSEWWVNEKVVGKGTGPQTNWTFFPLLQPGIS 385


>emb|CBI22685.3| unnamed protein product [Vitis vinifera]
          Length = 295

 Score = 99.0 bits (245), Expect = 1e-18
 Identities = 74/193 (38%), Positives = 98/193 (50%), Gaps = 25/193 (12%)
 Frame = -2

Query: 625 FLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD---PRSRDNFLLNRQDSDVAPVS- 458
           FL L T    PKLL  +    R W SR GSG+ TPD   P SRD+FLL  Q S+VA ++ 
Sbjct: 111 FLSL-TALSAPKLLGFEHFSTRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLAN 169

Query: 457 --------ESAVSHRVSFEITNEEVVRCVEKKEPNTGIERSLIEKTRVGESSNRKQXXXX 302
                   E+ + HRVSFE+  E+V  CVEKK P+T    +  E   VGE+   K     
Sbjct: 170 SESGSQNGETVIDHRVSFELAGEDVAVCVEKK-PST---ENCCEFC-VGEA--LKAASEK 222

Query: 301 XXXXXXXXXRHQKNRTITLGSSKEFNFENVDED---------SDWFVGE----EGAGHSE 161
                     H+K+  I  GS KEFNF+N   +         S+W+V E    +G G   
Sbjct: 223 ASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGTGPQT 282

Query: 160 KWSFFPMMQTGVS 122
            W+FFP++Q G+S
Sbjct: 283 NWTFFPLLQPGIS 295


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum
           lycopersicum]
          Length = 470

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 75/239 (31%), Positives = 106/239 (44%), Gaps = 62/239 (25%)
 Frame = -2

Query: 652 SSPFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTATP---------------- 521
           SSPFP     +E R G  PPK L  +    R+W SR GSG+ TP                
Sbjct: 233 SSPFPGKCPIIEFRKGE-PPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNG 291

Query: 520 ---------------DPRSRDNFLLNRQDSDVAP---------VSESAVSHRVSFEITNE 413
                          +P SRD++LL  Q S+VA          + E+ + HRVSFE+T E
Sbjct: 292 GISRLGSGTVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEE 351

Query: 412 EVVRCVEKK------EPNTGIERS--LIEKTRVGES--SNRKQXXXXXXXXXXXXXRHQK 263
           +V  C EK+      +P   ++ S  L  + R G S    +                H+K
Sbjct: 352 DVPSCREKEPVMSHSQPTLPMDVSNLLASEMRSGSSMAEEKTYGSPRKASESGEDECHRK 411

Query: 262 NRTITLGSSKEFNFENV--------DEDSDWFVGEEGA----GHSEKWSFFPMMQTGVS 122
           +R IT GSSK+F+F+NV          D +W+  ++ A    G    W+FFP++Q GVS
Sbjct: 412 HRNITFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 470


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score = 95.1 bits (235), Expect = 2e-17
 Identities = 72/239 (30%), Positives = 102/239 (42%), Gaps = 62/239 (25%)
 Frame = -2

Query: 652 SSPFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTATP---------------- 521
           SSPFP     +E R G  PPK L  +    R+W SR GSG+ TP                
Sbjct: 233 SSPFPGKCPIIEFRKGE-PPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNG 291

Query: 520 ---------------DPRSRDNFLLNRQDSDVAP---------VSESAVSHRVSFEITNE 413
                          +P SRD++LL  Q S+VA          + E  + HRVSFE+T E
Sbjct: 292 GISRLGSGTVTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGE 351

Query: 412 EVVRCVEKKEPNT--------GIERSLIEKTRVGES--SNRKQXXXXXXXXXXXXXRHQK 263
           +V  C EK+   +         +   L  + + G S    +                H+K
Sbjct: 352 DVPSCREKEPVMSHSQQTLPMDVSNLLANEMKSGSSMAEEKTYGSPRKASESGEDQCHRK 411

Query: 262 NRTITLGSSKEFNFENV--------DEDSDWFVGEEGAGH----SEKWSFFPMMQTGVS 122
           +R IT GSSK+F+F+NV          D +W+  ++ AG        W+FFP++Q GVS
Sbjct: 412 HRNITFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
           gi|223549721|gb|EEF51209.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 459

 Score = 95.1 bits (235), Expect = 2e-17
 Identities = 77/221 (34%), Positives = 105/221 (47%), Gaps = 45/221 (20%)
 Frame = -2

Query: 652 SSPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSGTATPDP--RSRDNF 497
           SSPFP+  F      FLE +    PPKLL LDK+ + E  SRQGSGT TPD    +  +F
Sbjct: 241 SSPFPDGEFAAAGPRFLEFQMA-VPPKLLNLDKLSVHECGSRQGSGTLTPDAVRATSCSF 299

Query: 496 LLNRQDSDVAP--------VSESAVSHRVSFEITNEEVVRCVEKKEPN----------TG 371
            L+RQ SD+A           +     RVSF+++ E+ +R  E K  +            
Sbjct: 300 PLDRQCSDIASNRHSDNENKDDQVADLRVSFDLSAEDALRYAEPKPASPVKIMPESMKNE 359

Query: 370 IERSLIEKT---------RVGESSNRKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFE 218
           I    ++K+         RVGE+SN                RHQK+RT+TLG+ KEFNF+
Sbjct: 360 IAAEKVQKSSEIRHNFECRVGETSN--GILEQASTGGEKTPRHQKHRTLTLGTFKEFNFD 417

Query: 217 NVD------EDSDWFVGEEGAGH----SEKWSFFPMMQTGV 125
           N D         DW+      G     ++ WSFFP+MQ  +
Sbjct: 418 NADGVPKPSAGPDWWDNGSDVGKEDFTAKNWSFFPVMQPSI 458


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2
           [Theobroma cacao] gi|508776011|gb|EOY23267.1|
           Hydroxyproline-rich glycoprotein family protein isoform
           2 [Theobroma cacao]
          Length = 489

 Score = 92.8 bits (229), Expect = 8e-17
 Identities = 75/253 (29%), Positives = 105/253 (41%), Gaps = 76/253 (30%)
 Frame = -2

Query: 652 SSPFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSG----------------TATP 521
           SSPFP+    LE R G   PKLL  +    R+W SR GSG                + TP
Sbjct: 238 SSPFPDRRPILEFRMGE-APKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTP 296

Query: 520 D-------------------PRSRDNFLLNRQDSDVAPVS---------ESAVSHRVSFE 425
           D                   P SRD FL+  Q S+VA ++         E+ V HRVSFE
Sbjct: 297 DGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFE 356

Query: 424 ITNEEVVRCVEKK--------------------EPNTGIERSLIEKTRVGESSNRKQXXX 305
           ++ E+V  C+E K                    +   GI++ L     +       +   
Sbjct: 357 LSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVE 416

Query: 304 XXXXXXXXXXRHQKNRTITLGSSKEFNFENVDED--------SDWFVGEEGAGHSEK--- 158
                      +QK+R++TLGS KEFNF+N   +        S+W+  E+ AG   +   
Sbjct: 417 KASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGN 476

Query: 157 -WSFFPMMQTGVS 122
            W+FFPM+Q  VS
Sbjct: 477 SWTFFPMLQPEVS 489


Top