BLASTX nr result

ID: Mentha22_contig00023747 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00023747
         (513 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU19567.1| hypothetical protein MIMGU_mgv1a011273mg [Mimulus...   192   3e-47
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   161   8e-38
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   152   5e-35
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   144   1e-32
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   144   1e-32
gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis]     137   2e-30
ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494...   134   1e-29
ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun...   133   3e-29
ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225...   131   8e-29
ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210...   131   8e-29
ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phas...   129   3e-28
ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein...   127   1e-27
ref|XP_002865912.1| hydroxyproline-rich glycoprotein family prot...   127   2e-27
ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Caps...   126   4e-27
gb|AFK46430.1| unknown [Medicago truncatula]                          125   6e-27
ref|XP_002867602.1| hydroxyproline-rich glycoprotein family prot...   125   6e-27
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   125   6e-27
emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]   125   6e-27
ref|XP_006413289.1| hypothetical protein EUTSA_v10025027mg [Eutr...   123   2e-26
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   123   2e-26

>gb|EYU19567.1| hypothetical protein MIMGU_mgv1a011273mg [Mimulus guttatus]
           gi|604299725|gb|EYU19568.1| hypothetical protein
           MIMGU_mgv1a011273mg [Mimulus guttatus]
          Length = 287

 Score =  192 bits (489), Expect = 3e-47
 Identities = 109/199 (54%), Positives = 128/199 (64%), Gaps = 29/199 (14%)
 Frame = +2

Query: 2   FQPYQYPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHFP---WDS 172
           FQPYQYP SPGGH+KSPGSA+STSGTSSPFP+ R           KF GYEHFP   WDS
Sbjct: 39  FQPYQYPASPGGHIKSPGSAISTSGTSSPFPDNRA----------KFFGYEHFPSYKWDS 88

Query: 173 RVGSGSLTPNGWGSTLGSGALSPNGGEP-------------------LNSDHKSQNDDVV 295
           RVGSGSLTPNGWGS LGSGAL+PNGGEP                    NSD+KSQ+DD V
Sbjct: 89  RVGSGSLTPNGWGSRLGSGALTPNGGEPPSRDSSSILENQIYEVASLANSDNKSQSDDAV 148

Query: 296 VD-HRVSFELFGEDIPTCVVTEPSPSFRNASTDLQEAAVDQT-DKKVSATKNADSFREQL 469
           VD HRVSFELFGEDIPTC V EP+PS + A    +E     T +++   TKN+D+ +++ 
Sbjct: 149 VDHHRVSFELFGEDIPTCTVREPAPSDKEAFIKPREEIRRGTNNEEYFITKNSDNPKKET 208

Query: 470 SRDTA-----NEGEDCHQK 511
             +       +EGED   K
Sbjct: 209 VSEVRGVPLDSEGEDLDLK 227


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  161 bits (408), Expect = 8e-38
 Identities = 98/210 (46%), Positives = 118/210 (56%), Gaps = 40/210 (19%)
 Frame = +2

Query: 2   FQPYQYPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHF---PWDS 172
           F PYQ P SPG ++ SPGS VS SGTSSPFP K PI+EFR GE PKFLGYEHF    W S
Sbjct: 207 FVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGS 266

Query: 173 RVGSGSLTPNGWGSTLGSGAL--------------SPNGGEP------------------ 256
           RVGSGSLTP+GWGS LGSG L              +PNGGEP                  
Sbjct: 267 RVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLEYQISEVASL 326

Query: 257 LNSDHKSQNDDVVVDHRVSFELFGEDIPTCVVTEPSPSFRNASTDLQEAAVDQTDKKVSA 436
            NSD+ S+  + V+DHRVSFEL GED+P+C   EP  S        Q   +D ++   + 
Sbjct: 327 ANSDNGSEIGEGVIDHRVSFELTGEDVPSCREKEPVMSHSQ-----QTLPMDVSNLLANE 381

Query: 437 TKNADSFREQLS----RDTANEGED-CHQK 511
            K+  S  E+ +    R  +  GED CH+K
Sbjct: 382 MKSGSSMAEEKTYGSPRKASESGEDQCHRK 411


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum
           lycopersicum]
          Length = 470

 Score =  152 bits (384), Expect = 5e-35
 Identities = 92/210 (43%), Positives = 115/210 (54%), Gaps = 40/210 (19%)
 Frame = +2

Query: 2   FQPYQYPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHFP------ 163
           F PYQ P SPG ++ SPGS VS SGTSSPFP K PI+EFR GE PKFLGYEHF       
Sbjct: 207 FVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGS 266

Query: 164 -----------WDSRVGSGSLTPNGWGSTLGSGALSPNGGEP------------------ 256
                      W SR+GSG+LTPNG  S LGSG ++PNGGEP                  
Sbjct: 267 RVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEVASL 326

Query: 257 LNSDHKSQNDDVVVDHRVSFELFGEDIPTCVVTEPSPSFRNASTDLQEAAVDQTDKKVSA 436
            NSD+ S+  + V+DHRVSFEL  ED+P+C   EP  S    +  +     D ++   S 
Sbjct: 327 ANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHSQPTLPM-----DVSNLLASE 381

Query: 437 TKNADSFREQLS----RDTANEGED-CHQK 511
            ++  S  E+ +    R  +  GED CH+K
Sbjct: 382 MRSGSSMAEEKTYGSPRKASESGEDECHRK 411


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2
           [Theobroma cacao] gi|508776011|gb|EOY23267.1|
           Hydroxyproline-rich glycoprotein family protein isoform
           2 [Theobroma cacao]
          Length = 489

 Score =  144 bits (363), Expect = 1e-32
 Identities = 86/203 (42%), Positives = 117/203 (57%), Gaps = 40/203 (19%)
 Frame = +2

Query: 2   FQPYQ-YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHF---PWD 169
           FQ YQ YP SPGG++ SPGSA+S SGTSSPFP++RPI+EFRMGEAPK LG+E+F    W 
Sbjct: 211 FQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGFENFTTRKWG 270

Query: 170 SRVGSGSLTPN------------------GWGSTLGSGALSPNGGEPLNSD--------- 268
           SR+GSGSLTP+                  G GS LGSG+L+P+G  P + D         
Sbjct: 271 SRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQIS 330

Query: 269 ---------HKSQNDDVVVDHRVSFELFGEDIPTCVVTEPSPSFRNASTDLQEAAVDQTD 421
                    +  +ND+ +VDHRVSFEL GED+  C+ ++     R  S   ++   +   
Sbjct: 331 EVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRK 390

Query: 422 KKVSATKNADSFREQLSRDTANE 490
           ++    K+ +S  E   R+T+NE
Sbjct: 391 ERDGIKKDLESSCELFIRETSNE 413


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1
           [Theobroma cacao] gi|508776010|gb|EOY23266.1|
           Hydroxyproline-rich glycoprotein family protein isoform
           1 [Theobroma cacao]
          Length = 485

 Score =  144 bits (363), Expect = 1e-32
 Identities = 86/203 (42%), Positives = 117/203 (57%), Gaps = 40/203 (19%)
 Frame = +2

Query: 2   FQPYQ-YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHF---PWD 169
           FQ YQ YP SPGG++ SPGSA+S SGTSSPFP++RPI+EFRMGEAPK LG+E+F    W 
Sbjct: 207 FQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGFENFTTRKWG 266

Query: 170 SRVGSGSLTPN------------------GWGSTLGSGALSPNGGEPLNSD--------- 268
           SR+GSGSLTP+                  G GS LGSG+L+P+G  P + D         
Sbjct: 267 SRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQIS 326

Query: 269 ---------HKSQNDDVVVDHRVSFELFGEDIPTCVVTEPSPSFRNASTDLQEAAVDQTD 421
                    +  +ND+ +VDHRVSFEL GED+  C+ ++     R  S   ++   +   
Sbjct: 327 EVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRK 386

Query: 422 KKVSATKNADSFREQLSRDTANE 490
           ++    K+ +S  E   R+T+NE
Sbjct: 387 ERDGIKKDLESSCELFIRETSNE 409


>gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis]
          Length = 521

 Score =  137 bits (345), Expect = 2e-30
 Identities = 100/251 (39%), Positives = 123/251 (49%), Gaps = 83/251 (33%)
 Frame = +2

Query: 2   FQPYQ-YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHFP---WD 169
           FQPYQ YP SPGG++ SPGS VS SGTSSPFP+K PI+ FRMGEAP+ LG+EHF    W 
Sbjct: 208 FQPYQLYPGSPGGNLISPGSVVSNSGTSSPFPDKHPILGFRMGEAPRLLGFEHFTTWKWG 267

Query: 170 SRVGSGSLTPNG----------------------------------WGSTLGSGALSPNG 247
           SR+GSGSLTP+G                                   GS LGSG ++PNG
Sbjct: 268 SRLGSGSLTPDGVGLGSRLGSGSVTPDGVGLGSRLGSGSLTPDGYGLGSRLGSGCMTPNG 327

Query: 248 ---GEPL-------------------------------NSDHKSQNDDVVVDHRVSFELF 325
              G  L                               NSD+  QND  VVDHRVSFEL 
Sbjct: 328 PGLGSRLGSGTLTPDGFLVVSGDSFLLENQISEVASLANSDNGCQNDGSVVDHRVSFELT 387

Query: 326 GEDIPTCVVTE-PSPSFRNASTDLQEAAVDQTDKK--VSA----TKNADSFREQLSRDT- 481
           GED+  C+ ++  S + R  S  L+++  +   KK  +SA    + N  S  E+ S  T 
Sbjct: 388 GEDVARCLASKSASSNGRTTSESLEDSPAECPTKKDGISANNVDSPNDQSCVEETSNKTP 447

Query: 482 ---ANEGEDCH 505
                EGED H
Sbjct: 448 QSDCREGEDDH 458


>ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494240 [Cicer arietinum]
          Length = 492

 Score =  134 bits (337), Expect = 1e-29
 Identities = 86/231 (37%), Positives = 112/231 (48%), Gaps = 61/231 (26%)
 Frame = +2

Query: 2   FQPYQ-YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHFP---WD 169
           FQPYQ YP SPG  + SPGS +STSGTS+PFP++R  +E   GE PK LG+EHF    W+
Sbjct: 205 FQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRSSLELSRGETPKILGFEHFSTRRWN 264

Query: 170 SRVGSGSLTPNGWG----------------------------------STLGSGALSPNG 247
           SR+GSGSLTP+G G                                  S LGSG+L+P+G
Sbjct: 265 SRIGSGSLTPDGAGQGSRLGSGSLTPDGFAHASRLGSGCTTPDGLGQDSRLGSGSLTPDG 324

Query: 248 GEPL----------------NSDHKSQNDDVVVDHRVSFELFGEDIPTCVVTEPSPSFRN 379
             P                 NS+H SQ++  +VDHRVSFEL GED+  C+  +     RN
Sbjct: 325 AGPTTRESMQNQISEDVSVANSEHGSQSNATLVDHRVSFELTGEDVARCLANKAGALLRN 384

Query: 380 ASTDLQEAAVDQTDKKVSATKNADSFREQLSRDTANE-------GEDCHQK 511
            S+  Q         +    K  +   +  SR T ++       GE C QK
Sbjct: 385 MSSSSQGILAKDPIDRERILKETNGCCDVCSRKTNDKSDNSCAGGEQCCQK 435


>ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica]
           gi|462415503|gb|EMJ20240.1| hypothetical protein
           PRUPE_ppa004616mg [Prunus persica]
          Length = 499

 Score =  133 bits (334), Expect = 3e-29
 Identities = 85/230 (36%), Positives = 115/230 (50%), Gaps = 64/230 (27%)
 Frame = +2

Query: 2   FQPYQ-YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHF---PWD 169
           FQPYQ YP SPGG++ SPGSAVS SGTSSPFP++ P++EFRMGEAPK  G++HF    W 
Sbjct: 206 FQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDRHPVLEFRMGEAPKLFGFDHFTTRKWG 265

Query: 170 SRVGSGSLTPN----------------------------------GWGSTLGSGALSPNG 247
           SR+GSGSLTP+                                  G GS LGSG L+P+G
Sbjct: 266 SRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGSRLGSGCVTPNGAGIGSRLGSGCLTPDG 325

Query: 248 GEPLNSD------------------HKSQNDDVVVDHRVSFELFGEDIPTCVVTEPSPSF 373
             P + D                     Q  + V DHRVSFEL GED+  C+  +   S 
Sbjct: 326 PGPASRDSFLLENQISEVASLANSESGCQTVETVFDHRVSFELTGEDVACCLANKAVASN 385

Query: 374 RNASTDLQEAAVDQTDKKVSATKNADSFRE--------QLSRDTANEGED 499
           R AS   +  A +   ++ + + ++ +  E        ++  + + EGED
Sbjct: 386 RTASGSSKVIASEYPSERDALSSDSSNHCEFSVEESSSRIPENVSGEGED 435


>ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225370 [Cucumis sativus]
          Length = 497

 Score =  131 bits (330), Expect = 8e-29
 Identities = 86/231 (37%), Positives = 113/231 (48%), Gaps = 62/231 (26%)
 Frame = +2

Query: 2   FQPYQ-YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHF---PWD 169
           FQPYQ YP SPG H+ SPGS +S SGTSSPFP+K PI+EFRM +APK LG EHF    W 
Sbjct: 209 FQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWI 268

Query: 170 SRVGSGSLTPN------------------GWGSTLGSGALSPNG---------------- 247
           SR+GSGSLTP+                  G GS LGSG+++PNG                
Sbjct: 269 SRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGSRLGSGSVTPNGMRQDSRLGSGTLTPDG 328

Query: 248 ------GEPL------------NSDHKSQNDDVVVDHRVSFELFGEDIPTCVVTEPSPSF 373
                   PL            NS+   QND  V +HRVSFEL GED+  C+  +   S 
Sbjct: 329 LGHGLQDSPLLDNQISEVASLANSETGCQND--VTNHRVSFELTGEDVARCLANKSLTSI 386

Query: 374 RNASTDLQEAAVDQTDKKVSATKNADSFR------EQLSRDTANEGEDCHQ 508
           R  S   ++ +    ++   +++ A++              T  E + C+Q
Sbjct: 387 RTESESPKQTSTSNQNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQ 437


>ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210841 [Cucumis sativus]
          Length = 497

 Score =  131 bits (330), Expect = 8e-29
 Identities = 86/231 (37%), Positives = 113/231 (48%), Gaps = 62/231 (26%)
 Frame = +2

Query: 2   FQPYQ-YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHF---PWD 169
           FQPYQ YP SPG H+ SPGS +S SGTSSPFP+K PI+EFRM +APK LG EHF    W 
Sbjct: 209 FQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWI 268

Query: 170 SRVGSGSLTPN------------------GWGSTLGSGALSPNG---------------- 247
           SR+GSGSLTP+                  G GS LGSG+++PNG                
Sbjct: 269 SRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGSRLGSGSVTPNGMRQDSRLGSGTLTPDG 328

Query: 248 ------GEPL------------NSDHKSQNDDVVVDHRVSFELFGEDIPTCVVTEPSPSF 373
                   PL            NS+   QND  V +HRVSFEL GED+  C+  +   S 
Sbjct: 329 LGHGLQDSPLLDNQISEVASLANSETGCQND--VTNHRVSFELTGEDVARCLANKSLTSI 386

Query: 374 RNASTDLQEAAVDQTDKKVSATKNADSFR------EQLSRDTANEGEDCHQ 508
           R  S   ++ +    ++   +++ A++              T  E + C+Q
Sbjct: 387 RTESESPKQTSTSNQNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQ 437


>ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phaseolus vulgaris]
           gi|561016644|gb|ESW15448.1| hypothetical protein
           PHAVU_007G073100g [Phaseolus vulgaris]
          Length = 479

 Score =  129 bits (325), Expect = 3e-28
 Identities = 84/202 (41%), Positives = 112/202 (55%), Gaps = 39/202 (19%)
 Frame = +2

Query: 2   FQPYQ-YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHFP---WD 169
           FQ YQ YP SPG  + SP S +STSG+S+PFP+  P++EF  GEA   LG+EHF    W+
Sbjct: 205 FQLYQQYPGSPGPQLISPASIISTSGSSTPFPDTHPLLEFHKGEASNLLGFEHFSTHKWN 264

Query: 170 SRVGSGSLTPN--GWGSTLGSGALSPNGGE------------------------------ 253
           SR+GSGSLTP+  G GS LGSG+L+PN  +                              
Sbjct: 265 SRLGSGSLTPDSTGQGSGLGSGSLTPNAVKLVSSSGCLTPEGVAPTARNGIYVGKQTSEL 324

Query: 254 -PL-NSDHKSQNDDVVVDHRVSFELFGEDIPTCVVTEP-SPSFRNASTDLQEAAVDQTDK 424
            PL NS+++ Q +  +VDHRVSFEL GED+  C+  +  SP   N S   Q A V +   
Sbjct: 325 TPLANSENECQPNAALVDHRVSFELTGEDVARCLANKSGSPLIGNISGSSQGALVGEPVD 384

Query: 425 KVSATKNADSFREQLSRDTANE 490
           +    KN+DS  +  SR T+N+
Sbjct: 385 RERIHKNSDSDCDLCSRKTSND 406


>ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
           thaliana] gi|10177409|dbj|BAB10540.1| unnamed protein
           product [Arabidopsis thaliana]
           gi|40823427|gb|AAR92282.1| At5g52430 [Arabidopsis
           thaliana] gi|56381929|gb|AAV85683.1| At5g52430
           [Arabidopsis thaliana] gi|110738650|dbj|BAF01250.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|332008830|gb|AED96213.1| hydroxyproline-rich
           glycoprotein family protein [Arabidopsis thaliana]
          Length = 438

 Score =  127 bits (320), Expect = 1e-27
 Identities = 82/185 (44%), Positives = 107/185 (57%), Gaps = 28/185 (15%)
 Frame = +2

Query: 20  PCSPGG-HVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHF---PWDSRVGSG 187
           P SPGG ++ SPGS +S SGTSSP+P K P+VEFR+GE PKFLG+EHF    W SR GSG
Sbjct: 215 PGSPGGGNLISPGSVISNSGTSSPYPGKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSG 274

Query: 188 SLTPNGWGSTLGSGALSPNGGE-------------PL-----------NSDHKSQNDDVV 295
           S+TP G GS L SGAL+PNG E             PL           NSDH S  + +V
Sbjct: 275 SITPVGHGSGLASGALTPNGPEIVSGNLTPNNTTWPLQNQISEVASLANSDHGS--EVMV 332

Query: 296 VDHRVSFELFGEDIPTCVVTEPSPSFRNASTDLQEAAVDQTDKKVSATKNADSFREQLSR 475
            DHRVSFEL GED+  C+ ++ + S    + +      D+ + + S++ +     E+ S 
Sbjct: 333 ADHRVSFELTGEDVARCLASKLNRSHDRMNNN------DRIETEESSSTDIRRNIEKRSG 386

Query: 476 DTANE 490
           D  NE
Sbjct: 387 DRENE 391


>ref|XP_002865912.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
           subsp. lyrata] gi|297311747|gb|EFH42171.1|
           hydroxyproline-rich glycoprotein family protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 437

 Score =  127 bits (318), Expect = 2e-27
 Identities = 83/180 (46%), Positives = 105/180 (58%), Gaps = 31/180 (17%)
 Frame = +2

Query: 20  PCSPGG-HVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHF---PWDSRVGSG 187
           P SPGG ++ SPGS +S SGTSSP+P K P+VEFR+GE PKFLG+EHF    W SR GSG
Sbjct: 214 PGSPGGGNLISPGSVISNSGTSSPYPGKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSG 273

Query: 188 SLTPNGWGSTLGSGALSPNGGE-------------PL-----------NSDHKSQNDDVV 295
           S+TP G GS L SGAL+PNG E             PL           NSDH S  + +V
Sbjct: 274 SITPVGHGSGLASGALTPNGLEIISGNLTPSNTTWPLHNQISEVASLANSDHGS--EVIV 331

Query: 296 VDHRVSFELFGEDIPTCVVTEPSPSF--RNASTDLQEAAVDQTDKKVSATK-NADSFREQ 466
            DHRVSFEL GED+  C+ ++ + S    N +  ++      TD + +  K +AD   EQ
Sbjct: 332 ADHRVSFELTGEDVARCLASKLNRSHDRMNNNDRIETEESSSTDLRRNMEKRSADRETEQ 391


>ref|XP_006280487.1| hypothetical protein CARUB_v10026425mg [Capsella rubella]
           gi|482549191|gb|EOA13385.1| hypothetical protein
           CARUB_v10026425mg [Capsella rubella]
          Length = 437

 Score =  126 bits (316), Expect = 4e-27
 Identities = 77/189 (40%), Positives = 110/189 (58%), Gaps = 37/189 (19%)
 Frame = +2

Query: 20  PCSPGG-HVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHF---PWDSRVGSG 187
           P SPGG ++ SPGS +S SGTSSP+P K P+VEFR+GE PKFLG+EHF    W SR GSG
Sbjct: 210 PGSPGGGNLISPGSVISNSGTSSPYPGKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSG 269

Query: 188 SLTPNGWGSTLGSGALSPNGGE-------------PL-----------NSDHKSQNDDVV 295
           S+TP G GS + SGAL+PN  E             PL           NSDH S  + +V
Sbjct: 270 SITPVGHGSGMASGALTPNAPEIISGNLTPSNTTWPLQNQISEVASLANSDHGS--EVIV 327

Query: 296 VDHRVSFELFGEDIPTCVVTEPSPSFRNASTDLQEAAVDQTD---------KKVSATKNA 448
            DHRVSFEL GED+  C+ ++ + S    + + + A  + +          +K+ +T+N 
Sbjct: 328 ADHRVSFELTGEDVARCLASKLNRSHDRMNNNDRIATEESSSTDRGRRNSFQKIESTENR 387

Query: 449 DSFREQLSR 475
           ++ ++++ +
Sbjct: 388 ETEQQRIQK 396


>gb|AFK46430.1| unknown [Medicago truncatula]
          Length = 487

 Score =  125 bits (314), Expect = 6e-27
 Identities = 86/226 (38%), Positives = 108/226 (47%), Gaps = 56/226 (24%)
 Frame = +2

Query: 2   FQPYQ-YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHFP---WD 169
           +QPYQ YP SPG  + SPGS +STSGTS+PFP++R  +E R GEAPK LG+EHF    W 
Sbjct: 206 YQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRSSLELRKGEAPKILGFEHFSTRKWM 265

Query: 170 SRVGSGS----------------LTPNGWGST------------------LGSGALSPNG 247
           SR+GSGS                LTP+G   T                  LGSG+L+P+G
Sbjct: 266 SRIGSGSLTPDGTGQGSRLGSGSLTPDGVSHTSRLGSGCATPDGLGQDSRLGSGSLTPDG 325

Query: 248 GEP------------------LNSDHKSQNDDVVVDHRVSFELFGEDIPTCVVTEPSPSF 373
             P                   NSDH SQ +  +VDHRVSFEL GED+  C+  +     
Sbjct: 326 VGPTTRGSIDVQNQIPVGVSVANSDHGSQTNATLVDHRVSFELTGEDVARCLANKTGALL 385

Query: 374 RNASTDLQEAAVDQTDKKVSATKNADSFREQLSRDTANEGEDCHQK 511
           RN S+  Q         +    K  +S  +  S   A  GE C  K
Sbjct: 386 RNMSSSSQGILAKDPIDREKILKETNSCCDVCS-GKAIGGEHCCPK 430


>ref|XP_002867602.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
           subsp. lyrata] gi|297313438|gb|EFH43861.1|
           hydroxyproline-rich glycoprotein family protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  125 bits (314), Expect = 6e-27
 Identities = 75/142 (52%), Positives = 91/142 (64%), Gaps = 23/142 (16%)
 Frame = +2

Query: 2   FQPYQ-YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHF---PWD 169
           F+ +Q YP SPGG++ SPGS     GTSSP+P K  I+EFR+GE PKFLG+EHF    W 
Sbjct: 194 FKSHQVYPGSPGGNLISPGS-----GTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWG 248

Query: 170 SRVGSGSLTPNGWGSTLGSGALSPNGGEPL----------------NSDHKS--QNDD-V 292
           SR GSGS+TP G GS LGSGAL+P+G  PL                NSDH S   ND+  
Sbjct: 249 SRFGSGSITPAGQGSRLGSGALTPDGLTPLEGSLLDSQITEVASLANSDHGSSRHNDEAA 308

Query: 293 VVDHRVSFELFGEDIPTCVVTE 358
           VV HRVSFEL GED+  C+ ++
Sbjct: 309 VVPHRVSFELTGEDVARCLASK 330


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  125 bits (314), Expect = 6e-27
 Identities = 83/190 (43%), Positives = 105/190 (55%), Gaps = 20/190 (10%)
 Frame = +2

Query: 2   FQPYQ-YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHFP---WD 169
           FQPYQ YP SP GH+ SP   +S SGTSSPFP++RPIVE     APK LG+EHF    W 
Sbjct: 207 FQPYQLYPESPVGHLISP---ISNSGTSSPFPDRRPIVE-----APKLLGFEHFSTRRWG 258

Query: 170 SRVGSGSLTPNGWGSTLGSGALSPNGGEPL----NSDHKSQNDDVVVDHRVSFELFGEDI 337
           SR+GSGSLTP+G G       L  N    +    NS+  SQN + V+DHRVSFEL GED+
Sbjct: 259 SRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELAGEDV 318

Query: 338 PTCVVTEPSPSFRNASTDLQEAAVD-----QTDKKVSATKNADSF-----REQLSRDTAN 487
             CV  +P  S       LQ+   +     + D    +T+N   F      +  S   + 
Sbjct: 319 AVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASEKASA 378

Query: 488 EGED--CHQK 511
           EGE+  CH+K
Sbjct: 379 EGEEEQCHKK 388


>emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]
          Length = 385

 Score =  125 bits (314), Expect = 6e-27
 Identities = 83/190 (43%), Positives = 105/190 (55%), Gaps = 20/190 (10%)
 Frame = +2

Query: 2   FQPYQ-YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHFP---WD 169
           FQPYQ YP SP GH+ SP   +S SGTSSPFP++RPIVE     APK LG+EHF    W 
Sbjct: 144 FQPYQLYPESPVGHLISP---ISNSGTSSPFPDRRPIVE-----APKLLGFEHFSTRRWG 195

Query: 170 SRVGSGSLTPNGWGSTLGSGALSPNGGEPL----NSDHKSQNDDVVVDHRVSFELFGEDI 337
           SR+GSGSLTP+G G       L  N    +    NS+  SQN + V+DHRVSFEL GED+
Sbjct: 196 SRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELAGEDV 255

Query: 338 PTCVVTEPSPSFRNASTDLQEAAVD-----QTDKKVSATKNADSF-----REQLSRDTAN 487
             CV  +P  S       LQ+   +     + D    +T+N   F      +  S   + 
Sbjct: 256 AVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASEKASA 315

Query: 488 EGED--CHQK 511
           EGE+  CH+K
Sbjct: 316 EGEEEQCHKK 325


>ref|XP_006413289.1| hypothetical protein EUTSA_v10025027mg [Eutrema salsugineum]
           gi|557114459|gb|ESQ54742.1| hypothetical protein
           EUTSA_v10025027mg [Eutrema salsugineum]
          Length = 489

 Score =  123 bits (309), Expect = 2e-26
 Identities = 74/163 (45%), Positives = 88/163 (53%), Gaps = 49/163 (30%)
 Frame = +2

Query: 17  YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHFP---WDSRVGSG 187
           +P SPGG++ SPGS +S SGTSSP+P K  I+EFR+GE PKFLG+EHF    W SR GSG
Sbjct: 216 FPGSPGGNLISPGSVISNSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSG 275

Query: 188 SLTPNGWGSTLGSGALSPNGG---------------------------EPL--------- 259
           S+TP G GS LGSGAL+P+GG                            PL         
Sbjct: 276 SITPAGQGSRLGSGALTPDGGGLGSKLASGAVTPNGAEMVSRKGSGNVTPLESSLLDCQI 335

Query: 260 -------NSDHKSQNDD---VVVDHRVSFELFGEDIPTCVVTE 358
                  NSDH S   D    VV HRVSFEL GED+  C  ++
Sbjct: 336 SEVASLANSDHGSSRHDEAVAVVSHRVSFELTGEDVARCFASK 378


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
           gi|223547583|gb|EEF49078.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 510

 Score =  123 bits (309), Expect = 2e-26
 Identities = 83/235 (35%), Positives = 117/235 (49%), Gaps = 65/235 (27%)
 Frame = +2

Query: 2   FQPYQ-YPCSPGGHVKSPGSAVSTSGTSSPFPEKRPIVEFRMGEAPKFLGYEHF---PWD 169
           FQ Y  YP SPGG + SPGS +S SGTSSPFP++ PI+EFRMGEAPK LG+EHF    W 
Sbjct: 220 FQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGEAPKLLGFEHFTTRKWG 279

Query: 170 SRVGSGSLTPNGWG----------------------------------STLGSGALSPNG 247
           SR+GSG++TP+G G                                  S LGSG+L+P+ 
Sbjct: 280 SRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGSLTPDA 339

Query: 248 GEP------------------LNSDHKSQNDDVVVDHRVSFELFGEDIPTCVVTEPSPSF 373
             P                   NS++ S+ D+ +VDHRVSFEL GE++  C+ ++   S 
Sbjct: 340 VGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESKSLASC 399

Query: 374 R---------NASTDLQEAAVDQTDKKVSATKNADSFREQLSRDTANEGEDCHQK 511
           R          A   ++   +  TD+ +   + +    E+ S +   E E C++K
Sbjct: 400 RAFSECPPDSMAEDQIKSGKMLMTDENLPTGETSGETPEKPSGEM--EEEHCYRK 452


Top