BLASTX nr result

ID: Akebia27_contig00014989 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00014989
         (792 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   246   9e-63
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   246   9e-63
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   243   6e-62
emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]   243   6e-62
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   241   3e-61
ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun...   231   2e-58
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   225   2e-56
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   217   4e-54
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   214   4e-53
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   211   2e-52
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   211   2e-52
ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot...   196   6e-48
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   196   1e-47
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   195   2e-47
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   195   2e-47
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   190   4e-46
ref|XP_007038760.1| Hydroxyproline-rich glycoprotein family prot...   181   3e-43
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     179   8e-43
ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phas...   174   3e-41
ref|XP_003516706.1| PREDICTED: uncharacterized protein LOC100777...   172   1e-40

>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  246 bits (627), Expect = 9e-63
 Identities = 145/301 (48%), Positives = 179/301 (59%), Gaps = 38/301 (12%)
 Frame = +2

Query: 2    LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181
            LLTSSL+R  + SG  QKF  SHYEFQSYQ+YPGSP G+LISP S IS SGTSSPFPDR 
Sbjct: 186  LLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDR- 244

Query: 182  FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------- 304
                  P LEFR GE PKL  F+  +TRKW    GSGSLTP                   
Sbjct: 245  -----RPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGM 299

Query: 305  ------------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTA 448
                        PD  GP SRD FLV +QISEVA LAN  NG +N+E ++DHRVSFEL+ 
Sbjct: 300  GLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSG 359

Query: 449  EETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENTC---VGETSSNVSG 619
            E+   C+E + ++ S   ++V+   K  +A    ERDG+  + E++C   + ETS+    
Sbjct: 360  EDVAPCLESKSLLPS---RAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVE 416

Query: 620  KAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD----RSDWWANEKVVITKEAG 787
            KA G+ ++E  H  Q+    T+GS+KEF FDNT G  SD    RS+WWANEKV   KEA 
Sbjct: 417  KASGEAEEE--HSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVA-GKEAR 473

Query: 788  P 790
            P
Sbjct: 474  P 474


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  246 bits (627), Expect = 9e-63
 Identities = 145/301 (48%), Positives = 179/301 (59%), Gaps = 38/301 (12%)
 Frame = +2

Query: 2    LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181
            LLTSSL+R  + SG  QKF  SHYEFQSYQ+YPGSP G+LISP S IS SGTSSPFPDR 
Sbjct: 182  LLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDR- 240

Query: 182  FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------- 304
                  P LEFR GE PKL  F+  +TRKW    GSGSLTP                   
Sbjct: 241  -----RPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGM 295

Query: 305  ------------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTA 448
                        PD  GP SRD FLV +QISEVA LAN  NG +N+E ++DHRVSFEL+ 
Sbjct: 296  GLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSG 355

Query: 449  EETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENTC---VGETSSNVSG 619
            E+   C+E + ++ S   ++V+   K  +A    ERDG+  + E++C   + ETS+    
Sbjct: 356  EDVAPCLESKSLLPS---RAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVE 412

Query: 620  KAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD----RSDWWANEKVVITKEAG 787
            KA G+ ++E  H  Q+    T+GS+KEF FDNT G  SD    RS+WWANEKV   KEA 
Sbjct: 413  KASGEAEEE--HSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVA-GKEAR 469

Query: 788  P 790
            P
Sbjct: 470  P 470


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  243 bits (620), Expect = 6e-62
 Identities = 149/271 (54%), Positives = 170/271 (62%), Gaps = 8/271 (2%)
 Frame = +2

Query: 2   LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181
           LLTSSLDR+ + SG  QK + S+YEFQ YQLYP SPVGHLISP   IS SGTSSPFPDR 
Sbjct: 182 LLTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRR 238

Query: 182 FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQIS 361
                 P +E      PKL  F+  STR+W    GSGSLTP D AGP SRDSFL+ENQIS
Sbjct: 239 ------PIVE-----APKLLGFEHFSTRRWGSRLGSGSLTP-DGAGPASRDSFLLENQIS 286

Query: 362 EVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLAT 541
           EVASLANS +GSQN E VIDHRVSFEL  E+   CVEK+P +AS E    T  D      
Sbjct: 287 EVASLANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKP-VASAETVQNTLQDIVEEGE 345

Query: 542 VTPERDGLSSEAENT---CVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFD 712
           +  ERDG+S   EN    CVGE     S KA  +G++E  H +  P     GS+KEF FD
Sbjct: 346 IERERDGISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPP--IRHGSIKEFNFD 403

Query: 713 NTDGGTSDR-----SDWWANEKVVITKEAGP 790
           NT G  S +     S+WW NEKVV  K  GP
Sbjct: 404 NTKGEVSAKPNIIGSEWWVNEKVV-GKGTGP 433


>emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera]
          Length = 385

 Score =  243 bits (620), Expect = 6e-62
 Identities = 149/271 (54%), Positives = 170/271 (62%), Gaps = 8/271 (2%)
 Frame = +2

Query: 2   LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181
           LLTSSLDR+ + SG  QK + S+YEFQ YQLYP SPVGHLISP   IS SGTSSPFPDR 
Sbjct: 119 LLTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRR 175

Query: 182 FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQIS 361
                 P +E      PKL  F+  STR+W    GSGSLTP D AGP SRDSFL+ENQIS
Sbjct: 176 ------PIVE-----APKLLGFEHFSTRRWGSRLGSGSLTP-DGAGPASRDSFLLENQIS 223

Query: 362 EVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLAT 541
           EVASLANS +GSQN E VIDHRVSFEL  E+   CVEK+P +AS E    T  D      
Sbjct: 224 EVASLANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKP-VASAETVQNTLQDIVEEGE 282

Query: 542 VTPERDGLSSEAENT---CVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFD 712
           +  ERDG+S   EN    CVGE     S KA  +G++E  H +  P     GS+KEF FD
Sbjct: 283 IERERDGISESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPP--IRHGSIKEFNFD 340

Query: 713 NTDGGTSDR-----SDWWANEKVVITKEAGP 790
           NT G  S +     S+WW NEKVV  K  GP
Sbjct: 341 NTKGEVSAKPNIIGSEWWVNEKVV-GKGTGP 370


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
           gi|462404864|gb|EMJ10328.1| hypothetical protein
           PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  241 bits (614), Expect = 3e-61
 Identities = 145/262 (55%), Positives = 174/262 (66%), Gaps = 4/262 (1%)
 Frame = +2

Query: 17  LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196
           LD + +     Q+F  SHYEFQSYQLYPGSPVG LISPSS IS SGTSSPFPD EF++ G
Sbjct: 187 LDPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARG 246

Query: 197 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 376
           H FLEFRTG+PPKL + D LSTR W    GSGS+T PD A  TS D FL++ Q  EV   
Sbjct: 247 HHFLEFRTGDPPKLLNLDILSTRDWGSRLGSGSVT-PDGAKSTSSDGFLLKPQTPEVVLN 305

Query: 377 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPER 556
             SNN  +NN+I I+HRVSFEL++EE   CVEK+P +A  EA S TSL+ T  A    + 
Sbjct: 306 PRSNNRGRNNDISINHRVSFELSSEEVIRCVEKKP-VALAEAVS-TSLEDTEKA--QSKE 361

Query: 557 DGLSSEAENTC-VGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTS 733
           D     + + C VGETS++ + KA  DG++   H +Q+    T+GSVKEF FDN DGG S
Sbjct: 362 DPSKVVSSSICPVGETSNDAAEKAVADGEEAQLHPKQRS--ITLGSVKEFNFDNPDGGDS 419

Query: 734 DR---SDWWANEKVVITKEAGP 790
                SDWWANEK V  KE GP
Sbjct: 420 GNSIGSDWWANEK-VDAKENGP 440


>ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica]
            gi|462415503|gb|EMJ20240.1| hypothetical protein
            PRUPE_ppa004616mg [Prunus persica]
          Length = 499

 Score =  231 bits (590), Expect = 2e-58
 Identities = 143/317 (45%), Positives = 176/317 (55%), Gaps = 54/317 (17%)
 Frame = +2

Query: 2    LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181
            LLTSSLDRN + SG  QKF  SHYEFQ YQ YPGSP G+LISP S +S SGTSSPFPDR 
Sbjct: 181  LLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDR- 239

Query: 182  FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------- 304
                 HP LEFR GE PKL+ FD  +TRKW    GSGSLTP                   
Sbjct: 240  -----HPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGN 294

Query: 305  ----------------------------PDPAGPTSRDSFLVENQISEVASLANSNNGSQ 400
                                        PD  GP SRDSFL+ENQISEVASLANS +G Q
Sbjct: 295  ELGSRLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQ 354

Query: 401  NNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAE 580
              E V DHRVSFELT E+   C+  + + ++   ++ +   K   +    ERD LSS++ 
Sbjct: 355  TVETVFDHRVSFELTGEDVACCLANKAVASN---RTASGSSKVIASEYPSERDALSSDSS 411

Query: 581  NTC---VGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDR---- 739
            N C   V E+SS +     G+G+D+   +R+  S+ T+GS K+F FDNT     ++    
Sbjct: 412  NHCEFSVEESSSRIPENVSGEGEDQ--GYRKHRSI-TLGSTKDFNFDNTKAEVPNKPNIG 468

Query: 740  SDWWANEKVVITKEAGP 790
            S+WWAN K V  KE+ P
Sbjct: 469  SEWWAN-KNVAAKESKP 484


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  225 bits (573), Expect = 2e-56
 Identities = 141/287 (49%), Positives = 173/287 (60%), Gaps = 31/287 (10%)
 Frame = +2

Query: 20   DRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREF-SSGG 196
            D N +      +F  S YEFQSYQLYPGSPVGHLISPSS IS SGTSSPFPDR+F  SG 
Sbjct: 190  DPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGS 249

Query: 197  HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSR--------------- 331
              FLEFR G PPKL + D LS  +W    GSGS+T PD  GP SR               
Sbjct: 250  SQFLEFRAGGPPKLLTLDKLSNHEWGSRIGSGSIT-PDALGPPSRDGSVLDRQVSDVIHP 308

Query: 332  ---DSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKE------PM 484
               D  +++ QIS+VAS + S++G  NNEI++DHRVSFELTAE+   CVEK+       +
Sbjct: 309  PSGDDSVLDRQISDVASHSLSDSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAV 368

Query: 485  MASLEAKSVTSLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGKAFGD--GDDEVPHH 658
             ASL+  +   +D+ +   V      + SE     VGET++N   KA  D  G++  PHH
Sbjct: 369  SASLQNPATVEIDENSREVV------VDSEGR---VGETANNPPEKAPEDANGEEGQPHH 419

Query: 659  RQQPSLTTIGSVKEFKFDNTDGGTSDR----SDWWANEKVVITKEAG 787
            +Q+    T+GS KEF FDN DGG SD+    SDWWANEKVV  KE G
Sbjct: 420  KQRS--ITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVV-GKEVG 463


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  217 bits (552), Expect = 4e-54
 Identities = 130/257 (50%), Positives = 167/257 (64%), Gaps = 6/257 (2%)
 Frame = +2

Query: 17  LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196
           LD + +     QKF  S+YEFQSY L+PGSPVG+LISPSS IS SGTSSPFPD EF++ G
Sbjct: 191 LDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAG 250

Query: 197 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 376
             F +F  G+PPKL + D LS R+W   QGSG+LT PD  G T R+ F    QISEVA  
Sbjct: 251 PQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLT-PDAVGSTPRNGFFQNRQISEVALR 309

Query: 377 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPER 556
            +S NG + ++IV DHRVSFELT E+   CVEK+P   + EA S +  + TT+     E+
Sbjct: 310 PHSENGLRKDQIV-DHRVSFELTTEDVVRCVEKKPTTLA-EAVSESLQNGTTV-----EK 362

Query: 557 DGLSSEAEN---TCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGG 727
           +  S EAEN   +C GE +++   K   D  +E P H++Q S+ T+GS KEF FD+ DG 
Sbjct: 363 EESSGEAENVHHSCAGEAANDEPLKTPVD-VEEAPRHQKQQSI-TLGSTKEFNFDSADGD 420

Query: 728 TSD---RSDWWANEKVV 769
           + +    SDWWANEKVV
Sbjct: 421 SHEPTIASDWWANEKVV 437


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
           gi|557541785|gb|ESR52763.1| hypothetical protein
           CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  214 bits (544), Expect = 4e-53
 Identities = 129/257 (50%), Positives = 166/257 (64%), Gaps = 6/257 (2%)
 Frame = +2

Query: 17  LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196
           LD + +     QKF  S+YEFQSY L+PGSPVG+LISPSS IS SGTSSPFPD EF++ G
Sbjct: 191 LDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAG 250

Query: 197 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 376
             F +F  G+PPKL + D LS R+W   QGSG+LT PD    T R+ F    QISEVA  
Sbjct: 251 PQFPDFHRGDPPKLLNLDKLSIREWGSRQGSGTLT-PDAVRSTPRNGFFQNRQISEVALR 309

Query: 377 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPER 556
            +S NG + ++IV DHRVSFELT E+   CVEK+P   + EA S +  + TT+     E+
Sbjct: 310 PHSENGLRKDQIV-DHRVSFELTTEDVVRCVEKKPTTLA-EAVSESLQNGTTV-----EK 362

Query: 557 DGLSSEAEN---TCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGG 727
           +  S EAEN   +C GE +++   K   D  +E P H++Q S+ T+GS KEF FD+ DG 
Sbjct: 363 EESSGEAENVHHSCAGEAANDEPLKTPVD-VEEAPRHQKQQSI-TLGSTKEFNFDSADGD 420

Query: 728 TSD---RSDWWANEKVV 769
           + +    SDWWANEKVV
Sbjct: 421 SHEPTIASDWWANEKVV 437


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
           vesca subsp. vesca]
          Length = 422

 Score =  211 bits (538), Expect = 2e-52
 Identities = 132/261 (50%), Positives = 159/261 (60%), Gaps = 4/261 (1%)
 Frame = +2

Query: 17  LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196
           LD N +     Q++  SHYEFQSYQ YPGSPVG LISPSS IS SGTSSPF D EF+SGG
Sbjct: 150 LDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLDSEFASGG 209

Query: 197 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 376
           H FLEFRTGE PK+ + D L TR W     SGS+T PD A  TS + F ++    E    
Sbjct: 210 HHFLEFRTGEAPKVLNLDILFTRDWGSRLCSGSVT-PDAAKSTSSEGFTLKPYTPEGVLN 268

Query: 377 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPER 556
           A SN+  +N+   I HRVSFEL+AEE   CVEK+P +A  EA S TSL     A      
Sbjct: 269 ARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKP-VALAEAVS-TSLQSAEKAEREEGP 326

Query: 557 DGLSSEAENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD 736
           +   S +    V +TS++ S KA G   +E+ +  Q+    T+GS KEF FDN DGG S 
Sbjct: 327 NQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKERSITLGSAKEFNFDNADGGDSG 386

Query: 737 RS----DWWANEKVVITKEAG 787
            S    DWWANEKVV+ KE G
Sbjct: 387 TSSISTDWWANEKVVL-KENG 406


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
           vesca subsp. vesca]
          Length = 459

 Score =  211 bits (538), Expect = 2e-52
 Identities = 132/261 (50%), Positives = 159/261 (60%), Gaps = 4/261 (1%)
 Frame = +2

Query: 17  LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196
           LD N +     Q++  SHYEFQSYQ YPGSPVG LISPSS IS SGTSSPF D EF+SGG
Sbjct: 187 LDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLDSEFASGG 246

Query: 197 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 376
           H FLEFRTGE PK+ + D L TR W     SGS+T PD A  TS + F ++    E    
Sbjct: 247 HHFLEFRTGEAPKVLNLDILFTRDWGSRLCSGSVT-PDAAKSTSSEGFTLKPYTPEGVLN 305

Query: 377 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPER 556
           A SN+  +N+   I HRVSFEL+AEE   CVEK+P +A  EA S TSL     A      
Sbjct: 306 ARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKP-VALAEAVS-TSLQSAEKAEREEGP 363

Query: 557 DGLSSEAENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSD 736
           +   S +    V +TS++ S KA G   +E+ +  Q+    T+GS KEF FDN DGG S 
Sbjct: 364 NQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKERSITLGSAKEFNFDNADGGDSG 423

Query: 737 RS----DWWANEKVVITKEAG 787
            S    DWWANEKVV+ KE G
Sbjct: 424 TSSISTDWWANEKVVL-KENG 443


>ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
           gi|508777528|gb|EOY24784.1| Hydroxyproline-rich
           glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  196 bits (499), Expect = 6e-48
 Identities = 116/244 (47%), Positives = 148/244 (60%), Gaps = 5/244 (2%)
 Frame = +2

Query: 50  QKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEP 229
           Q+F  SHYEFQSYQL+PGSPVG LISPSS IS SGTSSPF D EF++  H F EFR G+P
Sbjct: 200 QRFPISHYEFQSYQLHPGSPVGQLISPSSGISGSGTSSPFRDGEFAASLH-FPEFRMGDP 258

Query: 230 PKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLAN-SNNGSQNN 406
           PKL + D  S+ +W  H GSG+LT PD    T R+ FL+++QISE+ S  +  N   QN+
Sbjct: 259 PKLLNLDKHSSCEWGSHHGSGTLT-PDATRSTPRNGFLLDHQISEITSHPHLKNKEVQND 317

Query: 407 EIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENT 586
           ++  +HRVSFELT EE    +E E    S        ++ T     + E D    +    
Sbjct: 318 QVAHNHRVSFELTTEEVVRSLEMETATPSEAVSGSLQIEAT---RESEEHDTKVVDDYEC 374

Query: 587 CVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDR----SDWWA 754
            VGETS+    KA  D + +  HH+ Q    T+GS KEF FDN DGG + +    SDWWA
Sbjct: 375 RVGETSNERPEKALADREGKPQHHKHQS--ITLGSAKEFNFDNVDGGDAHKPILTSDWWA 432

Query: 755 NEKV 766
           N+KV
Sbjct: 433 NDKV 436


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum
            lycopersicum]
          Length = 470

 Score =  196 bits (497), Expect = 1e-47
 Identities = 131/293 (44%), Positives = 163/293 (55%), Gaps = 31/293 (10%)
 Frame = +2

Query: 2    LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181
            LLTSSL RN + SG   KF  S YEF  YQ  PGSP  +LISP SV+S SGTSSPFP   
Sbjct: 182  LLTSSLARNRRYSGSNYKFPLSQYEFVPYQ-DPGSPGSNLISPGSVVSNSGTSSPFP--- 237

Query: 182  FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------- 304
               G  P +EFR GEPPK   ++  STRKW    GSGS+TP                   
Sbjct: 238  ---GKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGIS 294

Query: 305  --------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETP 460
                    P+   P SRDS+L+ENQISEVASLANS+NGS+  E VIDHRVSFELT E+ P
Sbjct: 295  RLGSGTVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVP 354

Query: 461  SCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGKAFGDGD 640
            SC EKEP+M+   ++    +D + L  +  E    SS AE    G        KA   G+
Sbjct: 355  SCREKEPVMS--HSQPTLPMDVSNL--LASEMRSGSSMAEEKTYGSPR-----KASESGE 405

Query: 641  DEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDRS----DWWANEKVVITKEAG 787
            DE   HR+  ++ T GS K+F FDN      ++     +WW ++K  + KE+G
Sbjct: 406  DEC--HRKHRNI-TFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAV-KESG 454


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
           gi|550346902|gb|ERP65330.1| hypothetical protein
           POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  195 bits (495), Expect = 2e-47
 Identities = 114/236 (48%), Positives = 147/236 (62%), Gaps = 3/236 (1%)
 Frame = +2

Query: 71  YEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFD 250
           ++FQSYQ +PGSPVG LISPSS IS SGTSSPFPD EF+ GG  F EFR GEPPKL + D
Sbjct: 204 FDFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLD 263

Query: 251 GLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRV 430
            LST +W  +QGSG+LTP          +FL+  Q S+V S   S NG +N + V++HRV
Sbjct: 264 KLSTCEWGSYQGSGALTPESVR--RGSPNFLLHRQFSDVPSRPRSGNGHKNGQ-VVNHRV 320

Query: 431 SFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENTCVGETSSN 610
           SFELTAE+   CVE++P   +   K+V    +        +  G S ++    VG TS++
Sbjct: 321 SFELTAEDASRCVEEKP---AFSIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSND 377

Query: 611 VSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDR---SDWWANEKVV 769
               A  DG +  P HR+Q S+ T+GSVKEF FDN D G S +   S+WWAN  V+
Sbjct: 378 SPEMASTDG-EAAPQHRKQQSI-TLGSVKEFNFDNADEGDSRKPSSSNWWANGSVI 431


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
           gi|550346901|gb|EEE82832.2| hypothetical protein
           POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  195 bits (495), Expect = 2e-47
 Identities = 114/236 (48%), Positives = 147/236 (62%), Gaps = 3/236 (1%)
 Frame = +2

Query: 71  YEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFD 250
           ++FQSYQ +PGSPVG LISPSS IS SGTSSPFPD EF+ GG  F EFR GEPPKL + D
Sbjct: 205 FDFQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLD 264

Query: 251 GLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRV 430
            LST +W  +QGSG+LTP          +FL+  Q S+V S   S NG +N + V++HRV
Sbjct: 265 KLSTCEWGSYQGSGALTPESVR--RGSPNFLLHRQFSDVPSRPRSGNGHKNGQ-VVNHRV 321

Query: 431 SFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENTCVGETSSN 610
           SFELTAE+   CVE++P   +   K+V    +        +  G S ++    VG TS++
Sbjct: 322 SFELTAEDASRCVEEKP---AFSIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSND 378

Query: 611 VSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDR---SDWWANEKVV 769
               A  DG +  P HR+Q S+ T+GSVKEF FDN D G S +   S+WWAN  V+
Sbjct: 379 SPEMASTDG-EAAPQHRKQQSI-TLGSVKEFNFDNADEGDSRKPSSSNWWANGSVI 432


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  190 bits (483), Expect = 4e-46
 Identities = 130/293 (44%), Positives = 161/293 (54%), Gaps = 31/293 (10%)
 Frame = +2

Query: 2    LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181
            LLTSSL RN + SG   KF  S YEF  YQ  PGSP  +LISP SV+S SGTSSPFP   
Sbjct: 182  LLTSSLARNRRYSGSNYKFPLSQYEFVPYQ-DPGSPGSNLISPGSVVSNSGTSSPFP--- 237

Query: 182  FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------- 304
               G  P +EFR GEPPK   ++  STRKW    GSGSLTP                   
Sbjct: 238  ---GKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGIS 294

Query: 305  --------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETP 460
                    P+   P SRDS+L+E QISEVASLANS+NGS+  E VIDHRVSFELT E+ P
Sbjct: 295  RLGSGTVTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVP 354

Query: 461  SCVEKEPMMASLEAKSVTSLDKTTLATVTPERDGLSSEAENTCVGETSSNVSGKAFGDGD 640
            SC EKEP+M+   ++    +D + L  +  E    SS AE    G        KA   G+
Sbjct: 355  SCREKEPVMS--HSQQTLPMDVSNL--LANEMKSGSSMAEEKTYGSPR-----KASESGE 405

Query: 641  DEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDRS----DWWANEKVVITKEAG 787
            D+   HR+  ++ T GS K+F FDN      ++     +WW ++K    KE+G
Sbjct: 406  DQC--HRKHRNI-TFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAA-GKESG 454


>ref|XP_007038760.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
            gi|508776005|gb|EOY23261.1| Hydroxyproline-rich
            glycoprotein family protein [Theobroma cacao]
          Length = 540

 Score =  181 bits (459), Expect = 3e-43
 Identities = 114/275 (41%), Positives = 146/275 (53%), Gaps = 44/275 (16%)
 Frame = +2

Query: 98   PGSPVGHLISPS------SVISESGTSSPFPDREFSSGGHPFLEFRTGEPPKLWSFDGLS 259
            P  P   L++ S        IS SGTSSPFPDR       P LEF  GE PKL  F+ L+
Sbjct: 263  PEVPFAQLLASSLESARRKAISNSGTSSPFPDRR------PILEFHMGEAPKLLGFENLT 316

Query: 260  TRKWVPHQGSGSLTP-------------------------------PDPAGPTSRDSFLV 346
            TRKW    GSGSLTP                               PD  GP SRD FL+
Sbjct: 317  TRKWCSRLGSGSLTPDGLGRGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPPSRDGFLL 376

Query: 347  ENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDK 526
             +QISEVA L N  NG +N+E ++DHRVSFEL+ E+   C+E + ++ S   ++V+   K
Sbjct: 377  GSQISEVALLTNQANGPKNDETIVDHRVSFELSGEDVARCLESKSLLPS---RTVSEYPK 433

Query: 527  TTLATVTPERDGLSSEAENTC---VGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVK 697
              +A    ERDG+  + E++C   + ETS+    KA G  ++E  H  Q+    T+GS+K
Sbjct: 434  DLVAEGRIERDGIKKDLESSCELFIRETSNETVEKASGKAEEE--HSYQKHRSVTLGSIK 491

Query: 698  EFKFDNTDGGTSD----RSDWWANEKVVITKEAGP 790
            EF FDNT G  SD    RS+WWANEK    KEA P
Sbjct: 492  EFNFDNTKGEASDKPTIRSEWWANEKFA-RKEARP 525


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  179 bits (455), Expect = 8e-43
 Identities = 120/265 (45%), Positives = 144/265 (54%), Gaps = 7/265 (2%)
 Frame = +2

Query: 17  LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196
           LD N     P Q+F   H EFQSY   PGSP+G LISPSS IS SGTSSPFPD EF++ G
Sbjct: 192 LDPNIHNGEPGQRFPIFHNEFQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDPEFAARG 251

Query: 197 HPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVASL 376
             FLEFRTG+PPKL + D LS   W   QGSGSLT PD   P S           EVA  
Sbjct: 252 PHFLEFRTGDPPKLLNLDKLSKFDWGSRQGSGSLT-PDSVKPIS---------TFEVAPH 301

Query: 377 ANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPER 556
              N   +N E V D RVSF+++ E+    VEK+ +   L    +TSL  TT+       
Sbjct: 302 LKPNGRCRNAENVADRRVSFDVSTEDVIRYVEKKTV--PLAEAMLTSLKDTTMGQREENS 359

Query: 557 DGLSSE---AENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGG 727
           D    E    EN  VGETS+    KA   G++ + H + +    T+GS KEF FDN D G
Sbjct: 360 DSNKVEEIGCENR-VGETSNEEPDKAPTSGEEVLQHQKHRS--ITLGSSKEFNFDNADAG 416

Query: 728 ----TSDRSDWWANEKVVITKEAGP 790
               +   SDWWAN+KV   KE  P
Sbjct: 417 DLHKSDSVSDWWANQKVA-GKEGAP 440


>ref|XP_007143454.1| hypothetical protein PHAVU_007G073100g [Phaseolus vulgaris]
            gi|561016644|gb|ESW15448.1| hypothetical protein
            PHAVU_007G073100g [Phaseolus vulgaris]
          Length = 479

 Score =  174 bits (442), Expect = 3e-41
 Identities = 112/293 (38%), Positives = 152/293 (51%), Gaps = 37/293 (12%)
 Frame = +2

Query: 2    LLTSSLDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDRE 181
            LLTSSLDR+CK  G  Q+F  S+YEFQ YQ YPGSP   LISP+S+IS SG+S+PFPD  
Sbjct: 180  LLTSSLDRDCKDKGTNQRFALSNYEFQLYQQYPGSPGPQLISPASIISTSGSSTPFPDT- 238

Query: 182  FSSGGHPFLEFRTGEPPKLWSFDGLSTRKWVPHQGSGSLTP------------------- 304
                 HP LEF  GE   L  F+  ST KW    GSGSLTP                   
Sbjct: 239  -----HPLLEFHKGEASNLLGFEHFSTHKWNSRLGSGSLTPDSTGQGSGLGSGSLTPNAV 293

Query: 305  ----------PDPAGPTSRDSFLVENQISEVASLANSNNGSQNNEIVIDHRVSFELTAEE 454
                      P+   PT+R+   V  Q SE+  LANS N  Q N  ++DHRVSFELT E+
Sbjct: 294  KLVSSSGCLTPEGVAPTARNGIYVGKQTSELTPLANSENECQPNAALVDHRVSFELTGED 353

Query: 455  TPSCVEKE---PMMASLEAKSVTSLDKTTLATVTPERDGLSSEAE-NTCVGETSSNVSGK 622
               C+  +   P++ ++   S  +L       V  ER   +S+++ + C  +TS++    
Sbjct: 354  VARCLANKSGSPLIGNISGSSQGAL---VGEPVDRERIHKNSDSDCDLCSRKTSNDKPEN 410

Query: 623  AFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTSDR----SDWWANEKVV 769
            + G+G+++        S     S K+F FD+  G  SD     S+WW N+K+V
Sbjct: 411  SPGEGEEQCCLKHNSSS-----SSKDFNFDSRKGVVSDNPANASEWWTNKKIV 458


>ref|XP_003516706.1| PREDICTED: uncharacterized protein LOC100777876 [Glycine max]
          Length = 431

 Score =  172 bits (437), Expect = 1e-40
 Identities = 109/262 (41%), Positives = 146/262 (55%), Gaps = 5/262 (1%)
 Frame = +2

Query: 17  LDRNCKTSGPCQKFTPSHYEFQSYQLYPGSPVGHLISPSSVISESGTSSPFPDREFSSGG 196
           LD N K S   Q+F  S Y+F SYQL+PGSPVG LISP S  S SGTSSPFPD +F+S G
Sbjct: 177 LDPNTKNSETYQRFQISQYDFHSYQLHPGSPVGQLISPRSAFSPSGTSSPFPDTDFNSRG 236

Query: 197 HPFLEFRTGEPPKLWSFDGLSTRK-WVPHQGSGSLTPPDPAGPTSRDSFLVENQISEVAS 373
              L+F+ G+P KL +FD  ST +    HQGSGSLT PD    T++  FL  + +S++  
Sbjct: 237 SLLLDFQIGDPTKLLNFDKPSTNENHKSHQGSGSLT-PDSIRSTTQAGFLPSHWVSDII- 294

Query: 374 LANSNNGSQNNEIVIDHRVSFELTAEETPSCVEKEPMMASLEAKSVTSLDKTTLATVTPE 553
           ++     +  NEI ++HRVS E++A+E   CVE + +  S             L T  P 
Sbjct: 295 MSPRPRKNHPNEISVNHRVSIEVSAQEVLKCVENKAVALS------------KLKTDAPG 342

Query: 554 RDGLSSEAENTCVGETSSNVSGKAFGDGDDEVPHHRQQPSLTTIGSVKEFKFDNTDGGTS 733
            D   +  E   V ET ++   +   +GD E  HH+ +       + KEF FDN +GG S
Sbjct: 343 EDKKDNSIE-VLVSETPNDAPQQTADNGDVERAHHKDE--CIIFSAAKEFNFDNAEGGDS 399

Query: 734 DR----SDWWANEKVVITKEAG 787
                 +DWWANEKV  +KE G
Sbjct: 400 PAPNIVADWWANEKVA-SKEGG 420


Top