BLASTX nr result

ID: Mentha24_contig00007640 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00007640
         (1818 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prun...   310   1e-81
ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252...   298   4e-78
ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309...   296   2e-77
gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]     295   5e-77
emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]   289   3e-75
emb|CBI26785.3| unnamed protein product [Vitis vinifera]              288   6e-75
ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family prot...   285   4e-74
ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261...   283   2e-73
ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm...   283   2e-73
ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu...   282   4e-73
ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618...   281   5e-73
ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citr...   281   7e-73
ref|XP_006448090.1| hypothetical protein CICLE_v10014588mg [Citr...   281   7e-73
gb|ABK95394.1| unknown [Populus trichocarpa]                          281   7e-73
ref|XP_007045471.1| Hydroxyproline-rich glycoprotein family prot...   281   9e-73
ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family prot...   281   9e-73
ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600...   280   1e-72
ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus tr...   271   9e-70
ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809...   265   7e-68
ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210...   262   4e-67

>ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica]
            gi|462422058|gb|EMJ26321.1| hypothetical protein
            PRUPE_ppa002630mg [Prunus persica]
          Length = 650

 Score =  310 bits (795), Expect = 1e-81
 Identities = 187/425 (44%), Positives = 237/425 (55%), Gaps = 33/425 (7%)
 Frame = -3

Query: 1465 EKSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQL 1310
            +K NL + PK+F+  E  DGK+V                   KL +LV+DLRAAGKR QL
Sbjct: 217  QKQNLSIVPKTFIGNEISDGKTVNVVDGLKLYEDFLGDTEVSKLVSLVNDLRAAGKRRQL 276

Query: 1309 QGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLL 1130
            QGQT+V  KRPMKGHGREMIQLGIPIADAPPEDE++AG S+D KIEPIP  LQDVI+RL+
Sbjct: 277  QGQTYVVSKRPMKGHGREMIQLGIPIADAPPEDEISAGTSKDRKIEPIPSLLQDVIDRLV 336

Query: 1129 TKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNY 950
              +V+++KPDS IID++NEGDHSQPH WP WFGRPVC + LT C+M+FG+++  D PG+Y
Sbjct: 337  GMHVMTVKPDSCIIDVYNEGDHSQPHTWPSWFGRPVCALYLTECDMTFGRLLLMDHPGDY 396

Query: 949  XXXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF---- 782
                       S++ MQG+SADFA+HAIPS++KQRILVTL KSQ +K    +  RF    
Sbjct: 397  RGSLRLSLTPGSILLMQGKSADFAKHAIPSIRKQRILVTLTKSQPKKSTTSDGQRFPAPA 456

Query: 781  XXXXXXXXXXPSRTPGQIR-PAPAKHFXXXXXXXXXXXXXXXXXXXXPNG---MYVATPV 614
                      PSR+P  IR P   KH+                     NG   ++V  PV
Sbjct: 457  PAQSSYWGPPPSRSPNHIRHPTGPKHYAAVPTTGVLPAPPIRSQLPPQNGIQPLFVPAPV 516

Query: 613  APGIAYPAAVXXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQGQG--------PGNSTT 458
             P I + AAV                       PGTGVFLP  G G        PG +T 
Sbjct: 517  GPAIPFAAAV-PIPPGSAGWPAAPRHPPPRIPLPGTGVFLPPPGSGNSSAPQQLPGTATE 575

Query: 457  NQP----PSTENSATEDSVGKMNGSRLPLTKDDEEAAQKECNGS-----GGVEILKEGEE 305
              P    PS  +          + S  P  K D +A +++CNGS      G   +KE EE
Sbjct: 576  MSPTVETPSPRDKDNGSGKSNHSTSASPKGKSDGKAQRQDCNGSAEGTGSGRTAVKE-EE 634

Query: 304  NESHD 290
             +++D
Sbjct: 635  QQTYD 639


>ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera]
          Length = 698

 Score =  298 bits (764), Expect = 4e-78
 Identities = 188/425 (44%), Positives = 236/425 (55%), Gaps = 33/425 (7%)
 Frame = -3

Query: 1465 EKSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQL 1310
            EK N   SPK+FV TE +DGK+V                   K  +LV+DLRAAGKRGQL
Sbjct: 263  EKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQL 322

Query: 1309 QGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLL 1130
            QGQTFV  KRPMKGHGREMIQLG+PIADAP EDE   G S+D + E IP  LQDVI  L+
Sbjct: 323  QGQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKDRRTESIPSLLQDVIGHLV 382

Query: 1129 TKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNY 950
               V+++KPD+ IID +NEGDHSQPHIWP WFGRPVC++ LT C+M+FG+VI AD PG+Y
Sbjct: 383  GSQVLTVKPDACIIDFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDY 442

Query: 949  XXXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF---X 779
                       S++ MQG+SADFA+HAIPSL+KQRILVT  KSQ +K +  +  R     
Sbjct: 443  RGSLKLSLVPGSLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTMASDGQRLLPPA 502

Query: 778  XXXXXXXXXPSRTPGQIR-PAPAKHF--XXXXXXXXXXXXXXXXXXXXPNGM---YVATP 617
                     PSR+P  +R P   KH+                      PNGM   +V T 
Sbjct: 503  AQSSHWVPPPSRSPNHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTTA 562

Query: 616  VAPGIAYPAAVXXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQGQGPGNSTTNQPPSTE 437
            VAP + +PA V                       PGTGVFLP  G   GNS++ Q  STE
Sbjct: 563  VAPAMPFPAPVPLPTGSPGWPAAPPRHPPPRLPVPGTGVFLPPPGS--GNSSSPQHISTE 620

Query: 436  NSAT----------EDSVGKMNGSR---LPLTKDDEEAAQKECNGS---GGVEILKEGEE 305
             ++T          E+  GK + +     P  K D +  ++ECNGS    GV+     +E
Sbjct: 621  ATSTSVETAAPTEKENGSGKSSSNSNTVSPKGKLDGKVHRQECNGSMDETGVDERAVTKE 680

Query: 304  NESHD 290
             + H+
Sbjct: 681  EQQHN 685


>ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca
            subsp. vesca]
          Length = 682

 Score =  296 bits (759), Expect = 2e-77
 Identities = 188/470 (40%), Positives = 250/470 (53%), Gaps = 38/470 (8%)
 Frame = -3

Query: 1564 SATRQRSTQGDVTQADVDAEDTGSSSV---DGSGLA---EKSNLEVSPKSFVATEFYDGK 1403
            SA  Q +  G+    D    +  +SS+   + + +    EK NL + PK+FV  E +DGK
Sbjct: 211  SANSQGTISGNSESEDAVVNEGCTSSIKENESNSIQIQNEKQNLSLIPKTFVGNETFDGK 270

Query: 1402 SVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQLQGQTFVALKRPMKGHGREMIQ 1247
            +V                   KL +LV+DLR  G+RGQLQGQT+V  KRPMKGHGREMIQ
Sbjct: 271  TVNVVDGLKLYEEFLGDTEVSKLFSLVNDLRTTGRRGQLQGQTYVLSKRPMKGHGREMIQ 330

Query: 1246 LGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGD 1067
            LGIPIAD P EDE++AG S+D ++E IP  LQDVI+RL+   V++ KPDS IID FNEGD
Sbjct: 331  LGIPIADGPQEDEISAGISKDRRMEAIPSLLQDVIDRLIGTQVLTDKPDSCIIDFFNEGD 390

Query: 1066 HSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXXXXSVISMQGRSA 887
            HS PH+WP WFGRPV V+ LT C+++FGKV+  D PG+Y           S++ +QG+SA
Sbjct: 391  HSHPHMWPPWFGRPVSVLFLTECDLTFGKVLGMDHPGDYRGALRLSLTPGSLLLLQGKSA 450

Query: 886  DFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRFXXXXXXXXXXPS----RTPGQIR-P 722
            D+A+HAIPS++KQRILVT  KSQ RK    +  R            S    R+P  IR P
Sbjct: 451  DYAKHAIPSIRKQRILVTFTKSQPRKSFPTDGQRLPSPGPSQSPYWSPPPGRSPNHIRHP 510

Query: 721  APAKHFXXXXXXXXXXXXXXXXXXXXPNG---MYVATPVAPGIAYPAAVXXXXXXXXXXX 551
            A  KH+                     NG   ++VA PV P + +PA V           
Sbjct: 511  AGPKHYAAVPTTGVLPAPPNRPQLPPANGIQPLFVAAPVGPAMPFPAPV-VIPPGSPGWV 569

Query: 550  XXXXXXXXXXXXPGTGVFLPSQGQGPGNSTTNQPPST--------ENSATEDSVGKMNGS 395
                        PGTGVFLP  G G  ++   Q PST        E ++TE   G    S
Sbjct: 570  AAPRHPPPRMPLPGTGVFLPPPGSGSSSAPPQQFPSTATEMNPSVETASTEKDNGTAKSS 629

Query: 394  RL---PLTKDDEEAAQKECNGS-----GGVEILKEGEENESHDVAISGGA 269
                 P  K D +A +++CNGS      G   +K+ ++  S++ A +  A
Sbjct: 630  HAIASPKAKLDVKAQRQDCNGSVDGTGSGRGTVKQEQQQNSNNAAANNQA 679


>gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]
          Length = 681

 Score =  295 bits (755), Expect = 5e-77
 Identities = 185/458 (40%), Positives = 242/458 (52%), Gaps = 38/458 (8%)
 Frame = -3

Query: 1528 TQADVDAEDTG--SSSVDGSGLA-----EKSNLEVSPKSFVATEFYDGKSVXXXXXXXXX 1370
            ++ +V A D G  SSS +    +     E SNL   PK+F   E +DGK V         
Sbjct: 221  SEPEVHAVDDGCTSSSKENDSHSTPKQNENSNLANVPKTFSGNEMFDGKPVNVVEGLKLY 280

Query: 1369 XX--------KLNNLVSDLRAAGKRGQLQGQTFVALKRPMKGHGREMIQLGIPIADAPPE 1214
                      KL  LV+DLR+AG+RG  Q QT+V  KRPMKGHGRE IQLG+PIADAP E
Sbjct: 281  EEFCADTEVSKLVALVNDLRSAGERGHFQSQTYVVSKRPMKGHGREKIQLGLPIADAPVE 340

Query: 1213 DEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWF 1034
            DE++AG  +D + E IP  LQDV ERL++  V ++KPDS IID +NEGDHSQPH+WP WF
Sbjct: 341  DEISAGTLKDRRTEAIPPLLQDVAERLVSMQVATVKPDSCIIDFYNEGDHSQPHLWPSWF 400

Query: 1033 GRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSLQ 854
            GRPVCV+ LT C+M+FG+V A D PG+Y           S+++MQG+SADFA+HAIPSL+
Sbjct: 401  GRPVCVLFLTECDMTFGRVFAIDHPGDYRGALKLSLKPGSLLAMQGKSADFAKHAIPSLR 460

Query: 853  KQRILVTLVKSQSRKIVGGEPHRF----XXXXXXXXXXPSRTPGQIRPAPAKHFXXXXXX 686
            +QRILVT  KSQ +K +  +  R               PSR+P  IR    KH+      
Sbjct: 461  RQRILVTFTKSQPKKSMPSDGQRMPSPGVAPSSHWGPQPSRSPNHIRHPGPKHYAPVPTT 520

Query: 685  XXXXXXXXXXXXXXPNG---MYVATPVAPGIAYPAAVXXXXXXXXXXXXXXXXXXXXXXX 515
                          PNG   ++V  PVAP + +PA V                       
Sbjct: 521  GVLQASPVRPQIPPPNGIQPLFVTAPVAPAMPFPAPVPIPPSSSGWSAAPPRHPPPRLPV 580

Query: 514  PGTGVFLPSQGQGPGNSTTNQPPSTENSAT---------EDSVGKMNG--SRLPLTKDDE 368
            PGTGVFLP  G G  +S + Q    + + T         E+  GK+N   +  P  K D 
Sbjct: 581  PGTGVFLPPPGSGGNSSGSQQVLGNDTNHTVETAAPPEKENGSGKLNHGMTASPKGKVDS 640

Query: 367  EAAQKECNGS-----GGVEILKEGEENESHDVAISGGA 269
            +  ++ECNGS       + + KE  +  S + A S  A
Sbjct: 641  KTQKQECNGSLDGSGSVISVTKEERQQSSDNTATSKSA 678


>emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]
          Length = 1145

 Score =  289 bits (739), Expect = 3e-75
 Identities = 186/429 (43%), Positives = 235/429 (54%), Gaps = 37/429 (8%)
 Frame = -3

Query: 1465 EKSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQL 1310
            EK N   SPK+FV TE +DGK+V                   K  +LV+DLRAAGKRGQL
Sbjct: 272  EKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQL 331

Query: 1309 QGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASR----DPKIEPIPVALQDVI 1142
            QGQTFV  KRPMKGHGREMIQLG+PIADAP EDE   G S+    + + E IP  LQDVI
Sbjct: 332  QGQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKGMFHNRRTESIPSLLQDVI 391

Query: 1141 ERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADT 962
             +L+   V+++KPD+ IID +NEGDHSQPHIWP WFGRPVC++ LT C+M+FG+VI AD 
Sbjct: 392  GQLVGSQVLTVKPDACIIDFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADH 451

Query: 961  PGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF 782
            PG+Y           S++ MQG+SADFA+HAIPSL+KQRILVT  KSQ +K    +  R 
Sbjct: 452  PGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTTASDGQRL 511

Query: 781  ---XXXXXXXXXXPSRTPGQIR-PAPAKHF--XXXXXXXXXXXXXXXXXXXXPNGM---Y 629
                         PSR+P  +R P   KH+                      PNGM   +
Sbjct: 512  LPPAAQSSHWVPPPSRSPNHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLF 571

Query: 628  VATPVAPGIAYPAAVXXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQGQGPGNSTTNQP 449
            V T VAP + +PA                         PGTGVFLP  G   GNS++ Q 
Sbjct: 572  VTTAVAPAMPFPAPXPLPTGSPGWPAAPPRHPPPRLPVPGTGVFLPPPGS--GNSSSPQH 629

Query: 448  PSTENSAT----------EDSVGKMNGSR---LPLTKDDEEAAQKECNGS---GGVEILK 317
             STE ++T          E+  GK + +     P  K D +  ++ECNGS    GV+   
Sbjct: 630  ISTEATSTSVETAAPTEKENGSGKSSSNSNTVSPKGKLDGKVHRQECNGSMDETGVDERA 689

Query: 316  EGEENESHD 290
              +E + H+
Sbjct: 690  VTKEEQQHN 698


>emb|CBI26785.3| unnamed protein product [Vitis vinifera]
          Length = 672

 Score =  288 bits (737), Expect = 6e-75
 Identities = 180/389 (46%), Positives = 218/389 (56%), Gaps = 22/389 (5%)
 Frame = -3

Query: 1465 EKSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQL 1310
            EK N   SPK+FV TE +DGK+V                   K  +LV+DLRAAGKRGQL
Sbjct: 269  EKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQL 328

Query: 1309 Q-GQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERL 1133
            Q GQTFV  KRPMKGHGREMIQLG+PIADAP EDE   G S+D + E IP  LQDVI  L
Sbjct: 329  QAGQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKDRRTESIPSLLQDVIGHL 388

Query: 1132 LTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGN 953
            +   V+++KPD+ IID +NEGDHSQPHIWP WFGRPVC++ LT C+M+FG+VI AD PG+
Sbjct: 389  VGSQVLTVKPDACIIDFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGD 448

Query: 952  YXXXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF--- 782
            Y           S++ MQG+SADFA+HAIPSL+KQRILVT  KSQ +K +  +  R    
Sbjct: 449  YRGSLKLSLVPGSLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTMASDGQRLLPP 508

Query: 781  XXXXXXXXXXPSRTPGQIR-PAPAKHF--XXXXXXXXXXXXXXXXXXXXPNGM---YVAT 620
                      PSR+P  +R P   KH+                      PNGM   +V T
Sbjct: 509  AAQSSHWVPPPSRSPNHMRHPMGPKHYGAVPTTGVLPAPAPPMRPQLPPPNGMQPLFVTT 568

Query: 619  PVAPGIAYPAAVXXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQGQGPGNSTTNQPPST 440
             VAP + +PA V                       PGTGVFLP  G   GNS++ Q  ST
Sbjct: 569  AVAPAMPFPAPVPLPTGSPGWPAAPPRHPPPRLPVPGTGVFLPPPGS--GNSSSPQHIST 626

Query: 439  ENSATEDSVG----KMNGSRLPLTKDDEE 365
            E ++T         K NGS    T   EE
Sbjct: 627  EATSTSVETAAPTEKENGSGKSSTVTKEE 655


>ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
            [Theobroma cacao] gi|590697545|ref|XP_007045470.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709403|gb|EOY01300.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao]
          Length = 680

 Score =  285 bits (730), Expect = 4e-74
 Identities = 201/536 (37%), Positives = 268/536 (50%), Gaps = 43/536 (8%)
 Frame = -3

Query: 1765 GKEFRRGG---RGQR--VGLEVQNFGGEMNGKDLNNGYYKSNQKLTXXXXXXXXXXXXXX 1601
            GKEF+R G   +GQR  V  E QN G + +G        + N++ +              
Sbjct: 142  GKEFKRSGMGFKGQRMEVAKEGQNSGVDSDGNSTVTAVSERNERGSEKREEVKSCGEVGK 201

Query: 1600 XXXXXXXXXENGSATRQRSTQGDVTQADVDAEDTGSSSVDGSGLA------EKSNLEVSP 1439
                     E+   T  +   GD      D     +SS   + L       EK NL   P
Sbjct: 202  VEDKCSTFTEDKKDTGSKPHAGDAESVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGP 261

Query: 1438 KSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQLQGQTFVALK 1283
            K+FV  E +DGK V                    L +LV+DLRAAGKRGQLQGQT+VA K
Sbjct: 262  KTFVGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAK 321

Query: 1282 RPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKP 1103
            RPMKGHGREMIQLG+PIADAP +DE AAG S+D +IE IP  LQD IERL+   V+++KP
Sbjct: 322  RPMKGHGREMIQLGLPIADAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKP 381

Query: 1102 DSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGK-VIAADTPGNYXXXXXXXX 926
            DS IID++NEGDHSQP +WP WFG+PVC++ LT C+++FG+ VI AD PG+Y        
Sbjct: 382  DSCIIDVYNEGDHSQPRMWPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSL 441

Query: 925  XXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF----XXXXXXXX 758
               S++ MQG+SADFA+HA+PS++KQRILVT  K    K    +  R             
Sbjct: 442  APGSLLVMQGKSADFAKHALPSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQWG 501

Query: 757  XXPSRTPGQIR-PAPAKHFXXXXXXXXXXXXXXXXXXXXPNG---MYVATPVAPGIAYPA 590
              PSR+P +IR  A  KH+                     +G   ++V T VAP I++PA
Sbjct: 502  PPPSRSPNRIRHSAGPKHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFPA 561

Query: 589  AVXXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQGQGPGNSTTNQPPSTENSATEDSVG 410
             V                       PGTGVFLP  G G  +S      +TE +   ++  
Sbjct: 562  PV-PIPPGSTGWPAAPRHPPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETTS 620

Query: 409  ---KMNGS-------RLPLTKDDEEAAQKECNGS-----GGVEILKEGEENESHDV 287
               K NGS         P  + D ++ +++CNGS      G  ++KE +    + V
Sbjct: 621  PREKENGSVKPNHHTTSPRGRLDGKSPKQDCNGSVDGAGSGRALMKEEQHCADNSV 676


>ref|XP_004236917.1| PREDICTED: uncharacterized protein LOC101261013 [Solanum
            lycopersicum]
          Length = 641

 Score =  283 bits (724), Expect = 2e-73
 Identities = 195/519 (37%), Positives = 253/519 (48%), Gaps = 23/519 (4%)
 Frame = -3

Query: 1816 RQQKMVGFEGGARMGGVGKEFRRGGRGQRVGLEVQNFGGEMNGKDLN-NGYYKSNQKLTX 1640
            +QQK  GF+GG    G  +   +GG G     E    G E  G++ + + + K+N     
Sbjct: 123  KQQK--GFDGGVNKVGK-RNGSKGGGGGGWKSEGLKDGKESQGQNFSLDAHSKTNGVEKI 179

Query: 1639 XXXXXXXXXXXXXXXXXXXXXXENGSA-TRQRSTQGDVTQADV--DAEDTGSSSVDGSGL 1469
                                    GS  T    +QG+V + D   D+   GSS+V+    
Sbjct: 180  DVVEEKQGDKKELAAKPEANSSVKGSVCTEAGDSQGEVDKTDDKRDSNSEGSSNVESESH 239

Query: 1468 A-----EKSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLRAA 1328
            +     EK N  V PK+FVATE YDGK V                   KL  LV+DLRAA
Sbjct: 240  SFQIPTEKQN--VVPKTFVATEIYDGKPVNVVDGMKLYEELLSSSEVSKLVTLVNDLRAA 297

Query: 1327 GKRGQLQGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQD 1148
            G+RGQL  Q F+  KRPMKGHGREM+QLG+PI DAPPE+E A    +D K E IP  LQD
Sbjct: 298  GRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPEEESAISTYKDRKTEAIPGLLQD 357

Query: 1147 VIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAA 968
            VI++L     +S+KPD+ +IDIFNEGDHSQPH+WP W+GRP+  + LT CEM+FGKVI  
Sbjct: 358  VIDQLSAMQALSVKPDACVIDIFNEGDHSQPHLWPYWYGRPISTLFLTDCEMTFGKVIGV 417

Query: 967  DTPGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPH 788
            D PG+Y           SV+ MQGRS +FA++AIPS++KQR+LVT  K Q R+I  G+  
Sbjct: 418  DHPGDYRGSLKLSLAPGSVLVMQGRSTEFAKYAIPSIRKQRMLVTFTKLQLRRIKSGDSQ 477

Query: 787  RF---XXXXXXXXXXPSRTPGQI-RPAPAKHFXXXXXXXXXXXXXXXXXXXXPN--GMYV 626
            RF             PSR+   I RP   KH+                     N   ++V
Sbjct: 478  RFPSSAGGPVSQWVPPSRSSNHIRRPFGPKHYGSMPATGVLPIPGVRPQFAPANMQPIFV 537

Query: 625  ATPVAPGIAYPAAVXXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQGQGPGNSTTNQPP 446
               VAP + +PA V                       PGTGVFLP    G G S+T+  P
Sbjct: 538  PATVAPAMPFPAPVALPPASAGWAVPPIRHPPPRLPLPGTGVFLP---PGSGTSSTDNIP 594

Query: 445  STENSATEDSVGKMNGSRLPLTKDDEEAAQKECNGSGGV 329
            +       DS          +  D  E   ++CNG   V
Sbjct: 595  AENTGPLSDSTVSQK-----VNSDSSEVQTQDCNGKADV 628


>ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis]
            gi|223533099|gb|EEF34858.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 697

 Score =  283 bits (724), Expect = 2e-73
 Identities = 195/538 (36%), Positives = 271/538 (50%), Gaps = 47/538 (8%)
 Frame = -3

Query: 1765 GKEFRRGG-----RGQRVGLEVQ---NFGGEMNGKDLN-NGYYKSNQKLTXXXXXXXXXX 1613
            GK+F+R       +G R G EV    N+G E +G D N +G  K N+  +          
Sbjct: 150  GKDFKRNSSMGFNKGHRGGGEVVKEVNYGAESHGLDGNTSGNEKFNEIKSGGDSGRLENK 209

Query: 1612 XXXXXXXXXXXXXE------NGSATRQRSTQGDV-TQADVDAEDTGSSSVDGSGLAE--- 1463
                         +        S   + S  G++ T+A+   E +     D   +     
Sbjct: 210  SLATAEDKKDAASKPHVDNLKSSGNSEGSLSGNLETEAEAVHEQSSPKEHDSHFIQNQIV 269

Query: 1462 KSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQLQ 1307
            K NL  +PK+FV  E  DGKSV                   KL +LV+DLRAAG++GQ Q
Sbjct: 270  KLNLTTTPKTFVGAEMVDGKSVNVVDGLKLYEQLLDDVEVSKLVSLVNDLRAAGRKGQFQ 329

Query: 1306 GQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLT 1127
            GQ +V  KRPMKGHGREMIQLG+PIADAP E+E AAG S+D KIE IP  LQ+VIER ++
Sbjct: 330  GQAYVVSKRPMKGHGREMIQLGLPIADAPAEEENAAGTSKDRKIESIPTLLQEVIERFVS 389

Query: 1126 KNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYX 947
              ++++KPDS IIDI+NEGDHSQPH+WP WFG+P+ V+ LT C+++FG+VI AD PG+Y 
Sbjct: 390  MQIMTMKPDSCIIDIYNEGDHSQPHMWPPWFGKPISVLFLTECDLTFGRVITADHPGDYR 449

Query: 946  XXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF----X 779
                      S++ MQG++ DFA+HAIP+++KQR+L+T  KSQ +K V  +  R      
Sbjct: 450  GSLKLPLAPGSLLVMQGKATDFAKHAIPAIRKQRVLLTFTKSQPKKFVQSDGQRLTSPAA 509

Query: 778  XXXXXXXXXPSRTPGQIRPAPAKHFXXXXXXXXXXXXXXXXXXXXPNG---MYVATPVAP 608
                     PSR+P  IR   +KH+                    PNG   ++V  PVA 
Sbjct: 510  SPSSHWGPPPSRSPNHIRHPVSKHYAPIPTTGVLPAPSIRPQIAPPNGVQPLFVTAPVAA 569

Query: 607  GIAYPAAV-XXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQGQG-------PGNSTTNQ 452
             + +PA V                        PGTGVFLP  G G       P  +  N 
Sbjct: 570  PMPFPAPVPMPPVSTGWPAAPRHPPNRLPVPVPGTGVFLPPPGSGNASSPQIPNATEINF 629

Query: 451  PPSTEN-SATEDSVGKMNGSRL--PLTKDDEEAAQKECNG--SGGVEILKEGEENESH 293
            P  T +    E+ +GK N      P  K + ++ +++CNG   G     +E +++  H
Sbjct: 630  PAETASLQDKENGLGKSNHGTCASPKEKLEAKSQKQDCNGITDGKAGTKEEHQQSVDH 687


>ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa]
            gi|550333016|gb|ERP57586.1| hypothetical protein
            POPTR_0008s13830g [Populus trichocarpa]
          Length = 693

 Score =  282 bits (721), Expect = 4e-73
 Identities = 193/542 (35%), Positives = 262/542 (48%), Gaps = 31/542 (5%)
 Frame = -3

Query: 1816 RQQKMVGFEGGARMGGVGKEFRRGGRGQRVGLEVQNFGGEMNGKDLNNGYYKSNQKLTXX 1637
            ++    GF  G R GG G     GG   + G+         NG    N   +  +++   
Sbjct: 150  KRSSSAGFNRGHRGGGGGG----GGDAVKEGVNSSVENHSFNGNSSENIRSEKFEEVKSG 205

Query: 1636 XXXXXXXXXXXXXXXXXXXXXENGSATRQRSTQGDVTQADVDAEDTG--SSSVDGSGLAE 1463
                                 +N S   Q +  G+     VD   +   S S   +   E
Sbjct: 206  GDGGKSDDKKDATAKSHTDNHKNSSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNE 265

Query: 1462 KSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQLQ 1307
            K NL ++PK+FVA E  DG+ V                   KL +LV++LRA G+RGQ Q
Sbjct: 266  KQNLAITPKTFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQ 325

Query: 1306 GQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLT 1127
            GQT++  KRPMKGHGREMIQLG+PIADAP EDE A G S++ ++E IP  LQDVIE  + 
Sbjct: 326  GQTYILSKRPMKGHGREMIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVA 385

Query: 1126 KNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYX 947
              V+++KPDS IIDI+NEGDHSQPH+WP WFG+PV V+ LT CE++FGKVI     G+Y 
Sbjct: 386  MQVMTMKPDSCIIDIYNEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYK 445

Query: 946  XXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGE----PHRFX 779
                      S++ MQG+S+D A+HAIP ++KQR+LVT  KSQ +K+   +    P    
Sbjct: 446  GSLKLSVAPGSLLVMQGKSSDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAV 505

Query: 778  XXXXXXXXXPSRTPGQIRPAPAKHFXXXXXXXXXXXXXXXXXXXXPNG---MYVATPVAP 608
                     PSR+P  +R    KH+                    PNG   +++ TPVA 
Sbjct: 506  APSSHWGPPPSRSPNHLRHPVPKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAA 565

Query: 607  GIAYPAAV--XXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQGQGPGNST--------- 461
             + +PA V                         PGTGVFLP  G G  +S          
Sbjct: 566  PMPFPAPVPIPPVSTGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATE 625

Query: 460  TNQPPSTENSATEDSVGKMN--GSRLPLTKDDEEAAQKECNGS-GGVEILKEGEENESHD 290
             N P  TE    E+  GK N   S  P  K  E+  +++ NG   G+ + KE +++ SH 
Sbjct: 626  MNFPTETEKE-KENGPGKSNHDTSASPKEKSAEKTQRQDSNGDVDGIAVKKEEQQSVSHT 684

Query: 289  VA 284
            VA
Sbjct: 685  VA 686


>ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618872 [Citrus sinensis]
          Length = 627

 Score =  281 bits (720), Expect = 5e-73
 Identities = 180/450 (40%), Positives = 240/450 (53%), Gaps = 39/450 (8%)
 Frame = -3

Query: 1507 EDTGSSSVDGSGLAEKSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNN 1352
            ++  S SV      EK N  ++ KSFV TE  DGK V                   KL +
Sbjct: 181  KENDSQSVQSQN--EKQNQSMAAKSFVGTEMVDGKMVNVVDGLKLYEEVSGNSEVSKLVS 238

Query: 1351 LVSDLRAAGKRGQLQGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIE 1172
            LV+DLR AGKRGQ+QG  +V  KRP++GHGRE+IQLG+PI D PPEDE+AAG SRD +IE
Sbjct: 239  LVNDLRTAGKRGQIQGPAYVVSKRPIRGHGREVIQLGLPIVDGPPEDEIAAGTSRDRRIE 298

Query: 1171 PIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEM 992
            PIP  LQDVI+RL+   ++++KPDS I+D+FNEGDHSQPHI P WFGRPVC++ LT C+M
Sbjct: 299  PIPSLLQDVIDRLVGLQIMTVKPDSCIVDVFNEGDHSQPHISPSWFGRPVCILFLTECDM 358

Query: 991  SFGKVIAADTPGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSR 812
            +FG++I  D PG+Y           S++ MQG+SAD A+HAI S++KQRILVT  KSQ +
Sbjct: 359  TFGRMIGIDHPGDYRGTLRLSVAPGSLLVMQGKSADIAKHAISSIRKQRILVTFTKSQPK 418

Query: 811  KIVGGEPHRF----XXXXXXXXXXPSRTPGQIR-PAPAKHFXXXXXXXXXXXXXXXXXXX 647
            K+   +  R               P R P  IR P   KHF                   
Sbjct: 419  KLTPTDGQRLASPGIAPSPHWGLPPGRPPNHIRHPTGPKHFAPIPTTGVLPAPAIRAQIP 478

Query: 646  XPNG---MYVATPVAPGIAYPAAV---XXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQ 485
              NG   ++V+ PV P + +PA V                          PGTGVFLP  
Sbjct: 479  PTNGVPPIFVSPPVTPAMPFPAPVPIPPGSTGWTAAPPRHTPPPPPRLPVPGTGVFLPP- 537

Query: 484  GQGPGNSTTNQPPSTENSATEDSVGKM-------NGS-------RLPLTKDDEEAAQKEC 347
               PG+  ++ P    ++ATE  + +M       NGS         P  K   E   + C
Sbjct: 538  ---PGSGGSSSPRQVSSAATEHLIPEMGSQAEKENGSGKSNHETNAPKEKLVGETQGQGC 594

Query: 346  NGS----GGVE-ILKEGEENES-HDVAISG 275
            NGS    G V+ ++KE  +++S  D +++G
Sbjct: 595  NGSVDGTGSVKAVMKEENQHQSVEDTSVAG 624


>ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citrus clementina]
            gi|557550702|gb|ESR61331.1| hypothetical protein
            CICLE_v10014588mg [Citrus clementina]
          Length = 635

 Score =  281 bits (719), Expect = 7e-73
 Identities = 178/451 (39%), Positives = 239/451 (52%), Gaps = 40/451 (8%)
 Frame = -3

Query: 1507 EDTGSSSVDGSGLAEKSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNN 1352
            ++  S SV      EK N  ++ KSFV TE  DGK V                   KL +
Sbjct: 188  KENDSQSVQSQN--EKQNQSMAAKSFVGTEMVDGKMVNVVDGLKLYEEVSGNSEVSKLVS 245

Query: 1351 LVSDLRAAGKRGQLQGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIE 1172
            LV+DLR AGKRGQ+QG  +V  KRP++GHGRE+IQLG+PI D PPEDE+AAG SRD +IE
Sbjct: 246  LVNDLRTAGKRGQIQGPAYVVSKRPIRGHGREVIQLGLPIVDGPPEDEIAAGTSRDRRIE 305

Query: 1171 PIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEM 992
            PIP  LQDVI+RL+   ++++KPDS I+D+FNEGDHSQPHI P WFGRPVC++ LT C+M
Sbjct: 306  PIPSLLQDVIDRLVGLQIMTVKPDSCIVDVFNEGDHSQPHISPSWFGRPVCILFLTECDM 365

Query: 991  SFGKVIAADTPGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSR 812
            +FG++I  D PG+Y           S++ MQG+SAD A+HAI S++KQRILVT  KSQ +
Sbjct: 366  TFGRMIGIDHPGDYRGTLRLSVAPGSLLVMQGKSADIAKHAISSIRKQRILVTFTKSQPK 425

Query: 811  KIVGGEPHRFXXXXXXXXXXPSRTPGQ----IR-PAPAKHFXXXXXXXXXXXXXXXXXXX 647
            K+   +  R               PG+    IR P   KHF                   
Sbjct: 426  KLTPTDGQRLASPGIAPSPHWGPPPGRPPNHIRHPTGPKHFAPIPTTGVLPAPAIRAQIP 485

Query: 646  XPNGM---YVATPVAPGIAYPAAVXXXXXXXXXXXXXXXXXXXXXXXP----GTGVFLPS 488
              NG+   +V+ PV P + +PA V                            GTGVFLP 
Sbjct: 486  PTNGVPPIFVSPPVTPAMPFPAPVPIPPGSTGWTAAPPRHTPPPPPPRLPVPGTGVFLPP 545

Query: 487  QGQGPGNSTTNQPPSTENSATEDSVGKM-------NGS-------RLPLTKDDEEAAQKE 350
                PG+  ++ P    ++ATE  + +M       NGS         P  K   E   + 
Sbjct: 546  ----PGSGGSSSPRQVSSAATEHLIPEMGSQAEKENGSGKSNHETNAPKEKLVGETQGQG 601

Query: 349  CNGS----GGVE-ILKEGEENES-HDVAISG 275
            CNGS    G V+ ++KE  +++S  D +++G
Sbjct: 602  CNGSVDGTGSVKAVMKEENQHQSVEDTSVAG 632


>ref|XP_006448090.1| hypothetical protein CICLE_v10014588mg [Citrus clementina]
            gi|557550701|gb|ESR61330.1| hypothetical protein
            CICLE_v10014588mg [Citrus clementina]
          Length = 486

 Score =  281 bits (719), Expect = 7e-73
 Identities = 178/451 (39%), Positives = 239/451 (52%), Gaps = 40/451 (8%)
 Frame = -3

Query: 1507 EDTGSSSVDGSGLAEKSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNN 1352
            ++  S SV      EK N  ++ KSFV TE  DGK V                   KL +
Sbjct: 39   KENDSQSVQSQN--EKQNQSMAAKSFVGTEMVDGKMVNVVDGLKLYEEVSGNSEVSKLVS 96

Query: 1351 LVSDLRAAGKRGQLQGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIE 1172
            LV+DLR AGKRGQ+QG  +V  KRP++GHGRE+IQLG+PI D PPEDE+AAG SRD +IE
Sbjct: 97   LVNDLRTAGKRGQIQGPAYVVSKRPIRGHGREVIQLGLPIVDGPPEDEIAAGTSRDRRIE 156

Query: 1171 PIPVALQDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEM 992
            PIP  LQDVI+RL+   ++++KPDS I+D+FNEGDHSQPHI P WFGRPVC++ LT C+M
Sbjct: 157  PIPSLLQDVIDRLVGLQIMTVKPDSCIVDVFNEGDHSQPHISPSWFGRPVCILFLTECDM 216

Query: 991  SFGKVIAADTPGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSR 812
            +FG++I  D PG+Y           S++ MQG+SAD A+HAI S++KQRILVT  KSQ +
Sbjct: 217  TFGRMIGIDHPGDYRGTLRLSVAPGSLLVMQGKSADIAKHAISSIRKQRILVTFTKSQPK 276

Query: 811  KIVGGEPHRFXXXXXXXXXXPSRTPGQ----IR-PAPAKHFXXXXXXXXXXXXXXXXXXX 647
            K+   +  R               PG+    IR P   KHF                   
Sbjct: 277  KLTPTDGQRLASPGIAPSPHWGPPPGRPPNHIRHPTGPKHFAPIPTTGVLPAPAIRAQIP 336

Query: 646  XPNGM---YVATPVAPGIAYPAAVXXXXXXXXXXXXXXXXXXXXXXXP----GTGVFLPS 488
              NG+   +V+ PV P + +PA V                            GTGVFLP 
Sbjct: 337  PTNGVPPIFVSPPVTPAMPFPAPVPIPPGSTGWTAAPPRHTPPPPPPRLPVPGTGVFLPP 396

Query: 487  QGQGPGNSTTNQPPSTENSATEDSVGKM-------NGS-------RLPLTKDDEEAAQKE 350
                PG+  ++ P    ++ATE  + +M       NGS         P  K   E   + 
Sbjct: 397  ----PGSGGSSSPRQVSSAATEHLIPEMGSQAEKENGSGKSNHETNAPKEKLVGETQGQG 452

Query: 349  CNGS----GGVE-ILKEGEENES-HDVAISG 275
            CNGS    G V+ ++KE  +++S  D +++G
Sbjct: 453  CNGSVDGTGSVKAVMKEENQHQSVEDTSVAG 483


>gb|ABK95394.1| unknown [Populus trichocarpa]
          Length = 694

 Score =  281 bits (719), Expect = 7e-73
 Identities = 194/542 (35%), Positives = 264/542 (48%), Gaps = 31/542 (5%)
 Frame = -3

Query: 1816 RQQKMVGFEGGARMGGVGKEFRRGGRGQRVGLEVQNFGGEMNGKDLNNGYYKSNQKLTXX 1637
            ++    GF  G R GG G +  + G    V   V+N     NG    N   +  +++   
Sbjct: 153  KRSSSAGFNRGHRGGGGGGDAVKEG----VNSSVENHS--FNGNSSENIRSEKFEEVKSG 206

Query: 1636 XXXXXXXXXXXXXXXXXXXXXENGSATRQRSTQGDVTQADVDAEDTG--SSSVDGSGLAE 1463
                                 +N S   Q +  G+     VD   +   S S   +   E
Sbjct: 207  GDGGKSDDKKDATAKSHTDNHKNSSGNAQGTFSGNSEAVAVDDRSSPEESDSHPSNNQNE 266

Query: 1462 KSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQLQ 1307
            K NL ++PK+FVA E  DG+ V                   KL +LV++LRA G+RGQ Q
Sbjct: 267  KQNLAITPKTFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQ 326

Query: 1306 GQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLT 1127
            GQT++  KRPMKGHGREMIQLG+PIADAP EDE A G S++ ++E IP  LQDVIE  + 
Sbjct: 327  GQTYILSKRPMKGHGREMIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVA 386

Query: 1126 KNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYX 947
              V+++KPDS IIDI+NEGDHSQPH+WP WFG+PV V+ LT CE++FGKVI     G+Y 
Sbjct: 387  MQVMTMKPDSCIIDIYNEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYK 446

Query: 946  XXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGE----PHRFX 779
                      S++ MQG+S+D A+HAIP ++KQR+LVT  KSQ +K+   +    P    
Sbjct: 447  GSLKLSVAPGSLLVMQGKSSDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAV 506

Query: 778  XXXXXXXXXPSRTPGQIRPAPAKHFXXXXXXXXXXXXXXXXXXXXPNG---MYVATPVAP 608
                     PSR+P  +R    KH+                    PNG   +++ TPVA 
Sbjct: 507  APSSHWGPPPSRSPNHLRHPVPKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVAA 566

Query: 607  GIAYPAAV--XXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQGQGPGNST--------- 461
             + +PA V                         PGTGVFLP  G G  +S          
Sbjct: 567  PMPFPAPVPIPPVSTGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATATE 626

Query: 460  TNQPPSTENSATEDSVGKMN--GSRLPLTKDDEEAAQKECNGS-GGVEILKEGEENESHD 290
             N P  TE    E+  GK N   S  P  K  E+  +++ NG   G+ + KE +++ SH 
Sbjct: 627  MNFPTETEKE-KENGPGKSNHDTSASPKEKSAEKTQRQDSNGDVDGIAVKKEEQQSVSHT 685

Query: 289  VA 284
            VA
Sbjct: 686  VA 687


>ref|XP_007045471.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 5
            [Theobroma cacao] gi|508709406|gb|EOY01303.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 5 [Theobroma cacao]
          Length = 572

 Score =  281 bits (718), Expect = 9e-73
 Identities = 201/537 (37%), Positives = 268/537 (49%), Gaps = 44/537 (8%)
 Frame = -3

Query: 1765 GKEFRRGG---RGQR--VGLEVQNFGGEMNGKDLNNGYYKSNQKLTXXXXXXXXXXXXXX 1601
            GKEF+R G   +GQR  V  E QN G + +G        + N++ +              
Sbjct: 33   GKEFKRSGMGFKGQRMEVAKEGQNSGVDSDGNSTVTAVSERNERGSEKREEVKSCGEVGK 92

Query: 1600 XXXXXXXXXENGSATRQRSTQGDVTQADVDAEDTGSSSVDGSGLA------EKSNLEVSP 1439
                     E+   T  +   GD      D     +SS   + L       EK NL   P
Sbjct: 93   VEDKCSTFTEDKKDTGSKPHAGDAESVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGP 152

Query: 1438 KSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQLQ-GQTFVAL 1286
            K+FV  E +DGK V                    L +LV+DLRAAGKRGQLQ GQT+VA 
Sbjct: 153  KTFVGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAA 212

Query: 1285 KRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIK 1106
            KRPMKGHGREMIQLG+PIADAP +DE AAG S+D +IE IP  LQD IERL+   V+++K
Sbjct: 213  KRPMKGHGREMIQLGLPIADAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVK 272

Query: 1105 PDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGK-VIAADTPGNYXXXXXXX 929
            PDS IID++NEGDHSQP +WP WFG+PVC++ LT C+++FG+ VI AD PG+Y       
Sbjct: 273  PDSCIIDVYNEGDHSQPRMWPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLS 332

Query: 928  XXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF----XXXXXXX 761
                S++ MQG+SADFA+HA+PS++KQRILVT  K    K    +  R            
Sbjct: 333  LAPGSLLVMQGKSADFAKHALPSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQW 392

Query: 760  XXXPSRTPGQIR-PAPAKHFXXXXXXXXXXXXXXXXXXXXPNG---MYVATPVAPGIAYP 593
               PSR+P +IR  A  KH+                     +G   ++V T VAP I++P
Sbjct: 393  GPPPSRSPNRIRHSAGPKHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFP 452

Query: 592  AAVXXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQGQGPGNSTTNQPPSTENSATEDSV 413
            A V                       PGTGVFLP  G G  +S      +TE +   ++ 
Sbjct: 453  APV-PIPPGSTGWPAAPRHPPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETT 511

Query: 412  G---KMNGS-------RLPLTKDDEEAAQKECNGS-----GGVEILKEGEENESHDV 287
                K NGS         P  + D ++ +++CNGS      G  ++KE +    + V
Sbjct: 512  SPREKENGSVKPNHHTTSPRGRLDGKSPKQDCNGSVDGAGSGRALMKEEQHCADNSV 568


>ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|590697542|ref|XP_007045469.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709402|gb|EOY01299.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 681

 Score =  281 bits (718), Expect = 9e-73
 Identities = 201/537 (37%), Positives = 268/537 (49%), Gaps = 44/537 (8%)
 Frame = -3

Query: 1765 GKEFRRGG---RGQR--VGLEVQNFGGEMNGKDLNNGYYKSNQKLTXXXXXXXXXXXXXX 1601
            GKEF+R G   +GQR  V  E QN G + +G        + N++ +              
Sbjct: 142  GKEFKRSGMGFKGQRMEVAKEGQNSGVDSDGNSTVTAVSERNERGSEKREEVKSCGEVGK 201

Query: 1600 XXXXXXXXXENGSATRQRSTQGDVTQADVDAEDTGSSSVDGSGLA------EKSNLEVSP 1439
                     E+   T  +   GD      D     +SS   + L       EK NL   P
Sbjct: 202  VEDKCSTFTEDKKDTGSKPHAGDAESVTEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGP 261

Query: 1438 KSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQLQ-GQTFVAL 1286
            K+FV  E +DGK V                    L +LV+DLRAAGKRGQLQ GQT+VA 
Sbjct: 262  KTFVGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAA 321

Query: 1285 KRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIK 1106
            KRPMKGHGREMIQLG+PIADAP +DE AAG S+D +IE IP  LQD IERL+   V+++K
Sbjct: 322  KRPMKGHGREMIQLGLPIADAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVK 381

Query: 1105 PDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGK-VIAADTPGNYXXXXXXX 929
            PDS IID++NEGDHSQP +WP WFG+PVC++ LT C+++FG+ VI AD PG+Y       
Sbjct: 382  PDSCIIDVYNEGDHSQPRMWPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLS 441

Query: 928  XXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF----XXXXXXX 761
                S++ MQG+SADFA+HA+PS++KQRILVT  K    K    +  R            
Sbjct: 442  LAPGSLLVMQGKSADFAKHALPSVRKQRILVTFTKYCQPKKSTTDNQRLSSPSVSQSSQW 501

Query: 760  XXXPSRTPGQIR-PAPAKHFXXXXXXXXXXXXXXXXXXXXPNG---MYVATPVAPGIAYP 593
               PSR+P +IR  A  KH+                     +G   ++V T VAP I++P
Sbjct: 502  GPPPSRSPNRIRHSAGPKHYAVIPTTGVLPAPPIRPQIPPSSGVQPLFVPTAVAPAISFP 561

Query: 592  AAVXXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQGQGPGNSTTNQPPSTENSATEDSV 413
            A V                       PGTGVFLP  G G  +S      +TE +   ++ 
Sbjct: 562  APV-PIPPGSTGWPAAPRHPPPRLPVPGTGVFLPPPGSGNSSSQQLSTTATELNILVETT 620

Query: 412  G---KMNGS-------RLPLTKDDEEAAQKECNGS-----GGVEILKEGEENESHDV 287
                K NGS         P  + D ++ +++CNGS      G  ++KE +    + V
Sbjct: 621  SPREKENGSVKPNHHTTSPRGRLDGKSPKQDCNGSVDGAGSGRALMKEEQHCADNSV 677


>ref|XP_006355042.1| PREDICTED: uncharacterized protein LOC102600383 [Solanum tuberosum]
          Length = 638

 Score =  280 bits (717), Expect = 1e-72
 Identities = 195/521 (37%), Positives = 253/521 (48%), Gaps = 25/521 (4%)
 Frame = -3

Query: 1816 RQQKMVGFEGGARMGGVGKEFRRGGRGQRVGLEVQNF--GGEMNGKDLN-NGYYKSNQKL 1646
            +QQK  GF+GG +      E R G RG   G + +    G E  G++ + + + K+N   
Sbjct: 121  KQQK--GFDGGVKK----VEKRNGSRGGGGGWKSEGLKDGKESQGQNFSLDAHSKTNGVE 174

Query: 1645 TXXXXXXXXXXXXXXXXXXXXXXXENGSA-TRQRSTQGDVTQADV--DAEDTGSSSVDGS 1475
                                       S  T    +QG+V + D   D+   GSS+V+  
Sbjct: 175  KIDVVEVKQGEKKELAANPEANSSVKSSVCTEAGDSQGEVDKTDDKRDSNSEGSSNVESE 234

Query: 1474 GLA-----EKSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLR 1334
              +     EK N  V PK+FVATE YDGK V                   KL  LV+DLR
Sbjct: 235  SHSIQVPTEKQN--VVPKTFVATEIYDGKPVNVVDGMKLYEELLSSSEVSKLLTLVNDLR 292

Query: 1333 AAGKRGQLQGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVAL 1154
            AAG+RGQL  Q F+  KRPMKGHGREM+QLG+PI DAPPE+E A    +D K E IP   
Sbjct: 293  AAGRRGQLPAQAFIVSKRPMKGHGREMVQLGLPIVDAPPEEEAAISTYKDRKTEAIPGLF 352

Query: 1153 QDVIERLLTKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVI 974
            QDVI++L     +S+KPD+ +IDIFNEGDHSQPH+WP W+GRP+ ++ LT CEM+FGKVI
Sbjct: 353  QDVIDQLSAMQALSVKPDACVIDIFNEGDHSQPHLWPYWYGRPISMLFLTDCEMTFGKVI 412

Query: 973  AADTPGNYXXXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGE 794
              D PG+Y           SV+ MQGRS +FA++AIPS +KQRILVT  K Q R+I   +
Sbjct: 413  GVDHPGDYRGSLKLSLAPGSVLVMQGRSTEFAKYAIPSTRKQRILVTFTKLQLRRIKSAD 472

Query: 793  PHRF---XXXXXXXXXXPSRTPGQI-RPAPAKHFXXXXXXXXXXXXXXXXXXXXPN--GM 632
              RF             PSR+P  I RP   KH+                     N   +
Sbjct: 473  SQRFPSSAGGPVSQWVPPSRSPNHIRRPFGPKHYGSMSTTGVLPIPGVRPQFAPANMQPI 532

Query: 631  YVATPVAPGIAYPAAVXXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQGQGPGNSTTNQ 452
            +V   VAP + +PA V                       PGTGVFLP    G G S+T+ 
Sbjct: 533  FVPATVAPAMPFPAPVALPPASAGWAVPPLRHPPPRLPLPGTGVFLP---PGSGTSSTDN 589

Query: 451  PPSTENSATEDSVGKMNGSRLPLTKDDEEAAQKECNGSGGV 329
             P+ +     DS          +     E   +ECNG   V
Sbjct: 590  IPAEKAGPLSDSTVSQK-----VNSGSSEVQTQECNGKADV 625


>ref|XP_002311547.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550333015|gb|EEE88914.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 675

 Score =  271 bits (692), Expect = 9e-70
 Identities = 172/423 (40%), Positives = 226/423 (53%), Gaps = 29/423 (6%)
 Frame = -3

Query: 1465 EKSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQL 1310
            EK NL ++PK+FVA E  DG+ V                   KL +LV++LRA G+RGQ 
Sbjct: 248  EKQNLAITPKTFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQC 307

Query: 1309 QGQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLL 1130
            QGQT++  KRPMKGHGREMIQLG+PIADAP EDE A G S+   +E IP  LQDVIE  +
Sbjct: 308  QGQTYILSKRPMKGHGREMIQLGLPIADAPAEDENATGTSKGT-VESIPALLQDVIEHFV 366

Query: 1129 TKNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNY 950
               V+++KPDS IIDI+NEGDHSQPH+WP WFG+PV V+ LT CE++FGKVI     G+Y
Sbjct: 367  AMQVMTMKPDSCIIDIYNEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDY 426

Query: 949  XXXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGE----PHRF 782
                       S++ MQG+S+D A+HAIP ++KQR+LVT  KSQ +K+   +    P   
Sbjct: 427  KGSLKLSVAPGSLLVMQGKSSDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHA 486

Query: 781  XXXXXXXXXXPSRTPGQIRPAPAKHFXXXXXXXXXXXXXXXXXXXXPNG---MYVATPVA 611
                      PSR+P  +R    KH+                    PNG   +++ TPVA
Sbjct: 487  VAPSSHWGPPPSRSPNHLRHPVPKHYAAIPTTGVLLVPPIRPQIPPPNGVQPLFMTTPVA 546

Query: 610  PGIAYPAAV--XXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQGQGPGNST-------- 461
              + +PA V                         PGTGVFLP  G G  +S         
Sbjct: 547  APMPFPAPVPIPPVSTGWPTSSPRHPSARLPVPIPGTGVFLPPPGSGNASSALQLSATAT 606

Query: 460  -TNQPPSTENSATEDSVGKMN--GSRLPLTKDDEEAAQKECNGS-GGVEILKEGEENESH 293
              N P  TE    E+  GK N   S  P  K  E+  +++ NG   G+ + KE +++ SH
Sbjct: 607  EMNFPTETEKE-KENGPGKSNHDTSASPKEKSAEKTQRQDSNGDVDGIAVKKEEQQSVSH 665

Query: 292  DVA 284
             VA
Sbjct: 666  TVA 668


>ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine
            max]
          Length = 681

 Score =  265 bits (676), Expect = 7e-68
 Identities = 165/441 (37%), Positives = 228/441 (51%), Gaps = 31/441 (7%)
 Frame = -3

Query: 1570 NGSATRQRSTQGDVTQADVDA--------EDTGSSSVDGSGLAEKSNLEVSPKSFVATEF 1415
            +GS    RST+G ++  + +A           G  S       +  +L    K+F+  E 
Sbjct: 216  DGSLKSTRSTEGSLSNLESEAVVNDECISNSKGDDSHSVQNQHQSQSLSTKAKTFIGNEM 275

Query: 1414 YDGKSVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQLQG-QTFVALKRPMKGHG 1262
            +DGK V                    L +LV+DLR +GK+GQLQG Q ++  +RPMKGHG
Sbjct: 276  FDGKMVNVVDGLKLYEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRPMKGHG 335

Query: 1261 REMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLTKNVVSIKPDSAIIDI 1082
            REMIQLG+PIADAP E E   GAS+D  +EPIP   QD+IER+++  V+++KPD  I+D 
Sbjct: 336  REMIQLGVPIADAPAEGENMTGASKDMNVEPIPSLFQDIIERMVSSQVMTVKPDCCIVDF 395

Query: 1081 FNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYXXXXXXXXXXXSVISM 902
            +NEGDHSQPH WP W+GRPV ++ LT CEM+FG+VIA++ PG+Y           S++ M
Sbjct: 396  YNEGDHSQPHSWPSWYGRPVYILFLTECEMTFGRVIASEHPGDYRGGIKLSLVPGSLLVM 455

Query: 901  QGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF--XXXXXXXXXXPSRTPGQI 728
            +G+S+DFA+HA+PS++KQRILVT  KSQ RK +  +  R             PSR+P  +
Sbjct: 456  EGKSSDFAKHALPSVRKQRILVTFTKSQPRKSLSSDAQRLASTATSSHWGPLPSRSPNHV 515

Query: 727  R-PAPAKHFXXXXXXXXXXXXXXXXXXXXPNGM---YVATPVAPGIAYPAAV-XXXXXXX 563
            R    +KH+                    P GM   +V  PV P + +PA V        
Sbjct: 516  RHHVGSKHYATLPTTGVLPSPPIRPQMAAPVGMQPLFVTAPVVPPMPFPAPVAFPPGSTG 575

Query: 562  XXXXXXXXXXXXXXXXPGTGVFLPSQGQG------PGNSTTNQPPSTEN-SATEDSVGKM 404
                            PGTGVFLP  G G      P  +     PSTE  +  E   GK 
Sbjct: 576  WTGAPPPRHPPPRVPAPGTGVFLPPPGSGNSSQQLPAGTLAEVNPSTETPTMLEKENGKT 635

Query: 403  NGSRLPLTKDDEEAAQKECNG 341
            N +    +    +  ++ECNG
Sbjct: 636  NHNSTSASPKG-KVQKQECNG 655


>ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210274 [Cucumis sativus]
            gi|449481289|ref|XP_004156139.1| PREDICTED:
            uncharacterized LOC101210274 [Cucumis sativus]
          Length = 684

 Score =  262 bits (669), Expect = 4e-67
 Identities = 166/417 (39%), Positives = 219/417 (52%), Gaps = 26/417 (6%)
 Frame = -3

Query: 1462 KSNLEVSPKSFVATEFYDGKSVXXXXXXXXXXX--------KLNNLVSDLRAAGKRGQLQ 1307
            K     +P++FVA+E +DGK V                   KL +LV+DLRA+GKRGQ Q
Sbjct: 261  KQYAATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEVSKLLSLVNDLRASGKRGQFQ 320

Query: 1306 GQTFVALKRPMKGHGREMIQLGIPIADAPPEDEVAAGASRDPKIEPIPVALQDVIERLLT 1127
            GQT+V  KRPMKGHGREMIQLG PIADAP ED+ + G S+D +IEPIP  LQD+I+RL+ 
Sbjct: 321  GQTYVVSKRPMKGHGREMIQLGFPIADAPHEDDNSLGLSKDRRIEPIPSLLQDLIDRLVG 380

Query: 1126 KNVVSIKPDSAIIDIFNEGDHSQPHIWPQWFGRPVCVISLTVCEMSFGKVIAADTPGNYX 947
              V+++KPDS IID +NEGDHSQPH+WP WFGRPV V+ LT CE++FG+VI  D  GNY 
Sbjct: 381  DQVMTVKPDSCIIDFYNEGDHSQPHVWPSWFGRPVGVLLLTECEITFGRVIGTDHSGNYR 440

Query: 946  XXXXXXXXXXSVISMQGRSADFARHAIPSLQKQRILVTLVKSQSRKIVGGEPHRF---XX 776
                      +++ +QG+SADFA+HA+P+++KQRILVTL KSQ ++    +  R      
Sbjct: 441  GAMKLSLTPGNLLVVQGKSADFAKHALPAIRKQRILVTLTKSQPKRAAPADGQRTSLNVG 500

Query: 775  XXXXXXXXPSRTPGQIRPAPAKHFXXXXXXXXXXXXXXXXXXXXPNGM--YVATPVAPGI 602
                     +R+P        K +                    PNG+   +  PVA  +
Sbjct: 501  TFSGWGPPSARSPNPRLSPGQKPYPTVPSTGVLPVPPIRPQMAPPNGIPPLIVPPVASPM 560

Query: 601  AYPAAVXXXXXXXXXXXXXXXXXXXXXXXPGTGVFLPSQG--QGPGNSTTNQPPSTENSA 428
             +   V                       PGTGVFLP  G    P  S   Q P   N  
Sbjct: 561  PF-TPVPIPTGPSAWPTAHTRHPPPRLPVPGTGVFLPPPGSSSAPTPSPQQQLP-ISNIE 618

Query: 427  TEDSVGKMNG--------SRLPLTKDDEEAAQKECNGS---GGVEILKEGEENESHD 290
            T     K NG           P  K D +A ++ECNGS    G + +KE E+ +  +
Sbjct: 619  TGSLSEKENGLTKSDHSSGTFPGEKPDAKAQRQECNGSIDGSGNDKVKEEEQQQQQE 675


Top