BLASTX nr result

ID: Angelica27_contig00018652 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica27_contig00018652
         (1461 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_017253568.1 PREDICTED: wiskott-Aldrich syndrome protein homol...   558   0.0  
XP_017253567.1 PREDICTED: wiskott-Aldrich syndrome protein homol...   552   0.0  
XP_007028204.2 PREDICTED: uncharacterized protein LOC18598570 [T...   358   e-115
XP_002283542.1 PREDICTED: E3 ubiquitin-protein ligase Arkadia [V...   358   e-115
EOY08708.1 Zinc finger family protein, putative isoform 3 [Theob...   356   e-115
GAV74415.1 hypothetical protein CFOL_v3_17895 [Cephalotus follic...   355   e-114
EOY08706.1 Zinc finger family protein, putative isoform 1 [Theob...   356   e-114
XP_012089846.1 PREDICTED: uncharacterized protein LOC105648153 [...   350   e-112
XP_018843669.1 PREDICTED: uncharacterized protein LOC109008135 [...   346   e-110
KZM93644.1 hypothetical protein DCAR_016889 [Daucus carota subsp...   351   e-109
OAY44575.1 hypothetical protein MANES_08G162300 [Manihot esculenta]   342   e-109
XP_011471008.1 PREDICTED: uncharacterized protein LOC101292955 i...   337   e-107
XP_004309716.1 PREDICTED: uncharacterized protein LOC101292955 i...   337   e-107
XP_008244087.1 PREDICTED: uncharacterized protein LOC103342253 [...   336   e-107
XP_002323209.2 hypothetical protein POPTR_0016s02890g [Populus t...   335   e-106
KHG24544.1 Filamentous hemagglutinin [Gossypium arboreum]             334   e-106
XP_017631499.1 PREDICTED: uncharacterized protein LOC108474104 [...   332   e-105
XP_007204532.1 hypothetical protein PRUPE_ppa017564mg, partial [...   330   e-105
XP_018856757.1 PREDICTED: uncharacterized protein LOC109019002 i...   331   e-104
XP_018856756.1 PREDICTED: uncharacterized protein LOC109019002 i...   331   e-104

>XP_017253568.1 PREDICTED: wiskott-Aldrich syndrome protein homolog 1 isoform X2
            [Daucus carota subsp. sativus]
          Length = 515

 Score =  558 bits (1438), Expect = 0.0
 Identities = 307/450 (68%), Positives = 320/450 (71%), Gaps = 3/450 (0%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAENDCRIKGFVGFRCXXXXXXXXXXXXXXXFWLPPFLR 298
            MGKVGEEQPIPTSVS QN    AEN CRIKG  GFRC               FWLPPF  
Sbjct: 1    MGKVGEEQPIPTSVSAQN----AENCCRIKGLFGFRCVFVLLLGVAVLLSAVFWLPPFFS 56

Query: 299  HGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFDEIGLPATKVEVISLKP 478
            HGD GDLDLASLFRDH IVA F+VGKPVSVLEDNIMQLQDDIFDEIG+PATKVEVISLKP
Sbjct: 57   HGDRGDLDLASLFRDHYIVAAFNVGKPVSVLEDNIMQLQDDIFDEIGVPATKVEVISLKP 116

Query: 479  YAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSLRLTTSLFGDPSSFDVL 658
            YAGSN+TKVIFAVDPDVKG+KL +AAKSLIKASF SLV+NQSSLRLTTSLFGDPSSFDVL
Sbjct: 117  YAGSNVTKVIFAVDPDVKGAKLNTAAKSLIKASFASLVINQSSLRLTTSLFGDPSSFDVL 176

Query: 659  KFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTSQLKSGLHLAPYENLYI 838
            KFVGGITVIPPQRAFLLQTVQI FNFTLNFSIDQI+DNFSELTSQLKSGLHLAPYENLYI
Sbjct: 177  KFVGGITVIPPQRAFLLQTVQILFNFTLNFSIDQIQDNFSELTSQLKSGLHLAPYENLYI 236

Query: 839  SLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSPTKNLGLNNTVFGKVKQ 1018
            SL N KG             LLAVGNTPS+GRLKQLAQTITGSPTKNLGLNNTVFGKVKQ
Sbjct: 237  SLTNQKGSTVVSPTTVQSLVLLAVGNTPSLGRLKQLAQTITGSPTKNLGLNNTVFGKVKQ 296

Query: 1019 VSLSSILQHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAPVPEPY---XXX 1189
            VSLSSILQH                                      APVPEPY      
Sbjct: 297  VSLSSILQH---SLHGSDGSPSPSPALSPQPPDHHHHQHHHYDHSSPAPVPEPYQHHHHG 353

Query: 1190 XXXXXXXXXXXXXXDMHISPAPSPMKXXXXXXXXXXXXXXXXXXNKKSHLADSPGCHNGY 1369
                          D+HISPA SP++                  NKKSH+AD PGC NGY
Sbjct: 354  HRRRHHHHHRRHDHDVHISPALSPVE-GGSTSTTGSPASAPSPANKKSHVADPPGCQNGY 412

Query: 1370 RNRSHRKTDKHGHIISPAAPPTSAHHISPS 1459
            RNR  R  +KH HIISPAAPPTSAHHISPS
Sbjct: 413  RNRPPRNANKHSHIISPAAPPTSAHHISPS 442


>XP_017253567.1 PREDICTED: wiskott-Aldrich syndrome protein homolog 1 isoform X1
            [Daucus carota subsp. sativus]
          Length = 519

 Score =  552 bits (1423), Expect = 0.0
 Identities = 307/454 (67%), Positives = 320/454 (70%), Gaps = 7/454 (1%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAENDCRIKGFVGFRCXXXXXXXXXXXXXXXFWLPPFLR 298
            MGKVGEEQPIPTSVS QN    AEN CRIKG  GFRC               FWLPPF  
Sbjct: 1    MGKVGEEQPIPTSVSAQN----AENCCRIKGLFGFRCVFVLLLGVAVLLSAVFWLPPFFS 56

Query: 299  HGDPGDLDLASLFR----DHSIVATFDVGKPVSVLEDNIMQLQDDIFDEIGLPATKVEVI 466
            HGD GDLDLASLFR    DH IVA F+VGKPVSVLEDNIMQLQDDIFDEIG+PATKVEVI
Sbjct: 57   HGDRGDLDLASLFRVFYPDHYIVAAFNVGKPVSVLEDNIMQLQDDIFDEIGVPATKVEVI 116

Query: 467  SLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSLRLTTSLFGDPSS 646
            SLKPYAGSN+TKVIFAVDPDVKG+KL +AAKSLIKASF SLV+NQSSLRLTTSLFGDPSS
Sbjct: 117  SLKPYAGSNVTKVIFAVDPDVKGAKLNTAAKSLIKASFASLVINQSSLRLTTSLFGDPSS 176

Query: 647  FDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTSQLKSGLHLAPYE 826
            FDVLKFVGGITVIPPQRAFLLQTVQI FNFTLNFSIDQI+DNFSELTSQLKSGLHLAPYE
Sbjct: 177  FDVLKFVGGITVIPPQRAFLLQTVQILFNFTLNFSIDQIQDNFSELTSQLKSGLHLAPYE 236

Query: 827  NLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSPTKNLGLNNTVFG 1006
            NLYISL N KG             LLAVGNTPS+GRLKQLAQTITGSPTKNLGLNNTVFG
Sbjct: 237  NLYISLTNQKGSTVVSPTTVQSLVLLAVGNTPSLGRLKQLAQTITGSPTKNLGLNNTVFG 296

Query: 1007 KVKQVSLSSILQHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAPVPEPY-- 1180
            KVKQVSLSSILQH                                      APVPEPY  
Sbjct: 297  KVKQVSLSSILQH---SLHGSDGSPSPSPALSPQPPDHHHHQHHHYDHSSPAPVPEPYQH 353

Query: 1181 -XXXXXXXXXXXXXXXXXDMHISPAPSPMKXXXXXXXXXXXXXXXXXXNKKSHLADSPGC 1357
                              D+HISPA SP++                  NKKSH+AD PGC
Sbjct: 354  HHHGHRRRHHHHHRRHDHDVHISPALSPVE-GGSTSTTGSPASAPSPANKKSHVADPPGC 412

Query: 1358 HNGYRNRSHRKTDKHGHIISPAAPPTSAHHISPS 1459
             NGYRNR  R  +KH HIISPAAPPTSAHHISPS
Sbjct: 413  QNGYRNRPPRNANKHSHIISPAAPPTSAHHISPS 446


>XP_007028204.2 PREDICTED: uncharacterized protein LOC18598570 [Theobroma cacao]
          Length = 528

 Score =  358 bits (920), Expect = e-115
 Identities = 196/325 (60%), Positives = 226/325 (69%), Gaps = 16/325 (4%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAEND----------------CRIKGFVGFRCXXXXXXX 250
            MGK  EEQ + TSV+ + S +NA                   C  K   G RC       
Sbjct: 1    MGKGEEEQRLSTSVNSEVSVENAGGPISSLSLLFAPSASACGCGCKSLFGLRCFLVLLLS 60

Query: 251  XXXXXXXXFWLPPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFD 430
                    FWLPPFL   D  DLDL S F+DH IVA FDV KPVS L DNI+QL++DIFD
Sbjct: 61   LALFLSALFWLPPFLNFSDQSDLDLDSRFKDHDIVAGFDVEKPVSFLGDNILQLENDIFD 120

Query: 431  EIGLPATKVEVISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSL 610
            EIG P +KV + SL+P AGSNITKV+FAVDPDV+ SK++S ++SLI+ASF SLV++Q SL
Sbjct: 121  EIGFPTSKVVISSLEPLAGSNITKVVFAVDPDVRYSKISSTSQSLIRASFESLVIHQPSL 180

Query: 611  RLTTSLFGDPSSFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTS 790
            RLT SLFG P  F+VLKF GGITVIPPQ AFLLQ VQI FNFTLNFSIDQI+ NF ++TS
Sbjct: 181  RLTESLFGVPRDFEVLKFPGGITVIPPQSAFLLQKVQILFNFTLNFSIDQIQGNFEKMTS 240

Query: 791  QLKSGLHLAPYENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSP 970
            QLK+GL LA YENLYISL N KG             LLAVGNTPS+ RLKQLAQTITGS 
Sbjct: 241  QLKAGLRLATYENLYISLSNSKGSTVAPPTTVQSSVLLAVGNTPSMPRLKQLAQTITGSH 300

Query: 971  TKNLGLNNTVFGKVKQVSLSSILQH 1045
            ++NLGLNN +FG+VKQV LSSILQH
Sbjct: 301  SRNLGLNNNMFGRVKQVRLSSILQH 325


>XP_002283542.1 PREDICTED: E3 ubiquitin-protein ligase Arkadia [Vitis vinifera]
            CBI32839.3 unnamed protein product, partial [Vitis
            vinifera]
          Length = 529

 Score =  358 bits (919), Expect = e-115
 Identities = 195/310 (62%), Positives = 231/310 (74%), Gaps = 1/310 (0%)
 Frame = +2

Query: 119  MGKVGEEQPIPTS-VSGQNSEQNAENDCRIKGFVGFRCXXXXXXXXXXXXXXXFWLPPFL 295
            MGKV EEQP+P++ V  + S+QN  + CRI+G VGFRC               FWLPPFL
Sbjct: 1    MGKVEEEQPLPSAIVVSEPSDQNVGSRCRIRGRVGFRCVLALLLGAAVMLSAIFWLPPFL 60

Query: 296  RHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFDEIGLPATKVEVISLK 475
            ++ D  DLDL S FR H IVA+F V K +S+LED ++QL++DIF EI    +KV V+SL+
Sbjct: 61   QYADQRDLDLDSRFRGHDIVASFKVKKSISLLEDYLLQLENDIFVEIEGIESKVVVLSLE 120

Query: 476  PYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSLRLTTSLFGDPSSFDV 655
            P AG+NITKV+FAVD D K S++ ++ +SLI+  F SLV  QSSLRLT SLFGDP +F+V
Sbjct: 121  PSAGTNITKVVFAVDLDAKSSRILTS-QSLIRELFESLVTQQSSLRLTASLFGDPFTFEV 179

Query: 656  LKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTSQLKSGLHLAPYENLY 835
            LKF GGITV PPQ AFLLQ VQI FNFTLNFSI+QI +NF+ELTSQLKSGLHLA YENLY
Sbjct: 180  LKFPGGITVSPPQSAFLLQKVQILFNFTLNFSIEQILENFNELTSQLKSGLHLASYENLY 239

Query: 836  ISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSPTKNLGLNNTVFGKVK 1015
            ISL N KG             LLAVGNTPS+ RLKQLAQTITGS ++NLGLNNTVFG+VK
Sbjct: 240  ISLTNSKGSTVSPPTTVQSSVLLAVGNTPSLPRLKQLAQTITGSHSRNLGLNNTVFGRVK 299

Query: 1016 QVSLSSILQH 1045
            QV LSSILQH
Sbjct: 300  QVRLSSILQH 309


>EOY08708.1 Zinc finger family protein, putative isoform 3 [Theobroma cacao]
          Length = 507

 Score =  356 bits (914), Expect = e-115
 Identities = 195/325 (60%), Positives = 225/325 (69%), Gaps = 16/325 (4%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAEND----------------CRIKGFVGFRCXXXXXXX 250
            MGK  EEQ + TSV+ + S +NA                   C  K   G RC       
Sbjct: 1    MGKGEEEQRLSTSVNSEVSVENAGGPISSLSLLFAPSASACGCGCKSLFGLRCFLVLLLS 60

Query: 251  XXXXXXXXFWLPPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFD 430
                    FWLPPFL   D  DLDL S F+DH IVA FDV KPVS L DNI+QL++DIFD
Sbjct: 61   LALFLSALFWLPPFLNFSDQSDLDLDSRFKDHDIVAGFDVEKPVSFLGDNILQLENDIFD 120

Query: 431  EIGLPATKVEVISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSL 610
            EIG P +KV + SL+P AGSNITKV+FAVDPDV+ SK++S ++SLI+ASF SLV++Q SL
Sbjct: 121  EIGFPTSKVVISSLEPLAGSNITKVVFAVDPDVRYSKISSTSQSLIRASFESLVIHQPSL 180

Query: 611  RLTTSLFGDPSSFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTS 790
            RLT  LFG P  F+VLKF GGITVIPPQ AFLLQ VQI FNFTLNFSIDQI+ NF ++TS
Sbjct: 181  RLTEFLFGVPRDFEVLKFPGGITVIPPQSAFLLQKVQILFNFTLNFSIDQIQGNFEKMTS 240

Query: 791  QLKSGLHLAPYENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSP 970
            QLK+GL LA YENLYISL N KG             LLAVGNTPS+ RLKQLAQTITGS 
Sbjct: 241  QLKAGLRLATYENLYISLSNSKGSTVAPPTTVQSSVLLAVGNTPSMPRLKQLAQTITGSH 300

Query: 971  TKNLGLNNTVFGKVKQVSLSSILQH 1045
            ++NLGLNN +FG+VKQV LSSILQH
Sbjct: 301  SRNLGLNNNMFGRVKQVRLSSILQH 325


>GAV74415.1 hypothetical protein CFOL_v3_17895 [Cephalotus follicularis]
          Length = 504

 Score =  355 bits (912), Expect = e-114
 Identities = 217/451 (48%), Positives = 258/451 (57%), Gaps = 6/451 (1%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAEND----CR-IKGFVGFRCXXXXXXXXXXXXXXXFWL 283
            MGK  +E  + TS+  +  E N E      C+ I   VG RC               FWL
Sbjct: 1    MGKAEDEPNLHTSLDNEALEHNVEATFGCGCKWIYRLVGLRCLLVLFLSVAVFLSAVFWL 60

Query: 284  PPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFDEIGLPATKVEV 463
            PPFL   D  DLDL S F+ H IVA+F++ KP+S+LEDNI QL+DDIF+EI +P  KV V
Sbjct: 61   PPFLPFADQRDLDLDSKFKGHDIVASFNLEKPLSLLEDNISQLEDDIFNEIVVPNIKVTV 120

Query: 464  ISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSLRLTTSLFGDPS 643
            +SL+P AGSNI KV+FAVDPDVK SK+   ++SLI+ASF  L++NQS+LRLT SLFG+P 
Sbjct: 121  LSLEPSAGSNIIKVVFAVDPDVKYSKMLPTSQSLIRASFEVLLLNQSTLRLTNSLFGEPF 180

Query: 644  SFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTSQLKSGLHLAPY 823
             F VLKF GGITVIPPQ AFLLQ VQI FNFTLNFSI Q++ NF ELTSQLKSGLHLAPY
Sbjct: 181  FFQVLKFPGGITVIPPQSAFLLQRVQILFNFTLNFSIYQVQVNFFELTSQLKSGLHLAPY 240

Query: 824  ENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSPTKNLGLNNTVF 1003
            ENLYISL N KG             LL +GNTPS+ RLKQLAQTITGS ++NLGLN+TVF
Sbjct: 241  ENLYISLSNSKGSTVAPPTTVQSSVLLTIGNTPSMPRLKQLAQTITGSHSRNLGLNHTVF 300

Query: 1004 GKVKQVSLSSILQHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAPVPEPYX 1183
            G+VKQV LSSILQH                                      AP P P  
Sbjct: 301  GRVKQVRLSSILQH------------------------------SLHGGNGSAPSPAPLS 330

Query: 1184 XXXXXXXXXXXXXXXXDM-HISPAPSPMKXXXXXXXXXXXXXXXXXXNKKSHLADSPGCH 1360
                             +  +SPAP+ +                    +KS+ A  PGC 
Sbjct: 331  HSHHHHHHHHNHHNANSVPAMSPAPTTLTGAPAPAYGSPARENISPAPQKSYEAKPPGCR 390

Query: 1361 NGYRNRSHRKTDKHGHIISPAAPPTSAHHIS 1453
             G + R   K  K  H ISPA+ P    H S
Sbjct: 391  GGNKRRYKGKAGKRSH-ISPASDPNMPPHPS 420


>EOY08706.1 Zinc finger family protein, putative isoform 1 [Theobroma cacao]
            EOY08707.1 Zinc finger family protein, putative isoform 1
            [Theobroma cacao]
          Length = 527

 Score =  356 bits (914), Expect = e-114
 Identities = 195/325 (60%), Positives = 225/325 (69%), Gaps = 16/325 (4%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAEND----------------CRIKGFVGFRCXXXXXXX 250
            MGK  EEQ + TSV+ + S +NA                   C  K   G RC       
Sbjct: 1    MGKGEEEQRLSTSVNSEVSVENAGGPISSLSLLFAPSASACGCGCKSLFGLRCFLVLLLS 60

Query: 251  XXXXXXXXFWLPPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFD 430
                    FWLPPFL   D  DLDL S F+DH IVA FDV KPVS L DNI+QL++DIFD
Sbjct: 61   LALFLSALFWLPPFLNFSDQSDLDLDSRFKDHDIVAGFDVEKPVSFLGDNILQLENDIFD 120

Query: 431  EIGLPATKVEVISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSL 610
            EIG P +KV + SL+P AGSNITKV+FAVDPDV+ SK++S ++SLI+ASF SLV++Q SL
Sbjct: 121  EIGFPTSKVVISSLEPLAGSNITKVVFAVDPDVRYSKISSTSQSLIRASFESLVIHQPSL 180

Query: 611  RLTTSLFGDPSSFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTS 790
            RLT  LFG P  F+VLKF GGITVIPPQ AFLLQ VQI FNFTLNFSIDQI+ NF ++TS
Sbjct: 181  RLTEFLFGVPRDFEVLKFPGGITVIPPQSAFLLQKVQILFNFTLNFSIDQIQGNFEKMTS 240

Query: 791  QLKSGLHLAPYENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSP 970
            QLK+GL LA YENLYISL N KG             LLAVGNTPS+ RLKQLAQTITGS 
Sbjct: 241  QLKAGLRLATYENLYISLSNSKGSTVAPPTTVQSSVLLAVGNTPSMPRLKQLAQTITGSH 300

Query: 971  TKNLGLNNTVFGKVKQVSLSSILQH 1045
            ++NLGLNN +FG+VKQV LSSILQH
Sbjct: 301  SRNLGLNNNMFGRVKQVRLSSILQH 325


>XP_012089846.1 PREDICTED: uncharacterized protein LOC105648153 [Jatropha curcas]
            KDP22751.1 hypothetical protein JCGZ_01985 [Jatropha
            curcas]
          Length = 512

 Score =  350 bits (897), Expect = e-112
 Identities = 196/317 (61%), Positives = 226/317 (71%), Gaps = 8/317 (2%)
 Frame = +2

Query: 119  MGKVG--EEQPIPTSVSGQNSEQNAEND---CR---IKGFVGFRCXXXXXXXXXXXXXXX 274
            MGKVG  EEQ +PTS     S+Q+ E     C+   I  F+G RC               
Sbjct: 1    MGKVGVEEEQALPTS--DDTSDQDVERGFYGCKFEHIYRFIGVRCILVLLLSVAVFLSAV 58

Query: 275  FWLPPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFDEIGLPATK 454
            FWLPPFL   D G+LDL   F+DH I+A+F V K    LEDNI+QL+DDIFDEI  P+TK
Sbjct: 59   FWLPPFLHFADQGNLDLDPKFKDHDIIASFSVRKSADFLEDNILQLEDDIFDEISFPSTK 118

Query: 455  VEVISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSLRLTTSLFG 634
            V ++SL+P AG N TKV+F VDPD K SKL+S A+SLI+ASF  LVVNQS  RLT SLFG
Sbjct: 119  VVILSLEPSAGPNTTKVVFGVDPDAKYSKLSSTAQSLIRASFEFLVVNQS-FRLTKSLFG 177

Query: 635  DPSSFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTSQLKSGLHL 814
            DP SF+VLKF GGIT+IPPQ AFLLQ VQ+FFNFTLNFSI QI+ NF+ELTSQLKSGLHL
Sbjct: 178  DPFSFEVLKFPGGITIIPPQSAFLLQKVQVFFNFTLNFSIYQIQVNFAELTSQLKSGLHL 237

Query: 815  APYENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSPTKNLGLNN 994
            APYENLYI L N +G             +LAVGNTPS  RLKQLAQTI+G  ++NLGLNN
Sbjct: 238  APYENLYIRLSNSQGSTVAPPTTVQSSVVLAVGNTPSRERLKQLAQTISGH-SRNLGLNN 296

Query: 995  TVFGKVKQVSLSSILQH 1045
            TVFGKVKQV LSS+LQH
Sbjct: 297  TVFGKVKQVRLSSVLQH 313


>XP_018843669.1 PREDICTED: uncharacterized protein LOC109008135 [Juglans regia]
          Length = 530

 Score =  346 bits (888), Expect = e-110
 Identities = 213/477 (44%), Positives = 257/477 (53%), Gaps = 31/477 (6%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAEND-CRIKGF----------VGFRCXXXXXXXXXXXX 265
            MGK  E++ +P  V  Q+ +QNA+   C   GF          +G RC            
Sbjct: 1    MGKTEEDRAVPAGVGPQSQDQNAQTQWCGNCGFGWCCGGVRRLIGLRCLFFLLLSAAVFL 60

Query: 266  XXXFWLPPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFDEIGLP 445
               FWLPPFL+  D  DLDL S F+DH IVA+F + KP+S+LEDNI QL++DI +EIG  
Sbjct: 61   SAIFWLPPFLQFADQRDLDLDSKFKDHDIVASFYLKKPISLLEDNISQLEEDILNEIGFS 120

Query: 446  ATKVEVISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSLRLTTS 625
             TKV ++SL+P A SN TKV+F VDPD K S ++   +S I+A+FVSLV+ Q SLRLTT+
Sbjct: 121  TTKVVILSLEPIARSNTTKVVFGVDPDAKYSVISQTTQSFIRANFVSLVIQQLSLRLTTT 180

Query: 626  LFGDPSSFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTSQLKSG 805
            LFG+P  F+VLKF GGIT+IPPQ AFLLQ VQI FNFTLN  I QI+ NF++LTSQLKSG
Sbjct: 181  LFGEPFFFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNSPIYQIQLNFNKLTSQLKSG 240

Query: 806  LHLAPYENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSPTKNLG 985
            L LAPYENLY+SL N KG             LLAVGNTPS  RLKQLAQTIT S ++NLG
Sbjct: 241  LRLAPYENLYVSLSNSKGSTVAAPTVVQSSVLLAVGNTPSTQRLKQLAQTITHSHSRNLG 300

Query: 986  LNNTVFGKVKQVSLSSILQHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAP 1165
            LNNTVFG+VKQV LSSILQH                                      AP
Sbjct: 301  LNNTVFGRVKQVRLSSILQH------------------------SLHGGDGSGPSPSPAP 336

Query: 1166 VPEP--YXXXXXXXXXXXXXXXXXDMHISP------------------APSPMKXXXXXX 1285
            +P P  +                 D H+ P                  AP+P        
Sbjct: 337  LPHPHHHRHHHHHHHHHHHHRHHHDTHLDPAISPAPSIERGVPAAQRGAPAPKDGSPAPK 396

Query: 1286 XXXXXXXXXXXXNKKSHLADSPGCHNGYRNRSHRKTDKHGHIISPAAPPTSAHHISP 1456
                         +KS+ A  PGC  GYR  S     KH H       PT A +ISP
Sbjct: 397  DGSPAPKRSLPAPEKSYEAKPPGCQLGYRRSSKGHARKHSHF-----APTVAPYISP 448


>KZM93644.1 hypothetical protein DCAR_016889 [Daucus carota subsp. sativus]
          Length = 797

 Score =  351 bits (900), Expect = e-109
 Identities = 181/207 (87%), Positives = 189/207 (91%)
 Frame = +2

Query: 404  MQLQDDIFDEIGLPATKVEVISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFV 583
            MQLQDDIFDEIG+PATKVEVISLKPYAGSN+TKVIFAVDPDVKG+KL +AAKSLIKASF 
Sbjct: 1    MQLQDDIFDEIGVPATKVEVISLKPYAGSNVTKVIFAVDPDVKGAKLNTAAKSLIKASFA 60

Query: 584  SLVVNQSSLRLTTSLFGDPSSFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQI 763
            SLV+NQSSLRLTTSLFGDPSSFDVLKFVGGITVIPPQRAFLLQTVQI FNFTLNFSIDQI
Sbjct: 61   SLVINQSSLRLTTSLFGDPSSFDVLKFVGGITVIPPQRAFLLQTVQILFNFTLNFSIDQI 120

Query: 764  KDNFSELTSQLKSGLHLAPYENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQ 943
            +DNFSELTSQLKSGLHLAPYENLYISL N KG             LLAVGNTPS+GRLKQ
Sbjct: 121  QDNFSELTSQLKSGLHLAPYENLYISLTNQKGSTVVSPTTVQSLVLLAVGNTPSLGRLKQ 180

Query: 944  LAQTITGSPTKNLGLNNTVFGKVKQVS 1024
            LAQTITGSPTKNLGLNNTVFGKVKQ+S
Sbjct: 181  LAQTITGSPTKNLGLNNTVFGKVKQIS 207


>OAY44575.1 hypothetical protein MANES_08G162300 [Manihot esculenta]
          Length = 514

 Score =  342 bits (876), Expect = e-109
 Identities = 219/459 (47%), Positives = 254/459 (55%), Gaps = 12/459 (2%)
 Frame = +2

Query: 119  MGKVG--EEQPIPTSVSGQNSEQNAEN------DCRIKGF---VGFRCXXXXXXXXXXXX 265
            MGKVG  E+Q +PTS    +S+QNA+        C  KG    +G RC            
Sbjct: 1    MGKVGVEEDQALPTS--DDSSQQNAQRRSFGCCGCGFKGIFSLIGLRCVVVLLLSVALFL 58

Query: 266  XXXFWLPPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFDEIGLP 445
               FWLPPFL   D GDLDL   F+DH IVA+F+V K    LEDNI+QL+DDIFDEI  P
Sbjct: 59   SAVFWLPPFLHFVDQGDLDLDPRFKDHDIVASFNVEKSSPFLEDNILQLEDDIFDEISFP 118

Query: 446  ATKVEVISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSLRLTTS 625
            + KV ++SL+P AG N TKV+F VD D K SKL+S  +SLI+ASF  LVVNQS   LT  
Sbjct: 119  SIKVVILSLEPSAGPNTTKVVFGVDADAKYSKLSSTTESLIRASFEFLVVNQS-FHLTKP 177

Query: 626  LFGDPSSFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTSQLKSG 805
            LFGDP SF+VLKF GGIT+IPPQ AFLLQ  QI FNFTLNFSI QI+ NF+ELTSQLKSG
Sbjct: 178  LFGDPFSFEVLKFPGGITIIPPQSAFLLQKAQIRFNFTLNFSIYQIQVNFAELTSQLKSG 237

Query: 806  LHLAPYENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSPTKNLG 985
            LHLAPYENLYISL N KG             +LA+GNTPS+ RLKQLAQTI G  ++NLG
Sbjct: 238  LHLAPYENLYISLSNSKGSTVAPPTTVQSSVVLAIGNTPSMRRLKQLAQTIAGH-SRNLG 296

Query: 986  LNNTVFGKVKQVSLSSILQHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAP 1165
            LNNTVFGKVKQV LSSILQH                                      AP
Sbjct: 297  LNNTVFGKVKQVRLSSILQH-------------------------SLHGGEGSPSPSPAP 331

Query: 1166 VPEP-YXXXXXXXXXXXXXXXXXDMHISPAPSPMKXXXXXXXXXXXXXXXXXXNKKSHLA 1342
            +P P Y                    ISPAP+                        S  A
Sbjct: 332  LPHPQYHHHHHHHHHHHHHNTYMAPSISPAPATQNGAPAPLEHLPGSPKNSPAPHYSK-A 390

Query: 1343 DSPGCHNGYRNRSHRKTDKHGHIISPAAPPTSAHHISPS 1459
              PGC  G   R      K  H ++P  PP  + +ISP+
Sbjct: 391  KPPGCQLGGNRRYPGSGRKRSH-LTPTVPPNISPYISPA 428


>XP_011471008.1 PREDICTED: uncharacterized protein LOC101292955 isoform X2 [Fragaria
            vesca subsp. vesca]
          Length = 507

 Score =  337 bits (863), Expect = e-107
 Identities = 183/313 (58%), Positives = 224/313 (71%), Gaps = 4/313 (1%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAENDCR-IKGFVGFRCXXXXXXXXXXXXXXXFWLPPFL 295
            MGK   EQ + ++V  + S +NA   C  I+  +G RC               FWLPPFL
Sbjct: 1    MGKTEGEQGLGSTVGSEPSSRNAAACCPWIRTLIGLRCLLFLFLSLALFLSAIFWLPPFL 60

Query: 296  RHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFDEIGLPATKVEVISLK 475
            +  D GDLDL  +FRDH IVA+F++ KPVS++EDN++QL+D+IFDEI  P+TKV ++S++
Sbjct: 61   QFADQGDLDLDPVFRDHHIVASFNLFKPVSLVEDNVLQLEDNIFDEIVAPSTKVVILSVE 120

Query: 476  PYAGSN---ITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSLRLTTSLFGDPSS 646
               GSN   +T+V+F VDPD K SKL   ++SLI+ASF  LV +QS L L TSLFG  S 
Sbjct: 121  SLDGSNHSNVTRVVFGVDPDPKSSKLLPTSQSLIRASFEYLVTHQS-LSLNTSLFGSTSF 179

Query: 647  FDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTSQLKSGLHLAPYE 826
            F+VLKF GGIT+IPPQ+AFLLQ VQI FNFTLNFSI QI+ NF++L SQLKSGLHLAPYE
Sbjct: 180  FEVLKFPGGITIIPPQKAFLLQKVQILFNFTLNFSIYQIQLNFNDLKSQLKSGLHLAPYE 239

Query: 827  NLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSPTKNLGLNNTVFG 1006
            NLY+SL N KG             LL +GNTPS+ RLKQLAQTIT S ++NLGLNNTVFG
Sbjct: 240  NLYVSLSNSKGSTVAAPTTVQSSVLLTIGNTPSMQRLKQLAQTITHSHSRNLGLNNTVFG 299

Query: 1007 KVKQVSLSSILQH 1045
            KVKQV LSSILQH
Sbjct: 300  KVKQVRLSSILQH 312


>XP_004309716.1 PREDICTED: uncharacterized protein LOC101292955 isoform X1 [Fragaria
            vesca subsp. vesca]
          Length = 511

 Score =  337 bits (863), Expect = e-107
 Identities = 183/313 (58%), Positives = 224/313 (71%), Gaps = 4/313 (1%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAENDCR-IKGFVGFRCXXXXXXXXXXXXXXXFWLPPFL 295
            MGK   EQ + ++V  + S +NA   C  I+  +G RC               FWLPPFL
Sbjct: 1    MGKTEGEQGLGSTVGSEPSSRNAAACCPWIRTLIGLRCLLFLFLSLALFLSAIFWLPPFL 60

Query: 296  RHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFDEIGLPATKVEVISLK 475
            +  D GDLDL  +FRDH IVA+F++ KPVS++EDN++QL+D+IFDEI  P+TKV ++S++
Sbjct: 61   QFADQGDLDLDPVFRDHHIVASFNLFKPVSLVEDNVLQLEDNIFDEIVAPSTKVVILSVE 120

Query: 476  PYAGSN---ITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSLRLTTSLFGDPSS 646
               GSN   +T+V+F VDPD K SKL   ++SLI+ASF  LV +QS L L TSLFG  S 
Sbjct: 121  SLDGSNHSNVTRVVFGVDPDPKSSKLLPTSQSLIRASFEYLVTHQS-LSLNTSLFGSTSF 179

Query: 647  FDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTSQLKSGLHLAPYE 826
            F+VLKF GGIT+IPPQ+AFLLQ VQI FNFTLNFSI QI+ NF++L SQLKSGLHLAPYE
Sbjct: 180  FEVLKFPGGITIIPPQKAFLLQKVQILFNFTLNFSIYQIQLNFNDLKSQLKSGLHLAPYE 239

Query: 827  NLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSPTKNLGLNNTVFG 1006
            NLY+SL N KG             LL +GNTPS+ RLKQLAQTIT S ++NLGLNNTVFG
Sbjct: 240  NLYVSLSNSKGSTVAAPTTVQSSVLLTIGNTPSMQRLKQLAQTITHSHSRNLGLNNTVFG 299

Query: 1007 KVKQVSLSSILQH 1045
            KVKQV LSSILQH
Sbjct: 300  KVKQVRLSSILQH 312


>XP_008244087.1 PREDICTED: uncharacterized protein LOC103342253 [Prunus mume]
          Length = 509

 Score =  336 bits (862), Expect = e-107
 Identities = 182/311 (58%), Positives = 219/311 (70%), Gaps = 5/311 (1%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAENDCR-----IKGFVGFRCXXXXXXXXXXXXXXXFWL 283
            MGK  E+Q +P++V+ + S QNAE  C       + F+G RC               FWL
Sbjct: 1    MGKSEEDQALPSNVASEASAQNAEAHCAGCCGGFRRFIGLRCILVLLLSVALFLSAMFWL 60

Query: 284  PPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFDEIGLPATKVEV 463
            PPFL+  D  DLDL S F+DH IVA+FD+ KPVS+LEDNI+QL++DIFDEI  P+ KV +
Sbjct: 61   PPFLQFADQSDLDLDSKFKDHYIVASFDLWKPVSLLEDNILQLENDIFDEIVAPSIKVVI 120

Query: 464  ISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSLRLTTSLFGDPS 643
            +S++   GSN T V+F VDP+ K SKL   ++SLIKASF  LV +QS LRL TSLFG   
Sbjct: 121  LSVESLTGSNTTTVVFGVDPEPKSSKLLPTSQSLIKASFEYLVTHQS-LRLNTSLFGRTF 179

Query: 644  SFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTSQLKSGLHLAPY 823
             F+VLKF GGIT++PPQ AFLLQ VQI FNFTLNFSI QI+ NF EL SQLK+GLHLAPY
Sbjct: 180  LFEVLKFPGGITIVPPQNAFLLQKVQILFNFTLNFSIYQIQLNFDELKSQLKAGLHLAPY 239

Query: 824  ENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSPTKNLGLNNTVF 1003
            ENLYISL N +G             LL VGNTPS+ RLKQL+QTI GS ++NLGLNNTVF
Sbjct: 240  ENLYISLSNSRGSTVAAPTTVRASVLLTVGNTPSMQRLKQLSQTIRGSHSRNLGLNNTVF 299

Query: 1004 GKVKQVSLSSI 1036
            G+VKQV LSSI
Sbjct: 300  GRVKQVRLSSI 310


>XP_002323209.2 hypothetical protein POPTR_0016s02890g [Populus trichocarpa]
            EEF04970.2 hypothetical protein POPTR_0016s02890g
            [Populus trichocarpa]
          Length = 516

 Score =  335 bits (858), Expect = e-106
 Identities = 214/466 (45%), Positives = 252/466 (54%), Gaps = 24/466 (5%)
 Frame = +2

Query: 119  MGKVG-------EEQPIPTSVSGQNSEQNAEND-----CR----IKGFVGFRCXXXXXXX 250
            MGKVG       EEQ I TS  G+N EQN E       C+    +  F+GFRC       
Sbjct: 1    MGKVGNSVNGSEEEQGIGTS--GENGEQNVERGFYCFGCKGNFSVTRFIGFRCVFVLLLS 58

Query: 251  XXXXXXXXFWLPPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFD 430
                    FWLPPFL   D GDLDL    +DH IVA+F V KPV +LEDN ++LQ DIFD
Sbjct: 59   VAVFLSAVFWLPPFLHFADQGDLDLDYRIKDHDIVASFLVKKPVFLLEDNKLKLQGDIFD 118

Query: 431  EIGLPATKVEVISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSL 610
            E+ +P TKV ++SL+P AGSN TKV+F VDP    SK++S  +SLI+ SFVSLVVN SSL
Sbjct: 119  EMRVPNTKVVILSLEPLAGSNRTKVVFGVDPLENDSKISSTDQSLIRGSFVSLVVNDSSL 178

Query: 611  RLTTSLFGDPSSFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTS 790
             LT SLFGD SSF+VLKF GGIT+IPPQRAFLLQ VQI FNFTLNFSI QI++ F+EL S
Sbjct: 179  ELTKSLFGDASSFEVLKFPGGITIIPPQRAFLLQKVQIPFNFTLNFSILQIREKFAELKS 238

Query: 791  QLKSGLHLAPYENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSP 970
            QLK+GLHL P ENLYI L N +G             LL +GNTP   RLKQLAQTI G+ 
Sbjct: 239  QLKAGLHLTPIENLYIELWNSQGSTVSPPTTVKSSVLLVIGNTP---RLKQLAQTIRGN- 294

Query: 971  TKNLGLNNTVFGKVKQVSLSSILQHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1150
            +KNLGLNNT+FG+VKQV LSSILQH                                   
Sbjct: 295  SKNLGLNNTIFGRVKQVRLSSILQH------------------------------SLHGG 324

Query: 1151 XXXAPVPEP-----YXXXXXXXXXXXXXXXXXDMH---ISPAPSPMKXXXXXXXXXXXXX 1306
               AP P P     +                 D H   ISP P P +             
Sbjct: 325  EGSAPSPSPTSLPHHHHQHHHHHHHQHHHHHHDAHAPAISPIPPPKRSAPAPVDDSPAPL 384

Query: 1307 XXXXXNKKSHLADSPGCHNGYRNRSHRKTDKHGHIISPAAPPTSAH 1444
                    +H A+ PGC  G + R      K  H+    AP +  H
Sbjct: 385  KSSSAPHNNHEANPPGCQFGRKRRFTGNGGKRSHLAPSVAPSSPPH 430


>KHG24544.1 Filamentous hemagglutinin [Gossypium arboreum]
          Length = 509

 Score =  334 bits (856), Expect = e-106
 Identities = 186/317 (58%), Positives = 216/317 (68%), Gaps = 8/317 (2%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAENDCRIKGFV--------GFRCXXXXXXXXXXXXXXX 274
            MGK  EEQ + ++VS + S   + +    +  V        G RC               
Sbjct: 1    MGKTEEEQRLSSNVSSEVSVVESSSTISTRFVVCGSKSTLFGLRCFFVLLFSLAIFLSAL 60

Query: 275  FWLPPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFDEIGLPATK 454
            FWLPPFL   D  DLDL S F+DH IVA+F V KPVS L DNI+QL++DIFDEIG P +K
Sbjct: 61   FWLPPFLHSSDHSDLDLDSRFKDHDIVASFKVEKPVSFLGDNILQLENDIFDEIGFPTSK 120

Query: 455  VEVISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSLRLTTSLFG 634
            V ++SL+P   SN+TKV+F VDPD + SK++  + SLIK+SF  LV++QSSL LT SLFG
Sbjct: 121  VVILSLEPLTESNVTKVVFGVDPDARYSKISPTSLSLIKSSFEYLVIHQSSLSLTKSLFG 180

Query: 635  DPSSFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTSQLKSGLHL 814
            +   F+VLKF GGITVIPPQ AFLLQ VQI FNFTLNFSI QI+  F EL SQLKSGLHL
Sbjct: 181  ESYFFEVLKFPGGITVIPPQSAFLLQKVQIHFNFTLNFSIYQIQLYFDELRSQLKSGLHL 240

Query: 815  APYENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSPTKNLGLNN 994
            APYENLYI L N KG             LLAVGN PS  RLKQLAQTITGS +KNLGLN+
Sbjct: 241  APYENLYIILSNSKGSTVAPPTIVQSKVLLAVGNPPSTPRLKQLAQTITGSHSKNLGLNH 300

Query: 995  TVFGKVKQVSLSSILQH 1045
            TVFGKVKQV LSSILQH
Sbjct: 301  TVFGKVKQVRLSSILQH 317


>XP_017631499.1 PREDICTED: uncharacterized protein LOC108474104 [Gossypium arboreum]
          Length = 512

 Score =  332 bits (851), Expect = e-105
 Identities = 185/317 (58%), Positives = 215/317 (67%), Gaps = 8/317 (2%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAENDCRIKGFV--------GFRCXXXXXXXXXXXXXXX 274
            MGK  EEQ + ++VS + S   + +    +  V        G RC               
Sbjct: 1    MGKTEEEQRLSSNVSSEVSVVESSSTISTRFVVCGSKSTLFGLRCFFVLLFSLAIFLSAL 60

Query: 275  FWLPPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFDEIGLPATK 454
            FWLPPFL   D  DLDL S F+DH IVA+F V KPVS L DNI+QL++DIFDEIG P +K
Sbjct: 61   FWLPPFLHSSDHSDLDLDSRFKDHDIVASFKVEKPVSFLGDNILQLENDIFDEIGFPTSK 120

Query: 455  VEVISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSLRLTTSLFG 634
            V ++SL+P   SN+T V+F VDPD + SK++  + SLIK+SF  LV++QSSL LT SLFG
Sbjct: 121  VVILSLEPLTESNVTNVVFGVDPDARYSKISPTSLSLIKSSFEYLVIHQSSLSLTKSLFG 180

Query: 635  DPSSFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTSQLKSGLHL 814
            +   F+VLKF GGITVIPPQ AFLLQ VQI FNFTLNFSI QI+  F EL SQLKSGLHL
Sbjct: 181  ESYFFEVLKFPGGITVIPPQSAFLLQKVQIHFNFTLNFSIYQIQLYFDELRSQLKSGLHL 240

Query: 815  APYENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSPTKNLGLNN 994
            APYENLYI L N KG             LLAVGN PS  RLKQLAQTITGS +KNLGLN+
Sbjct: 241  APYENLYIILSNSKGSTVAPPTIVQSKVLLAVGNPPSTPRLKQLAQTITGSHSKNLGLNH 300

Query: 995  TVFGKVKQVSLSSILQH 1045
            TVFGKVKQV LSSILQH
Sbjct: 301  TVFGKVKQVRLSSILQH 317


>XP_007204532.1 hypothetical protein PRUPE_ppa017564mg, partial [Prunus persica]
          Length = 456

 Score =  330 bits (845), Expect = e-105
 Identities = 178/311 (57%), Positives = 218/311 (70%), Gaps = 5/311 (1%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAENDCR-----IKGFVGFRCXXXXXXXXXXXXXXXFWL 283
            MGK  E+Q +P++V+ + S QNAE  C       + F+G RC               FWL
Sbjct: 1    MGKSEEDQALPSNVASEASAQNAEAHCAGCCGGFRRFIGLRCILVLLLSVALFLSAMFWL 60

Query: 284  PPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFDEIGLPATKVEV 463
            PPFL+  D  DLDL S F+DH IVA+F++ KPVS+LEDNI+QL++DIFDEI  P+ KV +
Sbjct: 61   PPFLQFADQSDLDLDSKFKDHYIVASFNLWKPVSLLEDNILQLENDIFDEIVAPSIKVVI 120

Query: 464  ISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSLRLTTSLFGDPS 643
            +S++   GSN T V+F VDP+ K SKL   ++SLIK+SF  LV +QS L L TSLFG   
Sbjct: 121  LSVESLTGSNTTTVVFGVDPEPKSSKLLPTSQSLIKSSFEYLVTHQS-LSLNTSLFGRTF 179

Query: 644  SFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTSQLKSGLHLAPY 823
             F+VLKF GGIT++PPQ AFLLQ VQI FNFTLNFSI QI+ NF+EL SQLK+GLHLAPY
Sbjct: 180  LFEVLKFPGGITIVPPQNAFLLQKVQILFNFTLNFSIYQIQLNFNELKSQLKAGLHLAPY 239

Query: 824  ENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSPTKNLGLNNTVF 1003
            ENLYISL N +G              L VGNTPS+ RLKQL+QTI GS ++NLGLNNTVF
Sbjct: 240  ENLYISLSNSRGSTVAAPTTVRASVFLTVGNTPSMQRLKQLSQTIRGSHSRNLGLNNTVF 299

Query: 1004 GKVKQVSLSSI 1036
            G+VKQV LSSI
Sbjct: 300  GRVKQVRLSSI 310


>XP_018856757.1 PREDICTED: uncharacterized protein LOC109019002 isoform X2 [Juglans
            regia]
          Length = 534

 Score =  331 bits (849), Expect = e-104
 Identities = 205/474 (43%), Positives = 255/474 (53%), Gaps = 27/474 (5%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAEN---------DCR-------IKGFVGFRCXXXXXXX 250
            MGK  +E  +P  V  Q+ +QN +          +C        ++  +G RC       
Sbjct: 1    MGKAEDEHAVPPGVGPQSPDQNEQTQWCGGGGYGNCGFGWCFGGVRRLIGLRCLFVLLFA 60

Query: 251  XXXXXXXXFWLPPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFD 430
                    FWLPPFL+  D  D DL   F+DH IVA+F + KP  +LEDNI QL++DI++
Sbjct: 61   TAAFLSAIFWLPPFLQFADQRDPDLDPKFKDHDIVASFYLKKPFYLLEDNISQLKEDIWN 120

Query: 431  EIGLPATKVEVISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSL 610
            EIG P TKV ++SL+   GSN TKV+F VDPD K S+++   +SLI+ SFVSLV+   SL
Sbjct: 121  EIGFPTTKVVILSLEYKDGSNTTKVVFGVDPDAKYSEISRTIRSLIRESFVSLVLRLYSL 180

Query: 611  RLTTSLFGDPSSFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTS 790
            RLTT+LFG+P  F+VLKF GGIT+IP QRAFLLQ VQI FNFTLNFSI +I+ NF EL S
Sbjct: 181  RLTTTLFGEPFLFEVLKFPGGITIIPYQRAFLLQKVQILFNFTLNFSIYEIQSNFDELRS 240

Query: 791  QLKSGLHLAPYENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSP 970
            QLKSGLHLAPYENLY+SL N KG             +LAVGNTPS  RLKQLAQTITGS 
Sbjct: 241  QLKSGLHLAPYENLYVSLSNSKGSTVAAPTVVQSSVVLAVGNTPSTQRLKQLAQTITGSH 300

Query: 971  TKNLGLNNTVFGKVKQVSLSSILQHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1150
            ++NLGLNNTVFG+VKQV LSSILQH                                   
Sbjct: 301  SRNLGLNNTVFGRVKQVRLSSILQH------------------------SLHGGDGSGTS 336

Query: 1151 XXXAPVPEPYXXXXXXXXXXXXXXXXXDMHISPAPSPM-----------KXXXXXXXXXX 1297
               AP+P P+                 D H++PA SP                       
Sbjct: 337  PSPAPLPHPH-------HNHHHHHHHRDTHLAPAISPAPSIERGVRATPNGAPTPRVASP 389

Query: 1298 XXXXXXXXNKKSHLADSPGCHNGYRNRSHRKTDKHGHIISPAAPPTSAHHISPS 1459
                     +KS+ A  PGC  G    S     KH H ++P   P  +H+ + S
Sbjct: 390  DPKRSVPAPEKSYGAKPPGCQLGNGRSSKGYARKHPH-LAPTVAPYISHYPAAS 442


>XP_018856756.1 PREDICTED: uncharacterized protein LOC109019002 isoform X1 [Juglans
            regia]
          Length = 537

 Score =  331 bits (849), Expect = e-104
 Identities = 205/474 (43%), Positives = 255/474 (53%), Gaps = 27/474 (5%)
 Frame = +2

Query: 119  MGKVGEEQPIPTSVSGQNSEQNAEN---------DCR-------IKGFVGFRCXXXXXXX 250
            MGK  +E  +P  V  Q+ +QN +          +C        ++  +G RC       
Sbjct: 1    MGKAEDEHAVPPGVGPQSPDQNEQTQWCGGGGYGNCGFGWCFGGVRRLIGLRCLFVLLFA 60

Query: 251  XXXXXXXXFWLPPFLRHGDPGDLDLASLFRDHSIVATFDVGKPVSVLEDNIMQLQDDIFD 430
                    FWLPPFL+  D  D DL   F+DH IVA+F + KP  +LEDNI QL++DI++
Sbjct: 61   TAAFLSAIFWLPPFLQFADQRDPDLDPKFKDHDIVASFYLKKPFYLLEDNISQLKEDIWN 120

Query: 431  EIGLPATKVEVISLKPYAGSNITKVIFAVDPDVKGSKLTSAAKSLIKASFVSLVVNQSSL 610
            EIG P TKV ++SL+   GSN TKV+F VDPD K S+++   +SLI+ SFVSLV+   SL
Sbjct: 121  EIGFPTTKVVILSLEYKDGSNTTKVVFGVDPDAKYSEISRTIRSLIRESFVSLVLRLYSL 180

Query: 611  RLTTSLFGDPSSFDVLKFVGGITVIPPQRAFLLQTVQIFFNFTLNFSIDQIKDNFSELTS 790
            RLTT+LFG+P  F+VLKF GGIT+IP QRAFLLQ VQI FNFTLNFSI +I+ NF EL S
Sbjct: 181  RLTTTLFGEPFLFEVLKFPGGITIIPYQRAFLLQKVQILFNFTLNFSIYEIQSNFDELRS 240

Query: 791  QLKSGLHLAPYENLYISLINPKGXXXXXXXXXXXXXLLAVGNTPSIGRLKQLAQTITGSP 970
            QLKSGLHLAPYENLY+SL N KG             +LAVGNTPS  RLKQLAQTITGS 
Sbjct: 241  QLKSGLHLAPYENLYVSLSNSKGSTVAAPTVVQSSVVLAVGNTPSTQRLKQLAQTITGSH 300

Query: 971  TKNLGLNNTVFGKVKQVSLSSILQHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1150
            ++NLGLNNTVFG+VKQV LSSILQH                                   
Sbjct: 301  SRNLGLNNTVFGRVKQVRLSSILQH------------------------SLHGGDGSGTS 336

Query: 1151 XXXAPVPEPYXXXXXXXXXXXXXXXXXDMHISPAPSPM-----------KXXXXXXXXXX 1297
               AP+P P+                 D H++PA SP                       
Sbjct: 337  PSPAPLPHPH-------HNHHHHHHHRDTHLAPAISPAPSIERGVRATPNGAPTPRVASP 389

Query: 1298 XXXXXXXXNKKSHLADSPGCHNGYRNRSHRKTDKHGHIISPAAPPTSAHHISPS 1459
                     +KS+ A  PGC  G    S     KH H ++P   P  +H+ + S
Sbjct: 390  DPKRSVPAPEKSYGAKPPGCQLGNGRSSKGYARKHPH-LAPTVAPYISHYPAAS 442


Top