BLASTX nr result

ID: Acanthopanax23_contig00005227 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Acanthopanax23_contig00005227
         (871 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_017229408.1| PREDICTED: uncharacterized protein LOC108204...   320   2e-98
gb|KZN10686.1| hypothetical protein DCAR_003342 [Daucus carota s...   320   4e-98
ref|XP_017229407.1| PREDICTED: uncharacterized protein LOC108204...   320   4e-98
gb|KZN09286.1| hypothetical protein DCAR_001942 [Daucus carota s...   268   3e-80
gb|KZM90482.1| hypothetical protein DCAR_022153 [Daucus carota s...   270   4e-79
ref|XP_017258368.1| PREDICTED: uncharacterized protein LOC108227...   270   4e-79
ref|XP_017229084.1| PREDICTED: uncharacterized protein LOC108204...   268   7e-79
ref|XP_017229083.1| PREDICTED: uncharacterized protein LOC108204...   268   8e-79
ref|XP_021622741.1| uncharacterized protein LOC110622519 [Maniho...   264   4e-77
ref|XP_023887786.1| uncharacterized protein LOC111999912 [Quercu...   257   1e-74
ref|XP_021658683.1| uncharacterized protein LOC110648673 [Hevea ...   257   2e-74
ref|XP_007020229.2| PREDICTED: uncharacterized protein LOC185931...   256   2e-74
gb|EOY17454.1| Tudor/PWWP/MBT superfamily protein, putative [The...   256   2e-74
ref|XP_021300090.1| LOW QUALITY PROTEIN: uncharacterized protein...   255   4e-74
ref|XP_012089027.1| uncharacterized protein LOC105647517 isoform...   249   1e-71
gb|KHN20237.1| hypothetical protein glysoja_023800 [Glycine soja]     244   2e-71
ref|XP_010103359.1| uncharacterized protein LOC21386111 [Morus n...   247   5e-71
ref|XP_010104924.1| uncharacterized protein LOC21390089 [Morus n...   246   9e-71
ref|XP_003555609.1| PREDICTED: uncharacterized protein LOC100792...   244   2e-70
emb|CBI39497.3| unnamed protein product, partial [Vitis vinifera]     243   6e-70

>ref|XP_017229408.1| PREDICTED: uncharacterized protein LOC108204464 isoform X2 [Daucus
            carota subsp. sativus]
          Length = 1053

 Score =  320 bits (821), Expect = 2e-98
 Identities = 173/269 (64%), Positives = 202/269 (75%), Gaps = 5/269 (1%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGD-IKETSGPTI 693
            DDP+ GGRKRGPSDR           I+DMKVLT++KK  QKT  + R D IKETSGPT+
Sbjct: 686  DDPSTGGRKRGPSDRLEEIVAKKKKKISDMKVLTSEKKTVQKTPIVQRADGIKETSGPTV 745

Query: 692  KSVKQEPLKKPEPP-SRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWK 516
            +S+K  PL+KPE   +RAP+P MLVMKFPPQGTLPSIMELKARFARFGQ+DHSATRIFWK
Sbjct: 746  RSLKPAPLRKPETYYARAPDPVMLVMKFPPQGTLPSIMELKARFARFGQLDHSATRIFWK 805

Query: 515  SLTCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLRE--LGAPETEPGKTQREDSLMG 342
            +LTCR+VYRH+ADA+SA +FA  +S LFGN GV+C+ RE  + A  TEPG+ Q+EDS  G
Sbjct: 806  TLTCRLVYRHRADAESACKFASSNSTLFGNVGVKCYTREVDVAASVTEPGQAQKEDSSKG 865

Query: 341  TSQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDETSG-NGGGRGTPRVKFMLG 165
            TSQ+RD AVE+R              G Q KSILKKS GDET G NGGG+GT RVKF+LG
Sbjct: 866  TSQSRDLAVEQR-PATSLTSRTLQQSGSQPKSILKKSNGDETGGTNGGGKGT-RVKFILG 923

Query: 164  GEENSRGGEELMIGNKNFNNNASFADGGA 78
             EE +RGGE L+ GNKN NNNA F DGGA
Sbjct: 924  EEETNRGGEHLITGNKNINNNAVFVDGGA 952


>gb|KZN10686.1| hypothetical protein DCAR_003342 [Daucus carota subsp. sativus]
          Length = 1101

 Score =  320 bits (821), Expect = 4e-98
 Identities = 173/269 (64%), Positives = 202/269 (75%), Gaps = 5/269 (1%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGD-IKETSGPTI 693
            DDP+ GGRKRGPSDR           I+DMKVLT++KK  QKT  + R D IKETSGPT+
Sbjct: 734  DDPSTGGRKRGPSDRLEEIVAKKKKKISDMKVLTSEKKTVQKTPIVQRADGIKETSGPTV 793

Query: 692  KSVKQEPLKKPEPP-SRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWK 516
            +S+K  PL+KPE   +RAP+P MLVMKFPPQGTLPSIMELKARFARFGQ+DHSATRIFWK
Sbjct: 794  RSLKPAPLRKPETYYARAPDPVMLVMKFPPQGTLPSIMELKARFARFGQLDHSATRIFWK 853

Query: 515  SLTCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLRE--LGAPETEPGKTQREDSLMG 342
            +LTCR+VYRH+ADA+SA +FA  +S LFGN GV+C+ RE  + A  TEPG+ Q+EDS  G
Sbjct: 854  TLTCRLVYRHRADAESACKFASSNSTLFGNVGVKCYTREVDVAASVTEPGQAQKEDSSKG 913

Query: 341  TSQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDETSG-NGGGRGTPRVKFMLG 165
            TSQ+RD AVE+R              G Q KSILKKS GDET G NGGG+GT RVKF+LG
Sbjct: 914  TSQSRDLAVEQR-PATSLTSRTLQQSGSQPKSILKKSNGDETGGTNGGGKGT-RVKFILG 971

Query: 164  GEENSRGGEELMIGNKNFNNNASFADGGA 78
             EE +RGGE L+ GNKN NNNA F DGGA
Sbjct: 972  EEETNRGGEHLITGNKNINNNAVFVDGGA 1000


>ref|XP_017229407.1| PREDICTED: uncharacterized protein LOC108204464 isoform X1 [Daucus
            carota subsp. sativus]
          Length = 1105

 Score =  320 bits (821), Expect = 4e-98
 Identities = 173/269 (64%), Positives = 202/269 (75%), Gaps = 5/269 (1%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGD-IKETSGPTI 693
            DDP+ GGRKRGPSDR           I+DMKVLT++KK  QKT  + R D IKETSGPT+
Sbjct: 738  DDPSTGGRKRGPSDRLEEIVAKKKKKISDMKVLTSEKKTVQKTPIVQRADGIKETSGPTV 797

Query: 692  KSVKQEPLKKPEPP-SRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWK 516
            +S+K  PL+KPE   +RAP+P MLVMKFPPQGTLPSIMELKARFARFGQ+DHSATRIFWK
Sbjct: 798  RSLKPAPLRKPETYYARAPDPVMLVMKFPPQGTLPSIMELKARFARFGQLDHSATRIFWK 857

Query: 515  SLTCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLRE--LGAPETEPGKTQREDSLMG 342
            +LTCR+VYRH+ADA+SA +FA  +S LFGN GV+C+ RE  + A  TEPG+ Q+EDS  G
Sbjct: 858  TLTCRLVYRHRADAESACKFASSNSTLFGNVGVKCYTREVDVAASVTEPGQAQKEDSSKG 917

Query: 341  TSQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDETSG-NGGGRGTPRVKFMLG 165
            TSQ+RD AVE+R              G Q KSILKKS GDET G NGGG+GT RVKF+LG
Sbjct: 918  TSQSRDLAVEQR-PATSLTSRTLQQSGSQPKSILKKSNGDETGGTNGGGKGT-RVKFILG 975

Query: 164  GEENSRGGEELMIGNKNFNNNASFADGGA 78
             EE +RGGE L+ GNKN NNNA F DGGA
Sbjct: 976  EEETNRGGEHLITGNKNINNNAVFVDGGA 1004


>gb|KZN09286.1| hypothetical protein DCAR_001942 [Daucus carota subsp. sativus]
          Length = 829

 Score =  268 bits (685), Expect = 3e-80
 Identities = 142/265 (53%), Positives = 181/265 (68%), Gaps = 2/265 (0%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGDIKETSGPTIK 690
            DDP KGGRK  PSDRQ          ++   ++  D++ A+K+  + RG++KETS   +K
Sbjct: 465  DDPAKGGRKHVPSDRQEEIAAKRRKILDAKPLI--DQRTAEKSSLMQRGEVKETSALKMK 522

Query: 689  SVKQEPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWKSL 510
            SVK E +KK +PPSR  +P MLV+KFP +GTLPSI  LKARF RFGQ+DHSA R+FWKS 
Sbjct: 523  SVKPELVKKVQPPSRESDPAMLVLKFPLRGTLPSINALKARFIRFGQLDHSACRVFWKSS 582

Query: 509  TCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLREL--GAPETEPGKTQREDSLMGTS 336
            TC VV+RHK DAQ+AY +A  S+N+FG TG++C+L+E+   A E + GK  +++ LM TS
Sbjct: 583  TCLVVFRHKVDAQAAYNYAASSTNMFGATGIKCYLQEMEVAASEQKSGKVLQDEVLMSTS 642

Query: 335  QARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDETSGNGGGRGTPRVKFMLGGEE 156
            Q RD  VERR              G Q+K ILK++TGDETS +GG  G PRVKFML GEE
Sbjct: 643  QLRDSTVERR-YAAPPAFNSVKQSGAQVKPILKRATGDETSSSGGITGPPRVKFMLNGEE 701

Query: 155  NSRGGEELMIGNKNFNNNASFADGG 81
            NS G  +LMIGNKN + N  FADGG
Sbjct: 702  NSEGAGQLMIGNKN-STNTIFADGG 725


>gb|KZM90482.1| hypothetical protein DCAR_022153 [Daucus carota subsp. sativus]
          Length = 1157

 Score =  270 bits (689), Expect = 4e-79
 Identities = 161/297 (54%), Positives = 187/297 (62%), Gaps = 33/297 (11%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGD---------- 720
            +DP+KGG KRGPSDRQ          IND K L    KA+QKT  + +G           
Sbjct: 766  EDPSKGGLKRGPSDRQEEIAAKKKKKINDAKELKT-MKASQKTVVMQQGKEVSGQKMKSI 824

Query: 719  ----IKETSGPTIKSVKQE----------------PLKKPEPPSRAPEPTMLVMKFPPQG 600
                +KE  GPT+KS K +                PLKK EP ++AP P MLVMKFPPQG
Sbjct: 825  KPAPLKEARGPTMKSTKPKALSEASVPTMRSLKPAPLKKTEPSAKAPGPLMLVMKFPPQG 884

Query: 599  TLPSIMELKARFARFGQMDHSATRIFWKSLTCRVVYRHKADAQSAYRFAVGSSNLFGNTG 420
            TLPS+MELKARFARFGQ+DHSATRIFWKS TCR+VYR + DA++A RFA  + NLFGN  
Sbjct: 885  TLPSMMELKARFARFGQLDHSATRIFWKSSTCRLVYRRRVDAEAACRFA-STHNLFGNAD 943

Query: 419  VRCFLR--ELGAPETEPGKTQREDSLMGTSQARDFAVERRXXXXXXXXXXXXXPGVQLKS 246
            VR F R  E+ A   E GK  ++DS +G SQ  D AVE+R                Q KS
Sbjct: 944  VRYFTREVEVAASVAEQGKVHKDDSSVGNSQLTDSAVEQRPARSLPQRTLQQPG--QPKS 1001

Query: 245  ILKKSTGDETSG-NGGGRGTPRVKFMLGGEENSRGGEELMIGNKNFNNNASFADGGA 78
            ILKKS GDETSG NGGG+GT RV+F+LG EE  RGGE+ MIGNKN NNNASF DGGA
Sbjct: 1002 ILKKSNGDETSGTNGGGKGT-RVRFILGEEETDRGGEQSMIGNKNINNNASFVDGGA 1057


>ref|XP_017258368.1| PREDICTED: uncharacterized protein LOC108227632 [Daucus carota subsp.
            sativus]
          Length = 1161

 Score =  270 bits (689), Expect = 4e-79
 Identities = 161/297 (54%), Positives = 187/297 (62%), Gaps = 33/297 (11%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGD---------- 720
            +DP+KGG KRGPSDRQ          IND K L    KA+QKT  + +G           
Sbjct: 770  EDPSKGGLKRGPSDRQEEIAAKKKKKINDAKELKT-MKASQKTVVMQQGKEVSGQKMKSI 828

Query: 719  ----IKETSGPTIKSVKQE----------------PLKKPEPPSRAPEPTMLVMKFPPQG 600
                +KE  GPT+KS K +                PLKK EP ++AP P MLVMKFPPQG
Sbjct: 829  KPAPLKEARGPTMKSTKPKALSEASVPTMRSLKPAPLKKTEPSAKAPGPLMLVMKFPPQG 888

Query: 599  TLPSIMELKARFARFGQMDHSATRIFWKSLTCRVVYRHKADAQSAYRFAVGSSNLFGNTG 420
            TLPS+MELKARFARFGQ+DHSATRIFWKS TCR+VYR + DA++A RFA  + NLFGN  
Sbjct: 889  TLPSMMELKARFARFGQLDHSATRIFWKSSTCRLVYRRRVDAEAACRFA-STHNLFGNAD 947

Query: 419  VRCFLR--ELGAPETEPGKTQREDSLMGTSQARDFAVERRXXXXXXXXXXXXXPGVQLKS 246
            VR F R  E+ A   E GK  ++DS +G SQ  D AVE+R                Q KS
Sbjct: 948  VRYFTREVEVAASVAEQGKVHKDDSSVGNSQLTDSAVEQRPARSLPQRTLQQPG--QPKS 1005

Query: 245  ILKKSTGDETSG-NGGGRGTPRVKFMLGGEENSRGGEELMIGNKNFNNNASFADGGA 78
            ILKKS GDETSG NGGG+GT RV+F+LG EE  RGGE+ MIGNKN NNNASF DGGA
Sbjct: 1006 ILKKSNGDETSGTNGGGKGT-RVRFILGEEETDRGGEQSMIGNKNINNNASFVDGGA 1061


>ref|XP_017229084.1| PREDICTED: uncharacterized protein LOC108204247 isoform X2 [Daucus
            carota subsp. sativus]
          Length = 1082

 Score =  268 bits (685), Expect = 7e-79
 Identities = 142/265 (53%), Positives = 181/265 (68%), Gaps = 2/265 (0%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGDIKETSGPTIK 690
            DDP KGGRK  PSDRQ          ++   ++  D++ A+K+  + RG++KETS   +K
Sbjct: 718  DDPAKGGRKHVPSDRQEEIAAKRRKILDAKPLI--DQRTAEKSSLMQRGEVKETSALKMK 775

Query: 689  SVKQEPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWKSL 510
            SVK E +KK +PPSR  +P MLV+KFP +GTLPSI  LKARF RFGQ+DHSA R+FWKS 
Sbjct: 776  SVKPELVKKVQPPSRESDPAMLVLKFPLRGTLPSINALKARFIRFGQLDHSACRVFWKSS 835

Query: 509  TCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLREL--GAPETEPGKTQREDSLMGTS 336
            TC VV+RHK DAQ+AY +A  S+N+FG TG++C+L+E+   A E + GK  +++ LM TS
Sbjct: 836  TCLVVFRHKVDAQAAYNYAASSTNMFGATGIKCYLQEMEVAASEQKSGKVLQDEVLMSTS 895

Query: 335  QARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDETSGNGGGRGTPRVKFMLGGEE 156
            Q RD  VERR              G Q+K ILK++TGDETS +GG  G PRVKFML GEE
Sbjct: 896  QLRDSTVERR-YAAPPAFNSVKQSGAQVKPILKRATGDETSSSGGITGPPRVKFMLNGEE 954

Query: 155  NSRGGEELMIGNKNFNNNASFADGG 81
            NS G  +LMIGNKN + N  FADGG
Sbjct: 955  NSEGAGQLMIGNKN-STNTIFADGG 978


>ref|XP_017229083.1| PREDICTED: uncharacterized protein LOC108204247 isoform X1 [Daucus
            carota subsp. sativus]
          Length = 1092

 Score =  268 bits (685), Expect = 8e-79
 Identities = 142/265 (53%), Positives = 181/265 (68%), Gaps = 2/265 (0%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGDIKETSGPTIK 690
            DDP KGGRK  PSDRQ          ++   ++  D++ A+K+  + RG++KETS   +K
Sbjct: 728  DDPAKGGRKHVPSDRQEEIAAKRRKILDAKPLI--DQRTAEKSSLMQRGEVKETSALKMK 785

Query: 689  SVKQEPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWKSL 510
            SVK E +KK +PPSR  +P MLV+KFP +GTLPSI  LKARF RFGQ+DHSA R+FWKS 
Sbjct: 786  SVKPELVKKVQPPSRESDPAMLVLKFPLRGTLPSINALKARFIRFGQLDHSACRVFWKSS 845

Query: 509  TCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLREL--GAPETEPGKTQREDSLMGTS 336
            TC VV+RHK DAQ+AY +A  S+N+FG TG++C+L+E+   A E + GK  +++ LM TS
Sbjct: 846  TCLVVFRHKVDAQAAYNYAASSTNMFGATGIKCYLQEMEVAASEQKSGKVLQDEVLMSTS 905

Query: 335  QARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDETSGNGGGRGTPRVKFMLGGEE 156
            Q RD  VERR              G Q+K ILK++TGDETS +GG  G PRVKFML GEE
Sbjct: 906  QLRDSTVERR-YAAPPAFNSVKQSGAQVKPILKRATGDETSSSGGITGPPRVKFMLNGEE 964

Query: 155  NSRGGEELMIGNKNFNNNASFADGG 81
            NS G  +LMIGNKN + N  FADGG
Sbjct: 965  NSEGAGQLMIGNKN-STNTIFADGG 988


>ref|XP_021622741.1| uncharacterized protein LOC110622519 [Manihot esculenta]
 gb|OAY42303.1| hypothetical protein MANES_09G169200 [Manihot esculenta]
          Length = 1168

 Score =  264 bits (675), Expect = 4e-77
 Identities = 152/273 (55%), Positives = 185/273 (67%), Gaps = 9/273 (3%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGDIKE--TSGPT 696
            DDPT+GGRKR PSDRQ          I+ +K LTA+KKA Q+T    R + KE  T+ P 
Sbjct: 804  DDPTRGGRKRLPSDRQEEIAARRLKKISQLKSLTAEKKAVQRTLETHRSEGKELATAAPP 863

Query: 695  IKSVKQEPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWK 516
             K  K E  KK EP  RA EPTMLVMKFPP  +LPS+ ELKARFARFG +D SA R+FW+
Sbjct: 864  -KPAKSESSKKIEPQHRAVEPTMLVMKFPPGTSLPSVAELKARFARFGSIDQSAIRVFWQ 922

Query: 515  SLTCRVVYRHKADAQSAYRFAVGSSNLFGN-TGVRCFLRELGAPETEPGKTQR---EDSL 348
            S TCRVV+RHK DAQ+AY++AVG+++LFGN   VR  +RE+GAP  E  ++ +   +D+ 
Sbjct: 923  SSTCRVVFRHKLDAQAAYKYAVGNNSLFGNDVSVRYSVREVGAPAPEAPESDKGRGDDTS 982

Query: 347  MGTSQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDE---TSGNGGGRGTPRVK 177
            +   + +D A ER                +QLKSILKK TGDE    +G  GGRGT RVK
Sbjct: 983  LEAPRVKDAANER-----LLMQQLLPQSSIQLKSILKKPTGDEAGQVTGGNGGRGTARVK 1037

Query: 176  FMLGGEENSRGGEELMIGNKNFNNNASFADGGA 78
            FMLGGEE SR GE+LMIGN+NFNNNASFADGGA
Sbjct: 1038 FMLGGEETSR-GEQLMIGNRNFNNNASFADGGA 1069


>ref|XP_023887786.1| uncharacterized protein LOC111999912 [Quercus suber]
 gb|POE66996.1| pwwp domain-containing protein [Quercus suber]
          Length = 1168

 Score =  257 bits (657), Expect = 1e-74
 Identities = 149/277 (53%), Positives = 186/277 (67%), Gaps = 13/277 (4%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGDIKETSGPTI- 693
            DDPT+ GRKRGPSDRQ          IN +K L A+KKA QK     RG+ +E+  P   
Sbjct: 786  DDPTRAGRKRGPSDRQEEIAAKRVKKINALKSLAAEKKAGQKMPESQRGEGRESVAPAPP 845

Query: 692  KSVKQEPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWKS 513
            KS + +P+KK EP ++  +PTMLVMKFPP  +LPS+ ELKARFARFG +D S  R+FWKS
Sbjct: 846  KSSRPDPVKKVEPSAKTVDPTMLVMKFPPFTSLPSVAELKARFARFGPIDQSGLRVFWKS 905

Query: 512  LTCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLREL--GAPE-TEPGKTQREDSLMG 342
             TCRVV+ HK DA++AY++AV +++LFGN  VRC +REL  GAPE TE GK + +D+   
Sbjct: 906  STCRVVFLHKLDAEAAYKYAVANNSLFGNVNVRCHIRELGGGAPEGTESGKVRGDDNSNE 965

Query: 341  TSQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDE----TSGNGGG----RGTP 186
            T + +D A   +             P VQLKS LKK +GDE    T G GGG    +GTP
Sbjct: 966  TPRVKDSAAAVQRPASALVNQSPLKPAVQLKSCLKKVSGDESGQVTGGVGGGGGSSKGTP 1025

Query: 185  RVKFMLGGEENSRGGEELMIGNK-NFNNNASFADGGA 78
            RVKFMLGGEE+SR  E+LM+GN+ NFNNNAS ADGGA
Sbjct: 1026 RVKFMLGGEESSR-TEQLMVGNRNNFNNNASNADGGA 1061


>ref|XP_021658683.1| uncharacterized protein LOC110648673 [Hevea brasiliensis]
          Length = 1165

 Score =  257 bits (656), Expect = 2e-74
 Identities = 151/273 (55%), Positives = 182/273 (66%), Gaps = 9/273 (3%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGDIKE--TSGPT 696
            DDPT+ GRKR PSDRQ          I+ +K L A+KKA Q+T    R + KE  T+ P 
Sbjct: 803  DDPTRVGRKRLPSDRQEEIVARRLKKISQLKTLAAEKKAGQRTLETQRSEGKELATAAPP 862

Query: 695  IKSVKQEPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWK 516
             K  K E  KK EP  RA EPTMLVMKFPP  +LPS+ ELKARFARFG +D SA R+FW+
Sbjct: 863  -KPAKSESSKKIEPHHRAVEPTMLVMKFPPGTSLPSVAELKARFARFGSIDQSAIRVFWQ 921

Query: 515  SLTCRVVYRHKADAQSAYRFAVGSSNLFGN-TGVRCFLRELGAP---ETEPGKTQREDSL 348
            S TCRVV+RHK DAQ+AY++AVG+++LFGN   VR  +RE+GAP     E  K + +D+ 
Sbjct: 922  SSTCRVVFRHKLDAQAAYKYAVGNNSLFGNDVNVRYTVREVGAPAPEAPESDKGREDDTS 981

Query: 347  MGTSQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDE---TSGNGGGRGTPRVK 177
            +   + +D A ER                +QLKSILKK TGDE    +G  GGRGT RVK
Sbjct: 982  VEAPRLKDPANER-----LLMHQPLPQSTMQLKSILKKPTGDEAGQVTGGNGGRGTARVK 1036

Query: 176  FMLGGEENSRGGEELMIGNKNFNNNASFADGGA 78
            FMLGGEE SR GE+LMIGN+NFNNNASFADGGA
Sbjct: 1037 FMLGGEETSR-GEQLMIGNRNFNNNASFADGGA 1068


>ref|XP_007020229.2| PREDICTED: uncharacterized protein LOC18593109 [Theobroma cacao]
 ref|XP_017981589.1| PREDICTED: uncharacterized protein LOC18593109 [Theobroma cacao]
          Length = 1133

 Score =  256 bits (654), Expect = 2e-74
 Identities = 149/274 (54%), Positives = 181/274 (66%), Gaps = 10/274 (3%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGDIKE--TSGPT 696
            DDPTK GRKR PSDRQ          I+ +K L A+KKA  +T   P+ + KE  T+GP 
Sbjct: 767  DDPTKAGRKRLPSDRQEEIAAKRLKKISQLKSLAAEKKANLRTMEAPKVEGKEQPTAGPP 826

Query: 695  IKSVKQ-EPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFW 519
             + +K+ +  +K EPP RA EPTMLVMKFPPQ +LPS+ ELKARF RFG +D SA R+FW
Sbjct: 827  ARPLKKPDSARKTEPPPRAVEPTMLVMKFPPQVSLPSVAELKARFGRFGSLDQSAIRVFW 886

Query: 518  KSLTCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLRELGAPETEP---GKTQREDSL 348
            KS TCRVV+RHK DAQ+AYR+A G+++LFGN  VR  +R + AP  E     K + +D+ 
Sbjct: 887  KSSTCRVVFRHKLDAQAAYRYANGNNSLFGNVNVRYHVRSVEAPAVEVPDFDKARGDDTA 946

Query: 347  MGTSQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDET---SGNGGGRGTPRVK 177
              T + +D AVER                V LKS LKK T DE    SG  GGRGT RVK
Sbjct: 947  SETMRVKDPAVERSAPILPHQPLPQST--VLLKSCLKKPTADEAGQGSGGNGGRGTARVK 1004

Query: 176  FMLGGEENSRGGEELMIGNK-NFNNNASFADGGA 78
            FMLGGEE SR GE+LM+GN+ NFNNNASFADGGA
Sbjct: 1005 FMLGGEETSR-GEQLMVGNRNNFNNNASFADGGA 1037


>gb|EOY17454.1| Tudor/PWWP/MBT superfamily protein, putative [Theobroma cacao]
          Length = 1133

 Score =  256 bits (654), Expect = 2e-74
 Identities = 149/274 (54%), Positives = 181/274 (66%), Gaps = 10/274 (3%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGDIKE--TSGPT 696
            DDPTK GRKR PSDRQ          I+ +K L A+KKA  +T   P+ + KE  T+GP 
Sbjct: 767  DDPTKAGRKRLPSDRQEEIAAKRLKKISQLKSLAAEKKANLRTMEAPKVEGKEQPTAGPP 826

Query: 695  IKSVKQ-EPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFW 519
             + +K+ +  +K EPP RA EPTMLVMKFPPQ +LPS+ ELKARF RFG +D SA R+FW
Sbjct: 827  ARPLKKPDSARKTEPPPRAVEPTMLVMKFPPQVSLPSVAELKARFGRFGSLDQSAIRVFW 886

Query: 518  KSLTCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLRELGAPETEP---GKTQREDSL 348
            KS TCRVV+RHK DAQ+AYR+A G+++LFGN  VR  +R + AP  E     K + +D+ 
Sbjct: 887  KSSTCRVVFRHKLDAQAAYRYANGNNSLFGNVNVRYHVRSVEAPAVEVPDFDKARGDDTA 946

Query: 347  MGTSQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDET---SGNGGGRGTPRVK 177
              T + +D AVER                V LKS LKK T DE    SG  GGRGT RVK
Sbjct: 947  SETMRVKDPAVERSAPILPHQPLPQST--VLLKSCLKKPTADEAGQGSGGNGGRGTARVK 1004

Query: 176  FMLGGEENSRGGEELMIGNK-NFNNNASFADGGA 78
            FMLGGEE SR GE+LM+GN+ NFNNNASFADGGA
Sbjct: 1005 FMLGGEETSR-GEQLMVGNRNNFNNNASFADGGA 1037


>ref|XP_021300090.1| LOW QUALITY PROTEIN: uncharacterized protein LOC110428557 [Herrania
            umbratica]
          Length = 1132

 Score =  255 bits (652), Expect = 4e-74
 Identities = 149/274 (54%), Positives = 182/274 (66%), Gaps = 10/274 (3%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGDIKE--TSGPT 696
            DDPTK GRKR PSDRQ          I+ +K L A++KA  +T   P+ + KE  T+GP 
Sbjct: 765  DDPTKAGRKRLPSDRQEEIAAKRLKKISQLKSLAAERKANSRTMEAPKVEGKEQPTAGPP 824

Query: 695  IKSVKQ-EPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFW 519
             + +K+ +  +K EPP RA EPTMLVMKFPPQ +LPS+ ELKARF RFG +D SA R+FW
Sbjct: 825  ARPLKKTDSARKMEPPPRAVEPTMLVMKFPPQVSLPSVAELKARFGRFGSLDQSAIRVFW 884

Query: 518  KSLTCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLRELGAPETEP---GKTQREDSL 348
            KS TCRVV+RHK DAQ+AYR+A G+++LFGN  VR  +R + AP  E     K + +D+ 
Sbjct: 885  KSSTCRVVFRHKLDAQAAYRYANGNNSLFGNVNVRYHVRSVEAPAVEVPDFDKXRGDDTA 944

Query: 347  MGTSQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDET---SGNGGGRGTPRVK 177
              T + +D AVER                VQLKS LKK T DE    SG  GGRGT RVK
Sbjct: 945  SETMRVKDPAVER--SAPVLXHQPLAQSAVQLKSCLKKPTADEAGQGSGGNGGRGTARVK 1002

Query: 176  FMLGGEENSRGGEELMIGNK-NFNNNASFADGGA 78
            FMLGGEE SR GE+LM+G + NFNNNASFADGGA
Sbjct: 1003 FMLGGEETSR-GEQLMVGXRNNFNNNASFADGGA 1035


>ref|XP_012089027.1| uncharacterized protein LOC105647517 isoform X1 [Jatropha curcas]
 ref|XP_020540285.1| uncharacterized protein LOC105647517 isoform X2 [Jatropha curcas]
 gb|KDP23492.1| hypothetical protein JCGZ_23325 [Jatropha curcas]
          Length = 1189

 Score =  249 bits (635), Expect = 1e-71
 Identities = 144/272 (52%), Positives = 179/272 (65%), Gaps = 8/272 (2%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGDIKE--TSGPT 696
            DDP +GGRKR PSDRQ          I+ +K L A+KKA  +T    R + KE  T+ P 
Sbjct: 828  DDPMRGGRKRLPSDRQEEIAARKLKKISMLKSLAAEKKAGMRTSETHRTEGKEPATTAPA 887

Query: 695  IKSVKQEPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWK 516
             K VK +  +K E   RA EPTMLVMKFPPQ  LPS  +LKA+FARFG +D SA R+FW+
Sbjct: 888  -KPVKSDSARKMESQPRAVEPTMLVMKFPPQTNLPSAAQLKAKFARFGSIDQSAIRVFWQ 946

Query: 515  SLTCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLRELGAPETEPGKTQR---EDSLM 345
            + TCRVV+RHK DAQ+AY++AV ++ LFGN  VR  +RE+GAP +E  +  +   +D+ +
Sbjct: 947  TSTCRVVFRHKLDAQAAYKYAV-NNTLFGNLNVRYSVREVGAPASEAAEADKGRGDDTTL 1005

Query: 344  GTSQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDETS---GNGGGRGTPRVKF 174
               + +D A+ER                VQLKSILKK TGDE     G  GGRGT RVKF
Sbjct: 1006 EAPRVKDPAIER---PPLLHQAVHPQSTVQLKSILKKPTGDEAGQVMGGNGGRGTARVKF 1062

Query: 173  MLGGEENSRGGEELMIGNKNFNNNASFADGGA 78
            MLGGEE SR GE+LM+GN+NFNNNASFADGGA
Sbjct: 1063 MLGGEETSR-GEQLMVGNRNFNNNASFADGGA 1093


>gb|KHN20237.1| hypothetical protein glysoja_023800 [Glycine soja]
          Length = 810

 Score =  244 bits (624), Expect = 2e-71
 Identities = 141/273 (51%), Positives = 183/273 (67%), Gaps = 9/273 (3%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKA-AQKTFALPRGDIKETSGPTI 693
            DDPTK GRKR  SDRQ          I ++K L A+KKA +QKT    +GD KE+     
Sbjct: 459  DDPTKAGRKRALSDRQEEISEKRLKKIKNIKALAAEKKAGSQKTSEARQGDGKESMAQAP 518

Query: 692  -KSVKQEPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWK 516
             K VK E  +K E P++A EPT+LV+KFPP+ +LPS+ ELKARFARFG +D S  R+FWK
Sbjct: 519  PKVVKPELTRKVERPAKAVEPTILVIKFPPETSLPSVAELKARFARFGPIDQSGLRVFWK 578

Query: 515  SLTCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLRELG---APETEPGKTQREDSLM 345
            + TCRVV+ HK DAQSAY++A+ + +LFGN G++CFLRE G   +  +E  K + ++   
Sbjct: 579  TSTCRVVFLHKVDAQSAYKYALANQSLFGNVGMKCFLREFGDASSEVSEAAKARGDNGAN 638

Query: 344  GTSQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDET---SGNGG-GRGTPRVK 177
             + + +D AV +R               +QLKSILKKSTGDE    +GNGG  +GTPRVK
Sbjct: 639  ESPRVKDPAVVQRQSSVSAQQPLPQPM-IQLKSILKKSTGDELGQGTGNGGSSKGTPRVK 697

Query: 176  FMLGGEENSRGGEELMIGNKNFNNNASFADGGA 78
            FMLGGEE+SR GE+LM+GN+N  N+ SFADGGA
Sbjct: 698  FMLGGEESSR-GEQLMVGNRNSFNSVSFADGGA 729


>ref|XP_010103359.1| uncharacterized protein LOC21386111 [Morus notabilis]
 gb|EXB95528.1| hypothetical protein L484_002543 [Morus notabilis]
          Length = 1196

 Score =  247 bits (631), Expect = 5e-71
 Identities = 141/278 (50%), Positives = 180/278 (64%), Gaps = 14/278 (5%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGDIKETSGPTIK 690
            DDPT  GRKR PSDRQ          ++D++ L A+KKAAQKT   PRG+ +E + P+ +
Sbjct: 822  DDPTIAGRKRAPSDRQEEIAAKKSKKMSDIRSLAAEKKAAQKTSEEPRGEAREAAVPSGR 881

Query: 689  SVKQEPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWKSL 510
             +K   +KK E  +RA EPTMLVMKFPP+ +LPS  ELKARFARFG MD S  R+FWKS 
Sbjct: 882  KIKHVSIKKAEHTARAVEPTMLVMKFPPKTSLPSPAELKARFARFGPMDQSGLRVFWKSS 941

Query: 509  TCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLRELGAPET---EPGKTQREDSLMGT 339
            TCRVV+ HK+DAQ+A RFA  +++LFG  G+RC+ RE+ AP T   E GK Q +D  + T
Sbjct: 942  TCRVVFLHKSDAQAACRFAAANNSLFGTPGMRCYTREVEAPATEAPESGKGQGDDISLDT 1001

Query: 338  SQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDET--------SGNGGGRGTPR 183
            ++ +D AV +R               VQLKS LKK+  DE+         G+G  RGTPR
Sbjct: 1002 TRTKDTAVLQR-PSSITTKQPLPQAAVQLKSCLKKAATDESGQQGTGVGGGSGNSRGTPR 1060

Query: 182  VKFMLGGEE-NSRGGEELMIGNKN--FNNNASFADGGA 78
            VKFML GE+ +SR  + LM GN+N   NN+ASF DGGA
Sbjct: 1061 VKFMLDGEDSSSRVEQSLMAGNRNNSSNNSASFPDGGA 1098


>ref|XP_010104924.1| uncharacterized protein LOC21390089 [Morus notabilis]
 gb|EXC02372.1| hypothetical protein L484_006666 [Morus notabilis]
          Length = 1198

 Score =  246 bits (629), Expect = 9e-71
 Identities = 141/278 (50%), Positives = 179/278 (64%), Gaps = 14/278 (5%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGDIKETSGPTIK 690
            DDPT  GRKR PSDRQ          ++D++ L A+KKAAQKT   PRG+ +E + P+ +
Sbjct: 824  DDPTIAGRKRAPSDRQEEIAAKKSKKMSDIRSLAAEKKAAQKTSEEPRGEAREAAVPSGR 883

Query: 689  SVKQEPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWKSL 510
             +K   +KK E  +RA EPTMLVMKFPP+ +LPS  ELKARFARFG MD S  R+FWKS 
Sbjct: 884  KIKHVSIKKAEHTARAVEPTMLVMKFPPKTSLPSPAELKARFARFGPMDQSGLRVFWKSS 943

Query: 509  TCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLRELGAPET---EPGKTQREDSLMGT 339
            TCRVV+ HK+DAQ+A RFA  +++LFG  G+RC+ RE+ AP T   E GK Q +D  + T
Sbjct: 944  TCRVVFLHKSDAQAACRFAAANNSLFGTPGMRCYTREVEAPATEAPESGKGQGDDISLDT 1003

Query: 338  SQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDET--------SGNGGGRGTPR 183
             + +D AV +R               VQLKS LKK+  DE+         G+G  RGTPR
Sbjct: 1004 PRTKDTAVLQR-PSSITTKQPLPQAAVQLKSCLKKAATDESGQQGTGVGGGSGNSRGTPR 1062

Query: 182  VKFMLGGEE-NSRGGEELMIGNKN--FNNNASFADGGA 78
            VKFML GE+ +SR  + LM GN+N   NN+ASF DGGA
Sbjct: 1063 VKFMLDGEDSSSRVEQSLMAGNRNNSSNNSASFPDGGA 1100


>ref|XP_003555609.1| PREDICTED: uncharacterized protein LOC100792700 [Glycine max]
 gb|KRG92886.1| hypothetical protein GLYMA_20G235700 [Glycine max]
          Length = 1056

 Score =  244 bits (624), Expect = 2e-70
 Identities = 141/273 (51%), Positives = 183/273 (67%), Gaps = 9/273 (3%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKA-AQKTFALPRGDIKETSGPTI 693
            DDPTK GRKR  SDRQ          I ++K L A+KKA +QKT    +GD KE+     
Sbjct: 705  DDPTKAGRKRALSDRQEEISEKRLKKIKNIKALAAEKKAGSQKTSEARQGDGKESMAQAP 764

Query: 692  -KSVKQEPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWK 516
             K VK E  +K E P++A EPT+LV+KFPP+ +LPS+ ELKARFARFG +D S  R+FWK
Sbjct: 765  PKVVKPELTRKVERPAKAVEPTILVIKFPPETSLPSVAELKARFARFGPIDQSGLRVFWK 824

Query: 515  SLTCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLRELG---APETEPGKTQREDSLM 345
            + TCRVV+ HK DAQSAY++A+ + +LFGN G++CFLRE G   +  +E  K + ++   
Sbjct: 825  TSTCRVVFLHKVDAQSAYKYALANQSLFGNVGMKCFLREFGDASSEVSEAAKARGDNGAN 884

Query: 344  GTSQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDET---SGNGG-GRGTPRVK 177
             + + +D AV +R               +QLKSILKKSTGDE    +GNGG  +GTPRVK
Sbjct: 885  ESPRVKDPAVVQRQSSVSAQQPLPQPM-IQLKSILKKSTGDELGQGTGNGGSSKGTPRVK 943

Query: 176  FMLGGEENSRGGEELMIGNKNFNNNASFADGGA 78
            FMLGGEE+SR GE+LM+GN+N  N+ SFADGGA
Sbjct: 944  FMLGGEESSR-GEQLMVGNRNSFNSVSFADGGA 975


>emb|CBI39497.3| unnamed protein product, partial [Vitis vinifera]
          Length = 978

 Score =  243 bits (619), Expect = 6e-70
 Identities = 137/262 (52%), Positives = 169/262 (64%), Gaps = 4/262 (1%)
 Frame = -3

Query: 869  DDPTKGGRKRGPSDRQXXXXXXXXXXINDMKVLTADKKAAQKTFALPRGDIKETS-GPTI 693
            +DP K GRKR PSDRQ          IND+K L A+KKA QKT   PRGD KET      
Sbjct: 669  NDPLKAGRKRAPSDRQEGNALKKLKKINDLKSLAAEKKANQKTLETPRGDGKETVVKQDP 728

Query: 692  KSVKQEPLKKPEPPSRAPEPTMLVMKFPPQGTLPSIMELKARFARFGQMDHSATRIFWKS 513
            K  K +P KK EP +R  EPTML+MKFPP+ +LPSI ELKARF RFG +DHS+TR+FWKS
Sbjct: 729  KPFKLDPAKKTEPSARVEEPTMLLMKFPPRTSLPSIAELKARFVRFGPLDHSSTRVFWKS 788

Query: 512  LTCRVVYRHKADAQSAYRFAVGSSNLFGNTGVRCFLRELG--APE-TEPGKTQREDSLMG 342
            LTCRVV+R+K DA++A+R+AV +++LFGN  V+  LREL   APE  + GK + ED+   
Sbjct: 789  LTCRVVFRYKHDAEAAHRYAVKNNSLFGNVSVKYTLRELEVVAPELPDSGKGRGEDTSSE 848

Query: 341  TSQARDFAVERRXXXXXXXXXXXXXPGVQLKSILKKSTGDETSGNGGGRGTPRVKFMLGG 162
            T Q RD A E+R                 LKS LKK + DE     GGRGT RVKF+LG 
Sbjct: 849  TPQPRDAAAEQRVAPTF------------LKSCLKKPSSDEGGTGSGGRGTSRVKFLLGT 896

Query: 161  EENSRGGEELMIGNKNFNNNAS 96
             E    GE+ M+ N+NFNN+A+
Sbjct: 897  GEEGHRGEQTMVANRNFNNHAT 918


Top