BLASTX nr result

ID: Aconitum23_contig00007908 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Aconitum23_contig00007908
         (1620 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010259666.1| PREDICTED: uncharacterized protein LOC104599...   332   4e-88
gb|KDO56249.1| hypothetical protein CISIN_1g001012mg [Citrus sin...   278   1e-71
gb|KDO56248.1| hypothetical protein CISIN_1g001012mg [Citrus sin...   278   1e-71
gb|KDO56247.1| hypothetical protein CISIN_1g001012mg [Citrus sin...   278   1e-71
gb|KDO56246.1| hypothetical protein CISIN_1g001012mg [Citrus sin...   278   1e-71
ref|XP_006472071.1| PREDICTED: uncharacterized protein LOC102607...   272   7e-70
ref|XP_006433394.1| hypothetical protein CICLE_v10000070mg [Citr...   272   7e-70
ref|XP_011036849.1| PREDICTED: uncharacterized protein LOC105134...   269   5e-69
ref|XP_012089027.1| PREDICTED: uncharacterized protein LOC105647...   266   4e-68
ref|XP_011023667.1| PREDICTED: uncharacterized protein LOC105125...   261   1e-66
ref|XP_010025199.1| PREDICTED: uncharacterized protein LOC104415...   258   1e-65
ref|XP_011023665.1| PREDICTED: uncharacterized protein LOC105125...   257   2e-65
ref|XP_007020229.1| Tudor/PWWP/MBT superfamily protein, putative...   257   2e-65
gb|KHN20237.1| hypothetical protein glysoja_023800 [Glycine soja]     257   2e-65
ref|XP_003555609.1| PREDICTED: uncharacterized protein LOC100792...   257   2e-65
ref|XP_003553721.1| PREDICTED: uncharacterized protein LOC100805...   253   3e-64
ref|XP_012446851.1| PREDICTED: uncharacterized protein LOC105770...   251   1e-63
gb|KJB60047.1| hypothetical protein B456_009G287300 [Gossypium r...   251   1e-63
gb|KJB60046.1| hypothetical protein B456_009G287300 [Gossypium r...   251   1e-63
ref|XP_002319529.1| PWWP domain-containing family protein [Popul...   248   9e-63

>ref|XP_010259666.1| PREDICTED: uncharacterized protein LOC104599005 [Nelumbo nucifera]
          Length = 1278

 Score =  332 bits (852), Expect = 4e-88
 Identities = 191/394 (48%), Positives = 261/394 (66%), Gaps = 7/394 (1%)
 Frame = -1

Query: 1185 IETSPENVHKRLKTNTEEEIPRISAGRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSAL 1006
            +ET  ++  KRLKT+ + E  R SAG+SIGIG  P    Q D Q+   G  S  P D+++
Sbjct: 736  LETGTDHPPKRLKTSKDAESLRKSAGKSIGIGLVP----QEDPQKKVDGVSSPFPLDASM 791

Query: 1005 SLQKVDLDNIEVELPQVVHDMLALALDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVL-P 829
            +   +D+ +I+VELPQ+V D+LALALDPFYGVE    A+VR VLLRFRS+VYQKSL+L P
Sbjct: 792  APPVIDIGDIDVELPQLVGDLLALALDPFYGVERNGPAIVRHVLLRFRSLVYQKSLILVP 851

Query: 828  SSSEADTSELRVGKPTTSTSIAEVPSGEDARSL-APKHPRQIPRSDDPTRSGRKRNLSDR 652
             +  A+TS+ R  + ++  +   VP+ ED + L + + P+ + + DDPT++GRKR+LSDR
Sbjct: 852  PTESAETSDFRTNRSSSGGASGTVPN-EDVKDLPSARPPKHLSKVDDPTKAGRKRSLSDR 910

Query: 651  QEEISAKRLKKLHELKSLTERKASNQKIPDGSREQ----KDNNSSLPSKPFNLDSAKNPE 484
            QEEI+ KR+KKL+ELK +TE+KA +QK  +  R +    KD  +++ +K    D  K PE
Sbjct: 911  QEEIAVKRMKKLNELKLMTEKKAGSQKAQEMQRGERKDGKDAGTTILAKQMRPDYEKKPE 970

Query: 483  SPVKVAEPGMLVLKFPQSKSLPSAAELKARFSRFGPLDLPAMRVFYKTNTCRVVFRHKVH 304
             P ++AEP MLV+KFP   SLPS  ELKARF+RFGPLD  A RVF+K++TCRVVF+HK H
Sbjct: 971  PPARIAEPTMLVMKFPPRTSLPSVPELKARFARFGPLDHSATRVFWKSSTCRVVFKHKSH 1030

Query: 303  AKAAYGYAVRNNSLFGHVNVNYQLRDVGGQAPAEAEGAKVQQANGASDETLQLRXXXXXX 124
            A+ A+ YAVRN+SLFG+V VNY LR++    P   +  K  +A   SDE   ++      
Sbjct: 1031 AQVAHSYAVRNSSLFGNVKVNYHLRELEAPTPEMPDSGK-WRAEVTSDE---VQSRTVVA 1086

Query: 123  XXXXXGPRPMTPM-QRSRQLNSQLKSILKKPSGD 25
                  PRP   + Q+  Q + QLKS LKKPSGD
Sbjct: 1087 SDTVNEPRPRAALKQQPTQPSVQLKSCLKKPSGD 1120


>gb|KDO56249.1| hypothetical protein CISIN_1g001012mg [Citrus sinensis]
          Length = 1190

 Score =  278 bits (710), Expect = 1e-71
 Identities = 181/424 (42%), Positives = 245/424 (57%), Gaps = 3/424 (0%)
 Frame = -1

Query: 1287 DGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSN-IETSPENVHKRLKTNTEEEIPRISA 1111
            DG +KK K+LKRP  D +                 + T P + H++   +        S 
Sbjct: 630  DGKLKKPKSLKRPLGDLSSEKPMVGEQKKKKKKKELGTPPNSDHQKRSASN-------ST 682

Query: 1110 GRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEVELPQVVHDMLALA 931
             +S   G  PS   QL+ Q+ + G+ ST    S   L  V   NIEV LPQ++ D+ ALA
Sbjct: 683  KKSAQAGLGPSEDQQLNNQKKDGGA-STSALGSVEILPGVTTVNIEVGLPQLLRDLHALA 741

Query: 930  LDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTSTSIAEVPS 751
            LDPF+G E    + +RQ  LRFRS+VY KSLVL   S+ ++ E R  K ++S       S
Sbjct: 742  LDPFHGAERNCPSTIRQCFLRFRSLVYMKSLVLSPLSDTESVEGRAAKSSSSIGT----S 797

Query: 750  GEDARSLAPKHP-RQIPRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSLTERKASNQ 574
            GE+ R L    P +Q+ R +DPT++GRKR  SDRQEEI+AKRLKK++++KSLT  K S+Q
Sbjct: 798  GENVRDLPASKPIKQLARPEDPTKAGRKRLPSDRQEEIAAKRLKKINQMKSLTSEKKSSQ 857

Query: 573  KIPDGSREQKDNNSSLP-SKPFNLDSAKNPESPVKVAEPGMLVLKFPQSKSLPSAAELKA 397
            +  DG R +   ++++P ++P     AK  E P +  +P MLV+KFP   SLPSAAELKA
Sbjct: 858  RALDGQRVEGKEHAAVPLARPVKPGFAKKLEPPSRAVQPTMLVMKFPPETSLPSAAELKA 917

Query: 396  RFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSLFGHVNVNYQLRDVGG 217
            RF RFG LD  A+RVF+K+ TCRVVF+HK  A+AAY YA  NN+LFG+V V Y LR+V  
Sbjct: 918  RFGRFGSLDQSAIRVFWKSFTCRVVFKHKADAQAAYKYANGNNTLFGNVKVRYILREVEA 977

Query: 216  QAPAEAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQRSRQLNSQLKSILKK 37
             AP   +  KV + + +S ET +++             RP        Q N QLKS LKK
Sbjct: 978  PAPEVPDFDKV-RGDESSYETPRIK--------DPVADRPTPAPGLLPQPNIQLKSCLKK 1028

Query: 36   PSGD 25
            P+ D
Sbjct: 1029 PASD 1032


>gb|KDO56248.1| hypothetical protein CISIN_1g001012mg [Citrus sinensis]
          Length = 1179

 Score =  278 bits (710), Expect = 1e-71
 Identities = 181/424 (42%), Positives = 245/424 (57%), Gaps = 3/424 (0%)
 Frame = -1

Query: 1287 DGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSN-IETSPENVHKRLKTNTEEEIPRISA 1111
            DG +KK K+LKRP  D +                 + T P + H++   +        S 
Sbjct: 619  DGKLKKPKSLKRPLGDLSSEKPMVGEQKKKKKKKELGTPPNSDHQKRSASN-------ST 671

Query: 1110 GRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEVELPQVVHDMLALA 931
             +S   G  PS   QL+ Q+ + G+ ST    S   L  V   NIEV LPQ++ D+ ALA
Sbjct: 672  KKSAQAGLGPSEDQQLNNQKKDGGA-STSALGSVEILPGVTTVNIEVGLPQLLRDLHALA 730

Query: 930  LDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTSTSIAEVPS 751
            LDPF+G E    + +RQ  LRFRS+VY KSLVL   S+ ++ E R  K ++S       S
Sbjct: 731  LDPFHGAERNCPSTIRQCFLRFRSLVYMKSLVLSPLSDTESVEGRAAKSSSSIGT----S 786

Query: 750  GEDARSLAPKHP-RQIPRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSLTERKASNQ 574
            GE+ R L    P +Q+ R +DPT++GRKR  SDRQEEI+AKRLKK++++KSLT  K S+Q
Sbjct: 787  GENVRDLPASKPIKQLARPEDPTKAGRKRLPSDRQEEIAAKRLKKINQMKSLTSEKKSSQ 846

Query: 573  KIPDGSREQKDNNSSLP-SKPFNLDSAKNPESPVKVAEPGMLVLKFPQSKSLPSAAELKA 397
            +  DG R +   ++++P ++P     AK  E P +  +P MLV+KFP   SLPSAAELKA
Sbjct: 847  RALDGQRVEGKEHAAVPLARPVKPGFAKKLEPPSRAVQPTMLVMKFPPETSLPSAAELKA 906

Query: 396  RFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSLFGHVNVNYQLRDVGG 217
            RF RFG LD  A+RVF+K+ TCRVVF+HK  A+AAY YA  NN+LFG+V V Y LR+V  
Sbjct: 907  RFGRFGSLDQSAIRVFWKSFTCRVVFKHKADAQAAYKYANGNNTLFGNVKVRYILREVEA 966

Query: 216  QAPAEAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQRSRQLNSQLKSILKK 37
             AP   +  KV + + +S ET +++             RP        Q N QLKS LKK
Sbjct: 967  PAPEVPDFDKV-RGDESSYETPRIK--------DPVADRPTPAPGLLPQPNIQLKSCLKK 1017

Query: 36   PSGD 25
            P+ D
Sbjct: 1018 PASD 1021


>gb|KDO56247.1| hypothetical protein CISIN_1g001012mg [Citrus sinensis]
          Length = 1143

 Score =  278 bits (710), Expect = 1e-71
 Identities = 181/424 (42%), Positives = 245/424 (57%), Gaps = 3/424 (0%)
 Frame = -1

Query: 1287 DGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSN-IETSPENVHKRLKTNTEEEIPRISA 1111
            DG +KK K+LKRP  D +                 + T P + H++   +        S 
Sbjct: 619  DGKLKKPKSLKRPLGDLSSEKPMVGEQKKKKKKKELGTPPNSDHQKRSASN-------ST 671

Query: 1110 GRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEVELPQVVHDMLALA 931
             +S   G  PS   QL+ Q+ + G+ ST    S   L  V   NIEV LPQ++ D+ ALA
Sbjct: 672  KKSAQAGLGPSEDQQLNNQKKDGGA-STSALGSVEILPGVTTVNIEVGLPQLLRDLHALA 730

Query: 930  LDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTSTSIAEVPS 751
            LDPF+G E    + +RQ  LRFRS+VY KSLVL   S+ ++ E R  K ++S       S
Sbjct: 731  LDPFHGAERNCPSTIRQCFLRFRSLVYMKSLVLSPLSDTESVEGRAAKSSSSIGT----S 786

Query: 750  GEDARSLAPKHP-RQIPRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSLTERKASNQ 574
            GE+ R L    P +Q+ R +DPT++GRKR  SDRQEEI+AKRLKK++++KSLT  K S+Q
Sbjct: 787  GENVRDLPASKPIKQLARPEDPTKAGRKRLPSDRQEEIAAKRLKKINQMKSLTSEKKSSQ 846

Query: 573  KIPDGSREQKDNNSSLP-SKPFNLDSAKNPESPVKVAEPGMLVLKFPQSKSLPSAAELKA 397
            +  DG R +   ++++P ++P     AK  E P +  +P MLV+KFP   SLPSAAELKA
Sbjct: 847  RALDGQRVEGKEHAAVPLARPVKPGFAKKLEPPSRAVQPTMLVMKFPPETSLPSAAELKA 906

Query: 396  RFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSLFGHVNVNYQLRDVGG 217
            RF RFG LD  A+RVF+K+ TCRVVF+HK  A+AAY YA  NN+LFG+V V Y LR+V  
Sbjct: 907  RFGRFGSLDQSAIRVFWKSFTCRVVFKHKADAQAAYKYANGNNTLFGNVKVRYILREVEA 966

Query: 216  QAPAEAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQRSRQLNSQLKSILKK 37
             AP   +  KV + + +S ET +++             RP        Q N QLKS LKK
Sbjct: 967  PAPEVPDFDKV-RGDESSYETPRIK--------DPVADRPTPAPGLLPQPNIQLKSCLKK 1017

Query: 36   PSGD 25
            P+ D
Sbjct: 1018 PASD 1021


>gb|KDO56246.1| hypothetical protein CISIN_1g001012mg [Citrus sinensis]
          Length = 1072

 Score =  278 bits (710), Expect = 1e-71
 Identities = 181/424 (42%), Positives = 245/424 (57%), Gaps = 3/424 (0%)
 Frame = -1

Query: 1287 DGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSN-IETSPENVHKRLKTNTEEEIPRISA 1111
            DG +KK K+LKRP  D +                 + T P + H++   +        S 
Sbjct: 512  DGKLKKPKSLKRPLGDLSSEKPMVGEQKKKKKKKELGTPPNSDHQKRSASN-------ST 564

Query: 1110 GRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEVELPQVVHDMLALA 931
             +S   G  PS   QL+ Q+ + G+ ST    S   L  V   NIEV LPQ++ D+ ALA
Sbjct: 565  KKSAQAGLGPSEDQQLNNQKKDGGA-STSALGSVEILPGVTTVNIEVGLPQLLRDLHALA 623

Query: 930  LDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTSTSIAEVPS 751
            LDPF+G E    + +RQ  LRFRS+VY KSLVL   S+ ++ E R  K ++S       S
Sbjct: 624  LDPFHGAERNCPSTIRQCFLRFRSLVYMKSLVLSPLSDTESVEGRAAKSSSSIGT----S 679

Query: 750  GEDARSLAPKHP-RQIPRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSLTERKASNQ 574
            GE+ R L    P +Q+ R +DPT++GRKR  SDRQEEI+AKRLKK++++KSLT  K S+Q
Sbjct: 680  GENVRDLPASKPIKQLARPEDPTKAGRKRLPSDRQEEIAAKRLKKINQMKSLTSEKKSSQ 739

Query: 573  KIPDGSREQKDNNSSLP-SKPFNLDSAKNPESPVKVAEPGMLVLKFPQSKSLPSAAELKA 397
            +  DG R +   ++++P ++P     AK  E P +  +P MLV+KFP   SLPSAAELKA
Sbjct: 740  RALDGQRVEGKEHAAVPLARPVKPGFAKKLEPPSRAVQPTMLVMKFPPETSLPSAAELKA 799

Query: 396  RFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSLFGHVNVNYQLRDVGG 217
            RF RFG LD  A+RVF+K+ TCRVVF+HK  A+AAY YA  NN+LFG+V V Y LR+V  
Sbjct: 800  RFGRFGSLDQSAIRVFWKSFTCRVVFKHKADAQAAYKYANGNNTLFGNVKVRYILREVEA 859

Query: 216  QAPAEAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQRSRQLNSQLKSILKK 37
             AP   +  KV + + +S ET +++             RP        Q N QLKS LKK
Sbjct: 860  PAPEVPDFDKV-RGDESSYETPRIK--------DPVADRPTPAPGLLPQPNIQLKSCLKK 910

Query: 36   PSGD 25
            P+ D
Sbjct: 911  PASD 914


>ref|XP_006472071.1| PREDICTED: uncharacterized protein LOC102607628 isoform X2 [Citrus
            sinensis]
          Length = 1143

 Score =  272 bits (695), Expect = 7e-70
 Identities = 179/424 (42%), Positives = 241/424 (56%), Gaps = 3/424 (0%)
 Frame = -1

Query: 1287 DGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSN-IETSPENVHKRLKTNTEEEIPRISA 1111
            DG +KK K+LKRP  D +                 + T P + H++            S 
Sbjct: 619  DGKLKKPKSLKRPLGDLSSEKPMVGEQKKKKKKKELGTQPNSDHQKRSAPN-------ST 671

Query: 1110 GRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEVELPQVVHDMLALA 931
             +S   G  PS   QL+ Q+ + G+ ST    S      V   NIEV LPQ++ D+ ALA
Sbjct: 672  KKSAQAGLGPSEDQQLNNQKKDGGA-STSALGSVEISPGVTTVNIEVGLPQLLRDLHALA 730

Query: 930  LDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTSTSIAEVPS 751
            LDPF+G E    + +RQ  LRFRS+VY KSLVL   S+ ++ E    K ++S       S
Sbjct: 731  LDPFHGAERNCPSTIRQCFLRFRSLVYMKSLVLSPLSDTESVEGHAAKSSSSIGT----S 786

Query: 750  GEDARSLAPKHP-RQIPRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSLTERKASNQ 574
            GE+ R L    P +Q+ R +DPT++GRKR  SDRQEEI+AKRLKK++++KSLT  K S+Q
Sbjct: 787  GENVRDLPASKPIKQLARPEDPTKAGRKRLPSDRQEEIAAKRLKKINQMKSLTSEKKSSQ 846

Query: 573  KIPDGSREQKDNNSSLP-SKPFNLDSAKNPESPVKVAEPGMLVLKFPQSKSLPSAAELKA 397
            +  DG R +   ++++P  +P     AK  E P +  +P MLV+KFP   SLPSAAELKA
Sbjct: 847  RTLDGQRVEGKEHAAVPLPRPVKPGFAKKLEPPSRAVQPTMLVMKFPPETSLPSAAELKA 906

Query: 396  RFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSLFGHVNVNYQLRDVGG 217
            RF RFG LD  A+RVF+K+ TCRVVF+HK  A+AAY YA  NN+LFG+V V Y LR+V  
Sbjct: 907  RFGRFGSLDQSAIRVFWKSFTCRVVFKHKADAQAAYKYANGNNTLFGNVKVRYILREVEA 966

Query: 216  QAPAEAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQRSRQLNSQLKSILKK 37
             AP   +  KV + + +S ET +++             RP        Q N QLKS LKK
Sbjct: 967  PAPEVPDFDKV-RGDESSYETPRIK--------DPVADRPTPAPGLLPQPNIQLKSCLKK 1017

Query: 36   PSGD 25
            P+ D
Sbjct: 1018 PASD 1021


>ref|XP_006433394.1| hypothetical protein CICLE_v10000070mg [Citrus clementina]
            gi|568836067|ref|XP_006472070.1| PREDICTED:
            uncharacterized protein LOC102607628 isoform X1 [Citrus
            sinensis] gi|557535516|gb|ESR46634.1| hypothetical
            protein CICLE_v10000070mg [Citrus clementina]
          Length = 1179

 Score =  272 bits (695), Expect = 7e-70
 Identities = 179/424 (42%), Positives = 241/424 (56%), Gaps = 3/424 (0%)
 Frame = -1

Query: 1287 DGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSN-IETSPENVHKRLKTNTEEEIPRISA 1111
            DG +KK K+LKRP  D +                 + T P + H++            S 
Sbjct: 619  DGKLKKPKSLKRPLGDLSSEKPMVGEQKKKKKKKELGTQPNSDHQKRSAPN-------ST 671

Query: 1110 GRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEVELPQVVHDMLALA 931
             +S   G  PS   QL+ Q+ + G+ ST    S      V   NIEV LPQ++ D+ ALA
Sbjct: 672  KKSAQAGLGPSEDQQLNNQKKDGGA-STSALGSVEISPGVTTVNIEVGLPQLLRDLHALA 730

Query: 930  LDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTSTSIAEVPS 751
            LDPF+G E    + +RQ  LRFRS+VY KSLVL   S+ ++ E    K ++S       S
Sbjct: 731  LDPFHGAERNCPSTIRQCFLRFRSLVYMKSLVLSPLSDTESVEGHAAKSSSSIGT----S 786

Query: 750  GEDARSLAPKHP-RQIPRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSLTERKASNQ 574
            GE+ R L    P +Q+ R +DPT++GRKR  SDRQEEI+AKRLKK++++KSLT  K S+Q
Sbjct: 787  GENVRDLPASKPIKQLARPEDPTKAGRKRLPSDRQEEIAAKRLKKINQMKSLTSEKKSSQ 846

Query: 573  KIPDGSREQKDNNSSLP-SKPFNLDSAKNPESPVKVAEPGMLVLKFPQSKSLPSAAELKA 397
            +  DG R +   ++++P  +P     AK  E P +  +P MLV+KFP   SLPSAAELKA
Sbjct: 847  RTLDGQRVEGKEHAAVPLPRPVKPGFAKKLEPPSRAVQPTMLVMKFPPETSLPSAAELKA 906

Query: 396  RFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSLFGHVNVNYQLRDVGG 217
            RF RFG LD  A+RVF+K+ TCRVVF+HK  A+AAY YA  NN+LFG+V V Y LR+V  
Sbjct: 907  RFGRFGSLDQSAIRVFWKSFTCRVVFKHKADAQAAYKYANGNNTLFGNVKVRYILREVEA 966

Query: 216  QAPAEAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQRSRQLNSQLKSILKK 37
             AP   +  KV + + +S ET +++             RP        Q N QLKS LKK
Sbjct: 967  PAPEVPDFDKV-RGDESSYETPRIK--------DPVADRPTPAPGLLPQPNIQLKSCLKK 1017

Query: 36   PSGD 25
            P+ D
Sbjct: 1018 PASD 1021


>ref|XP_011036849.1| PREDICTED: uncharacterized protein LOC105134211 isoform X2 [Populus
            euphratica]
          Length = 1136

 Score =  269 bits (688), Expect = 5e-69
 Identities = 171/416 (41%), Positives = 237/416 (56%), Gaps = 24/416 (5%)
 Frame = -1

Query: 1182 ETSPENVHKRLKTNTEEEIPRISAGRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALS 1003
            ET+P++  KRL T  +  +  IS+G+S  I  +P    QL+ QQ + G+ +T P+     
Sbjct: 597  ETNPDHPKKRLATG-KGGVAGISSGKSTQISMSPGEDFQLNGQQKDVGTSNTLPN----- 650

Query: 1002 LQKVDLDNIEVELPQVVHDMLALALDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSS 823
                   +IE+ELPQ++ D+ ALALDPF+G E  + +V     LRFRS+VYQKSL L S 
Sbjct: 651  -------SIELELPQLLSDLQALALDPFHGAERNSPSVTMSFFLRFRSLVYQKSLALSSP 703

Query: 822  SEADTSELRVGKPTTSTSIAEVPSGEDARSLAPKHP-RQIPRSDDPTRSGRKRNLSDRQE 646
            SE +  E R  K +++   ++  + E++R L    P + + R DDPT++GRKR  SDRQE
Sbjct: 704  SETELVEARGAKSSSNIGASDYSASENSRGLTSSKPAKSLARLDDPTKAGRKRLPSDRQE 763

Query: 645  EISAKRLKKLHELKSLTERKASNQKIPDGSR-----------------------EQKDNN 535
            EI+AKRLKK+  LKSL   K + Q+  D  R                       E K   
Sbjct: 764  EIAAKRLKKITHLKSLASGKKAGQRSLDMQRVEGKEPVATQRAEGKLPATTHRPEGKHPV 823

Query: 534  SSLPSKPFNLDSAKNPESPVKVAEPGMLVLKFPQSKSLPSAAELKARFSRFGPLDLPAMR 355
            +  P K    DS K  E PV+  EP MLV+KFP   SLPSAA+LKA+F+RFG +D  A+R
Sbjct: 824  AQAPRKFVKPDSYKKMEPPVRANEPTMLVMKFPPETSLPSAAQLKAKFARFGSIDQSAIR 883

Query: 354  VFYKTNTCRVVFRHKVHAKAAYGYAVRNNSLFGHVNVNYQLRDVGGQAPAEAEGAKVQQA 175
            VF+K++ CRVVFR K+ A+AA  YAV N SLFG+VNV Y +R+VG  A +EA  ++  + 
Sbjct: 884  VFWKSSQCRVVFRRKLDAQAALRYAVANKSLFGNVNVRYNIREVGAPA-SEAPESEKSRG 942

Query: 174  NGASDETLQLRXXXXXXXXXXXGPRPMTPMQRSRQLNSQLKSILKKPSGDVVVNVP 7
            +  S +  Q +             +P +      Q   QLKSILKKP+G+  V VP
Sbjct: 943  DDTSVDATQAKDPLVERQAAAFAHQPPS------QSAGQLKSILKKPNGEEAVPVP 992


>ref|XP_012089027.1| PREDICTED: uncharacterized protein LOC105647517 isoform X1 [Jatropha
            curcas] gi|802756446|ref|XP_012089028.1| PREDICTED:
            uncharacterized protein LOC105647517 isoform X2 [Jatropha
            curcas] gi|643708576|gb|KDP23492.1| hypothetical protein
            JCGZ_23325 [Jatropha curcas]
          Length = 1189

 Score =  266 bits (680), Expect = 4e-68
 Identities = 177/427 (41%), Positives = 242/427 (56%), Gaps = 4/427 (0%)
 Frame = -1

Query: 1293 SGDGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSN--IETSPENVHKRLKTNTEEEIPR 1120
            S D  VKK K LKRP  D                 +   E SP++  KRL          
Sbjct: 634  SADVAVKKAKVLKRPLGDLGSENSVTREKKKKKKKDSGTEISPDHPKKRLAGAGV----- 688

Query: 1119 ISAGRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEVELPQVVHDML 940
              AG+S  I        + + Q+ + G+ S  P  S   L  V + NIE+ELP ++ D+ 
Sbjct: 689  --AGKSSLINVASREDHRGNQQKKDVGT-SNAPFSSVGPLPMVGMGNIELELPHLLSDLH 745

Query: 939  ALALDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTSTSIAE 760
            ALAL+P++G E    ++  Q  LRFRS  YQKSL L   SE +T+E+R  K  +S  ++ 
Sbjct: 746  ALALNPYHGTERNGPSITMQFFLRFRSHFYQKSLALSPPSETETNEIRAAKFPSSAGVSG 805

Query: 759  VPSGEDARSLAPKHP-RQIPRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSLTERKA 583
              +GE+ R L    P + + R DDP R GRKR  SDRQEEI+A++LKK+  LKSL   K 
Sbjct: 806  NSAGENVRDLTSSKPVKSLVRPDDPMRGGRKRLPSDRQEEIAARKLKKISMLKSLAAEKK 865

Query: 582  SNQKIPDGSR-EQKDNNSSLPSKPFNLDSAKNPESPVKVAEPGMLVLKFPQSKSLPSAAE 406
            +  +  +  R E K+  ++ P+KP   DSA+  ES  +  EP MLV+KFP   +LPSAA+
Sbjct: 866  AGMRTSETHRTEGKEPATTAPAKPVKSDSARKMESQPRAVEPTMLVMKFPPQTNLPSAAQ 925

Query: 405  LKARFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSLFGHVNVNYQLRD 226
            LKA+F+RFG +D  A+RVF++T+TCRVVFRHK+ A+AAY YAV NN+LFG++NV Y +R+
Sbjct: 926  LKAKFARFGSIDQSAIRVFWQTSTCRVVFRHKLDAQAAYKYAV-NNTLFGNLNVRYSVRE 984

Query: 225  VGGQAPAEAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQRSRQLNSQLKSI 46
            VG  APA +E A+  +  G  D TL+                P+       Q   QLKSI
Sbjct: 985  VG--APA-SEAAEADKGRG-DDTTLEAPRVKDPAIERP----PLLHQAVHPQSTVQLKSI 1036

Query: 45   LKKPSGD 25
            LKKP+GD
Sbjct: 1037 LKKPTGD 1043


>ref|XP_011023667.1| PREDICTED: uncharacterized protein LOC105125068 isoform X2 [Populus
            euphratica] gi|743830003|ref|XP_011023668.1| PREDICTED:
            uncharacterized protein LOC105125068 isoform X3 [Populus
            euphratica]
          Length = 1100

 Score =  261 bits (668), Expect = 1e-66
 Identities = 177/439 (40%), Positives = 235/439 (53%), Gaps = 12/439 (2%)
 Frame = -1

Query: 1305 VPLASGDGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSNIETSPENVHKRLKTNTEEEI 1126
            V  ++G G VK+ K +KRP  D +                 ET+P+   KRL T   EE+
Sbjct: 544  VGTSTGSG-VKRVKVIKRPVGDTSLRKSITGGKKKKEI-GAETNPDGPKKRLATGKGEEV 601

Query: 1125 PRISAGRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEVELPQVVHD 946
             RIS G+S  +  +P                     DS L+ QK D    E EL Q++ D
Sbjct: 602  -RISLGKSTHVSVSPG-------------------EDSQLNSQKKD--GTEFELSQLLSD 639

Query: 945  MLALALDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTSTSI 766
             LALALDPF+  E  +H+V     LRFRS+V+QKSLVL   SE +  E+   K  +S   
Sbjct: 640  FLALALDPFHVAERNSHSVTMHFFLRFRSLVFQKSLVLSPPSETEVVEVSGTKSLSSIGA 699

Query: 765  AEVPSGEDARSLAPKHPRQI-PRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSLTER 589
            ++  + EDAR L P  P ++  R +DPT++GRKR  SDRQEEI+AKRLKK+ +LKSL   
Sbjct: 700  SDYSASEDARGLIPSKPAKLLVRPNDPTKAGRKRLPSDRQEEIAAKRLKKIIQLKSLAAE 759

Query: 588  KASNQKIPDGSREQKDNNSSL-----------PSKPFNLDSAKNPESPVKVAEPGMLVLK 442
            K + + +     E K+  +             P K    DS K  E PV+  EP MLVL+
Sbjct: 760  KKAQRTLDTLGAEGKETVARQRAEGKQPVAQPPRKSVKPDSFKKTEPPVRAIEPTMLVLR 819

Query: 441  FPQSKSLPSAAELKARFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSL 262
            FP   SLPSAA+LKARF+RFG LD  A+RVF+K++ CRVVFR K+ A+AA  YA+ N SL
Sbjct: 820  FPPETSLPSAAQLKARFARFGSLDQSAIRVFWKSSQCRVVFRRKLDAQAALKYALGNKSL 879

Query: 261  FGHVNVNYQLRDVGGQAPAEAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQ 82
            FG VNV Y +R+VG  APA       +  +  S +  Q               +P +   
Sbjct: 880  FGDVNVRYNIREVG--APASEAPESDKSRDDTSVDAAQAEDSLADWQAVAFAHQPPS--- 934

Query: 81   RSRQLNSQLKSILKKPSGD 25
               Q   QLKSILK+P+GD
Sbjct: 935  ---QSTVQLKSILKRPNGD 950


>ref|XP_010025199.1| PREDICTED: uncharacterized protein LOC104415567 [Eucalyptus grandis]
            gi|629095812|gb|KCW61807.1| hypothetical protein
            EUGRSUZ_H04503 [Eucalyptus grandis]
          Length = 1157

 Score =  258 bits (658), Expect = 1e-65
 Identities = 176/440 (40%), Positives = 238/440 (54%), Gaps = 9/440 (2%)
 Frame = -1

Query: 1317 SPTVVPLASGDGLV--KKTKALKRPAVDFNXXXXXXXXXXXXXXSNI--ETSPENVHKRL 1150
            SP    + S DG +  KK K L  P  + +                I  ET  ++  KRL
Sbjct: 586  SPLSTDVVSADGAMRKKKKKVLGHPVGEPSSQNVVMREKKKKKRKEIGLETGSDHPRKRL 645

Query: 1149 KTNTEEEIPRISAGRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEV 970
             T+         AG+   + S    +   D Q+    S  THP D  + +      N E+
Sbjct: 646  LTSKVGVSVAKVAGKLTQVDSASREESYADKQKKGEAS-RTHPDD--VGMVPTWSGNAEL 702

Query: 969  ELPQVVHDMLALALDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVG 790
            +L Q+++ + ALALDPFYG+E    AV +Q  LRFRS+VYQKSL+L   SE DT E+R  
Sbjct: 703  DLRQLLNGLQALALDPFYGIERSNPAVTKQAFLRFRSLVYQKSLILAPPSETDTVEIRPA 762

Query: 789  KPTTSTSIAEVPSGEDARSLAP-KHPRQIPRSDDPTRSGRKRNLSDRQEEISAKRLKKLH 613
            K       A+  +GE  R L+  K  +   R DDP +SGRKR  SDRQEEI AKRLKK+H
Sbjct: 763  KSPAGVGAADQSTGESVRKLSSSKSTKPTGRFDDPAKSGRKRPPSDRQEEIEAKRLKKIH 822

Query: 612  ELKSLTERKASNQKIPDGSR-EQKDNNSSLP--SKPFNLDSAKNPESPVKVAEPGMLVLK 442
             +KSL   K + QK  D  R E ++  S+ P  +KPF +   ++   P + ++P +LV+K
Sbjct: 823  NIKSLAAEKRAIQKTQDAPRGEGRETVSATPKQAKPFPVKKVES--HPARASDPTILVMK 880

Query: 441  FPQSKSLPSAAELKARFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSL 262
            FP   SLPS  ELKARF+RFGPLD   +RVF+K++TCRVVF  K+ A+AAY YA  NN+L
Sbjct: 881  FPPGTSLPSVTELKARFARFGPLDYSGIRVFWKSSTCRVVFHRKLDAEAAYKYAAGNNNL 940

Query: 261  FGHVNVNYQLRDVGGQAPAEAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQ 82
            FG+  V Y LRD    A   +E  K  + N +  +T +L+              P T  +
Sbjct: 941  FGNAGVRYSLRDAEVPASEASESGK-GRGNDSVHDTPRLKD-------------PST--E 984

Query: 81   RSRQLNS-QLKSILKKPSGD 25
            RS   ++ QLKS LKK SGD
Sbjct: 985  RSGPASTVQLKSCLKKSSGD 1004


>ref|XP_011023665.1| PREDICTED: uncharacterized protein LOC105125068 isoform X1 [Populus
            euphratica] gi|743829997|ref|XP_011023666.1| PREDICTED:
            uncharacterized protein LOC105125068 isoform X1 [Populus
            euphratica]
          Length = 1111

 Score =  257 bits (657), Expect = 2e-65
 Identities = 177/450 (39%), Positives = 235/450 (52%), Gaps = 23/450 (5%)
 Frame = -1

Query: 1305 VPLASGDGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSNIETSPENVHKRLKTNTEEEI 1126
            V  ++G G VK+ K +KRP  D +                 ET+P+   KRL T   EE+
Sbjct: 544  VGTSTGSG-VKRVKVIKRPVGDTSLRKSITGGKKKKEI-GAETNPDGPKKRLATGKGEEV 601

Query: 1125 PRISAGRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEVELPQVVHD 946
             RIS G+S  +  +P                     DS L+ QK D    E EL Q++ D
Sbjct: 602  -RISLGKSTHVSVSPG-------------------EDSQLNSQKKD--GTEFELSQLLSD 639

Query: 945  MLALALDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTSTSI 766
             LALALDPF+  E  +H+V     LRFRS+V+QKSLVL   SE +  E+   K  +S   
Sbjct: 640  FLALALDPFHVAERNSHSVTMHFFLRFRSLVFQKSLVLSPPSETEVVEVSGTKSLSSIGA 699

Query: 765  AEVPSGEDARSLAPKHPRQI-PRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSLTER 589
            ++  + EDAR L P  P ++  R +DPT++GRKR  SDRQEEI+AKRLKK+ +LKSL   
Sbjct: 700  SDYSASEDARGLIPSKPAKLLVRPNDPTKAGRKRLPSDRQEEIAAKRLKKIIQLKSLAAE 759

Query: 588  KASNQKIPDGSREQKDNNSSL----------------------PSKPFNLDSAKNPESPV 475
            K + + +     E K+  +                        P K    DS K  E PV
Sbjct: 760  KKAQRTLDTLGAEGKETVARQRAEVKQTAATQRAEGKQPVAQPPRKSVKPDSFKKTEPPV 819

Query: 474  KVAEPGMLVLKFPQSKSLPSAAELKARFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKA 295
            +  EP MLVL+FP   SLPSAA+LKARF+RFG LD  A+RVF+K++ CRVVFR K+ A+A
Sbjct: 820  RAIEPTMLVLRFPPETSLPSAAQLKARFARFGSLDQSAIRVFWKSSQCRVVFRRKLDAQA 879

Query: 294  AYGYAVRNNSLFGHVNVNYQLRDVGGQAPAEAEGAKVQQANGASDETLQLRXXXXXXXXX 115
            A  YA+ N SLFG VNV Y +R+VG  APA       +  +  S +  Q           
Sbjct: 880  ALKYALGNKSLFGDVNVRYNIREVG--APASEAPESDKSRDDTSVDAAQAEDSLADWQAV 937

Query: 114  XXGPRPMTPMQRSRQLNSQLKSILKKPSGD 25
                +P +      Q   QLKSILK+P+GD
Sbjct: 938  AFAHQPPS------QSTVQLKSILKRPNGD 961


>ref|XP_007020229.1| Tudor/PWWP/MBT superfamily protein, putative [Theobroma cacao]
            gi|508725557|gb|EOY17454.1| Tudor/PWWP/MBT superfamily
            protein, putative [Theobroma cacao]
          Length = 1133

 Score =  257 bits (657), Expect = 2e-65
 Identities = 171/435 (39%), Positives = 240/435 (55%), Gaps = 12/435 (2%)
 Frame = -1

Query: 1293 SGDGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSNIETSPENVHKRLKTNTEEEIPRIS 1114
            S +G VKK K  KRP+VD                   E   +   K     T  + P+  
Sbjct: 574  SSEGGVKKVK--KRPSVDIGSDNSALG----------ERKKKKKKKEAGPETNSDHPQKP 621

Query: 1113 AGRSIGIGSTPSGKLQL---DLQQVNSGSGSTHPSDSALSL----QKVDLDNIEVELPQV 955
                +G G   + ++ L   +  QVN       P++S+ +       + L N  +EL Q+
Sbjct: 622  F--VLGKGGAKAAQISLGPREESQVNHQKKDVGPANSSFNSVGASTTIGLGNSGLELAQL 679

Query: 954  VHDMLALALDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTS 775
            + D+ +LALDPF+ VE  +  ++RQ  LRFR++VYQKSLVL   SE + +E+R  KP   
Sbjct: 680  LSDLHSLALDPFHAVERNSPTIIRQFFLRFRALVYQKSLVLSPPSEMEPAEVRGTKPPPF 739

Query: 774  TSIAEVPSGEDARSLAPKHP-RQIPRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSL 598
              +++    E+ R   P  P R + R DDPT++GRKR  SDRQEEI+AKRLKK+ +LKSL
Sbjct: 740  VGVSDNLPNENVRDSTPSKPVRPLVRPDDPTKAGRKRLPSDRQEEIAAKRLKKISQLKSL 799

Query: 597  TERKASNQKIPDGSREQ--KDNNSSLPSKPFNL-DSAKNPESPVKVAEPGMLVLKFPQSK 427
               K +N +  +  + +  +   +  P++P    DSA+  E P +  EP MLV+KFP   
Sbjct: 800  AAEKKANLRTMEAPKVEGKEQPTAGPPARPLKKPDSARKTEPPPRAVEPTMLVMKFPPQV 859

Query: 426  SLPSAAELKARFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSLFGHVN 247
            SLPS AELKARF RFG LD  A+RVF+K++TCRVVFRHK+ A+AAY YA  NNSLFG+VN
Sbjct: 860  SLPSVAELKARFGRFGSLDQSAIRVFWKSSTCRVVFRHKLDAQAAYRYANGNNSLFGNVN 919

Query: 246  VNYQLRDVGGQAPA-EAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQRSRQ 70
            V Y +R V  +APA E       + +  + ET++++              P+ P Q   Q
Sbjct: 920  VRYHVRSV--EAPAVEVPDFDKARGDDTASETMRVK------DPAVERSAPILPHQPLPQ 971

Query: 69   LNSQLKSILKKPSGD 25
                LKS LKKP+ D
Sbjct: 972  STVLLKSCLKKPTAD 986


>gb|KHN20237.1| hypothetical protein glysoja_023800 [Glycine soja]
          Length = 810

 Score =  257 bits (656), Expect = 2e-65
 Identities = 164/426 (38%), Positives = 233/426 (54%), Gaps = 5/426 (1%)
 Frame = -1

Query: 1287 DGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSN-IETSPENVHKRLKTNTEEEIPRIS- 1114
            DG+ KK K  KRPA D                 N +   P + H   K +T E+  ++S 
Sbjct: 268  DGVPKKIKVHKRPANDLKSKTSGIEGKRKKKMKNDLNLQPISGHLE-KISTSEKAVQLSG 326

Query: 1113 -AGRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEVELPQVVHDMLA 937
             + + + IG      L+ +  QV++ + +  P DS        +  + +ELP ++ D+ A
Sbjct: 327  QSEKPVSIGLASREDLRSEPMQVDASTSNLMPMDS--------IAEVNIELPHLLGDLQA 378

Query: 936  LALDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTSTSIAEV 757
            LALDPF+GV+    AV RQ  LRFRS+VYQKSL +      +   +   +P +S   ++ 
Sbjct: 379  LALDPFHGVKRGIPAVTRQFFLRFRSLVYQKSLPVSPPMVTENEAVEDRRPPSSIGTSDS 438

Query: 756  PSGEDARSLAPKHPRQIPRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSLT-ERKAS 580
            P      S   K  + I R DDPT++GRKR LSDRQEEIS KRLKK+  +K+L  E+KA 
Sbjct: 439  PDDRARASPLIKPVKHIVRPDDPTKAGRKRALSDRQEEISEKRLKKIKNIKALAAEKKAG 498

Query: 579  NQKIPDGSR-EQKDNNSSLPSKPFNLDSAKNPESPVKVAEPGMLVLKFPQSKSLPSAAEL 403
            +QK  +  + + K++ +  P K    +  +  E P K  EP +LV+KFP   SLPS AEL
Sbjct: 499  SQKTSEARQGDGKESMAQAPPKVVKPELTRKVERPAKAVEPTILVIKFPPETSLPSVAEL 558

Query: 402  KARFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSLFGHVNVNYQLRDV 223
            KARF+RFGP+D   +RVF+KT+TCRVVF HKV A++AY YA+ N SLFG+V +   LR+ 
Sbjct: 559  KARFARFGPIDQSGLRVFWKTSTCRVVFLHKVDAQSAYKYALANQSLFGNVGMKCFLREF 618

Query: 222  GGQAPAEAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQRSRQLNSQLKSIL 43
            G  +   +E AK +  NGA++                   +P+       Q   QLKSIL
Sbjct: 619  GDASSEVSEAAKARGDNGANESPRVKDPAVVQRQSSVSAQQPLP------QPMIQLKSIL 672

Query: 42   KKPSGD 25
            KK +GD
Sbjct: 673  KKSTGD 678


>ref|XP_003555609.1| PREDICTED: uncharacterized protein LOC100792700 [Glycine max]
            gi|947043162|gb|KRG92886.1| hypothetical protein
            GLYMA_20G235700 [Glycine max]
          Length = 1056

 Score =  257 bits (656), Expect = 2e-65
 Identities = 164/426 (38%), Positives = 233/426 (54%), Gaps = 5/426 (1%)
 Frame = -1

Query: 1287 DGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSN-IETSPENVHKRLKTNTEEEIPRIS- 1114
            DG+ KK K  KRPA D                 N +   P + H   K +T E+  ++S 
Sbjct: 514  DGVPKKIKVHKRPANDLKSKTSGIEGKRKKKMKNDLNLQPISGHLE-KISTSEKAVQLSG 572

Query: 1113 -AGRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEVELPQVVHDMLA 937
             + + + IG      L+ +  QV++ + +  P DS        +  + +ELP ++ D+ A
Sbjct: 573  QSEKPVSIGLASREDLRSEPMQVDASTSNLMPMDS--------IAEVNIELPHLLGDLQA 624

Query: 936  LALDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTSTSIAEV 757
            LALDPF+GV+    AV RQ  LRFRS+VYQKSL +      +   +   +P +S   ++ 
Sbjct: 625  LALDPFHGVKRGIPAVTRQFFLRFRSLVYQKSLPVSPPMVTENEAVEDRRPPSSIGTSDS 684

Query: 756  PSGEDARSLAPKHPRQIPRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSLT-ERKAS 580
            P      S   K  + I R DDPT++GRKR LSDRQEEIS KRLKK+  +K+L  E+KA 
Sbjct: 685  PDDRARASPLIKPVKHIVRPDDPTKAGRKRALSDRQEEISEKRLKKIKNIKALAAEKKAG 744

Query: 579  NQKIPDGSR-EQKDNNSSLPSKPFNLDSAKNPESPVKVAEPGMLVLKFPQSKSLPSAAEL 403
            +QK  +  + + K++ +  P K    +  +  E P K  EP +LV+KFP   SLPS AEL
Sbjct: 745  SQKTSEARQGDGKESMAQAPPKVVKPELTRKVERPAKAVEPTILVIKFPPETSLPSVAEL 804

Query: 402  KARFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSLFGHVNVNYQLRDV 223
            KARF+RFGP+D   +RVF+KT+TCRVVF HKV A++AY YA+ N SLFG+V +   LR+ 
Sbjct: 805  KARFARFGPIDQSGLRVFWKTSTCRVVFLHKVDAQSAYKYALANQSLFGNVGMKCFLREF 864

Query: 222  GGQAPAEAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQRSRQLNSQLKSIL 43
            G  +   +E AK +  NGA++                   +P+       Q   QLKSIL
Sbjct: 865  GDASSEVSEAAKARGDNGANESPRVKDPAVVQRQSSVSAQQPLP------QPMIQLKSIL 918

Query: 42   KKPSGD 25
            KK +GD
Sbjct: 919  KKSTGD 924


>ref|XP_003553721.1| PREDICTED: uncharacterized protein LOC100805944 [Glycine max]
            gi|734425615|gb|KHN43292.1| DNA mismatch repair protein
            Msh6 [Glycine soja] gi|947047193|gb|KRG96822.1|
            hypothetical protein GLYMA_19G234300 [Glycine max]
          Length = 1075

 Score =  253 bits (647), Expect = 3e-64
 Identities = 163/423 (38%), Positives = 237/423 (56%), Gaps = 5/423 (1%)
 Frame = -1

Query: 1278 VKKTKALKRPAVDFNXXXXXXXXXXXXXXSNIETSPENVHKRLKTNTEEEIPRISAG--- 1108
            VKK K LKRPA + N               N+   P  +  + K +T  ++  +S     
Sbjct: 519  VKKKKGLKRPADELNSETSAVGEEKKKKKKNLNLQP-TLGSQDKHSTFGKMIHLSGKSTE 577

Query: 1107 RSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEVELPQVVHDMLALAL 928
             ++  G  P      +  +V+  + +  P D+          N   EL Q++ D+ ALAL
Sbjct: 578  NAVSSGLAPREDFPAEQGEVDVNARNLLPMDTT--------GNANFELVQLLGDLQALAL 629

Query: 927  DPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTSTSIAEVPSG 748
            +PF+G+E +  + V++  LRFRS+VYQKSL +   +E +  ++RV KP +S  I++ P  
Sbjct: 630  NPFHGIERKIPSAVQKFFLRFRSLVYQKSLFVSPPTENEAPDVRVTKPPSSVGISDSPDE 689

Query: 747  EDARSLAPKHPRQIPRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSLTERKA-SNQK 571
                S   K  + I   DDPT++GRKR  SDRQEEI+AKRLKK+ ++K+L   KA +NQK
Sbjct: 690  YVKASPVVKPLKHIVWPDDPTKAGRKRAPSDRQEEIAAKRLKKIKDIKALASEKAVTNQK 749

Query: 570  IPDGSREQ-KDNNSSLPSKPFNLDSAKNPESPVKVAEPGMLVLKFPQSKSLPSAAELKAR 394
              +  +E  K++ S  PSK   L+S K  + P K  EP +L++KFP   SLPS AELKAR
Sbjct: 750  TSEAWQEDGKESMSQAPSKLVKLESNKKVDCPAKAVEPTILMIKFPPETSLPSIAELKAR 809

Query: 393  FSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSLFGHVNVNYQLRDVGGQ 214
            F+RFGP+D    RVF+ ++TCRVVF HKV A+AAY Y+V + SLFG V V + LR+ G  
Sbjct: 810  FARFGPMDQSGFRVFWNSSTCRVVFLHKVDAQAAYKYSVGSQSLFGSVGVRFFLREFGDS 869

Query: 213  APAEAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQRSRQLNSQLKSILKKP 34
            AP  +E AK +  +GA +ET +++             + +   Q+      QLKS LKK 
Sbjct: 870  APEVSEAAKARADDGA-NETPRVKDPAGIHR------QTLVSSQQPLLQPIQLKSCLKKS 922

Query: 33   SGD 25
            +GD
Sbjct: 923  TGD 925


>ref|XP_012446851.1| PREDICTED: uncharacterized protein LOC105770274 [Gossypium raimondii]
            gi|763793052|gb|KJB60048.1| hypothetical protein
            B456_009G287300 [Gossypium raimondii]
          Length = 1115

 Score =  251 bits (641), Expect = 1e-63
 Identities = 179/451 (39%), Positives = 247/451 (54%), Gaps = 15/451 (3%)
 Frame = -1

Query: 1332 PIDSQSPTVVPLASGDGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSNIETSPENVHKR 1153
            P+D + P  V   S +G VKK K  KR + D                  +E   +   K 
Sbjct: 550  PVDIKRPGGV---SAEGGVKKVK--KRSSADIGVENSAL----------VEKKKKKKKKE 594

Query: 1152 LKTNTEEEIPRISA------GRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQK- 994
              + T  + P+  +       +S  IG  P  + Q++ Q+ +     TH S +++     
Sbjct: 595  TGSETNSDKPKKPSFLGKDGAKSAHIGLGPREESQVNQQKKDVDP--THSSFNSVGASTT 652

Query: 993  VDLDNIEVELPQVVHDMLALALDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEA 814
            + + N   EL Q++ D+ ALALDPF+GVE  +  +VRQ  LR+RS+VYQKSLV+  +SE 
Sbjct: 653  IGVGNSGFELAQLLSDLHALALDPFHGVERNSPTIVRQCFLRYRSLVYQKSLVVLPTSEM 712

Query: 813  DTSELRVGKPTTSTSIAEVPSGEDARSLAPKHP-RQIPRSDDPTRSGRKRNLSDRQEEIS 637
            D++ELR GKP            E+ R   P  P R + R DDPT++G KR  SDR EEI+
Sbjct: 713  DSTELRAGKPPLVGGSDNTK--ENVRDSTPSKPVRPLARPDDPTKAGLKRLPSDRLEEIA 770

Query: 636  AKRLKKLHELKSLTERKASNQKIPDGSREQ--KDNNSSLPSKPFNL-DSAKNPESPVKVA 466
            AKRLKKL +LKSLT  K  N +  +  + +  +   +  P++P    DS +  ES  +  
Sbjct: 771  AKRLKKLSQLKSLTAEKKGNLRASEAPKVEVKEQPTTGPPARPTKKPDSLRKVESLPRAV 830

Query: 465  EPGMLVLKFPQSKSLPSAAELKARFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYG 286
            EP MLV+KFP   SLPS AELKARF RFG LD  A+RVF+K++TCRVVFRHK+ A+AAY 
Sbjct: 831  EPTMLVMKFPPQVSLPSVAELKARFGRFGSLDQSAIRVFWKSSTCRVVFRHKIDAQAAYR 890

Query: 285  YAVRNNSLFGHVNVNYQLRDVGGQAP-AEAEGAKVQQANGASDETLQLRXXXXXXXXXXX 109
            YA   NSLFG+VNV Y LR V  +AP AEA  +   + +    ET++++           
Sbjct: 891  YANGTNSLFGNVNVRYHLRSV--EAPTAEALDSDKARGDETGSETIRVK--------DPV 940

Query: 108  GPRPMTPM---QRSRQLNSQLKSILKKPSGD 25
              RP  P+   Q   Q   QLKS LKKP+ +
Sbjct: 941  VERPAAPVVAHQPLPQPTVQLKSCLKKPTSE 971


>gb|KJB60047.1| hypothetical protein B456_009G287300 [Gossypium raimondii]
          Length = 1048

 Score =  251 bits (641), Expect = 1e-63
 Identities = 179/451 (39%), Positives = 247/451 (54%), Gaps = 15/451 (3%)
 Frame = -1

Query: 1332 PIDSQSPTVVPLASGDGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSNIETSPENVHKR 1153
            P+D + P  V   S +G VKK K  KR + D                  +E   +   K 
Sbjct: 550  PVDIKRPGGV---SAEGGVKKVK--KRSSADIGVENSAL----------VEKKKKKKKKE 594

Query: 1152 LKTNTEEEIPRISA------GRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQK- 994
              + T  + P+  +       +S  IG  P  + Q++ Q+ +     TH S +++     
Sbjct: 595  TGSETNSDKPKKPSFLGKDGAKSAHIGLGPREESQVNQQKKDVDP--THSSFNSVGASTT 652

Query: 993  VDLDNIEVELPQVVHDMLALALDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEA 814
            + + N   EL Q++ D+ ALALDPF+GVE  +  +VRQ  LR+RS+VYQKSLV+  +SE 
Sbjct: 653  IGVGNSGFELAQLLSDLHALALDPFHGVERNSPTIVRQCFLRYRSLVYQKSLVVLPTSEM 712

Query: 813  DTSELRVGKPTTSTSIAEVPSGEDARSLAPKHP-RQIPRSDDPTRSGRKRNLSDRQEEIS 637
            D++ELR GKP            E+ R   P  P R + R DDPT++G KR  SDR EEI+
Sbjct: 713  DSTELRAGKPPLVGGSDNTK--ENVRDSTPSKPVRPLARPDDPTKAGLKRLPSDRLEEIA 770

Query: 636  AKRLKKLHELKSLTERKASNQKIPDGSREQ--KDNNSSLPSKPFNL-DSAKNPESPVKVA 466
            AKRLKKL +LKSLT  K  N +  +  + +  +   +  P++P    DS +  ES  +  
Sbjct: 771  AKRLKKLSQLKSLTAEKKGNLRASEAPKVEVKEQPTTGPPARPTKKPDSLRKVESLPRAV 830

Query: 465  EPGMLVLKFPQSKSLPSAAELKARFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYG 286
            EP MLV+KFP   SLPS AELKARF RFG LD  A+RVF+K++TCRVVFRHK+ A+AAY 
Sbjct: 831  EPTMLVMKFPPQVSLPSVAELKARFGRFGSLDQSAIRVFWKSSTCRVVFRHKIDAQAAYR 890

Query: 285  YAVRNNSLFGHVNVNYQLRDVGGQAP-AEAEGAKVQQANGASDETLQLRXXXXXXXXXXX 109
            YA   NSLFG+VNV Y LR V  +AP AEA  +   + +    ET++++           
Sbjct: 891  YANGTNSLFGNVNVRYHLRSV--EAPTAEALDSDKARGDETGSETIRVK--------DPV 940

Query: 108  GPRPMTPM---QRSRQLNSQLKSILKKPSGD 25
              RP  P+   Q   Q   QLKS LKKP+ +
Sbjct: 941  VERPAAPVVAHQPLPQPTVQLKSCLKKPTSE 971


>gb|KJB60046.1| hypothetical protein B456_009G287300 [Gossypium raimondii]
          Length = 752

 Score =  251 bits (641), Expect = 1e-63
 Identities = 179/451 (39%), Positives = 247/451 (54%), Gaps = 15/451 (3%)
 Frame = -1

Query: 1332 PIDSQSPTVVPLASGDGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSNIETSPENVHKR 1153
            P+D + P  V   S +G VKK K  KR + D                  +E   +   K 
Sbjct: 187  PVDIKRPGGV---SAEGGVKKVK--KRSSADIGVENSAL----------VEKKKKKKKKE 231

Query: 1152 LKTNTEEEIPRISA------GRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQK- 994
              + T  + P+  +       +S  IG  P  + Q++ Q+ +     TH S +++     
Sbjct: 232  TGSETNSDKPKKPSFLGKDGAKSAHIGLGPREESQVNQQKKDVDP--THSSFNSVGASTT 289

Query: 993  VDLDNIEVELPQVVHDMLALALDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEA 814
            + + N   EL Q++ D+ ALALDPF+GVE  +  +VRQ  LR+RS+VYQKSLV+  +SE 
Sbjct: 290  IGVGNSGFELAQLLSDLHALALDPFHGVERNSPTIVRQCFLRYRSLVYQKSLVVLPTSEM 349

Query: 813  DTSELRVGKPTTSTSIAEVPSGEDARSLAPKHP-RQIPRSDDPTRSGRKRNLSDRQEEIS 637
            D++ELR GKP            E+ R   P  P R + R DDPT++G KR  SDR EEI+
Sbjct: 350  DSTELRAGKPPLVGGSDNTK--ENVRDSTPSKPVRPLARPDDPTKAGLKRLPSDRLEEIA 407

Query: 636  AKRLKKLHELKSLTERKASNQKIPDGSREQ--KDNNSSLPSKPFNL-DSAKNPESPVKVA 466
            AKRLKKL +LKSLT  K  N +  +  + +  +   +  P++P    DS +  ES  +  
Sbjct: 408  AKRLKKLSQLKSLTAEKKGNLRASEAPKVEVKEQPTTGPPARPTKKPDSLRKVESLPRAV 467

Query: 465  EPGMLVLKFPQSKSLPSAAELKARFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYG 286
            EP MLV+KFP   SLPS AELKARF RFG LD  A+RVF+K++TCRVVFRHK+ A+AAY 
Sbjct: 468  EPTMLVMKFPPQVSLPSVAELKARFGRFGSLDQSAIRVFWKSSTCRVVFRHKIDAQAAYR 527

Query: 285  YAVRNNSLFGHVNVNYQLRDVGGQAP-AEAEGAKVQQANGASDETLQLRXXXXXXXXXXX 109
            YA   NSLFG+VNV Y LR V  +AP AEA  +   + +    ET++++           
Sbjct: 528  YANGTNSLFGNVNVRYHLRSV--EAPTAEALDSDKARGDETGSETIRVK--------DPV 577

Query: 108  GPRPMTPM---QRSRQLNSQLKSILKKPSGD 25
              RP  P+   Q   Q   QLKS LKKP+ +
Sbjct: 578  VERPAAPVVAHQPLPQPTVQLKSCLKKPTSE 608


>ref|XP_002319529.1| PWWP domain-containing family protein [Populus trichocarpa]
            gi|222857905|gb|EEE95452.1| PWWP domain-containing family
            protein [Populus trichocarpa]
          Length = 1024

 Score =  248 bits (634), Expect = 9e-63
 Identities = 173/429 (40%), Positives = 225/429 (52%), Gaps = 2/429 (0%)
 Frame = -1

Query: 1305 VPLASGDGLVKKTKALKRPAVDFNXXXXXXXXXXXXXXSNIETSPENVHKRLKTNTEEEI 1126
            V  ++G G VKK K +KRP  D +                 ET+P+   KRL T   EE+
Sbjct: 499  VGTSTGSG-VKKVKVIKRPVGDTSSQKSIMGGKRKKEI-RAETNPDRPKKRLATGKGEEV 556

Query: 1125 PRISAGRSIGIGSTPSGKLQLDLQQVNSGSGSTHPSDSALSLQKVDLDNIEVELPQVVHD 946
             RIS G+S  I  +P                     DS L+ QK D   IE ELPQ++ D
Sbjct: 557  -RISLGKSTHISFSPG-------------------EDSQLNSQKKD--GIEFELPQLLSD 594

Query: 945  MLALALDPFYGVEGETHAVVRQVLLRFRSIVYQKSLVLPSSSEADTSELRVGKPTTSTSI 766
             LALALDPF+  E  +H+V     LRFRS+V+QKSLVL   SE +               
Sbjct: 595  FLALALDPFHVAERNSHSVTMHFFLRFRSLVFQKSLVLSPPSETEV-------------- 640

Query: 765  AEVPSGEDARSLAPKHPRQI-PRSDDPTRSGRKRNLSDRQEEISAKRLKKLHELKSLTER 589
                   D R L P  P ++  R +DPT++GRKR  SDRQEEI+AKR KK+ +LKSL   
Sbjct: 641  -------DTRGLIPSKPAKLLVRPNDPTKAGRKRLPSDRQEEIAAKRQKKIIQLKSLAAE 693

Query: 588  KASNQKIPDGSREQKDNN-SSLPSKPFNLDSAKNPESPVKVAEPGMLVLKFPQSKSLPSA 412
            K + + +     E K+   +  P K    DS K  E PV+  EP MLVL+FP   SLPSA
Sbjct: 694  KKAQRTLDTLGAEGKETPVAQPPRKSVKPDSFKKMEPPVRAIEPTMLVLRFPPETSLPSA 753

Query: 411  AELKARFSRFGPLDLPAMRVFYKTNTCRVVFRHKVHAKAAYGYAVRNNSLFGHVNVNYQL 232
            A+LKARF+RFG +D  A+RVF+K++ CRVVFR K+ A+AA  YA+ N SLFG VNV Y +
Sbjct: 754  AQLKARFARFGSIDQSAIRVFWKSSQCRVVFRRKLDAQAALKYALGNKSLFGDVNVRYNI 813

Query: 231  RDVGGQAPAEAEGAKVQQANGASDETLQLRXXXXXXXXXXXGPRPMTPMQRSRQLNSQLK 52
            R+VG  APA       +  +    +  Q               +P +      Q   QLK
Sbjct: 814  REVG--APASEPPESDKSRDDTFVDAAQAEDPLADWQAVAFAHQPPS------QSTVQLK 865

Query: 51   SILKKPSGD 25
            SILK+P+GD
Sbjct: 866  SILKRPNGD 874


Top