BLASTX nr result

ID: Sinomenium22_contig00004332 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00004332
         (1693 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53142.1| JHL05D22.13 [Jatropha curcas]                         208   7e-51
ref|XP_002511755.1| poly(A) polymerase cid, putative [Ricinus co...   200   2e-48
ref|XP_007051995.1| Nucleotidyltransferase family protein isofor...   188   7e-45
ref|XP_007051994.1| Nucleotidyltransferase family protein isofor...   188   7e-45
ref|XP_007051993.1| Nucleotidyltransferase family protein isofor...   188   7e-45
ref|XP_007051992.1| Nucleotidyltransferase family protein isofor...   188   7e-45
ref|XP_007051991.1| Nucleotidyltransferase family protein isofor...   188   7e-45
ref|XP_002301312.2| hypothetical protein POPTR_0002s15230g [Popu...   184   1e-43
ref|XP_002272342.2| PREDICTED: uncharacterized protein LOC100267...   164   9e-38
emb|CAN77386.1| hypothetical protein VITISV_006352 [Vitis vinifera]   164   9e-38
ref|XP_004308428.1| PREDICTED: uncharacterized protein LOC101313...   164   1e-37
ref|XP_006375316.1| hypothetical protein POPTR_0014s06910g, part...   161   1e-36
ref|XP_006490961.1| PREDICTED: uncharacterized protein LOC102611...   160   2e-36
ref|XP_006445207.1| hypothetical protein CICLE_v10023615mg, part...   160   2e-36
ref|XP_006397741.1| hypothetical protein EUTSA_v10001324mg [Eutr...   157   1e-35
ref|NP_566048.1| Nucleotidyltransferase family protein [Arabidop...   156   2e-35
ref|XP_007220905.1| hypothetical protein PRUPE_ppa002004mg [Prun...   155   5e-35
ref|XP_003529982.1| PREDICTED: uncharacterized protein LOC100812...   155   7e-35
ref|XP_006339776.1| PREDICTED: uncharacterized protein LOC102603...   154   9e-35
ref|XP_002880188.1| hypothetical protein ARALYDRAFT_483698 [Arab...   152   4e-34

>dbj|BAJ53142.1| JHL05D22.13 [Jatropha curcas]
          Length = 748

 Score =  208 bits (529), Expect = 7e-51
 Identities = 158/453 (34%), Positives = 210/453 (46%), Gaps = 15/453 (3%)
 Frame = +2

Query: 380  PWPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQI------FFSDEFQQLGINGKP-DG 538
            PWP  ++       FL GF  + W    + LAA Q          D+ Q LG +G     
Sbjct: 97   PWPHNLSAAPLLPGFL-GFPQNHWPSPANHLAAGQFQGNQQGVLGDDLQILGFSGADVRA 155

Query: 539  WKXXXXXXXXXXXXXNMLMFGSLACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRV 718
                             L FGS   +I + E L N N      + N  K LEV      +
Sbjct: 156  NNTIHNRVQQKQQLEQKLQFGSFRSDIQNVEALLNVN-----SKLNAAKELEVRLATRNL 210

Query: 719  DGLETRSNSGVEIGSNSGQNREFDRGMSSRSPHQWSRH---GRHEPRRSTVAEPRMVPPG 889
            +GLE+             Q R FD     RS   W +    G + P+     E RM PPG
Sbjct: 211  NGLESDQKF-------DSQLRTFDLREQDRSGGGWRKQPHGGNYRPQ-----ETRMPPPG 258

Query: 890  FPQRPKGPDQWAPMSRRSDLERNVGKEWRARDKL-NHQQTYTSDDKVEILKRQVPKFDGV 1066
            F  +P+G   W  +SRR +L+ NV KE   + +L N    ++S+DK+       P+ DG 
Sbjct: 259  FSNKPRGGGNWDYVSRRRELDYNVNKEKGNQGELSNRNALFSSEDKI-------PR-DG- 309

Query: 1067 EENESSRLGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNG 1246
              + S  LGL  QLD+PGPP GS+L+SV A+D+E S+ +V   V   G+     +E R  
Sbjct: 310  --DRSRDLGLTGQLDRPGPPAGSNLYSVSAADVELSMLNVEAEVVEDGK-----DEGREL 362

Query: 1247 DRGGGSVYEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREP----QAT 1414
            D  G                E   DSL+L  E   +     N+ SR   SR      +  
Sbjct: 363  DEAG----------------EELVDSLLLEGESDGKNDKKQNRHSREKESRSDNRGQRTL 406

Query: 1415 GQWRKNYRRPVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQ 1594
             Q  +  +R + CR DI++  APFLAIY+SLVP                  V+KEWP+A+
Sbjct: 407  SQRMRMLKRQMECRRDIDRLNAPFLAIYESLVPPEEEKAKQKQLLSLLEKLVNKEWPQAR 466

Query: 1595 LYLYGSCANSFGVSNSDIDVCLAIDDADIDKSE 1693
            LYLYGSCANSFGV  SDIDVCLAI +ADI+KSE
Sbjct: 467  LYLYGSCANSFGVLKSDIDVCLAIQNADINKSE 499


>ref|XP_002511755.1| poly(A) polymerase cid, putative [Ricinus communis]
            gi|223548935|gb|EEF50424.1| poly(A) polymerase cid,
            putative [Ricinus communis]
          Length = 696

 Score =  200 bits (508), Expect = 2e-48
 Identities = 149/446 (33%), Positives = 204/446 (45%), Gaps = 9/446 (2%)
 Frame = +2

Query: 383  WPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQI--FFSDEFQQLGINGKPDGWKXXXX 556
            WP  ++P       L    + PW  QGS    +    F  D+ Q+LG++    G      
Sbjct: 93   WPYNLSPPNLVPGLLGFPQNHPW--QGSQFQGSDQRGFLGDDLQRLGLSS---GNTRIRN 147

Query: 557  XXXXXXXXXNMLMFGSLACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLETR 736
                       L FGS   +I  PE L N N      + N  K L V      ++G+E  
Sbjct: 148  LVQQKQQLEQKLQFGSFRSDIQPPEGLLNLN-----SKLNAAKELGVDLGIRNLNGMERN 202

Query: 737  SNSGVEIGSN---SGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPK 907
             +   ++ SN   S    +  RG   + PH  +   +         E RM PPGF  +P+
Sbjct: 203  LHFEPQLMSNLRTSDLREQDQRGGWGKQPHGSNYRSQ---------ETRMPPPGFSNKPR 253

Query: 908  GPDQWAPMSRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSR 1087
            G      +SRR +L+ NV KE     +L+ +  + S +   +        DG   N S  
Sbjct: 254  GGGNMDHVSRRRELDHNVNKEKGNHSELSKRNAFLSSESKSLR-------DG---NGSRD 303

Query: 1088 LGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSV 1267
            LGL +QLD PGPP GS+LHSV A DIEESL + +  +   G+N                 
Sbjct: 304  LGLTRQLDHPGPPAGSNLHSVSALDIEESLLNFNAEMVEDGKN----------------- 346

Query: 1268 YEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREP----QATGQWRKNY 1435
             + H   + +D  E  AD+L+L  E   +  +  N+ SR   SR      Q   Q  +  
Sbjct: 347  -DGH---DLDDVGEELADTLLLEGESEGKNDNKQNRHSRDKESRSDNRGQQILSQRMRML 402

Query: 1436 RRPVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSC 1615
            +R + CR DI++    FLAIY+SL+P                  V+KEWP A+LYLYGSC
Sbjct: 403  KRQMECRRDIDRLNVSFLAIYESLIPPEEEKSKQKQLLTLLEKLVNKEWPEARLYLYGSC 462

Query: 1616 ANSFGVSNSDIDVCLAIDDADIDKSE 1693
            ANSFGV  SDIDVCLAI DADI+KSE
Sbjct: 463  ANSFGVRKSDIDVCLAIQDADINKSE 488


>ref|XP_007051995.1| Nucleotidyltransferase family protein isoform 5 [Theobroma cacao]
            gi|508704256|gb|EOX96152.1| Nucleotidyltransferase family
            protein isoform 5 [Theobroma cacao]
          Length = 635

 Score =  188 bits (477), Expect = 7e-45
 Identities = 145/444 (32%), Positives = 206/444 (46%), Gaps = 7/444 (1%)
 Frame = +2

Query: 383  WPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXX 562
            WPQT++P  A  +FL GF  SPW+  G+  A NQ    D+ ++LG++G  +         
Sbjct: 92   WPQTLSPPLAP-NFL-GFPLSPWSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNR 149

Query: 563  XXXXXXXNMLMFGSLACEIGS---PEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733
                     L+FGS   +I +   PE   N NLL+ +            ++N     L++
Sbjct: 150  VQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENS------------KLNLSNQQLDS 197

Query: 734  RSNSGVEIGSNSGQNREF-DRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKG 910
            R NS         Q+R   DRG       Q    G + P  S   E R  PPGF  +P+G
Sbjct: 198  RLNSNPNTSPYVFQHRNSGDRGK------QQQHGGSYRPTPSP--EARRSPPGFLGKPRG 249

Query: 911  PDQWAPM-SRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSR 1087
                    +RR   E NV       DK   + +  S D                    + 
Sbjct: 250  GGGNRDFGNRRRHFEHNV-------DKAKAEYSQPSSD--------------------NE 282

Query: 1088 LGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSV 1267
            +GL  QLD+PGPP GS+L SV A+DIEESL  +H      GR+  +R +    + GG   
Sbjct: 283  VGLSGQLDRPGPPAGSNLQSVSATDIEESLLELHS---DGGRDRFSRRDKFRREDGG--- 336

Query: 1268 YEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNR--SREPQATGQWRKNYRR 1441
                   E ++  E   +SL++ DE   +     ++  + +R  +R  +   Q  +  +R
Sbjct: 337  -------EVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKR 389

Query: 1442 PVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCAN 1621
             + CR DI +  APFLA+Y+SL+P                  V KEWP A+LYLYGSCAN
Sbjct: 390  QMECRSDIHRLNAPFLALYESLIPPEEERAKQKQLLALLEKLVCKEWPEARLYLYGSCAN 449

Query: 1622 SFGVSNSDIDVCLAIDDADIDKSE 1693
            SFGVS SDIDVCLA ++ D++KSE
Sbjct: 450  SFGVSKSDIDVCLAFNEMDVNKSE 473


>ref|XP_007051994.1| Nucleotidyltransferase family protein isoform 4, partial [Theobroma
            cacao] gi|508704255|gb|EOX96151.1| Nucleotidyltransferase
            family protein isoform 4, partial [Theobroma cacao]
          Length = 585

 Score =  188 bits (477), Expect = 7e-45
 Identities = 145/444 (32%), Positives = 206/444 (46%), Gaps = 7/444 (1%)
 Frame = +2

Query: 383  WPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXX 562
            WPQT++P  A  +FL GF  SPW+  G+  A NQ    D+ ++LG++G  +         
Sbjct: 92   WPQTLSPPLAP-NFL-GFPLSPWSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNR 149

Query: 563  XXXXXXXNMLMFGSLACEIGS---PEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733
                     L+FGS   +I +   PE   N NLL+ +            ++N     L++
Sbjct: 150  VQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENS------------KLNLSNQQLDS 197

Query: 734  RSNSGVEIGSNSGQNREF-DRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKG 910
            R NS         Q+R   DRG       Q    G + P  S   E R  PPGF  +P+G
Sbjct: 198  RLNSNPNTSPYVFQHRNSGDRGK------QQQHGGSYRPTPSP--EARRSPPGFLGKPRG 249

Query: 911  PDQWAPM-SRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSR 1087
                    +RR   E NV       DK   + +  S D                    + 
Sbjct: 250  GGGNRDFGNRRRHFEHNV-------DKAKAEYSQPSSD--------------------NE 282

Query: 1088 LGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSV 1267
            +GL  QLD+PGPP GS+L SV A+DIEESL  +H      GR+  +R +    + GG   
Sbjct: 283  VGLSGQLDRPGPPAGSNLQSVSATDIEESLLELHS---DGGRDRFSRRDKFRREDGG--- 336

Query: 1268 YEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNR--SREPQATGQWRKNYRR 1441
                   E ++  E   +SL++ DE   +     ++  + +R  +R  +   Q  +  +R
Sbjct: 337  -------EVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKR 389

Query: 1442 PVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCAN 1621
             + CR DI +  APFLA+Y+SL+P                  V KEWP A+LYLYGSCAN
Sbjct: 390  QMECRSDIHRLNAPFLALYESLIPPEEERAKQKQLLALLEKLVCKEWPEARLYLYGSCAN 449

Query: 1622 SFGVSNSDIDVCLAIDDADIDKSE 1693
            SFGVS SDIDVCLA ++ D++KSE
Sbjct: 450  SFGVSKSDIDVCLAFNEMDVNKSE 473


>ref|XP_007051993.1| Nucleotidyltransferase family protein isoform 3, partial [Theobroma
            cacao] gi|508704254|gb|EOX96150.1| Nucleotidyltransferase
            family protein isoform 3, partial [Theobroma cacao]
          Length = 584

 Score =  188 bits (477), Expect = 7e-45
 Identities = 145/444 (32%), Positives = 206/444 (46%), Gaps = 7/444 (1%)
 Frame = +2

Query: 383  WPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXX 562
            WPQT++P  A  +FL GF  SPW+  G+  A NQ    D+ ++LG++G  +         
Sbjct: 92   WPQTLSPPLAP-NFL-GFPLSPWSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNR 149

Query: 563  XXXXXXXNMLMFGSLACEIGS---PEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733
                     L+FGS   +I +   PE   N NLL+ +            ++N     L++
Sbjct: 150  VQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENS------------KLNLSNQQLDS 197

Query: 734  RSNSGVEIGSNSGQNREF-DRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKG 910
            R NS         Q+R   DRG       Q    G + P  S   E R  PPGF  +P+G
Sbjct: 198  RLNSNPNTSPYVFQHRNSGDRGK------QQQHGGSYRPTPSP--EARRSPPGFLGKPRG 249

Query: 911  PDQWAPM-SRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSR 1087
                    +RR   E NV       DK   + +  S D                    + 
Sbjct: 250  GGGNRDFGNRRRHFEHNV-------DKAKAEYSQPSSD--------------------NE 282

Query: 1088 LGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSV 1267
            +GL  QLD+PGPP GS+L SV A+DIEESL  +H      GR+  +R +    + GG   
Sbjct: 283  VGLSGQLDRPGPPAGSNLQSVSATDIEESLLELHS---DGGRDRFSRRDKFRREDGG--- 336

Query: 1268 YEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNR--SREPQATGQWRKNYRR 1441
                   E ++  E   +SL++ DE   +     ++  + +R  +R  +   Q  +  +R
Sbjct: 337  -------EVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKR 389

Query: 1442 PVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCAN 1621
             + CR DI +  APFLA+Y+SL+P                  V KEWP A+LYLYGSCAN
Sbjct: 390  QMECRSDIHRLNAPFLALYESLIPPEEERAKQKQLLALLEKLVCKEWPEARLYLYGSCAN 449

Query: 1622 SFGVSNSDIDVCLAIDDADIDKSE 1693
            SFGVS SDIDVCLA ++ D++KSE
Sbjct: 450  SFGVSKSDIDVCLAFNEMDVNKSE 473


>ref|XP_007051992.1| Nucleotidyltransferase family protein isoform 2 [Theobroma cacao]
            gi|508704253|gb|EOX96149.1| Nucleotidyltransferase family
            protein isoform 2 [Theobroma cacao]
          Length = 621

 Score =  188 bits (477), Expect = 7e-45
 Identities = 145/444 (32%), Positives = 206/444 (46%), Gaps = 7/444 (1%)
 Frame = +2

Query: 383  WPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXX 562
            WPQT++P  A  +FL GF  SPW+  G+  A NQ    D+ ++LG++G  +         
Sbjct: 92   WPQTLSPPLAP-NFL-GFPLSPWSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNR 149

Query: 563  XXXXXXXNMLMFGSLACEIGS---PEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733
                     L+FGS   +I +   PE   N NLL+ +            ++N     L++
Sbjct: 150  VQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENS------------KLNLSNQQLDS 197

Query: 734  RSNSGVEIGSNSGQNREF-DRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKG 910
            R NS         Q+R   DRG       Q    G + P  S   E R  PPGF  +P+G
Sbjct: 198  RLNSNPNTSPYVFQHRNSGDRGK------QQQHGGSYRPTPSP--EARRSPPGFLGKPRG 249

Query: 911  PDQWAPM-SRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSR 1087
                    +RR   E NV       DK   + +  S D                    + 
Sbjct: 250  GGGNRDFGNRRRHFEHNV-------DKAKAEYSQPSSD--------------------NE 282

Query: 1088 LGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSV 1267
            +GL  QLD+PGPP GS+L SV A+DIEESL  +H      GR+  +R +    + GG   
Sbjct: 283  VGLSGQLDRPGPPAGSNLQSVSATDIEESLLELHS---DGGRDRFSRRDKFRREDGG--- 336

Query: 1268 YEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNR--SREPQATGQWRKNYRR 1441
                   E ++  E   +SL++ DE   +     ++  + +R  +R  +   Q  +  +R
Sbjct: 337  -------EVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKR 389

Query: 1442 PVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCAN 1621
             + CR DI +  APFLA+Y+SL+P                  V KEWP A+LYLYGSCAN
Sbjct: 390  QMECRSDIHRLNAPFLALYESLIPPEEERAKQKQLLALLEKLVCKEWPEARLYLYGSCAN 449

Query: 1622 SFGVSNSDIDVCLAIDDADIDKSE 1693
            SFGVS SDIDVCLA ++ D++KSE
Sbjct: 450  SFGVSKSDIDVCLAFNEMDVNKSE 473


>ref|XP_007051991.1| Nucleotidyltransferase family protein isoform 1 [Theobroma cacao]
            gi|508704252|gb|EOX96148.1| Nucleotidyltransferase family
            protein isoform 1 [Theobroma cacao]
          Length = 722

 Score =  188 bits (477), Expect = 7e-45
 Identities = 145/444 (32%), Positives = 206/444 (46%), Gaps = 7/444 (1%)
 Frame = +2

Query: 383  WPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXX 562
            WPQT++P  A  +FL GF  SPW+  G+  A NQ    D+ ++LG++G  +         
Sbjct: 92   WPQTLSPPLAP-NFL-GFPLSPWSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNR 149

Query: 563  XXXXXXXNMLMFGSLACEIGS---PEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733
                     L+FGS   +I +   PE   N NLL+ +            ++N     L++
Sbjct: 150  VQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENS------------KLNLSNQQLDS 197

Query: 734  RSNSGVEIGSNSGQNREF-DRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKG 910
            R NS         Q+R   DRG       Q    G + P  S   E R  PPGF  +P+G
Sbjct: 198  RLNSNPNTSPYVFQHRNSGDRGK------QQQHGGSYRPTPSP--EARRSPPGFLGKPRG 249

Query: 911  PDQWAPM-SRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSR 1087
                    +RR   E NV       DK   + +  S D                    + 
Sbjct: 250  GGGNRDFGNRRRHFEHNV-------DKAKAEYSQPSSD--------------------NE 282

Query: 1088 LGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSV 1267
            +GL  QLD+PGPP GS+L SV A+DIEESL  +H      GR+  +R +    + GG   
Sbjct: 283  VGLSGQLDRPGPPAGSNLQSVSATDIEESLLELHS---DGGRDRFSRRDKFRREDGG--- 336

Query: 1268 YEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNR--SREPQATGQWRKNYRR 1441
                   E ++  E   +SL++ DE   +     ++  + +R  +R  +   Q  +  +R
Sbjct: 337  -------EVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKR 389

Query: 1442 PVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCAN 1621
             + CR DI +  APFLA+Y+SL+P                  V KEWP A+LYLYGSCAN
Sbjct: 390  QMECRSDIHRLNAPFLALYESLIPPEEERAKQKQLLALLEKLVCKEWPEARLYLYGSCAN 449

Query: 1622 SFGVSNSDIDVCLAIDDADIDKSE 1693
            SFGVS SDIDVCLA ++ D++KSE
Sbjct: 450  SFGVSKSDIDVCLAFNEMDVNKSE 473


>ref|XP_002301312.2| hypothetical protein POPTR_0002s15230g [Populus trichocarpa]
            gi|550345065|gb|EEE80585.2| hypothetical protein
            POPTR_0002s15230g [Populus trichocarpa]
          Length = 728

 Score =  184 bits (467), Expect = 1e-43
 Identities = 133/372 (35%), Positives = 184/372 (49%), Gaps = 4/372 (1%)
 Frame = +2

Query: 590  LMFGSLACEIGSPEE-LQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLETRSNSGVEIGSN 766
            L FGS + EI SP E L N NL           V EV       +GLE   +   +  SN
Sbjct: 163  LQFGSFSSEIQSPAEVLVNANL-----------VREVGPGGRSFNGLERNRHLEKQANSN 211

Query: 767  SGQNREFDR--GMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGPDQWAPMSRR 940
            S +N E  +  G S    +Q      H+ +      P   PPGF  +P+G   W   SRR
Sbjct: 212  SRRNSEVRQPGGSSGGWGNQHRNQHLHQEQHRNYRSP---PPGFSNKPRGGGNWDYGSRR 268

Query: 941  SDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPG 1120
             +LE N+ +E     ++N+++   S+  VE                   LGL +QLD+PG
Sbjct: 269  RELELNITRENGDYSEMNNEKVRRSEGSVE-------------------LGLTRQLDRPG 309

Query: 1121 PPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSVYEAHNHLERED 1300
            PP GS+LHSV  S+I ESL ++ G            E   +G   GG         E +D
Sbjct: 310  PPAGSNLHSVLGSEIGESLINLDG------------ENGEDGKDDGG---------ELDD 348

Query: 1301 FPEHFADSLVLRDEV-GRRRSDSFNQDSRSNRSREPQATGQWRKNYRRPVRCRMDIEKWT 1477
              E   DSL+L  +  G++     N++SRS+ +R  +   Q  +  ++  +C +DI++  
Sbjct: 349  LGEELVDSLLLNGQSEGKKDKKQSNKESRSD-NRGKKILSQRMRMLKKQTQCCLDIDRLN 407

Query: 1478 APFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNSDIDVC 1657
            A FLAIY+SL+P                  V+KEWP A+LYLYGS ANSFGVS SDIDVC
Sbjct: 408  AAFLAIYESLIPPEEEKMKQELFLMSLEKLVNKEWPEARLYLYGSGANSFGVSKSDIDVC 467

Query: 1658 LAIDDADIDKSE 1693
            LAI+DA+I+KSE
Sbjct: 468  LAIEDAEINKSE 479


>ref|XP_002272342.2| PREDICTED: uncharacterized protein LOC100267790 [Vitis vinifera]
          Length = 679

 Score =  164 bits (416), Expect = 9e-38
 Identities = 141/451 (31%), Positives = 198/451 (43%), Gaps = 13/451 (2%)
 Frame = +2

Query: 380  PWPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLG--INGKPDGWKXXX 553
            PW    N L      + G   +PW PQ      ++    ++ ++LG  + GK        
Sbjct: 86   PWANPPNYL------IQGLAQNPWPPQTPQFIGDRELLGEDGRRLGFDVRGKT------- 132

Query: 554  XXXXXXXXXXNMLMFGSLACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733
                      + LMFGS  CEI +   L N   L+  +   +++ L      G+ D L+ 
Sbjct: 133  ----VQHQQHHKLMFGSFPCEIQNHGGLVNGKSLENPIPGAIREPLV-----GKFDALKN 183

Query: 734  RSNSGVEIGSNSGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGP 913
                G++   N   +    +    R    W  H + E  RS        PPGFP + +  
Sbjct: 184  HK-MGLDPIWNLNSHHNASQQEQERRTVGWGTHQQGEFSRSGP------PPGFPSKARAV 236

Query: 914  DQWAPMSRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLG 1093
                    R  LE          DK+N +   T++D  E ++R  P+      N S++LG
Sbjct: 237  GNCDSGILRRGLE----------DKVN-KGNVTANDYDEKVRRLSPRHVDNHGNASAQLG 285

Query: 1094 LVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSG------RNGLTREESRNGDRG 1255
            L  QL+ PGP +        ASDIEE L ++   ++G G      + G+ RE   N D  
Sbjct: 286  LTGQLEHPGPLL--------ASDIEECLLNLGAEIDGVGDRVRHQKQGMRREGQGNLD-- 335

Query: 1256 GGSVYEAHNHLEREDFPEHFADSLVLRD-----EVGRRRSDSFNQDSRSNRSREPQATGQ 1420
                          D  E    SLVL D         +  +S N+D RS+ +R  +   Q
Sbjct: 336  --------------DLSEEMTGSLVLEDGSQDKNDTNQHHNSRNRDFRSD-TRGQRMLSQ 380

Query: 1421 WRKNYRRPVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLY 1600
              +N +R + CR DI      FL+IY+SL+P                  VSKEWP+AQL+
Sbjct: 381  RVRNLKRHMECRRDIGTLNFRFLSIYESLIPEEEEKAKQKQLLTLLEKLVSKEWPKAQLF 440

Query: 1601 LYGSCANSFGVSNSDIDVCLAIDDADIDKSE 1693
            LYGSCANSFGVS SDIDVCLAIDDADI+KSE
Sbjct: 441  LYGSCANSFGVSKSDIDVCLAIDDADINKSE 471


>emb|CAN77386.1| hypothetical protein VITISV_006352 [Vitis vinifera]
          Length = 720

 Score =  164 bits (416), Expect = 9e-38
 Identities = 141/451 (31%), Positives = 198/451 (43%), Gaps = 13/451 (2%)
 Frame = +2

Query: 380  PWPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLG--INGKPDGWKXXX 553
            PW    N L      + G   +PW PQ      ++    ++ ++LG  + GK        
Sbjct: 86   PWANPPNYL------IQGLAQNPWPPQTPQFIGDRELLGEDGRRLGFDVRGKT------- 132

Query: 554  XXXXXXXXXXNMLMFGSLACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733
                      + LMFGS  CEI +   L N   L+  +   +++ L      G+ D L+ 
Sbjct: 133  ----VQHQQHHKLMFGSFPCEIQNHGGLVNGKSLENPIPGAIREPLV-----GKFDALKN 183

Query: 734  RSNSGVEIGSNSGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGP 913
                G++   N   +    +    R    W  H + E  RS        PPGFP + +  
Sbjct: 184  HK-MGLDPIWNLNSHHNASQQEQERRTVGWGTHQQGEFSRSGP------PPGFPSKARAV 236

Query: 914  DQWAPMSRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLG 1093
                    R  LE          DK+N +   T++D  E ++R  P+      N S++LG
Sbjct: 237  GNCDSGILRRGLE----------DKVN-KGNVTANDYDEKVRRLSPRHVDNHGNASAQLG 285

Query: 1094 LVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSG------RNGLTREESRNGDRG 1255
            L  QL+ PGP +        ASDIEE L ++   ++G G      + G+ RE   N D  
Sbjct: 286  LTGQLEHPGPLL--------ASDIEECLLNLGAEIDGVGDRVRHQKQGMRREGQGNLD-- 335

Query: 1256 GGSVYEAHNHLEREDFPEHFADSLVLRD-----EVGRRRSDSFNQDSRSNRSREPQATGQ 1420
                          D  E    SLVL D         +  +S N+D RS+ +R  +   Q
Sbjct: 336  --------------DLSEEMTGSLVLEDGSQDKNDTNQHHNSRNRDFRSD-TRGQRMLSQ 380

Query: 1421 WRKNYRRPVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLY 1600
              +N +R + CR DI      FL+IY+SL+P                  VSKEWP+AQL+
Sbjct: 381  RVRNLKRHMECRRDIGTLNFRFLSIYESLIPEEEEKAKQKQLLTLLEKLVSKEWPKAQLF 440

Query: 1601 LYGSCANSFGVSNSDIDVCLAIDDADIDKSE 1693
            LYGSCANSFGVS SDIDVCLAIDDADI+KSE
Sbjct: 441  LYGSCANSFGVSKSDIDVCLAIDDADINKSE 471


>ref|XP_004308428.1| PREDICTED: uncharacterized protein LOC101313262 [Fragaria vesca
            subsp. vesca]
          Length = 699

 Score =  164 bits (414), Expect = 1e-37
 Identities = 112/314 (35%), Positives = 149/314 (47%), Gaps = 3/314 (0%)
 Frame = +2

Query: 761  SNSGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGPDQWAPMSRR 940
            SNS  + EF R        +    G  E  R       M PPGF  +P+G   W    RR
Sbjct: 171  SNSSASNEFRRANYGSGEGELRGGGGGE--RGKQVHRTMPPPGFGNKPRGGGNWDSGGRR 228

Query: 941  SDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPG 1120
              +E NV +E ++       +  + D+  E ++R   +  G+  N   R GL  QLD+PG
Sbjct: 229  GGMEYNVDRERQSSSGFARNREGSFDN--ERVRRLAGEDGGMRGNGDGRKGLSAQLDRPG 286

Query: 1121 PPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSVYEAHNHLERED 1300
            PP G++LHSV AS+IEES+ +  G            E +R    G   V +     ER+D
Sbjct: 287  PPAGTNLHSVSASEIEESMMNFDGG-----------ERARKDSDGVEDVGQHSLEEERDD 335

Query: 1301 FPE---HFADSLVLRDEVGRRRSDSFNQDSRSNRSREPQATGQWRKNYRRPVRCRMDIEK 1471
              E   H  DS          RSD   Q   S R R          +Y+R   CR DI++
Sbjct: 336  KIEGKQHHKDS----------RSDDRGQHQLSQRMR----------SYKRQTLCRFDIDR 375

Query: 1472 WTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNSDID 1651
            + APFL I+DSL+P                  + KEWP A+LY+YGSC NSFGVS SDID
Sbjct: 376  FNAPFLEIFDSLIPTEEDKAKQKQLLTLLENIICKEWPDARLYIYGSCGNSFGVSKSDID 435

Query: 1652 VCLAIDDADIDKSE 1693
            +CL I + DI+KSE
Sbjct: 436  LCLEIGEEDINKSE 449


>ref|XP_006375316.1| hypothetical protein POPTR_0014s06910g, partial [Populus trichocarpa]
            gi|550323667|gb|ERP53113.1| hypothetical protein
            POPTR_0014s06910g, partial [Populus trichocarpa]
          Length = 497

 Score =  161 bits (407), Expect = 1e-36
 Identities = 125/373 (33%), Positives = 174/373 (46%), Gaps = 5/373 (1%)
 Frame = +2

Query: 590  LMFGSLACEIGSPEE-LQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLETRSNSGVEIGSN 766
            L FGS +  I SP + L N NL+            EV   +   +GLE   +   +  S+
Sbjct: 174  LQFGSFSSAIPSPADGLVNANLMR-----------EVGPGSRNFNGLERNRHLEKQANSH 222

Query: 767  SGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGPD---QWAPMSR 937
            S        G SS       R   H+ +      P   PPGF  +P+G      W    R
Sbjct: 223  STNFEVRQPGASSGG-----RGNLHKEQHQNYKSP---PPGFSNKPRGGGGGGNWDHGGR 274

Query: 938  RSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQP 1117
            R +LE  + +E     +LN+++   ++  VE+                      +QLD+P
Sbjct: 275  RRELEHTMYREKGDYSELNNEKARRNEGSVEVR-------------------FTRQLDRP 315

Query: 1118 GPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSVYEAHNHLERE 1297
            GPP GS+LHSV  S+I+ESL ++ G                     GG +         +
Sbjct: 316  GPPPGSNLHSVLGSEIKESLINLDGE-------------------DGGLL---------D 347

Query: 1298 DFPEHFADSLVLRDEV-GRRRSDSFNQDSRSNRSREPQATGQWRKNYRRPVRCRMDIEKW 1474
            D  E   DSL+L  E  G++     +++SRS+ SR      Q  +  +R ++CR+DI++ 
Sbjct: 348  DLGEELMDSLLLEGESDGKKDKKQSSKESRSD-SRGHNILSQRMRMLKRQMQCRLDIDRL 406

Query: 1475 TAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNSDIDV 1654
             A FLAIY+SLVP                  VSKEWP A+LYLYGSCANSFGVS SDIDV
Sbjct: 407  NAAFLAIYESLVPPEEETAKQKQFFMLLEKLVSKEWPEARLYLYGSCANSFGVSKSDIDV 466

Query: 1655 CLAIDDADIDKSE 1693
            CL I+DA+I KSE
Sbjct: 467  CLTIEDAEIKKSE 479


>ref|XP_006490961.1| PREDICTED: uncharacterized protein LOC102611932 [Citrus sinensis]
          Length = 699

 Score =  160 bits (404), Expect = 2e-36
 Identities = 153/500 (30%), Positives = 212/500 (42%), Gaps = 21/500 (4%)
 Frame = +2

Query: 257  PHQSHPETQSRPRINGXXXXXXXXXXXXXXXMGPTVHLHNQ----------PWPQTVNPL 406
            PHQ+ P+  S P                   +GPT++   Q           WP+T  PL
Sbjct: 36   PHQTPPQQPSLPN------------DPAVAAVGPTINFQPQWPSNGCDLPPTWPRTPLPL 83

Query: 407  YASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXXXXXXXXXN 586
                +FL GF  +PWA   S+    Q    ++F +LG +                    N
Sbjct: 84   ----NFL-GFPQNPWA-SSSTENQQQRLLCEDFGRLGFSNANYAAIHNLIQQPNHQQQQN 137

Query: 587  MLMFGSLACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLETRSNSGVEIGSN 766
             L FGS   +   P+ L N N L+  +++N+ +       N + D  + R++S     S 
Sbjct: 138  -LRFGSFQVQ---PDSLLNLNHLEN-LKYNLDR-------NSQFD--QPRASSISNPNSF 183

Query: 767  SGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGPDQWAPMSRRSD 946
              +N E  R               H+ R          PPGF  + +             
Sbjct: 184  LHRNLENSR--------------EHDLRLGKQHYGSTPPPGFSNKAR------------- 216

Query: 947  LERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPGPP 1126
                VG    +R    H         V+++ R    F        + +GL +QLD+PGPP
Sbjct: 217  ----VGGSGNSRRGFEHN--------VDMINR----FTSSAVEGGNGVGLTRQLDRPGPP 260

Query: 1127 MGSDLHSVPASDIEESLASVHGRVEGSGRN-GLT-REESRNGDRGGGSVYEAHNHLERED 1300
             GS+LHSV A DIEESL  +  R EG  R+ GL  R E+  G   GG         + +D
Sbjct: 261  SGSNLHSVSALDIEESLLDL--RREGRERHLGLDKRRENGPGYSQGGD--------DMDD 310

Query: 1301 FPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREPQATGQWR---------KNYRRPVRC 1453
            F E   DSL+  DE   +       D +   SR+ +     R         +N +  + C
Sbjct: 311  FGEDLVDSLLPDDESELKNDTHERNDKKHRNSRDKEIRSDNRGKRLLSQRMRNLKWQIEC 370

Query: 1454 RMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGV 1633
            R DI +  APFLAIY+SL+P                  V KEWP A+LYLYGSCANSFGV
Sbjct: 371  RADIGRLNAPFLAIYESLIPAEEEKAKQKKLLTLLEKLVCKEWPDARLYLYGSCANSFGV 430

Query: 1634 SNSDIDVCLAIDDADIDKSE 1693
            S SDIDVCLAI+D++I+KSE
Sbjct: 431  SKSDIDVCLAINDSEINKSE 450


>ref|XP_006445207.1| hypothetical protein CICLE_v10023615mg, partial [Citrus clementina]
            gi|557547469|gb|ESR58447.1| hypothetical protein
            CICLE_v10023615mg, partial [Citrus clementina]
          Length = 1046

 Score =  160 bits (404), Expect = 2e-36
 Identities = 153/500 (30%), Positives = 212/500 (42%), Gaps = 21/500 (4%)
 Frame = +2

Query: 257  PHQSHPETQSRPRINGXXXXXXXXXXXXXXXMGPTVHLHNQ----------PWPQTVNPL 406
            PHQ+ P+  S P                   +GPT++   Q           WP+T  PL
Sbjct: 67   PHQTPPQQPSLPN------------DPAVAAVGPTINFQPQWPSNGCDLPPTWPRTPLPL 114

Query: 407  YASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXXXXXXXXXN 586
                +FL GF  +PWA   S+    Q    ++F +LG +                    N
Sbjct: 115  ----NFL-GFPQNPWA-SSSTENQQQRLLCEDFGRLGFSNANYAAIHNLIQQPNHQQQQN 168

Query: 587  MLMFGSLACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLETRSNSGVEIGSN 766
             L FGS   +   P+ L N N L+  +++N+ +       N + D  + R++S     S 
Sbjct: 169  -LRFGSFQVQ---PDSLLNLNHLEN-LKYNLDR-------NSQFD--QPRASSISNPNSF 214

Query: 767  SGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGPDQWAPMSRRSD 946
              +N E  R               H+ R          PPGF  + +             
Sbjct: 215  LHRNLENSR--------------EHDLRLGKQHYGSTPPPGFSNKAR------------- 247

Query: 947  LERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPGPP 1126
                VG    +R    H         V+++ R    F        + +GL +QLD+PGPP
Sbjct: 248  ----VGGSGNSRRGFEHN--------VDMINR----FTSSAVEGGNGVGLTRQLDRPGPP 291

Query: 1127 MGSDLHSVPASDIEESLASVHGRVEGSGRN-GLT-REESRNGDRGGGSVYEAHNHLERED 1300
             GS+LHSV A DIEESL  +  R EG  R+ GL  R E+  G   GG         + +D
Sbjct: 292  SGSNLHSVSALDIEESLLDL--RREGRERHLGLDKRRENGPGYSQGGD--------DMDD 341

Query: 1301 FPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREPQATGQWR---------KNYRRPVRC 1453
            F E   DSL+  DE   +       D +   SR+ +     R         +N +  + C
Sbjct: 342  FGEDLVDSLLPDDESELKNDTHERNDKKHRNSRDKEIRSDNRGKRLLSQRMRNLKWQIEC 401

Query: 1454 RMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGV 1633
            R DI +  APFLAIY+SL+P                  V KEWP A+LYLYGSCANSFGV
Sbjct: 402  RADIGRLNAPFLAIYESLIPAEEEKAKQKKLLTLLEKLVCKEWPDARLYLYGSCANSFGV 461

Query: 1634 SNSDIDVCLAIDDADIDKSE 1693
            S SDIDVCLAI+D++I+KSE
Sbjct: 462  SKSDIDVCLAINDSEINKSE 481


>ref|XP_006397741.1| hypothetical protein EUTSA_v10001324mg [Eutrema salsugineum]
            gi|557098814|gb|ESQ39194.1| hypothetical protein
            EUTSA_v10001324mg [Eutrema salsugineum]
          Length = 757

 Score =  157 bits (398), Expect = 1e-35
 Identities = 128/440 (29%), Positives = 190/440 (43%), Gaps = 18/440 (4%)
 Frame = +2

Query: 428  PGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXXXXXXXXXNMLMFGSL 607
            PG H+SPWAP  +  + N + FS    Q  +N  P                   L    +
Sbjct: 81   PGTHASPWAPPPNH-SPNLLGFS----QFPLNPFPANQFDGNQRVSAEDAYRLGLTGAGI 135

Query: 608  ACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDG---LETRSNSGVEIGSNSGQN 778
               +   +    + L+  +   +  + L     NG ++G   L++   S      + G N
Sbjct: 136  QSMVQQQQPPPPQKLVFGSFSGDAAQSL-----NGLLNGNLKLDSNIGSANHHPRSVGPN 190

Query: 779  REFDRGMSSRSPHQWSRHGRHEP-----RRSTVAEPRMVPPGFPQRPKGPDQWAPMSRRS 943
               D  +S       SR G   P     R S    P   PPGF    +G D         
Sbjct: 191  PNSDPNLSHDFHEHNSRRGNWGPIGSNGRGSKSTLPPPPPPGFSSNQRGWDMDLGSKGMG 250

Query: 944  DLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPGP 1123
              + N        DK   + +   D K      +V +   +      R  L +Q+DQPGP
Sbjct: 251  SFQGN------NHDKEKGEHSNLWDHKSVDFIAEVDRLRRLSIQNEGRFDLSQQIDQPGP 304

Query: 1124 PMGSDLHSVPASDIEESLASVHGRVEGSG--RNGLTREESRNGDRGGGSVYEAHNHLERE 1297
            PMG++L+SV A+D E+S++ ++    G G  R     + S+    G G      + +E  
Sbjct: 305  PMGTNLYSVSAADAEDSISMLNKEARGGGVGRKEELGQFSKGKREGNGECGPGDDDIE-- 362

Query: 1298 DFPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREPQATGQWRKNYRRPVR--------C 1453
             F E   +SL+L DE   + +     +SR++R +E +   + ++  R+  R        C
Sbjct: 363  GFGEDIVESLLLEDETDDKNAKDGKNNSRTSREKESRMDTRGQRLLRQSSRIHRWRYMAC 422

Query: 1454 RMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGV 1633
            R DI  + APF+A+Y+SL+P                  V KEWP A+LYLYGSCANSFG 
Sbjct: 423  RYDIHMYDAPFIAVYESLIPAEEELEKQKQLMARLEHLVGKEWPHAKLYLYGSCANSFGF 482

Query: 1634 SNSDIDVCLAIDDADIDKSE 1693
              SDIDVCLAI+D DI+KSE
Sbjct: 483  PKSDIDVCLAIEDDDINKSE 502


>ref|NP_566048.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
            gi|13430538|gb|AAK25891.1|AF360181_1 unknown protein
            [Arabidopsis thaliana] gi|14532746|gb|AAK64074.1| unknown
            protein [Arabidopsis thaliana] gi|20197056|gb|AAC06161.2|
            expressed protein [Arabidopsis thaliana]
            gi|330255483|gb|AEC10577.1| Nucleotidyltransferase family
            protein [Arabidopsis thaliana]
          Length = 764

 Score =  156 bits (395), Expect = 2e-35
 Identities = 106/333 (31%), Positives = 152/333 (45%), Gaps = 12/333 (3%)
 Frame = +2

Query: 731  TRSNSGVEIGSNSGQNREFDRGMSSRSPH-QWSRHGRHEPRRSTVAEPRMVPPGFPQRPK 907
            T SNS ++   +  +N +        S    W   G +   R   + P   PPGF    +
Sbjct: 193  TLSNSNMDPNLSHHRNHDLHEQRGGHSGRGNWGHIGNNG--RGLKSTPPPPPPGFSSNQR 250

Query: 908  GPDQWAPMSRRSDLERNVGKEWRARDKLNHQQTYTSDDKV----EILKRQVPKFDGVEEN 1075
            G   W       D +R +G+        NH Q      KV         +  +  G+   
Sbjct: 251  G---WDMSLGSKDDDRGMGR--------NHDQAMGEHSKVWNQSVDFSAEANRLRGLSIQ 299

Query: 1076 ESSRLGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRG 1255
              S+  L +Q+D PGPP G+ LHSV A+D  +S + ++      G       +     R 
Sbjct: 300  NESKFNLSQQIDHPGPPKGASLHSVSAADAADSFSMLNKEARRGGERREELGQLSKAKRE 359

Query: 1256 GGSVYEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREPQAT------- 1414
            G +     N  E EDF E    SL+L DE G + ++   +DS+++R +E +         
Sbjct: 360  GNA-----NSDEIEDFGEDIVKSLLLEDETGEKDANDGKKDSKTSREKESRVDNRGQRLL 414

Query: 1415 GQWRKNYRRPVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQ 1594
            GQ  +  +  + CR DI ++ A F+AIY SL+P                  V+KEWP A+
Sbjct: 415  GQKARMVKMYMACRNDIHRYDATFIAIYKSLIPAEEELEKQRQLMAHLENLVAKEWPHAK 474

Query: 1595 LYLYGSCANSFGVSNSDIDVCLAIDDADIDKSE 1693
            LYLYGSCANSFG   SDIDVCLAI+  DI+KSE
Sbjct: 475  LYLYGSCANSFGFPKSDIDVCLAIEGDDINKSE 507


>ref|XP_007220905.1| hypothetical protein PRUPE_ppa002004mg [Prunus persica]
            gi|462417367|gb|EMJ22104.1| hypothetical protein
            PRUPE_ppa002004mg [Prunus persica]
          Length = 730

 Score =  155 bits (392), Expect = 5e-35
 Identities = 114/377 (30%), Positives = 170/377 (45%), Gaps = 20/377 (5%)
 Frame = +2

Query: 623  SPEELQNRNLLDKAMQFNVQKVLEVVEVNG---RVDGLETRSNSGVEIGS-NSGQNREFD 790
            S   LQ++NL     Q   Q+ L+   +     R       +N+  E+ + ++G +R  +
Sbjct: 145  SNNALQSQNLAQLKQQHQEQQKLKFSYLPSDIIRNPEPPVTANTSSEVSNLSNGFDRSLN 204

Query: 791  RGMSSRSPHQWSRHGRHEPRRSTVAEPR----------------MVPPGFPQRPKGPDQW 922
               ++ S     RHG  +   S   E R                  PPGF    +G   W
Sbjct: 205  LNPNNSSSSNEFRHGNPDTFNSREQERRGGGGGGAGRGKQFQRNTPPPGFGNNSRGGGNW 264

Query: 923  APMSRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVK 1102
               SRR D E NV +E ++  +    +  + +D  E ++R   +   +  N +  LG   
Sbjct: 265  DSGSRRRDFEHNVDRERQSSSEFVRNRDASFED--ERVRRLASEDSRIRGNGARGLGFSA 322

Query: 1103 QLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSVYEAHN 1282
            QLD PGPP G++LHS  AS+IE+S+ ++              ++ +N +       + HN
Sbjct: 323  QLDDPGPPTGANLHSASASEIEKSMMNLQ-----------HEKDDKNEEDDKNEAKQHHN 371

Query: 1283 HLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREPQATGQWRKNYRRPVRCRMD 1462
              E++                   RSD+  Q   S R R           ++  ++CR D
Sbjct: 372  SREKDS------------------RSDNRGQHLLSQRMR----------IFKSQMQCRFD 403

Query: 1463 IEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNS 1642
            I++  APFLAIYDSL+P                  ++KEWP AQLY+YGSC NSFGVS S
Sbjct: 404  IDRLNAPFLAIYDSLIPTEEEKAKQNQLFTLLETLITKEWPEAQLYVYGSCGNSFGVSKS 463

Query: 1643 DIDVCLAIDDADIDKSE 1693
            DID+CLAID AD +KSE
Sbjct: 464  DIDLCLAIDVADDNKSE 480


>ref|XP_003529982.1| PREDICTED: uncharacterized protein LOC100812787 [Glycine max]
          Length = 732

 Score =  155 bits (391), Expect = 7e-35
 Identities = 128/372 (34%), Positives = 170/372 (45%), Gaps = 21/372 (5%)
 Frame = +2

Query: 641  NRNLLDKAMQFNVQKVLEVVEVNGRVDGLETRSNSGVEIGSNSGQ---NREFDR--GMSS 805
            N N +D  +  + Q+  +  E+  +   L T + S  E+ SN G    N +F+R    +S
Sbjct: 147  NNNKVDGFVHHHHQQQQQQHELKLQFGSLPTVAYSAAEVSSNGGDSLLNLKFNRVDHPTS 206

Query: 806  RSPHQWSRHGRHEP----RR---------STVAEPRMVPPGFPQRPKGPDQWAPMSRRSD 946
             S       G H+     RR         S   E   VPPGF  R +G            
Sbjct: 207  NSSGNVVVQGNHDAVERERRGLGGYRAGGSLPPETSRVPPGFGNRTRGK----------- 255

Query: 947  LERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPGPP 1126
                 G E R      ++  Y   D+ E  +    +   V  N   ++GLV QLD+PGPP
Sbjct: 256  -----GLEGR------NENLY---DRREGGRMVSGERSNVRGNVGHKMGLVDQLDRPGPP 301

Query: 1127 MGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSVYEAHNHLEREDFP 1306
             GS LHS   +D    +  V GR       G  R E      GGG+  +           
Sbjct: 302  AGSHLHSGSGNDA--GIGEVGGRDGKHKEIGRLRMEGVPESGGGGADVDV--------LG 351

Query: 1307 EHFADSLVLRDEVGRR---RSDSFNQDSRSNRSREPQATGQWRKNYRRPVRCRMDIEKWT 1477
            E  ADSL+++DE   R   R     +D R + SR  Q   Q  + YRR + CR DI+ + 
Sbjct: 352  EQLADSLLVKDESDDRTNLRQRRREKDVRLSDSRGQQIMSQRGRMYRRQMMCRRDIDVFN 411

Query: 1478 APFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNSDIDVC 1657
             PFLAIY SL+P                  VSKEWP A+LYLYGSCANSFGVS SDIDVC
Sbjct: 412  VPFLAIYGSLIPPEEEKLKQKKLVALLEKLVSKEWPTAKLYLYGSCANSFGVSKSDIDVC 471

Query: 1658 LAIDDADIDKSE 1693
            LAI++AD++KS+
Sbjct: 472  LAIEEADMEKSK 483


>ref|XP_006339776.1| PREDICTED: uncharacterized protein LOC102603223 [Solanum tuberosum]
          Length = 775

 Score =  154 bits (390), Expect = 9e-35
 Identities = 125/374 (33%), Positives = 170/374 (45%), Gaps = 11/374 (2%)
 Frame = +2

Query: 605  LACEIGSPEELQNRNLLDKAMQFN-VQKVLEVVEVNGR-----VDGLETRSNSGVEIGSN 766
            LAC++G+ E+    + L      N V+   E V  +GR     + GLE ++  G    S 
Sbjct: 189  LACKVGNFEQKNQESRLTNVRMLNGVEGKRENVIGSGRKQLGNLRGLEQQNRGGGGGESE 248

Query: 767  SGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGPDQWAPMSRRSD 946
            SG       G+           GR     S      + PPGF  +P          R  D
Sbjct: 249  SG-------GL-----------GRGRQFHSGTVRGAVPPPGFSSKP----------RSRD 280

Query: 947  LERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPGPP 1126
             E NV  E     +LNH+    +  K E   + + +        S    + +QLD P PP
Sbjct: 281  FEHNVDNEKNNFVELNHRGIGLNH-KYERESKHLTRNGKNYAIGSDDQRVFRQLDSPVPP 339

Query: 1127 MGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSVYEAHNHLEREDFP 1306
             GS LHSV  SD+E+S   +HG    SG      EE+ +G R       A    + ++  
Sbjct: 340  AGSKLHSVLGSDVEDSTLELHGEDAESG------EETVSGMRNVLGRSSAQGQSDLDELG 393

Query: 1307 EHFADSLVLRDEVGRRRSD-----SFNQDSRSNRSREPQATGQWRKNYRRPVRCRMDIEK 1471
            EH   SL L DE   R        S ++D RS++ R     GQ  +  +R + CR DI +
Sbjct: 394  EHVISSLGLEDEPDERSDKKKHHASRDKDYRSDK-RGAYILGQRMRMLKRQIACRSDINR 452

Query: 1472 WTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNSDID 1651
                FLA ++SL+P                  VSKEWP A+LY+YGSCANSFG S SDID
Sbjct: 453  MNGAFLATFESLIPPEEERTKQKQLLALLDEIVSKEWPDARLYVYGSCANSFGFSKSDID 512

Query: 1652 VCLAIDDADIDKSE 1693
            +CLAI+DA+IDKSE
Sbjct: 513  ICLAIEDANIDKSE 526


>ref|XP_002880188.1| hypothetical protein ARALYDRAFT_483698 [Arabidopsis lyrata subsp.
            lyrata] gi|297326027|gb|EFH56447.1| hypothetical protein
            ARALYDRAFT_483698 [Arabidopsis lyrata subsp. lyrata]
          Length = 757

 Score =  152 bits (384), Expect = 4e-34
 Identities = 110/358 (30%), Positives = 164/358 (45%), Gaps = 28/358 (7%)
 Frame = +2

Query: 704  VNGRVDGLETRSNSGVEIGS---NSGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPR 874
            V G   G  T+S +G+  G+   +S Q+ +  R   S   +       HEPR S      
Sbjct: 152  VFGSFSGDATQSLNGLHNGNLKYDSNQHEQLMRHPQSVLSNSNMDPNLHEPRGSHSGRGN 211

Query: 875  M--------------VPPGFPQRPKGPDQWAPMSRRSDLERNVGKEWRARDKLNHQQTYT 1012
                            PPGF    +G D    ++ + D +R +G   R  D+   + +  
Sbjct: 212  WGHIGNNGRGFKSTPPPPGFSSNQRGRDM--NLTSKDD-DRGMGSFHRNHDQAMGEHSKF 268

Query: 1013 SDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHG 1192
             D  V     +  +  G+     S+  L +Q+D PG P G+ LHSV A+D  +S + ++ 
Sbjct: 269  WDQSVNF-SAEADRLRGLSIQNDSKFNLSQQIDHPGLPKGTSLHSVSAADAADSFSMLNK 327

Query: 1193 RVEGSGRN----GLTREESRNGDRGGGSVYEAHNHLEREDFPEHFADSLVLRDEVGRRRS 1360
               G        G   +  R G+   G V +     E EDF E    SL+L DE G + +
Sbjct: 328  EARGGSERKEELGRLSKGKREGNANSGPVDD-----EIEDFGEDIVKSLLLEDETGEKDA 382

Query: 1361 DSFNQDSRSNRSREPQAT-------GQWRKNYRRPVRCRMDIEKWTAPFLAIYDSLVPXX 1519
                +DS+++R ++ +         GQ  +  +  + CR DI ++ A F+A+Y SL+P  
Sbjct: 383  KDGKKDSKTSREKDSRMDNRGQRLLGQKARMVKMYMACRNDIHRYDASFIAVYKSLIPAE 442

Query: 1520 XXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNSDIDVCLAIDDADIDKSE 1693
                            V+KEWP A+LYLYGSCANSFG   SDIDVCLAI+  DI+KSE
Sbjct: 443  EELEKQRQLMAHLENLVAKEWPHAKLYLYGSCANSFGFPKSDIDVCLAIEGDDINKSE 500


Top