BLASTX nr result
ID: Sinomenium22_contig00004332
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium22_contig00004332 (1693 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAJ53142.1| JHL05D22.13 [Jatropha curcas] 208 7e-51 ref|XP_002511755.1| poly(A) polymerase cid, putative [Ricinus co... 200 2e-48 ref|XP_007051995.1| Nucleotidyltransferase family protein isofor... 188 7e-45 ref|XP_007051994.1| Nucleotidyltransferase family protein isofor... 188 7e-45 ref|XP_007051993.1| Nucleotidyltransferase family protein isofor... 188 7e-45 ref|XP_007051992.1| Nucleotidyltransferase family protein isofor... 188 7e-45 ref|XP_007051991.1| Nucleotidyltransferase family protein isofor... 188 7e-45 ref|XP_002301312.2| hypothetical protein POPTR_0002s15230g [Popu... 184 1e-43 ref|XP_002272342.2| PREDICTED: uncharacterized protein LOC100267... 164 9e-38 emb|CAN77386.1| hypothetical protein VITISV_006352 [Vitis vinifera] 164 9e-38 ref|XP_004308428.1| PREDICTED: uncharacterized protein LOC101313... 164 1e-37 ref|XP_006375316.1| hypothetical protein POPTR_0014s06910g, part... 161 1e-36 ref|XP_006490961.1| PREDICTED: uncharacterized protein LOC102611... 160 2e-36 ref|XP_006445207.1| hypothetical protein CICLE_v10023615mg, part... 160 2e-36 ref|XP_006397741.1| hypothetical protein EUTSA_v10001324mg [Eutr... 157 1e-35 ref|NP_566048.1| Nucleotidyltransferase family protein [Arabidop... 156 2e-35 ref|XP_007220905.1| hypothetical protein PRUPE_ppa002004mg [Prun... 155 5e-35 ref|XP_003529982.1| PREDICTED: uncharacterized protein LOC100812... 155 7e-35 ref|XP_006339776.1| PREDICTED: uncharacterized protein LOC102603... 154 9e-35 ref|XP_002880188.1| hypothetical protein ARALYDRAFT_483698 [Arab... 152 4e-34 >dbj|BAJ53142.1| JHL05D22.13 [Jatropha curcas] Length = 748 Score = 208 bits (529), Expect = 7e-51 Identities = 158/453 (34%), Positives = 210/453 (46%), Gaps = 15/453 (3%) Frame = +2 Query: 380 PWPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQI------FFSDEFQQLGINGKP-DG 538 PWP ++ FL GF + W + LAA Q D+ Q LG +G Sbjct: 97 PWPHNLSAAPLLPGFL-GFPQNHWPSPANHLAAGQFQGNQQGVLGDDLQILGFSGADVRA 155 Query: 539 WKXXXXXXXXXXXXXNMLMFGSLACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRV 718 L FGS +I + E L N N + N K LEV + Sbjct: 156 NNTIHNRVQQKQQLEQKLQFGSFRSDIQNVEALLNVN-----SKLNAAKELEVRLATRNL 210 Query: 719 DGLETRSNSGVEIGSNSGQNREFDRGMSSRSPHQWSRH---GRHEPRRSTVAEPRMVPPG 889 +GLE+ Q R FD RS W + G + P+ E RM PPG Sbjct: 211 NGLESDQKF-------DSQLRTFDLREQDRSGGGWRKQPHGGNYRPQ-----ETRMPPPG 258 Query: 890 FPQRPKGPDQWAPMSRRSDLERNVGKEWRARDKL-NHQQTYTSDDKVEILKRQVPKFDGV 1066 F +P+G W +SRR +L+ NV KE + +L N ++S+DK+ P+ DG Sbjct: 259 FSNKPRGGGNWDYVSRRRELDYNVNKEKGNQGELSNRNALFSSEDKI-------PR-DG- 309 Query: 1067 EENESSRLGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNG 1246 + S LGL QLD+PGPP GS+L+SV A+D+E S+ +V V G+ +E R Sbjct: 310 --DRSRDLGLTGQLDRPGPPAGSNLYSVSAADVELSMLNVEAEVVEDGK-----DEGREL 362 Query: 1247 DRGGGSVYEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREP----QAT 1414 D G E DSL+L E + N+ SR SR + Sbjct: 363 DEAG----------------EELVDSLLLEGESDGKNDKKQNRHSREKESRSDNRGQRTL 406 Query: 1415 GQWRKNYRRPVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQ 1594 Q + +R + CR DI++ APFLAIY+SLVP V+KEWP+A+ Sbjct: 407 SQRMRMLKRQMECRRDIDRLNAPFLAIYESLVPPEEEKAKQKQLLSLLEKLVNKEWPQAR 466 Query: 1595 LYLYGSCANSFGVSNSDIDVCLAIDDADIDKSE 1693 LYLYGSCANSFGV SDIDVCLAI +ADI+KSE Sbjct: 467 LYLYGSCANSFGVLKSDIDVCLAIQNADINKSE 499 >ref|XP_002511755.1| poly(A) polymerase cid, putative [Ricinus communis] gi|223548935|gb|EEF50424.1| poly(A) polymerase cid, putative [Ricinus communis] Length = 696 Score = 200 bits (508), Expect = 2e-48 Identities = 149/446 (33%), Positives = 204/446 (45%), Gaps = 9/446 (2%) Frame = +2 Query: 383 WPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQI--FFSDEFQQLGINGKPDGWKXXXX 556 WP ++P L + PW QGS + F D+ Q+LG++ G Sbjct: 93 WPYNLSPPNLVPGLLGFPQNHPW--QGSQFQGSDQRGFLGDDLQRLGLSS---GNTRIRN 147 Query: 557 XXXXXXXXXNMLMFGSLACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLETR 736 L FGS +I PE L N N + N K L V ++G+E Sbjct: 148 LVQQKQQLEQKLQFGSFRSDIQPPEGLLNLN-----SKLNAAKELGVDLGIRNLNGMERN 202 Query: 737 SNSGVEIGSN---SGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPK 907 + ++ SN S + RG + PH + + E RM PPGF +P+ Sbjct: 203 LHFEPQLMSNLRTSDLREQDQRGGWGKQPHGSNYRSQ---------ETRMPPPGFSNKPR 253 Query: 908 GPDQWAPMSRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSR 1087 G +SRR +L+ NV KE +L+ + + S + + DG N S Sbjct: 254 GGGNMDHVSRRRELDHNVNKEKGNHSELSKRNAFLSSESKSLR-------DG---NGSRD 303 Query: 1088 LGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSV 1267 LGL +QLD PGPP GS+LHSV A DIEESL + + + G+N Sbjct: 304 LGLTRQLDHPGPPAGSNLHSVSALDIEESLLNFNAEMVEDGKN----------------- 346 Query: 1268 YEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREP----QATGQWRKNY 1435 + H + +D E AD+L+L E + + N+ SR SR Q Q + Sbjct: 347 -DGH---DLDDVGEELADTLLLEGESEGKNDNKQNRHSRDKESRSDNRGQQILSQRMRML 402 Query: 1436 RRPVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSC 1615 +R + CR DI++ FLAIY+SL+P V+KEWP A+LYLYGSC Sbjct: 403 KRQMECRRDIDRLNVSFLAIYESLIPPEEEKSKQKQLLTLLEKLVNKEWPEARLYLYGSC 462 Query: 1616 ANSFGVSNSDIDVCLAIDDADIDKSE 1693 ANSFGV SDIDVCLAI DADI+KSE Sbjct: 463 ANSFGVRKSDIDVCLAIQDADINKSE 488 >ref|XP_007051995.1| Nucleotidyltransferase family protein isoform 5 [Theobroma cacao] gi|508704256|gb|EOX96152.1| Nucleotidyltransferase family protein isoform 5 [Theobroma cacao] Length = 635 Score = 188 bits (477), Expect = 7e-45 Identities = 145/444 (32%), Positives = 206/444 (46%), Gaps = 7/444 (1%) Frame = +2 Query: 383 WPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXX 562 WPQT++P A +FL GF SPW+ G+ A NQ D+ ++LG++G + Sbjct: 92 WPQTLSPPLAP-NFL-GFPLSPWSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNR 149 Query: 563 XXXXXXXNMLMFGSLACEIGS---PEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733 L+FGS +I + PE N NLL+ + ++N L++ Sbjct: 150 VQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENS------------KLNLSNQQLDS 197 Query: 734 RSNSGVEIGSNSGQNREF-DRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKG 910 R NS Q+R DRG Q G + P S E R PPGF +P+G Sbjct: 198 RLNSNPNTSPYVFQHRNSGDRGK------QQQHGGSYRPTPSP--EARRSPPGFLGKPRG 249 Query: 911 PDQWAPM-SRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSR 1087 +RR E NV DK + + S D + Sbjct: 250 GGGNRDFGNRRRHFEHNV-------DKAKAEYSQPSSD--------------------NE 282 Query: 1088 LGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSV 1267 +GL QLD+PGPP GS+L SV A+DIEESL +H GR+ +R + + GG Sbjct: 283 VGLSGQLDRPGPPAGSNLQSVSATDIEESLLELHS---DGGRDRFSRRDKFRREDGG--- 336 Query: 1268 YEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNR--SREPQATGQWRKNYRR 1441 E ++ E +SL++ DE + ++ + +R +R + Q + +R Sbjct: 337 -------EVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKR 389 Query: 1442 PVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCAN 1621 + CR DI + APFLA+Y+SL+P V KEWP A+LYLYGSCAN Sbjct: 390 QMECRSDIHRLNAPFLALYESLIPPEEERAKQKQLLALLEKLVCKEWPEARLYLYGSCAN 449 Query: 1622 SFGVSNSDIDVCLAIDDADIDKSE 1693 SFGVS SDIDVCLA ++ D++KSE Sbjct: 450 SFGVSKSDIDVCLAFNEMDVNKSE 473 >ref|XP_007051994.1| Nucleotidyltransferase family protein isoform 4, partial [Theobroma cacao] gi|508704255|gb|EOX96151.1| Nucleotidyltransferase family protein isoform 4, partial [Theobroma cacao] Length = 585 Score = 188 bits (477), Expect = 7e-45 Identities = 145/444 (32%), Positives = 206/444 (46%), Gaps = 7/444 (1%) Frame = +2 Query: 383 WPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXX 562 WPQT++P A +FL GF SPW+ G+ A NQ D+ ++LG++G + Sbjct: 92 WPQTLSPPLAP-NFL-GFPLSPWSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNR 149 Query: 563 XXXXXXXNMLMFGSLACEIGS---PEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733 L+FGS +I + PE N NLL+ + ++N L++ Sbjct: 150 VQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENS------------KLNLSNQQLDS 197 Query: 734 RSNSGVEIGSNSGQNREF-DRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKG 910 R NS Q+R DRG Q G + P S E R PPGF +P+G Sbjct: 198 RLNSNPNTSPYVFQHRNSGDRGK------QQQHGGSYRPTPSP--EARRSPPGFLGKPRG 249 Query: 911 PDQWAPM-SRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSR 1087 +RR E NV DK + + S D + Sbjct: 250 GGGNRDFGNRRRHFEHNV-------DKAKAEYSQPSSD--------------------NE 282 Query: 1088 LGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSV 1267 +GL QLD+PGPP GS+L SV A+DIEESL +H GR+ +R + + GG Sbjct: 283 VGLSGQLDRPGPPAGSNLQSVSATDIEESLLELHS---DGGRDRFSRRDKFRREDGG--- 336 Query: 1268 YEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNR--SREPQATGQWRKNYRR 1441 E ++ E +SL++ DE + ++ + +R +R + Q + +R Sbjct: 337 -------EVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKR 389 Query: 1442 PVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCAN 1621 + CR DI + APFLA+Y+SL+P V KEWP A+LYLYGSCAN Sbjct: 390 QMECRSDIHRLNAPFLALYESLIPPEEERAKQKQLLALLEKLVCKEWPEARLYLYGSCAN 449 Query: 1622 SFGVSNSDIDVCLAIDDADIDKSE 1693 SFGVS SDIDVCLA ++ D++KSE Sbjct: 450 SFGVSKSDIDVCLAFNEMDVNKSE 473 >ref|XP_007051993.1| Nucleotidyltransferase family protein isoform 3, partial [Theobroma cacao] gi|508704254|gb|EOX96150.1| Nucleotidyltransferase family protein isoform 3, partial [Theobroma cacao] Length = 584 Score = 188 bits (477), Expect = 7e-45 Identities = 145/444 (32%), Positives = 206/444 (46%), Gaps = 7/444 (1%) Frame = +2 Query: 383 WPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXX 562 WPQT++P A +FL GF SPW+ G+ A NQ D+ ++LG++G + Sbjct: 92 WPQTLSPPLAP-NFL-GFPLSPWSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNR 149 Query: 563 XXXXXXXNMLMFGSLACEIGS---PEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733 L+FGS +I + PE N NLL+ + ++N L++ Sbjct: 150 VQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENS------------KLNLSNQQLDS 197 Query: 734 RSNSGVEIGSNSGQNREF-DRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKG 910 R NS Q+R DRG Q G + P S E R PPGF +P+G Sbjct: 198 RLNSNPNTSPYVFQHRNSGDRGK------QQQHGGSYRPTPSP--EARRSPPGFLGKPRG 249 Query: 911 PDQWAPM-SRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSR 1087 +RR E NV DK + + S D + Sbjct: 250 GGGNRDFGNRRRHFEHNV-------DKAKAEYSQPSSD--------------------NE 282 Query: 1088 LGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSV 1267 +GL QLD+PGPP GS+L SV A+DIEESL +H GR+ +R + + GG Sbjct: 283 VGLSGQLDRPGPPAGSNLQSVSATDIEESLLELHS---DGGRDRFSRRDKFRREDGG--- 336 Query: 1268 YEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNR--SREPQATGQWRKNYRR 1441 E ++ E +SL++ DE + ++ + +R +R + Q + +R Sbjct: 337 -------EVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKR 389 Query: 1442 PVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCAN 1621 + CR DI + APFLA+Y+SL+P V KEWP A+LYLYGSCAN Sbjct: 390 QMECRSDIHRLNAPFLALYESLIPPEEERAKQKQLLALLEKLVCKEWPEARLYLYGSCAN 449 Query: 1622 SFGVSNSDIDVCLAIDDADIDKSE 1693 SFGVS SDIDVCLA ++ D++KSE Sbjct: 450 SFGVSKSDIDVCLAFNEMDVNKSE 473 >ref|XP_007051992.1| Nucleotidyltransferase family protein isoform 2 [Theobroma cacao] gi|508704253|gb|EOX96149.1| Nucleotidyltransferase family protein isoform 2 [Theobroma cacao] Length = 621 Score = 188 bits (477), Expect = 7e-45 Identities = 145/444 (32%), Positives = 206/444 (46%), Gaps = 7/444 (1%) Frame = +2 Query: 383 WPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXX 562 WPQT++P A +FL GF SPW+ G+ A NQ D+ ++LG++G + Sbjct: 92 WPQTLSPPLAP-NFL-GFPLSPWSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNR 149 Query: 563 XXXXXXXNMLMFGSLACEIGS---PEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733 L+FGS +I + PE N NLL+ + ++N L++ Sbjct: 150 VQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENS------------KLNLSNQQLDS 197 Query: 734 RSNSGVEIGSNSGQNREF-DRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKG 910 R NS Q+R DRG Q G + P S E R PPGF +P+G Sbjct: 198 RLNSNPNTSPYVFQHRNSGDRGK------QQQHGGSYRPTPSP--EARRSPPGFLGKPRG 249 Query: 911 PDQWAPM-SRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSR 1087 +RR E NV DK + + S D + Sbjct: 250 GGGNRDFGNRRRHFEHNV-------DKAKAEYSQPSSD--------------------NE 282 Query: 1088 LGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSV 1267 +GL QLD+PGPP GS+L SV A+DIEESL +H GR+ +R + + GG Sbjct: 283 VGLSGQLDRPGPPAGSNLQSVSATDIEESLLELHS---DGGRDRFSRRDKFRREDGG--- 336 Query: 1268 YEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNR--SREPQATGQWRKNYRR 1441 E ++ E +SL++ DE + ++ + +R +R + Q + +R Sbjct: 337 -------EVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKR 389 Query: 1442 PVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCAN 1621 + CR DI + APFLA+Y+SL+P V KEWP A+LYLYGSCAN Sbjct: 390 QMECRSDIHRLNAPFLALYESLIPPEEERAKQKQLLALLEKLVCKEWPEARLYLYGSCAN 449 Query: 1622 SFGVSNSDIDVCLAIDDADIDKSE 1693 SFGVS SDIDVCLA ++ D++KSE Sbjct: 450 SFGVSKSDIDVCLAFNEMDVNKSE 473 >ref|XP_007051991.1| Nucleotidyltransferase family protein isoform 1 [Theobroma cacao] gi|508704252|gb|EOX96148.1| Nucleotidyltransferase family protein isoform 1 [Theobroma cacao] Length = 722 Score = 188 bits (477), Expect = 7e-45 Identities = 145/444 (32%), Positives = 206/444 (46%), Gaps = 7/444 (1%) Frame = +2 Query: 383 WPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXX 562 WPQT++P A +FL GF SPW+ G+ A NQ D+ ++LG++G + Sbjct: 92 WPQTLSPPLAP-NFL-GFPLSPWSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNR 149 Query: 563 XXXXXXXNMLMFGSLACEIGS---PEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733 L+FGS +I + PE N NLL+ + ++N L++ Sbjct: 150 VQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENS------------KLNLSNQQLDS 197 Query: 734 RSNSGVEIGSNSGQNREF-DRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKG 910 R NS Q+R DRG Q G + P S E R PPGF +P+G Sbjct: 198 RLNSNPNTSPYVFQHRNSGDRGK------QQQHGGSYRPTPSP--EARRSPPGFLGKPRG 249 Query: 911 PDQWAPM-SRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSR 1087 +RR E NV DK + + S D + Sbjct: 250 GGGNRDFGNRRRHFEHNV-------DKAKAEYSQPSSD--------------------NE 282 Query: 1088 LGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSV 1267 +GL QLD+PGPP GS+L SV A+DIEESL +H GR+ +R + + GG Sbjct: 283 VGLSGQLDRPGPPAGSNLQSVSATDIEESLLELHS---DGGRDRFSRRDKFRREDGG--- 336 Query: 1268 YEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNR--SREPQATGQWRKNYRR 1441 E ++ E +SL++ DE + ++ + +R +R + Q + +R Sbjct: 337 -------EVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKR 389 Query: 1442 PVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCAN 1621 + CR DI + APFLA+Y+SL+P V KEWP A+LYLYGSCAN Sbjct: 390 QMECRSDIHRLNAPFLALYESLIPPEEERAKQKQLLALLEKLVCKEWPEARLYLYGSCAN 449 Query: 1622 SFGVSNSDIDVCLAIDDADIDKSE 1693 SFGVS SDIDVCLA ++ D++KSE Sbjct: 450 SFGVSKSDIDVCLAFNEMDVNKSE 473 >ref|XP_002301312.2| hypothetical protein POPTR_0002s15230g [Populus trichocarpa] gi|550345065|gb|EEE80585.2| hypothetical protein POPTR_0002s15230g [Populus trichocarpa] Length = 728 Score = 184 bits (467), Expect = 1e-43 Identities = 133/372 (35%), Positives = 184/372 (49%), Gaps = 4/372 (1%) Frame = +2 Query: 590 LMFGSLACEIGSPEE-LQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLETRSNSGVEIGSN 766 L FGS + EI SP E L N NL V EV +GLE + + SN Sbjct: 163 LQFGSFSSEIQSPAEVLVNANL-----------VREVGPGGRSFNGLERNRHLEKQANSN 211 Query: 767 SGQNREFDR--GMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGPDQWAPMSRR 940 S +N E + G S +Q H+ + P PPGF +P+G W SRR Sbjct: 212 SRRNSEVRQPGGSSGGWGNQHRNQHLHQEQHRNYRSP---PPGFSNKPRGGGNWDYGSRR 268 Query: 941 SDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPG 1120 +LE N+ +E ++N+++ S+ VE LGL +QLD+PG Sbjct: 269 RELELNITRENGDYSEMNNEKVRRSEGSVE-------------------LGLTRQLDRPG 309 Query: 1121 PPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSVYEAHNHLERED 1300 PP GS+LHSV S+I ESL ++ G E +G GG E +D Sbjct: 310 PPAGSNLHSVLGSEIGESLINLDG------------ENGEDGKDDGG---------ELDD 348 Query: 1301 FPEHFADSLVLRDEV-GRRRSDSFNQDSRSNRSREPQATGQWRKNYRRPVRCRMDIEKWT 1477 E DSL+L + G++ N++SRS+ +R + Q + ++ +C +DI++ Sbjct: 349 LGEELVDSLLLNGQSEGKKDKKQSNKESRSD-NRGKKILSQRMRMLKKQTQCCLDIDRLN 407 Query: 1478 APFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNSDIDVC 1657 A FLAIY+SL+P V+KEWP A+LYLYGS ANSFGVS SDIDVC Sbjct: 408 AAFLAIYESLIPPEEEKMKQELFLMSLEKLVNKEWPEARLYLYGSGANSFGVSKSDIDVC 467 Query: 1658 LAIDDADIDKSE 1693 LAI+DA+I+KSE Sbjct: 468 LAIEDAEINKSE 479 >ref|XP_002272342.2| PREDICTED: uncharacterized protein LOC100267790 [Vitis vinifera] Length = 679 Score = 164 bits (416), Expect = 9e-38 Identities = 141/451 (31%), Positives = 198/451 (43%), Gaps = 13/451 (2%) Frame = +2 Query: 380 PWPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLG--INGKPDGWKXXX 553 PW N L + G +PW PQ ++ ++ ++LG + GK Sbjct: 86 PWANPPNYL------IQGLAQNPWPPQTPQFIGDRELLGEDGRRLGFDVRGKT------- 132 Query: 554 XXXXXXXXXXNMLMFGSLACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733 + LMFGS CEI + L N L+ + +++ L G+ D L+ Sbjct: 133 ----VQHQQHHKLMFGSFPCEIQNHGGLVNGKSLENPIPGAIREPLV-----GKFDALKN 183 Query: 734 RSNSGVEIGSNSGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGP 913 G++ N + + R W H + E RS PPGFP + + Sbjct: 184 HK-MGLDPIWNLNSHHNASQQEQERRTVGWGTHQQGEFSRSGP------PPGFPSKARAV 236 Query: 914 DQWAPMSRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLG 1093 R LE DK+N + T++D E ++R P+ N S++LG Sbjct: 237 GNCDSGILRRGLE----------DKVN-KGNVTANDYDEKVRRLSPRHVDNHGNASAQLG 285 Query: 1094 LVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSG------RNGLTREESRNGDRG 1255 L QL+ PGP + ASDIEE L ++ ++G G + G+ RE N D Sbjct: 286 LTGQLEHPGPLL--------ASDIEECLLNLGAEIDGVGDRVRHQKQGMRREGQGNLD-- 335 Query: 1256 GGSVYEAHNHLEREDFPEHFADSLVLRD-----EVGRRRSDSFNQDSRSNRSREPQATGQ 1420 D E SLVL D + +S N+D RS+ +R + Q Sbjct: 336 --------------DLSEEMTGSLVLEDGSQDKNDTNQHHNSRNRDFRSD-TRGQRMLSQ 380 Query: 1421 WRKNYRRPVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLY 1600 +N +R + CR DI FL+IY+SL+P VSKEWP+AQL+ Sbjct: 381 RVRNLKRHMECRRDIGTLNFRFLSIYESLIPEEEEKAKQKQLLTLLEKLVSKEWPKAQLF 440 Query: 1601 LYGSCANSFGVSNSDIDVCLAIDDADIDKSE 1693 LYGSCANSFGVS SDIDVCLAIDDADI+KSE Sbjct: 441 LYGSCANSFGVSKSDIDVCLAIDDADINKSE 471 >emb|CAN77386.1| hypothetical protein VITISV_006352 [Vitis vinifera] Length = 720 Score = 164 bits (416), Expect = 9e-38 Identities = 141/451 (31%), Positives = 198/451 (43%), Gaps = 13/451 (2%) Frame = +2 Query: 380 PWPQTVNPLYASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLG--INGKPDGWKXXX 553 PW N L + G +PW PQ ++ ++ ++LG + GK Sbjct: 86 PWANPPNYL------IQGLAQNPWPPQTPQFIGDRELLGEDGRRLGFDVRGKT------- 132 Query: 554 XXXXXXXXXXNMLMFGSLACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLET 733 + LMFGS CEI + L N L+ + +++ L G+ D L+ Sbjct: 133 ----VQHQQHHKLMFGSFPCEIQNHGGLVNGKSLENPIPGAIREPLV-----GKFDALKN 183 Query: 734 RSNSGVEIGSNSGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGP 913 G++ N + + R W H + E RS PPGFP + + Sbjct: 184 HK-MGLDPIWNLNSHHNASQQEQERRTVGWGTHQQGEFSRSGP------PPGFPSKARAV 236 Query: 914 DQWAPMSRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLG 1093 R LE DK+N + T++D E ++R P+ N S++LG Sbjct: 237 GNCDSGILRRGLE----------DKVN-KGNVTANDYDEKVRRLSPRHVDNHGNASAQLG 285 Query: 1094 LVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSG------RNGLTREESRNGDRG 1255 L QL+ PGP + ASDIEE L ++ ++G G + G+ RE N D Sbjct: 286 LTGQLEHPGPLL--------ASDIEECLLNLGAEIDGVGDRVRHQKQGMRREGQGNLD-- 335 Query: 1256 GGSVYEAHNHLEREDFPEHFADSLVLRD-----EVGRRRSDSFNQDSRSNRSREPQATGQ 1420 D E SLVL D + +S N+D RS+ +R + Q Sbjct: 336 --------------DLSEEMTGSLVLEDGSQDKNDTNQHHNSRNRDFRSD-TRGQRMLSQ 380 Query: 1421 WRKNYRRPVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLY 1600 +N +R + CR DI FL+IY+SL+P VSKEWP+AQL+ Sbjct: 381 RVRNLKRHMECRRDIGTLNFRFLSIYESLIPEEEEKAKQKQLLTLLEKLVSKEWPKAQLF 440 Query: 1601 LYGSCANSFGVSNSDIDVCLAIDDADIDKSE 1693 LYGSCANSFGVS SDIDVCLAIDDADI+KSE Sbjct: 441 LYGSCANSFGVSKSDIDVCLAIDDADINKSE 471 >ref|XP_004308428.1| PREDICTED: uncharacterized protein LOC101313262 [Fragaria vesca subsp. vesca] Length = 699 Score = 164 bits (414), Expect = 1e-37 Identities = 112/314 (35%), Positives = 149/314 (47%), Gaps = 3/314 (0%) Frame = +2 Query: 761 SNSGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGPDQWAPMSRR 940 SNS + EF R + G E R M PPGF +P+G W RR Sbjct: 171 SNSSASNEFRRANYGSGEGELRGGGGGE--RGKQVHRTMPPPGFGNKPRGGGNWDSGGRR 228 Query: 941 SDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPG 1120 +E NV +E ++ + + D+ E ++R + G+ N R GL QLD+PG Sbjct: 229 GGMEYNVDRERQSSSGFARNREGSFDN--ERVRRLAGEDGGMRGNGDGRKGLSAQLDRPG 286 Query: 1121 PPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSVYEAHNHLERED 1300 PP G++LHSV AS+IEES+ + G E +R G V + ER+D Sbjct: 287 PPAGTNLHSVSASEIEESMMNFDGG-----------ERARKDSDGVEDVGQHSLEEERDD 335 Query: 1301 FPE---HFADSLVLRDEVGRRRSDSFNQDSRSNRSREPQATGQWRKNYRRPVRCRMDIEK 1471 E H DS RSD Q S R R +Y+R CR DI++ Sbjct: 336 KIEGKQHHKDS----------RSDDRGQHQLSQRMR----------SYKRQTLCRFDIDR 375 Query: 1472 WTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNSDID 1651 + APFL I+DSL+P + KEWP A+LY+YGSC NSFGVS SDID Sbjct: 376 FNAPFLEIFDSLIPTEEDKAKQKQLLTLLENIICKEWPDARLYIYGSCGNSFGVSKSDID 435 Query: 1652 VCLAIDDADIDKSE 1693 +CL I + DI+KSE Sbjct: 436 LCLEIGEEDINKSE 449 >ref|XP_006375316.1| hypothetical protein POPTR_0014s06910g, partial [Populus trichocarpa] gi|550323667|gb|ERP53113.1| hypothetical protein POPTR_0014s06910g, partial [Populus trichocarpa] Length = 497 Score = 161 bits (407), Expect = 1e-36 Identities = 125/373 (33%), Positives = 174/373 (46%), Gaps = 5/373 (1%) Frame = +2 Query: 590 LMFGSLACEIGSPEE-LQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLETRSNSGVEIGSN 766 L FGS + I SP + L N NL+ EV + +GLE + + S+ Sbjct: 174 LQFGSFSSAIPSPADGLVNANLMR-----------EVGPGSRNFNGLERNRHLEKQANSH 222 Query: 767 SGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGPD---QWAPMSR 937 S G SS R H+ + P PPGF +P+G W R Sbjct: 223 STNFEVRQPGASSGG-----RGNLHKEQHQNYKSP---PPGFSNKPRGGGGGGNWDHGGR 274 Query: 938 RSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQP 1117 R +LE + +E +LN+++ ++ VE+ +QLD+P Sbjct: 275 RRELEHTMYREKGDYSELNNEKARRNEGSVEVR-------------------FTRQLDRP 315 Query: 1118 GPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSVYEAHNHLERE 1297 GPP GS+LHSV S+I+ESL ++ G GG + + Sbjct: 316 GPPPGSNLHSVLGSEIKESLINLDGE-------------------DGGLL---------D 347 Query: 1298 DFPEHFADSLVLRDEV-GRRRSDSFNQDSRSNRSREPQATGQWRKNYRRPVRCRMDIEKW 1474 D E DSL+L E G++ +++SRS+ SR Q + +R ++CR+DI++ Sbjct: 348 DLGEELMDSLLLEGESDGKKDKKQSSKESRSD-SRGHNILSQRMRMLKRQMQCRLDIDRL 406 Query: 1475 TAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNSDIDV 1654 A FLAIY+SLVP VSKEWP A+LYLYGSCANSFGVS SDIDV Sbjct: 407 NAAFLAIYESLVPPEEETAKQKQFFMLLEKLVSKEWPEARLYLYGSCANSFGVSKSDIDV 466 Query: 1655 CLAIDDADIDKSE 1693 CL I+DA+I KSE Sbjct: 467 CLTIEDAEIKKSE 479 >ref|XP_006490961.1| PREDICTED: uncharacterized protein LOC102611932 [Citrus sinensis] Length = 699 Score = 160 bits (404), Expect = 2e-36 Identities = 153/500 (30%), Positives = 212/500 (42%), Gaps = 21/500 (4%) Frame = +2 Query: 257 PHQSHPETQSRPRINGXXXXXXXXXXXXXXXMGPTVHLHNQ----------PWPQTVNPL 406 PHQ+ P+ S P +GPT++ Q WP+T PL Sbjct: 36 PHQTPPQQPSLPN------------DPAVAAVGPTINFQPQWPSNGCDLPPTWPRTPLPL 83 Query: 407 YASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXXXXXXXXXN 586 +FL GF +PWA S+ Q ++F +LG + N Sbjct: 84 ----NFL-GFPQNPWA-SSSTENQQQRLLCEDFGRLGFSNANYAAIHNLIQQPNHQQQQN 137 Query: 587 MLMFGSLACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLETRSNSGVEIGSN 766 L FGS + P+ L N N L+ +++N+ + N + D + R++S S Sbjct: 138 -LRFGSFQVQ---PDSLLNLNHLEN-LKYNLDR-------NSQFD--QPRASSISNPNSF 183 Query: 767 SGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGPDQWAPMSRRSD 946 +N E R H+ R PPGF + + Sbjct: 184 LHRNLENSR--------------EHDLRLGKQHYGSTPPPGFSNKAR------------- 216 Query: 947 LERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPGPP 1126 VG +R H V+++ R F + +GL +QLD+PGPP Sbjct: 217 ----VGGSGNSRRGFEHN--------VDMINR----FTSSAVEGGNGVGLTRQLDRPGPP 260 Query: 1127 MGSDLHSVPASDIEESLASVHGRVEGSGRN-GLT-REESRNGDRGGGSVYEAHNHLERED 1300 GS+LHSV A DIEESL + R EG R+ GL R E+ G GG + +D Sbjct: 261 SGSNLHSVSALDIEESLLDL--RREGRERHLGLDKRRENGPGYSQGGD--------DMDD 310 Query: 1301 FPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREPQATGQWR---------KNYRRPVRC 1453 F E DSL+ DE + D + SR+ + R +N + + C Sbjct: 311 FGEDLVDSLLPDDESELKNDTHERNDKKHRNSRDKEIRSDNRGKRLLSQRMRNLKWQIEC 370 Query: 1454 RMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGV 1633 R DI + APFLAIY+SL+P V KEWP A+LYLYGSCANSFGV Sbjct: 371 RADIGRLNAPFLAIYESLIPAEEEKAKQKKLLTLLEKLVCKEWPDARLYLYGSCANSFGV 430 Query: 1634 SNSDIDVCLAIDDADIDKSE 1693 S SDIDVCLAI+D++I+KSE Sbjct: 431 SKSDIDVCLAINDSEINKSE 450 >ref|XP_006445207.1| hypothetical protein CICLE_v10023615mg, partial [Citrus clementina] gi|557547469|gb|ESR58447.1| hypothetical protein CICLE_v10023615mg, partial [Citrus clementina] Length = 1046 Score = 160 bits (404), Expect = 2e-36 Identities = 153/500 (30%), Positives = 212/500 (42%), Gaps = 21/500 (4%) Frame = +2 Query: 257 PHQSHPETQSRPRINGXXXXXXXXXXXXXXXMGPTVHLHNQ----------PWPQTVNPL 406 PHQ+ P+ S P +GPT++ Q WP+T PL Sbjct: 67 PHQTPPQQPSLPN------------DPAVAAVGPTINFQPQWPSNGCDLPPTWPRTPLPL 114 Query: 407 YASTSFLPGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXXXXXXXXXN 586 +FL GF +PWA S+ Q ++F +LG + N Sbjct: 115 ----NFL-GFPQNPWA-SSSTENQQQRLLCEDFGRLGFSNANYAAIHNLIQQPNHQQQQN 168 Query: 587 MLMFGSLACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDGLETRSNSGVEIGSN 766 L FGS + P+ L N N L+ +++N+ + N + D + R++S S Sbjct: 169 -LRFGSFQVQ---PDSLLNLNHLEN-LKYNLDR-------NSQFD--QPRASSISNPNSF 214 Query: 767 SGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGPDQWAPMSRRSD 946 +N E R H+ R PPGF + + Sbjct: 215 LHRNLENSR--------------EHDLRLGKQHYGSTPPPGFSNKAR------------- 247 Query: 947 LERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPGPP 1126 VG +R H V+++ R F + +GL +QLD+PGPP Sbjct: 248 ----VGGSGNSRRGFEHN--------VDMINR----FTSSAVEGGNGVGLTRQLDRPGPP 291 Query: 1127 MGSDLHSVPASDIEESLASVHGRVEGSGRN-GLT-REESRNGDRGGGSVYEAHNHLERED 1300 GS+LHSV A DIEESL + R EG R+ GL R E+ G GG + +D Sbjct: 292 SGSNLHSVSALDIEESLLDL--RREGRERHLGLDKRRENGPGYSQGGD--------DMDD 341 Query: 1301 FPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREPQATGQWR---------KNYRRPVRC 1453 F E DSL+ DE + D + SR+ + R +N + + C Sbjct: 342 FGEDLVDSLLPDDESELKNDTHERNDKKHRNSRDKEIRSDNRGKRLLSQRMRNLKWQIEC 401 Query: 1454 RMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGV 1633 R DI + APFLAIY+SL+P V KEWP A+LYLYGSCANSFGV Sbjct: 402 RADIGRLNAPFLAIYESLIPAEEEKAKQKKLLTLLEKLVCKEWPDARLYLYGSCANSFGV 461 Query: 1634 SNSDIDVCLAIDDADIDKSE 1693 S SDIDVCLAI+D++I+KSE Sbjct: 462 SKSDIDVCLAINDSEINKSE 481 >ref|XP_006397741.1| hypothetical protein EUTSA_v10001324mg [Eutrema salsugineum] gi|557098814|gb|ESQ39194.1| hypothetical protein EUTSA_v10001324mg [Eutrema salsugineum] Length = 757 Score = 157 bits (398), Expect = 1e-35 Identities = 128/440 (29%), Positives = 190/440 (43%), Gaps = 18/440 (4%) Frame = +2 Query: 428 PGFHSSPWAPQGSSLAANQIFFSDEFQQLGINGKPDGWKXXXXXXXXXXXXXNMLMFGSL 607 PG H+SPWAP + + N + FS Q +N P L + Sbjct: 81 PGTHASPWAPPPNH-SPNLLGFS----QFPLNPFPANQFDGNQRVSAEDAYRLGLTGAGI 135 Query: 608 ACEIGSPEELQNRNLLDKAMQFNVQKVLEVVEVNGRVDG---LETRSNSGVEIGSNSGQN 778 + + + L+ + + + L NG ++G L++ S + G N Sbjct: 136 QSMVQQQQPPPPQKLVFGSFSGDAAQSL-----NGLLNGNLKLDSNIGSANHHPRSVGPN 190 Query: 779 REFDRGMSSRSPHQWSRHGRHEP-----RRSTVAEPRMVPPGFPQRPKGPDQWAPMSRRS 943 D +S SR G P R S P PPGF +G D Sbjct: 191 PNSDPNLSHDFHEHNSRRGNWGPIGSNGRGSKSTLPPPPPPGFSSNQRGWDMDLGSKGMG 250 Query: 944 DLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPGP 1123 + N DK + + D K +V + + R L +Q+DQPGP Sbjct: 251 SFQGN------NHDKEKGEHSNLWDHKSVDFIAEVDRLRRLSIQNEGRFDLSQQIDQPGP 304 Query: 1124 PMGSDLHSVPASDIEESLASVHGRVEGSG--RNGLTREESRNGDRGGGSVYEAHNHLERE 1297 PMG++L+SV A+D E+S++ ++ G G R + S+ G G + +E Sbjct: 305 PMGTNLYSVSAADAEDSISMLNKEARGGGVGRKEELGQFSKGKREGNGECGPGDDDIE-- 362 Query: 1298 DFPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREPQATGQWRKNYRRPVR--------C 1453 F E +SL+L DE + + +SR++R +E + + ++ R+ R C Sbjct: 363 GFGEDIVESLLLEDETDDKNAKDGKNNSRTSREKESRMDTRGQRLLRQSSRIHRWRYMAC 422 Query: 1454 RMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGV 1633 R DI + APF+A+Y+SL+P V KEWP A+LYLYGSCANSFG Sbjct: 423 RYDIHMYDAPFIAVYESLIPAEEELEKQKQLMARLEHLVGKEWPHAKLYLYGSCANSFGF 482 Query: 1634 SNSDIDVCLAIDDADIDKSE 1693 SDIDVCLAI+D DI+KSE Sbjct: 483 PKSDIDVCLAIEDDDINKSE 502 >ref|NP_566048.1| Nucleotidyltransferase family protein [Arabidopsis thaliana] gi|13430538|gb|AAK25891.1|AF360181_1 unknown protein [Arabidopsis thaliana] gi|14532746|gb|AAK64074.1| unknown protein [Arabidopsis thaliana] gi|20197056|gb|AAC06161.2| expressed protein [Arabidopsis thaliana] gi|330255483|gb|AEC10577.1| Nucleotidyltransferase family protein [Arabidopsis thaliana] Length = 764 Score = 156 bits (395), Expect = 2e-35 Identities = 106/333 (31%), Positives = 152/333 (45%), Gaps = 12/333 (3%) Frame = +2 Query: 731 TRSNSGVEIGSNSGQNREFDRGMSSRSPH-QWSRHGRHEPRRSTVAEPRMVPPGFPQRPK 907 T SNS ++ + +N + S W G + R + P PPGF + Sbjct: 193 TLSNSNMDPNLSHHRNHDLHEQRGGHSGRGNWGHIGNNG--RGLKSTPPPPPPGFSSNQR 250 Query: 908 GPDQWAPMSRRSDLERNVGKEWRARDKLNHQQTYTSDDKV----EILKRQVPKFDGVEEN 1075 G W D +R +G+ NH Q KV + + G+ Sbjct: 251 G---WDMSLGSKDDDRGMGR--------NHDQAMGEHSKVWNQSVDFSAEANRLRGLSIQ 299 Query: 1076 ESSRLGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRG 1255 S+ L +Q+D PGPP G+ LHSV A+D +S + ++ G + R Sbjct: 300 NESKFNLSQQIDHPGPPKGASLHSVSAADAADSFSMLNKEARRGGERREELGQLSKAKRE 359 Query: 1256 GGSVYEAHNHLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREPQAT------- 1414 G + N E EDF E SL+L DE G + ++ +DS+++R +E + Sbjct: 360 GNA-----NSDEIEDFGEDIVKSLLLEDETGEKDANDGKKDSKTSREKESRVDNRGQRLL 414 Query: 1415 GQWRKNYRRPVRCRMDIEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQ 1594 GQ + + + CR DI ++ A F+AIY SL+P V+KEWP A+ Sbjct: 415 GQKARMVKMYMACRNDIHRYDATFIAIYKSLIPAEEELEKQRQLMAHLENLVAKEWPHAK 474 Query: 1595 LYLYGSCANSFGVSNSDIDVCLAIDDADIDKSE 1693 LYLYGSCANSFG SDIDVCLAI+ DI+KSE Sbjct: 475 LYLYGSCANSFGFPKSDIDVCLAIEGDDINKSE 507 >ref|XP_007220905.1| hypothetical protein PRUPE_ppa002004mg [Prunus persica] gi|462417367|gb|EMJ22104.1| hypothetical protein PRUPE_ppa002004mg [Prunus persica] Length = 730 Score = 155 bits (392), Expect = 5e-35 Identities = 114/377 (30%), Positives = 170/377 (45%), Gaps = 20/377 (5%) Frame = +2 Query: 623 SPEELQNRNLLDKAMQFNVQKVLEVVEVNG---RVDGLETRSNSGVEIGS-NSGQNREFD 790 S LQ++NL Q Q+ L+ + R +N+ E+ + ++G +R + Sbjct: 145 SNNALQSQNLAQLKQQHQEQQKLKFSYLPSDIIRNPEPPVTANTSSEVSNLSNGFDRSLN 204 Query: 791 RGMSSRSPHQWSRHGRHEPRRSTVAEPR----------------MVPPGFPQRPKGPDQW 922 ++ S RHG + S E R PPGF +G W Sbjct: 205 LNPNNSSSSNEFRHGNPDTFNSREQERRGGGGGGAGRGKQFQRNTPPPGFGNNSRGGGNW 264 Query: 923 APMSRRSDLERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVK 1102 SRR D E NV +E ++ + + + +D E ++R + + N + LG Sbjct: 265 DSGSRRRDFEHNVDRERQSSSEFVRNRDASFED--ERVRRLASEDSRIRGNGARGLGFSA 322 Query: 1103 QLDQPGPPMGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSVYEAHN 1282 QLD PGPP G++LHS AS+IE+S+ ++ ++ +N + + HN Sbjct: 323 QLDDPGPPTGANLHSASASEIEKSMMNLQ-----------HEKDDKNEEDDKNEAKQHHN 371 Query: 1283 HLEREDFPEHFADSLVLRDEVGRRRSDSFNQDSRSNRSREPQATGQWRKNYRRPVRCRMD 1462 E++ RSD+ Q S R R ++ ++CR D Sbjct: 372 SREKDS------------------RSDNRGQHLLSQRMR----------IFKSQMQCRFD 403 Query: 1463 IEKWTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNS 1642 I++ APFLAIYDSL+P ++KEWP AQLY+YGSC NSFGVS S Sbjct: 404 IDRLNAPFLAIYDSLIPTEEEKAKQNQLFTLLETLITKEWPEAQLYVYGSCGNSFGVSKS 463 Query: 1643 DIDVCLAIDDADIDKSE 1693 DID+CLAID AD +KSE Sbjct: 464 DIDLCLAIDVADDNKSE 480 >ref|XP_003529982.1| PREDICTED: uncharacterized protein LOC100812787 [Glycine max] Length = 732 Score = 155 bits (391), Expect = 7e-35 Identities = 128/372 (34%), Positives = 170/372 (45%), Gaps = 21/372 (5%) Frame = +2 Query: 641 NRNLLDKAMQFNVQKVLEVVEVNGRVDGLETRSNSGVEIGSNSGQ---NREFDR--GMSS 805 N N +D + + Q+ + E+ + L T + S E+ SN G N +F+R +S Sbjct: 147 NNNKVDGFVHHHHQQQQQQHELKLQFGSLPTVAYSAAEVSSNGGDSLLNLKFNRVDHPTS 206 Query: 806 RSPHQWSRHGRHEP----RR---------STVAEPRMVPPGFPQRPKGPDQWAPMSRRSD 946 S G H+ RR S E VPPGF R +G Sbjct: 207 NSSGNVVVQGNHDAVERERRGLGGYRAGGSLPPETSRVPPGFGNRTRGK----------- 255 Query: 947 LERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPGPP 1126 G E R ++ Y D+ E + + V N ++GLV QLD+PGPP Sbjct: 256 -----GLEGR------NENLY---DRREGGRMVSGERSNVRGNVGHKMGLVDQLDRPGPP 301 Query: 1127 MGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSVYEAHNHLEREDFP 1306 GS LHS +D + V GR G R E GGG+ + Sbjct: 302 AGSHLHSGSGNDA--GIGEVGGRDGKHKEIGRLRMEGVPESGGGGADVDV--------LG 351 Query: 1307 EHFADSLVLRDEVGRR---RSDSFNQDSRSNRSREPQATGQWRKNYRRPVRCRMDIEKWT 1477 E ADSL+++DE R R +D R + SR Q Q + YRR + CR DI+ + Sbjct: 352 EQLADSLLVKDESDDRTNLRQRRREKDVRLSDSRGQQIMSQRGRMYRRQMMCRRDIDVFN 411 Query: 1478 APFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNSDIDVC 1657 PFLAIY SL+P VSKEWP A+LYLYGSCANSFGVS SDIDVC Sbjct: 412 VPFLAIYGSLIPPEEEKLKQKKLVALLEKLVSKEWPTAKLYLYGSCANSFGVSKSDIDVC 471 Query: 1658 LAIDDADIDKSE 1693 LAI++AD++KS+ Sbjct: 472 LAIEEADMEKSK 483 >ref|XP_006339776.1| PREDICTED: uncharacterized protein LOC102603223 [Solanum tuberosum] Length = 775 Score = 154 bits (390), Expect = 9e-35 Identities = 125/374 (33%), Positives = 170/374 (45%), Gaps = 11/374 (2%) Frame = +2 Query: 605 LACEIGSPEELQNRNLLDKAMQFN-VQKVLEVVEVNGR-----VDGLETRSNSGVEIGSN 766 LAC++G+ E+ + L N V+ E V +GR + GLE ++ G S Sbjct: 189 LACKVGNFEQKNQESRLTNVRMLNGVEGKRENVIGSGRKQLGNLRGLEQQNRGGGGGESE 248 Query: 767 SGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPRMVPPGFPQRPKGPDQWAPMSRRSD 946 SG G+ GR S + PPGF +P R D Sbjct: 249 SG-------GL-----------GRGRQFHSGTVRGAVPPPGFSSKP----------RSRD 280 Query: 947 LERNVGKEWRARDKLNHQQTYTSDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPGPP 1126 E NV E +LNH+ + K E + + + S + +QLD P PP Sbjct: 281 FEHNVDNEKNNFVELNHRGIGLNH-KYERESKHLTRNGKNYAIGSDDQRVFRQLDSPVPP 339 Query: 1127 MGSDLHSVPASDIEESLASVHGRVEGSGRNGLTREESRNGDRGGGSVYEAHNHLEREDFP 1306 GS LHSV SD+E+S +HG SG EE+ +G R A + ++ Sbjct: 340 AGSKLHSVLGSDVEDSTLELHGEDAESG------EETVSGMRNVLGRSSAQGQSDLDELG 393 Query: 1307 EHFADSLVLRDEVGRRRSD-----SFNQDSRSNRSREPQATGQWRKNYRRPVRCRMDIEK 1471 EH SL L DE R S ++D RS++ R GQ + +R + CR DI + Sbjct: 394 EHVISSLGLEDEPDERSDKKKHHASRDKDYRSDK-RGAYILGQRMRMLKRQIACRSDINR 452 Query: 1472 WTAPFLAIYDSLVPXXXXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNSDID 1651 FLA ++SL+P VSKEWP A+LY+YGSCANSFG S SDID Sbjct: 453 MNGAFLATFESLIPPEEERTKQKQLLALLDEIVSKEWPDARLYVYGSCANSFGFSKSDID 512 Query: 1652 VCLAIDDADIDKSE 1693 +CLAI+DA+IDKSE Sbjct: 513 ICLAIEDANIDKSE 526 >ref|XP_002880188.1| hypothetical protein ARALYDRAFT_483698 [Arabidopsis lyrata subsp. lyrata] gi|297326027|gb|EFH56447.1| hypothetical protein ARALYDRAFT_483698 [Arabidopsis lyrata subsp. lyrata] Length = 757 Score = 152 bits (384), Expect = 4e-34 Identities = 110/358 (30%), Positives = 164/358 (45%), Gaps = 28/358 (7%) Frame = +2 Query: 704 VNGRVDGLETRSNSGVEIGS---NSGQNREFDRGMSSRSPHQWSRHGRHEPRRSTVAEPR 874 V G G T+S +G+ G+ +S Q+ + R S + HEPR S Sbjct: 152 VFGSFSGDATQSLNGLHNGNLKYDSNQHEQLMRHPQSVLSNSNMDPNLHEPRGSHSGRGN 211 Query: 875 M--------------VPPGFPQRPKGPDQWAPMSRRSDLERNVGKEWRARDKLNHQQTYT 1012 PPGF +G D ++ + D +R +G R D+ + + Sbjct: 212 WGHIGNNGRGFKSTPPPPGFSSNQRGRDM--NLTSKDD-DRGMGSFHRNHDQAMGEHSKF 268 Query: 1013 SDDKVEILKRQVPKFDGVEENESSRLGLVKQLDQPGPPMGSDLHSVPASDIEESLASVHG 1192 D V + + G+ S+ L +Q+D PG P G+ LHSV A+D +S + ++ Sbjct: 269 WDQSVNF-SAEADRLRGLSIQNDSKFNLSQQIDHPGLPKGTSLHSVSAADAADSFSMLNK 327 Query: 1193 RVEGSGRN----GLTREESRNGDRGGGSVYEAHNHLEREDFPEHFADSLVLRDEVGRRRS 1360 G G + R G+ G V + E EDF E SL+L DE G + + Sbjct: 328 EARGGSERKEELGRLSKGKREGNANSGPVDD-----EIEDFGEDIVKSLLLEDETGEKDA 382 Query: 1361 DSFNQDSRSNRSREPQAT-------GQWRKNYRRPVRCRMDIEKWTAPFLAIYDSLVPXX 1519 +DS+++R ++ + GQ + + + CR DI ++ A F+A+Y SL+P Sbjct: 383 KDGKKDSKTSREKDSRMDNRGQRLLGQKARMVKMYMACRNDIHRYDASFIAVYKSLIPAE 442 Query: 1520 XXXXXXXXXXXXXXXXVSKEWPRAQLYLYGSCANSFGVSNSDIDVCLAIDDADIDKSE 1693 V+KEWP A+LYLYGSCANSFG SDIDVCLAI+ DI+KSE Sbjct: 443 EELEKQRQLMAHLENLVAKEWPHAKLYLYGSCANSFGFPKSDIDVCLAIEGDDINKSE 500