BLASTX nr result
ID: Mentha24_contig00006728
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha24_contig00006728 (950 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU32028.1| hypothetical protein MIMGU_mgv1a001944mg [Mimulus... 152 2e-34 ref|XP_007051995.1| Nucleotidyltransferase family protein isofor... 140 7e-31 ref|XP_007051994.1| Nucleotidyltransferase family protein isofor... 140 7e-31 ref|XP_007051993.1| Nucleotidyltransferase family protein isofor... 140 7e-31 ref|XP_007051992.1| Nucleotidyltransferase family protein isofor... 140 7e-31 ref|XP_007051991.1| Nucleotidyltransferase family protein isofor... 140 7e-31 ref|XP_006339776.1| PREDICTED: uncharacterized protein LOC102603... 123 1e-25 ref|XP_002511755.1| poly(A) polymerase cid, putative [Ricinus co... 117 7e-24 dbj|BAJ53142.1| JHL05D22.13 [Jatropha curcas] 114 4e-23 ref|XP_004229872.1| PREDICTED: uncharacterized protein LOC101244... 113 1e-22 gb|EXC11712.1| Poly(A) RNA polymerase cid11 [Morus notabilis] 111 4e-22 ref|XP_006490961.1| PREDICTED: uncharacterized protein LOC102611... 110 1e-21 ref|XP_006445207.1| hypothetical protein CICLE_v10023615mg, part... 110 1e-21 ref|XP_002301312.2| hypothetical protein POPTR_0002s15230g [Popu... 107 5e-21 ref|XP_006375316.1| hypothetical protein POPTR_0014s06910g, part... 104 6e-20 ref|XP_007220905.1| hypothetical protein PRUPE_ppa002004mg [Prun... 92 2e-16 ref|XP_006295859.1| hypothetical protein CARUB_v10024989mg [Caps... 91 7e-16 ref|XP_004308428.1| PREDICTED: uncharacterized protein LOC101313... 89 2e-15 ref|XP_002880188.1| hypothetical protein ARALYDRAFT_483698 [Arab... 82 2e-13 ref|NP_566048.1| Nucleotidyltransferase family protein [Arabidop... 79 3e-12 >gb|EYU32028.1| hypothetical protein MIMGU_mgv1a001944mg [Mimulus guttatus] Length = 735 Score = 152 bits (383), Expect = 2e-34 Identities = 120/381 (31%), Positives = 159/381 (41%), Gaps = 66/381 (17%) Frame = +3 Query: 6 DFAFSPWSHLPPPPFAEHXXXXXXXXXXXXYSSQSRG------FTQSPSQFDNHLRRILP 167 D F W H P PPFA H + +P QF+ RI P Sbjct: 71 DLQFRQWKHSPVPPFAPHQYFQQNPIARPNLNPDFPSPPPPGELNYAPHQFNLQSNRISP 130 Query: 168 PDD------------------------------DARNLSFNSKPSLPN------QPGNLM 239 +D DAR L + + P+ + +L+ Sbjct: 131 GEDARKLAPYGDNSRPSAAAHQQLQSNRIPLGEDARRLGVFGEIATPSVAQHQREQNHLI 190 Query: 240 FGSVSRDIL------------------GPAANANALDYRKNDNRFPNPIEANERNSRTVM 365 FGS++RDIL G + L + NRFP NE N Sbjct: 191 FGSLNRDILQTDAGDVLHQSLHPMDKLGNSYLEEVLGMDRRMNRFP----VNEVNG---- 242 Query: 366 RAQNQERSSTSNDRVKLADGGSKTAVAPPPGFLSNSKDARHREAGYGRRASDVNEDKGKG 545 N +S+ N+R D GS A+APP +N K+ +RE GY R D DKGKG Sbjct: 243 ---NSRGNSSGNERRNQGDNGSHRALAPPGFSSNNMKNVGNREHGYVTRNPDNYVDKGKG 299 Query: 546 NSGQLHKNDRLSNQLDFPGLPAGSSIHSALTFDIEESMKQLHXXXXXXXXXXXXXXXNND 725 NSG +KN +SN ++ PG SM +H NN Sbjct: 300 NSGGSYKNGGVSNPINSPG-----------------SMMGIHVEDGGKGKELRFGGQNNK 342 Query: 726 G------SEMDDLENQVDSLGIEEESGGKNTKKKHHRDKDYRSDDRGKWIMGQRMRIMKR 887 S+M+ +E+Q+ SLGIEEESG + KKK+ DK+YRSD RG+WIMGQRMR +K Sbjct: 343 NQGDRAQSKMNGIEDQMGSLGIEEESGETSDKKKNPHDKEYRSDQRGQWIMGQRMRHVKM 402 Query: 888 QTTCRNDINRLSGPLLALVES 950 QT CR DI+R + L + ES Sbjct: 403 QTACRKDIDRFNSQFLTVFES 423 >ref|XP_007051995.1| Nucleotidyltransferase family protein isoform 5 [Theobroma cacao] gi|508704256|gb|EOX96152.1| Nucleotidyltransferase family protein isoform 5 [Theobroma cacao] Length = 635 Score = 140 bits (353), Expect = 7e-31 Identities = 104/305 (34%), Positives = 152/305 (49%), Gaps = 20/305 (6%) Frame = +3 Query: 96 YSSQSRGFTQSPSQFDNHLRRI-LPPDDDARNLSFNSKPSLPNQPGNLMFGSVSRDILG- 269 +SS F + + LRR+ L D+ +N ++ +Q L+FGS DI Sbjct: 112 WSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNRVQQKHQDQKLVFGSFPSDIQTL 171 Query: 270 ----PAANANALDYRK---NDNRFPNPIEANERNSRTVMRAQNQERSSTSNDRVKLADGG 428 + N N L+ K ++ + + + +N S V + +N S DR K G Sbjct: 172 KTPEGSPNGNLLENSKLNLSNQQLDSRLNSNPNTSPYVFQHRN------SGDRGKQQQHG 225 Query: 429 SKTAVAP-------PPGFLSNSKDAR-HREAGYGRRASDVNEDKGKGNSGQLHKNDR--L 578 P PPGFL + +R+ G RR + N DK K Q ++ L Sbjct: 226 GSYRPTPSPEARRSPPGFLGKPRGGGGNRDFGNRRRHFEHNVDKAKAEYSQPSSDNEVGL 285 Query: 579 SNQLDFPGLPAGSSIHSALTFDIEESMKQLHXXXXXXXXXXXXXXXNNDGSEMDDLENQV 758 S QLD PG PAGS++ S DIEES+ +LH DG E+D++ Q+ Sbjct: 286 SGQLDRPGPPAGSNLQSVSATDIEESLLELHSDGGRDRFSRRDKFRREDGGEVDEVGEQL 345 Query: 759 -DSLGIEEESGGKNTKKKHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLL 935 +SL IE+ES KN KK+H R+K+ R D+RG+ ++ QRMR++KRQ CR+DI+RL+ P L Sbjct: 346 LESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKRQMECRSDIHRLNAPFL 405 Query: 936 ALVES 950 AL ES Sbjct: 406 ALYES 410 >ref|XP_007051994.1| Nucleotidyltransferase family protein isoform 4, partial [Theobroma cacao] gi|508704255|gb|EOX96151.1| Nucleotidyltransferase family protein isoform 4, partial [Theobroma cacao] Length = 585 Score = 140 bits (353), Expect = 7e-31 Identities = 104/305 (34%), Positives = 152/305 (49%), Gaps = 20/305 (6%) Frame = +3 Query: 96 YSSQSRGFTQSPSQFDNHLRRI-LPPDDDARNLSFNSKPSLPNQPGNLMFGSVSRDILG- 269 +SS F + + LRR+ L D+ +N ++ +Q L+FGS DI Sbjct: 112 WSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNRVQQKHQDQKLVFGSFPSDIQTL 171 Query: 270 ----PAANANALDYRK---NDNRFPNPIEANERNSRTVMRAQNQERSSTSNDRVKLADGG 428 + N N L+ K ++ + + + +N S V + +N S DR K G Sbjct: 172 KTPEGSPNGNLLENSKLNLSNQQLDSRLNSNPNTSPYVFQHRN------SGDRGKQQQHG 225 Query: 429 SKTAVAP-------PPGFLSNSKDAR-HREAGYGRRASDVNEDKGKGNSGQLHKNDR--L 578 P PPGFL + +R+ G RR + N DK K Q ++ L Sbjct: 226 GSYRPTPSPEARRSPPGFLGKPRGGGGNRDFGNRRRHFEHNVDKAKAEYSQPSSDNEVGL 285 Query: 579 SNQLDFPGLPAGSSIHSALTFDIEESMKQLHXXXXXXXXXXXXXXXNNDGSEMDDLENQV 758 S QLD PG PAGS++ S DIEES+ +LH DG E+D++ Q+ Sbjct: 286 SGQLDRPGPPAGSNLQSVSATDIEESLLELHSDGGRDRFSRRDKFRREDGGEVDEVGEQL 345 Query: 759 -DSLGIEEESGGKNTKKKHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLL 935 +SL IE+ES KN KK+H R+K+ R D+RG+ ++ QRMR++KRQ CR+DI+RL+ P L Sbjct: 346 LESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKRQMECRSDIHRLNAPFL 405 Query: 936 ALVES 950 AL ES Sbjct: 406 ALYES 410 >ref|XP_007051993.1| Nucleotidyltransferase family protein isoform 3, partial [Theobroma cacao] gi|508704254|gb|EOX96150.1| Nucleotidyltransferase family protein isoform 3, partial [Theobroma cacao] Length = 584 Score = 140 bits (353), Expect = 7e-31 Identities = 104/305 (34%), Positives = 152/305 (49%), Gaps = 20/305 (6%) Frame = +3 Query: 96 YSSQSRGFTQSPSQFDNHLRRI-LPPDDDARNLSFNSKPSLPNQPGNLMFGSVSRDILG- 269 +SS F + + LRR+ L D+ +N ++ +Q L+FGS DI Sbjct: 112 WSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNRVQQKHQDQKLVFGSFPSDIQTL 171 Query: 270 ----PAANANALDYRK---NDNRFPNPIEANERNSRTVMRAQNQERSSTSNDRVKLADGG 428 + N N L+ K ++ + + + +N S V + +N S DR K G Sbjct: 172 KTPEGSPNGNLLENSKLNLSNQQLDSRLNSNPNTSPYVFQHRN------SGDRGKQQQHG 225 Query: 429 SKTAVAP-------PPGFLSNSKDAR-HREAGYGRRASDVNEDKGKGNSGQLHKNDR--L 578 P PPGFL + +R+ G RR + N DK K Q ++ L Sbjct: 226 GSYRPTPSPEARRSPPGFLGKPRGGGGNRDFGNRRRHFEHNVDKAKAEYSQPSSDNEVGL 285 Query: 579 SNQLDFPGLPAGSSIHSALTFDIEESMKQLHXXXXXXXXXXXXXXXNNDGSEMDDLENQV 758 S QLD PG PAGS++ S DIEES+ +LH DG E+D++ Q+ Sbjct: 286 SGQLDRPGPPAGSNLQSVSATDIEESLLELHSDGGRDRFSRRDKFRREDGGEVDEVGEQL 345 Query: 759 -DSLGIEEESGGKNTKKKHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLL 935 +SL IE+ES KN KK+H R+K+ R D+RG+ ++ QRMR++KRQ CR+DI+RL+ P L Sbjct: 346 LESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKRQMECRSDIHRLNAPFL 405 Query: 936 ALVES 950 AL ES Sbjct: 406 ALYES 410 >ref|XP_007051992.1| Nucleotidyltransferase family protein isoform 2 [Theobroma cacao] gi|508704253|gb|EOX96149.1| Nucleotidyltransferase family protein isoform 2 [Theobroma cacao] Length = 621 Score = 140 bits (353), Expect = 7e-31 Identities = 104/305 (34%), Positives = 152/305 (49%), Gaps = 20/305 (6%) Frame = +3 Query: 96 YSSQSRGFTQSPSQFDNHLRRI-LPPDDDARNLSFNSKPSLPNQPGNLMFGSVSRDILG- 269 +SS F + + LRR+ L D+ +N ++ +Q L+FGS DI Sbjct: 112 WSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNRVQQKHQDQKLVFGSFPSDIQTL 171 Query: 270 ----PAANANALDYRK---NDNRFPNPIEANERNSRTVMRAQNQERSSTSNDRVKLADGG 428 + N N L+ K ++ + + + +N S V + +N S DR K G Sbjct: 172 KTPEGSPNGNLLENSKLNLSNQQLDSRLNSNPNTSPYVFQHRN------SGDRGKQQQHG 225 Query: 429 SKTAVAP-------PPGFLSNSKDAR-HREAGYGRRASDVNEDKGKGNSGQLHKNDR--L 578 P PPGFL + +R+ G RR + N DK K Q ++ L Sbjct: 226 GSYRPTPSPEARRSPPGFLGKPRGGGGNRDFGNRRRHFEHNVDKAKAEYSQPSSDNEVGL 285 Query: 579 SNQLDFPGLPAGSSIHSALTFDIEESMKQLHXXXXXXXXXXXXXXXNNDGSEMDDLENQV 758 S QLD PG PAGS++ S DIEES+ +LH DG E+D++ Q+ Sbjct: 286 SGQLDRPGPPAGSNLQSVSATDIEESLLELHSDGGRDRFSRRDKFRREDGGEVDEVGEQL 345 Query: 759 -DSLGIEEESGGKNTKKKHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLL 935 +SL IE+ES KN KK+H R+K+ R D+RG+ ++ QRMR++KRQ CR+DI+RL+ P L Sbjct: 346 LESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKRQMECRSDIHRLNAPFL 405 Query: 936 ALVES 950 AL ES Sbjct: 406 ALYES 410 >ref|XP_007051991.1| Nucleotidyltransferase family protein isoform 1 [Theobroma cacao] gi|508704252|gb|EOX96148.1| Nucleotidyltransferase family protein isoform 1 [Theobroma cacao] Length = 722 Score = 140 bits (353), Expect = 7e-31 Identities = 104/305 (34%), Positives = 152/305 (49%), Gaps = 20/305 (6%) Frame = +3 Query: 96 YSSQSRGFTQSPSQFDNHLRRI-LPPDDDARNLSFNSKPSLPNQPGNLMFGSVSRDILG- 269 +SS F + + LRR+ L D+ +N ++ +Q L+FGS DI Sbjct: 112 WSSPGNQFAGNQGALMDDLRRLGLSGIDNNKNHVIQNRVQQKHQDQKLVFGSFPSDIQTL 171 Query: 270 ----PAANANALDYRK---NDNRFPNPIEANERNSRTVMRAQNQERSSTSNDRVKLADGG 428 + N N L+ K ++ + + + +N S V + +N S DR K G Sbjct: 172 KTPEGSPNGNLLENSKLNLSNQQLDSRLNSNPNTSPYVFQHRN------SGDRGKQQQHG 225 Query: 429 SKTAVAP-------PPGFLSNSKDAR-HREAGYGRRASDVNEDKGKGNSGQLHKNDR--L 578 P PPGFL + +R+ G RR + N DK K Q ++ L Sbjct: 226 GSYRPTPSPEARRSPPGFLGKPRGGGGNRDFGNRRRHFEHNVDKAKAEYSQPSSDNEVGL 285 Query: 579 SNQLDFPGLPAGSSIHSALTFDIEESMKQLHXXXXXXXXXXXXXXXNNDGSEMDDLENQV 758 S QLD PG PAGS++ S DIEES+ +LH DG E+D++ Q+ Sbjct: 286 SGQLDRPGPPAGSNLQSVSATDIEESLLELHSDGGRDRFSRRDKFRREDGGEVDEVGEQL 345 Query: 759 -DSLGIEEESGGKNTKKKHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLL 935 +SL IE+ES KN KK+H R+K+ R D+RG+ ++ QRMR++KRQ CR+DI+RL+ P L Sbjct: 346 LESLLIEDESDDKNDKKQHRREKESRIDNRGQRLLSQRMRMLKRQMECRSDIHRLNAPFL 405 Query: 936 ALVES 950 AL ES Sbjct: 406 ALYES 410 >ref|XP_006339776.1| PREDICTED: uncharacterized protein LOC102603223 [Solanum tuberosum] Length = 775 Score = 123 bits (308), Expect = 1e-25 Identities = 104/316 (32%), Positives = 143/316 (45%), Gaps = 65/316 (20%) Frame = +3 Query: 198 NSKPSLPNQP--GNLMFGSVSRDILGPAANANALDYRKNDN------------------- 314 N+K S N NL+FGS+ RDI G N + L+ R +D+ Sbjct: 151 NAKASNSNNEFDHNLIFGSLRRDIQG---NVSMLNDRFSDDLACKVGNFEQKNQESRLTN 207 Query: 315 -RFPNPIEANERN----------SRTVMRAQNQ-----ERSSTSNDRVKLADGGSKTAVA 446 R N +E N + + QN+ E S R + G+ Sbjct: 208 VRMLNGVEGKRENVIGSGRKQLGNLRGLEQQNRGGGGGESESGGLGRGRQFHSGTVRGAV 267 Query: 447 PPPGFLSN--SKDARHR------------EAGYGRRASDVNEDKGKGNSGQLHK----ND 572 PPPGF S S+D H G G E K +G+ + + Sbjct: 268 PPPGFSSKPRSRDFEHNVDNEKNNFVELNHRGIGLNHKYERESKHLTRNGKNYAIGSDDQ 327 Query: 573 RLSNQLDFPGLPAGSSIHSALTFDIEESMKQLHXXXXXXXXXXXXXXXNNDG-------S 731 R+ QLD P PAGS +HS L D+E+S +LH N G S Sbjct: 328 RVFRQLDSPVPPAGSKLHSVLGSDVEDSTLELHGEDAESGEETVSGMRNVLGRSSAQGQS 387 Query: 732 EMDDL-ENQVDSLGIEEESGGKNTKKKHH--RDKDYRSDDRGKWIMGQRMRIMKRQTTCR 902 ++D+L E+ + SLG+E+E ++ KKKHH RDKDYRSD RG +I+GQRMR++KRQ CR Sbjct: 388 DLDELGEHVISSLGLEDEPDERSDKKKHHASRDKDYRSDKRGAYILGQRMRMLKRQIACR 447 Query: 903 NDINRLSGPLLALVES 950 +DINR++G LA ES Sbjct: 448 SDINRMNGAFLATFES 463 >ref|XP_002511755.1| poly(A) polymerase cid, putative [Ricinus communis] gi|223548935|gb|EEF50424.1| poly(A) polymerase cid, putative [Ricinus communis] Length = 696 Score = 117 bits (293), Expect = 7e-24 Identities = 74/190 (38%), Positives = 101/190 (53%), Gaps = 22/190 (11%) Frame = +3 Query: 447 PPPGFLSNSKDARHREAGYGRRASDVNEDKGKGNSGQLHKNDR----------------- 575 PPPGF + + + + RR D N +K KGN +L K + Sbjct: 244 PPPGFSNKPRGGGNMDHVSRRRELDHNVNKEKGNHSELSKRNAFLSSESKSLRDGNGSRD 303 Query: 576 --LSNQLDFPGLPAGSSIHSALTFDIEESMKQLHXXXXXXXXXXXXXXXNNDGSEMDDL- 746 L+ QLD PG PAGS++HS DIEES+ + NDG ++DD+ Sbjct: 304 LGLTRQLDHPGPPAGSNLHSVSALDIEESLLNFNAEMVEDG--------KNDGHDLDDVG 355 Query: 747 ENQVDSLGIEEESGGKNTKK--KHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRL 920 E D+L +E ES GKN K +H RDK+ RSD+RG+ I+ QRMR++KRQ CR DI+RL Sbjct: 356 EELADTLLLEGESEGKNDNKQNRHSRDKESRSDNRGQQILSQRMRMLKRQMECRRDIDRL 415 Query: 921 SGPLLALVES 950 + LA+ ES Sbjct: 416 NVSFLAIYES 425 >dbj|BAJ53142.1| JHL05D22.13 [Jatropha curcas] Length = 748 Score = 114 bits (286), Expect = 4e-23 Identities = 93/293 (31%), Positives = 134/293 (45%), Gaps = 33/293 (11%) Frame = +3 Query: 171 DDDARNLSFNSKPSLPNQPGNLMFGSVSRDILGPAANANA---LDYRKN-----DNRFPN 326 D A N N L FGS DI A N L+ K R N Sbjct: 152 DVRANNTIHNRVQQKQQLEQKLQFGSFRSDIQNVEALLNVNSKLNAAKELEVRLATRNLN 211 Query: 327 PIEANERNSRTVMRAQNQERSSTSNDRVKLADGGS---KTAVAPPPGFLSNSKDARHREA 497 +E++++ + +E+ + K GG+ + PPPGF + + + + Sbjct: 212 GLESDQKFDSQLRTFDLREQDRSGGGWRKQPHGGNYRPQETRMPPPGFSNKPRGGGNWDY 271 Query: 498 GYGRRASDVNEDKGKGNSGQLHKNDRL-------------------SNQLDFPGLPAGSS 620 RR D N +K KGN G+L + L + QLD PG PAGS+ Sbjct: 272 VSRRRELDYNVNKEKGNQGELSNRNALFSSEDKIPRDGDRSRDLGLTGQLDRPGPPAGSN 331 Query: 621 IHSALTFDIEESMKQLHXXXXXXXXXXXXXXXNNDGSEMDDL-ENQVDSLGIEEESGGKN 797 ++S D+E SM + ++G E+D+ E VDSL +E ES GKN Sbjct: 332 LYSVSAADVELSMLNVEAEVVEDG--------KDEGRELDEAGEELVDSLLLEGESDGKN 383 Query: 798 TKK--KHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 950 KK +H R+K+ RSD+RG+ + QRMR++KRQ CR DI+RL+ P LA+ ES Sbjct: 384 DKKQNRHSREKESRSDNRGQRTLSQRMRMLKRQMECRRDIDRLNAPFLAIYES 436 >ref|XP_004229872.1| PREDICTED: uncharacterized protein LOC101244121 [Solanum lycopersicum] Length = 775 Score = 113 bits (283), Expect = 1e-22 Identities = 75/203 (36%), Positives = 104/203 (51%), Gaps = 28/203 (13%) Frame = +3 Query: 426 GSKTAVAPPPGFLSN--SKDARHR------------EAGYGRRASDVNEDKGKGNSGQLH 563 G+ V PPPGF S S+D H G G E K +G+ + Sbjct: 261 GTVRGVVPPPGFSSKPRSRDFEHNVDNEKNNFVELNHRGIGLNHKYERESKHLSRNGKNY 320 Query: 564 K----NDRLSNQLDFPGLPAGSSIHSALTFDIEESMKQLHXXXXXXXXXXXXXXXNNDG- 728 + R+ +LD P PAGS +HS L D+E+S +L + G Sbjct: 321 AIGSDDQRVFRRLDSPVPPAGSKLHSVLASDVEDSTLELRGEDAESGEETVSVMRDVLGR 380 Query: 729 ------SEMDDL-ENQVDSLGIEEESGGKNTKKKHH--RDKDYRSDDRGKWIMGQRMRIM 881 SE+D+L E+ + SLG+E+E ++ KK HH RDKDYRSD RG +I+GQRMR++ Sbjct: 381 SSAQGQSELDELGEHVISSLGLEDEPNERSDKKNHHASRDKDYRSDKRGAYILGQRMRML 440 Query: 882 KRQTTCRNDINRLSGPLLALVES 950 KRQ CR+DINR++G LA +S Sbjct: 441 KRQIACRSDINRMNGAFLATFQS 463 >gb|EXC11712.1| Poly(A) RNA polymerase cid11 [Morus notabilis] Length = 703 Score = 111 bits (278), Expect = 4e-22 Identities = 93/285 (32%), Positives = 126/285 (44%), Gaps = 26/285 (9%) Frame = +3 Query: 174 DDARNLSF----NSKPSLPNQP------------GNLMFGSVSRDILGPAANANALDYRK 305 +D R L F NS P+L P L FGS+ +I+ +D Sbjct: 133 EDLRRLGFSGGVNSNPNLNLNPIHGIVQQKNQLEHKLKFGSLPSEIVIIPEALPKVDASN 192 Query: 306 NDNRFPNPIEANERNSRTVMRAQNQERSSTSNDRVKLADGGSKTAVAPPPGFLSNSKDA- 482 +N + +S +R N E T+ PPPGF S K Sbjct: 193 FNNLVDRSRRLSSNSSSNAVRQGNYEHQRTN----------------PPPGFRSKPKRTG 236 Query: 483 -RHREAGYGRRASDVNEDKGK-----GNSGQLHKNDRLSNQLDFPGLPAGSSIHSALTFD 644 H G + D+ + G G + LS QLD PG P+GS++ S L D Sbjct: 237 LNHSIGGENSVSGDLMRTRDVLAEDIGIRGDGSRGLELSAQLDRPGPPSGSNLRSVLASD 296 Query: 645 IEESMKQLHXXXXXXXXXXXXXXXNNDGSEMDDL-ENQVDSLGIEEESGGKNTKKKHH-- 815 +EESM +L G E+DD+ + VDSL IE+ES KN KKH Sbjct: 297 VEESMMKLESDAVEV----------GGGHEIDDIGQRLVDSLLIEDESDDKNETKKHKNS 346 Query: 816 RDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 950 RDKD RSD RG+ ++ QRMR+ KRQ CR+DI+RL +A+V+S Sbjct: 347 RDKDSRSDSRGQRLLSQRMRVYKRQMRCRSDIDRLDDAFIAIVKS 391 >ref|XP_006490961.1| PREDICTED: uncharacterized protein LOC102611932 [Citrus sinensis] Length = 699 Score = 110 bits (274), Expect = 1e-21 Identities = 107/309 (34%), Positives = 146/309 (47%), Gaps = 30/309 (9%) Frame = +3 Query: 114 GFTQSP---SQFDNHLRRILPPD------DDARNLSFNSKPSLPN--QPGNLMFGS--VS 254 GF Q+P S +N +R+L D +A + ++ PN Q NL FGS V Sbjct: 87 GFPQNPWASSSTENQQQRLLCEDFGRLGFSNANYAAIHNLIQQPNHQQQQNLRFGSFQVQ 146 Query: 255 RDILGPAANANALDYRKNDN-RFPNPIEANERNSRTVMRAQNQERSSTSNDRVKLADGGS 431 D L + L Y + N +F P ++ N + + +N E S + R+ GS Sbjct: 147 PDSLLNLNHLENLKYNLDRNSQFDQPRASSISNPNSFLH-RNLENSREHDLRLGKQHYGS 205 Query: 432 KTAVAPPPGFLSNSKDARHREAGYGRRASDVNEDK-GKGNSGQLHKNDR--LSNQLDFPG 602 PPPGF S AR +G RR + N D + S + + L+ QLD PG Sbjct: 206 ----TPPPGF---SNKARVGGSGNSRRGFEHNVDMINRFTSSAVEGGNGVGLTRQLDRPG 258 Query: 603 LPAGSSIHSALTFDIEESMKQLHXXXXXXXXXXXXXXXNNDG-----SEMDDL-ENQVDS 764 P+GS++HS DIEES+ L N G +MDD E+ VDS Sbjct: 259 PPSGSNLHSVSALDIEESLLDLRREGRERHLGLDKRRENGPGYSQGGDDMDDFGEDLVDS 318 Query: 765 LGIEEESGGKN-----TKKKHH--RDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLS 923 L ++ES KN KKH RDK+ RSD+RGK ++ QRMR +K Q CR DI RL+ Sbjct: 319 LLPDDESELKNDTHERNDKKHRNSRDKEIRSDNRGKRLLSQRMRNLKWQIECRADIGRLN 378 Query: 924 GPLLALVES 950 P LA+ ES Sbjct: 379 APFLAIYES 387 >ref|XP_006445207.1| hypothetical protein CICLE_v10023615mg, partial [Citrus clementina] gi|557547469|gb|ESR58447.1| hypothetical protein CICLE_v10023615mg, partial [Citrus clementina] Length = 1046 Score = 110 bits (274), Expect = 1e-21 Identities = 107/309 (34%), Positives = 146/309 (47%), Gaps = 30/309 (9%) Frame = +3 Query: 114 GFTQSP---SQFDNHLRRILPPD------DDARNLSFNSKPSLPN--QPGNLMFGS--VS 254 GF Q+P S +N +R+L D +A + ++ PN Q NL FGS V Sbjct: 118 GFPQNPWASSSTENQQQRLLCEDFGRLGFSNANYAAIHNLIQQPNHQQQQNLRFGSFQVQ 177 Query: 255 RDILGPAANANALDYRKNDN-RFPNPIEANERNSRTVMRAQNQERSSTSNDRVKLADGGS 431 D L + L Y + N +F P ++ N + + +N E S + R+ GS Sbjct: 178 PDSLLNLNHLENLKYNLDRNSQFDQPRASSISNPNSFLH-RNLENSREHDLRLGKQHYGS 236 Query: 432 KTAVAPPPGFLSNSKDARHREAGYGRRASDVNEDK-GKGNSGQLHKNDR--LSNQLDFPG 602 PPPGF S AR +G RR + N D + S + + L+ QLD PG Sbjct: 237 ----TPPPGF---SNKARVGGSGNSRRGFEHNVDMINRFTSSAVEGGNGVGLTRQLDRPG 289 Query: 603 LPAGSSIHSALTFDIEESMKQLHXXXXXXXXXXXXXXXNNDG-----SEMDDL-ENQVDS 764 P+GS++HS DIEES+ L N G +MDD E+ VDS Sbjct: 290 PPSGSNLHSVSALDIEESLLDLRREGRERHLGLDKRRENGPGYSQGGDDMDDFGEDLVDS 349 Query: 765 LGIEEESGGKN-----TKKKHH--RDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLS 923 L ++ES KN KKH RDK+ RSD+RGK ++ QRMR +K Q CR DI RL+ Sbjct: 350 LLPDDESELKNDTHERNDKKHRNSRDKEIRSDNRGKRLLSQRMRNLKWQIECRADIGRLN 409 Query: 924 GPLLALVES 950 P LA+ ES Sbjct: 410 APFLAIYES 418 >ref|XP_002301312.2| hypothetical protein POPTR_0002s15230g [Populus trichocarpa] gi|550345065|gb|EEE80585.2| hypothetical protein POPTR_0002s15230g [Populus trichocarpa] Length = 728 Score = 107 bits (268), Expect = 5e-21 Identities = 86/264 (32%), Positives = 128/264 (48%), Gaps = 25/264 (9%) Frame = +3 Query: 234 LMFGSVSRDILGPA---ANANALDYRKNDNRFPNPIEAN---ERNSRTVMRAQNQERS-- 389 L FGS S +I PA NAN + R N +E N E+ + + R ++ R Sbjct: 163 LQFGSFSSEIQSPAEVLVNANLVREVGPGGRSFNGLERNRHLEKQANSNSRRNSEVRQPG 222 Query: 390 ------STSNDRVKLADGGSKTAVAPPPGFLSNSKDARHREAGYGRRASDVNEDKGKGNS 551 + L + +PPPGF + + + + G RR ++N + G+ Sbjct: 223 GSSGGWGNQHRNQHLHQEQHRNYRSPPPGFSNKPRGGGNWDYGSRRRELELNITRENGDY 282 Query: 552 GQLHKND----------RLSNQLDFPGLPAGSSIHSALTFDIEESMKQLHXXXXXXXXXX 701 +++ L+ QLD PG PAGS++HS L +I ES+ L Sbjct: 283 SEMNNEKVRRSEGSVELGLTRQLDRPGPPAGSNLHSVLGSEIGESLINLDGENGEDG--- 339 Query: 702 XXXXXNNDGSEMDDL-ENQVDSLGIEEESGGKNTKKKHHRDKDYRSDDRGKWIMGQRMRI 878 +DG E+DDL E VDSL + +S GK KK+ + K+ RSD+RGK I+ QRMR+ Sbjct: 340 -----KDDGGELDDLGEELVDSLLLNGQSEGKKDKKQSN--KESRSDNRGKKILSQRMRM 392 Query: 879 MKRQTTCRNDINRLSGPLLALVES 950 +K+QT C DI+RL+ LA+ ES Sbjct: 393 LKKQTQCCLDIDRLNAAFLAIYES 416 >ref|XP_006375316.1| hypothetical protein POPTR_0014s06910g, partial [Populus trichocarpa] gi|550323667|gb|ERP53113.1| hypothetical protein POPTR_0014s06910g, partial [Populus trichocarpa] Length = 497 Score = 104 bits (259), Expect = 6e-20 Identities = 89/263 (33%), Positives = 126/263 (47%), Gaps = 24/263 (9%) Frame = +3 Query: 234 LMFGSVSRDILGPA---ANANALDYRKNDNRFPNPIEAN-----ERNSRTVMRAQNQERS 389 L FGS S I PA NAN + +R N +E N + NS + + ++ Sbjct: 174 LQFGSFSSAIPSPADGLVNANLMREVGPGSRNFNGLERNRHLEKQANSHST-NFEVRQPG 232 Query: 390 STSNDRVKLADGGSKTAVAPPPGFLSNSKDA----------RHREAGYGRRA-----SDV 524 ++S R L + +PPPGF + + R RE + S++ Sbjct: 233 ASSGGRGNLHKEQHQNYKSPPPGFSNKPRGGGGGGNWDHGGRRRELEHTMYREKGDYSEL 292 Query: 525 NEDKGKGNSGQLHKNDRLSNQLDFPGLPAGSSIHSALTFDIEESMKQLHXXXXXXXXXXX 704 N +K + N G + R + QLD PG P GS++HS L +I+ES+ L Sbjct: 293 NNEKARRNEGSVEV--RFTRQLDRPGPPPGSNLHSVLGSEIKESLINLD----------- 339 Query: 705 XXXXNNDGSEMDDL-ENQVDSLGIEEESGGKNTKKKHHRDKDYRSDDRGKWIMGQRMRIM 881 DG +DDL E +DSL +E ES GK KK+ K+ RSD RG I+ QRMR++ Sbjct: 340 ----GEDGGLLDDLGEELMDSLLLEGESDGKKDKKQS--SKESRSDSRGHNILSQRMRML 393 Query: 882 KRQTTCRNDINRLSGPLLALVES 950 KRQ CR DI+RL+ LA+ ES Sbjct: 394 KRQMQCRLDIDRLNAAFLAIYES 416 >ref|XP_007220905.1| hypothetical protein PRUPE_ppa002004mg [Prunus persica] gi|462417367|gb|EMJ22104.1| hypothetical protein PRUPE_ppa002004mg [Prunus persica] Length = 730 Score = 92.4 bits (228), Expect = 2e-16 Identities = 109/392 (27%), Positives = 154/392 (39%), Gaps = 79/392 (20%) Frame = +3 Query: 12 AFSPWSHLPP-PPFA-----EHXXXXXXXXXXXXYSSQSR-------GFTQSP------- 131 A P PP PP+A +H +S+QS GF Q+P Sbjct: 55 AVGPTLPFPPIPPWASSNGRDHLSQLPNPSSSSLWSTQSPPSPFNFLGFPQNPYPSPSPP 114 Query: 132 ---SQFD-NHLRRILPPDDDARNL-SFNSKPSLPNQPGNLM-------------FGSVSR 257 QF N L DD RNL F S + Q NL F + Sbjct: 115 NPFPQFGGNQFPGNLALTDDLRNLVGFQSPSNNALQSQNLAQLKQQHQEQQKLKFSYLPS 174 Query: 258 DILG---PAANANALDYRKN-DNRFPNPIEANERNSRTVMRAQNQERSSTSNDRVKLADG 425 DI+ P AN N N F + N NS + ++ + ++ + G Sbjct: 175 DIIRNPEPPVTANTSSEVSNLSNGFDRSLNLNPNNSSSSNEFRHGNPDTFNSREQERRGG 234 Query: 426 GSKTAV--------APPPGFLSNSKDARHREAGYGRRASDVNEDKGKGNSGQLHKN---- 569 G A PPPGF +NS+ + ++G RR + N D+ + +S + +N Sbjct: 235 GGGGAGRGKQFQRNTPPPGFGNNSRGGGNWDSGSRRRDFEHNVDRERQSSSEFVRNRDAS 294 Query: 570 ---DRL--------------------SNQLDFPGLPAGSSIHSALTFDIEESMKQLHXXX 680 +R+ S QLD PG P G+++HSA +IE+SM L Sbjct: 295 FEDERVRRLASEDSRIRGNGARGLGFSAQLDDPGPPTGANLHSASASEIEKSMMNLQ--- 351 Query: 681 XXXXXXXXXXXXNNDGSEMDDLENQVDSLGIEEESGGKNTKKKHH--RDKDYRSDDRGKW 854 E DD + E KN K+HH R+KD RSD+RG+ Sbjct: 352 ----------------HEKDD----------KNEEDDKNEAKQHHNSREKDSRSDNRGQH 385 Query: 855 IMGQRMRIMKRQTTCRNDINRLSGPLLALVES 950 ++ QRMRI K Q CR DI+RL+ P LA+ +S Sbjct: 386 LLSQRMRIFKSQMQCRFDIDRLNAPFLAIYDS 417 >ref|XP_006295859.1| hypothetical protein CARUB_v10024989mg [Capsella rubella] gi|482564567|gb|EOA28757.1| hypothetical protein CARUB_v10024989mg [Capsella rubella] Length = 764 Score = 90.9 bits (224), Expect = 7e-16 Identities = 86/304 (28%), Positives = 122/304 (40%), Gaps = 42/304 (13%) Frame = +3 Query: 165 PPDDDARNLSFNSKPSLPNQP------GNLMFGSVSRDILGPAANANALDYRKNDNRFPN 326 PP D R L F S Q GNL + S+ ++ L + L+ D PN Sbjct: 145 PPQSDYRKLVFGSFSGDATQSLNGLRNGNLKYDSIHQEQLMRNPQSVVLNSNPED---PN 201 Query: 327 PIEANERNSRTVMRAQNQERSSTSNDRVKLADGGSKTAVAPPPGFLSN---------SKD 479 + + + R + R G T PPPGF SN SKD Sbjct: 202 -LSHHRNHDLHEQRGGHNGRGGNWGPIGNNVRGFKSTPTPPPPGFSSNQRGWDMNLGSKD 260 Query: 480 ---------ARHREAGYGRRASDVNEDKGKGNSGQLHKNDRLSNQLDFPGLPAGSSIHSA 632 H A + + D+ +G S Q LS Q+D PG P G+S+HS Sbjct: 261 DDRGIGSFQRNHDRAMWEHSNLNAEADRLRGLSLQNESKFNLSQQIDHPGPPKGTSLHSV 320 Query: 633 LTFDIEESMKQLHXXXXXXXXXXXXXXX-------NNDGS-----EMDDL-ENQVDSLGI 773 T D S L+ N+ S E+DD E+ VDSL + Sbjct: 321 STADAANSFSMLNKEARGGSERKDELGQLSKMKREGNEKSGPGDDEIDDFGEDIVDSLLL 380 Query: 774 EEESGGKNTK-----KKHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLLA 938 E ++ K+ K K R+K+ R D+RG+W++ QR+R K CRNDI+R P +A Sbjct: 381 EVDTDDKDAKDGKKNSKTSREKESRVDNRGRWLLSQRLRERKMYMACRNDIHRYDAPFMA 440 Query: 939 LVES 950 + +S Sbjct: 441 VYKS 444 >ref|XP_004308428.1| PREDICTED: uncharacterized protein LOC101313262 [Fragaria vesca subsp. vesca] Length = 699 Score = 89.4 bits (220), Expect = 2e-15 Identities = 83/282 (29%), Positives = 115/282 (40%), Gaps = 43/282 (15%) Frame = +3 Query: 234 LMFGSVSRDI-----LGPAANANALDYRKNDNRFPNPIEANERNSRTVMRAQNQ-ERSST 395 L FG + D+ L AA + + K N + N NS A N+ R++ Sbjct: 129 LKFGYLPGDVIRNPELSSAAPVTSSEIAKLSNGLDRNLHLNSSNSS----ASNEFRRANY 184 Query: 396 SNDRVKLADGGSKTA------VAPPPGFLSNSKDARHREAGYGRRASDVNEDKGKGNSGQ 557 + +L GG PPPGF + + + ++G R + N D+ + +S Sbjct: 185 GSGEGELRGGGGGERGKQVHRTMPPPGFGNKPRGGGNWDSGGRRGGMEYNVDRERQSSSG 244 Query: 558 LHKNDR---------------------------LSNQLDFPGLPAGSSIHSALTFDIEES 656 +N LS QLD PG PAG+++HS +IEES Sbjct: 245 FARNREGSFDNERVRRLAGEDGGMRGNGDGRKGLSAQLDRPGPPAGTNLHSVSASEIEES 304 Query: 657 MKQLHXXXXXXXXXXXXXXXNNDGSEM----DDLENQVDSLGIEEESGGKNTKKKHHRDK 824 M N DG E D V +EEE K K+HH K Sbjct: 305 MM------------------NFDGGERARKDSDGVEDVGQHSLEEERDDKIEGKQHH--K 344 Query: 825 DYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 950 D RSDDRG+ + QRMR KRQT CR DI+R + P L + +S Sbjct: 345 DSRSDDRGQHQLSQRMRSYKRQTLCRFDIDRFNAPFLEIFDS 386 >ref|XP_002880188.1| hypothetical protein ARALYDRAFT_483698 [Arabidopsis lyrata subsp. lyrata] gi|297326027|gb|EFH56447.1| hypothetical protein ARALYDRAFT_483698 [Arabidopsis lyrata subsp. lyrata] Length = 757 Score = 82.4 bits (202), Expect = 2e-13 Identities = 94/351 (26%), Positives = 140/351 (39%), Gaps = 72/351 (20%) Frame = +3 Query: 114 GFTQSP------SQFDNHLRRILPPDDDARNLSF----------------NSKPSLPNQP 227 GF Q P +QFD + +R+ P +DA L F P ++ Sbjct: 92 GFPQFPLNPFPTNQFDGN-QRVSP--EDAFRLGFPGTANHAIQSMVQQQQQQLPPPQSEN 148 Query: 228 GNLMFGSVSRDI---LGPAANANALDYRKNDN----RFPNPIEANERNSRTVMRAQNQER 386 L+FGS S D L N N L Y N + R P + +N + + Sbjct: 149 RKLVFGSFSGDATQSLNGLHNGN-LKYDSNQHEQLMRHPQSVLSNSNMDPNLHEPRGSHS 207 Query: 387 SSTSNDRVKLADGGSKTAVAPPPGFLSN---------SKDA--------RHREAGYGRRA 515 + + G K+ PPPGF SN SKD R+ + G + Sbjct: 208 GRGNWGHIGNNGRGFKST-PPPPGFSSNQRGRDMNLTSKDDDRGMGSFHRNHDQAMGEHS 266 Query: 516 S--------DVNEDKGKGNSGQLHKNDRLSNQLDFPGLPAGSSIHSALTFDIEESMKQLH 671 D+ +G S Q LS Q+D PGLP G+S+HS D +S L+ Sbjct: 267 KFWDQSVNFSAEADRLRGLSIQNDSKFNLSQQIDHPGLPKGTSLHSVSAADAADSFSMLN 326 Query: 672 XXXXXXXXXXXXXXXNNDGS------------EMDDL-ENQVDSLGIEEESGGKNTK--- 803 + G E++D E+ V SL +E+E+G K+ K Sbjct: 327 KEARGGSERKEELGRLSKGKREGNANSGPVDDEIEDFGEDIVKSLLLEDETGEKDAKDGK 386 Query: 804 --KKHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 950 K R+KD R D+RG+ ++GQ+ R++K CRNDI+R +A+ +S Sbjct: 387 KDSKTSREKDSRMDNRGQRLLGQKARMVKMYMACRNDIHRYDASFIAVYKS 437 >ref|NP_566048.1| Nucleotidyltransferase family protein [Arabidopsis thaliana] gi|13430538|gb|AAK25891.1|AF360181_1 unknown protein [Arabidopsis thaliana] gi|14532746|gb|AAK64074.1| unknown protein [Arabidopsis thaliana] gi|20197056|gb|AAC06161.2| expressed protein [Arabidopsis thaliana] gi|330255483|gb|AEC10577.1| Nucleotidyltransferase family protein [Arabidopsis thaliana] Length = 764 Score = 79.0 bits (193), Expect = 3e-12 Identities = 92/354 (25%), Positives = 139/354 (39%), Gaps = 75/354 (21%) Frame = +3 Query: 114 GFTQSP------SQFDNHLRRILPPDDDARNLSF-----------------NSKPSLPNQ 224 GF Q P +QFD + +R+ P +DA L F P ++ Sbjct: 95 GFPQFPPSPFTTNQFDGN-QRVSP--EDAYRLGFPGTTNPAIQSMVQQQQQQQLPPPQSE 151 Query: 225 PGNLMFGSVSRDI---LGPAANANALDYRKNDN----RFPNPIEANERNSRTVMRAQNQE 383 L+FGS S D L N N L Y N + R P +N + +N + Sbjct: 152 TRKLVFGSFSGDATQSLNGLHNGN-LKYDSNQHEQLMRHPQSTLSNSNMDPNLSHHRNHD 210 Query: 384 RSSTSNDRVKLADGG---------SKTAVAPPPGFLSN---------SKD-----ARHRE 494 + G T PPPGF SN SKD R+ + Sbjct: 211 LHEQRGGHSGRGNWGHIGNNGRGLKSTPPPPPPGFSSNQRGWDMSLGSKDDDRGMGRNHD 270 Query: 495 AGYGRRASDVNE--------DKGKGNSGQLHKNDRLSNQLDFPGLPAGSSIHSALTFDIE 650 G + N+ ++ +G S Q LS Q+D PG P G+S+HS D Sbjct: 271 QAMGEHSKVWNQSVDFSAEANRLRGLSIQNESKFNLSQQIDHPGPPKGASLHSVSAADAA 330 Query: 651 ESMKQLHXXXXXXXXXXXXXXX--------NNDGSEMDDL-ENQVDSLGIEEESGGKNTK 803 +S L+ N + E++D E+ V SL +E+E+G K+ Sbjct: 331 DSFSMLNKEARRGGERREELGQLSKAKREGNANSDEIEDFGEDIVKSLLLEDETGEKDAN 390 Query: 804 -----KKHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 950 K R+K+ R D+RG+ ++GQ+ R++K CRNDI+R +A+ +S Sbjct: 391 DGKKDSKTSREKESRVDNRGQRLLGQKARMVKMYMACRNDIHRYDATFIAIYKS 444