BLASTX nr result
ID: Mentha27_contig00003746
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00003746 (1744 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU19942.1| hypothetical protein MIMGU_mgv1a006495mg [Mimulus... 559 e-156 ref|XP_006338856.1| PREDICTED: poly(A) RNA polymerase cid11-like... 503 e-139 ref|XP_004240948.1| PREDICTED: poly(A) RNA polymerase GLD2-like ... 493 e-136 ref|XP_002277771.2| PREDICTED: poly(A) RNA polymerase GLD2-like ... 459 e-126 ref|XP_002316267.2| hypothetical protein POPTR_0010s20710g [Popu... 439 e-120 ref|XP_006486408.1| PREDICTED: poly(A) RNA polymerase GLD2-like ... 429 e-117 ref|XP_006435620.1| hypothetical protein CICLE_v10031537mg [Citr... 428 e-117 ref|XP_004503175.1| PREDICTED: poly(A) RNA polymerase GLD2-like ... 426 e-116 gb|EPS67904.1| hypothetical protein M569_06872 [Genlisea aurea] 426 e-116 ref|XP_006601982.1| PREDICTED: poly(A) RNA polymerase cid11-like... 425 e-116 ref|XP_002524216.1| zinc finger protein, putative [Ricinus commu... 420 e-115 ref|XP_007163461.1| hypothetical protein PHAVU_001G236100g [Phas... 419 e-114 ref|XP_007220456.1| hypothetical protein PRUPE_ppa005171mg [Prun... 417 e-114 ref|XP_006591354.1| PREDICTED: poly(A) RNA polymerase cid11-like... 416 e-113 ref|XP_007009280.1| Zinc finger protein, putative [Theobroma cac... 411 e-112 ref|XP_004150639.1| PREDICTED: poly(A) RNA polymerase GLD2-like ... 408 e-111 ref|XP_006601987.1| PREDICTED: poly(A) RNA polymerase cid11-like... 383 e-103 ref|XP_002879814.1| hypothetical protein ARALYDRAFT_321659 [Arab... 379 e-102 ref|NP_181504.2| HEN1 suppressor 1 [Arabidopsis thaliana] gi|538... 379 e-102 dbj|BAE99845.1| hypothetical protein [Arabidopsis thaliana] 376 e-101 >gb|EYU19942.1| hypothetical protein MIMGU_mgv1a006495mg [Mimulus guttatus] Length = 442 Score = 559 bits (1441), Expect = e-156 Identities = 282/441 (63%), Positives = 352/441 (79%), Gaps = 21/441 (4%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 MN +N ++LT+++IL+ INPSN+DW+ RF +INE+RA+VGS+E+LRGATVEP+GSF SNL Sbjct: 5 MNRYNLLDLTIRDILRVINPSNDDWSFRFQMINEIRAIVGSIENLRGATVEPFGSFASNL 64 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 FTKWGDLD+SIELQNG+YIS PGK+HKQ++L +V++A RKKGG+ +L I+NARVPILKF Sbjct: 65 FTKWGDLDISIELQNGTYISSPGKKHKQSVLQEVLKAFRKKGGFRKLKFIANARVPILKF 124 Query: 482 EGNYKISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLNS 661 EG+Y ISCDIS++NL GQMKSKILFW+N IDGRFRDLVMLVKEWAKAHHINDSKSGTLNS Sbjct: 125 EGSYNISCDISINNLSGQMKSKILFWINEIDGRFRDLVMLVKEWAKAHHINDSKSGTLNS 184 Query: 662 YSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSDK 841 YSLSLLV+FH QTL PAILPPL+EIYPGNM DDLTGVR AEKNIED CAA+I RI SD+ Sbjct: 185 YSLSLLVIFHLQTLVPAILPPLREIYPGNMIDDLTGVRTVAEKNIEDICAANIHRIRSDR 244 Query: 842 SRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVEDP 1021 SR IN+ KF+DICSRAST+GI P+ GQ+EDI++N RWLP+TYALFVEDP Sbjct: 245 SRLINRSTLSALFISFLTKFADICSRASTQGICPYSGQLEDIHTNMRWLPRTYALFVEDP 304 Query: 1022 FEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVS--WLGP---- 1183 FEQPANTARTVSSNQLI+I++AIQ TH +L ++N ++ L+ VL GPH+S ++ P Sbjct: 305 FEQPANTARTVSSNQLIRISQAIQATHGILVAANQDRTCLIPVLAGPHISCFFMRPSVPA 364 Query: 1184 ---------RTRQSTSI----ASNNRNGNKNRPERKQPPFQTQ-KSQNRAMDK-PQNGET 1318 RT ST + +S++RNG++ P +++ +TQ SQ++ +DK P Sbjct: 365 PPLFNQFPSRTSSSTQLPSRASSSSRNGHRTHPPQRK---KTQDSSQDKRLDKRPTEPSK 421 Query: 1319 NGSKHASTSQKQQVWRPRSET 1381 + + STSQKQQVWRPRSE+ Sbjct: 422 SPTPPPSTSQKQQVWRPRSES 442 >ref|XP_006338856.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X1 [Solanum tuberosum] gi|565343469|ref|XP_006338857.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X2 [Solanum tuberosum] gi|565343471|ref|XP_006338858.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X3 [Solanum tuberosum] gi|565343473|ref|XP_006338859.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X4 [Solanum tuberosum] Length = 453 Score = 503 bits (1295), Expect = e-139 Identities = 264/456 (57%), Positives = 322/456 (70%), Gaps = 37/456 (8%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 MN ++ +E TLQNIL +INP EDW++RF +I+E+RAVV S+E LRGATVEP+GSFVSNL Sbjct: 1 MNCYSLLEHTLQNILHSINPLEEDWSMRFQLIHELRAVVESIEILRGATVEPFGSFVSNL 60 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 FT+WGDLD+SIEL NGS+IS GK++K +LL DV++ALR KGG +L I+NARVPILKF Sbjct: 61 FTRWGDLDISIELPNGSHISAAGKKYKLSLLGDVLKALRAKGGCRKLQFITNARVPILKF 120 Query: 482 EGNYKISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLNS 661 +GNY ISCDIS++NL GQMKSKIL+W+N IDGRFRD+V+LVKEWAKAH+INDSK+GTLNS Sbjct: 121 QGNYNISCDISINNLSGQMKSKILYWINMIDGRFRDMVLLVKEWAKAHNINDSKTGTLNS 180 Query: 662 YSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSDK 841 YSLSLLV+FHFQT PAILPPLKEIYPG+M DDLTGVRA+AEK IE+TCA +I R++S+K Sbjct: 181 YSLSLLVVFHFQTCVPAILPPLKEIYPGSMVDDLTGVRASAEKFIEETCAMNINRLMSNK 240 Query: 842 SRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVEDP 1021 SR IN+ AKF DI SRAS +GISPF GQ EDI SN RWLPKTY +FVEDP Sbjct: 241 SRAINRSYLSELFISFIAKFCDISSRASAQGISPFTGQWEDIVSNMRWLPKTYTIFVEDP 300 Query: 1022 FEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVSWLGPRTRQST 1201 FEQP N+AR VS+ QL +I EA + TH ML SSN N+ ++S LV PHVS R T Sbjct: 301 FEQPLNSARGVSTKQLTRIEEAFRSTHFMLCSSNQNENEIISTLVKPHVSKFVAR----T 356 Query: 1202 SIASNNRNGNKNRPERK-------------------------------------QPPFQT 1270 S NN + N RP+ + PP Q Sbjct: 357 SGNQNNYSRNGLRPQLQAQRAIKPPFQAHHQRQAQRAIHPPLRAQHQPQAQRPINPPLQA 416 Query: 1271 QKSQNRAMDKPQNGETNGSKHASTSQKQQVWRPRSE 1378 + Q++ M++ QN G A Q Q VWRP+S+ Sbjct: 417 HQLQDKRMNRNQNSTAQGPTQAIRVQTQTVWRPKSD 452 >ref|XP_004240948.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Solanum lycopersicum] Length = 453 Score = 493 bits (1269), Expect = e-136 Identities = 260/457 (56%), Positives = 321/457 (70%), Gaps = 39/457 (8%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 MNG++ +E TLQNIL +INPS EDW++RF +I+E+RAVV S+E LRGATVEP+GSFVSNL Sbjct: 1 MNGYSLLEHTLQNILHSINPSEEDWSMRFQLIHELRAVVESIEILRGATVEPFGSFVSNL 60 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 FT+WGD+D+SIEL NG +IS GK++K +LL DV++ALR KGG +L I+NARVPILKF Sbjct: 61 FTRWGDVDISIELPNGLHISAAGKKYKLSLLGDVLKALRAKGGCRKLQFITNARVPILKF 120 Query: 482 EGNYKISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLNS 661 +GN ISCDIS++NL GQMKSKIL+W+N IDGRFRD+V+LVKEWAKAH+INDSK+GTLNS Sbjct: 121 QGNNNISCDISINNLSGQMKSKILYWINMIDGRFRDMVLLVKEWAKAHNINDSKTGTLNS 180 Query: 662 YSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSDK 841 YSLSLLV+FH QT PAILPPLKEIYPG+M DDLTGVRA+AEK IE+TCA +I R++S+K Sbjct: 181 YSLSLLVVFHLQTCVPAILPPLKEIYPGSMVDDLTGVRASAEKFIEETCAMNINRLMSNK 240 Query: 842 SRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVEDP 1021 SR IN+ AKF +I SRAS +GISPF GQ EDI SN RWLPKTY +FVEDP Sbjct: 241 SRVINRSSLSELFISFIAKFCNISSRASAQGISPFTGQWEDIVSNMRWLPKTYTIFVEDP 300 Query: 1022 FEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVSWLGPRTRQST 1201 FEQP N+AR VS+ QL +I EA + TH ML SSN N+ ++S LV PHVS R Sbjct: 301 FEQPLNSARGVSTKQLTRIEEAFRSTHFMLCSSNLNENEVISTLVKPHVSKFVAR----- 355 Query: 1202 SIASNNRNGNKN--RPERK-------------------------------------QPPF 1264 I+ N N ++N RP+ + PP Sbjct: 356 -ISGNQNNYSRNGLRPQLQGQRAIHPPLQAHHQRQAQRAIHPPLRAQHQPQAQRPINPPL 414 Query: 1265 QTQKSQNRAMDKPQNGETNGSKHASTSQKQQVWRPRS 1375 Q + Q++ M++ QN G A Q Q VWRP+S Sbjct: 415 QAHQLQDKRMNRNQNSTVQGPTQAIRVQTQTVWRPKS 451 >ref|XP_002277771.2| PREDICTED: poly(A) RNA polymerase GLD2-like [Vitis vinifera] gi|296086183|emb|CBI31624.3| unnamed protein product [Vitis vinifera] Length = 453 Score = 459 bits (1181), Expect = e-126 Identities = 247/452 (54%), Positives = 308/452 (68%), Gaps = 33/452 (7%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 M+ FN +E+ L++IL INPS EDWA+R +I + R V SVESLRGATVEP+GSF+SNL Sbjct: 1 MSTFNVLEIVLKDILLVINPSREDWAIRNQLIADFRTAVDSVESLRGATVEPFGSFLSNL 60 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 +T+WGDLD+SIEL NG+YIS GKRHKQTLL V+ ALR KGG+ +L I NARVPI+KF Sbjct: 61 YTQWGDLDISIELPNGAYISSAGKRHKQTLLGHVLNALRSKGGWRKLQFIPNARVPIIKF 120 Query: 482 EGNY-KISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 658 E + ISCD+S++NL GQMKSK LFW++GIDGRFRDLV+LVKEWA+AH IN+SK+GTLN Sbjct: 121 ESYHPNISCDVSINNLKGQMKSKFLFWISGIDGRFRDLVLLVKEWARAHDINNSKTGTLN 180 Query: 659 SYSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSD 838 SYSLSLLV+FH QT PAILPPLKEIYPGN+ DDL GVRA E IE+T AA+I R D Sbjct: 181 SYSLSLLVVFHLQTCRPAILPPLKEIYPGNVADDLIGVRAVVEGQIEETSAANINRFKRD 240 Query: 839 KSRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVED 1018 +SR N+ AKF DI SRAS +GI P+ GQ DI+SN RW+P+TY LFVED Sbjct: 241 RSRAPNRSSLSELFISFLAKFVDITSRASEQGICPYTGQWVDIDSNMRWMPRTYELFVED 300 Query: 1019 PFEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVSWLGPR--TR 1192 PFEQP NTAR V S QL +I+EA Q THQ L S+N +Q SL+ LV P ++ R +R Sbjct: 301 PFEQPENTARGVRSRQLQRISEAFQTTHQRLTSANQDQHSLIDTLVRPQIAQFIRRAPSR 360 Query: 1193 QSTSIASNN-----------------RNGNKNRPERKQPPFQTQKS------------QN 1285 S++ NN +N +NR + +P +Q+S Q Sbjct: 361 NSSAYGRNNSRTYPSVPNVANSPLQFQNDFQNRRPQSRPNTTSQRSAPVQARPNSVTMQR 420 Query: 1286 RAMDKPQNGETNGS-KHASTSQKQQVWRPRSE 1378 +P + S + A+ SQ Q+VWRPRS+ Sbjct: 421 SMYTRPGSSTVQRSVQQATQSQSQRVWRPRSD 452 >ref|XP_002316267.2| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|566191879|ref|XP_006378690.1| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|566191881|ref|XP_006378691.1| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|566191883|ref|XP_006378692.1| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|550330240|gb|EEF02438.2| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|550330241|gb|ERP56487.1| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|550330242|gb|ERP56488.1| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|550330243|gb|ERP56489.1| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] Length = 493 Score = 439 bits (1130), Expect = e-120 Identities = 224/361 (62%), Positives = 270/361 (74%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 MN + +E TL++IL I P EDW VRF +I E+ VV SVESLRG+TVEP+GSFVSNL Sbjct: 1 MNTYRVLEPTLKDILNGIQPLREDWVVRFKVIEELEDVVKSVESLRGSTVEPFGSFVSNL 60 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 FT+WGDLD+SI L NGSYIS GKR KQ LL DV++ALR++GG+ RL I NARVPILKF Sbjct: 61 FTRWGDLDISIVLSNGSYISSAGKRRKQNLLEDVLKALRQRGGWQRLQFIPNARVPILKF 120 Query: 482 EGNYKISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLNS 661 E N ISCD+S+ N+ G MKSK LFW+N ID RFRD+V+LVKEWAK H+IN+ K+G+LNS Sbjct: 121 E-NASISCDVSIDNMQGLMKSKFLFWINEIDRRFRDMVLLVKEWAKTHNINNPKTGSLNS 179 Query: 662 YSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSDK 841 YSLSLLV+FHFQT PAILPPLKEIYP N+ DDLTGVR AE+ I + CAA+I R S+K Sbjct: 180 YSLSLLVIFHFQTCVPAILPPLKEIYPRNVIDDLTGVRTDAERRIGEICAANISRYRSNK 239 Query: 842 SRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVEDP 1021 SR IN+ KF DI S+A+ GI PF G+ E+I SNTRWLP+TYALF+EDP Sbjct: 240 SRAINRNSLSELFISFLTKFYDISSKATELGICPFTGKWEEIRSNTRWLPRTYALFIEDP 299 Query: 1022 FEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVSWLGPRTRQST 1201 FEQP NTAR VS+ L+KI+EAIQ TH L ++N NQ S L +LV P +S + T S Sbjct: 300 FEQPENTARAVSAANLMKISEAIQTTHHRLVTANQNQISFLGMLVRPRISRIIAGTPASN 359 Query: 1202 S 1204 S Sbjct: 360 S 360 >ref|XP_006486408.1| PREDICTED: poly(A) RNA polymerase GLD2-like isoform X1 [Citrus sinensis] gi|568866114|ref|XP_006486409.1| PREDICTED: poly(A) RNA polymerase GLD2-like isoform X2 [Citrus sinensis] Length = 445 Score = 429 bits (1104), Expect = e-117 Identities = 226/431 (52%), Positives = 291/431 (67%), Gaps = 12/431 (2%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 M +N +E L++IL +NP EDW R +I+++R VV SVESLRGATVEP+GSFVSNL Sbjct: 1 MGSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNL 60 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 F++WGDLD+SIEL NGS IS GK+ KQ+LL D++RALR+KGGY RL +++ARVPILKF Sbjct: 61 FSRWGDLDISIELSNGSCISSTGKKLKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKF 120 Query: 482 EGNYK-ISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 658 E ++ ISCDIS+ NLCGQ+KSK LFW++ IDGRFRD+V+LVKEWAKAH IN+ K+GT N Sbjct: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180 Query: 659 SYSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSD 838 SYSLSLLVLFHFQT PAILPPLK+IYPGN+ DDL GVRA E+ I + CA +I R SD Sbjct: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANVERQIAEICAFNIARFSSD 240 Query: 839 KSRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVED 1018 K R IN+ KFS + +AS GI PF GQ E I SNTRWLP + LF+ED Sbjct: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIED 300 Query: 1019 PFEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHV-SWLGPRTRQ 1195 PFEQP N+AR VS L KI+ A ++TH L S+N + +LLS L P++ + G + Sbjct: 301 PFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPYILQFFGESPVR 360 Query: 1196 STSIASNNRNGNKNRPERKQPPFQTQ-KSQNRAMDKPQN---GETNGSKHAS------TS 1345 + + +R + P Q Q +S N + N + + +H S Sbjct: 361 YANYNNGHRRARPQSHKSVNSPLQAQHQSHNARRENRPNRPMSQQSVQQHQSQPVRQNNG 420 Query: 1346 QKQQVWRPRSE 1378 Q Q++WRP+S+ Sbjct: 421 QVQRIWRPKSD 431 >ref|XP_006435620.1| hypothetical protein CICLE_v10031537mg [Citrus clementina] gi|557537816|gb|ESR48860.1| hypothetical protein CICLE_v10031537mg [Citrus clementina] Length = 445 Score = 428 bits (1101), Expect = e-117 Identities = 225/431 (52%), Positives = 291/431 (67%), Gaps = 12/431 (2%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 M +N +E L++IL +NP EDW R +I+++R VV SVESLRGATVEP+GSFVSNL Sbjct: 1 MGSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNL 60 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 F++WGDLD+SIEL NGS IS GK+ KQ+LL D++RALR+KGGY RL +++ARVPILKF Sbjct: 61 FSRWGDLDISIELSNGSCISSTGKKLKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKF 120 Query: 482 EGNYK-ISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 658 E ++ ISCDIS+ NLCGQ+KSK LFW++ IDGRFRD+V+LVKEWAKAH IN+ K+GT N Sbjct: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180 Query: 659 SYSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSD 838 SYSLSLLVLFHFQT PAILPPLK+IYPGN+ DDL GVRA E+ I + CA +I R SD Sbjct: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANVERQIAEICAFNIARFSSD 240 Query: 839 KSRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVED 1018 K R IN+ KFS + ++S GI PF GQ E I SNTRWLP + LF+ED Sbjct: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKSSELGICPFTGQWEHIRSNTRWLPNNHPLFIED 300 Query: 1019 PFEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHV-SWLGPRTRQ 1195 PFEQP N+AR VS L KI+ A ++TH L S+N + +LLS L P++ + G + Sbjct: 301 PFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPYILQFFGESPVR 360 Query: 1196 STSIASNNRNGNKNRPERKQPPFQTQ-KSQNRAMDKPQN---GETNGSKHAS------TS 1345 + + +R + P Q Q +S N + N + + +H S Sbjct: 361 YANYNNGHRRARPQSHKSVNSPLQAQHQSHNARRENRPNRPMSQQSVQQHQSQPVRQNNG 420 Query: 1346 QKQQVWRPRSE 1378 Q Q++WRP+S+ Sbjct: 421 QVQRIWRPKSD 431 >ref|XP_004503175.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cicer arietinum] Length = 491 Score = 426 bits (1096), Expect = e-116 Identities = 231/431 (53%), Positives = 292/431 (67%), Gaps = 5/431 (1%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 M+ N + L +ILQ I PS EDWA+RF IIN++R++ SV+SLRGATVEP+GSFVSNL Sbjct: 1 MSTHNMLGNVLNDILQVITPSQEDWAIRFAIINDLRSIAESVQSLRGATVEPFGSFVSNL 60 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 FT+WGD+D+SIEL NGS+I+ G++ KQTLL D +R LR KGGY + LI NARVPILKF Sbjct: 61 FTRWGDVDISIELLNGSHIASVGRKQKQTLLGDFLRVLRLKGGYMNMQLILNARVPILKF 120 Query: 482 EGNYK-ISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 658 + ISCD+S++NL G MKSK L W+N IDGRF D+V++VKEWAKAH IN+S++G+ N Sbjct: 121 RSKQQGISCDVSINNLPGLMKSKFLLWINRIDGRFHDMVLVVKEWAKAHRINNSRTGSFN 180 Query: 659 SYSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSD 838 SYSLSLLV+FHFQT PAILPPLK+IYP NM D+L GVRA E I +TC A+I R +SD Sbjct: 181 SYSLSLLVIFHFQTCAPAILPPLKDIYPANMVDELRGVRADVENLISETCGANINRFISD 240 Query: 839 KSRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVED 1018 KSR IN+ KF+ + S AS GI P+ GQ E I +N RWLPKTYA+FVED Sbjct: 241 KSRTINRKSVPELFIDFLRKFAQMDSWASELGICPYSGQREQIKNNMRWLPKTYAIFVED 300 Query: 1019 PFEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVSWL--GPRTR 1192 PFEQP N+AR+VS+ QL KIAEA T+ +L S N NQ SLL+ L PH++ + GP Sbjct: 301 PFEQPENSARSVSAGQLRKIAEAFLKTYSLLTSKNQNQNSLLACLAPPHIARIIGGPAIP 360 Query: 1193 QSTSIASNNRNGNKNRPERKQPPFQTQKS-QNRAMDKPQNGETN-GSKHASTSQKQQVWR 1366 +S + + P Q+Q QN NG T+ S + STS+ + Sbjct: 361 SYSSGYFHPTQPQQQVQRGVLPHPQSQHHFQNVRKGARANGSTSKASTNGSTSRANTI-- 418 Query: 1367 PRSETKAEKNG 1399 S +KA NG Sbjct: 419 -GSTSKASANG 428 >gb|EPS67904.1| hypothetical protein M569_06872 [Genlisea aurea] Length = 413 Score = 426 bits (1094), Expect = e-116 Identities = 231/416 (55%), Positives = 290/416 (69%), Gaps = 11/416 (2%) Frame = +2 Query: 119 EMNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSN 298 E+N ++ + +IL INPS +DW+ RF +I E++ VV SVESLRGA V+PYGSFVSN Sbjct: 5 EINTVRFLDTAIGDILCVINPSKDDWSARFFVIKEIQDVVRSVESLRGALVQPYGSFVSN 64 Query: 299 LFTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILK 478 LF+K GDLD+SI+LQ+GS+IS PGK+ KQ+LL D+ ALRKKG + R+ I NARVPILK Sbjct: 65 LFSKEGDLDISIDLQHGSFISSPGKKQKQSLLKDLSTALRKKGQFLRVQCIPNARVPILK 124 Query: 479 FEGNYKISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 658 + + ISCDISV NL G+MKS +L+W+N IDGRFRDLV KEWAK H INDS++G+ N Sbjct: 125 LDTVFNISCDISVCNLSGEMKSIMLYWINEIDGRFRDLV---KEWAKTHQINDSRNGSFN 181 Query: 659 SYSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSD 838 SYSL+LLV+FH QTLEPAILPPLKEIYPGNM++ LTG R A KN+ED CA +IKRI D Sbjct: 182 SYSLTLLVIFHLQTLEPAILPPLKEIYPGNMSETLTGERNVAVKNVEDICAVNIKRIKMD 241 Query: 839 KSRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVED 1018 KSR N+ AK S+IC AST+GISPF GQ EDI+SNT W PKTYA+FVED Sbjct: 242 KSRWTNRSSLSHLFISFLAKLSEICCEASTKGISPFAGQSEDISSNTSWQPKTYAVFVED 301 Query: 1019 PFEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQ---------ASLLSVLVGP--H 1165 PFEQPANTARTV+S QL KI EAI+ T ++ S+NH++ +S LSV V P + Sbjct: 302 PFEQPANTARTVNSKQLEKILEAIKSTQAVVLSANHHRDDNKATVTTSSFLSVSVAPNHN 361 Query: 1166 VSWLGPRTRQSTSIASNNRNGNKNRPERKQPPFQTQKSQNRAMDKPQNGETNGSKH 1333 PR R+ +NG+KN P T ++ + K G TN +++ Sbjct: 362 EKEAVPRRRRMMM----PQNGSKNVKAAAAPSANTGVAKTKRWLKVLFGPTNQTRY 413 >ref|XP_006601982.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X1 [Glycine max] gi|571542766|ref|XP_006601983.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X2 [Glycine max] gi|571542770|ref|XP_006601984.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X3 [Glycine max] gi|571542774|ref|XP_006601985.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X4 [Glycine max] Length = 415 Score = 425 bits (1092), Expect = e-116 Identities = 222/425 (52%), Positives = 283/425 (66%), Gaps = 7/425 (1%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 M+ + +++ + +IL+ + P EDW +RF IIN++R++V SVESLRGATVEP+GSFVSNL Sbjct: 1 MSTHSTLDIVVNDILRVVTPVQEDWEIRFAIINDLRSIVESVESLRGATVEPFGSFVSNL 60 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 FT+WGDLD+SIEL NG +IS GK+ KQT L DV++ALR KGG L ISNARVPILKF Sbjct: 61 FTRWGDLDISIELSNGLHISSAGKKQKQTFLGDVLKALRMKGGGSNLQFISNARVPILKF 120 Query: 482 EGNYK-ISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 658 + + +SCDIS++NL GQMKSKIL W+N IDGRFR +V+LVKEWAKAH IN+SK+GT N Sbjct: 121 KSYRQGVSCDISINNLPGQMKSKILLWINKIDGRFRHMVLLVKEWAKAHKINNSKAGTFN 180 Query: 659 SYSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSD 838 SYSLSLLV+F+FQT PAI PPLK+IYPGNM DDL GVR+ AE I TC A+I R +S+ Sbjct: 181 SYSLSLLVIFYFQTCIPAIFPPLKDIYPGNMVDDLIGVRSDAENLIAQTCDANINRFISN 240 Query: 839 KSRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVED 1018 ++R IN+ KF+ + S A GI P+ G+ E I N WLPKTYA+FVED Sbjct: 241 RARSINRKSVAELFVEFIGKFAKMDSMAVKMGICPYSGKWEQIEDNMIWLPKTYAIFVED 300 Query: 1019 PFEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVSWLGPRTRQS 1198 PFEQP NTAR+VS+ QL KI EA TH +L S+N NQ SLLS + HV Sbjct: 301 PFEQPQNTARSVSAGQLKKITEAFARTHDLLTSTNQNQISLLSNMAPAHV---------- 350 Query: 1199 TSIASNNRNGNKNRPERKQ------PPFQTQKSQNRAMDKPQNGETNGSKHASTSQKQQV 1360 + G P + Q P Q+Q+ + ++ H + QQ+ Sbjct: 351 IRCITRPYGGGYFHPTQPQVQRAIRPQLQSQRHFQNVSQGTSSNSSSSKGHTLVHRGQQI 410 Query: 1361 WRPRS 1375 WRP+S Sbjct: 411 WRPKS 415 >ref|XP_002524216.1| zinc finger protein, putative [Ricinus communis] gi|223536493|gb|EEF38140.1| zinc finger protein, putative [Ricinus communis] Length = 493 Score = 420 bits (1080), Expect = e-115 Identities = 226/418 (54%), Positives = 281/418 (67%), Gaps = 7/418 (1%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 MN + +E L++ L+ I P EDWAVR II E++ V+ S+ESLRGATVEP+GSFVSNL Sbjct: 1 MNAHSVLEPILRDTLEVIKPLREDWAVRSKIIEELKDVIASIESLRGATVEPFGSFVSNL 60 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 FT+WGDLD+SI L NGSYIS K+ KQ +L + +ALR+KGG+ RL + NARVP+LKF Sbjct: 61 FTRWGDLDISIMLANGSYISSAAKKRKQNVLREFHKALRQKGGWRRLQFVPNARVPLLKF 120 Query: 482 E-GNYKISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 658 E G ISCD+S+ NL GQ+KS LFW+N IDGRFRD+V+LVKEWAKAH+IN+ K+GTLN Sbjct: 121 ESGRQNISCDVSIDNLQGQIKSNFLFWLNQIDGRFRDMVLLVKEWAKAHNINNPKTGTLN 180 Query: 659 SYSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSD 838 SYSLSLLV+FHFQT PAILPPLKEIYP N+ DDLTGVR AE+ I++TC A+I R +SD Sbjct: 181 SYSLSLLVIFHFQTCVPAILPPLKEIYPRNVVDDLTGVRTVAEERIKETCNANIARYMSD 240 Query: 839 KSRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVED 1018 K R +N+ AKFS I +A+ GI F GQ DI S RWLPKTYALF+ED Sbjct: 241 KYRAVNRSSLSELFISFFAKFSGISLKAADLGICTFTGQWLDIRSTMRWLPKTYALFIED 300 Query: 1019 PFEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPH----VSWLGPR 1186 PFEQP N AR VS+ L+KIAEA Q T+ L +N N+ SLL LV P ++ R Sbjct: 301 PFEQPENAARAVSAGNLVKIAEAFQTTYHKLVLANQNRTSLLGTLVRPEILNCIAGTPVR 360 Query: 1187 TRQSTSIASNNRNGNKNRPERKQPPFQTQKSQNRAMDKPQNGET--NGSKHASTSQKQ 1354 TS+ + + ++ P Q Q QN +K Q T KH +S Q Sbjct: 361 NLSYTSLHYQSTHPQISKSMYSSPQVQHQ-FQNMRQEKHQKIFTAQRQEKHPHSSNSQ 417 >ref|XP_007163461.1| hypothetical protein PHAVU_001G236100g [Phaseolus vulgaris] gi|561036925|gb|ESW35455.1| hypothetical protein PHAVU_001G236100g [Phaseolus vulgaris] Length = 509 Score = 419 bits (1076), Expect = e-114 Identities = 216/391 (55%), Positives = 280/391 (71%), Gaps = 7/391 (1%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 M+ + +++ L++ILQ + P EDW +RF I+N++R++V SVESLRGATVEP+GSFVSNL Sbjct: 1 MSTHSMLDIVLKDILQVVTPLQEDWQIRFAILNDLRSIVESVESLRGATVEPFGSFVSNL 60 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 FT+WGDLD+SIEL NG +IS GK+ KQTLL +V++ALR KG L IS+ARVPILKF Sbjct: 61 FTRWGDLDISIELSNGLHISSAGKKQKQTLLGEVLKALRMKGAGSHLQFISSARVPILKF 120 Query: 482 EGNYK-ISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 658 + N + +SCDIS++NL GQMKSKIL W+N IDGRF D+V+LVKEWAKAH IN+SK+GT N Sbjct: 121 KSNRQGVSCDISINNLPGQMKSKILLWINKIDGRFHDMVLLVKEWAKAHKINNSKTGTFN 180 Query: 659 SYSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSD 838 SYSLSLLV+FHFQT PAILPPLK IYPGNM DDL G+RA AE I +TC A I R +S+ Sbjct: 181 SYSLSLLVIFHFQTCVPAILPPLKYIYPGNMVDDLKGIRADAENLIAETCNAGINRHISN 240 Query: 839 KSRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVED 1018 +R IN+ K++ + S AS GI P+ GQ E I +NT WLPKTY++FVED Sbjct: 241 TARSINKKSVPDLFVEFLRKYAQMDSWASELGICPYTGQWEQIENNTIWLPKTYSIFVED 300 Query: 1019 PFEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVSWLGPRTRQS 1198 PFEQP NTAR+V++ QL KI++ T+ L+S++HN SLL++L PHV +S Sbjct: 301 PFEQPQNTARSVNAGQLKKISDTFSKTYAFLSSNHHNLNSLLTMLAPPHVV-------KS 353 Query: 1199 TSIASNNRNGNKNRPER------KQPPFQTQ 1273 + N +G+ P + +PP Q Q Sbjct: 354 ITTPIRNYDGSYFHPTQPKVQRAMRPPLQLQ 384 >ref|XP_007220456.1| hypothetical protein PRUPE_ppa005171mg [Prunus persica] gi|462416918|gb|EMJ21655.1| hypothetical protein PRUPE_ppa005171mg [Prunus persica] Length = 474 Score = 417 bits (1072), Expect = e-114 Identities = 230/473 (48%), Positives = 297/473 (62%), Gaps = 54/473 (11%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 M+ + +E TL+ IL+ + P EDW R II+E+R V SVESLRGATVEP+GSFVS+L Sbjct: 1 MSAQSTLENTLKEILRVVKPLREDWTTRLQIIDELRGAVESVESLRGATVEPFGSFVSDL 60 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 FT+WGDLD+SIE NGS++S GK+ KQ LL DV+RA+R+KGG+ R LI NARVPILK Sbjct: 61 FTRWGDLDVSIEFSNGSFVSPYGKKQKQRLLGDVMRAMRQKGGWRRYQLIPNARVPILKV 120 Query: 482 EGNYK-ISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 658 E N + +SCDIS+ NL QMKS++LFW++ ID RFRD+V+L+KEWAKAH+IN+ K GT N Sbjct: 121 ESNLQNVSCDISIDNLKCQMKSRLLFWISEIDTRFRDMVLLIKEWAKAHNINNPKFGTFN 180 Query: 659 SYSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSD 838 SYSL+LLV+FHFQT PAI PPLK+IYPGN+ DDL G+RA E+ IE+TCAA+I+R S Sbjct: 181 SYSLTLLVVFHFQTCAPAIFPPLKDIYPGNLIDDLKGLRADTERRIEETCAANIRRFQSY 240 Query: 839 KSRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVED 1018 R N+ KFSDI +AS GI + GQ + I SN RWLP+TYALF+ED Sbjct: 241 NLRAENRSSLSELFISFLGKFSDISLKASELGICTYTGQWQAIKSNMRWLPQTYALFIED 300 Query: 1019 PFEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVSWLGPRT--- 1189 PFEQP N+AR VS +L +I+E +++H ML S NH +SLL+ LV P + L RT Sbjct: 301 PFEQPENSARAVSKRELTRISETFEMSHHMLISPNH--SSLLATLVRPQMLSLMVRTPDW 358 Query: 1190 -RQST-----------SIASNNRNGNK--------------------------NRPERKQ 1255 RQ T S +N NG + P Q Sbjct: 359 RRQPTHPQRFRAEGSHSPTPSNNNGPRQPTRPQVHRVVRSPSQVQPQYQTVKPKGPSEVQ 418 Query: 1256 PPFQTQKSQNRAMDKPQNGETNGSKHASTS------------QKQQVWRPRSE 1378 P +QT K + + +PQ N H + + Q+QQ+WRPRS+ Sbjct: 419 PQYQTVKPKGPSQVQPQFQTMNPKSHPNRATFKKPPLQTYEDQRQQIWRPRSD 471 >ref|XP_006591354.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X1 [Glycine max] gi|571489968|ref|XP_006591355.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X2 [Glycine max] Length = 455 Score = 416 bits (1069), Expect = e-113 Identities = 209/350 (59%), Positives = 260/350 (74%), Gaps = 1/350 (0%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 M+ + +++ + +IL+ + P EDW +RF IIN+ R++V SVESLRGATVEPYGSFVSNL Sbjct: 1 MSTHSMLDIVVNDILRVVTPLQEDWEIRFAIINDFRSIVESVESLRGATVEPYGSFVSNL 60 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 FT+WGDLD+SIEL NG +IS GK+ KQTLL +V++ALR KGG L ISNARVPILKF Sbjct: 61 FTRWGDLDISIELSNGLHISSAGKKQKQTLLGEVLKALRMKGGGSNLQFISNARVPILKF 120 Query: 482 EGNYK-ISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 658 + + +SCDIS++NL GQMKSKIL W+N IDGRFR +V+LVKEWAKAH IN+SK+GT N Sbjct: 121 KSYRQGVSCDISINNLPGQMKSKILLWINKIDGRFRHMVLLVKEWAKAHKINNSKAGTFN 180 Query: 659 SYSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSD 838 SYSLSLLV+F+FQT PAI PPLK+IYPGNM DDL G+R+ AE I +TC A+I R +S+ Sbjct: 181 SYSLSLLVIFYFQTCIPAIFPPLKDIYPGNMIDDLIGIRSDAENLIAETCDANINRFISN 240 Query: 839 KSRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVED 1018 ++R IN+ KF+ + S A GI P+ G+ E I N WLPKTYA+FVED Sbjct: 241 RARSINRKSVAELFVDFVGKFAKMDSMAVEMGICPYTGKWEQIEDNMIWLPKTYAIFVED 300 Query: 1019 PFEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHV 1168 PFEQP NTAR+VS+ QL KI E TH +L S+N NQ SLLS L HV Sbjct: 301 PFEQPQNTARSVSAGQLKKITETFARTHDLLTSTNQNQISLLSNLAPAHV 350 >ref|XP_007009280.1| Zinc finger protein, putative [Theobroma cacao] gi|508726193|gb|EOY18090.1| Zinc finger protein, putative [Theobroma cacao] Length = 482 Score = 411 bits (1056), Expect = e-112 Identities = 222/413 (53%), Positives = 280/413 (67%), Gaps = 1/413 (0%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 MN ++QVE TLQ +L+ I P EDW R II+E+R VV S+ESLRGATVEP+GS VSNL Sbjct: 1 MNSYSQVESTLQEVLEVIKPLREDWVTRQKIIDELREVVQSMESLRGATVEPFGSLVSNL 60 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 FT+WGDLD+SIEL GSY+S GK+ KQTLL ++ RAL++K G+ RL I +ARVPILK Sbjct: 61 FTRWGDLDISIELPYGSYVSSAGKKRKQTLLGELQRALKQKDGWQRLQFIPHARVPILKI 120 Query: 482 EGNYK-ISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 658 E ++ ISCDIS+ NL GQ+KSK LFW+N IDGRFR++V+LVKEWA A+ IN+ K+GT N Sbjct: 121 ESRWQNISCDISIDNLQGQIKSKFLFWLNEIDGRFREMVLLVKEWASANGINNPKAGTFN 180 Query: 659 SYSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSD 838 SYSL+LLV+FHFQT PAI PPLK+IYP N+ DLTGVRA AE+ I C+++I R S Sbjct: 181 SYSLTLLVIFHFQTCAPAIFPPLKDIYPRNVVTDLTGVRADAERRIAQVCSSNIARFRS- 239 Query: 839 KSRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVED 1018 R +N+ AKFSDI S+AS GI F GQ E I SN RWLP+TYA+FVED Sbjct: 240 -GRTVNRSSLSELFISFIAKFSDINSKASDMGICTFTGQWEYITSNMRWLPRTYAIFVED 298 Query: 1019 PFEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVSWLGPRTRQS 1198 PFEQP N +R VS QLIKIAEA + T ML S+N Q++LL LVGP S + + Sbjct: 299 PFEQPENASRAVSQKQLIKIAEAFETTRCMLISANLTQSTLLPTLVGPKTSRFIVKQQSV 358 Query: 1199 TSIASNNRNGNKNRPERKQPPFQTQKSQNRAMDKPQNGETNGSKHASTSQKQQ 1357 +S + N + RP Q ++ + + Q+ N AS Q+ Q Sbjct: 359 SSSSYNGGHYPNTRP-------QVHRAVHSPLLMQQHQYRNSRPAASQMQQHQ 404 >ref|XP_004150639.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus] gi|449516431|ref|XP_004165250.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus] Length = 464 Score = 408 bits (1049), Expect = e-111 Identities = 224/420 (53%), Positives = 280/420 (66%), Gaps = 11/420 (2%) Frame = +2 Query: 122 MNGFNQVELTLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNL 301 MNG ++ +++IL+ + P +DW RF +INE+R VV S+ESLRGAT+EP+GSFVSNL Sbjct: 1 MNGLT-LDRVIKDILRVVEPLQDDWTARFQVINELRNVVQSIESLRGATIEPFGSFVSNL 59 Query: 302 FTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF 481 F++WGDLDLS++L NGSY S GK+ KQTLL D+ A RK G + +L LI +ARVPILK Sbjct: 60 FSRWGDLDLSVQLNNGSYTSTAGKKRKQTLLRDIQNASRKNGRWYKLQLIPHARVPILKI 119 Query: 482 EG-NYKISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 658 E + ISCDIS+ NL GQ+KSKIL WVN IDGRF D+V+LVKEWAKAH IN+SK GT N Sbjct: 120 EHIQHNISCDISIDNLVGQIKSKILLWVNEIDGRFHDMVLLVKEWAKAHDINNSKQGTFN 179 Query: 659 SYSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSD 838 SYSLSLLV+FHFQT PAI PPL++IYPGN+ D+L GVRA E I TCA +I R Sbjct: 180 SYSLSLLVIFHFQTCSPAIFPPLRDIYPGNVVDNLKGVRAEVENEIARTCATNIARF--- 236 Query: 839 KSRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVED 1018 KSR N+ AKFSDI S+AS GI P+ GQ I SN RWLPKTYA+FVED Sbjct: 237 KSRTANRSSLSELFVSFLAKFSDISSKASELGICPYTGQWLKIESNMRWLPKTYAIFVED 296 Query: 1019 PFEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVSWLGPRTRQS 1198 PFEQP NTAR +++ QL++I+EA ++TH L S N++S+L+ L P +S L + S Sbjct: 297 PFEQPENTARAINARQLMRISEAFRMTHLRLTSVYQNRSSILNDLARPQISQLIINSSGS 356 Query: 1199 TSI-ASNNRNGNKNRPE-------RKQPPFQTQKSQNRAMDKPQNGETNGSK--HASTSQ 1348 S A N N RP+ + +P Q Q N N S+ HA TSQ Sbjct: 357 ASAPAFNVENYTPIRPQVHQARVMQPRPWIQHQFQNNIPRFNMGNFPAINSQAPHAGTSQ 416 >ref|XP_006601987.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X6 [Glycine max] Length = 374 Score = 383 bits (984), Expect = e-103 Identities = 204/382 (53%), Positives = 251/382 (65%), Gaps = 7/382 (1%) Frame = +2 Query: 251 SLRGATVEPYGSFVSNLFTKWGDLDLSIELQNGSYISIPGKRHKQTLLADVIRALRKKGG 430 SL GATVEP+GSFVSNLFT+WGDLD+SIEL NG +IS GK+ KQT L DV++ALR KGG Sbjct: 3 SLDGATVEPFGSFVSNLFTRWGDLDISIELSNGLHISSAGKKQKQTFLGDVLKALRMKGG 62 Query: 431 YGRLHLISNARVPILKFEGNYK-ISCDISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVK 607 L ISNARVPILKF+ + +SCDIS++NL GQMKSKIL W+N IDGRFR +V+LVK Sbjct: 63 GSNLQFISNARVPILKFKSYRQGVSCDISINNLPGQMKSKILLWINKIDGRFRHMVLLVK 122 Query: 608 EWAKAHHINDSKSGTLNSYSLSLLVLFHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAE 787 EWAKAH IN+SK+GT NSYSLSLLV+F+FQT PAI PPLK+IYPGNM DDL GVR+ AE Sbjct: 123 EWAKAHKINNSKAGTFNSYSLSLLVIFYFQTCIPAIFPPLKDIYPGNMVDDLIGVRSDAE 182 Query: 788 KNIEDTCAASIKRILSDKSRPINQXXXXXXXXXXXAKFSDICSRASTRGISPFMGQMEDI 967 I TC A+I R +S+++R IN+ KF+ + S A GI P+ G+ E I Sbjct: 183 NLIAQTCDANINRFISNRARSINRKSVAELFVEFIGKFAKMDSMAVKMGICPYSGKWEQI 242 Query: 968 NSNTRWLPKTYALFVEDPFEQPANTARTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLS 1147 N WLPKTYA+FVEDPFEQP NTAR+VS+ QL KI EA TH +L S+N NQ SLLS Sbjct: 243 EDNMIWLPKTYAIFVEDPFEQPQNTARSVSAGQLKKITEAFARTHDLLTSTNQNQISLLS 302 Query: 1148 VLVGPHVSWLGPRTRQSTSIASNNRNGNKNRPERKQ------PPFQTQKSQNRAMDKPQN 1309 + HV + G P + Q P Q+Q+ + Sbjct: 303 NMAPAHV----------IRCITRPYGGGYFHPTQPQVQRAIRPQLQSQRHFQNVSQGTSS 352 Query: 1310 GETNGSKHASTSQKQQVWRPRS 1375 ++ H + QQ+WRP+S Sbjct: 353 NSSSSKGHTLVHRGQQIWRPKS 374 >ref|XP_002879814.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp. lyrata] gi|297325653|gb|EFH56073.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp. lyrata] Length = 500 Score = 379 bits (973), Expect = e-102 Identities = 202/417 (48%), Positives = 276/417 (66%), Gaps = 1/417 (0%) Frame = +2 Query: 149 TLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNLFTKWGDLDL 328 TLQ ILQ I P+ DW R +I+++R V+ +VE LRGATV+P+GSFVSNLFT+WGDLDL Sbjct: 10 TLQEILQVIKPTRADWDTRIRVIDQLRDVLQTVECLRGATVQPFGSFVSNLFTRWGDLDL 69 Query: 329 SIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF-EGNYKISC 505 S++L +GS I GK+ KQTLL ++RALR G + +L + +ARVPILK G+ +I+C Sbjct: 70 SVDLFSGSSILFTGKKQKQTLLRHLLRALRASGLWYKLQFVIHARVPILKVVSGHQRIAC 129 Query: 506 DISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLNSYSLSLLVL 685 DIS+ NL G +KS+ LFW++ IDGRFRDLV+LVKEWAKAH+INDSK+GT NSYSLSLLV+ Sbjct: 130 DISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKNGTFNSYSLSLLVI 189 Query: 686 FHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSDKSRPINQXX 865 FH QT PAILPPL+ IYP + DDLTGVR TAE++I AA+I R + ++ +N+ Sbjct: 190 FHLQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKLNTAKSVNRSS 249 Query: 866 XXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVEDPFEQPANTA 1045 AKFSDI +A G+ PF G+ E+I+SNT WLPKTY+LFVEDPFEQP N A Sbjct: 250 LSELLVSFYAKFSDINLKAQELGVCPFTGRWENISSNTTWLPKTYSLFVEDPFEQPVNAA 309 Query: 1046 RTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVSWLGPRTRQSTSIASNNRN 1225 R+VS L +IA+ Q+T + L S+ N+ S++ VL G H+ S +R Sbjct: 310 RSVSRRNLDRIAQVFQITSRRLV-SDCNRNSIIGVLTGQHIQ------------ESLHRT 356 Query: 1226 GNKNRPERKQPPFQTQKSQNRAMDKPQNGETNGSKHASTSQKQQVWRPRSETKAEKN 1396 + + + + +A + Q + N S+ +T Q W P ++++ ++N Sbjct: 357 ISLHSQQHANSMHNVRNLHGQARHQNQQMQQNWSQSYNT-QNPPYWPPPTQSRPQQN 412 >ref|NP_181504.2| HEN1 suppressor 1 [Arabidopsis thaliana] gi|53850481|gb|AAU95417.1| At2g39740 [Arabidopsis thaliana] gi|55733735|gb|AAV59264.1| At2g39740 [Arabidopsis thaliana] gi|330254623|gb|AEC09717.1| HEN1 suppressor 1 [Arabidopsis thaliana] Length = 511 Score = 379 bits (972), Expect = e-102 Identities = 206/417 (49%), Positives = 277/417 (66%), Gaps = 1/417 (0%) Frame = +2 Query: 149 TLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNLFTKWGDLDL 328 TLQ ILQ I P+ D R +I+++R V+ SVE LRGATV+P+GSFVSNLFT+WGDLD+ Sbjct: 10 TLQEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNLFTRWGDLDI 69 Query: 329 SIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF-EGNYKISC 505 S++L +GS I GK+ KQTLL ++RALR G + +L + +ARVPILK G+ +ISC Sbjct: 70 SVDLFSGSSILFTGKKQKQTLLGHLLRALRASGLWYKLQFVIHARVPILKVVSGHQRISC 129 Query: 506 DISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLNSYSLSLLVL 685 DIS+ NL G +KS+ LFW++ IDGRFRDLV+LVKEWAKAH+INDSK+GT NSYSLSLLV+ Sbjct: 130 DISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKTGTFNSYSLSLLVI 189 Query: 686 FHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSDKSRPINQXX 865 FHFQT PAILPPL+ IYP + DDLTGVR TAE++I AA+I R S++++ +N+ Sbjct: 190 FHFQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKSERAKSVNRSS 249 Query: 866 XXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVEDPFEQPANTA 1045 AKFSDI +A G+ PF G+ E I+SNT WLPKTY+LFVEDPFEQP N A Sbjct: 250 LSELLVSFFAKFSDINVKAQEFGVCPFTGRWETISSNTTWLPKTYSLFVEDPFEQPVNAA 309 Query: 1046 RTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVSWLGPRTRQSTSIASNNRN 1225 R+VS L +IA+ Q+T + L S N+ S++ +L G H+ RT S ++ N Sbjct: 310 RSVSRRNLDRIAQVFQITSRRLV-SECNRNSIIGILTGQHIQESLYRTISLPS--QHHAN 366 Query: 1226 GNKNRPERKQPPFQTQKSQNRAMDKPQNGETNGSKHASTSQKQQVWRPRSETKAEKN 1396 G N + +A + Q + N S+ +T W P ++++ ++N Sbjct: 367 GMHN----------VRNLHGQARPQNQQMQQNWSQSYNTPNPPH-WPPLTQSRPQQN 412 >dbj|BAE99845.1| hypothetical protein [Arabidopsis thaliana] Length = 511 Score = 376 bits (966), Expect = e-101 Identities = 205/417 (49%), Positives = 276/417 (66%), Gaps = 1/417 (0%) Frame = +2 Query: 149 TLQNILQAINPSNEDWAVRFHIINEVRAVVGSVESLRGATVEPYGSFVSNLFTKWGDLDL 328 TLQ ILQ I P+ D R +I+++R V+ SVE LRGATV+P+GSFVSNLFT+WGDLD+ Sbjct: 10 TLQEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNLFTRWGDLDI 69 Query: 329 SIELQNGSYISIPGKRHKQTLLADVIRALRKKGGYGRLHLISNARVPILKF-EGNYKISC 505 S++L +GS I GK+ KQ LL ++RALR G + +L + +ARVPILK G+ +ISC Sbjct: 70 SVDLFSGSSILFTGKKQKQILLGHLLRALRASGLWYKLQFVIHARVPILKVVSGHQRISC 129 Query: 506 DISVSNLCGQMKSKILFWVNGIDGRFRDLVMLVKEWAKAHHINDSKSGTLNSYSLSLLVL 685 DIS+ NL G +KS+ LFW++ IDGRFRDLV+LVKEWAKAH+INDSK+GT NSYSLSLLV+ Sbjct: 130 DISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKTGTFNSYSLSLLVI 189 Query: 686 FHFQTLEPAILPPLKEIYPGNMNDDLTGVRATAEKNIEDTCAASIKRILSDKSRPINQXX 865 FHFQT PAILPPL+ IYP + DDLTGVR TAE++I AA+I R S++++ +N+ Sbjct: 190 FHFQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKSERAKSVNRSS 249 Query: 866 XXXXXXXXXAKFSDICSRASTRGISPFMGQMEDINSNTRWLPKTYALFVEDPFEQPANTA 1045 AKFSDI +A G+ PF G+ E I+SNT WLPKTY+LFVEDPFEQP N A Sbjct: 250 LSELLVSFFAKFSDINVKAQEFGVCPFTGRWETISSNTTWLPKTYSLFVEDPFEQPVNAA 309 Query: 1046 RTVSSNQLIKIAEAIQLTHQMLASSNHNQASLLSVLVGPHVSWLGPRTRQSTSIASNNRN 1225 R+VS L +IA+ Q+T + L S N+ S++ +L G H+ RT S ++ N Sbjct: 310 RSVSRRNLDRIAQVFQITSRRLV-SECNRNSIIGILTGQHIQESLYRTISLPS--QHHAN 366 Query: 1226 GNKNRPERKQPPFQTQKSQNRAMDKPQNGETNGSKHASTSQKQQVWRPRSETKAEKN 1396 G N + +A + Q + N S+ +T W P ++++ ++N Sbjct: 367 GMHN----------VRNLHGQARPQNQQMQQNWSQSYNTPNPPH-WPPLTQSRPQQN 412