BLASTX nr result
ID: Akebia24_contig00024511
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00024511 (2820 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002277771.2| PREDICTED: poly(A) RNA polymerase GLD2-like ... 424 e-116 ref|XP_002524216.1| zinc finger protein, putative [Ricinus commu... 419 e-114 ref|XP_006486408.1| PREDICTED: poly(A) RNA polymerase GLD2-like ... 409 e-111 ref|XP_006435620.1| hypothetical protein CICLE_v10031537mg [Citr... 408 e-111 ref|XP_007220456.1| hypothetical protein PRUPE_ppa005171mg [Prun... 402 e-109 ref|XP_002316267.2| hypothetical protein POPTR_0010s20710g [Popu... 402 e-109 gb|EYU19942.1| hypothetical protein MIMGU_mgv1a006495mg [Mimulus... 397 e-107 ref|XP_006338856.1| PREDICTED: poly(A) RNA polymerase cid11-like... 393 e-106 ref|XP_004150639.1| PREDICTED: poly(A) RNA polymerase GLD2-like ... 392 e-106 ref|XP_007009280.1| Zinc finger protein, putative [Theobroma cac... 391 e-105 ref|XP_004240948.1| PREDICTED: poly(A) RNA polymerase GLD2-like ... 390 e-105 ref|XP_002879814.1| hypothetical protein ARALYDRAFT_321659 [Arab... 381 e-103 ref|XP_006296116.1| hypothetical protein CARUB_v10025267mg [Caps... 381 e-102 ref|NP_181504.2| HEN1 suppressor 1 [Arabidopsis thaliana] gi|538... 380 e-102 dbj|BAE99845.1| hypothetical protein [Arabidopsis thaliana] 379 e-102 ref|XP_006601982.1| PREDICTED: poly(A) RNA polymerase cid11-like... 370 2e-99 ref|XP_006591354.1| PREDICTED: poly(A) RNA polymerase cid11-like... 366 4e-98 ref|XP_006411208.1| hypothetical protein EUTSA_v10016564mg [Eutr... 363 3e-97 ref|XP_007163461.1| hypothetical protein PHAVU_001G236100g [Phas... 360 2e-96 ref|XP_004503175.1| PREDICTED: poly(A) RNA polymerase GLD2-like ... 356 3e-95 >ref|XP_002277771.2| PREDICTED: poly(A) RNA polymerase GLD2-like [Vitis vinifera] gi|296086183|emb|CBI31624.3| unnamed protein product [Vitis vinifera] Length = 453 Score = 424 bits (1091), Expect = e-116 Identities = 233/454 (51%), Positives = 290/454 (63%), Gaps = 48/454 (10%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 MS LE+ L+DIL I PS +D R +I +FR V+S+E LRGATVEPFGS++SNL Sbjct: 1 MSTFNVLEIVLKDILLVINPSREDWAIRNQLIADFRTAVDSVESLRGATVEPFGSFLSNL 60 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 YT+WGDLDISIE+PNG++ISS +R KQ LL + LR +GG R +++IPNARVP++KF Sbjct: 61 YTQWGDLDISIELPNGAYISSAGKRHKQTLLGHVLNALRSKGGWRKLQFIPNARVPIIKF 120 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 ES H NI CD+SI NL GQ+KSKF WI+ ID RFRD++LLVKEWA+ +IN+ K GTLN Sbjct: 121 ESYHPNISCDVSINNLKGQMKSKFLFWISGIDGRFRDLVLLVKEWARAHDINNSKTGTLN 180 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSLSLLV+FH QTC PAILPPL+EIY GN+ D L GVR + E I+ AANI RFKR+ Sbjct: 181 SYSLSLLVVFHLQTCRPAILPPLKEIYPGNVADDLIGVRAVVEGQIEETSAANINRFKRD 240 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 S NRSSLSEL ISF KF I ASE +C YTGQW + + W RT Y L +E Sbjct: 241 RSRAPNRSSLSELFISFLAKFVDITSRASEQGICPYTGQWVDIDSNMRWMPRT-YELFVE 299 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHRRPE 2501 DPFEQPEN AR V S + RISEAF TH++L S+ +D++SLI +LVRP+I + R P Sbjct: 300 DPFEQPENTARGVRSRQLQRISEAFQTTHQRLTSANQDQHSLIDTLVRPQIAQFIRRAPS 359 Query: 2502 SSSTV---------------------LQNQFRNMRLESNPSPSIGFS------------- 2579 +S+ QN F+N R +S P+ + S Sbjct: 360 RNSSAYGRNNSRTYPSVPNVANSPLQFQNDFQNRRPQSRPNTTSQRSAPVQARPNSVTMQ 419 Query: 2580 ----QRPGSS---------AQSQGQQRWRERYNR 2642 RPGSS QSQ Q+ WR R +R Sbjct: 420 RSMYTRPGSSTVQRSVQQATQSQSQRVWRPRSDR 453 >ref|XP_002524216.1| zinc finger protein, putative [Ricinus communis] gi|223536493|gb|EEF38140.1| zinc finger protein, putative [Ricinus communis] Length = 493 Score = 419 bits (1077), Expect = e-114 Identities = 222/399 (55%), Positives = 279/399 (69%), Gaps = 22/399 (5%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 M+ + LE LRD L IKP +D R +I E + V+ S+E LRGATVEPFGS+VSNL Sbjct: 1 MNAHSVLEPILRDTLEVIKPLREDWAVRSKIIEELKDVIASIESLRGATVEPFGSFVSNL 60 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 +T+WGDLDISI + NGS+ISS ++ KQN+LR+ K LR++GG R ++++PNARVPLLKF Sbjct: 61 FTRWGDLDISIMLANGSYISSAAKKRKQNVLREFHKALRQKGGWRRLQFVPNARVPLLKF 120 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 ES QNI CD+SI NL GQIKS F W+ +ID RFRDM+LLVKEWAK NIN+PK GTLN Sbjct: 121 ESGRQNISCDVSIDNLQGQIKSNFLFWLNQIDGRFRDMVLLVKEWAKAHNINNPKTGTLN 180 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSLSLLVIFHFQTC PAILPPL+EIY N+VD LTGVRT+AE I+ C ANIAR+ + Sbjct: 181 SYSLSLLVIFHFQTCVPAILPPLKEIYPRNVVDDLTGVRTVAEERIKETCNANIARYMSD 240 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 NRSSLSEL ISFF KFS I++ A++ +CT+TGQW + + W +T Y L IE Sbjct: 241 KYRAVNRSSLSELFISFFAKFSGISLKAADLGICTFTGQWLDIRSTMRWLPKT-YALFIE 299 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHRRPE 2501 DPFEQPENAARAVS+ +I+EAF T+ KL + ++R SL+ +LVRPEI + + P Sbjct: 300 DPFEQPENAARAVSAGNLVKIAEAFQTTYHKLVLANQNRTSLLGTLVRPEILNCIAGTPV 359 Query: 2502 S---------------------SSTVLQNQFRNMRLESN 2555 SS +Q+QF+NMR E + Sbjct: 360 RNLSYTSLHYQSTHPQISKSMYSSPQVQHQFQNMRQEKH 398 >ref|XP_006486408.1| PREDICTED: poly(A) RNA polymerase GLD2-like isoform X1 [Citrus sinensis] gi|568866114|ref|XP_006486409.1| PREDICTED: poly(A) RNA polymerase GLD2-like isoform X2 [Citrus sinensis] Length = 445 Score = 409 bits (1052), Expect = e-111 Identities = 225/429 (52%), Positives = 289/429 (67%), Gaps = 27/429 (6%) Frame = +3 Query: 1428 SYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNLY 1607 SYN LE L+DIL + P +D TR+ VI++ R VVES+E LRGATVEPFGS+VSNL+ Sbjct: 3 SYN-VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61 Query: 1608 TKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKFE 1787 ++WGDLDISIE+ NGS ISS ++ KQ+LL D+ + LR++GG R ++++ +ARVP+LKFE Sbjct: 62 SRWGDLDISIELSNGSCISSTGKKLKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121 Query: 1788 SIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLNS 1967 +IHQNI CDISI NL GQIKSKF WI++ID RFRDM+LLVKEWAK +IN+PK GT NS Sbjct: 122 TIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNS 181 Query: 1968 YSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRNN 2147 YSLSLLV+FHFQTC PAILPPL++IY GN+VD L GVR ER I ICA NIARF + Sbjct: 182 YSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANVERQIAEICAFNIARFSSDK 241 Query: 2148 SINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIED 2327 NRSSL+ L +SF +KFS +++ ASE +C +TGQWE + W +PL IED Sbjct: 242 YRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNN-HPLFIED 300 Query: 2328 PFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEI---------- 2474 PFEQPEN+ARAVS +IS AF TH +L S+ + R +L+SSL RP I Sbjct: 301 PFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPYILQFFGESPVR 360 Query: 2475 ---RSSVHR--RPESSSTV-----LQNQFRNMRLESNPSPSIG------FSQRPGSSAQS 2606 ++ HR RP+S +V Q+Q N R E+ P+ + +P Sbjct: 361 YANYNNGHRRARPQSHKSVNSPLQAQHQSHNARRENRPNRPMSQQSVQQHQSQPVRQNNG 420 Query: 2607 QGQQRWRER 2633 Q Q+ WR + Sbjct: 421 QVQRIWRPK 429 >ref|XP_006435620.1| hypothetical protein CICLE_v10031537mg [Citrus clementina] gi|557537816|gb|ESR48860.1| hypothetical protein CICLE_v10031537mg [Citrus clementina] Length = 445 Score = 408 bits (1049), Expect = e-111 Identities = 224/429 (52%), Positives = 289/429 (67%), Gaps = 27/429 (6%) Frame = +3 Query: 1428 SYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNLY 1607 SYN LE L+DIL + P +D TR+ VI++ R VVES+E LRGATVEPFGS+VSNL+ Sbjct: 3 SYN-VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61 Query: 1608 TKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKFE 1787 ++WGDLDISIE+ NGS ISS ++ KQ+LL D+ + LR++GG R ++++ +ARVP+LKFE Sbjct: 62 SRWGDLDISIELSNGSCISSTGKKLKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121 Query: 1788 SIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLNS 1967 +IHQNI CDISI NL GQIKSKF WI++ID RFRDM+LLVKEWAK +IN+PK GT NS Sbjct: 122 TIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNS 181 Query: 1968 YSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRNN 2147 YSLSLLV+FHFQTC PAILPPL++IY GN+VD L GVR ER I ICA NIARF + Sbjct: 182 YSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANVERQIAEICAFNIARFSSDK 241 Query: 2148 SINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIED 2327 NRSSL+ L +SF +KFS +++ +SE +C +TGQWE + W +PL IED Sbjct: 242 YRKINRSSLAHLFVSFLEKFSGLSLKSSELGICPFTGQWEHIRSNTRWLPNN-HPLFIED 300 Query: 2328 PFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEI---------- 2474 PFEQPEN+ARAVS +IS AF TH +L S+ + R +L+SSL RP I Sbjct: 301 PFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPYILQFFGESPVR 360 Query: 2475 ---RSSVHR--RPESSSTV-----LQNQFRNMRLESNPSPSIG------FSQRPGSSAQS 2606 ++ HR RP+S +V Q+Q N R E+ P+ + +P Sbjct: 361 YANYNNGHRRARPQSHKSVNSPLQAQHQSHNARRENRPNRPMSQQSVQQHQSQPVRQNNG 420 Query: 2607 QGQQRWRER 2633 Q Q+ WR + Sbjct: 421 QVQRIWRPK 429 >ref|XP_007220456.1| hypothetical protein PRUPE_ppa005171mg [Prunus persica] gi|462416918|gb|EMJ21655.1| hypothetical protein PRUPE_ppa005171mg [Prunus persica] Length = 474 Score = 402 bits (1034), Expect = e-109 Identities = 211/395 (53%), Positives = 273/395 (69%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 MS LE +L++IL +KP +D TR+ +I+E R VES+E LRGATVEPFGS+VS+L Sbjct: 1 MSAQSTLENTLKEILRVVKPLREDWTTRLQIIDELRGAVESVESLRGATVEPFGSFVSDL 60 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 +T+WGDLD+SIE NGSF+S ++ KQ LL D+ + +R++GG R + IPNARVP+LK Sbjct: 61 FTRWGDLDVSIEFSNGSFVSPYGKKQKQRLLGDVMRAMRQKGGWRRYQLIPNARVPILKV 120 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 ES QN+ CDISI NL Q+KS+ WI+EID RFRDM+LL+KEWAK NIN+PK GT N Sbjct: 121 ESNLQNVSCDISIDNLKCQMKSRLLFWISEIDTRFRDMVLLIKEWAKAHNINNPKFGTFN 180 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSL+LLV+FHFQTC PAI PPL++IY GN++D L G+R ER I+ CAANI RF+ Sbjct: 181 SYSLTLLVVFHFQTCAPAIFPPLKDIYPGNLIDDLKGLRADTERRIEETCAANIRRFQSY 240 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 N NRSSLSEL ISF KFS I++ ASE +CTYTGQW+ + + W +T Y L IE Sbjct: 241 NLRAENRSSLSELFISFLGKFSDISLKASELGICTYTGQWQAIKSNMRWLPQT-YALFIE 299 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSSYRDRNSLISSLVRPEIRSSVHRRPES 2504 DPFEQPEN+ARAVS E RISE F +H L S + +SL+++LVRP++ S + R P+ Sbjct: 300 DPFEQPENSARAVSKRELTRISETFEMSHHMLIS-PNHSSLLATLVRPQMLSLMVRTPDW 358 Query: 2505 SSTVLQNQFRNMRLESNPSPSIGFSQRPGSSAQSQ 2609 Q R E + SP+ + P + Q Sbjct: 359 RRQPTHPQ--RFRAEGSHSPTPSNNNGPRQPTRPQ 391 >ref|XP_002316267.2| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|566191879|ref|XP_006378690.1| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|566191881|ref|XP_006378691.1| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|566191883|ref|XP_006378692.1| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|550330240|gb|EEF02438.2| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|550330241|gb|ERP56487.1| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|550330242|gb|ERP56488.1| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] gi|550330243|gb|ERP56489.1| hypothetical protein POPTR_0010s20710g [Populus trichocarpa] Length = 493 Score = 402 bits (1033), Expect = e-109 Identities = 209/358 (58%), Positives = 262/358 (73%), Gaps = 1/358 (0%) Frame = +3 Query: 1443 LELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNLYTKWGD 1622 LE +L+DIL I+P +D R VI E VV+S+E LRG+TVEPFGS+VSNL+T+WGD Sbjct: 7 LEPTLKDILNGIQPLREDWVVRFKVIEELEDVVKSVESLRGSTVEPFGSFVSNLFTRWGD 66 Query: 1623 LDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKFESIHQN 1802 LDISI + NGS+ISS +R KQNLL D+ K LR+RGG + +++IPNARVP+LKFE+ + Sbjct: 67 LDISIVLSNGSYISSAGKRRKQNLLEDVLKALRQRGGWQRLQFIPNARVPILKFENA--S 124 Query: 1803 IFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLNSYSLSL 1982 I CD+SI N+ G +KSKF WI EID RFRDM+LLVKEWAK NIN+PK G+LNSYSLSL Sbjct: 125 ISCDVSIDNMQGLMKSKFLFWINEIDRRFRDMVLLVKEWAKTHNINNPKTGSLNSYSLSL 184 Query: 1983 LVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRNNSINAN 2162 LVIFHFQTC PAILPPL+EIY N++D LTGVRT AER I ICAANI+R++ N S N Sbjct: 185 LVIFHFQTCVPAILPPLKEIYPRNVIDDLTGVRTDAERRIGEICAANISRYRSNKSRAIN 244 Query: 2163 RSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIEDPFEQP 2342 R+SLSEL ISF KF I+ A+E +C +TG+WE + W RT Y L IEDPFEQP Sbjct: 245 RNSLSELFISFLTKFYDISSKATELGICPFTGKWEEIRSNTRWLPRT-YALFIEDPFEQP 303 Query: 2343 ENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHRRPESSST 2513 EN ARAVS+ +ISEA TH +L ++ +++ S + LVRP I + P S+S+ Sbjct: 304 ENTARAVSAANLMKISEAIQTTHHRLVTANQNQISFLGMLVRPRISRIIAGTPASNSS 361 >gb|EYU19942.1| hypothetical protein MIMGU_mgv1a006495mg [Mimulus guttatus] Length = 442 Score = 397 bits (1019), Expect = e-107 Identities = 213/382 (55%), Positives = 267/382 (69%), Gaps = 1/382 (0%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 M+ L+L++RDIL I PS DD R +INE RA+V S+E LRGATVEPFGS+ SNL Sbjct: 5 MNRYNLLDLTIRDILRVINPSNDDWSFRFQMINEIRAIVGSIENLRGATVEPFGSFASNL 64 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 +TKWGDLDISIE+ NG++ISSP ++ KQ++L+++ K R++GG R +++I NARVP+LKF Sbjct: 65 FTKWGDLDISIELQNGTYISSPGKKHKQSVLQEVLKAFRKKGGFRKLKFIANARVPILKF 124 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 E + NI CDISI NL GQ+KSK WI EID RFRD+++LVKEWAK +IND K+GTLN Sbjct: 125 EGSY-NISCDISINNLSGQMKSKILFWINEIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 183 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSLSLLVIFH QT PAILPPL+EIY GN++D LTGVRT+AE+NI+ ICAANI R + + Sbjct: 184 SYSLSLLVIFHLQTLVPAILPPLREIYPGNMIDDLTGVRTVAEKNIEDICAANIHRIRSD 243 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 S NRS+LS L ISF KF+ I AS +C Y+GQ E + + W RT Y L +E Sbjct: 244 RSRLINRSTLSALFISFLTKFADICSRASTQGICPYSGQLEDIHTNMRWLPRT-YALFVE 302 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHRRPE 2501 DPFEQP N AR VSS + RIS+A TH L ++ +DR LI L P I S RP Sbjct: 303 DPFEQPANTARTVSSNQLIRISQAIQATHGILVAANQDRTCLIPVLAGPHI-SCFFMRPS 361 Query: 2502 SSSTVLQNQFRNMRLESNPSPS 2567 + L NQF + S PS Sbjct: 362 VPAPPLFNQFPSRTSSSTQLPS 383 >ref|XP_006338856.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X1 [Solanum tuberosum] gi|565343469|ref|XP_006338857.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X2 [Solanum tuberosum] gi|565343471|ref|XP_006338858.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X3 [Solanum tuberosum] gi|565343473|ref|XP_006338859.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X4 [Solanum tuberosum] Length = 453 Score = 393 bits (1010), Expect = e-106 Identities = 208/371 (56%), Positives = 261/371 (70%), Gaps = 1/371 (0%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 M+ LE +L++IL +I P E+D R +I+E RAVVES+EILRGATVEPFGS+VSNL Sbjct: 1 MNCYSLLEHTLQNILHSINPLEEDWSMRFQLIHELRAVVESIEILRGATVEPFGSFVSNL 60 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 +T+WGDLDISIE+PNGS IS+ ++ K +LL D+ K LR +GG R +++I NARVP+LKF Sbjct: 61 FTRWGDLDISIELPNGSHISAAGKKYKLSLLGDVLKALRAKGGCRKLQFITNARVPILKF 120 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 + + NI CDISI NL GQ+KSK WI ID RFRDM+LLVKEWAK NIND K GTLN Sbjct: 121 QG-NYNISCDISINNLSGQMKSKILYWINMIDGRFRDMVLLVKEWAKAHNINDSKTGTLN 179 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSLSLLV+FHFQTC PAILPPL+EIY G++VD LTGVR AE+ I+ CA NI R N Sbjct: 180 SYSLSLLVVFHFQTCVPAILPPLKEIYPGSMVDDLTGVRASAEKFIEETCAMNINRLMSN 239 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 S NRS LSEL ISF KF I+ AS + +TGQWE + + W +T Y + +E Sbjct: 240 KSRAINRSYLSELFISFIAKFCDISSRASAQGISPFTGQWEDIVSNMRWLPKT-YTIFVE 298 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHRRPE 2501 DPFEQP N+AR VS+ + RI EAF TH L SS ++ N +IS+LV+P + V R Sbjct: 299 DPFEQPLNSARGVSTKQLTRIEEAFRSTHFMLCSSNQNENEIISTLVKPHVSKFVARTSG 358 Query: 2502 SSSTVLQNQFR 2534 + + +N R Sbjct: 359 NQNNYSRNGLR 369 >ref|XP_004150639.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus] gi|449516431|ref|XP_004165250.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus] Length = 464 Score = 392 bits (1006), Expect = e-106 Identities = 198/357 (55%), Positives = 256/357 (71%), Gaps = 1/357 (0%) Frame = +3 Query: 1443 LELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNLYTKWGD 1622 L+ ++DIL ++P +DD R VINE R VV+S+E LRGAT+EPFGS+VSNL+++WGD Sbjct: 6 LDRVIKDILRVVEPLQDDWTARFQVINELRNVVQSIESLRGATIEPFGSFVSNLFSRWGD 65 Query: 1623 LDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKFESIHQN 1802 LD+S+++ NGS+ S+ ++ KQ LLRDI+ R+ G ++ IP+ARVP+LK E I N Sbjct: 66 LDLSVQLNNGSYTSTAGKKRKQTLLRDIQNASRKNGRWYKLQLIPHARVPILKIEHIQHN 125 Query: 1803 IFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLNSYSLSL 1982 I CDISI NLVGQIKSK W+ EID RF DM+LLVKEWAK +IN+ K GT NSYSLSL Sbjct: 126 ISCDISIDNLVGQIKSKILLWVNEIDGRFHDMVLLVKEWAKAHDINNSKQGTFNSYSLSL 185 Query: 1983 LVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRNNSINAN 2162 LVIFHFQTC PAI PPL++IY GN+VD L GVR E I CA NIARFK S AN Sbjct: 186 LVIFHFQTCSPAIFPPLRDIYPGNVVDNLKGVRAEVENEIARTCATNIARFK---SRTAN 242 Query: 2163 RSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIEDPFEQP 2342 RSSLSEL +SF KFS I+ ASE +C YTGQW ++ + W +T Y + +EDPFEQP Sbjct: 243 RSSLSELFVSFLAKFSDISSKASELGICPYTGQWLKIESNMRWLPKT-YAIFVEDPFEQP 301 Query: 2343 ENAARAVSSVEFFRISEAFAGTHRKLSS-YRDRNSLISSLVRPEIRSSVHRRPESSS 2510 EN ARA+++ + RISEAF TH +L+S Y++R+S+++ L RP+I + S+S Sbjct: 302 ENTARAINARQLMRISEAFRMTHLRLTSVYQNRSSILNDLARPQISQLIINSSGSAS 358 >ref|XP_007009280.1| Zinc finger protein, putative [Theobroma cacao] gi|508726193|gb|EOY18090.1| Zinc finger protein, putative [Theobroma cacao] Length = 482 Score = 391 bits (1004), Expect = e-105 Identities = 214/430 (49%), Positives = 284/430 (66%), Gaps = 35/430 (8%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 M+ +E +L+++L IKP +D TR +I+E R VV+SME LRGATVEPFGS VSNL Sbjct: 1 MNSYSQVESTLQEVLEVIKPLREDWVTRQKIIDELREVVQSMESLRGATVEPFGSLVSNL 60 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 +T+WGDLDISIE+P GS++SS ++ KQ LL ++++ L+++ G + +++IP+ARVP+LK Sbjct: 61 FTRWGDLDISIELPYGSYVSSAGKKRKQTLLGELQRALKQKDGWQRLQFIPHARVPILKI 120 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 ES QNI CDISI NL GQIKSKF W+ EID RFR+M+LLVKEWA IN+PK GT N Sbjct: 121 ESRWQNISCDISIDNLQGQIKSKFLFWLNEIDGRFREMVLLVKEWASANGINNPKAGTFN 180 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSL+LLVIFHFQTC PAI PPL++IY N+V LTGVR AER I +C++NIARF+ Sbjct: 181 SYSLTLLVIFHFQTCAPAIFPPLKDIYPRNVVTDLTGVRADAERRIAQVCSSNIARFRSG 240 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 ++ NRSSLSEL ISF KFS IN AS+ +CT+TGQWE +T + W RT Y + +E Sbjct: 241 RTV--NRSSLSELFISFIAKFSDINSKASDMGICTFTGQWEYITSNMRWLPRT-YAIFVE 297 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPE---------- 2471 DPFEQPENA+RAVS + +I+EAF T L S+ +++L+ +LV P+ Sbjct: 298 DPFEQPENASRAVSQKQLIKIAEAFETTRCMLISANLTQSTLLPTLVGPKTSRFIVKQQS 357 Query: 2472 -------------IRSSVHRRPESSSTVLQNQFRNMRLESN-----------PSPSIGFS 2579 R VHR S + Q+Q+RN R ++ PSPS Sbjct: 358 VSSSSYNGGHYPNTRPQVHRAVHSPLLMQQHQYRNSRPAASQMQQHQAQMVMPSPSRVQP 417 Query: 2580 QRPGSSAQSQ 2609 Q P + +S+ Sbjct: 418 QFPKTRVESR 427 >ref|XP_004240948.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Solanum lycopersicum] Length = 453 Score = 390 bits (1002), Expect = e-105 Identities = 206/365 (56%), Positives = 258/365 (70%), Gaps = 1/365 (0%) Frame = +3 Query: 1443 LELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNLYTKWGD 1622 LE +L++IL +I PSE+D R +I+E RAVVES+EILRGATVEPFGS+VSNL+T+WGD Sbjct: 7 LEHTLQNILHSINPSEEDWSMRFQLIHELRAVVESIEILRGATVEPFGSFVSNLFTRWGD 66 Query: 1623 LDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKFESIHQN 1802 +DISIE+PNG IS+ ++ K +LL D+ K LR +GG R +++I NARVP+LKF+ + N Sbjct: 67 VDISIELPNGLHISAAGKKYKLSLLGDVLKALRAKGGCRKLQFITNARVPILKFQG-NNN 125 Query: 1803 IFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLNSYSLSL 1982 I CDISI NL GQ+KSK WI ID RFRDM+LLVKEWAK NIND K GTLNSYSLSL Sbjct: 126 ISCDISINNLSGQMKSKILYWINMIDGRFRDMVLLVKEWAKAHNINDSKTGTLNSYSLSL 185 Query: 1983 LVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRNNSINAN 2162 LV+FH QTC PAILPPL+EIY G++VD LTGVR AE+ I+ CA NI R N S N Sbjct: 186 LVVFHLQTCVPAILPPLKEIYPGSMVDDLTGVRASAEKFIEETCAMNINRLMSNKSRVIN 245 Query: 2163 RSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIEDPFEQP 2342 RSSLSEL ISF KF I+ AS + +TGQWE + + W +T Y + +EDPFEQP Sbjct: 246 RSSLSELFISFIAKFCNISSRASAQGISPFTGQWEDIVSNMRWLPKT-YTIFVEDPFEQP 304 Query: 2343 ENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHRRPESSSTVL 2519 N+AR VS+ + RI EAF TH L SS + N +IS+LV+P + V R + + Sbjct: 305 LNSARGVSTKQLTRIEEAFRSTHFMLCSSNLNENEVISTLVKPHVSKFVARISGNQNNYS 364 Query: 2520 QNQFR 2534 +N R Sbjct: 365 RNGLR 369 >ref|XP_002879814.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp. lyrata] gi|297325653|gb|EFH56073.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp. lyrata] Length = 500 Score = 381 bits (979), Expect = e-103 Identities = 206/405 (50%), Positives = 268/405 (66%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 MS N L+ +L++IL IKP+ D TR+ VI++ R V++++E LRGATV+PFGS+VSNL Sbjct: 1 MSRNPFLDPTLQEILQVIKPTRADWDTRIRVIDQLRDVLQTVECLRGATVQPFGSFVSNL 60 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 +T+WGDLD+S+++ +GS I ++ KQ LLR + + LR G ++++ +ARVP+LK Sbjct: 61 FTRWGDLDLSVDLFSGSSILFTGKKQKQTLLRHLLRALRASGLWYKLQFVIHARVPILKV 120 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 S HQ I CDISI NL G +KS+F WI+EID RFRD++LLVKEWAK NIND KNGT N Sbjct: 121 VSGHQRIACDISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKNGTFN 180 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSLSLLVIFH QTC PAILPPL+ IY + VD LTGVR AE +I + AANIARFK N Sbjct: 181 SYSLSLLVIFHLQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKLN 240 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 + + NRSSLSELL+SF+ KFS IN+ A E VC +TG+WE ++ W +T Y L +E Sbjct: 241 TAKSVNRSSLSELLVSFYAKFSDINLKAQELGVCPFTGRWENISSNTTWLPKT-YSLFVE 299 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSSYRDRNSLISSLVRPEIRSSVHRRPES 2504 DPFEQP NAAR+VS RI++ F T R+L S +RNS+I L I+ S+HR Sbjct: 300 DPFEQPVNAARSVSRRNLDRIAQVFQITSRRLVSDCNRNSIIGVLTGQHIQESLHRTISL 359 Query: 2505 SSTVLQNQFRNMRLESNPSPSIGFSQRPGSSAQSQGQQRWRERYN 2639 S N N+R + Q QQ W + YN Sbjct: 360 HSQQHANSMHNVRNLHGQA----------RHQNQQMQQNWSQSYN 394 >ref|XP_006296116.1| hypothetical protein CARUB_v10025267mg [Capsella rubella] gi|482564824|gb|EOA29014.1| hypothetical protein CARUB_v10025267mg [Capsella rubella] Length = 499 Score = 381 bits (978), Expect = e-102 Identities = 207/402 (51%), Positives = 272/402 (67%) Frame = +3 Query: 1434 NGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNLYTK 1613 N L+ +L++IL IKP+ D TR+ VI + R+VV+S+E LRGATV+PFGS+VSNL+T+ Sbjct: 3 NPFLDPTLQEILQVIKPTRADCDTRIGVIEQLRSVVQSVECLRGATVQPFGSFVSNLFTR 62 Query: 1614 WGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKFESI 1793 WGDLDIS+++ +GS I ++ KQ LL + + LR G ++++ +ARVP+LK ES Sbjct: 63 WGDLDISVDLFSGSSILFTGKKQKQKLLGHLLRALRANGLWYKLQFVIHARVPILKVESG 122 Query: 1794 HQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLNSYS 1973 HQ I CDISI NL G +KS+F WI+EID RFRD++LLVKEWAK NIND KNGT NSYS Sbjct: 123 HQRISCDISIDNLEGLLKSRFLLWISEIDGRFRDLVLLVKEWAKAHNINDSKNGTFNSYS 182 Query: 1974 LSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRNNSI 2153 LSLLVIFH QTC PAILPPL+ IY + D LTGVR AE +I I AANIARFK + + Sbjct: 183 LSLLVIFHLQTCVPAILPPLRVIYPKSAADDLTGVRKTAEESIAQITAANIARFKLDTAK 242 Query: 2154 NANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIEDPF 2333 + NRSSLSELL+SFF KFS INV A E VC +TG+WE ++ W +T Y L +EDPF Sbjct: 243 SPNRSSLSELLVSFFAKFSDINVKAQELGVCPFTGRWENISSNSRWLPKT-YSLFVEDPF 301 Query: 2334 EQPENAARAVSSVEFFRISEAFAGTHRKLSSYRDRNSLISSLVRPEIRSSVHRRPESSST 2513 EQP+NAAR+VS RI++ F T R+L+S +RNS+I + +I+ S++R + Sbjct: 302 EQPQNAARSVSRRNLDRIAQVFQMTSRRLASDCNRNSIIGVMTGQQIQQSLYRTISLHNQ 361 Query: 2514 VLQNQFRNMRLESNPSPSIGFSQRPGSSAQSQGQQRWRERYN 2639 N N+R ++ RP + Q QQ W + YN Sbjct: 362 HHANGTHNVR-------NLHGQSRPWN---QQLQQNWSQSYN 393 >ref|NP_181504.2| HEN1 suppressor 1 [Arabidopsis thaliana] gi|53850481|gb|AAU95417.1| At2g39740 [Arabidopsis thaliana] gi|55733735|gb|AAV59264.1| At2g39740 [Arabidopsis thaliana] gi|330254623|gb|AEC09717.1| HEN1 suppressor 1 [Arabidopsis thaliana] Length = 511 Score = 380 bits (977), Expect = e-102 Identities = 209/405 (51%), Positives = 269/405 (66%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 MS N L+ +L++IL IKP+ D TR+ VI++ R V++S+E LRGATV+PFGS+VSNL Sbjct: 1 MSRNPFLDPTLQEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNL 60 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 +T+WGDLDIS+++ +GS I ++ KQ LL + + LR G ++++ +ARVP+LK Sbjct: 61 FTRWGDLDISVDLFSGSSILFTGKKQKQTLLGHLLRALRASGLWYKLQFVIHARVPILKV 120 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 S HQ I CDISI NL G +KS+F WI+EID RFRD++LLVKEWAK NIND K GT N Sbjct: 121 VSGHQRISCDISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKTGTFN 180 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSLSLLVIFHFQTC PAILPPL+ IY + VD LTGVR AE +I + AANIARFK Sbjct: 181 SYSLSLLVIFHFQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKSE 240 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 + + NRSSLSELL+SFF KFS INV A E VC +TG+WE ++ W +T Y L +E Sbjct: 241 RAKSVNRSSLSELLVSFFAKFSDINVKAQEFGVCPFTGRWETISSNTTWLPKT-YSLFVE 299 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSSYRDRNSLISSLVRPEIRSSVHRRPES 2504 DPFEQP NAAR+VS RI++ F T R+L S +RNS+I L I+ S++R Sbjct: 300 DPFEQPVNAARSVSRRNLDRIAQVFQITSRRLVSECNRNSIIGILTGQHIQESLYRTISL 359 Query: 2505 SSTVLQNQFRNMRLESNPSPSIGFSQRPGSSAQSQGQQRWRERYN 2639 S N N+R ++ RP Q QQ W + YN Sbjct: 360 PSQHHANGMHNVR-------NLHGQARP---QNQQMQQNWSQSYN 394 >dbj|BAE99845.1| hypothetical protein [Arabidopsis thaliana] Length = 511 Score = 379 bits (974), Expect = e-102 Identities = 209/405 (51%), Positives = 269/405 (66%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 MS N L+ +L++IL IKP+ D TR+ VI++ R V++S+E LRGATV+PFGS+VSNL Sbjct: 1 MSRNPFLDPTLQEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNL 60 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 +T+WGDLDIS+++ +GS I ++ KQ LL + + LR G ++++ +ARVP+LK Sbjct: 61 FTRWGDLDISVDLFSGSSILFTGKKQKQILLGHLLRALRASGLWYKLQFVIHARVPILKV 120 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 S HQ I CDISI NL G +KS+F WI+EID RFRD++LLVKEWAK NIND K GT N Sbjct: 121 VSGHQRISCDISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKTGTFN 180 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSLSLLVIFHFQTC PAILPPL+ IY + VD LTGVR AE +I + AANIARFK Sbjct: 181 SYSLSLLVIFHFQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKSE 240 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 + + NRSSLSELL+SFF KFS INV A E VC +TG+WE ++ W +T Y L +E Sbjct: 241 RAKSVNRSSLSELLVSFFAKFSDINVKAQEFGVCPFTGRWETISSNTTWLPKT-YSLFVE 299 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSSYRDRNSLISSLVRPEIRSSVHRRPES 2504 DPFEQP NAAR+VS RI++ F T R+L S +RNS+I L I+ S++R Sbjct: 300 DPFEQPVNAARSVSRRNLDRIAQVFQITSRRLVSECNRNSIIGILTGQHIQESLYRTISL 359 Query: 2505 SSTVLQNQFRNMRLESNPSPSIGFSQRPGSSAQSQGQQRWRERYN 2639 S N N+R ++ RP Q QQ W + YN Sbjct: 360 PSQHHANGMHNVR-------NLHGQARP---QNQQMQQNWSQSYN 394 >ref|XP_006601982.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X1 [Glycine max] gi|571542766|ref|XP_006601983.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X2 [Glycine max] gi|571542770|ref|XP_006601984.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X3 [Glycine max] gi|571542774|ref|XP_006601985.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X4 [Glycine max] Length = 415 Score = 370 bits (950), Expect = 2e-99 Identities = 197/415 (47%), Positives = 272/415 (65%), Gaps = 12/415 (2%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 MS + L++ + DIL + P ++D R +IN+ R++VES+E LRGATVEPFGS+VSNL Sbjct: 1 MSTHSTLDIVVNDILRVVTPVQEDWEIRFAIINDLRSIVESVESLRGATVEPFGSFVSNL 60 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 +T+WGDLDISIE+ NG ISS ++ KQ L D+ K LR +GG ++++I NARVP+LKF Sbjct: 61 FTRWGDLDISIELSNGLHISSAGKKQKQTFLGDVLKALRMKGGGSNLQFISNARVPILKF 120 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 +S Q + CDISI NL GQ+KSK WI +ID RFR M+LLVKEWAK IN+ K GT N Sbjct: 121 KSYRQGVSCDISINNLPGQMKSKILLWINKIDGRFRHMVLLVKEWAKAHKINNSKAGTFN 180 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSLSLLVIF+FQTC PAI PPL++IY GN+VD L GVR+ AE I C ANI RF N Sbjct: 181 SYSLSLLVIFYFQTCIPAIFPPLKDIYPGNMVDDLIGVRSDAENLIAQTCDANINRFISN 240 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 + + NR S++EL + F KF++++ MA + +C Y+G+WE++ + W +T Y + +E Sbjct: 241 RARSINRKSVAELFVEFIGKFAKMDSMAVKMGICPYSGKWEQIEDNMIWLPKT-YAIFVE 299 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISS---------LVRPEI 2474 DPFEQP+N AR+VS+ + +I+EAFA TH L S+ +++ SL+S+ + RP Sbjct: 300 DPFEQPQNTARSVSAGQLKKITEAFARTHDLLTSTNQNQISLLSNMAPAHVIRCITRPYG 359 Query: 2475 RSSVHRRPESSSTVLQNQFRNMRLESNPS--PSIGFSQRPGSSAQSQGQQRWRER 2633 H ++ Q ++ R N S S S G + +GQQ WR + Sbjct: 360 GGYFHPTQPQVQRAIRPQLQSQRHFQNVSQGTSSNSSSSKGHTLVHRGQQIWRPK 414 >ref|XP_006591354.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X1 [Glycine max] gi|571489968|ref|XP_006591355.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X2 [Glycine max] Length = 455 Score = 366 bits (939), Expect = 4e-98 Identities = 183/357 (51%), Positives = 254/357 (71%), Gaps = 1/357 (0%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 MS + L++ + DIL + P ++D R +IN+FR++VES+E LRGATVEP+GS+VSNL Sbjct: 1 MSTHSMLDIVVNDILRVVTPLQEDWEIRFAIINDFRSIVESVESLRGATVEPYGSFVSNL 60 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 +T+WGDLDISIE+ NG ISS ++ KQ LL ++ K LR +GG ++++I NARVP+LKF Sbjct: 61 FTRWGDLDISIELSNGLHISSAGKKQKQTLLGEVLKALRMKGGGSNLQFISNARVPILKF 120 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 +S Q + CDISI NL GQ+KSK WI +ID RFR M+LLVKEWAK IN+ K GT N Sbjct: 121 KSYRQGVSCDISINNLPGQMKSKILLWINKIDGRFRHMVLLVKEWAKAHKINNSKAGTFN 180 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSLSLLVIF+FQTC PAI PPL++IY GN++D L G+R+ AE I C ANI RF N Sbjct: 181 SYSLSLLVIFYFQTCIPAIFPPLKDIYPGNMIDDLIGIRSDAENLIAETCDANINRFISN 240 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 + + NR S++EL + F KF++++ MA E +C YTG+WE++ + W +T Y + +E Sbjct: 241 RARSINRKSVAELFVDFVGKFAKMDSMAVEMGICPYTGKWEQIEDNMIWLPKT-YAIFVE 299 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHR 2492 DPFEQP+N AR+VS+ + +I+E FA TH L S+ +++ SL+S+L + + R Sbjct: 300 DPFEQPQNTARSVSAGQLKKITETFARTHDLLTSTNQNQISLLSNLAPAHVIRCITR 356 >ref|XP_006411208.1| hypothetical protein EUTSA_v10016564mg [Eutrema salsugineum] gi|567214704|ref|XP_006411209.1| hypothetical protein EUTSA_v10016564mg [Eutrema salsugineum] gi|557112377|gb|ESQ52661.1| hypothetical protein EUTSA_v10016564mg [Eutrema salsugineum] gi|557112378|gb|ESQ52662.1| hypothetical protein EUTSA_v10016564mg [Eutrema salsugineum] Length = 493 Score = 363 bits (931), Expect = 3e-97 Identities = 205/421 (48%), Positives = 275/421 (65%), Gaps = 15/421 (3%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 MS N + +L+DIL AIKP+ D R+ VI++ R+ ++S+E LRGATV+PFGS+VSNL Sbjct: 1 MSRNPVFDPTLQDILQAIKPTGADWDARMTVIDQLRSALQSVESLRGATVQPFGSFVSNL 60 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 +T+WGDLDIS+++ +GS I ++ KQ L + + LR G ++++ +ARVP+LK Sbjct: 61 FTRWGDLDISVDLFSGSSILFTGKKQKQTFLGQLLRALRASGAWYRLQFVAHARVPILKV 120 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 S HQ I CDISI NL G +KS+F WI+EID RFRD++LLVKEWAK +IN+PKNGT N Sbjct: 121 VSGHQRISCDISIDNLEGLLKSRFLFWISEIDWRFRDLVLLVKEWAKAHDINNPKNGTFN 180 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSLSLLVIFH QTC PAILPPL +IY + VD L IA+ + AANIARF+ Sbjct: 181 SYSLSLLVIFHLQTCVPAILPPLGDIYPRSAVDDLKVAACIAQ-----LSAANIARFRSG 235 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 S NRSSLSELL+SFF KFS INV A E VC +TG+WE ++ W +T Y L +E Sbjct: 236 TSRAVNRSSLSELLVSFFAKFSDINVKAKELGVCPFTGRWENISSNTRWLPKT-YSLFVE 294 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSSYRDRNSLISSLVRPEI------RSSV 2486 DPFEQPENAAR+VS RI++ F T R+L++ +RNS++ L P I R+S+ Sbjct: 295 DPFEQPENAARSVSRKSLDRIAQVFEMTSRRLATDCNRNSIVGVLTSPHISQPLCSRTSL 354 Query: 2487 HRRPESSST----VLQNQFR--NMRLESNPSPSIGFSQRP---GSSAQSQGQQRWRERYN 2639 H ++ L Q R N +++ + S S + Q P +SA+S+ QQ W + Sbjct: 355 HNHHHANGVNNGHNLHGQSRPWNHQMQQHWSQS-NYVQNPPYWPASARSRAQQNWSQNNP 413 Query: 2640 R 2642 R Sbjct: 414 R 414 >ref|XP_007163461.1| hypothetical protein PHAVU_001G236100g [Phaseolus vulgaris] gi|561036925|gb|ESW35455.1| hypothetical protein PHAVU_001G236100g [Phaseolus vulgaris] Length = 509 Score = 360 bits (924), Expect = 2e-96 Identities = 179/355 (50%), Positives = 249/355 (70%), Gaps = 1/355 (0%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 MS + L++ L+DIL + P ++D + R ++N+ R++VES+E LRGATVEPFGS+VSNL Sbjct: 1 MSTHSMLDIVLKDILQVVTPLQEDWQIRFAILNDLRSIVESVESLRGATVEPFGSFVSNL 60 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 +T+WGDLDISIE+ NG ISS ++ KQ LL ++ K LR +G +++I +ARVP+LKF Sbjct: 61 FTRWGDLDISIELSNGLHISSAGKKQKQTLLGEVLKALRMKGAGSHLQFISSARVPILKF 120 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 +S Q + CDISI NL GQ+KSK WI +ID RF DM+LLVKEWAK IN+ K GT N Sbjct: 121 KSNRQGVSCDISINNLPGQMKSKILLWINKIDGRFHDMVLLVKEWAKAHKINNSKTGTFN 180 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSLSLLVIFHFQTC PAILPPL+ IY GN+VD L G+R AE I C A I R N Sbjct: 181 SYSLSLLVIFHFQTCVPAILPPLKYIYPGNMVDDLKGIRADAENLIAETCNAGINRHISN 240 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 + + N+ S+ +L + F +K+++++ ASE +C YTGQWE++ W +T Y + +E Sbjct: 241 TARSINKKSVPDLFVEFLRKYAQMDSWASELGICPYTGQWEQIENNTIWLPKT-YSIFVE 299 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSS-YRDRNSLISSLVRPEIRSSV 2486 DPFEQP+N AR+V++ + +IS+ F+ T+ LSS + + NSL++ L P + S+ Sbjct: 300 DPFEQPQNTARSVNAGQLKKISDTFSKTYAFLSSNHHNLNSLLTMLAPPHVVKSI 354 >ref|XP_004503175.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cicer arietinum] Length = 491 Score = 356 bits (914), Expect = 3e-95 Identities = 185/351 (52%), Positives = 245/351 (69%), Gaps = 1/351 (0%) Frame = +3 Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604 MS + L L DIL I PS++D R +IN+ R++ ES++ LRGATVEPFGS+VSNL Sbjct: 1 MSTHNMLGNVLNDILQVITPSQEDWAIRFAIINDLRSIAESVQSLRGATVEPFGSFVSNL 60 Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784 +T+WGD+DISIE+ NGS I+S R+ KQ LL D +VLR +GG +++ I NARVP+LKF Sbjct: 61 FTRWGDVDISIELLNGSHIASVGRKQKQTLLGDFLRVLRLKGGYMNMQLILNARVPILKF 120 Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964 S Q I CD+SI NL G +KSKF WI ID RF DM+L+VKEWAK IN+ + G+ N Sbjct: 121 RSKQQGISCDVSINNLPGLMKSKFLLWINRIDGRFHDMVLVVKEWAKAHRINNSRTGSFN 180 Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144 SYSLSLLVIFHFQTC PAILPPL++IY N+VD L GVR E I C ANI RF + Sbjct: 181 SYSLSLLVIFHFQTCAPAILPPLKDIYPANMVDELRGVRADVENLISETCGANINRFISD 240 Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324 S NR S+ EL I F +KF++++ ASE +C Y+GQ E++ + W +T Y + +E Sbjct: 241 KSRTINRKSVPELFIDFLRKFAQMDSWASELGICPYSGQREQIKNNMRWLPKT-YAIFVE 299 Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSS-YRDRNSLISSLVRPEI 2474 DPFEQPEN+AR+VS+ + +I+EAF T+ L+S +++NSL++ L P I Sbjct: 300 DPFEQPENSARSVSAGQLRKIAEAFLKTYSLLTSKNQNQNSLLACLAPPHI 350