BLASTX nr result

ID: Akebia24_contig00024511 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00024511
         (2820 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002277771.2| PREDICTED: poly(A) RNA polymerase GLD2-like ...   424   e-116
ref|XP_002524216.1| zinc finger protein, putative [Ricinus commu...   419   e-114
ref|XP_006486408.1| PREDICTED: poly(A) RNA polymerase GLD2-like ...   409   e-111
ref|XP_006435620.1| hypothetical protein CICLE_v10031537mg [Citr...   408   e-111
ref|XP_007220456.1| hypothetical protein PRUPE_ppa005171mg [Prun...   402   e-109
ref|XP_002316267.2| hypothetical protein POPTR_0010s20710g [Popu...   402   e-109
gb|EYU19942.1| hypothetical protein MIMGU_mgv1a006495mg [Mimulus...   397   e-107
ref|XP_006338856.1| PREDICTED: poly(A) RNA polymerase cid11-like...   393   e-106
ref|XP_004150639.1| PREDICTED: poly(A) RNA polymerase GLD2-like ...   392   e-106
ref|XP_007009280.1| Zinc finger protein, putative [Theobroma cac...   391   e-105
ref|XP_004240948.1| PREDICTED: poly(A) RNA polymerase GLD2-like ...   390   e-105
ref|XP_002879814.1| hypothetical protein ARALYDRAFT_321659 [Arab...   381   e-103
ref|XP_006296116.1| hypothetical protein CARUB_v10025267mg [Caps...   381   e-102
ref|NP_181504.2| HEN1 suppressor 1 [Arabidopsis thaliana] gi|538...   380   e-102
dbj|BAE99845.1| hypothetical protein [Arabidopsis thaliana]           379   e-102
ref|XP_006601982.1| PREDICTED: poly(A) RNA polymerase cid11-like...   370   2e-99
ref|XP_006591354.1| PREDICTED: poly(A) RNA polymerase cid11-like...   366   4e-98
ref|XP_006411208.1| hypothetical protein EUTSA_v10016564mg [Eutr...   363   3e-97
ref|XP_007163461.1| hypothetical protein PHAVU_001G236100g [Phas...   360   2e-96
ref|XP_004503175.1| PREDICTED: poly(A) RNA polymerase GLD2-like ...   356   3e-95

>ref|XP_002277771.2| PREDICTED: poly(A) RNA polymerase GLD2-like [Vitis vinifera]
            gi|296086183|emb|CBI31624.3| unnamed protein product
            [Vitis vinifera]
          Length = 453

 Score =  424 bits (1091), Expect = e-116
 Identities = 233/454 (51%), Positives = 290/454 (63%), Gaps = 48/454 (10%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            MS    LE+ L+DIL  I PS +D   R  +I +FR  V+S+E LRGATVEPFGS++SNL
Sbjct: 1    MSTFNVLEIVLKDILLVINPSREDWAIRNQLIADFRTAVDSVESLRGATVEPFGSFLSNL 60

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            YT+WGDLDISIE+PNG++ISS  +R KQ LL  +   LR +GG R +++IPNARVP++KF
Sbjct: 61   YTQWGDLDISIELPNGAYISSAGKRHKQTLLGHVLNALRSKGGWRKLQFIPNARVPIIKF 120

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
            ES H NI CD+SI NL GQ+KSKF  WI+ ID RFRD++LLVKEWA+  +IN+ K GTLN
Sbjct: 121  ESYHPNISCDVSINNLKGQMKSKFLFWISGIDGRFRDLVLLVKEWARAHDINNSKTGTLN 180

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSLSLLV+FH QTC PAILPPL+EIY GN+ D L GVR + E  I+   AANI RFKR+
Sbjct: 181  SYSLSLLVVFHLQTCRPAILPPLKEIYPGNVADDLIGVRAVVEGQIEETSAANINRFKRD 240

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
             S   NRSSLSEL ISF  KF  I   ASE  +C YTGQW  +   + W  RT Y L +E
Sbjct: 241  RSRAPNRSSLSELFISFLAKFVDITSRASEQGICPYTGQWVDIDSNMRWMPRT-YELFVE 299

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHRRPE 2501
            DPFEQPEN AR V S +  RISEAF  TH++L S+ +D++SLI +LVRP+I   + R P 
Sbjct: 300  DPFEQPENTARGVRSRQLQRISEAFQTTHQRLTSANQDQHSLIDTLVRPQIAQFIRRAPS 359

Query: 2502 SSSTV---------------------LQNQFRNMRLESNPSPSIGFS------------- 2579
             +S+                       QN F+N R +S P+ +   S             
Sbjct: 360  RNSSAYGRNNSRTYPSVPNVANSPLQFQNDFQNRRPQSRPNTTSQRSAPVQARPNSVTMQ 419

Query: 2580 ----QRPGSS---------AQSQGQQRWRERYNR 2642
                 RPGSS          QSQ Q+ WR R +R
Sbjct: 420  RSMYTRPGSSTVQRSVQQATQSQSQRVWRPRSDR 453


>ref|XP_002524216.1| zinc finger protein, putative [Ricinus communis]
            gi|223536493|gb|EEF38140.1| zinc finger protein, putative
            [Ricinus communis]
          Length = 493

 Score =  419 bits (1077), Expect = e-114
 Identities = 222/399 (55%), Positives = 279/399 (69%), Gaps = 22/399 (5%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            M+ +  LE  LRD L  IKP  +D   R  +I E + V+ S+E LRGATVEPFGS+VSNL
Sbjct: 1    MNAHSVLEPILRDTLEVIKPLREDWAVRSKIIEELKDVIASIESLRGATVEPFGSFVSNL 60

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            +T+WGDLDISI + NGS+ISS  ++ KQN+LR+  K LR++GG R ++++PNARVPLLKF
Sbjct: 61   FTRWGDLDISIMLANGSYISSAAKKRKQNVLREFHKALRQKGGWRRLQFVPNARVPLLKF 120

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
            ES  QNI CD+SI NL GQIKS F  W+ +ID RFRDM+LLVKEWAK  NIN+PK GTLN
Sbjct: 121  ESGRQNISCDVSIDNLQGQIKSNFLFWLNQIDGRFRDMVLLVKEWAKAHNINNPKTGTLN 180

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSLSLLVIFHFQTC PAILPPL+EIY  N+VD LTGVRT+AE  I+  C ANIAR+  +
Sbjct: 181  SYSLSLLVIFHFQTCVPAILPPLKEIYPRNVVDDLTGVRTVAEERIKETCNANIARYMSD 240

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
                 NRSSLSEL ISFF KFS I++ A++  +CT+TGQW  +   + W  +T Y L IE
Sbjct: 241  KYRAVNRSSLSELFISFFAKFSGISLKAADLGICTFTGQWLDIRSTMRWLPKT-YALFIE 299

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHRRPE 2501
            DPFEQPENAARAVS+    +I+EAF  T+ KL  + ++R SL+ +LVRPEI + +   P 
Sbjct: 300  DPFEQPENAARAVSAGNLVKIAEAFQTTYHKLVLANQNRTSLLGTLVRPEILNCIAGTPV 359

Query: 2502 S---------------------SSTVLQNQFRNMRLESN 2555
                                  SS  +Q+QF+NMR E +
Sbjct: 360  RNLSYTSLHYQSTHPQISKSMYSSPQVQHQFQNMRQEKH 398


>ref|XP_006486408.1| PREDICTED: poly(A) RNA polymerase GLD2-like isoform X1 [Citrus
            sinensis] gi|568866114|ref|XP_006486409.1| PREDICTED:
            poly(A) RNA polymerase GLD2-like isoform X2 [Citrus
            sinensis]
          Length = 445

 Score =  409 bits (1052), Expect = e-111
 Identities = 225/429 (52%), Positives = 289/429 (67%), Gaps = 27/429 (6%)
 Frame = +3

Query: 1428 SYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNLY 1607
            SYN  LE  L+DIL  + P  +D  TR+ VI++ R VVES+E LRGATVEPFGS+VSNL+
Sbjct: 3    SYN-VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61

Query: 1608 TKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKFE 1787
            ++WGDLDISIE+ NGS ISS  ++ KQ+LL D+ + LR++GG R ++++ +ARVP+LKFE
Sbjct: 62   SRWGDLDISIELSNGSCISSTGKKLKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121

Query: 1788 SIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLNS 1967
            +IHQNI CDISI NL GQIKSKF  WI++ID RFRDM+LLVKEWAK  +IN+PK GT NS
Sbjct: 122  TIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNS 181

Query: 1968 YSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRNN 2147
            YSLSLLV+FHFQTC PAILPPL++IY GN+VD L GVR   ER I  ICA NIARF  + 
Sbjct: 182  YSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANVERQIAEICAFNIARFSSDK 241

Query: 2148 SINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIED 2327
                NRSSL+ L +SF +KFS +++ ASE  +C +TGQWE +     W     +PL IED
Sbjct: 242  YRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNN-HPLFIED 300

Query: 2328 PFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEI---------- 2474
            PFEQPEN+ARAVS     +IS AF  TH +L S+ + R +L+SSL RP I          
Sbjct: 301  PFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPYILQFFGESPVR 360

Query: 2475 ---RSSVHR--RPESSSTV-----LQNQFRNMRLESNPSPSIG------FSQRPGSSAQS 2606
                ++ HR  RP+S  +V      Q+Q  N R E+ P+  +          +P      
Sbjct: 361  YANYNNGHRRARPQSHKSVNSPLQAQHQSHNARRENRPNRPMSQQSVQQHQSQPVRQNNG 420

Query: 2607 QGQQRWRER 2633
            Q Q+ WR +
Sbjct: 421  QVQRIWRPK 429


>ref|XP_006435620.1| hypothetical protein CICLE_v10031537mg [Citrus clementina]
            gi|557537816|gb|ESR48860.1| hypothetical protein
            CICLE_v10031537mg [Citrus clementina]
          Length = 445

 Score =  408 bits (1049), Expect = e-111
 Identities = 224/429 (52%), Positives = 289/429 (67%), Gaps = 27/429 (6%)
 Frame = +3

Query: 1428 SYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNLY 1607
            SYN  LE  L+DIL  + P  +D  TR+ VI++ R VVES+E LRGATVEPFGS+VSNL+
Sbjct: 3    SYN-VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61

Query: 1608 TKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKFE 1787
            ++WGDLDISIE+ NGS ISS  ++ KQ+LL D+ + LR++GG R ++++ +ARVP+LKFE
Sbjct: 62   SRWGDLDISIELSNGSCISSTGKKLKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121

Query: 1788 SIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLNS 1967
            +IHQNI CDISI NL GQIKSKF  WI++ID RFRDM+LLVKEWAK  +IN+PK GT NS
Sbjct: 122  TIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNS 181

Query: 1968 YSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRNN 2147
            YSLSLLV+FHFQTC PAILPPL++IY GN+VD L GVR   ER I  ICA NIARF  + 
Sbjct: 182  YSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANVERQIAEICAFNIARFSSDK 241

Query: 2148 SINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIED 2327
                NRSSL+ L +SF +KFS +++ +SE  +C +TGQWE +     W     +PL IED
Sbjct: 242  YRKINRSSLAHLFVSFLEKFSGLSLKSSELGICPFTGQWEHIRSNTRWLPNN-HPLFIED 300

Query: 2328 PFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEI---------- 2474
            PFEQPEN+ARAVS     +IS AF  TH +L S+ + R +L+SSL RP I          
Sbjct: 301  PFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPYILQFFGESPVR 360

Query: 2475 ---RSSVHR--RPESSSTV-----LQNQFRNMRLESNPSPSIG------FSQRPGSSAQS 2606
                ++ HR  RP+S  +V      Q+Q  N R E+ P+  +          +P      
Sbjct: 361  YANYNNGHRRARPQSHKSVNSPLQAQHQSHNARRENRPNRPMSQQSVQQHQSQPVRQNNG 420

Query: 2607 QGQQRWRER 2633
            Q Q+ WR +
Sbjct: 421  QVQRIWRPK 429


>ref|XP_007220456.1| hypothetical protein PRUPE_ppa005171mg [Prunus persica]
            gi|462416918|gb|EMJ21655.1| hypothetical protein
            PRUPE_ppa005171mg [Prunus persica]
          Length = 474

 Score =  402 bits (1034), Expect = e-109
 Identities = 211/395 (53%), Positives = 273/395 (69%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            MS    LE +L++IL  +KP  +D  TR+ +I+E R  VES+E LRGATVEPFGS+VS+L
Sbjct: 1    MSAQSTLENTLKEILRVVKPLREDWTTRLQIIDELRGAVESVESLRGATVEPFGSFVSDL 60

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            +T+WGDLD+SIE  NGSF+S   ++ KQ LL D+ + +R++GG R  + IPNARVP+LK 
Sbjct: 61   FTRWGDLDVSIEFSNGSFVSPYGKKQKQRLLGDVMRAMRQKGGWRRYQLIPNARVPILKV 120

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
            ES  QN+ CDISI NL  Q+KS+   WI+EID RFRDM+LL+KEWAK  NIN+PK GT N
Sbjct: 121  ESNLQNVSCDISIDNLKCQMKSRLLFWISEIDTRFRDMVLLIKEWAKAHNINNPKFGTFN 180

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSL+LLV+FHFQTC PAI PPL++IY GN++D L G+R   ER I+  CAANI RF+  
Sbjct: 181  SYSLTLLVVFHFQTCAPAIFPPLKDIYPGNLIDDLKGLRADTERRIEETCAANIRRFQSY 240

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
            N    NRSSLSEL ISF  KFS I++ ASE  +CTYTGQW+ +   + W  +T Y L IE
Sbjct: 241  NLRAENRSSLSELFISFLGKFSDISLKASELGICTYTGQWQAIKSNMRWLPQT-YALFIE 299

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSSYRDRNSLISSLVRPEIRSSVHRRPES 2504
            DPFEQPEN+ARAVS  E  RISE F  +H  L S  + +SL+++LVRP++ S + R P+ 
Sbjct: 300  DPFEQPENSARAVSKRELTRISETFEMSHHMLIS-PNHSSLLATLVRPQMLSLMVRTPDW 358

Query: 2505 SSTVLQNQFRNMRLESNPSPSIGFSQRPGSSAQSQ 2609
                   Q    R E + SP+   +  P    + Q
Sbjct: 359  RRQPTHPQ--RFRAEGSHSPTPSNNNGPRQPTRPQ 391


>ref|XP_002316267.2| hypothetical protein POPTR_0010s20710g [Populus trichocarpa]
            gi|566191879|ref|XP_006378690.1| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
            gi|566191881|ref|XP_006378691.1| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
            gi|566191883|ref|XP_006378692.1| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
            gi|550330240|gb|EEF02438.2| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
            gi|550330241|gb|ERP56487.1| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
            gi|550330242|gb|ERP56488.1| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
            gi|550330243|gb|ERP56489.1| hypothetical protein
            POPTR_0010s20710g [Populus trichocarpa]
          Length = 493

 Score =  402 bits (1033), Expect = e-109
 Identities = 209/358 (58%), Positives = 262/358 (73%), Gaps = 1/358 (0%)
 Frame = +3

Query: 1443 LELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNLYTKWGD 1622
            LE +L+DIL  I+P  +D   R  VI E   VV+S+E LRG+TVEPFGS+VSNL+T+WGD
Sbjct: 7    LEPTLKDILNGIQPLREDWVVRFKVIEELEDVVKSVESLRGSTVEPFGSFVSNLFTRWGD 66

Query: 1623 LDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKFESIHQN 1802
            LDISI + NGS+ISS  +R KQNLL D+ K LR+RGG + +++IPNARVP+LKFE+   +
Sbjct: 67   LDISIVLSNGSYISSAGKRRKQNLLEDVLKALRQRGGWQRLQFIPNARVPILKFENA--S 124

Query: 1803 IFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLNSYSLSL 1982
            I CD+SI N+ G +KSKF  WI EID RFRDM+LLVKEWAK  NIN+PK G+LNSYSLSL
Sbjct: 125  ISCDVSIDNMQGLMKSKFLFWINEIDRRFRDMVLLVKEWAKTHNINNPKTGSLNSYSLSL 184

Query: 1983 LVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRNNSINAN 2162
            LVIFHFQTC PAILPPL+EIY  N++D LTGVRT AER I  ICAANI+R++ N S   N
Sbjct: 185  LVIFHFQTCVPAILPPLKEIYPRNVIDDLTGVRTDAERRIGEICAANISRYRSNKSRAIN 244

Query: 2163 RSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIEDPFEQP 2342
            R+SLSEL ISF  KF  I+  A+E  +C +TG+WE +     W  RT Y L IEDPFEQP
Sbjct: 245  RNSLSELFISFLTKFYDISSKATELGICPFTGKWEEIRSNTRWLPRT-YALFIEDPFEQP 303

Query: 2343 ENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHRRPESSST 2513
            EN ARAVS+    +ISEA   TH +L ++ +++ S +  LVRP I   +   P S+S+
Sbjct: 304  ENTARAVSAANLMKISEAIQTTHHRLVTANQNQISFLGMLVRPRISRIIAGTPASNSS 361


>gb|EYU19942.1| hypothetical protein MIMGU_mgv1a006495mg [Mimulus guttatus]
          Length = 442

 Score =  397 bits (1019), Expect = e-107
 Identities = 213/382 (55%), Positives = 267/382 (69%), Gaps = 1/382 (0%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            M+    L+L++RDIL  I PS DD   R  +INE RA+V S+E LRGATVEPFGS+ SNL
Sbjct: 5    MNRYNLLDLTIRDILRVINPSNDDWSFRFQMINEIRAIVGSIENLRGATVEPFGSFASNL 64

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            +TKWGDLDISIE+ NG++ISSP ++ KQ++L+++ K  R++GG R +++I NARVP+LKF
Sbjct: 65   FTKWGDLDISIELQNGTYISSPGKKHKQSVLQEVLKAFRKKGGFRKLKFIANARVPILKF 124

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
            E  + NI CDISI NL GQ+KSK   WI EID RFRD+++LVKEWAK  +IND K+GTLN
Sbjct: 125  EGSY-NISCDISINNLSGQMKSKILFWINEIDGRFRDLVMLVKEWAKAHHINDSKSGTLN 183

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSLSLLVIFH QT  PAILPPL+EIY GN++D LTGVRT+AE+NI+ ICAANI R + +
Sbjct: 184  SYSLSLLVIFHLQTLVPAILPPLREIYPGNMIDDLTGVRTVAEKNIEDICAANIHRIRSD 243

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
             S   NRS+LS L ISF  KF+ I   AS   +C Y+GQ E +   + W  RT Y L +E
Sbjct: 244  RSRLINRSTLSALFISFLTKFADICSRASTQGICPYSGQLEDIHTNMRWLPRT-YALFVE 302

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHRRPE 2501
            DPFEQP N AR VSS +  RIS+A   TH  L ++ +DR  LI  L  P I S    RP 
Sbjct: 303  DPFEQPANTARTVSSNQLIRISQAIQATHGILVAANQDRTCLIPVLAGPHI-SCFFMRPS 361

Query: 2502 SSSTVLQNQFRNMRLESNPSPS 2567
              +  L NQF +    S   PS
Sbjct: 362  VPAPPLFNQFPSRTSSSTQLPS 383


>ref|XP_006338856.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X1 [Solanum
            tuberosum] gi|565343469|ref|XP_006338857.1| PREDICTED:
            poly(A) RNA polymerase cid11-like isoform X2 [Solanum
            tuberosum] gi|565343471|ref|XP_006338858.1| PREDICTED:
            poly(A) RNA polymerase cid11-like isoform X3 [Solanum
            tuberosum] gi|565343473|ref|XP_006338859.1| PREDICTED:
            poly(A) RNA polymerase cid11-like isoform X4 [Solanum
            tuberosum]
          Length = 453

 Score =  393 bits (1010), Expect = e-106
 Identities = 208/371 (56%), Positives = 261/371 (70%), Gaps = 1/371 (0%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            M+    LE +L++IL +I P E+D   R  +I+E RAVVES+EILRGATVEPFGS+VSNL
Sbjct: 1    MNCYSLLEHTLQNILHSINPLEEDWSMRFQLIHELRAVVESIEILRGATVEPFGSFVSNL 60

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            +T+WGDLDISIE+PNGS IS+  ++ K +LL D+ K LR +GG R +++I NARVP+LKF
Sbjct: 61   FTRWGDLDISIELPNGSHISAAGKKYKLSLLGDVLKALRAKGGCRKLQFITNARVPILKF 120

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
            +  + NI CDISI NL GQ+KSK   WI  ID RFRDM+LLVKEWAK  NIND K GTLN
Sbjct: 121  QG-NYNISCDISINNLSGQMKSKILYWINMIDGRFRDMVLLVKEWAKAHNINDSKTGTLN 179

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSLSLLV+FHFQTC PAILPPL+EIY G++VD LTGVR  AE+ I+  CA NI R   N
Sbjct: 180  SYSLSLLVVFHFQTCVPAILPPLKEIYPGSMVDDLTGVRASAEKFIEETCAMNINRLMSN 239

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
             S   NRS LSEL ISF  KF  I+  AS   +  +TGQWE +   + W  +T Y + +E
Sbjct: 240  KSRAINRSYLSELFISFIAKFCDISSRASAQGISPFTGQWEDIVSNMRWLPKT-YTIFVE 298

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHRRPE 2501
            DPFEQP N+AR VS+ +  RI EAF  TH  L SS ++ N +IS+LV+P +   V R   
Sbjct: 299  DPFEQPLNSARGVSTKQLTRIEEAFRSTHFMLCSSNQNENEIISTLVKPHVSKFVARTSG 358

Query: 2502 SSSTVLQNQFR 2534
            + +   +N  R
Sbjct: 359  NQNNYSRNGLR 369


>ref|XP_004150639.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus]
            gi|449516431|ref|XP_004165250.1| PREDICTED: poly(A) RNA
            polymerase GLD2-like [Cucumis sativus]
          Length = 464

 Score =  392 bits (1006), Expect = e-106
 Identities = 198/357 (55%), Positives = 256/357 (71%), Gaps = 1/357 (0%)
 Frame = +3

Query: 1443 LELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNLYTKWGD 1622
            L+  ++DIL  ++P +DD   R  VINE R VV+S+E LRGAT+EPFGS+VSNL+++WGD
Sbjct: 6    LDRVIKDILRVVEPLQDDWTARFQVINELRNVVQSIESLRGATIEPFGSFVSNLFSRWGD 65

Query: 1623 LDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKFESIHQN 1802
            LD+S+++ NGS+ S+  ++ KQ LLRDI+   R+ G    ++ IP+ARVP+LK E I  N
Sbjct: 66   LDLSVQLNNGSYTSTAGKKRKQTLLRDIQNASRKNGRWYKLQLIPHARVPILKIEHIQHN 125

Query: 1803 IFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLNSYSLSL 1982
            I CDISI NLVGQIKSK   W+ EID RF DM+LLVKEWAK  +IN+ K GT NSYSLSL
Sbjct: 126  ISCDISIDNLVGQIKSKILLWVNEIDGRFHDMVLLVKEWAKAHDINNSKQGTFNSYSLSL 185

Query: 1983 LVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRNNSINAN 2162
            LVIFHFQTC PAI PPL++IY GN+VD L GVR   E  I   CA NIARFK   S  AN
Sbjct: 186  LVIFHFQTCSPAIFPPLRDIYPGNVVDNLKGVRAEVENEIARTCATNIARFK---SRTAN 242

Query: 2163 RSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIEDPFEQP 2342
            RSSLSEL +SF  KFS I+  ASE  +C YTGQW ++   + W  +T Y + +EDPFEQP
Sbjct: 243  RSSLSELFVSFLAKFSDISSKASELGICPYTGQWLKIESNMRWLPKT-YAIFVEDPFEQP 301

Query: 2343 ENAARAVSSVEFFRISEAFAGTHRKLSS-YRDRNSLISSLVRPEIRSSVHRRPESSS 2510
            EN ARA+++ +  RISEAF  TH +L+S Y++R+S+++ L RP+I   +     S+S
Sbjct: 302  ENTARAINARQLMRISEAFRMTHLRLTSVYQNRSSILNDLARPQISQLIINSSGSAS 358


>ref|XP_007009280.1| Zinc finger protein, putative [Theobroma cacao]
            gi|508726193|gb|EOY18090.1| Zinc finger protein, putative
            [Theobroma cacao]
          Length = 482

 Score =  391 bits (1004), Expect = e-105
 Identities = 214/430 (49%), Positives = 284/430 (66%), Gaps = 35/430 (8%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            M+    +E +L+++L  IKP  +D  TR  +I+E R VV+SME LRGATVEPFGS VSNL
Sbjct: 1    MNSYSQVESTLQEVLEVIKPLREDWVTRQKIIDELREVVQSMESLRGATVEPFGSLVSNL 60

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            +T+WGDLDISIE+P GS++SS  ++ KQ LL ++++ L+++ G + +++IP+ARVP+LK 
Sbjct: 61   FTRWGDLDISIELPYGSYVSSAGKKRKQTLLGELQRALKQKDGWQRLQFIPHARVPILKI 120

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
            ES  QNI CDISI NL GQIKSKF  W+ EID RFR+M+LLVKEWA    IN+PK GT N
Sbjct: 121  ESRWQNISCDISIDNLQGQIKSKFLFWLNEIDGRFREMVLLVKEWASANGINNPKAGTFN 180

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSL+LLVIFHFQTC PAI PPL++IY  N+V  LTGVR  AER I  +C++NIARF+  
Sbjct: 181  SYSLTLLVIFHFQTCAPAIFPPLKDIYPRNVVTDLTGVRADAERRIAQVCSSNIARFRSG 240

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
             ++  NRSSLSEL ISF  KFS IN  AS+  +CT+TGQWE +T  + W  RT Y + +E
Sbjct: 241  RTV--NRSSLSELFISFIAKFSDINSKASDMGICTFTGQWEYITSNMRWLPRT-YAIFVE 297

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPE---------- 2471
            DPFEQPENA+RAVS  +  +I+EAF  T   L S+   +++L+ +LV P+          
Sbjct: 298  DPFEQPENASRAVSQKQLIKIAEAFETTRCMLISANLTQSTLLPTLVGPKTSRFIVKQQS 357

Query: 2472 -------------IRSSVHRRPESSSTVLQNQFRNMRLESN-----------PSPSIGFS 2579
                          R  VHR   S   + Q+Q+RN R  ++           PSPS    
Sbjct: 358  VSSSSYNGGHYPNTRPQVHRAVHSPLLMQQHQYRNSRPAASQMQQHQAQMVMPSPSRVQP 417

Query: 2580 QRPGSSAQSQ 2609
            Q P +  +S+
Sbjct: 418  QFPKTRVESR 427


>ref|XP_004240948.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Solanum lycopersicum]
          Length = 453

 Score =  390 bits (1002), Expect = e-105
 Identities = 206/365 (56%), Positives = 258/365 (70%), Gaps = 1/365 (0%)
 Frame = +3

Query: 1443 LELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNLYTKWGD 1622
            LE +L++IL +I PSE+D   R  +I+E RAVVES+EILRGATVEPFGS+VSNL+T+WGD
Sbjct: 7    LEHTLQNILHSINPSEEDWSMRFQLIHELRAVVESIEILRGATVEPFGSFVSNLFTRWGD 66

Query: 1623 LDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKFESIHQN 1802
            +DISIE+PNG  IS+  ++ K +LL D+ K LR +GG R +++I NARVP+LKF+  + N
Sbjct: 67   VDISIELPNGLHISAAGKKYKLSLLGDVLKALRAKGGCRKLQFITNARVPILKFQG-NNN 125

Query: 1803 IFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLNSYSLSL 1982
            I CDISI NL GQ+KSK   WI  ID RFRDM+LLVKEWAK  NIND K GTLNSYSLSL
Sbjct: 126  ISCDISINNLSGQMKSKILYWINMIDGRFRDMVLLVKEWAKAHNINDSKTGTLNSYSLSL 185

Query: 1983 LVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRNNSINAN 2162
            LV+FH QTC PAILPPL+EIY G++VD LTGVR  AE+ I+  CA NI R   N S   N
Sbjct: 186  LVVFHLQTCVPAILPPLKEIYPGSMVDDLTGVRASAEKFIEETCAMNINRLMSNKSRVIN 245

Query: 2163 RSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIEDPFEQP 2342
            RSSLSEL ISF  KF  I+  AS   +  +TGQWE +   + W  +T Y + +EDPFEQP
Sbjct: 246  RSSLSELFISFIAKFCNISSRASAQGISPFTGQWEDIVSNMRWLPKT-YTIFVEDPFEQP 304

Query: 2343 ENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHRRPESSSTVL 2519
             N+AR VS+ +  RI EAF  TH  L SS  + N +IS+LV+P +   V R   + +   
Sbjct: 305  LNSARGVSTKQLTRIEEAFRSTHFMLCSSNLNENEVISTLVKPHVSKFVARISGNQNNYS 364

Query: 2520 QNQFR 2534
            +N  R
Sbjct: 365  RNGLR 369


>ref|XP_002879814.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp.
            lyrata] gi|297325653|gb|EFH56073.1| hypothetical protein
            ARALYDRAFT_321659 [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  381 bits (979), Expect = e-103
 Identities = 206/405 (50%), Positives = 268/405 (66%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            MS N  L+ +L++IL  IKP+  D  TR+ VI++ R V++++E LRGATV+PFGS+VSNL
Sbjct: 1    MSRNPFLDPTLQEILQVIKPTRADWDTRIRVIDQLRDVLQTVECLRGATVQPFGSFVSNL 60

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            +T+WGDLD+S+++ +GS I    ++ KQ LLR + + LR  G    ++++ +ARVP+LK 
Sbjct: 61   FTRWGDLDLSVDLFSGSSILFTGKKQKQTLLRHLLRALRASGLWYKLQFVIHARVPILKV 120

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
             S HQ I CDISI NL G +KS+F  WI+EID RFRD++LLVKEWAK  NIND KNGT N
Sbjct: 121  VSGHQRIACDISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKNGTFN 180

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSLSLLVIFH QTC PAILPPL+ IY  + VD LTGVR  AE +I  + AANIARFK N
Sbjct: 181  SYSLSLLVIFHLQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKLN 240

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
             + + NRSSLSELL+SF+ KFS IN+ A E  VC +TG+WE ++    W  +T Y L +E
Sbjct: 241  TAKSVNRSSLSELLVSFYAKFSDINLKAQELGVCPFTGRWENISSNTTWLPKT-YSLFVE 299

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSSYRDRNSLISSLVRPEIRSSVHRRPES 2504
            DPFEQP NAAR+VS     RI++ F  T R+L S  +RNS+I  L    I+ S+HR    
Sbjct: 300  DPFEQPVNAARSVSRRNLDRIAQVFQITSRRLVSDCNRNSIIGVLTGQHIQESLHRTISL 359

Query: 2505 SSTVLQNQFRNMRLESNPSPSIGFSQRPGSSAQSQGQQRWRERYN 2639
             S    N   N+R     +               Q QQ W + YN
Sbjct: 360  HSQQHANSMHNVRNLHGQA----------RHQNQQMQQNWSQSYN 394


>ref|XP_006296116.1| hypothetical protein CARUB_v10025267mg [Capsella rubella]
            gi|482564824|gb|EOA29014.1| hypothetical protein
            CARUB_v10025267mg [Capsella rubella]
          Length = 499

 Score =  381 bits (978), Expect = e-102
 Identities = 207/402 (51%), Positives = 272/402 (67%)
 Frame = +3

Query: 1434 NGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNLYTK 1613
            N  L+ +L++IL  IKP+  D  TR+ VI + R+VV+S+E LRGATV+PFGS+VSNL+T+
Sbjct: 3    NPFLDPTLQEILQVIKPTRADCDTRIGVIEQLRSVVQSVECLRGATVQPFGSFVSNLFTR 62

Query: 1614 WGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKFESI 1793
            WGDLDIS+++ +GS I    ++ KQ LL  + + LR  G    ++++ +ARVP+LK ES 
Sbjct: 63   WGDLDISVDLFSGSSILFTGKKQKQKLLGHLLRALRANGLWYKLQFVIHARVPILKVESG 122

Query: 1794 HQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLNSYS 1973
            HQ I CDISI NL G +KS+F  WI+EID RFRD++LLVKEWAK  NIND KNGT NSYS
Sbjct: 123  HQRISCDISIDNLEGLLKSRFLLWISEIDGRFRDLVLLVKEWAKAHNINDSKNGTFNSYS 182

Query: 1974 LSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRNNSI 2153
            LSLLVIFH QTC PAILPPL+ IY  +  D LTGVR  AE +I  I AANIARFK + + 
Sbjct: 183  LSLLVIFHLQTCVPAILPPLRVIYPKSAADDLTGVRKTAEESIAQITAANIARFKLDTAK 242

Query: 2154 NANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIEDPF 2333
            + NRSSLSELL+SFF KFS INV A E  VC +TG+WE ++    W  +T Y L +EDPF
Sbjct: 243  SPNRSSLSELLVSFFAKFSDINVKAQELGVCPFTGRWENISSNSRWLPKT-YSLFVEDPF 301

Query: 2334 EQPENAARAVSSVEFFRISEAFAGTHRKLSSYRDRNSLISSLVRPEIRSSVHRRPESSST 2513
            EQP+NAAR+VS     RI++ F  T R+L+S  +RNS+I  +   +I+ S++R     + 
Sbjct: 302  EQPQNAARSVSRRNLDRIAQVFQMTSRRLASDCNRNSIIGVMTGQQIQQSLYRTISLHNQ 361

Query: 2514 VLQNQFRNMRLESNPSPSIGFSQRPGSSAQSQGQQRWRERYN 2639
               N   N+R       ++    RP +    Q QQ W + YN
Sbjct: 362  HHANGTHNVR-------NLHGQSRPWN---QQLQQNWSQSYN 393


>ref|NP_181504.2| HEN1 suppressor 1 [Arabidopsis thaliana] gi|53850481|gb|AAU95417.1|
            At2g39740 [Arabidopsis thaliana]
            gi|55733735|gb|AAV59264.1| At2g39740 [Arabidopsis
            thaliana] gi|330254623|gb|AEC09717.1| HEN1 suppressor 1
            [Arabidopsis thaliana]
          Length = 511

 Score =  380 bits (977), Expect = e-102
 Identities = 209/405 (51%), Positives = 269/405 (66%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            MS N  L+ +L++IL  IKP+  D  TR+ VI++ R V++S+E LRGATV+PFGS+VSNL
Sbjct: 1    MSRNPFLDPTLQEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNL 60

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            +T+WGDLDIS+++ +GS I    ++ KQ LL  + + LR  G    ++++ +ARVP+LK 
Sbjct: 61   FTRWGDLDISVDLFSGSSILFTGKKQKQTLLGHLLRALRASGLWYKLQFVIHARVPILKV 120

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
             S HQ I CDISI NL G +KS+F  WI+EID RFRD++LLVKEWAK  NIND K GT N
Sbjct: 121  VSGHQRISCDISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKTGTFN 180

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSLSLLVIFHFQTC PAILPPL+ IY  + VD LTGVR  AE +I  + AANIARFK  
Sbjct: 181  SYSLSLLVIFHFQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKSE 240

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
             + + NRSSLSELL+SFF KFS INV A E  VC +TG+WE ++    W  +T Y L +E
Sbjct: 241  RAKSVNRSSLSELLVSFFAKFSDINVKAQEFGVCPFTGRWETISSNTTWLPKT-YSLFVE 299

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSSYRDRNSLISSLVRPEIRSSVHRRPES 2504
            DPFEQP NAAR+VS     RI++ F  T R+L S  +RNS+I  L    I+ S++R    
Sbjct: 300  DPFEQPVNAARSVSRRNLDRIAQVFQITSRRLVSECNRNSIIGILTGQHIQESLYRTISL 359

Query: 2505 SSTVLQNQFRNMRLESNPSPSIGFSQRPGSSAQSQGQQRWRERYN 2639
             S    N   N+R       ++    RP      Q QQ W + YN
Sbjct: 360  PSQHHANGMHNVR-------NLHGQARP---QNQQMQQNWSQSYN 394


>dbj|BAE99845.1| hypothetical protein [Arabidopsis thaliana]
          Length = 511

 Score =  379 bits (974), Expect = e-102
 Identities = 209/405 (51%), Positives = 269/405 (66%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            MS N  L+ +L++IL  IKP+  D  TR+ VI++ R V++S+E LRGATV+PFGS+VSNL
Sbjct: 1    MSRNPFLDPTLQEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNL 60

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            +T+WGDLDIS+++ +GS I    ++ KQ LL  + + LR  G    ++++ +ARVP+LK 
Sbjct: 61   FTRWGDLDISVDLFSGSSILFTGKKQKQILLGHLLRALRASGLWYKLQFVIHARVPILKV 120

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
             S HQ I CDISI NL G +KS+F  WI+EID RFRD++LLVKEWAK  NIND K GT N
Sbjct: 121  VSGHQRISCDISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKTGTFN 180

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSLSLLVIFHFQTC PAILPPL+ IY  + VD LTGVR  AE +I  + AANIARFK  
Sbjct: 181  SYSLSLLVIFHFQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKSE 240

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
             + + NRSSLSELL+SFF KFS INV A E  VC +TG+WE ++    W  +T Y L +E
Sbjct: 241  RAKSVNRSSLSELLVSFFAKFSDINVKAQEFGVCPFTGRWETISSNTTWLPKT-YSLFVE 299

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSSYRDRNSLISSLVRPEIRSSVHRRPES 2504
            DPFEQP NAAR+VS     RI++ F  T R+L S  +RNS+I  L    I+ S++R    
Sbjct: 300  DPFEQPVNAARSVSRRNLDRIAQVFQITSRRLVSECNRNSIIGILTGQHIQESLYRTISL 359

Query: 2505 SSTVLQNQFRNMRLESNPSPSIGFSQRPGSSAQSQGQQRWRERYN 2639
             S    N   N+R       ++    RP      Q QQ W + YN
Sbjct: 360  PSQHHANGMHNVR-------NLHGQARP---QNQQMQQNWSQSYN 394


>ref|XP_006601982.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X1 [Glycine max]
            gi|571542766|ref|XP_006601983.1| PREDICTED: poly(A) RNA
            polymerase cid11-like isoform X2 [Glycine max]
            gi|571542770|ref|XP_006601984.1| PREDICTED: poly(A) RNA
            polymerase cid11-like isoform X3 [Glycine max]
            gi|571542774|ref|XP_006601985.1| PREDICTED: poly(A) RNA
            polymerase cid11-like isoform X4 [Glycine max]
          Length = 415

 Score =  370 bits (950), Expect = 2e-99
 Identities = 197/415 (47%), Positives = 272/415 (65%), Gaps = 12/415 (2%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            MS +  L++ + DIL  + P ++D   R  +IN+ R++VES+E LRGATVEPFGS+VSNL
Sbjct: 1    MSTHSTLDIVVNDILRVVTPVQEDWEIRFAIINDLRSIVESVESLRGATVEPFGSFVSNL 60

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            +T+WGDLDISIE+ NG  ISS  ++ KQ  L D+ K LR +GG  ++++I NARVP+LKF
Sbjct: 61   FTRWGDLDISIELSNGLHISSAGKKQKQTFLGDVLKALRMKGGGSNLQFISNARVPILKF 120

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
            +S  Q + CDISI NL GQ+KSK   WI +ID RFR M+LLVKEWAK   IN+ K GT N
Sbjct: 121  KSYRQGVSCDISINNLPGQMKSKILLWINKIDGRFRHMVLLVKEWAKAHKINNSKAGTFN 180

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSLSLLVIF+FQTC PAI PPL++IY GN+VD L GVR+ AE  I   C ANI RF  N
Sbjct: 181  SYSLSLLVIFYFQTCIPAIFPPLKDIYPGNMVDDLIGVRSDAENLIAQTCDANINRFISN 240

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
             + + NR S++EL + F  KF++++ MA +  +C Y+G+WE++   + W  +T Y + +E
Sbjct: 241  RARSINRKSVAELFVEFIGKFAKMDSMAVKMGICPYSGKWEQIEDNMIWLPKT-YAIFVE 299

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISS---------LVRPEI 2474
            DPFEQP+N AR+VS+ +  +I+EAFA TH  L S+ +++ SL+S+         + RP  
Sbjct: 300  DPFEQPQNTARSVSAGQLKKITEAFARTHDLLTSTNQNQISLLSNMAPAHVIRCITRPYG 359

Query: 2475 RSSVHRRPESSSTVLQNQFRNMRLESNPS--PSIGFSQRPGSSAQSQGQQRWRER 2633
                H         ++ Q ++ R   N S   S   S   G +   +GQQ WR +
Sbjct: 360  GGYFHPTQPQVQRAIRPQLQSQRHFQNVSQGTSSNSSSSKGHTLVHRGQQIWRPK 414


>ref|XP_006591354.1| PREDICTED: poly(A) RNA polymerase cid11-like isoform X1 [Glycine max]
            gi|571489968|ref|XP_006591355.1| PREDICTED: poly(A) RNA
            polymerase cid11-like isoform X2 [Glycine max]
          Length = 455

 Score =  366 bits (939), Expect = 4e-98
 Identities = 183/357 (51%), Positives = 254/357 (71%), Gaps = 1/357 (0%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            MS +  L++ + DIL  + P ++D   R  +IN+FR++VES+E LRGATVEP+GS+VSNL
Sbjct: 1    MSTHSMLDIVVNDILRVVTPLQEDWEIRFAIINDFRSIVESVESLRGATVEPYGSFVSNL 60

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            +T+WGDLDISIE+ NG  ISS  ++ KQ LL ++ K LR +GG  ++++I NARVP+LKF
Sbjct: 61   FTRWGDLDISIELSNGLHISSAGKKQKQTLLGEVLKALRMKGGGSNLQFISNARVPILKF 120

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
            +S  Q + CDISI NL GQ+KSK   WI +ID RFR M+LLVKEWAK   IN+ K GT N
Sbjct: 121  KSYRQGVSCDISINNLPGQMKSKILLWINKIDGRFRHMVLLVKEWAKAHKINNSKAGTFN 180

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSLSLLVIF+FQTC PAI PPL++IY GN++D L G+R+ AE  I   C ANI RF  N
Sbjct: 181  SYSLSLLVIFYFQTCIPAIFPPLKDIYPGNMIDDLIGIRSDAENLIAETCDANINRFISN 240

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
             + + NR S++EL + F  KF++++ MA E  +C YTG+WE++   + W  +T Y + +E
Sbjct: 241  RARSINRKSVAELFVDFVGKFAKMDSMAVEMGICPYTGKWEQIEDNMIWLPKT-YAIFVE 299

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKL-SSYRDRNSLISSLVRPEIRSSVHR 2492
            DPFEQP+N AR+VS+ +  +I+E FA TH  L S+ +++ SL+S+L    +   + R
Sbjct: 300  DPFEQPQNTARSVSAGQLKKITETFARTHDLLTSTNQNQISLLSNLAPAHVIRCITR 356


>ref|XP_006411208.1| hypothetical protein EUTSA_v10016564mg [Eutrema salsugineum]
            gi|567214704|ref|XP_006411209.1| hypothetical protein
            EUTSA_v10016564mg [Eutrema salsugineum]
            gi|557112377|gb|ESQ52661.1| hypothetical protein
            EUTSA_v10016564mg [Eutrema salsugineum]
            gi|557112378|gb|ESQ52662.1| hypothetical protein
            EUTSA_v10016564mg [Eutrema salsugineum]
          Length = 493

 Score =  363 bits (931), Expect = 3e-97
 Identities = 205/421 (48%), Positives = 275/421 (65%), Gaps = 15/421 (3%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            MS N   + +L+DIL AIKP+  D   R+ VI++ R+ ++S+E LRGATV+PFGS+VSNL
Sbjct: 1    MSRNPVFDPTLQDILQAIKPTGADWDARMTVIDQLRSALQSVESLRGATVQPFGSFVSNL 60

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            +T+WGDLDIS+++ +GS I    ++ KQ  L  + + LR  G    ++++ +ARVP+LK 
Sbjct: 61   FTRWGDLDISVDLFSGSSILFTGKKQKQTFLGQLLRALRASGAWYRLQFVAHARVPILKV 120

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
             S HQ I CDISI NL G +KS+F  WI+EID RFRD++LLVKEWAK  +IN+PKNGT N
Sbjct: 121  VSGHQRISCDISIDNLEGLLKSRFLFWISEIDWRFRDLVLLVKEWAKAHDINNPKNGTFN 180

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSLSLLVIFH QTC PAILPPL +IY  + VD L     IA+     + AANIARF+  
Sbjct: 181  SYSLSLLVIFHLQTCVPAILPPLGDIYPRSAVDDLKVAACIAQ-----LSAANIARFRSG 235

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
             S   NRSSLSELL+SFF KFS INV A E  VC +TG+WE ++    W  +T Y L +E
Sbjct: 236  TSRAVNRSSLSELLVSFFAKFSDINVKAKELGVCPFTGRWENISSNTRWLPKT-YSLFVE 294

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSSYRDRNSLISSLVRPEI------RSSV 2486
            DPFEQPENAAR+VS     RI++ F  T R+L++  +RNS++  L  P I      R+S+
Sbjct: 295  DPFEQPENAARSVSRKSLDRIAQVFEMTSRRLATDCNRNSIVGVLTSPHISQPLCSRTSL 354

Query: 2487 HRRPESSST----VLQNQFR--NMRLESNPSPSIGFSQRP---GSSAQSQGQQRWRERYN 2639
            H    ++       L  Q R  N +++ + S S  + Q P    +SA+S+ QQ W +   
Sbjct: 355  HNHHHANGVNNGHNLHGQSRPWNHQMQQHWSQS-NYVQNPPYWPASARSRAQQNWSQNNP 413

Query: 2640 R 2642
            R
Sbjct: 414  R 414


>ref|XP_007163461.1| hypothetical protein PHAVU_001G236100g [Phaseolus vulgaris]
            gi|561036925|gb|ESW35455.1| hypothetical protein
            PHAVU_001G236100g [Phaseolus vulgaris]
          Length = 509

 Score =  360 bits (924), Expect = 2e-96
 Identities = 179/355 (50%), Positives = 249/355 (70%), Gaps = 1/355 (0%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            MS +  L++ L+DIL  + P ++D + R  ++N+ R++VES+E LRGATVEPFGS+VSNL
Sbjct: 1    MSTHSMLDIVLKDILQVVTPLQEDWQIRFAILNDLRSIVESVESLRGATVEPFGSFVSNL 60

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            +T+WGDLDISIE+ NG  ISS  ++ KQ LL ++ K LR +G    +++I +ARVP+LKF
Sbjct: 61   FTRWGDLDISIELSNGLHISSAGKKQKQTLLGEVLKALRMKGAGSHLQFISSARVPILKF 120

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
            +S  Q + CDISI NL GQ+KSK   WI +ID RF DM+LLVKEWAK   IN+ K GT N
Sbjct: 121  KSNRQGVSCDISINNLPGQMKSKILLWINKIDGRFHDMVLLVKEWAKAHKINNSKTGTFN 180

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSLSLLVIFHFQTC PAILPPL+ IY GN+VD L G+R  AE  I   C A I R   N
Sbjct: 181  SYSLSLLVIFHFQTCVPAILPPLKYIYPGNMVDDLKGIRADAENLIAETCNAGINRHISN 240

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
             + + N+ S+ +L + F +K+++++  ASE  +C YTGQWE++     W  +T Y + +E
Sbjct: 241  TARSINKKSVPDLFVEFLRKYAQMDSWASELGICPYTGQWEQIENNTIWLPKT-YSIFVE 299

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSS-YRDRNSLISSLVRPEIRSSV 2486
            DPFEQP+N AR+V++ +  +IS+ F+ T+  LSS + + NSL++ L  P +  S+
Sbjct: 300  DPFEQPQNTARSVNAGQLKKISDTFSKTYAFLSSNHHNLNSLLTMLAPPHVVKSI 354


>ref|XP_004503175.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cicer arietinum]
          Length = 491

 Score =  356 bits (914), Expect = 3e-95
 Identities = 185/351 (52%), Positives = 245/351 (69%), Gaps = 1/351 (0%)
 Frame = +3

Query: 1425 MSYNGALELSLRDILFAIKPSEDDERTRVHVINEFRAVVESMEILRGATVEPFGSYVSNL 1604
            MS +  L   L DIL  I PS++D   R  +IN+ R++ ES++ LRGATVEPFGS+VSNL
Sbjct: 1    MSTHNMLGNVLNDILQVITPSQEDWAIRFAIINDLRSIAESVQSLRGATVEPFGSFVSNL 60

Query: 1605 YTKWGDLDISIEIPNGSFISSPTRRSKQNLLRDIRKVLRRRGGARSIEYIPNARVPLLKF 1784
            +T+WGD+DISIE+ NGS I+S  R+ KQ LL D  +VLR +GG  +++ I NARVP+LKF
Sbjct: 61   FTRWGDVDISIELLNGSHIASVGRKQKQTLLGDFLRVLRLKGGYMNMQLILNARVPILKF 120

Query: 1785 ESIHQNIFCDISIGNLVGQIKSKFFRWITEIDERFRDMILLVKEWAKLQNINDPKNGTLN 1964
             S  Q I CD+SI NL G +KSKF  WI  ID RF DM+L+VKEWAK   IN+ + G+ N
Sbjct: 121  RSKQQGISCDVSINNLPGLMKSKFLLWINRIDGRFHDMVLVVKEWAKAHRINNSRTGSFN 180

Query: 1965 SYSLSLLVIFHFQTCEPAILPPLQEIYEGNIVDALTGVRTIAERNIQGICAANIARFKRN 2144
            SYSLSLLVIFHFQTC PAILPPL++IY  N+VD L GVR   E  I   C ANI RF  +
Sbjct: 181  SYSLSLLVIFHFQTCAPAILPPLKDIYPANMVDELRGVRADVENLISETCGANINRFISD 240

Query: 2145 NSINANRSSLSELLISFFQKFSRINVMASENAVCTYTGQWERLTKKVGWTSRTYYPLLIE 2324
             S   NR S+ EL I F +KF++++  ASE  +C Y+GQ E++   + W  +T Y + +E
Sbjct: 241  KSRTINRKSVPELFIDFLRKFAQMDSWASELGICPYSGQREQIKNNMRWLPKT-YAIFVE 299

Query: 2325 DPFEQPENAARAVSSVEFFRISEAFAGTHRKLSS-YRDRNSLISSLVRPEI 2474
            DPFEQPEN+AR+VS+ +  +I+EAF  T+  L+S  +++NSL++ L  P I
Sbjct: 300  DPFEQPENSARSVSAGQLRKIAEAFLKTYSLLTSKNQNQNSLLACLAPPHI 350


Top