BLASTX nr result

ID: Atropa21_contig00035012 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00035012
         (699 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004230024.1| PREDICTED: LOW QUALITY PROTEIN: DNA-directed...   192   1e-46
gb|AAY89359.1| RNA polymerase IV largest subunit [Solanum lycope...   167   3e-39
gb|AAX12374.1| DNA-directed RNA polymerase alpha subunit [Spinac...   102   2e-19
ref|XP_002299207.2| hypothetical protein POPTR_0001s06460g [Popu...    98   3e-18
ref|XP_002303926.2| hypothetical protein POPTR_0003s19630g [Popu...    98   3e-18
ref|XP_006286895.1| hypothetical protein CARUB_v10000039mg [Caps...    89   2e-15
gb|AAY89362.1| RNA polymerase IV largest subunit [Arabidopsis th...    87   5e-15
ref|NP_181532.2| nuclear RNA polymerase D1B [Arabidopsis thalian...    87   5e-15
gb|AAB95289.1| unknown protein [Arabidopsis thaliana]                  87   5e-15
ref|XP_002871085.1| hypothetical protein ARALYDRAFT_487210 [Arab...    84   6e-14
ref|NP_196049.1| kow domain-containing transcription factor 1 [A...    83   7e-14
gb|EMJ20080.1| hypothetical protein PRUPE_ppa000088mg [Prunus pe...    83   1e-13
ref|XP_002879839.1| NRPD1b [Arabidopsis lyrata subsp. lyrata] gi...    81   3e-13
ref|XP_006420718.1| hypothetical protein CICLE_v10004129mg [Citr...    80   5e-13
ref|XP_006436520.1| hypothetical protein CICLE_v10030480mg [Citr...    79   1e-12
ref|XP_003627850.1| Protein DCL [Medicago truncatula] gi|3555218...    79   1e-12
ref|XP_003627838.1| DNA-directed RNA polymerase subunit [Medicag...    79   1e-12
ref|XP_001436180.1| hypothetical protein [Paramecium tetraurelia...    79   1e-12
emb|CAI45859.1| NOWA1 protein [Paramecium tetraurelia]                 79   1e-12
ref|XP_004308588.1| PREDICTED: DNA-directed RNA polymerase E sub...    79   2e-12

>ref|XP_004230024.1| PREDICTED: LOW QUALITY PROTEIN: DNA-directed RNA polymerase E subunit
            1 [Solanum lycopersicum]
          Length = 1632

 Score =  192 bits (487), Expect = 1e-46
 Identities = 115/238 (48%), Positives = 132/238 (55%), Gaps = 7/238 (2%)
 Frame = +2

Query: 5    RDGGSSWGQKVD---KDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKK 175
            +  GS+W +      K GGSW    +T+N  +  G  QS SWS+W            GKK
Sbjct: 1316 KPSGSAWEKASSGSVKSGGSWDMAGKTQNGAE-EGVNQSDSWSAW------------GKK 1362

Query: 176  VDEPENNPHQS----QSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGGLGSTTA 343
            VDEPENN  QS    QSGSWSS GKKVEKDGGSWD PKQ NS+SSWGKA  GGGLGS TA
Sbjct: 1363 VDEPENNRQQSGSGEQSGSWSSWGKKVEKDGGSWDEPKQLNSESSWGKAPNGGGLGSATA 1422

Query: 344  EGNRRLDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVGGWDSQKVGVEESDKQPQ 523
            EGN+RLDQS+NDW+ S S DGQ   +T         W       W               
Sbjct: 1423 EGNKRLDQSVNDWSSSVSRDGQKXTNTXXLYKK---WWLEFFKRW--------------- 1464

Query: 524  WGQRRRNSRGDFKENSRGWGSASGGDWKSNRPPRSADDSNRGVNLTATRQKLDIFTAE 697
                       + E S GW       WK+NRP RSADDSNRG + TATRQK+D+FTAE
Sbjct: 1465 -----------WLELSEGW------QWKNNRPARSADDSNRGGHFTATRQKIDLFTAE 1505


>gb|AAY89359.1| RNA polymerase IV largest subunit [Solanum lycopersicum]
          Length = 1127

 Score =  167 bits (423), Expect = 3e-39
 Identities = 106/238 (44%), Positives = 124/238 (52%), Gaps = 7/238 (2%)
 Frame = +2

Query: 5    RDGGSSWGQKVD---KDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKK 175
            +  GS+W +      K GGSW    +T+N  +  G  QS SWSSW            GKK
Sbjct: 867  KPSGSAWEEASSGSVKSGGSWDMAGKTQNGAE-EGVNQSDSWSSW------------GKK 913

Query: 176  VDEPENNPHQS----QSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGGLGSTTA 343
            VDEPENN  QS    QSGSWS  G++ +K     D PKQ NS+SSWGKA  GGGLGS TA
Sbjct: 914  VDEPENNRQQSGSGEQSGSWSPWGRRWKKMVVLGDEPKQLNSESSWGKAPNGGGLGSATA 973

Query: 344  EGNRRLDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVGGWDSQKVGVEESDKQPQ 523
            EGNRRLDQS+NDW+ S S DGQ  +           W       W               
Sbjct: 974  EGNRRLDQSVNDWSSSVSRDGQYKK-----------WWLEFFKRW--------------- 1007

Query: 524  WGQRRRNSRGDFKENSRGWGSASGGDWKSNRPPRSADDSNRGVNLTATRQKLDIFTAE 697
                       + E S GW       WK+NRP RSADDSNRG + TATRQK+D+FTAE
Sbjct: 1008 -----------WLELSGGW------QWKNNRPARSADDSNRGGHFTATRQKIDLFTAE 1048


>gb|AAX12374.1| DNA-directed RNA polymerase alpha subunit [Spinacia oleracea]
          Length = 1902

 Score =  102 bits (253), Expect = 2e-19
 Identities = 76/247 (30%), Positives = 117/247 (47%), Gaps = 20/247 (8%)
 Frame = +2

Query: 17   SSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWG-KKVDEPEN 193
            SSW Q+ D    +W +  E ++S + +    SG WS+  K +     SSWG +K D P+ 
Sbjct: 1517 SSWAQQGDS---TWKDSKEARSSVKANNSTNSGGWST-GKALVDGVSSSWGSQKEDRPQP 1572

Query: 194  NPHQSQSGSWSSLGKKVEKDG-GSWDGPK-QSNSDSSWGKATKGGGLGSTTAE--GNRRL 361
              +    G   +  K  +++G  SWD  K +  + SSWG+ ++      ++A+  G+ + 
Sbjct: 1573 KSNDRSVGD-GNFDKDAKEEGLSSWDAKKVERKTQSSWGQPSESKNSAQSSADHWGSDKS 1631

Query: 362  DQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVGGW--DSQKVGVEESDKQPQWGQR 535
            +Q      G SSG G  + +   DS     W  S V  W  +S +      D Q  WGQ 
Sbjct: 1632 NQP-----GKSSGWGSEDTNAGKDSEKQDSWGKSNVSTWKKESGEKLHGSDDSQSPWGQP 1686

Query: 536  RRNSRGDFK-ENSRGWGSASGGDWKS------------NRPPRSADDSNRGVNLTATRQK 676
              +     + E  RGWGS++ G+WKS            NRPPR  +D +  V LTATR++
Sbjct: 1687 GGSGWNKKQPEGGRGWGSSNTGEWKSRKNQNQNQNQNQNRPPRGPNDDSPRVALTATRKR 1746

Query: 677  LDIFTAE 697
            +D F  E
Sbjct: 1747 MDEFPTE 1753



 Score = 63.2 bits (152), Expect = 8e-08
 Identities = 66/238 (27%), Positives = 98/238 (41%), Gaps = 28/238 (11%)
 Frame = +2

Query: 44   DGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKKVDEPENNPHQSQSGSW 223
            D  SW +   +  +   + K  +GS S+      + G SSWG K D+  N    S++G W
Sbjct: 1109 DESSW-DAFPSSGTGWNANKIDTGSGSA------EGGWSSWGSKKDQA-NPEDSSKTGGW 1160

Query: 224  SSLGKKVE-------KDGG-----SWDGPKQSNSDSSWGKATKGGGLGSTTAEGNRRLDQ 367
            SS G K +       K GG     SW G  Q +    WG+  K               D 
Sbjct: 1161 SSGGSKQKPQPEDSSKSGGWDASKSWGGSNQGDPSPVWGQPVKATN------------DI 1208

Query: 368  SINDWNGSSS--GDGQLNESTRDDSTNIGGWNSSTVGGWDSQK--VGVEESDKQPQWGQ- 532
            SI + +GS S  G G  N   + D +     NSST GGWD+ K   G +  D    WG  
Sbjct: 1209 SIENDHGSGSAEGGGWANSGMKKDLSK--QENSSTAGGWDASKSWSGSKPKDPSSAWGAG 1266

Query: 533  ---------RRRNSRGDFKENSRGWGSASGGDWKSN--RPPRSADDSNRGVNLTATRQ 673
                     ++ +S+ D    S   G  SG   K +  +P  SA ++  G + + +++
Sbjct: 1267 KKTDDNNGWKKSDSKKDLASGSVEDGGCSGWGPKKDLLQPEDSAGENGWGASKSKSKE 1324



 Score = 63.2 bits (152), Expect = 8e-08
 Identities = 60/227 (26%), Positives = 84/227 (37%), Gaps = 18/227 (7%)
 Frame = +2

Query: 11   GGSSWGQKVDKDG-------GSWGEKVETKNSPQLSGKEQSGSWS---SWAKPVEKDGGS 160
            G SSWG K D+         G W     +K  PQ     +SG W    SW    + D   
Sbjct: 1138 GWSSWGSKKDQANPEDSSKTGGWSSG-GSKQKPQPEDSSKSGGWDASKSWGGSNQGDPSP 1196

Query: 161  SWGKKVDEPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGGLGSTT 340
             WG+ V    +   ++  GS S+ G      G   D  KQ NS      +T GG   S +
Sbjct: 1197 VWGQPVKATNDISIENDHGSGSAEGGGWANSGMKKDLSKQENS------STAGGWDASKS 1250

Query: 341  AEGNRRLDQSINDWNGSSSGDGQLNESTRDDSTNI--GGWNSSTVGGWDSQK--VGVEES 508
              G++  D S + W      D        D   ++  G        GW  +K  +  E+S
Sbjct: 1251 WSGSKPKDPS-SAWGAGKKTDDNNGWKKSDSKKDLASGSVEDGGCSGWGPKKDLLQPEDS 1309

Query: 509  DKQPQWGQRRRNSRGDFKENSRGWG----SASGGDWKSNRPPRSADD 637
              +  WG  +  S    KE S  WG          WK N P R +++
Sbjct: 1310 AGENGWGASKSKS----KEPSSAWGKPAQETDNIGWKKNNPQRDSEN 1352


>ref|XP_002299207.2| hypothetical protein POPTR_0001s06460g [Populus trichocarpa]
            gi|550346662|gb|EEE84012.2| hypothetical protein
            POPTR_0001s06460g [Populus trichocarpa]
          Length = 888

 Score = 97.8 bits (242), Expect = 3e-18
 Identities = 77/248 (31%), Positives = 108/248 (43%), Gaps = 16/248 (6%)
 Frame = +2

Query: 2    ERDGGSSWGQK--------VDKDGGSWGEKV--ETKNSPQLSGKEQSGSWSSWAKPVEKD 151
            + D  S WG+         V   GG  G ++  +T+N  QL G ++SG W +       D
Sbjct: 603  QADTASGWGKSKSLDRGWGVSNSGGGNGNEMNNKTENQSQLEGGKESGGWGA----KNTD 658

Query: 152  GGSSWGKKVDEPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGGLG 331
                WG KV     N +Q+ + S              W  PK  + D  WG +  GGG G
Sbjct: 659  ADKPWGNKV-----NSNQADTAS-------------CWGKPK--SPDLGWGVSNSGGGNG 698

Query: 332  STTAEGNRRLDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSS-----TVGGWD-SQKV 493
            S   +     +QS+ D    S G G+    ++       GW SS      V GW      
Sbjct: 699  SEMEDKTE--NQSLLDRGKESGGWGKPKSISQ-------GWGSSKDSVKAVDGWGVPNSA 749

Query: 494  GVEESDKQPQWGQRRRNSRGDFKENSRGWGSASGGDWKSNRPPRSADDSNRGVNLTATRQ 673
            G   S++  QWGQ+    + +  E SRGWGS +G   K NRP +  +DS+     T TRQ
Sbjct: 750  GSNGSERDQQWGQQSGEFKKNRTEGSRGWGSNNGHWKKRNRPSKPHEDSSSSGLFTMTRQ 809

Query: 674  KLDIFTAE 697
             LDIFT++
Sbjct: 810  WLDIFTSQ 817



 Score = 57.0 bits (136), Expect = 6e-06
 Identities = 58/234 (24%), Positives = 92/234 (39%), Gaps = 34/234 (14%)
 Frame = +2

Query: 41   KDGGSWGEKVETKNSPQLSGKEQSGSWSS--------WAKPV---EKDGGSSWGKKVDEP 187
            K+   W +K  +  +    G + + SW +        W + V   + D   SWG+    P
Sbjct: 490  KESSDWNKKSNSNQTDAACGSKAASSWGAKNTDADKRWGRKVDLNQADTSCSWGRS-KTP 548

Query: 188  ENNPHQSQSGSWSSLGKKVE------------KDGGSWDGPKQSNSDSSWGKATKGGGLG 331
            +     S SG   S+G ++E            K+   W G K +++D  W  + K     
Sbjct: 549  DRGWGLSNSG--GSIGSEMENKTENQSLLDRGKESVGW-GTKNTDADKPW--SNKVNSNQ 603

Query: 332  STTAEGNRRLDQSINDWNGSSSGDG---QLNESTRDDSTNIGGWNSSTVGGWDSQKVGVE 502
            + TA G  +       W  S+SG G   ++N  T + S   GG  S   GGW     G +
Sbjct: 604  ADTASGWGKSKSLDRGWGVSNSGGGNGNEMNNKTENQSQLEGGKES---GGW-----GAK 655

Query: 503  ESDKQPQWGQRRRNSRGDF-------KENSRGWG-SASGGDWKSNRPPRSADDS 640
             +D    WG +  +++ D        K    GWG S SGG   S    ++ + S
Sbjct: 656  NTDADKPWGNKVNSNQADTASCWGKPKSPDLGWGVSNSGGGNGSEMEDKTENQS 709


>ref|XP_002303926.2| hypothetical protein POPTR_0003s19630g [Populus trichocarpa]
            gi|550343552|gb|EEE78905.2| hypothetical protein
            POPTR_0003s19630g [Populus trichocarpa]
          Length = 1920

 Score = 97.8 bits (242), Expect = 3e-18
 Identities = 81/271 (29%), Positives = 115/271 (42%), Gaps = 44/271 (16%)
 Frame = +2

Query: 17   SSWGQKVDKDGGSWGEKVETKNSPQLSG------KEQSGSWSSWAKPVEKDGG----SSW 166
            S+WG +       WG++V +  +   SG       E S  W S  + V+ D G    SS 
Sbjct: 1519 STWGAENTDGDKLWGKEVSSNQADTASGWGKPKSPEISLGWGSTKESVKSDRGWGVSSSG 1578

Query: 167  GKKVDEPENNPHQSQ---SGSWSSLGKKVEKDGGS-WDGPKQSNSDSSWGKATKGG---- 322
            G +  + EN     Q   SG W +     + D  S W  PK S +   WG + + G    
Sbjct: 1579 GGRDKKTENQSLAGQGKESGGWGNKVTSNQADTASGWGKPKSSENSQGWGLSKESGKEVH 1638

Query: 323  ---------GLGSTTAEGNRRLDQSINDWNGSSSGDGQLNESTRDDSTNIG--------- 448
                     G GS T   N   +QS+ +    S  D + + +    ++  G         
Sbjct: 1639 EWGVPNSAGGNGSETNNNNE--NQSLVEQGKESGWDNKASSNQEGTASGWGKPKSPALSE 1696

Query: 449  GWNS-----STVGGWD-SQKVGVEESDKQPQWGQRRRNSRGDFKENSRGWGSASGGDWKS 610
            GW S       V GW      G   S +  QWGQ+ R  + D  E SRGWGS + GDWK+
Sbjct: 1697 GWGSPREPVKAVHGWGVPNSGGGNGSGRDQQWGQQSREFKKDRFEGSRGWGS-NNGDWKN 1755

Query: 611  --NRPPRSADDSNRGVNLTATRQKLDIFTAE 697
              NRP +  +D N     T TRQ+LD+FT++
Sbjct: 1756 KRNRPSKPHEDLNASGIFTTTRQRLDVFTSQ 1786



 Score = 57.8 bits (138), Expect = 3e-06
 Identities = 57/250 (22%), Positives = 92/250 (36%), Gaps = 44/250 (17%)
 Frame = +2

Query: 17   SSWGQKVDKDGGSWGEK-------------VETKNSPQLSGKEQSGSWSSWAKPVEKDGG 157
            SS    VDK+ G+  EK             V +      + +  + SW+S     + +  
Sbjct: 1331 SSGNWDVDKNDGAVKEKPWSLGMNTAEANDVASSGWDTAAARTTNNSWNSENNVAQSNSF 1390

Query: 158  SSWGKKVDEPEN------NPHQSQSGSWSSLGKKVEKDGGSWDGPKQSN-------SDSS 298
            S W  K  EP N          + S  W +        G +W    + N       S S 
Sbjct: 1391 SGWATKKPEPHNGFATKVQEEPTTSNDWDA--------GAAWGRKDRDNKFAETNASKSW 1442

Query: 299  WGKATKGGGLGSTTAEGNRRLDQSI--NDWNGSSSGD----GQLNESTRDDSTNIGGWNS 460
            WGK T G   G   ++  R  DQ +  + W+   S D    G  +++T++ +T   GW+S
Sbjct: 1443 WGKVTDGDESGQNKSKNKRPEDQDVGTHGWDDKMSQDQSISGWASKTTQEATTESLGWDS 1502

Query: 461  -------STVGGWDSQKV-GVEESDKQPQWGQRRRNSRGDFKENSRGWGSASGGD----W 604
                       GW +    G E +D    WG+   +++ D    + GWG     +    W
Sbjct: 1503 KGNSNPGDAACGWKAASTWGAENTDGDKLWGKEVSSNQAD---TASGWGKPKSPEISLGW 1559

Query: 605  KSNRPPRSAD 634
             S +    +D
Sbjct: 1560 GSTKESVKSD 1569


>ref|XP_006286895.1| hypothetical protein CARUB_v10000039mg [Capsella rubella]
            gi|482555601|gb|EOA19793.1| hypothetical protein
            CARUB_v10000039mg [Capsella rubella]
          Length = 1437

 Score = 88.6 bits (218), Expect = 2e-15
 Identities = 79/237 (33%), Positives = 105/237 (44%), Gaps = 23/237 (9%)
 Frame = +2

Query: 8    DGGSSWGQKVD--KD--GGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKK 175
            DGGSSWG+K D  KD  G SWG KV+  +S    GK+  G  SSWAK  + DGGSSWGKK
Sbjct: 1011 DGGSSWGKKDDGHKDDRGSSWGIKVDGGSS---WGKKDDGG-SSWAK--KDDGGSSWGKK 1064

Query: 176  VDEPENNPHQSQSG-SWSSLGK------KVEKDGGSWDGPKQSNSDSSWGKATKGGGLGS 334
             D P +   +   G SW+          K++  G SW   K+ +  SSWGK   GG    
Sbjct: 1065 DDGPSSWGKKDDGGPSWAKKADGGASWGKMDDGGSSWG--KKDDGGSSWGKKDDGGSSWG 1122

Query: 335  TTAEGNRRLDQSINDWNGSSSG---DGQLNESTRDDSTNIGGWNSSTVGGWDSQKV---- 493
               +G     +   D  GSS G   DG  +   +DD  +   W+++  GG+  Q      
Sbjct: 1123 KKDDGGSSWGK--KDDGGSSWGKKDDGGSSWGKKDDGGS--SWDNNDDGGYTEQTYDRGG 1178

Query: 494  ----GVEESDKQPQWGQRRRNSRGDFKENSRGWGSASGG-DWKSNRPPRSADDSNRG 649
                G     ++  W Q  R       E+   W   SGG +W S       +D   G
Sbjct: 1179 RGFGGRRGGGRRGGWDQSGRGRSLSNSEDLGPWNKPSGGSNWGSGSAWGQQNDGGGG 1235



 Score = 87.0 bits (214), Expect = 5e-15
 Identities = 73/228 (32%), Positives = 102/228 (44%), Gaps = 28/228 (12%)
 Frame = +2

Query: 8    DGGSSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVE--------KDGGSS 163
            DGGSSW +K D  G SWG+K    + P   GK+  G   SWAK  +         DGGSS
Sbjct: 1046 DGGSSWAKK-DDGGSSWGKK---DDGPSSWGKKDDGG-PSWAKKADGGASWGKMDDGGSS 1100

Query: 164  WGKKVDEPENNPHQSQSG-SW-------SSLGKKVEKDGGSWDGPKQSNSDSSWGKATKG 319
            WGKK D   +   +   G SW       SS GKK +  G SW   K+ +  SSWGK   G
Sbjct: 1101 WGKKDDGGSSWGKKDDGGSSWGKKDDGGSSWGKK-DDGGSSWG--KKDDGGSSWGKKDDG 1157

Query: 320  GGLGSTTAEG---NRRLDQSINDWNGSSSGD--GQLNESTR----DDSTNIGGWNSSTVG 472
            G       +G    +  D+    + G   G   G  ++S R     +S ++G WN  + G
Sbjct: 1158 GSSWDNNDDGGYTEQTYDRGGRGFGGRRGGGRRGGWDQSGRGRSLSNSEDLGPWNKPSGG 1217

Query: 473  -GWDSQKVGVEESD--KQPQWGQRRRNSRGDFKENSRGWGSASGGDWK 607
              W S     +++D      WG++  N R  + E+  G     GG ++
Sbjct: 1218 SNWGSGSAWGQQNDGGGGSSWGRQNDNGRKPWNEHGNGGRGFGGGGFR 1265



 Score = 78.6 bits (192), Expect = 2e-12
 Identities = 80/255 (31%), Positives = 110/255 (43%), Gaps = 37/255 (14%)
 Frame = +2

Query: 8    DGGSSWGQKVD--KD--GGSWGEKVETKNS--PQLSGKEQSGSWSSWAKPVEKDGGSSWG 169
            DGGSSWG+K D  KD  G SWG+K +  +S   +  G +  G   SW K  + DGGSSWG
Sbjct: 961  DGGSSWGKKDDGHKDDGGSSWGKKDDGGSSWVKKDDGHKDDGV-LSWGK--KDDGGSSWG 1017

Query: 170  KKVDEPENNPHQSQSGSW-------SSLGKK-------VEKDGGSWDGPKQSNSDSSWGK 307
            KK D  +++    +  SW       SS GKK        +KD G     K+ +  SSWGK
Sbjct: 1018 KKDDGHKDD----RGSSWGIKVDGGSSWGKKDDGGSSWAKKDDGGSSWGKKDDGPSSWGK 1073

Query: 308  ATKGGGLGSTTAEGNR---RLDQSINDW-----NGSSSG---DGQLNESTRDDSTNIGGW 454
               GG   +  A+G     ++D   + W      GSS G   DG  +   +DD  +  G 
Sbjct: 1074 KDDGGPSWAKKADGGASWGKMDDGGSSWGKKDDGGSSWGKKDDGGSSWGKKDDGGSSWGK 1133

Query: 455  NSSTVGGWDSQKVGVEESDKQPQWGQR-RRNSRGDFKENS-----RGWGSASGGDWKSNR 616
                   W  +  G     K+   G     N  G + E +     RG+G   GG  +   
Sbjct: 1134 KDDGGSSWGKKDDGGSSWGKKDDGGSSWDNNDDGGYTEQTYDRGGRGFGGRRGGGRRG-- 1191

Query: 617  PPRSADDSNRGVNLT 661
                 D S RG +L+
Sbjct: 1192 ---GWDQSGRGRSLS 1203



 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 61/209 (29%), Positives = 81/209 (38%), Gaps = 13/209 (6%)
 Frame = +2

Query: 11   GGSSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKKVDEPE 190
            G   WG+    DG SWG +           K  S   +SW K  + DGGSSWGKK D   
Sbjct: 929  GAPGWGKP--DDGPSWGNQ----------DKGGSTFVASWGK--KDDGGSSWGKKDD--- 971

Query: 191  NNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDS---SWGKATKGGGLGSTTAEGNRRL 361
               H+   G  SS GKK +  G SW      + D    SWGK   GG       +G++  
Sbjct: 972  --GHKDDGG--SSWGKK-DDGGSSWVKKDDGHKDDGVLSWGKKDDGGSSWGKKDDGHK-- 1024

Query: 362  DQSINDW-----NGSSSG---DGQLNESTRDDSTNIGGWNSSTVGGWDSQKVGVEESDKQ 517
            D   + W      GSS G   DG  + + +DD  +  G        W       ++ D  
Sbjct: 1025 DDRGSSWGIKVDGGSSWGKKDDGGSSWAKKDDGGSSWGKKDDGPSSWG------KKDDGG 1078

Query: 518  PQWGQRRRN--SRGDFKENSRGWGSASGG 598
            P W ++     S G   +    WG    G
Sbjct: 1079 PSWAKKADGGASWGKMDDGGSSWGKKDDG 1107


>gb|AAY89362.1| RNA polymerase IV largest subunit [Arabidopsis thaliana]
          Length = 1976

 Score = 87.0 bits (214), Expect = 5e-15
 Identities = 73/264 (27%), Positives = 107/264 (40%), Gaps = 39/264 (14%)
 Frame = +2

Query: 23   WGQK---VDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKP-VEKDGG----SSWGKKV 178
            W +K    + +G +WG   +TK+         + +W+SW K  +E D       S GKK 
Sbjct: 1484 WNKKSSETESNGATWGSSDKTKSG--------AAAWNSWDKKNIETDSEPAAWGSQGKKN 1535

Query: 179  DEPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGG------------ 322
             E E+ P  +  G+W     + E     W    + NS++  G A  G             
Sbjct: 1536 SETESGP--AAWGAWDKKKSETEPGPAGWGMGDKKNSETELGPAAMGNWDKKKSDTKSGP 1593

Query: 323  -GLGSTTAEGNRRLDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVGGWDSQKVGV 499
               GST A      D++ ++    ++  G  N+ T +  +  G W S       ++    
Sbjct: 1594 AAWGSTDAAAWGSSDKNNSETESDAAAWGSRNKKTSEIESGAGAWGSWGQPSPTAEDKDT 1653

Query: 500  EESDKQPQWGQRRRNSR-GDFKENSR------------GWGSASGGDWKSN-----RPPR 625
             E D+ P    +   SR  D KE S+            GW +  G DWK N     RPPR
Sbjct: 1654 NEDDRNPWVSLKETKSREKDDKERSQWGNPAKKFPSSGGWSNGGGADWKGNRNHTPRPPR 1713

Query: 626  SADDSNRGVNLTATRQKLDIFTAE 697
            S D  N     TATRQ+LD FT+E
Sbjct: 1714 SED--NLAPMFTATRQRLDSFTSE 1735


>ref|NP_181532.2| nuclear RNA polymerase D1B [Arabidopsis thaliana]
            gi|75320513|sp|Q5D869.1|NRPE1_ARATH RecName:
            Full=DNA-directed RNA polymerase V subunit 1; AltName:
            Full=DNA-directed RNA polymerase D subunit 1b;
            Short=AtNRPD1b; Short=Nuclear RNA polymerase D 1b;
            AltName: Full=DNA-directed RNA polymerase E subunit 1;
            Short=Nuclear RNA polymerase E 1; AltName: Full=Protein
            DEFECTIVE IN MERISTEM SILENCING 5; AltName: Full=Protein
            DEFECTIVE IN RNA-DIRECTED DNA METHYLATION 3; AltName:
            Full=Protein RNA-DIRECTED DNA METHYLATION DEFECTIVE 1;
            AltName: Full=RNA polymerase IV subunit 1; Short=POL IV 1
            gi|59939210|gb|AAX12373.1| DNA-directed RNA polymerase
            alpha subunit [Arabidopsis thaliana]
            gi|62822917|gb|AAY15198.1| DNA-dependent RNA polymerase
            large subunit [Arabidopsis thaliana]
            gi|330254673|gb|AEC09767.1| nuclear RNA polymerase D1B
            [Arabidopsis thaliana]
          Length = 1976

 Score = 87.0 bits (214), Expect = 5e-15
 Identities = 73/264 (27%), Positives = 107/264 (40%), Gaps = 39/264 (14%)
 Frame = +2

Query: 23   WGQK---VDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKP-VEKDGG----SSWGKKV 178
            W +K    + +G +WG   +TK+         + +W+SW K  +E D       S GKK 
Sbjct: 1484 WNKKSSETESNGATWGSSDKTKSG--------AAAWNSWDKKNIETDSEPAAWGSQGKKN 1535

Query: 179  DEPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGG------------ 322
             E E+ P  +  G+W     + E     W    + NS++  G A  G             
Sbjct: 1536 SETESGP--AAWGAWDKKKSETEPGPAGWGMGDKKNSETELGPAAMGNWDKKKSDTKSGP 1593

Query: 323  -GLGSTTAEGNRRLDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVGGWDSQKVGV 499
               GST A      D++ ++    ++  G  N+ T +  +  G W S       ++    
Sbjct: 1594 AAWGSTDAAAWGSSDKNNSETESDAAAWGSRNKKTSEIESGAGAWGSWGQPSPTAEDKDT 1653

Query: 500  EESDKQPQWGQRRRNSR-GDFKENSR------------GWGSASGGDWKSN-----RPPR 625
             E D+ P    +   SR  D KE S+            GW +  G DWK N     RPPR
Sbjct: 1654 NEDDRNPWVSLKETKSREKDDKERSQWGNPAKKFPSSGGWSNGGGADWKGNRNHTPRPPR 1713

Query: 626  SADDSNRGVNLTATRQKLDIFTAE 697
            S D  N     TATRQ+LD FT+E
Sbjct: 1714 SED--NLAPMFTATRQRLDSFTSE 1735


>gb|AAB95289.1| unknown protein [Arabidopsis thaliana]
          Length = 839

 Score = 87.0 bits (214), Expect = 5e-15
 Identities = 73/264 (27%), Positives = 107/264 (40%), Gaps = 39/264 (14%)
 Frame = +2

Query: 23   WGQK---VDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKP-VEKDGG----SSWGKKV 178
            W +K    + +G +WG   +TK+         + +W+SW K  +E D       S GKK 
Sbjct: 347  WNKKSSETESNGATWGSSDKTKSG--------AAAWNSWDKKNIETDSEPAAWGSQGKKN 398

Query: 179  DEPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGG------------ 322
             E E+ P  +  G+W     + E     W    + NS++  G A  G             
Sbjct: 399  SETESGP--AAWGAWDKKKSETEPGPAGWGMGDKKNSETELGPAAMGNWDKKKSDTKSGP 456

Query: 323  -GLGSTTAEGNRRLDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVGGWDSQKVGV 499
               GST A      D++ ++    ++  G  N+ T +  +  G W S       ++    
Sbjct: 457  AAWGSTDAAAWGSSDKNNSETESDAAAWGSRNKKTSEIESGAGAWGSWGQPSPTAEDKDT 516

Query: 500  EESDKQPQWGQRRRNSR-GDFKENSR------------GWGSASGGDWKSN-----RPPR 625
             E D+ P    +   SR  D KE S+            GW +  G DWK N     RPPR
Sbjct: 517  NEDDRNPWVSLKETKSREKDDKERSQWGNPAKKFPSSGGWSNGGGADWKGNRNHTPRPPR 576

Query: 626  SADDSNRGVNLTATRQKLDIFTAE 697
            S D  N     TATRQ+LD FT+E
Sbjct: 577  SED--NLAPMFTATRQRLDSFTSE 598


>ref|XP_002871085.1| hypothetical protein ARALYDRAFT_487210 [Arabidopsis lyrata subsp.
            lyrata] gi|297316922|gb|EFH47344.1| hypothetical protein
            ARALYDRAFT_487210 [Arabidopsis lyrata subsp. lyrata]
          Length = 1476

 Score = 83.6 bits (205), Expect = 6e-14
 Identities = 66/201 (32%), Positives = 81/201 (40%), Gaps = 5/201 (2%)
 Frame = +2

Query: 11   GGSSWGQKVDKDGG--SWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKKVDE 184
            GGSSWGQ+ D DGG  SWG++ +T            G  S W K     GGSSWGK+ D 
Sbjct: 1102 GGSSWGQQ-DSDGGGSSWGKENDT------------GGGSGWGKQDSGGGGSSWGKQND- 1147

Query: 185  PENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSD-SSWGKATKGGGLGSTTAEGNRRL 361
                     SGS SS GK+    GGS  G + +  D SSWGK   GG  GS   + N   
Sbjct: 1148 --------ASGSGSSWGKQNNAGGGSSWGKQDTGGDGSSWGKQDGGGSSGSGWGKQNNAS 1199

Query: 362  DQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVGGWDSQKVGVEE--SDKQPQWGQR 535
              S   W   S   G  +   +D       W     GG      G +   S     WG++
Sbjct: 1200 GGS--SWGKQSDAGGGSSWDKQDGGGGGSSWGKQDGGGGSGSAWGKQNDTSGGSSSWGKQ 1257

Query: 536  RRNSRGDFKENSRGWGSASGG 598
              +  G        WG   GG
Sbjct: 1258 NDSGGGS------SWGKQDGG 1272



 Score = 79.0 bits (193), Expect = 1e-12
 Identities = 68/212 (32%), Positives = 88/212 (41%), Gaps = 4/212 (1%)
 Frame = +2

Query: 11   GGSSWGQKVDKDGGS---WGEKVETKNSPQLSGKEQ-SGSWSSWAKPVEKDGGSSWGKKV 178
            GGSSWG K D  GGS   WG++ +T       GK+  SG  SSW K     GGSSWGK  
Sbjct: 1224 GGSSWG-KQDGGGGSGSAWGKQNDTSGGSSSWGKQNDSGGGSSWGKQDGGGGGSSWGK-- 1280

Query: 179  DEPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGGLGSTTAEGNRR 358
                  P     G  SS GK  + DGGS    +QS     +G +  GGG       G  +
Sbjct: 1281 ------PDNDGGGGGSSWGK--QGDGGSKPWNEQSGGGRGFGGSRGGGGFRGGFRGGRNQ 1332

Query: 359  LDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVGGWDSQKVGVEESDKQPQWGQRR 538
                     G  S DG  + S + D+     W S   GG          SD +  WG+  
Sbjct: 1333 ------SARGGRSFDGDQSSSWKTDNQE-NTWKSDQSGG----------SDWKKGWGENS 1375

Query: 539  RNSRGDFKENSRGWGSASGGDWKSNRPPRSAD 634
             NS+     +S G G+ +   W +N    + D
Sbjct: 1376 NNSKP--SGSSSGGGAGNWPSWDTNSKRETND 1405



 Score = 75.1 bits (183), Expect = 2e-11
 Identities = 71/212 (33%), Positives = 89/212 (41%), Gaps = 16/212 (7%)
 Frame = +2

Query: 8    DGGSSWGQKVDK-----------DGGSWGEKVETKNS--PQLSGKEQSGSWSSWAKPVEK 148
            DGGSSWG K DK           DGGSWG K +  +S   +  G++  G  SSW K  + 
Sbjct: 946  DGGSSWG-KQDKQEGVASWGKKDDGGSWGNKDDGVSSWGKKDDGQKDDGG-SSWGK--KD 1001

Query: 149  DGGSSWGKKVDEPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGGL 328
            DGGSSWGKK D   +   +   G   SL  K +  G SW   K+ +  SSWGK   GG  
Sbjct: 1002 DGGSSWGKKDDGGYSWGKKDDGG---SLWGKKDDGGSSWG--KKDDGGSSWGKKDDGGYS 1056

Query: 329  GSTTAEGNRRLDQSINDWNGSSSGDGQLNE-STRDDSTNIGGWNSSTVGGWDSQKVGVEE 505
              T   G R          G   G  Q    S+  +S ++  WN  + G           
Sbjct: 1057 EQTFDMGGRGFGG--RRGGGRRGGRDQFGRGSSFSNSEDLAPWNKPSGGS---------- 1104

Query: 506  SDKQPQWGQRRRNSRGDF--KENSRGWGSASG 595
                  WGQ+  +  G    KEN  G GS  G
Sbjct: 1105 -----SWGQQDSDGGGSSWGKENDTGGGSGWG 1131



 Score = 74.3 bits (181), Expect = 3e-11
 Identities = 67/223 (30%), Positives = 95/223 (42%), Gaps = 24/223 (10%)
 Frame = +2

Query: 14   GSSWGQKVDKDGGS-WGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDG-GSSWGKKVDEP 187
            GS WG++ +  GGS WG++ +         ++  G  SSW K     G GS+WGK+ D  
Sbjct: 1189 GSGWGKQNNASGGSSWGKQSDAGGGSSWDKQDGGGGGSSWGKQDGGGGSGSAWGKQND-- 1246

Query: 188  ENNPHQSQSGSWSSLGKKVEKDGG-SWDGPKQSNSDSSWGKATKGGGLGSTTAEGNRRLD 364
                    SG  SS GK+ +  GG SW         SSWGK    GG G ++    ++ D
Sbjct: 1247 -------TSGGSSSWGKQNDSGGGSSWGKQDGGGGGSSWGKPDNDGGGGGSS--WGKQGD 1297

Query: 365  QSINDWN---------GSSSGDGQLNESTRDDSTNIGGWNSSTVGG--WDSQKVGVEESD 511
                 WN         G S G G      R      GG N S  GG  +D  +    ++D
Sbjct: 1298 GGSKPWNEQSGGGRGFGGSRGGGGFRGGFR------GGRNQSARGGRSFDGDQSSSWKTD 1351

Query: 512  KQPQWGQRRRNSRGDFKE-------NSRGWGSASG---GDWKS 610
             Q    +  ++   D+K+       NS+  GS+SG   G+W S
Sbjct: 1352 NQENTWKSDQSGGSDWKKGWGENSNNSKPSGSSSGGGAGNWPS 1394



 Score = 73.2 bits (178), Expect = 8e-11
 Identities = 68/220 (30%), Positives = 93/220 (42%), Gaps = 23/220 (10%)
 Frame = +2

Query: 8    DGGSSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKKVDEP 187
            DGGSSWG+K D  G SWG+K    +   L GK+  G  SSW K  + DGGSSWGKK    
Sbjct: 1002 DGGSSWGKK-DDGGYSWGKK---DDGGSLWGKKDDGG-SSWGK--KDDGGSSWGKK---- 1050

Query: 188  ENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQ-------SNSD--SSWGKATKGGGLGSTT 340
            ++  +  Q+      G    + GG   G  Q       SNS+  + W K + G   G   
Sbjct: 1051 DDGGYSEQTFDMGGRGFGGRRGGGRRGGRDQFGRGSSFSNSEDLAPWNKPSGGSSWGQQD 1110

Query: 341  AEGNRRLDQSIND------WNGSSSGDGQLNESTRDDSTNIG-GWNSSTVGG----WDSQ 487
            ++G        ND      W    SG G  +   ++D++  G  W      G    W  Q
Sbjct: 1111 SDGGGSSWGKENDTGGGSGWGKQDSGGGGSSWGKQNDASGSGSSWGKQNNAGGGSSWGKQ 1170

Query: 488  KVGVEESDKQPQWGQRRRNSRGDFKENSRGWG---SASGG 598
              G + S     WG++          +  GWG   +ASGG
Sbjct: 1171 DTGGDGS----SWGKQDGGG-----SSGSGWGKQNNASGG 1201


>ref|NP_196049.1| kow domain-containing transcription factor 1 [Arabidopsis thaliana]
            gi|332003341|gb|AED90724.1| kow domain-containing
            transcription factor 1 [Arabidopsis thaliana]
          Length = 1493

 Score = 83.2 bits (204), Expect = 7e-14
 Identities = 75/237 (31%), Positives = 98/237 (41%), Gaps = 29/237 (12%)
 Frame = +2

Query: 11   GGSSWGQKVDKDGG--SWGEKVETKNSPQLSGKEQSG---SW----------SSWAKPVE 145
            GGSSWG K D DGG  SWG++ +        GK+ +G   SW          SSW K  +
Sbjct: 1128 GGSSWG-KQDGDGGGSSWGKENDAGGGSSW-GKQDNGVGSSWGKQNDGSGGGSSWGKQND 1185

Query: 146  KDGGSSWGKKVDEPENNPHQSQSG---SWSSLGKKVEKDGG-SWDGPKQSNSDSSWGKAT 313
              GGSSWGK+    + +    Q G   S S+ GK+    GG SW     +   SSWGK  
Sbjct: 1186 AGGGSSWGKQDSGGDGSSWGKQDGGGDSGSAWGKQNNTSGGSSWGKQSDAGGGSSWGKQD 1245

Query: 314  KG----------GGLGSTTAEGNRRLDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSS 463
             G          GG GS +A G +    + + W   +   G  +   +D       W   
Sbjct: 1246 GGGGGSSWGKQDGGGGSGSAWGKQNETSNGSSWGKQNDSGGGSSWGKQDGGGGGSSWGKQ 1305

Query: 464  TVGGWDSQKVGVEESDKQPQWGQRRRNSRGDFKENSRGWGSASGGDWKSNRPPRSAD 634
              GG  S      +   +P W +     RG F E  RG G   GG  +S R  RS D
Sbjct: 1306 NDGGGGSSWGKQGDGGSKP-WNEHSGGGRG-FGER-RGGGGFRGGRNQSGRGGRSFD 1359



 Score = 80.1 bits (196), Expect = 6e-13
 Identities = 66/233 (28%), Positives = 84/233 (36%), Gaps = 42/233 (18%)
 Frame = +2

Query: 11   GGSSWGQKVDKDGGS---WGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKKVD 181
            GGSSWG K D  GGS   WG++ ET N      +  SG  SSW K     GGSSWGK   
Sbjct: 1249 GGSSWG-KQDGGGGSGSAWGKQNETSNGSSWGKQNDSGGGSSWGKQDGGGGGSSWGK--- 1304

Query: 182  EPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGGL---------GS 334
                   Q+  G  SS GK  + DGGS    + S     +G+   GGG          G 
Sbjct: 1305 -------QNDGGGGSSWGK--QGDGGSKPWNEHSGGGRGFGERRGGGGFRGGRNQSGRGG 1355

Query: 335  TTAEGNR----RLDQSINDWNGSSSGDGQLNESTRDDSTN-----------IGGWNS--- 460
             + +G R    + D   N W    SG     +   +DS N            G W S   
Sbjct: 1356 RSFDGGRSSSWKTDNQENTWKSDQSGGSDWKKGWGEDSNNSKPSGSSAGGCAGNWPSWDT 1415

Query: 461  ------------STVGGWDSQKVGVEESDKQPQWGQRRRNSRGDFKENSRGWG 583
                         +   W +    V   +    W ++  N  G   E    WG
Sbjct: 1416 NSKKETNDKPGDDSKSAWGTSNDQVNTDNNNDSWNKKPNNDVGTSGEADNAWG 1468



 Score = 79.3 bits (194), Expect = 1e-12
 Identities = 67/227 (29%), Positives = 98/227 (43%), Gaps = 18/227 (7%)
 Frame = +2

Query: 8    DGGSSWGQKVDKDGG-SWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDG-GSSWGKKVD 181
            D GS+WG++ +  GG SWG++ +         ++  G  SSW K     G GS+WGK   
Sbjct: 1212 DSGSAWGKQNNTSGGSSWGKQSDAGGGSSWGKQDGGGGGSSWGKQDGGGGSGSAWGK--- 1268

Query: 182  EPENNPHQSQSGSWSSLGKKVEKDGG-SWDGPKQSNSDSSWGKATKGGGLGSTTAEGNRR 358
                   Q+++ + SS GK+ +  GG SW         SSWGK   GGG  S   +G   
Sbjct: 1269 -------QNETSNGSSWGKQNDSGGGSSWGKQDGGGGGSSWGKQNDGGGGSSWGKQG--- 1318

Query: 359  LDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVGG--WDSQKVGVEESDKQPQWGQ 532
             D     WN  S G     E  R      GG N S  GG  +D  +    ++D Q    +
Sbjct: 1319 -DGGSKPWNEHSGGGRGFGE-RRGGGGFRGGRNQSGRGGRSFDGGRSSSWKTDNQENTWK 1376

Query: 533  RRRNSRGDFKE-------NSRGWGSASGG------DWKSNRPPRSAD 634
              ++   D+K+       NS+  GS++GG       W +N    + D
Sbjct: 1377 SDQSGGSDWKKGWGEDSNNSKPSGSSAGGCAGNWPSWDTNSKKETND 1423



 Score = 77.0 bits (188), Expect = 5e-12
 Identities = 68/208 (32%), Positives = 87/208 (41%), Gaps = 11/208 (5%)
 Frame = +2

Query: 8    DGGSSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKP---VEKDGGSSWGKKV 178
            DG +SWG+K   DGGSWG+K +         K+  G  SSW K     + DGGSSW KK 
Sbjct: 949  DGAASWGKK--DDGGSWGKKDD-------GNKDDGG--SSWGKKDDGQKDDGGSSWEKKF 997

Query: 179  DEPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGGLGSTTAEGNRR 358
            D   +   +   G  SS GKK   DGGS  G K+ +  SSWGK   GG L     +G   
Sbjct: 998  DGGSSWGKKDDGG--SSWGKK--DDGGSLWG-KKDDGGSSWGKEDDGGSLWGKKDDGE-- 1050

Query: 359  LDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVGGWDSQKVGVEESDKQPQWGQRR 538
                 + W      DG+ +   +DD  +   W     GG+  Q           + G  R
Sbjct: 1051 -----SSW--GKKDDGESSWGKKDDGGS--SWGKKDEGGYSEQTFDRGGRGFGGRRGGGR 1101

Query: 539  RNSRGDF--------KENSRGWGSASGG 598
            R  R  F         E+   W   SGG
Sbjct: 1102 RGGRDQFGRGSSFGNSEDPAPWSKPSGG 1129



 Score = 76.6 bits (187), Expect = 7e-12
 Identities = 71/248 (28%), Positives = 92/248 (37%), Gaps = 51/248 (20%)
 Frame = +2

Query: 8    DGGSSWGQKVDKDGGSWGEKVETKNSPQL-----------------SGKEQSG------- 115
            DG SSWG+K D  G SWG+K E   S Q                   G++Q G       
Sbjct: 1058 DGESSWGKK-DDGGSSWGKKDEGGYSEQTFDRGGRGFGGRRGGGRRGGRDQFGRGSSFGN 1116

Query: 116  --------------SW---------SSWAKPVEKDGGSSWGKKVDEPENN--PHQSQSGS 220
                          SW         SSW K  +  GGSSWGK+ +   ++       SG 
Sbjct: 1117 SEDPAPWSKPSGGSSWGKQDGDGGGSSWGKENDAGGGSSWGKQDNGVGSSWGKQNDGSGG 1176

Query: 221  WSSLGKKVEKDGGSWDGPKQSNSD-SSWGKATKGGGLGSTTAEGNRRLDQSINDWNGSSS 397
             SS GK+ +  GGS  G + S  D SSWGK   GG  GS  A G +      + W   S 
Sbjct: 1177 GSSWGKQNDAGGGSSWGKQDSGGDGSSWGKQDGGGDSGS--AWGKQNNTSGGSSWGKQSD 1234

Query: 398  GDGQLNESTRDDSTNIGGWNSSTVGGWDSQKVGVE-ESDKQPQWGQRRRNSRGDFKENSR 574
              G  +   +D       W     GG      G + E+     WG++  +  G       
Sbjct: 1235 AGGGSSWGKQDGGGGGSSWGKQDGGGGSGSAWGKQNETSNGSSWGKQNDSGGGS------ 1288

Query: 575  GWGSASGG 598
             WG   GG
Sbjct: 1289 SWGKQDGG 1296



 Score = 73.6 bits (179), Expect = 6e-11
 Identities = 81/245 (33%), Positives = 99/245 (40%), Gaps = 44/245 (17%)
 Frame = +2

Query: 8    DGGSSWGQKVD--KD--GGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKK 175
            DGGSSWG+K D  KD  G SW +K +  +S    GK+  G  SSW K  + DGGS WGKK
Sbjct: 973  DGGSSWGKKDDGQKDDGGSSWEKKFDGGSS---WGKKDDGG-SSWGK--KDDGGSLWGKK 1026

Query: 176  VDEPENNPHQSQSGS-W-------SSLGKKVEKDGGSWDGPKQSNSDSSWGKATKG---- 319
             D   +   +   GS W       SS GK   KD G     K+ +  SSWGK  +G    
Sbjct: 1027 DDGGSSWGKEDDGGSLWGKKDDGESSWGK---KDDGESSWGKKDDGGSSWGKKDEGGYSE 1083

Query: 320  -----GGLGSTTAEGNRRLDQSINDWNGSSSGD-----------GQLNESTRDDSTNIGG 451
                 GG G     G  R         GSS G+           G  +   +D       
Sbjct: 1084 QTFDRGGRGFGGRRGGGRRGGRDQFGRGSSFGNSEDPAPWSKPSGGSSWGKQDGDGGGSS 1143

Query: 452  WNSSTVGG----WDSQKVGVEESDKQPQWGQRRRNSRGDF---KENSRGWGSA-----SG 595
            W      G    W  Q  GV  S     WG++   S G     K+N  G GS+     SG
Sbjct: 1144 WGKENDAGGGSSWGKQDNGVGSS-----WGKQNDGSGGGSSWGKQNDAGGGSSWGKQDSG 1198

Query: 596  GDWKS 610
            GD  S
Sbjct: 1199 GDGSS 1203



 Score = 62.8 bits (151), Expect = 1e-07
 Identities = 59/211 (27%), Positives = 79/211 (37%), Gaps = 15/211 (7%)
 Frame = +2

Query: 11   GGSSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKKVDEPE 190
            GGSS G K D+D   WG+  E   S Q   KE+S    SW K    DG SSWG K     
Sbjct: 822  GGSSGGNKQDEDS-VWGKLCEASESSQK--KEES----SWGKKGGSDGESSWGNKDGNSS 874

Query: 191  NNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGGLGSTTAEGNRRLDQS 370
             +     S      G    K G +W        D   GK   G    + +AE +    + 
Sbjct: 875  ASKKDGVSWGQQDKGSDESKGGSAW---SNQCGDFGSGKKKDGSSGWNKSAEDSNANSKG 931

Query: 371  INDW----NGSS---SGDGQLNESTRDDSTNIGGWNSSTVGGWD------SQKVGVEESD 511
            + DW    +GSS    GDG  +   +DD    G W     G  D       +K   ++ D
Sbjct: 932  VPDWGQPNDGSSWGKKGDGAASWGKKDDG---GSWGKKDDGNKDDGGSSWGKKDDGQKDD 988

Query: 512  KQPQWGQR--RRNSRGDFKENSRGWGSASGG 598
                W ++    +S G   +    WG    G
Sbjct: 989  GGSSWEKKFDGGSSWGKKDDGGSSWGKKDDG 1019


>gb|EMJ20080.1| hypothetical protein PRUPE_ppa000088mg [Prunus persica]
          Length = 1855

 Score = 82.8 bits (203), Expect = 1e-13
 Identities = 77/257 (29%), Positives = 114/257 (44%), Gaps = 30/257 (11%)
 Frame = +2

Query: 17   SSWGQKVDKDGGSWGEKVETKNSPQLSGKEQS--GSWSSWA-KPVEKDGGSSWGKKVDEP 187
            S+WG     +    G +V   +S  LS K+ S   + S+WA     +D  S+WGK   + 
Sbjct: 1466 STWGTTRANENDWCGREVGQDDSASLSVKKSSVLDTSSAWATNTAREDAASAWGKHPAKE 1525

Query: 188  ENNPH----QSQSGSWSSLGKKVEKDGGSWDGPKQS--NSDSSWGKATKGGGLGSTTAEG 349
                      +    W   G   + D  S  G K S  N+ S W  AT      +T+A G
Sbjct: 1526 NTTSTWGTTTASENDWCGRGVGHD-DSASLSGKKSSVLNTSSVW--ATNTAREDATSAWG 1582

Query: 350  NRRLDQ-----------SINDWNGSSSGDGQLNE----STRDDSTNIGGWNSSTVGGWDS 484
                 +           S NDW G  +G  +  +      +DDS ++ GW+S T  G   
Sbjct: 1583 KNPAKENTTSTWGTTTASENDWCGREAGKVEPVDLQPTKPQDDSASLSGWDSPTGDG--- 1639

Query: 485  QKVGVEESDKQPQWGQRRRN-SRGDFKENSRGWGSASGGDWKS-NRPPRSA----DDSNR 646
                    ++  QWGQ R + ++ +  E +R W S S G+WK+ NRPP+S     D+S  
Sbjct: 1640 -----NSGERNHQWGQHRGDQTKKNRFEGARNWVS-SPGEWKNKNRPPKSPGMVNDNSTM 1693

Query: 647  GVNLTATRQKLDIFTAE 697
            G   T TRQ+LD+FT+E
Sbjct: 1694 GALYTVTRQRLDMFTSE 1710


>ref|XP_002879839.1| NRPD1b [Arabidopsis lyrata subsp. lyrata] gi|297325678|gb|EFH56098.1|
            NRPD1b [Arabidopsis lyrata subsp. lyrata]
          Length = 1947

 Score = 81.3 bits (199), Expect = 3e-13
 Identities = 69/256 (26%), Positives = 104/256 (40%), Gaps = 34/256 (13%)
 Frame = +2

Query: 32   KVDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKP-VEKDGG-SSWGKKV-DEPENNPH 202
            K + DG +WG   +TK+         + +WSSW K  +E D   ++WG +  ++PE    
Sbjct: 1490 KTESDGATWGSSDKTKSG--------AAAWSSWDKKNMETDSEPAAWGSQSKNKPETESG 1541

Query: 203  QSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKG-------------GGLGSTTA 343
             S  G+W +   + E     W    + NS++  G A  G                GST A
Sbjct: 1542 PSTWGAWDTKKSETESGPAGWGIVDKKNSETESGPAAMGNWDKKKSNTESGPAAWGSTDA 1601

Query: 344  EGNRRLDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVGGWDSQKVGVEESDKQPQ 523
                  D++ ++    ++  G  ++ T +  +    W S       +      E D+ P 
Sbjct: 1602 AVWGFSDKNNSETESDAAAWGSRDKKTSETESGAAAWGSWGQPTPTAANEDANEDDENPW 1661

Query: 524  WGQRRRNSRG-DFKE------------NSRGWGSASGGDWKSN-----RPPRSADDSNRG 649
               +   SR  D KE            +S GW +  G DWK       RPPRS D  N  
Sbjct: 1662 VSLKETKSRDKDDKERIQWGNPAKKFPSSGGWSNGGGADWKGKRNHTPRPPRSED--NLA 1719

Query: 650  VNLTATRQKLDIFTAE 697
               TATRQ+LD FT+E
Sbjct: 1720 PMFTATRQRLDSFTSE 1735


>ref|XP_006420718.1| hypothetical protein CICLE_v10004129mg [Citrus clementina]
            gi|557522591|gb|ESR33958.1| hypothetical protein
            CICLE_v10004129mg [Citrus clementina]
          Length = 1867

 Score = 80.5 bits (197), Expect = 5e-13
 Identities = 84/264 (31%), Positives = 111/264 (42%), Gaps = 37/264 (14%)
 Frame = +2

Query: 17   SSWGQKV--DKDGGSWGEKVETKNSPQLSG-------KEQSGSWSSWAKPVEKDGGSSWG 169
            S+WG +   DK      EKV       LSG         +S  WS W      +  +SWG
Sbjct: 1481 SAWGTEASWDKSSEVTLEKVAAPAENPLSGWGTEAQDSGKSSDWSEWKD--HANATASWG 1538

Query: 170  KKVDEPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGK----ATKGGGLGST 337
            +  +  E N       SW++       D GS       NS S WG     +TKG    S 
Sbjct: 1539 R--NGSEENSGWDTKASWNTKALDKLDDVGS----AVENSSSVWGAREDFSTKGWEDSSK 1592

Query: 338  TAEGNRRLDQSINDWN--------GSSSGDGQLNESTR--DDS-------TNIGGWNSST 466
             +   + +   I  WN         SS G  +L E+ +  DDS       T       ++
Sbjct: 1593 PSANEKSIVHQIGGWNVPDAKGTDDSSWGKQKLTENAKGTDDSSWGKQKHTENESSQPAS 1652

Query: 467  VGGWD-SQKVGVEESDKQPQWGQRRRNSRGDFKENSRGWGSASGGDWKS--NRPPRSA-- 631
               WD     G  E++ Q  WGQ R+     FK+N RGW S+SG +WK   NRPPRS   
Sbjct: 1653 SNAWDLPDATGGSETEMQV-WGQSRKEP---FKKN-RGWASSSG-EWKGKKNRPPRSPGV 1706

Query: 632  --DDSNRGVNLTATRQKLDIFTAE 697
              DDS      T TRQ+LD+FT+E
Sbjct: 1707 VNDDSTVNAMYTVTRQRLDMFTSE 1730


>ref|XP_006436520.1| hypothetical protein CICLE_v10030480mg [Citrus clementina]
            gi|557538716|gb|ESR49760.1| hypothetical protein
            CICLE_v10030480mg [Citrus clementina]
          Length = 1807

 Score = 79.3 bits (194), Expect = 1e-12
 Identities = 75/239 (31%), Positives = 104/239 (43%), Gaps = 21/239 (8%)
 Frame = +2

Query: 2    ERDGGSSWGQKVDKDGGS-WGEK----VETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSW 166
            ++DGGSSWG++   DGGS WG++    + +K   Q   K    SW +      +DGGSSW
Sbjct: 1157 KQDGGSSWGKQ---DGGSSWGKQDGGSLWSKEPDQQHRKNGGSSWGN------RDGGSSW 1207

Query: 167  GKKVDEPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSN----SDSSWGKATKGGGLGS 334
             K+ D+ +N     +S      G +  + GG   G + S+         G     GG+G 
Sbjct: 1208 SKQADQQDNQEKPLESDGGRGSGGRWGQGGGRGGGQEVSDQYGRGSFDQGSEKGTGGMGD 1267

Query: 335  TTAEGNRRLDQSINDWN-------GSSSGDGQLNESTRDDSTNIGGWNSSTVGGWDSQKV 493
                 NRR D+ I DWN       GSS GDG         +   GGW   +   W+S   
Sbjct: 1268 QGNGCNRR-DKGI-DWNKKFNWNSGSSDGDG---------NNGSGGWGKKS--NWNSGSS 1314

Query: 494  GVEESDKQPQWGQRRR----NSRGDFKENSRGWGSASGGDWKSNRPPRSAD-DSNRGVN 655
            G  ES K   W ++      +S GD   NS GW   S  +  S+    S D D N+  N
Sbjct: 1315 GAGES-KDTDWNKKSNLNCGSSDGD-GNNSSGWDKKSNWNAGSSGDGESKDTDWNKKCN 1371



 Score = 78.2 bits (191), Expect = 2e-12
 Identities = 83/266 (31%), Positives = 110/266 (41%), Gaps = 47/266 (17%)
 Frame = +2

Query: 2    ERDGGSSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSG-SW------SSWAKP------- 139
            +RDGGSSWG+   +DG SWG+    ++S    GK+  G SW      SSWAK        
Sbjct: 1077 KRDGGSSWGK---QDGSSWGK----QDSGSSLGKQDGGSSWSKQDGGSSWAKQDGGSSWA 1129

Query: 140  --------VEKDGGSSWGKKVDEPENNPHQSQSGSW------SSLGKKVEKDGGS-W--- 265
                     ++DGGSSWGK+ D   +   Q    SW      SS GK   +DGGS W   
Sbjct: 1130 KQDGGSSWAKQDGGSSWGKQ-DGGSSWGKQDGGSSWGKQDGGSSWGK---QDGGSLWSKE 1185

Query: 266  -DGPKQSNSDSSWGKATKGGGLGSTTAEGNRRLDQSINDWNGSSS------GDGQLNEST 424
             D   + N  SSWG    GG   S  A+     ++ +    G  S      G G+     
Sbjct: 1186 PDQQHRKNGGSSWGN-RDGGSSWSKQADQQDNQEKPLESDGGRGSGGRWGQGGGRGGGQE 1244

Query: 425  RDDSTNIGGWNSST---VGGWDSQKVGVEESDKQPQWGQR----RRNSRGDFKENSRGWG 583
              D    G ++  +    GG   Q  G    DK   W ++      +S GD    S GWG
Sbjct: 1245 VSDQYGRGSFDQGSEKGTGGMGDQGNGCNRRDKGIDWNKKFNWNSGSSDGDGNNGSGGWG 1304

Query: 584  SASGGDWKSNRPPRSAD-DSNRGVNL 658
              S  +  S+    S D D N+  NL
Sbjct: 1305 KKSNWNSGSSGAGESKDTDWNKKSNL 1330



 Score = 71.6 bits (174), Expect = 2e-10
 Identities = 67/229 (29%), Positives = 90/229 (39%), Gaps = 24/229 (10%)
 Frame = +2

Query: 5    RDGGSSWGQKVDKDGGS-WGEKVETKNSPQLSGKEQSGS-W---------------SSWA 133
            +DGGSSW ++   DGGS WG+K    N   L GK+  GS W               SSW 
Sbjct: 997  QDGGSSWAKQ---DGGSSWGKK----NGGSLMGKQDGGSSWGKQDGGSSLGKQDGGSSWG 1049

Query: 134  KP------VEKDGGSSWGKKVDEPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDS 295
            K        ++DGGSSWGK          Q +  SWS      ++DGGS  G +     S
Sbjct: 1050 KQDGGSSLAKQDGGSSWGK----------QDEGSSWS------KRDGGSSWGKQDG---S 1090

Query: 296  SWGKATKGGGLGSTTAEGNRRLDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVG- 472
            SWGK   G  LG      +       + W   +  DG  + + +D  ++   W     G 
Sbjct: 1091 SWGKQDSGSSLGKQDGGSSWSKQDGGSSW---AKQDGGSSWAKQDGGSS---WAKQDGGS 1144

Query: 473  GWDSQKVGVEESDKQPQWGQRRRNSRGDFKENSRGWGSASGGDWKSNRP 619
             W  Q       D    WG++   S    ++    WG   GG   S  P
Sbjct: 1145 SWGKQ-------DGGSSWGKQDGGSSWGKQDGGSSWGKQDGGSLWSKEP 1186



 Score = 65.5 bits (158), Expect = 2e-08
 Identities = 65/240 (27%), Positives = 94/240 (39%), Gaps = 46/240 (19%)
 Frame = +2

Query: 17   SSWGQKVDK-DGGSWG-EKVETKNSPQLSG------KEQSGSWSSWAKPVEKDGGSS--- 163
            S+WG KV+     SWG    E KN    +       +  +G++  W K   +D GSS   
Sbjct: 801  SAWGSKVNAIQNSSWGLAAAEGKNEDCWNKAAVKNIESNNGAYGGWGK---EDAGSSLQD 857

Query: 164  ----WGKKVDEPENNPHQSQSGSWSSLGKKVEKDGGSW-DGPKQSNSDSSWGKATKGGGL 328
                WGK  D  +N  +  +S SW    K +     SW D   + N   SWGK  K G  
Sbjct: 858  SQDNWGKNKDACDNQANWKKSDSWDKGKKIIGNSTSSWGDKTAEKNEPDSWGKG-KDGSS 916

Query: 329  GSTTAEGNRRL--DQSINDWNGSSSG-----DGQLNEST---RDDSTN---IGGWNSSTV 469
            GS +   +  L  +     W  +S G      G ++E +   +DDS N     GWN    
Sbjct: 917  GSKSDWNSSALATENPTVSWGNASGGWTQQKGGNMDERSGWKKDDSGNQDQRSGWNKPKT 976

Query: 470  GGWD-----SQKVGVEESDKQ------------PQWGQRRRNSRGDFKENSRGWGSASGG 598
             G D     +++ G+  SD Q              WG++   S    ++    WG   GG
Sbjct: 977  FGADVGSSWNKQDGICSSDVQDGGSSWAKQDGGSSWGKKNGGSLMGKQDGGSSWGKQDGG 1036



 Score = 65.5 bits (158), Expect = 2e-08
 Identities = 72/285 (25%), Positives = 100/285 (35%), Gaps = 70/285 (24%)
 Frame = +2

Query: 5    RDGGSSWGQKVDKDGGSWGEKVETKNSPQLS-GKEQSGSWSSWAKPVEKDGGSSWGKKVD 181
            RD G  W      D  S+      KN  + S   + +GSWS         GG +W     
Sbjct: 1541 RDQGGGWNNNDSGDYKSFDSSQGVKNGGEWSRSNDGAGSWSQ--------GGGTW----- 1587

Query: 182  EPENNPHQSQSGSWSSLG------------KKVEKDGGSWD-GPKQSNSDSSWGKATKGG 322
            +  N+   SQ G WSS G            K +   GG W+ G   S     WG  ++G 
Sbjct: 1588 KSGNSGASSQDGGWSSQGSGWNNSNTTNEVKGLSDQGGGWNKGAGGSAQAGGWG--SQGS 1645

Query: 323  GLGSTTAEGNRRLDQSI-----------------------------NDWNGSSSGDGQLN 415
            G  S T+ GNR  + S                              N  +G SSG G  N
Sbjct: 1646 GWSSGTSTGNRGSNDSSIANDVEGPNDQVVGRNKGSNGSAQSGGWGNQGSGWSSGTGSGN 1705

Query: 416  ESTRDDSTNI-------GGWN-----SSTVGGWDSQKVG----------VEESDKQPQWG 529
            + + D + +        GGWN     S+  G W +Q  G             SD+   W 
Sbjct: 1706 KGSNDSNISNKGPNDQGGGWNKGSGGSAQSGAWGNQGSGWNGGTDSGNRGSNSDQPKSWN 1765

Query: 530  QRR-----RNSRGDFKENSRGWGSASGGDWKSNRPPRSADDSNRG 649
            Q         S+   + +SRGWG  +G  W+     +  D S +G
Sbjct: 1766 QSSVATDGGRSKDAGEGSSRGWGKTAGSSWE-----KGNDGSGKG 1805



 Score = 56.6 bits (135), Expect = 7e-06
 Identities = 57/238 (23%), Positives = 95/238 (39%), Gaps = 39/238 (16%)
 Frame = +2

Query: 8    DGGSSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKK---- 175
            +  S W +K + + GS G+  E+K++       +  +W+S +   + + GS WGKK    
Sbjct: 1340 NNSSGWDKKSNWNAGSSGDG-ESKDTDW----NKKCNWNSGSNDGDGNNGSGWGKKSNWN 1394

Query: 176  ----VDEPENNPHQSQSGSWSSLGKKVEKDG------GSWD-GPKQSNSDSSWGKAT--- 313
                V    N+ + ++ G+W+S      ++       G+W+ G +  + +SSWGK +   
Sbjct: 1395 SGSNVAGESNDSNWAKKGNWNSGSDDANQESSWGKKQGNWNSGSRDGHQESSWGKKSDWN 1454

Query: 314  ----------KGGGLGSTTAEGNRRLDQSINDWNGSSSGDGQLNEST-----RDDSTNIG 448
                         G G+    G  R  +  +D  G   G G+ N        R D    G
Sbjct: 1455 SRSEDQPEPFNNRGSGNFRGRGGFR-GRGDSD-RGGFGGRGRTNRGGYGGRGRFDREGFG 1512

Query: 449  GWNSSTVGGW------DSQKVGVEESDKQPQWGQRRRNSRGDFKENSRGWGSASGGDW 604
            G   S  GG+      D    G     ++ Q G    N  GD+K      G  +GG+W
Sbjct: 1513 GRGGSDRGGFGGRGSSDRGGFGGRGRGRRDQGGGWNNNDSGDYKSFDSSQGVKNGGEW 1570


>ref|XP_003627850.1| Protein DCL [Medicago truncatula] gi|355521872|gb|AET02326.1|
           Protein DCL [Medicago truncatula]
          Length = 481

 Score = 79.3 bits (194), Expect = 1e-12
 Identities = 73/233 (31%), Positives = 104/233 (44%), Gaps = 3/233 (1%)
 Frame = +2

Query: 8   DGGSSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSGS--WSSWAKPVEKDGGSSWGKKVD 181
           D  SSWGQK D+        V  ++S + +  EQ       SW   V     SSWGK   
Sbjct: 93  DNRSSWGQKKDEI------HVMPEDSSRSNAWEQKPENVKDSWVAKVPV-ANSSWGK-AK 144

Query: 182 EPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGGLGSTTAEGNRRL 361
            PEN P  S++   +S GK   ++   WD   ++ SDSSWGK        S  ++     
Sbjct: 145 SPENRPWDSKNEPNNSFGKPNSQENEPWDS--KNESDSSWGKPK------SQESQPWDSK 196

Query: 362 DQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVGGWDSQKVGVE-ESDKQPQWGQRR 538
           ++S + W    S +    +S  + +   G        GWDSQ      ESDK  QWG++ 
Sbjct: 197 NESNSSWGKPKSQENHPWDSKNESNQTAGS------RGWDSQVASANSESDKSFQWGKQG 250

Query: 539 RNSRGDFKENSRGWGSASGGDWKSNRPPRSADDSNRGVNLTATRQKLDIFTAE 697
           R+S   FK+N R  GS SGG       P + D  NR   +    Q+ +++T E
Sbjct: 251 RDS---FKKN-RFEGSQSGG-------PNAGDWKNRSRPVRPPGQRFELYTPE 292


>ref|XP_003627838.1| DNA-directed RNA polymerase subunit [Medicago truncatula]
            gi|355521860|gb|AET02314.1| DNA-directed RNA polymerase
            subunit [Medicago truncatula]
          Length = 2032

 Score = 79.3 bits (194), Expect = 1e-12
 Identities = 73/233 (31%), Positives = 104/233 (44%), Gaps = 3/233 (1%)
 Frame = +2

Query: 8    DGGSSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSGS--WSSWAKPVEKDGGSSWGKKVD 181
            D  SSWGQK D+        V  ++S + +  EQ       SW   V     SSWGK   
Sbjct: 1644 DNRSSWGQKKDEI------HVMPEDSSRSNAWEQKPENVKDSWVAKVPV-ANSSWGK-AK 1695

Query: 182  EPENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGGLGSTTAEGNRRL 361
             PEN P  S++   +S GK   ++   WD   ++ SDSSWGK        S  ++     
Sbjct: 1696 SPENRPWDSKNEPNNSFGKPNSQENEPWDS--KNESDSSWGKPK------SQESQPWDSK 1747

Query: 362  DQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTVGGWDSQKVGVE-ESDKQPQWGQRR 538
            ++S + W    S +    +S  + +   G        GWDSQ      ESDK  QWG++ 
Sbjct: 1748 NESNSSWGKPKSQENHPWDSKNESNQTAGS------RGWDSQVASANSESDKSFQWGKQG 1801

Query: 539  RNSRGDFKENSRGWGSASGGDWKSNRPPRSADDSNRGVNLTATRQKLDIFTAE 697
            R+S   FK+N R  GS SGG       P + D  NR   +    Q+ +++T E
Sbjct: 1802 RDS---FKKN-RFEGSQSGG-------PNAGDWKNRSRPVRPPGQRFELYTPE 1843



 Score = 59.3 bits (142), Expect = 1e-06
 Identities = 56/230 (24%), Positives = 90/230 (39%), Gaps = 22/230 (9%)
 Frame = +2

Query: 20   SWG----QKVDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKKVDEP 187
            SWG    QK D+   +WG+ V  ++S       +SG+W +    V++D   S       P
Sbjct: 1496 SWGAATNQKSDQSASAWGKAVVQEDS------SKSGAWGNAKSVVQEDSSKSGA-----P 1544

Query: 188  ENNPHQSQSGSWSSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGG---LGSTTAEGNRR 358
             N  H S    W  +    E+  G   G K+  +D S   +T  GG    GS+  E +  
Sbjct: 1545 ANTNHSSDQSCWGQITGGEERAQGESGGTKKWKADVSQEDSTNSGGWKAWGSSKPEVHEG 1604

Query: 359  LDQSIND-WNGSSSGDGQLNESTRDDSTNIGGW------NSSTVGGWDSQK----VGVEE 505
                + D WN      G+  + ++ DS     W      ++     W  +K    V  E+
Sbjct: 1605 ESTKVQDSWNSQKWKAGE--DVSQKDSQKSSAWGATKPKSNDNRSSWGQKKDEIHVMPED 1662

Query: 506  SDKQPQWGQRRRNSRGDFKEN----SRGWGSASGGDWKSNRPPRSADDSN 643
            S +   W Q+  N +  +       +  WG A   +   NRP  S ++ N
Sbjct: 1663 SSRSNAWEQKPENVKDSWVAKVPVANSSWGKAKSPE---NRPWDSKNEPN 1709


>ref|XP_001436180.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
           gi|124403319|emb|CAK68783.1| unnamed protein product
           [Paramecium tetraurelia]
          Length = 1015

 Score = 79.0 bits (193), Expect = 1e-12
 Identities = 67/245 (27%), Positives = 91/245 (37%), Gaps = 33/245 (13%)
 Frame = +2

Query: 14  GSSWGQKVDKD--GGSWGEKVETKNSPQLSG---------KEQSGSWSSWAKPVEKDGGS 160
           GSSWG   +KD  GG WG    T   P  SG           QSG W +  +   +    
Sbjct: 225 GSSWGNSDNKDNAGGGWGS-TSTNEQPAQSGGWGSTTTEQPAQSGGWGNSTEQPVQQASE 283

Query: 161 SWGKKVDEPENNPHQSQSGSWSSLGKKVEKDGGSWDG-----PKQSNSDSSWGKATK--- 316
            WG K ++P      +Q G  ++  +  ++ GG W        KQSN    WG +T+   
Sbjct: 284 GWGSKTEQPPQQAESNQGGWGTTTEQSNQQSGGGWGSTTEQPQKQSN---GWGNSTQEQQ 340

Query: 317 -----GGGLGSTTAEGNRRLDQSINDWNGSSSGDGQLN---ESTRDDSTNIGGWNSST-- 466
                 GG GS+  E   +  QS   W  S++          ST + +T  GGW S+T  
Sbjct: 341 PQQSGAGGWGSSNTE---QPAQSSGGWGASTTEQPATTGGWGSTTEQATTSGGWGSTTDQ 397

Query: 467 ----VGGWDSQKVGVEESDKQPQWGQRRRNSRGDFKENSRGWGSASGGDWKSNRPPRSAD 634
                GGW         SD+Q                N   WG    GD +  R     D
Sbjct: 398 AASSGGGWGG------SSDQQ----------------NGNSWGGGGSGDGQRGRGRGRGD 435

Query: 635 DSNRG 649
             +RG
Sbjct: 436 RGDRG 440



 Score = 68.6 bits (166), Expect = 2e-09
 Identities = 57/216 (26%), Positives = 79/216 (36%), Gaps = 13/216 (6%)
 Frame = +2

Query: 8    DGGSSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKKVDEP 187
            +GGS  G   D+ GG WG    T + P       +G W S A    +  G  WG    E 
Sbjct: 506  NGGSWGGSSNDQGGGGWGS--STTDQP-----ASNGGWGSTATEQPQSNG-GWGSTATE- 556

Query: 188  ENNPHQSQSGSWSSLGKKVEKDGGSW--DGPKQSNSDSSWGKAT-----KGGGLGSTTAE 346
                  +Q+G W S   +     G W     +Q      WG  T       GG GST  E
Sbjct: 557  ----QPAQTGGWGSTATEKPAQNGGWGSTATEQPQQSGGWGSTTTEQPQASGGWGSTATE 612

Query: 347  GNRRLDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTV-----GGWDSQKVGVEESD 511
               +     +      + +G    +  +     GGW SS       GGW S     ++S+
Sbjct: 613  QPAQNGGWGSTTTEQPAQNGGWGSTATEQPAQTGGWGSSDAPQQSNGGWGSSNNDQQQSN 672

Query: 512  KQPQWGQRRRNSRGDFKE-NSRGWGSASGGDWKSNR 616
                WG   + S G+ +    RG G   GGD   +R
Sbjct: 673  ---GWGSSNQQSNGNGERGRGRGRGRGRGGDRGGDR 705



 Score = 63.9 bits (154), Expect = 5e-08
 Identities = 57/229 (24%), Positives = 83/229 (36%), Gaps = 18/229 (7%)
 Frame = +2

Query: 11   GGSSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKK-VDEP 187
            G    G + D+ GG  G +            + SG+  SW       GG  WG    D+P
Sbjct: 481  GRGDRGDRGDRRGGFRGGR----------DNDNSGNGGSWGGSSNDQGGGGWGSSTTDQP 530

Query: 188  ENNPHQSQSGSWSSLGKKVEKDGGSW--DGPKQSNSDSSWG-----KATKGGGLGSTTAE 346
             +N      G W S   +  +  G W     +Q      WG     K  + GG GST  E
Sbjct: 531  ASN------GGWGSTATEQPQSNGGWGSTATEQPAQTGGWGSTATEKPAQNGGWGSTATE 584

Query: 347  GNRRLDQSINDWNGSSS----GDGQLNESTRDDSTNIGGWNSSTV------GGWDSQKVG 496
                  Q    W  +++      G    +  +     GGW S+T       GGW S    
Sbjct: 585  ----QPQQSGGWGSTTTEQPQASGGWGSTATEQPAQNGGWGSTTTEQPAQNGGWGS--TA 638

Query: 497  VEESDKQPQWGQRRRNSRGDFKENSRGWGSASGGDWKSNRPPRSADDSN 643
             E+  +   WG     S    ++++ GWGS++    +SN    S   SN
Sbjct: 639  TEQPAQTGGWG-----SSDAPQQSNGGWGSSNNDQQQSNGWGSSNQQSN 682



 Score = 57.8 bits (138), Expect = 3e-06
 Identities = 55/217 (25%), Positives = 86/217 (39%), Gaps = 18/217 (8%)
 Frame = +2

Query: 59  GEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWG---KKVDEPEN--NPHQSQSGSW 223
           GE     ++P   G++ +G        V+K     WG   KK ++  N  N   ++SG W
Sbjct: 11  GEVKPVDSTPVWGGEQNAGGQQ--IDEVKKTQEPQWGQDEKKENQETNAQNGTAAESGGW 68

Query: 224 SSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGGLGSTTAEGNRRLDQSINDW-NGSSSG 400
              G + ++   SW   K  N+D++WG  T G G  ST   G          W NG SS 
Sbjct: 69  ---GNQPQQQSSSWGETKTDNNDNAWGSGTTGFGSTSTGDNGGSSWGGGSTSWGNGGSSD 125

Query: 401 DGQLNESTR-----DDSTNIGGWNSSTVGGWDSQKVGVE----ESDKQPQWGQRRRNSRG 553
           +   ++  R     D     GG+      G +  +   +      D+  + G R R  RG
Sbjct: 126 NNFQSDRPRGRGRGDRGDRGGGYRGRGDRGGEGFRGRGDGFRGRGDRGDRGGFRGRGDRG 185

Query: 554 DFKENSRG---WGSASGGDWKSNRPPRSADDSNRGVN 655
             + + RG    G  +GG W  N    +   ++ G N
Sbjct: 186 GDRGDRRGGFRGGRDNGGSWGGNSSNNNGGGNSWGGN 222


>emb|CAI45859.1| NOWA1 protein [Paramecium tetraurelia]
          Length = 1024

 Score = 79.0 bits (193), Expect = 1e-12
 Identities = 67/245 (27%), Positives = 91/245 (37%), Gaps = 33/245 (13%)
 Frame = +2

Query: 14  GSSWGQKVDKD--GGSWGEKVETKNSPQLSG---------KEQSGSWSSWAKPVEKDGGS 160
           GSSWG   +KD  GG WG    T   P  SG           QSG W +  +   +    
Sbjct: 225 GSSWGNSDNKDNAGGGWGS-TSTNEQPAQSGGWGSTTTEQPAQSGGWGNSTEQPVQQASE 283

Query: 161 SWGKKVDEPENNPHQSQSGSWSSLGKKVEKDGGSWDG-----PKQSNSDSSWGKATK--- 316
            WG K ++P      +Q G  ++  +  ++ GG W        KQSN    WG +T+   
Sbjct: 284 GWGSKTEQPPQQAESNQGGWGTTTEQSNQQSGGGWGSTTEQPQKQSN---GWGNSTQEQQ 340

Query: 317 -----GGGLGSTTAEGNRRLDQSINDWNGSSSGDGQLN---ESTRDDSTNIGGWNSST-- 466
                 GG GS+  E   +  QS   W  S++          ST + +T  GGW S+T  
Sbjct: 341 PQQSGAGGWGSSNTE---QPAQSSGGWGASTTEQPATTGGWGSTTEQATTSGGWGSTTDQ 397

Query: 467 ----VGGWDSQKVGVEESDKQPQWGQRRRNSRGDFKENSRGWGSASGGDWKSNRPPRSAD 634
                GGW         SD+Q                N   WG    GD +  R     D
Sbjct: 398 AASSGGGWGG------SSDQQ----------------NGNSWGGGGSGDGQRGRGRGRGD 435

Query: 635 DSNRG 649
             +RG
Sbjct: 436 RGDRG 440



 Score = 68.6 bits (166), Expect = 2e-09
 Identities = 57/216 (26%), Positives = 79/216 (36%), Gaps = 13/216 (6%)
 Frame = +2

Query: 8    DGGSSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKKVDEP 187
            +GGS  G   D+ GG WG    T + P       +G W S A    +  G  WG    E 
Sbjct: 506  NGGSWGGSSNDQGGGGWGS--STTDQP-----ASNGGWGSTATEQPQSNG-GWGSTATE- 556

Query: 188  ENNPHQSQSGSWSSLGKKVEKDGGSW--DGPKQSNSDSSWGKAT-----KGGGLGSTTAE 346
                  +Q+G W S   +     G W     +Q      WG  T       GG GST  E
Sbjct: 557  ----QPAQTGGWGSTATEKPAQNGGWGSTATEQPQQSGGWGSTTTEQPQASGGWGSTATE 612

Query: 347  GNRRLDQSINDWNGSSSGDGQLNESTRDDSTNIGGWNSSTV-----GGWDSQKVGVEESD 511
               +     +      + +G    +  +     GGW SS       GGW S     ++S+
Sbjct: 613  QPAQNGGWGSTTTEQPAQNGGWGSTATEQPAQTGGWGSSDAPQQSNGGWGSSNNDQQQSN 672

Query: 512  KQPQWGQRRRNSRGDFKE-NSRGWGSASGGDWKSNR 616
                WG   + S G+ +    RG G   GGD   +R
Sbjct: 673  ---GWGSSNQQSNGNGERGRGRGRGRGRGGDRGGDR 705



 Score = 63.9 bits (154), Expect = 5e-08
 Identities = 57/229 (24%), Positives = 83/229 (36%), Gaps = 18/229 (7%)
 Frame = +2

Query: 11   GGSSWGQKVDKDGGSWGEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWGKK-VDEP 187
            G    G + D+ GG  G +            + SG+  SW       GG  WG    D+P
Sbjct: 481  GRGDRGDRGDRRGGFRGGR----------DNDNSGNGGSWGGSSNDQGGGGWGSSTTDQP 530

Query: 188  ENNPHQSQSGSWSSLGKKVEKDGGSW--DGPKQSNSDSSWG-----KATKGGGLGSTTAE 346
             +N      G W S   +  +  G W     +Q      WG     K  + GG GST  E
Sbjct: 531  ASN------GGWGSTATEQPQSNGGWGSTATEQPAQTGGWGSTATEKPAQNGGWGSTATE 584

Query: 347  GNRRLDQSINDWNGSSS----GDGQLNESTRDDSTNIGGWNSSTV------GGWDSQKVG 496
                  Q    W  +++      G    +  +     GGW S+T       GGW S    
Sbjct: 585  ----QPQQSGGWGSTTTEQPQASGGWGSTATEQPAQNGGWGSTTTEQPAQNGGWGS--TA 638

Query: 497  VEESDKQPQWGQRRRNSRGDFKENSRGWGSASGGDWKSNRPPRSADDSN 643
             E+  +   WG     S    ++++ GWGS++    +SN    S   SN
Sbjct: 639  TEQPAQTGGWG-----SSDAPQQSNGGWGSSNNDQQQSNGWGSSNQQSN 682



 Score = 57.8 bits (138), Expect = 3e-06
 Identities = 55/217 (25%), Positives = 86/217 (39%), Gaps = 18/217 (8%)
 Frame = +2

Query: 59  GEKVETKNSPQLSGKEQSGSWSSWAKPVEKDGGSSWG---KKVDEPEN--NPHQSQSGSW 223
           GE     ++P   G++ +G        V+K     WG   KK ++  N  N   ++SG W
Sbjct: 11  GEVKPVDSTPVWGGEQNAGGQQ--IDEVKKTQEPQWGQDEKKENQETNAQNGTAAESGGW 68

Query: 224 SSLGKKVEKDGGSWDGPKQSNSDSSWGKATKGGGLGSTTAEGNRRLDQSINDW-NGSSSG 400
              G + ++   SW   K  N+D++WG  T G G  ST   G          W NG SS 
Sbjct: 69  ---GNQPQQQSSSWGETKTDNNDNAWGSGTTGFGSTSTGDNGGSSWGGGSTSWGNGGSSD 125

Query: 401 DGQLNESTR-----DDSTNIGGWNSSTVGGWDSQKVGVE----ESDKQPQWGQRRRNSRG 553
           +   ++  R     D     GG+      G +  +   +      D+  + G R R  RG
Sbjct: 126 NNFQSDRPRGRGRGDRGDRGGGYRGRGDRGGEGFRGRGDGFRGRGDRGDRGGFRGRGDRG 185

Query: 554 DFKENSRG---WGSASGGDWKSNRPPRSADDSNRGVN 655
             + + RG    G  +GG W  N    +   ++ G N
Sbjct: 186 GDRGDRRGGFRGGRDNGGSWGGNSSNNNGGGNSWGGN 222


>ref|XP_004308588.1| PREDICTED: DNA-directed RNA polymerase E subunit 1-like [Fragaria
            vesca subsp. vesca]
          Length = 1991

 Score = 78.6 bits (192), Expect = 2e-12
 Identities = 76/285 (26%), Positives = 121/285 (42%), Gaps = 58/285 (20%)
 Frame = +2

Query: 17   SSWGQKVDKDGGSW----GEKVETKNSPQLSGKEQSGSWSSWA----KPVEKDGGSSWGK 172
            S+W    + + G W    G KVE+ +       E    W+ ++    KP  ++ GS WG 
Sbjct: 1579 STWRTSTESENG-WSGRGGSKVESTDVQSQKAVENPKGWNDFSAGVRKPQTENAGSGWGM 1637

Query: 173  K---------VDEPENNPHQ--------SQSGSWSSLGKKVEKDGGSWDGPKQSN-SDSS 298
            K         +++ E+  H         S  G+W    K  +   G+W+ PK +  S  +
Sbjct: 1638 KGSEKKGDIELEQDESTRHSWKQKSADASSQGAWERQ-KSPDTSKGTWEQPKSAEMSHGA 1696

Query: 299  WGKATKGG------GL---GSTTAEGNRRLDQSIN----DWNGSSSGDGQLNEST----R 427
            WG+           GL    ST ++G+    +S      +W   +S D  +++ T    +
Sbjct: 1697 WGQQKSPDVSQGVWGLEKPASTNSQGSWGQQKSPEIPQGNWGQQTSPD--ISQGTWGQQK 1754

Query: 428  DDSTNIGGWNS---------STVGGWDSQKVGVEESDKQPQWGQRRRNSRGDFKENSRGW 580
                + G W           +TV  WDSQ     E  +  QWG    +++    E  R W
Sbjct: 1755 SPEMSQGSWGQQKPSDTSQPATVNQWDSQSEAAVE--RHQQWGHNGDSNKRKRFEGGRSW 1812

Query: 581  GSASGGDWK--SNRPPRSA----DDSNRGVNLTATRQKLDIFTAE 697
            G  + G+WK  ++RP +S     DDS+     TATRQ+LDIFT+E
Sbjct: 1813 GP-NAGEWKGKNSRPAKSPGMVNDDSSVAAIYTATRQRLDIFTSE 1856


Top