BLASTX nr result

ID: Zingiber25_contig00004665 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber25_contig00004665
         (1827 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY11663.1| Tetratricopeptide repeat (TPR)-like superfamily p...   381   e-103
gb|EOY11661.1| Tetratricopeptide repeat (TPR)-like superfamily p...   380   e-103
ref|XP_006472205.1| PREDICTED: RNA polymerase II-associated prot...   377   e-102
ref|XP_006433540.1| hypothetical protein CICLE_v10003914mg [Citr...   373   e-100
ref|XP_002330255.1| predicted protein [Populus trichocarpa]           370   1e-99
gb|EOY11660.1| Tetratricopeptide repeat-like superfamily protein...   367   1e-98
ref|XP_004302236.1| PREDICTED: RNA polymerase II-associated prot...   363   1e-97
gb|EXB53029.1| RNA polymerase II-associated protein 3 [Morus not...   363   2e-97
ref|XP_006840434.1| hypothetical protein AMTR_s00045p00163960 [A...   361   7e-97
ref|XP_006339932.1| PREDICTED: RNA polymerase II-associated prot...   360   1e-96
gb|EOY11664.1| Tetratricopeptide repeat (TPR)-like superfamily p...   360   1e-96
ref|XP_006339935.1| PREDICTED: RNA polymerase II-associated prot...   356   2e-95
ref|XP_004248819.1| PREDICTED: RNA polymerase II-associated prot...   356   2e-95
ref|XP_006339933.1| PREDICTED: RNA polymerase II-associated prot...   353   1e-94
ref|XP_006654957.1| PREDICTED: RNA polymerase II-associated prot...   353   2e-94
ref|NP_001054548.1| Os05g0129900 [Oryza sativa Japonica Group] g...   350   9e-94
gb|EMJ06489.1| hypothetical protein PRUPE_ppa006661mg [Prunus pe...   350   1e-93
ref|XP_006588434.1| PREDICTED: uncharacterized protein LOC100784...   347   1e-92
gb|ESW16998.1| hypothetical protein PHAVU_007G201600g [Phaseolus...   345   5e-92
ref|NP_001242466.1| uncharacterized protein LOC100784528 [Glycin...   344   8e-92

>gb|EOY11663.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
            isoform 4 [Theobroma cacao]
          Length = 421

 Score =  381 bits (978), Expect = e-103
 Identities = 205/428 (47%), Positives = 280/428 (65%), Gaps = 13/428 (3%)
 Frame = +2

Query: 395  SKKKVRDRSVEFKGFLNDLQDWDYLRDGKD-------------INQKGRDRQNNKLIVDG 535
            + K  RD++++F+GFLN+LQDW+     KD              N+KGR    + LI   
Sbjct: 2    ASKHSRDQALDFQGFLNNLQDWELSLKEKDKIMKSQASDKEQLTNEKGRPTGKSSLI--- 58

Query: 536  NKASSSDRSAQFDYLKYADPIGQISGINYNDQVPPDASSEKELGNECFKQKKFSEAIDCY 715
               SS+  S Q+DYL+  D    +S     ++  PDA+SEKELGNE FKQKKF EAIDCY
Sbjct: 59   --DSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEAIDCY 116

Query: 716  SRSIALSPTSVAFANRAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRRATARKELGKL 895
            SRSI LSPT+VA ANRAMAYLK+K+++EAE DCTEALNLDDRY+KAYSRRATARKELGKL
Sbjct: 117  SRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKL 176

Query: 896  KAALEDADFAVRLEPNNNEVRKQYSETKALYEKEITKRNSESLKSISKGSEPPDSFQTKS 1075
            K ++ED +FA+RLEPNN E++KQ++E K+LYEKEI ++ S  L+   + ++     +TK 
Sbjct: 177  KESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKSETKE 236

Query: 1076 KIVKDDYLVSSKTQKVATNEQNSGSVQMIQKRSEGAQTKYELKEPLQDVXXXXXXXXXXX 1255
              +   +  S+ TQ+         +VQ  Q +      K ELK  +Q++           
Sbjct: 237  NGL-GMHSASNSTQRTGV-----ATVQGYQTKKNNRTRKPELKASVQELASLAATRAMAE 290

Query: 1256 XXVNLTAPKSAYEFEVSWRALSDDSARQMQLLKMIPPATLPHIFKNALSAPILVDIIKCV 1435
               N++ P +AY+FEVSWRALS D A Q  LLK+  P+ LP IFKNALSA +LVDIIKCV
Sbjct: 291  AAKNISPPNTAYQFEVSWRALSGDRALQAHLLKVTSPSALPQIFKNALSASMLVDIIKCV 350

Query: 1436 TTFFKEDSELAIGILDNMTRVPRFDMLMMCISARDRSEINRMWGNISSCTTIPASHREAV 1615
             TFF+E+ +LAI  L+N+T+VPRFDML+MC+S+ +++++ ++W ++      P    E +
Sbjct: 351  ATFFREEVDLAIKYLENLTKVPRFDMLIMCLSSTEKADLLKVWDDVFCNEATPIEWAEIL 410

Query: 1616 AQLRTKYC 1639
              LR+ YC
Sbjct: 411  DNLRSVYC 418


>gb|EOY11661.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
            isoform 2 [Theobroma cacao]
          Length = 422

 Score =  380 bits (977), Expect = e-103
 Identities = 205/429 (47%), Positives = 280/429 (65%), Gaps = 14/429 (3%)
 Frame = +2

Query: 395  SKKKVRDRSVEFKGFLNDLQDWDYLRDGKD--------------INQKGRDRQNNKLIVD 532
            + K  RD++++F+GFLN+LQDW+     KD               N+KGR    + LI  
Sbjct: 2    ASKHSRDQALDFQGFLNNLQDWELSLKEKDKIMKSQASDKEQLKTNEKGRPTGKSSLI-- 59

Query: 533  GNKASSSDRSAQFDYLKYADPIGQISGINYNDQVPPDASSEKELGNECFKQKKFSEAIDC 712
                SS+  S Q+DYL+  D    +S     ++  PDA+SEKELGNE FKQKKF EAIDC
Sbjct: 60   ---DSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEAIDC 116

Query: 713  YSRSIALSPTSVAFANRAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRRATARKELGK 892
            YSRSI LSPT+VA ANRAMAYLK+K+++EAE DCTEALNLDDRY+KAYSRRATARKELGK
Sbjct: 117  YSRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKELGK 176

Query: 893  LKAALEDADFAVRLEPNNNEVRKQYSETKALYEKEITKRNSESLKSISKGSEPPDSFQTK 1072
            LK ++ED +FA+RLEPNN E++KQ++E K+LYEKEI ++ S  L+   + ++     +TK
Sbjct: 177  LKESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKSETK 236

Query: 1073 SKIVKDDYLVSSKTQKVATNEQNSGSVQMIQKRSEGAQTKYELKEPLQDVXXXXXXXXXX 1252
               +   +  S+ TQ+         +VQ  Q +      K ELK  +Q++          
Sbjct: 237  ENGL-GMHSASNSTQRTGV-----ATVQGYQTKKNNRTRKPELKASVQELASLAATRAMA 290

Query: 1253 XXXVNLTAPKSAYEFEVSWRALSDDSARQMQLLKMIPPATLPHIFKNALSAPILVDIIKC 1432
                N++ P +AY+FEVSWRALS D A Q  LLK+  P+ LP IFKNALSA +LVDIIKC
Sbjct: 291  EAAKNISPPNTAYQFEVSWRALSGDRALQAHLLKVTSPSALPQIFKNALSASMLVDIIKC 350

Query: 1433 VTTFFKEDSELAIGILDNMTRVPRFDMLMMCISARDRSEINRMWGNISSCTTIPASHREA 1612
            V TFF+E+ +LAI  L+N+T+VPRFDML+MC+S+ +++++ ++W ++      P    E 
Sbjct: 351  VATFFREEVDLAIKYLENLTKVPRFDMLIMCLSSTEKADLLKVWDDVFCNEATPIEWAEI 410

Query: 1613 VAQLRTKYC 1639
            +  LR+ YC
Sbjct: 411  LDNLRSVYC 419


>ref|XP_006472205.1| PREDICTED: RNA polymerase II-associated protein 3-like [Citrus
            sinensis]
          Length = 438

 Score =  377 bits (969), Expect = e-102
 Identities = 202/434 (46%), Positives = 278/434 (64%), Gaps = 21/434 (4%)
 Frame = +2

Query: 401  KKVRDRSVEFKGFLNDLQDWDYLRDGKDINQKGRDRQNNKLIVDGNKASSSDRSAQFDYL 580
            K  RD++++F+GFLNDLQDWD   + KD   K +    + L+    K++     +   Y 
Sbjct: 3    KHNRDQALDFQGFLNDLQDWDLSLNEKDKKMKHKASSKDNLVSSSLKSAKKPSPSGNSYS 62

Query: 581  KYADPIGQISGINYNDQVPPDASSEKELGNECFKQKKFSEAIDCYSRSIALSPTSVAFAN 760
            +  DP+  IS    N++  PDA+SEKELGNECFKQKKF EAIDCYSRSIALSPT+VA+AN
Sbjct: 63   RNYDPVSHISSSLMNEESTPDATSEKELGNECFKQKKFKEAIDCYSRSIALSPTAVAYAN 122

Query: 761  RAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRRATARKELGKLKAALEDADFAVRLEP 940
            RAMAYLKL+R++EAE DCTEALNLDDRY+KAYSRRATARKELGKLK ++ED++FA+RLEP
Sbjct: 123  RAMAYLKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDSEFALRLEP 182

Query: 941  NNNEVRKQYSETKALYEKEITKRNSESLKSISKGSEPPDSFQT--------KSKIVK-DD 1093
             N E++KQ +E K+LYEKE+ ++ S++L+   K     +  +         K+ + +  D
Sbjct: 183  QNQEIKKQLAEVKSLYEKEVFQKASKTLEKYGKSGMKVNGHEVRAVRNTTQKTGVAEIQD 242

Query: 1094 YLVSSKTQ------KVATNEQNSGS------VQMIQKRSEGAQTKYELKEPLQDVXXXXX 1237
              +S KT+      +  T  Q  GS      +  + KR+   + K  L   +Q++     
Sbjct: 243  LTISKKTENKNLRDESKTEGQRDGSGANATHISGLDKRNHRTK-KAVLDASVQELATRAT 301

Query: 1238 XXXXXXXXVNLTAPKSAYEFEVSWRALSDDSARQMQLLKMIPPATLPHIFKNALSAPILV 1417
                     N+T PKSAYEFEVSWR  + D A Q +LLK I P  LP IFKNALSA IL+
Sbjct: 302  SRAVAEAAKNITPPKSAYEFEVSWRGFAGDHALQARLLKAISPNALPQIFKNALSASILI 361

Query: 1418 DIIKCVTTFFKEDSELAIGILDNMTRVPRFDMLMMCISARDRSEINRMWGNISSCTTIPA 1597
            DI+K V TFF  + +LAI  L+ +T VPRFD+++MC+S  D++++ ++W       + P 
Sbjct: 362  DIVKVVATFFTGEVDLAIKYLEYLTMVPRFDLVIMCLSLADKADLRKVWDETFCNESTPI 421

Query: 1598 SHREAVAQLRTKYC 1639
             + E +  LR+KYC
Sbjct: 422  EYAEILDNLRSKYC 435


>ref|XP_006433540.1| hypothetical protein CICLE_v10003914mg [Citrus clementina]
            gi|557535662|gb|ESR46780.1| hypothetical protein
            CICLE_v10003914mg [Citrus clementina]
          Length = 977

 Score =  373 bits (957), Expect = e-100
 Identities = 201/431 (46%), Positives = 273/431 (63%), Gaps = 21/431 (4%)
 Frame = +2

Query: 410  RDRSVEFKGFLNDLQDWDYLRDGKDINQKGRDRQNNKLIVDGNKASSSDRSAQFDYLKYA 589
            RD++++F+GFLNDLQDWD     KD   K +    + L+    K+      +   Y +  
Sbjct: 545  RDQALDFQGFLNDLQDWDLSLHEKDKKMKHKASSKDNLVSSSLKSGEKPSPSGNSYSRNY 604

Query: 590  DPIGQISGINYNDQVPPDASSEKELGNECFKQKKFSEAIDCYSRSIALSPTSVAFANRAM 769
            DP+ +IS    N++  PDA+SEKELGNECFKQKKF EAIDCYSRSIALSPT+VA+ANRAM
Sbjct: 605  DPVSRISSSLMNEESTPDATSEKELGNECFKQKKFKEAIDCYSRSIALSPTAVAYANRAM 664

Query: 770  AYLKLKRYEEAESDCTEALNLDDRYVKAYSRRATARKELGKLKAALEDADFAVRLEPNNN 949
            AYLKL+R++EAE DCTEALNLDDRY+KAYSRRATARKELGKLK ++ED++FA+RLEP N 
Sbjct: 665  AYLKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDSEFALRLEPQNQ 724

Query: 950  EVRKQYSETKALYEKEITKRNSESLKSISKGSEPPDSFQ---TKSKIVK------DDYLV 1102
            E++KQ +E K+LYEKE+ ++ S++L+   K     +  +    ++ I K       D  +
Sbjct: 725  EIKKQLAEVKSLYEKEVFQKASKTLEKYGKSGMKVNGHEVRAVRNTIQKTGVAEIQDLTI 784

Query: 1103 SSKTQ------KVATNEQNSGS------VQMIQKRSEGAQTKYELKEPLQDVXXXXXXXX 1246
            S KT+      +  T  Q  GS      +  + KR+   + K  L   +Q++        
Sbjct: 785  SKKTENKNLRDESKTEGQRDGSGANATHISGLDKRNHRTK-KAVLDASVQELATRATSRA 843

Query: 1247 XXXXXVNLTAPKSAYEFEVSWRALSDDSARQMQLLKMIPPATLPHIFKNALSAPILVDII 1426
                  N+T PKSAYEFEVSWR  + D A Q +LLK I P  LP IFKNALSA IL+DI+
Sbjct: 844  VAEAAKNITPPKSAYEFEVSWRGFAGDHALQARLLKAISPNALPQIFKNALSASILIDIV 903

Query: 1427 KCVTTFFKEDSELAIGILDNMTRVPRFDMLMMCISARDRSEINRMWGNISSCTTIPASHR 1606
            K V  FF  + +LAI  L+ +T VPRFD ++MC+S  D++++ ++W         P  + 
Sbjct: 904  KVVAMFFPGEVDLAIKYLEYLTMVPRFDFVIMCLSLADKADLRKVWDETFCNELTPIEYA 963

Query: 1607 EAVAQLRTKYC 1639
            E +  LR+KYC
Sbjct: 964  EILDNLRSKYC 974


>ref|XP_002330255.1| predicted protein [Populus trichocarpa]
          Length = 434

 Score =  370 bits (950), Expect = 1e-99
 Identities = 201/439 (45%), Positives = 282/439 (64%), Gaps = 20/439 (4%)
 Frame = +2

Query: 383  MSETSKKKVRDRSVEFKGFLNDLQDWDYLRDGKDINQKGRDRQNNKLIVDGN---KASSS 553
            M+    K  RD++++F+GFLNDLQDW+ L+D     +K     + K+  DG    K S++
Sbjct: 1    MARVPGKHGRDQALDFQGFLNDLQDWELLKDTDKKMKKKSRASDVKIGEDGRSKGKTSAA 60

Query: 554  DRS----AQFDYLKYADPIGQISGINYNDQVPPDASSEKELGNECFKQKKFSEAIDCYSR 721
            D S     Q++Y +    I ++S     D++  DA++EKELGNE FKQKKF+EAI+CYSR
Sbjct: 61   DSSRSGSGQYEYSRNFGAINRLSSSFTTDEITVDATTEKELGNEYFKQKKFNEAIECYSR 120

Query: 722  SIALSPTSVAFANRAMAYLKLKR----YEEAESDCTEALNLDDRYVKAYSRRATARKELG 889
            SIALSPT+VA+ANRAMAYLK+KR    + EAE DCTEALNLDDRY+KAYSRRATARKELG
Sbjct: 121  SIALSPTAVAYANRAMAYLKIKRQFFLFREAEDDCTEALNLDDRYIKAYSRRATARKELG 180

Query: 890  KLKAALEDADFAVRLEPNNNEVRKQYSETKALYEK-------EITKRNSESLKSISKGSE 1048
            KLK ++ED++FA++LEPNN E++KQY+E K+LYEK       EI ++ S +L+S  +G++
Sbjct: 181  KLKESIEDSEFALKLEPNNQEIKKQYAEVKSLYEKASDYLMLEILQKASGTLRSSLQGTQ 240

Query: 1049 PPDSFQTK--SKIVKDDYLVSSKTQKVATNEQNSGSVQMIQKRSEGAQTKYELKEPLQDV 1222
                 +       V    + + KT   A+ + N+        +      + ELK  + ++
Sbjct: 241  QGGRSEASVNGHAVHPVSIATQKTGVSASKKDNT--------KKNNRTRRQELKTSVIEL 292

Query: 1223 XXXXXXXXXXXXXVNLTAPKSAYEFEVSWRALSDDSARQMQLLKMIPPATLPHIFKNALS 1402
                          N+T P SAY+FEVSW+  S D A Q  LLK+  P+ LP IFKNALS
Sbjct: 293  ASQAASRAMAEAAKNITPPNSAYQFEVSWQGFSGDRALQAHLLKVTSPSALPQIFKNALS 352

Query: 1403 APILVDIIKCVTTFFKEDSELAIGILDNMTRVPRFDMLMMCISARDRSEINRMWGNISSC 1582
             PIL+DIIKCV +FF +D + A+  L+N+T+VPRFDML+MC+S+ D S++ +MW  +   
Sbjct: 353  VPILIDIIKCVASFFIDDMDFAVKYLENLTKVPRFDMLIMCLSSTDTSDLLKMWDGVFCS 412

Query: 1583 TTIPASHREAVAQLRTKYC 1639
             + P  + E +  LR+KYC
Sbjct: 413  ASTPIEYAEILDNLRSKYC 431


>gb|EOY11660.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1
            [Theobroma cacao]
          Length = 468

 Score =  367 bits (941), Expect = 1e-98
 Identities = 211/470 (44%), Positives = 287/470 (61%), Gaps = 55/470 (11%)
 Frame = +2

Query: 395  SKKKVRDRSVEFKGFLNDLQDWDYLRDGKD-------------INQKGRDRQNNKLIVDG 535
            + K  RD++++F+GFLN+LQDW+     KD              N+KGR    + LI   
Sbjct: 2    ASKHSRDQALDFQGFLNNLQDWELSLKEKDKIMKSQASDKEQLTNEKGRPTGKSSLI--- 58

Query: 536  NKASSSDRSAQFDYLKYADPIGQISGINYNDQVPPDASSEKELGNECFKQKKFSEAIDCY 715
               SS+  S Q+DYL+  D    +S     ++  PDA+SEKELGNE FKQKKF EAIDCY
Sbjct: 59   --DSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELGNEYFKQKKFKEAIDCY 116

Query: 716  SRSIALSPTSVAFANRAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRRATARKELGKL 895
            SRSI LSPT+VA ANRAMAYLK+K+++EAE DCTEALNLDDRY+KAYSRRATARKELGKL
Sbjct: 117  SRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKL 176

Query: 896  KAALEDADFAVRLEPNNNEVRKQYSETKALYEKEITKRNSESLKSISKGSEPPDSFQTKS 1075
            K ++ED +FA+RLEPNN E++KQ++E K+LYEKEI ++ S  L+   + ++     +TK 
Sbjct: 177  KESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLRKSMQEAQEVGKSETKE 236

Query: 1076 KIVKDDYLVSSKTQK--VAT--------------NEQNSGSVQ---------MIQKRSEG 1180
              +   +  S+ TQ+  VAT               +   GSV          +   R +G
Sbjct: 237  NGL-GMHSASNSTQRTGVATVQGYQTKVSEYDKQKKPEKGSVTSEGIGDRNTLAGSRKDG 295

Query: 1181 AQT-----------------KYELKEPLQDVXXXXXXXXXXXXXVNLTAPKSAYEFEVSW 1309
             Q                  K ELK  +Q++              N++ P +AY+FEVSW
Sbjct: 296  TQLDSGIVGLESIKKNNRTRKPELKASVQELASLAATRAMAEAAKNISPPNTAYQFEVSW 355

Query: 1310 RALSDDSARQMQLLKMIPPATLPHIFKNALSAPILVDIIKCVTTFFKEDSELAIGILDNM 1489
            RALS D A Q  LLK+  P+ LP IFKNALSA +LVDIIKCV TFF+E+ +LAI  L+N+
Sbjct: 356  RALSGDRALQAHLLKVTSPSALPQIFKNALSASMLVDIIKCVATFFREEVDLAIKYLENL 415

Query: 1490 TRVPRFDMLMMCISARDRSEINRMWGNISSCTTIPASHREAVAQLRTKYC 1639
            T+VPRFDML+MC+S+ +++++ ++W ++      P    E +  LR+ YC
Sbjct: 416  TKVPRFDMLIMCLSSTEKADLLKVWDDVFCNEATPIEWAEILDNLRSVYC 465


>ref|XP_004302236.1| PREDICTED: RNA polymerase II-associated protein 3-like [Fragaria
            vesca subsp. vesca]
          Length = 407

 Score =  363 bits (932), Expect = 1e-97
 Identities = 196/421 (46%), Positives = 274/421 (65%), Gaps = 2/421 (0%)
 Frame = +2

Query: 383  MSETSKKKVRDRSVEFKGFLNDLQDWDYLRDGKDINQKGRDRQNNKLIVDGNKASSSDRS 562
            M+    K  RD++++F+GFL+DLQDW+     KD  +K R +Q NK         +S  S
Sbjct: 1    MARAPSKHGRDQALDFQGFLSDLQDWELSLKDKD--KKMRPQQPNKEAPKSRDFGTSSYS 58

Query: 563  AQFDYLKYADPIGQISGINYNDQVPPDASSEKELGNECFKQKKFSEAIDCYSRSIALSPT 742
              ++      P+  +S    ++   PDA+SEK+LGNE FKQKKF EAIDCYSRSIAL+PT
Sbjct: 59   TNYE------PMNTVSSSFTSEDGLPDAASEKDLGNEYFKQKKFKEAIDCYSRSIALTPT 112

Query: 743  SVAFANRAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRRATARKELGKLKAALEDADF 922
            +VAFANRAM+Y+K+KR++EAE+DCTEALNLDDRY+KAYSRRATARKELGKLK ++EDA+F
Sbjct: 113  AVAFANRAMSYIKIKRFQEAENDCTEALNLDDRYIKAYSRRATARKELGKLKESIEDAEF 172

Query: 923  AVRLEPNNNEVRKQYSETKALYEKEITKRNSESLKSISKGSEPPDSFQTKSKIVKDDYLV 1102
            A+RLEP+N E++KQY+E K+LYEK I ++ S ++K          S Q K K+ K    V
Sbjct: 173  ALRLEPHNQEIKKQYAEAKSLYEKGILQKVSGAIKI---------SEQDKQKVEKSGTTV 223

Query: 1103 SSKT-QKVATNEQNSGSVQM-IQKRSEGAQTKYELKEPLQDVXXXXXXXXXXXXXVNLTA 1276
            +  + Q V++  Q + +  +    +      K   K  +Q++              N+T 
Sbjct: 224  NGHSIQPVSSTTQRTETTAVGDHTKKINTNGKQASKLSVQELASRAASRAKALAAENITP 283

Query: 1277 PKSAYEFEVSWRALSDDSARQMQLLKMIPPATLPHIFKNALSAPILVDIIKCVTTFFKED 1456
            P SAY+FE SWR LS D A Q +LLK I P+ LP IFKNAL+  ILVDI+KCVTTFF ++
Sbjct: 284  PSSAYQFEASWRGLSGDRALQAKLLKAISPSALPQIFKNALTVHILVDILKCVTTFFIDE 343

Query: 1457 SELAIGILDNMTRVPRFDMLMMCISARDRSEINRMWGNISSCTTIPASHREAVAQLRTKY 1636
             +LA+ +L+N+T+VPRFD L+M +S+ D++++ ++W  +      P    E +  LR KY
Sbjct: 344  MDLAVSVLENLTKVPRFDTLIMFLSSNDKADLAKIWDEVFYNEATPIEFAEKLDNLRAKY 403

Query: 1637 C 1639
            C
Sbjct: 404  C 404


>gb|EXB53029.1| RNA polymerase II-associated protein 3 [Morus notabilis]
          Length = 450

 Score =  363 bits (931), Expect = 2e-97
 Identities = 201/456 (44%), Positives = 283/456 (62%), Gaps = 37/456 (8%)
 Frame = +2

Query: 383  MSETSKKKVRDRSVEFKGFLNDLQDWDYLRDGKDINQKGRDRQNNKLIV---------DG 535
            M+    K  RD ++ F+GFLNDLQDW++  + KD ++K + + ++K I           G
Sbjct: 1    MARAPTKHGRDEALAFQGFLNDLQDWEFSLEDKDKDKKMKAQASDKGISVSSSKKIGEAG 60

Query: 536  NKASSSDRSAQFDYLKYA---------DPIGQISGINYNDQVPPDASSEKELGNECFKQK 688
                ++ +S+ F+YL  +         D I Q+S  + ++    DA+SEKELGNE FKQK
Sbjct: 61   KDRKAAGKSSTFEYLSSSMPYDYSRKYDAINQVSSSSISEDSYTDAASEKELGNEYFKQK 120

Query: 689  KFSEAIDCYSRSIALSPTSVAFANRAMAYLKLKR-----------------YEEAESDCT 817
            KF EAIDCYSRSIALS T+VA+ANRAMAYLKLKR                 ++EAE DCT
Sbjct: 121  KFKEAIDCYSRSIALSSTAVAYANRAMAYLKLKRQLLPYLIFFCKSIFLIRFQEAEGDCT 180

Query: 818  EALNLDDRYVKAYSRRATARKELGKLKAALEDADFAVRLEPNNNEVRKQYSETKALYEKE 997
            EALN+DDRY+KAYSRRATARKELGKLK  +EDA+FA+RLEPNN E++KQYSE K+L EK 
Sbjct: 181  EALNMDDRYIKAYSRRATARKELGKLKECIEDAEFALRLEPNNQEIKKQYSEAKSLCEKV 240

Query: 998  ITKRNSESLKSISKGSEPPDSFQTKSKIVKDDYL--VSSKTQKVATNEQNSGSVQMIQKR 1171
            I ++ S +L++  +  +  +   TK   V+++ +  V S TQK         +V     +
Sbjct: 241  ILQKASVALENTVQKMQKAEKKDTK---VQNNGIQPVESATQKT------EAAVAEDYTK 291

Query: 1172 SEGAQTKYELKEPLQDVXXXXXXXXXXXXXVNLTAPKSAYEFEVSWRALSDDSARQMQLL 1351
                  K E K  +Q++              N+ +P SAY+FEVSWR LS D A Q  LL
Sbjct: 292  INQTAKKQEPKASVQELASRAASRAMNGTAKNIRSPTSAYQFEVSWRGLSGDRALQASLL 351

Query: 1352 KMIPPATLPHIFKNALSAPILVDIIKCVTTFFKEDSELAIGILDNMTRVPRFDMLMMCIS 1531
            K + P  LP IFKN+L+ PILVDI+KC+ TFF E+ ++ +  L+N+T+VPRFD+L+MC++
Sbjct: 352  KTVSPGALPQIFKNSLTVPILVDIVKCIATFFIEEMDVTVTFLENLTKVPRFDILVMCLT 411

Query: 1532 ARDRSEINRMWGNISSCTTIPASHREAVAQLRTKYC 1639
            ++DR+++ ++W  +      P  H E +  LR+KYC
Sbjct: 412  SKDRADLVKIWNEVFCKEATPIEHAEKLDNLRSKYC 447


>ref|XP_006840434.1| hypothetical protein AMTR_s00045p00163960 [Amborella trichopoda]
            gi|548842152|gb|ERN02109.1| hypothetical protein
            AMTR_s00045p00163960 [Amborella trichopoda]
          Length = 466

 Score =  361 bits (926), Expect = 7e-97
 Identities = 203/464 (43%), Positives = 283/464 (60%), Gaps = 45/464 (9%)
 Frame = +2

Query: 383  MSETSKKKVRDRSVEFKGFLNDLQDWDYLRDGKDINQKGRDRQNNKLIVDGNKASSSD-- 556
            M++ S K VRD  +EF+G L+DLQ W+ +    D N KG+  +  K      K       
Sbjct: 1    MAQASNKNVRDHDLEFQGLLSDLQSWERVVKESDKNPKGQVDKIGKPDFGARKTGKGTER 60

Query: 557  --RSAQFDYLKYADPIGQISGINYN---------DQVPPDASSEKELGNECFKQKKFSEA 703
              +S +   +   D +G     NY+          +  P+A+SEKELGN+ FKQKK++ A
Sbjct: 61   KGQSMKSATVDQKDILGCSDFTNYSYFSNSRSLSSEDTPNATSEKELGNDYFKQKKYAHA 120

Query: 704  IDCYSRSIALSPTSVAFANRAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRRATARKE 883
            I+CYSRSIALSP++VA+ANRAMAYLK +RYEEAE+DCTEALNLDDRYVKAYSRRATARKE
Sbjct: 121  IECYSRSIALSPSAVAYANRAMAYLKTRRYEEAENDCTEALNLDDRYVKAYSRRATARKE 180

Query: 884  LGKLKAALEDADFAVRLEPNNNEVRKQYSETKALYEKEITKRNSESLKSISKGS------ 1045
            LGKL A++EDA+FA+RLEPNN E++KQY+E K ++ K   K+ S + KS ++ S      
Sbjct: 181  LGKLHASIEDAEFALRLEPNNQELKKQYAEIKEIFAKIAMKKISGAGKSSTQDSGNKRDS 240

Query: 1046 -----------EPPDS-FQTKSKIVKDDYLVSSKTQKVAT---------NEQNSGSVQM- 1159
                       +P  S   +  KI K D  V  + Q   T         +++  GS    
Sbjct: 241  VSEIKIDVQDAQPQRSQMDSNGKISKKDPSVMKEVQSRRTPEDLNVRLGSQETHGSTPAK 300

Query: 1160 ----IQKRSEGAQTKYELKEPLQDVXXXXXXXXXXXXXVNLTAPKSAYEFEVSWRALSDD 1327
                + K +     K++ KE + +V              N+  PKSAYEFEV+WR LS+D
Sbjct: 301  SQLDVSKDNHKEFMKHQSKESILEVASRAASRAKAAAAQNIATPKSAYEFEVAWRRLSED 360

Query: 1328 SARQMQLLKMIPPATLPHIFKNALSAPILVDIIKCVTTFFKEDSELAIGILDNMTRVPRF 1507
             A Q  LLK I P +LP +FKNALSAP+L++IIKCV  FF+E++ LA+ ILDN+TR+ RF
Sbjct: 361  RASQCLLLKTILPESLPQLFKNALSAPMLIEIIKCVAEFFREETNLAVNILDNLTRIGRF 420

Query: 1508 DMLMMCISARDRSEINRMWGNISSCTTIPASHREAVAQLRTKYC 1639
            DM++MC+S++D++++ R+W  + S   +   H E + +L +KYC
Sbjct: 421  DMIIMCLSSKDKADLQRIWDEVVSSCAVTMEHAETLERLHSKYC 464


>ref|XP_006339932.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X1
            [Solanum tuberosum]
          Length = 468

 Score =  360 bits (923), Expect = 1e-96
 Identities = 204/449 (45%), Positives = 279/449 (62%), Gaps = 52/449 (11%)
 Frame = +2

Query: 383  MSETSKKKVRDRSVEFKGFLNDLQDWDYLRDGKDINQKGRDRQNNKLIVDGNKAS----- 547
            M++   K  RD+  + +G LN+LQDW+    GKD   K +      L  D ++ S     
Sbjct: 1    MAKVPSKHSRDQFQDMQGLLNNLQDWELSLKGKDKKMKSQAHGKETLREDWSRTSELLTS 60

Query: 548  -----------SSDRSAQ--FDYLKYADPIGQISGINYNDQVPPDASSEKELGNECFKQK 688
                       +S RSA   ++Y K  +PI  +S    +++   +A+SEKELGNECFKQK
Sbjct: 61   PQVNGTRVGKSTSIRSAAGPYNYSKNYNPISHLSSELISEESNINANSEKELGNECFKQK 120

Query: 689  KFSEAIDCYSRSIALSPTSVAFANRAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRRA 868
            KF+EAIDCYSRSIALSPT+V++ANRAMAYLK+KR++EAE+DCTEALNLDDRY+KAYSRR+
Sbjct: 121  KFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRRS 180

Query: 869  TARKELGKLKAALEDADFAVRLEPNNNEVRKQYSETKALYEKEITKRNSESLKSISKGSE 1048
            T+RKELGKLK ++EDA+FA+RLEP N E++KQY E KALYEKEI KR S +    ++ ++
Sbjct: 181  TSRKELGKLKESIEDAEFALRLEPQNPEIKKQYGEVKALYEKEIRKRVSGATDVSAQRAQ 240

Query: 1049 PPDSFQTKSKIVKDDYLVSSKTQKVA----------------TNEQNSGSVQMIQKRSEG 1180
                      +++    VSS +QK+A                T +     +Q+  K S+ 
Sbjct: 241  KSGKTIKSGPVIQS---VSSSSQKMAEVWTIPAKENNRDVPGTAKVEDTHMQINNKDSDA 297

Query: 1181 AQT------------------KYELKEPLQDVXXXXXXXXXXXXXVNLTAPKSAYEFEVS 1306
            + T                  K EL+E +Q++              N+ AP SAY+FEVS
Sbjct: 298  SPTVPTLNPAFGTAKKTHKISKQELEESVQELAARAAGLAKTEAAKNIAAPNSAYQFEVS 357

Query: 1307 WRALSDDSARQMQLLKMIPPATLPHIFKNALSAPILVDIIKCVTTFFKEDSELAIGILDN 1486
            WR LS D   Q QLLK+  PA LP IFKNALSAP+L+DI++CV TFF ED  LAI  L++
Sbjct: 358  WRGLSGDRNLQTQLLKVTSPAMLPRIFKNALSAPMLMDIVRCVATFFIEDMNLAIRYLED 417

Query: 1487 MTRVPRFDMLMMCISARDRSEINRMWGNI 1573
            +T+VPRFDM++MC+S+ D+SE+ ++W  I
Sbjct: 418  LTKVPRFDMIIMCLSSTDKSELLKIWEEI 446


>gb|EOY11664.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
            isoform 5 [Theobroma cacao]
          Length = 389

 Score =  360 bits (923), Expect = 1e-96
 Identities = 191/384 (49%), Positives = 258/384 (67%)
 Frame = +2

Query: 488  NQKGRDRQNNKLIVDGNKASSSDRSAQFDYLKYADPIGQISGINYNDQVPPDASSEKELG 667
            N+KGR    + LI      SS+  S Q+DYL+  D    +S     ++  PDA+SEKELG
Sbjct: 14   NEKGRPTGKSSLI-----DSSTTSSRQYDYLQNYDKFNSLSSSFVTEENMPDAASEKELG 68

Query: 668  NECFKQKKFSEAIDCYSRSIALSPTSVAFANRAMAYLKLKRYEEAESDCTEALNLDDRYV 847
            NE FKQKKF EAIDCYSRSI LSPT+VA ANRAMAYLK+K+++EAE DCTEALNLDDRY+
Sbjct: 69   NEYFKQKKFKEAIDCYSRSIGLSPTAVAHANRAMAYLKIKKFQEAEDDCTEALNLDDRYI 128

Query: 848  KAYSRRATARKELGKLKAALEDADFAVRLEPNNNEVRKQYSETKALYEKEITKRNSESLK 1027
            KAYSRRATARKELGKLK ++ED +FA+RLEPNN E++KQ++E K+LYEKEI ++ S  L+
Sbjct: 129  KAYSRRATARKELGKLKESIEDTEFALRLEPNNQEIKKQHAEFKSLYEKEILQKASGVLR 188

Query: 1028 SISKGSEPPDSFQTKSKIVKDDYLVSSKTQKVATNEQNSGSVQMIQKRSEGAQTKYELKE 1207
               + ++     +TK   +   +  S+ TQ+         +VQ  Q +      K ELK 
Sbjct: 189  KSMQEAQEVGKSETKENGL-GMHSASNSTQRTGV-----ATVQGYQTKKNNRTRKPELKA 242

Query: 1208 PLQDVXXXXXXXXXXXXXVNLTAPKSAYEFEVSWRALSDDSARQMQLLKMIPPATLPHIF 1387
             +Q++              N++ P +AY+FEVSWRALS D A Q  LLK+  P+ LP IF
Sbjct: 243  SVQELASLAATRAMAEAAKNISPPNTAYQFEVSWRALSGDRALQAHLLKVTSPSALPQIF 302

Query: 1388 KNALSAPILVDIIKCVTTFFKEDSELAIGILDNMTRVPRFDMLMMCISARDRSEINRMWG 1567
            KNALSA +LVDIIKCV TFF+E+ +LAI  L+N+T+VPRFDML+MC+S+ +++++ ++W 
Sbjct: 303  KNALSASMLVDIIKCVATFFREEVDLAIKYLENLTKVPRFDMLIMCLSSTEKADLLKVWD 362

Query: 1568 NISSCTTIPASHREAVAQLRTKYC 1639
            ++      P    E +  LR+ YC
Sbjct: 363  DVFCNEATPIEWAEILDNLRSVYC 386


>ref|XP_006339935.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X4
            [Solanum tuberosum]
          Length = 419

 Score =  356 bits (914), Expect = 2e-95
 Identities = 198/415 (47%), Positives = 265/415 (63%), Gaps = 18/415 (4%)
 Frame = +2

Query: 383  MSETSKKKVRDRSVEFKGFLNDLQDWDYLRDGKDINQKGRDRQNNKLIVDGNKAS----- 547
            M++   K  RD+  + +G LN+LQDW+    GKD   K +      L  D ++ S     
Sbjct: 1    MAKVPSKHSRDQFQDMQGLLNNLQDWELSLKGKDKKMKSQAHGKETLREDWSRTSELLTS 60

Query: 548  -----------SSDRSAQ--FDYLKYADPIGQISGINYNDQVPPDASSEKELGNECFKQK 688
                       +S RSA   ++Y K  +PI  +S    +++   +A+SEKELGNECFKQK
Sbjct: 61   PQVNGTRVGKSTSIRSAAGPYNYSKNYNPISHLSSELISEESNINANSEKELGNECFKQK 120

Query: 689  KFSEAIDCYSRSIALSPTSVAFANRAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRRA 868
            KF+EAIDCYSRSIALSPT+V++ANRAMAYLK+KR++EAE+DCTEALNLDDRY+KAYSRR+
Sbjct: 121  KFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRRS 180

Query: 869  TARKELGKLKAALEDADFAVRLEPNNNEVRKQYSETKALYEKEITKRNSESLKSISKGSE 1048
            T+RKELGKLK ++EDA+FA+RLEP N E++KQY E KALYEKE    N+  +   +K  +
Sbjct: 181  TSRKELGKLKESIEDAEFALRLEPQNPEIKKQYGEVKALYEKE----NNRDVPGTAKVED 236

Query: 1049 PPDSFQTKSKIVKDDYLVSSKTQKVATNEQNSGSVQMIQKRSEGAQTKYELKEPLQDVXX 1228
                   K          S  +  V T     G+ +   K S     K EL+E +Q++  
Sbjct: 237  THMQINNKD---------SDASPTVPTLNPAFGTAKKTHKIS-----KQELEESVQELAA 282

Query: 1229 XXXXXXXXXXXVNLTAPKSAYEFEVSWRALSDDSARQMQLLKMIPPATLPHIFKNALSAP 1408
                        N+ AP SAY+FEVSWR LS D   Q QLLK+  PA LP IFKNALSAP
Sbjct: 283  RAAGLAKTEAAKNIAAPNSAYQFEVSWRGLSGDRNLQTQLLKVTSPAMLPRIFKNALSAP 342

Query: 1409 ILVDIIKCVTTFFKEDSELAIGILDNMTRVPRFDMLMMCISARDRSEINRMWGNI 1573
            +L+DI++CV TFF ED  LAI  L+++T+VPRFDM++MC+S+ D+SE+ ++W  I
Sbjct: 343  MLMDIVRCVATFFIEDMNLAIRYLEDLTKVPRFDMIIMCLSSTDKSELLKIWEEI 397


>ref|XP_004248819.1| PREDICTED: RNA polymerase II-associated protein 3-like [Solanum
            lycopersicum]
          Length = 470

 Score =  356 bits (913), Expect = 2e-95
 Identities = 207/471 (43%), Positives = 276/471 (58%), Gaps = 53/471 (11%)
 Frame = +2

Query: 383  MSETSKKKVRDRSVEFKGFLNDLQDWDYLRDGKDINQKGRDRQNNKLIVDGNKASSSDRS 562
            M+       RD+  + +G  N+LQDW+    GKD   K +      L  D ++ S    S
Sbjct: 1    MARVPSNHSRDQFQDMQGLFNNLQDWELALKGKDKKMKSQAGGKETLKEDWSRTSEPLTS 60

Query: 563  AQ-------------------FDYLKYADPIGQISGINYNDQVPPDASSEKELGNECFKQ 685
             Q                   + Y K  +PI  +S    +++   +A+SEKELGNECFKQ
Sbjct: 61   PQANGTQQVGKSTSIRNAAGPYSYSKNYNPISHLSSELISEESNINANSEKELGNECFKQ 120

Query: 686  KKFSEAIDCYSRSIALSPTSVAFANRAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRR 865
            KKF+EAIDCYSRSIALSPT+V++ANRAMAYLK+KR++EAE+DCTEALNLDDRY+KAYSRR
Sbjct: 121  KKFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRR 180

Query: 866  ATARKELGKLKAALEDADFAVRLEPNNNEVRKQYSETKALYEKEITKRNSESLKSISKGS 1045
            +T+RKELGKLK ++EDA+FA+ LEP N E++KQY E KALYEKEI KR S +    ++G 
Sbjct: 181  STSRKELGKLKESIEDAEFALWLEPRNPEIKKQYGEVKALYEKEILKRVSGATDVSAQG- 239

Query: 1046 EPPDSFQTKSKIVKDDYLVSSKTQKVA----------------TNEQNSGSVQMIQKRSE 1177
              P       KI      VSS +QKVA                T +     +Q+  K S+
Sbjct: 240  --PQKSGKTIKIGPVIQSVSSSSQKVAEVRTIPAKENNRDVLGTAKVEDTHMQISNKDSD 297

Query: 1178 GAQT------------------KYELKEPLQDVXXXXXXXXXXXXXVNLTAPKSAYEFEV 1303
             + T                  K EL+E +Q++              N+ AP SAY+FEV
Sbjct: 298  ASPTVPTLNLAFGTAKKTHKISKQELEESVQELAARAAGLAKTEAAKNIAAPNSAYQFEV 357

Query: 1304 SWRALSDDSARQMQLLKMIPPATLPHIFKNALSAPILVDIIKCVTTFFKEDSELAIGILD 1483
            SWR LS D   Q QLLK+  PA LP IFKNALSAP+L+DI++C+ TFF ED  LAI  L+
Sbjct: 358  SWRGLSGDRNLQTQLLKVTSPAMLPRIFKNALSAPMLMDIVRCIATFFIEDMNLAIRYLE 417

Query: 1484 NMTRVPRFDMLMMCISARDRSEINRMWGNISSCTTIPASHREAVAQLRTKY 1636
            ++T+VPRFDM++MC+S+ D+SE+ ++W  I     +   H   +  LR  Y
Sbjct: 418  DLTKVPRFDMIIMCLSSADKSELLKIWEEI--FCKVAEEHSATLGALRVSY 466


>ref|XP_006339933.1| PREDICTED: RNA polymerase II-associated protein 3-like isoform X2
            [Solanum tuberosum]
          Length = 467

 Score =  353 bits (906), Expect = 1e-94
 Identities = 203/449 (45%), Positives = 278/449 (61%), Gaps = 52/449 (11%)
 Frame = +2

Query: 383  MSETSKKKVRDRSVEFKGFLNDLQDWDYLRDGKDINQKGRDRQNNKLIVDGNKAS----- 547
            M++   K  RD+  + +G LN+LQDW+    GKD   K +      L  D ++ S     
Sbjct: 1    MAKVPSKHSRDQFQDMQGLLNNLQDWELSLKGKDKKMKSQAHGKETLREDWSRTSELLTS 60

Query: 548  -----------SSDRSAQ--FDYLKYADPIGQISGINYNDQVPPDASSEKELGNECFKQK 688
                       +S RSA   ++Y K  +PI  +S    +++   +A+SEKELGNECFKQK
Sbjct: 61   PQVNGTRVGKSTSIRSAAGPYNYSKNYNPISHLSSELISEESNINANSEKELGNECFKQK 120

Query: 689  KFSEAIDCYSRSIALSPTSVAFANRAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRRA 868
            KF+EAIDCYSRSIALSPT+V++ANRAMAYLK+KR++EAE+DCTEALNLDDRY+KAYSRR+
Sbjct: 121  KFNEAIDCYSRSIALSPTAVSYANRAMAYLKIKRFQEAENDCTEALNLDDRYIKAYSRRS 180

Query: 869  TARKELGKLKAALEDADFAVRLEPNNNEVRKQYSETKALYEKEITKRNSESLKSISKGSE 1048
            T+RKELGKLK ++EDA+FA+RLEP N E++KQY E KALYEK I KR S +    ++ ++
Sbjct: 181  TSRKELGKLKESIEDAEFALRLEPQNPEIKKQYGEVKALYEK-IRKRVSGATDVSAQRAQ 239

Query: 1049 PPDSFQTKSKIVKDDYLVSSKTQKVA----------------TNEQNSGSVQMIQKRSEG 1180
                      +++    VSS +QK+A                T +     +Q+  K S+ 
Sbjct: 240  KSGKTIKSGPVIQS---VSSSSQKMAEVWTIPAKENNRDVPGTAKVEDTHMQINNKDSDA 296

Query: 1181 AQT------------------KYELKEPLQDVXXXXXXXXXXXXXVNLTAPKSAYEFEVS 1306
            + T                  K EL+E +Q++              N+ AP SAY+FEVS
Sbjct: 297  SPTVPTLNPAFGTAKKTHKISKQELEESVQELAARAAGLAKTEAAKNIAAPNSAYQFEVS 356

Query: 1307 WRALSDDSARQMQLLKMIPPATLPHIFKNALSAPILVDIIKCVTTFFKEDSELAIGILDN 1486
            WR LS D   Q QLLK+  PA LP IFKNALSAP+L+DI++CV TFF ED  LAI  L++
Sbjct: 357  WRGLSGDRNLQTQLLKVTSPAMLPRIFKNALSAPMLMDIVRCVATFFIEDMNLAIRYLED 416

Query: 1487 MTRVPRFDMLMMCISARDRSEINRMWGNI 1573
            +T+VPRFDM++MC+S+ D+SE+ ++W  I
Sbjct: 417  LTKVPRFDMIIMCLSSTDKSELLKIWEEI 445


>ref|XP_006654957.1| PREDICTED: RNA polymerase II-associated protein 3-like [Oryza
            brachyantha]
          Length = 466

 Score =  353 bits (905), Expect = 2e-94
 Identities = 190/392 (48%), Positives = 259/392 (66%), Gaps = 14/392 (3%)
 Frame = +2

Query: 542  ASSSDRSAQFDYLKYADPIGQISGINYNDQVPPDASSEKELGNECFKQKKFSEAIDCYSR 721
            AS  +    ++Y  Y+  +        ND+  PDA+SEKE GNE FKQKKF++AI+CYSR
Sbjct: 83   ASRGNLGDMYNYKSYSSYL--------NDEPMPDAASEKEQGNEYFKQKKFTQAIECYSR 134

Query: 722  SIALSPTSVAFANRAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRRATARKELGKLKA 901
            SI LSPT+VAFANRAMAYLKL+R+EEAE+DCTEALNLDDRYVKAYSRR TARKELGKLK 
Sbjct: 135  SIGLSPTAVAFANRAMAYLKLRRFEEAENDCTEALNLDDRYVKAYSRRITARKELGKLKE 194

Query: 902  ALEDADFAVRLEPNNNEVRKQYSETKALYEKEITKRNSESLKSISKGSEPPDSFQT---K 1072
            A++DA+FAV ++PNN E+RKQYSE KAL+ +++ KR + + +++S+  E  D   T    
Sbjct: 195  AMDDAEFAVSIDPNNPELRKQYSELKALHLEKVAKRTTPTKRTVSEFGESGDKKGTSDLS 254

Query: 1073 SKIVKDDYLVSSKTQKVA---------TNEQNSGSV--QMIQKRSEGAQTKYELKEPLQD 1219
            S   KD ++      +V          T++  SG V      + S  A+ K   +  +QD
Sbjct: 255  STSQKDSFMEVDPPSRVPVEITEKADDTSKGGSGVVFKDSTMQPSRDAKQKPGPEASIQD 314

Query: 1220 VXXXXXXXXXXXXXVNLTAPKSAYEFEVSWRALSDDSARQMQLLKMIPPATLPHIFKNAL 1399
            +              ++  PK+AY+FEVSWRALSDD+A+Q+QLLK IPPA+LP IFKNAL
Sbjct: 315  LASRAASRYMASTVKSVKTPKTAYDFEVSWRALSDDTAQQIQLLKSIPPASLPEIFKNAL 374

Query: 1400 SAPILVDIIKCVTTFFKEDSELAIGILDNMTRVPRFDMLMMCISARDRSEINRMWGNISS 1579
            SA  L+DI+KC  + F+ED+ LA+ IL+N+ +VPRFD+++MC+S+  +SE+ ++W  I  
Sbjct: 375  SAAFLIDIVKCTASIFREDTMLAVSILENLAKVPRFDLIIMCLSSMHKSELRKVWDQIFL 434

Query: 1580 CTTIPASHREAVAQLRTKYCNGEDHMYVSNGW 1675
              T PA   EA+ +LR K        Y+  GW
Sbjct: 435  AETAPADQVEALGKLRAK--------YIQEGW 458


>ref|NP_001054548.1| Os05g0129900 [Oryza sativa Japonica Group]
            gi|113578099|dbj|BAF16462.1| Os05g0129900 [Oryza sativa
            Japonica Group] gi|215734871|dbj|BAG95593.1| unnamed
            protein product [Oryza sativa Japonica Group]
            gi|215765748|dbj|BAG87445.1| unnamed protein product
            [Oryza sativa Japonica Group]
          Length = 397

 Score =  350 bits (899), Expect = 9e-94
 Identities = 184/365 (50%), Positives = 250/365 (68%), Gaps = 16/365 (4%)
 Frame = +2

Query: 623  NDQVPPDASSEKELGNECFKQKKFSEAIDCYSRSIALSPTSVAFANRAMAYLKLKRYEEA 802
            ND+  PDA+SEKE GNE FKQKKF++AI+CYSRSI LSP++VAFANRAMAYLKL+R+EEA
Sbjct: 33   NDEPMPDAASEKEQGNEYFKQKKFAQAIECYSRSIGLSPSAVAFANRAMAYLKLRRFEEA 92

Query: 803  ESDCTEALNLDDRYVKAYSRRATARKELGKLKAALEDADFAVRLEPNNNEVRKQYSETKA 982
            E+DCTEALNLDDRYVKAYSRR TARKELGKLK A++DA+FAV ++PNN E+RKQYSE K 
Sbjct: 93   ENDCTEALNLDDRYVKAYSRRITARKELGKLKEAMDDAEFAVSIDPNNPELRKQYSEIKE 152

Query: 983  LYEKEITKRNSESLKSI---SKGSEPPDSFQTKSKIVKDDYLVSSKTQKVA--------- 1126
            L+ KE+  R+  +  ++    K  +  D+    S   KD ++      +VA         
Sbjct: 153  LHMKEVANRSKPTKHTVFKFDKSGDKKDTSHAPSSSQKDSFMEVDPPSRVAVEIREKADG 212

Query: 1127 TNEQNSGSV--QMIQKRSEGAQTKYELKEPLQDVXXXXXXXXXXXXXVNLTAPKSAYEFE 1300
            T++  SG +      + S  A+ K   +  +QD+              ++  PK+AY+FE
Sbjct: 213  TSKGGSGVIFKDSTVQPSRDAKQKPGPEASIQDLASRAASRYMASTVKSVKTPKTAYDFE 272

Query: 1301 VSWRALSDDSARQMQLLKMIPPATLPHIFKNALSAPILVDIIKCVTTFFKEDSELAIGIL 1480
            VSWRALS+D+A+Q QLLK IPP++LP IFKNALSA  L+DI+KC T+ F+ED+ LA+ IL
Sbjct: 273  VSWRALSNDTAKQTQLLKSIPPSSLPEIFKNALSAAFLIDIVKCTTSIFREDTMLAVSIL 332

Query: 1481 DNMTRVPRFDMLMMCISARDRSEINRMWGNISSCTTIPASHREAVAQLRTKYCNG--EDH 1654
            +N+ +VPRFD+++MC+S+  +SE+ ++W  I    T  A   EA+ QLR KY     +D+
Sbjct: 333  ENLAKVPRFDLIIMCLSSMHKSELRKVWDQIFLAETASADQVEALRQLRAKYIQEGLQDN 392

Query: 1655 MYVSN 1669
            M+ SN
Sbjct: 393  MFTSN 397


>gb|EMJ06489.1| hypothetical protein PRUPE_ppa006661mg [Prunus persica]
          Length = 401

 Score =  350 bits (898), Expect = 1e-93
 Identities = 187/397 (47%), Positives = 255/397 (64%), Gaps = 2/397 (0%)
 Frame = +2

Query: 455  DWDYLRDGKDINQKGRDRQNNKLIVDGNKASSSDRSAQFDYLKYADPIGQISGINYNDQV 634
            DW+     KD   + +D    KL       SS +    +DY +  D I  +S    ++  
Sbjct: 15   DWELSLKDKDKKMRPKDSHQEKLKTRDLGTSSGN----YDYSRNLDSINTMSSSFISEDS 70

Query: 635  PPDASSEKELGNECFKQKKFSEAIDCYSRSIALSPTSVAFANRAMAYLKLKRYEEAESDC 814
             PDA+SEKELGNE FKQKKF EAIDCYSRSIALSP++VA+ANRAMAY+K+K ++EAE DC
Sbjct: 71   LPDAASEKELGNEYFKQKKFREAIDCYSRSIALSPSAVAYANRAMAYIKIKSFQEAEDDC 130

Query: 815  TEALNLDDRYVKAYSRRATARKELGKLKAALEDADFAVRLEPNNNEVRKQYSETKALYEK 994
            TEALNLDDRY+KAYSRRATARKELGKLK ++EDA+FA+RLEP N E++KQY+E K+LY+K
Sbjct: 131  TEALNLDDRYIKAYSRRATARKELGKLKESIEDAEFALRLEPQNQEIKKQYTEAKSLYDK 190

Query: 995  EITKRNSESLKSISKGSEPPDSFQTKSKIVKDDYLVSSKTQKVATNEQNSGSVQMIQKRS 1174
             I ++ S + K         +S Q   K+ K D  V+ ++ + A++      +  +Q  +
Sbjct: 191  TILQKASGAQK---------NSVQEMRKVGKLDTKVNGQSIQPASSSAQITEMTAVQDHT 241

Query: 1175 EGAQT--KYELKEPLQDVXXXXXXXXXXXXXVNLTAPKSAYEFEVSWRALSDDSARQMQL 1348
            +   T    E+K  +Q++               +  P SAY+FEVSWR  S D+ARQ  L
Sbjct: 242  KRNNTTRNPEVKASVQELASRAASRVKAVAAEKIKPPNSAYQFEVSWRGFSGDNARQTSL 301

Query: 1349 LKMIPPATLPHIFKNALSAPILVDIIKCVTTFFKEDSELAIGILDNMTRVPRFDMLMMCI 1528
            LK I P+ LP IFKNAL+ PIL+DIIKCV TFF E+ +LA+  L+N+TRVPRFD L+M +
Sbjct: 302  LKAISPSALPQIFKNALTVPILLDIIKCVATFFVEEMDLAVNYLENLTRVPRFDTLIMFL 361

Query: 1529 SARDRSEINRMWGNISSCTTIPASHREAVAQLRTKYC 1639
            S+ D +++ ++W  +      P  + E +  LRTKYC
Sbjct: 362  SSSDNADLVKIWDEVFDNEATPIEYAEKLDNLRTKYC 398


>ref|XP_006588434.1| PREDICTED: uncharacterized protein LOC100784528 isoform X1 [Glycine
            max]
          Length = 459

 Score =  347 bits (890), Expect = 1e-92
 Identities = 200/462 (43%), Positives = 283/462 (61%), Gaps = 53/462 (11%)
 Frame = +2

Query: 422  VEFKGFLNDLQDWDYLRDGKDINQKGRDRQNNKLI--VDGNKASSSDRSA---------Q 568
            ++F+GFLNDLQDW+  R  K   QK  +  +++L   V   KAS  D  +         Q
Sbjct: 1    MDFQGFLNDLQDWELSRKDKTRAQK-ENASSSQLTGSVGVEKASKGDTISFDRARNSPGQ 59

Query: 569  FDYLKYADPIGQISGINYNDQVPPDASSEKELGNECFKQKKFSEAIDCYSRSIALSPTSV 748
            +D  +  DP  ++      + VP DA SEK+LGNE FKQKKF EA DCYSRSIALSPT+V
Sbjct: 60   YDLSRINDPFNRVHSSFVPEDVP-DAVSEKDLGNEFFKQKKFKEARDCYSRSIALSPTAV 118

Query: 749  AFANRAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRRATARKELGKLKAALEDADFAV 928
            A+ANRAMA +KL+R++EAE DCTEALNLDDRY+KAYSRRATARKELGK+K +++DA FA+
Sbjct: 119  AYANRAMANIKLRRFQEAEDDCTEALNLDDRYIKAYSRRATARKELGKIKESMDDAAFAL 178

Query: 929  RLEPNNNEVRKQYSETKALYEKEITKRNSESLKS--------------ISKGSEPPDSFQ 1066
            RLEPNN E++KQY++ K+LYEK+I ++ S +L+S              I+ GS  P S  
Sbjct: 179  RLEPNNQEIKKQYADAKSLYEKDILQKASGALRSTVQGTQKSQKSEEKINGGSIQPISHS 238

Query: 1067 TKSK----------------IVKDDYL---VSSKTQKVATNEQNSG---------SVQMI 1162
            T+                  +VK+  L   V S+  K  +  Q+ G         +   +
Sbjct: 239  TQKSGLAEVNHHKKDNEQQILVKESLLTEDVDSRETKARSRPQSQGGDGSKEGLSASNSL 298

Query: 1163 QKRSEGAQTKYELKEPLQDVXXXXXXXXXXXXXVNLTAPKSAYEFEVSWRALSDDSARQM 1342
            ++R+    TK E+K  +Q +              N+T P +AY+FEVSWRA S D A Q 
Sbjct: 299  EQRNHSI-TKLEMKASVQQLASRAASRVVAEAAKNVTPPTTAYQFEVSWRAFSGDLALQA 357

Query: 1343 QLLKMIPPATLPHIFKNALSAPILVDIIKCVTTFFKEDSELAIGILDNMTRVPRFDMLMM 1522
            +LLK I P  LP IFKNALS+ IL++IIKC+ +FF ED +L +  L+++T+VPRFD+++M
Sbjct: 358  RLLKAISPHELPKIFKNALSSAILIEIIKCLASFFTEDMDLVVSYLEHLTKVPRFDVIVM 417

Query: 1523 CISARDRSEINRMWGNISSCTTIPASHREAVAQLRTKYCNGE 1648
            C+S+ ++ +I ++W  + S    P  + E +  LR+K+  G+
Sbjct: 418  CLSSTNKDDIRKIWDEVFSSEATPIEYAEILDNLRSKFGLGQ 459


>gb|ESW16998.1| hypothetical protein PHAVU_007G201600g [Phaseolus vulgaris]
          Length = 465

 Score =  345 bits (884), Expect = 5e-92
 Identities = 198/469 (42%), Positives = 279/469 (59%), Gaps = 60/469 (12%)
 Frame = +2

Query: 422  VEFKGFLNDLQDWDYLRDGKDINQKGRDRQNNKLI----------VDGNKASSSD----- 556
            ++F+GFLNDLQDW+  R  KD  Q  + ++ N+            V   KAS +D     
Sbjct: 1    MDFQGFLNDLQDWELSR--KDKTQTLKSQKENQFTKASSSRLTGSVGVEKASKADAISFD 58

Query: 557  --RSAQ--FDYLKYADPIGQISGINYNDQVPPDASSEKELGNECFKQKKFSEAIDCYSRS 724
              R++Q  +D  K  DP+ ++ G    + VP DA+SEK+LGNE FKQKKF EA DCYSRS
Sbjct: 59   RARNSQGLYDLSKINDPLNRLHGSFVPEDVP-DAASEKDLGNEFFKQKKFKEARDCYSRS 117

Query: 725  IALSPTSVAFANRAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRRATARKELGKLKAA 904
            IALSPT+VA+ANRAMA +KL+R++EAE DCTEAL+LDDRY+KAYSRRATARKELGK+K +
Sbjct: 118  IALSPTAVAYANRAMANIKLRRFQEAEDDCTEALDLDDRYIKAYSRRATARKELGKIKES 177

Query: 905  LEDADFAVRLEPNNNEVRKQYSETKALYEKEIT----------------------KRNSE 1018
            +EDA+FA+RLEPNN E++KQY++ K+LYEK+I                       K N  
Sbjct: 178  MEDAEFALRLEPNNQEIKKQYADAKSLYEKDILHKASGALRRTVQGTNKVGKSDEKVNGG 237

Query: 1019 SLKSISKGSEPPDSFQTKSKIVKDDYLVSSKTQKVATNEQNSGSVQMIQKRSEGAQT--- 1189
            S+  IS G++     +   K V +   V  K   V     +  ++   + +++G      
Sbjct: 238  SIHPISHGAQKSGPAEVNHKKVNEQQ-VPIKESLVTEEVDSRDTITRKRPQAQGGDDSKK 296

Query: 1190 ----------------KYELKEPLQDVXXXXXXXXXXXXXVNLTAPKSAYEFEVSWRALS 1321
                            K E K  +Q +              N+T P +AYEFEVSWRALS
Sbjct: 297  SLSASNSLEQRNHRIIKPEFKASVQQLASRAASRAMAEAAKNITPPTTAYEFEVSWRALS 356

Query: 1322 DDSARQMQLLKMIPPATLPHIFKNALSAPILVDIIKCVTTFFKEDSELAIGILDNMTRVP 1501
             D A Q +LLK I P  LP IFKNALS+ ILVDIIKC+++FF ED +L +  ++++ +VP
Sbjct: 357  GDLALQARLLKAISPRELPKIFKNALSSTILVDIIKCLSSFFTEDMDLVVSYMEHLIKVP 416

Query: 1502 RFDMLMMCISARDRSEINRMWGNISSCTTIPASHREAVAQLRTKYCNGE 1648
            RFDM+++C+S+ ++ +I ++W  +      P  + E +  LR+K+C G+
Sbjct: 417  RFDMIVLCLSSTNKDDIRKIWDEVFRSKATPIEYAEILDNLRSKFCLGQ 465


>ref|NP_001242466.1| uncharacterized protein LOC100784528 [Glycine max]
            gi|255641877|gb|ACU21207.1| unknown [Glycine max]
          Length = 454

 Score =  344 bits (882), Expect = 8e-92
 Identities = 198/460 (43%), Positives = 279/460 (60%), Gaps = 51/460 (11%)
 Frame = +2

Query: 422  VEFKGFLNDLQDWDYLRDGKDINQKGRDRQNNKLIVDGNKASSSDRSA---------QFD 574
            ++F+GFLNDLQDW+  R  K   QK    +N    V   KAS  D  +         Q+D
Sbjct: 1    MDFQGFLNDLQDWELSRKDKTRAQK----ENLTGSVGVEKASKGDTISFDRARNSPGQYD 56

Query: 575  YLKYADPIGQISGINYNDQVPPDASSEKELGNECFKQKKFSEAIDCYSRSIALSPTSVAF 754
              +  DP  ++      + VP DA SEK+LGNE FKQKKF EA DCYSRSIALSPT+VA+
Sbjct: 57   LSRINDPFNRVHSSFVPEDVP-DAVSEKDLGNEFFKQKKFKEARDCYSRSIALSPTAVAY 115

Query: 755  ANRAMAYLKLKRYEEAESDCTEALNLDDRYVKAYSRRATARKELGKLKAALEDADFAVRL 934
            ANRAMA +KL+R++EAE DCTEALNLDDRY+KAYSR ATARKELGK+K +++DA FA+RL
Sbjct: 116  ANRAMANIKLRRFQEAEDDCTEALNLDDRYIKAYSRGATARKELGKIKESMDDAAFALRL 175

Query: 935  EPNNNEVRKQYSETKALYEKEITKRNSESLKSISKGSEPP---------DSFQTKSK--- 1078
            EPNN E++KQY++ K+LYEK+I ++ S +L+S  +G++           DS Q  S    
Sbjct: 176  EPNNQEIKKQYADAKSLYEKDILQKASGALRSTVQGTQKSQKSEEKINGDSIQPISHSTQ 235

Query: 1079 ------------------IVKDDYL---VSSKTQKVATNEQNSG---------SVQMIQK 1168
                              +VK+  L   V S+  K  +  Q+ G         +   +++
Sbjct: 236  KSGLAEVNHHKKDNEQQILVKESLLTEDVDSRETKARSRPQSQGGDGSKEGLSASNSLEQ 295

Query: 1169 RSEGAQTKYELKEPLQDVXXXXXXXXXXXXXVNLTAPKSAYEFEVSWRALSDDSARQMQL 1348
            R+    TK E+K  +Q +              N+T P +AY+FEVSWRA S D A Q +L
Sbjct: 296  RNHSI-TKLEMKASVQQLASRAASRVVAEAAKNVTPPTTAYQFEVSWRAFSGDLALQARL 354

Query: 1349 LKMIPPATLPHIFKNALSAPILVDIIKCVTTFFKEDSELAIGILDNMTRVPRFDMLMMCI 1528
            LK I P  LP IFKNALS+ IL++IIKC+ +FF ED +L +  L+++T+VPRFD+++MC+
Sbjct: 355  LKAISPHELPKIFKNALSSAILIEIIKCLASFFTEDMDLVVSYLEHLTKVPRFDVIVMCL 414

Query: 1529 SARDRSEINRMWGNISSCTTIPASHREAVAQLRTKYCNGE 1648
            S+ ++ +I ++W  + S    P  + E +  LR+K+  G+
Sbjct: 415  SSTNKDDIRKIWDEVFSSEATPIEYAEILDNLRSKFGLGQ 454


Top