BLASTX nr result

ID: Cephaelis21_contig00005775 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00005775
         (2019 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI39598.3| unnamed protein product [Vitis vinifera]              454   e-125
ref|XP_002277910.2| PREDICTED: RNA polymerase II-associated prot...   453   e-125
ref|XP_002330255.1| predicted protein [Populus trichocarpa] gi|2...   414   e-113
ref|XP_004144746.1| PREDICTED: RNA polymerase II-associated prot...   394   e-107
ref|NP_176039.2| tetratricopeptide repeat domain-containing prot...   385   e-104

>emb|CBI39598.3| unnamed protein product [Vitis vinifera]
          Length = 1097

 Score =  454 bits (1169), Expect = e-125
 Identities = 261/488 (53%), Positives = 319/488 (65%), Gaps = 4/488 (0%)
 Frame = +1

Query: 103  SSSSRVPAKHSRDQAYQDFEGLLNNLQDWELSFXXXXXXXXXXXQGKEKPDLPAQRYNIG 282
            S ++R P+KH+RDQA  DF+G L +LQDWELS            Q +EK D+P  R N+ 
Sbjct: 623  SMATRFPSKHARDQAL-DFQGFLTDLQDWELSLKEKDKKMKA--QAEEK-DVPTARGNVK 678

Query: 283  NVSQLSNNSRVGETAADQRSPYAHNVVSSSQHDYLREYDKLSKLSSGFMAEESSVDANSE 462
            + S+LS++  V       RS        + QH+Y R +D +S++SS FM EES  DA SE
Sbjct: 679  HSSKLSSSPGVSLRLGQSRS-------DTRQHEYSRNHDAISRISSSFMTEESLPDAASE 731

Query: 463  KELGNEYFKKKNFNEAIDCYSRSIVLSPTAVAYANRAMAYIKIKRFQEAESDCTEALNLD 642
            KELGNEYFK++ F EAIDCYSRSI L PTAVAYANRAMAYIKIKRF+EAE DC EALNLD
Sbjct: 732  KELGNEYFKQRKFKEAIDCYSRSIALLPTAVAYANRAMAYIKIKRFREAEDDCMEALNLD 791

Query: 643  DRYTKAYSRRSTARKELGNFKESLEDADFALRLEPQNQEIKKQYXXXXXXXXXXXXXXXX 822
            DRY KAYSRR+TARKELG FKE+ EDA+FALRLEPQNQEIKKQY                
Sbjct: 792  DRYIKAYSRRATARKELGKFKEATEDAEFALRLEPQNQEIKKQYAEAKSLYEKEILQKAS 851

Query: 823  XXXXXXTRGVKKAVKPKVEVDKSVESVQSVTSNSGSRMVAEIQDDRSKGNNGSVPSKTSI 1002
                   +G++K  K  VEV+   + V+S++S+S     A IQD         VP+ TS 
Sbjct: 852  GALKSSVQGLQKVGKSVVEVNADTQGVRSISSSSQGAGEAAIQD------RFMVPANTST 905

Query: 1003 GHDGT-SPWTKDQNGE-ASREIATQSFGLDTTKRNTTNGKQELKPSVQELXXXXXXXXXX 1176
              + T +  T +++ E    E A Q+ GL+    N   G++E+K S+QEL          
Sbjct: 906  SMEETENKGTGNRSKENGYLENAVQNSGLEDVMSNHKTGQREMKSSLQELASRAASRAMV 965

Query: 1177 XXXKNISPPNSAYQFEVSWRGLSGDRNLQARLLKVTSPTSLPHIFKNALSAPLLVDIVRC 1356
               KNI+ PNSAYQFEVSWRGL GD  LQA  LK  SP +LP IFKNALSAP+L+DI++C
Sbjct: 966  EAAKNITAPNSAYQFEVSWRGLLGDHALQASYLKAISPNALPQIFKNALSAPILIDIIKC 1025

Query: 1357 IGTFFVEEMDLAVKYLQNLTKIPRFDMIIMCLSSTDKADLARLWDQTF-GKATPE-YDEI 1530
            I TFFV EMDLAVK+L NLTKI RFDMIIMCLSSTDK DL ++WD+ F  KATP  Y + 
Sbjct: 1026 IATFFVTEMDLAVKFLDNLTKISRFDMIIMCLSSTDKTDLLKIWDEVFCNKATPSGYADT 1085

Query: 1531 LGHLRSKY 1554
            LG LR +Y
Sbjct: 1086 LGKLRPRY 1093


>ref|XP_002277910.2| PREDICTED: RNA polymerase II-associated protein 3-like [Vitis
            vinifera]
          Length = 474

 Score =  453 bits (1166), Expect = e-125
 Identities = 260/486 (53%), Positives = 318/486 (65%), Gaps = 4/486 (0%)
 Frame = +1

Query: 109  SSRVPAKHSRDQAYQDFEGLLNNLQDWELSFXXXXXXXXXXXQGKEKPDLPAQRYNIGNV 288
            ++R P+KH+RDQA  DF+G L +LQDWELS            Q +EK D+P  R N+ + 
Sbjct: 2    ATRFPSKHARDQAL-DFQGFLTDLQDWELSLKEKDKKMKA--QAEEK-DVPTARGNVKHS 57

Query: 289  SQLSNNSRVGETAADQRSPYAHNVVSSSQHDYLREYDKLSKLSSGFMAEESSVDANSEKE 468
            S+LS++  V       RS        + QH+Y R +D +S++SS FM EES  DA SEKE
Sbjct: 58   SKLSSSPGVSLRLGQSRS-------DTRQHEYSRNHDAISRISSSFMTEESLPDAASEKE 110

Query: 469  LGNEYFKKKNFNEAIDCYSRSIVLSPTAVAYANRAMAYIKIKRFQEAESDCTEALNLDDR 648
            LGNEYFK++ F EAIDCYSRSI L PTAVAYANRAMAYIKIKRF+EAE DC EALNLDDR
Sbjct: 111  LGNEYFKQRKFKEAIDCYSRSIALLPTAVAYANRAMAYIKIKRFREAEDDCMEALNLDDR 170

Query: 649  YTKAYSRRSTARKELGNFKESLEDADFALRLEPQNQEIKKQYXXXXXXXXXXXXXXXXXX 828
            Y KAYSRR+TARKELG FKE+ EDA+FALRLEPQNQEIKKQY                  
Sbjct: 171  YIKAYSRRATARKELGKFKEATEDAEFALRLEPQNQEIKKQYAEAKSLYEKEILQKASGA 230

Query: 829  XXXXTRGVKKAVKPKVEVDKSVESVQSVTSNSGSRMVAEIQDDRSKGNNGSVPSKTSIGH 1008
                 +G++K  K  VEV+   + V+S++S+S     A IQD         VP+ TS   
Sbjct: 231  LKSSVQGLQKVGKSVVEVNADTQGVRSISSSSQGAGEAAIQD------RFMVPANTSTSM 284

Query: 1009 DGT-SPWTKDQNGE-ASREIATQSFGLDTTKRNTTNGKQELKPSVQELXXXXXXXXXXXX 1182
            + T +  T +++ E    E A Q+ GL+    N   G++E+K S+QEL            
Sbjct: 285  EETENKGTGNRSKENGYLENAVQNSGLEDVMSNHKTGQREMKSSLQELASRAASRAMVEA 344

Query: 1183 XKNISPPNSAYQFEVSWRGLSGDRNLQARLLKVTSPTSLPHIFKNALSAPLLVDIVRCIG 1362
             KNI+ PNSAYQFEVSWRGL GD  LQA  LK  SP +LP IFKNALSAP+L+DI++CI 
Sbjct: 345  AKNITAPNSAYQFEVSWRGLLGDHALQASYLKAISPNALPQIFKNALSAPILIDIIKCIA 404

Query: 1363 TFFVEEMDLAVKYLQNLTKIPRFDMIIMCLSSTDKADLARLWDQTF-GKATPE-YDEILG 1536
            TFFV EMDLAVK+L NLTKI RFDMIIMCLSSTDK DL ++WD+ F  KATP  Y + LG
Sbjct: 405  TFFVTEMDLAVKFLDNLTKISRFDMIIMCLSSTDKTDLLKIWDEVFCNKATPSGYADTLG 464

Query: 1537 HLRSKY 1554
             LR +Y
Sbjct: 465  KLRPRY 470


>ref|XP_002330255.1| predicted protein [Populus trichocarpa] gi|222871711|gb|EEF08842.1|
            predicted protein [Populus trichocarpa]
          Length = 434

 Score =  414 bits (1065), Expect = e-113
 Identities = 243/488 (49%), Positives = 300/488 (61%), Gaps = 7/488 (1%)
 Frame = +1

Query: 112  SRVPAKHSRDQAYQDFEGLLNNLQDWELSFXXXXXXXXXXXQGKEKPDLPAQRYNIGNVS 291
            +RVP KH RDQA  DF+G LN+LQDWEL               K K    A    IG   
Sbjct: 2    ARVPGKHGRDQAL-DFQGFLNDLQDWEL---------LKDTDKKMKKKSRASDVKIGE-- 49

Query: 292  QLSNNSRVGETAADQRSPYAHNVVSSSQHDYLREYDKLSKLSSGFMAEESSVDANSEKEL 471
                 S+   +AAD           S Q++Y R +  +++LSS F  +E +VDA +EKEL
Sbjct: 50   --DGRSKGKTSAADSSRS------GSGQYEYSRNFGAINRLSSSFTTDEITVDATTEKEL 101

Query: 472  GNEYFKKKNFNEAIDCYSRSIVLSPTAVAYANRAMAYIKIKR----FQEAESDCTEALNL 639
            GNEYFK+K FNEAI+CYSRSI LSPTAVAYANRAMAY+KIKR    F+EAE DCTEALNL
Sbjct: 102  GNEYFKQKKFNEAIECYSRSIALSPTAVAYANRAMAYLKIKRQFFLFREAEDDCTEALNL 161

Query: 640  DDRYTKAYSRRSTARKELGNFKESLEDADFALRLEPQNQEIKKQYXXXXXXXXXXXXXXX 819
            DDRY KAYSRR+TARKELG  KES+ED++FAL+LEP NQEIKKQY               
Sbjct: 162  DDRYIKAYSRRATARKELGKLKESIEDSEFALKLEPNNQEIKKQYAEV------------ 209

Query: 820  XXXXXXXTRGVKKAVKPKVEVDKSVESVQSVTSNSGSRMVAEIQDDRSKGN-NGSVPSKT 996
                        K++  K      +E +Q  +    S +    Q  RS+ + NG      
Sbjct: 210  ------------KSLYEKASDYLMLEILQKASGTLRSSLQGTQQGGRSEASVNGHAVHPV 257

Query: 997  SIGHDGTSPWTKDQNGEASREIATQSFGLDTTKRNTTNGKQELKPSVQELXXXXXXXXXX 1176
            SI               A+++    +   D TK+N    +QELK SV EL          
Sbjct: 258  SI---------------ATQKTGVSASKKDNTKKNNRTRRQELKTSVIELASQAASRAMA 302

Query: 1177 XXXKNISPPNSAYQFEVSWRGLSGDRNLQARLLKVTSPTSLPHIFKNALSAPLLVDIVRC 1356
               KNI+PPNSAYQFEVSW+G SGDR LQA LLKVTSP++LP IFKNALS P+L+DI++C
Sbjct: 303  EAAKNITPPNSAYQFEVSWQGFSGDRALQAHLLKVTSPSALPQIFKNALSVPILIDIIKC 362

Query: 1357 IGTFFVEEMDLAVKYLQNLTKIPRFDMIIMCLSSTDKADLARLWDQTFGKA-TP-EYDEI 1530
            + +FF+++MD AVKYL+NLTK+PRFDM+IMCLSSTD +DL ++WD  F  A TP EY EI
Sbjct: 363  VASFFIDDMDFAVKYLENLTKVPRFDMLIMCLSSTDTSDLLKMWDGVFCSASTPIEYAEI 422

Query: 1531 LGHLRSKY 1554
            L +LRSKY
Sbjct: 423  LDNLRSKY 430


>ref|XP_004144746.1| PREDICTED: RNA polymerase II-associated protein 3-like [Cucumis
            sativus] gi|449517788|ref|XP_004165926.1| PREDICTED: RNA
            polymerase II-associated protein 3-like [Cucumis sativus]
          Length = 458

 Score =  394 bits (1012), Expect = e-107
 Identities = 239/495 (48%), Positives = 302/495 (61%), Gaps = 8/495 (1%)
 Frame = +1

Query: 94   MASSSSSRVPAKHSRDQAYQDFEGLLNNLQDWELSFXXXXXXXXXXXQGKEKPDLPAQRY 273
            MA SS     AKH RDQ   DF+G LN+LQDWE+SF           +GK+K   P    
Sbjct: 1    MADSS-----AKHGRDQLL-DFQGFLNDLQDWEVSF-----------KGKDKKLKP---- 39

Query: 274  NIGNVSQLSNNSRVGETAADQRSPYAHNVVSSSQHDYLREYDKLSKLSSGFMAEESSVDA 453
                         +G+   D+R         +S  DY+++YD +++LS  F  E S VDA
Sbjct: 40   -----------QAIGKEKEDRRQ-----TEKASAADYMKQYDAVNRLSRNFQTEGSFVDA 83

Query: 454  NSEKELGNEYFKKKNFNEAIDCYSRSIVLSPTAVAYANRAMAYIKIKRFQEAESDCTEAL 633
             SEKE GNEYFK+K F EAIDCYSRSI LSPTAVA+ANRAMAY+KI+RFQEAE DCTEAL
Sbjct: 84   ASEKEQGNEYFKQKKFKEAIDCYSRSIALSPTAVAFANRAMAYLKIRRFQEAEDDCTEAL 143

Query: 634  NLDDRYTKAYSRRSTARKELGNFKESLEDADFALRLEPQNQEIKKQYXXXXXXXXXXXXX 813
            NLDDRY KAYSRR+TARKELG  KE+LEDA+FA RLEP NQEIKKQ+             
Sbjct: 144  NLDDRYIKAYSRRATARKELGKAKEALEDAEFAQRLEPNNQEIKKQHADLRAFVGKAILE 203

Query: 814  XXXXXXXXXTRGVKKAVKPKVEVDKSVESVQSVTSNSGSRMVAEIQDDRSKGNNGSVPSK 993
                     T+  KK +K K + D  ++ +  V S+S SR       +R + N G    K
Sbjct: 204  KASGASRSSTKN-KKTLK-KSDSDAKIQDIPPV-SSSTSRTGLLAARERVEENGGGNAVK 260

Query: 994  TSIGHDGTSPWTKDQNGEASREIATQSFGLDTT------KRNTTNGKQELKPSVQELXXX 1155
            TS   +  S  T       S+++AT  F  D++      +R+    KQELK SV EL   
Sbjct: 261  TSARLE-ESEDTSSGAEITSKKVATNGFHKDSSSYLSALERDHLPRKQELKASVYELASQ 319

Query: 1156 XXXXXXXXXXKNISPPNSAYQFEVSWRGLSGDRNLQARLLKVTSPTSLPHIFKNALSAPL 1335
                      KNI  P +AYQFEVSWRG SGD+ LQARLLK  SP  LP IFK+AL+AP+
Sbjct: 320  AASRSMVEAAKNIIAPTTAYQFEVSWRGFSGDQALQARLLKTISPAKLPQIFKDALTAPI 379

Query: 1336 LVDIVRCIGTFFVEEMDLAVKYLQNLTKIPRFDMIIMCLSSTDKADLARLWDQTF-GKAT 1512
            L+DIV+C+ TFF+EE  LA+ +L+NL  +PRF +++MCLSS++K DL ++WD+ F  +A 
Sbjct: 380  LIDIVKCVATFFIEEPALAISFLENLVNVPRFSILMMCLSSSEKFDLLKIWDEVFCDEAV 439

Query: 1513 P-EYDEILGHLRSKY 1554
            P EY E+L  LRSKY
Sbjct: 440  PIEYAEMLDSLRSKY 454


>ref|NP_176039.2| tetratricopeptide repeat domain-containing protein [Arabidopsis
            thaliana] gi|53828529|gb|AAU94374.1| At1g56440
            [Arabidopsis thaliana] gi|59958350|gb|AAX12885.1|
            At1g56440 [Arabidopsis thaliana]
            gi|110743110|dbj|BAE99447.1| hypothetical protein
            [Arabidopsis thaliana] gi|332195274|gb|AEE33395.1|
            tetratricopeptide repeat domain-containing protein
            [Arabidopsis thaliana]
          Length = 476

 Score =  385 bits (988), Expect = e-104
 Identities = 227/508 (44%), Positives = 307/508 (60%), Gaps = 27/508 (5%)
 Frame = +1

Query: 112  SRVPAKHSRDQAYQDFEGLLNNLQDWELSFXXXXXXXXXXXQGKEKPDLPAQRYNIGNVS 291
            +R P+KH RDQ  QDF+G  N+LQDWELS              K+K              
Sbjct: 2    ARSPSKHGRDQT-QDFQGFFNDLQDWELSL-------------KDK-------------- 33

Query: 292  QLSNNSRVGETAADQRSPYAHNV--VSSSQHDYLREYDKLSKLSSGFMAEESSVDANSEK 465
                + ++ +  A+  +P +       S ++D+ ++Y  +  LSS  + E S +D++SEK
Sbjct: 34   ----DKKIKQQPANSSNPSSETFRPSGSGKYDFAKKYRSIRDLSSSLIGE-SLLDSSSEK 88

Query: 466  ELGNEYFKKKNFNEAIDCYSRSIVLSPTAVAYANRAMAYIKIKRFQEAESDCTEALNLDD 645
            E GNE+FK+K FNEAIDCYSRSI LSP AV YANRAMAY+KIKR++EAE DCTEALNLDD
Sbjct: 89   EQGNEFFKQKKFNEAIDCYSRSIALSPNAVTYANRAMAYLKIKRYREAEVDCTEALNLDD 148

Query: 646  RYTKAYSRRSTARKELGNFKESLEDADFALRLEPQNQEIKKQYXXXXXXXXXXXXXXXXX 825
            RY KAYSRR+TARKELG  KE+ EDA+FALRLEP++QE+KKQY                 
Sbjct: 149  RYIKAYSRRATARKELGMIKEAKEDAEFALRLEPESQELKKQYADIKSLLEKEIIEKATG 208

Query: 826  XXXXXTRGV-------KKAVKPKVEVDKS------------VESVQSVTSNSGSRMVAEI 948
                  + +       KK  KPK E+               V+ V     +SG +++  I
Sbjct: 209  AMQSTAQELLKTSGLDKKIQKPKTEMTSKPVTLVAKTNRDIVQPVLGSNESSGKKLIENI 268

Query: 949  Q-DDRSKGNNGSVPSKTSI-GHDGTSPWTKDQNGEA--SREIATQSFGLDTTKRNTTNGK 1116
            Q +++SK  +  +P+ T I      +P ++    EA  S    TQ  G      N  + +
Sbjct: 269  QPEEKSKEGSMKIPAITEILDSKKVTPGSQSYEKEAKPSDRNGTQPSG----PENQVSKQ 324

Query: 1117 QELKPSVQELXXXXXXXXXXXXXKNISPPNSAYQFEVSWRGLSGDRNLQARLLKVTSPTS 1296
             ELKPSVQEL             KNI  P SAY+FE SWR  SGD  L+++LLKVT+P+S
Sbjct: 325  LELKPSVQELAAHAASLAMTEASKNIKTPKSAYEFENSWRSFSGDSALRSQLLKVTTPSS 384

Query: 1297 LPHIFKNALSAPLLVDIVRCIGTFFVEEMDLAVKYLQNLTKIPRFDMIIMCLSSTDKADL 1476
            LP IFKNAL++P+LVDI++C+ +FF E+MDLAVKY++NLTK+PRF+M++MCL+ST+K +L
Sbjct: 385  LPQIFKNALTSPVLVDIIKCVASFFTEDMDLAVKYIENLTKVPRFNMLVMCLTSTEKNEL 444

Query: 1477 ARLWDQTF-GKATP-EYDEILGHLRSKY 1554
             ++W+  F  KATP EY E+L  LRS+Y
Sbjct: 445  LKIWEDVFCNKATPMEYAEVLDKLRSRY 472


Top