BLASTX nr result

ID: Cephaelis21_contig00018343 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00018343
         (1095 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN70994.1| hypothetical protein VITISV_038698 [Vitis vinifera]   435   e-120
ref|XP_002326871.1| predicted protein [Populus trichocarpa] gi|2...   424   e-116
ref|XP_002511599.1| pentatricopeptide repeat-containing protein,...   412   e-112
ref|XP_002271824.1| PREDICTED: pentatricopeptide repeat-containi...   391   e-106
ref|XP_003551717.1| PREDICTED: pentatricopeptide repeat-containi...   370   e-100

>emb|CAN70994.1| hypothetical protein VITISV_038698 [Vitis vinifera]
          Length = 751

 Score =  435 bits (1119), Expect = e-120
 Identities = 224/365 (61%), Positives = 271/365 (74%), Gaps = 7/365 (1%)
 Frame = +2

Query: 20   RRGETEWLWSPREKKCLFLLQQRNTRATLLQIHAFMIQNALQTNINILTKLIDAFASS-- 193
            R  + + LWSP E+KCL LLQQ  TRA LLQIHAFM++NAL+TN N+ TK I   +S   
Sbjct: 140  RGNQQQSLWSPIERKCLSLLQQSKTRANLLQIHAFMLRNALETNPNLFTKFIATCSSIAL 199

Query: 194  -----DPLACISHARRIFDFSSRKDDTFLCNTMIKSHLNARQFTEATLLYRYLLRNAGFK 358
                 DPLA I HARR+FD    +DD FLCN+MIK+++  RQ++E+  LYR L RN  F 
Sbjct: 200  LAPLYDPLAGIVHARRMFDHRPHRDDAFLCNSMIKAYVGMRQYSESFALYRDLRRNTSFT 259

Query: 359  PDNYTFVSLAKCCGLNTACFEGLGIHNHSLKSGFGSNLYVATALVDMYGKFGEMGFARKL 538
            PD++TF  LAK C LN A +EG  IH+H +  GF  +LY ATALVDMY KFG+M  ARKL
Sbjct: 260  PDSFTFSVLAKSCALNMAIWEGQEIHSHVVAVGFCLDLYAATALVDMYAKFGKMDCARKL 319

Query: 539  FDEMTERSSVSWTALIDGYVKTGDMGSAMGLFYFMPEKDVAAYNVMIDAHVKMGEMGLAR 718
            FDEM +RS VSWTALI GYV++GDM +A  LF  M EKD AA+N MIDA+VK+G+M  AR
Sbjct: 320  FDEMIDRSQVSWTALIGGYVRSGDMDNAGKLFDQMIEKDSAAFNTMIDAYVKLGDMCSAR 379

Query: 719  SLFETMPERNVVSWTSMIDGYCSAGNVAEARLLFDAMPVRNLCSWNAIIGGYSQNKQPHE 898
             LF+ MPER+VVSWT MI GY S GN+  AR LFDAMP +NL SWNA+I GY QNKQP+E
Sbjct: 380  KLFDEMPERSVVSWTIMIYGYSSNGNLDSARSLFDAMPEKNLFSWNAMISGYXQNKQPYE 439

Query: 899  ALSLFHQLQMMTIFQPDNVTLVSVLPAIADLGALELGNWVYHYASRKKLDRYSNVCTAII 1078
            AL LFH++Q  T  +PD VT+VSVLPAIADLGAL+LG WV+ +  RKKLDR +NV TA+I
Sbjct: 440  ALKLFHEMQSTTSLEPDEVTIVSVLPAIADLGALDLGGWVHRFVRRKKLDRATNVGTALI 499

Query: 1079 DMYAK 1093
            DMYAK
Sbjct: 500  DMYAK 504



 Score = 80.5 bits (197), Expect = 7e-13
 Identities = 64/216 (29%), Positives = 93/216 (43%), Gaps = 8/216 (3%)
 Frame = +2

Query: 209  ISHARRIFDFSSRKDDTFLCNTMIKSHLNARQFTEATLLYRYLLRNAGFKPDNYTFVSLA 388
            +  AR +FD    K+  F  N MI  +   +Q  EA  L+  +      +PD  T VS+ 
Sbjct: 406  LDSARSLFDAMPEKN-LFSWNAMISGYXQNKQPYEALKLFHEMQSTTSLEPDEVTIVSVL 464

Query: 389  KCCGLNTACFEGLGIHNHSLKSGFGSNLYVATALVDMYGKFGEMGFARKLFDEMTERSSV 568
                   A   G  +H    +        V TAL+DMY K GE+  +R +FD M E+ + 
Sbjct: 465  PAIADLGALDLGGWVHRFVRRKKLDRATNVGTALIDMYAKCGEIVKSRGVFDNMPEKETA 524

Query: 569  SWTALIDGYVKTGDMGSAMGLFYFMPEKDVAAYNV-MIDAHVKMGEMGL---ARSLFETM 736
            SW ALI+ +   G    A+GLF  M  K      + MI         GL    +  F+ M
Sbjct: 525  SWNALINAFAINGRAKEALGLFMEMNHKGFMPNEITMIGVLSACNHSGLVEEGKRWFKAM 584

Query: 737  PE----RNVVSWTSMIDGYCSAGNVAEARLLFDAMP 832
             E      +  +  M+D    AG + EA  L ++MP
Sbjct: 585  EEFGLTPKIEHYGCMVDLLGRAGCLQEAEKLMESMP 620


>ref|XP_002326871.1| predicted protein [Populus trichocarpa] gi|222835186|gb|EEE73621.1|
            predicted protein [Populus trichocarpa]
          Length = 581

 Score =  424 bits (1091), Expect = e-116
 Identities = 207/346 (59%), Positives = 260/346 (75%)
 Frame = +2

Query: 56   EKKCLFLLQQRNTRATLLQIHAFMIQNALQTNINILTKLIDAFASSDPLACISHARRIFD 235
            E++CLFLLQ+  TR TLLQIHA +++NA+  N+NILTK I    +   L+   HAR +FD
Sbjct: 2    ERECLFLLQRCRTRKTLLQIHALILRNAIDANVNILTKFI---TTCGQLSSTRHARHLFD 58

Query: 236  FSSRKDDTFLCNTMIKSHLNARQFTEATLLYRYLLRNAGFKPDNYTFVSLAKCCGLNTAC 415
              S + DTFLCN+MIKSH+  RQ  +A  LY+ L R   F PDN+TF  LAKCC L  A 
Sbjct: 59   NRSHRGDTFLCNSMIKSHVVMRQLADAFTLYKDLRRETCFVPDNFTFTVLAKCCALRMAV 118

Query: 416  FEGLGIHNHSLKSGFGSNLYVATALVDMYGKFGEMGFARKLFDEMTERSSVSWTALIDGY 595
            +EGL  H H +K GF  ++YV+TALVDMY KFG +G ARK+F++M +RS VSWTALI GY
Sbjct: 119  WEGLETHGHVVKIGFCFDMYVSTALVDMYAKFGNLGLARKVFNDMPDRSLVSWTALIGGY 178

Query: 596  VKTGDMGSAMGLFYFMPEKDVAAYNVMIDAHVKMGEMGLARSLFETMPERNVVSWTSMID 775
            V+ GDMG+A  LF  MP +D AA+N++ID +VK+G+M  ARSLF+ MPERNV+SWTSMI 
Sbjct: 179  VRRGDMGNAWFLFKLMPGRDSAAFNLLIDGYVKVGDMESARSLFDEMPERNVISWTSMIY 238

Query: 776  GYCSAGNVAEARLLFDAMPVRNLCSWNAIIGGYSQNKQPHEALSLFHQLQMMTIFQPDNV 955
            GYC+ G+V  AR LFDAMP +NL SWNA+IGGY QNKQPHEAL LF +LQ  T+F+P+ V
Sbjct: 239  GYCNNGDVLSARFLFDAMPEKNLVSWNAMIGGYCQNKQPHEALKLFRELQSSTVFEPNEV 298

Query: 956  TLVSVLPAIADLGALELGNWVYHYASRKKLDRYSNVCTAIIDMYAK 1093
            T+VS+LPAIA LGALELG WV+ +  RKKLD   NVCT+++DMY K
Sbjct: 299  TVVSILPAIATLGALELGEWVHRFVQRKKLDAAVNVCTSLVDMYLK 344



 Score = 75.5 bits (184), Expect = 2e-11
 Identities = 58/213 (27%), Positives = 100/213 (46%), Gaps = 8/213 (3%)
 Frame = +2

Query: 218 ARRIFDFSSRKDDTFLCNTMIKSHLNARQFTEATLLYRYLLRNAGFKPDNYTFVSLAKCC 397
           AR +FD    K+     N MI  +   +Q  EA  L+R L  +  F+P+  T VS+    
Sbjct: 249 ARFLFDAMPEKN-LVSWNAMIGGYCQNKQPHEALKLFRELQSSTVFEPNEVTVVSILPAI 307

Query: 398 GLNTACFEGLGIHNHSLKSGFGSNLYVATALVDMYGKFGEMGFARKLFDEMTERSSVSWT 577
               A   G  +H    +    + + V T+LVDMY K GE+  ARK+F E+ ++ + +W 
Sbjct: 308 ATLGALELGEWVHRFVQRKKLDAAVNVCTSLVDMYLKCGEISKARKVFSEIPKKETATWN 367

Query: 578 ALIDGYVKTGDMGSAMGLFYFMPEKDVAAYNVMID------AHVKMGEMGLA--RSLFET 733
           ALI+G+   G    A+  F  M ++ +   ++ +       +H  + E G    +++ E+
Sbjct: 368 ALINGFAMNGLASEALEAFSEMQQEGIKPNDITMTGVLSACSHGGLVEEGKGQFKAMIES 427

Query: 734 MPERNVVSWTSMIDGYCSAGNVAEARLLFDAMP 832
                +  +  ++D    AG + EA  L  +MP
Sbjct: 428 GLSPKIEHYGCLVDLLGRAGCLDEAENLIKSMP 460


>ref|XP_002511599.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223548779|gb|EEF50268.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 429

 Score =  412 bits (1058), Expect = e-112
 Identities = 204/330 (61%), Positives = 253/330 (76%), Gaps = 7/330 (2%)
 Frame = +2

Query: 125  MIQNALQTNINILTKLID-----AFASS--DPLACISHARRIFDFSSRKDDTFLCNTMIK 283
            M+++A+++N+NIL K I      A   S  + LA I HAR++FD    KDDTFLCN+MIK
Sbjct: 1    MLRSAVESNVNILAKFITISGCLALIPSVYESLAIIQHARQVFDNRPHKDDTFLCNSMIK 60

Query: 284  SHLNARQFTEATLLYRYLLRNAGFKPDNYTFVSLAKCCGLNTACFEGLGIHNHSLKSGFG 463
            +H+  RQF E+  LY+ L +  GF PDN+TF +LAK CGLN A +EG  IHNH LK GFG
Sbjct: 61   AHVGMRQFYESFTLYQDLRKGTGFLPDNFTFTALAKSCGLNMAVWEGFEIHNHVLKMGFG 120

Query: 464  SNLYVATALVDMYGKFGEMGFARKLFDEMTERSSVSWTALIDGYVKTGDMGSAMGLFYFM 643
             +LYV+TALVDMY KFGE+  ARK+FDEM ER  VSWTALI G +++GDMG+A  LF  M
Sbjct: 121  LDLYVSTALVDMYAKFGELCMARKMFDEMAERGVVSWTALIGGCMRSGDMGNARILFDQM 180

Query: 644  PEKDVAAYNVMIDAHVKMGEMGLARSLFETMPERNVVSWTSMIDGYCSAGNVAEARLLFD 823
            PEKD AAYN M+D +VK G+M  A+SLF+ MP RNV+SWTSMI GYCS G+V  AR LFD
Sbjct: 181  PEKDSAAYNAMLDGYVKAGDMESAQSLFDKMPARNVISWTSMIYGYCSGGDVLTARSLFD 240

Query: 824  AMPVRNLCSWNAIIGGYSQNKQPHEALSLFHQLQMMTIFQPDNVTLVSVLPAIADLGALE 1003
            AMP RNL SWNA+IGGYSQN + HEAL LFH++Q  T+F+PD VT+VSVLPAIADLGAL+
Sbjct: 241  AMPERNLFSWNAMIGGYSQNNKSHEALKLFHEMQSRTLFEPDKVTVVSVLPAIADLGALD 300

Query: 1004 LGNWVYHYASRKKLDRYSNVCTAIIDMYAK 1093
            LG+W++ +A  KK+DR  NVCTA++DMYAK
Sbjct: 301  LGSWIHQFARLKKIDRSINVCTALVDMYAK 330



 Score = 73.6 bits (179), Expect = 8e-11
 Identities = 50/152 (32%), Positives = 71/152 (46%)
 Frame = +2

Query: 218 ARRIFDFSSRKDDTFLCNTMIKSHLNARQFTEATLLYRYLLRNAGFKPDNYTFVSLAKCC 397
           AR +FD    ++  F  N MI  +    +  EA  L+  +     F+PD  T VS+    
Sbjct: 235 ARSLFDAMPERN-LFSWNAMIGGYSQNNKSHEALKLFHEMQSRTLFEPDKVTVVSVLPAI 293

Query: 398 GLNTACFEGLGIHNHSLKSGFGSNLYVATALVDMYGKFGEMGFARKLFDEMTERSSVSWT 577
               A   G  IH  +       ++ V TALVDMY K GEM  AR++FD M ++   SW 
Sbjct: 294 ADLGALDLGSWIHQFARLKKIDRSINVCTALVDMYAKCGEMLKARRVFDSMPKKEEASWN 353

Query: 578 ALIDGYVKTGDMGSAMGLFYFMPEKDVAAYNV 673
           ALI+G+   G    A+  F  M  + V   +V
Sbjct: 354 ALINGFAVNGCADEALTAFSEMKREGVKPNDV 385


>ref|XP_002271824.1| PREDICTED: pentatricopeptide repeat-containing protein At2g44880
            [Vitis vinifera] gi|297734603|emb|CBI16654.3| unnamed
            protein product [Vitis vinifera]
          Length = 577

 Score =  391 bits (1004), Expect = e-106
 Identities = 201/330 (60%), Positives = 245/330 (74%), Gaps = 7/330 (2%)
 Frame = +2

Query: 125  MIQNALQTNINILTKLIDAFASS-------DPLACISHARRIFDFSSRKDDTFLCNTMIK 283
            M++NAL+TN N+ TK I   +S        DPLA I HARR+FD    +DD FLCN+MIK
Sbjct: 1    MLRNALETNPNLFTKFIATCSSIALLAPLYDPLAGIVHARRMFDHRPHRDDAFLCNSMIK 60

Query: 284  SHLNARQFTEATLLYRYLLRNAGFKPDNYTFVSLAKCCGLNTACFEGLGIHNHSLKSGFG 463
            +++  RQ++E+  LYR L RN  F PD++TF  LAK C LN A +EG  IH+H +  GF 
Sbjct: 61   AYVGMRQYSESFALYRDLRRNTSFTPDSFTFSVLAKSCALNMAIWEGQEIHSHVVAVGFC 120

Query: 464  SNLYVATALVDMYGKFGEMGFARKLFDEMTERSSVSWTALIDGYVKTGDMGSAMGLFYFM 643
             +LY ATALVDMY KFG+M  ARKLFDEM +RS VSWTALI GYV++GDM +A  LF  M
Sbjct: 121  LDLYAATALVDMYAKFGKMDCARKLFDEMIDRSQVSWTALIGGYVRSGDMDNAGKLFDQM 180

Query: 644  PEKDVAAYNVMIDAHVKMGEMGLARSLFETMPERNVVSWTSMIDGYCSAGNVAEARLLFD 823
             EKD AA+N MIDA+VK+G+M  AR LF+ MPER+VVSWT MI GY S GN+  AR LFD
Sbjct: 181  IEKDSAAFNTMIDAYVKLGDMCSARKLFDEMPERSVVSWTIMIYGYSSNGNLDSARSLFD 240

Query: 824  AMPVRNLCSWNAIIGGYSQNKQPHEALSLFHQLQMMTIFQPDNVTLVSVLPAIADLGALE 1003
            AMP +NL SWNA+I GY QNKQP+EAL LFH++Q  T  +PD VT+VSVLPAIADLGAL+
Sbjct: 241  AMPEKNLFSWNAMISGYRQNKQPYEALKLFHEMQSTTSLEPDEVTIVSVLPAIADLGALD 300

Query: 1004 LGNWVYHYASRKKLDRYSNVCTAIIDMYAK 1093
            LG WV+ +  RKKLDR +NV TA+IDMYAK
Sbjct: 301  LGGWVHRFVRRKKLDRATNVGTALIDMYAK 330



 Score = 80.1 bits (196), Expect = 9e-13
 Identities = 64/216 (29%), Positives = 93/216 (43%), Gaps = 8/216 (3%)
 Frame = +2

Query: 209 ISHARRIFDFSSRKDDTFLCNTMIKSHLNARQFTEATLLYRYLLRNAGFKPDNYTFVSLA 388
           +  AR +FD    K+  F  N MI  +   +Q  EA  L+  +      +PD  T VS+ 
Sbjct: 232 LDSARSLFDAMPEKN-LFSWNAMISGYRQNKQPYEALKLFHEMQSTTSLEPDEVTIVSVL 290

Query: 389 KCCGLNTACFEGLGIHNHSLKSGFGSNLYVATALVDMYGKFGEMGFARKLFDEMTERSSV 568
                  A   G  +H    +        V TAL+DMY K GE+  +R +FD M E+ + 
Sbjct: 291 PAIADLGALDLGGWVHRFVRRKKLDRATNVGTALIDMYAKCGEIVKSRGVFDNMPEKETA 350

Query: 569 SWTALIDGYVKTGDMGSAMGLFYFMPEKDVAAYNV-MIDAHVKMGEMGL---ARSLFETM 736
           SW ALI+ +   G    A+GLF  M  K      + MI         GL    +  F+ M
Sbjct: 351 SWNALINAFAINGRAKEALGLFMEMNHKGFMPNEITMIGVLSACNHSGLVEEGKRWFKAM 410

Query: 737 PE----RNVVSWTSMIDGYCSAGNVAEARLLFDAMP 832
            E      +  +  M+D    AG + EA  L ++MP
Sbjct: 411 EEFGLTPKIEHYGCMVDLLGRAGCLQEAEKLMESMP 446


>ref|XP_003551717.1| PREDICTED: pentatricopeptide repeat-containing protein At2g44880-like
            [Glycine max]
          Length = 599

 Score =  370 bits (949), Expect = e-100
 Identities = 192/358 (53%), Positives = 253/358 (70%), Gaps = 7/358 (1%)
 Frame = +2

Query: 41   LWSPREKKCLFLLQQRNTRA-TLLQIHAFMIQNALQTNINILTKLIDAFAS-----SDPL 202
            LWS  E+ CL +LQ R     TLLQIHAF+++++L +N+N+LT  +   AS       PL
Sbjct: 11   LWSNAERTCLHILQCRTKSIPTLLQIHAFILRHSLHSNLNLLTAFVTTCASLAASAKRPL 70

Query: 203  ACISHARRIFDFSSRKDDTFLCNTMIKSHLNARQFTEATLLYRYLLRNAG-FKPDNYTFV 379
            A I+HARR F+ +  +D TFLCN+MI +H  ARQF++   L+R L R A  F PD YTF 
Sbjct: 71   AIINHARRFFNATHTRD-TFLCNSMIAAHFAARQFSQPFTLFRDLRRQAPPFTPDGYTFT 129

Query: 380  SLAKCCGLNTACFEGLGIHNHSLKSGFGSNLYVATALVDMYGKFGEMGFARKLFDEMTER 559
            +L K C    A  EG  +H   LK+G   +LYVATALVDMY KFG +G ARK+FDEM+ R
Sbjct: 130  ALVKGCATRVATGEGTLLHGMVLKNGVCFDLYVATALVDMYVKFGVLGSARKVFDEMSVR 189

Query: 560  SSVSWTALIDGYVKTGDMGSAMGLFYFMPEKDVAAYNVMIDAHVKMGEMGLARSLFETMP 739
            S VSWTA+I GY + GDM  A  LF  M ++D+ A+N MID +VKMG +GLAR LF  M 
Sbjct: 190  SKVSWTAVIVGYARCGDMSEARRLFDEMEDRDIVAFNAMIDGYVKMGCVGLARELFNEMR 249

Query: 740  ERNVVSWTSMIDGYCSAGNVAEARLLFDAMPVRNLCSWNAIIGGYSQNKQPHEALSLFHQ 919
            ERNVVSWTSM+ GYC  G+V  A+L+FD MP +N+ +WNA+IGGY QN++ H+AL LF +
Sbjct: 250  ERNVVSWTSMVSGYCGNGDVENAKLMFDLMPEKNVFTWNAMIGGYCQNRRSHDALELFRE 309

Query: 920  LQMMTIFQPDNVTLVSVLPAIADLGALELGNWVYHYASRKKLDRYSNVCTAIIDMYAK 1093
            +Q  ++ +P+ VT+V VLPA+ADLGAL+LG W++ +A RKKLDR + + TA+IDMYAK
Sbjct: 310  MQTASV-EPNEVTVVCVLPAVADLGALDLGRWIHRFALRKKLDRSARIGTALIDMYAK 366



 Score = 77.4 bits (189), Expect = 6e-12
 Identities = 67/238 (28%), Positives = 104/238 (43%), Gaps = 8/238 (3%)
 Frame = +2

Query: 143 QTNINILTKLIDAFASSDPLACISHARRIFDFSSRKDDTFLCNTMIKSHLNARQFTEATL 322
           + N+   T ++  +  +     + +A+ +FD    K+  F  N MI  +   R+  +A  
Sbjct: 250 ERNVVSWTSMVSGYCGNGD---VENAKLMFDLMPEKN-VFTWNAMIGGYCQNRRSHDALE 305

Query: 323 LYRYLLRNAGFKPDNYTFVSLAKCCGLNTACFEGLGIHNHSLKSGFGSNLYVATALVDMY 502
           L+R + + A  +P+  T V +        A   G  IH  +L+     +  + TAL+DMY
Sbjct: 306 LFREM-QTASVEPNEVTVVCVLPAVADLGALDLGRWIHRFALRKKLDRSARIGTALIDMY 364

Query: 503 GKFGEMGFARKLFDEMTERSSVSWTALIDGYVKTGDMGSAMGLFYFMPEKDVAAYNV-MI 679
            K GE+  A+  F+ MTER + SW ALI+G+   G    A+ +F  M E+      V MI
Sbjct: 365 AKCGEITKAKLAFEGMTERETASWNALINGFAVNGCAKEALEVFARMIEEGFGPNEVTMI 424

Query: 680 DAHVKMGEMGL---ARSLFETMPE----RNVVSWTSMIDGYCSAGNVAEARLLFDAMP 832
                    GL    R  F  M        V  +  M+D    AG + EA  L   MP
Sbjct: 425 GVLSACNHCGLVEEGRRWFNAMERFGIAPQVEHYGCMVDLLGRAGCLDEAENLIQTMP 482


Top