BLASTX nr result

ID: Cephaelis21_contig00021490 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00021490
         (1929 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002527905.1| pentatricopeptide repeat-containing protein,...   591   e-166
ref|XP_002267979.1| PREDICTED: pentatricopeptide repeat-containi...   588   e-165
ref|XP_002331159.1| predicted protein [Populus trichocarpa] gi|1...   582   e-163
ref|XP_003531991.1| PREDICTED: pentatricopeptide repeat-containi...   561   e-157
ref|XP_002886593.1| pentatricopeptide repeat-containing protein ...   548   e-153

>ref|XP_002527905.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223532680|gb|EEF34462.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 490

 Score =  591 bits (1523), Expect = e-166
 Identities = 303/489 (61%), Positives = 352/489 (71%)
 Frame = -1

Query: 1824 MQQFTRKTKNVGKRSTXXXXXXXXXXXXXXXXXXXXKVRQQLNEFLKSHKSAFKWEVDRS 1645
            M Q   +TKNV KRS                      VR+QLN+FLKS K  +KWEV  +
Sbjct: 3    MPQLFGRTKNVAKRSKKYLEEALYFRLFKEGSSEVK-VREQLNQFLKSSKRVYKWEVGDT 61

Query: 1644 LKLLRQRKLYGPALKLSENMKKRGMNKTTSDEAIHLDLIAKAIGIASAETYFVTLPDTSK 1465
            LK LR R LY PALKLSE M KRGMNKT SD+AIHLDL+AK  GI +AE +F+ LP+TSK
Sbjct: 62   LKKLRSRGLYYPALKLSEAMSKRGMNKTVSDQAIHLDLVAKTRGIPAAEIFFIDLPETSK 121

Query: 1464 NHLIYGALLNCYCKALMIXXXXXXXXXXXXXXXXLTSMPYNSLMTLYMKTGQPEKVPAII 1285
            NHL YGALLNCYC  LM                 L+SM YNSLMTLY K GQPE++P II
Sbjct: 122  NHLTYGALLNCYCNQLMTEEAEALMEKMKELNLGLSSMSYNSLMTLYTKIGQPERIPGII 181

Query: 1284 QEMKDSNIMPDSYTYNVWMRAFAAANDITGVERLIDEMKRDGRVSADWTTYSNLASIYAD 1105
            QEMK  +IMPDSYTYNVWMRA AA NDI+G ER+I+EMKRDGRV+ADWTTYSNLASIY D
Sbjct: 182  QEMKSDSIMPDSYTYNVWMRALAAVNDISGAERVIEEMKRDGRVAADWTTYSNLASIYVD 241

Query: 1104 VGXXXXXXXXXXXXXXKNDRRDLSAYQYLITLYGRTGNLLEVYRVWRSLRLAFRKTANIS 925
                            +N  RD SA+Q+LITLYGR GNL E+YR+WRSLRLAF KT+NIS
Sbjct: 242  AQLFEKAEKTLKELEKRNVHRDHSAFQFLITLYGRIGNLHELYRIWRSLRLAFPKTSNIS 301

Query: 924  YLNMIQVLVNLNDLPGAEKCFREWESGQPTYDIRIANVLIRAYTKQGXXXXXXXXXXXXX 745
            YLNMIQVLVNL DLPGAEKCFREWES    YDIR+ANVLI+AY K+G             
Sbjct: 302  YLNMIQVLVNLKDLPGAEKCFREWESNCSGYDIRVANVLIKAYAKKGLLEKAEELKERAI 361

Query: 744  KCGGKANAKTWEIFLEYYLKNGEIKSAVECAQNAISTGRGDGSKWVPSSAVVTEIMEHFE 565
              G K NAKTWEIF +YY +NG+IK +VEC  NAIS GRGDG KW+PS  VV   M HFE
Sbjct: 362  GRGAKPNAKTWEIFSDYYFENGDIKLSVECLANAISKGRGDGQKWIPSPEVVASFMAHFE 421

Query: 564  QKKEVDGAEGFVEILKQAGEGLEVNVFESLLKTYAAAGRTSPIMRQRMKMENVELSEEGK 385
            Q+K+VDGAEGF+EILK+A + +E NVFESL++TYAAAGRTS ++R+R+KMENVE+SE G+
Sbjct: 422  QQKDVDGAEGFIEILKKATDDVEANVFESLIRTYAAAGRTSQVLRRRLKMENVEVSEAGQ 481

Query: 384  KLLDVICVE 358
            KLL++ICVE
Sbjct: 482  KLLEMICVE 490


>ref|XP_002267979.1| PREDICTED: pentatricopeptide repeat-containing protein At1g60770
            [Vitis vinifera]
          Length = 489

 Score =  588 bits (1517), Expect = e-165
 Identities = 305/489 (62%), Positives = 352/489 (71%)
 Frame = -1

Query: 1824 MQQFTRKTKNVGKRSTXXXXXXXXXXXXXXXXXXXXKVRQQLNEFLKSHKSAFKWEVDRS 1645
            M Q +R TKN+ KRS                      VRQQLN FLKS K  FKWEV  +
Sbjct: 3    MPQLSR-TKNIAKRSKKYLEEALYDRLFKDGSSEVS-VRQQLNHFLKSSKRVFKWEVGDT 60

Query: 1644 LKLLRQRKLYGPALKLSENMKKRGMNKTTSDEAIHLDLIAKAIGIASAETYFVTLPDTSK 1465
            +K LR RK + PALKLSE M KRGMN T SD+AI+LDLI K  G+A+AE YF+ LP+TSK
Sbjct: 61   VKKLRDRKRFYPALKLSETMAKRGMNMTISDQAIYLDLITKTRGVAAAENYFIDLPETSK 120

Query: 1464 NHLIYGALLNCYCKALMIXXXXXXXXXXXXXXXXLTSMPYNSLMTLYMKTGQPEKVPAII 1285
            NHL YGALLNCYCK L+                 L+SMPYNSLMTLY K GQPEK+P II
Sbjct: 121  NHLTYGALLNCYCKELLTEKAEALMERMKELKLGLSSMPYNSLMTLYTKIGQPEKIPTII 180

Query: 1284 QEMKDSNIMPDSYTYNVWMRAFAAANDITGVERLIDEMKRDGRVSADWTTYSNLASIYAD 1105
            QE+K  +IMPDSYTYN+WMRA AA NDI+GVER+I+EMKRDGRV++DWTTYSNLASIY D
Sbjct: 181  QELKSLDIMPDSYTYNIWMRALAAVNDISGVERVIEEMKRDGRVASDWTTYSNLASIYVD 240

Query: 1104 VGXXXXXXXXXXXXXXKNDRRDLSAYQYLITLYGRTGNLLEVYRVWRSLRLAFRKTANIS 925
             G              +N  RDL+A+Q+LITLYGR GNLLEVYRVWRSLRLAF KTAN+S
Sbjct: 241  AGVFEKAEKALKELEKRNACRDLTAFQFLITLYGRIGNLLEVYRVWRSLRLAFPKTANVS 300

Query: 924  YLNMIQVLVNLNDLPGAEKCFREWESGQPTYDIRIANVLIRAYTKQGXXXXXXXXXXXXX 745
            YLNMIQVLVNL DLPGAEKCFREWESG   YDIR+AN LI AY K G             
Sbjct: 301  YLNMIQVLVNLKDLPGAEKCFREWESGCSIYDIRVANALIGAYAKDGLLEKAEELKEHAR 360

Query: 744  KCGGKANAKTWEIFLEYYLKNGEIKSAVECAQNAISTGRGDGSKWVPSSAVVTEIMEHFE 565
            + G K NAKTWEIFL Y+LKN E+K AV+C  NAISTGRGDG KWVPS  ++   M+HFE
Sbjct: 361  RRGAKPNAKTWEIFLAYHLKNREMKQAVDCVANAISTGRGDGQKWVPSPEIIGVFMQHFE 420

Query: 564  QKKEVDGAEGFVEILKQAGEGLEVNVFESLLKTYAAAGRTSPIMRQRMKMENVELSEEGK 385
            Q+K+VDGAEGF+EILK   E L V VFESL++ YAAAGRTSP+MR+R+KMENVE+S+  K
Sbjct: 421  QEKDVDGAEGFLEILKSTVEDLGVEVFESLIRIYAAAGRTSPVMRRRLKMENVEVSDSCK 480

Query: 384  KLLDVICVE 358
            KLL+ + VE
Sbjct: 481  KLLEEVSVE 489


>ref|XP_002331159.1| predicted protein [Populus trichocarpa] gi|118487894|gb|ABK95769.1|
            unknown [Populus trichocarpa] gi|222873242|gb|EEF10373.1|
            predicted protein [Populus trichocarpa]
          Length = 490

 Score =  582 bits (1499), Expect = e-163
 Identities = 296/489 (60%), Positives = 351/489 (71%)
 Frame = -1

Query: 1824 MQQFTRKTKNVGKRSTXXXXXXXXXXXXXXXXXXXXKVRQQLNEFLKSHKSAFKWEVDRS 1645
            M Q   +TK+V KRS                      VRQQLN+FLKS K  FKWEV  +
Sbjct: 3    MPQLYGRTKSVTKRSKKYLEEALYVRLFKEGSSEVS-VRQQLNQFLKSSKRVFKWEVGDT 61

Query: 1644 LKLLRQRKLYGPALKLSENMKKRGMNKTTSDEAIHLDLIAKAIGIASAETYFVTLPDTSK 1465
            +K LR R LY PA+KLSE M  RGMNKT SD+AIHLDL+AK  GI +AE YF+ LP+TSK
Sbjct: 62   IKKLRSRNLYYPAVKLSETMSSRGMNKTVSDQAIHLDLVAKTRGIPAAENYFIDLPETSK 121

Query: 1464 NHLIYGALLNCYCKALMIXXXXXXXXXXXXXXXXLTSMPYNSLMTLYMKTGQPEKVPAII 1285
            N   YGALLNCYCK LM                 L+SM YNSLMTLY K GQPE++PAII
Sbjct: 122  NLRTYGALLNCYCKELMTEEAEALIEKMKELNLGLSSMSYNSLMTLYTKVGQPERIPAII 181

Query: 1284 QEMKDSNIMPDSYTYNVWMRAFAAANDITGVERLIDEMKRDGRVSADWTTYSNLASIYAD 1105
            QEMK  N+MPDSYTYNVWMRA AA NDI+GVER+I+EMKRDGRV+A+WTTYSNLASIY D
Sbjct: 182  QEMKADNVMPDSYTYNVWMRALAAVNDISGVERVIEEMKRDGRVAANWTTYSNLASIYVD 241

Query: 1104 VGXXXXXXXXXXXXXXKNDRRDLSAYQYLITLYGRTGNLLEVYRVWRSLRLAFRKTANIS 925
             G               N  +DL A+Q+LITLYGRTG L+EVYR+WRSLRLAF KTANIS
Sbjct: 242  AGYFDKAEKALKELEKINANKDLFAFQFLITLYGRTGKLIEVYRIWRSLRLAFPKTANIS 301

Query: 924  YLNMIQVLVNLNDLPGAEKCFREWESGQPTYDIRIANVLIRAYTKQGXXXXXXXXXXXXX 745
            YLNMIQVLVNL D+PGAEKCFREWESG  TYDIR+ANV+I AY K+G             
Sbjct: 302  YLNMIQVLVNLKDVPGAEKCFREWESGCSTYDIRVANVVISAYAKEGLVDKAEELKERAR 361

Query: 744  KCGGKANAKTWEIFLEYYLKNGEIKSAVECAQNAISTGRGDGSKWVPSSAVVTEIMEHFE 565
            + G K N+KTWEIF +YYLKNG++K  V+C  NA+S GRG+G KWVPS  +V  +M HFE
Sbjct: 362  RRGAKPNSKTWEIFCDYYLKNGDVKLGVDCIANAVSAGRGNGQKWVPSPVIVGSLMAHFE 421

Query: 564  QKKEVDGAEGFVEILKQAGEGLEVNVFESLLKTYAAAGRTSPIMRQRMKMENVELSEEGK 385
            Q+K+VDGAE  +EILK+A + + V VFESL++TYAAAGR S +MR+R+KMENVE+S++ +
Sbjct: 422  QQKDVDGAEDLIEILKKAVDDVAVEVFESLIRTYAAAGRKSQLMRRRLKMENVEVSDDCQ 481

Query: 384  KLLDVICVE 358
            KLL+ ICVE
Sbjct: 482  KLLEAICVE 490


>ref|XP_003531991.1| PREDICTED: pentatricopeptide repeat-containing protein At1g60770-like
            [Glycine max]
          Length = 490

 Score =  561 bits (1447), Expect = e-157
 Identities = 290/452 (64%), Positives = 335/452 (74%)
 Frame = -1

Query: 1713 VRQQLNEFLKSHKSAFKWEVDRSLKLLRQRKLYGPALKLSENMKKRGMNKTTSDEAIHLD 1534
            VRQ LN F+KS K  +KWEV  +LK LR RKLY PALKLSE M KR M KT SD AIHLD
Sbjct: 39   VRQSLNNFVKSRKRVYKWEVGDTLKKLRDRKLYQPALKLSETMAKRNMIKTVSDHAIHLD 98

Query: 1533 LIAKAIGIASAETYFVTLPDTSKNHLIYGALLNCYCKALMIXXXXXXXXXXXXXXXXLTS 1354
            L+AKA GI +AE YFV+LP+ SKNHL YGALLNCYCK LM                 L+S
Sbjct: 99   LLAKARGITAAENYFVSLPEPSKNHLCYGALLNCYCKELMTEKSEGLMEKMKELSLPLSS 158

Query: 1353 MPYNSLMTLYMKTGQPEKVPAIIQEMKDSNIMPDSYTYNVWMRAFAAANDITGVERLIDE 1174
            MPYNSLMTLY K GQPEK+P++IQEMK SN+M DSYTYNVWMRA AA NDI+GVER+ DE
Sbjct: 159  MPYNSLMTLYTKVGQPEKIPSLIQEMKASNVMLDSYTYNVWMRALAAVNDISGVERVHDE 218

Query: 1173 MKRDGRVSADWTTYSNLASIYADVGXXXXXXXXXXXXXXKNDRRDLSAYQYLITLYGRTG 994
            MKR G+V+ DWTTYSNLASI+ D G              +N  +DL+AYQ+LITLYGRTG
Sbjct: 219  MKRGGQVTGDWTTYSNLASIFVDAGLFDKAEVALKELEKRNAFKDLTAYQFLITLYGRTG 278

Query: 993  NLLEVYRVWRSLRLAFRKTANISYLNMIQVLVNLNDLPGAEKCFREWESGQPTYDIRIAN 814
            NL EVYRVWRSLRLAF KTANISYLNMIQVLVNL DLPGAEKCFREWE G PTYDIR+AN
Sbjct: 279  NLYEVYRVWRSLRLAFPKTANISYLNMIQVLVNLKDLPGAEKCFREWECGCPTYDIRVAN 338

Query: 813  VLIRAYTKQGXXXXXXXXXXXXXKCGGKANAKTWEIFLEYYLKNGEIKSAVECAQNAIST 634
            VLIRAY K               + G K NAKT EIF++YYL  G+ K AV+    AIS 
Sbjct: 339  VLIRAYVKLDMLEKAEELKERARRRGAKPNAKTLEIFMDYYLLKGDFKLAVDYLNEAISM 398

Query: 633  GRGDGSKWVPSSAVVTEIMEHFEQKKEVDGAEGFVEILKQAGEGLEVNVFESLLKTYAAA 454
            GRG+G KWVPSS +++ +M HFEQ+K+VDGAE F+EILK++ E   V VFESL++TYAAA
Sbjct: 399  GRGNGEKWVPSSRIISIMMRHFEQEKDVDGAEEFLEILKKSVESPGVEVFESLIRTYAAA 458

Query: 453  GRTSPIMRQRMKMENVELSEEGKKLLDVICVE 358
            GR S  M++R+KMENVE+SE  +KLL+ I VE
Sbjct: 459  GRISSAMQRRLKMENVEVSEGTQKLLEAISVE 490


>ref|XP_002886593.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297332434|gb|EFH62852.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 491

 Score =  548 bits (1411), Expect = e-153
 Identities = 276/483 (57%), Positives = 339/483 (70%)
 Frame = -1

Query: 1806 KTKNVGKRSTXXXXXXXXXXXXXXXXXXXXKVRQQLNEFLKSHKSAFKWEVDRSLKLLRQ 1627
            ++++V KRST                    KVRQQLN+FLK  K  FKWEV  ++K LR 
Sbjct: 8    RSRDVTKRSTKKYIEEPLYHRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRN 67

Query: 1626 RKLYGPALKLSENMKKRGMNKTTSDEAIHLDLIAKAIGIASAETYFVTLPDTSKNHLIYG 1447
            R LY PALKLSE M++RGMNKT SD+AIHLDL+AKA GI + E YFV LP+TSK  L YG
Sbjct: 68   RGLYYPALKLSEVMEERGMNKTVSDQAIHLDLVAKARGITAGENYFVDLPETSKTELTYG 127

Query: 1446 ALLNCYCKALMIXXXXXXXXXXXXXXXXLTSMPYNSLMTLYMKTGQPEKVPAIIQEMKDS 1267
            +LLNCYCK L+                  +SM YNSLMTLY KTGQ EKVPA+IQE+K  
Sbjct: 128  SLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGQTEKVPAMIQELKAE 187

Query: 1266 NIMPDSYTYNVWMRAFAAANDITGVERLIDEMKRDGRVSADWTTYSNLASIYADVGXXXX 1087
            N+MPDSYTYNVWMRA AA NDI+GVER+I+EM RDGRV+ DWTTYSN+ASIY D G    
Sbjct: 188  NVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQK 247

Query: 1086 XXXXXXXXXXKNDRRDLSAYQYLITLYGRTGNLLEVYRVWRSLRLAFRKTANISYLNMIQ 907
                      KN +RD +AYQ+LITLYGR G L EVYR+WRSLRLA  KT+N++YLNMIQ
Sbjct: 248  AEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAMPKTSNVAYLNMIQ 307

Query: 906  VLVNLNDLPGAEKCFREWESGQPTYDIRIANVLIRAYTKQGXXXXXXXXXXXXXKCGGKA 727
            VLV LNDLPGAE  F+EW++   TYDIRI NVLI AY K+G             + GGKA
Sbjct: 308  VLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAKEGLIEKAKELKEKAPRRGGKA 367

Query: 726  NAKTWEIFLEYYLKNGEIKSAVECAQNAISTGRGDGSKWVPSSAVVTEIMEHFEQKKEVD 547
            NAKTWEIF++YY+K+G++  A+EC   A+S G+GDG KW+PS   V  +M +FEQKK+V+
Sbjct: 368  NAKTWEIFMDYYVKSGDMAHALECMSKAVSIGKGDGGKWIPSQETVRTLMSYFEQKKDVN 427

Query: 546  GAEGFVEILKQAGEGLEVNVFESLLKTYAAAGRTSPIMRQRMKMENVELSEEGKKLLDVI 367
            GAE  +EILK   + +   +FESL++TYAAAG++ P MR+R+KMENVE++E  KKLLD +
Sbjct: 428  GAENLLEILKNGTDNIGAEIFESLIRTYAAAGKSHPAMRRRLKMENVEVNEVTKKLLDEV 487

Query: 366  CVE 358
              E
Sbjct: 488  SQE 490


Top