BLASTX nr result

ID: Cephaelis21_contig00019358 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00019358
         (2090 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containi...   484   e-134
ref|XP_002516403.1| pentatricopeptide repeat-containing protein,...   444   e-122
ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containi...   440   e-121
ref|XP_002321108.1| predicted protein [Populus trichocarpa] gi|2...   437   e-120
ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containi...   425   e-116

>ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190,
            chloroplastic-like [Vitis vinifera]
          Length = 581

 Score =  484 bits (1247), Expect = e-134
 Identities = 241/433 (55%), Positives = 320/433 (73%), Gaps = 3/433 (0%)
 Frame = +3

Query: 309  VELKQSNHRLKKAPFAEKVKEEKQLDSLEKLKEDGDWSKEEFWGVIKSLYLSSRTNEILQ 488
            VE    N RL      +K+   K    L+ L++DGDW+K+ FW VI+ L  +SR++EIL 
Sbjct: 94   VETLHENERL--GVLIQKLSN-KASSPLQLLRDDGDWNKQHFWAVIRFLKDASRSSEILP 150

Query: 489  VFDSWKNVDKSRINVENYEKIICYMFRDRLTDAAILAFQQLKSHGIQPSLEIYNLVINGF 668
            VF  WK++DKSRIN  NY KII  + ++ L + ++LA + +K+HG++PSLEIYNLVI+ F
Sbjct: 151  VFHLWKDMDKSRINEFNYAKIIGLLSQEDLAEESVLALEGMKTHGLKPSLEIYNLVIHCF 210

Query: 669  ARIGRFDDALFYLREMKNTGLQPDTEIYDGLIQAFGNFKMYDEMGRCLREMELDGCPPDH 848
            AR G FD AL++L E+K   L  DTE YDGLIQ++G +KMYDE+  C+++ME DGC PDH
Sbjct: 211  ARKGEFDRALYFLNELKANNLIADTETYDGLIQSYGKYKMYDELDECVKKMESDGCLPDH 270

Query: 849  VTYNLLIREFAKAGLLNMMERTHQALLTKKMDLQTSTIVAMLETYANFGIWAKMEKVFRR 1028
            +TYNLLI+EF++ GLL  MER  Q +L+KKM LQ+ST+V MLE YANFGI  KME  +RR
Sbjct: 271  ITYNLLIQEFSRGGLLKRMERVFQTVLSKKMGLQSSTLVVMLEAYANFGIIEKMENAYRR 330

Query: 1029 ALRSKAFVEEDLVQKLARVYIENLMFSKLDDLGLDFSSKTGSTDLVWSLRLLSHACSLSK 1208
             L SK  +++DL++KLA VYIEN  FS+L D+GL+ +S T  TDLVW LRLLSHAC LS+
Sbjct: 331  VLNSKTSLKDDLIRKLAEVYIENYKFSRLADMGLNLASVTSRTDLVWCLRLLSHACLLSR 390

Query: 1209 KGMYSIIEEMESKEVPWNVTIANIMALAFLKMKDFKQLQVLLSELPSRCVKPDVITVGVL 1388
            KG+ SI++EME+K VPWN T+AN + LA+LKMKDF +L++LL EL +R VKPD++TVG+L
Sbjct: 391  KGLDSIVKEMEAKNVPWNATVANTILLAYLKMKDFTRLRILLLELSTRHVKPDIVTVGIL 450

Query: 1389 FDACMSGFHGTLVKRTWARAGFFDDTVELNTDDLVLVAFAKGQFLRDFEELF---EAKTK 1559
            FDA   GF+GT+   TW R GF D+ VE+NTD LVL AF KG FL+  EE++   E + +
Sbjct: 451  FDANRIGFNGTMALNTWRRTGFLDEAVEMNTDPLVLSAFGKGNFLQSCEEMYSSLEPEAR 510

Query: 1560 KKGPWTYRHLIDL 1598
            KK  WTY++LIDL
Sbjct: 511  KKKIWTYQNLIDL 523


>ref|XP_002516403.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223544501|gb|EEF46020.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 502

 Score =  444 bits (1143), Expect = e-122
 Identities = 220/409 (53%), Positives = 295/409 (72%), Gaps = 3/409 (0%)
 Frame = +3

Query: 390  LEKLKEDGDWSKEEFWGVIKSLYLSSRTNEILQVFDSWKNVDKSRINVENYEKIICYMFR 569
            L+ L++D DWSK+ FW VI+ L  SSR++EILQVFD WK+++KSRIN  NYEK+I  +  
Sbjct: 86   LQMLQDDADWSKDHFWAVIRFLRHSSRSDEILQVFDMWKDIEKSRINEFNYEKVIEILGE 145

Query: 570  DRLTDAAILAFQQLKSHGIQPSLEIYNLVINGFARIGRFDDALFYLREMKNTGLQPDTEI 749
            + L + A  AF ++K+  + PSL++YN +I+G+AR G+FDDA+FYL  +K   L P ++ 
Sbjct: 146  EGLIEDAYSAFIEMKTLCLSPSLQVYNSLIHGYARNGKFDDAVFYLNHLKEINLSPVSDT 205

Query: 750  YDGLIQAFGNFKMYDEMGRCLREMELDGCPPDHVTYNLLIREFAKAGLLNMMERTHQALL 929
            Y+GLIQA+G +KMYDEMG CL++ME++GC PDHVTYNLLI+E A+AGLL  ME+ +Q   
Sbjct: 206  YNGLIQAYGKYKMYDEMGMCLKKMEMEGCSPDHVTYNLLIQELAEAGLLTRMEKVYQTTR 265

Query: 930  TKKMDLQTSTIVAMLETYANFGIWAKMEKVFRRALRSKAFVEEDLVQKLARVYIENLMFS 1109
              +MDL+++T+ AMLE YANFGI  KME + +R   SKA ++EDL++K+A VYIEN MFS
Sbjct: 266  MNRMDLKSTTLTAMLEAYANFGIVEKMELILKRTRNSKALLKEDLIKKIALVYIENFMFS 325

Query: 1110 KLDDLGLDFSSKTGSTDLVWSLRLLSHACSLSKKGMYSIIEEMESKEVPWNVTIANIMAL 1289
            +L+ LG   S ++G  D+VW L LLS+AC LS+KGM S++ EM+  +V WNVT  NI+ L
Sbjct: 326  RLEKLGHYLSKRSGQNDMVWCLLLLSNACMLSQKGMDSVVREMKVAKVSWNVTFINIILL 385

Query: 1290 AFLKMKDFKQLQVLLSELPSRCVKPDVITVGVLFDACMSGFHGTLVKRTWARAGFFDDTV 1469
            A+LKMKD  +L +LLS L +  VKPD++TVGVLFDA   GFHG  +  TW R G     V
Sbjct: 386  AYLKMKDSMRLGILLSTLTNHIVKPDIVTVGVLFDANNIGFHGNGILETWRRTGILYRCV 445

Query: 1470 ELNTDDLVLVAFAKGQFLRDFEELF---EAKTKKKGPWTYRHLIDLVKT 1607
            E  TD LVL AF KGQFL+  EE +   E   ++K  WTY +LIDLV T
Sbjct: 446  ETETDPLVLAAFGKGQFLKKCEEAYSSLEPVARQKEKWTYCNLIDLVAT 494


>ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190,
            chloroplastic-like [Glycine max]
          Length = 509

 Score =  440 bits (1132), Expect = e-121
 Identities = 215/427 (50%), Positives = 304/427 (71%), Gaps = 3/427 (0%)
 Frame = +3

Query: 330  HRLKKAPFAEKVKEEKQLDSLEKLKEDGDWSKEEFWGVIKSLYLSSRTNEILQVFDSWKN 509
            H   +A  A+  KE+   + L  L EDGDWSK+ FW V++ L  +SR  +ILQVFD WKN
Sbjct: 72   HDSLRALLAKLQKED--CNPLHVLAEDGDWSKDHFWAVVRFLKSASRFTQILQVFDMWKN 129

Query: 510  VDKSRINVENYEKIICYMFRDRLTDAAILAFQQLKSHGIQPSLEIYNLVINGFARIGRFD 689
            ++KSRI+  NY KII  +      + A+ A + +K  GI+PSL+ YN +I+G +R G+F 
Sbjct: 130  IEKSRISEFNYNKIIGLLCEGGKMEDALSALRDMKVQGIKPSLDTYNPIIHGLSREGKFS 189

Query: 690  DALFYLREMKNTGLQPDTEIYDGLIQAFGNFKMYDEMGRCLREMELDGCPPDHVTYNLLI 869
            DAL ++ EMK +GL+ D+E YDGL+ A+G F+MYDEMG C+++MEL+GC PDH+TYN+LI
Sbjct: 190  DALRFIDEMKESGLELDSETYDGLLGAYGKFQMYDEMGECVKKMELEGCSPDHITYNILI 249

Query: 870  REFAKAGLLNMMERTHQALLTKKMDLQTSTIVAMLETYANFGIWAKMEKVFRRALRSKAF 1049
            +E+A+AGLL  ME+ +Q +++K+M +Q+ST+VAMLE Y  FG+  KME  +R+ L SK  
Sbjct: 250  QEYARAGLLQRMEKLYQRMVSKRMHVQSSTLVAMLEAYTTFGMVEKMENFYRKILSSKTC 309

Query: 1050 VEEDLVQKLARVYIENLMFSKLDDLGLDFSSKTGSTDLVWSLRLLSHACSLSKKGMYSII 1229
            +E+DL++K+A VYI+N MFS+L+DL LD     G ++LVW LRLLS+AC LSKKGM  ++
Sbjct: 310  LEDDLIRKVAEVYIKNYMFSRLEDLALDLCPAFGESNLVWCLRLLSYACPLSKKGMDIVV 369

Query: 1230 EEMESKEVPWNVTIANIMALAFLKMKDFKQLQVLLSELPSRCVKPDVITVGVLFDACMSG 1409
             EM   +V WNVT+ANI+ LA++KMKDF+ L++LLS+LP   V+PD+IT+G+LFDA   G
Sbjct: 370  REMRDAKVNWNVTVANIIMLAYVKMKDFRHLKILLSQLPIYRVQPDIITIGILFDATRIG 429

Query: 1410 FHGTLVKRTWARAGFFDDTVELNTDDLVLVAFAKGQFLRDFEELFEA---KTKKKGPWTY 1580
            F G+    TW R G+    VE+ TD LVL AF KG FL+  EE++ +   + +K+  WTY
Sbjct: 430  FDGSGALETWRRMGYLYRVVEIKTDSLVLTAFGKGHFLKSCEEVYSSLHPEDRKRKTWTY 489

Query: 1581 RHLIDLV 1601
              LI L+
Sbjct: 490  HDLIALL 496


>ref|XP_002321108.1| predicted protein [Populus trichocarpa] gi|222861881|gb|EEE99423.1|
            predicted protein [Populus trichocarpa]
          Length = 419

 Score =  437 bits (1125), Expect = e-120
 Identities = 220/419 (52%), Positives = 294/419 (70%), Gaps = 20/419 (4%)
 Frame = +3

Query: 405  EDGDWSKEEFWGVIKSLYLSSRTNEILQV-----------------FDSWKNVDKSRINV 533
            +DGDWSK++FW VIK L LS+R+N+ILQV                 F  W++V+K+RIN 
Sbjct: 1    QDGDWSKDDFWSVIKFLKLSARSNQILQVHSLAHLFFLAARKIEFVFHMWRDVEKTRINE 60

Query: 534  ENYEKIICYMFRDRLTDAAILAFQQLKSHGIQPSLEIYNLVINGFARIGRFDDALFYLRE 713
             NYEKII  +  + L + A+ AF ++KS G+  SLE+YN +I+G+AR G+FDDALFYL +
Sbjct: 61   FNYEKIIGLLGEEGLMEDAVTAFMEMKSFGLCLSLEVYNSIIHGYARNGKFDDALFYLNQ 120

Query: 714  MKNTGLQPDTEIYDGLIQAFGNFKMYDEMGRCLREMELDGCPPDHVTYNLLIREFAKAGL 893
            M    L P+++ YDGLI+A+G ++MYDEM  CL++MELDGC PD  TYNLLI++FA+ GL
Sbjct: 121  MNEMNLSPESDTYDGLIEAYGTYRMYDEMAMCLKKMELDGCSPDRYTYNLLIQKFAQGGL 180

Query: 894  LNMMERTHQALLTKKMDLQTSTIVAMLETYANFGIWAKMEKVFRRALRSKAFVEEDLVQK 1073
            L  MER +Q++ TK+M LQ+ST+++MLE YANFGI  KMEK+ R A  SK  V+EDLV+K
Sbjct: 181  LTRMERVYQSMRTKRMKLQSSTLISMLEAYANFGIVEKMEKILRWAWNSKITVKEDLVRK 240

Query: 1074 LARVYIENLMFSKLDDLGLDFSSKTGSTDLVWSLRLLSHACSLSKKGMYSIIEEMESKEV 1253
            LA VYI N MFS+L DL +D +S TG TD+VW L LLSHAC LS++GM +++ EME  + 
Sbjct: 241  LAGVYIANYMFSRLHDLAVDLTSITGRTDIVWCLHLLSHACLLSRRGMDAVVREMEDAKA 300

Query: 1254 PWNVTIANIMALAFLKMKDFKQLQVLLSELPSRCVKPDVITVGVLFDACMSGFHGTLVKR 1433
             WN+T+ANI+ LA+LKMKDF +L++LLS+LP   V+PD++T G+LFDA   GF G     
Sbjct: 301  CWNITVANIILLAYLKMKDFTRLRILLSKLPEIRVEPDIVTFGILFDAEEIGFDGKECLE 360

Query: 1434 TWARAGFFDDTVELNTDDLVLVAFAKGQFLRDFEELF---EAKTKKKGPWTYRHLIDLV 1601
             W + G     VE+NTD L L AF KG FLR  EE +   E   ++K  WTY   I+LV
Sbjct: 361  MWRKMGLLYRRVEMNTDPLALSAFGKGSFLRSCEEGYSSLEPNAREKKRWTYVDFINLV 419


>ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190,
            chloroplastic-like [Glycine max]
          Length = 506

 Score =  425 bits (1092), Expect = e-116
 Identities = 207/414 (50%), Positives = 290/414 (70%), Gaps = 3/414 (0%)
 Frame = +3

Query: 369  EEKQLDSLEKLKEDGDWSKEEFWGVIKSLYLSSRTNEILQVFDSWKNVDKSRINVENYEK 548
            E +  + L  L ED DWSK+ FW V++ L  SS    ILQVFD WKN++KSRI+  NY K
Sbjct: 81   ENEYSNPLHMLAEDADWSKDHFWAVVRFLKSSSNFTHILQVFDMWKNIEKSRISEFNYNK 140

Query: 549  IICYMFRDRLTDAAILAFQQLKSHGIQPSLEIYNLVINGFARIGRFDDALFYLREMKNTG 728
            II  +        A+ A Q +K  GI+PSL+ YN +I+G +R G+F DAL ++ EMK +G
Sbjct: 141  IIGLLCEGGKMKDALSALQDMKVQGIKPSLDTYNPIIHGLSREGKFSDALRFIDEMKESG 200

Query: 729  LQPDTEIYDGLIQAFGNFKMYDEMGRCLREMELDGCPPDHVTYNLLIREFAKAGLLNMME 908
            L+ D+E YDGLI A+G F+MYDEMG C+++MEL+GC PD +TYN+LI+E+A  GLL  ME
Sbjct: 201  LELDSETYDGLIGAYGKFQMYDEMGECVKKMELEGCSPDPITYNILIQEYAGGGLLQRME 260

Query: 909  RTHQALLTKKMDLQTSTIVAMLETYANFGIWAKMEKVFRRALRSKAFVEEDLVQKLARVY 1088
            + +Q +L+K+M +++ST+VAMLE Y  FG+  KMEK +R+ L SK  +E+DL++K+A VY
Sbjct: 261  KLYQRMLSKRMHVKSSTLVAMLEAYTTFGMVEKMEKFYRKILNSKTCIEDDLIRKVAEVY 320

Query: 1089 IENLMFSKLDDLGLDFSSKTGSTDLVWSLRLLSHACSLSKKGMYSIIEEMESKEVPWNVT 1268
            I N MFS+L+DL LD     G ++L W  RLLS+AC LSKKGM  +++EM+  +V WNVT
Sbjct: 321  INNFMFSRLEDLALDLCPAFGESNLEWCFRLLSYACLLSKKGMDIVVQEMQDAKVSWNVT 380

Query: 1269 IANIMALAFLKMKDFKQLQVLLSELPSRCVKPDVITVGVLFDACMSGFHGTLVKRTWARA 1448
            +ANI+ LA++KMK+F+ L++LLS+LP   V+PD+IT+G+LFDA   GF G+    TW R 
Sbjct: 381  VANIIMLAYVKMKEFRHLRILLSQLPIYRVQPDIITIGILFDATRIGFDGSGALETWRRM 440

Query: 1449 GFFDDTVELNTDDLVLVAFAKGQFLRDFEELFEA---KTKKKGPWTYRHLIDLV 1601
            G+    VE+ TD LVL AF KG FL+  EE++ +   + +K+   TY  LI L+
Sbjct: 441  GYLYRVVEMKTDSLVLTAFGKGHFLKSCEEVYSSLHPEDRKRKTCTYHDLIPLL 494


Top