BLASTX nr result

ID: Cephaelis21_contig00012120 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00012120
         (3077 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002514778.1| pentatricopeptide repeat-containing protein,...   517   e-144
ref|XP_003546486.1| PREDICTED: pentatricopeptide repeat-containi...   472   e-130
ref|XP_003542095.1| PREDICTED: pentatricopeptide repeat-containi...   466   e-128
ref|NP_001143372.1| uncharacterized protein LOC100276004 [Zea ma...   367   1e-98
gb|ACN34333.1| unknown [Zea mays] gi|414879211|tpg|DAA56342.1| T...   362   4e-97

>ref|XP_002514778.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223545829|gb|EEF47332.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 584

 Score =  517 bits (1332), Expect = e-144
 Identities = 289/604 (47%), Positives = 384/604 (63%), Gaps = 3/604 (0%)
 Frame = -2

Query: 2989 FNSWGTLKLPSYPKLRFYSSDPTPENDEFYREAPINYVRNEFDGSPFGANEGFADDSVVS 2810
            F++W    +P    LRF S+     ND     +     +N +DG     +EG   D+ + 
Sbjct: 31   FSNW----VPCQQNLRFLSNLSVNTNDTEIDHSSHGSAQNNYDGD----DEGKVTDTHLH 82

Query: 2809 VDSRLGXXXXXXXXXXXEKAGNLLGGFNEVASQNGDIDET---GVSEGEGDNVVGADELE 2639
              S              E     +  F + +    +   T   G  EG  D V+ AD   
Sbjct: 83   NFSPQADINEVSHHYSSENGDTHMDNFVQRSPDLAEEANTQIHGEVEGHVDYVIDAD--- 139

Query: 2638 KTDKIEGMAQKLEDVLSLLQSSGYGKSIEPSLEDMGLALNEKYVLRVLETPFIPGENLIG 2459
                      KLE+VLSLLQSS    S+E SL++M L L+E ++++VLETP I G+NLI 
Sbjct: 140  ----------KLENVLSLLQSST-DASLESSLDNMDLHLHEDFIVKVLETPLIVGDNLIK 188

Query: 2458 FFKWVFRKNEILVTKEALDALVTAISNEFRARNAYALWDLVKEAGEKDIGVVSTETFNEL 2279
            FF W  ++ +I VT   +  LV AI +E R ++AYALWDLVK+ GE++  V++ +  N+L
Sbjct: 189  FFNWAIKQPDINVTTRLVHPLVRAICSELRKKDAYALWDLVKDIGEEENTVLNVDLLNQL 248

Query: 2278 LSLFSRLGKGKVAFEVFNKFEDFGCVPDADTYYFTIDALCKRMIFGWACSVCEKMVSADK 2099
            ++LFS+LGKGK AFEVFNKF DFGCVPD++TY++TI+ALC+R IF WA SV EKM+ A+ 
Sbjct: 249  IALFSKLGKGKAAFEVFNKFGDFGCVPDSETYHYTIEALCRRSIFDWASSVREKMLRAEA 308

Query: 2098 VPESKKVGKIVSHLCKGKKFRDAHTVYLWAKERKIYPSRSSVNFLIRSLCGKDKLMGEDK 1919
            +P+++K+GKI+   CKG K  DA+ VYL AKE+  YP + SVNFLI  LC K+       
Sbjct: 309  LPDTEKIGKIICWFCKGDKANDAYLVYLLAKEKNKYPPQPSVNFLIGLLCQKN------- 361

Query: 1918 GRTKNETYSPSQAEVIAQEEKENVYLALKMLDDFSEEERKHAIKPFSFVIKGLLRIRDFG 1739
                                 E V LAL+MLD FS  +RK+AIKPFS VI+ L RI+D  
Sbjct: 362  ---------------------ETVKLALEMLDAFSGPKRKYAIKPFSSVIRALCRIKDLD 400

Query: 1738 GAKKLLFQMIEAGPPPGNMIFNTIIGSLSKAGDMQEAMVTLKIMEDRGLKPDVYSYTVVI 1559
            GAK LL +M++ GPPPGN +FN+II   SK GDM+EA+   ++M  RGLKPD+++Y V++
Sbjct: 401  GAKMLLSKMVDEGPPPGNAVFNSIINGYSKCGDMKEAIKMKQLMVRRGLKPDLFTYAVIM 460

Query: 1558 SGYVEGGAMEEACQVFGEAKKRHSKLSHVTYHSMIRGYCKLEQFEKALELMEEMKNSCLQ 1379
            SGY  GG MEEAC+V  EAKK+HSKLS V YH++IRGYCKLEQF+KAL+L+ EMK   +Q
Sbjct: 461  SGYASGGQMEEACKVLSEAKKKHSKLSPVMYHTVIRGYCKLEQFDKALDLLAEMKTFGVQ 520

Query: 1378 PNADEYNKMIKSLCVKALDWETAAKLLEEMSESGLHVNEKKRCLVRAVKELQDEIVESEA 1199
             NADEYNK+I+SLC+KALDWE A KLLE+M E GLH+N   R L+RAVKEL+DE +E E 
Sbjct: 521  ANADEYNKLIQSLCLKALDWERAEKLLEKMKEDGLHLNGITRGLIRAVKELEDEGIEKEV 580

Query: 1198 VAAA 1187
             A A
Sbjct: 581  GAEA 584


>ref|XP_003546486.1| PREDICTED: pentatricopeptide repeat-containing protein At3g02650,
            mitochondrial-like isoform 1 [Glycine max]
            gi|356556346|ref|XP_003546487.1| PREDICTED:
            pentatricopeptide repeat-containing protein At3g02650,
            mitochondrial-like isoform 2 [Glycine max]
          Length = 538

 Score =  472 bits (1215), Expect = e-130
 Identities = 248/506 (49%), Positives = 340/506 (67%), Gaps = 2/506 (0%)
 Frame = -2

Query: 2698 DETGVSEGEGDNVVGADELEKTDKIEGMAQKLEDVLSLLQSSGYGKSIEPSLEDMGLALN 2519
            D     EGEG    G       D+ E  + KLE VL LLQ+S  G S+E  L+D+ L L+
Sbjct: 68   DRQLAEEGEGAGGGG-------DRYEVDSDKLESVLRLLQTSADG-SLESCLDDIDLTLH 119

Query: 2518 EKYVLRVLETPFIPGENLIGFFKWVFRKNEILVTKEALDALVTAI-SNEFRARNA-YALW 2345
            ++ V ++ ETPF+  ENLI FF W + +  + VT   +++LV AI  N+ R +   Y+LW
Sbjct: 120  QQLVTKITETPFVLSENLIRFFWWAWSERSLGVTTPMVESLVLAICGNDVRKKEVVYSLW 179

Query: 2344 DLVKEAGEKDIGVVSTETFNELLSLFSRLGKGKVAFEVFNKFEDFGCVPDADTYYFTIDA 2165
            DLVKE GEK+ G+++ +  NEL+S F RLGKGK A EVFNKFE F CVPDADTYYFTI+A
Sbjct: 180  DLVKEIGEKESGILNVKILNELISSFLRLGKGKAALEVFNKFEAFHCVPDADTYYFTIEA 239

Query: 2164 LCKRMIFGWACSVCEKMVSADKVPESKKVGKIVSHLCKGKKFRDAHTVYLWAKERKIYPS 1985
            LC+R    WAC VC+KMV A  +P+ +KVG I+S LCKGKK ++AH VY+ A E+   P 
Sbjct: 240  LCRRRALDWACGVCQKMVDAQILPDGEKVGAILSWLCKGKKAKEAHGVYVVATEKGKQPP 299

Query: 1984 RSSVNFLIRSLCGKDKLMGEDKGRTKNETYSPSQAEVIAQEEKENVYLALKMLDDFSEEE 1805
             + V+FL+  LCG+D                            E V  AL+ML+D  EE+
Sbjct: 300  VNVVSFLVVKLCGED----------------------------ETVKFALEMLEDIPEEK 331

Query: 1804 RKHAIKPFSFVIKGLLRIRDFGGAKKLLFQMIEAGPPPGNMIFNTIIGSLSKAGDMQEAM 1625
            R+ AIKPF  V++ L RI++   AK+L+ +MIE GPPPGN +FN ++ + SKAG+M +A+
Sbjct: 332  RERAIKPFLAVVRALCRIKEVDKAKELVLKMIEDGPPPGNAVFNFVVTAYSKAGEMGKAV 391

Query: 1624 VTLKIMEDRGLKPDVYSYTVVISGYVEGGAMEEACQVFGEAKKRHSKLSHVTYHSMIRGY 1445
              +++ME RGL+PDVY+YTV+ S Y  GG MEEA ++  E KK+H+KL  V +H++IRGY
Sbjct: 392  EMMRLMESRGLRPDVYTYTVLASAYSNGGEMEEAQKILAEVKKKHAKLGPVMFHTLIRGY 451

Query: 1444 CKLEQFEKALELMEEMKNSCLQPNADEYNKMIKSLCVKALDWETAAKLLEEMSESGLHVN 1265
            CKLEQF++AL+L+ EMK+  + P+ DEY+K+I+SLC+KALDWE A KL EEM ESGLH+ 
Sbjct: 452  CKLEQFDEALKLLAEMKDYGVHPSVDEYDKLIQSLCLKALDWEMAEKLHEEMKESGLHLK 511

Query: 1264 EKKRCLVRAVKELQDEIVESEAVAAA 1187
               R L+RAVKE++ E+VE+ ++ AA
Sbjct: 512  GITRGLIRAVKEMEKEVVEAGSITAA 537


>ref|XP_003542095.1| PREDICTED: pentatricopeptide repeat-containing protein At3g02650,
            mitochondrial-like [Glycine max]
          Length = 539

 Score =  466 bits (1200), Expect = e-128
 Identities = 241/485 (49%), Positives = 331/485 (68%), Gaps = 4/485 (0%)
 Frame = -2

Query: 2632 DKIEGMAQKLEDVLSLLQSSGYGKSIEPSLEDMGLALNEKYVLRVLETPFIPGENLIGFF 2453
            D  E  +  LE VL LLQ+S  G S+E  L+DM L L+++ V ++ ETPF+  ENLI FF
Sbjct: 82   DTYEVDSDTLESVLRLLQTSADG-SLESCLDDMDLTLHQQLVTKITETPFVLSENLIRFF 140

Query: 2452 KWVFRKNEILVTKEALDALVTAISNEFRARN----AYALWDLVKEAGEKDIGVVSTETFN 2285
             W + +  + VT   +++LV AI      R      Y+LWDLVKE GEK+ G+++    N
Sbjct: 141  WWAWSERSLEVTTPMVESLVLAICGNDDVRKKKEVVYSLWDLVKEIGEKESGLLNVRILN 200

Query: 2284 ELLSLFSRLGKGKVAFEVFNKFEDFGCVPDADTYYFTIDALCKRMIFGWACSVCEKMVSA 2105
            EL+S FSRL KGK A EVF+KFE F CVPDADTYYFTI+ALC+R  F WAC VC+KMV A
Sbjct: 201  ELISSFSRLRKGKAALEVFDKFEAFHCVPDADTYYFTIEALCRRRAFDWACGVCQKMVDA 260

Query: 2104 DKVPESKKVGKIVSHLCKGKKFRDAHTVYLWAKERKIYPSRSSVNFLIRSLCGKDKLMGE 1925
              +P+++KVG I+S LCKGKK ++AH VY+ A E+   P  + V+FL+  LCG+D     
Sbjct: 261  RTLPDAEKVGAILSWLCKGKKAKEAHGVYVVATEKGKLPPVNVVSFLVLKLCGED----- 315

Query: 1924 DKGRTKNETYSPSQAEVIAQEEKENVYLALKMLDDFSEEERKHAIKPFSFVIKGLLRIRD 1745
                                   E V  AL++L+D  EE+R+ AIKPF  V++ L RI++
Sbjct: 316  -----------------------ETVKSALEILEDIPEEKRERAIKPFLAVVRALCRIKE 352

Query: 1744 FGGAKKLLFQMIEAGPPPGNMIFNTIIGSLSKAGDMQEAMVTLKIMEDRGLKPDVYSYTV 1565
               AK+LL +MIE GPPPGN +FN ++ + SKAG+M +A+  +++ME RGL+PDVY+YTV
Sbjct: 353  VDKAKELLLKMIENGPPPGNAVFNFVVTAYSKAGEMGKAVEMMRLMESRGLRPDVYTYTV 412

Query: 1564 VISGYVEGGAMEEACQVFGEAKKRHSKLSHVTYHSMIRGYCKLEQFEKALELMEEMKNSC 1385
            + S Y  GG MEEA ++  EAKK+H KL  V +H++IRGYCKLEQF++AL+L+ EMK+  
Sbjct: 413  LASAYSNGGEMEEAQKILAEAKKKHVKLGPVMFHTLIRGYCKLEQFDEALKLLAEMKDYG 472

Query: 1384 LQPNADEYNKMIKSLCVKALDWETAAKLLEEMSESGLHVNEKKRCLVRAVKELQDEIVES 1205
            ++P+ DEY+K+I+SLC+KALDW+ A KL EEM ESGLH+    R L+RAVKE++ E+VE+
Sbjct: 473  VRPSVDEYDKLIQSLCLKALDWKMAEKLQEEMKESGLHLKGITRGLIRAVKEMEKEVVEA 532

Query: 1204 EAVAA 1190
            E++ A
Sbjct: 533  ESITA 537


>ref|NP_001143372.1| uncharacterized protein LOC100276004 [Zea mays]
            gi|195619158|gb|ACG31409.1| hypothetical protein [Zea
            mays]
          Length = 597

 Score =  367 bits (941), Expect = 1e-98
 Identities = 229/635 (36%), Positives = 347/635 (54%), Gaps = 16/635 (2%)
 Frame = -2

Query: 3055 VLQSRETLPPPLALFRYRNFWRFNSWGTLKLPSYPKLRFYSSDPTPENDEFYREAPINYV 2876
            +L+S  T PPP     Y          T  L   P  RF SS P P  D     A     
Sbjct: 11   LLRSTITRPPPPPPQPYP---------TRTLTRVPPPRFLSSSPDPIPDSSSAAA----- 56

Query: 2875 RNEFDGSPFGANEGFADDSVVSVDSRLGXXXXXXXXXXXEKAGNLLGGFNEVASQNGDID 2696
                   PF   E F+  +  S D+               +AG      + +  + GD D
Sbjct: 57   -----ADPFP--EAFSSPTKASQDAA--------------EAGE--DNLSSMWEEAGDAD 93

Query: 2695 ETGVSEGEGDNVVGADELEKTDKIEGMAQKLEDVLSLLQSSGYGKSIEPSLEDMGLALNE 2516
            +   S G  D V   +E+ +             + + ++S+   + I  +L DM +  NE
Sbjct: 94   DIFASPGSADAVADDEEVAR-------------ICAAVESTPEDE-IASTLADMTVDFNE 139

Query: 2515 KYVLRVL-ETPFIPGENLIGFFKWVFRKNEILVTKEALDALVTAISN--EFRARNAYALW 2345
              +  VL        + LI  F +  + N    +   L+ LV+ +++  E    +AY LW
Sbjct: 140  PLLAAVLLAADQCSCKKLISLFNYAAKNNPTSKSLSNLEVLVSKLADSAEIDKADAYLLW 199

Query: 2344 DLVKEAGEKDIGVVSTETFNELLSLFSRLGKGKVAFEVFNKFEDFGCVPDADTYYFTIDA 2165
            D +KE G    G VST   NE++++F +L K K A EVF+KF++FGC PD+D+YY  I+A
Sbjct: 200  DSIKEIGSVP-GSVSTPLLNEMIAIFWKLEKSKAALEVFSKFDEFGCTPDSDSYYLVIEA 258

Query: 2164 LCKRMIFGWACSVCEKMVSADKVPESKKVGKIVSHLCKGKKFRDAHTVYLWAKERKIYPS 1985
              K+ +F  AC VCEKM+ +   P  +KVG+I+ +LC+GKK + AH++YL  KE+KI   
Sbjct: 259  ARKKSLFRSACEVCEKMIGSACFPNGEKVGRILIYLCEGKKVKMAHSLYLAVKEKKIPVP 318

Query: 1984 RSSVNFLIRSLCGKDKLMG-------EDKGRTKNE------TYSPSQAEVIAQEEKENVY 1844
            + +++FL+ +L   D+ +G       E +G +         T   +   +   E+  N+ 
Sbjct: 319  KLALDFLVGALARNDETIGTALELLEEYQGESLKHAGKSFATVVHALCRLSKMEDANNLL 378

Query: 1843 LALKMLDDFSEEERKHAIKPFSFVIKGLLRIRDFGGAKKLLFQMIEAGPPPGNMIFNTII 1664
            + +  L+++  E  K+A K F+ VI GL R +    AK LL +M+  GP PGN +FN +I
Sbjct: 379  MRMVQLEEYKGESLKNAGKTFATVIHGLCRKKKLEDAKALLMRMVNVGPAPGNAVFNFVI 438

Query: 1663 GSLSKAGDMQEAMVTLKIMEDRGLKPDVYSYTVVISGYVEGGAMEEACQVFGEAKKRHSK 1484
             +LSK G+M++A   +++ME +G+ PD+Y+Y+V++SGYV+GG ++EA  +  EAKK H K
Sbjct: 439  TALSKQGEMEDAKGLMRMMESQGISPDIYTYSVLMSGYVKGGMIDEAHDLLREAKKIHPK 498

Query: 1483 LSHVTYHSMIRGYCKLEQFEKALELMEEMKNSCLQPNADEYNKMIKSLCVKALDWETAAK 1304
            L+ V YH +IRGYCK+E FEKA E ++EMK   LQPN DEY+K+I+SLC+KA+DW  A K
Sbjct: 499  LNRVAYHILIRGYCKMEDFEKANECLKEMKKDGLQPNVDEYDKLIQSLCLKAMDWRRAEK 558

Query: 1303 LLEEMSESGLHVNEKKRCLVRAVKELQDEIVESEA 1199
            LLEEM +SGL +    R L+ AVKEL+ E ++S+A
Sbjct: 559  LLEEMEDSGLCLRGISRSLITAVKELEGEEMQSKA 593


>gb|ACN34333.1| unknown [Zea mays] gi|414879211|tpg|DAA56342.1| TPA: hypothetical
            protein ZEAMMB73_618544 [Zea mays]
          Length = 598

 Score =  362 bits (928), Expect = 4e-97
 Identities = 228/635 (35%), Positives = 344/635 (54%), Gaps = 16/635 (2%)
 Frame = -2

Query: 3055 VLQSRETLPPPLALFRYRNFWRFNSWGTLKLPSYPKLRFYSSDPTPENDEFYREAPINYV 2876
            +L+S  T PPP     Y          T  L   P  RF SS P P  D     A     
Sbjct: 12   LLRSTITRPPPPPPQPYP---------TRTLTRVPPPRFLSSSPDPIPDSSSAAA----- 57

Query: 2875 RNEFDGSPFGANEGFADDSVVSVDSRLGXXXXXXXXXXXEKAGNLLGGFNEVASQNGDID 2696
                   PF   E F+  +  S D+               +AG      + +  + G  D
Sbjct: 58   -----ADPFP--EAFSSPTKASQDAA--------------EAGE--DNLSSMWEEAGHAD 94

Query: 2695 ETGVSEGEGDNVVGADELEKTDKIEGMAQKLEDVLSLLQSSGYGKSIEPSLEDMGLALNE 2516
            +   S G  D V   +E+ +             V + ++S+   + I  +L DM +  NE
Sbjct: 95   DIFASPGSADAVADDEEVAR-------------VCAAVESTPEDE-IASTLADMTVDFNE 140

Query: 2515 KYVLRVL-ETPFIPGENLIGFFKWVFRKNEILVTKEALDALVTAISN--EFRARNAYALW 2345
              +  VL        + LI  F +  + N    +   L+ LV+ +++  E    +AY LW
Sbjct: 141  PLLAAVLLAAEQCSCKKLISLFNYAAKNNPASKSLSNLEVLVSKLADSAEIDKADAYLLW 200

Query: 2344 DLVKEAGEKDIGVVSTETFNELLSLFSRLGKGKVAFEVFNKFEDFGCVPDADTYYFTIDA 2165
            D +KE G    G VST   NE++++F ++ K K A EVF+KF++FGC PD+D+YY  I+A
Sbjct: 201  DSIKEIGSVS-GSVSTPLLNEMIAIFWKVEKSKAALEVFSKFDEFGCTPDSDSYYLVIEA 259

Query: 2164 LCKRMIFGWACSVCEKMVSADKVPESKKVGKIVSHLCKGKKFRDAHTVYLWAKERKIYPS 1985
              K+ +F  AC VCEKM+ +   P  +KVG+I+ +LC+GKK + AH++YL  KE+KI   
Sbjct: 260  ARKKSLFRSACEVCEKMIGSACFPNGEKVGRILIYLCEGKKVKMAHSLYLAVKEKKIPVP 319

Query: 1984 RSSVNFLIRSLCGKDKLMG-------EDKGRTKNETYSPSQAEVIA------QEEKENVY 1844
            + +++FL+ +L   D+ +G       E +G +           V A       E+  N+ 
Sbjct: 320  KLALDFLVGALARNDETIGTALELLEEYQGESLKHAGKSFATVVHALCRLNKMEDANNLL 379

Query: 1843 LALKMLDDFSEEERKHAIKPFSFVIKGLLRIRDFGGAKKLLFQMIEAGPPPGNMIFNTII 1664
            + +  L+++  E  K+A K F+ VI GL R +    AK LL +M+  GP PGN +FN +I
Sbjct: 380  MRMVQLEEYKGESLKNAGKTFATVIHGLCRKKKLEDAKALLMRMVNVGPAPGNAVFNFVI 439

Query: 1663 GSLSKAGDMQEAMVTLKIMEDRGLKPDVYSYTVVISGYVEGGAMEEACQVFGEAKKRHSK 1484
             +LSK G+M++A   +++ME +G+ PD+Y+Y+V++SGY +GG ++EA  +  EAKK H K
Sbjct: 440  TALSKQGEMEDAKGLMRMMESQGISPDIYTYSVLMSGYAKGGMIDEAHDLLREAKKIHPK 499

Query: 1483 LSHVTYHSMIRGYCKLEQFEKALELMEEMKNSCLQPNADEYNKMIKSLCVKALDWETAAK 1304
            L+ V YH +IRGYCK+E FEKA E ++EMK   LQPN DEY+K+I+SLC+KA+DW  A K
Sbjct: 500  LNRVAYHILIRGYCKMEDFEKANECLKEMKKDGLQPNVDEYDKLIQSLCLKAMDWRRAEK 559

Query: 1303 LLEEMSESGLHVNEKKRCLVRAVKELQDEIVESEA 1199
            LLEEM +SGL +    R L+ AVKEL+ E ++S+A
Sbjct: 560  LLEEMEDSGLCLRGISRSLITAVKELEGEEMQSKA 594


Top