BLASTX nr result

ID: Cephaelis21_contig00027800 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00027800
         (1757 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN83162.1| hypothetical protein VITISV_022557 [Vitis vinifera]   615   e-175
ref|XP_002280974.1| PREDICTED: pentatricopeptide repeat-containi...   615   e-174
dbj|BAJ53121.1| JHL07K02.11 [Jatropha curcas]                         603   e-171
ref|XP_003534842.1| PREDICTED: pentatricopeptide repeat-containi...   590   e-169
ref|NP_199912.2| pentatricopeptide repeat-containing protein [Ar...   563   e-158

>emb|CAN83162.1| hypothetical protein VITISV_022557 [Vitis vinifera]
          Length = 562

 Score =  615 bits (1585), Expect(2) = e-175
 Identities = 289/440 (65%), Positives = 357/440 (81%)
 Frame = -3

Query: 1470 WNVDIVSANSMISNFLKIGEISIAKKIFHNVPSRDVVTWNSLIGGLVKNACFQEALGAFG 1291
            W  D+++AN +I++ +K+GE   AK++F  +  RDVVTWNS+IGG V+N  F+EAL  F 
Sbjct: 123  WGFDLITANLIIASLMKVGEFDFAKRVFRKMLRRDVVTWNSMIGGCVRNERFEEALRFFR 182

Query: 1290 EMLKTKIEPDRYTFASTLTAVARLGVLDHAKWIHGMMVEKRIELNYILASALIDMYAKCG 1111
            EML + +EPD +TFAS +   ARLG   HA+ +HG+M+EK+I+LN+IL+SALID+Y+KCG
Sbjct: 183  EMLNSNVEPDGFTFASVINGCARLGSSHHAELVHGLMIEKKIQLNFILSSALIDLYSKCG 242

Query: 1110 KINAAKEIFNSVHRADVSIWNAMINGLAIHGLAFDAIAIFSMMKAENVSPDSITFTGILT 931
            +IN AK++FNS+   DVS+WN+MINGLAIHGLA DAI +FS M+ E+VSPDSITF GILT
Sbjct: 243  RINTAKKVFNSIQHDDVSVWNSMINGLAIHGLALDAIGVFSQMEMESVSPDSITFIGILT 302

Query: 930  ACSHCGLVEQGRELYGLMRTCYLIQPQLEHYGAMVVLLSRAGLLEEAYAVIREMTVEPDI 751
            ACSHCGLVEQGR  + LMR  Y IQPQLEHYGAMV LL RAGL+EEAYA+I+ M +EPDI
Sbjct: 303  ACSHCGLVEQGRRYFDLMRRHYSIQPQLEHYGAMVDLLGRAGLVEEAYAMIKAMPMEPDI 362

Query: 750  IIWRALLGACRMHKNSELAEIAISKIQHLGCGDYTLLSNTYCSVNRWESSEKVRYTMKEK 571
            +IWRALL ACR  KN EL E+AI+KI HL  GDY LLSN YCS+ +W+S+E+VR  MK  
Sbjct: 363  VIWRALLSACRNFKNPELGEVAIAKISHLNSGDYILLSNMYCSLEKWDSAERVREMMKRD 422

Query: 570  SVRKRSGKSWLEIGGIIHQFSAGGKSHIEASKIYKVLEALIQRTRREGFVSDTDLVLMDV 391
             VRK  G+SW+E+GG+IHQF AG +SH E   IYKVLE LI+RT+ EGF+  TDLVLMDV
Sbjct: 423  GVRKNRGRSWVELGGVIHQFKAGDRSHPETGAIYKVLEGLIRRTKLEGFMPATDLVLMDV 482

Query: 390  SEEEKEENLNYHSEKWALAYGILQTSPGTEILISKNLRTCSDCHSWMKIVSKVLNRVITV 211
            S+EE+EENLN HSEK ALAY IL+TSPGTEI +SKNLRTC DCH WMKI+S++L+RVI V
Sbjct: 483  SDEEREENLNSHSEKLALAYVILKTSPGTEIRVSKNLRTCHDCHCWMKILSRLLSRVIIV 542

Query: 210  RDRIRFHRFEGGTCTCRDYW 151
            RDRIRFH+FEGG C+CRDYW
Sbjct: 543  RDRIRFHQFEGGLCSCRDYW 562



 Score = 27.7 bits (60), Expect(2) = e-175
 Identities = 17/51 (33%), Positives = 26/51 (50%), Gaps = 2/51 (3%)
 Frame = -1

Query: 1613 SNPRVHNTTSKSHH--SDAGYQGLLRVLEACKVVPNLGTATAVHAKIVIHG 1467
            S+P  H+ +  +     +  +Q L  +LEACK   +  TA   HAKI+  G
Sbjct: 39   SSPTCHDFSGTTDMVIQERDHQKLNCILEACKFSSDFRTAFQSHAKIIKFG 89


>ref|XP_002280974.1| PREDICTED: pentatricopeptide repeat-containing protein At5g50990
            [Vitis vinifera]
          Length = 523

 Score =  615 bits (1585), Expect(2) = e-174
 Identities = 289/440 (65%), Positives = 357/440 (81%)
 Frame = -3

Query: 1470 WNVDIVSANSMISNFLKIGEISIAKKIFHNVPSRDVVTWNSLIGGLVKNACFQEALGAFG 1291
            W  D+++AN +I++ +K+GE   AK++F  +  RDVVTWNS+IGG V+N  F+EAL  F 
Sbjct: 84   WGFDLITANLIIASLMKVGEFDFAKRVFRKMLRRDVVTWNSMIGGCVRNERFEEALRFFR 143

Query: 1290 EMLKTKIEPDRYTFASTLTAVARLGVLDHAKWIHGMMVEKRIELNYILASALIDMYAKCG 1111
            EML + +EPD +TFAS +   ARLG   HA+ +HG+M+EK+I+LN+IL+SALID+Y+KCG
Sbjct: 144  EMLNSNVEPDGFTFASVINGCARLGSSHHAELVHGLMIEKKIQLNFILSSALIDLYSKCG 203

Query: 1110 KINAAKEIFNSVHRADVSIWNAMINGLAIHGLAFDAIAIFSMMKAENVSPDSITFTGILT 931
            +IN AK++FNS+   DVS+WN+MINGLAIHGLA DAI +FS M+ E+VSPDSITF GILT
Sbjct: 204  RINTAKKVFNSIQHDDVSVWNSMINGLAIHGLALDAIGVFSQMEMESVSPDSITFIGILT 263

Query: 930  ACSHCGLVEQGRELYGLMRTCYLIQPQLEHYGAMVVLLSRAGLLEEAYAVIREMTVEPDI 751
            ACSHCGLVEQGR  + LMR  Y IQPQLEHYGAMV LL RAGL+EEAYA+I+ M +EPDI
Sbjct: 264  ACSHCGLVEQGRRYFDLMRRHYSIQPQLEHYGAMVDLLGRAGLVEEAYAMIKAMPMEPDI 323

Query: 750  IIWRALLGACRMHKNSELAEIAISKIQHLGCGDYTLLSNTYCSVNRWESSEKVRYTMKEK 571
            +IWRALL ACR  KN EL E+AI+KI HL  GDY LLSN YCS+ +W+S+E+VR  MK  
Sbjct: 324  VIWRALLSACRNFKNPELGEVAIAKISHLNSGDYILLSNMYCSLEKWDSAERVREMMKRD 383

Query: 570  SVRKRSGKSWLEIGGIIHQFSAGGKSHIEASKIYKVLEALIQRTRREGFVSDTDLVLMDV 391
             VRK  G+SW+E+GG+IHQF AG +SH E   IYKVLE LI+RT+ EGF+  TDLVLMDV
Sbjct: 384  GVRKNRGRSWVELGGVIHQFKAGDRSHPETGAIYKVLEGLIRRTKLEGFMPATDLVLMDV 443

Query: 390  SEEEKEENLNYHSEKWALAYGILQTSPGTEILISKNLRTCSDCHSWMKIVSKVLNRVITV 211
            S+EE+EENLN HSEK ALAY IL+TSPGTEI +SKNLRTC DCH WMKI+S++L+RVI V
Sbjct: 444  SDEEREENLNSHSEKLALAYVILKTSPGTEIRVSKNLRTCHDCHCWMKILSRLLSRVIIV 503

Query: 210  RDRIRFHRFEGGTCTCRDYW 151
            RDRIRFH+FEGG C+CRDYW
Sbjct: 504  RDRIRFHQFEGGLCSCRDYW 523



 Score = 26.9 bits (58), Expect(2) = e-174
 Identities = 14/31 (45%), Positives = 18/31 (58%)
 Frame = -1

Query: 1559 YQGLLRVLEACKVVPNLGTATAVHAKIVIHG 1467
            +Q L  +LEACK   +  TA   HAKI+  G
Sbjct: 20   HQKLNCILEACKFSSDFRTAFQSHAKIIKFG 50


>dbj|BAJ53121.1| JHL07K02.11 [Jatropha curcas]
          Length = 514

 Score =  603 bits (1555), Expect(2) = e-171
 Identities = 280/442 (63%), Positives = 356/442 (80%)
 Frame = -3

Query: 1476 YTWNVDIVSANSMISNFLKIGEISIAKKIFHNVPSRDVVTWNSLIGGLVKNACFQEALGA 1297
            ++W V +   N +I +F++IGE  IAKK+F  +P RDVVTWNS+IGG V+N  F+EAL +
Sbjct: 73   FSWTVSLAGLNLVIDSFMRIGEYEIAKKVFCKMPDRDVVTWNSMIGGYVRNGKFEEALRS 132

Query: 1296 FGEMLKTKIEPDRYTFASTLTAVARLGVLDHAKWIHGMMVEKRIELNYILASALIDMYAK 1117
            F  ML + +EPD++TFAS +TA ARLG L+HA+W+H +MV+KRIE+N+IL+SALIDMY+K
Sbjct: 133  FQAMLSSNVEPDKFTFASVITACARLGALNHAQWLHDLMVQKRIEVNFILSSALIDMYSK 192

Query: 1116 CGKINAAKEIFNSVHRADVSIWNAMINGLAIHGLAFDAIAIFSMMKAENVSPDSITFTGI 937
            CG+I  AKE+F SV R DVS+WN++INGLA+HGLA DA+ +FS M+AENV PDS+TF GI
Sbjct: 193  CGRIETAKEVFESVERNDVSVWNSLINGLAVHGLALDAMMVFSKMEAENVLPDSLTFLGI 252

Query: 936  LTACSHCGLVEQGRELYGLMRTCYLIQPQLEHYGAMVVLLSRAGLLEEAYAVIREMTVEP 757
            L ACSHCGLV++GR+ + LM   Y I+PQLEHYGAMV LL RAGLL+EAYA+I  M +EP
Sbjct: 253  LKACSHCGLVKEGRKYFDLMENYYSIKPQLEHYGAMVDLLGRAGLLDEAYAMITAMPMEP 312

Query: 756  DIIIWRALLGACRMHKNSELAEIAISKIQHLGCGDYTLLSNTYCSVNRWESSEKVRYTMK 577
            D+I+WR LL ACR H+N+EL E+A++ I     GDY LLSN YCS NRW+++++VR  MK
Sbjct: 313  DVIVWRILLSACRTHRNTELGEVAVANISGPKSGDYVLLSNIYCSQNRWDNAQEVREMMK 372

Query: 576  EKSVRKRSGKSWLEIGGIIHQFSAGGKSHIEASKIYKVLEALIQRTRREGFVSDTDLVLM 397
            E+ VRKR GKSW E   ++H+F AG KSH E   +YK+LE LIQRT+ EGFV  T+LV+M
Sbjct: 373  EEGVRKRRGKSWFEWEDVVHRFRAGDKSHPETEALYKILEGLIQRTKLEGFVPSTELVMM 432

Query: 396  DVSEEEKEENLNYHSEKWALAYGILQTSPGTEILISKNLRTCSDCHSWMKIVSKVLNRVI 217
            DVSEEEKEENL +HSEK ALAYGIL+TSPGTEI I KNLR C DCH+W+K+VS +L+RVI
Sbjct: 433  DVSEEEKEENLYHHSEKLALAYGILKTSPGTEIRIYKNLRICYDCHNWIKMVSGLLSRVI 492

Query: 216  TVRDRIRFHRFEGGTCTCRDYW 151
             +RDRIRFHRFEGG+C+C DYW
Sbjct: 493  IIRDRIRFHRFEGGSCSCGDYW 514



 Score = 27.3 bits (59), Expect(2) = e-171
 Identities = 11/25 (44%), Positives = 17/25 (68%)
 Frame = -1

Query: 1541 VLEACKVVPNLGTATAVHAKIVIHG 1467
            +LEACK+  ++ TAT  H +I+  G
Sbjct: 17   LLEACKLSQDIRTATETHTRIIRFG 41


>ref|XP_003534842.1| PREDICTED: pentatricopeptide repeat-containing protein At5g50990-like
            [Glycine max]
          Length = 569

 Score =  590 bits (1521), Expect(2) = e-169
 Identities = 280/438 (63%), Positives = 349/438 (79%)
 Frame = -3

Query: 1464 VDIVSANSMISNFLKIGEISIAKKIFHNVPSRDVVTWNSLIGGLVKNACFQEALGAFGEM 1285
            +D+ S N +I + +K G+  IAKK+F  +  RDVVTWNS+IGG V+N  F +AL  F  M
Sbjct: 132  LDLFSMNLVIESLVKGGQCDIAKKVFGKMSVRDVVTWNSMIGGYVRNLRFFDALSIFRRM 191

Query: 1284 LKTKIEPDRYTFASTLTAVARLGVLDHAKWIHGMMVEKRIELNYILASALIDMYAKCGKI 1105
            L  K+EPD +TFAS +TA ARLG L +AKW+HG+MVEKR+ELNYIL++ALIDMYAKCG+I
Sbjct: 192  LSAKVEPDGFTFASVVTACARLGALGNAKWVHGLMVEKRVELNYILSAALIDMYAKCGRI 251

Query: 1104 NAAKEIFNSVHRADVSIWNAMINGLAIHGLAFDAIAIFSMMKAENVSPDSITFTGILTAC 925
            + ++++F  V R  VS+WNAMI+GLAIHGLA DA  +FS M+ E+V PDSITF GILTAC
Sbjct: 252  DVSRQVFEEVARDHVSVWNAMISGLAIHGLAMDATLVFSRMEMEHVLPDSITFIGILTAC 311

Query: 924  SHCGLVEQGRELYGLMRTCYLIQPQLEHYGAMVVLLSRAGLLEEAYAVIREMTVEPDIII 745
            SHCGLVE+GR+ +G+M+  ++IQPQLEHYG MV LL RAGL+EEAYAVI+EM +EPDI+I
Sbjct: 312  SHCGLVEEGRKYFGMMQNRFMIQPQLEHYGTMVDLLGRAGLMEEAYAVIKEMRMEPDIVI 371

Query: 744  WRALLGACRMHKNSELAEIAISKIQHLGCGDYTLLSNTYCSVNRWESSEKVRYTMKEKSV 565
            WRALL ACR+H+  EL E+AI+ I  L  GD+ LLSN YCS+N W+ +E+VR  MK + V
Sbjct: 372  WRALLSACRIHRKKELGEVAIANISRLESGDFVLLSNMYCSLNNWDGAERVRRMMKTRGV 431

Query: 564  RKRSGKSWLEIGGIIHQFSAGGKSHIEASKIYKVLEALIQRTRREGFVSDTDLVLMDVSE 385
            RK  GKSW+E+G  IHQF+A  +SH E   IY+VLE LIQR + EGF   TDLVLMDVSE
Sbjct: 432  RKSRGKSWVELGDGIHQFNAAYQSHPEMKSIYRVLEGLIQRAKLEGFTPLTDLVLMDVSE 491

Query: 384  EEKEENLNYHSEKWALAYGILQTSPGTEILISKNLRTCSDCHSWMKIVSKVLNRVITVRD 205
            EEKEENL +HSEK A+AY +L+TSPGT+I ISKNLR C DCH+W+KIVSK+LNR I VRD
Sbjct: 492  EEKEENLMFHSEKLAMAYAVLKTSPGTKIRISKNLRICLDCHNWIKIVSKILNRKIIVRD 551

Query: 204  RIRFHRFEGGTCTCRDYW 151
            RIRFH+FEGG C+C+DYW
Sbjct: 552  RIRFHQFEGGVCSCKDYW 569



 Score = 32.7 bits (73), Expect(2) = e-169
 Identities = 15/28 (53%), Positives = 20/28 (71%)
 Frame = -1

Query: 1550 LLRVLEACKVVPNLGTATAVHAKIVIHG 1467
            L RVLE C+V  +L TAT  HA++V+ G
Sbjct: 73   LHRVLERCRVSTDLKTATKTHARVVVLG 100


>ref|NP_199912.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|223635749|sp|Q9FI49.2|PP428_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g50990 gi|332008636|gb|AED96019.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 534

 Score =  563 bits (1450), Expect = e-158
 Identities = 269/437 (61%), Positives = 339/437 (77%), Gaps = 1/437 (0%)
 Frame = -3

Query: 1458 IVSANSMISNFLKIGEISIAKKIFHNVPSRDVVTWNSLIGGLVKNACFQEALGAFGEMLK 1279
            + + N +I + +KIGE  +AKK+  N   ++V+TWN +IGG V+N  ++EAL A   ML 
Sbjct: 98   VCNINLIIESLMKIGESGLAKKVLRNASDQNVITWNLMIGGYVRNVQYEEALKALKNMLS 157

Query: 1278 -TKIEPDRYTFASTLTAVARLGVLDHAKWIHGMMVEKRIELNYILASALIDMYAKCGKIN 1102
             T I+P++++FAS+L A ARLG L HAKW+H +M++  IELN IL+SAL+D+YAKCG I 
Sbjct: 158  FTDIKPNKFSFASSLAACARLGDLHHAKWVHSLMIDSGIELNAILSSALVDVYAKCGDIG 217

Query: 1101 AAKEIFNSVHRADVSIWNAMINGLAIHGLAFDAIAIFSMMKAENVSPDSITFTGILTACS 922
             ++E+F SV R DVSIWNAMI G A HGLA +AI +FS M+AE+VSPDSITF G+LT CS
Sbjct: 218  TSREVFYSVKRNDVSIWNAMITGFATHGLATEAIRVFSEMEAEHVSPDSITFLGLLTTCS 277

Query: 921  HCGLVEQGRELYGLMRTCYLIQPQLEHYGAMVVLLSRAGLLEEAYAVIREMTVEPDIIIW 742
            HCGL+E+G+E +GLM   + IQP+LEHYGAMV LL RAG ++EAY +I  M +EPD++IW
Sbjct: 278  HCGLLEEGKEYFGLMSRRFSIQPKLEHYGAMVDLLGRAGRVKEAYELIESMPIEPDVVIW 337

Query: 741  RALLGACRMHKNSELAEIAISKIQHLGCGDYTLLSNTYCSVNRWESSEKVRYTMKEKSVR 562
            R+LL + R +KN EL EIAI  +     GDY LLSN Y S  +WES++KVR  M ++ +R
Sbjct: 338  RSLLSSSRTYKNPELGEIAIQNLSKAKSGDYVLLSNIYSSTKKWESAQKVRELMSKEGIR 397

Query: 561  KRSGKSWLEIGGIIHQFSAGGKSHIEASKIYKVLEALIQRTRREGFVSDTDLVLMDVSEE 382
            K  GKSWLE GG+IH+F AG  SHIE   IYKVLE LIQ+T+ +GFVSDTDLVLMDVSEE
Sbjct: 398  KAKGKSWLEFGGMIHRFKAGDTSHIETKAIYKVLEGLIQKTKSQGFVSDTDLVLMDVSEE 457

Query: 381  EKEENLNYHSEKWALAYGILQTSPGTEILISKNLRTCSDCHSWMKIVSKVLNRVITVRDR 202
            EKEENLNYHSEK ALAY IL++SPGTEI I KN+R CSDCH+W+K VSK+LNRVI +RDR
Sbjct: 458  EKEENLNYHSEKLALAYVILKSSPGTEIRIQKNIRMCSDCHNWIKAVSKLLNRVIIMRDR 517

Query: 201  IRFHRFEGGTCTCRDYW 151
            IRFHRFE G C+CRDYW
Sbjct: 518  IRFHRFEDGLCSCRDYW 534



 Score = 79.3 bits (194), Expect = 3e-12
 Identities = 49/195 (25%), Positives = 98/195 (50%), Gaps = 3/195 (1%)
 Frame = -3

Query: 1464 VDIVSANSMISNFLKIGEISIAKKIFHNVPSRDVVTWNSLIGGLVKNACFQEALGAFGEM 1285
            ++ + +++++  + K G+I  ++++F++V   DV  WN++I G   +    EA+  F EM
Sbjct: 198  LNAILSSALVDVYAKCGDIGTSREVFYSVKRNDVSIWNAMITGFATHGLATEAIRVFSEM 257

Query: 1284 LKTKIEPDRYTFASTLTAVARLGVLDHAKWIHGMMVEK-RIELNYILASALIDMYAKCGK 1108
                + PD  TF   LT  +  G+L+  K   G+M  +  I+       A++D+  + G+
Sbjct: 258  EAEHVSPDSITFLGLLTTCSHCGLLEEGKEYFGLMSRRFSIQPKLEHYGAMVDLLGRAGR 317

Query: 1107 INAAKEIFNSVH-RADVSIWNAMINGLAIH-GLAFDAIAIFSMMKAENVSPDSITFTGIL 934
            +  A E+  S+    DV IW ++++    +       IAI ++ KA+  S D +  + I 
Sbjct: 318  VKEAYELIESMPIEPDVVIWRSLLSSSRTYKNPELGEIAIQNLSKAK--SGDYVLLSNIY 375

Query: 933  TACSHCGLVEQGREL 889
            ++       ++ REL
Sbjct: 376  SSTKKWESAQKVREL 390


Top