BLASTX nr result
ID: Cephaelis21_contig00006319
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00006319 (1019 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002518527.1| pentatricopeptide repeat-containing protein,... 284 2e-74 ref|XP_002305605.1| predicted protein [Populus trichocarpa] gi|2... 262 1e-67 ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containi... 260 4e-67 ref|NP_193849.2| pentatricopeptide repeat-containing protein [Ar... 199 7e-49 sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-c... 199 7e-49 >ref|XP_002518527.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223542372|gb|EEF43914.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 599 Score = 284 bits (727), Expect = 2e-74 Identities = 158/342 (46%), Positives = 212/342 (61%), Gaps = 3/342 (0%) Frame = +3 Query: 3 KNSKQSSWPPTNWRLQVQETQLISQVSKILLQRQTKYWEXXXXXXXXXXXXXXXXFLRIL 182 + + +S +WR ++Q+ QL+S++S ILLQR W F +IL Sbjct: 31 RKTYSTSTSKISWRTRIQQNQLVSEISTILLQRNN--WIPLLQNLNLSSKLTPFLFFQIL 88 Query: 183 QKTQPSPQISLDFFNWAKKNLRFQPDLKSHCKLTHLLFESGLSGLAKPVLASIIQSYPSD 362 KTQ QISL+FFNWAK NL F PDLKS C + L S L AK +L S+I++YPS+ Sbjct: 89 HKTQTHAQISLNFFNWAKTNLNFNPDLKSQCHVIQLSLGSDLPRAAKKILDSLIKTYPSN 148 Query: 363 EIVSSFSKVANFDTYSSVLCS---VLKGYCNRGLYWEALKVYVKAKESGHGVISVHVFNS 533 + + + SS+LC+ VL+ Y ++G + E L+VY K + G SVH N Sbjct: 149 LFLETMVQACRGK--SSLLCTLNFVLEFYSHKGSFLEGLEVYKKMRVIGC-TPSVHACNV 205 Query: 534 LLHLLQVQNETRLAWCVYSSMIRHVVSENQFTWPIIARILCKDAKFERIGRILDMGINSS 713 LL LQ ++E RLAWC Y +MIR V ++FTW ++A ILCKD FERI ++LDMGI +S Sbjct: 206 LLDALQRESEIRLAWCFYCAMIRVGVLPDKFTWSLVAHILCKDGNFERIVKLLDMGICNS 265 Query: 714 SLYDLIVQNYSERGDFDAAFGYLGKMYDKKLDPSFSIYSSILDGACKCGDTEVIEMVTRT 893 +Y+ +V YS+ GDF AAF L +MYD+K++P FS YSSILDGACKC + +VIE V Sbjct: 266 VMYNAVVDYYSKNGDFKAAFCRLNEMYDRKVEPGFSTYSSILDGACKCRNLQVIERVVAI 325 Query: 894 MIEKGYIPKGVTSKYDSVIQKLSDLGKTYAAKLFLSRACVEK 1019 M+ K + K +S YDS+IQKL DLGK AA LF RAC E+ Sbjct: 326 MVGKQLLSKCPSSDYDSIIQKLCDLGKVSAATLFFKRACDER 367 >ref|XP_002305605.1| predicted protein [Populus trichocarpa] gi|222848569|gb|EEE86116.1| predicted protein [Populus trichocarpa] Length = 564 Score = 262 bits (669), Expect = 1e-67 Identities = 143/328 (43%), Positives = 206/328 (62%), Gaps = 1/328 (0%) Frame = +3 Query: 39 WRLQVQETQLISQVSKILLQRQTKYWEXXXXXXXXXXXXXXXXFLRILQKTQPSPQISLD 218 WR+Q+++ QL+ Q+S ILLQR W F +IL KTQ +PQISL Sbjct: 14 WRIQIRQNQLVFQISSILLQRHN--WVSLLQNFNLSTKLTPPLFNQILHKTQTNPQISLR 71 Query: 219 FFNWAKKNLRFQPDLKSHCKLTHLLFESGLSGLAKPVLASIIQSYPSDEIVSSFSKVANF 398 FFNW + NL+ +PDLKS C + ++ SGL+ +P++ S+++++ + + Sbjct: 72 FFNWVQTNLKLKPDLKSQCHIINICVNSGLTLPVRPIMDSLVKTHHVSVLGEAMVDSCRG 131 Query: 399 DTYSSVLCS-VLKGYCNRGLYWEALKVYVKAKESGHGVISVHVFNSLLHLLQVQNETRLA 575 + S S VL+ Y ++GL+ E+L+++ K + +G + S NS+L +LQ +NE +LA Sbjct: 132 KSLKSDAFSFVLECYSHKGLFMESLEMFRKMRGNGF-IASGTACNSVLDVLQRENEIKLA 190 Query: 576 WCVYSSMIRHVVSENQFTWPIIARILCKDAKFERIGRILDMGINSSSLYDLIVQNYSERG 755 WC Y +MI+ V ++ TW +IA+ILCKD FERI + LDMG+ +S LY+ ++ S+RG Sbjct: 191 WCFYCAMIKDGVLPDKLTWSLIAQILCKDGNFERIVKFLDMGVYNSVLYNGVIDCCSKRG 250 Query: 756 DFDAAFGYLGKMYDKKLDPSFSIYSSILDGACKCGDTEVIEMVTRTMIEKGYIPKGVTSK 935 DF+AAF L +M ++KLDP FS YS+ILDGACK G+ EVIE V M EKG +PK S+ Sbjct: 251 DFEAAFERLNQMCERKLDPGFSTYSAILDGACKHGNEEVIERVMDIMAEKGLLPKCPLSQ 310 Query: 936 YDSVIQKLSDLGKTYAAKLFLSRACVEK 1019 DSVIQK SDL K A +F RAC EK Sbjct: 311 CDSVIQKFSDLCKMNVATMFFRRACDEK 338 >ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like [Vitis vinifera] Length = 569 Score = 260 bits (664), Expect = 4e-67 Identities = 152/340 (44%), Positives = 212/340 (62%), Gaps = 2/340 (0%) Frame = +3 Query: 6 NSKQSSWPPTNWRLQVQETQLISQVSKILLQRQTKYWEXXXXXXXXXXXXXXXXFLRILQ 185 N S P NWR Q+++ QLISQ+S ILLQR W F +IL Sbjct: 11 NQFSKSTTPLNWRAQIKQNQLISQISSILLQRHN--WVTLLRNFNLSSKLTPSLFHQILL 68 Query: 186 KTQPSPQISLDFFNWAKKNLRFQPDLKSHCKLTHLLFESGLSGLAKPVLASIIQSYPSDE 365 KTQ +PQ SL FFNW + NL FQPDL +H ++ + +SGL AK +L S+I++ Sbjct: 69 KTQKNPQSSLSFFNWVRTNLGFQPDLAAHSQIIRISIQSGLFQPAKGILDSLIETQKVSV 128 Query: 366 IVSSFSKVANF-DTYSSVLCSVLKGYCNRGLYWEALKVYVKAKESGHGVISVHVFNSLLH 542 +V S + D+ S VL VL+ Y ++GL+ EAL+V+ + G+ V SV N+LL Sbjct: 129 LVDSVIQACRGKDSESPVLGFVLECYSSKGLFIEALEVFRRITIHGY-VPSVRSCNALLD 187 Query: 543 LLQVQNETRLAWCVYSSMIRHVVSENQFTWPIIARILCKDAKFERIGRILDMGINSSSL- 719 LQ +NE +LAWCV ++IR+ V + IA ILCK+ K ER+ R+LDM I ++L Sbjct: 188 SLQRENEIKLAWCVCGALIRNGVLPDYVR---IALILCKNGKLERVVRLLDMSIVCNALI 244 Query: 720 YDLIVQNYSERGDFDAAFGYLGKMYDKKLDPSFSIYSSILDGACKCGDTEVIEMVTRTMI 899 Y L++ Y ERG+F AAF YL +M ++K DP F Y+SILDGACK + EVI++V +M+ Sbjct: 245 YKLVIDCYCERGNFSAAFHYLNEMCNRKFDPGFCAYNSILDGACKYENDEVIQIVMGSMV 304 Query: 900 EKGYIPKGVTSKYDSVIQKLSDLGKTYAAKLFLSRACVEK 1019 EKG +PK + S+YDS+IQK+ +LGKT+AA++F RA EK Sbjct: 305 EKGLLPKLLLSEYDSIIQKICNLGKTHAAQMFFKRARNEK 344 >ref|NP_193849.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332659015|gb|AEE84415.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 551 Score = 199 bits (507), Expect = 7e-49 Identities = 109/328 (33%), Positives = 192/328 (58%), Gaps = 2/328 (0%) Frame = +3 Query: 33 TNWRLQVQETQLISQVSKILLQRQTKYWEXXXXXXXXXXXXXXXX-FLRILQKTQPSPQI 209 ++W+ Q ++ +++S ILLQR+ FL+IL++T+ P+ Sbjct: 26 SDWKTQQTLFRVATEISSILLQRRNWITHLQYVKSKLPRSTLTSPVFLQILRETRKCPKT 85 Query: 210 SLDFFNWAKKNLRFQPDLKSHCKLTHLLFESGLSGLAKPVLASIIQSYPSDEIVSSFSKV 389 +LDFF++AK +LRF+PDLKSHC++ + ESGL A+ +L ++++ +V + Sbjct: 86 TLDFFDFAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRW 145 Query: 390 ANFDTYSSVLCS-VLKGYCNRGLYWEALKVYVKAKESGHGVISVHVFNSLLHLLQVQNET 566 + SV S VL+ Y +G + L+V+ + S +NSLL L +N+ Sbjct: 146 FEGEVSLSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSP-SQSAYNSLLGSLVKENQF 204 Query: 567 RLAWCVYSSMIRHVVSENQFTWPIIARILCKDAKFERIGRILDMGINSSSLYDLIVQNYS 746 R+A C+YS+M+R+ + ++ TW +IA+ILC+ + + + ++++ G+ S +Y +V+ YS Sbjct: 205 RVALCLYSAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYS 264 Query: 747 ERGDFDAAFGYLGKMYDKKLDPSFSIYSSILDGACKCGDTEVIEMVTRTMIEKGYIPKGV 926 G+FDA F + +M DKKL+ SF Y +LD AC+ GD E I+ V M+EK ++ G Sbjct: 265 RNGEFDAVFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGD 324 Query: 927 TSKYDSVIQKLSDLGKTYAAKLFLSRAC 1010 ++ D +I++L D+GKT+A+++ +AC Sbjct: 325 SAVNDKIIERLCDMGKTFASEMLFRKAC 352 >sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g21170 Length = 585 Score = 199 bits (507), Expect = 7e-49 Identities = 109/328 (33%), Positives = 192/328 (58%), Gaps = 2/328 (0%) Frame = +3 Query: 33 TNWRLQVQETQLISQVSKILLQRQTKYWEXXXXXXXXXXXXXXXX-FLRILQKTQPSPQI 209 ++W+ Q ++ +++S ILLQR+ FL+IL++T+ P+ Sbjct: 26 SDWKTQQTLFRVATEISSILLQRRNWITHLQYVKSKLPRSTLTSPVFLQILRETRKCPKT 85 Query: 210 SLDFFNWAKKNLRFQPDLKSHCKLTHLLFESGLSGLAKPVLASIIQSYPSDEIVSSFSKV 389 +LDFF++AK +LRF+PDLKSHC++ + ESGL A+ +L ++++ +V + Sbjct: 86 TLDFFDFAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRW 145 Query: 390 ANFDTYSSVLCS-VLKGYCNRGLYWEALKVYVKAKESGHGVISVHVFNSLLHLLQVQNET 566 + SV S VL+ Y +G + L+V+ + S +NSLL L +N+ Sbjct: 146 FEGEVSLSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSP-SQSAYNSLLGSLVKENQF 204 Query: 567 RLAWCVYSSMIRHVVSENQFTWPIIARILCKDAKFERIGRILDMGINSSSLYDLIVQNYS 746 R+A C+YS+M+R+ + ++ TW +IA+ILC+ + + + ++++ G+ S +Y +V+ YS Sbjct: 205 RVALCLYSAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYS 264 Query: 747 ERGDFDAAFGYLGKMYDKKLDPSFSIYSSILDGACKCGDTEVIEMVTRTMIEKGYIPKGV 926 G+FDA F + +M DKKL+ SF Y +LD AC+ GD E I+ V M+EK ++ G Sbjct: 265 RNGEFDAVFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGD 324 Query: 927 TSKYDSVIQKLSDLGKTYAAKLFLSRAC 1010 ++ D +I++L D+GKT+A+++ +AC Sbjct: 325 SAVNDKIIERLCDMGKTFASEMLFRKAC 352