BLASTX nr result
ID: Cephaelis21_contig00029057
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00029057 (1632 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera] 324 4e-86 ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containi... 323 1e-85 ref|XP_002868835.1| pentatricopeptide repeat-containing protein ... 320 5e-85 ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containi... 318 2e-84 ref|NP_195528.1| pentatricopeptide repeat-containing protein [Ar... 312 2e-82 >emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera] Length = 381 Score = 324 bits (831), Expect = 4e-86 Identities = 172/315 (54%), Positives = 218/315 (69%), Gaps = 1/315 (0%) Frame = +2 Query: 416 KTSASEHNDNNYDNPPEPIPNRPLRGERRTPINPXXXXXXXXXXXXXFGEDESHARQQNE 595 K+S+S + NPP PIPNRPLRGE+R P G D + Q Sbjct: 83 KSSSSCGGGGSSSNPPNPIPNRPLRGEQRMNRPPPHIPQRKLGLPKDEGVDRA---SQAS 139 Query: 596 QLNRPRFGENAQRETGALEDSDFLEKFKLGFDRN-KGENPNTRQPSYNKQRGGEKYDQAA 772 N+P E + GA + FLE+FKLG + + + QPS + K Sbjct: 140 PFNQPSPAE----KVGATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGK----- 190 Query: 773 ETPPPSQPPEDADEIFRKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRERGTIPEV 952 QPP++ADEIFRKMKE+GLIPNAVAMLDGLCKDGLVQEAMKLFGLMRE+GTIPEV Sbjct: 191 -----EQPPQNADEIFRKMKESGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEV 245 Query: 953 VIYTAVVEGFCKAHKVDDAVRIFKKMQSIGITPNAFSYGILIQGLLRVKRLDDAREFCLE 1132 VIYTAVVEGFCKA ++DDAVRIF+KMQ+ GI+PNAFSY +LI+G+ + RLD A +FC+E Sbjct: 246 VIYTAVVEGFCKARQLDDAVRIFRKMQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVE 305 Query: 1133 MLEGGHSPNVVTFIGLVDAYCREKGLEEAQTMLSSLREKGFYLNEKAVREHLEKKGPFLP 1312 MLE GHSPNV T + L+ +C+EKG+EEA+ ++ +L++KG ++++KAVRE+L+KKGP P Sbjct: 306 MLEAGHSPNVATLVDLIHEFCKEKGVEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQSP 365 Query: 1313 LVWEAIFGKKSSERS 1357 LVWEA FGKKS +RS Sbjct: 366 LVWEAFFGKKSPQRS 380 >ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Vitis vinifera] Length = 380 Score = 323 bits (827), Expect = 1e-85 Identities = 170/317 (53%), Positives = 218/317 (68%), Gaps = 1/317 (0%) Frame = +2 Query: 410 FSKTSASEHNDNNYDNPPEPIPNRPLRGERRTPINPXXXXXXXXXXXXXFGEDESHARQQ 589 + + S+S + NPP PIPNRPLRGE+R P G D + Q Sbjct: 80 YGRKSSSSCGGGSSSNPPNPIPNRPLRGEQRMNRPPPHIPQRKLGLPKDEGVDRA---SQ 136 Query: 590 NEQLNRPRFGENAQRETGALEDSDFLEKFKLGFDRN-KGENPNTRQPSYNKQRGGEKYDQ 766 N+P E + GA + FLE+FKLG + + + QPS + K Sbjct: 137 ASPFNQPSPAE----KVGATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGK--- 189 Query: 767 AAETPPPSQPPEDADEIFRKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRERGTIP 946 QPP++ADEIFRKMKE+GLIPNAVAMLDGLCKDGLVQEAMKLFGLMRE+GTIP Sbjct: 190 -------EQPPQNADEIFRKMKESGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIP 242 Query: 947 EVVIYTAVVEGFCKAHKVDDAVRIFKKMQSIGITPNAFSYGILIQGLLRVKRLDDAREFC 1126 EVVIYTAVVEGFCKA +++DAVRIF+KMQ+ GI+PNAFSY +LI+G+ + RLD A +FC Sbjct: 243 EVVIYTAVVEGFCKARQLNDAVRIFRKMQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFC 302 Query: 1127 LEMLEGGHSPNVVTFIGLVDAYCREKGLEEAQTMLSSLREKGFYLNEKAVREHLEKKGPF 1306 +EMLE GHSPNV T + L+ +C+EKG+EEA+ ++ +L++KG ++++KAVRE+L+KKGP Sbjct: 303 VEMLEAGHSPNVATLVDLIHEFCKEKGVEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQ 362 Query: 1307 LPLVWEAIFGKKSSERS 1357 PLVWEA FGKKS +RS Sbjct: 363 SPLVWEAFFGKKSPQRS 379 >ref|XP_002868835.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297314671|gb|EFH45094.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 301 Score = 320 bits (821), Expect = 5e-85 Identities = 170/311 (54%), Positives = 205/311 (65%) Frame = +2 Query: 422 SASEHNDNNYDNPPEPIPNRPLRGERRTPINPXXXXXXXXXXXXXFGEDESHARQQNEQL 601 S + NPPEP+PNRPLRGER + E ARQ ++ Sbjct: 32 STGDKGQEKQQNPPEPLPNRPLRGERSSN-----------------SHREPPARQAHD-- 72 Query: 602 NRPRFGENAQRETGALEDSDFLEKFKLGFDRNKGENPNTRQPSYNKQRGGEKYDQAAETP 781 + L D FLE+FKLG +++ E P E+Y Q Sbjct: 73 --------LGKIDNTLSDDGFLEQFKLGVNQDSQETPKP-----------EQYPQ----- 108 Query: 782 PPSQPPEDADEIFRKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRERGTIPEVVIY 961 P PPED+DEIF+KMKE GLIPNAVAMLDGLCKDGLVQEAMKLFGLMR++GTIPEVVIY Sbjct: 109 DPLLPPEDSDEIFKKMKEGGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIY 168 Query: 962 TAVVEGFCKAHKVDDAVRIFKKMQSIGITPNAFSYGILIQGLLRVKRLDDAREFCLEMLE 1141 TAVVEGFCKAHK++DA RIF+KMQ+ GITPNAFSYG+L+QGL LDDA FC EMLE Sbjct: 169 TAVVEGFCKAHKIEDAKRIFRKMQTNGITPNAFSYGVLVQGLYNCNMLDDAVTFCCEMLE 228 Query: 1142 GGHSPNVVTFIGLVDAYCREKGLEEAQTMLSSLREKGFYLNEKAVREHLEKKGPFLPLVW 1321 GHSPN+ TF+GLVDA CREKG+E+AQ+ + L +KGF LN KAV+E ++K+ PF L W Sbjct: 229 SGHSPNIPTFVGLVDALCREKGVEQAQSAIDGLNQKGFALNVKAVKEFMDKRAPFPSLAW 288 Query: 1322 EAIFGKKSSER 1354 EAIF KK +++ Sbjct: 289 EAIFKKKPTDK 299 >ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Glycine max] Length = 388 Score = 318 bits (816), Expect = 2e-84 Identities = 193/391 (49%), Positives = 241/391 (61%), Gaps = 25/391 (6%) Frame = +2 Query: 257 HSLYSSHFYRKTESYLHELLSNVVPSLLSMKIMRPFSTIRDVDS-----LHTHHSNFSKT 421 H L S K S++H +P LL + +R FS D + F + Sbjct: 12 HKLVSFSQIEKLVSFVH--CKQYLPPLL--ETVRHFSFTDDCSGRSKQPVGESDDFFLQQ 67 Query: 422 SASEHNDNNYDNPP--EPIPNRPLRGERRTPIN-PXXXXXXXXXXXXXF---------GE 565 S S DN + EPIP+RPLR R P+N P F G Sbjct: 68 SDSSFKDNGESDQSLSEPIPSRPLRS--RKPVNQPPPRFQEYDRGSHSFPPRFYDNHGGP 125 Query: 566 DESHARQQNEQLNRPRFGENA---QRETGALEDSDFLEKFKLGFDRNKGENPNTRQPSYN 736 DE ++ +++ N R+ G DS FL KFKLGFD + N + + + Sbjct: 126 DELDQTNKSSKIDLAFQNTNVAKTNRDAGQSGDS-FLNKFKLGFD---DKTVNLSEVAAS 181 Query: 737 KQRGGEKYDQAAETPPPSQP-----PEDADEIFRKMKETGLIPNAVAMLDGLCKDGLVQE 901 KQ + A+ P+QP P+DADEIF+KMKETGLIPNAVAMLDGLCKDGLVQE Sbjct: 182 KQ------SEEAKRSNPNQPAQESMPQDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQE 235 Query: 902 AMKLFGLMRERGTIPEVVIYTAVVEGFCKAHKVDDAVRIFKKMQSIGITPNAFSYGILIQ 1081 A+KLFGLMRE+GTIPE+VIYTAVVEG+ KAHK DDA RIF+KMQS G++PNAFSY +LIQ Sbjct: 236 ALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFRKMQSSGVSPNAFSYMVLIQ 295 Query: 1082 GLLRVKRLDDAREFCLEMLEGGHSPNVVTFIGLVDAYCREKGLEEAQTMLSSLREKGFYL 1261 GL + RL DA EFC+EMLE GHSPNV TF+GLVD +C EKG+EEA++ + +L +KGF + Sbjct: 296 GLYKCSRLHDAFEFCVEMLEAGHSPNVTTFVGLVDGFCNEKGVEEAKSAIKTLTDKGFVV 355 Query: 1262 NEKAVREHLEKKGPFLPLVWEAIFGKKSSER 1354 NEKAVR+ L+KK PF P VWEAIFGKK+ +R Sbjct: 356 NEKAVRQFLDKKAPFSPSVWEAIFGKKAPQR 386 >ref|NP_195528.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|79326453|ref|NP_001031806.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75266764|sp|Q9SZL5.1|PP356_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g38150 gi|4467121|emb|CAB37555.1| putative protein [Arabidopsis thaliana] gi|7270799|emb|CAB80480.1| putative protein [Arabidopsis thaliana] gi|26453272|dbj|BAC43709.1| unknown protein [Arabidopsis thaliana] gi|332661484|gb|AEE86884.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332661485|gb|AEE86885.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 302 Score = 312 bits (799), Expect = 2e-82 Identities = 175/344 (50%), Positives = 215/344 (62%) Frame = +2 Query: 323 VVPSLLSMKIMRPFSTIRDVDSLHTHHSNFSKTSASEHNDNNYDNPPEPIPNRPLRGERR 502 +VPS ++ R + V + + F T + D NPPEP+PNRPLRGER Sbjct: 1 MVPSSKAVVFARQMAKQIRVTTPSMSATRFLSTGDNGQVDEQ-QNPPEPLPNRPLRGERS 59 Query: 503 TPINPXXXXXXXXXXXXXFGEDESHARQQNEQLNRPRFGENAQRETGALEDSDFLEKFKL 682 + E ARQ + N + L D FLE+FKL Sbjct: 60 SN-----------------SHREPPARQAH----------NLGKSDTTLSDDGFLEQFKL 92 Query: 683 GFDRNKGENPNTRQPSYNKQRGGEKYDQAAETPPPSQPPEDADEIFRKMKETGLIPNAVA 862 G +++ E P E+Y Q P PPED+DEIF+KMKE GLIPNAVA Sbjct: 93 GVNQDSRETPKP-----------EQYPQE-----PLPPPEDSDEIFKKMKEGGLIPNAVA 136 Query: 863 MLDGLCKDGLVQEAMKLFGLMRERGTIPEVVIYTAVVEGFCKAHKVDDAVRIFKKMQSIG 1042 MLDGLCKDGLVQEAMKLFGLMR++GTIPEVVIYTAVVE FCKAHK++DA RIF+KMQ+ G Sbjct: 137 MLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEAFCKAHKIEDAKRIFRKMQNNG 196 Query: 1043 ITPNAFSYGILIQGLLRVKRLDDAREFCLEMLEGGHSPNVVTFIGLVDAYCREKGLEEAQ 1222 I PNAFSYG+L+QGL LDDA FC EMLE GHSPNV TF+ LVDA CR KG+E+AQ Sbjct: 197 IAPNAFSYGVLVQGLYNCNMLDDAVAFCSEMLESGHSPNVPTFVELVDALCRVKGVEQAQ 256 Query: 1223 TMLSSLREKGFYLNEKAVREHLEKKGPFLPLVWEAIFGKKSSER 1354 + + +L +KGF +N KAV+E ++K+ PF L WEAIF KK +E+ Sbjct: 257 SAIDTLNQKGFAVNVKAVKEFMDKRAPFPSLAWEAIFKKKPTEK 300