BLASTX nr result
ID: Cephaelis21_contig00052561
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00052561 (381 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003632994.1| PREDICTED: pentatricopeptide repeat-containi... 199 1e-49 ref|XP_002531058.1| pentatricopeptide repeat-containing protein,... 182 2e-44 ref|XP_003541672.1| PREDICTED: pentatricopeptide repeat-containi... 181 4e-44 ref|XP_003547196.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 171 6e-41 emb|CBI30968.3| unnamed protein product [Vitis vinifera] 169 2e-40 >ref|XP_003632994.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Vitis vinifera] Length = 613 Score = 199 bits (507), Expect = 1e-49 Identities = 96/126 (76%), Positives = 109/126 (86%) Frame = +2 Query: 2 RQMLANSVQPDTHTYPFLLKAIAKLLAVREGEKVHCIAMKNGFESLVFVQNALVHFYGAC 181 RQM + ++PDTHTYPFLLKAIAKL+ VREGEKVH IA++NGFESLVFVQN LVH Y AC Sbjct: 129 RQMHVSCIEPDTHTYPFLLKAIAKLMDVREGEKVHSIAIRNGFESLVFVQNTLVHMYAAC 188 Query: 182 GQAESAHKLFEFMFEKNLVGWNSVINGYALSSRPNETLTLYRKMELEDVKPDGFTLVSLL 361 G AESAHKLFE M E+NLV WNSVINGYAL+ RPNE LTL+R+M L V+PDGFT+VSLL Sbjct: 189 GHAESAHKLFELMAERNLVTWNSVINGYALNGRPNEALTLFREMGLRGVEPDGFTMVSLL 248 Query: 362 TACAEL 379 +ACAEL Sbjct: 249 SACAEL 254 Score = 85.9 bits (211), Expect = 3e-15 Identities = 45/124 (36%), Positives = 69/124 (55%) Frame = +2 Query: 2 RQMLANSVQPDTHTYPFLLKAIAKLLAVREGEKVHCIAMKNGFESLVFVQNALVHFYGAC 181 R+M V+PD T LL A A+L A+ G + H +K G + + NAL+ Y C Sbjct: 230 REMGLRGVEPDGFTMVSLLSACAELGALALGRRAHVYMVKVGLDGNLHAGNALLDLYAKC 289 Query: 182 GQAESAHKLFEFMFEKNLVGWNSVINGYALSSRPNETLTLYRKMELEDVKPDGFTLVSLL 361 G AHK+F+ M EK++V W S+I G A++ E L L++++E + + P T V +L Sbjct: 290 GSIRQAHKVFDEMEEKSVVSWTSLIVGLAVNGFGKEALELFKELERKGLMPSEITFVGVL 349 Query: 362 TACA 373 AC+ Sbjct: 350 YACS 353 Score = 56.6 bits (135), Expect = 2e-06 Identities = 40/135 (29%), Positives = 66/135 (48%), Gaps = 12/135 (8%) Frame = +2 Query: 11 LANSVQPDTHTYPFLLKAIAKLLAVREGE----KVHCIAMKNGF--------ESLVFVQN 154 ++ S P++ L K IA LL+ + ++H ++++G + L+F Sbjct: 25 ISTSTCPESPKSYILKKCIALLLSCASSKFKFRQIHAFSIRHGVPLTNPDMGKYLIFT-- 82 Query: 155 ALVHFYGACGQAESAHKLFEFMFEKNLVGWNSVINGYALSSRPNETLTLYRKMELEDVKP 334 L+ F C AH++F + N+ WN++I GYA S P L LYR+M + ++P Sbjct: 83 -LLSF---CSPMSYAHQIFSQIQNPNIFTWNTMIRGYAESENPMPALELYRQMHVSCIEP 138 Query: 335 DGFTLVSLLTACAEL 379 D T LL A A+L Sbjct: 139 DTHTYPFLLKAIAKL 153 >ref|XP_002531058.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223529353|gb|EEF31319.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 341 Score = 182 bits (462), Expect = 2e-44 Identities = 83/121 (68%), Positives = 106/121 (87%) Frame = +2 Query: 17 NSVQPDTHTYPFLLKAIAKLLAVREGEKVHCIAMKNGFESLVFVQNALVHFYGACGQAES 196 NS++PDTHTYPFLLKA++K++ VR GEK+H I+++NGFESLVFVQN+L+H Y ACGQ ES Sbjct: 134 NSIEPDTHTYPFLLKAVSKMVNVRVGEKIHSISIRNGFESLVFVQNSLMHMYAACGQYES 193 Query: 197 AHKLFEFMFEKNLVGWNSVINGYALSSRPNETLTLYRKMELEDVKPDGFTLVSLLTACAE 376 AHKLFEFM +++LV WN+ I+G+AL+ +PNE L LY +M LE V+PDGFTLVSLL+ACAE Sbjct: 194 AHKLFEFMPDRDLVAWNTAISGFALNGKPNEALKLYMEMGLEGVEPDGFTLVSLLSACAE 253 Query: 377 L 379 L Sbjct: 254 L 254 Score = 67.0 bits (162), Expect = 2e-09 Identities = 34/110 (30%), Positives = 60/110 (54%) Frame = +2 Query: 5 QMLANSVQPDTHTYPFLLKAIAKLLAVREGEKVHCIAMKNGFESLVFVQNALVHFYGACG 184 +M V+PD T LL A A+L A+ G ++H +K G + + N+L+ Y CG Sbjct: 231 EMGLEGVEPDGFTLVSLLSACAELGALALGRRIHAYMVKVGLDENLHANNSLIDLYAKCG 290 Query: 185 QAESAHKLFEFMFEKNLVGWNSVINGYALSSRPNETLTLYRKMELEDVKP 334 + A ++F+ M +N+V W S+I G A++ E + +++ME + + P Sbjct: 291 RIRDAQQVFDEMELRNVVSWTSLIVGLAVNGFGMEAIEHFKEMEKQGLVP 340 >ref|XP_003541672.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Glycine max] Length = 591 Score = 181 bits (460), Expect = 4e-44 Identities = 84/126 (66%), Positives = 107/126 (84%) Frame = +2 Query: 2 RQMLANSVQPDTHTYPFLLKAIAKLLAVREGEKVHCIAMKNGFESLVFVQNALVHFYGAC 181 RQM+ + V+PDTHTYPFLLKAI+K L VREGE +H + ++NGFESLVFVQN+L+H Y AC Sbjct: 107 RQMVVSCVEPDTHTYPFLLKAISKSLNVREGEAIHSVTIRNGFESLVFVQNSLLHIYAAC 166 Query: 182 GQAESAHKLFEFMFEKNLVGWNSVINGYALSSRPNETLTLYRKMELEDVKPDGFTLVSLL 361 G ESA+K+FE M E++LV WNS+ING+AL+ RPNE LTL+R+M +E V+PDGFT+VSLL Sbjct: 167 GDTESAYKVFELMKERDLVAWNSMINGFALNGRPNEALTLFREMSVEGVEPDGFTVVSLL 226 Query: 362 TACAEL 379 +A AEL Sbjct: 227 SASAEL 232 Score = 83.2 bits (204), Expect = 2e-14 Identities = 45/124 (36%), Positives = 66/124 (53%) Frame = +2 Query: 2 RQMLANSVQPDTHTYPFLLKAIAKLLAVREGEKVHCIAMKNGFESLVFVQNALVHFYGAC 181 R+M V+PD T LL A A+L A+ G +VH +K G V N+L+ Y C Sbjct: 208 REMSVEGVEPDGFTVVSLLSASAELGALELGRRVHVYLLKVGLSKNSHVTNSLLDLYAKC 267 Query: 182 GQAESAHKLFEFMFEKNLVGWNSVINGYALSSRPNETLTLYRKMELEDVKPDGFTLVSLL 361 G A ++F M E+N V W S+I G A++ E L L+++ME + + P T V +L Sbjct: 268 GAIREAQRVFSEMSERNAVSWTSLIVGLAVNGFGEEALELFKEMEGQGLVPSEITFVGVL 327 Query: 362 TACA 373 AC+ Sbjct: 328 YACS 331 >ref|XP_003547196.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g21065-like [Glycine max] Length = 661 Score = 171 bits (433), Expect = 6e-41 Identities = 80/126 (63%), Positives = 103/126 (81%) Frame = +2 Query: 2 RQMLANSVQPDTHTYPFLLKAIAKLLAVREGEKVHCIAMKNGFESLVFVQNALVHFYGAC 181 RQM+ + ++PDTHTYPFLLKAI+K L VREGE +H + ++NGFESLVFVQN+L+H Y AC Sbjct: 119 RQMIVSRIEPDTHTYPFLLKAISKSLNVREGEAIHSVTIRNGFESLVFVQNSLLHIYAAC 178 Query: 182 GQAESAHKLFEFMFEKNLVGWNSVINGYALSSRPNETLTLYRKMELEDVKPDGFTLVSLL 361 G ESAH +FE M +++LV SVING+AL+ RP+E LTL+R+M E V+PDGFT+VSLL Sbjct: 179 GDTESAHNVFELMRDRDLVAXISVINGFALNGRPSEALTLFREMSAEGVEPDGFTVVSLL 238 Query: 362 TACAEL 379 +A AEL Sbjct: 239 SASAEL 244 Score = 70.1 bits (170), Expect = 2e-10 Identities = 43/124 (34%), Positives = 62/124 (50%) Frame = +2 Query: 2 RQMLANSVQPDTHTYPFLLKAIAKLLAVREGEKVHCIAMKNGFESLVFVQNALVHFYGAC 181 R+M A V+PD T LL A A+L A+ G +VH +K G V N+L+ Y C Sbjct: 220 REMSAEGVEPDGFTVVSLLSASAELGALELGRRVHVYLLKVGLRENSHVTNSLLDLYAKC 279 Query: 182 GQAESAHKLFEFMFEKNLVGWNSVINGYALSSRPNETLTLYRKMELEDVKPDGFTLVSLL 361 + E+N V W S+I G A++ E L L+R+ME + + P T V +L Sbjct: 280 DAI--------WEXERNAVSWTSLIVGLAVNGFGEEALELFREMEGQGLVPSEITFVGVL 331 Query: 362 TACA 373 AC+ Sbjct: 332 YACS 335 >emb|CBI30968.3| unnamed protein product [Vitis vinifera] Length = 1434 Score = 169 bits (428), Expect = 2e-40 Identities = 82/107 (76%), Positives = 93/107 (86%) Frame = +2 Query: 59 KAIAKLLAVREGEKVHCIAMKNGFESLVFVQNALVHFYGACGQAESAHKLFEFMFEKNLV 238 +AIAKL+ VREGEKVH IA++NGFESLVFVQN LVH Y ACG AESAHKLFE M E+NLV Sbjct: 3 RAIAKLMDVREGEKVHSIAIRNGFESLVFVQNTLVHMYAACGHAESAHKLFELMAERNLV 62 Query: 239 GWNSVINGYALSSRPNETLTLYRKMELEDVKPDGFTLVSLLTACAEL 379 WNSVINGYAL+ RPNE LTL+R+M L V+PDGFT+VSLL+ACAEL Sbjct: 63 TWNSVINGYALNGRPNEALTLFREMGLRGVEPDGFTMVSLLSACAEL 109 Score = 85.9 bits (211), Expect = 3e-15 Identities = 45/124 (36%), Positives = 69/124 (55%) Frame = +2 Query: 2 RQMLANSVQPDTHTYPFLLKAIAKLLAVREGEKVHCIAMKNGFESLVFVQNALVHFYGAC 181 R+M V+PD T LL A A+L A+ G + H +K G + + NAL+ Y C Sbjct: 85 REMGLRGVEPDGFTMVSLLSACAELGALALGRRAHVYMVKVGLDGNLHAGNALLDLYAKC 144 Query: 182 GQAESAHKLFEFMFEKNLVGWNSVINGYALSSRPNETLTLYRKMELEDVKPDGFTLVSLL 361 G AHK+F+ M EK++V W S+I G A++ E L L++++E + + P T V +L Sbjct: 145 GSIRQAHKVFDEMEEKSVVSWTSLIVGLAVNGFGKEALELFKELERKGLMPSEITFVGVL 204 Query: 362 TACA 373 AC+ Sbjct: 205 YACS 208