BLASTX nr result
ID: Angelica23_contig00018423
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00018423 (1389 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi... 520 e-145 ref|XP_002521239.1| pentatricopeptide repeat-containing protein,... 478 e-132 dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana] 464 e-128 ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar... 464 e-128 ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar... 464 e-128 >ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed protein product [Vitis vinifera] Length = 435 Score = 520 bits (1338), Expect = e-145 Identities = 262/423 (61%), Positives = 332/423 (78%) Frame = +3 Query: 93 LLPSCALSKQGQRFFTSLATSVTGDPLVTDRLIRKFIASSSKSVALNALSHLLTPNSSHS 272 LL CALSKQGQ F +S+A DP ++RLI KFIASSSKS+ALNALSHLL+P ++H Sbjct: 19 LLIQCALSKQGQLFLSSVAR----DPSASNRLICKFIASSSKSIALNALSHLLSPTTTHP 74 Query: 273 HLSSLAFPFYSLLTRAPWFSWNSKLVADVIALLYKQERFREADALISETVSKLDIREREL 452 +LSSLA P YS ++ A WFSWN KL+ADVIALLYKQ + +EA+ L+SET+ KL RER+L Sbjct: 75 YLSSLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRERDL 134 Query: 453 CTFYCCLIEFHSKHKSKQGVFDSYTSLKQILSCSSSVYLKRRAYESMICSLCVIDLPREA 632 +FYC LI+ HSKH S QGVFD + L +I+S SSSVY+K RAY+SMI SLC + LP EA Sbjct: 135 VSFYCNLIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKERAYKSMISSLCAVGLPLEA 194 Query: 633 ENMMEEMKVQGITSKPSLFELRSIVYAYGRVGLFKDMKRLLFEMESEGIKMDTVCFNMVL 812 EN++EEM+V+G+ KPS+FE RS+VY YGRVGL +DM+R+L +M +EG ++DTV NMVL Sbjct: 195 ENLIEEMRVKGL--KPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMVL 252 Query: 813 SSLGAHAELLEMVSWLQRMKYLNVPLSIRTYNTVLNSCPSIMSMLQDPKSIPLSLNELLA 992 SS GA+ + EMVSWLQRMK ++P SIRTYN+VLNSCP IMS+LQD K+ P +++EL+ Sbjct: 253 SSYGAYNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSILQDLKTFPPTIDELME 312 Query: 993 DLCDNESMLVQELIVSSVLVEAMGWTSLELKLDLHGMHLSSAYLIILQWFDELRTRFEHV 1172 L +E++LV+ELI S VL E M W E KLDLHGMHL SAYLI+LQW +ELR R Sbjct: 313 TLKGDEALLVKELIGSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWREELRYRLNAA 372 Query: 1173 ETDIPAEVRIICGSGKHSTVRGDSPVKHLVKELMVRTESPLRIDRKNIGCFIAKGKVCKS 1352 E +P E+ ++CGSGKHS+VRG+SPVK +V+E+M RT SP++IDRKNIGCF+AK KV K+ Sbjct: 373 EYVMPVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPMKIDRKNIGCFVAKAKVVKN 432 Query: 1353 WLC 1361 WLC Sbjct: 433 WLC 435 >ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223539507|gb|EEF41095.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 460 Score = 478 bits (1231), Expect = e-132 Identities = 236/428 (55%), Positives = 319/428 (74%), Gaps = 2/428 (0%) Frame = +3 Query: 84 PLRLLPSCA-LSKQGQRFFTSLATSVT-GDPLVTDRLIRKFIASSSKSVALNALSHLLTP 257 P++ + CA LSKQGQRF +SLA + T GD + T+RLI+KF+A+S KS+AL+ALSHLL P Sbjct: 35 PMKWVFRCAALSKQGQRFLSSLAIATTKGDTVATNRLIKKFVAASPKSIALDALSHLLNP 94 Query: 258 NSSHSHLSSLAFPFYSLLTRAPWFSWNSKLVADVIALLYKQERFREADALISETVSKLDI 437 +SSHSHLSSLAF Y + A WF WN KLVADV+A L KQ R+ E+ L+S+++SKL + Sbjct: 95 HSSHSHLSSLAFTLYLKIAEARWFQWNPKLVADVVAFLDKQGRYDESATLVSDSISKLQV 154 Query: 438 RERELCTFYCCLIEFHSKHKSKQGVFDSYTSLKQILSCSSSVYLKRRAYESMICSLCVID 617 +ER+L FYC L+E SK S +G +S SL Q++ S+SVY+KR+ Y+SM+ LC + Sbjct: 155 KERDLARFYCNLVESQSKQNSIRGFDNSVASLMQLVCNSNSVYVKRQGYKSMVNGLCEMG 214 Query: 618 LPREAENMMEEMKVQGITSKPSLFELRSIVYAYGRVGLFKDMKRLLFEMESEGIKMDTVC 797 PREAE ++EEM +G+ +PS+FE + +VYAYG +G F++M + L +ME G ++DTVC Sbjct: 215 RPREAETLIEEMGKEGV--RPSMFEFKCVVYAYGSLGSFEEMNKCLHQMERAGFRVDTVC 272 Query: 798 FNMVLSSLGAHAELLEMVSWLQRMKYLNVPLSIRTYNTVLNSCPSIMSMLQDPKSIPLSL 977 NM+L+S GAH L EMV WLQ+MK L +P S+RT N+ LNSCP+IMSM+Q+ P+S+ Sbjct: 273 SNMILASYGAHNALPEMVLWLQKMKDLGIPFSLRTCNSALNSCPTIMSMMQNSNDFPISI 332 Query: 978 NELLADLCDNESMLVQELIVSSVLVEAMGWTSLELKLDLHGMHLSSAYLIILQWFDELRT 1157 ++L+ L ++E++LV+E++ SSVL EAM W E KLDLHG HL SAYLIIL W +E+R Sbjct: 333 HDLMKILSEDEALLVKEIVTSSVLDEAMKWDVAEAKLDLHGTHLCSAYLIILLWIEEMRK 392 Query: 1158 RFEHVETDIPAEVRIICGSGKHSTVRGDSPVKHLVKELMVRTESPLRIDRKNIGCFIAKG 1337 RF+ V P E+ ++CGSG HS VRG+SPVK +VK+ MVR SP+RIDR+NIGCFIAKG Sbjct: 393 RFKSVNYVNPTEITVVCGSGNHSIVRGESPVKCMVKDFMVRARSPMRIDRRNIGCFIAKG 452 Query: 1338 KVCKSWLC 1361 KV + WLC Sbjct: 453 KVVEEWLC 460 >dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana] Length = 501 Score = 464 bits (1194), Expect = e-128 Identities = 237/418 (56%), Positives = 310/418 (74%), Gaps = 1/418 (0%) Frame = +3 Query: 111 LSKQGQRFFTSLAT-SVTGDPLVTDRLIRKFIASSSKSVALNALSHLLTPNSSHSHLSSL 287 L K G RF +SL++ ++ GDP +R I+KF+A+S KSVALN LSHLL+ +SH HLS Sbjct: 85 LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 144 Query: 288 AFPFYSLLTRAPWFSWNSKLVADVIALLYKQERFREADALISETVSKLDIRERELCTFYC 467 A YS +T A WF WN KL+A++IALL KQERF E++ L+S VS+L ER+ F C Sbjct: 145 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 204 Query: 468 CLIEFHSKHKSKQGVFDSYTSLKQILSCSSSVYLKRRAYESMICSLCVIDLPREAENMME 647 L+E +SK S QG ++ L++I+ SSSVY+K +AY+SM+ LC +D P +AE ++E Sbjct: 205 NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 264 Query: 648 EMKVQGITSKPSLFELRSIVYAYGRVGLFKDMKRLLFEMESEGIKMDTVCFNMVLSSLGA 827 EM+++ I KP LFE +S++Y YGR+GLF DM R++ M +EG K+DTVC NMVLSS GA Sbjct: 265 EMRMEKI--KPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGA 322 Query: 828 HAELLEMVSWLQRMKYLNVPLSIRTYNTVLNSCPSIMSMLQDPKSIPLSLNELLADLCDN 1007 H L +M SWLQ++K NVP SIRTYN+VLNSCP+I+SML+D S P+SL+EL L ++ Sbjct: 323 HDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNED 382 Query: 1008 ESMLVQELIVSSVLVEAMGWTSLELKLDLHGMHLSSAYLIILQWFDELRTRFEHVETDIP 1187 E++LV EL SSVL EA+ W ++E KLDLHGMHLSS+YLI+LQW DE R RF + IP Sbjct: 383 EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 442 Query: 1188 AEVRIICGSGKHSTVRGDSPVKHLVKELMVRTESPLRIDRKNIGCFIAKGKVCKSWLC 1361 AE+ ++ GSGKHS VRG+SPVK LVK++MVRT SP+RIDRKN+G FIAKGK K WLC Sbjct: 443 AEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 500 >ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown protein [Arabidopsis thaliana] gi|330251481|gb|AEC06575.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 504 Score = 464 bits (1194), Expect = e-128 Identities = 237/418 (56%), Positives = 310/418 (74%), Gaps = 1/418 (0%) Frame = +3 Query: 111 LSKQGQRFFTSLAT-SVTGDPLVTDRLIRKFIASSSKSVALNALSHLLTPNSSHSHLSSL 287 L K G RF +SL++ ++ GDP +R I+KF+A+S KSVALN LSHLL+ +SH HLS Sbjct: 88 LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 147 Query: 288 AFPFYSLLTRAPWFSWNSKLVADVIALLYKQERFREADALISETVSKLDIRERELCTFYC 467 A YS +T A WF WN KL+A++IALL KQERF E++ L+S VS+L ER+ F C Sbjct: 148 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 207 Query: 468 CLIEFHSKHKSKQGVFDSYTSLKQILSCSSSVYLKRRAYESMICSLCVIDLPREAENMME 647 L+E +SK S QG ++ L++I+ SSSVY+K +AY+SM+ LC +D P +AE ++E Sbjct: 208 NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 267 Query: 648 EMKVQGITSKPSLFELRSIVYAYGRVGLFKDMKRLLFEMESEGIKMDTVCFNMVLSSLGA 827 EM+++ I KP LFE +S++Y YGR+GLF DM R++ M +EG K+DTVC NMVLSS GA Sbjct: 268 EMRMEKI--KPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGA 325 Query: 828 HAELLEMVSWLQRMKYLNVPLSIRTYNTVLNSCPSIMSMLQDPKSIPLSLNELLADLCDN 1007 H L +M SWLQ++K NVP SIRTYN+VLNSCP+I+SML+D S P+SL+EL L ++ Sbjct: 326 HDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNED 385 Query: 1008 ESMLVQELIVSSVLVEAMGWTSLELKLDLHGMHLSSAYLIILQWFDELRTRFEHVETDIP 1187 E++LV EL SSVL EA+ W ++E KLDLHGMHLSS+YLI+LQW DE R RF + IP Sbjct: 386 EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 445 Query: 1188 AEVRIICGSGKHSTVRGDSPVKHLVKELMVRTESPLRIDRKNIGCFIAKGKVCKSWLC 1361 AE+ ++ GSGKHS VRG+SPVK LVK++MVRT SP+RIDRKN+G FIAKGK K WLC Sbjct: 446 AEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 503 >ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 505 Score = 464 bits (1194), Expect = e-128 Identities = 237/418 (56%), Positives = 310/418 (74%), Gaps = 1/418 (0%) Frame = +3 Query: 111 LSKQGQRFFTSLAT-SVTGDPLVTDRLIRKFIASSSKSVALNALSHLLTPNSSHSHLSSL 287 L K G RF +SL++ ++ GDP +R I+KF+A+S KSVALN LSHLL+ +SH HLS Sbjct: 89 LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 148 Query: 288 AFPFYSLLTRAPWFSWNSKLVADVIALLYKQERFREADALISETVSKLDIRERELCTFYC 467 A YS +T A WF WN KL+A++IALL KQERF E++ L+S VS+L ER+ F C Sbjct: 149 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 208 Query: 468 CLIEFHSKHKSKQGVFDSYTSLKQILSCSSSVYLKRRAYESMICSLCVIDLPREAENMME 647 L+E +SK S QG ++ L++I+ SSSVY+K +AY+SM+ LC +D P +AE ++E Sbjct: 209 NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 268 Query: 648 EMKVQGITSKPSLFELRSIVYAYGRVGLFKDMKRLLFEMESEGIKMDTVCFNMVLSSLGA 827 EM+++ I KP LFE +S++Y YGR+GLF DM R++ M +EG K+DTVC NMVLSS GA Sbjct: 269 EMRMEKI--KPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGA 326 Query: 828 HAELLEMVSWLQRMKYLNVPLSIRTYNTVLNSCPSIMSMLQDPKSIPLSLNELLADLCDN 1007 H L +M SWLQ++K NVP SIRTYN+VLNSCP+I+SML+D S P+SL+EL L ++ Sbjct: 327 HDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNED 386 Query: 1008 ESMLVQELIVSSVLVEAMGWTSLELKLDLHGMHLSSAYLIILQWFDELRTRFEHVETDIP 1187 E++LV EL SSVL EA+ W ++E KLDLHGMHLSS+YLI+LQW DE R RF + IP Sbjct: 387 EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 446 Query: 1188 AEVRIICGSGKHSTVRGDSPVKHLVKELMVRTESPLRIDRKNIGCFIAKGKVCKSWLC 1361 AE+ ++ GSGKHS VRG+SPVK LVK++MVRT SP+RIDRKN+G FIAKGK K WLC Sbjct: 447 AEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 504