BLASTX nr result
ID: Aconitum21_contig00023072
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Aconitum21_contig00023072 (1817 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002277549.1| PREDICTED: pentatricopeptide repeat-containi... 632 e-179 ref|XP_002528283.1| pentatricopeptide repeat-containing protein,... 610 e-172 ref|XP_004172296.1| PREDICTED: pentatricopeptide repeat-containi... 561 e-157 ref|XP_004137053.1| PREDICTED: pentatricopeptide repeat-containi... 561 e-157 ref|NP_179705.1| pentatricopeptide repeat-containing protein [Ar... 554 e-155 >ref|XP_002277549.1| PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Vitis vinifera] Length = 612 Score = 632 bits (1630), Expect = e-179 Identities = 304/461 (65%), Positives = 380/461 (82%) Frame = +2 Query: 2 FDEMPERDVVSWNTMVIMFAKNGFFKDALLFYNQFRKLNIGFNEFSFAGVLIACVKLEEF 181 FD+MPE+DVVSWNTMVI A+ G++ +AL FY++FR+L I N FSFAGVL CVKL+E Sbjct: 149 FDKMPEKDVVSWNTMVIAHAQCGYWDEALRFYSEFRQLGIQCNGFSFAGVLTVCVKLKE- 207 Query: 182 WLPLTKQVHGQVLVSGFLSNLVLSSSIVDAYAKSGVMGDAQRVFDQMRVKDVLAWTTMVS 361 + LT+QVHGQ+LV+GFLSN+VLSSS++DAY K G+MGDA+++FD+M +DVLAWTTMVS Sbjct: 208 -VGLTRQVHGQILVAGFLSNVVLSSSVLDAYVKCGLMGDARKLFDEMSARDVLAWTTMVS 266 Query: 362 GYAKLGDMETARRLFDEIPEPNPVSWTSLISGYARTGLGFEALKLFSDMMTRGIKPDQFT 541 GYAK GDM++A LF E+PE NPVSWT+LISGYAR G+G +AL+LF+ MM ++PDQFT Sbjct: 267 GYAKWGDMKSANELFVEMPEKNPVSWTALISGYARNGMGHKALELFTKMMLFHVRPDQFT 326 Query: 542 FXXXXXXXXXXXXXKHGKSIHGHLIRTLFKPNAIVVSSLIDMYSKCGSLGVGRSVFDIIG 721 F KHGK IH +L+R F+PN IVVS+LIDMYSKCGSLG+GR VFD++G Sbjct: 327 FSSCLCACASIASLKHGKQIHAYLLRINFQPNTIVVSALIDMYSKCGSLGIGRKVFDLMG 386 Query: 722 DKLDVILWNTMISALAQHGLGEESIKLFHDMTRVGTKPDRTTLVVVLNACSHSGLVEEGI 901 +KLDV+LWNT+ISALAQHG GEE+I++ DM R G KPD+ T VV+LNACSHSGLV++G+ Sbjct: 387 NKLDVVLWNTIISALAQHGCGEEAIQMLDDMVRSGAKPDKITFVVILNACSHSGLVQQGL 446 Query: 902 HLFESITQDYGIAADQEHYACLVDLLGRAGKFKELKDLLKKMPYKPDGKVWNALLGACKI 1081 + FES++ DYGI QEHYACL+DLLGRAG F+E+ D L+KMPYKPD +VWNALLG C+I Sbjct: 447 NFFESMSCDYGIVPSQEHYACLIDLLGRAGCFEEVMDQLEKMPYKPDDRVWNALLGVCRI 506 Query: 1082 HGNIELGSIAAEHLIDQEPQSSAAYVLLSNMYALTGKWESVEKVRHLMNERQVKKEQAIS 1261 HG+IELG AAE LI+ EPQSS AYVLLS++YA+ G+WESV+KVR LMNERQVKKE+AIS Sbjct: 507 HGHIELGRKAAERLIELEPQSSTAYVLLSSIYAVLGRWESVQKVRQLMNERQVKKERAIS 566 Query: 1262 WVEVESKVHSFSVFDHLHPSKDEIYLALEQLTGQMDDETSL 1384 W+E+E+KVHSFSV D HP K++IY LEQL GQM+++ SL Sbjct: 567 WLEIENKVHSFSVSDSSHPLKEQIYSVLEQLAGQMEEDASL 607 Score = 145 bits (366), Expect = 3e-32 Identities = 90/329 (27%), Positives = 162/329 (49%), Gaps = 32/329 (9%) Frame = +2 Query: 197 KQVHGQVLVSGFLS-NLVLSSSIVDAYAKSGVMGDAQRVFDQMRVKDVLAWTTMVSGYAK 373 K+VH + ++G LS+ +++ YAK G +A++VFD+M +++ +W M+SGYAK Sbjct: 79 KRVHLHLKLTGLKRPGTFLSNHLINMYAKCGKEVEARKVFDKMSARNLYSWNNMLSGYAK 138 Query: 374 LGDMETARRLFDEIPEPNPVSWTSLISGYARTGLGFEALKLFSDMMTRGIKPDQFTFXXX 553 LG ++ AR+LFD++PE + VSW +++ +A+ G EAL+ +S+ GI+ + F+F Sbjct: 139 LGMIKPARKLFDKMPEKDVVSWNTMVIAHAQCGYWDEALRFYSEFRQLGIQCNGFSFAGV 198 Query: 554 XXXXXXXXXXKHGKSIHGHLIRTLFKPNAIVVSSLIDMYSKCGSLGVGRSVFDIIGDKLD 733 + +HG ++ F N ++ SS++D Y KCG +G R +FD + + D Sbjct: 199 LTVCVKLKEVGLTRQVHGQILVAGFLSNVVLSSSVLDAYVKCGLMGDARKLFDEMSAR-D 257 Query: 734 VILWNTM-------------------------------ISALAQHGLGEESIKLFHDMTR 820 V+ W TM IS A++G+G ++++LF M Sbjct: 258 VLAWTTMVSGYAKWGDMKSANELFVEMPEKNPVSWTALISGYARNGMGHKALELFTKMML 317 Query: 821 VGTKPDRTTLVVVLNACSHSGLVEEGIHLFESITQDYGIAADQEHYACLVDLLGRAGKFK 1000 +PD+ T L AC+ ++ G + + + + + L+D+ + G Sbjct: 318 FHVRPDQFTFSSCLCACASIASLKHGKQIHAYLLR-INFQPNTIVVSALIDMYSKCGSLG 376 Query: 1001 ELKDLLKKMPYKPDGKVWNALLGACKIHG 1087 + + M K D +WN ++ A HG Sbjct: 377 IGRKVFDLMGNKLDVVLWNTIISALAQHG 405 Score = 58.2 bits (139), Expect = 7e-06 Identities = 42/161 (26%), Positives = 69/161 (42%), Gaps = 31/161 (19%) Frame = +2 Query: 482 EALKLFSDMMTRGIKPDQFTFXXXXXXXXXXXXXKHGKSIHGHLIRT-LFKPNAIVVSSL 658 EA+ ++ RG++ D T + GK +H HL T L +P + + L Sbjct: 42 EAVSSLENLARRGLRLDSRTLASLLQHCADSRALREGKRVHLHLKLTGLKRPGTFLSNHL 101 Query: 659 IDMYSKCGSLGVGRSVFDIIG---------------------------DKL---DVILWN 748 I+MY+KCG R VFD + DK+ DV+ WN Sbjct: 102 INMYAKCGKEVEARKVFDKMSARNLYSWNNMLSGYAKLGMIKPARKLFDKMPEKDVVSWN 161 Query: 749 TMISALAQHGLGEESIKLFHDMTRVGTKPDRTTLVVVLNAC 871 TM+ A AQ G +E+++ + + ++G + + + VL C Sbjct: 162 TMVIAHAQCGYWDEALRFYSEFRQLGIQCNGFSFAGVLTVC 202 >ref|XP_002528283.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223532320|gb|EEF34121.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 602 Score = 610 bits (1573), Expect = e-172 Identities = 294/460 (63%), Positives = 370/460 (80%) Frame = +2 Query: 2 FDEMPERDVVSWNTMVIMFAKNGFFKDALLFYNQFRKLNIGFNEFSFAGVLIACVKLEEF 181 FD+MPE+DVVSWNTMVI +AK+GF DAL FY + R+L IG+NE+SFAG+L CVK++E Sbjct: 140 FDKMPEKDVVSWNTMVIAYAKSGFCNDALRFYRELRRLGIGYNEYSFAGLLNICVKVKE- 198 Query: 182 WLPLTKQVHGQVLVSGFLSNLVLSSSIVDAYAKSGVMGDAQRVFDQMRVKDVLAWTTMVS 361 L L+KQ HGQVLV+GFLSNLV+SSS++DAYAK MGDA+R+FD+M ++DVLAWTTMVS Sbjct: 199 -LELSKQAHGQVLVAGFLSNLVISSSVLDAYAKCSEMGDARRLFDEMIIRDVLAWTTMVS 257 Query: 362 GYAKLGDMETARRLFDEIPEPNPVSWTSLISGYARTGLGFEALKLFSDMMTRGIKPDQFT 541 GYA+ GD+E AR LFD +PE NPV+WTSLI+GYAR LG +AL+LF+ MM I+PDQFT Sbjct: 258 GYAQWGDVEAARELFDLMPEKNPVAWTSLIAGYARHDLGHKALELFTKMMALNIRPDQFT 317 Query: 542 FXXXXXXXXXXXXXKHGKSIHGHLIRTLFKPNAIVVSSLIDMYSKCGSLGVGRSVFDIIG 721 F HGK IHG+LIRT +PN IVVSSLIDMYSKCG L VGR VFD++G Sbjct: 318 FSSCLCASASIASLNHGKQIHGYLIRTNIRPNTIVVSSLIDMYSKCGCLEVGRLVFDLMG 377 Query: 722 DKLDVILWNTMISALAQHGLGEESIKLFHDMTRVGTKPDRTTLVVVLNACSHSGLVEEGI 901 DK DV+LWNT+IS+LAQHG G+E+I++F DM R+G KPDR TL+V+LNACSHSGLV+EG+ Sbjct: 378 DKWDVVLWNTIISSLAQHGRGQEAIQMFDDMVRLGMKPDRITLIVLLNACSHSGLVQEGL 437 Query: 902 HLFESITQDYGIAADQEHYACLVDLLGRAGKFKELKDLLKKMPYKPDGKVWNALLGACKI 1081 L+ESIT +G+ +QEHYACL+DLLGRAG F L + L+KMP KP+ ++WNALLG C++ Sbjct: 438 RLYESITSCHGVIPNQEHYACLIDLLGRAGHFDTLMNQLEKMPCKPNDEIWNALLGVCRM 497 Query: 1082 HGNIELGSIAAEHLIDQEPQSSAAYVLLSNMYALTGKWESVEKVRHLMNERQVKKEQAIS 1261 HGNIE G AE +I+ +PQSSAAYVLLS+++A G+WE VE VR LMNER V+K++AIS Sbjct: 498 HGNIEFGREVAEKIIELDPQSSAAYVLLSSIHAAVGRWELVENVRQLMNERHVRKDRAIS 557 Query: 1262 WVEVESKVHSFSVFDHLHPSKDEIYLALEQLTGQMDDETS 1381 W+E+E+KVHSF+ D LHP K+ IYLAL+QL G M++ S Sbjct: 558 WIEIENKVHSFTASDRLHPLKEVIYLALKQLAGHMEEVLS 597 Score = 142 bits (359), Expect = 2e-31 Identities = 89/332 (26%), Positives = 164/332 (49%), Gaps = 31/332 (9%) Frame = +2 Query: 185 LPLTKQVHGQVLVSGFLS-NLVLSSSIVDAYAKSGVMGDAQRVFDQMRVKDVLAWTTMVS 361 L L K VH + V+G N L++ +++ Y+K G A +VFD+M +++ +W M+S Sbjct: 66 LKLGKWVHLHLKVTGLKRPNTFLANHLINMYSKCGDYPSAYKVFDEMSTRNLYSWNGMLS 125 Query: 362 GYAKLGDMETARRLFDEIPEPNPVSWTSLISGYARTGLGFEALKLFSDMMTRGIKPDQFT 541 GYAKLG ++ AR+LFD++PE + VSW +++ YA++G +AL+ + ++ GI ++++ Sbjct: 126 GYAKLGKIKPARKLFDKMPEKDVVSWNTMVIAYAKSGFCNDALRFYRELRRLGIGYNEYS 185 Query: 542 FXXXXXXXXXXXXXKHGKSIHGHLIRTLFKPNAIVVSSLIDMYSKCGSLGVGRSVFD--I 715 F + K HG ++ F N ++ SS++D Y+KC +G R +FD I Sbjct: 186 FAGLLNICVKVKELELSKQAHGQVLVAGFLSNLVISSSVLDAYAKCSEMGDARRLFDEMI 245 Query: 716 IGDKL----------------------------DVILWNTMISALAQHGLGEESIKLFHD 811 I D L + + W ++I+ A+H LG ++++LF Sbjct: 246 IRDVLAWTTMVSGYAQWGDVEAARELFDLMPEKNPVAWTSLIAGYARHDLGHKALELFTK 305 Query: 812 MTRVGTKPDRTTLVVVLNACSHSGLVEEGIHLFESITQDYGIAADQEHYACLVDLLGRAG 991 M + +PD+ T L A + + G + + + I + + L+D+ + G Sbjct: 306 MMALNIRPDQFTFSSCLCASASIASLNHGKQIHGYLIRT-NIRPNTIVVSSLIDMYSKCG 364 Query: 992 KFKELKDLLKKMPYKPDGKVWNALLGACKIHG 1087 + + + M K D +WN ++ + HG Sbjct: 365 CLEVGRLVFDLMGDKWDVVLWNTIISSLAQHG 396 >ref|XP_004172296.1| PREDICTED: pentatricopeptide repeat-containing protein At2g21090-like [Cucumis sativus] Length = 611 Score = 561 bits (1447), Expect = e-157 Identities = 276/458 (60%), Positives = 350/458 (76%) Frame = +2 Query: 2 FDEMPERDVVSWNTMVIMFAKNGFFKDALLFYNQFRKLNIGFNEFSFAGVLIACVKLEEF 181 FD M E+DVVSWNT+V+ +AK G F +A+ Y FR+L++GFN FSFAGVLI CVKL+E Sbjct: 152 FDRMMEKDVVSWNTIVLAYAKQGCFNEAIGLYRDFRRLDMGFNAFSFAGVLILCVKLKE- 210 Query: 182 WLPLTKQVHGQVLVSGFLSNLVLSSSIVDAYAKSGVMGDAQRVFDQMRVKDVLAWTTMVS 361 L L KQVHGQVLV+GFLSNLVLSSSIVDAY+K G M A+ +FD+M VKD+ AWTT+VS Sbjct: 211 -LQLAKQVHGQVLVAGFLSNLVLSSSIVDAYSKCGEMRCARTLFDEMLVKDIHAWTTIVS 269 Query: 362 GYAKLGDMETARRLFDEIPEPNPVSWTSLISGYARTGLGFEALKLFSDMMTRGIKPDQFT 541 GYAK GDM +A LF ++PE NPVSW++LISGYAR LG EAL F+ MM GI P+Q+T Sbjct: 270 GYAKWGDMNSASELFHQMPEKNPVSWSALISGYARNSLGHEALDYFTKMMKFGINPEQYT 329 Query: 542 FXXXXXXXXXXXXXKHGKSIHGHLIRTLFKPNAIVVSSLIDMYSKCGSLGVGRSVFDIIG 721 F KHGK +HG+LIRT F+ N IVVSSLIDMYSKCG L VF ++G Sbjct: 330 FSSCLCACASIAALKHGKQVHGYLIRTYFRCNTIVVSSLIDMYSKCGMLEASCCVFHLMG 389 Query: 722 DKLDVILWNTMISALAQHGLGEESIKLFHDMTRVGTKPDRTTLVVVLNACSHSGLVEEGI 901 +K DV++WNTMISALAQ+G GE+++++F+DM G KPDR T +V+L+ACSHSGLV+EG+ Sbjct: 390 NKQDVVVWNTMISALAQNGHGEKAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGL 449 Query: 902 HLFESITQDYGIAADQEHYACLVDLLGRAGKFKELKDLLKKMPYKPDGKVWNALLGACKI 1081 F+++T D+G+ DQEHYACL+DLLGRAG F EL + L+ M KPD +VW+ALLG C+I Sbjct: 450 RFFKAMTYDHGVFPDQEHYACLIDLLGRAGCFVELVNELENMSCKPDDRVWSALLGVCRI 509 Query: 1082 HGNIELGSIAAEHLIDQEPQSSAAYVLLSNMYALTGKWESVEKVRHLMNERQVKKEQAIS 1261 H NIELG AE +I+ +PQSSAAYV L+++YA GKWESVEKVR LM+E+ ++KE+ IS Sbjct: 510 HNNIELGRKVAERVIELKPQSSAAYVSLASLYAFLGKWESVEKVRELMDEKFIRKERGIS 569 Query: 1262 WVEVESKVHSFSVFDHLHPSKDEIYLALEQLTGQMDDE 1375 W++V +K HSF D LHP K+EIYL LEQL +++ Sbjct: 570 WIDVGNKTHSFIASDRLHPLKEEIYLLLEQLARHTEED 607 Score = 141 bits (355), Expect = 7e-31 Identities = 96/377 (25%), Positives = 177/377 (46%), Gaps = 31/377 (8%) Frame = +2 Query: 59 AKNGFFKDALLFYNQFRKLNIGFNEFSFAGVLIACVKLEEFWLPLTKQVHGQVLVSGFLS 238 + G +AL + ++ + I F +L C K + F K VH + +GF Sbjct: 38 SSQGRLPEALSYLDRLAQRGIRLPTGIFVDLLRLCAKAKYF--KGGKCVHLHLKHTGFKR 95 Query: 239 -NLVLSSSIVDAYAKSGVMGDAQRVFDQMRVKDVLAWTTMVSGYAKLGDMETARRLFDEI 415 ++++ ++ Y + G +A++VFD+M V+++ +W M++GYAKLGD+ AR+LFD + Sbjct: 96 PTTIVANHLIGMYFECGRDVEARKVFDKMSVRNLYSWNHMLAGYAKLGDVNNARKLFDRM 155 Query: 416 PEPNPVSWTSLISGYARTGLGFEALKLFSDMMTRGIKPDQFTFXXXXXXXXXXXXXKHGK 595 E + VSW +++ YA+ G EA+ L+ D + + F+F + K Sbjct: 156 MEKDVVSWNTIVLAYAKQGCFNEAIGLYRDFRRLDMGFNAFSFAGVLILCVKLKELQLAK 215 Query: 596 SIHGHLIRTLFKPNAIVVSSLIDMYSKCGSLGVGRSVFDII------------------G 721 +HG ++ F N ++ SS++D YSKCG + R++FD + G Sbjct: 216 QVHGQVLVAGFLSNLVLSSSIVDAYSKCGEMRCARTLFDEMLVKDIHAWTTIVSGYAKWG 275 Query: 722 D------------KLDVILWNTMISALAQHGLGEESIKLFHDMTRVGTKPDRTTLVVVLN 865 D + + + W+ +IS A++ LG E++ F M + G P++ T L Sbjct: 276 DMNSASELFHQMPEKNPVSWSALISGYARNSLGHEALDYFTKMMKFGINPEQYTFSSCLC 335 Query: 866 ACSHSGLVEEGIHLFESITQDYGIAADQEHYACLVDLLGRAGKFKELKDLLKKMPYKPDG 1045 AC+ ++ G + + + Y + + L+D+ + G + + M K D Sbjct: 336 ACASIAALKHGKQVHGYLIRTY-FRCNTIVVSSLIDMYSKCGMLEASCCVFHLMGNKQDV 394 Query: 1046 KVWNALLGACKIHGNIE 1096 VWN ++ A +G+ E Sbjct: 395 VVWNTMISALAQNGHGE 411 >ref|XP_004137053.1| PREDICTED: pentatricopeptide repeat-containing protein At2g21090-like [Cucumis sativus] Length = 611 Score = 561 bits (1447), Expect = e-157 Identities = 276/458 (60%), Positives = 350/458 (76%) Frame = +2 Query: 2 FDEMPERDVVSWNTMVIMFAKNGFFKDALLFYNQFRKLNIGFNEFSFAGVLIACVKLEEF 181 FD M E+DVVSWNT+V+ +AK G F +A+ Y FR+L++GFN FSFAGVLI CVKL+E Sbjct: 152 FDRMMEKDVVSWNTIVLAYAKQGCFNEAIGLYRDFRRLDMGFNAFSFAGVLILCVKLKE- 210 Query: 182 WLPLTKQVHGQVLVSGFLSNLVLSSSIVDAYAKSGVMGDAQRVFDQMRVKDVLAWTTMVS 361 L L KQVHGQVLV+GFLSNLVLSSSIVDAYAK G M A+ +FD+M VKD+ AWTT+VS Sbjct: 211 -LQLAKQVHGQVLVAGFLSNLVLSSSIVDAYAKCGEMRCARTLFDEMLVKDIHAWTTIVS 269 Query: 362 GYAKLGDMETARRLFDEIPEPNPVSWTSLISGYARTGLGFEALKLFSDMMTRGIKPDQFT 541 GYAK GDM +A LF ++PE NPVSW++LISGYAR LG EAL F+ MM GI P+Q+T Sbjct: 270 GYAKWGDMNSASELFHQMPEKNPVSWSALISGYARNSLGHEALDYFTKMMKFGINPEQYT 329 Query: 542 FXXXXXXXXXXXXXKHGKSIHGHLIRTLFKPNAIVVSSLIDMYSKCGSLGVGRSVFDIIG 721 F KHGK +HG+LIRT F+ N IVVSSLIDMYSKCG L VF ++G Sbjct: 330 FSSCLCACASIAALKHGKQVHGYLIRTYFRCNTIVVSSLIDMYSKCGMLEASCCVFHLMG 389 Query: 722 DKLDVILWNTMISALAQHGLGEESIKLFHDMTRVGTKPDRTTLVVVLNACSHSGLVEEGI 901 +K DV++WNTMISALAQ+G GE+++++F+DM G KPDR T +V+L+ACSHSGLV+EG+ Sbjct: 390 NKQDVVVWNTMISALAQNGHGEKAMQMFNDMVESGLKPDRITFIVILSACSHSGLVQEGL 449 Query: 902 HLFESITQDYGIAADQEHYACLVDLLGRAGKFKELKDLLKKMPYKPDGKVWNALLGACKI 1081 F+++T D+G+ DQEHY+CL+DLLGRAG F EL + L+ M KPD +VW+ALLG C+I Sbjct: 450 RFFKAMTYDHGVFPDQEHYSCLIDLLGRAGCFVELVNELENMSCKPDDRVWSALLGVCRI 509 Query: 1082 HGNIELGSIAAEHLIDQEPQSSAAYVLLSNMYALTGKWESVEKVRHLMNERQVKKEQAIS 1261 H NIELG AE +I+ +PQSSAAYV L+++YA GKWESVEKVR LM+E+ ++KE+ IS Sbjct: 510 HNNIELGRKVAERVIELKPQSSAAYVSLASLYAFLGKWESVEKVRELMDEKFIRKERGIS 569 Query: 1262 WVEVESKVHSFSVFDHLHPSKDEIYLALEQLTGQMDDE 1375 W++V +K HSF D LHP K+EIYL LEQL +++ Sbjct: 570 WIDVGNKTHSFIASDRLHPLKEEIYLLLEQLARHTEED 607 Score = 139 bits (351), Expect = 2e-30 Identities = 94/377 (24%), Positives = 177/377 (46%), Gaps = 31/377 (8%) Frame = +2 Query: 59 AKNGFFKDALLFYNQFRKLNIGFNEFSFAGVLIACVKLEEFWLPLTKQVHGQVLVSGFLS 238 + G +AL + ++ + + F +L C K + F K VH + +GF Sbjct: 38 SSQGRLPEALSYLDRLAQRGVRLPTGIFVDLLRLCAKAKYF--KGGKCVHLHLKHTGFKR 95 Query: 239 -NLVLSSSIVDAYAKSGVMGDAQRVFDQMRVKDVLAWTTMVSGYAKLGDMETARRLFDEI 415 ++++ ++ Y + G +A++VFD+M V+++ +W M++GYAKLGD+ AR+LFD + Sbjct: 96 PTTIVANHLIGMYFECGRDVEARKVFDKMSVRNLYSWNHMLAGYAKLGDVNNARKLFDRM 155 Query: 416 PEPNPVSWTSLISGYARTGLGFEALKLFSDMMTRGIKPDQFTFXXXXXXXXXXXXXKHGK 595 E + VSW +++ YA+ G EA+ L+ D + + F+F + K Sbjct: 156 MEKDVVSWNTIVLAYAKQGCFNEAIGLYRDFRRLDMGFNAFSFAGVLILCVKLKELQLAK 215 Query: 596 SIHGHLIRTLFKPNAIVVSSLIDMYSKCGSLGVGRSVFDII------------------G 721 +HG ++ F N ++ SS++D Y+KCG + R++FD + G Sbjct: 216 QVHGQVLVAGFLSNLVLSSSIVDAYAKCGEMRCARTLFDEMLVKDIHAWTTIVSGYAKWG 275 Query: 722 D------------KLDVILWNTMISALAQHGLGEESIKLFHDMTRVGTKPDRTTLVVVLN 865 D + + + W+ +IS A++ LG E++ F M + G P++ T L Sbjct: 276 DMNSASELFHQMPEKNPVSWSALISGYARNSLGHEALDYFTKMMKFGINPEQYTFSSCLC 335 Query: 866 ACSHSGLVEEGIHLFESITQDYGIAADQEHYACLVDLLGRAGKFKELKDLLKKMPYKPDG 1045 AC+ ++ G + + + Y + + L+D+ + G + + M K D Sbjct: 336 ACASIAALKHGKQVHGYLIRTY-FRCNTIVVSSLIDMYSKCGMLEASCCVFHLMGNKQDV 394 Query: 1046 KVWNALLGACKIHGNIE 1096 VWN ++ A +G+ E Sbjct: 395 VVWNTMISALAQNGHGE 411 >ref|NP_179705.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206523|sp|Q9SKQ4.1|PP167_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g21090 gi|4803934|gb|AAD29807.1| unknown protein [Arabidopsis thaliana] gi|330252028|gb|AEC07122.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 597 Score = 554 bits (1427), Expect = e-155 Identities = 273/462 (59%), Positives = 344/462 (74%), Gaps = 2/462 (0%) Frame = +2 Query: 2 FDEMPERDVVSWNTMVIMFAKNGFFKDALLFYNQFRKLNIGFNEFSFAGVLIACVKLEEF 181 FD MPERDVVSWNTMVI +A++G +AL FY +FR+ I FNEFSFAG+L ACVK + Sbjct: 136 FDSMPERDVVSWNTMVIGYAQDGNLHEALWFYKEFRRSGIKFNEFSFAGLLTACVKSRQ- 194 Query: 182 WLPLTKQVHGQVLVSGFLSNLVLSSSIVDAYAKSGVMGDAQRVFDQMRVKDVLAWTTMVS 361 L L +Q HGQVLV+GFLSN+VLS SI+DAYAK G M A+R FD+M VKD+ WTT++S Sbjct: 195 -LQLNRQAHGQVLVAGFLSNVVLSCSIIDAYAKCGQMESAKRCFDEMTVKDIHIWTTLIS 253 Query: 362 GYAKLGDMETARRLFDEIPEPNPVSWTSLISGYARTGLGFEALKLFSDMMTRGIKPDQFT 541 GYAKLGDME A +LF E+PE NPVSWT+LI+GY R G G AL LF M+ G+KP+QFT Sbjct: 254 GYAKLGDMEAAEKLFCEMPEKNPVSWTALIAGYVRQGSGNRALDLFRKMIALGVKPEQFT 313 Query: 542 FXXXXXXXXXXXXXKHGKSIHGHLIRTLFKPNAIVVSSLIDMYSKCGSLGVGRSVFDIIG 721 F +HGK IHG++IRT +PNAIV+SSLIDMYSK GSL VF I Sbjct: 314 FSSCLCASASIASLRHGKEIHGYMIRTNVRPNAIVISSLIDMYSKSGSLEASERVFRICD 373 Query: 722 DKLDVILWNTMISALAQHGLGEESIKLFHDMTRVGTKPDRTTLVVVLNACSHSGLVEEGI 901 DK D + WNTMISALAQHGLG +++++ DM + +P+RTTLVV+LNACSHSGLVEEG+ Sbjct: 374 DKHDCVFWNTMISALAQHGLGHKALRMLDDMIKFRVQPNRTTLVVILNACSHSGLVEEGL 433 Query: 902 HLFESITQDYGIAADQEHYACLVDLLGRAGKFKELKDLLKKMPYKPDGKVWNALLGACKI 1081 FES+T +GI DQEHYACL+DLLGRAG FKEL +++MP++PD +WNA+LG C+I Sbjct: 434 RWFESMTVQHGIVPDQEHYACLIDLLGRAGCFKELMRKIEEMPFEPDKHIWNAILGVCRI 493 Query: 1082 HGNIELGSIAAEHLIDQEPQSSAAYVLLSNMYALTGKWESVEKVRHLMNERQVKKEQAIS 1261 HGN ELG AA+ LI +P+SSA Y+LLS++YA GKWE VEK+R +M +R+V KE+A+S Sbjct: 494 HGNEELGKKAADELIKLDPESSAPYILLSSIYADHGKWELVEKLRGVMKKRRVNKEKAVS 553 Query: 1262 WVEVESKVHSFSVFD--HLHPSKDEIYLALEQLTGQMDDETS 1381 W+E+E KV +F+V D H H K+EIY L L +++E S Sbjct: 554 WIEIEKKVEAFTVSDGSHAHARKEEIYFILHNLAAVIEEEAS 595 Score = 138 bits (347), Expect = 6e-30 Identities = 99/363 (27%), Positives = 166/363 (45%), Gaps = 39/363 (10%) Frame = +2 Query: 197 KQVHGQVLVSGFLS-NLVLSSSIVDAYAKSGVMGDAQRVFDQMRVKDVLAWTTMVSGYAK 373 K +H + ++GF N +LS+ ++ Y K G DA +VFDQM ++++ +W MVSGY K Sbjct: 66 KWIHRHLKITGFKRPNTLLSNHLIGMYMKCGKPIDACKVFDQMHLRNLYSWNNMVSGYVK 125 Query: 374 LGDMETARRLFDEIPEPNPVSWTSLISGYARTGLGFEALKLFSDMMTRGIKPDQFTFXXX 553 G + AR +FD +PE + VSW +++ GYA+ G EAL + + GIK ++F+F Sbjct: 126 SGMLVRARVVFDSMPERDVVSWNTMVIGYAQDGNLHEALWFYKEFRRSGIKFNEFSFAGL 185 Query: 554 XXXXXXXXXXKHGKSIHGHLIRTLFKPNAIVVSSLIDMYSKCGSLGVGRSVFDIIGDKLD 733 + + HG ++ F N ++ S+ID Y+KCG + + FD + K D Sbjct: 186 LTACVKSRQLQLNRQAHGQVLVAGFLSNVVLSCSIIDAYAKCGQMESAKRCFDEMTVK-D 244 Query: 734 VILWNTMISALA-------------------------------QHGLGEESIKLFHDMTR 820 + +W T+IS A + G G ++ LF M Sbjct: 245 IHIWTTLISGYAKLGDMEAAEKLFCEMPEKNPVSWTALIAGYVRQGSGNRALDLFRKMIA 304 Query: 821 VGTKPDRTTLVVVLNACSHSGLVEEG--IHLFESITQDYGIAADQEHYACLVDLLGRAGK 994 +G KP++ T L A + + G IH + T + + + L+D+ ++G Sbjct: 305 LGVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRT---NVRPNAIVISSLIDMYSKSGS 361 Query: 995 FKELKDLLKKMPYKPDGKVWNALLGACKIHGNIELGSIAAEHLIDQ-----EPQSSAAYV 1159 + + + + K D WN ++ A HG LG A L D +P + V Sbjct: 362 LEASERVFRICDDKHDCVFWNTMISALAQHG---LGHKALRMLDDMIKFRVQPNRTTLVV 418 Query: 1160 LLS 1168 +L+ Sbjct: 419 ILN 421