BLASTX nr result
ID: Scutellaria23_contig00000601
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Scutellaria23_contig00000601 (2965 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002280968.2| PREDICTED: pentatricopeptide repeat-containi... 971 0.0 ref|XP_004152769.1| PREDICTED: pentatricopeptide repeat-containi... 905 0.0 ref|XP_004168675.1| PREDICTED: pentatricopeptide repeat-containi... 875 0.0 ref|XP_003615696.1| Pentatricopeptide repeat-containing protein ... 874 0.0 ref|NP_173402.2| pentatricopeptide repeat-containing protein [Ar... 871 0.0 >ref|XP_002280968.2| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Vitis vinifera] Length = 1545 Score = 971 bits (2511), Expect = 0.0 Identities = 495/886 (55%), Positives = 637/886 (71%), Gaps = 21/886 (2%) Frame = -2 Query: 2934 MESIVLPCKT----ILPSK-----------FQEKPQTAKLYTRFTP---PKPTNDDYLKR 2809 ME+++LPCK+ PSK Q + K++ TP PK T D +L Sbjct: 1 MENLILPCKSRPPLATPSKQGTSFECSSRIIQPRVSFTKIHQPLTPKLKPKVT-DAHLNH 59 Query: 2808 LCKLGRLSEAISALDSCA---STVRPSTLSYLIESCIYSSSLDLCYKIHARITKLIKQPD 2638 LCK GRL++AI+ LD+ A S V+P+T L++SCI S +L K+HARI L+++ + Sbjct: 60 LCKNGRLADAIACLDAIAQGGSNVKPNTYMQLLQSCIDQGSAELGRKLHARIG-LLEEMN 118 Query: 2637 PFIETKLVGMFAKCGSLDDAFAVFDEMRQRDLYAWSAIIGACSREKMWGDVVELFYWMMD 2458 PF+ETKLV M+AKCGSL +A VF EMR+R+LYAWSA+IGA SRE+MW +VV+ F++MM+ Sbjct: 119 PFVETKLVSMYAKCGSLGEARKVFGEMRERNLYAWSAMIGAYSREQMWREVVQHFFFMME 178 Query: 2457 DGGVIPDSFLLPKILQACANLGDVETGKMIHGIVIKCGMSGELRVNNSILSVYAKCGRLS 2278 DG ++PD FLLPKILQAC N GD ETGK+IH +VI+CGM+ +RV+NSIL+VYAKCGRLS Sbjct: 179 DG-IVPDEFLLPKILQACGNCGDAETGKLIHSLVIRCGMNFNIRVSNSILAVYAKCGRLS 237 Query: 2277 SAKRFFEGMGVNDKVSWNTMINGYCEANKMTEAQKLFELMQEVDVEPDVVTWNVMISSYS 2098 A+RFFE M D+VSWN++I GYC+ ++ ++ +LFE MQE +EP +VTWN++I+SYS Sbjct: 238 CARRFFENMDYRDRVSWNSIITGYCQKGELEKSHQLFEKMQEEGIEPGLVTWNILINSYS 297 Query: 2097 QSGKCDVAMKLLSEMGIRGIKADVFTWTSMISGFAHNNKGLDALNLFKEMLLGGVNPNGI 1918 QSGKCD AM+L+ +M I DVFTWTSMISGFA NN+ AL LF+EMLL G+ PNG+ Sbjct: 298 QSGKCDDAMELMKKMESFRIVPDVFTWTSMISGFAQNNRRSQALELFREMLLAGIEPNGV 357 Query: 1917 TLVSAISACSLVKDFRKGKEVHLAAIKLGYSEDVLVCNSLIDMYCKCGKLEVAREVFDMI 1738 T+ S ISAC+ +K +KG E+H A+K+G ED+LV NSLIDMY K G+LE AR VFDMI Sbjct: 358 TVTSGISACASLKALKKGMELHSVAVKIGCVEDLLVGNSLIDMYSKSGELEDARRVFDMI 417 Query: 1737 SEKDVYTWNSMIAGYCQAGYCGVAHDLFKQMQKSDVLPNIITWNVMITGYIQKGDEDQAM 1558 +KDVYTWNSMI GYCQAGYCG A+DLF +M +SDV PN++TWN MI+GYIQ GDEDQAM Sbjct: 418 LKKDVYTWNSMIGGYCQAGYCGKAYDLFIKMHESDVPPNVVTWNAMISGYIQNGDEDQAM 477 Query: 1557 DLFHMMEKNGGIKRDIASWNALIAGYLDYGQKNKALQIFRKMQSVGFKPNYITILSILPA 1378 DLFH MEK+G IKRD ASWN+LIAGYL G KNKAL IFR+MQS +PN +T+LSILPA Sbjct: 478 DLFHRMEKDGLIKRDTASWNSLIAGYLQNGHKNKALGIFRQMQSFCIRPNSVTMLSILPA 537 Query: 1377 CANLIAMKKVKEIHCCXXXXXXXXXXXVTNSLLDTYAKSGNIKYSKAIFDGMPSIDLITW 1198 CANL+A KKVKEIH C V N L+DTYAKSGNI Y++ IF G+ S D+I+W Sbjct: 538 CANLVAAKKVKEIHGCILRRNLGSELSVANCLIDTYAKSGNIVYAQTIFQGISSKDIISW 597 Query: 1197 NTLTTAYALHGCFNEVIELFERMRKQDYRPNRSTFASVISAYALAKKVDEGRHVFSIMTE 1018 N+L Y LHGC + ++LF++M K +P+R TF S+I A++L+ VD+G+ VFS M E Sbjct: 598 NSLIAGYVLHGCSDSALDLFDQMTKMGVKPSRGTFLSIIYAFSLSGMVDKGKQVFSSMME 657 Query: 1017 DYNILPCLDHYIAMVNLFGRSGKLDEALEFIRGLPVEPDVSIWSVFLTACRRHGNVKLAI 838 DY ILP L+H+ AM++L GRSGKL EA+EFI + +EPD IW+ LTA + HGN+ LAI Sbjct: 658 DYQILPGLEHHSAMIDLLGRSGKLGEAIEFIEDMAIEPDSCIWAALLTASKIHGNIGLAI 717 Query: 837 HAGERLLELEPDNAYIRGLMLQLYDLRRISKEPLKIKRHGIRKVSSEQIGWSWIEEKNRI 658 AGE LLELEP N I +LQ+Y L ++ K+++ R + + +G SWIE KN + Sbjct: 718 RAGECLLELEPSNFSIHQQILQMYALSGKFEDVSKLRKSEKRSETKQPLGCSWIEAKNIV 777 Query: 657 HAFVTGNLCQFDGRSLHSWIKSIEVNTMESKYNDTLNIIEEEKEETAGIHSEKLALAFAI 478 H FV + + LHSWI+++ +D L I EEEKEE G+HSEKLALAFA+ Sbjct: 778 HTFVADDRSRPYFDFLHSWIENVARKVKAPDQHDRLFIEEEEKEEIGGVHSEKLALAFAL 837 Query: 477 IGSPHASRTIRIVKNLRMCDHCHRFAKLVSKTHGCEIQISDSKCLH 340 I A R++RIVKNLRMC CH AK +S + CEI +SDSKCLH Sbjct: 838 IDPSCAPRSVRIVKNLRMCGDCHGTAKFLSMLYSCEIYLSDSKCLH 883 >ref|XP_004152769.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like [Cucumis sativus] Length = 1463 Score = 905 bits (2339), Expect = 0.0 Identities = 460/888 (51%), Positives = 616/888 (69%), Gaps = 18/888 (2%) Frame = -2 Query: 2934 MESIVLPCKTI------------LPSKFQEKPQTAKLYTRFTPPKPTNDDYLKRLCKLGR 2791 ME + +PC+T P KF KP ++ + NDD+L LC G Sbjct: 1 MEKLAIPCQTNPPISGPASVIKPRPLKFSSKPIKTSIFFTYKLTSKFNDDHLSYLCSNGL 60 Query: 2790 LSEAISALDSCA---STVRPSTLSYLIESCIYSSSLDLCYKIHARITKLIKQPDPFIETK 2620 L EAI+A+DS + S + +T L+++CI S++L ++H R+ L+ + +PF+ETK Sbjct: 61 LREAITAIDSISKRGSKLSTNTYINLLQTCIDVGSIELGRELHVRMG-LVHRVNPFVETK 119 Query: 2619 LVGMFAKCGSLDDAFAVFDEMRQRDLYAWSAIIGACSREKMWGDVVELFYWMMDDGGVIP 2440 LV M+AKCG L DA VFD M++R+LY WSA+IGA SRE+ W +VVELF+ MM DG V+P Sbjct: 120 LVSMYAKCGCLKDARKVFDGMQERNLYTWSAMIGAYSREQRWKEVVELFFLMMGDG-VLP 178 Query: 2439 DSFLLPKILQACANLGDVETGKMIHGIVIKCGMSGELRVNNSILSVYAKCGRLSSAKRFF 2260 D+FL PKILQAC N D+ET K+IH +VI+CG+S +R++NSIL+ + KCG+LS A++FF Sbjct: 179 DAFLFPKILQACGNCEDLETVKLIHSLVIRCGLSCYMRLSNSILTAFVKCGKLSLARKFF 238 Query: 2259 EGMGVNDKVSWNTMINGYCEANKMTEAQKLFELMQEVDVEPDVVTWNVMISSYSQSGKCD 2080 M D VSWN MI GYC+ EA++L + M +P +VT+N+MI+SYSQ G CD Sbjct: 239 GNMDERDGVSWNVMIAGYCQKGNGDEARRLLDTMSNQGFKPGLVTYNIMIASYSQLGDCD 298 Query: 2079 VAMKLLSEMGIRGIKADVFTWTSMISGFAHNNKGLDALNLFKEMLLGGVNPNGITLVSAI 1900 + + L +M G+ DV+TWTSMISGF+ +++ AL+ FK+M+L GV PN IT+ SA Sbjct: 299 LVIDLKKKMESVGLAPDVYTWTSMISGFSQSSRISQALDFFKKMILAGVEPNTITIASAT 358 Query: 1899 SACSLVKDFRKGKEVHLAAIKLGYSEDVLVCNSLIDMYCKCGKLEVAREVFDMISEKDVY 1720 SAC+ +K + G E+H AIK+G + + LV NSLIDMY KCGKLE AR VFD I EKDVY Sbjct: 359 SACASLKSLQNGLEIHCFAIKMGIARETLVGNSLIDMYSKCGKLEAARHVFDTILEKDVY 418 Query: 1719 TWNSMIAGYCQAGYCGVAHDLFKQMQKSDVLPNIITWNVMITGYIQKGDEDQAMDLFHMM 1540 TWNSMI GYCQAGY G A++LF ++++S V+PN++TWN MI+G IQ GDEDQAMDLF +M Sbjct: 419 TWNSMIGGYCQAGYGGKAYELFMRLRESTVMPNVVTWNAMISGCIQNGDEDQAMDLFQIM 478 Query: 1539 EKNGGIKRDIASWNALIAGYLDYGQKNKALQIFRKMQSVGFKPNYITILSILPACANLIA 1360 EK+GG+KR+ ASWN+LIAGY G+KNKAL IFR+MQS+ F PN +TILSILPACAN++A Sbjct: 479 EKDGGVKRNTASWNSLIAGYHQLGEKNKALAIFRQMQSLNFSPNSVTILSILPACANVMA 538 Query: 1359 MKKVKEIHCCXXXXXXXXXXXVTNSLLDTYAKSGNIKYSKAIFDGMPSIDLITWNTLTTA 1180 KK+KEIH C V NSL+DTYAKSGNIKYS+ +F+GM S D+ITWN++ Sbjct: 539 EKKIKEIHGCVLRRNLESELAVANSLVDTYAKSGNIKYSRTVFNGMSSKDIITWNSIIAG 598 Query: 1179 YALHGCFNEVIELFERMRKQDYRPNRSTFASVISAYALAKKVDEGRHVFSIMTEDYNILP 1000 Y LHGC + +LF++MR RPNR T AS+I AY +A VD+GRHVFS +TE++ ILP Sbjct: 599 YILHGCSDSAFQLFDQMRNLGIRPNRGTLASIIHAYGIAGMVDKGRHVFSSITEEHQILP 658 Query: 999 CLDHYIAMVNLFGRSGKLDEALEFIRGLPVEPDVSIWSVFLTACRRHGNVKLAIHAGERL 820 LDHY+AMV+L+GRSG+L +A+EFI +P+EPDVSIW+ LTACR HGN+ LA+ A +RL Sbjct: 659 TLDHYLAMVDLYGRSGRLADAIEFIEDMPIEPDVSIWTSLLTACRFHGNLNLAVLAAKRL 718 Query: 819 LELEPDNAYIRGLMLQLYDLRRISKEPLKIKRHGIRKVSSEQIGWSWIEEKNRIHAFVTG 640 ELEPDN I L++Q Y L ++ LK+++ G + W+E +N++H FVTG Sbjct: 719 HELEPDNHVIYRLLVQAYALYGKFEQTLKVRKLGKESAMKKCTAQCWVEVRNKVHLFVTG 778 Query: 639 NLCQFDGRSLHSWIKSIEVNTMESKYNDTLNIIEEEKEE-TAGIHSEKLALAFAIIGSPH 463 + + D L++WIKSIE + + L+I EEEKEE G H EK A AF +IGS H Sbjct: 779 DQSKLD--VLNTWIKSIEGKVKKFNNHHQLSIEEEEKEEKIGGFHCEKFAFAFGLIGSSH 836 Query: 462 ASRTIRIVKNLRMCDHCHRFAKLVSKTHGC-EIQ-ISDSKCLHHIKHG 325 ++I+IVKNLRMC CH+ AK +S + C E+Q DS+ + I G Sbjct: 837 TRKSIKIVKNLRMCVDCHQMAKYISAAYECIEVQSFDDSRKIKPIMAG 884 >ref|XP_004168675.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like, partial [Cucumis sativus] Length = 1090 Score = 875 bits (2260), Expect = 0.0 Identities = 441/847 (52%), Positives = 591/847 (69%), Gaps = 16/847 (1%) Frame = -2 Query: 2934 MESIVLPCKTI------------LPSKFQEKPQTAKLYTRFTPPKPTNDDYLKRLCKLGR 2791 ME + +PC+T P KF KP ++ + NDD+L LC G Sbjct: 1 MEKLAIPCQTNPPISGPASVIKPRPLKFSSKPIKTSIFFTYKLTSKFNDDHLSYLCSNGL 60 Query: 2790 LSEAISALDSCA---STVRPSTLSYLIESCIYSSSLDLCYKIHARITKLIKQPDPFIETK 2620 L EAI+A+DS + S + +T L+++CI S++L ++H R+ L+ + +PF+ETK Sbjct: 61 LREAITAIDSISKRGSKLSTNTYINLLQTCIDVGSIELGRELHVRMG-LVHRVNPFVETK 119 Query: 2619 LVGMFAKCGSLDDAFAVFDEMRQRDLYAWSAIIGACSREKMWGDVVELFYWMMDDGGVIP 2440 LV M+AKCG L DA VFD M++R+LY WSA+IGA SRE+ W +VVELF+ MM DG V+P Sbjct: 120 LVSMYAKCGCLKDARKVFDGMQERNLYTWSAMIGAYSREQRWKEVVELFFLMMGDG-VLP 178 Query: 2439 DSFLLPKILQACANLGDVETGKMIHGIVIKCGMSGELRVNNSILSVYAKCGRLSSAKRFF 2260 D+FL PKILQAC N D+ET K+IH +VI+CG+S +R++NSIL+ + KCG+LS A++FF Sbjct: 179 DAFLFPKILQACGNCEDLETVKLIHSLVIRCGLSCYMRLSNSILTAFVKCGKLSLARKFF 238 Query: 2259 EGMGVNDKVSWNTMINGYCEANKMTEAQKLFELMQEVDVEPDVVTWNVMISSYSQSGKCD 2080 M D VSWN MI GYC+ EA++L + M +P +VT+N+MI+SYSQ G CD Sbjct: 239 GNMDERDGVSWNVMIAGYCQKGNGDEARRLLDTMSNQGFKPGLVTYNIMIASYSQLGDCD 298 Query: 2079 VAMKLLSEMGIRGIKADVFTWTSMISGFAHNNKGLDALNLFKEMLLGGVNPNGITLVSAI 1900 + + L +M G+ DV+TWTSMISGF+ +++ AL+ FK+M+L GV PN IT+ SA Sbjct: 299 LVIDLKKKMESVGLAPDVYTWTSMISGFSQSSRISQALDFFKKMILAGVEPNTITIASAT 358 Query: 1899 SACSLVKDFRKGKEVHLAAIKLGYSEDVLVCNSLIDMYCKCGKLEVAREVFDMISEKDVY 1720 SAC+ +K + G E+H AIK+G + + LV NSLIDMY KCGKLE AR VFD I EKDVY Sbjct: 359 SACASLKSLQNGLEIHCFAIKMGIARETLVGNSLIDMYSKCGKLEAARHVFDTILEKDVY 418 Query: 1719 TWNSMIAGYCQAGYCGVAHDLFKQMQKSDVLPNIITWNVMITGYIQKGDEDQAMDLFHMM 1540 TWNSMI GYCQAGY G A++LF ++++S V+PN++TWN MI+G IQ GDEDQAMDLF +M Sbjct: 419 TWNSMIGGYCQAGYGGKAYELFMRLRESTVMPNVVTWNAMISGCIQNGDEDQAMDLFQIM 478 Query: 1539 EKNGGIKRDIASWNALIAGYLDYGQKNKALQIFRKMQSVGFKPNYITILSILPACANLIA 1360 EK+GG+KR+ ASWN+LIAGY G+KNKAL IFR+MQS+ F PN +TILSILPACAN++A Sbjct: 479 EKDGGVKRNTASWNSLIAGYHQLGEKNKALAIFRQMQSLNFSPNSVTILSILPACANVMA 538 Query: 1359 MKKVKEIHCCXXXXXXXXXXXVTNSLLDTYAKSGNIKYSKAIFDGMPSIDLITWNTLTTA 1180 KK+KEIH C V NSL+DTYAKSGNIKYS+ +F+GM S D+ITWN++ Sbjct: 539 EKKIKEIHGCVLRRNLESELAVANSLVDTYAKSGNIKYSRTVFNGMSSKDIITWNSIIAG 598 Query: 1179 YALHGCFNEVIELFERMRKQDYRPNRSTFASVISAYALAKKVDEGRHVFSIMTEDYNILP 1000 Y LHGC + +LF++MR RPNR T AS+I AY +A VD+GRHVFS +TE++ ILP Sbjct: 599 YILHGCSDSAFQLFDQMRNLGIRPNRGTLASIIHAYGIAGMVDKGRHVFSSITEEHQILP 658 Query: 999 CLDHYIAMVNLFGRSGKLDEALEFIRGLPVEPDVSIWSVFLTACRRHGNVKLAIHAGERL 820 LDHY+AMV+L+GRSG+L +A+EFI +P+EPDVSIW+ LTACR HGN+ LA+ A +RL Sbjct: 659 TLDHYLAMVDLYGRSGRLADAIEFIEDMPIEPDVSIWTSLLTACRFHGNLNLAVLAAKRL 718 Query: 819 LELEPDNAYIRGLMLQLYDLRRISKEPLKIKRHGIRKVSSEQIGWSWIEEKNRIHAFVTG 640 ELEPDN I L++Q Y L ++ LK+++ G + W+E +N++H FVTG Sbjct: 719 HELEPDNHVIYRLLVQAYALYGKFEQTLKVRKLGKESAMKKCTAQCWVEVRNKVHLFVTG 778 Query: 639 NLCQFDGRSLHSWIKSIEVNTMESKYNDTLNIIEEEKEE-TAGIHSEKLALAFAIIGSPH 463 + + D L++WIKSIE + + L+I EEEKEE G H EK A AF +IGS H Sbjct: 779 DQSKLD--VLNTWIKSIEGKVKKFNNHHQLSIEEEEKEEKIGGFHCEKFAFAFGLIGSSH 836 Query: 462 ASRTIRI 442 ++I+I Sbjct: 837 TRKSIKI 843 >ref|XP_003615696.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355517031|gb|AES98654.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 887 Score = 874 bits (2257), Expect = 0.0 Identities = 449/891 (50%), Positives = 593/891 (66%), Gaps = 16/891 (1%) Frame = -2 Query: 2925 IVLPCKTILPSKFQEKPQTAKLYT---------RFTPPKPTN----DDYLKRLCKLGRLS 2785 I +P K+I P F KP + + KP+ D L +LC G LS Sbjct: 6 IFIPNKSITPLSFPNKPTKFDCISSKRVNANSNNVSTTKPSIRKLIDSQLNQLCINGSLS 65 Query: 2784 EAISALDSCAST---VRPSTLSYLIESCIYSSSLDLCYKIHARITKLIKQPDPFIETKLV 2614 EA++ LDS A V+P T L++SCI + + ++H+RI L++ +PF+ETKLV Sbjct: 66 EAVTILDSLAEQGCRVKPITYMNLLQSCIDKDCIFIGKELHSRIG-LVENVNPFVETKLV 124 Query: 2613 GMFAKCGSLDDAFAVFDEMRQRDLYAWSAIIGACSREKMWGDVVELFYWMMDDGGVIPDS 2434 M+AKCG L A VF+EM R+L+ WSA+IG CSR K WG+VV LFY MM DG V+PD Sbjct: 125 SMYAKCGLLGMARKVFNEMSVRNLFTWSAMIGGCSRNKSWGEVVGLFYAMMRDG-VLPDE 183 Query: 2433 FLLPKILQACANLGDVETGKMIHGIVIKCGMSGELRVNNSILSVYAKCGRLSSAKRFFEG 2254 FLLPK+LQAC D+ETG++IH +VI+ GM + NSI++VYAKCG + AK+ F+ Sbjct: 184 FLLPKVLQACGKCRDLETGRLIHSMVIRRGMRWSKHLRNSIMAVYAKCGEMDCAKKIFDC 243 Query: 2253 MGVNDKVSWNTMINGYCEANKMTEAQKLFELMQEVDVEPDVVTWNVMISSYSQSGKCDVA 2074 M D V+WN MI+G+C+ ++ +AQK F+ MQ+ VEP +VTWN++IS Y+Q G CD+A Sbjct: 244 MDERDSVAWNAMISGFCQNGEIGQAQKYFDAMQKDGVEPSLVTWNILISCYNQLGHCDLA 303 Query: 2073 MKLLSEMGIRGIKADVFTWTSMISGFAHNNKGLDALNLFKEMLLGGVNPNGITLVSAISA 1894 + L+ +M GI DV+TWTSMISGF + AL+L KEM L GV N IT+ SA SA Sbjct: 304 IDLMRKMEWFGIAPDVYTWTSMISGFTQKGRISHALDLLKEMFLAGVEANNITIASAASA 363 Query: 1893 CSLVKDFRKGKEVHLAAIKLGYSEDVLVCNSLIDMYCKCGKLEVAREVFDMISEKDVYTW 1714 C+ +K G E+H A+K+ ++VLV NSLIDMYCKCG L+ A+ +FDM+SE+DVY+W Sbjct: 364 CAALKSLSMGLEIHSIAVKMNLVDNVLVGNSLIDMYCKCGDLKAAQHIFDMMSERDVYSW 423 Query: 1713 NSMIAGYCQAGYCGVAHDLFKQMQKSDVLPNIITWNVMITGYIQKGDEDQAMDLFHMMEK 1534 NS+I GY QAG+CG AH+LF +MQ+SD PNIITWN+MITGY+Q G EDQA+DLF +EK Sbjct: 424 NSIIGGYFQAGFCGKAHELFMKMQESDSPPNIITWNIMITGYMQSGAEDQALDLFKSIEK 483 Query: 1533 NGGIKRDIASWNALIAGYLDYGQKNKALQIFRKMQSVGFKPNYITILSILPACANLIAMK 1354 +G KR+ ASWN+LI+G++ GQK+KALQIFR MQ PN +TILSILP CANL+A K Sbjct: 484 DGKTKRNAASWNSLISGFVQSGQKDKALQIFRNMQFCHILPNSVTILSILPVCANLVASK 543 Query: 1353 KVKEIHCCXXXXXXXXXXXVTNSLLDTYAKSGNIKYSKAIFDGMPSIDLITWNTLTTAYA 1174 KVKEIHC V+N L+D+YAKSGN+ YSK IF+ + D ++WN++ ++Y Sbjct: 544 KVKEIHCFAVRRILVSELSVSNLLIDSYAKSGNLMYSKNIFNELSWKDAVSWNSMLSSYV 603 Query: 1173 LHGCFNEVIELFERMRKQDYRPNRSTFASVISAYALAKKVDEGRHVFSIMTEDYNILPCL 994 LHGC ++LF +MRKQ +PNR TFAS++ AY A VDEG+ VFS +T+DY + + Sbjct: 604 LHGCSESALDLFYQMRKQGLQPNRGTFASILLAYGHAGMVDEGKSVFSCITKDYLVRQGM 663 Query: 993 DHYIAMVNLFGRSGKLDEALEFIRGLPVEPDVSIWSVFLTACRRHGNVKLAIHAGERLLE 814 +HY AMV L GRSGKL EAL+FI+ +P+EP+ S+W LTACR H N +A+ AG+R+LE Sbjct: 664 EHYSAMVYLLGRSGKLAEALDFIQSMPIEPNSSVWGALLTACRIHRNFGVAVLAGKRMLE 723 Query: 813 LEPDNAYIRGLMLQLYDLRRISKEPLKIKRHGIRKVSSEQIGWSWIEEKNRIHAFVTGNL 634 EP N R L+ Q Y L K + G K ++ IG SWIE N +H FV G+ Sbjct: 724 FEPGNNITRHLLSQAYSLCG------KFEPEG-EKAVNKPIGQSWIERNNVVHTFVVGDQ 776 Query: 633 CQFDGRSLHSWIKSIEVNTMESKYNDTLNIIEEEKEETAGIHSEKLALAFAIIGSPHASR 454 LHSW+K + VN ++ L I EEEKE T+ +HSEKLA AFA+I + + Sbjct: 777 SNPYLDKLHSWLKRVAVNVKTHVSDNELYIEEEEKENTSSVHSEKLAFAFALIDPHNKPQ 836 Query: 453 TIRIVKNLRMCDHCHRFAKLVSKTHGCEIQISDSKCLHHIKHGICSCGDYW 301 +RIVK LRMC CH AK +S +GCEI +SDS CLHH K G CSC DYW Sbjct: 837 ILRIVKKLRMCRDCHDTAKYISMAYGCEIYLSDSNCLHHFKGGHCSCRDYW 887 >ref|NP_173402.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75263158|sp|Q9FXH1.1|PPR52_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g19720; AltName: Full=Protein DYW7 gi|10086495|gb|AAG12555.1|AC007797_15 Unknown Protein [Arabidopsis thaliana] gi|332191770|gb|AEE29891.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 894 Score = 871 bits (2251), Expect = 0.0 Identities = 449/896 (50%), Positives = 598/896 (66%), Gaps = 18/896 (2%) Frame = -2 Query: 2934 MESIVLPC--KTIL----PSKFQEKPQ----TAKLYTRFTPPKPTN---DDYLKRLCKLG 2794 ME + +P KT L P+K + P+ + K FT K N D+ LC+ G Sbjct: 1 MEKLFVPSFPKTFLNYQTPAKVENSPELHPKSRKKNLSFTKKKEPNIIPDEQFDYLCRNG 60 Query: 2793 RLSEAISALDSC---ASTVRPSTLSYLIESCIYSSSLDLCYKIHARITKLIKQPDPFIET 2623 L EA ALDS S V+ ST L+ESCI S S+ L +HAR L +PD F+ET Sbjct: 61 SLLEAEKALDSLFQQGSKVKRSTYLKLLESCIDSGSIHLGRILHARFG-LFTEPDVFVET 119 Query: 2622 KLVGMFAKCGSLDDAFAVFDEMRQRDLYAWSAIIGACSREKMWGDVVELFYWMMDDGGVI 2443 KL+ M+AKCG + DA VFD MR+R+L+ WSA+IGA SRE W +V +LF MM DG V+ Sbjct: 120 KLLSMYAKCGCIADARKVFDSMRERNLFTWSAMIGAYSRENRWREVAKLFRLMMKDG-VL 178 Query: 2442 PDSFLLPKILQACANLGDVETGKMIHGIVIKCGMSGELRVNNSILSVYAKCGRLSSAKRF 2263 PD FL PKILQ CAN GDVE GK+IH +VIK GMS LRV+NSIL+VYAKCG L A +F Sbjct: 179 PDDFLFPKILQGCANCGDVEAGKVIHSVVIKLGMSSCLRVSNSILAVYAKCGELDFATKF 238 Query: 2262 FEGMGVNDKVSWNTMINGYCEANKMTEAQKLFELMQEVDVEPDVVTWNVMISSYSQSGKC 2083 F M D ++WN+++ YC+ K EA +L + M++ + P +VTWN++I Y+Q GKC Sbjct: 239 FRRMRERDVIAWNSVLLAYCQNGKHEEAVELVKEMEKEGISPGLVTWNILIGGYNQLGKC 298 Query: 2082 DVAMKLLSEMGIRGIKADVFTWTSMISGFAHNNKGLDALNLFKEMLLGGVNPNGITLVSA 1903 D AM L+ +M GI ADVFTWT+MISG HN AL++F++M L GV PN +T++SA Sbjct: 299 DAAMDLMQKMETFGITADVFTWTAMISGLIHNGMRYQALDMFRKMFLAGVVPNAVTIMSA 358 Query: 1902 ISACSLVKDFRKGKEVHLAAIKLGYSEDVLVCNSLIDMYCKCGKLEVAREVFDMISEKDV 1723 +SACS +K +G EVH A+K+G+ +DVLV NSL+DMY KCGKLE AR+VFD + KDV Sbjct: 359 VSACSCLKVINQGSEVHSIAVKMGFIDDVLVGNSLVDMYSKCGKLEDARKVFDSVKNKDV 418 Query: 1722 YTWNSMIAGYCQAGYCGVAHDLFKQMQKSDVLPNIITWNVMITGYIQKGDEDQAMDLFHM 1543 YTWNSMI GYCQAGYCG A++LF +MQ +++ PNIITWN MI+GYI+ GDE +AMDLF Sbjct: 419 YTWNSMITGYCQAGYCGKAYELFTRMQDANLRPNIITWNTMISGYIKNGDEGEAMDLFQR 478 Query: 1542 MEKNGGIKRDIASWNALIAGYLDYGQKNKALQIFRKMQSVGFKPNYITILSILPACANLI 1363 MEK+G ++R+ A+WN +IAGY+ G+K++AL++FRKMQ F PN +TILS+LPACANL+ Sbjct: 479 MEKDGKVQRNTATWNLIIAGYIQNGKKDEALELFRKMQFSRFMPNSVTILSLLPACANLL 538 Query: 1362 AMKKVKEIHCCXXXXXXXXXXXVTNSLLDTYAKSGNIKYSKAIFDGMPSIDLITWNTLTT 1183 K V+EIH C V N+L DTYAKSG+I+YS+ IF GM + D+ITWN+L Sbjct: 539 GAKMVREIHGCVLRRNLDAIHAVKNALTDTYAKSGDIEYSRTIFLGMETKDIITWNSLIG 598 Query: 1182 AYALHGCFNEVIELFERMRKQDYRPNRSTFASVISAYALAKKVDEGRHVFSIMTEDYNIL 1003 Y LHG + + LF +M+ Q PNR T +S+I A+ L VDEG+ VF + DY+I+ Sbjct: 599 GYVLHGSYGPALALFNQMKTQGITPNRGTLSSIILAHGLMGNVDEGKKVFYSIANDYHII 658 Query: 1002 PCLDHYIAMVNLFGRSGKLDEALEFIRGLPVEPDVSIWSVFLTACRRHGNVKLAIHAGER 823 P L+H AMV L+GR+ +L+EAL+FI+ + ++ + IW FLT CR HG++ +AIHA E Sbjct: 659 PALEHCSAMVYLYGRANRLEEALQFIQEMNIQSETPIWESFLTGCRIHGDIDMAIHAAEN 718 Query: 822 LLELEPDNAYIRGLMLQLYDLRRISKEPLKIKRHGIRKVSSEQIGWSWIEEKNRIHAFVT 643 L LEP+N ++ Q+Y L L+ + + + +G SWIE +N IH F T Sbjct: 719 LFSLEPENTATESIVSQIYALGAKLGRSLEGNKPRRDNLLKKPLGQSWIEVRNLIHTFTT 778 Query: 642 GNLCQFDGRSLHSWI-KSIEVNTMESKYNDTLNIIEEEKEETAGIHSEKLALAFAIIGSP 466 G+ + L+ + K ++ +YN L I EE +EET GIHSEK A+AF +I S Sbjct: 779 GDQSKLCTDVLYPLVEKMSRLDNRSDQYNGELWIEEEGREETCGIHSEKFAMAFGLISSS 838 Query: 465 HASR-TIRIVKNLRMCDHCHRFAKLVSKTHGCEIQISDSKCLHHIKHGICSCGDYW 301 AS+ TIRI+KNLRMC CH AK VSK +GC+I + D++CLHH K+G CSC DYW Sbjct: 839 GASKTTIRILKNLRMCRDCHDTAKYVSKRYGCDILLEDTRCLHHFKNGDCSCKDYW 894