BLASTX nr result
ID: Angelica23_contig00001498
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00001498 (3210 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002271180.1| PREDICTED: pentatricopeptide repeat-containi... 1141 0.0 ref|XP_002515260.1| pentatricopeptide repeat-containing protein,... 1098 0.0 ref|XP_004135985.1| PREDICTED: pentatricopeptide repeat-containi... 1056 0.0 ref|XP_004166285.1| PREDICTED: pentatricopeptide repeat-containi... 1055 0.0 ref|NP_180698.1| genomes uncoupled 1 protein [Arabidopsis thalia... 1015 0.0 >ref|XP_002271180.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400, chloroplastic [Vitis vinifera] Length = 867 Score = 1141 bits (2952), Expect = 0.0 Identities = 579/845 (68%), Positives = 685/845 (81%), Gaps = 7/845 (0%) Frame = +1 Query: 340 TTPNHQNPIFTAPKVSINHHNQRFQANPRNVSAP--APPHIQXXXXXXXXXXXXXXXXXX 513 T +H N +++ KVS+ + +PRN + P A P Sbjct: 27 TKNHHNNHHWSSHKVSLTNP----LPSPRNAAKPGAASPATATNRNSNFPSLSPLPPSKS 82 Query: 514 XXXXDFSGRRSTRYVSKMHFGRQKTANASRHTSIAEEALQEAIRLSGDENCIESVLKRFE 693 DFSGRRSTR+VSKMHFGR KTA A+RHTS AEEAL+ AIR + D+ I+SVL FE Sbjct: 83 ELTADFSGRRSTRFVSKMHFGRPKTAAAARHTSTAEEALRHAIRFASDDKGIDSVLLNFE 142 Query: 694 NKFCGTDDYTYLLRELGNRGEYSMAIKCFEFAVKREKRRNDQGKLASSMISILGRLGKVD 873 ++ CG+DDYT+LLRELGNRGE++ AI+CFEFAV+RE+RRN+QGKLAS+MISILGRLG+V+ Sbjct: 143 SRLCGSDDYTFLLRELGNRGEWAKAIRCFEFAVRREQRRNEQGKLASAMISILGRLGQVE 202 Query: 874 LAEKVFEIALNEGYGNTVYAYSALISAYGKSGYCDEAIRLFETMKKSGLKPNLVTYNALI 1053 LA+ VFE ALNEGYGNTVYA+SALISAYG+SGYCDEAI++FETMK SGLKPNLVTYNA+I Sbjct: 203 LAKNVFETALNEGYGNTVYAFSALISAYGRSGYCDEAIKVFETMKSSGLKPNLVTYNAVI 262 Query: 1054 NACGKGGADFKRASEIFDDMWSNGVQPDRITYNSLLAACSGGGLWDTAKKLFSEMIDRGI 1233 +ACGKGG DF RA+EIFD+M NGVQPDRIT+NSLLA C GGLW+ A+ LFSEM+ RGI Sbjct: 263 DACGKGGVDFNRAAEIFDEMLRNGVQPDRITFNSLLAVCGRGGLWEAARNLFSEMLYRGI 322 Query: 1234 DQDIYTYNTLLDVACNGGHLDMAFGIMSEMPAKNIFPNEVTYSTIIRGCAKAGRLDRALS 1413 +QDI+TYNTLLD C GG +D+AF IMSEMP K+I PN VTYST+I G AKAGRLD AL+ Sbjct: 323 EQDIFTYNTLLDAVCKGGQMDLAFQIMSEMPRKHIMPNVVTYSTVIDGYAKAGRLDEALN 382 Query: 1414 LFNEMKYAGIKLDRVSYNTLLAIYASLGRFEEALCVGNEMESMGIKKDVVTYNALLDGFG 1593 LFNEMK+A I LDRVSYNTLL+IYA LGRFEEAL V EMES GIKKD VTYNALL G+G Sbjct: 383 LFNEMKFASIGLDRVSYNTLLSIYAKLGRFEEALNVCKEMESSGIKKDAVTYNALLGGYG 442 Query: 1594 KQGMFDKVKELFQMMRADKLSPNLLTYSTLISVYSKGGLYQEAMEVYKEFKKDGLKADVV 1773 KQG +++VK +F+ M+A+++ PNLLTYSTLI VYSKGGLYQEAMEV++EFKK GLKADVV Sbjct: 443 KQGKYEEVKRVFEEMKAERIFPNLLTYSTLIDVYSKGGLYQEAMEVFREFKKAGLKADVV 502 Query: 1774 FYSKLIDALCKKGLVESSALLLDEMTKKGIQPNVVTYNSIINAFGQSASTEICLD----- 1938 YS LIDALCK GLVES+ LDEMTK+GI+PNVVTYNSII+AFG+S S E +D Sbjct: 503 LYSALIDALCKNGLVESAVSFLDEMTKEGIRPNVVTYNSIIDAFGRSGSAECVIDPPYET 562 Query: 1939 GDSQVESSALMALDNSSNFKGEVVEEDRIVRVFEQLAGGESRQGTKDNKGSNEEILCVLG 2118 S++ SS+L +++++ + E+++I+++F QLA ++ K+N+G +EILC+L Sbjct: 563 NVSKMSSSSLKVVEDATESEVGDKEDNQIIKIFGQLAAEKTCHAKKENRG-RQEILCILA 621 Query: 2119 VFQKMHELEIKPNVVTFSAILNACSHCSSFKEASLLLEELRLFDNHVYGVAHGLLMGNRE 2298 VF KMHEL+IKPNVVTFSAILNACS C+SF++AS+LLEELRLFDN VYGVAHGLLMG + Sbjct: 622 VFHKMHELDIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNQVYGVAHGLLMGYGD 681 Query: 2299 NVWVQALSLFDDVMQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENTWST 2478 NVWVQA SLFD+V QMDSSTASAFYNALTDMLWHFGQ+RGAQLVVLEGKRR VWEN WS Sbjct: 682 NVWVQAQSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQLVVLEGKRRHVWENMWSN 741 Query: 2479 SCLDLHLMSSGAARAMVHAWLLNIRSIVFEGHELPKLLSILTGWGKHSKVQGDATLKPAI 2658 SCLDLHLMSSGAARAMVHAWLLNIRSIVFEGHELP+LLSILTGWGKHSKV GD L+ AI Sbjct: 742 SCLDLHLMSSGAARAMVHAWLLNIRSIVFEGHELPQLLSILTGWGKHSKVVGDGALRRAI 801 Query: 2659 EALLVGIGAPFQLAKCNIGRYISTGPVVAAWLRESGTLQVLVLQDDRNQLESTRSYNVPI 2838 EALL G+GAPF++AKCN+GR+ISTG VVAAWLRESGTL+VLVL DDR + R + Sbjct: 802 EALLTGMGAPFRVAKCNLGRFISTGAVVAAWLRESGTLKVLVLHDDRTNPDRARCSQISN 861 Query: 2839 LSALP 2853 L LP Sbjct: 862 LQTLP 866 >ref|XP_002515260.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223545740|gb|EEF47244.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 878 Score = 1098 bits (2840), Expect = 0.0 Identities = 545/780 (69%), Positives = 653/780 (83%), Gaps = 5/780 (0%) Frame = +1 Query: 526 DFSGRRSTRYVSKMHFGRQKTANASRHTSIAEEALQEAIRLSGDENCIESVLKRFENKFC 705 DFSGRRSTR+VSK+HFGR KT N +RHTS+A EALQ+ I+ D+ +E+VL FE++ C Sbjct: 93 DFSGRRSTRFVSKLHFGRPKT-NMNRHTSVALEALQQVIQYGKDDKALENVLLNFESRLC 151 Query: 706 GTDDYTYLLRELGNRGEYSMAIKCFEFAVKREKRRNDQGKLASSMISILGRLGKVDLAEK 885 G DDYT+LLRELGNRG+ + A++CFEFAV+RE +N+QGKLAS+MIS LGRLGKV+LA+ Sbjct: 152 GPDDYTFLLRELGNRGDSAKAVRCFEFAVRRESGKNEQGKLASAMISTLGRLGKVELAKA 211 Query: 886 VFEIALNEGYGNTVYAYSALISAYGKSGYCDEAIRLFETMKKSGLKPNLVTYNALINACG 1065 VF+ AL EGYG TVYA+SALISAYG+SGYC+EAI++F++MK +GL PNLVTYNA+I+ACG Sbjct: 212 VFDTALKEGYGKTVYAFSALISAYGRSGYCNEAIKVFDSMKSNGLMPNLVTYNAVIDACG 271 Query: 1066 KGGADFKRASEIFDDMWSNGVQPDRITYNSLLAACSGGGLWDTAKKLFSEMIDRGIDQDI 1245 KGG +FK+ EIFD M SNGVQPDRIT+NSLLA CS GGLW+ A++LFS M+D+GIDQDI Sbjct: 272 KGGVEFKKVVEIFDGMLSNGVQPDRITFNSLLAVCSRGGLWEAARRLFSAMVDKGIDQDI 331 Query: 1246 YTYNTLLDVACNGGHLDMAFGIMSEMPAKNIFPNEVTYSTIIRGCAKAGRLDRALSLFNE 1425 +TYNTLLD C GG +D+AF IMSEMP KNI PN VTYST+I G AK GRLD AL++FNE Sbjct: 332 FTYNTLLDAVCKGGQMDLAFEIMSEMPTKNILPNVVTYSTMIDGYAKVGRLDDALNMFNE 391 Query: 1426 MKYAGIKLDRVSYNTLLAIYASLGRFEEALCVGNEMESMGIKKDVVTYNALLDGFGKQGM 1605 MK+ G+ LDRVSYNTLL++YA LGRFE+AL V EME+ GI+KDVVTYNALL G+GKQ Sbjct: 392 MKFLGVGLDRVSYNTLLSVYAKLGRFEQALDVCKEMENAGIRKDVVTYNALLAGYGKQYR 451 Query: 1606 FDKVKELFQMMRADKLSPNLLTYSTLISVYSKGGLYQEAMEVYKEFKKDGLKADVVFYSK 1785 +D+V+ +F+ M+ ++SPNLLTYSTLI VYSKGGLY+EAMEV++EFK+ GLKADVV YS Sbjct: 452 YDEVRRVFEEMKRGRVSPNLLTYSTLIDVYSKGGLYKEAMEVFREFKQAGLKADVVLYSA 511 Query: 1786 LIDALCKKGLVESSALLLDEMTKKGIQPNVVTYNSIINAFGQSASTEICLDGDS-----Q 1950 LIDALCK GLVESS LLDEMTK+GI+PNVVTYNSII+AFG+SAS + +D Q Sbjct: 512 LIDALCKNGLVESSVTLLDEMTKEGIRPNVVTYNSIIDAFGRSASAQCVVDDSGETTALQ 571 Query: 1951 VESSALMALDNSSNFKGEVVEEDRIVRVFEQLAGGESRQGTKDNKGSNEEILCVLGVFQK 2130 VES + + + + + E++RI+ +F +LA ++ + K +EILC+LGVFQK Sbjct: 572 VESLSSIVVQEAIESQAADKEDNRIIEIFGKLAAEKACEAKNSGK---QEILCILGVFQK 628 Query: 2131 MHELEIKPNVVTFSAILNACSHCSSFKEASLLLEELRLFDNHVYGVAHGLLMGNRENVWV 2310 MHEL+IKPNVVTFSAILNACS C SF++AS+LLEELRLFDN VYGVAHGLLMG RENVW+ Sbjct: 629 MHELKIKPNVVTFSAILNACSRCDSFEDASMLLEELRLFDNQVYGVAHGLLMGYRENVWL 688 Query: 2311 QALSLFDDVMQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENTWSTSCLD 2490 QA SLFD+V MDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWEN WS SCLD Sbjct: 689 QAQSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENIWSDSCLD 748 Query: 2491 LHLMSSGAARAMVHAWLLNIRSIVFEGHELPKLLSILTGWGKHSKVQGDATLKPAIEALL 2670 LHLMSSGAARAMVHAWLLNIRSIVFEGHELPKLLSILTGWGKHSKV GD+ L+ A+EALL Sbjct: 749 LHLMSSGAARAMVHAWLLNIRSIVFEGHELPKLLSILTGWGKHSKVVGDSALRRAVEALL 808 Query: 2671 VGIGAPFQLAKCNIGRYISTGPVVAAWLRESGTLQVLVLQDDRNQLESTRSYNVPILSAL 2850 +G+GAPF+LAKCN+GR+ISTG VVAAWL+ESGTL+VLVL DDR E+ +++ L L Sbjct: 809 IGMGAPFRLAKCNLGRFISTGSVVAAWLKESGTLEVLVLHDDRTHPENKDLFSLSPLPGL 868 >ref|XP_004135985.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400, chloroplastic-like [Cucumis sativus] Length = 868 Score = 1056 bits (2732), Expect = 0.0 Identities = 530/780 (67%), Positives = 637/780 (81%), Gaps = 5/780 (0%) Frame = +1 Query: 526 DFSGRRSTRYVSKMHFGRQKTANASRHTSIAEEALQEAIRLSGDENCIESVLKRFENKFC 705 +FSGRRSTR+VSK HFGR K++ +RH++IAEE L + ++ D+ ++++L FE+K C Sbjct: 89 NFSGRRSTRFVSKFHFGRPKSSMTTRHSAIAEEVLHQVLQFGKDDASLDNILLNFESKLC 148 Query: 706 GTDDYTYLLRELGNRGEYSMAIKCFEFAVKREKRRNDQGKLASSMISILGRLGKVDLAEK 885 G++DYT+LLRELGNRGE AI+CF+FA+ RE R+N++GKLAS+MIS LGRLGKV+LA+ Sbjct: 149 GSEDYTFLLRELGNRGECWKAIRCFDFALVREGRKNERGKLASAMISTLGRLGKVELAKG 208 Query: 886 VFEIALNEGYGNTVYAYSALISAYGKSGYCDEAIRLFETMKKSGLKPNLVTYNALINACG 1065 VFE AL+EGYGNTV+A+SALISAYGKSGY DEAI++FE+MK SGLKPNLVTYNA+I+ACG Sbjct: 209 VFETALSEGYGNTVFAFSALISAYGKSGYFDEAIKVFESMKVSGLKPNLVTYNAVIDACG 268 Query: 1066 KGGADFKRASEIFDDMWSNGVQPDRITYNSLLAACSGGGLWDTAKKLFSEMIDRGIDQDI 1245 KGG +FKR EIF++M NGVQPDRITYNSLLA CS GGLW+ A+ LF+EMIDRGIDQD+ Sbjct: 269 KGGVEFKRVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAARNLFNEMIDRGIDQDV 328 Query: 1246 YTYNTLLDVACNGGHLDMAFGIMSEMPAKNIFPNEVTYSTIIRGCAKAGRLDRALSLFNE 1425 +TYNTLLD C GG +D+A+ IM EMP K I PN VTYST+ G AKAGRL+ AL+L+NE Sbjct: 329 FTYNTLLDAVCKGGQMDLAYEIMLEMPGKKILPNVVTYSTMADGYAKAGRLEDALNLYNE 388 Query: 1426 MKYAGIKLDRVSYNTLLAIYASLGRFEEALCVGNEMESMGIKKDVVTYNALLDGFGKQGM 1605 MK+ GI LDRVSYNTLL+IYA LGRFE+AL V EM S G+KKDVVTYNALLDG+GKQG Sbjct: 389 MKFLGIGLDRVSYNTLLSIYAKLGRFEDALKVCKEMGSSGVKKDVVTYNALLDGYGKQGK 448 Query: 1606 FDKVKELFQMMRADKLSPNLLTYSTLISVYSKGGLYQEAMEVYKEFKKDGLKADVVFYSK 1785 F++V +F+ M+ D++ PNLLTYSTLI VYSKG LY+EAMEV++EFK+ GLKADVV YS+ Sbjct: 449 FNEVTRVFKEMKKDRVFPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSE 508 Query: 1786 LIDALCKKGLVESSALLLDEMTKKGIQPNVVTYNSIINAFGQSASTEICLDG-----DSQ 1950 LI+ALCK GLV+S+ LLLDEMTK+GI+PNVVTYNSII+AFG+S + E +DG + Q Sbjct: 509 LINALCKNGLVDSAVLLLDEMTKEGIRPNVVTYNSIIDAFGRSTTAEFLVDGVGASNERQ 568 Query: 1951 VESSALMALDNSSNFKGEVVEEDRIVRVFEQLAGGESRQGTKDNKGSNEEILCVLGVFQK 2130 ES + M ++ + E+ +D V F Q E K + EEI +L VF+K Sbjct: 569 SESPSFMLIEGVD--ESEINWDDGHVFKFYQQLVSEKEGPAKKERLGKEEIRSILSVFKK 626 Query: 2131 MHELEIKPNVVTFSAILNACSHCSSFKEASLLLEELRLFDNHVYGVAHGLLMGNRENVWV 2310 MHELEIKPNVVTFSAILNACS C S ++AS+LLEELRLFDN VYGVAHGLLMG ENVW+ Sbjct: 627 MHELEIKPNVVTFSAILNACSRCKSIEDASMLLEELRLFDNQVYGVAHGLLMGFSENVWI 686 Query: 2311 QALSLFDDVMQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENTWSTSCLD 2490 QA LFD+V QMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRR+VWE WS SCLD Sbjct: 687 QAQYLFDEVKQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLD 746 Query: 2491 LHLMSSGAARAMVHAWLLNIRSIVFEGHELPKLLSILTGWGKHSKVQGDATLKPAIEALL 2670 LHLMSSGAARAMVHAWLL I S+VFEGH+LPKLLSILTGWGKHSKV GD L+ AIEALL Sbjct: 747 LHLMSSGAARAMVHAWLLGIHSVVFEGHQLPKLLSILTGWGKHSKVVGDGALRRAIEALL 806 Query: 2671 VGIGAPFQLAKCNIGRYISTGPVVAAWLRESGTLQVLVLQDDRNQLESTRSYNVPILSAL 2850 +GAPF++AKCNIGRY+STG VVAAWL+ESGTL++LVL DDR +S N+ ++S L Sbjct: 807 TSMGAPFRVAKCNIGRYVSTGSVVAAWLKESGTLKLLVLHDDRTHPDSE---NMDLISKL 863 >ref|XP_004166285.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400, chloroplastic-like [Cucumis sativus] Length = 868 Score = 1055 bits (2728), Expect = 0.0 Identities = 529/780 (67%), Positives = 636/780 (81%), Gaps = 5/780 (0%) Frame = +1 Query: 526 DFSGRRSTRYVSKMHFGRQKTANASRHTSIAEEALQEAIRLSGDENCIESVLKRFENKFC 705 +FSGRRSTR+VSK HFGR K++ +RH++IAEE L + ++ D+ ++++L FE+K C Sbjct: 89 NFSGRRSTRFVSKFHFGRPKSSMTTRHSAIAEEVLHQVLQFGKDDASLDNILLNFESKLC 148 Query: 706 GTDDYTYLLRELGNRGEYSMAIKCFEFAVKREKRRNDQGKLASSMISILGRLGKVDLAEK 885 G++DYT+LLRELGNRGE AI+CF+FA+ RE R+N++GKLAS+MIS LGRLGKV+LA+ Sbjct: 149 GSEDYTFLLRELGNRGECWKAIRCFDFALVREGRKNERGKLASAMISTLGRLGKVELAKG 208 Query: 886 VFEIALNEGYGNTVYAYSALISAYGKSGYCDEAIRLFETMKKSGLKPNLVTYNALINACG 1065 VFE AL+EGYGNTV+A+SALISAYGKSGY DEAI++FE+MK SGLKPNLVTYNA+I+ACG Sbjct: 209 VFETALSEGYGNTVFAFSALISAYGKSGYFDEAIKVFESMKVSGLKPNLVTYNAVIDACG 268 Query: 1066 KGGADFKRASEIFDDMWSNGVQPDRITYNSLLAACSGGGLWDTAKKLFSEMIDRGIDQDI 1245 KGG +FKR EIF++M NGVQPDRITYNSLLA CS GGLW+ A+ LF+EMIDRGIDQD+ Sbjct: 269 KGGVEFKRVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAARNLFNEMIDRGIDQDV 328 Query: 1246 YTYNTLLDVACNGGHLDMAFGIMSEMPAKNIFPNEVTYSTIIRGCAKAGRLDRALSLFNE 1425 +TYNTLLD C GG +D+A+ IM EMP K I PN VTYST+ G AKAGRL+ AL+L+NE Sbjct: 329 FTYNTLLDAVCKGGQMDLAYEIMLEMPGKKILPNVVTYSTMADGYAKAGRLEDALNLYNE 388 Query: 1426 MKYAGIKLDRVSYNTLLAIYASLGRFEEALCVGNEMESMGIKKDVVTYNALLDGFGKQGM 1605 MK+ GI LDRVSYNTLL+IYA LGRFE+AL V EM S G+KKDVVTYNALLDG+GKQG Sbjct: 389 MKFLGIGLDRVSYNTLLSIYAKLGRFEDALKVCKEMGSSGVKKDVVTYNALLDGYGKQGK 448 Query: 1606 FDKVKELFQMMRADKLSPNLLTYSTLISVYSKGGLYQEAMEVYKEFKKDGLKADVVFYSK 1785 F++V +F+ M+ D++ PNLLTYSTLI VYSKG LY+EAMEV++EFK+ GLKADVV YS+ Sbjct: 449 FNEVTRVFKEMKKDRVFPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSE 508 Query: 1786 LIDALCKKGLVESSALLLDEMTKKGIQPNVVTYNSIINAFGQSASTEICLDG-----DSQ 1950 LI+ALCK GLV+S+ LLLDEMTK+GI+PNVVTYNSII+AFG+S + E +DG + Q Sbjct: 509 LINALCKNGLVDSAVLLLDEMTKEGIRPNVVTYNSIIDAFGRSTTAEFLVDGVGASNERQ 568 Query: 1951 VESSALMALDNSSNFKGEVVEEDRIVRVFEQLAGGESRQGTKDNKGSNEEILCVLGVFQK 2130 ES M ++ + E+ +D V F Q E K + EEI +L VF+K Sbjct: 569 SESPTFMLIEGVD--ESEINWDDGHVFKFYQQLVSEKEGPAKKERLGKEEIRSILSVFKK 626 Query: 2131 MHELEIKPNVVTFSAILNACSHCSSFKEASLLLEELRLFDNHVYGVAHGLLMGNRENVWV 2310 MHELEIKPNVVTFSAILNACS C S ++AS+LLEELRLFDN VYGVAHGLLMG ENVW+ Sbjct: 627 MHELEIKPNVVTFSAILNACSRCKSIEDASMLLEELRLFDNQVYGVAHGLLMGFSENVWI 686 Query: 2311 QALSLFDDVMQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENTWSTSCLD 2490 QA LFD+V QMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRR+VWE WS SCLD Sbjct: 687 QAQYLFDEVKQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLD 746 Query: 2491 LHLMSSGAARAMVHAWLLNIRSIVFEGHELPKLLSILTGWGKHSKVQGDATLKPAIEALL 2670 LHLMSSGAARAMVHAWLL I S+VFEGH+LPKLLSILTGWGKHSKV GD L+ AIEALL Sbjct: 747 LHLMSSGAARAMVHAWLLGIHSVVFEGHQLPKLLSILTGWGKHSKVVGDGALRRAIEALL 806 Query: 2671 VGIGAPFQLAKCNIGRYISTGPVVAAWLRESGTLQVLVLQDDRNQLESTRSYNVPILSAL 2850 +GAPF++AKCNIGRY+STG VVAAWL+ESGTL++LVL DDR ++ N+ ++S L Sbjct: 807 TSMGAPFRVAKCNIGRYVSTGSVVAAWLKESGTLKLLVLHDDRTHPDTE---NMDLISKL 863 >ref|NP_180698.1| genomes uncoupled 1 protein [Arabidopsis thaliana] gi|75206083|sp|Q9SIC9.1|PP178_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g31400, chloroplastic; Flags: Precursor gi|4589961|gb|AAD26479.1| unknown protein [Arabidopsis thaliana] gi|330253448|gb|AEC08542.1| genomes uncoupled 1 protein [Arabidopsis thaliana] Length = 918 Score = 1015 bits (2624), Expect = 0.0 Identities = 514/826 (62%), Positives = 629/826 (76%), Gaps = 7/826 (0%) Frame = +1 Query: 337 FTTPNHQNPIFTAPKVSINHHNQRFQANPRNVSAPAPPHIQXXXXXXXXXXXXXXXXXXX 516 F +P N T + NH ++PR SAP P + Sbjct: 70 FVSPATNNHRQTRQNPNYNHRPYGASSSPRG-SAPPPSSVATVAPAQLSQPPNFSPLQTP 128 Query: 517 XXX---DFSGRRSTRYVSKMHFGRQKTANASRHTSIAEEALQEAIRLSGDENCIESVLKR 687 DFSGRRSTR+VSKMHFGRQKT A+RH+S AE+ALQ AI SGD+ S++ Sbjct: 129 KSDLSSDFSGRRSTRFVSKMHFGRQKTTMATRHSSAAEDALQNAIDFSGDDEMFHSLMLS 188 Query: 688 FENKFCGTDDYTYLLRELGNRGEYSMAIKCFEFAVKREKRRNDQGKLASSMISILGRLGK 867 FE+K CG+DD TY++RELGNR E A+ +EFAVKRE+R+N+QGKLAS+MIS LGR GK Sbjct: 189 FESKLCGSDDCTYIIRELGNRNECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGK 248 Query: 868 VDLAEKVFEIALNEGYGNTVYAYSALISAYGKSGYCDEAIRLFETMKKSGLKPNLVTYNA 1047 V +A+++FE A GYGNTVYA+SALISAYG+SG +EAI +F +MK+ GL+PNLVTYNA Sbjct: 249 VTIAKRIFETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFNSMKEYGLRPNLVTYNA 308 Query: 1048 LINACGKGGADFKRASEIFDDMWSNGVQPDRITYNSLLAACSGGGLWDTAKKLFSEMIDR 1227 +I+ACGKGG +FK+ ++ FD+M NGVQPDRIT+NSLLA CS GGLW+ A+ LF EM +R Sbjct: 309 VIDACGKGGMEFKQVAKFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMTNR 368 Query: 1228 GIDQDIYTYNTLLDVACNGGHLDMAFGIMSEMPAKNIFPNEVTYSTIIRGCAKAGRLDRA 1407 I+QD+++YNTLLD C GG +D+AF I+++MP K I PN V+YST+I G AKAGR D A Sbjct: 369 RIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEA 428 Query: 1408 LSLFNEMKYAGIKLDRVSYNTLLAIYASLGRFEEALCVGNEMESMGIKKDVVTYNALLDG 1587 L+LF EM+Y GI LDRVSYNTLL+IY +GR EEAL + EM S+GIKKDVVTYNALL G Sbjct: 429 LNLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGG 488 Query: 1588 FGKQGMFDKVKELFQMMRADKLSPNLLTYSTLISVYSKGGLYQEAMEVYKEFKKDGLKAD 1767 +GKQG +D+VK++F M+ + + PNLLTYSTLI YSKGGLY+EAME+++EFK GL+AD Sbjct: 489 YGKQGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRAD 548 Query: 1768 VVFYSKLIDALCKKGLVESSALLLDEMTKKGIQPNVVTYNSIINAFGQSASTEICLD--- 1938 VV YS LIDALCK GLV S+ L+DEMTK+GI PNVVTYNSII+AFG+SA+ + D Sbjct: 549 VVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSATMDRSADYSN 608 Query: 1939 -GDSQVESSALMALDNSSNFKGEVVEEDRIVRVFEQLAGGESRQGTKDNKGSNEEILCVL 2115 G SSAL AL + E +R++++F QL + + TKD + +E+ C+L Sbjct: 609 GGSLPFSSSALSALTET--------EGNRVIQLFGQLTTESNNRTTKDCEEGMQELSCIL 660 Query: 2116 GVFQKMHELEIKPNVVTFSAILNACSHCSSFKEASLLLEELRLFDNHVYGVAHGLLMGNR 2295 VF+KMH+LEIKPNVVTFSAILNACS C+SF++AS+LLEELRLFDN VYGV HGLLMG R Sbjct: 661 EVFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLLMGQR 720 Query: 2296 ENVWVQALSLFDDVMQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWENTWS 2475 ENVW+QA SLFD V +MD STASAFYNALTDMLWHFGQKRGA+LV LEG+ RQVWEN WS Sbjct: 721 ENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWENVWS 780 Query: 2476 TSCLDLHLMSSGAARAMVHAWLLNIRSIVFEGHELPKLLSILTGWGKHSKVQGDATLKPA 2655 SCLDLHLMSSGAARAMVHAWLLNIRSIV+EGHELPK+LSILTGWGKHSKV GD L+ A Sbjct: 781 DSCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGDGALRRA 840 Query: 2656 IEALLVGIGAPFQLAKCNIGRYISTGPVVAAWLRESGTLQVLVLQD 2793 +E LL G+ APF L+KCN+GR+ S+G VVA WLRES TL++L+L D Sbjct: 841 VEVLLRGMDAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLILHD 886