BLASTX nr result
ID: Paeonia23_contig00003016
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia23_contig00003016 (2996 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002284744.1| PREDICTED: pentatricopeptide repeat-containi... 1170 0.0 ref|XP_004292932.1| PREDICTED: pentatricopeptide repeat-containi... 1093 0.0 emb|CAN63659.1| hypothetical protein VITISV_008415 [Vitis vinifera] 1078 0.0 ref|XP_004250454.1| PREDICTED: pentatricopeptide repeat-containi... 1029 0.0 ref|XP_007227203.1| hypothetical protein PRUPE_ppa019251mg [Prun... 1027 0.0 ref|XP_007017888.1| Pentatricopeptide repeat (PPR) superfamily p... 1009 0.0 gb|EXB68664.1| hypothetical protein L484_024678 [Morus notabilis] 979 0.0 ref|XP_002301973.2| pentatricopeptide repeat-containing family p... 979 0.0 gb|EYU24286.1| hypothetical protein MIMGU_mgv1a025107mg [Mimulus... 979 0.0 ref|XP_003549191.1| PREDICTED: pentatricopeptide repeat-containi... 978 0.0 gb|EYU18955.1| hypothetical protein MIMGU_mgv1a022111mg [Mimulus... 969 0.0 ref|XP_004515286.1| PREDICTED: pentatricopeptide repeat-containi... 944 0.0 ref|XP_002890375.1| pentatricopeptide repeat-containing protein ... 941 0.0 ref|XP_006306841.1| hypothetical protein CARUB_v10008385mg [Caps... 939 0.0 ref|XP_006416418.1| hypothetical protein EUTSA_v10009574mg [Eutr... 939 0.0 ref|NP_173449.1| pentatricopeptide repeat-containing protein [Ar... 925 0.0 gb|EPS74339.1| hypothetical protein M569_00411 [Genlisea aurea] 911 0.0 ref|XP_007152607.1| hypothetical protein PHAVU_004G144300g [Phas... 862 0.0 ref|XP_006587447.1| PREDICTED: pentatricopeptide repeat-containi... 855 0.0 gb|AAF79892.1|AC022472_1 Contains similarity to an unknown prote... 831 0.0 >ref|XP_002284744.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230 [Vitis vinifera] Length = 758 Score = 1170 bits (3028), Expect = 0.0 Identities = 558/757 (73%), Positives = 655/757 (86%) Frame = +3 Query: 90 AMTRQALHLLNSSHHITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANH 269 +++ QAL LL+S H N ST+ASL Q RQAH HILKTGL N+TH TKLLS YAN+ Sbjct: 2 SLSAQALALLDSVQHTIFNCLNSTTASLSQTRQAHAHILKTGLFNDTHLATKLLSHYANN 61 Query: 270 LCFDDASHVLDSILQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAI 449 +CF DA+ VLD + +P++FSFSTLI+A +K H+++HAL FS+ML+ GL PD V+PSA+ Sbjct: 62 MCFADATLVLDLVPEPNVFSFSTLIYAFSKFHQFHHALSTFSQMLTRGLMPDNRVLPSAV 121 Query: 450 KACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVV 629 KACAGLSAL+ ++VHG+ SVSG SD FVQ+SLVH+Y+KC +IR+AH+VFD M +PDVV Sbjct: 122 KACAGLSALKPARQVHGIASVSGFDSDSFVQSSLVHMYIKCNQIRDAHRVFDRMFEPDVV 181 Query: 630 TWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKM 809 +WSALVA +AR G V EA RLF MGDSG++ N +SWNGMIAGFNHS LY+EAVLMF M Sbjct: 182 SWSALVAAYARQGCVDEAKRLFSEMGDSGVQPNLISWNGMIAGFNHSGLYSEAVLMFLDM 241 Query: 810 HSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACA 989 H +GF+PD T+ SSVLPAVGDLEDL +GI IH Y+IK GL SDKCV SALIDMYGKC+C Sbjct: 242 HLRGFEPDGTTISSVLPAVGDLEDLVMGILIHGYVIKQGLVSDKCVSSALIDMYGKCSCT 301 Query: 990 KEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACC 1169 EMSQVFD+MD +D+G+CNA + GLSRNG E++L +F++++ QG+ELNVVSWTSMIACC Sbjct: 302 SEMSQVFDQMDHMDVGSCNAFIFGLSRNGQVESSLRLFRQLKDQGMELNVVSWTSMIACC 361 Query: 1170 SQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDV 1349 SQNG+DIEALELFR+MQIAGV PNSVTIPCLLPACGN+AALMHGKAAHCFSLRRG S DV Sbjct: 362 SQNGRDIEALELFREMQIAGVKPNSVTIPCLLPACGNIAALMHGKAAHCFSLRRGISTDV 421 Query: 1350 YVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRS 1529 YVGSALIDMYAKCG+I++S+ CFDG+P +NLVCWNA+I GYAMHGK KEA+EIF +MQRS Sbjct: 422 YVGSALIDMYAKCGRIQASRICFDGIPTKNLVCWNAVIAGYAMHGKAKEAMEIFDLMQRS 481 Query: 1530 GQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKE 1709 GQKPD+I+FT VLSACSQSGLTEEG Y+FNSMS ++GI+ARVEHYACMV LL RAGKL++ Sbjct: 482 GQKPDIISFTCVLSACSQSGLTEEGSYYFNSMSSKYGIEARVEHYACMVTLLSRAGKLEQ 541 Query: 1710 AYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYAS 1889 AY+MI++MP PDACVWGALLSSCRVHNN+ LGE+AA++LFELEP NPGNYILLSNIYAS Sbjct: 542 AYAMIRRMPVNPDACVWGALLSSCRVHNNVSLGEVAAEKLFELEPSNPGNYILLSNIYAS 601 Query: 1890 NGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLE 2069 GMWN+V+ VR MMK+ GLRKNPGCSWIE+KNKVHMLLAGDKSHP M QIIEKL+KLS+E Sbjct: 602 KGMWNEVNRVRDMMKNKGLRKNPGCSWIEVKNKVHMLLAGDKSHPQMTQIIEKLDKLSME 661 Query: 2070 MKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDC 2249 MKK G+FP NFVLQDVEEQDKEQ LCGHSEKLAVVFGLLNT G PL+VIKNLRICGDC Sbjct: 662 MKKLGYFPEINFVLQDVEEQDKEQILCGHSEKLAVVFGLLNTPPGYPLQVIKNLRICGDC 721 Query: 2250 HAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 2360 H IKF+SS+E REIFVRDTN FHHFK+GACSCGD+W Sbjct: 722 HVVIKFISSFERREIFVRDTNRFHHFKEGACSCGDYW 758 >ref|XP_004292932.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like [Fragaria vesca subsp. vesca] Length = 755 Score = 1093 bits (2826), Expect = 0.0 Identities = 533/756 (70%), Positives = 620/756 (82%) Frame = +3 Query: 93 MTRQALHLLNSSHHITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHL 272 MTRQ L+L + H L+ F++ S+SL QA QAH ILKTGLSN T+ TKLLSLYAN L Sbjct: 1 MTRQVLNLSDHLLHKLLS-FLNPSSSLSQAHQAHAQILKTGLSNHTNLTTKLLSLYANSL 59 Query: 273 CFDDASHVLDSILQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIK 452 CF +A VL SI P++FSFSTLIHA K + + +AL +FS+MLS GLAPD + PS +K Sbjct: 60 CFVEAKLVLHSIPHPNLFSFSTLIHAFAKLNSFGNALSLFSQMLSRGLAPDSFLFPSVVK 119 Query: 453 ACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVT 632 ACAGL + ++ ++VH + SG A D FVQ+SLVH+Y+KC +I +A KVFD +P+ DV+ Sbjct: 120 ACAGLQSSQSARQVHAISFSSGFALDSFVQSSLVHMYIKCDRIGDARKVFDRVPERDVII 179 Query: 633 WSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMH 812 +SAL++G++R G V EA RL M G N V WNGMIAGF+ S+LYA V +F+KMH Sbjct: 180 YSALISGYSRRGCVDEAMRLLGEMRGLGFVPNVVLWNGMIAGFSQSKLYASTVGVFQKMH 239 Query: 813 SQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAK 992 SQGF+PD +S SSVLPAVG+LEDL +G+QIH +IK GL SDKCVVSAL+DMYGKCAC Sbjct: 240 SQGFEPDGSSISSVLPAVGELEDLDIGVQIHGQVIKRGLKSDKCVVSALVDMYGKCACTL 299 Query: 993 EMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCS 1172 EMS+V EMD++D+GACNALV GL+RNG +NAL VF + +GQGVELN VSWTS+IA CS Sbjct: 300 EMSRVVGEMDELDVGACNALVTGLARNGLVDNALEVFMQFKGQGVELNTVSWTSIIASCS 359 Query: 1173 QNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVY 1352 QNGKD+EALELFR+MQI GV PNS+TI CLLPACGN+AAL HGKAAHCF+ RRG +DVY Sbjct: 360 QNGKDMEALELFREMQIEGVEPNSMTISCLLPACGNIAALTHGKAAHCFAFRRGMLSDVY 419 Query: 1353 VGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSG 1532 VGSALIDMYAKCG+I+ S+ CFD MP RNLVCWNA++ GYAMHGK KE +EIFHMMQRSG Sbjct: 420 VGSALIDMYAKCGKIQLSRLCFDKMPTRNLVCWNAVMSGYAMHGKAKETMEIFHMMQRSG 479 Query: 1533 QKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEA 1712 KPD+I+FT VLSACSQ+GLTEEGWY+FNSMS EHGI+AR+EHYACMV LLGRAGKL EA Sbjct: 480 LKPDIISFTCVLSACSQNGLTEEGWYYFNSMSKEHGIEARIEHYACMVTLLGRAGKLDEA 539 Query: 1713 YSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASN 1892 YSMI+KMPFEPDACVWGALLSSCRVHNN+ LGE AK+LF LEP NPGNYILLSNIYAS Sbjct: 540 YSMIKKMPFEPDACVWGALLSSCRVHNNVTLGESTAKKLFNLEPGNPGNYILLSNIYASK 599 Query: 1893 GMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEM 2072 GMW +VD VR MKS+GLRKNPGCSWIE KN VHMLLAGDK+HP M +I EKLN LS EM Sbjct: 600 GMWTEVDRVRDTMKSLGLRKNPGCSWIEFKNNVHMLLAGDKTHPQMNKITEKLNTLSSEM 659 Query: 2073 KKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCH 2252 KKSG+ P T+FVLQDVEEQ+KEQ LCGHSEKLAVV GLLNT GS LRVIKNLRICGDCH Sbjct: 660 KKSGYLPSTHFVLQDVEEQEKEQILCGHSEKLAVVLGLLNTPPGSSLRVIKNLRICGDCH 719 Query: 2253 AFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 2360 + IKF+SS EGREI VRDTN FHHFKDG CSCGD+W Sbjct: 720 SVIKFISSLEGREISVRDTNRFHHFKDGVCSCGDYW 755 >emb|CAN63659.1| hypothetical protein VITISV_008415 [Vitis vinifera] Length = 760 Score = 1078 bits (2787), Expect = 0.0 Identities = 517/709 (72%), Positives = 611/709 (86%) Frame = +3 Query: 90 AMTRQALHLLNSSHHITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANH 269 +++ QAL LL+S H LN ST+ASL Q RQAH HILKTGL N+TH TKLLS YAN+ Sbjct: 2 SLSAQALALLDSVQHTILNCLNSTTASLSQTRQAHAHILKTGLFNDTHLATKLLSHYANN 61 Query: 270 LCFDDASHVLDSILQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAI 449 +CF DA+ VLD + +P++FSFSTLI+A +K H+++HAL FS+ML+ GL PD V+PSA+ Sbjct: 62 MCFADATLVLDLVPEPNVFSFSTLIYAFSKFHQFHHALSTFSQMLTRGLMPDNRVLPSAV 121 Query: 450 KACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVV 629 KACAGLSAL+ ++VHG+ SVSG SD FVQ+SLVH+Y+KC +IR+AH+VFD M +PDVV Sbjct: 122 KACAGLSALKPARQVHGIASVSGFDSDSFVQSSLVHMYIKCNQIRDAHRVFDRMFEPDVV 181 Query: 630 TWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKM 809 +WSALVA +AR G V EA RLF MGDSG++ N +SWNGMIAGFNHS LY+EAVLMF M Sbjct: 182 SWSALVAAYARQGCVDEAKRLFSEMGDSGVQPNLISWNGMIAGFNHSGLYSEAVLMFLDM 241 Query: 810 HSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACA 989 H +GF+PD T+ SSVLPAVGDLEDL +GI IH Y+IK GL SDKCV SALIDMYGKC+C Sbjct: 242 HLRGFEPDGTTISSVLPAVGDLEDLVMGILIHGYVIKQGLVSDKCVSSALIDMYGKCSCT 301 Query: 990 KEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACC 1169 EMSQVFD+MD +D+G+CNA + GLSRNG E++L +F++++ QG+ELNVVSWTSMIACC Sbjct: 302 SEMSQVFDQMDHMDVGSCNAFIFGLSRNGQVESSLRLFRQLKDQGMELNVVSWTSMIACC 361 Query: 1170 SQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDV 1349 SQNG+D+EALELFR+MQIAGV PNSVTIPCLLPACGN+AALMHGKAAHCFSLRRG S DV Sbjct: 362 SQNGRDMEALELFREMQIAGVKPNSVTIPCLLPACGNIAALMHGKAAHCFSLRRGISTDV 421 Query: 1350 YVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRS 1529 YVGSALIDMYAKCG+I++S+ CFDG+P +NLVCWNA+I GYAMHGK KEA+EIF +MQRS Sbjct: 422 YVGSALIDMYAKCGRIQASRICFDGIPTKNLVCWNAVIAGYAMHGKAKEAMEIFDLMQRS 481 Query: 1530 GQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKE 1709 GQKPD+I+FT VLSACSQSGLTEEG Y+FNSMS ++GI+ARVEHYACMV LL RAGKL++ Sbjct: 482 GQKPDIISFTCVLSACSQSGLTEEGSYYFNSMSSKYGIEARVEHYACMVTLLSRAGKLEQ 541 Query: 1710 AYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYAS 1889 AY+MI++MP PDACVWGALLSSCRVHNN+ LGE+AA++LFELEP NPGNYILLSNIYAS Sbjct: 542 AYAMIRRMPVNPDACVWGALLSSCRVHNNVSLGEVAAEKLFELEPSNPGNYILLSNIYAS 601 Query: 1890 NGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLE 2069 GMWN+V+ VR MMK+ GLRKNPGCSWIE+KNKVHMLLAGDKSHP M QIIE L+KLS+E Sbjct: 602 KGMWNEVNRVRDMMKNKGLRKNPGCSWIEVKNKVHMLLAGDKSHPQMTQIIENLDKLSME 661 Query: 2070 MKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLR 2216 MKK G+FP NFVLQDVEEQDKEQ LCGHSEKLAVVFGLLNT G PL+ Sbjct: 662 MKKLGYFPEINFVLQDVEEQDKEQILCGHSEKLAVVFGLLNTPPGYPLQ 710 >ref|XP_004250454.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like [Solanum lycopersicum] Length = 828 Score = 1029 bits (2661), Expect = 0.0 Identities = 502/764 (65%), Positives = 605/764 (79%) Frame = +3 Query: 69 KLLPQSQAMTRQALHLLNSSHHITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKL 248 +LL A Q+L +L+S T+ + I+ S+SL Q +Q H HILKTG S++THF K+ Sbjct: 65 ELLNSMNARQAQSLRVLDSLMPNTILSLIARSSSLSQTQQVHAHILKTGHSSDTHFTNKV 124 Query: 249 LSLYANHLCFDDASHVLDSILQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDC 428 LSLYAN CF +A +L S+ P+IFSF +LIHAS+K + +++ L +FSR+LS + PD Sbjct: 125 LSLYANFNCFANAESLLHSLPNPNIFSFKSLIHASSKSNLFSYTLVLFSRLLSKCILPDV 184 Query: 429 HVVPSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDT 608 HV+PSAIKACAGLSA GK+VHG +GLA D FV+ SLVH+YVKC +++ A K+FD Sbjct: 185 HVLPSAIKACAGLSASEVGKQVHGYGLTTGLALDSFVEASLVHMYVKCDQLKCARKMFDK 244 Query: 609 MPQPDVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEA 788 M +PDVV+WSAL G+A+ G V A +FD G G+E N VSWNGMIAGFN S Y EA Sbjct: 245 MREPDVVSWSALSGGYAKKGDVFNAKMVFDEGGKLGIEPNLVSWNGMIAGFNQSGCYLEA 304 Query: 789 VLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDM 968 VLMF++M+S GF+ D TS SSVLPAV DLEDL +G+Q+H+++IK G SD C++SAL+DM Sbjct: 305 VLMFQRMNSDGFRSDGTSISSVLPAVSDLEDLKMGVQVHSHVIKTGFESDNCIISALVDM 364 Query: 969 YGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSW 1148 YGKC C EMS+VF+ ++ID+G NALVAGLSRNG + A VFKK + + ELNVVSW Sbjct: 365 YGKCRCTSEMSRVFEGAEEIDLGGFNALVAGLSRNGLVDEAFKVFKKFKLKVKELNVVSW 424 Query: 1149 TSMIACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLR 1328 TSMI+ CSQ+GKD+EALE+FR+MQ+A V PNSVTI CLLPACGN+AAL+HGKA HCFSLR Sbjct: 425 TSMISSCSQHGKDLEALEIFREMQLAKVRPNSVTISCLLPACGNIAALVHGKATHCFSLR 484 Query: 1329 RGFSNDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEI 1508 FS+DVYV SALIDMYA CG+I+ ++ FD MP RNLVCWNA+ GYAMHGK KEA+EI Sbjct: 485 NWFSDDVYVSSALIDMYANCGRIQLARVIFDRMPVRNLVCWNAMTSGYAMHGKAKEAIEI 544 Query: 1509 FHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLG 1688 F M+RSGQKPD I+FTSVLSACSQ+GLTE+G ++F+ MS HG++ARVEHYACMV+LLG Sbjct: 545 FDSMRRSGQKPDFISFTSVLSACSQAGLTEQGQHYFDCMSRIHGLEARVEHYACMVSLLG 604 Query: 1689 RAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYIL 1868 R GKLKEAY MI MP EPDACVWGALLSSCR H N+ LGEIAA +LFELEPKNPGNYIL Sbjct: 605 RTGKLKEAYDMISTMPIEPDACVWGALLSSCRTHRNMSLGEIAADKLFELEPKNPGNYIL 664 Query: 1869 LSNIYASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEK 2048 LSNIYASN WN+VD VR MMK +GL KNPGCSWIEIKNKVHMLLAGD HP M QI+EK Sbjct: 665 LSNIYASNNRWNEVDKVRDMMKHVGLSKNPGCSWIEIKNKVHMLLAGDDLHPQMPQIMEK 724 Query: 2049 LNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKN 2228 L KLS++MK +G T VLQDVEEQDKE LCGHSEKLAVV G+LNT+ G+ LRVIKN Sbjct: 725 LRKLSMDMKNTGVSHDTELVLQDVEEQDKELILCGHSEKLAVVLGILNTNPGTSLRVIKN 784 Query: 2229 LRICGDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 2360 LRICGDCH FIKF+SS+EGREI+VRD N +HHF +G CSCGD+W Sbjct: 785 LRICGDCHTFIKFISSFEGREIYVRDANRYHHFNEGICSCGDYW 828 >ref|XP_007227203.1| hypothetical protein PRUPE_ppa019251mg [Prunus persica] gi|462424139|gb|EMJ28402.1| hypothetical protein PRUPE_ppa019251mg [Prunus persica] Length = 654 Score = 1027 bits (2655), Expect = 0.0 Identities = 488/654 (74%), Positives = 562/654 (85%) Frame = +3 Query: 399 MLSHGLAPDCHVVPSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGK 578 MLS GL PD + PS +KACAGL A + GK+VH + SVSGLASD FVQ+SLVH+Y+KC + Sbjct: 1 MLSRGLVPDSFLFPSVVKACAGLPASKAGKQVHAIASVSGLASDSFVQSSLVHMYIKCDQ 60 Query: 579 IREAHKVFDTMPQPDVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAG 758 IR+A K+FD +PQ DV+ SAL++G++R G V EA +L M LE N V WNGMIAG Sbjct: 61 IRDARKLFDRVPQRDVIICSALISGYSRRGCVDEAMQLLSEMRGMCLEPNVVLWNGMIAG 120 Query: 759 FNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSD 938 FN S+LYA+ V + +KMHS+GFQPD +S SS LPAVG LEDL +GIQIH Y++K GLGSD Sbjct: 121 FNQSKLYADTVAVLQKMHSEGFQPDGSSISSALPAVGHLEDLGMGIQIHGYVVKQGLGSD 180 Query: 939 KCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEG 1118 KCVVSALIDMYGKCAC+ E SQVF EMD++D+GACNALV GLSRNG +NAL VF++ + Sbjct: 181 KCVVSALIDMYGKCACSFETSQVFHEMDQMDVGACNALVTGLSRNGLVDNALKVFRQFKD 240 Query: 1119 QGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMH 1298 QG+ELN+VSWTS+IA CSQNGKD+EALELFR+MQ+ GV PNSVTIPCLLPACGN+AALMH Sbjct: 241 QGMELNIVSWTSIIASCSQNGKDMEALELFREMQVEGVEPNSVTIPCLLPACGNIAALMH 300 Query: 1299 GKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAM 1478 GKAAHCFSLRRG SNDVYVGS+LIDMYAKCG+IR S+ CFD MP RNLVCWNA++GGYAM Sbjct: 301 GKAAHCFSLRRGISNDVYVGSSLIDMYAKCGKIRLSRLCFDEMPTRNLVCWNAVMGGYAM 360 Query: 1479 HGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVE 1658 HGK E +E+F +MQRSGQKPD I+FT VLSACSQ GLT+EGWY+FNSMS EHG++ARVE Sbjct: 361 HGKANETMEVFRLMQRSGQKPDFISFTCVLSACSQKGLTDEGWYYFNSMSKEHGLEARVE 420 Query: 1659 HYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFEL 1838 HYACMV LL R+GKL+EAYSMI++MPFEPDACVWGALLSSCRVH+N+ LG+ AK+LF L Sbjct: 421 HYACMVTLLSRSGKLEEAYSMIKQMPFEPDACVWGALLSSCRVHSNVTLGKYVAKKLFNL 480 Query: 1839 EPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKS 2018 EPKNPGNYILLSNIYAS GMW++VD VR MKS+GLRKNPGCSWIE+KNKVHMLLAGDK+ Sbjct: 481 EPKNPGNYILLSNIYASKGMWSEVDKVRDKMKSLGLRKNPGCSWIEVKNKVHMLLAGDKA 540 Query: 2019 HPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTH 2198 HP M QIIEKLNKLS EMKK G+FP T+FVLQDVEEQDKEQ LCGHSEKLAVV GLLN+ Sbjct: 541 HPQMNQIIEKLNKLSSEMKKLGYFPNTHFVLQDVEEQDKEQILCGHSEKLAVVLGLLNSP 600 Query: 2199 QGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 2360 GS LRVIKNLRICGDCHA IKF+SS+EGREI VRDTNLFHHFKDG CSC D+W Sbjct: 601 PGSSLRVIKNLRICGDCHAVIKFISSFEGREISVRDTNLFHHFKDGVCSCEDYW 654 Score = 182 bits (461), Expect = 1e-42 Identities = 105/328 (32%), Positives = 173/328 (52%), Gaps = 4/328 (1%) Frame = +3 Query: 246 LLSLYANHLCFDDASHVLDSI----LQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHG 413 L+S Y+ C D+A +L + L+P++ ++ +I + Y + + +M S G Sbjct: 82 LISGYSRRGCVDEAMQLLSEMRGMCLEPNVVLWNGMIAGFNQSKLYADTVAVLQKMHSEG 141 Query: 414 LAPDCHVVPSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAH 593 PD + SA+ A L L G ++HG V GL SD V ++L+ +Y KC E Sbjct: 142 FQPDGSSISSALPAVGHLEDLGMGIQIHGYVVKQGLGSDKCVVSALIDMYGKCACSFETS 201 Query: 594 KVFDTMPQPDVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSR 773 +VF M Q DV +ALV G +R+G V A ++F D G+ELN VSW +IA + + Sbjct: 202 QVFHEMDQMDVGACNALVTGLSRNGLVDNALKVFRQFKDQGMELNIVSWTSIIASCSQNG 261 Query: 774 LYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVS 953 EA+ +F++M +G +P+ + +LPA G++ L G H + ++ G+ +D V S Sbjct: 262 KDMEALELFREMQVEGVEPNSVTIPCLLPACGNIAALMHGKAAHCFSLRRGISNDVYVGS 321 Query: 954 ALIDMYGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVEL 1133 +LIDMY KC + FDEM ++ NA++ G + +G A + VF+ ++ G + Sbjct: 322 SLIDMYAKCGKIRLSRLCFDEMPTRNLVCWNAVMGGYAMHGKANETMEVFRLMQRSGQKP 381 Query: 1134 NVVSWTSMIACCSQNGKDIEALELFRDM 1217 + +S+T +++ CSQ G E F M Sbjct: 382 DFISFTCVLSACSQKGLTDEGWYYFNSM 409 Score = 104 bits (260), Expect = 2e-19 Identities = 64/246 (26%), Positives = 114/246 (46%), Gaps = 35/246 (14%) Frame = +3 Query: 186 QAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPD--------------- 320 Q HG+++K GL ++ V+ L+ +Y C + S V + Q D Sbjct: 167 QIHGYVVKQGLGSDKCVVSALIDMYGKCACSFETSQVFHEMDQMDVGACNALVTGLSRNG 226 Query: 321 --------------------IFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVP 440 I S++++I + +++ + AL +F M G+ P+ +P Sbjct: 227 LVDNALKVFRQFKDQGMELNIVSWTSIIASCSQNGKDMEALELFREMQVEGVEPNSVTIP 286 Query: 441 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 620 + AC ++AL GK H G+++D +V +SL+ +Y KCGKIR + FD MP Sbjct: 287 CLLPACGNIAALMHGKAAHCFSLRRGISNDVYVGSSLIDMYAKCGKIRLSRLCFDEMPTR 346 Query: 621 DVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 800 ++V W+A++ G+A HG E +F M SG + + +S+ +++ + L E F Sbjct: 347 NLVCWNAVMGGYAMHGKANETMEVFRLMQRSGQKPDFISFTCVLSACSQKGLTDEGWYYF 406 Query: 801 KKMHSQ 818 M + Sbjct: 407 NSMSKE 412 >ref|XP_007017888.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] gi|508723216|gb|EOY15113.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] Length = 758 Score = 1009 bits (2610), Expect = 0.0 Identities = 491/758 (64%), Positives = 603/758 (79%), Gaps = 2/758 (0%) Frame = +3 Query: 93 MTRQALHLLNSSHHITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHL 272 MT QAL + L S ASL Q QAH +ILK+G+ +T TKL+S YAN Sbjct: 1 MTVQALPFFEILNRSILPCLNSAVASLSQTSQAHAYILKSGVCIDTLISTKLISQYANRH 60 Query: 273 CFDDASHVLDSILQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIK 452 CF +A VL+SI +P + SFS LI+A K++ + +L +FSRMLS G+ PD V+P+ +K Sbjct: 61 CFAEAELVLNSISEPLVSSFSALIYALNKYNLFTQSLYVFSRMLSRGILPDNRVLPNVVK 120 Query: 453 ACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVT 632 AC LSA + GKEVHG+V G SD VQ SLVHLY+K +I++A VF+ +P+ DVVT Sbjct: 121 ACGKLSAFKLGKEVHGIVVKYGFDSDSVVQASLVHLYLKGDRIQDAKNVFERLPERDVVT 180 Query: 633 WSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMH 812 AL++ +AR G V EA +F GM G+ N VSWNGMI GFN S Y EAV+MFK+MH Sbjct: 181 CGALLSAYARKGCVNEAKEIFYGMQSFGVGPNLVSWNGMITGFNQSEQYNEAVVMFKEMH 240 Query: 813 SQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAK 992 S+GF PD+ + SSV AVGDLE L +GIQ+ Y+IK GL K V+SAL+DM+GKCACA Sbjct: 241 SEGFLPDDITISSVFSAVGDLERLNIGIQVLCYVIKLGLLHCKFVISALMDMFGKCACAG 300 Query: 993 EMSQVFDEMDK--IDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIAC 1166 E+ + F+E+D+ +D GA NAL+ GLSRNG + AL F++ QG ELNVVSWTS+IA Sbjct: 301 ELMKAFEEVDEEIMDTGALNALITGLSRNGLVDVALETFQRFRVQGRELNVVSWTSIIAG 360 Query: 1167 CSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSND 1346 CSQNGKDIEALELFR+MQ A + PNSVTIPCLLPACGN+AAL+HGKAAH F++R G +ND Sbjct: 361 CSQNGKDIEALELFREMQSARLKPNSVTIPCLLPACGNIAALIHGKAAHGFAIRTGIAND 420 Query: 1347 VYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQR 1526 V+VGSAL+DMYAKCG+I S+ CFD +P++N VCWNAI+GGYAMHGK KEA++IFHMMQR Sbjct: 421 VHVGSALVDMYAKCGRIHLSRLCFDRIPSKNSVCWNAIMGGYAMHGKAKEAIDIFHMMQR 480 Query: 1527 SGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLK 1706 GQKPD I+F+ VLSACSQ GLTEEGW+ FNSMS +HG+KA++EHY+CMVNLLGR+GKL+ Sbjct: 481 RGQKPDFISFSCVLSACSQGGLTEEGWHFFNSMSRDHGVKAKMEHYSCMVNLLGRSGKLE 540 Query: 1707 EAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYA 1886 +AY++IQ+MPFEPDACVWGALLSSCR+HNN+ LGEIAA+ LF+LEP NPGNYILLSNIYA Sbjct: 541 QAYALIQQMPFEPDACVWGALLSSCRLHNNISLGEIAAQNLFKLEPSNPGNYILLSNIYA 600 Query: 1887 SNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSL 2066 S GMW++VD VR +M+S G++KNPGCSWIEIKN+VHMLLAGDKSHP M +IIEK+ KLS+ Sbjct: 601 SKGMWDEVDAVRDVMRSRGMKKNPGCSWIEIKNQVHMLLAGDKSHPQMTEIIEKIYKLSM 660 Query: 2067 EMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGD 2246 +MKK+G+ P T+FVLQDV+EQDKEQ LCGHSEKLAV FGLLNT GSPL++IKNLRICGD Sbjct: 661 DMKKAGYLPNTDFVLQDVDEQDKEQILCGHSEKLAVAFGLLNTPPGSPLQIIKNLRICGD 720 Query: 2247 CHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 2360 CHA IKF+S +EGREI+VRDTN FHHFKDG CSC D+W Sbjct: 721 CHAVIKFISGFEGREIYVRDTNRFHHFKDGVCSCRDYW 758 >gb|EXB68664.1| hypothetical protein L484_024678 [Morus notabilis] Length = 728 Score = 979 bits (2532), Expect = 0.0 Identities = 478/737 (64%), Positives = 577/737 (78%), Gaps = 2/737 (0%) Frame = +3 Query: 156 STSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFS 335 ST SL RQ H ++LK+ S + TKLLSLYAN+LCF +A+ VLDSI PD+F FS Sbjct: 20 STPPSL--TRQLHAYLLKSN-SAQLSTTTKLLSLYANNLCFFEANLVLDSIPNPDLFCFS 76 Query: 336 TLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIKACAGLSALRTGKEVHGVVSVS 515 TLIHAS+K R++ +LR+FSRMLS + PD + PS +KA +GL +L GK++H + Sbjct: 77 TLIHASSKLGRFSFSLRLFSRMLSRQIFPDAFLFPSLVKASSGLPSLEVGKQLHSFAFLF 136 Query: 516 GLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVTWSALVAGFARHGYVKEANRLF 695 G SD FVQ+SL+H+Y+KC I +A K+FD MPQ D+V WSAL++G++ G V+EA LF Sbjct: 137 GFCSDSFVQSSLLHMYLKCDHIWDARKLFDGMPQRDLVAWSALISGYSSRGLVEEAKGLF 196 Query: 696 DGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDL 875 MG GLE N V+WNGMI+GF+ S +EAV MF++MHS+G PD +S SSVLPA+GDL Sbjct: 197 YDMGMGGLEPNVVTWNGMISGFSRSGSCSEAVDMFRRMHSEGVPPDGSSVSSVLPAIGDL 256 Query: 876 EDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALV 1055 EDL +GIQ+H Y++K G GSDKCV SALIDMYGK + Sbjct: 257 EDLNVGIQVHGYVVKRGFGSDKCVTSALIDMYGKSSW----------------------- 293 Query: 1056 AGLSRNGFAENALAVFKKI--EGQGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAG 1229 LSRNGF E+AL VF+K + Q ++LN+VSWTS+IACCSQNGKD++ALELFR+MQ+ G Sbjct: 294 --LSRNGFVEDALEVFRKFKRQQQAMQLNIVSWTSVIACCSQNGKDMDALELFREMQLEG 351 Query: 1230 VMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQ 1409 PNSVTIPC+LPACGN+AAL +GKAAHCFSLR G +++YVGSALIDMY CG++ S+ Sbjct: 352 FKPNSVTIPCMLPACGNIAALTYGKAAHCFSLRMGIFDNLYVGSALIDMYGNCGKLHLSR 411 Query: 1410 CCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSG 1589 CFD +P RNLVCWNAI+ GYAMHGK +E +EIF MMQ+SGQKPD I+FT VLSACSQ+G Sbjct: 412 LCFDQLPVRNLVCWNAIMSGYAMHGKARETIEIFQMMQKSGQKPDFISFTCVLSACSQNG 471 Query: 1590 LTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGAL 1769 LT+EGW++F+SMS EHGI+AR+EHYACMV LLGR+GKL+EAYS+I KMP EPDACVWG+L Sbjct: 472 LTDEGWHYFSSMSKEHGIEARLEHYACMVTLLGRSGKLEEAYSLINKMPMEPDACVWGSL 531 Query: 1770 LSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLR 1949 LSSCRVHNN+ LGE+AA++LFELEP+NPGNY++LSNIY S GMW+ VD VR MM GLR Sbjct: 532 LSSCRVHNNVSLGEVAAEKLFELEPRNPGNYVILSNIYGSKGMWSQVDRVRDMMNQKGLR 591 Query: 1950 KNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQ 2129 KNPGCSWIE+KN+VHMLLAGDKSHP QII KLNKLS+EMK SG+FP FVLQDVEEQ Sbjct: 592 KNPGCSWIEVKNEVHMLLAGDKSHPQRIQIIGKLNKLSMEMKNSGYFPNFTFVLQDVEEQ 651 Query: 2130 DKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDT 2309 DK LCGHSEKLAV FGLLNT GS LRVIKNLRICGDCH IKF+SS+E REIFVRDT Sbjct: 652 DKVHILCGHSEKLAVAFGLLNTPPGSSLRVIKNLRICGDCHVVIKFISSFEQREIFVRDT 711 Query: 2310 NLFHHFKDGACSCGDFW 2360 N FHHFKDG CSCGD+W Sbjct: 712 NRFHHFKDGHCSCGDYW 728 >ref|XP_002301973.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550344115|gb|EEE81246.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 724 Score = 979 bits (2531), Expect = 0.0 Identities = 493/756 (65%), Positives = 580/756 (76%) Frame = +3 Query: 93 MTRQALHLLNSSHHITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHL 272 M RQAL L + H + +T ASL QA H HILKTG+S Sbjct: 13 MARQALPLFENFSHCLCS---ATKASLSQA---HAHILKTGIS----------------- 49 Query: 273 CFDDASHVLDSILQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIK 452 L I FS L H + H +R+FS ML+ G+ PD V+P+ IK Sbjct: 50 ------------LPETIQIFSKLNH-------FGHVIRVFSYMLTQGIVPDSRVLPTVIK 90 Query: 453 ACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVT 632 CA LSAL+TGK++H VSGL D V +SL+H+YV+ +++A VFD +PQP VVT Sbjct: 91 TCAALSALQTGKQMHCFALVSGLGLDSVVLSSLLHMYVQFDHLKDARNVFDKLPQPGVVT 150 Query: 633 WSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMH 812 SAL++ FAR G VKE LF D G+ELN VSWNGMI+GFN S Y +AVLMF+ MH Sbjct: 151 SSALISRFARKGRVKETKELFYQTRDLGVELNLVSWNGMISGFNRSGSYLDAVLMFQNMH 210 Query: 813 SQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAK 992 +G +PD TS SSVLPAVGDL+ +GIQIH Y+IK GLG DK VVSALIDMYGKCACA Sbjct: 211 LEGLKPDGTSVSSVLPAVGDLDMPLMGIQIHCYVIKQGLGPDKFVVSALIDMYGKCACAS 270 Query: 993 EMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCS 1172 EMS VF+EMD++D+GACNALV GLSRNG +NAL VFK+ +G ++LNVVSWTSMIA CS Sbjct: 271 EMSGVFNEMDEVDVGACNALVTGLSRNGLVDNALEVFKQFKG--MDLNVVSWTSMIASCS 328 Query: 1173 QNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVY 1352 QNGKD+EALELFR+MQI GV PNSVTIPCLLPACGN+AAL+HGKAAHCFSLR G NDVY Sbjct: 329 QNGKDMEALELFREMQIEGVKPNSVTIPCLLPACGNIAALLHGKAAHCFSLRNGIFNDVY 388 Query: 1353 VGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSG 1532 VGSALIDMYAKCG++ +S+ CFD MP RNLV WN+++ GYAMHGK EA+ IF +MQR G Sbjct: 389 VGSALIDMYAKCGRMLASRLCFDMMPNRNLVSWNSLMAGYAMHGKTFEAINIFELMQRCG 448 Query: 1533 QKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEA 1712 QKPD ++FT VLSAC+Q GLTEEGW++F+SMS HG++AR+EHY+CMV LLGR+G+L+EA Sbjct: 449 QKPDHVSFTCVLSACTQGGLTEEGWFYFDSMSRNHGVEARMEHYSCMVTLLGRSGRLEEA 508 Query: 1713 YSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASN 1892 Y+MI++MPFEPD+CVWGALLSSCRVHN + LGEIAAK +FELEP+NPGNYILLSNIYAS Sbjct: 509 YAMIKQMPFEPDSCVWGALLSSCRVHNRVDLGEIAAKRVFELEPRNPGNYILLSNIYASK 568 Query: 1893 GMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEM 2072 MW +VD+VR MM+S GL+KNPG SWIEIKNKVHMLLAGD SHP M QIIEKL KL++EM Sbjct: 569 AMWVEVDMVRDMMRSRGLKKNPGYSWIEIKNKVHMLLAGDSSHPQMPQIIEKLAKLTVEM 628 Query: 2073 KKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCH 2252 KKSG+ P T+FVLQDVEEQDKEQ LCGHSEKLAVV GLLNT G PL+VIKNLRIC DCH Sbjct: 629 KKSGYVPHTDFVLQDVEEQDKEQILCGHSEKLAVVLGLLNTKPGFPLQVIKNLRICRDCH 688 Query: 2253 AFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 2360 A IKF+S +E REIFVRDTN FH FK G CSCGD+W Sbjct: 689 AVIKFISDFEKREIFVRDTNRFHQFKGGVCSCGDYW 724 >gb|EYU24286.1| hypothetical protein MIMGU_mgv1a025107mg [Mimulus guttatus] Length = 654 Score = 979 bits (2530), Expect = 0.0 Identities = 459/654 (70%), Positives = 552/654 (84%) Frame = +3 Query: 399 MLSHGLAPDCHVVPSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGK 578 ML HGL PD HV+PS IKACAGL A+ GK+VHG SG++ D FVQ+SLVH YVKC + Sbjct: 1 MLKHGLFPDAHVLPSVIKACAGLLAVNIGKQVHGFSLASGISLDSFVQSSLVHFYVKCDE 60 Query: 579 IREAHKVFDTMPQPDVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAG 758 + +AHK+FD M + DVV+WSAL AG+AR G A ++F+ + + G + N VSWNGMIAG Sbjct: 61 LVDAHKLFDNMVERDVVSWSALAAGYARKGDRVNARKVFNEVKNLGFQPNTVSWNGMIAG 120 Query: 759 FNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSD 938 FN S + +AVLMF++MH GF+ D TS SSVLPA+GDL L G Q+H Y+IK+G D Sbjct: 121 FNQSGCFLDAVLMFQQMHKHGFKSDGTSISSVLPAIGDLGYLSTGTQVHGYVIKNGFAVD 180 Query: 939 KCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEG 1118 KC+VSALIDMYGKC CA EMSQV ++M ++++GACNAL+ GL+R+G + AL VFK+++G Sbjct: 181 KCIVSALIDMYGKCGCALEMSQVLEDMGQVEVGACNALITGLARHGLVDKALRVFKELQG 240 Query: 1119 QGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMH 1298 Q +ELNVVSWTS+IACCSQ+GKDIEALELFR+MQ AGV PN+VTIPCLLPACGN+AALMH Sbjct: 241 QQMELNVVSWTSVIACCSQHGKDIEALELFREMQSAGVKPNAVTIPCLLPACGNIAALMH 300 Query: 1299 GKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAM 1478 GKAAHCFSLRRG S DVYVGSALIDMYA CG+I+ ++CCFD MP RNLVCWNA++GGYAM Sbjct: 301 GKAAHCFSLRRGISGDVYVGSALIDMYANCGKIQLARCCFDRMPVRNLVCWNAMLGGYAM 360 Query: 1479 HGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVE 1658 HGK EA+E F +MQRSGQKPD ++ TS+LSACSQSGLTEEG +F+ M+ +HGIK RVE Sbjct: 361 HGKANEAIEFFLLMQRSGQKPDSVSLTSLLSACSQSGLTEEGHRYFDRMTTDHGIKPRVE 420 Query: 1659 HYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFEL 1838 HYAC+V+LLGRAGKL+EAYSMI+KMPFEPDACVWGALLSSCRVH+N+ LGE+AA++LFEL Sbjct: 421 HYACVVSLLGRAGKLEEAYSMIEKMPFEPDACVWGALLSSCRVHHNMSLGEVAARKLFEL 480 Query: 1839 EPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKS 2018 EP NPGNYIL+SNIYAS G + +VD +R +M+ GLRKNPGCSWIE+KNKVHMLLAGDKS Sbjct: 481 EPMNPGNYILMSNIYASKGRYKEVDKIRDIMRDKGLRKNPGCSWIEVKNKVHMLLAGDKS 540 Query: 2019 HPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTH 2198 P MAQI++KLN+LS+EMKK+G+ P T++VLQDVEEQ+KE LCGHSEKLAVVFG+LNT Sbjct: 541 LPQMAQIMDKLNRLSIEMKKAGYSPNTDYVLQDVEEQEKEHILCGHSEKLAVVFGILNTS 600 Query: 2199 QGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 2360 GSPLRV KNLRICGDCHA IKF+S +E REIFVRDTN +HHFKDG CSCGD+W Sbjct: 601 PGSPLRVTKNLRICGDCHAVIKFISRFERREIFVRDTNRYHHFKDGDCSCGDYW 654 Score = 175 bits (443), Expect = 1e-40 Identities = 136/492 (27%), Positives = 222/492 (45%), Gaps = 42/492 (8%) Frame = +3 Query: 183 RQAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFSTLIHA-STK 359 +Q HG L +G+S ++ + L+ Y DA + D++++ D+ S+S L + K Sbjct: 30 KQVHGFSLASGISLDSFVQSSLVHFYVKCDELVDAHKLFDNMVERDVVSWSALAAGYARK 89 Query: 360 HHRYN----------------------------------HALRIFSRMLSHGLAPDCHVV 437 R N A+ +F +M HG D + Sbjct: 90 GDRVNARKVFNEVKNLGFQPNTVSWNGMIAGFNQSGCFLDAVLMFQQMHKHGFKSDGTSI 149 Query: 438 PSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQ 617 S + A L L TG +VHG V +G A D + ++L+ +Y KCG E +V + M Q Sbjct: 150 SSVLPAIGDLGYLSTGTQVHGYVIKNGFAVDKCIVSALIDMYGKCGCALEMSQVLEDMGQ 209 Query: 618 PDVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLM 797 +V +AL+ G ARHG V +A R+F + +ELN VSW +IA + EA+ + Sbjct: 210 VEVGACNALITGLARHGLVDKALRVFKELQGQQMELNVVSWTSVIACCSQHGKDIEALEL 269 Query: 798 FKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGK 977 F++M S G +P+ + +LPA G++ L G H + ++ G+ D V SALIDMY Sbjct: 270 FREMQSAGVKPNAVTIPCLLPACGNIAALMHGKAAHCFSLRRGISGDVYVGSALIDMYAN 329 Query: 978 CACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSM 1157 C + FD M ++ NA++ G + +G A A+ F ++ G + + VS TS+ Sbjct: 330 CGKIQLARCCFDRMPVRNLVCWNAMLGGYAMHGKANEAIEFFLLMQRSGQKPDSVSLTSL 389 Query: 1158 IACCSQNGKDIEALELFRDMQI-AGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRG 1334 ++ CSQ+G E F M G+ P C++ G L + A+ + Sbjct: 390 LSACSQSGLTEEGHRYFDRMTTDHGIKPRVEHYACVVSLLGRAGKL---EEAYSMIEKMP 446 Query: 1335 FSNDVYVGSALIDM-----YAKCGQIRSSQCC-FDGMPARNLVCWNAIIGGYAMHGKVKE 1496 F D V AL+ G++ + + + M N + + I YA G+ KE Sbjct: 447 FEPDACVWGALLSSCRVHHNMSLGEVAARKLFELEPMNPGNYILMSNI---YASKGRYKE 503 Query: 1497 ALEIFHMMQRSG 1532 +I +M+ G Sbjct: 504 VDKIRDIMRDKG 515 >ref|XP_003549191.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like isoform X1 [Glycine max] Length = 748 Score = 978 bits (2529), Expect = 0.0 Identities = 478/746 (64%), Positives = 580/746 (77%), Gaps = 3/746 (0%) Frame = +3 Query: 132 HITLNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHLCFD--DASHVLDS 305 H S++ASL QARQAH IL+ L ++T T LLS YAN L S L S Sbjct: 3 HALSQCLSSSTASLSQARQAHALILRLNLFSDTQLTTSLLSFYANALSLSTPQLSLTLSS 62 Query: 306 IL-QPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIKACAGLSALRT 482 L P +FSFS+LIHA + H + H L FS + L PD ++PSAIK+CA L AL Sbjct: 63 HLPHPTLFSFSSLIHAFARSHHFPHVLTTFSHLHPLRLIPDAFLLPSAIKSCASLRALDP 122 Query: 483 GKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVTWSALVAGFAR 662 G+++H + SG +D V +SL H+Y+KC +I +A K+FD MP DVV WSA++AG++R Sbjct: 123 GQQLHAFAAASGFLTDSIVASSLTHMYLKCDRILDARKLFDRMPDRDVVVWSAMIAGYSR 182 Query: 663 HGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETS 842 G V+EA LF M G+E N VSWNGM+AGF ++ Y EAV MF+ M QGF PD ++ Sbjct: 183 LGLVEEAKELFGEMRSGGVEPNLVSWNGMLAGFGNNGFYDEAVGMFRMMLVQGFWPDGST 242 Query: 843 TSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMD 1022 S VLPAVG LED+ +G Q+H Y+IK GLGSDK VVSA++DMYGKC C KEMS+VFDE++ Sbjct: 243 VSCVLPAVGCLEDVVVGAQVHGYVIKQGLGSDKFVVSAMLDMYGKCGCVKEMSRVFDEVE 302 Query: 1023 KIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALE 1202 +++IG+ NA + GLSRNG + AL VF K + Q +ELNVV+WTS+IA CSQNGKD+EALE Sbjct: 303 EMEIGSLNAFLTGLSRNGMVDTALEVFNKFKDQKMELNVVTWTSIIASCSQNGKDLEALE 362 Query: 1203 LFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVYVGSALIDMYA 1382 LFRDMQ GV PN+VTIP L+PACGN++ALMHGK HCFSLRRG +DVYVGSALIDMYA Sbjct: 363 LFRDMQAYGVEPNAVTIPSLIPACGNISALMHGKEIHCFSLRRGIFDDVYVGSALIDMYA 422 Query: 1383 KCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTS 1562 KCG+I+ ++ CFD M A NLV WNA++ GYAMHGK KE +E+FHMM +SGQKPDL+TFT Sbjct: 423 KCGRIQLARRCFDKMSALNLVSWNAVMKGYAMHGKAKETMEMFHMMLQSGQKPDLVTFTC 482 Query: 1563 VLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFE 1742 VLSAC+Q+GLTEEGW +NSMS EHGI+ ++EHYAC+V LL R GKL+EAYS+I++MPFE Sbjct: 483 VLSACAQNGLTEEGWRCYNSMSEEHGIEPKMEHYACLVTLLSRVGKLEEAYSIIKEMPFE 542 Query: 1743 PDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVR 1922 PDACVWGALLSSCRVHNNL LGEIAA++LF LEP NPGNYILLSNIYAS G+W++ + +R Sbjct: 543 PDACVWGALLSSCRVHNNLSLGEIAAEKLFFLEPTNPGNYILLSNIYASKGLWDEENRIR 602 Query: 1923 GMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTN 2102 +MKS GLRKNPG SWIE+ +KVHMLLAGD+SHP M I+EKL+KL+++MKKSG+ P TN Sbjct: 603 EVMKSKGLRKNPGYSWIEVGHKVHMLLAGDQSHPQMKDILEKLDKLNMQMKKSGYLPKTN 662 Query: 2103 FVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYE 2282 FVLQDVEEQDKEQ LCGHSEKLAVV GLLNT G PL+VIKNLRIC DCHA IK +S E Sbjct: 663 FVLQDVEEQDKEQILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLE 722 Query: 2283 GREIFVRDTNLFHHFKDGACSCGDFW 2360 GREI+VRDTN FHHFKDG CSCGDFW Sbjct: 723 GREIYVRDTNRFHHFKDGVCSCGDFW 748 >gb|EYU18955.1| hypothetical protein MIMGU_mgv1a022111mg [Mimulus guttatus] Length = 654 Score = 969 bits (2505), Expect = 0.0 Identities = 457/654 (69%), Positives = 549/654 (83%) Frame = +3 Query: 399 MLSHGLAPDCHVVPSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGK 578 ML GL PD HV+PS IKACAGL A++ GK+VHG SG++ D FVQ+SLVH YVKC + Sbjct: 1 MLKQGLFPDAHVLPSVIKACAGLLAVKIGKQVHGFSLASGISLDSFVQSSLVHFYVKCDE 60 Query: 579 IREAHKVFDTMPQPDVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAG 758 + +AHK+FD M + DVV+WSAL AG+AR G A ++F+ + + G + N VSWNGMIAG Sbjct: 61 LVDAHKLFDNMVERDVVSWSALAAGYARKGDAVNARKVFNEVKNLGFQPNTVSWNGMIAG 120 Query: 759 FNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSD 938 FN S + +AVLMF++MH GF+ D TS SSVLPA+GDL L G Q+H Y+IK+G D Sbjct: 121 FNRSGCFLDAVLMFQQMHKHGFKSDGTSISSVLPAIGDLGYLSSGTQVHGYVIKNGFAVD 180 Query: 939 KCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEG 1118 KC+VSALIDMYGKC A EMSQV ++M ++++GACNAL+ GL+R+G + AL VFK+++G Sbjct: 181 KCIVSALIDMYGKCGYALEMSQVLEDMGQVEVGACNALITGLARHGLVDKALGVFKELQG 240 Query: 1119 QGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMH 1298 Q +ELNVVSWTS+IACCSQ+GKDIEALELFR+MQ +GV PN+VTIPCLLPACGN+AALMH Sbjct: 241 QQMELNVVSWTSVIACCSQHGKDIEALELFREMQASGVKPNAVTIPCLLPACGNIAALMH 300 Query: 1299 GKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAM 1478 GKAAHCFSLRRG S DVYVGSALIDMYA CG+I+ ++CCFD M RNLVCWNA++GGYAM Sbjct: 301 GKAAHCFSLRRGISGDVYVGSALIDMYANCGKIQLARCCFDRMSVRNLVCWNAMLGGYAM 360 Query: 1479 HGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVE 1658 HGK KEA+E F +MQRSGQKPD ++ TS+LSACSQSGLTEEG +F+ M+ +HGIK RVE Sbjct: 361 HGKAKEAIEFFLLMQRSGQKPDSVSLTSLLSACSQSGLTEEGHRYFDRMTTDHGIKPRVE 420 Query: 1659 HYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFEL 1838 HYAC+V+LLGRAGKL+EAYSMI+KMPFEPDACVWGALLSSCRVH+N+ LG +AA++LFEL Sbjct: 421 HYACVVSLLGRAGKLEEAYSMIEKMPFEPDACVWGALLSSCRVHHNMSLGGVAARKLFEL 480 Query: 1839 EPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKS 2018 EPKNPGNYILLSNIYAS G + +VD +R +M GLRKNPGCSWIE+KNKVHMLLAGDKS Sbjct: 481 EPKNPGNYILLSNIYASKGRYKEVDKIRDIMGDKGLRKNPGCSWIEVKNKVHMLLAGDKS 540 Query: 2019 HPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTH 2198 P MAQI+EKLN+LS+EMKK+G+ P T++VLQDVEEQ+KE LCGHSEKLAVVFG+LN Sbjct: 541 LPQMAQIMEKLNRLSIEMKKAGYSPNTDYVLQDVEEQEKEHILCGHSEKLAVVFGILNMS 600 Query: 2199 QGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 2360 GSPLRV KNLRICGDCHA IKF+S +E REIFVRDTN +HHFKDG CSCGD+W Sbjct: 601 PGSPLRVTKNLRICGDCHAVIKFISRFERREIFVRDTNRYHHFKDGDCSCGDYW 654 Score = 169 bits (429), Expect = 5e-39 Identities = 122/432 (28%), Positives = 199/432 (46%), Gaps = 36/432 (8%) Frame = +3 Query: 183 RQAHGHILKTGLSNET-------HFVTKLLSLYANHLCFDD------------------- 284 +Q HG L +G+S ++ HF K L H FD+ Sbjct: 30 KQVHGFSLASGISLDSFVQSSLVHFYVKCDELVDAHKLFDNMVERDVVSWSALAAGYARK 89 Query: 285 -----ASHVLDSI----LQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVV 437 A V + + QP+ S++ +I + + A+ +F +M HG D + Sbjct: 90 GDAVNARKVFNEVKNLGFQPNTVSWNGMIAGFNRSGCFLDAVLMFQQMHKHGFKSDGTSI 149 Query: 438 PSAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQ 617 S + A L L +G +VHG V +G A D + ++L+ +Y KCG E +V + M Q Sbjct: 150 SSVLPAIGDLGYLSSGTQVHGYVIKNGFAVDKCIVSALIDMYGKCGYALEMSQVLEDMGQ 209 Query: 618 PDVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLM 797 +V +AL+ G ARHG V +A +F + +ELN VSW +IA + EA+ + Sbjct: 210 VEVGACNALITGLARHGLVDKALGVFKELQGQQMELNVVSWTSVIACCSQHGKDIEALEL 269 Query: 798 FKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGK 977 F++M + G +P+ + +LPA G++ L G H + ++ G+ D V SALIDMY Sbjct: 270 FREMQASGVKPNAVTIPCLLPACGNIAALMHGKAAHCFSLRRGISGDVYVGSALIDMYAN 329 Query: 978 CACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSM 1157 C + FD M ++ NA++ G + +G A+ A+ F ++ G + + VS TS+ Sbjct: 330 CGKIQLARCCFDRMSVRNLVCWNAMLGGYAMHGKAKEAIEFFLLMQRSGQKPDSVSLTSL 389 Query: 1158 IACCSQNGKDIEALELFRDMQI-AGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRG 1334 ++ CSQ+G E F M G+ P C++ G L + A+ + Sbjct: 390 LSACSQSGLTEEGHRYFDRMTTDHGIKPRVEHYACVVSLLGRAGKL---EEAYSMIEKMP 446 Query: 1335 FSNDVYVGSALI 1370 F D V AL+ Sbjct: 447 FEPDACVWGALL 458 >ref|XP_004515286.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like [Cicer arietinum] Length = 730 Score = 944 bits (2440), Expect = 0.0 Identities = 451/735 (61%), Positives = 568/735 (77%) Frame = +3 Query: 156 STSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFS 335 ST+++L ARQAH H LK GL +T T LLSLY+++L F VL S+ QP +FSFS Sbjct: 11 STTSTLFHARQAHAHFLKFGLFFDTQLTTSLLSLYSHYLPFTQLKLVLSSLPQPTLFSFS 70 Query: 336 TLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIKACAGLSALRTGKEVHGVVSVS 515 ++I++ + +NH L +FS+M S GL PD +++PSAIKAC+ L AL+ G++VHG VS Sbjct: 71 SIINSFARSRHFNHVLGVFSQMGSLGLVPDSYLLPSAIKACSALKALKLGRQVHGFAYVS 130 Query: 516 GLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVTWSALVAGFARHGYVKEANRLF 695 G SD + +SLVH+Y+KC I +A K+FD+M + DVV WSA++AG++R G V A LF Sbjct: 131 GFGSDSILISSLVHMYLKCKTIEDAQKLFDSMSERDVVVWSAMIAGYSRLGLVDRAKELF 190 Query: 696 DGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDL 875 M + G+E N VSWNGMIAGF ++ Y EA ++F+ M S+GF PD ++ S VLP +G+L Sbjct: 191 SEMRNEGVEPNLVSWNGMIAGFGNAGSYGEAAMLFRGMISEGFLPDGSAVSCVLPGIGNL 250 Query: 876 EDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALV 1055 ED+ +G Q+H Y+IK GL SD V+SAL+DMYGKC C EMS+VFDE+D+ +IG+ NA + Sbjct: 251 EDVLMGKQVHGYVIKQGLDSDNFVISALLDMYGKCGCTSEMSRVFDEIDQTEIGSLNAFL 310 Query: 1056 AGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAGVM 1235 GLSRNG + AL +FKK + Q +ELNVV+WTS+IA C+Q+GKD+EALE FRDMQ GV Sbjct: 311 TGLSRNGLVDTALEMFKKFKAQEIELNVVTWTSIIASCTQHGKDMEALEFFRDMQADGVE 370 Query: 1236 PNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQCC 1415 P +VTIP L+PACGN++AL HGK HCFSLR+G +DVYVGSALIDMYAKCG+I+ S+ C Sbjct: 371 PTAVTIPSLIPACGNVSALTHGKEIHCFSLRKGIFDDVYVGSALIDMYAKCGRIQLSRHC 430 Query: 1416 FDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLT 1595 FD MPA+NLV WN+++ GYAMHGK +E +E+F+MM +SGQKPDLITFT VLSAC+Q+GL Sbjct: 431 FDIMPAKNLVSWNSVMSGYAMHGKARETIEMFNMMLQSGQKPDLITFTCVLSACTQNGLI 490 Query: 1596 EEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLS 1775 EEGW +FNSMS EH ++ R+EHY AYS++++MPFEPDACVWG+LLS Sbjct: 491 EEGWNYFNSMSKEHDVEPRMEHY---------------AYSIVKEMPFEPDACVWGSLLS 535 Query: 1776 SCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKN 1955 SCRVH NL LGEIAA++LF LEP NPGNY+LLSNIYAS GMW + + +R MMK+ GLRKN Sbjct: 536 SCRVHKNLSLGEIAAEKLFVLEPDNPGNYVLLSNIYASKGMWGEENRIRNMMKNKGLRKN 595 Query: 1956 PGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDK 2135 PGCSWIEI +VH LL+GDKSHP M +I+EK +KLS+E+KKSG+ P+TN VLQDVEEQDK Sbjct: 596 PGCSWIEIGRRVHTLLSGDKSHPQMKEILEKSDKLSIEIKKSGYLPMTNTVLQDVEEQDK 655 Query: 2136 EQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNL 2315 EQ LCGHSEKLAVV GLLNT G PL+VIKNLRIC DCHA IK +S E REI+VRDTN Sbjct: 656 EQILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEAREIYVRDTNR 715 Query: 2316 FHHFKDGACSCGDFW 2360 FHHFKDG CSC DFW Sbjct: 716 FHHFKDGVCSCEDFW 730 >ref|XP_002890375.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297336217|gb|EFH66634.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 760 Score = 941 bits (2433), Expect = 0.0 Identities = 450/760 (59%), Positives = 583/760 (76%), Gaps = 4/760 (0%) Frame = +3 Query: 93 MTRQALHLLNSSHHITLNTFISTSA----SLPQARQAHGHILKTGLSNETHFVTKLLSLY 260 MT+Q L L+ L S+S+ SL + QAH ILK+G N+ + KL++ Y Sbjct: 1 MTKQVLPLIEKIPQTILGILESSSSLWSSSLSKTTQAHARILKSGAQNDGYISAKLIASY 60 Query: 261 ANHLCFDDASHVLDSILQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVP 440 +N+ CF+DA +L SI P ++SFS+LI+A TK ++ ++ +FSRM SHGL PD HV+P Sbjct: 61 SNYNCFNDADLILQSIPDPTVYSFSSLIYALTKAKLFSQSIGVFSRMFSHGLIPDTHVLP 120 Query: 441 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 620 + K CA LSA + GK++H V VSGL D FVQ SL H+Y++CG++ +A KVFD M + Sbjct: 121 NLFKVCAELSAFKAGKQIHCVACVSGLDMDAFVQGSLFHMYMRCGRMGDARKVFDRMSEK 180 Query: 621 DVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 800 DVVT SAL+ G+AR G ++E R+ M SG+E N VSWNG+++GFN S + EAV+MF Sbjct: 181 DVVTCSALLCGYARKGCLEEVVRILSEMEKSGIEPNIVSWNGILSGFNRSGYHKEAVIMF 240 Query: 801 KKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKC 980 +KMH GF PD+ + SSVLP+VGD E+L +G QIH Y+IK GL DKCV+SA++DMYGK Sbjct: 241 QKMHHLGFCPDQVTVSSVLPSVGDSENLNMGRQIHGYVIKQGLLKDKCVISAMLDMYGKS 300 Query: 981 ACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMI 1160 + ++FDE + ++ G CNA + GLSRNG + AL +F + Q +ELNVVSWTS+I Sbjct: 301 GHVYGIIKLFDEFEMMETGVCNAYITGLSRNGLVDKALEMFGLFKEQKMELNVVSWTSII 360 Query: 1161 ACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFS 1340 A C+QNGKDIEALELFR+MQ+AGV PN VTIP +LPACGN+AAL HG++ H F++R Sbjct: 361 AGCAQNGKDIEALELFREMQVAGVKPNRVTIPSMLPACGNIAALGHGRSTHGFAVRVHLL 420 Query: 1341 NDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMM 1520 +DV+VGSALIDMYAKCG+I+ SQ F+ MP +NLVCWN+++ GY+MHGK KE + IF + Sbjct: 421 DDVHVGSALIDMYAKCGRIKMSQIVFNMMPTKNLVCWNSLMNGYSMHGKAKEVMSIFESL 480 Query: 1521 QRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGK 1700 R+ KPD I+FTS+LSAC Q GLT+EGW +FN MS E+GIK R+EHY+CMVNLLGRAGK Sbjct: 481 MRTRLKPDFISFTSLLSACGQVGLTDEGWKYFNMMSEEYGIKPRLEHYSCMVNLLGRAGK 540 Query: 1701 LKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNI 1880 L+EAY +I+++PFEPD+CVWGALL+SCR+ NN+ L EIAA++LF LEP+NPG Y+L+SNI Sbjct: 541 LQEAYDLIKEIPFEPDSCVWGALLNSCRLQNNVDLAEIAAQKLFHLEPENPGTYVLMSNI 600 Query: 1881 YASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKL 2060 YA+ GMW +VD +R M+S+GL+KNPGCSWI++KNKV+ LLA DKSHP + QI EK++++ Sbjct: 601 YAAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNKVYTLLACDKSHPQIDQITEKMDEI 660 Query: 2061 SLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRIC 2240 S EM+KSG P +F LQDVEEQ++EQ L GHSEKLAVVFGLLNT G+PL+VIKNLRIC Sbjct: 661 SEEMRKSGHRPNLDFALQDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRIC 720 Query: 2241 GDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 2360 GDCHA IKF+SSY GREIF+RDTN FHHFKDG CSCGDFW Sbjct: 721 GDCHAVIKFISSYAGREIFIRDTNRFHHFKDGICSCGDFW 760 >ref|XP_006306841.1| hypothetical protein CARUB_v10008385mg [Capsella rubella] gi|482575552|gb|EOA39739.1| hypothetical protein CARUB_v10008385mg [Capsella rubella] Length = 760 Score = 939 bits (2428), Expect = 0.0 Identities = 451/760 (59%), Positives = 585/760 (76%), Gaps = 4/760 (0%) Frame = +3 Query: 93 MTRQALHLLNSSHHITLNTFISTSA----SLPQARQAHGHILKTGLSNETHFVTKLLSLY 260 MT+Q L L+ + S+S+ SL + QAH ILK+G N+ + KL++ Y Sbjct: 1 MTKQVLPLIVQIPQSIVGFLESSSSIWSSSLSKTTQAHARILKSGAQNDGYISAKLIASY 60 Query: 261 ANHLCFDDASHVLDSILQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVP 440 +N+ CFDDA VL SI P ++SFS+LI+A TK ++ ++ +FSRM SHGL PD HV+P Sbjct: 61 SNYSCFDDADLVLQSIPDPTVYSFSSLIYALTKAKLFSQSIGVFSRMFSHGLIPDSHVLP 120 Query: 441 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 620 + K CA LSA + GK++H V VSGL D FVQ SL H+Y++CG++ +A KVFD M + Sbjct: 121 NLFKVCAELSAFKVGKQIHCVSCVSGLDMDAFVQGSLFHMYMRCGRMGDARKVFDRMFEK 180 Query: 621 DVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 800 DVVT SAL+ G+AR G ++E R+ GM +SG+E N VSWNG+++GFN S + EAV+MF Sbjct: 181 DVVTCSALLCGYARKGCLEEVVRILSGMENSGIEPNIVSWNGILSGFNRSGYHREAVIMF 240 Query: 801 KKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKC 980 +KMH GF PD+ + SSVLP+VGD E L +G QIH Y+IK GL DKCV+SA++DMYGK Sbjct: 241 QKMHLCGFSPDQVTVSSVLPSVGDSEMLNMGRQIHGYVIKQGLLKDKCVISAMLDMYGKS 300 Query: 981 ACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMI 1160 + ++FDE + ++ G CNA + GLSRNG + AL +F+ + Q VELNVVSWTS+I Sbjct: 301 GHVYGIIKLFDEFEMMETGVCNAYITGLSRNGLVDKALEMFELFKEQKVELNVVSWTSII 360 Query: 1161 ACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFS 1340 A C+QNGKDIEALELFR+MQ+AGV PN VTIP +LPACGN+AAL HG++ H F++R Sbjct: 361 AGCAQNGKDIEALELFREMQVAGVKPNRVTIPSMLPACGNIAALGHGRSTHGFAVRVHLW 420 Query: 1341 NDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMM 1520 +DV+VGSALIDMYAKCG+I SQ F+ MP +NLVCWN+++ GY+MHGK KE + IF + Sbjct: 421 DDVHVGSALIDMYAKCGRINMSQFVFNMMPTKNLVCWNSLMNGYSMHGKAKEVMSIFESL 480 Query: 1521 QRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGK 1700 R+ KPD I+FTS+L++C Q GLT+EGW +F+ MS E+GIK R+EHY+CMVNLLGRAGK Sbjct: 481 LRTRLKPDFISFTSLLASCGQVGLTDEGWKYFSMMSEEYGIKPRLEHYSCMVNLLGRAGK 540 Query: 1701 LKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNI 1880 L+EAY +I++MPFEPD+CVWGALL+SCR+ +N+ L EIAA +LF+LEP+NPG Y+LLSNI Sbjct: 541 LQEAYELIKEMPFEPDSCVWGALLNSCRLQSNVDLAEIAADKLFDLEPENPGTYVLLSNI 600 Query: 1881 YASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKL 2060 YA+ GMW +VD +R M+S+GL+KNPGCSWI++KN+V+ LLAGDKSHP + QI EK++++ Sbjct: 601 YAAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEI 660 Query: 2061 SLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRIC 2240 S EM+KSG P +F LQDVEEQ++EQ L GHSEKLAVVFGLLNT G+PL+VIKNLRIC Sbjct: 661 SEEMRKSGHRPNLDFALQDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRIC 720 Query: 2241 GDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 2360 GDCH+ IKF+SSY GREIFVRDTN FHHFKDG CSCGDFW Sbjct: 721 GDCHSVIKFISSYAGREIFVRDTNRFHHFKDGICSCGDFW 760 >ref|XP_006416418.1| hypothetical protein EUTSA_v10009574mg [Eutrema salsugineum] gi|557094189|gb|ESQ34771.1| hypothetical protein EUTSA_v10009574mg [Eutrema salsugineum] Length = 760 Score = 939 bits (2426), Expect = 0.0 Identities = 451/760 (59%), Positives = 583/760 (76%), Gaps = 4/760 (0%) Frame = +3 Query: 93 MTRQALHLLNSSHHITLNTFIST----SASLPQARQAHGHILKTGLSNETHFVTKLLSLY 260 MT+Q L L+ L + S+SL + QAH ILK+G N+ + +KL++ Y Sbjct: 1 MTKQVLPLIEKIPQSILGFLEFSPSCWSSSLTKTTQAHARILKSGAQNDGYISSKLIASY 60 Query: 261 ANHLCFDDASHVLDSILQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVP 440 +N+ CFDDA+ +L SI P ++SFS+LI+A TK ++ +L +FSRM SHGL PD HV+P Sbjct: 61 SNYSCFDDANLILQSIPDPSVYSFSSLIYALTKAKLFSQSLGVFSRMFSHGLIPDTHVLP 120 Query: 441 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 620 + K CA LSA + GK++H V GL D FVQ SL H+Y++CG++ +A KVFD M + Sbjct: 121 NLFKVCAELSAFKAGKQIHCVSCTLGLDEDAFVQGSLFHMYMRCGRMGDARKVFDRMSEK 180 Query: 621 DVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 800 DVVT SAL+ G+AR G +++ R+ M SG+E N VSWNG+++GFN S + EAV+MF Sbjct: 181 DVVTCSALLCGYARKGCLEDVVRILSEMEKSGIEPNIVSWNGILSGFNRSGYHEEAVIMF 240 Query: 801 KKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKC 980 +KMH GF PDE + SSVLP+VGD E L +G QIH Y+IK GL DKCV SA+IDMYGK Sbjct: 241 QKMHHLGFFPDEVAVSSVLPSVGDSEKLDMGRQIHGYVIKQGLLKDKCVTSAMIDMYGKS 300 Query: 981 ACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMI 1160 + ++F++++ ++ G CNA + GLSRNG + AL +F+ + Q +ELNVVSWTS+I Sbjct: 301 GQVYGIIKLFEQVELMETGVCNACITGLSRNGLIDKALEMFELFKEQNIELNVVSWTSII 360 Query: 1161 ACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFS 1340 A C+QNGKDIEALELFR+MQ+A V PN VTIP +LPACGN+AAL+HG++AH F++R Sbjct: 361 AGCAQNGKDIEALELFREMQVARVKPNRVTIPSMLPACGNIAALVHGRSAHGFAVRVHLL 420 Query: 1341 NDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMM 1520 +DV+VGSALIDMYAKCG+I SQ FD MP RNLVCWN+++ GY+MHGK KE + IF + Sbjct: 421 DDVHVGSALIDMYAKCGRINMSQMVFDMMPTRNLVCWNSLMSGYSMHGKAKEVMSIFDSL 480 Query: 1521 QRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGK 1700 R+ KPD I+FTS+LSACSQ GLT+EGW +F M+ E+GIK R+EHY+CMV+LLGRAGK Sbjct: 481 VRTRLKPDFISFTSLLSACSQVGLTDEGWKYFGMMTEEYGIKPRLEHYSCMVSLLGRAGK 540 Query: 1701 LKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNI 1880 L+EAY +I+++PFEPD+CVWGALL+SCR+ NN+ L EIAA++LF+LEP+NPG Y+LLSNI Sbjct: 541 LQEAYDLIKEIPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFDLEPENPGTYVLLSNI 600 Query: 1881 YASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKL 2060 YA+ GMW +VD VR M+S+GL+KNPGCSWI++KNKV+ LLAGDKSHP + QI EK++++ Sbjct: 601 YAAKGMWAEVDSVRNKMESLGLKKNPGCSWIQVKNKVYTLLAGDKSHPQIEQITEKMDEI 660 Query: 2061 SLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRIC 2240 S EM+KSG P +F LQDVEEQ+KEQ L GHSEKLAVVFGLLNT G+PL+VIKNLRIC Sbjct: 661 SKEMRKSGHRPNLDFALQDVEEQEKEQILLGHSEKLAVVFGLLNTPDGTPLQVIKNLRIC 720 Query: 2241 GDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 2360 GDCH+ IKF+S Y GREIFVRDTN FHHFKDG CSCGDFW Sbjct: 721 GDCHSVIKFISGYAGREIFVRDTNRFHHFKDGICSCGDFW 760 >ref|NP_173449.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|193806503|sp|Q9LNU6.2|PPR53_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g20230 gi|332191832|gb|AEE29953.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 760 Score = 925 bits (2390), Expect = 0.0 Identities = 444/760 (58%), Positives = 576/760 (75%), Gaps = 4/760 (0%) Frame = +3 Query: 93 MTRQALHLLNSSHHITLNTFISTS----ASLPQARQAHGHILKTGLSNETHFVTKLLSLY 260 MT+Q L L+ + S+S +SL + QAH ILK+G N+ + KL++ Y Sbjct: 1 MTKQVLPLIEKIPQSIVGFLESSSYHWSSSLSKTTQAHARILKSGAQNDGYISAKLIASY 60 Query: 261 ANHLCFDDASHVLDSILQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVP 440 +N+ CF+DA VL SI P I+SFS+LI+A TK + ++ +FSRM SHGL PD HV+P Sbjct: 61 SNYNCFNDADLVLQSIPDPTIYSFSSLIYALTKAKLFTQSIGVFSRMFSHGLIPDSHVLP 120 Query: 441 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 620 + K CA LSA + GK++H V VSGL D FVQ S+ H+Y++CG++ +A KVFD M Sbjct: 121 NLFKVCAELSAFKVGKQIHCVSCVSGLDMDAFVQGSMFHMYMRCGRMGDARKVFDRMSDK 180 Query: 621 DVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 800 DVVT SAL+ +AR G ++E R+ M SG+E N VSWNG+++GFN S + EAV+MF Sbjct: 181 DVVTCSALLCAYARKGCLEEVVRILSEMESSGIEANIVSWNGILSGFNRSGYHKEAVVMF 240 Query: 801 KKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKC 980 +K+H GF PD+ + SSVLP+VGD E L +G IH Y+IK GL DKCV+SA+IDMYGK Sbjct: 241 QKIHHLGFCPDQVTVSSVLPSVGDSEMLNMGRLIHGYVIKQGLLKDKCVISAMIDMYGKS 300 Query: 981 ACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMI 1160 + +F++ + ++ G CNA + GLSRNG + AL +F+ + Q +ELNVVSWTS+I Sbjct: 301 GHVYGIISLFNQFEMMEAGVCNAYITGLSRNGLVDKALEMFELFKEQTMELNVVSWTSII 360 Query: 1161 ACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFS 1340 A C+QNGKDIEALELFR+MQ+AGV PN VTIP +LPACGN+AAL HG++ H F++R Sbjct: 361 AGCAQNGKDIEALELFREMQVAGVKPNHVTIPSMLPACGNIAALGHGRSTHGFAVRVHLL 420 Query: 1341 NDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMM 1520 ++V+VGSALIDMYAKCG+I SQ F+ MP +NLVCWN+++ G++MHGK KE + IF + Sbjct: 421 DNVHVGSALIDMYAKCGRINLSQIVFNMMPTKNLVCWNSLMNGFSMHGKAKEVMSIFESL 480 Query: 1521 QRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGK 1700 R+ KPD I+FTS+LSAC Q GLT+EGW +F MS E+GIK R+EHY+CMVNLLGRAGK Sbjct: 481 MRTRLKPDFISFTSLLSACGQVGLTDEGWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGK 540 Query: 1701 LKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNI 1880 L+EAY +I++MPFEPD+CVWGALL+SCR+ NN+ L EIAA++LF LEP+NPG Y+LLSNI Sbjct: 541 LQEAYDLIKEMPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNI 600 Query: 1881 YASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKL 2060 YA+ GMW +VD +R M+S+GL+KNPGCSWI++KN+V+ LLAGDKSHP + QI EK++++ Sbjct: 601 YAAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEI 660 Query: 2061 SLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRIC 2240 S EM+KSG P +F L DVEEQ++EQ L GHSEKLAVVFGLLNT G+PL+VIKNLRIC Sbjct: 661 SKEMRKSGHRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRIC 720 Query: 2241 GDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDFW 2360 GDCHA IKF+SSY GREIF+RDTN FHHFKDG CSCGDFW Sbjct: 721 GDCHAVIKFISSYAGREIFIRDTNRFHHFKDGICSCGDFW 760 >gb|EPS74339.1| hypothetical protein M569_00411 [Genlisea aurea] Length = 1063 Score = 911 bits (2355), Expect = 0.0 Identities = 441/742 (59%), Positives = 561/742 (75%), Gaps = 2/742 (0%) Frame = +3 Query: 141 LNTFISTSASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPD 320 L+ ASL Q RQAH +L+TGL + + +LSLYA H DA +L S+L PD Sbjct: 323 LSNLSKIGASLSQIRQAHAQLLRTGLFELSQYSNNILSLYARHQYLSDAKRLLRSLLTPD 382 Query: 321 IFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIKACAGLSALRTGKEVHG 500 +F+ LI A +K L + S L GL PD +V+PS I+ACAGL A + GK+ HG Sbjct: 383 SAAFTVLITACSKSSDLKSTLILVSEFLRSGLTPDVYVLPSIIRACAGLFAFKIGKQAHG 442 Query: 501 VVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVTWSALVAGFARHGYVKE 680 VSG DPF+++SLVH Y+KCG++ A KVF +M + D+V+WSAL A +AR G V Sbjct: 443 FSIVSGFVLDPFIESSLVHFYLKCGELAGARKVFYSMDEKDIVSWSALSAAYARKGDVLN 502 Query: 681 ANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLP 860 A +LF + G E N VSWNGMIAGFN S+ + +AVLMF++MHS GF D + SS LP Sbjct: 503 AKKLFFSVRGFGFEPNAVSWNGMIAGFNQSKHFLDAVLMFQQMHSCGFPSDGINISSALP 562 Query: 861 AVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGA 1040 AV DL L LG Q+H ++IK G DKC+VSALIDMYGK A E+ VF++M ++D+ Sbjct: 563 AVSDLGSLKLGTQVHGHVIKIGFAGDKCIVSALIDMYGKLGNASEILLVFEDMHQLDVVV 622 Query: 1041 CNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALELFRDMQ 1220 CNAL++GLSR+G + +L++F+K+ G+E N+VSWTS I+CCSQ+G+D+EAL LFR+MQ Sbjct: 623 CNALISGLSRHGLVDESLSMFEKLRSSGIE-NLVSWTSAISCCSQHGRDMEALGLFREMQ 681 Query: 1221 IAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIR 1400 +GV PN+VTIP LLPACGN+AAL +GKA HCFSLR NDVYVGSALIDMYA CG+I+ Sbjct: 682 FSGVKPNAVTIPSLLPACGNIAALSYGKAVHCFSLRNNICNDVYVGSALIDMYANCGKIK 741 Query: 1401 SSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACS 1580 +++C F+ MP RNLVCWNA++G Y+MHG+ KEA+ +F MQR GQKPD ++FTS+LSACS Sbjct: 742 AARCLFERMPVRNLVCWNAMLGAYSMHGEAKEAIGLFQSMQRCGQKPDSVSFTSLLSACS 801 Query: 1581 QSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVW 1760 QSGL EEG +F SM +HG++ R+EHYAC+V LLGRAGKL EAY+ I++MPFE DACVW Sbjct: 802 QSGLAEEGRRYFESMFEDHGLEPRLEHYACIVGLLGRAGKLDEAYAKIKRMPFEADACVW 861 Query: 1761 GALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSM 1940 GALLSSC +HNN LGE+AA++LFELE N GNYILLSNIYAS+ W +V +R MM Sbjct: 862 GALLSSCALHNNEFLGEVAAEKLFELELGNSGNYILLSNIYASSRKWKEVRRIRDMMSLK 921 Query: 1941 GLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKLSLEMK-KSGFFPVTNFVLQD 2117 G++KNPGCSWIE+KNKVHM+LAGDK+ P +++I+E+L +L+ EMK G+FP TN+VLQD Sbjct: 922 GMKKNPGCSWIEVKNKVHMILAGDKALPQVSKIMERLKRLNQEMKGAGGYFPNTNYVLQD 981 Query: 2118 VEEQ-DKEQFLCGHSEKLAVVFGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREI 2294 VEEQ ++E LCGHSEKLAVVFG+LNT +GSP+RV KNLRICGDCHA IKF+S +EGREI Sbjct: 982 VEEQEEREGILCGHSEKLAVVFGILNTSRGSPIRVTKNLRICGDCHAVIKFISGFEGREI 1041 Query: 2295 FVRDTNLFHHFKDGACSCGDFW 2360 VRDTN +HHFKDG CSCGD+W Sbjct: 1042 SVRDTNRYHHFKDGICSCGDYW 1063 >ref|XP_007152607.1| hypothetical protein PHAVU_004G144300g [Phaseolus vulgaris] gi|561025916|gb|ESW24601.1| hypothetical protein PHAVU_004G144300g [Phaseolus vulgaris] Length = 601 Score = 862 bits (2227), Expect = 0.0 Identities = 410/601 (68%), Positives = 494/601 (82%) Frame = +3 Query: 558 LYVKCGKIREAHKVFDTMPQPDVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVS 737 +Y+KC +I A K+FD MP+ DVV WSA++AG++R G V EA LF M G+E N V+ Sbjct: 1 MYLKCDRIVGARKLFDRMPERDVVVWSAMIAGYSRLGLVDEARGLFGEMRSCGVEPNLVT 60 Query: 738 WNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLI 917 WNGM+AGF ++ LY EAV MF+ M +GF PD ++ S VLP+VG LED+ +G Q+H Y+ Sbjct: 61 WNGMLAGFGNNGLYDEAVGMFRVMLLEGFWPDGSTVSCVLPSVGCLEDVVMGAQVHGYVT 120 Query: 918 KHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALA 1097 K GL DK VVSAL+DMYGKC KEMS+VFDE+++++IG+ NA + GLSRNG + AL Sbjct: 121 KQGLICDKFVVSALLDMYGKCGFVKEMSRVFDEVEEMEIGSLNAFLTGLSRNGMVDAALE 180 Query: 1098 VFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACG 1277 VF +++ Q VELNVV+WTS+IA CSQNGKD EALELFRDMQ GV PN+VTIP L+PACG Sbjct: 181 VFNRLKDQRVELNVVTWTSVIASCSQNGKDFEALELFRDMQAYGVEPNAVTIPSLIPACG 240 Query: 1278 NMAALMHGKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNA 1457 N++AL HGK HCFSLR+G +DVYVGSALIDMYAKCG+I+ S+ CFD M A NLV WNA Sbjct: 241 NISALTHGKEIHCFSLRKGIFDDVYVGSALIDMYAKCGRIQLSRRCFDNMLAPNLVSWNA 300 Query: 1458 IIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEH 1637 +I GYAMHGK KE +E+FHMMQ+SGQKPD ITFT +LSAC+Q+GLTEEGW+++NSMS EH Sbjct: 301 VISGYAMHGKAKETMEMFHMMQQSGQKPDSITFTCILSACAQNGLTEEGWHYYNSMSKEH 360 Query: 1638 GIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIA 1817 GI+ ++EHYACMV LL R GKL+EAYS+I++MPFEPDACVWGALLSSCRVHNNL LGEIA Sbjct: 361 GIEPKMEHYACMVTLLSRVGKLEEAYSIIKEMPFEPDACVWGALLSSCRVHNNLSLGEIA 420 Query: 1818 AKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHM 1997 A++LF LEP NPGNY+LLSNIYAS G+W++ + +R MMKS GLRKNPG SWIE+ +KVHM Sbjct: 421 AEKLFPLEPANPGNYVLLSNIYASKGLWDEENRIREMMKSKGLRKNPGYSWIEVGHKVHM 480 Query: 1998 LLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVV 2177 LLAGD+SHP M I+EKL+KL++EMKKSG+ P TNFVLQDVEEQDKEQ LCGHSEKLAVV Sbjct: 481 LLAGDQSHPQMKDILEKLDKLNMEMKKSGYLPKTNFVLQDVEEQDKEQILCGHSEKLAVV 540 Query: 2178 FGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDF 2357 GLLNT G PL+VIKNLRIC DCHA IK +S EGREI++RDTN FHH KDG CSCGDF Sbjct: 541 LGLLNTSPGQPLQVIKNLRICDDCHAVIKAISRLEGREIYIRDTNRFHHIKDGVCSCGDF 600 Query: 2358 W 2360 W Sbjct: 601 W 601 Score = 171 bits (433), Expect = 2e-39 Identities = 98/355 (27%), Positives = 189/355 (53%), Gaps = 1/355 (0%) Frame = +3 Query: 309 LQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIKACAGLSALRTGK 488 ++P++ +++ ++ + Y+ A+ +F ML G PD V + + L + G Sbjct: 54 VEPNLVTWNGMLAGFGNNGLYDEAVGMFRVMLLEGFWPDGSTVSCVLPSVGCLEDVVMGA 113 Query: 489 EVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVTWSALVAGFARHG 668 +VHG V+ GL D FV ++L+ +Y KCG ++E +VFD + + ++ + +A + G +R+G Sbjct: 114 QVHGYVTKQGLICDKFVVSALLDMYGKCGFVKEMSRVFDEVEEMEIGSLNAFLTGLSRNG 173 Query: 669 YVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTS 848 V A +F+ + D +ELN V+W +IA + + EA+ +F+ M + G +P+ + Sbjct: 174 MVDAALEVFNRLKDQRVELNVVTWTSVIASCSQNGKDFEALELFRDMQAYGVEPNAVTIP 233 Query: 849 SVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKI 1028 S++PA G++ L G +IH + ++ G+ D V SALIDMY KC + + FD M Sbjct: 234 SLIPACGNISALTHGKEIHCFSLRKGIFDDVYVGSALIDMYAKCGRIQLSRRCFDNMLAP 293 Query: 1029 DIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALELF 1208 ++ + NA+++G + +G A+ + +F ++ G + + +++T +++ C+QNG E + Sbjct: 294 NLVSWNAVISGYAMHGKAKETMEMFHMMQQSGQKPDSITFTCILSACAQNGLTEEGWHYY 353 Query: 1209 RDM-QIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVYVGSALI 1370 M + G+ P C++ + L + A+ F D V AL+ Sbjct: 354 NSMSKEHGIEPKMEHYACMVTLLSRVGKL---EEAYSIIKEMPFEPDACVWGALL 405 Score = 114 bits (286), Expect = 2e-22 Identities = 76/298 (25%), Positives = 139/298 (46%), Gaps = 39/298 (13%) Frame = +3 Query: 186 QAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFSTLIHASTKHH 365 Q HG++ K GL + V+ LL +Y + S V D + + +I S + + +++ Sbjct: 114 QVHGYVTKQGLICDKFVVSALLDMYGKCGFVKEMSRVFDEVEEMEIGSLNAFLTGLSRNG 173 Query: 366 RYNHALRIFSR-----------------------------------MLSHGLAPDCHVVP 440 + AL +F+R M ++G+ P+ +P Sbjct: 174 MVDAALEVFNRLKDQRVELNVVTWTSVIASCSQNGKDFEALELFRDMQAYGVEPNAVTIP 233 Query: 441 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 620 S I AC +SAL GKE+H G+ D +V ++L+ +Y KCG+I+ + + FD M P Sbjct: 234 SLIPACGNISALTHGKEIHCFSLRKGIFDDVYVGSALIDMYAKCGRIQLSRRCFDNMLAP 293 Query: 621 DVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 800 ++V+W+A+++G+A HG KE +F M SG + + +++ +++ + L E + Sbjct: 294 NLVSWNAVISGYAMHGKAKETMEMFHMMQQSGQKPDSITFTCILSACAQNGLTEEGWHYY 353 Query: 801 KKMHSQ-GFQPDETSTS---SVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALI 962 M + G +P + ++L VG LE+ Y + + + D CV AL+ Sbjct: 354 NSMSKEHGIEPKMEHYACMVTLLSRVGKLEEAY------SIIKEMPFEPDACVWGALL 405 Score = 59.7 bits (143), Expect = 8e-06 Identities = 38/169 (22%), Positives = 73/169 (43%), Gaps = 2/169 (1%) Frame = +3 Query: 165 ASLPQARQAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFSTLI 344 ++L ++ H L+ G+ ++ + + L+ +YA + D++L P++ S++ +I Sbjct: 243 SALTHGKEIHCFSLRKGIFDDVYVGSALIDMYAKCGRIQLSRRCFDNMLAPNLVSWNAVI 302 Query: 345 HASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIKACAGLSALRTGKEVHGVVSVS-GL 521 H + + +F M G PD + ACA G + +S G+ Sbjct: 303 SGYAMHGKAKETMEMFHMMQQSGQKPDSITFTCILSACAQNGLTEEGWHYYNSMSKEHGI 362 Query: 522 ASDPFVQTSLVHLYVKCGKIREAHKVFDTMP-QPDVVTWSALVAGFARH 665 +V L + GK+ EA+ + MP +PD W AL++ H Sbjct: 363 EPKMEHYACMVTLLSRVGKLEEAYSIIKEMPFEPDACVWGALLSSCRVH 411 >ref|XP_006587447.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like [Glycine max] Length = 601 Score = 855 bits (2208), Expect = 0.0 Identities = 404/601 (67%), Positives = 490/601 (81%) Frame = +3 Query: 558 LYVKCGKIREAHKVFDTMPQPDVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVS 737 +Y+KC +IR+A K+FD MP+ DVV WSA+VAG++R G V EA F M G+ N VS Sbjct: 1 MYLKCDRIRDARKLFDMMPERDVVVWSAMVAGYSRLGLVDEAKEFFGEMRSGGMAPNLVS 60 Query: 738 WNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLI 917 WNGM+AGF ++ LY A+ MF+ M GF PD ++ S VLP+VG LED +G Q+H Y+I Sbjct: 61 WNGMLAGFGNNGLYDVALGMFRMMLVDGFWPDGSTVSCVLPSVGCLEDAVVGAQVHGYVI 120 Query: 918 KHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALA 1097 K GLG DK VVSA++DMYGKC C KEMS+VFDE+++++IG+ NA + GLSRNG + AL Sbjct: 121 KQGLGCDKFVVSAMLDMYGKCGCVKEMSRVFDEVEEMEIGSLNAFLTGLSRNGMVDAALE 180 Query: 1098 VFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACG 1277 VF K + + +ELNVV+WTS+IA CSQNGKD+EALELFRDMQ GV PN+VTIP L+PACG Sbjct: 181 VFNKFKDRKMELNVVTWTSIIASCSQNGKDLEALELFRDMQADGVEPNAVTIPSLIPACG 240 Query: 1278 NMAALMHGKAAHCFSLRRGFSNDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNA 1457 N++ALMHGK HCFSLRRG +DVYVGSALIDMYAKCG+I+ S+CCFD M A NLV WNA Sbjct: 241 NISALMHGKEIHCFSLRRGIFDDVYVGSALIDMYAKCGRIQLSRCCFDKMSAPNLVSWNA 300 Query: 1458 IIGGYAMHGKVKEALEIFHMMQRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEH 1637 ++ GYAMHGK KE +E+FHMM +SGQKP+L+TFT VLSAC+Q+GLTEEGW ++NSMS EH Sbjct: 301 VMSGYAMHGKAKETMEMFHMMLQSGQKPNLVTFTCVLSACAQNGLTEEGWRYYNSMSEEH 360 Query: 1638 GIKARVEHYACMVNLLGRAGKLKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIA 1817 G + ++EHYACMV LL R GKL+EAYS+I++MPFEPDACV GALLSSCRVHNNL LGEI Sbjct: 361 GFEPKMEHYACMVTLLSRVGKLEEAYSIIKEMPFEPDACVRGALLSSCRVHNNLSLGEIT 420 Query: 1818 AKELFELEPKNPGNYILLSNIYASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHM 1997 A++LF LEP NPGNYI+LSNIYAS G+W++ + +R +MKS GLRKNPG SWIE+ +K+HM Sbjct: 421 AEKLFLLEPTNPGNYIILSNIYASKGLWDEENRIREVMKSKGLRKNPGYSWIEVGHKIHM 480 Query: 1998 LLAGDKSHPHMAQIIEKLNKLSLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVV 2177 LLAGD+SHP M I+EKL+KL++EMKKSG+ P +NFV QDVEE DKEQ LCGHSEKLAVV Sbjct: 481 LLAGDQSHPQMKDILEKLDKLNMEMKKSGYLPKSNFVWQDVEEHDKEQILCGHSEKLAVV 540 Query: 2178 FGLLNTHQGSPLRVIKNLRICGDCHAFIKFLSSYEGREIFVRDTNLFHHFKDGACSCGDF 2357 GLLNT G PL+VIKNLRIC DCHA IK +S EGREI+VRDTN HHFKDG CSCGDF Sbjct: 541 LGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEGREIYVRDTNRLHHFKDGVCSCGDF 600 Query: 2358 W 2360 W Sbjct: 601 W 601 Score = 171 bits (432), Expect = 2e-39 Identities = 101/355 (28%), Positives = 184/355 (51%), Gaps = 1/355 (0%) Frame = +3 Query: 309 LQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVPSAIKACAGLSALRTGK 488 + P++ S++ ++ + Y+ AL +F ML G PD V + + L G Sbjct: 54 MAPNLVSWNGMLAGFGNNGLYDVALGMFRMMLVDGFWPDGSTVSCVLPSVGCLEDAVVGA 113 Query: 489 EVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQPDVVTWSALVAGFARHG 668 +VHG V GL D FV ++++ +Y KCG ++E +VFD + + ++ + +A + G +R+G Sbjct: 114 QVHGYVIKQGLGCDKFVVSAMLDMYGKCGCVKEMSRVFDEVEEMEIGSLNAFLTGLSRNG 173 Query: 669 YVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMFKKMHSQGFQPDETSTS 848 V A +F+ D +ELN V+W +IA + + EA+ +F+ M + G +P+ + Sbjct: 174 MVDAALEVFNKFKDRKMELNVVTWTSIIASCSQNGKDLEALELFRDMQADGVEPNAVTIP 233 Query: 849 SVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKCACAKEMSQVFDEMDKI 1028 S++PA G++ L G +IH + ++ G+ D V SALIDMY KC + FD+M Sbjct: 234 SLIPACGNISALMHGKEIHCFSLRRGIFDDVYVGSALIDMYAKCGRIQLSRCCFDKMSAP 293 Query: 1029 DIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMIACCSQNGKDIEALELF 1208 ++ + NA+++G + +G A+ + +F + G + N+V++T +++ C+QNG E + Sbjct: 294 NLVSWNAVMSGYAMHGKAKETMEMFHMMLQSGQKPNLVTFTCVLSACAQNGLTEEGWRYY 353 Query: 1209 RDM-QIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFSNDVYVGSALI 1370 M + G P C++ + L + A+ F D V AL+ Sbjct: 354 NSMSEEHGFEPKMEHYACMVTLLSRVGKL---EEAYSIIKEMPFEPDACVRGALL 405 Score = 118 bits (296), Expect = 1e-23 Identities = 78/298 (26%), Positives = 140/298 (46%), Gaps = 39/298 (13%) Frame = +3 Query: 186 QAHGHILKTGLSNETHFVTKLLSLYANHLCFDDASHVLDSILQPDIFSFSTLIHASTKHH 365 Q HG+++K GL + V+ +L +Y C + S V D + + +I S + + +++ Sbjct: 114 QVHGYVIKQGLGCDKFVVSAMLDMYGKCGCVKEMSRVFDEVEEMEIGSLNAFLTGLSRNG 173 Query: 366 RYNHALRIFSR-----------------------------------MLSHGLAPDCHVVP 440 + AL +F++ M + G+ P+ +P Sbjct: 174 MVDAALEVFNKFKDRKMELNVVTWTSIIASCSQNGKDLEALELFRDMQADGVEPNAVTIP 233 Query: 441 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 620 S I AC +SAL GKE+H G+ D +V ++L+ +Y KCG+I+ + FD M P Sbjct: 234 SLIPACGNISALMHGKEIHCFSLRRGIFDDVYVGSALIDMYAKCGRIQLSRCCFDKMSAP 293 Query: 621 DVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 800 ++V+W+A+++G+A HG KE +F M SG + N V++ +++ + L E + Sbjct: 294 NLVSWNAVMSGYAMHGKAKETMEMFHMMLQSGQKPNLVTFTCVLSACAQNGLTEEGWRYY 353 Query: 801 KKMHSQ-GFQPDETSTS---SVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALI 962 M + GF+P + ++L VG LE+ Y + + + D CV AL+ Sbjct: 354 NSMSEEHGFEPKMEHYACMVTLLSRVGKLEEAY------SIIKEMPFEPDACVRGALL 405 >gb|AAF79892.1|AC022472_1 Contains similarity to an unknown protein F28A21.160 gi|7486269 from Arabidopsis thaliana BAC F28A21 gi|T04867 and contains multiple PPR PF|01535 repeats. EST gb|AI999742 comes from this gene. This gene may be cut off, partial [Arabidopsis thaliana] Length = 757 Score = 831 bits (2146), Expect(2) = 0.0 Identities = 403/713 (56%), Positives = 533/713 (74%), Gaps = 4/713 (0%) Frame = +3 Query: 93 MTRQALHLLNSSHHITLNTFISTS----ASLPQARQAHGHILKTGLSNETHFVTKLLSLY 260 MT+Q L L+ + S+S +SL + QAH ILK+G N+ + KL++ Y Sbjct: 1 MTKQVLPLIEKIPQSIVGFLESSSYHWSSSLSKTTQAHARILKSGAQNDGYISAKLIASY 60 Query: 261 ANHLCFDDASHVLDSILQPDIFSFSTLIHASTKHHRYNHALRIFSRMLSHGLAPDCHVVP 440 +N+ CF+DA VL SI P I+SFS+LI+A TK + ++ +FSRM SHGL PD HV+P Sbjct: 61 SNYNCFNDADLVLQSIPDPTIYSFSSLIYALTKAKLFTQSIGVFSRMFSHGLIPDSHVLP 120 Query: 441 SAIKACAGLSALRTGKEVHGVVSVSGLASDPFVQTSLVHLYVKCGKIREAHKVFDTMPQP 620 + K CA LSA + GK++H V VSGL D FVQ S+ H+Y++CG++ +A KVFD M Sbjct: 121 NLFKVCAELSAFKVGKQIHCVSCVSGLDMDAFVQGSMFHMYMRCGRMGDARKVFDRMSDK 180 Query: 621 DVVTWSALVAGFARHGYVKEANRLFDGMGDSGLELNKVSWNGMIAGFNHSRLYAEAVLMF 800 DVVT SAL+ +AR G ++E R+ M SG+E N VSWNG+++GFN S + EAV+MF Sbjct: 181 DVVTCSALLCAYARKGCLEEVVRILSEMESSGIEANIVSWNGILSGFNRSGYHKEAVVMF 240 Query: 801 KKMHSQGFQPDETSTSSVLPAVGDLEDLYLGIQIHAYLIKHGLGSDKCVVSALIDMYGKC 980 +K+H GF PD+ + SSVLP+VGD E L +G IH Y+IK GL DKCV+SA+IDMYGK Sbjct: 241 QKIHHLGFCPDQVTVSSVLPSVGDSEMLNMGRLIHGYVIKQGLLKDKCVISAMIDMYGKS 300 Query: 981 ACAKEMSQVFDEMDKIDIGACNALVAGLSRNGFAENALAVFKKIEGQGVELNVVSWTSMI 1160 + +F++ + ++ G CNA + GLSRNG + AL +F+ + Q +ELNVVSWTS+I Sbjct: 301 GHVYGIISLFNQFEMMEAGVCNAYITGLSRNGLVDKALEMFELFKEQTMELNVVSWTSII 360 Query: 1161 ACCSQNGKDIEALELFRDMQIAGVMPNSVTIPCLLPACGNMAALMHGKAAHCFSLRRGFS 1340 A C+QNGKDIEALELFR+MQ+AGV PN VTIP +LPACGN+AAL HG++ H F++R Sbjct: 361 AGCAQNGKDIEALELFREMQVAGVKPNHVTIPSMLPACGNIAALGHGRSTHGFAVRVHLL 420 Query: 1341 NDVYVGSALIDMYAKCGQIRSSQCCFDGMPARNLVCWNAIIGGYAMHGKVKEALEIFHMM 1520 ++V+VGSALIDMYAKCG+I SQ F+ MP +NLVCWN+++ G++MHGK KE + IF + Sbjct: 421 DNVHVGSALIDMYAKCGRINLSQIVFNMMPTKNLVCWNSLMNGFSMHGKAKEVMSIFESL 480 Query: 1521 QRSGQKPDLITFTSVLSACSQSGLTEEGWYHFNSMSVEHGIKARVEHYACMVNLLGRAGK 1700 R+ KPD I+FTS+LSAC Q GLT+EGW +F MS E+GIK R+EHY+CMVNLLGRAGK Sbjct: 481 MRTRLKPDFISFTSLLSACGQVGLTDEGWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGK 540 Query: 1701 LKEAYSMIQKMPFEPDACVWGALLSSCRVHNNLRLGEIAAKELFELEPKNPGNYILLSNI 1880 L+EAY +I++MPFEPD+CVWGALL+SCR+ NN+ L EIAA++LF LEP+NPG Y+LLSNI Sbjct: 541 LQEAYDLIKEMPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNI 600 Query: 1881 YASNGMWNDVDIVRGMMKSMGLRKNPGCSWIEIKNKVHMLLAGDKSHPHMAQIIEKLNKL 2060 YA+ GMW +VD +R M+S+GL+KNPGCSWI++KN+V+ LLAGDKSHP + QI EK++++ Sbjct: 601 YAAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEI 660 Query: 2061 SLEMKKSGFFPVTNFVLQDVEEQDKEQFLCGHSEKLAVVFGLLNTHQGSPLRV 2219 S EM+KSG P +F L DVEEQ++EQ L GHSEKLAVVFGLLNT G+PL+V Sbjct: 661 SKEMRKSGHRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQV 713 Score = 29.3 bits (64), Expect(2) = 0.0 Identities = 15/28 (53%), Positives = 18/28 (64%) Frame = +1 Query: 2287 EKFLSETQISFTILKTELVLVGIFGELR 2370 E+F E QI F ILKTE V V I G+ + Sbjct: 714 ERFSLEIQIGFIILKTEFVPVEISGDTK 741