BLASTX nr result
ID: Catharanthus23_contig00014786
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00014786 (2009 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citr... 494 e-137 ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containi... 474 e-131 ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containi... 445 e-122 ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containi... 441 e-121 ref|XP_002309173.2| pentatricopeptide repeat-containing family p... 437 e-119 gb|EOY07026.1| Pentatricopeptide repeat superfamily protein, put... 436 e-119 gb|ESW12162.1| hypothetical protein PHAVU_008G089500g [Phaseolus... 429 e-117 gb|AHB18409.1| pentatricopeptide repeat-containing protein [Goss... 421 e-115 ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containi... 420 e-114 gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis] 419 e-114 ref|XP_003623530.1| Pentatricopeptide repeat-containing protein ... 418 e-114 gb|ESW10362.1| hypothetical protein PHAVU_009G202600g [Phaseolus... 416 e-113 ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containi... 399 e-108 ref|NP_181376.3| pentatricopeptide repeat-containing protein [Ar... 361 7e-97 ref|XP_002879744.1| pentatricopeptide repeat-containing protein ... 360 1e-96 gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|3137637... 358 4e-96 ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutr... 357 8e-96 ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Caps... 348 4e-93 ref|XP_002533822.1| pentatricopeptide repeat-containing protein,... 322 3e-85 emb|CAN63706.1| hypothetical protein VITISV_013107 [Vitis vinifera] 317 9e-84 >ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citrus clementina] gi|557531581|gb|ESR42764.1| hypothetical protein CICLE_v10013613mg [Citrus clementina] Length = 506 Score = 494 bits (1273), Expect = e-137 Identities = 251/468 (53%), Positives = 341/468 (72%), Gaps = 11/468 (2%) Frame = +2 Query: 209 NCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSSKT----------HLLNSLIDS 358 N +LRK RKWPLSPYK +WH T QQA QN+KQS+ + T H+L+SL+ S Sbjct: 11 NLHLRKHRKWPLSPYKAKWHQTLDQQQAKQNVKQSLTTPPTKQQQQIPKQPHILSSLLHS 70 Query: 359 FAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDS 538 F++Y CEP P+AYHFVIK LA N S++ I VLDHI+K E FETPEFI IDLIK Y D+ Sbjct: 71 FSIYNCEPPPEAYHFVIKTLAEN-SQFCDISSVLDHIEKRENFETPEFIFIDLIKTYADA 129 Query: 539 NRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCE-KGVLEIIPQVLIKAQLMNIRIEESC 715 +R Q ++ LFY+IP FRC PSV+SL ALL VLC K ++++PQ+L+K+QLMNIRIEES Sbjct: 130 HRFQDSVNLFYKIPKFRCVPSVYSLNALLSVLCRNKEWVKMVPQILLKSQLMNIRIEESS 189 Query: 716 FGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEE 895 F ILI LCRI +V +A E+LN M NDGF +DG+ S+IL ++ EQ++ S E++ F++E Sbjct: 190 FRILISTLCRINRVGFAIEILNCMINDGFCVDGKTCSWILSSVCEQRDLSSDELLGFVQE 249 Query: 896 MCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEED 1075 M KLGF D+ +VI+ L KKE DA+ L +MK D I PDIVCY ++L+ ++++ED Sbjct: 250 MKKLGFCFGMVDYTNVIRSLVKKEKVFDALGILNQMKSDGIKPDIVCYTMVLNGVIVQED 309 Query: 1076 YVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYST 1255 YV A+++FDELLVLGLVP+V+TYN +INGLCKQN VE GIKM+ CMEELG +PD++TY+T Sbjct: 310 YVKAEELFDELLVLGLVPDVYTYNVYINGLCKQNNVEAGIKMIACMEELGSKPDVITYNT 369 Query: 1256 ILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKF 1435 +L +L +LN REL+ +M+ KG+ LN TY ++I GL + + +A LL+E +NK Sbjct: 370 LLQALCKVRELNRLRELVKEMKWKGIVLNLQTYSIMIDGLASKGDIIEACGLLEEALNKG 429 Query: 1436 SVSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579 + S +FD+ ICGLCQ G++ +A E+L++M++K+V PG WEALL Sbjct: 430 LCTQS-SMFDETICGLCQRGLVRKALELLKQMADKDVSPGARVWEALL 476 Score = 73.9 bits (180), Expect = 2e-10 Identities = 79/349 (22%), Positives = 148/349 (42%), Gaps = 7/349 (2%) Frame = +2 Query: 368 YECEPFPKAYHFVIKVLANNPSRWDQIPQVL--DHIQKVETFETPEFILIDLIKFYGDSN 541 + C P + + ++ VL N +PQ+L + + E+ ILI + N Sbjct: 145 FRCVPSVYSLNALLSVLCRNKEWVKMVPQILLKSQLMNIRIEESSFRILISTLCRI---N 201 Query: 542 RIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGVLEIIPQVLIKAQLMNIRIEESCFG 721 R+ AIE+ + N + +L +CE+ L + ++ + CFG Sbjct: 202 RVGFAIEILNCMINDGFCVDGKTCSWILSSVCEQRDLSSDELLGFVQEMKKLGF---CFG 258 Query: 722 IL-----IRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCF 886 ++ IR+L + KV A +LN M +DG D ++ +L + Q++ AE + Sbjct: 259 MVDYTNVIRSLVKKEKVFDALGILNQMKSDGIKPDIVCYTMVLNGVIVQEDYVKAEEL-- 316 Query: 887 LEEMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVL 1066 +E+ LG P+ + I L K+ N IK + M+ PD++ YN +L L Sbjct: 317 FDELLVLGLVPDVYTYNVYINGLCKQNNVEAGIKMIACMEELGSKPDVITYNTLLQALCK 376 Query: 1067 EEDYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVT 1246 + ++ E+ G+V N+ TY+ I+GL + + E +L G Sbjct: 377 VRELNRLRELVKEMKWKGIVLNLQTYSIMIDGLASKGDIIEACGLLEEALNKGLCTQSSM 436 Query: 1247 YSTILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEF 1393 + + L G + A EL+ +M K + + +E ++ ++ +F Sbjct: 437 FDETICGLCQRGLVRKALELLKQMADKDVSPGARVWEALLLSSVSKLDF 485 >ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Solanum lycopersicum] Length = 496 Score = 474 bits (1220), Expect = e-131 Identities = 244/482 (50%), Positives = 332/482 (68%), Gaps = 7/482 (1%) Frame = +2 Query: 161 LKSSLHLSTPSWSP---VHNCYLRKRRKWPLSPYKTQWHLT-FAHQQAMQNLKQSV--KS 322 L LH S+ S+S ++N +LRKRRKWPLS YKT+W HQ +MQ L +S +S Sbjct: 10 LHKKLHSSSHSYSARSSMNNYFLRKRRKWPLSLYKTKWQEEKLTHQLSMQKLVESTPNRS 69 Query: 323 SKTHLLNSLIDSFAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEF 502 KTHLL+ L+DSF+ YEC+P P AY+F++K L NPS WD+IP +LD+I+K E FETPE+ Sbjct: 70 PKTHLLSILLDSFSAYECDPTPNAYYFILKTLTQNPSTWDEIPLILDYIRKFENFETPEY 129 Query: 503 ILIDLIKFYGDSNRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIK 679 I LIKFYGDSN A E+F+ +P +RCNPSV SL L+ VLC+ L I+ QVL+K Sbjct: 130 IFTYLIKFYGDSNMTHLAYEMFFTMPAYRCNPSVKSLNCLIWVLCKNNYDLRIVLQVLVK 189 Query: 680 AQLMNIRIEESCFGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKE 859 +QL+NI +EES F ILIRALCRIGK N A +LL M + GFNLD I S IL M + K+ Sbjct: 190 SQLLNIWVEESTFKILIRALCRIGKTNNAVDLLKLMVDSGFNLDANICSLILSTMPDVKD 249 Query: 860 CSGAEIMCFLEEMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCY 1039 C G EI LEEM KLG+SP R D +VI+F DA++ L +MK+ + PD+VCY Sbjct: 250 CVGVEIWGVLEEMRKLGYSPKRVDLCNVIRFYVNNGKGIDALEVLNKMKMCGMVPDVVCY 309 Query: 1040 NLILDRLVLEEDYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEE 1219 NL+L+ L+ E +Y NAD++FDELLVLGL P++ TYN +INGLCKQ+K+ E +++L CME+ Sbjct: 310 NLVLNGLIFEGEYSNADELFDELLVLGLNPDIVTYNVYINGLCKQDKMVEALRVLGCMED 369 Query: 1220 LGCRPDLVTYSTILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQ 1399 LGC+P++ TY TIL L G L+ +E++ +M+ KG+QL+S Y ++I ++ N E ++ Sbjct: 370 LGCKPEMNTYHTILDGLCRCGMLSSVKEVLGQMKSKGLQLSSHIYGVIINCMIRNGEVDE 429 Query: 1400 AFDLLQEMINKFSVSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579 A++LL EM++ V S+ FD +I LC G + E+L MS KN+ PG+ SWEA + Sbjct: 430 AYNLLHEMVDMGFVPQSI-TFDGLIGLLCNKGSFYEVMELLSIMSTKNLVPGIRSWEAFV 488 Query: 1580 QV 1585 QV Sbjct: 489 QV 490 >ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Vitis vinifera] Length = 505 Score = 445 bits (1145), Expect = e-122 Identities = 230/476 (48%), Positives = 321/476 (67%), Gaps = 9/476 (1%) Frame = +2 Query: 179 LSTPSWSPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKS--------SKTH 334 L PS+S + +LRKRRKWPLSPYK WH TF H+QAMQ LK ++ + S + Sbjct: 2 LRPPSFSKTN--FLRKRRKWPLSPYKATWHETFHHRQAMQTLKNTIANQSPSPQSPSNSQ 59 Query: 335 LLNSLIDSFAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILID 514 L+ LIDSF +Y +P P AY FVI L ++ +P +L ++KVE FETPEFI + Sbjct: 60 FLSILIDSFRIYNSDPTPNAYRFVISTLTRC-RQFHHLPPLLHRLEKVEKFETPEFIFTN 118 Query: 515 LIKFYGDSNRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLM 691 LIK YG++N + A++LF+RIPNFRC PSV+SL ALL VLC++ L ++PQ+L+K+Q M Sbjct: 119 LIKVYGNANMFEDAVDLFFRIPNFRCVPSVYSLNALLYVLCKRREGLVMVPQILLKSQAM 178 Query: 692 NIRIEESCFGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGA 871 NIR+EES F IL+ ALCRI K NYA +LN+M NDG+ +D ++ S IL ++ EQK SG Sbjct: 179 NIRLEESSFRILVAALCRIKKHNYAIRILNYMLNDGYAVDAKMCSIILSSLCEQKGLSGD 238 Query: 872 EIMCFLEEMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLIL 1051 E++ F+EEM KLGF P R D +VI+FL K+ DA+ +MK D I PD V Y +IL Sbjct: 239 EVLRFMEEMRKLGFYPGRVDCNNVIRFLVKEGMVMDALGVFDQMKTDGIKPDTVSYTMIL 298 Query: 1052 DRLVLEEDYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCR 1231 + + + DY AD +FDE+LVLG+VP++ YN +IN LCKQN +EEG++ML M ELGC+ Sbjct: 299 NGVTADGDYEKADDLFDEMLVLGVVPDIHAYNVYINSLCKQNNIEEGVRMLASMRELGCK 358 Query: 1232 PDLVTYSTILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDL 1411 PD VTY+ +L + L REL +ME++G+Q N TY +++ GL+ E +++ L Sbjct: 359 PDYVTYNMLLEGMSKVRDLGGMRELAREMELEGVQWNWETYRIMLDGLVGKGEIDESCSL 418 Query: 1412 LQEMINKFSVSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579 L+EM++K+ S FD+IIC LCQ G++ +A +++ KM K + PG +WEALL Sbjct: 419 LEEMLDKY-FSCWCSTFDEIICELCQRGLVCKALQLVNKMVRKTIAPGARAWEALL 473 >ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Fragaria vesca subsp. vesca] Length = 491 Score = 441 bits (1134), Expect = e-121 Identities = 221/466 (47%), Positives = 322/466 (69%), Gaps = 1/466 (0%) Frame = +2 Query: 185 TPSWSPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSSKTHLLNSLIDSFA 364 T S +P ++RK RKWP+SPY T+WH F QA+Q LK S + LL++LI SF Sbjct: 4 TSSLTPRSKFFVRKHRKWPVSPYNTKWHKLFNQHQALQTLKHSPLNPPQTLLSTLIHSFN 63 Query: 365 MYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNR 544 + C+P P+AY+FV+K L S+ IP VLD ++ +E F PE I +LI+FYG +NR Sbjct: 64 TFNCDPTPEAYNFVLKTLFKT-SQLSHIPSVLDRLESIEKFHPPESIFANLIRFYGSANR 122 Query: 545 IQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFG 721 ++ AI++F RIP FRC+PS SL +LL VLC L+++PQVL+ ++ M IR+EES F Sbjct: 123 VEDAIDVFCRIPKFRCDPSAVSLNSLLYVLCGSSEGLKMVPQVLMNSRAMGIRLEESSFR 182 Query: 722 ILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMC 901 ILI ALCRIG V YA E++ M ++G++LD +I S +L ++ EQK G E++ F+EEM Sbjct: 183 ILISALCRIGSVGYAIEIMKCMISNGYDLDVKICSLVLSSLCEQKGVGGLEVVGFVEEMK 242 Query: 902 KLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYV 1081 K+GF P D+ +VI+ L K+ DA++ L +MK++ + PDIVCY ++L ++ DY Sbjct: 243 KVGFCPGMLDYSNVIRCLVKQGKGLDALRVLCKMKVEGMKPDIVCYTMVLYGVIANGDYK 302 Query: 1082 NADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTIL 1261 NADKVFDELLVLGLVP+V+TYN +INGLC QN VE GIKM+ CM+ELGCRP+L+TY+ +L Sbjct: 303 NADKVFDELLVLGLVPDVYTYNVYINGLCNQNNVEAGIKMITCMDELGCRPNLITYNLLL 362 Query: 1262 PSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSV 1441 +L + +L+ AREL+S+M + G+ +N T+ +++ GL + ++A ++EM++KF + Sbjct: 363 KALCKNEELSRARELVSEMTLNGVGVNLQTHIIMLDGLFCKGDVDEACIFMEEMLDKF-M 421 Query: 1442 SDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579 +D +I GLCQ G++ +A ++L KM +KNV PG +WEALL Sbjct: 422 CRRCSAYDDVIYGLCQRGLVCKAMDLLLKMVDKNVVPGARAWEALL 467 >ref|XP_002309173.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550335936|gb|EEE92696.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 490 Score = 437 bits (1123), Expect = e-119 Identities = 221/447 (49%), Positives = 310/447 (69%), Gaps = 8/447 (1%) Frame = +2 Query: 215 YLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSV-------KSSKTHLLNSLIDSFAMYE 373 +LRK RKWP SPYK +WH F QQAMQ+LKQS +K HLL+SLI SF++Y+ Sbjct: 13 FLRKHRKWPYSPYKARWHRIFNQQQAMQSLKQSALKPPQQESPNKPHLLSSLIHSFSIYD 72 Query: 374 CEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNRIQG 553 EP PKA+ F+ K L S++ IP VLDH++KVE+FE PE LI+ YG +N+ Sbjct: 73 VEPAPKAFDFIFKTLVKT-SQFHHIPSVLDHLEKVESFEPPESTFAYLIEVYGRTNKTHE 131 Query: 554 AIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFGILI 730 AIELFYRIP FRC PSV+SL L+ VLC L+++P++L+K+Q+MNIR+EES F +LI Sbjct: 132 AIELFYRIPKFRCVPSVYSLNTLISVLCRNSKGLKLVPEILLKSQVMNIRVEESTFQVLI 191 Query: 731 RALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMCKLG 910 ALCRI KV +A E+LN M NDGF ++ I+S +L + EQK+ + E++ FLE++ KLG Sbjct: 192 TALCRIRKVGFAIEMLNCMVNDGFIVNAEIYSLLLSCLCEQKDATKFEVIGFLEQLRKLG 251 Query: 911 FSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYVNAD 1090 F P D+ +VI+FL K + DA+ L MK D I PDI CY ++L ++ ++DY+ AD Sbjct: 252 FFPGMVDYSNVIRFLVKGKRGLDALHVLNHMKSDRIKPDIFCYTMVLHGVIEDKDYLKAD 311 Query: 1091 KVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTILPSL 1270 ++FDELLV GLVP+ +TYN +INGLCKQN V+ GIKM+ MEELGC+P+L+TY+ ++ L Sbjct: 312 ELFDELLVFGLVPDAYTYNVYINGLCKQNNVQAGIKMVASMEELGCKPNLITYNMLVKQL 371 Query: 1271 VADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSVSDS 1450 G+L+ A EL+ +M +KG+ LN TY ++I GL +N + +A L +E ++K + S Sbjct: 372 CKVGELSKAGELVREMGLKGIGLNMQTYRIMIDGLASNGKIVEACGLFEEALDKGLCTQS 431 Query: 1451 VLLFDKIICGLCQNGMLNQAFEVLRKM 1531 L+FD+IICGLC + +A ++L KM Sbjct: 432 -LMFDEIICGLCHRDLSCKALKLLEKM 457 >gb|EOY07026.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] Length = 542 Score = 436 bits (1120), Expect = e-119 Identities = 222/484 (45%), Positives = 330/484 (68%), Gaps = 11/484 (2%) Frame = +2 Query: 161 LKSSLHLSTPSWS-----PVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSS 325 LKS L S +W N +LRK R+WP YKT+W+ TF +QAM + KQ V + Sbjct: 33 LKSELAWSDLAWPLKEMVRCRNLFLRKHRRWPHFAYKTKWNQTFTQKQAMLSFKQLVAVA 92 Query: 326 KTHL-----LNSLIDSFAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFE 490 + +L L++L+ SF++Y P P+AYHF+IK L N ++ IP VL H++ VE F+ Sbjct: 93 QDNLPPPILLSTLVRSFSLYNVHPTPQAYHFLIKTLIQN-LHFNHIPSVLHHLEHVEKFQ 151 Query: 491 TPEFILIDLIKFYGDSNRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQ 667 TPE+I DLI YG +NRIQ A+++FYRIP FRC PS +SL +LL +LC L+++PQ Sbjct: 152 TPEYIFADLITTYGIANRIQDAVDIFYRIPKFRCVPSAYSLNSLLALLCRNQYSLKLVPQ 211 Query: 668 VLIKAQLMNIRIEESCFGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMG 847 VL+K+ LMNIR+EES IL+ ALCR+ KV+YA ++L M ++G ++ ++ S+IL ++ Sbjct: 212 VLLKSLLMNIRVEESTLRILVSALCRMNKVSYAIDILQRMIDEGLGVNDKVCSFILSSIC 271 Query: 848 EQKECSGAEIMCFLEEMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPD 1027 + + G ++M E+ KLGF P D+ +I+FL KK DA+ L +MK I P Sbjct: 272 AKADLDGEDVMGLWRELGKLGFCPAMSDYNCLIRFLVKKGRGLDALDFLNQMKSVGIKPG 331 Query: 1028 IVCYNLILDRLVLEEDYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLR 1207 IV Y + L+ ++ E DY+ AD++FDELL+LGLVP+V+TYN++I+ LCKQNKVEEGIKM+ Sbjct: 332 IVSYTMALNGVIAEGDYMLADELFDELLMLGLVPDVYTYNAYIDALCKQNKVEEGIKMVA 391 Query: 1208 CMEELGCRPDLVTYSTILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNS 1387 CMEEL C+P+++TY+ +L ++ G+++ A EL+ +M+ KG+++N V+Y ++I GL++ Sbjct: 392 CMEELRCKPNVLTYNMLLEAICKVGEISRAMELVKEMKYKGIEMNLVSYTVIIDGLVSKG 451 Query: 1388 EFEQAFDLLQEMINKFSVSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESW 1567 E +A L++E+++K S L FD++ICGLCQ G++ +A E+LRKM KNV PG W Sbjct: 452 EILEAHGLVEEVLHKCFCHQS-LAFDEVICGLCQRGLVCEALELLRKMVAKNVSPGARGW 510 Query: 1568 EALL 1579 EALL Sbjct: 511 EALL 514 >gb|ESW12162.1| hypothetical protein PHAVU_008G089500g [Phaseolus vulgaris] Length = 514 Score = 429 bits (1104), Expect = e-117 Identities = 221/467 (47%), Positives = 312/467 (66%), Gaps = 10/467 (2%) Frame = +2 Query: 209 NCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSS---------KTHLLNSLIDSF 361 N YLRK RKWP SPYKT WH F QQAM LKQ+ LL++L+D+F Sbjct: 11 NKYLRKFRKWPHSPYKTSWHHNFGEQQAMHKLKQATLEMGCPQTPNLPHPFLLSTLLDAF 70 Query: 362 AMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSN 541 Y C+P PKAY+FVIK L + S IP VLDH++++ETFETPEFIL+ LI+FYG S+ Sbjct: 71 KAYSCDPTPKAYYFVIKTLTST-SHLQDIPPVLDHLEQLETFETPEFILVYLIRFYGLSD 129 Query: 542 RIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKG-VLEIIPQVLIKAQLMNIRIEESCF 718 R+Q A++LF RIP FRC P+V SL +L +LC K L+++P++L+K+Q MNIR+EES F Sbjct: 130 RVQDAVDLFLRIPRFRCTPTVWSLNLVLSLLCRKRECLKMVPEILLKSQHMNIRVEESTF 189 Query: 719 GILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEM 898 +LI ALCRI +V YA ++LN+M G+ LD I S I+ ++ EQ++ + E + +M Sbjct: 190 QVLIEALCRIKRVGYAIKMLNYMIEGGYGLDETICSLIISSLCEQEDMTSVEALVIWRDM 249 Query: 899 CKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDY 1078 KLGF P D+ ++I+FL K+ TDA+ L + K D I PD+VCY ++L +V E +Y Sbjct: 250 RKLGFCPGVMDYTNMIRFLVKEGKGTDALDILNQQKKDGIKPDVVCYTMVLSGIVAEGEY 309 Query: 1079 VNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTI 1258 V +++FDE+LV GLVP+V+TYN +INGLCKQN V+E +K++ MEEL CRP++VT +T+ Sbjct: 310 VKLEELFDEILVFGLVPDVYTYNVYINGLCKQNNVDEALKIVASMEELECRPNVVTCNTL 369 Query: 1259 LPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFS 1438 L +L G L AR +M +M KG+ LN +Y +++ GL+ E +A LL+EM+ K Sbjct: 370 LGALCVAGDLRKARGVMKEMGWKGVGLNLHSYRIMLDGLVGKGEIGEACFLLEEMLEKCF 429 Query: 1439 VSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579 S FD II +CQ G++ +A E+ +K+ K+ PG +WEALL Sbjct: 430 FPRS-STFDHIIFQMCQKGLIAEAIELTKKIVAKSFVPGARAWEALL 475 >gb|AHB18409.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum] Length = 480 Score = 421 bits (1082), Expect = e-115 Identities = 213/463 (46%), Positives = 322/463 (69%), Gaps = 6/463 (1%) Frame = +2 Query: 209 NCYLRKRRKWPL-SPYKTQWHLTFAHQQAMQNLKQSVKSS---KTHLLNSLIDSFAMYEC 376 N +LRK RKWPL S +KT+W F Q M + KQ V + + SL+ S ++Y Sbjct: 7 NFFLRKHRKWPLISSHKTKWRQAFTQNQPMVSFKQLVARHNPLQPDFVPSLLQSLSLYNL 66 Query: 377 EPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNRIQGA 556 P+AYHF+IK L +N ++ IP +L H+Q ++ F+TPE+I L+KFYG +NRIQ A Sbjct: 67 HQSPQAYHFLIKTLLHN-RQFHHIPSLLHHLQ-LQHFQTPEYIFTHLVKFYGKANRIQDA 124 Query: 557 IELFYRIPNFRCNPSVHSLKALLLVLC--EKGVLEIIPQVLIKAQLMNIRIEESCFGILI 730 +++FYRIP FRC PS +SL ALL +LC ++G L+++PQVL+ + MNIR+EES F +L+ Sbjct: 125 VDIFYRIPQFRCFPSAYSLNALLALLCRSQRG-LKLLPQVLLNSLHMNIRLEESTFRLLV 183 Query: 731 RALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMCKLG 910 LCR+ KV YA E+L M +DG ++ ++FS++L ++ + + G +++ F + KLG Sbjct: 184 CTLCRMNKVAYAIEILQRMLDDGLGVNDKVFSFVLSSVCAEGDLDGEDVIGFWRGLRKLG 243 Query: 911 FSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYVNAD 1090 FSP GD+ V++FL KK DA L +MK D I P I+ Y ++L+ + E DY+ AD Sbjct: 244 FSPAMGDYDGVVRFLVKKGRGLDAWDVLNQMKSDGIMPGIISYTMVLNGVTAEGDYILAD 303 Query: 1091 KVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTILPSL 1270 ++FDELL+LGLVPNV+TY ++I+ LCKQNKVEEGIKM+ CMEELGC+P+++ Y+T+L ++ Sbjct: 304 ELFDELLMLGLVPNVYTYKAYIDALCKQNKVEEGIKMVACMEELGCKPNVLIYNTLLRTI 363 Query: 1271 VADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSVSDS 1450 G+++ AREL+ +M+ KG+++N V+Y ++I GL++N E +A L++E+++K S Sbjct: 364 SKAGEISRARELVKEMKYKGIEMNWVSYTIIIDGLVSNGEILEACALVEEVLHKCIFIKS 423 Query: 1451 VLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579 L FD++ICGLCQ G++ +A E+L KM E+++ PG WEALL Sbjct: 424 -LTFDEVICGLCQRGLVCKARELLGKMVERSISPGARVWEALL 465 >ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Glycine max] Length = 499 Score = 420 bits (1079), Expect = e-114 Identities = 220/470 (46%), Positives = 311/470 (66%), Gaps = 13/470 (2%) Frame = +2 Query: 209 NCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSV--KSSKTH----------LLNSLI 352 N YLRK +KWP SPYKT WH F +QAM+NLKQ+ S H LL++L+ Sbjct: 11 NKYLRKFKKWPHSPYKTSWHHNFGEEQAMKNLKQATLEMDSSQHPQRPNLPCPFLLSTLL 70 Query: 353 DSFAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYG 532 DSF Y +P PKAY FV+K L + S+ IP VL H++ +E FETPE IL+ LI+FYG Sbjct: 71 DSFKAYSIDPTPKAYFFVLKTLTST-SQLQDIPPVLYHLEHLEKFETPESILVYLIRFYG 129 Query: 533 DSNRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEK-GVLEIIPQVLIKAQLMNIRIEE 709 S+R+Q A++LF+RIP FRC P+V SL +L +LC K LE++P++L+K+Q MNIR+EE Sbjct: 130 LSDRVQDAVDLFFRIPRFRCTPTVCSLNLVLSLLCRKRDCLEMVPEILLKSQHMNIRVEE 189 Query: 710 SCFGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFL 889 S F +LIRALCRI +V YA ++LN M DG+ LD +I S ++ A+ EQK+ + AE + Sbjct: 190 STFRVLIRALCRIKRVGYAIKMLNFMVEDGYGLDEKICSLVISALCEQKDLTSAEALVVW 249 Query: 890 EEMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLE 1069 +M KLGF P D+ ++I+FL K+ DA+ L + K D I D+V Y ++L +V E Sbjct: 250 RDMRKLGFCPGVMDYTNMIRFLVKEGRGMDALDILNQQKQDGIKLDVVSYTMVLSGIVAE 309 Query: 1070 EDYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTY 1249 +YV D++FDE+LV+GL+P+ +TYN +INGLCKQN V E ++++ MEELGC+P++VTY Sbjct: 310 GEYVMLDELFDEMLVIGLIPDAYTYNVYINGLCKQNNVAEALQIVASMEELGCKPNVVTY 369 Query: 1250 STILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMIN 1429 +T+L +L G ARELM +M KG+ LN TY +V+ GL+ E ++ LL+EM+ Sbjct: 370 NTLLGALSVAGDFVKARELMKEMGWKGVGLNLHTYRIVLDGLVGKGEIGESCLLLEEMLE 429 Query: 1430 KFSVSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579 K S FD II +CQ + +A E+ +K+ K+ PG +WEALL Sbjct: 430 KCLFPRS-STFDNIIFQMCQKDLFTEAMELTKKVVAKSFLPGASTWEALL 478 >gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis] Length = 494 Score = 419 bits (1076), Expect = e-114 Identities = 215/461 (46%), Positives = 319/461 (69%), Gaps = 4/461 (0%) Frame = +2 Query: 209 NCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSSKTHLLNSLIDSFAMYECEPFP 388 N +LRK R++P+SPYKT+WH TF QA+Q LK+ + LL+ L++SF Y+C P P Sbjct: 10 NKFLRKHREFPISPYKTKWHETFNQTQALQTLKRHQNENPNRLLSLLLNSFNSYDCNPTP 69 Query: 389 KAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNRIQGAIELF 568 +AYHFV+K L S++D I VLD I+ VE FETPE+ +I FYG +RI+ AI++F Sbjct: 70 EAYHFVLKTLIKT-SQFDHIHSVLDRIEFVEKFETPEYFFAQIIGFYGFLDRIEDAIDIF 128 Query: 569 YRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFGILIRALCR 745 +RIP FRC PS +SL +LL VLC + L +P+VLIK++ MNIR+EE+ F ILI ALC+ Sbjct: 129 WRIPKFRCVPSSYSLNSLLYVLCRRNEGLRFVPEVLIKSRDMNIRLEEASFRILITALCK 188 Query: 746 IGKVNYAAELLNHMANDGFNLDGRIFSYILKAM-GEQKEC--SGAEIMCFLEEMCKLGFS 916 IGKV YA E+L+ M +DG+++D RI S IL + G+ KE +G +++ L++M K+GF Sbjct: 189 IGKVGYAIEILDCMISDGYDIDARICSLILSFLCGKNKELDLAGFDVLELLQKMEKMGFC 248 Query: 917 PNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYVNADKV 1096 P GD+ VI+ L +++ +A+ L +MK D + PD+VCY ++L +V E +Y AD++ Sbjct: 249 PRMGDYSKVIRILVREKRGLEALDILGQMKADGMKPDVVCYTMVLHGIVAEGEYSKADEM 308 Query: 1097 FDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTILPSLVA 1276 FDE+LVLGLVP+V+TYN++INGLCKQN V+ + + MEELGC+P+L+TY+ IL +L Sbjct: 309 FDEMLVLGLVPDVYTYNAYINGLCKQNDVDGALDTILRMEELGCKPNLITYNLILRALCK 368 Query: 1277 DGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSVSDSVL 1456 +G+ A+EL+++M +KG + TY +++ LL E +A L++EM++K + Sbjct: 369 NGEFGRAKELVAEMSLKGFEDYLQTYIIMLDVLLGKGEIVEACGLMEEMLDKL-LCRRCS 427 Query: 1457 LFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579 ++D+II GLC+ G+ +A E+L KM KNV PG +W+ALL Sbjct: 428 MYDEIIFGLCRRGLDCKASEMLGKMVGKNVAPGARAWDALL 468 >ref|XP_003623530.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355498545|gb|AES79748.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 653 Score = 418 bits (1074), Expect = e-114 Identities = 215/467 (46%), Positives = 312/467 (66%), Gaps = 6/467 (1%) Frame = +2 Query: 197 SPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNL----KQSVKSSKTHLLNSLIDSFA 364 S N YLRK RKWP SPYKT WH F QQA+Q L Q+ ++ LL++LI SF Sbjct: 7 SKTANKYLRKFRKWPHSPYKTSWHHNFGEQQAIQILINAKTQTQNNNDPFLLSTLIHSFK 66 Query: 365 MYECEPFPKAYHFVIKVLAN-NPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSN 541 Y +P PKAY F+IK + N N S +IP +L+H++ E FETPEFI + LI+FYG ++ Sbjct: 67 AYHTDPSPKAYFFLIKTITNINTSHLHEIPHILNHLEHNEKFETPEFIFMYLIRFYGFND 126 Query: 542 RIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKG-VLEIIPQVLIKAQLMNIRIEESCF 718 R+Q A++LF+RIP FRC P+V SL LL +LC K L ++P +L+K++ M IR+EES F Sbjct: 127 RVQDAVDLFFRIPRFRCTPTVCSLNLLLSLLCGKRECLRMVPDILLKSRDMKIRLEESSF 186 Query: 719 GILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEM 898 +LI+ALCRI +V+YA +++N M DG+ LD +I S I+ ++ EQ + + E + M Sbjct: 187 WVLIKALCRIKRVDYAIKMMNCMVEDGYCLDDKICSLIISSLCEQNDLTSVEALVVWGNM 246 Query: 899 CKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDY 1078 KLGF P D ++I+FL K+ DA++ L ++K D I PDIVCY ++L +V E DY Sbjct: 247 RKLGFCPGVMDCTNMIRFLVKEGKGMDALEILNQLKEDGIKPDIVCYTIVLSGIVKEGDY 306 Query: 1079 VNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTI 1258 V D++FDE+LVLGLVP+V+TYN +INGLCKQN +E +K++ ME+LGC+P++VTY+T+ Sbjct: 307 VKLDELFDEILVLGLVPDVYTYNVYINGLCKQNNFDEALKIVVSMEKLGCKPNVVTYNTL 366 Query: 1259 LPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFS 1438 L +L G L A+ +M +M +KG++LN TY +++ GL+ E +A LL+EM+ K Sbjct: 367 LGALCMSGDLGKAKRVMKEMRLKGVELNLHTYRIMLDGLVGKGEIGEACVLLEEMLEKCF 426 Query: 1439 VSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579 S FD I+ +CQ G+++ A ++ K+ K+ PG + WEALL Sbjct: 427 YPRS-STFDSIVHQMCQKGLISDALVLMNKIVAKSFDPGAKVWEALL 472 >gb|ESW10362.1| hypothetical protein PHAVU_009G202600g [Phaseolus vulgaris] Length = 513 Score = 416 bits (1069), Expect = e-113 Identities = 213/468 (45%), Positives = 311/468 (66%), Gaps = 10/468 (2%) Frame = +2 Query: 209 NCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSS---------KTHLLNSLIDSF 361 N YLRK RKWP SPYKT WH F QQAM LKQ+ LL++LIDSF Sbjct: 11 NKYLRKFRKWPHSPYKTSWHHNFGEQQAMHKLKQATLEMGCPQTPNLPHPFLLSTLIDSF 70 Query: 362 AMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSN 541 Y C+P PKAY+F+IK L S++ IP VLDH++ +E FETPEF L+ LI+FYG S+ Sbjct: 71 KSYSCDPTPKAYYFLIKTLTCT-SQFQDIPPVLDHLEHLEKFETPEFNLVYLIRFYGLSD 129 Query: 542 RIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKG-VLEIIPQVLIKAQLMNIRIEESCF 718 ++Q A++LF RIP FRC P+V SL +L +LC K L+++P++L+K+Q MNIR+EES F Sbjct: 130 KVQDAVDLFLRIPRFRCTPTVCSLNLVLSLLCRKRECLKMVPEILLKSQHMNIRVEESTF 189 Query: 719 GILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEM 898 +LI+ALCRI +V YA ++LN+M G+ LD + S I+ ++ EQ++ + E + +M Sbjct: 190 QVLIKALCRIKRVGYAIKMLNYMIEGGYGLDETMCSLIISSLCEQEDMTSVEALVIWRDM 249 Query: 899 CKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDY 1078 KLGF P D+ ++I+FL K+ DA+ L + K D I PD+VCY ++L ++ E +Y Sbjct: 250 RKLGFCPGIMDYTNMIRFLVKEGKGMDALDILNQQKKDGIKPDVVCYTMVLSGIIAEGEY 309 Query: 1079 VNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTI 1258 V +++FDE+LV GLVP+V+TYN +INGLCKQN V+E +K++ MEEL C+P++VT + + Sbjct: 310 VKLEELFDEILVFGLVPDVYTYNVYINGLCKQNNVDEALKIVASMEELECKPNVVTCNIL 369 Query: 1259 LPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFS 1438 L +L G L AR +M +M KG++L+ +Y +++ GL+ E +A LL+EM+ K S Sbjct: 370 LGALCVAGDLRKARGVMKEMGWKGVRLDLHSYRIMLDGLVGKGEIGEACFLLEEMLEK-S 428 Query: 1439 VSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALLQ 1582 FD II +CQ G++ +A E+ +K+ K+ PG +WEALL+ Sbjct: 429 FFPRSSTFDHIIFQMCQKGLIVEAIELTKKIVAKSFVPGARAWEALLK 476 >ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Cucumis sativus] gi|449483740|ref|XP_004156675.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Cucumis sativus] Length = 491 Score = 399 bits (1024), Expect = e-108 Identities = 210/474 (44%), Positives = 308/474 (64%), Gaps = 2/474 (0%) Frame = +2 Query: 209 NCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSSKTHLL-NSLIDSFAMYECEPF 385 N +LRK RKWPLS +KT+WH TF +A++ LKQ+ + HLL ++L+ SF Y C P Sbjct: 12 NNFLRKHRKWPLSSHKTKWHQTFDQDEALRILKQAANPDQPHLLLSALVTSFTAYSCHPT 71 Query: 386 PKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNRIQGAIEL 565 P AY+FV+K LA S++ IP VL +Q +E F+TPE+I +DLIK YG NRIQ A+ L Sbjct: 72 PNAYYFVLKTLART-SQFHHIPPVLHRLQFLENFQTPEYIFVDLIKLYGRMNRIQDAVTL 130 Query: 566 FYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFGILIRALC 742 F RIP FRC PS SL +LL L L IIP +++ + M IR+E S F ILI ALC Sbjct: 131 FRRIPMFRCVPSTLSLNSLLSQLSRNAQGLPIIPDIILNSHSMGIRLEHSTFQILITALC 190 Query: 743 RIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMCKLGFSPN 922 ++ KV +A EL N+M +G+ L+ +I S IL ++ +QK+ SG ++ FLEEM + GF P Sbjct: 191 KVNKVGHAMELFNYMITEGYGLNPQICSLILASLCQQKKSSGDVVLGFLEEMRQKGFCPA 250 Query: 923 RGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYVNADKVFD 1102 D+ +VI+F + +DA+ L +MK D PDIVCY ++L+ ++ + DY AD++FD Sbjct: 251 VVDYSNVIKFFVTRGMGSDAVDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKMADELFD 310 Query: 1103 ELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTILPSLVADG 1282 ELL+ GLVP+++TYN +I+GLCKQ G++M+ ME LGC+P+++TY+ IL SL G Sbjct: 311 ELLLFGLVPDIYTYNVYIHGLCKQGDSVAGLQMIPHMEALGCQPNVITYNVILKSLCKTG 370 Query: 1283 KLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSVSDSVLLF 1462 +L+ AR+L SKM++KG+ N T+ ++I GL N E +A LL+EM+ + F Sbjct: 371 ELDEARKLRSKMQLKGLAENLRTFRIMIDGLFHNGEVIEACVLLEEMLGS-RFPPQISTF 429 Query: 1463 DKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALLQVFEFIHSPNEISAI 1624 +I+ LC+ M+ +A E+L M KN PG ++WE LL + S +E++++ Sbjct: 430 SEILSWLCKRHMVGKAVELLALMVGKNFSPGPKAWEILL-----LSSESELTSV 478 >ref|NP_181376.3| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|218546769|sp|Q8L6Y7.2|PP193_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g38420, mitochondrial; Flags: Precursor gi|3395430|gb|AAC28762.1| hypothetical protein [Arabidopsis thaliana] gi|330254441|gb|AEC09535.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 453 Score = 361 bits (926), Expect = 7e-97 Identities = 188/447 (42%), Positives = 294/447 (65%), Gaps = 3/447 (0%) Frame = +2 Query: 191 SWSPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSV--KSSKTHLLNSLIDSFA 364 SW + N ++RK RK P S +KT+W+ + AM+ L+ ++ S ++ +L+ SF Sbjct: 6 SWHRMSN-FMRKYRKIPHSSFKTKWNENLKQKYAMEELRSNLLTDSENASVMRTLLSSFQ 64 Query: 365 MYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNR 544 ++ CEP P+AY FVIK LA + S+ + I VL H++ E F+TPE I D+I YG S R Sbjct: 65 LHNCEPTPQAYRFVIKTLAKS-SQLENISSVLYHLEVSEKFDTPESIFRDVIAAYGFSGR 123 Query: 545 IQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFG 721 I+ AIE+F++IPNFRC PS ++L ALLLVL K LE++P++L+KA M +R+EES FG Sbjct: 124 IEEAIEVFFKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACRMGVRLEESTFG 183 Query: 722 ILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMC 901 ILI ALCRIG+V+ A EL+ +M+ D +D R++S +L ++ + K+ S +++ +LE++ Sbjct: 184 ILIDALCRIGEVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGYLEDLR 243 Query: 902 KLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYV 1081 K FSP D+ V++FL + + + L +MK D ++PD+VCY ++L ++ +EDY Sbjct: 244 KTRFSPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIADEDYP 303 Query: 1082 NADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTIL 1261 ADK+FDELL+LGL P+V+TYN +INGLCKQN +E +KM+ M +LG P++VTY+ ++ Sbjct: 304 KADKLFDELLLLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVTYNILI 363 Query: 1262 PSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSV 1441 +LV G L+ A+ L +ME G+ NS T++++I + E A LL+E N +V Sbjct: 364 KALVKAGDLSRAKTLWKEMETNGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEAFN-MNV 422 Query: 1442 SDSVLLFDKIICGLCQNGMLNQAFEVL 1522 +++I LC+ G+++QA E+L Sbjct: 423 FVKSSRIEEVISRLCEKGLMDQAVELL 449 >ref|XP_002879744.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297325583|gb|EFH56003.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 444 Score = 360 bits (925), Expect = 1e-96 Identities = 188/441 (42%), Positives = 291/441 (65%), Gaps = 5/441 (1%) Frame = +2 Query: 215 YLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSV--KSSKTHLLNSLIDSFAMYECEPFP 388 ++RK RK P S +KT+W+ + AM+ L+ ++ S ++ +L+ SF ++ CEP P Sbjct: 4 FMRKYRKIPQSSFKTKWNENLKQKYAMEELRSNLLADSENGSVMRTLVSSFQLHNCEPTP 63 Query: 389 KAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNRIQGAIELF 568 +AY FVI+ LA S+ + I VLDH++ E F+TPE I D+I YG S RI+ AI++F Sbjct: 64 QAYRFVIETLAKT-SQLENIASVLDHLEVSEKFDTPESIFRDVIAAYGFSGRIEEAIDVF 122 Query: 569 YRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFGILIRALCR 745 ++IPNFRC PS ++L ALLLVL K LE++P++L+KA M +R+EES FGILI ALCR Sbjct: 123 FKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKASRMGVRLEESTFGILINALCR 182 Query: 746 IGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMCKLGFSPNR 925 IG+V+ A EL+ +M+ D +D R++S +L ++ + K+ S +++ +LE++ K F P Sbjct: 183 IGEVDCATELVRYMSEDSVIVDPRLYSLLLSSVCKHKDSSCFDVIGYLEDLRKTRFLPGL 242 Query: 926 GDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYVNADKVFDE 1105 D+ V++FL + + + L +MK D IDPD+VCY ++L ++ +EDY ADK+FDE Sbjct: 243 RDYTVVMRFLVEGGRGKEVVSVLNQMKCDRIDPDVVCYTIVLLGVIADEDYPKADKLFDE 302 Query: 1106 LLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTILPSLVADGK 1285 LL+LGL P+V+TYN +INGLCKQN +E IKM+ M +LG P++VTY+ ++ LV G Sbjct: 303 LLLLGLDPDVYTYNVYINGLCKQNDIEGAIKMMSSMNKLGSEPNVVTYNIVIKGLVKAGD 362 Query: 1286 LNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEM--INKFSVSDSVLL 1459 L+ A+ L +MEM G+ NS TY+++I + E A LL+E +N F S + Sbjct: 363 LSRAKTLWKEMEMNGVNRNSHTYDIMISAYIEVDEVVCAQGLLEEAFNMNLFVKSSKI-- 420 Query: 1460 FDKIICGLCQNGMLNQAFEVL 1522 +++I LC+ G++++A E+L Sbjct: 421 -EEVISRLCEKGLMDKAVELL 440 >gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|31376375|gb|AAP49514.1| At2g38420 [Arabidopsis thaliana] Length = 444 Score = 358 bits (920), Expect = 4e-96 Identities = 185/439 (42%), Positives = 290/439 (66%), Gaps = 3/439 (0%) Frame = +2 Query: 215 YLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSV--KSSKTHLLNSLIDSFAMYECEPFP 388 ++RK RK P S +KT+W+ + AM+ L+ ++ S ++ +L+ SF ++ CEP P Sbjct: 4 FMRKYRKIPHSSFKTKWNENLKQKYAMEELRSNLLTDSENASVMRTLLSSFQLHNCEPTP 63 Query: 389 KAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNRIQGAIELF 568 +AY FVIK LA + S+ + I VL H++ E F+TPE I D+I YG S RI+ AIE+F Sbjct: 64 QAYRFVIKTLAKS-SQLENISSVLYHLEVSEKFDTPESIFRDVIAAYGFSGRIEEAIEVF 122 Query: 569 YRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFGILIRALCR 745 ++IPNFRC PS ++L ALLLVL K LE++P++L+KA M +R+EES FGILI ALCR Sbjct: 123 FKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACRMGVRLEESTFGILIDALCR 182 Query: 746 IGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMCKLGFSPNR 925 IG+V+ A EL+ +M+ D +D R++S +L ++ + K+ S +++ +LE++ K FSP Sbjct: 183 IGEVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGYLEDLRKTRFSPGL 242 Query: 926 GDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYVNADKVFDE 1105 D+ V++FL + + + L +MK D ++PD+VCY ++L ++ +EDY ADK+FDE Sbjct: 243 RDYTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIADEDYPKADKLFDE 302 Query: 1106 LLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTILPSLVADGK 1285 LL+LGL P+V+TYN +INGLCKQN +E +KM+ M +LG P++VTY+ ++ +LV G Sbjct: 303 LLLLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVTYNILIKALVKAGD 362 Query: 1286 LNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSVSDSVLLFD 1465 L+ A+ L +ME G+ NS T++++I + E A LL+E N +V + Sbjct: 363 LSRAKTLWKEMETNGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEAFN-MNVFVKSSRIE 421 Query: 1466 KIICGLCQNGMLNQAFEVL 1522 ++I LC+ G+++QA E+L Sbjct: 422 EVISRLCEKGLMDQAVELL 440 >ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutrema salsugineum] gi|557112223|gb|ESQ52507.1| hypothetical protein EUTSA_v10017948mg [Eutrema salsugineum] Length = 456 Score = 357 bits (917), Expect = 8e-96 Identities = 189/450 (42%), Positives = 296/450 (65%), Gaps = 6/450 (1%) Frame = +2 Query: 191 SWSPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSV-----KSSKTHLLNSLID 355 SW + N + RK RK P S +KT+W+ + AM+ L+ + + +L +LI Sbjct: 6 SWHRMSN-FFRKYRKIPHSSFKTKWNENLKQKYAMEELRSGLIADSGSNENDGVLRTLIS 64 Query: 356 SFAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGD 535 SF ++ CEP P+AY FVIK LA S+ + I VL+HI+ E F+TPE I D+I YG Sbjct: 65 SFRLHNCEPTPQAYKFVIKTLAKT-SQLENIASVLNHIEISEKFDTPESIFRDVIFAYGF 123 Query: 536 SNRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEES 712 S RI+ AI++F++IPNFRC PS ++L ALL VL K L+++P+VL+KA + +R+EES Sbjct: 124 SGRIEEAIDVFFKIPNFRCVPSAYTLNALLSVLVRKRQGLKMVPEVLLKASKLGVRLEES 183 Query: 713 CFGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLE 892 GILI ALCRIG+V+ A +L+ M++D + +D R++S +L ++ + K+ S +++ +LE Sbjct: 184 TLGILIDALCRIGEVDCATDLVKDMSDDCYIVDPRLYSLLLSSVCKHKDSSCFDVIGYLE 243 Query: 893 EMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEE 1072 + K FSP+ D+ V++FL + + + L +MK D I+PDIVCY +IL ++ +E Sbjct: 244 GLRKTRFSPDLRDYTAVMRFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIILQGVIADE 303 Query: 1073 DYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYS 1252 DY ADK+FDELL+LGLVP+V+TYN +INGLCKQ+ +E GIKM+ CME+LG P++VTY+ Sbjct: 304 DYKKADKLFDELLLLGLVPDVYTYNVYINGLCKQSDIECGIKMMSCMEKLGSEPNVVTYN 363 Query: 1253 TILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINK 1432 ++ +LV G ++ A+ + +ME G+ NS +Y++++ + E A LL+E ++ Sbjct: 364 ILIKALVKAGDMSRAKIIWEEMETNGVDRNSHSYDIMVNASIEADEVVCAHGLLEEAFSR 423 Query: 1433 FSVSDSVLLFDKIICGLCQNGMLNQAFEVL 1522 V S +++IC LC G++++A E+L Sbjct: 424 SLVVKSSRT-EEVICRLCDKGLMDKAVELL 452 >ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Capsella rubella] gi|482562854|gb|EOA27044.1| hypothetical protein CARUB_v10023139mg [Capsella rubella] Length = 470 Score = 348 bits (894), Expect = 4e-93 Identities = 183/447 (40%), Positives = 287/447 (64%), Gaps = 3/447 (0%) Frame = +2 Query: 191 SWSPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQS--VKSSKTHLLNSLIDSFA 364 SW + N +LRK RK P SP+KT+W+ + AM+ L+ S S ++ +L+ SF Sbjct: 23 SWHRMSN-FLRKYRKIPHSPFKTKWNENLKQKYAMEELRSSPVADSEDGGVIRTLVSSFR 81 Query: 365 MYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNR 544 ++ CEP P+AY FVIK LA S+ + I VL H++ E F+TPE I D+I YG + R Sbjct: 82 LHNCEPTPQAYRFVIKTLAKT-SQLENIASVLSHLEVSEKFDTPESIFRDVIAAYGFAGR 140 Query: 545 IQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFG 721 I AI++F++IPNFRC PS ++L ALLLVL K LE++P++L+KA M +R+EES FG Sbjct: 141 IGEAIDVFFKIPNFRCVPSAYTLNALLLVLVRKRESLELVPEILVKASRMGVRLEESTFG 200 Query: 722 ILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMC 901 ILI ALC+IG+V+ A EL+ +M+ D +D R++S +L ++ + K+ S +++ +LE++ Sbjct: 201 ILIDALCKIGEVDCATELVRYMSIDCVIVDPRLYSQLLSSVCKHKDSSCFDVVGYLEDLR 260 Query: 902 KLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYV 1081 K FSP D+ V+ FL + + + L +MK D I+PDIVCY ++L ++ + +Y Sbjct: 261 KTRFSPGLRDYTVVMSFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIVLQGVIADAEYS 320 Query: 1082 NADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTIL 1261 ADK FDELL+LGL P+V+TYN ++NGLCKQN +E +KM+ M +LG P+++TY+ ++ Sbjct: 321 KADKFFDELLLLGLAPDVYTYNVYMNGLCKQNDIEGALKMMSSMNKLGSEPNVITYNILI 380 Query: 1262 PSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSV 1441 +LV G L+ A+ L +M + G+ NS TY+++I + + A L+E N +V Sbjct: 381 KALVNAGDLSQAKTLWEEMGINGVNRNSHTYDIMISAFIEVGDVVSAQGFLEEAFN-MNV 439 Query: 1442 SDSVLLFDKIICGLCQNGMLNQAFEVL 1522 +++I LC G++++A E+L Sbjct: 440 FAKSSRTEEVISRLCDKGLMDKAVELL 466 >ref|XP_002533822.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223526239|gb|EEF28557.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 373 Score = 322 bits (826), Expect = 3e-85 Identities = 168/346 (48%), Positives = 231/346 (66%), Gaps = 1/346 (0%) Frame = +2 Query: 545 IQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFG 721 +Q AI LFYR PNFRC PSV+ L LL VLC L +P+VL+K+Q MNIR+EES F Sbjct: 1 MQNAIHLFYRTPNFRCVPSVYLLNTLLSVLCRTNEGLNFVPEVLLKSQDMNIRMEESSFR 60 Query: 722 ILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMC 901 +LI ALC I KV YA E+ N M NDGF++D +I S +L ++ Q + S +E+M FL E+ Sbjct: 61 LLINALCSINKVGYAVEMFNCMINDGFSVDSKICSLLLSSLCYQADISSSEVMRFLGELR 120 Query: 902 KLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYV 1081 K GF P D+ VI FL ++ +A+ L +MKLD I PDIVCY +L+ ++ Y Sbjct: 121 KFGFCPGIKDYSKVINFLVRRGMGMEALNVLNQMKLDGIKPDIVCYTTVLNGVIANGVYS 180 Query: 1082 NADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTIL 1261 AD++FDELLV GLVP+V+TYN +I GLCKQN VE GI+M+ MEELGC+P+L+TY+ +L Sbjct: 181 KADELFDELLVFGLVPDVYTYNVYIYGLCKQNNVEAGIEMVTSMEELGCKPNLITYNILL 240 Query: 1262 PSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSV 1441 L +G+ + AR+L+ M KG+ L TY+++I GL + + +A LL+E ++K + Sbjct: 241 EDLCKNGEDSRARDLVRDMGSKGIGLGMQTYKVMIHGLTSGGKIVKACSLLEEALDK-GL 299 Query: 1442 SDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579 L FD++I GLCQ G + +A E+L K+ KNV PGV WE LL Sbjct: 300 CPRGLRFDEVIYGLCQTGSICKALELLEKVVNKNVSPGVRVWETLL 345 >emb|CAN63706.1| hypothetical protein VITISV_013107 [Vitis vinifera] Length = 390 Score = 317 bits (813), Expect = 9e-84 Identities = 177/420 (42%), Positives = 244/420 (58%), Gaps = 8/420 (1%) Frame = +2 Query: 179 LSTPSWSPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKS--------SKTH 334 L PS+S + +LRKRRKWPLSPYK WH TF H+QAMQ LK ++ + S + Sbjct: 4 LRPPSFSKTN--FLRKRRKWPLSPYKATWHETFHHRQAMQTLKNTIANQSPSPQSPSNSQ 61 Query: 335 LLNSLIDSFAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILID 514 L+ LIDSF +Y +P P AY FVI L ++ +P +L ++KVE FETPEFI + Sbjct: 62 FLSILIDSFRIYNSDPTPNAYRFVISTLTRC-RQFHHLPPLLHRLEKVEKFETPEFIFTN 120 Query: 515 LIKFYGDSNRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGVLEIIPQVLIKAQLMN 694 LIK +L+K+Q MN Sbjct: 121 LIK------------------------------------------------ILLKSQAMN 132 Query: 695 IRIEESCFGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAE 874 IR+EES F IL+ ALCRI K NYA +LN+M NDG+ +D ++ S IL ++ EQK SG E Sbjct: 133 IRLEESSFRILVAALCRIKKHNYAIRILNYMLNDGYAVDAKMCSIILSSLCEQKGLSGDE 192 Query: 875 IMCFLEEMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILD 1054 ++ F+EEM KLGF P R D +VI FL K+ DA+ +MK D I PD V Y +IL+ Sbjct: 193 VLRFMEEMRKLGFYPGRVDCNNVIXFLVKEGMVMDALGVFDQMKTDGIKPDTVSYTMILN 252 Query: 1055 RLVLEEDYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRP 1234 + + DY AD +FDE+LVLG+VP++ YN +IN LCKQN +EEG++ML M ELGC+P Sbjct: 253 GVTADGDYEKADDLFDEMLVLGVVPDIHAYNVYINSLCKQNNIEEGVRMLASMRELGCKP 312 Query: 1235 DLVTYSTILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLL 1414 D V Y+ +L + L REL +ME++G+Q N TY +++ GL+ E +++ L+ Sbjct: 313 DYVXYNMLLEGMSKVRDLGGMRELAREMELEGVQWNWETYRIMLDGLVGKGEIDESCSLV 372