BLASTX nr result
ID: Catharanthus22_contig00019994
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00019994 (1055 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006493995.1| PREDICTED: pentatricopeptide repeat-containi... 393 e-107 ref|XP_006338375.1| PREDICTED: pentatricopeptide repeat-containi... 378 e-102 gb|EOX94498.1| Basic helix-loop-helix DNA-binding superfamily pr... 370 e-100 ref|XP_006420414.1| hypothetical protein CICLE_v10006642mg [Citr... 365 2e-98 ref|XP_002301860.2| pentatricopeptide repeat-containing family p... 360 7e-97 gb|EMJ01920.1| hypothetical protein PRUPE_ppa025321mg [Prunus pe... 355 2e-95 ref|XP_003530855.2| PREDICTED: pentatricopeptide repeat-containi... 353 5e-95 gb|EXB75130.1| hypothetical protein L484_025905 [Morus notabilis] 352 1e-94 ref|XP_004152039.1| PREDICTED: pentatricopeptide repeat-containi... 345 2e-92 gb|ESW06293.1| hypothetical protein PHAVU_010G035600g [Phaseolus... 345 2e-92 ref|XP_004165913.1| PREDICTED: pentatricopeptide repeat-containi... 342 1e-91 emb|CBI20254.3| unnamed protein product [Vitis vinifera] 338 2e-90 ref|XP_004511192.1| PREDICTED: pentatricopeptide repeat-containi... 331 3e-88 ref|XP_002892322.1| hypothetical protein ARALYDRAFT_311694 [Arab... 330 6e-88 ref|XP_004292199.1| PREDICTED: pentatricopeptide repeat-containi... 330 7e-88 sp|Q56X05.2|PPR15_ARATH RecName: Full=Pentatricopeptide repeat-c... 325 2e-86 ref|NP_172105.4| transcription factor EMB1444 [Arabidopsis thali... 325 2e-86 dbj|BAD94184.1| hypothetical protein [Arabidopsis thaliana] 321 3e-85 ref|XP_006409153.1| hypothetical protein EUTSA_v10022616mg [Eutr... 318 2e-84 ref|XP_004233665.1| PREDICTED: pentatricopeptide repeat-containi... 318 3e-84 >ref|XP_006493995.1| PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Citrus sinensis] Length = 578 Score = 393 bits (1010), Expect = e-107 Identities = 184/302 (60%), Positives = 240/302 (79%), Gaps = 1/302 (0%) Frame = +3 Query: 153 SEITHLANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITAC-SKFHHVEWAIAAFT 329 + I +AN LK+CSS+KELE +YA M K+N+ +DCFL NQF++ C S+FH ++AI AFT Sbjct: 24 TRIHTMANQLKKCSSVKELECVYATMVKTNANQDCFLANQFVSFCTSRFHRTDYAILAFT 83 Query: 330 QMRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKL 509 QM+ PN+FVYNALI L+ C P QA+ Y MLR E+ P+SYTF S+IK+C+L+ + Sbjct: 84 QMQEPNVFVYNALIRGLVHCGHPHQAIIFYLHMLRAEVLPTSYTFSSLIKACSLLLDICS 143 Query: 510 GESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVR 689 GE++HGQVWK GFG H+ VQTAL+D+YSN ++ +SR VFD MP+RD F+WTTMV AH R Sbjct: 144 GEAVHGQVWKNGFGSHVFVQTALVDYYSNSNKFFESRSVFDEMPQRDIFSWTTMVLAHAR 203 Query: 690 FRDLNSARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCY 869 DL SAR+LFD+MPE+N A+WNTMI +AR+ +V +A+ LFNKMP +D+ISWTTMI CY Sbjct: 204 AGDLCSARRLFDEMPERNIATWNTMIDAYARLGNVRAAELLFNKMPARDIISWTTMITCY 263 Query: 870 SQNKYYKEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDV 1049 SQNK ++EALD F EMK+ GISPD+VTM+T++SACAHLG LD G++IHLYV+Q FD+DV Sbjct: 264 SQNKQFREALDAFNEMKNSGISPDQVTMATVLSACAHLGALDLGREIHLYVMQIGFDIDV 323 Query: 1050 YI 1055 YI Sbjct: 324 YI 325 Score = 77.4 bits (189), Expect = 9e-12 Identities = 56/251 (22%), Positives = 113/251 (45%), Gaps = 9/251 (3%) Frame = +3 Query: 249 RDCFLMNQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDM 428 R+ N I A ++ +V A F +M +I + +I + Q +AL +++M Sbjct: 220 RNIATWNTMIDAYARLGNVRAAELLFNKMPARDIISWTTMITCYSQNKQFREALDAFNEM 279 Query: 429 LRTEINPSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRV 608 + I+P T +++ +C + A+ LG IH V + GF + +++ +AL+D Y+ + Sbjct: 280 KNSGISPDQVTMATVLSACAHLGALDLGREIHLYVMQIGFDIDVYIGSALVDMYAKCGSL 339 Query: 609 LDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKM----PEKNTASWNTMIHGF 776 S LVF + E++ F W +++ + A +FD+M E N ++ +++ Sbjct: 340 DRSLLVFFKLREKNLFCWNSIIEGLAVHGFAHEALAMFDRMIYENVEPNGVTFISVLSAC 399 Query: 777 ARMRDVESAKELFNKMP-----EKDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPD 941 VE + F M ++ + M+ S+ ++AL++ R K P+ Sbjct: 400 THAGLVEEGRRRFLSMTCGYSITPEVEHYGCMVDLLSKAGLLEDALELIRSSK---FQPN 456 Query: 942 EVTMSTIISAC 974 V ++ C Sbjct: 457 AVIWGALLGGC 467 Score = 70.5 bits (171), Expect = 1e-09 Identities = 62/318 (19%), Positives = 131/318 (41%), Gaps = 43/318 (13%) Frame = +3 Query: 180 LKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRN----PN 347 L C + Y M ++ + + I ACS + A Q+ + Sbjct: 100 LVHCGHPHQAIIFYLHMLRAEVLPTSYTFSSLIKACSLLLDICSGEAVHGQVWKNGFGSH 159 Query: 348 IFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIHG 527 +FV AL+ ++ ++ ++ +M + +I + + ++ L A +L + + Sbjct: 160 VFVQTALVDYYSNSNKFFESRSVFDEMPQRDIFSWTTMVLAHARAGDLCSARRLFDEMPE 219 Query: 528 QVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNS 707 + +I ++D Y+ V + L+F+ MP RD +WTTM+ + + + Sbjct: 220 R--------NIATWNTMIDAYARLGNVRAAELLFNKMPARDIISWTTMITCYSQNKQFRE 271 Query: 708 ARKLFDKM------PEKNTASW---------------------------------NTMIH 770 A F++M P++ T + + ++ Sbjct: 272 ALDAFNEMKNSGISPDQVTMATVLSACAHLGALDLGREIHLYVMQIGFDIDVYIGSALVD 331 Query: 771 GFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVT 950 +A+ ++ + +F K+ EK+L W ++I + + + EAL +F M + P+ VT Sbjct: 332 MYAKCGSLDRSLLVFFKLREKNLFCWNSIIEGLAVHGFAHEALAMFDRMIYENVEPNGVT 391 Query: 951 MSTIISACAHLGLLDEGK 1004 +++SAC H GL++EG+ Sbjct: 392 FISVLSACTHAGLVEEGR 409 >ref|XP_006338375.1| PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like isoform X1 [Solanum tuberosum] gi|565342486|ref|XP_006338376.1| PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like isoform X2 [Solanum tuberosum] Length = 558 Score = 378 bits (970), Expect = e-102 Identities = 181/301 (60%), Positives = 233/301 (77%) Frame = +3 Query: 153 SEITHLANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQ 332 + I + N LK CSS K+LESLY+ M K+ +T+D FLMNQFI CS ++ ++A AF+Q Sbjct: 5 NSILSIVNQLKICSSRKQLESLYSLMLKNGATKDSFLMNQFIATCSALNNPDFASFAFSQ 64 Query: 333 MRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLG 512 M NPN+FVYNALI A + C P +AL +Y DMLRT+ PSSYTF S++K CTL+ ++LG Sbjct: 65 MENPNVFVYNALIRAFVHCHSPHKALLLYIDMLRTQNIPSSYTFSSVVKGCTLMCGLRLG 124 Query: 513 ESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRF 692 E IHGQ+W+YGFG H+ VQT L+DFYSN RV +RLVFD MPERD+FAW MV+AH Sbjct: 125 ECIHGQIWEYGFGTHVFVQTGLIDFYSNLGRVDLARLVFDEMPERDNFAWAAMVSAHAGA 184 Query: 693 RDLNSARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYS 872 DL SARKLFD+MPEK T + N MI+GFA+ DVESA+ LF +M KDLI+WTTMI+CYS Sbjct: 185 GDLGSARKLFDEMPEKITVACNAMINGFAKTGDVESAELLFKEMSRKDLIAWTTMINCYS 244 Query: 873 QNKYYKEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVY 1052 QN+ Y A+++F +MK + I+PDEVTM+T+ISACAHLG+LD+GK++HLYV+Q FDL V+ Sbjct: 245 QNRKYGLAIEVFYDMKSNLITPDEVTMTTVISACAHLGVLDQGKEMHLYVMQKGFDLGVH 304 Query: 1053 I 1055 I Sbjct: 305 I 305 Score = 76.6 bits (187), Expect = 1e-11 Identities = 54/245 (22%), Positives = 110/245 (44%), Gaps = 9/245 (3%) Frame = +3 Query: 267 NQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEIN 446 N I +K VE A F +M ++ + +I + + A++++ DM I Sbjct: 206 NAMINGFAKTGDVESAELLFKEMSRKDLIAWTTMINCYSQNRKYGLAIEVFYDMKSNLIT 265 Query: 447 PSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLV 626 P T ++I +C + + G+ +H V + GF L +H+ +AL+D Y+ + S LV Sbjct: 266 PDEVTMTTVISACAHLGVLDQGKEMHLYVMQKGFDLGVHIGSALIDMYAKCGSLERSLLV 325 Query: 627 FDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMPEK----NTASWNTMIHGFARMRDV 794 F + E++ F W +++ A LF +M ++ N ++ +++ V Sbjct: 326 FYKLREKNLFCWNSVIDGLAVHGYAEEALALFSRMEKEKVKPNGITFVSVLTACTHGGLV 385 Query: 795 ESAKELFNKMPE-----KDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVTMST 959 E ++ F +M + ++ + M+ + +EAL+I R M+ + P+ V Sbjct: 386 EKGRKNFLRMTQDYGIVPEMEHYGCMVDLLCKAGLLEEALEIIRSMR---VEPNAVIWGA 442 Query: 960 IISAC 974 ++ C Sbjct: 443 LLGGC 447 >gb|EOX94498.1| Basic helix-loop-helix DNA-binding superfamily protein [Theobroma cacao] Length = 600 Score = 370 bits (951), Expect = e-100 Identities = 170/300 (56%), Positives = 236/300 (78%) Frame = +3 Query: 156 EITHLANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQM 335 +I + + +K+CS+L +LE++YA M K+N+ +DCFL NQF++AC+ F +++AI AFTQM Sbjct: 47 QIQTIVDQIKKCSNLNQLETIYATMIKTNANQDCFLTNQFVSACATFCRMDYAILAFTQM 106 Query: 336 RNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGE 515 + PN+FVYNALI L+ C P QAL + MLR + PSS+TF S++K+C LV + GE Sbjct: 107 QKPNVFVYNALIKGLVHCHNPFQALDYHKHMLRAGVWPSSFTFSSLVKACGLVSELGFGE 166 Query: 516 SIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFR 695 S+HGQVWK+GF H+ VQTAL+DFY+N + +S+ VFD MP+RD FAWTTMV+ ++ Sbjct: 167 SVHGQVWKHGFESHVFVQTALVDFYANVGKFAESKRVFDEMPDRDVFAWTTMVSGFLKAG 226 Query: 696 DLNSARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQ 875 DL S+R+LFD+MPE+NTA+WN MI G+AR+ DVESA+ FN+MP KD+ISWT+MI+CYS+ Sbjct: 227 DLVSSRRLFDEMPERNTATWNAMIDGYARVGDVESAELFFNQMPVKDIISWTSMINCYSK 286 Query: 876 NKYYKEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 NK ++EAL +F EM+ + +SPDEVTM+++ISACAHLG L+ GK+IH YV+Q+ F LDVYI Sbjct: 287 NKQFREALAVFEEMRRNKVSPDEVTMASVISACAHLGALNTGKEIHHYVMQNGFYLDVYI 346 Score = 81.6 bits (200), Expect = 5e-13 Identities = 58/251 (23%), Positives = 113/251 (45%), Gaps = 9/251 (3%) Frame = +3 Query: 249 RDCFLMNQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDM 428 R+ N I ++ VE A F QM +I + ++I + Q +AL ++ +M Sbjct: 241 RNTATWNAMIDGYARVGDVESAELFFNQMPVKDIISWTSMINCYSKNKQFREALAVFEEM 300 Query: 429 LRTEINPSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRV 608 R +++P T S+I +C + A+ G+ IH V + GF L +++ +AL+D Y+ + Sbjct: 301 RRNKVSPDEVTMASVISACAHLGALNTGKEIHHYVMQNGFYLDVYIGSALVDMYAKCGSL 360 Query: 609 LDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMP----EKNTASWNTMIHGF 776 S L F + E++ F W +++ A +FD M + N ++ +++ Sbjct: 361 ERSLLAFFKLREKNLFCWNSVIEGLAVHGYAQEALAMFDSMERHHVKPNGVTFVSVLSAC 420 Query: 777 ARMRDVESAKELFNKMPE-----KDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPD 941 VE ++ F M ++ + M+ S+ ++AL + R MK + P+ Sbjct: 421 THAGLVEVGRQRFLSMTRDYSIPPEVEHYGCMVDLLSKAGLLEDALFLIRSMK---LEPN 477 Query: 942 EVTMSTIISAC 974 V ++ C Sbjct: 478 PVIWGALLGGC 488 >ref|XP_006420414.1| hypothetical protein CICLE_v10006642mg [Citrus clementina] gi|557522287|gb|ESR33654.1| hypothetical protein CICLE_v10006642mg [Citrus clementina] Length = 530 Score = 365 bits (936), Expect = 2e-98 Identities = 169/277 (61%), Positives = 220/277 (79%), Gaps = 1/277 (0%) Frame = +3 Query: 228 MTKSNSTRDCFLMNQFITAC-SKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQ 404 M K+N+ +DCFL NQF++ C S+FH ++AI AFTQM+ PN+FVYNALI L+ C P Q Sbjct: 1 MVKTNANQDCFLANQFVSFCTSRFHRTDYAILAFTQMQEPNVFVYNALIRGLVHCGHPHQ 60 Query: 405 ALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMD 584 A+ Y MLR E+ P+SYTF S+IK+C+L+ + GE++HGQVWK GFG H+ VQTAL+D Sbjct: 61 AIIFYLHMLRAEVLPTSYTFSSLIKACSLLLDICSGEAVHGQVWKNGFGSHVFVQTALVD 120 Query: 585 FYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMPEKNTASWNTM 764 +YSN ++ +SR VFD MP+RD F+WTTMV AH R DL SAR+LFD+MPE+N A+WNTM Sbjct: 121 YYSNSNKFFESRSVFDEMPQRDIFSWTTMVLAHARAGDLCSARRLFDEMPERNIATWNTM 180 Query: 765 IHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDE 944 I +AR+ +V++A+ LFNKMP +D+ISWTTMI CYSQN ++EALD F EMK GISPD+ Sbjct: 181 IDAYARLGNVQAAELLFNKMPARDIISWTTMITCYSQNNQFREALDAFNEMKKSGISPDQ 240 Query: 945 VTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 VTM+T++SACAHLG LD G++IHLYV+Q FD+DVYI Sbjct: 241 VTMATVLSACAHLGALDLGREIHLYVMQIGFDIDVYI 277 Score = 80.1 bits (196), Expect = 1e-12 Identities = 56/251 (22%), Positives = 116/251 (46%), Gaps = 9/251 (3%) Frame = +3 Query: 249 RDCFLMNQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDM 428 R+ N I A ++ +V+ A F +M +I + +I + +Q +AL +++M Sbjct: 172 RNIATWNTMIDAYARLGNVQAAELLFNKMPARDIISWTTMITCYSQNNQFREALDAFNEM 231 Query: 429 LRTEINPSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRV 608 ++ I+P T +++ +C + A+ LG IH V + GF + +++ +AL+D Y+ + Sbjct: 232 KKSGISPDQVTMATVLSACAHLGALDLGREIHLYVMQIGFDIDVYIGSALIDMYAKCGSL 291 Query: 609 LDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKM----PEKNTASWNTMIHGF 776 S LVF + E++ F W +++ + A +FD+M E N ++ +++ Sbjct: 292 DRSLLVFFKLREKNLFCWNSIIEGLAAHGFAHEALAMFDRMIYENVEPNGVTFISVLSAC 351 Query: 777 ARMRDVESAKELFNKMP-----EKDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPD 941 VE + F M ++ + M+ S+ ++AL++ R K P+ Sbjct: 352 THAGLVEEGRRRFLSMTCGYSITPEVEHYGCMVDLLSKAGLLEDALELIRSSK---FQPN 408 Query: 942 EVTMSTIISAC 974 V ++ C Sbjct: 409 AVIWGALLGGC 419 Score = 70.5 bits (171), Expect = 1e-09 Identities = 63/318 (19%), Positives = 130/318 (40%), Gaps = 43/318 (13%) Frame = +3 Query: 180 LKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRN----PN 347 L C + Y M ++ + + I ACS + A Q+ + Sbjct: 52 LVHCGHPHQAIIFYLHMLRAEVLPTSYTFSSLIKACSLLLDICSGEAVHGQVWKNGFGSH 111 Query: 348 IFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIHG 527 +FV AL+ ++ ++ ++ +M + +I + + ++ L A +L + + Sbjct: 112 VFVQTALVDYYSNSNKFFESRSVFDEMPQRDIFSWTTMVLAHARAGDLCSARRLFDEMPE 171 Query: 528 QVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNS 707 + +I ++D Y+ V + L+F+ MP RD +WTTM+ + + Sbjct: 172 R--------NIATWNTMIDAYARLGNVQAAELLFNKMPARDIISWTTMITCYSQNNQFRE 223 Query: 708 ARKLFDKM------PEKNTASW---------------------------------NTMIH 770 A F++M P++ T + + +I Sbjct: 224 ALDAFNEMKKSGISPDQVTMATVLSACAHLGALDLGREIHLYVMQIGFDIDVYIGSALID 283 Query: 771 GFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVT 950 +A+ ++ + +F K+ EK+L W ++I + + + EAL +F M + P+ VT Sbjct: 284 MYAKCGSLDRSLLVFFKLREKNLFCWNSIIEGLAAHGFAHEALAMFDRMIYENVEPNGVT 343 Query: 951 MSTIISACAHLGLLDEGK 1004 +++SAC H GL++EG+ Sbjct: 344 FISVLSACTHAGLVEEGR 361 >ref|XP_002301860.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550345843|gb|EEE81133.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 933 Score = 360 bits (923), Expect = 7e-97 Identities = 163/280 (58%), Positives = 223/280 (79%) Frame = +3 Query: 216 LYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQ 395 +YA M K+N+ +DC+LMNQFI+A S F+ +++A+ A+TQM PN+FVYNA+I ++ Q Sbjct: 1 MYAVMVKTNTNQDCYLMNQFISALSTFNRMDYAVLAYTQMEIPNVFVYNAMIKGFVQSYQ 60 Query: 396 PDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTA 575 P QAL++Y MLR ++P+SYTFPS+IK+C LV ++ E++HG VW+ GF H+ VQT+ Sbjct: 61 PVQALELYVQMLRANVSPTSYTFPSLIKACGLVSQLRFAEAVHGHVWRNGFDSHVFVQTS 120 Query: 576 LMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMPEKNTASW 755 L+DFYS+ R+ +S VFD MPERD FAWTTMV+ VR D++SA +LFD MP++N A+W Sbjct: 121 LVDFYSSMGRIEESVRVFDEMPERDVFAWTTMVSGLVRVGDMSSAGRLFDMMPDRNLATW 180 Query: 756 NTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKYYKEALDIFREMKDHGIS 935 NT+I G+AR+R+V+ A+ LFN+MP +D+ISWTTMI+CYSQNK ++EAL +F EM HGIS Sbjct: 181 NTLIDGYARLREVDVAELLFNQMPARDIISWTTMINCYSQNKRFREALGVFNEMAKHGIS 240 Query: 936 PDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 PDEVTM+T+ISACAHLG LD GK+IH Y++Q F+LDVYI Sbjct: 241 PDEVTMATVISACAHLGALDLGKEIHYYIMQHGFNLDVYI 280 Score = 85.9 bits (211), Expect = 2e-14 Identities = 58/251 (23%), Positives = 116/251 (46%), Gaps = 9/251 (3%) Frame = +3 Query: 249 RDCFLMNQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDM 428 R+ N I ++ V+ A F QM +I + +I + + +AL ++++M Sbjct: 175 RNLATWNTLIDGYARLREVDVAELLFNQMPARDIISWTTMINCYSQNKRFREALGVFNEM 234 Query: 429 LRTEINPSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRV 608 + I+P T ++I +C + A+ LG+ IH + ++GF L +++ +AL+D Y+ + Sbjct: 235 AKHGISPDEVTMATVISACAHLGALDLGKEIHYYIMQHGFNLDVYIGSALIDMYAKCGSL 294 Query: 609 LDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMPEK----NTASWNTMIHGF 776 S L+F + E++ F W +++ A +FDKM + N ++ +++ Sbjct: 295 DRSLLMFFKLREKNLFCWNSVIEGLAVHGYAEEALAMFDKMEREKIKPNGVTFVSVLSAC 354 Query: 777 ARMRDVESAKELFNKMPEKDLI-----SWTTMIHCYSQNKYYKEALDIFREMKDHGISPD 941 +E ++ F M I + M+ S+ +EAL + R MK + P+ Sbjct: 355 NHAGLIEEGRKRFASMTRDHSIPPGVEHYGCMVDLLSKAGLLEEALQLIRTMK---LEPN 411 Query: 942 EVTMSTIISAC 974 V ++S C Sbjct: 412 AVIWGALLSGC 422 Score = 82.4 bits (202), Expect = 3e-13 Identities = 68/307 (22%), Positives = 142/307 (46%), Gaps = 44/307 (14%) Frame = +3 Query: 216 LYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQM-RN---PNIFVYNALIGALL 383 LY M ++N + + I AC + +A A + RN ++FV +L+ Sbjct: 67 LYVQMLRANVSPTSYTFPSLIKACGLVSQLRFAEAVHGHVWRNGFDSHVFVQTSLVDFYS 126 Query: 384 RCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGE-SIHGQVWKYGFGLHI 560 + +++++++ +M ++ + + +++ V++G+ S G+++ ++ Sbjct: 127 SMGRIEESVRVFDEMPERDV----FAWTTMVSGL-----VRVGDMSSAGRLFDMMPDRNL 177 Query: 561 HVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKM--- 731 L+D Y+ V + L+F+ MP RD +WTTM+ + + + A +F++M Sbjct: 178 ATWNTLIDGYARLREVDVAELLFNQMPARDIISWTTMINCYSQNKRFREALGVFNEMAKH 237 Query: 732 ---PEKNTAS-------------------WNTMIHGF--------------ARMRDVESA 803 P++ T + + M HGF A+ ++ + Sbjct: 238 GISPDEVTMATVISACAHLGALDLGKEIHYYIMQHGFNLDVYIGSALIDMYAKCGSLDRS 297 Query: 804 KELFNKMPEKDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVTMSTIISACAHL 983 +F K+ EK+L W ++I + + Y +EAL +F +M+ I P+ VT +++SAC H Sbjct: 298 LLMFFKLREKNLFCWNSVIEGLAVHGYAEEALAMFDKMEREKIKPNGVTFVSVLSACNHA 357 Query: 984 GLLDEGK 1004 GL++EG+ Sbjct: 358 GLIEEGR 364 >gb|EMJ01920.1| hypothetical protein PRUPE_ppa025321mg [Prunus persica] Length = 529 Score = 355 bits (910), Expect = 2e-95 Identities = 165/276 (59%), Positives = 217/276 (78%) Frame = +3 Query: 228 MTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQA 407 M K+N+T+D F MNQ ITACS +++A+ AFTQ+ +PN+FVYNA+I + C P QA Sbjct: 1 MIKTNATQDSFFMNQLITACSTLSRIDYAVLAFTQIESPNVFVYNAMIKGFVCCGHPCQA 60 Query: 408 LQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDF 587 L Y +MLR + P+SYTF S+IK+CT + A+ +GE++ G +WK GFG H+ VQT+L+DF Sbjct: 61 LGCYINMLRGMVLPTSYTFSSLIKACTSLSALGVGEAVQGHIWKNGFGSHVFVQTSLIDF 120 Query: 588 YSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMPEKNTASWNTMI 767 YS R+ +SR VFD MPERD+FAWTTMV++HVR D++SAR LFD+M E+N +WNTMI Sbjct: 121 YSKLRRISESRKVFDEMPERDAFAWTTMVSSHVRVGDMSSARILFDEMEERNITTWNTMI 180 Query: 768 HGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEV 947 G+AR+ +VESA+ LFN MP +D+ISWTTMI CYSQNK + EAL +F +M+ GISPDEV Sbjct: 181 DGYARLGNVESAELLFNHMPTRDIISWTTMIDCYSQNKKFGEALAVFSDMRMKGISPDEV 240 Query: 948 TMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 TM+T+ISACAHLG LD GK+IHLY++Q+ FDLDVYI Sbjct: 241 TMATVISACAHLGALDLGKEIHLYILQNGFDLDVYI 276 Score = 84.0 bits (206), Expect = 9e-14 Identities = 60/252 (23%), Positives = 107/252 (42%) Frame = +3 Query: 249 RDCFLMNQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDM 428 R+ N I ++ +VE A F M +I + +I + + +AL ++SDM Sbjct: 171 RNITTWNTMIDGYARLGNVESAELLFNHMPTRDIISWTTMIDCYSQNKKFGEALAVFSDM 230 Query: 429 LRTEINPSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRV 608 I+P T ++I +C + A+ LG+ IH + + GF L +++ +AL+D Y+ + Sbjct: 231 RMKGISPDEVTMATVISACAHLGALDLGKEIHLYILQNGFDLDVYIGSALIDMYAKCGAL 290 Query: 609 LDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMPEKNTASWNTMIHGFARMR 788 S LVF + +++ F W N+A +HGFA+ Sbjct: 291 DRSLLVFFKLQDKNLFCW--------------------------NSAIEGLAVHGFAK-- 322 Query: 789 DVESAKELFNKMPEKDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVTMSTIIS 968 EAL +F +M+ I+P+ VT +++S Sbjct: 323 ----------------------------------EALAMFSKMEREKINPNGVTFVSVLS 348 Query: 969 ACAHLGLLDEGK 1004 +C H GL++EG+ Sbjct: 349 SCTHAGLVEEGR 360 >ref|XP_003530855.2| PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Glycine max] Length = 585 Score = 353 bits (907), Expect = 5e-95 Identities = 164/296 (55%), Positives = 222/296 (75%) Frame = +3 Query: 168 LANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNPN 347 + ++KRC S K LES+YA M K+N+T+DCFL+NQFI+ACS + A +AF ++NPN Sbjct: 37 ILGHIKRCFSPKSLESVYASMIKTNTTQDCFLVNQFISACSNLSCINLAASAFANVQNPN 96 Query: 348 IFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIHG 527 + V+NALI + C +QAL Y MLR + P+SY+F S+IK+CTL+ GE++HG Sbjct: 97 VLVFNALIRGCVHCCYSEQALVHYMHMLRNNVMPTSYSFSSLIKACTLLVDSAFGEAVHG 156 Query: 528 QVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNS 707 VWK+GF H+ VQT L++FYS F V SR VFD+MPERD FAWTTM++AHVR D+ S Sbjct: 157 HVWKHGFDSHVFVQTTLIEFYSTFGDVGGSRRVFDDMPERDVFAWTTMISAHVRDGDMAS 216 Query: 708 ARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKYY 887 A +LFD+MPEKN A+WN MI G+ ++ + ESA+ LFN+MP +D+ISWTTM++CYS+NK Y Sbjct: 217 AGRLFDEMPEKNVATWNAMIDGYGKLGNAESAEFLFNQMPARDIISWTTMMNCYSRNKRY 276 Query: 888 KEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 KE + +F ++ D G+ PDEVTM+T+ISACAHLG L GK++HLY++ FDLDVYI Sbjct: 277 KEVIALFHDVIDKGMIPDEVTMTTVISACAHLGALALGKEVHLYLVLQGFDLDVYI 332 Score = 69.3 bits (168), Expect = 2e-09 Identities = 62/313 (19%), Positives = 128/313 (40%), Gaps = 43/313 (13%) Frame = +3 Query: 219 YAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMR----NPNIFVYNALIGALLR 386 Y M ++N + + I AC+ + A + + ++FV LI Sbjct: 120 YMHMLRNNVMPTSYSFSSLIKACTLLVDSAFGEAVHGHVWKHGFDSHVFVQTTLIEFYST 179 Query: 387 CSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHV 566 + +++ DM ++ + + ++ + A +L + + + ++ Sbjct: 180 FGDVGGSRRVFDDMPERDVFAWTTMISAHVRDGDMASAGRLFDEMPEK--------NVAT 231 Query: 567 QTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNSARKLF----DK-- 728 A++D Y + +F+ MP RD +WTTM+ + R + LF DK Sbjct: 232 WNAMIDGYGKLGNAESAEFLFNQMPARDIISWTTMMNCYSRNKRYKEVIALFHDVIDKGM 291 Query: 729 MPEKNTASW---------------------------------NTMIHGFARMRDVESAKE 809 +P++ T + +++I +A+ ++ A Sbjct: 292 IPDEVTMTTVISACAHLGALALGKEVHLYLVLQGFDLDVYIGSSLIDMYAKCGSIDMALL 351 Query: 810 LFNKMPEKDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVTMSTIISACAHLGL 989 +F K+ K+L W +I + + Y +EAL +F EM+ I P+ VT +I++AC H G Sbjct: 352 VFYKLQTKNLFCWNCIIDGLATHGYVEEALRMFGEMERKRIRPNAVTFISILTACTHAGF 411 Query: 990 LDEGKDIHLYVIQ 1028 ++EG+ + ++Q Sbjct: 412 IEEGRRWFMSMVQ 424 Score = 68.6 bits (166), Expect = 4e-09 Identities = 47/245 (19%), Positives = 107/245 (43%), Gaps = 9/245 (3%) Frame = +3 Query: 267 NQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEIN 446 N I K + E A F QM +I + ++ R + + + ++ D++ + Sbjct: 233 NAMIDGYGKLGNAESAEFLFNQMPARDIISWTTMMNCYSRNKRYKEVIALFHDVIDKGMI 292 Query: 447 PSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLV 626 P T ++I +C + A+ LG+ +H + GF L +++ ++L+D Y+ + + LV Sbjct: 293 PDEVTMTTVISACAHLGALALGKEVHLYLVLQGFDLDVYIGSSLIDMYAKCGSIDMALLV 352 Query: 627 FDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMPEK----NTASWNTMIHGFARMRDV 794 F + ++ F W ++ + A ++F +M K N ++ +++ + Sbjct: 353 FYKLQTKNLFCWNCIIDGLATHGYVEEALRMFGEMERKRIRPNAVTFISILTACTHAGFI 412 Query: 795 ESAKELFNKMPEKDLIS-----WTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVTMST 959 E + F M + I+ + M+ S+ ++AL++ R M + P+ Sbjct: 413 EEGRRWFMSMVQDYCIAPQVEHYGCMVDLLSKAGLLEDALEMIRNMT---VEPNSFIWGA 469 Query: 960 IISAC 974 +++ C Sbjct: 470 LLNGC 474 >gb|EXB75130.1| hypothetical protein L484_025905 [Morus notabilis] Length = 554 Score = 352 bits (903), Expect = 1e-94 Identities = 163/296 (55%), Positives = 225/296 (76%) Frame = +3 Query: 168 LANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNPN 347 +A +K+CS L ELE +YA M K+ +T+D L NQFI+A S F V++A+ AF Q+ NPN Sbjct: 9 VAERIKKCSKLTELEHVYASMIKTGATQDPLLTNQFISASSNFSRVDYAVLAFKQIENPN 68 Query: 348 IFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIHG 527 +FVYNA+I + P QAL+ Y DM+R +++P+SYTFPS+I++CTL+ GE++HG Sbjct: 69 VFVYNAMIRGYVNDGYPYQALECYVDMMRAKVSPTSYTFPSLIRACTLLFVPGFGEAVHG 128 Query: 528 QVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNS 707 +W+ G H++VQTA++DFYS R+ DSR VFD M ERD+FAWTTM++AH R D++ Sbjct: 129 HIWRNGLDSHVYVQTAMVDFYSKLSRIKDSRRVFDEMSERDAFAWTTMISAHARAGDMDC 188 Query: 708 ARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKYY 887 A KLF++M EKNT +WN+MI GFAR+ ++ESA+ LF++MP +D ISWTTMI CYS NK + Sbjct: 189 AAKLFERMSEKNTTTWNSMIDGFARLGNLESAELLFHQMPARDTISWTTMITCYSHNKKH 248 Query: 888 KEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 +EAL F EM +GISPD VTM+T++SACAHLG L+ GK++HLYV+Q+ F LDV+I Sbjct: 249 REALAAFEEMTMNGISPDGVTMATVVSACAHLGALELGKEMHLYVMQNGFHLDVFI 304 Score = 77.0 bits (188), Expect = 1e-11 Identities = 56/262 (21%), Positives = 117/262 (44%), Gaps = 9/262 (3%) Frame = +3 Query: 216 LYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQ 395 L+ M++ N+T N I ++ ++E A F QM + + +I + Sbjct: 192 LFERMSEKNTTT----WNSMIDGFARLGNLESAELLFHQMPARDTISWTTMITCYSHNKK 247 Query: 396 PDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTA 575 +AL + +M I+P T +++ +C + A++LG+ +H V + GF L + + +A Sbjct: 248 HREALAAFEEMTMNGISPDGVTMATVVSACAHLGALELGKEMHLYVMQNGFHLDVFIGSA 307 Query: 576 LMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMPEKNT--- 746 L+D Y+ + + LVF + +++ F W +++ + KM EKN Sbjct: 308 LIDMYAKCGALDRALLVFFKLRDKNLFCWNSIIEGLAAHGYAEETLAMLSKMEEKNIKPN 367 Query: 747 -ASWNTMIHGFARMRDVESAKELFNKMPEKDLIS-----WTTMIHCYSQNKYYKEALDIF 908 ++ +++ V+ ++ F M I+ + M+ S+ +EALD+ Sbjct: 368 GVTFVSVLSACTHAGLVQEGRKRFLSMTNDYSITPGVEHYGCMVDLLSKAGLLEEALDLI 427 Query: 909 REMKDHGISPDEVTMSTIISAC 974 R MK ++P+ + ++ C Sbjct: 428 RSMK---VTPNSIIWGALLGGC 446 >ref|XP_004152039.1| PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cucumis sativus] Length = 697 Score = 345 bits (885), Expect = 2e-92 Identities = 166/297 (55%), Positives = 224/297 (75%), Gaps = 1/297 (0%) Frame = +3 Query: 168 LANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNPN 347 L N +K CS++ EL L A M K+N+ +DCFL++QFI+A + V + + AFTQM NPN Sbjct: 139 LLNRIKNCSTINELHGLCASMIKTNAIQDCFLVHQFISASFALNSVHYPVFAFTQMENPN 198 Query: 348 IFVYNALIGALLRCSQPDQALQIYSDMLR-TEINPSSYTFPSIIKSCTLVPAVKLGESIH 524 +FVYNA+I + C P +ALQ Y ML + + P+SYTF S++K+CT + AV+LG+ +H Sbjct: 199 VFVYNAMIKGFVYCGYPFRALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVH 258 Query: 525 GQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLN 704 +WK GF H+ VQTAL+DFYS + + ++R VFD M ERD+FAWT MV+A R D++ Sbjct: 259 CHIWKKGFESHLFVQTALVDFYSKLEILSEARKVFDEMCERDAFAWTAMVSALARVGDMD 318 Query: 705 SARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKY 884 SARKLF++MPE+NTA+WNTMI G+AR+ +VESA+ LFN+MP KD+ISWTTMI CYSQNK Sbjct: 319 SARKLFEEMPERNTATWNTMIDGYARLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQ 378 Query: 885 YKEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 Y++AL I+ EM+ +GI PDEVTMST+ SACAH+G L+ GK+IH YV+ +LDVYI Sbjct: 379 YQDALAIYSEMRLNGIIPDEVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYI 435 Score = 86.3 bits (212), Expect = 2e-14 Identities = 70/281 (24%), Positives = 113/281 (40%) Frame = +3 Query: 162 THLANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRN 341 T + + L R + L+ M + N+ N I ++ +VE A F QM Sbjct: 305 TAMVSALARVGDMDSARKLFEEMPERNTAT----WNTMIDGYARLGNVESAELLFNQMPT 360 Query: 342 PNIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESI 521 +I + +I + Q AL IYS+M I P T ++ +C + A++LG+ I Sbjct: 361 KDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPDEVTMSTVASACAHIGALELGKEI 420 Query: 522 HGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDL 701 H V G L +++ +AL+D Y+ + S L+F Sbjct: 421 HHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIF------------------------ 456 Query: 702 NSARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNK 881 K+ +KN WN +I G A +H Y++ Sbjct: 457 -------FKLTDKNLYCWNAVIEGLA--------------------------VHGYAE-- 481 Query: 882 YYKEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGK 1004 +AL +F M+ I P+ VT +I+SAC H GL+DEG+ Sbjct: 482 ---KALRMFAIMEREKIMPNGVTFISILSACTHAGLVDEGR 519 >gb|ESW06293.1| hypothetical protein PHAVU_010G035600g [Phaseolus vulgaris] Length = 558 Score = 345 bits (884), Expect = 2e-92 Identities = 163/296 (55%), Positives = 217/296 (73%) Frame = +3 Query: 168 LANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNPN 347 + ++KRC + K LES+YA M K+N+T+DCFLMNQFI++CS +V+ A + F M NPN Sbjct: 10 IRGHIKRCMTQKSLESVYACMIKTNTTQDCFLMNQFISSCSALSYVDLASSTFAHMENPN 69 Query: 348 IFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIHG 527 FVYNALI C PD+AL Y MLR + P+SY+F S+IK+CTL+ G+++HG Sbjct: 70 AFVYNALIRGCGHCCYPDRALGFYIHMLRNNVMPNSYSFSSLIKACTLLMDSAFGKAVHG 129 Query: 528 QVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNS 707 +WK GF H+ VQT L++FYS V SR VFD+MPERD FAWTTM++A VR D+ S Sbjct: 130 HIWKNGFDSHMFVQTTLIEFYSTLGDVSGSRRVFDDMPERDVFAWTTMISALVRDGDMAS 189 Query: 708 ARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKYY 887 A LFD+MPEKN A+WN MI G A++ + ESA+ LFN+M +D+ISWTTM+ C+S+NK Y Sbjct: 190 AGNLFDEMPEKNIATWNAMIDGHAKLGNAESAEFLFNQMLARDIISWTTMMSCFSRNKRY 249 Query: 888 KEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 + + +F +M D G+ PDEVTMST+ISACAHLG LD GK++HLY++ +FDLDVYI Sbjct: 250 MDVVRLFHDMIDKGMIPDEVTMSTVISACAHLGALDLGKEVHLYLMLHEFDLDVYI 305 Score = 68.2 bits (165), Expect = 5e-09 Identities = 55/245 (22%), Positives = 98/245 (40%) Frame = +3 Query: 267 NQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEIN 446 N I +K + E A F QM +I + ++ R + ++++ DM+ + Sbjct: 206 NAMIDGHAKLGNAESAEFLFNQMLARDIISWTTMMSCFSRNKRYMDVVRLFHDMIDKGMI 265 Query: 447 PSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLV 626 P T ++I +C + A+ LG+ +H + + F L +++ ++L+D Y+ + + LV Sbjct: 266 PDEVTMSTVISACAHLGALDLGKEVHLYLMLHEFDLDVYIGSSLIDMYAKCGSIDRALLV 325 Query: 627 FDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMPEKNTASWNTMIHGFARMRDVESAK 806 F K+ KN WN++I G A Sbjct: 326 F-------------------------------YKLQNKNLYCWNSIIDGLA--------- 345 Query: 807 ELFNKMPEKDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVTMSTIISACAHLG 986 H Y++ EAL +F M+ I P+ VT +I+SAC H G Sbjct: 346 -----------------THGYAK-----EALRMFGAMESKRIRPNAVTFISILSACTHTG 383 Query: 987 LLDEG 1001 ++EG Sbjct: 384 FVEEG 388 >ref|XP_004165913.1| PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cucumis sativus] Length = 600 Score = 342 bits (878), Expect = 1e-91 Identities = 164/297 (55%), Positives = 223/297 (75%), Gaps = 1/297 (0%) Frame = +3 Query: 168 LANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNPN 347 L N +K CS++ EL L A M K+N+ +DCFL++QFI+A + V + + AFTQM NPN Sbjct: 42 LLNRIKNCSTINELHGLCASMIKTNAIQDCFLVHQFISASFALNSVHYPVFAFTQMENPN 101 Query: 348 IFVYNALIGALLRCSQPDQALQIYSDMLR-TEINPSSYTFPSIIKSCTLVPAVKLGESIH 524 +FVYNA+I + C P +ALQ Y ML + + P+SYTF S++K+CT + AV+LG+ +H Sbjct: 102 VFVYNAMIKGFVYCGYPFRALQCYVHMLEESNVLPTSYTFSSLVKACTFMCAVELGQMVH 161 Query: 525 GQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLN 704 +WK GF H+ VQTAL+DFYS + + ++R VFD M ERD+FAWT M++A R D++ Sbjct: 162 CHIWKKGFESHLFVQTALVDFYSKLEILSEARKVFDEMCERDAFAWTAMLSALARVGDMD 221 Query: 705 SARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKY 884 SARKLF++MPE+NTA+WNTMI G+ R+ +VESA+ LFN+MP KD+ISWTTMI CYSQNK Sbjct: 222 SARKLFEEMPERNTATWNTMIDGYTRLGNVESAELLFNQMPTKDIISWTTMITCYSQNKQ 281 Query: 885 YKEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 Y++AL I+ EM+ +GI PDEVTMST+ SACAH+G L+ GK+IH YV+ +LDVYI Sbjct: 282 YQDALAIYSEMRLNGIIPDEVTMSTVASACAHIGALELGKEIHHYVMSQGLNLDVYI 338 Score = 85.9 bits (211), Expect = 2e-14 Identities = 70/281 (24%), Positives = 113/281 (40%) Frame = +3 Query: 162 THLANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRN 341 T + + L R + L+ M + N+ N I ++ +VE A F QM Sbjct: 208 TAMLSALARVGDMDSARKLFEEMPERNTAT----WNTMIDGYTRLGNVESAELLFNQMPT 263 Query: 342 PNIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESI 521 +I + +I + Q AL IYS+M I P T ++ +C + A++LG+ I Sbjct: 264 KDIISWTTMITCYSQNKQYQDALAIYSEMRLNGIIPDEVTMSTVASACAHIGALELGKEI 323 Query: 522 HGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDL 701 H V G L +++ +AL+D Y+ + S L+F Sbjct: 324 HHYVMSQGLNLDVYIGSALVDMYAKCGSLDLSLLIF------------------------ 359 Query: 702 NSARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNK 881 K+ +KN WN +I G A +H Y++ Sbjct: 360 -------FKLTDKNLYCWNAVIEGLA--------------------------VHGYAE-- 384 Query: 882 YYKEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGK 1004 +AL +F M+ I P+ VT +I+SAC H GL+DEG+ Sbjct: 385 ---KALRMFAIMEREKIMPNGVTFISILSACTHAGLVDEGR 422 >emb|CBI20254.3| unnamed protein product [Vitis vinifera] Length = 494 Score = 338 bits (868), Expect = 2e-90 Identities = 157/315 (49%), Positives = 221/315 (70%), Gaps = 31/315 (9%) Frame = +3 Query: 153 SEITHLANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQ 332 + HLA +K+CS +KELES+YA M K+N+ +DCFLMNQFI ACS FH +++AI AFT Sbjct: 10 TNFNHLAQQIKKCSMVKELESVYASMIKANANQDCFLMNQFIAACSIFHRIDYAILAFTH 69 Query: 333 MRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLG 512 M+ PN+FVYNA+I AL++C P QAL Y DM++ +++P+S+TF S++K+C+LV + G Sbjct: 70 MQEPNVFVYNAMIRALVQCYHPVQALDCYLDMVQAQVSPTSFTFSSLVKACSLVSELGFG 129 Query: 513 ESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRF 692 E++HG +WKYGF H+ VQTAL+DFY N +++++R VFD M ERD FAWTTM++ H R Sbjct: 130 EAVHGHIWKYGFDSHVFVQTALVDFYGNAGKIVEARRVFDEMSERDVFAWTTMISVHART 189 Query: 693 RDLNSARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYS 872 D++SAR+LFD+MP +NTASWN MI G++R+R+VESA+ LF++MP +D+ISWTTMI CYS Sbjct: 190 GDMSSARQLFDEMPVRNTASWNAMIDGYSRLRNVESAELLFSQMPNRDIISWTTMIACYS 249 Query: 873 QNK-------------------------------YYKEALDIFREMKDHGISPDEVTMST 959 QNK Y +EAL +F M+ I P+ VT + Sbjct: 250 QNKHLDKSLVVFFKLRKKNLFCWNSIIEGLAVHGYAEEALAMFSRMQREKIKPNGVTFIS 309 Query: 960 IISACAHLGLLDEGK 1004 ++ AC H GL++EG+ Sbjct: 310 VLGACTHAGLVEEGR 324 >ref|XP_004511192.1| PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Cicer arietinum] Length = 1049 Score = 331 bits (849), Expect = 3e-88 Identities = 164/316 (51%), Positives = 225/316 (71%), Gaps = 3/316 (0%) Frame = +3 Query: 117 LQHVGHRLIPQHSEITHLANYLKRCSSLKEL-ESLYAFMTKSNSTRDCFLMNQFITACSK 293 +QH + + HS + +++K+CS K L ES+YA M K+N +DCFLMNQFITA S Sbjct: 482 VQHYNNNNV--HSLKDTILSHIKQCSGAKPLLESIYATMIKTNFNQDCFLMNQFITASSI 539 Query: 294 FHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSI 473 H+ A + FTQ++ PN VYNALI A + C +AL Y ML+ + PSSY+F S+ Sbjct: 540 SSHINLATSTFTQIKKPNTLVYNALIKACVHCHSSHKALLHYIHMLQNGVVPSSYSFSSL 599 Query: 474 IKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDS 653 IK+CTL+ G+++HG VWK GF H+ VQT L++FYSN +V DSR VFD M ERD Sbjct: 600 IKACTLLTDHVNGKTLHGHVWKNGFSTHVFVQTTLVEFYSNLGQVCDSRKVFDEMSERDV 659 Query: 654 FAWTTMVAAHVRFRDLNSARKLFDKMPE-KNTASWNTMIHGFARMRDVESAKELFNKMPE 830 +AWTTM++AHVR D+ SA KLFD+MPE KNTA+WN +I G+A++ D+E + LF+K+P Sbjct: 660 YAWTTMISAHVRNNDVESAEKLFDEMPERKNTATWNVVIDGYAKLGDIERVEVLFSKIPS 719 Query: 831 KDLISWTTMIHCYSQNKYYKEALDIFREMKDHG-ISPDEVTMSTIISACAHLGLLDEGKD 1007 KD+ISWTT+++CYS+NK Y E + +F EM + G + PDEVT++T+ISACAHLG L GK+ Sbjct: 720 KDIISWTTLMNCYSKNKRYGEVVKLFHEMVNEGMVFPDEVTITTVISACAHLGALGLGKE 779 Query: 1008 IHLYVIQSQFDLDVYI 1055 +H Y++ + F LDVYI Sbjct: 780 VHFYLMVNGFGLDVYI 795 >ref|XP_002892322.1| hypothetical protein ARALYDRAFT_311694 [Arabidopsis lyrata subsp. lyrata] gi|297338164|gb|EFH68581.1| hypothetical protein ARALYDRAFT_311694 [Arabidopsis lyrata subsp. lyrata] Length = 1329 Score = 330 bits (846), Expect = 6e-88 Identities = 153/296 (51%), Positives = 219/296 (73%) Frame = +3 Query: 168 LANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNPN 347 L +K+CS+ K LES A M K++ T++C+LMNQFITACS F+ ++ A++ TQM+ PN Sbjct: 783 LKQIIKQCSTPKLLESALAAMIKTSQTQNCYLMNQFITACSSFNRLDLAVSFMTQMQKPN 842 Query: 348 IFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIHG 527 +FVYNALI + CS P ++L+ Y MLR ++PSSYT+ S++++ A GES+ Sbjct: 843 VFVYNALIKGFVTCSHPIRSLEFYVRMLRDSVSPSSYTYSSLVQASAF--ASGFGESLQA 900 Query: 528 QVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNS 707 +WK+GFG H+ +QT L+ FYS R+ ++R VFD MPERD WTTMV+A+ + D++S Sbjct: 901 HIWKFGFGFHVQIQTTLIGFYSASGRIREARKVFDEMPERDDVTWTTMVSAYRQVLDMDS 960 Query: 708 ARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKYY 887 A L ++MPEKN A+WN +I G+ R+ ++E A+ LFN+MP KD+ISWTTMI+ YS+NK Y Sbjct: 961 ANSLANQMPEKNEATWNCLIDGYTRLGNLELAESLFNQMPVKDIISWTTMINGYSRNKRY 1020 Query: 888 KEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 +EA+ +F +M + GI PDEVTMST+ISACAHLG+L+ GK++H+Y +Q+ F LDVYI Sbjct: 1021 REAIAVFYKMMEEGIIPDEVTMSTVISACAHLGVLEIGKEVHMYTVQNGFVLDVYI 1076 Score = 84.3 bits (207), Expect = 7e-14 Identities = 54/245 (22%), Positives = 114/245 (46%), Gaps = 9/245 (3%) Frame = +3 Query: 267 NQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEIN 446 N I ++ ++E A + F QM +I + +I R + +A+ ++ M+ I Sbjct: 977 NCLIDGYTRLGNLELAESLFNQMPVKDIISWTTMINGYSRNKRYREAIAVFYKMMEEGII 1036 Query: 447 PSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLV 626 P T ++I +C + +++G+ +H + GF L +++ +AL+D YS + + LV Sbjct: 1037 PDEVTMSTVISACAHLGVLEIGKEVHMYTVQNGFVLDVYIGSALVDMYSKCGSLERALLV 1096 Query: 627 FDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMP----EKNTASWNTMIHGFARMRDV 794 F N+P+++ F W +++ A K+F KM + NT ++ ++ V Sbjct: 1097 FFNLPKKNLFCWNSIIEGLAAHGFAQEALKMFAKMEMESVKPNTVTFVSVFTACTHAGLV 1156 Query: 795 ESAKELFNKMPE-----KDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVTMST 959 E + ++ M + ++ + M+H +S+ EAL++ M+ P+ V Sbjct: 1157 EEGRRIYRSMIDDYSIVSNVEHYGCMVHLFSKAGLIYEALELIGSME---FEPNAVIWGA 1213 Query: 960 IISAC 974 ++ C Sbjct: 1214 LLDGC 1218 >ref|XP_004292199.1| PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Fragaria vesca subsp. vesca] Length = 532 Score = 330 bits (845), Expect = 7e-88 Identities = 159/279 (56%), Positives = 206/279 (73%), Gaps = 3/279 (1%) Frame = +3 Query: 228 MTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQA 407 M K+N+ +D F NQ ITA S + A AF+ ++NPN FVYNA+I A + C P Q Sbjct: 1 MIKTNTIQDSFFTNQLITASSSLSRLNHAALAFSHIQNPNAFVYNAMIKASVHCGHPFQG 60 Query: 408 LQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDF 587 L + +MLR + P+SYT+PS+IK+C V + GE +HG+VWK GF H++VQTAL+D Sbjct: 61 LLCFINMLRNRVFPTSYTYPSLIKACASVSVMGFGEGVHGRVWKTGFDSHVYVQTALIDL 120 Query: 588 YSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMPEK---NTASWN 758 YS RV D+R VFD MP+RD FAWTTMVA+HVR D++SAR LFD+M E+ N A+WN Sbjct: 121 YSKLGRVGDARKVFDEMPDRDGFAWTTMVASHVRVGDMSSARVLFDEMLERCIANAATWN 180 Query: 759 TMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISP 938 TMI G+AR+ DVESA LF++MP +DLISWT MI+CY QNK + EAL +F EM+ +G+SP Sbjct: 181 TMIDGYARLGDVESAGMLFDQMPARDLISWTAMINCYCQNKRFGEALAVFDEMRINGVSP 240 Query: 939 DEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 D VTMST++SACAHLG LD GK+IH YV+++ FDLDVYI Sbjct: 241 DAVTMSTVVSACAHLGALDLGKEIHYYVMRNGFDLDVYI 279 Score = 88.2 bits (217), Expect = 5e-15 Identities = 57/245 (23%), Positives = 115/245 (46%), Gaps = 9/245 (3%) Frame = +3 Query: 267 NQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEIN 446 N I ++ VE A F QM ++ + A+I + + +AL ++ +M ++ Sbjct: 180 NTMIDGYARLGDVESAGMLFDQMPARDLISWTAMINCYCQNKRFGEALAVFDEMRINGVS 239 Query: 447 PSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLV 626 P + T +++ +C + A+ LG+ IH V + GF L +++ +AL+D Y+ + + +V Sbjct: 240 PDAVTMSTVVSACAHLGALDLGKEIHYYVMRNGFDLDVYIGSALIDMYAKCGALDRALVV 299 Query: 627 FDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMPEK----NTASWNTMIHGFARMRDV 794 F N+ E++ F W +++ D A +F KM + N ++ +++ V Sbjct: 300 FFNLREKNLFCWNSVIEGLAAHGDAEKALAMFSKMAREKIKPNGVTFVSVLSACTHAGLV 359 Query: 795 ESAKELFNKMPEKDLIS-----WTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVTMST 959 E + F+ M + IS + M+ S+ +AL++ R MK + P+ V Sbjct: 360 EEGRRRFSSMTQDYSISPGAEHYGCMVDLLSRAGLLDDALELIRSMK---LKPNSVIWGA 416 Query: 960 IISAC 974 ++ C Sbjct: 417 LLGGC 421 >sp|Q56X05.2|PPR15_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g06145; AltName: Full=Protein EMBRYO DEFECTIVE 1444 Length = 577 Score = 325 bits (833), Expect = 2e-86 Identities = 153/297 (51%), Positives = 218/297 (73%) Frame = +3 Query: 165 HLANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNP 344 +L +K+CS+ K LES A M K++ +DC LMNQFITAC+ F ++ A++ TQM+ P Sbjct: 30 NLKKIIKQCSTPKLLESALAAMIKTSLNQDCRLMNQFITACTSFKRLDLAVSTMTQMQEP 89 Query: 345 NIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIH 524 N+FVYNAL + CS P ++L++Y MLR ++PSSYT+ S++K+ + A + GES+ Sbjct: 90 NVFVYNALFKGFVTCSHPIRSLELYVRMLRDSVSPSSYTYSSLVKASSF--ASRFGESLQ 147 Query: 525 GQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLN 704 +WK+GFG H+ +QT L+DFYS R+ ++R VFD MPERD AWTTMV+A+ R D++ Sbjct: 148 AHIWKFGFGFHVKIQTTLIDFYSATGRIREARKVFDEMPERDDIAWTTMVSAYRRVLDMD 207 Query: 705 SARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKY 884 SA L ++M EKN A+ N +I+G+ + ++E A+ LFN+MP KD+ISWTTMI YSQNK Sbjct: 208 SANSLANQMSEKNEATSNCLINGYMGLGNLEQAESLFNQMPVKDIISWTTMIKGYSQNKR 267 Query: 885 YKEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 Y+EA+ +F +M + GI PDEVTMST+ISACAHLG+L+ GK++H+Y +Q+ F LDVYI Sbjct: 268 YREAIAVFYKMMEEGIIPDEVTMSTVISACAHLGVLEIGKEVHMYTLQNGFVLDVYI 324 Score = 81.6 bits (200), Expect = 5e-13 Identities = 59/289 (20%), Positives = 131/289 (45%), Gaps = 11/289 (3%) Frame = +3 Query: 141 IPQHSEI--THLANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWA 314 +P+ +I T + + +R + SL M++ N L+N ++ ++E A Sbjct: 185 MPERDDIAWTTMVSAYRRVLDMDSANSLANQMSEKNEATSNCLINGYMG----LGNLEQA 240 Query: 315 IAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLV 494 + F QM +I + +I + + +A+ ++ M+ I P T ++I +C + Sbjct: 241 ESLFNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPDEVTMSTVISACAHL 300 Query: 495 PAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMV 674 +++G+ +H + GF L +++ +AL+D YS + + LVF N+P+++ F W +++ Sbjct: 301 GVLEIGKEVHMYTLQNGFVLDVYIGSALVDMYSKCGSLERALLVFFNLPKKNLFCWNSII 360 Query: 675 AAHVRFRDLNSARKLFDKMP----EKNTASWNTMIHGFARMRDVESAKELFNKMPE---- 830 A K+F KM + N ++ ++ V+ + ++ M + Sbjct: 361 EGLAAHGFAQEALKMFAKMEMESVKPNAVTFVSVFTACTHAGLVDEGRRIYRSMIDDYSI 420 Query: 831 -KDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVTMSTIISAC 974 ++ + M+H +S+ EAL++ M+ P+ V ++ C Sbjct: 421 VSNVEHYGGMVHLFSKAGLIYEALELIGNME---FEPNAVIWGALLDGC 466 >ref|NP_172105.4| transcription factor EMB1444 [Arabidopsis thaliana] gi|8810477|gb|AAF80138.1|AC024174_20 Contains similarity to an unknown protein T5J8.5 gi|4263522 from Arabidopsis thaliana BAC T5J8 gb|AC004044 and contains multiple PPR PF|01535 repeats. ESTs gb|AV565358, gb|AV558710, gb|AV524184 come from this gene [Arabidopsis thaliana] gi|332189826|gb|AEE27947.1| bHLH transcription factor LHL1 [Arabidopsis thaliana] Length = 1322 Score = 325 bits (833), Expect = 2e-86 Identities = 153/297 (51%), Positives = 218/297 (73%) Frame = +3 Query: 165 HLANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNP 344 +L +K+CS+ K LES A M K++ +DC LMNQFITAC+ F ++ A++ TQM+ P Sbjct: 775 NLKKIIKQCSTPKLLESALAAMIKTSLNQDCRLMNQFITACTSFKRLDLAVSTMTQMQEP 834 Query: 345 NIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIH 524 N+FVYNAL + CS P ++L++Y MLR ++PSSYT+ S++K+ + A + GES+ Sbjct: 835 NVFVYNALFKGFVTCSHPIRSLELYVRMLRDSVSPSSYTYSSLVKASSF--ASRFGESLQ 892 Query: 525 GQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLN 704 +WK+GFG H+ +QT L+DFYS R+ ++R VFD MPERD AWTTMV+A+ R D++ Sbjct: 893 AHIWKFGFGFHVKIQTTLIDFYSATGRIREARKVFDEMPERDDIAWTTMVSAYRRVLDMD 952 Query: 705 SARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKY 884 SA L ++M EKN A+ N +I+G+ + ++E A+ LFN+MP KD+ISWTTMI YSQNK Sbjct: 953 SANSLANQMSEKNEATSNCLINGYMGLGNLEQAESLFNQMPVKDIISWTTMIKGYSQNKR 1012 Query: 885 YKEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 Y+EA+ +F +M + GI PDEVTMST+ISACAHLG+L+ GK++H+Y +Q+ F LDVYI Sbjct: 1013 YREAIAVFYKMMEEGIIPDEVTMSTVISACAHLGVLEIGKEVHMYTLQNGFVLDVYI 1069 Score = 81.6 bits (200), Expect = 5e-13 Identities = 59/289 (20%), Positives = 131/289 (45%), Gaps = 11/289 (3%) Frame = +3 Query: 141 IPQHSEI--THLANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWA 314 +P+ +I T + + +R + SL M++ N L+N ++ ++E A Sbjct: 930 MPERDDIAWTTMVSAYRRVLDMDSANSLANQMSEKNEATSNCLINGYMG----LGNLEQA 985 Query: 315 IAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLV 494 + F QM +I + +I + + +A+ ++ M+ I P T ++I +C + Sbjct: 986 ESLFNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPDEVTMSTVISACAHL 1045 Query: 495 PAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMV 674 +++G+ +H + GF L +++ +AL+D YS + + LVF N+P+++ F W +++ Sbjct: 1046 GVLEIGKEVHMYTLQNGFVLDVYIGSALVDMYSKCGSLERALLVFFNLPKKNLFCWNSII 1105 Query: 675 AAHVRFRDLNSARKLFDKMP----EKNTASWNTMIHGFARMRDVESAKELFNKMPE---- 830 A K+F KM + N ++ ++ V+ + ++ M + Sbjct: 1106 EGLAAHGFAQEALKMFAKMEMESVKPNAVTFVSVFTACTHAGLVDEGRRIYRSMIDDYSI 1165 Query: 831 -KDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVTMSTIISAC 974 ++ + M+H +S+ EAL++ M+ P+ V ++ C Sbjct: 1166 VSNVEHYGGMVHLFSKAGLIYEALELIGNME---FEPNAVIWGALLDGC 1211 >dbj|BAD94184.1| hypothetical protein [Arabidopsis thaliana] Length = 577 Score = 321 bits (823), Expect = 3e-85 Identities = 152/297 (51%), Positives = 217/297 (73%) Frame = +3 Query: 165 HLANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNP 344 +L +K+CS+ K LES A M K++ +DC LMNQFITAC+ F ++ A++ TQM+ P Sbjct: 30 NLKKIIKQCSTPKLLESALAAMIKTSLNQDCRLMNQFITACTSFKCLDLAVSTMTQMQEP 89 Query: 345 NIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIH 524 N+FVYNAL + CS P ++L++Y MLR ++PSSYT+ S++K+ + A + GES+ Sbjct: 90 NVFVYNALFKGFVTCSHPIRSLELYVRMLRDSVSPSSYTYSSLVKASSF--ASRFGESLQ 147 Query: 525 GQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLN 704 +WK+GFG H+ +QT L+DFYS + ++R VFD MPERD AWTTMV+A+ R D++ Sbjct: 148 AHIWKFGFGFHVKIQTTLIDFYSATGGIREARKVFDEMPERDDIAWTTMVSAYRRVLDMD 207 Query: 705 SARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKY 884 SA L ++M EKN A+ N +I+G+ + ++E A+ LFN+MP KD+ISWTTMI YSQNK Sbjct: 208 SANSLANQMSEKNEATSNCLINGYMGLGNLEQAESLFNQMPVKDIISWTTMIKGYSQNKR 267 Query: 885 YKEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 Y+EA+ +F +M + GI PDEVTMST+ISACAHLG+L+ GK++H+Y +Q+ F LDVYI Sbjct: 268 YREAIAVFYKMMEEGIIPDEVTMSTVISACAHLGVLEIGKEVHMYTLQNGFVLDVYI 324 Score = 81.6 bits (200), Expect = 5e-13 Identities = 59/289 (20%), Positives = 131/289 (45%), Gaps = 11/289 (3%) Frame = +3 Query: 141 IPQHSEI--THLANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWA 314 +P+ +I T + + +R + SL M++ N L+N ++ ++E A Sbjct: 185 MPERDDIAWTTMVSAYRRVLDMDSANSLANQMSEKNEATSNCLINGYMG----LGNLEQA 240 Query: 315 IAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLV 494 + F QM +I + +I + + +A+ ++ M+ I P T ++I +C + Sbjct: 241 ESLFNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPDEVTMSTVISACAHL 300 Query: 495 PAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMV 674 +++G+ +H + GF L +++ +AL+D YS + + LVF N+P+++ F W +++ Sbjct: 301 GVLEIGKEVHMYTLQNGFVLDVYIGSALVDMYSKCGSLERALLVFFNLPKKNLFCWNSII 360 Query: 675 AAHVRFRDLNSARKLFDKMP----EKNTASWNTMIHGFARMRDVESAKELFNKMPE---- 830 A K+F KM + N ++ ++ V+ + ++ M + Sbjct: 361 EGLAAHGFAQEALKMFAKMEMESVKPNAVTFVSVFTACTHAGLVDEGRRIYRSMIDDYSI 420 Query: 831 -KDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVTMSTIISAC 974 ++ + M+H +S+ EAL++ M+ P+ V ++ C Sbjct: 421 VSNVEHYGGMVHLFSKAGLIYEALELIGNME---FEPNAVIWGALLDGC 466 >ref|XP_006409153.1| hypothetical protein EUTSA_v10022616mg [Eutrema salsugineum] gi|557110315|gb|ESQ50606.1| hypothetical protein EUTSA_v10022616mg [Eutrema salsugineum] Length = 578 Score = 318 bits (816), Expect = 2e-84 Identities = 147/296 (49%), Positives = 213/296 (71%) Frame = +3 Query: 168 LANYLKRCSSLKELESLYAFMTKSNSTRDCFLMNQFITACSKFHHVEWAIAAFTQMRNPN 347 + ++K+CS K LES A M K++ +DC +MN FIT+C+ F+ ++ A+++ TQM+ PN Sbjct: 33 ILQFIKQCSIPKLLESALAAMIKTSQNQDCHVMNHFITSCTSFNRLDLAVSSMTQMQEPN 92 Query: 348 IFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLGESIHG 527 +FVYNALI L+ CS P +AL +Y MLR ++PSSYT+ S++K+C GE + Sbjct: 93 VFVYNALIKGLVICSYPIRALGLYVRMLRYSVSPSSYTYSSLVKACAFDSV--FGELVQA 150 Query: 528 QVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRFRDLNS 707 +WK+GF H+ + T L+ FYS R+ ++R VFD MPERD F WTTM++A+ D++S Sbjct: 151 HIWKFGFCFHVQITTTLIWFYSALGRIREARKVFDEMPERDGFTWTTMISAYRHVLDMDS 210 Query: 708 ARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYSQNKYY 887 A L ++MPEKN A+WN +I G+ ++ +VE A+ LFN+MP KD+ISWTTMI+ YS+NK Y Sbjct: 211 ANHLANQMPEKNVATWNCLIDGYTKLGNVEIAESLFNQMPVKDIISWTTMINGYSRNKRY 270 Query: 888 KEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVYI 1055 KE++ +F +M + GI PDEVTMST+ISACAHLG+LD G ++H+Y +Q+ F DVYI Sbjct: 271 KESIAVFYKMTEEGIIPDEVTMSTVISACAHLGVLDIGNEVHMYTVQNGFLHDVYI 326 Score = 72.0 bits (175), Expect = 4e-10 Identities = 51/245 (20%), Positives = 107/245 (43%), Gaps = 9/245 (3%) Frame = +3 Query: 267 NQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEIN 446 N I +K +VE A + F QM +I + +I R + +++ ++ M I Sbjct: 227 NCLIDGYTKLGNVEIAESLFNQMPVKDIISWTTMINGYSRNKRYKESIAVFYKMTEEGII 286 Query: 447 PSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLV 626 P T ++I +C + + +G +H + GF +++ +AL+D YS + + LV Sbjct: 287 PDEVTMSTVISACAHLGVLDIGNEVHMYTVQNGFLHDVYIGSALVDMYSKCGSLNRALLV 346 Query: 627 FDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMP----EKNTASWNTMIHGFARMRDV 794 F N+P+++ F W +++ A ++F KM + N ++ +++ V Sbjct: 347 FFNLPKKNLFCWNSIIEGLAAHGYAQEALRMFAKMEMESVKPNAVTFVSVLTACTHAGLV 406 Query: 795 ESAKELFNKMPE-----KDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVTMST 959 E ++ M + ++ + M+ S+ EAL++ M+ P+ V Sbjct: 407 EEGWRIYRSMIDDYSIVSNIKHYGCMVDLLSKAGLIHEALELIESME---FEPNVVIWGA 463 Query: 960 IISAC 974 ++ C Sbjct: 464 LLDGC 468 >ref|XP_004233665.1| PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Solanum lycopersicum] Length = 494 Score = 318 bits (814), Expect = 3e-84 Identities = 150/241 (62%), Positives = 193/241 (80%) Frame = +3 Query: 333 MRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEINPSSYTFPSIIKSCTLVPAVKLG 512 M NPN+FVYNALI A + C P +AL +Y DMLRT+ PSSYTF S++K CTL+ ++LG Sbjct: 1 MENPNVFVYNALIRAFVHCHIPHKALLLYIDMLRTQNIPSSYTFSSVVKGCTLMCGLRLG 60 Query: 513 ESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLVFDNMPERDSFAWTTMVAAHVRF 692 E IHG++W+YGFG H+ VQT+L+DFYSN RV +RLVFD MPERD+FAW MV+AH Sbjct: 61 ECIHGKIWEYGFGSHVFVQTSLIDFYSNLARVDLARLVFDEMPERDNFAWAAMVSAHAGT 120 Query: 693 RDLNSARKLFDKMPEKNTASWNTMIHGFARMRDVESAKELFNKMPEKDLISWTTMIHCYS 872 DL SARKLFD+MPEK T + N MI+G+A+ DVESA+ LF +M KDLI+WTTMI+CYS Sbjct: 121 GDLGSARKLFDEMPEKITVACNAMINGYAKTGDVESAELLFKEMSRKDLIAWTTMINCYS 180 Query: 873 QNKYYKEALDIFREMKDHGISPDEVTMSTIISACAHLGLLDEGKDIHLYVIQSQFDLDVY 1052 QN+ Y +A+++F EMK + I+PDEVTM+T+ISACAHLG+LD+GK++HLYV+Q FDL V+ Sbjct: 181 QNRKYGQAIEVFYEMKSNLITPDEVTMTTVISACAHLGVLDQGKEMHLYVMQKGFDLGVH 240 Query: 1053 I 1055 I Sbjct: 241 I 241 Score = 77.8 bits (190), Expect = 7e-12 Identities = 55/254 (21%), Positives = 103/254 (40%) Frame = +3 Query: 267 NQFITACSKFHHVEWAIAAFTQMRNPNIFVYNALIGALLRCSQPDQALQIYSDMLRTEIN 446 N I +K VE A F +M ++ + +I + + QA++++ +M I Sbjct: 142 NAMINGYAKTGDVESAELLFKEMSRKDLIAWTTMINCYSQNRKYGQAIEVFYEMKSNLIT 201 Query: 447 PSSYTFPSIIKSCTLVPAVKLGESIHGQVWKYGFGLHIHVQTALMDFYSNFDRVLDSRLV 626 P T ++I +C + + G+ +H V + GF L +H+ +AL+D Y+ + S LV Sbjct: 202 PDEVTMTTVISACAHLGVLDQGKEMHLYVMQKGFDLGVHIGSALIDMYAKCGSLERSLLV 261 Query: 627 FDNMPERDSFAWTTMVAAHVRFRDLNSARKLFDKMPEKNTASWNTMIHGFARMRDVESAK 806 F + E++ F W N+A +HG+A Sbjct: 262 FYKLREKNLFCW--------------------------NSAIDGLAVHGYA--------- 286 Query: 807 ELFNKMPEKDLISWTTMIHCYSQNKYYKEALDIFREMKDHGISPDEVTMSTIISACAHLG 986 +EAL +F M+ + P+ +T ++++AC H G Sbjct: 287 ---------------------------EEALALFSRMEKEKVKPNGITFVSVLTACTHAG 319 Query: 987 LLDEGKDIHLYVIQ 1028 L+++G+ L + Q Sbjct: 320 LVEKGRKNFLSMTQ 333