BLASTX nr result
ID: Mentha27_contig00005856
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00005856 (2460 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU44833.1| hypothetical protein MIMGU_mgv1a017808mg, partial... 899 0.0 ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citr... 852 0.0 ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containi... 848 0.0 ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containi... 848 0.0 ref|XP_007031692.1| Pentatricopeptide repeat (PPR-like) superfam... 845 0.0 ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containi... 842 0.0 ref|XP_002526948.1| pentatricopeptide repeat-containing protein,... 795 0.0 gb|EPS70491.1| hypothetical protein M569_04265, partial [Genlise... 786 0.0 ref|XP_002324000.1| pentatricopeptide repeat-containing family p... 784 0.0 ref|XP_007140836.1| hypothetical protein PHAVU_008G145600g [Phas... 774 0.0 gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis] 773 0.0 ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containi... 764 0.0 ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutr... 756 0.0 ref|XP_007220233.1| hypothetical protein PRUPE_ppa001979mg [Prun... 752 0.0 ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Caps... 738 0.0 ref|XP_002873660.1| pentatricopeptide repeat-containing protein ... 733 0.0 ref|NP_190245.1| pentatricopeptide repeat-containing protein [Ar... 732 0.0 ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containi... 725 0.0 ref|XP_006852234.1| hypothetical protein AMTR_s00049p00149530 [A... 706 0.0 gb|EAY82798.1| hypothetical protein OsI_38004 [Oryza sativa Indi... 654 0.0 >gb|EYU44833.1| hypothetical protein MIMGU_mgv1a017808mg, partial [Mimulus guttatus] Length = 659 Score = 899 bits (2323), Expect = 0.0 Identities = 452/660 (68%), Positives = 536/660 (81%), Gaps = 5/660 (0%) Frame = +2 Query: 491 RKGSLGSAFAVSWALDEPTVGKDDS-VAELEQLDEVERDDDGAKNRXXXXXXXXXXXXXX 667 +K SLG+AFA++WALDEPT G DDS + E +QL+ D+DGA N+ Sbjct: 2 KKPSLGAAFALTWALDEPTTGNDDSPIQESDQLN----DNDGANNKDGGDVQKRGIYRRQ 57 Query: 668 XXXXXXXXXXXXXXXXXXXXXALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGK 847 AL RL SA ADDVE +LK LPLQVYSTIIRGFGK Sbjct: 58 KLQNGRIDVR-----------ALALRLHSATNADDVETILKDMGNLPLQVYSTIIRGFGK 106 Query: 848 EKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGV 1027 +KK++SAMALFEWLKRKS E IQPNL+IYNSLLGA+K+A FDF++ +M+DMA G+ Sbjct: 107 DKKVDSAMALFEWLKRKSNEADSPIQPNLYIYNSLLGALKQAESFDFVDDVMSDMAAKGL 166 Query: 1028 HPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALA 1207 PNVVTYNTLMGIYIE KE+K +LFEEMP+KGI PSPAS+SIVL YRRLEDGFGAL Sbjct: 167 LPNVVTYNTLMGIYIEHRKEAKVFELFEEMPTKGIFPSPASYSIVLLAYRRLEDGFGALT 226 Query: 1208 FYVQTRNRYEQGEIGRDDD---REDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVL 1378 F+V+ R+++++GEIG+D+D EDW EF+KLENF I +CYQVMRRWLV S+N S +VL Sbjct: 227 FFVEIRDKFQKGEIGKDNDGEEEEDWVDEFAKLENFTIRICYQVMRRWLVNSKNLSTEVL 286 Query: 1379 RLLQEMDNACLKHGREEHERLIWACTREEHCVVAKELYTRIREVDD-EISVSVCNHLIWL 1555 RLL+EMD A L+ G EEHERLIWACTREEH +V KELY RIRE+ EIS+SVCNH+IWL Sbjct: 287 RLLKEMDKAGLQPGHEEHERLIWACTREEHYIVVKELYARIREMTSTEISLSVCNHVIWL 346 Query: 1556 LGKAKKWWAALEIYEDMLDKGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKME 1735 +GKAKKWWAALEIYED+LDKGPKPNNMS+ELI+SHF+ILL+AARKKGIW+WGVRLLNKME Sbjct: 347 MGKAKKWWAALEIYEDLLDKGPKPNNMSYELIVSHFSILLTAARKKGIWKWGVRLLNKME 406 Query: 1736 EKGLKPGSREWNSVLVACSKASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYE 1915 EKGLKPGSREWN+VLVACSKASETSAAIEIFKRMV+QGEKPTIISYGALLS+LEKGKLY+ Sbjct: 407 EKGLKPGSREWNAVLVACSKASETSAAIEIFKRMVDQGEKPTIISYGALLSALEKGKLYD 466 Query: 1916 QAFQVWQHMVRVGVEPNLHAYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAII 2095 +A QVW+HM+++G+EPNL+AYTIMASIYAGQ +F+ ++ I++EM ++ I PTV+TFNAII Sbjct: 467 EALQVWKHMLKMGLEPNLYAYTIMASIYAGQQKFDIVDSIIQEMVTVNIEPTVVTFNAII 526 Query: 2096 SSCGRNGHGGTAYEWFERMKVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFE 2275 SSCGR+ G AYE+F+RM+V ++ PNEVTY++LIEALA DGKPRLAY+LHLRA +EG Sbjct: 527 SSCGRSNLGSVAYEYFQRMRVLNIAPNEVTYDVLIEALASDGKPRLAYELHLRANNEGLV 586 Query: 2276 LSTKAYDAVVESVNLYGATVDIGALGSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVR 2455 LSTKAYDAVVES YGAT+D+ ALG RPPERKKKV RK LSEFC LADVPRRS+PF R Sbjct: 587 LSTKAYDAVVESSESYGATIDVSALGPRPPERKKKVQTRKKLSEFCDLADVPRRSKPFDR 646 >ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citrus clementina] gi|568831365|ref|XP_006469938.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Citrus sinensis] gi|557549828|gb|ESR60457.1| hypothetical protein CICLE_v10014357mg [Citrus clementina] Length = 768 Score = 852 bits (2201), Expect = 0.0 Identities = 441/762 (57%), Positives = 545/762 (71%), Gaps = 10/762 (1%) Frame = +2 Query: 203 MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLDF----CDVHSHGLLRFS 370 MQ L++WP K VPQ +++S RRK+ C + G L S Sbjct: 1 MQPLSVWPLKG---GFAAVPQLHFDVVSSSFLSTRNRRRKKWSLVESVCHSRNTGFLLVS 57 Query: 371 RYSYYKGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPTV 550 S + C GVC S D + F +P+K G++ +W++++ + Sbjct: 58 SNSTFSCC--GVCCRSIKLDSKCEFLSGFSSHKLVLFCEPKKSYFGASVMFAWSMEQQEI 115 Query: 551 GKDDSVAELEQLD----EVERD--DDGAKNRXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 712 G V E D E E D D + +R Sbjct: 116 GNGLLVEEPNSADGLLVETESDIVDYRSVHRVEDTGDNGNQVESEEVEIIGERGVGKQKS 175 Query: 713 XXXXXXALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLK 892 AL L KTADDVEEVLK LP QV+S++IRGFGKEK+ + AMAL EWLK Sbjct: 176 GRVDVKALAQSLWHTKTADDVEEVLKDMGELPPQVHSSMIRGFGKEKRTDCAMALVEWLK 235 Query: 893 RKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYI 1072 RK ETGG I PNLF+YNSLLGAVK+++KF+ +++IMNDMA GV+PNVVTYNTLM IYI Sbjct: 236 RKKRETGGFIGPNLFVYNSLLGAVKQSQKFEEMDRIMNDMAEEGVNPNVVTYNTLMAIYI 295 Query: 1073 EEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIG 1252 E+G+ +KAL + EE+ KG+TPS S+S L YRR+EDG GAL F+V+ R +Y +GEIG Sbjct: 296 EQGEGTKALNVLEEIKKKGLTPSAVSYSQALLAYRRMEDGNGALKFFVELREKYLKGEIG 355 Query: 1253 RDDDREDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEH 1432 + DD E+W++EF KL++FII +CYQVMRRWLVK EN S +VL+LL EMD A L+ + E+ Sbjct: 356 KGDD-ENWENEFVKLKDFIIRICYQVMRRWLVKDENLSTNVLKLLIEMDKAGLRPVKAEY 414 Query: 1433 ERLIWACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLD 1612 ERL+WACTREEH VVAKE Y RIRE DEIS+SVCNHLIWL+GKAKKWWAALE+YED+LD Sbjct: 415 ERLVWACTREEHYVVAKEFYARIRERHDEISLSVCNHLIWLMGKAKKWWAALEVYEDLLD 474 Query: 1613 KGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACS 1792 KGPKPNNMS+ELI+SHFNILLSAARK+GIWRWGVRLLNKMEEKGLKPGSREWN+VLVACS Sbjct: 475 KGPKPNNMSYELIVSHFNILLSAARKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACS 534 Query: 1793 KASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLH 1972 KASE +AA++IFKRMVE+GEKPTIISYGALLS+LEKGKLY++A +VWQHM+ VG EPNL+ Sbjct: 535 KASEYNAAVQIFKRMVEKGEKPTIISYGALLSALEKGKLYDEASRVWQHMLNVGAEPNLY 594 Query: 1973 AYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERM 2152 AYTIMASI+ QG+F + LI +EMAS I PTV+T+NAIIS+CG+NG AYEWF RM Sbjct: 595 AYTIMASIFTAQGKFNLVELIFREMASSRIEPTVVTYNAIISACGQNGMSSAAYEWFHRM 654 Query: 2153 KVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGAT 2332 KV ++ PNE+TYEMLIEALA+DGKPRLAYDL+LRA++E LS+KAYDA++E +YGAT Sbjct: 655 KVQNISPNEITYEMLIEALAKDGKPRLAYDLYLRARNEELNLSSKAYDAILEFSQVYGAT 714 Query: 2333 VDIGALGSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVRK 2458 +D+ LG RPP++KKKV IRK+LS FC ADVPRRS+PF +K Sbjct: 715 IDLTVLGPRPPDKKKKVVIRKNLSNFCHFADVPRRSKPFDKK 756 >ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Solanum tuberosum] Length = 740 Score = 848 bits (2192), Expect = 0.0 Identities = 404/576 (70%), Positives = 491/576 (85%) Frame = +2 Query: 731 ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910 AL L KTAD+V+EVLK + LPLQVYS++IRGFGK+KKL SAMAL EWL+R+S++ Sbjct: 156 ALAQSLHFVKTADEVDEVLKDKIELPLQVYSSMIRGFGKDKKLNSAMALVEWLRRRSKDN 215 Query: 911 GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090 G I N+FIYNSLLGA+KEA K+DF++K+M+DM GV PNVVTYNTLM IYIE+G+E Sbjct: 216 IGSISLNVFIYNSLLGAIKEAGKYDFVDKVMDDMVSEGVQPNVVTYNTLMRIYIEQGREL 275 Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270 +AL LF MP KG++PSPAS+S LF YRRLEDGFGA+ F+V+TR +Y+ GEIG ++ E Sbjct: 276 EALNLFRLMPKKGLSPSPASYSTALFAYRRLEDGFGAITFFVETREKYQNGEIGNIEE-E 334 Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWA 1450 +W+ EF+KLENFI+ +CYQVMR+WLVK EN + +VL+LL +MD A L+ R E+ERL+WA Sbjct: 335 NWEDEFAKLENFIVRICYQVMRQWLVKGENANTNVLKLLTDMDRARLQLSRAEYERLVWA 394 Query: 1451 CTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPN 1630 CTREEH VVAKELY RIRE D EIS+SVCNH+IWL+GKAKKWWAALEIYED+LDKGPKPN Sbjct: 395 CTREEHHVVAKELYNRIRERDTEISLSVCNHIIWLMGKAKKWWAALEIYEDLLDKGPKPN 454 Query: 1631 NMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETS 1810 NMS+ELI+SHFNILLSAARK+GIWRWGVRLLNKMEEKGLKP SREWN+VLVACSKASETS Sbjct: 455 NMSYELIVSHFNILLSAARKRGIWRWGVRLLNKMEEKGLKPSSREWNAVLVACSKASETS 514 Query: 1811 AAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMA 1990 AA++IF+RMVE+GEKPT+ISYGALLS+LEKGKLY++A QVW+HM++VG+EPNL+AYTIMA Sbjct: 515 AAVQIFRRMVEKGEKPTVISYGALLSALEKGKLYDEALQVWKHMIKVGIEPNLYAYTIMA 574 Query: 1991 SIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVV 2170 SIY QG+F ++ I+KEM + G+ PTV+TFNAIIS C RNG AYEWF+RMK ++ Sbjct: 575 SIYTAQGKFNIVDSIIKEMVTTGVEPTVVTFNAIISGCARNGMESVAYEWFQRMKTQNIT 634 Query: 2171 PNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGAL 2350 PNEV+YEMLIEALA DGKPRLAY+L++RA +EG LSTKAYDAV+ S YGA++D+ L Sbjct: 635 PNEVSYEMLIEALANDGKPRLAYELYVRALTEGLSLSTKAYDAVISSTQAYGASIDLSIL 694 Query: 2351 GSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVRK 2458 G RPPE+KK+V IRK LSEFC +ADVPRRSRPF R+ Sbjct: 695 GPRPPEKKKRVQIRKSLSEFCNIADVPRRSRPFDRE 730 >ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610 [Vitis vinifera] Length = 763 Score = 848 bits (2190), Expect = 0.0 Identities = 444/766 (57%), Positives = 551/766 (71%), Gaps = 14/766 (1%) Frame = +2 Query: 203 MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLD----FCDVHSHGLLRFS 370 MQAL++WPSK W VPQ D L S P + GRRK + C S L S Sbjct: 1 MQALSVWPSKGVFW---AVPQLDYNLGSSSIPSRRRGRRKLWNPEDPVCQYRSLAFLWVS 57 Query: 371 RYSYYKGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPTV 550 S + R GV S +D + I + ++GS G++FA++WAL++ + Sbjct: 58 SSS--RSDRVGVYCGSPKFDFGCGLLSGYSKLKIFLLCERKRGSFGASFALAWALEQQAI 115 Query: 551 G----KDDSVAELEQLDEVERDD------DGAKNRXXXXXXXXXXXXXXXXXXXXXXXXX 700 G K+DS + E D DGA++ Sbjct: 116 GNEFVKEDSNSIHSLAGNTETVDIDCLKVDGARD-------GDENDNEEEKEAEKNGEVI 168 Query: 701 XXXXXXXXXXALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALF 880 AL L A TADDVEEVLK + LPLQVYST+IRGFG +K+L++AMAL Sbjct: 169 EEKSRNVDVRALAHGLEFATTADDVEEVLKDKVELPLQVYSTMIRGFGTDKRLDAAMALV 228 Query: 881 EWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLM 1060 EWLKRK +ET G PNLF+YNSLLGAVK++ KF +EK+MNDMA G+ PNVVTYNTLM Sbjct: 229 EWLKRK-KETNGSKGPNLFVYNSLLGAVKQSEKFALVEKVMNDMAREGILPNVVTYNTLM 287 Query: 1061 GIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQ 1240 IY+E+G+ +AL + EE+ G+ PSP S+S L YRR+EDG GAL F+++ R Y + Sbjct: 288 SIYLEQGRSVEALNILEEIQKNGLCPSPVSYSTALLVYRRMEDGHGALKFFIELRENYLK 347 Query: 1241 GEIGRDDDREDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHG 1420 GEIG+D D EDW++EF KL+NF I +CYQVMRRWLVK N+S +L+LL +MDNA L+ G Sbjct: 348 GEIGKDAD-EDWENEFVKLKNFTIRICYQVMRRWLVKEGNQSPILLKLLADMDNAGLQPG 406 Query: 1421 REEHERLIWACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYE 1600 R E+ERL+WACTREEH VVAKELYTRIRE EIS+SVCNH+IWL+GKAKKWWAALEIYE Sbjct: 407 RAEYERLVWACTREEHYVVAKELYTRIRERHTEISLSVCNHIIWLMGKAKKWWAALEIYE 466 Query: 1601 DMLDKGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVL 1780 D+LDKGPKPNN+S+EL++SHFNILL+AARKKGIWRWGVRLLNKME+KGLKPGSREWN+VL Sbjct: 467 DLLDKGPKPNNLSYELVVSHFNILLTAARKKGIWRWGVRLLNKMEDKGLKPGSREWNAVL 526 Query: 1781 VACSKASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVE 1960 VACSKA+ETSAA+EIF+RMVEQGEKPTIISYGALLS+LEKGKLY++A +VW+HMV++GVE Sbjct: 527 VACSKAAETSAAVEIFRRMVEQGEKPTIISYGALLSALEKGKLYDEASRVWEHMVKMGVE 586 Query: 1961 PNLHAYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEW 2140 PNL+AYTIMASI GQG+ + ++ I++EM +LGI TV+T+NAIIS C RNG A+EW Sbjct: 587 PNLYAYTIMASICVGQGKLQRVDSILREMETLGIDATVVTYNAIISGCARNGLSSAAFEW 646 Query: 2141 FERMKVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNL 2320 F RMKV + PNE+TYEMLIEALA+DGKPRLA++L+ RA++EG LSTKAYDAVV S + Sbjct: 647 FHRMKVGKIQPNEITYEMLIEALAKDGKPRLAFELYSRAQNEGLNLSTKAYDAVVLSSQV 706 Query: 2321 YGATVDIGALGSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVRK 2458 + AT+D+ LG RPPE+KKK+ RK LS FC LADVPRR++PF RK Sbjct: 707 HSATIDVSLLGPRPPEKKKKLLARKTLSAFCNLADVPRRAKPFDRK 752 >ref|XP_007031692.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative [Theobroma cacao] gi|508710721|gb|EOY02618.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative [Theobroma cacao] Length = 741 Score = 845 bits (2182), Expect = 0.0 Identities = 424/752 (56%), Positives = 540/752 (71%) Frame = +2 Query: 203 MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLDFCDVHSHGLLRFSRYSY 382 MQAL+IWP S +VP D EL S C RK + L S YS Sbjct: 1 MQALSIWPLNV---GSLVVPHLDFELGSSCFASTKPSSRKTWSLAESRGPSFLLLSSYSR 57 Query: 383 YKGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPTVGKDD 562 + R+G C + + E + F +P++GS A++WAL++ +G Sbjct: 58 FS--RSGTCYRNLNCSLRCGFLCWYSELKVVLFCEPKRGSSRGLVALAWALEQQEIGN-- 113 Query: 563 SVAELEQLDEVERDDDGAKNRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXALGW 742 ELE+ + RD D AL Sbjct: 114 ---ELEREESHSRDGDNGNE-----------DKNEEMDASSEGEVELEESARLDVRALAS 159 Query: 743 RLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLI 922 L AKTADD+E+VLK LPLQV+S++I+GFG++ +++AMAL EWLKRK ++GG + Sbjct: 160 SLQFAKTADDIEKVLKDMDELPLQVHSSMIKGFGRDNYMDAAMALVEWLKRKKNDSGGSV 219 Query: 923 QPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQ 1102 PNLFIYNSLLGAVK +++F +EKI+ DM GV PN+VTYN LM IY+E+G+ +KAL Sbjct: 220 GPNLFIYNSLLGAVKHSKQFREMEKILKDMEEEGVIPNIVTYNVLMAIYLEQGEATKALN 279 Query: 1103 LFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWDH 1282 + EE+ KG +PSP S+S L YRR+EDG GAL F+++ R +Y +G++G+D D E+W++ Sbjct: 280 VLEEIQEKGFSPSPVSYSTALLAYRRMEDGNGALKFFIELREKYVKGDLGKDAD-ENWEY 338 Query: 1283 EFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWACTRE 1462 EF KLENF + +C QVMRRWLVK EN S +VL+LL++MDNA LK +E++ER+IWACT E Sbjct: 339 EFVKLENFTVRICQQVMRRWLVKDENLSTNVLKLLRDMDNAGLKLSKEDYERIIWACTCE 398 Query: 1463 EHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPNNMSH 1642 EH VVAKELY+RIRE EIS+SVCNHLIWL+GKAKKWWAALE+YE++LDKGP PNN+S+ Sbjct: 399 EHYVVAKELYSRIRERHSEISLSVCNHLIWLMGKAKKWWAALEVYEELLDKGPSPNNLSY 458 Query: 1643 ELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETSAAIE 1822 EL++SHFNILL+AARK+GIWRWGVRLLNKME+KGLKPGSREWN+VLVACSKASET+AA++ Sbjct: 459 ELVMSHFNILLTAARKRGIWRWGVRLLNKMEDKGLKPGSREWNAVLVACSKASETTAAVQ 518 Query: 1823 IFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMASIYA 2002 IF+RMVEQGEKPTIISYGALLS+LEKGKLY++A +VW HM++VGV+PNL+AYTIMASI Sbjct: 519 IFRRMVEQGEKPTIISYGALLSALEKGKLYDEALRVWDHMIKVGVKPNLYAYTIMASIVT 578 Query: 2003 GQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVVPNEV 2182 G+G F +N + +EMAS GI PTV+T+NAIIS C RNG AYEWF RMKV ++ PNE+ Sbjct: 579 GKGNFRMVNAVFQEMASSGIEPTVVTYNAIISGCARNGMSSAAYEWFHRMKVQNISPNEI 638 Query: 2183 TYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGALGSRP 2362 TY+MLIEALA+DGKPRLAY+L+LRA +EG LS+KAYDAVV+S +YGAT D+ LG RP Sbjct: 639 TYQMLIEALAKDGKPRLAYELYLRAHNEGLNLSSKAYDAVVQSSQVYGATTDLSVLGPRP 698 Query: 2363 PERKKKVTIRKDLSEFCKLADVPRRSRPFVRK 2458 P++K KV IRK L+EFC LADVPRRS+PF RK Sbjct: 699 PDKKMKVQIRKTLTEFCNLADVPRRSKPFDRK 730 >ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Solanum lycopersicum] Length = 742 Score = 842 bits (2176), Expect = 0.0 Identities = 402/577 (69%), Positives = 491/577 (85%), Gaps = 1/577 (0%) Frame = +2 Query: 731 ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRK-SEE 907 AL L KTAD+V+EVLK + LPLQVYS++IRGFGK+KKL SAMAL EWL+R+ ++ Sbjct: 157 ALAQSLHFVKTADEVDEVLKDKVELPLQVYSSMIRGFGKDKKLNSAMALVEWLRRRRGKD 216 Query: 908 TGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKE 1087 G I N+FIYNSLLGA+KEA K+DF++K+M+DM GV PNVVTYNTLM YIE+G+E Sbjct: 217 NIGSISLNVFIYNSLLGAIKEAGKYDFVDKVMDDMVSEGVQPNVVTYNTLMRTYIEQGRE 276 Query: 1088 SKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDR 1267 +AL+LF EMP KG+TPSPAS+S LF YRRLEDGFGA+ F+V+TR RY+ GEIG ++ Sbjct: 277 LEALKLFREMPKKGLTPSPASYSTALFAYRRLEDGFGAITFFVETRERYQNGEIGNIEE- 335 Query: 1268 EDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIW 1447 E+W+ EF+KLENFI+ +CYQVMR+WLVK EN + +VL+LL +MD A L+ R E+ERL+W Sbjct: 336 ENWEDEFAKLENFIVRICYQVMRQWLVKGENANTNVLKLLTDMDRARLQLSRAEYERLVW 395 Query: 1448 ACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKP 1627 ACTREEH VVAKELY RIRE D +IS+SVCNH+IWL+GKAKKWWAALEIYED+LDKGP+P Sbjct: 396 ACTREEHYVVAKELYNRIRERDTDISLSVCNHIIWLMGKAKKWWAALEIYEDLLDKGPQP 455 Query: 1628 NNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASET 1807 NNMS+ELI+SHFNILLSAARK+GIWRWGVRLLNKMEEKGLKP SREWN+VLVACSKASET Sbjct: 456 NNMSYELIVSHFNILLSAARKRGIWRWGVRLLNKMEEKGLKPSSREWNAVLVACSKASET 515 Query: 1808 SAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIM 1987 SAA++IF+RMVE+GEKPT+ISYGALLS+LEKGKLY++A QVW+HM++VG+EPNL+AYTIM Sbjct: 516 SAAVQIFRRMVEKGEKPTVISYGALLSALEKGKLYDEALQVWKHMIKVGIEPNLYAYTIM 575 Query: 1988 ASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDV 2167 ASIY QG+F ++ I+KEM + G+ PTV+TFNAIIS C RNG AYEWF+RMK ++ Sbjct: 576 ASIYTAQGKFNIVDSIIKEMVTTGVEPTVVTFNAIISGCARNGMESVAYEWFQRMKTQNI 635 Query: 2168 VPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGA 2347 PNEV+YE+LIEALA DGKPRLAY+L++RA +EG LSTKAYDAV+ S YGA++D+ Sbjct: 636 TPNEVSYEVLIEALANDGKPRLAYELYVRALTEGLSLSTKAYDAVISSTQAYGASIDLSI 695 Query: 2348 LGSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVRK 2458 LG RPPE+KK+V IRK LSEFC +ADVPRRSRPF R+ Sbjct: 696 LGPRPPEKKKRVQIRKSLSEFCHIADVPRRSRPFDRE 732 >ref|XP_002526948.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223533700|gb|EEF35435.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 671 Score = 795 bits (2053), Expect = 0.0 Identities = 381/575 (66%), Positives = 479/575 (83%) Frame = +2 Query: 731 ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910 +L L SA+TADDVEEVLK + LPLQVYS++I+ FG + K+ESA+AL EWLKR+ +E Sbjct: 87 SLARSLHSAQTADDVEEVLKDKGELPLQVYSSMIKAFGWDNKMESALALVEWLKRR-KEI 145 Query: 911 GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090 G I PNLFIYNSLL AVK+++ F+ EKI+NDM G+ PNVVTYNTLMGIY+E+G+ + Sbjct: 146 GSSIGPNLFIYNSLLSAVKKSKLFEEAEKILNDMTQEGIAPNVVTYNTLMGIYVEKGQAT 205 Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270 KAL + E+M KG P+ AS+S L YR +EDG GALAF+V +++Y +G+IG++ D E Sbjct: 206 KALNILEQMHEKGFIPTAASYSTALLAYRGMEDGHGALAFFVDIKDKYLKGKIGKNSD-E 264 Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWA 1450 +W++EF KLE FII +CYQVMRRWLV+ +N S DVL+LL +MD A L+ + E+ERL+WA Sbjct: 265 NWENEFVKLETFIIRICYQVMRRWLVRHDNFSTDVLKLLTDMDKAGLQPSQAEYERLVWA 324 Query: 1451 CTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPN 1630 CTRE+H V KELY RIRE +IS+SVCNHLIWL+GKAKKWWAALEIYED+LDKGP PN Sbjct: 325 CTREDHYAVGKELYIRIRERHSKISLSVCNHLIWLMGKAKKWWAALEIYEDLLDKGPNPN 384 Query: 1631 NMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETS 1810 NMS+ELI+SHFNILL+AARK+GIWRWGVRLLNKME+KGLKPGSREWN+VLVACSKASET+ Sbjct: 385 NMSYELIVSHFNILLTAARKRGIWRWGVRLLNKMEDKGLKPGSREWNAVLVACSKASETT 444 Query: 1811 AAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMA 1990 AA++IF+RM+EQGEKPTI+SYGALLS+LEKGKLY++A +VW+HM++V V+PNL+AYTIMA Sbjct: 445 AAVQIFRRMIEQGEKPTIVSYGALLSALEKGKLYDEAVRVWEHMLKVDVKPNLYAYTIMA 504 Query: 1991 SIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVV 2170 S++AGQG+F ++ I+++M S GI PT+IT+NAIIS C N AYEWF RMKV ++ Sbjct: 505 SVFAGQGKFTYVDAIIQKMVSSGIEPTIITYNAIISGCTHNNLSSAAYEWFHRMKVQNMP 564 Query: 2171 PNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGAL 2350 PN++TYEMLIEALA+DGKPRLAY+L+LRAK EG +LS K YDAV+ S +YGAT+DI L Sbjct: 565 PNKITYEMLIEALAKDGKPRLAYELYLRAKYEGLDLSAKVYDAVLRSSQVYGATIDINVL 624 Query: 2351 GSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVR 2455 G RPP++KK+V IRK L+EFC LADVPRRS+PF R Sbjct: 625 GPRPPDKKKRVKIRKTLTEFCDLADVPRRSKPFER 659 >gb|EPS70491.1| hypothetical protein M569_04265, partial [Genlisea aurea] Length = 557 Score = 786 bits (2031), Expect = 0.0 Identities = 375/549 (68%), Positives = 462/549 (84%) Frame = +2 Query: 731 ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910 AL +L A TADDVE++LK + LPLQVYST+IRG GKEK+++SAMALFEWL+RKS+E+ Sbjct: 10 ALALKLQLATTADDVEQLLKGKENLPLQVYSTVIRGLGKEKRIQSAMALFEWLQRKSKES 69 Query: 911 GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090 G ++ NLF+YNSLLGA+K+A FD +E++M M GVHPNVVT+N LMGI+IE+G E Sbjct: 70 GSKLKLNLFVYNSLLGAMKQAEAFDLVEEVMTKMGAEGVHPNVVTFNALMGIHIEQGNEL 129 Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270 +AL+LF EM GI+PSPAS+S VL YRR+E+G GA++F+++TRN+Y G++ DDD E Sbjct: 130 RALELFREMLMMGISPSPASYSTVLNAYRRMENGSGAVSFFIETRNKYRNGDMANDDD-E 188 Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWA 1450 DW+ E SKLENF + +CYQVMRRWLVK N S +VL+LL+EMDNA L E E+LIWA Sbjct: 189 DWELEISKLENFTLRICYQVMRRWLVKRGNFSTEVLKLLKEMDNAGLNCDPENLEKLIWA 248 Query: 1451 CTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPN 1630 CTRE+HC VAKELYTR+RE+ +IS+SVCNH+IWL+GKAKKWWAALEIYE++LD GPKPN Sbjct: 249 CTREDHCAVAKELYTRVREMGADISLSVCNHIIWLMGKAKKWWAALEIYEELLDTGPKPN 308 Query: 1631 NMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETS 1810 NMS+ELI+SHFNILL+AARKKGIWRWGVRL+NKM+EKGLKPGSREWNSVLVACSKA ETS Sbjct: 309 NMSYELIVSHFNILLTAARKKGIWRWGVRLINKMKEKGLKPGSREWNSVLVACSKAGETS 368 Query: 1811 AAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMA 1990 AIEIFKRMVE G+KPTIISYGALLS+LEKGKLY++A QVW+HMV+VGVE NL+AYTIMA Sbjct: 369 TAIEIFKRMVENGDKPTIISYGALLSALEKGKLYDEAIQVWKHMVKVGVEANLYAYTIMA 428 Query: 1991 SIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVV 2170 SI+A QG+ + ++LI++EM G+ PTV+TFNA+IS +N AYEWF RMK+ +V Sbjct: 429 SIHASQGKIDLVDLIIREMVGAGVEPTVVTFNAVISGFVKNNLSSAAYEWFRRMKLQNVT 488 Query: 2171 PNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGAL 2350 PNE+TYE LIEALA+DGKPRLA +LHLRA++EG LSTKAYDA+++S + YGAT+D GAL Sbjct: 489 PNEITYETLIEALAKDGKPRLASELHLRAQNEGLMLSTKAYDAIIQSSDAYGATIDYGAL 548 Query: 2351 GSRPPERKK 2377 G RPPE KK Sbjct: 549 GPRPPEGKK 557 >ref|XP_002324000.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222867002|gb|EEF04133.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 709 Score = 784 bits (2025), Expect = 0.0 Identities = 406/752 (53%), Positives = 527/752 (70%) Frame = +2 Query: 203 MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLDFCDVHSHGLLRFSRYSY 382 MQ L++WP S SC VP + E S C G KR D + Sbjct: 1 MQTLSVWPL---SGGSCAVPHLEFEEDSSCFLSTRRGI-KRWGLVD------------NV 44 Query: 383 YKGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPTVGKDD 562 ++G +G + S F + ++GS GS+ A++ AL++ +G + Sbjct: 45 FQGASSGFPMVSGDLRFLSNHSKIKYVC----FRETKEGSFGSSLALASALEQQKIGNEF 100 Query: 563 SVAELEQLDEVERDDDGAKNRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXALGW 742 E LD+ + G + AL Sbjct: 101 HRVE-SSLDDRSLGEAGEER-----------------------------DEKIDVPALAQ 130 Query: 743 RLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLI 922 L AKT DD+EEVLK + LP+QVY ++I+GFG +KK+E A+AL +WLK K +ET G I Sbjct: 131 SLYFAKTVDDIEEVLKDKGELPVQVYLSMIKGFGWDKKMEPAIALVDWLKIK-KETDGTI 189 Query: 923 QPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQ 1102 PNLFIYNSLL AVK++ +++ EKI+ M GV PNVVTYN LM IY+++G+ KAL Sbjct: 190 VPNLFIYNSLLSAVKQSEQYEETEKILERMTQEGVAPNVVTYNILMVIYVKQGQAKKALD 249 Query: 1103 LFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWDH 1282 + EEM G TPS AS+S L YR++EDG GAL F+V+ +++Y +GEIG+D D EDW+ Sbjct: 250 VLEEMRRNGFTPSAASYSSALLAYRKMEDGDGALKFFVEIKDKYMKGEIGKDAD-EDWER 308 Query: 1283 EFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWACTRE 1462 E+ KLENF I +CYQVMRRWLV+ EN + +VL+LL +MD A L+ GR ++ERL+WACTRE Sbjct: 309 EYVKLENFTIRVCYQVMRRWLVRLENLNTNVLKLLTDMDKAELQPGRSDYERLVWACTRE 368 Query: 1463 EHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPNNMSH 1642 EH VVAKELY RIRE +IS+SVCNH+IWL+GKAKKWWAALE+YED+LDKGPKPNN+S+ Sbjct: 369 EHYVVAKELYIRIRERCSDISLSVCNHVIWLMGKAKKWWAALEVYEDLLDKGPKPNNLSY 428 Query: 1643 ELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETSAAIE 1822 ELI+S+FN+LL+AA+K+GIWRWGVRLLNKMEEKGLKPGS+EWN+VLVACSKASET+AA++ Sbjct: 429 ELIVSYFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSKEWNAVLVACSKASETAAAVQ 488 Query: 1823 IFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMASIYA 2002 IF+RMVEQGEKPT+ISYGALLS+LEKG+LY++A +VW+HM++VGV+PN++AYTIMAS++ Sbjct: 489 IFRRMVEQGEKPTVISYGALLSALEKGRLYDEAVRVWEHMLKVGVKPNVYAYTIMASVFT 548 Query: 2003 GQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVVPNEV 2182 QG F ++ I+ EM S GI PTV+T+NAIIS C RN AYEWF RMKV ++ PNE+ Sbjct: 549 RQGNFRLVDAIINEMVSTGIEPTVVTYNAIISGCARNNLSSAAYEWFHRMKVQNISPNEI 608 Query: 2183 TYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGALGSRP 2362 TY+MLIEALA+ GKPRLAY+L+LRA++E +LS KAYDAV+ S YGAT+D LG RP Sbjct: 609 TYDMLIEALAKSGKPRLAYELYLRAQNEDLQLSPKAYDAVMHSSEAYGATIDTSVLGPRP 668 Query: 2363 PERKKKVTIRKDLSEFCKLADVPRRSRPFVRK 2458 P++KKKV IRK L+EFC LADVPRRS+PF +K Sbjct: 669 PDKKKKVQIRKTLTEFCNLADVPRRSKPFNKK 700 >ref|XP_007140836.1| hypothetical protein PHAVU_008G145600g [Phaseolus vulgaris] gi|561013969|gb|ESW12830.1| hypothetical protein PHAVU_008G145600g [Phaseolus vulgaris] Length = 752 Score = 774 bits (1999), Expect = 0.0 Identities = 375/573 (65%), Positives = 453/573 (79%) Frame = +2 Query: 731 ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910 AL RL +A T DDV E+L + LPLQV+STII FGKEK+++SA+ LFEW+K++ ET Sbjct: 166 ALALRLQTALTVDDVREILVDKRDLPLQVFSTIINSFGKEKRMDSALILFEWMKKRKIET 225 Query: 911 GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090 G PNLFIYN LLG VK++ +F +E I+N+MA +G+ NVVTYNTLM IYIE+G+ Sbjct: 226 NGSFGPNLFIYNGLLGVVKQSGQFAQMETILNEMAKDGISYNVVTYNTLMAIYIEKGEFD 285 Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270 +AL + EE+ G TPSP S+S L YRR+ED GAL F+V+ R Y +GEIG DDD E Sbjct: 286 RALNVLEEIHGNGFTPSPVSYSQALLAYRRMEDCNGALNFFVELRENYHRGEIGEDDDGE 345 Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWA 1450 DW+ E KLE F I +CYQVMR WLV S+N S +VL+ L +MDNA + R + ERL+WA Sbjct: 346 DWEEELMKLEKFTIRICYQVMRCWLVSSDNLSKNVLKFLVDMDNAGIPLTRADLERLVWA 405 Query: 1451 CTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPN 1630 CTRE+H +V KELYTRIRE D+IS+SVCNH IWL+GKAKKWWAALEIYED+LDKGPKPN Sbjct: 406 CTREDHYIVVKELYTRIRERYDKISLSVCNHAIWLMGKAKKWWAALEIYEDLLDKGPKPN 465 Query: 1631 NMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETS 1810 N+S+ELI+SHFN LL+AA++KGIWRWGVRLLNKMEEKGLKPGSREWN+VLVACSKASET+ Sbjct: 466 NLSYELIVSHFNFLLNAAKRKGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKASETT 525 Query: 1811 AAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMA 1990 AA++IFKRMVE GEKPT+ISYGALLS+LEKGKLY+ A +VW HMV+VGVEPN +AYTIMA Sbjct: 526 AAVQIFKRMVENGEKPTVISYGALLSALEKGKLYDDALRVWNHMVKVGVEPNAYAYTIMA 585 Query: 1991 SIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVV 2170 SIY QG F ++ IV+EM ++GI TV+T+NAIIS C RNG AYEWF RMKV ++ Sbjct: 586 SIYTAQGNFNRVDAIVQEMVTIGIEVTVVTYNAIISGCARNGMSSAAYEWFHRMKVQNIT 645 Query: 2171 PNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGAL 2350 PNE+TYEMLIEALA DGKPRLAY L+ RAK+EG LS+KAYD VV S GAT ++G L Sbjct: 646 PNEITYEMLIEALANDGKPRLAYQLYTRAKNEGLTLSSKAYDVVVHSSQANGATTELGLL 705 Query: 2351 GSRPPERKKKVTIRKDLSEFCKLADVPRRSRPF 2449 G RP ++KKKV IRK L+EF LA VPRRS F Sbjct: 706 GPRPADKKKKVQIRKTLTEFYNLAGVPRRSNQF 738 >gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis] Length = 737 Score = 773 bits (1996), Expect = 0.0 Identities = 404/749 (53%), Positives = 513/749 (68%), Gaps = 22/749 (2%) Frame = +2 Query: 203 MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLDFCDVHSHGLLRFSRYSY 382 MQAL+ WP K + W +VPQ E S +K RR+R + D H + R + Sbjct: 1 MQALSTWPLKGDLW---IVPQLSSEKSS---SLKTSSRRRRKNVLDFGFHFPVCHGRITG 54 Query: 383 Y-------KGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRK-GSLGSAFAVSWALD 538 + +G G +D + + F KP+K SLG++ A++ AL+ Sbjct: 55 FVLSTRNSRGVGYGGFCDRPKFDLGCGFLFGFSKLKVARFCKPKKKSSLGASVALAGALE 114 Query: 539 EPTVGKDDSVAELEQ--------------LDEVERDDDGAKNRXXXXXXXXXXXXXXXXX 676 E VG + EL+ L +E DD + Sbjct: 115 EQAVGSAIRIEELDSECSLSGKLSDGHLLLGRIESGDDNNGDEEQENKVIEDVGSEEKSR 174 Query: 677 XXXXXXXXXXXXXXXXXXALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKK 856 L L AKTADDV+EVLK + LP QV+ST+IRG G+EK Sbjct: 175 EEKGGKVDVRE--------LASSLRFAKTADDVDEVLKDKGELPPQVFSTMIRGLGREKL 226 Query: 857 LESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPN 1036 L+ A AL EWLKRK EE GLI NLFIYNSLLGAVK++ +F +EK++N MA GV PN Sbjct: 227 LDPAFALLEWLKRKKEENNGLISLNLFIYNSLLGAVKQSEQFGEMEKVLNYMAQEGVVPN 286 Query: 1037 VVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYV 1216 VVTYNT+M I++E G+ +KAL + EE+ KG+TPSP S+S L YRR+EDG GAL F+V Sbjct: 287 VVTYNTMMAIHLENGEGTKALSVLEEIRKKGLTPSPVSYSTALLAYRRMEDGHGALKFFV 346 Query: 1217 QTRNRYEQGEIGRDDDREDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEM 1396 + R +Y++GE+G+DDD EDW++EF KLENF I +CYQVMR WLV +N S +VL+LL +M Sbjct: 347 EIREKYQKGEMGKDDD-EDWENEFVKLENFTIRVCYQVMRHWLVNEDNLSTNVLKLLTKM 405 Query: 1397 DNACLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKW 1576 D A + R EHERL+WACTREEH +VAKELY RIRE +IS+SVCNH IWL+GKAK+W Sbjct: 406 DIAGIPPSRSEHERLLWACTREEHHLVAKELYDRIREGYSDISLSVCNHTIWLMGKAKRW 465 Query: 1577 WAALEIYEDMLDKGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPG 1756 W ALEIYED+LDKGP+PNNMS+E+I+SHFNILL+AARK+GIW+WGVRLLNKMEEKGLKPG Sbjct: 466 WTALEIYEDLLDKGPQPNNMSYEIIVSHFNILLTAARKRGIWKWGVRLLNKMEEKGLKPG 525 Query: 1757 SREWNSVLVACSKASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQ 1936 S+EWN+VL+ACSKASETSAA++IFKRMVEQG+KPT +SYGALLS+LEKGKLY++A QVW+ Sbjct: 526 SKEWNAVLIACSKASETSAAVKIFKRMVEQGQKPTFLSYGALLSALEKGKLYDEARQVWE 585 Query: 1937 HMVRVGVEPNLHAYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNG 2116 HM++VG+ PN++AYTIMAS++AG G+F ++ ++ EM S GI PTV+T+NAIIS C RN Sbjct: 586 HMLKVGIRPNVYAYTIMASVFAGHGKFNMVDTVIHEMVSSGIEPTVVTYNAIISGCARND 645 Query: 2117 HGGTAYEWFERMKVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYD 2296 A+EWF RMK + PN VTYEMLIEALA D KPRLAY+L+LRA++EG L+ KAYD Sbjct: 646 MIDMAFEWFHRMKAQSITPNNVTYEMLIEALANDCKPRLAYELYLRAQNEGLRLAPKAYD 705 Query: 2297 AVVESVNLYGATVDIGALGSRPPERKKKV 2383 VVES +GAT+D+ LG RPPERK KV Sbjct: 706 IVVESSQYHGATIDLRLLGPRPPERKGKV 734 >ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Glycine max] Length = 808 Score = 764 bits (1973), Expect = 0.0 Identities = 368/575 (64%), Positives = 455/575 (79%) Frame = +2 Query: 731 ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910 AL L + KT +DV +LK + LPLQV+STII GFGKEK+++SA+ LF W+K++ ET Sbjct: 222 ALALSLQTVKTVEDVGGILKDKGDLPLQVFSTIISGFGKEKRMDSALILFNWMKKRKIET 281 Query: 911 GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090 G PNLFIYN LLG VK++ +F +E I+N+MA +G+ NVVTYNTLM IYIE+G+ Sbjct: 282 NGSFGPNLFIYNGLLGVVKQSGQFAEMEVILNEMAEDGIAYNVVTYNTLMAIYIEKGECD 341 Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270 KAL + EE+ G+TPSP S+S L YRR+EDG+GAL F+V+ R +Y QGEIG+DDD E Sbjct: 342 KALNMLEEIRRNGLTPSPVSYSQALLAYRRMEDGYGALNFFVEFREKYRQGEIGKDDDGE 401 Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWA 1450 DW+ E KLE F I +CYQVMR WLV +N S +VL+ L +MDN + R + ERL WA Sbjct: 402 DWEKECLKLEKFTIRVCYQVMRCWLVSRDNLSKNVLKFLVDMDNVGIPLPRADLERLAWA 461 Query: 1451 CTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPN 1630 CTRE+H +V KELY RIRE D+IS+SVCNH IWL+GKAKKWWAALEIYED+LDKGPKPN Sbjct: 462 CTREDHYIVVKELYNRIRERYDKISLSVCNHAIWLMGKAKKWWAALEIYEDLLDKGPKPN 521 Query: 1631 NMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETS 1810 N+S+ELI+SHFN LLSAA++KGIWRWGV+LLNKME+KGLKPG REWN+VLVACSKASET+ Sbjct: 522 NLSYELIVSHFNFLLSAAKRKGIWRWGVKLLNKMEDKGLKPGCREWNAVLVACSKASETT 581 Query: 1811 AAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMA 1990 AA++IFKRMVE GEKPTIISYGALLS+LEKGKLY+ A +VW HM++VGVEPN +AYTIMA Sbjct: 582 AAVQIFKRMVENGEKPTIISYGALLSALEKGKLYDDALRVWNHMIKVGVEPNAYAYTIMA 641 Query: 1991 SIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVV 2170 SI+ QG F ++ I++EM +LGI TV+T+NAII+ C NG AYEWF RMKV ++ Sbjct: 642 SIHTAQGNFNRVDAIIQEMVTLGIEVTVVTYNAIITGCAHNGMSSVAYEWFHRMKVQNIS 701 Query: 2171 PNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGAL 2350 PNE+TYEMLI ALA DGKPRLAY L+ RAK+EG LS+KAYDAVV+S AT+++G L Sbjct: 702 PNEITYEMLIVALANDGKPRLAYQLYTRAKNEGLTLSSKAYDAVVQSSQANNATIELGLL 761 Query: 2351 GSRPPERKKKVTIRKDLSEFCKLADVPRRSRPFVR 2455 G RP ++KKKV IRK L+EF LA VP+RS+PF R Sbjct: 762 GPRPVDKKKKVQIRKTLNEFYNLAGVPKRSQPFDR 796 >ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutrema salsugineum] gi|557101036|gb|ESQ41399.1| hypothetical protein EUTSA_v10015672mg [Eutrema salsugineum] Length = 688 Score = 756 bits (1952), Expect = 0.0 Identities = 399/726 (54%), Positives = 502/726 (69%) Frame = +2 Query: 203 MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLDFCDVHSHGLLRFSRYSY 382 MQAL+IWP K + + + + EL C V RKR F + G + S + Sbjct: 1 MQALSIWPLK---FGLLVGSRLEFELDCSCYVVS-PKTRKRQYFVEQACFGSI--SSFLL 54 Query: 383 YKGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPTVGKDD 562 R LA + + +P+K GS+ V WA ++ +G++ Sbjct: 55 VSSNRKFEGLAINP------------STKVLFLCEPKKSLSGSSVGVGWATEQRELGEE- 101 Query: 563 SVAELEQLDEVERDDDGAKNRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXALGW 742 V+ + D D +K++ L + Sbjct: 102 -VSREDSSSVTASDSDHSKSQAVTGGEKTNARVDVRE--------------------LAY 140 Query: 743 RLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLI 922 L +AKTADDV+ VLK + LPLQVY +IRGFGK+K+L+ AMA+ +WLKRK E+GGLI Sbjct: 141 SLRAAKTADDVDVVLKEKGELPLQVYCAMIRGFGKDKRLKPAMAVVDWLKRKKIESGGLI 200 Query: 923 QPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQ 1102 PNLFIYNSLLGA+KE+R F EKI++DM G+ PN+VTYNTLM IY+EEG+ KAL Sbjct: 201 GPNLFIYNSLLGAMKESRGFGETEKILSDMEEEGIVPNIVTYNTLMVIYMEEGEFHKALG 260 Query: 1103 LFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWDH 1282 + + + KG PSP ++S L YRRLEDG GAL F+ + R +Y + EIG D D DW+ Sbjct: 261 ILDLVKEKGFEPSPVTYSTALLVYRRLEDGMGALEFFAELREKYSKREIGNDADY-DWEF 319 Query: 1283 EFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWACTRE 1462 EF KLENFI +CYQVMRRWLVK EN + +L+LL MDNA LK REEHERLIWACTRE Sbjct: 320 EFVKLENFIGRICYQVMRRWLVKDENLTTKMLKLLNAMDNAGLKPSREEHERLIWACTRE 379 Query: 1463 EHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPNNMSH 1642 EH VV KELY RIRE EIS+SVCNHLIWL+GKAKKWWAALEIYED+LD+GP+PNN+S+ Sbjct: 380 EHYVVGKELYKRIRERFPEISLSVCNHLIWLMGKAKKWWAALEIYEDLLDQGPEPNNLSY 439 Query: 1643 ELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETSAAIE 1822 EL++SHFNILLSAA ++GIWRWGVRLLNKME+KGLKP SR WN+VLVACSKASET+AAI+ Sbjct: 440 ELVVSHFNILLSAASRRGIWRWGVRLLNKMEDKGLKPQSRHWNAVLVACSKASETAAAIQ 499 Query: 1823 IFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMASIYA 2002 IFK MVE GEKPT+ISYGALLS+LEKGKLY++AF+VW HM++VG+EPN+HAYTIMAS+ Sbjct: 500 IFKAMVENGEKPTVISYGALLSALEKGKLYDEAFRVWNHMIKVGIEPNVHAYTIMASVLT 559 Query: 2003 GQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVVPNEV 2182 GQ +F L+ ++KEM+S GI P+V+T+NAIIS C RN G AYEWF RM+ ++V PNE+ Sbjct: 560 GQQKFNLLDTLLKEMSSKGIEPSVVTYNAIISGCARNELSGVAYEWFHRMRGENVEPNEI 619 Query: 2183 TYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGALGSRP 2362 TYEMLIEALA D KPRLAY+LHL+A++EG +LS+K YDAVV+S YGAT+D+ LG RP Sbjct: 620 TYEMLIEALANDAKPRLAYELHLKAQNEGLKLSSKPYDAVVKSAESYGATIDLNLLGPRP 679 Query: 2363 PERKKK 2380 KK+ Sbjct: 680 VTPKKE 685 >ref|XP_007220233.1| hypothetical protein PRUPE_ppa001979mg [Prunus persica] gi|462416695|gb|EMJ21432.1| hypothetical protein PRUPE_ppa001979mg [Prunus persica] Length = 734 Score = 752 bits (1942), Expect = 0.0 Identities = 389/740 (52%), Positives = 507/740 (68%), Gaps = 5/740 (0%) Frame = +2 Query: 203 MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRL-----DFCDVHSHGLLRF 367 MQAL WPS+ E+W VPQ EL S C RRK++ C S +L Sbjct: 1 MQALVTWPSRAETW---AVPQLGFELGSSCK-FSTRIRRKKMWSLGFPVCYGRSGAVLLL 56 Query: 368 SRYSYYKGCRNGVCLASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPT 547 S S G S +D + + +K S G++F V+WAL+E Sbjct: 57 SSNSGAIGAE--AFSGSPKFDFGCGCFSGYSKLKPARICQSKKRSFGASFVVAWALEEQA 114 Query: 548 VGKDDSVAELEQLDEVERDDDGAKNRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 727 +G D + E + + + Sbjct: 115 IGNDIVIEESTSEHRLSGEGESKGVDHLIVDEAEGGEDKNEVDVRNGGANWEQKNEKIDV 174 Query: 728 XALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEE 907 AL L AKTADDVE VLK + LPLQV+S++IRGFG+++ ++SA A+ EWLKRKSEE Sbjct: 175 RALALSLQFAKTADDVEVVLKDKGDLPLQVFSSMIRGFGRDRLMDSAFAVVEWLKRKSEE 234 Query: 908 TGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKE 1087 T G I PNLFIYNSLLGAVK++++F ++K+++ M GV NVVTYNT M IYIE+G Sbjct: 235 TNGSITPNLFIYNSLLGAVKQSKQFGEMDKVLSAMTEEGVELNVVTYNTKMAIYIEQGLS 294 Query: 1088 SKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDR 1267 +KAL + E++ KG+ PS S+S L Y+R+EDG GAL F+++ R +Y +G+I ++ Sbjct: 295 TKALDVLEDIEKKGLIPSSVSYSTALLAYQRMEDGNGALQFFIEFREKYHKGDISKESV- 353 Query: 1268 EDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIW 1447 EDW+HEF +LENF +CYQVMRRWLVK +N S +VL+LL +MD A + R EHERL+W Sbjct: 354 EDWEHEFIQLENFTKRVCYQVMRRWLVKDDNLSTNVLKLLAQMDIAGVPLSRAEHERLLW 413 Query: 1448 ACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKP 1627 ACTREEH VAKELY RIRE EI +SVCNH+IWL+GKAKKWWAALEIYEDMLD+GPKP Sbjct: 414 ACTREEHYTVAKELYNRIRERHTEIGISVCNHVIWLMGKAKKWWAALEIYEDMLDRGPKP 473 Query: 1628 NNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASET 1807 NNMS+ELI+SHFN+LL+AARK+GIWRWG+RLLNKMEEKGLKP S+EWN+VLVACSKA+ET Sbjct: 474 NNMSYELIVSHFNVLLTAARKRGIWRWGIRLLNKMEEKGLKPRSKEWNAVLVACSKAAET 533 Query: 1808 SAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIM 1987 SAA++IFKRMVEQG+KPT++SYGALLS+LEKGKLY++A QVW+HM++VGV+PNL+AYTIM Sbjct: 534 SAAVKIFKRMVEQGQKPTVLSYGALLSALEKGKLYDEARQVWEHMLKVGVKPNLYAYTIM 593 Query: 1988 ASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDV 2167 AS+++G G+ ++ I+ EM S GI PTV+T+NAIIS RNG AYEWF+RMK ++ Sbjct: 594 ASVFSGHGKLNMVDTIIHEMVSSGIEPTVVTYNAIISGFARNGSTNAAYEWFQRMKDQNI 653 Query: 2168 VPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGA 2347 PN VTYEM+IE LA GKPRLAYDL+L A+++G +LS K+YD VV+S G ++ G Sbjct: 654 SPNNVTYEMMIEGLANGGKPRLAYDLYLTAQNQGLDLSPKSYDIVVQSSLASGVAIE-GF 712 Query: 2348 LGSRPPERKKKVTIRKDLSE 2407 LG+RPP++K++V RK ++ Sbjct: 713 LGARPPDKKEEVQGRKSSTQ 732 >ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Capsella rubella] gi|482561642|gb|EOA25833.1| hypothetical protein CARUB_v10019206mg [Capsella rubella] Length = 673 Score = 738 bits (1905), Expect = 0.0 Identities = 368/632 (58%), Positives = 465/632 (73%), Gaps = 1/632 (0%) Frame = +2 Query: 485 KPRKGSLGSAFAVSWALD-EPTVGKDDSVAELEQLDEVERDDDGAKNRXXXXXXXXXXXX 661 +P++ LGS+ V WA + V +DS + E + + G KN Sbjct: 71 EPKRSFLGSSVGVRWATELGEEVSTEDSSSSSVDHSEPQAVNGGEKNNSRVNVRE----- 125 Query: 662 XXXXXXXXXXXXXXXXXXXXXXXALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGF 841 L + L +AKTADDV+ VLK + LPLQV+ +I GF Sbjct: 126 ------------------------LAFSLRAAKTADDVDAVLKEKGELPLQVFCAMISGF 161 Query: 842 GKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAIN 1021 GK+K+LE A+A+ +WLKRK E+G +I PNLFIYNSLLGA+K+ F EK+++DM Sbjct: 162 GKDKRLEPAVAVVDWLKRKKSESGSVIGPNLFIYNSLLGAMKQLSAFGEAEKVLSDMEEE 221 Query: 1022 GVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGA 1201 G+ PN+VTYNTLM IY+EEG+ KAL + + + KG P+P ++S L YRR+EDG GA Sbjct: 222 GIVPNIVTYNTLMVIYMEEGEFLKALGILDLVKEKGFEPNPITYSTALLVYRRMEDGMGA 281 Query: 1202 LAFYVQTRNRYEQGEIGRDDDREDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLR 1381 L F+V+ R +Y + EIG D D DW EF KLENFI +CYQVMRRWLVK+EN + VL+ Sbjct: 282 LEFFVELREKYSKREIGNDPDY-DWKFEFFKLENFIGRICYQVMRRWLVKNENWTTRVLK 340 Query: 1382 LLQEMDNACLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLG 1561 LL MD+A LK REEHERLIWACTREEH +V KELY RIRE EIS+SVCNHLIWL+G Sbjct: 341 LLNAMDSAGLKPSREEHERLIWACTREEHYIVGKELYKRIRERFPEISLSVCNHLIWLMG 400 Query: 1562 KAKKWWAALEIYEDMLDKGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEK 1741 KAKKWWAALEIYED+LD+GP+PNN+S+EL++SHF+ILLSAA ++GIWRWGVRLLNKME+K Sbjct: 401 KAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFSILLSAASRRGIWRWGVRLLNKMEDK 460 Query: 1742 GLKPGSREWNSVLVACSKASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQA 1921 LKP SR WN+VLVACSKASET+AAI+IFK MV+ GEKPT+ISYGALLS+LEKGKLY++A Sbjct: 461 NLKPQSRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALEKGKLYDEA 520 Query: 1922 FQVWQHMVRVGVEPNLHAYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISS 2101 F+VW HMV+VG+EPNL+AYT MAS+ GQ +F L+ ++KEMAS GI P+V+T+NA+IS Sbjct: 521 FRVWNHMVKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLKEMASKGIEPSVVTYNAVISG 580 Query: 2102 CGRNGHGGTAYEWFERMKVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELS 2281 C +NG G AYEWF RMK ++V PNE+TYEMLIEALA D KPRLAY+LHL+A++EG +LS Sbjct: 581 CAKNGLSGVAYEWFHRMKSENVEPNEITYEMLIEALANDAKPRLAYELHLKAQNEGLKLS 640 Query: 2282 TKAYDAVVESVNLYGATVDIGALGSRPPERKK 2377 +K YDAVV+S YGAT+D+ LG RP +K+ Sbjct: 641 SKPYDAVVKSAETYGATIDLNLLGPRPDTKKR 672 >ref|XP_002873660.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297319497|gb|EFH49919.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 674 Score = 733 bits (1893), Expect = 0.0 Identities = 383/726 (52%), Positives = 495/726 (68%), Gaps = 1/726 (0%) Frame = +2 Query: 203 MQALTIWPSKTESWNSCLVPQFDLELISFCTPVKWGGRRKRLDFCDVHSHGLLRFSRYSY 382 MQAL+IWP K+ + + + EL C V R++ S Sbjct: 1 MQALSIWPLKS---GLLVGSRLEFELDCSCFVVSHKSRKRHC----------------SA 41 Query: 383 YKGCRNGVC-LASRSYDXXXXXXXXXXESTINGFSKPRKGSLGSAFAVSWALDEPTVGKD 559 +GC + L S + S + +P++ GS+ V WA ++ +G++ Sbjct: 42 QQGCFGRISSLILVSSNRKFEGLAVNPTSKVLFLCEPKRNLSGSSVGVGWATEQRELGEE 101 Query: 560 DSVAELEQLDEVERDDDGAKNRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXALG 739 S + V + G K L Sbjct: 102 VSTEDSSYPQTV---NGGEKTNSRVDVRE-----------------------------LA 129 Query: 740 WRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGL 919 + L +AKTADDV+ V+K LPLQVY +IRGFGK+K+L+ A+A+ +WL+RK E+GG+ Sbjct: 130 YSLRAAKTADDVDIVIKEMGELPLQVYCAMIRGFGKDKRLKPAIAVVDWLRRKKSESGGV 189 Query: 920 IQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKAL 1099 I PNLFIYNSLLGA+K++ + EKI++DM G+ PN+VTYNTLM IY+E+G+ KAL Sbjct: 190 IGPNLFIYNSLLGAMKQSSVGE-AEKILSDMEEEGIVPNIVTYNTLMVIYMEKGEFHKAL 248 Query: 1100 QLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWD 1279 + + + KG P+P ++S L YRR+EDG GAL F+V+ R +Y + EIG D D DW+ Sbjct: 249 GILDLVKEKGFEPNPITYSTALLVYRRMEDGMGALEFFVELREKYSKREIGNDADY-DWE 307 Query: 1280 HEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWACTR 1459 EF KLENFI +CYQVMRRWLVK EN + VL+LL MDNA K REEHERLIWACTR Sbjct: 308 FEFVKLENFIGRICYQVMRRWLVKDENWTTRVLKLLNAMDNAGPKPSREEHERLIWACTR 367 Query: 1460 EEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPNNMS 1639 EEH +V KELY RIRE EIS+SVCNHLIWL+GKAKKWWAALEIYED+LD+GP+PNN+S Sbjct: 368 EEHYIVGKELYKRIRERFPEISLSVCNHLIWLMGKAKKWWAALEIYEDLLDEGPEPNNLS 427 Query: 1640 HELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETSAAI 1819 +EL++SHFNILLSAA ++GIWRWGVRLLNKME+KGLKP SR WN+VLVACSKASET+AAI Sbjct: 428 YELVVSHFNILLSAASRRGIWRWGVRLLNKMEDKGLKPQSRHWNAVLVACSKASETTAAI 487 Query: 1820 EIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMASIY 1999 +IFK MV+ GEKPT+ISYGALLS+LEKGKLY++AF+VW HM++VG+EPNL+AYT MAS+ Sbjct: 488 QIFKAMVDNGEKPTVISYGALLSALEKGKLYDEAFRVWNHMIKVGIEPNLYAYTTMASVL 547 Query: 2000 AGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDVVPNE 2179 GQ +F L+ ++KEMAS GI P+V+T+NA+IS C RNG G AYEWF RM+ + V PNE Sbjct: 548 TGQQKFNLLDTLLKEMASKGIEPSVVTYNAVISGCARNGLSGVAYEWFHRMRGEKVEPNE 607 Query: 2180 VTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGALGSR 2359 +TYEMLIEALA D KPRLAY+LHL+A+++G +LS+K YDAVV+S YGAT+D+ LG R Sbjct: 608 ITYEMLIEALANDAKPRLAYELHLKAQNDGLKLSSKPYDAVVKSAETYGATIDLNLLGPR 667 Query: 2360 PPERKK 2377 P + K+ Sbjct: 668 PHKEKR 673 >ref|NP_190245.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206903|sp|Q9SNB7.1|PP264_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g46610 gi|6523064|emb|CAB62331.1| hypothetical protein [Arabidopsis thaliana] gi|332644660|gb|AEE78181.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 665 Score = 732 bits (1889), Expect = 0.0 Identities = 365/631 (57%), Positives = 462/631 (73%) Frame = +2 Query: 485 KPRKGSLGSAFAVSWALDEPTVGKDDSVAELEQLDEVERDDDGAKNRXXXXXXXXXXXXX 664 +P++ LGS+F V WA ++ + + E L + G KN Sbjct: 70 EPKRSLLGSSFGVGWATEQRELELGEEEVSTEDLSSA---NGGEKNNLRVDVRE------ 120 Query: 665 XXXXXXXXXXXXXXXXXXXXXXALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFG 844 L + L +AKTADDV+ VLK + LPLQV+ +I+GFG Sbjct: 121 -----------------------LAFSLRAAKTADDVDAVLKDKGELPLQVFCAMIKGFG 157 Query: 845 KEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAING 1024 K+K+L+ A+A+ +WLKRK E+GG+I PNLFIYNSLLGA+ R F EKI+ DM G Sbjct: 158 KDKRLKPAVAVVDWLKRKKSESGGVIGPNLFIYNSLLGAM---RGFGEAEKILKDMEEEG 214 Query: 1025 VHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGAL 1204 + PN+VTYNTLM IY+EEG+ KAL + + KG P+P ++S L YRR+EDG GAL Sbjct: 215 IVPNIVTYNTLMVIYMEEGEFLKALGILDLTKEKGFEPNPITYSTALLVYRRMEDGMGAL 274 Query: 1205 AFYVQTRNRYEQGEIGRDDDREDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRL 1384 F+V+ R +Y + EIG D DW+ EF KLENFI +CYQVMRRWLVK +N + VL+L Sbjct: 275 EFFVELREKYAKREIGNDVGY-DWEFEFVKLENFIGRICYQVMRRWLVKDDNWTTRVLKL 333 Query: 1385 LQEMDNACLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGK 1564 L MD+A ++ REEHERLIWACTREEH +V KELY RIRE EIS+SVCNHLIWL+GK Sbjct: 334 LNAMDSAGVRPSREEHERLIWACTREEHYIVGKELYKRIRERFSEISLSVCNHLIWLMGK 393 Query: 1565 AKKWWAALEIYEDMLDKGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKG 1744 AKKWWAALEIYED+LD+GP+PNN+S+EL++SHFNILLSAA K+GIWRWGVRLLNKME+KG Sbjct: 394 AKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASKRGIWRWGVRLLNKMEDKG 453 Query: 1745 LKPGSREWNSVLVACSKASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAF 1924 LKP R WN+VLVACSKASET+AAI+IFK MV+ GEKPT+ISYGALLS+LEKGKLY++AF Sbjct: 454 LKPQRRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALEKGKLYDEAF 513 Query: 1925 QVWQHMVRVGVEPNLHAYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSC 2104 +VW HM++VG+EPNL+AYT MAS+ GQ +F L+ ++KEMAS GI P+V+TFNA+IS C Sbjct: 514 RVWNHMIKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLKEMASKGIEPSVVTFNAVISGC 573 Query: 2105 GRNGHGGTAYEWFERMKVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELST 2284 RNG G AYEWF RMK ++V PNE+TYEMLIEALA D KPRLAY+LH++A++EG +LS+ Sbjct: 574 ARNGLSGVAYEWFHRMKSENVEPNEITYEMLIEALANDAKPRLAYELHVKAQNEGLKLSS 633 Query: 2285 KAYDAVVESVNLYGATVDIGALGSRPPERKK 2377 K YDAVV+S YGAT+D+ LG RP ++ + Sbjct: 634 KPYDAVVKSAETYGATIDLNLLGPRPDKKNR 664 >ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Fragaria vesca subsp. vesca] Length = 657 Score = 725 bits (1871), Expect = 0.0 Identities = 346/551 (62%), Positives = 446/551 (80%), Gaps = 1/551 (0%) Frame = +2 Query: 731 ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910 AL RL AKTADDVEEVLK LPLQV+S++IRGFG++K ++SA A+ EWLKR+ EET Sbjct: 103 ALASRLQFAKTADDVEEVLKEMGDLPLQVFSSMIRGFGRDKLMDSAFAVVEWLKRRGEET 162 Query: 911 GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090 G++ PNLFI+NSLLGAVK+ ++F ++K++ DM GV PN+VTYNT M IY+E+G + Sbjct: 163 NGMVAPNLFIFNSLLGAVKQCKQFGEMDKVLADMTQEGVEPNIVTYNTKMAIYVEQGLST 222 Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270 KAL + EE+ KG+ SP ++S L Y+R++DG GAL F+V+ R +Y G+I + E Sbjct: 223 KALDVLEEIQKKGMIASPVTYSTALQAYQRMQDGIGALEFFVEFREKYRNGDICNVSE-E 281 Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGREEHERLIWA 1450 DW+ EF KLE+F +CYQVMR WLV ++ S +VL+LL MDNA + GR EHERL+WA Sbjct: 282 DWESEFLKLESFTKRVCYQVMRWWLVMDDDLSINVLKLLVNMDNAGIPLGRAEHERLLWA 341 Query: 1451 CTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKPN 1630 CTRE+H VAKELY RIRE EIS+SVCNH+IW++GKAKKWWAALEIYEDMLDKGPKPN Sbjct: 342 CTREDHYNVAKELYCRIRERHSEISLSVCNHVIWVMGKAKKWWAALEIYEDMLDKGPKPN 401 Query: 1631 NMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASETS 1810 NMS+EL++SHFN+LL+AARKKGIWRWGVRLLNKMEEKGLKP S+EWN+VLVACSKA+ETS Sbjct: 402 NMSYELVVSHFNVLLTAARKKGIWRWGVRLLNKMEEKGLKPRSKEWNAVLVACSKAAETS 461 Query: 1811 AAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIMA 1990 AA++IF+RMVEQG+KPTI+SYGALLS+LEKGKLY++A QVW+HM++VGV+PNL+AYTIMA Sbjct: 462 AAVKIFRRMVEQGQKPTILSYGALLSALEKGKLYDEARQVWEHMIKVGVKPNLYAYTIMA 521 Query: 1991 SIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRN-GHGGTAYEWFERMKVDDV 2167 S+++G G+F + I++EM S GI PTV+T+NAIIS C RN AY+WF+RMK +++ Sbjct: 522 SVFSGHGKFNLVETILQEMVSSGIEPTVVTYNAIISGCARNDSSSADAYDWFDRMKANNI 581 Query: 2168 VPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGA 2347 PN VTYEM+IEALA++GKPRLAY+L+LRA+++G LS+KAYD +V+S +G + D+ Sbjct: 582 PPNNVTYEMMIEALAKEGKPRLAYELYLRAQNQGIHLSSKAYDILVQSSIDFGDSFDLNL 641 Query: 2348 LGSRPPERKKK 2380 LG RPP K+ Sbjct: 642 LGPRPPPHAKE 652 >ref|XP_006852234.1| hypothetical protein AMTR_s00049p00149530 [Amborella trichopoda] gi|548855838|gb|ERN13701.1| hypothetical protein AMTR_s00049p00149530 [Amborella trichopoda] Length = 754 Score = 706 bits (1821), Expect = 0.0 Identities = 344/573 (60%), Positives = 447/573 (78%), Gaps = 1/573 (0%) Frame = +2 Query: 731 ALGWRLSSAKTADDVEEVLKAESILPLQVYSTIIRGFGKEKKLESAMALFEWLKRKSEET 910 AL L A+ ADDVEEVL + LP VYS++IRGFG ++L+ A+AL EWLKR + T Sbjct: 164 ALAMSLQFAERADDVEEVL-GDMDLPPSVYSSMIRGFGMAERLKPAIALVEWLKRGKKST 222 Query: 911 GGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGIYIEEGKES 1090 G NL+IYNSLLGA K + ++ + KI+ DM G+ PN+VT NTLM +Y+E+GK Sbjct: 223 NGGAILNLYIYNSLLGAAKASHSYEKVGKIIEDMEKQGILPNIVTLNTLMSVYLEQGKTQ 282 Query: 1091 KALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGEIGRDDDRE 1270 +A +F E+P G++PSP ++S VL YR++ED GAL F+V++R +Y++GEI +D E Sbjct: 283 EARDIFSEIPRNGLSPSPVTYSTVLQIYRKMEDAKGALEFFVESREKYKKGEI-ENDSCE 341 Query: 1271 DWDHEFSKLENFIITLCYQVMRRWLVKSENR-SNDVLRLLQEMDNACLKHGREEHERLIW 1447 DW++EF+KLENF I +CYQVMR WLVK R + DVL+LL E+D A LK GR +ERLIW Sbjct: 342 DWENEFAKLENFTIRICYQVMRGWLVKGGGREATDVLKLLIELDKAGLKPGRAIYERLIW 401 Query: 1448 ACTREEHCVVAKELYTRIREVDDEISVSVCNHLIWLLGKAKKWWAALEIYEDMLDKGPKP 1627 ACT E H +VAKELY RIRE + EIS+SVCNH+IWL+GKAKKWWA+LE+YE+MLDKGPKP Sbjct: 402 ACTNEGHYIVAKELYQRIRENNTEISLSVCNHVIWLMGKAKKWWASLEVYEEMLDKGPKP 461 Query: 1628 NNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLVACSKASET 1807 NN+S+EL++S FNILLSAA ++GIW W +RLLNKM+EKG+KP +REWN+ LVACS+ASE Sbjct: 462 NNLSYELMVSQFNILLSAASRRGIWNWAIRLLNKMQEKGIKPRTREWNAALVACSRASEA 521 Query: 1808 SAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEPNLHAYTIM 1987 +AA++IF RMVEQGEKPTI+SYGALLS+LEKGKLY++A QVW+HM++VGV+PNL+AYT M Sbjct: 522 AAAVQIFMRMVEQGEKPTILSYGALLSALEKGKLYDKAHQVWEHMIKVGVQPNLYAYTTM 581 Query: 1988 ASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWFERMKVDDV 2167 SIY QG + ++++++EM SLGI PTV+TFNAIIS C G GG A+EWF RMK ++ Sbjct: 582 LSIYIKQGRLKAVDIVIREMNSLGIEPTVVTFNAIISGCAYKGMGGAAFEWFHRMKAKNI 641 Query: 2168 VPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLYGATVDIGA 2347 PNE+TYEMLIEALA DGKPRLAY+++LRA++E LS KAYD+V+ S Y A++D+ Sbjct: 642 EPNEITYEMLIEALANDGKPRLAYEVYLRARNEDLLLSPKAYDSVLRSSYQYKASIDMSR 701 Query: 2348 LGSRPPERKKKVTIRKDLSEFCKLADVPRRSRP 2446 LG RPPE+ KK T K +EFC+L D+ RR +P Sbjct: 702 LGPRPPEKTKKRT--KVSAEFCRLPDMSRREKP 732 >gb|EAY82798.1| hypothetical protein OsI_38004 [Oryza sativa Indica Group] Length = 669 Score = 654 bits (1688), Expect = 0.0 Identities = 314/582 (53%), Positives = 432/582 (74%), Gaps = 9/582 (1%) Frame = +2 Query: 731 ALGWRLSSAKTADDVEEVLKA--------ESILPLQVYSTIIRGFGKEKKLESAMALFEW 886 A+G L A+TAD+VE ++K E LPLQVY+++IRG GKE++L++A A+ E Sbjct: 75 AVGAALRDARTADEVETLVKGFLDDGGGGEEHLPLQVYTSVIRGLGKERRLDAAFAVVEH 134 Query: 887 LKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEKIMNDMAINGVHPNVVTYNTLMGI 1066 LKR S GG+ N F+YN LLGAVK + +F + ++ DM G+ PNVVT+NTLM I Sbjct: 135 LKRGSGSGGGV---NQFVYNCLLGAVKNSGEFGRIHDVLADMEAQGIPPNVVTFNTLMSI 191 Query: 1067 YIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYRRLEDGFGALAFYVQTRNRYEQGE 1246 Y+E+GK + ++F+ + G+ P+ A++S V+ Y++ D F AL F + R Y +GE Sbjct: 192 YVEQGKIDEVFRVFDTIEGSGLVPTAATYSTVMSSYKKAGDAFAALKFLTKLREMYNKGE 251 Query: 1247 IGRDDDREDWDHEFSKLENFIITLCYQVMRRWLVKSENRSNDVLRLLQEMDNACLKHGRE 1426 + +REDWD EF K E + +CY MRR LV EN +VL++L MD A +K R Sbjct: 252 LA--GNREDWDREFVKFEKLTVRVCYMAMRRSLVGGENPVGEVLKVLLGMDEAGVKPDRR 309 Query: 1427 EHERLIWACTREEHCVVAKELYTRIREVDDE-ISVSVCNHLIWLLGKAKKWWAALEIYED 1603 ++ERL+WACT EEH +AKELY RIRE D IS+SVCNHLIWL+GKAKKWWAALEIYED Sbjct: 310 DYERLVWACTGEEHYTIAKELYQRIRERGDGVISLSVCNHLIWLMGKAKKWWAALEIYED 369 Query: 1604 MLDKGPKPNNMSHELIISHFNILLSAARKKGIWRWGVRLLNKMEEKGLKPGSREWNSVLV 1783 +LDKGPKPNN+S+ELI+SHFNILL+AA+++GIWRWGVRLL+KM++KGLKPGSREWN+VL+ Sbjct: 370 LLDKGPKPNNLSYELIMSHFNILLNAAKRRGIWRWGVRLLDKMQQKGLKPGSREWNAVLL 429 Query: 1784 ACSKASETSAAIEIFKRMVEQGEKPTIISYGALLSSLEKGKLYEQAFQVWQHMVRVGVEP 1963 ACS+A+ETSAA++IFKRM++QG P ++SYGALLS+LEKGKLY++A +VW+HM +VGV+P Sbjct: 430 ACSRAAETSAAVDIFKRMIDQGLTPDVVSYGALLSALEKGKLYDEALRVWEHMCKVGVKP 489 Query: 1964 NLHAYTIMASIYAGQGEFENLNLIVKEMASLGIPPTVITFNAIISSCGRNGHGGTAYEWF 2143 NLHAYTI+ SIY G+G ++ +++ M S + PTV+TFNAIIS+C RN GG+A+EWF Sbjct: 490 NLHAYTILVSIYIGKGNHAMVDSVLRGMLSAKVEPTVVTFNAIISACVRNNKGGSAFEWF 549 Query: 2144 ERMKVDDVVPNEVTYEMLIEALARDGKPRLAYDLHLRAKSEGFELSTKAYDAVVESVNLY 2323 RMKV ++ PNE+TY+MLIEAL +DGKPRLAY++++RA ++G EL K+YD V+E+ Y Sbjct: 550 HRMKVQNIEPNEITYQMLIEALVQDGKPRLAYEMYMRACNQGLELPAKSYDTVMEACQDY 609 Query: 2324 GATVDIGALGSRPPERKKKVTIRKDLSEFCKLADVPRRSRPF 2449 G+ +D+ +LG RP ++ + + I S + D+P ++ F Sbjct: 610 GSLIDLNSLGPRPVKKVEPIRIENKFSSSYYVGDLPSSTKHF 651