BLASTX nr result
ID: Catharanthus22_contig00025697
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00025697 (2605 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006340539.1| PREDICTED: pentatricopeptide repeat-containi... 988 0.0 ref|XP_004231485.1| PREDICTED: pentatricopeptide repeat-containi... 974 0.0 ref|XP_006419949.1| hypothetical protein CICLE_v10006593mg [Citr... 945 0.0 ref|XP_006489403.1| PREDICTED: pentatricopeptide repeat-containi... 942 0.0 ref|XP_002282622.1| PREDICTED: pentatricopeptide repeat-containi... 939 0.0 gb|EXB63452.1| hypothetical protein L484_005415 [Morus notabilis] 934 0.0 gb|EMJ26873.1| hypothetical protein PRUPE_ppa002602mg [Prunus pe... 934 0.0 ref|XP_004303188.1| PREDICTED: pentatricopeptide repeat-containi... 929 0.0 gb|EOY05698.1| Pentatricopeptide repeat (PPR) superfamily protei... 914 0.0 gb|ESW15257.1| hypothetical protein PHAVU_007G057700g [Phaseolus... 912 0.0 gb|EOY05700.1| Pentatricopeptide repeat (PPR) superfamily protei... 909 0.0 ref|XP_004496720.1| PREDICTED: pentatricopeptide repeat-containi... 902 0.0 ref|XP_003617141.1| Pentatricopeptide repeat-containing protein ... 896 0.0 ref|XP_003536531.1| PREDICTED: pentatricopeptide repeat-containi... 894 0.0 ref|XP_002314694.1| hypothetical protein POPTR_0010s09690g [Popu... 889 0.0 gb|EPS73044.1| hypothetical protein M569_01710 [Genlisea aurea] 885 0.0 ref|NP_193221.3| pentatricopeptide repeat-containing protein LOI... 865 0.0 ref|XP_002870277.1| hypothetical protein ARALYDRAFT_493409 [Arab... 862 0.0 ref|XP_006414633.1| hypothetical protein EUTSA_v10024593mg [Eutr... 858 0.0 ref|XP_006283247.1| hypothetical protein CARUB_v10004282mg [Caps... 853 0.0 >ref|XP_006340539.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14850-like [Solanum tuberosum] Length = 687 Score = 988 bits (2554), Expect = 0.0 Identities = 474/668 (70%), Positives = 564/668 (84%) Frame = -1 Query: 2491 SLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVVTWT 2312 SL LG+AVHA IIRT+ + PPFLSNHLIN YSKLD NSAQL+LSLT P SVVTWT Sbjct: 21 SLLLGRAVHAHIIRTI-ESPFPPFLSNHLINFYSKLDSPNSAQLLLSLTPPRFRSVVTWT 79 Query: 2311 ALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLAVKL 2132 ALI+GSVQNGHFTSA +FS MRR ++ PNDFTFPCLFKASA L+ P +GQQLH LA+K Sbjct: 80 ALIAGSVQNGHFTSALLHFSDMRRQSVQPNDFTFPCLFKASAFLHYPLMGQQLHALALKG 139 Query: 2131 KLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAIVNF 1952 I DVFV CSAFDMY K + + A K+FD M RNIATWNA ISN+V +GRP +A + F Sbjct: 140 SFINDVFVGCSAFDMYCKNGLREYAQKMFDEMPHRNIATWNACISNSVLDGRPYDASLKF 199 Query: 1951 IELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDFYGK 1772 +ELL+ GEE PNSITFC F NAC+D L LKLGQQLHG+VIRFG ++V++LNG++DFYGK Sbjct: 200 VELLRLGEEPPNSITFCVFLNACSDGLYLKLGQQLHGYVIRFGFGSDVSVLNGMVDFYGK 259 Query: 1771 CKEVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDFLVSSV 1592 C +V+++E++FN + RN VSWC+MLAVYEQN++ + A LF++ RK+G+ PT+F++SSV Sbjct: 260 CHQVKYSELVFNEINVRNGVSWCTMLAVYEQNDIWDNAFMLFLKARKEGIKPTEFMLSSV 319 Query: 1591 ISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEMPERNL 1412 +SACAG+A LELGR++HG+AVK CIE N+FVGSALVDMYGKCG I++CE +FYEMPERNL Sbjct: 320 LSACAGMAVLELGRSIHGLAVKACIEHNVFVGSALVDMYGKCGSIDNCESSFYEMPERNL 379 Query: 1411 ICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQGVDFF 1232 I WNA++GGYAHQG A+ AL L EEMT ES ++P+YVTFV VLTACSR G V G+D F Sbjct: 380 ITWNAVMGGYAHQGCADMALSLFEEMTSESHNVVPSYVTFVCVLTACSRAGAVKIGMDIF 439 Query: 1231 ESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLGACKVYGR 1052 ESM+ KYGI PG EHYACVVD+LGRAGLV+RAY+F+K MP+ PTVS+WGALLGAC+V+G+ Sbjct: 440 ESMQKKYGIEPGPEHYACVVDILGRAGLVERAYDFIKKMPVPPTVSVWGALLGACRVHGK 499 Query: 1051 PDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTGYSWIT 872 P++GK+AA+ LF LDP DSGNHVILSNMFAAAGRW++ANLVRKEMKDVG+ KG G SWI+ Sbjct: 500 PELGKVAADNLFRLDPLDSGNHVILSNMFAAAGRWDEANLVRKEMKDVGITKGAGISWIS 559 Query: 871 VKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKESEVWYH 692 KN++H+FQAKD +HER EIQAML KL++DMKA GYI DTN ALYDL+EEEKESEVW+H Sbjct: 560 AKNSIHIFQAKDTTHERYPEIQAMLAKLRRDMKAEGYIADTNSALYDLEEEEKESEVWHH 619 Query: 691 SEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFHHFKGN 512 SEKIALAFGLIA+PPGVPIRI KNLRVCVDCHSAIKFISGI GREIIVRDN RFH FK Sbjct: 620 SEKIALAFGLIAIPPGVPIRITKNLRVCVDCHSAIKFISGITGREIIVRDNNRFHSFKDY 679 Query: 511 ECSCRDYW 488 +CSCRDYW Sbjct: 680 QCSCRDYW 687 >ref|XP_004231485.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14850-like [Solanum lycopersicum] Length = 687 Score = 974 bits (2519), Expect = 0.0 Identities = 471/668 (70%), Positives = 557/668 (83%) Frame = -1 Query: 2491 SLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVVTWT 2312 SL LG+A+HA IIRT+ + PPFLSNHLIN YSKLD LNSAQL+LSLT P SVVTWT Sbjct: 21 SLLLGRAIHAHIIRTI-EPPFPPFLSNHLINFYSKLDSLNSAQLLLSLTPPPFRSVVTWT 79 Query: 2311 ALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLAVKL 2132 ALI+GSVQNGHFTSA +FS MR ++ PNDFTFPCLFKASA L+ P +G QLH LA+K Sbjct: 80 ALIAGSVQNGHFTSALLHFSDMRCQSVQPNDFTFPCLFKASAFLHYPLMGLQLHALALKG 139 Query: 2131 KLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAIVNF 1952 I D FV CSAFDMY KT + + A KVFD M RNIATWNA ISN+V +GRP +A + F Sbjct: 140 SFINDAFVGCSAFDMYCKTGLREYAQKVFDEMPHRNIATWNACISNSVLDGRPYDASLKF 199 Query: 1951 IELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDFYGK 1772 +ELL+ GEE PNSITF F NAC+D L LKLGQQLHG+VIR G ++V++LNG++DFYGK Sbjct: 200 VELLRLGEEPPNSITFSVFLNACSDGLYLKLGQQLHGYVIRLGFGSDVSVLNGMVDFYGK 259 Query: 1771 CKEVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDFLVSSV 1592 C +V+++E++FN + N VSW +MLAVYEQN++ +KA LF++ RK+G+ PT+F+VSSV Sbjct: 260 CHQVKYSELVFNEINVCNGVSWSTMLAVYEQNDIWDKAFMLFLKARKEGIKPTEFMVSSV 319 Query: 1591 ISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEMPERNL 1412 +SACAG A LELGR++HG+AVK CIE N+FVGSALVDMYGKCG IE+CE AFYEMPERNL Sbjct: 320 LSACAGTAVLELGRSIHGLAVKACIEHNVFVGSALVDMYGKCGSIENCESAFYEMPERNL 379 Query: 1411 ICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQGVDFF 1232 I WNA++GGYAHQG A+ AL L EEMT ES ++P+YVTF+ VLTACSR G V G+D F Sbjct: 380 ITWNAVMGGYAHQGCADMALRLFEEMTSESHDVVPSYVTFICVLTACSRAGAVKIGMDIF 439 Query: 1231 ESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLGACKVYGR 1052 ESMR KYGI PG EHYACVVD+LGRAGLV+RAY+F+K MP+ PTVS+WGALLGAC+V+G+ Sbjct: 440 ESMRKKYGIEPGPEHYACVVDILGRAGLVERAYDFIKKMPVPPTVSVWGALLGACRVHGK 499 Query: 1051 PDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTGYSWIT 872 P++GK+AA+ LF LDP DSGNHV+LSNMFAAAGRW +ANLVRKEMKDVG+ KG G SWI+ Sbjct: 500 PELGKVAADNLFRLDPLDSGNHVVLSNMFAAAGRWHEANLVRKEMKDVGITKGAGISWIS 559 Query: 871 VKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKESEVWYH 692 KN++HVFQAKD +HER EIQAML KL++DMKA GYI DTN ALYDL+EEEKESEVW+H Sbjct: 560 AKNSIHVFQAKDTTHERYPEIQAMLAKLRRDMKAEGYIADTNSALYDLEEEEKESEVWHH 619 Query: 691 SEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFHHFKGN 512 SEKIALAFGLI +PPGVPIRI KNLRVCVDCHSAIKFISGI GREI+VRDN RFH FK Sbjct: 620 SEKIALAFGLITIPPGVPIRITKNLRVCVDCHSAIKFISGITGREIVVRDNNRFHSFKDY 679 Query: 511 ECSCRDYW 488 +CSCRDYW Sbjct: 680 QCSCRDYW 687 >ref|XP_006419949.1| hypothetical protein CICLE_v10006593mg [Citrus clementina] gi|557521822|gb|ESR33189.1| hypothetical protein CICLE_v10006593mg [Citrus clementina] Length = 686 Score = 945 bits (2442), Expect = 0.0 Identities = 460/689 (66%), Positives = 553/689 (80%) Frame = -1 Query: 2554 MPFXXXXXXXXXXXXXXSTHSSLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRL 2375 MPF ST SS LG+ VHA +IRTL + +P LSN+LINMYSKLD Sbjct: 1 MPFHAPDSLGTLLETAVSTRSS-SLGRVVHAYVIRTLANH-VPSTLSNYLINMYSKLDLP 58 Query: 2374 NSAQLVLSLTRPSSLSVVTWTALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFK 2195 N AQLVL LT S +VV+WTALISG VQNGHFTSAF +F+ MR + I PNDFTFPCLFK Sbjct: 59 NPAQLVLQLTPVRSRTVVSWTALISGLVQNGHFTSAFLHFTNMRLECISPNDFTFPCLFK 118 Query: 2194 ASASLNSPFLGQQLHGLAVKLKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIAT 2015 AS++L+ P G+QLH LA+K I+DVFV CSAFDMYSKT + DA+K+FD M RN+AT Sbjct: 119 ASSALHIPVTGKQLHALALKSGQIHDVFVGCSAFDMYSKTGLKDDADKMFDEMPERNLAT 178 Query: 2014 WNAKISNAVTNGRPREAIVNFIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFV 1835 WNA ISNAV GRP+ AI FI L ++G E P+ ITFCAF NAC+DCL L+LG+QLHGF+ Sbjct: 179 WNAYISNAVLGGRPKNAIDAFINLRRTGGE-PDLITFCAFLNACSDCLLLQLGRQLHGFL 237 Query: 1834 IRFGCENEVALLNGLIDFYGKCKEVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKAC 1655 +R G + V++ NGL+DFYGKC EV A+ +F+G+ ++N VSWCSMLAVY QN E C Sbjct: 238 VRSGFDGNVSVCNGLVDFYGKCNEVGLAKAVFDGIIDKNDVSWCSMLAVYVQNYEEENGC 297 Query: 1654 QLFVETRKKGMGPTDFLVSSVISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMY 1475 ++F+ R++G+ P DF++SSV+SACA +A LELGR+VH +AVK C+EGN+FVGSALVDMY Sbjct: 298 RMFLTARREGVEPKDFMISSVLSACARIAGLELGRSVHAVAVKACVEGNIFVGSALVDMY 357 Query: 1474 GKCGCIEDCELAFYEMPERNLICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVT 1295 GKCG IED E+AF +MPERNL+CWNA+IGGYAHQGHA+ AL EEMT +PNYVT Sbjct: 358 GKCGSIEDAEIAFNKMPERNLVCWNAIIGGYAHQGHADMALSSFEEMTSMRCEAVPNYVT 417 Query: 1294 FVSVLTACSRGGMVNQGVDFFESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNM 1115 V VL+ACSR G V +G++ F SM KYGI PGAEHYACVVD+LGRAGLVDRAYE +K M Sbjct: 418 LVCVLSACSRAGAVEKGMEIFYSMTLKYGIKPGAEHYACVVDLLGRAGLVDRAYEIIKEM 477 Query: 1114 PMQPTVSIWGALLGACKVYGRPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDAN 935 PM+PT+S+WGALL AC+VYG+P++G+IAA+ LF+LDPNDSGNHV+LSNMFAA GRWE+A+ Sbjct: 478 PMRPTISVWGALLNACRVYGKPELGRIAADNLFKLDPNDSGNHVLLSNMFAATGRWEEAD 537 Query: 934 LVRKEMKDVGMKKGTGYSWITVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIP 755 LVRKEMKDVG+KKG G SWI+VKN +H+FQAKD SHERN+EIQAML KL+++MKAAGYIP Sbjct: 538 LVRKEMKDVGIKKGAGCSWISVKNRIHIFQAKDTSHERNTEIQAMLTKLREEMKAAGYIP 597 Query: 754 DTNVALYDLQEEEKESEVWYHSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFIS 575 DTN ALYD++EEEK +EV +HSEKIALAFGLIA+PPGVPIRI KNLR+C DCHSA KFIS Sbjct: 598 DTNFALYDVEEEEKMTEVGHHSEKIALAFGLIAIPPGVPIRITKNLRICGDCHSAFKFIS 657 Query: 574 GIVGREIIVRDNRRFHHFKGNECSCRDYW 488 GIVGRE+IVRDN RFH F CSC DYW Sbjct: 658 GIVGREVIVRDNNRFHRFWDGYCSCSDYW 686 >ref|XP_006489403.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14850-like isoform X1 [Citrus sinensis] gi|568872496|ref|XP_006489404.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14850-like isoform X2 [Citrus sinensis] Length = 686 Score = 942 bits (2435), Expect = 0.0 Identities = 459/689 (66%), Positives = 551/689 (79%) Frame = -1 Query: 2554 MPFXXXXXXXXXXXXXXSTHSSLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRL 2375 MPF ST SS LG+ VHA +IRTL + +P LSN+LINMYSK D Sbjct: 1 MPFHAPDSLGTLLETAVSTRSS-SLGRVVHAYVIRTLANH-VPSTLSNYLINMYSKFDLP 58 Query: 2374 NSAQLVLSLTRPSSLSVVTWTALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFK 2195 N AQLVL LT S +VV+WTALISG VQNGHFTSAF +F+ MR + I PNDFTFPCLFK Sbjct: 59 NPAQLVLQLTPVRSRTVVSWTALISGLVQNGHFTSAFLHFTNMRLECISPNDFTFPCLFK 118 Query: 2194 ASASLNSPFLGQQLHGLAVKLKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIAT 2015 AS++L+ P G+QLH LA+K I+DVFV CSAFDMYSKT + DA+K+FD M RN+AT Sbjct: 119 ASSALHIPVTGKQLHALALKSGQIHDVFVGCSAFDMYSKTGLKDDADKMFDEMPERNLAT 178 Query: 2014 WNAKISNAVTNGRPREAIVNFIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFV 1835 WNA ISNAV GRP+ AI FI L ++G E P+ ITFCAF NAC+DCL L+LG+QLHGF+ Sbjct: 179 WNAYISNAVLGGRPKNAIDAFINLRRTGGE-PDLITFCAFLNACSDCLLLQLGRQLHGFL 237 Query: 1834 IRFGCENEVALLNGLIDFYGKCKEVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKAC 1655 +R G + V++ NGL+DFYGKC EV A+ +F+G+ ++N VSWCSMLAVY QN E C Sbjct: 238 VRSGFDGNVSVCNGLVDFYGKCNEVGLAKAVFDGIIDKNDVSWCSMLAVYVQNYEEENGC 297 Query: 1654 QLFVETRKKGMGPTDFLVSSVISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMY 1475 ++F+ R++G+ P DF++SSV+SACA +A LELGR+VH +AVK C+EGN+FVGSALVDMY Sbjct: 298 RMFLTARREGVEPKDFMISSVLSACARIAGLELGRSVHAVAVKACVEGNIFVGSALVDMY 357 Query: 1474 GKCGCIEDCELAFYEMPERNLICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVT 1295 GKCG IED E+AF +MPERNL+CWNA+IGGYAHQGHA+ AL EEMT +PNYVT Sbjct: 358 GKCGSIEDAEIAFNKMPERNLVCWNAIIGGYAHQGHADMALSSFEEMTSMRCEAVPNYVT 417 Query: 1294 FVSVLTACSRGGMVNQGVDFFESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNM 1115 V VL+ACSR G V +G+ F SM KYGI PGAEHYACVVD+LGRAGLVDRAYE +K M Sbjct: 418 LVCVLSACSRAGAVEKGMKIFYSMTLKYGIKPGAEHYACVVDLLGRAGLVDRAYEIIKEM 477 Query: 1114 PMQPTVSIWGALLGACKVYGRPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDAN 935 PM+PT+S+WGALL AC+VYG+P++G+IAA+ LF+LDPNDSGNHV+LSNMFAA GRWE+A+ Sbjct: 478 PMRPTISVWGALLNACRVYGKPELGRIAADNLFKLDPNDSGNHVLLSNMFAATGRWEEAD 537 Query: 934 LVRKEMKDVGMKKGTGYSWITVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIP 755 LVRKEMKDVG+KKG G SWI+VKN +H+FQAKD SHERN+EIQAML KL+++MKAAGYIP Sbjct: 538 LVRKEMKDVGIKKGAGCSWISVKNRIHIFQAKDTSHERNTEIQAMLTKLREEMKAAGYIP 597 Query: 754 DTNVALYDLQEEEKESEVWYHSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFIS 575 DTN ALYD++EEEK +EV +HSEKIALAFGLIA+PPGVPIRI KNLR+C DCHSA KFIS Sbjct: 598 DTNFALYDVEEEEKMTEVGHHSEKIALAFGLIAIPPGVPIRITKNLRICGDCHSAFKFIS 657 Query: 574 GIVGREIIVRDNRRFHHFKGNECSCRDYW 488 GIVGRE+IVRDN RFH F CSC DYW Sbjct: 658 GIVGREVIVRDNNRFHRFWDGYCSCSDYW 686 >ref|XP_002282622.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14850-like [Vitis vinifera] Length = 684 Score = 939 bits (2426), Expect = 0.0 Identities = 458/666 (68%), Positives = 547/666 (82%) Frame = -1 Query: 2485 RLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVVTWTAL 2306 RLG+A HA+II+TL LP F+ NHL+NMYSKLDR NSAQL+LSLT + SVVTWTAL Sbjct: 23 RLGRAAHAQIIKTLDN-PLPSFIYNHLVNMYSKLDRPNSAQLLLSLT--PNRSVVTWTAL 79 Query: 2305 ISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLAVKLKL 2126 I+GSVQNG FTSA +FS MRRD+I PNDFTFPC FKAS SL SP +G+Q+H LAVK Sbjct: 80 IAGSVQNGRFTSALFHFSNMRRDSIQPNDFTFPCAFKASGSLRSPLVGKQVHALAVKAGQ 139 Query: 2125 IYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAIVNFIE 1946 I DVFV CSAFDMYSK + ++A K+FD M RNIATWNA +SN+V GR +A+ FIE Sbjct: 140 ISDVFVGCSAFDMYSKAGLTEEARKMFDEMPERNIATWNAYLSNSVLEGRYDDALTAFIE 199 Query: 1945 LLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDFYGKCK 1766 G E PN ITFCAF NACA L+LG+QLHGFV++ G E +V++ NGLIDFYGKC Sbjct: 200 FRHEGWE-PNLITFCAFLNACAGASYLRLGRQLHGFVLQSGFEADVSVANGLIDFYGKCH 258 Query: 1765 EVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDFLVSSVIS 1586 +V +E+IF+G+ + N VSWCSM+ Y QN+ EKAC +F+ RK+G+ PTDF+VSSV+S Sbjct: 259 QVGCSEIIFSGISKPNDVSWCSMIVSYVQNDEEEKACLVFLRARKEGIEPTDFMVSSVLS 318 Query: 1585 ACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEMPERNLIC 1406 ACAGL+ LE+G++VH +AVK C+ GN+FVGSALVDMYGKCG IED E AF EMPERNL+ Sbjct: 319 ACAGLSVLEVGKSVHTLAVKACVVGNIFVGSALVDMYGKCGSIEDAERAFDEMPERNLVT 378 Query: 1405 WNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQGVDFFES 1226 WNA+IGGYAHQG A+ A+ L +EMT S R+ PNYVTFV VL+ACSR G VN G++ FES Sbjct: 379 WNAMIGGYAHQGQADMAVTLFDEMTCGSHRVAPNYVTFVCVLSACSRAGSVNVGMEIFES 438 Query: 1225 MRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLGACKVYGRPD 1046 MRG+YGI PGAEHYACVVD+LGRAG+V++AY+F+K MP++PTVS+WGALLGA K++G+ + Sbjct: 439 MRGRYGIEPGAEHYACVVDLLGRAGMVEQAYQFIKKMPIRPTVSVWGALLGASKMFGKSE 498 Query: 1045 IGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTGYSWITVK 866 +GK+AA+ LFELDP DSGNHV+LSNMFAAAGRWE+A LVRKEMKDVG+KKG G SWIT Sbjct: 499 LGKVAADNLFELDPLDSGNHVLLSNMFAAAGRWEEATLVRKEMKDVGIKKGAGCSWITAG 558 Query: 865 NTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKESEVWYHSE 686 N VHVFQAKD SHERNSEIQAML KL+ +M+AAGYIPDT+ AL+DL+EEEK EVWYHSE Sbjct: 559 NAVHVFQAKDTSHERNSEIQAMLAKLRGEMEAAGYIPDTSFALFDLEEEEKAMEVWYHSE 618 Query: 685 KIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFHHFKGNEC 506 KIALAFGLI++P GVPIRI KNLR+C DCHSAIKFISGIVGREIIVRDN FH F+ N+C Sbjct: 619 KIALAFGLISIPAGVPIRITKNLRICGDCHSAIKFISGIVGREIIVRDNNLFHRFRDNQC 678 Query: 505 SCRDYW 488 SCRDYW Sbjct: 679 SCRDYW 684 >gb|EXB63452.1| hypothetical protein L484_005415 [Morus notabilis] Length = 678 Score = 934 bits (2414), Expect = 0.0 Identities = 451/668 (67%), Positives = 541/668 (80%) Frame = -1 Query: 2491 SLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVVTWT 2312 S RLG+ VHA+IIR LG LP FL NHL++MYSKLD +SAQLVLSLT S SVVTW+ Sbjct: 15 SARLGRVVHAQIIRNLGS-SLPAFLCNHLVHMYSKLDLPDSAQLVLSLT--PSRSVVTWS 71 Query: 2311 ALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLAVKL 2132 +LI+G V NGHF SA +FSGMR D I PNDFTFPC+FKASASL F+G+Q+H +A K+ Sbjct: 72 SLIAGCVHNGHFASALHHFSGMRLDCIQPNDFTFPCIFKASASLGMSFVGRQVHAVAFKI 131 Query: 2131 KLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAIVNF 1952 I+DVFV C AFDMY KT + DA KVFD M RN TWNA ISNAV +GRP I F Sbjct: 132 GQIHDVFVGCGAFDMYCKTGLWDDACKVFDEMPERNSTTWNAYISNAVLSGRPIYGIKKF 191 Query: 1951 IELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDFYGK 1772 IE L+ G E P+SITFC+F NAC+D +L+LG+QLHGFVIR G V ++NGLIDFYGK Sbjct: 192 IEFLRVGGE-PDSITFCSFLNACSDMSDLELGRQLHGFVIRCGYGKYVKVMNGLIDFYGK 250 Query: 1771 CKEVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDFLVSSV 1592 C+EVE +EM+F+ + RN VSWCSM+AVY QN+ E AC++F++ RK+G+ P DF++S+ Sbjct: 251 CQEVESSEMVFDRIHLRNDVSWCSMMAVYVQNDEEENACEVFLKARKEGLVPNDFMISTF 310 Query: 1591 ISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEMPERNL 1412 +SACAGL+ +LGR+ H +AVK C+EGN+FVGSALVDMYGKCG I D E F EMP RN Sbjct: 311 LSACAGLSDFDLGRSGHTLAVKACVEGNIFVGSALVDMYGKCGSINDAEREFNEMPHRNS 370 Query: 1411 ICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQGVDFF 1232 I WNA+I GYAHQGHA+ AL L E+MT + ++PNYVT VS+L+ACS+ G V +G++ F Sbjct: 371 ITWNAMINGYAHQGHADMALALCEKMTSSNCEVLPNYVTLVSILSACSKAGAVEKGMEIF 430 Query: 1231 ESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLGACKVYGR 1052 ESMR +YG+ PG EHYACVVD+LGRAGLV+RAYEF+K MP+ PT S+WGALLGACK+Y + Sbjct: 431 ESMRARYGVEPGVEHYACVVDLLGRAGLVERAYEFIKKMPILPTTSVWGALLGACKMYRK 490 Query: 1051 PDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTGYSWIT 872 ++G+IAA+ LF+LDP DSGNHV+LSNMFAAAGRWE+A LVRKEMKDVG+KKG GYSWIT Sbjct: 491 SELGEIAADNLFKLDPKDSGNHVVLSNMFAAAGRWEEATLVRKEMKDVGIKKGAGYSWIT 550 Query: 871 VKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKESEVWYH 692 VKNTVH+FQAKD SHERNSEIQ ML KL++ +K AGY PDTN AL+DL+EEEK SEVWYH Sbjct: 551 VKNTVHIFQAKDTSHERNSEIQEMLTKLRRMVKEAGYFPDTNYALFDLEEEEKTSEVWYH 610 Query: 691 SEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFHHFKGN 512 SEK+ALAFGL+A+PPGVPIRI KNLR+C DCHSAIKFISGIVGREIIVRDN RFH FK Sbjct: 611 SEKLALAFGLVAIPPGVPIRITKNLRICGDCHSAIKFISGIVGREIIVRDNNRFHQFKDG 670 Query: 511 ECSCRDYW 488 +CSCRDYW Sbjct: 671 KCSCRDYW 678 >gb|EMJ26873.1| hypothetical protein PRUPE_ppa002602mg [Prunus persica] Length = 653 Score = 934 bits (2413), Expect = 0.0 Identities = 457/657 (69%), Positives = 534/657 (81%) Frame = -1 Query: 2458 IIRTLGQYDLPPFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVVTWTALISGSVQNGH 2279 +IRTL LP FLSNHL+NMYSKLD +SAQLVL L S SVVTWTALI+GSVQNGH Sbjct: 1 MIRTLDA-PLPSFLSNHLVNMYSKLDLPDSAQLVLQLN--PSRSVVTWTALIAGSVQNGH 57 Query: 2278 FTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLAVKLKLIYDVFVACS 2099 F SA +F+ M R+++ PNDFTFPC FKAS SL P G+Q+H LAVK I DVFV CS Sbjct: 58 FASAILHFANMLRESVQPNDFTFPCAFKASGSLRLPATGKQVHALAVKAGQICDVFVGCS 117 Query: 2098 AFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAIVNFIELLQSGEEAP 1919 AFDMY KT + +A KVFD M RN+ATWNA +SNAV +GRP+ A+ FIE L++G E P Sbjct: 118 AFDMYCKTGLRDEARKVFDEMPERNLATWNAYMSNAVLDGRPQNAVYKFIEFLRAGGE-P 176 Query: 1918 NSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDFYGKCKEVEFAEMIF 1739 NSITFCAF NAC+D NL+LG+QLHGFV+R G +V++LNGLIDFYGKC+EV + M+F Sbjct: 177 NSITFCAFLNACSDTSNLELGRQLHGFVMRCGFGKDVSVLNGLIDFYGKCREVGSSMMVF 236 Query: 1738 NGMRERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDFLVSSVISACAGLARLE 1559 + + +RN VSWCS++A QN+ E AC+LF+ RK+G+ PTDF+VSSV+SAC+GLA LE Sbjct: 237 DTIDKRNDVSWCSLVAACVQNDEEEMACELFLRARKEGVEPTDFMVSSVLSACSGLAWLE 296 Query: 1558 LGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEMPERNLICWNALIGGYA 1379 GR+VH IAVK C+EGNLFVGSALVDMYGKCG IED + AF MP RNLI WNA++GGYA Sbjct: 297 QGRSVHAIAVKACVEGNLFVGSALVDMYGKCGSIEDAKCAFNGMPSRNLISWNAMVGGYA 356 Query: 1378 HQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQGVDFFESMRGKYGILP 1199 HQGHA AL L EEMT S + PNYVT V VL+ACSR G V G+ FESM+ KYGI P Sbjct: 357 HQGHANMALVLFEEMTVRSHEVKPNYVTLVCVLSACSRAGAVETGMQIFESMKAKYGIEP 416 Query: 1198 GAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLGACKVYGRPDIGKIAAEKL 1019 GAEHYACVVD+LGRAG+V+RAYEF+ MP++PT+SIWGALLGACK+Y + ++G++AA+KL Sbjct: 417 GAEHYACVVDLLGRAGMVERAYEFITKMPIRPTISIWGALLGACKMYRKTELGRVAADKL 476 Query: 1018 FELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTGYSWITVKNTVHVFQAK 839 FELDP DSGNHVILSNMFAAAGRWE+A LVRK MKDVG+KKG GYSWI VKN VHVFQAK Sbjct: 477 FELDPKDSGNHVILSNMFAAAGRWEEATLVRKGMKDVGIKKGAGYSWIAVKNAVHVFQAK 536 Query: 838 DESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKESEVWYHSEKIALAFGLI 659 D SHERNSEIQAML KL+++M+ AGYI DTN AL+DL+EEEK SEVWYHSEKIALAFGLI Sbjct: 537 DTSHERNSEIQAMLTKLRREMEKAGYIADTNFALFDLEEEEKVSEVWYHSEKIALAFGLI 596 Query: 658 ALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFHHFKGNECSCRDYW 488 A+PPGVPIRI KNLR+C DCH AIKFISGIVGREIIVRDN RFH F+ CSCRDYW Sbjct: 597 AIPPGVPIRITKNLRICGDCHGAIKFISGIVGREIIVRDNNRFHRFRDGHCSCRDYW 653 >ref|XP_004303188.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14850-like [Fragaria vesca subsp. vesca] Length = 684 Score = 929 bits (2400), Expect = 0.0 Identities = 452/671 (67%), Positives = 535/671 (79%) Frame = -1 Query: 2500 THSSLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVV 2321 T SSL G+A HA IIRTL Q P FLSNHLINMYSKLD NSAQL+L LT S SVV Sbjct: 19 TRSSLT-GRAAHAHIIRTL-QPPHPSFLSNHLINMYSKLDLPNSAQLLLHLT--PSRSVV 74 Query: 2320 TWTALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLA 2141 TWTALI+G VQN HF SA F MRRD++ PNDFTFPC FKAS L P +G+Q+H LA Sbjct: 75 TWTALIAGLVQNRHFASALLNFINMRRDSVVPNDFTFPCAFKASGLLRRPVIGKQVHALA 134 Query: 2140 VKLKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAI 1961 VK I DVFV CSAFDMY KT + DA KVFD M RN+ATWNA +SNAV + RP A+ Sbjct: 135 VKAGQICDVFVGCSAFDMYCKTGLGDDAGKVFDEMPERNLATWNAYMSNAVLDRRPVSAV 194 Query: 1960 VNFIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDF 1781 F+E +++G E PNSITFCAF NAC+D L+LG+QLHGFV+RFG +V+++NGL+DF Sbjct: 195 EKFVEFVRAGGE-PNSITFCAFLNACSDLSALELGRQLHGFVMRFGFGRDVSVMNGLVDF 253 Query: 1780 YGKCKEVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDFLV 1601 YGKC++V A M+F + + N VSWCSM+A Y QNN EKAC+LF+ R++G+ PTDF+V Sbjct: 254 YGKCRDVGLARMVFERIGQANHVSWCSMVAAYVQNNEEEKACELFLRARREGVEPTDFMV 313 Query: 1600 SSVISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEMPE 1421 SSV+SAC+GLA LE GR++H +AVK C++GN+FVGSALVDMYGKCG IED E AF MP Sbjct: 314 SSVLSACSGLAWLEQGRSIHALAVKACVDGNVFVGSALVDMYGKCGSIEDAECAFDMMPS 373 Query: 1420 RNLICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQGV 1241 RNLI WNA++GGY HQGHA AL L EEM+ S + PNYVT V VL+ACSR G V +G+ Sbjct: 374 RNLISWNAMVGGYTHQGHANTALALFEEMSDRSHELKPNYVTLVCVLSACSRAGDVQKGM 433 Query: 1240 DFFESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLGACKV 1061 F+SM+ +YG+ PGAEHYACVVD+LGRAG+V+RAYEF+ MP++PT+SIWGALLGACK+ Sbjct: 434 QIFDSMKSRYGVEPGAEHYACVVDLLGRAGMVERAYEFITKMPIRPTISIWGALLGACKM 493 Query: 1060 YGRPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTGYS 881 Y +P++GKIAA+KLFELDP DSGNHV+LSN+ AA GRWE+A LVRKEMKDVG+KKG GYS Sbjct: 494 YKKPELGKIAADKLFELDPKDSGNHVVLSNLLAATGRWEEATLVRKEMKDVGIKKGAGYS 553 Query: 880 WITVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKESEV 701 WI VKN VH+FQAKD SHE NSEIQAML L+ M+ AGY+PDTN AL+DL+EEEK SEV Sbjct: 554 WIAVKNAVHIFQAKDTSHEMNSEIQAMLIYLRTKMEEAGYVPDTNFALFDLEEEEKVSEV 613 Query: 700 WYHSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFHHF 521 WYHSEKIALAFGLIA+P G+PIRINKNLR+C DCH AIKFISGIV REIIVRDN RFH F Sbjct: 614 WYHSEKIALAFGLIAIPSGLPIRINKNLRICGDCHGAIKFISGIVDREIIVRDNNRFHRF 673 Query: 520 KGNECSCRDYW 488 + CSCRDYW Sbjct: 674 REGHCSCRDYW 684 >gb|EOY05698.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 1 [Theobroma cacao] gi|508713802|gb|EOY05699.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 1 [Theobroma cacao] gi|508713804|gb|EOY05701.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 1 [Theobroma cacao] Length = 684 Score = 914 bits (2361), Expect = 0.0 Identities = 447/689 (64%), Positives = 541/689 (78%) Frame = -1 Query: 2554 MPFXXXXXXXXXXXXXXSTHSSLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRL 2375 MPF STHS L G+A HA I+++L Q P FLSNHLINMYSK + Sbjct: 1 MPFLTANELGSLLVSAISTHSLL-FGRATHAHILKSL-QIPFPSFLSNHLINMYSKFNLP 58 Query: 2374 NSAQLVLSLTRPSSLSVVTWTALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFK 2195 NSA LVL T P S SVVTWTALISG VQNGHF SA +FS MR+D I PNDFTFPC FK Sbjct: 59 NSAHLVLLQTPPESRSVVTWTALISGHVQNGHFASALIHFSHMRKDLISPNDFTFPCAFK 118 Query: 2194 ASASLNSPFLGQQLHGLAVKLKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIAT 2015 ASA+L SP +G+QLH LA+K I+D FV CS FDMY KT + +A +FD M R++A Sbjct: 119 ASAALRSPVVGKQLHALALKSAQIFDSFVGCSCFDMYLKTGLRGEARNMFDEMPDRSVAM 178 Query: 2014 WNAKISNAVTNGRPREAIVNFIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFV 1835 WNA ISNAV +G+P A+ FI+ + G E P+ ITFC F NAC+D L+LG+QLHG V Sbjct: 179 WNANISNAVLDGKPSIAVDVFIKFRRVGGE-PDPITFCVFLNACSDAFYLELGRQLHGCV 237 Query: 1834 IRFGCENEVALLNGLIDFYGKCKEVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKAC 1655 IR G + +++ NGL+DFYGKCKEVE A+M+F+GM +RN VSWCS+++ YEQN E AC Sbjct: 238 IRSGFDGNLSVCNGLVDFYGKCKEVESAKMVFDGMEKRNAVSWCSLVSAYEQNYEEENAC 297 Query: 1654 QLFVETRKKGMGPTDFLVSSVISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMY 1475 ++F+ RK+G+ PTDF+VSSVISACAG++ LE GR+VHG+AVK C++GN+FVGSAL+DMY Sbjct: 298 EVFLAARKEGVEPTDFMVSSVISACAGMSGLEFGRSVHGLAVKACVKGNVFVGSALIDMY 357 Query: 1474 GKCGCIEDCELAFYEMPERNLICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVT 1295 GKCG I+D E AF+EMPERNL+ WNA+IGGYAHQG A+ AL L ++M S ++PNYVT Sbjct: 358 GKCGSIKDAEQAFHEMPERNLVTWNAMIGGYAHQGCADMALALFQDMM--SCGVVPNYVT 415 Query: 1294 FVSVLTACSRGGMVNQGVDFFESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNM 1115 V VL+ACSRGG V GV FESM ++ I PGAEHYACVVD+LGRAG+V+RAY+F+K M Sbjct: 416 LVCVLSACSRGGAVKLGVKIFESMNERFHIEPGAEHYACVVDLLGRAGMVERAYDFIKKM 475 Query: 1114 PMQPTVSIWGALLGACKVYGRPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDAN 935 P+ PT+S+WGALL AC+VY +P++G+IAA KLFELDP DSGNHV+LSN+FA+ GRWE+A+ Sbjct: 476 PIAPTISVWGALLNACRVYKKPELGRIAAYKLFELDPKDSGNHVLLSNLFASTGRWEEAD 535 Query: 934 LVRKEMKDVGMKKGTGYSWITVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIP 755 LVRKEMKDVG+KKG G SWITVKN VH FQAKD SHE NS+IQ ML KL+++MK+AGYI Sbjct: 536 LVRKEMKDVGIKKGAGCSWITVKNEVHTFQAKDTSHEMNSKIQEMLAKLRREMKSAGYIA 595 Query: 754 DTNVALYDLQEEEKESEVWYHSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFIS 575 DTN ALYDL+EEEK SEV YHSEKIALAFGLI +PPGVPIRI KNLR+C DCHSA KF+S Sbjct: 596 DTNFALYDLEEEEKISEVGYHSEKIALAFGLIVIPPGVPIRITKNLRICGDCHSAFKFMS 655 Query: 574 GIVGREIIVRDNRRFHHFKGNECSCRDYW 488 GIVGREIIVRDN RFH F+ +CSCRDYW Sbjct: 656 GIVGREIIVRDNNRFHRFRDGQCSCRDYW 684 >gb|ESW15257.1| hypothetical protein PHAVU_007G057700g [Phaseolus vulgaris] Length = 685 Score = 912 bits (2357), Expect = 0.0 Identities = 446/675 (66%), Positives = 542/675 (80%), Gaps = 4/675 (0%) Frame = -1 Query: 2500 THSSLRLGKAVHARIIRTLGQYDLP--PFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLS 2327 THSSL LG+AVHA I+RT +D P FL NHL+NMYSKLDRLNSA+L+LSLT P + Sbjct: 19 THSSL-LGRAVHALILRT---HDTPLSSFLCNHLVNMYSKLDRLNSAELLLSLTNPRT-- 72 Query: 2326 VVTWTALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHG 2147 VVTWT+LISG V N FTSAF +FS MRR+++ PNDFTFPC+FKAS SL+ PF G+QLH Sbjct: 73 VVTWTSLISGCVHNRRFTSAFLHFSNMRRESVLPNDFTFPCVFKASGSLHMPFTGKQLHA 132 Query: 2146 LAVKLKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPRE 1967 LA+K I DVFV CSAFDMYSKT + +A +F+ M RN+ATWNA ISNAV +GR + Sbjct: 133 LALKGGNILDVFVGCSAFDMYSKTGLRVEARNMFEEMPHRNLATWNAYISNAVQDGRCLD 192 Query: 1966 AIVNFIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLI 1787 A+ F + L G E PN+ITFC F NACAD ++L+LG Q+HGF++R +V++ NGLI Sbjct: 193 AVAAFKKFLCEGGE-PNAITFCVFLNACADMVSLELGIQVHGFIVRSRYREDVSVSNGLI 251 Query: 1786 DFYGKCKEVEFAEMIFN--GMRERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPT 1613 DFYGKC ++ +EM+F+ G RN+VSWCSMLA QN+ E+AC +F++ RK+ + PT Sbjct: 252 DFYGKCGDIVSSEMVFSTIGGGRRNVVSWCSMLAALVQNHEEERACTVFLKARKE-VEPT 310 Query: 1612 DFLVSSVISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFY 1433 DF++SSV+SACA L LELGR+VH +AVK C+E N++VGSALVD+YGKCG IE E F Sbjct: 311 DFMISSVLSACAELGGLELGRSVHALAVKACVEENIYVGSALVDLYGKCGSIEKAEQVFR 370 Query: 1432 EMPERNLICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMV 1253 EMPE+NL+ WNA+IGGYAH G + AL L EEMT S + PNYVT VSVL+ACSR G V Sbjct: 371 EMPEKNLVTWNAMIGGYAHLGDVDMALSLFEEMTLSSFGITPNYVTLVSVLSACSRAGAV 430 Query: 1252 NQGVDFFESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLG 1073 +G+ FESMRG+YGI PGAEHYAC+VD+LGR+GLVDRAYEF+K MP+ PT+S+WGALLG Sbjct: 431 ERGLHIFESMRGRYGIEPGAEHYACIVDLLGRSGLVDRAYEFIKRMPILPTISVWGALLG 490 Query: 1072 ACKVYGRPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKG 893 +CK++G+ +GKIAAEKLF+LDP+DSGNHV+ SNM A+AGRWE+A +VRKEM+DVG+KK Sbjct: 491 SCKMHGKTKLGKIAAEKLFQLDPDDSGNHVVFSNMLASAGRWEEATIVRKEMRDVGIKKN 550 Query: 892 TGYSWITVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEK 713 GYSW+ VKN VHVFQAKD SHE NSEIQAML KL+ +MK AGY+PDTN++L+DL+EEEK Sbjct: 551 VGYSWVAVKNRVHVFQAKDSSHENNSEIQAMLAKLRGEMKKAGYVPDTNLSLFDLEEEEK 610 Query: 712 ESEVWYHSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRR 533 SEVWYHSEKIALAFGLIALP GVPIRI KNLR+C DCHSAIKFIS IVGREIIVRDN R Sbjct: 611 ASEVWYHSEKIALAFGLIALPHGVPIRITKNLRICADCHSAIKFISKIVGREIIVRDNNR 670 Query: 532 FHHFKGNECSCRDYW 488 FHHFK CSC+DYW Sbjct: 671 FHHFKNGWCSCKDYW 685 >gb|EOY05700.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 3 [Theobroma cacao] Length = 683 Score = 909 bits (2350), Expect = 0.0 Identities = 446/688 (64%), Positives = 540/688 (78%) Frame = -1 Query: 2554 MPFXXXXXXXXXXXXXXSTHSSLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRL 2375 MPF STHS L G+A HA I+++L Q P FLSNHLINMYSK + Sbjct: 1 MPFLTANELGSLLVSAISTHSLL-FGRATHAHILKSL-QIPFPSFLSNHLINMYSKFNLP 58 Query: 2374 NSAQLVLSLTRPSSLSVVTWTALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFK 2195 NSA LVL T P S SVVTWTALISG VQNGHF SA +FS MR+D I PNDFTFPC FK Sbjct: 59 NSAHLVLLQTPPESRSVVTWTALISGHVQNGHFASALIHFSHMRKDLISPNDFTFPCAFK 118 Query: 2194 ASASLNSPFLGQQLHGLAVKLKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIAT 2015 ASA+L SP +G+QLH LA+K I+D FV CS FDMY KT + +A +FD M R++A Sbjct: 119 ASAALRSPVVGKQLHALALKSAQIFDSFVGCSCFDMYLKTGLRGEARNMFDEMPDRSVAM 178 Query: 2014 WNAKISNAVTNGRPREAIVNFIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFV 1835 WNA ISNAV +G+P A+ FI+ + G E P+ ITFC F NAC+D L+LG+QLHG V Sbjct: 179 WNANISNAVLDGKPSIAVDVFIKFRRVGGE-PDPITFCVFLNACSDAFYLELGRQLHGCV 237 Query: 1834 IRFGCENEVALLNGLIDFYGKCKEVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKAC 1655 IR G + +++ NGL+DFYGKCKEVE A+M+F+GM +RN VSWCS+++ YEQN E AC Sbjct: 238 IRSGFDGNLSVCNGLVDFYGKCKEVESAKMVFDGMEKRNAVSWCSLVSAYEQNYEEENAC 297 Query: 1654 QLFVETRKKGMGPTDFLVSSVISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMY 1475 ++F+ RK+G+ PTDF+VSSVISACAG++ LE GR+VHG+AVK C++GN+FVGSAL+DMY Sbjct: 298 EVFLAARKEGVEPTDFMVSSVISACAGMSGLEFGRSVHGLAVKACVKGNVFVGSALIDMY 357 Query: 1474 GKCGCIEDCELAFYEMPERNLICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVT 1295 GKCG I+D E AF+EMPERNL+ WNA+IGGYAHQG A+ AL L ++M S ++PNYVT Sbjct: 358 GKCGSIKDAEQAFHEMPERNLVTWNAMIGGYAHQGCADMALALFQDMM--SCGVVPNYVT 415 Query: 1294 FVSVLTACSRGGMVNQGVDFFESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNM 1115 V VL+ACSRGG V GV FESM ++ I PGAEHYACVVD+LGRAG+V+RAY+F+K M Sbjct: 416 LVCVLSACSRGGAVKLGVKIFESMNERFHIEPGAEHYACVVDLLGRAGMVERAYDFIKKM 475 Query: 1114 PMQPTVSIWGALLGACKVYGRPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDAN 935 P+ PT+S+WGALL AC+VY +P++G+IAA KLFELDP DSGNHV+LSN+FA+ GRWE+A+ Sbjct: 476 PIAPTISVWGALLNACRVYKKPELGRIAAYKLFELDPKDSGNHVLLSNLFASTGRWEEAD 535 Query: 934 LVRKEMKDVGMKKGTGYSWITVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIP 755 LVRKEMKDVG+KKG G SWITVKN VH FQAKD SHE NS+IQ ML KL+++MK+AGYI Sbjct: 536 LVRKEMKDVGIKKGAGCSWITVKNEVHTFQAKDTSHEMNSKIQEMLAKLRREMKSAGYIA 595 Query: 754 DTNVALYDLQEEEKESEVWYHSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFIS 575 DTN ALYDL+EEEK SEV YHSEKIALAFGLI +PPGVPIRI KNLR+C DCHSA KF+S Sbjct: 596 DTNFALYDLEEEEKISEVGYHSEKIALAFGLIVIPPGVPIRITKNLRICGDCHSAFKFMS 655 Query: 574 GIVGREIIVRDNRRFHHFKGNECSCRDY 491 GIVGREIIVRDN RFH F+ +CSCRDY Sbjct: 656 GIVGREIIVRDNNRFHRFRDGQCSCRDY 683 >ref|XP_004496720.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14850-like [Cicer arietinum] Length = 684 Score = 902 bits (2332), Expect = 0.0 Identities = 446/672 (66%), Positives = 537/672 (79%), Gaps = 1/672 (0%) Frame = -1 Query: 2500 THSSLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVV 2321 T+SS+ LG+AVHA IIRT LP FLSNHL+NMYSKLD LNSAQLVLSLT + VV Sbjct: 19 TNSSI-LGRAVHAHIIRT-HDTPLPSFLSNHLVNMYSKLDLLNSAQLVLSLTHLPT--VV 74 Query: 2320 TWTALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLA 2141 TWT+LISG V N F +AF +F+ MRRD++ PNDFTFP +FKASASL+ P G+Q+H LA Sbjct: 75 TWTSLISGCVHNRRFVTAFLHFTNMRRDSVHPNDFTFPGVFKASASLHMPMTGKQVHALA 134 Query: 2140 VKLKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAI 1961 +K IYDVFV CSAFDMY KT + +A +FD M RN ATWNA ISNAV +GR +AI Sbjct: 135 LKGGQIYDVFVGCSAFDMYCKTGLRVEARNMFDEMPHRNSATWNAYISNAVQDGRSLDAI 194 Query: 1960 VNFIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDF 1781 F E L PNSITFCAF NAC D L LG+QLH F++R G + +V++ NGLIDF Sbjct: 195 AAFKEFL-CVHGHPNSITFCAFLNACVDTLRSNLGRQLHAFIVRCGYKEDVSVANGLIDF 253 Query: 1780 YGKCKEVEFAEMIFNGM-RERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDFL 1604 YGKC ++ +E++F+ + R+RN+VSWCSMLA QN+ E+AC +F+E RK+ + PTDF+ Sbjct: 254 YGKCGDIVSSELVFSRIGRKRNVVSWCSMLAALVQNHEEERACMVFLEARKE-VEPTDFM 312 Query: 1603 VSSVISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEMP 1424 +SS++SACA L LELGR+VH +AVK C+E N+FVGSALVD+YGKCG IE+ E F EMP Sbjct: 313 ISSMLSACAELGGLELGRSVHALAVKACVEDNIFVGSALVDLYGKCGSIENAEQVFTEMP 372 Query: 1423 ERNLICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQG 1244 ERNL+ WNALIGGYAHQG AL L EEMT S M P+YVT VSVL+ACSR G V +G Sbjct: 373 ERNLVTWNALIGGYAHQGDVGMALRLFEEMTLGSRGMTPSYVTLVSVLSACSRAGAVERG 432 Query: 1243 VDFFESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLGACK 1064 + FESMR YGI PGAEHYAC+VD+LGR+GLVDRAYEF++NMPM+PT+S+WGALLGAC+ Sbjct: 433 MQIFESMRLNYGIEPGAEHYACIVDLLGRSGLVDRAYEFIQNMPMEPTISVWGALLGACR 492 Query: 1063 VYGRPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTGY 884 ++G+ +GKIAAEKLFELD DSGNHV+LSNM A+AGRWE+A ++RKEMKD+G+KK GY Sbjct: 493 MHGKTKLGKIAAEKLFELDHVDSGNHVVLSNMLASAGRWEEATIIRKEMKDIGIKKNVGY 552 Query: 883 SWITVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKESE 704 SWI VKN +HVFQAKD SHERN+EIQAMLGKL+++MK AGY+PDTN++L+DL++EEK SE Sbjct: 553 SWIAVKNRIHVFQAKDSSHERNTEIQAMLGKLRREMKEAGYVPDTNLSLFDLEDEEKASE 612 Query: 703 VWYHSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFHH 524 VWYHSEKIALAFGLIALP VPIRI KNLR+C DCHSAIKFIS IVGREIIVRDN RFH Sbjct: 613 VWYHSEKIALAFGLIALPQVVPIRITKNLRICGDCHSAIKFISRIVGREIIVRDNHRFHR 672 Query: 523 FKGNECSCRDYW 488 FK CSC+DYW Sbjct: 673 FKDGCCSCKDYW 684 >ref|XP_003617141.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355518476|gb|AET00100.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 684 Score = 896 bits (2315), Expect = 0.0 Identities = 442/672 (65%), Positives = 532/672 (79%), Gaps = 1/672 (0%) Frame = -1 Query: 2500 THSSLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVV 2321 TH S+ LG+ +HA IIRT LP FLSNHL+NMYSKLD LNSAQ VLSLT + VV Sbjct: 19 THCSI-LGRTIHAHIIRT-HVTPLPSFLSNHLVNMYSKLDLLNSAQHVLSLTHLRT--VV 74 Query: 2320 TWTALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLA 2141 TWT+LISG V N F A +F+ MRRDN+ PNDFTFPC+FKASA + P G+Q+HGLA Sbjct: 75 TWTSLISGCVHNRRFLPALLHFTNMRRDNVQPNDFTFPCVFKASAFVQIPMTGKQIHGLA 134 Query: 2140 VKLKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAI 1961 +K +IYDVFV CS FDMY KT DA +FD M +RN+ATWNA ISNAV + R +AI Sbjct: 135 LKGGMIYDVFVGCSCFDMYCKTGFRGDACNMFDEMPQRNLATWNAYISNAVQDRRSLDAI 194 Query: 1960 VNFIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDF 1781 V F E L E PNSITFCAF NAC D + L LG+QLH F++R G + +V++ NGLIDF Sbjct: 195 VAFKEFLCVHGE-PNSITFCAFLNACVDMVRLNLGRQLHAFIVRCGYKEDVSVANGLIDF 253 Query: 1780 YGKCKEVEFAEMIFNGMRER-NIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDFL 1604 YGKC ++ AEM+FN + R N+VSWCSMLA QN+ E+AC +F++ RK+ + PTDF+ Sbjct: 254 YGKCGDIVSAEMVFNRIGNRKNVVSWCSMLAALVQNHEEERACMVFLQARKE-VEPTDFM 312 Query: 1603 VSSVISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEMP 1424 +SSV+SACA L LELGR+VH +AVK C+E N+FVGSALVDMYGKCG IE+ E F E+P Sbjct: 313 ISSVLSACAELGGLELGRSVHALAVKACVEDNIFVGSALVDMYGKCGSIENAEQVFSELP 372 Query: 1423 ERNLICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQG 1244 ERNL+ WNA+IGGYAHQG + AL L EEMT S + P+YVT +S+L+ CSR G V +G Sbjct: 373 ERNLVTWNAMIGGYAHQGDIDMALRLFEEMTLGSHGIRPSYVTLISILSVCSRVGAVERG 432 Query: 1243 VDFFESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLGACK 1064 + FESMR YGI PGAEH+ACVVD+LGR+GLVDRAYEF++NM +QPT+S+WGALLGAC+ Sbjct: 433 IQIFESMRLNYGIEPGAEHFACVVDLLGRSGLVDRAYEFIQNMAIQPTISVWGALLGACR 492 Query: 1063 VYGRPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTGY 884 ++G+ ++GKIAAEKLFELD DSGNHV+LSNM A+AGRWE+A +VRKEMKD+G+KK GY Sbjct: 493 MHGKTELGKIAAEKLFELDHVDSGNHVVLSNMLASAGRWEEATVVRKEMKDIGIKKNVGY 552 Query: 883 SWITVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKESE 704 SWI VKN +HVFQAKD SH+RNSEIQAMLGKL+ MK AGY+PDTN++L+DL++EEK SE Sbjct: 553 SWIAVKNRIHVFQAKDSSHDRNSEIQAMLGKLRGGMKEAGYVPDTNLSLFDLEDEEKASE 612 Query: 703 VWYHSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFHH 524 VWYHSEKIALAFGLIALP GVPIRI KNLR+C DCHSAIKFIS IVGREIIVRDN RFH Sbjct: 613 VWYHSEKIALAFGLIALPQGVPIRITKNLRICGDCHSAIKFISRIVGREIIVRDNHRFHR 672 Query: 523 FKGNECSCRDYW 488 FK CSC+DYW Sbjct: 673 FKDGCCSCKDYW 684 >ref|XP_003536531.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14850-like [Glycine max] Length = 686 Score = 894 bits (2309), Expect = 0.0 Identities = 439/673 (65%), Positives = 534/673 (79%), Gaps = 2/673 (0%) Frame = -1 Query: 2500 THSSLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVV 2321 + SSL LG+AVHA I+RT LP FL NHL+NMYSKLD NSAQLVLSLT P + VV Sbjct: 20 SRSSL-LGRAVHAHILRT-HDTPLPSFLCNHLVNMYSKLDLPNSAQLVLSLTNPRT--VV 75 Query: 2320 TWTALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLA 2141 TWT+LISG V N FTSA +FS MRR+ + PNDFTFPC+FKASASL+ P G+QLH LA Sbjct: 76 TWTSLISGCVHNRRFTSALLHFSNMRRECVLPNDFTFPCVFKASASLHMPVTGKQLHALA 135 Query: 2140 VKLKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAI 1961 +K I DVFV CSAFDMYSKT + +A +FD M RN+ATWNA +SNAV +GR +AI Sbjct: 136 LKGGNILDVFVGCSAFDMYSKTGLRPEARNMFDEMPHRNLATWNAYMSNAVQDGRCLDAI 195 Query: 1960 VNFIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDF 1781 F + L E PN+ITFCAF NACAD ++L+LG+QLHGF++R +V++ NGLIDF Sbjct: 196 AAFKKFLCVDGE-PNAITFCAFLNACADIVSLELGRQLHGFIVRSRYREDVSVFNGLIDF 254 Query: 1780 YGKCKEVEFAEMIFN--GMRERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDF 1607 YGKC ++ +E++F+ G RN+VSWCS+LA QN+ E+AC +F++ RK+ + PTDF Sbjct: 255 YGKCGDIVSSELVFSRIGSGRRNVVSWCSLLAALVQNHEEERACMVFLQARKE-VEPTDF 313 Query: 1606 LVSSVISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEM 1427 ++SSV+SACA L LELGR+VH +A+K C+E N+FVGSALVD+YGKCG IE E F EM Sbjct: 314 MISSVLSACAELGGLELGRSVHALALKACVEENIFVGSALVDLYGKCGSIEYAEQVFREM 373 Query: 1426 PERNLICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQ 1247 PERNL+ WNA+IGGYAH G + AL L +EMT S + +YVT VSVL+ACSR G V + Sbjct: 374 PERNLVTWNAMIGGYAHLGDVDMALSLFQEMTSGSCGIALSYVTLVSVLSACSRAGAVER 433 Query: 1246 GVDFFESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLGAC 1067 G+ FESMRG+YGI PGAEHYACVVD+LGR+GLVDRAYEF+K MP+ PT+S+WGALLGAC Sbjct: 434 GLQIFESMRGRYGIEPGAEHYACVVDLLGRSGLVDRAYEFIKRMPILPTISVWGALLGAC 493 Query: 1066 KVYGRPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTG 887 K++G+ +GKIAAEKLFELDP+DSGNHV+ SNM A+AGRWE+A +VRKEM+D+G+KK G Sbjct: 494 KMHGKTKLGKIAAEKLFELDPDDSGNHVVFSNMLASAGRWEEATIVRKEMRDIGIKKNVG 553 Query: 886 YSWITVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKES 707 YSW+ VKN VHVFQAKD HE+NSEIQAML KL+ +MK AGY+PD N++L+DL+EEEK S Sbjct: 554 YSWVAVKNRVHVFQAKDSFHEKNSEIQAMLAKLRGEMKKAGYVPDANLSLFDLEEEEKAS 613 Query: 706 EVWYHSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFH 527 EVWYHSEKIALAFGLI LP GVPIRI KNLR+C+DCHSAIKFIS IVGREIIVRDN RFH Sbjct: 614 EVWYHSEKIALAFGLITLPRGVPIRITKNLRICIDCHSAIKFISKIVGREIIVRDNNRFH 673 Query: 526 HFKGNECSCRDYW 488 FK CSC+DYW Sbjct: 674 RFKDGWCSCKDYW 686 >ref|XP_002314694.1| hypothetical protein POPTR_0010s09690g [Populus trichocarpa] gi|222863734|gb|EEF00865.1| hypothetical protein POPTR_0010s09690g [Populus trichocarpa] Length = 631 Score = 889 bits (2297), Expect = 0.0 Identities = 435/637 (68%), Positives = 516/637 (81%) Frame = -1 Query: 2398 MYSKLDRLNSAQLVLSLTRPSSLSVVTWTALISGSVQNGHFTSAFSYFSGMRRDNIFPND 2219 MYSKLD N AQL+L LT + VVTWTALISGSVQNG+F+SA YFS MRR+NI PND Sbjct: 1 MYSKLDLPNPAQLLLQLT--PTRCVVTWTALISGSVQNGYFSSALLYFSKMRRENIKPND 58 Query: 2218 FTFPCLFKASASLNSPFLGQQLHGLAVKLKLIYDVFVACSAFDMYSKTRMNQDANKVFDR 2039 FTFPC FKAS +L PF G+Q+H +A+KL I D FV CSAFDMYSKT + +A ++FD Sbjct: 59 FTFPCAFKASTALCLPFAGKQIHAIALKLGQINDKFVGCSAFDMYSKTGLKFEAQRLFDE 118 Query: 2038 MSRRNIATWNAKISNAVTNGRPREAIVNFIELLQSGEEAPNSITFCAFFNACADCLNLKL 1859 M RN+A WNA ISNAV +GRP +AI FIE + G E P+ ITFCAF NACAD L L Sbjct: 119 MPPRNVAVWNAYISNAVLDGRPGKAIDKFIEFRRVGGE-PDLITFCAFLNACADARCLDL 177 Query: 1858 GQQLHGFVIRFGCENEVALLNGLIDFYGKCKEVEFAEMIFNGMRERNIVSWCSMLAVYEQ 1679 G+QLHG VIR G E +V++ NG+ID YGKCKEVE AEM+FNGM RN VSWC+M+A EQ Sbjct: 178 GRQLHGLVIRSGFEGDVSVANGIIDVYGKCKEVELAEMVFNGMGRRNSVSWCTMVAACEQ 237 Query: 1678 NNMGEKACQLFVETRKKGMGPTDFLVSSVISACAGLARLELGRAVHGIAVKGCIEGNLFV 1499 N+ EKAC +F+ RK+G+ TD++VSSVISA AG++ LE GR+VH +AVK C+EG++FV Sbjct: 238 NDEKEKACVVFLMGRKEGIELTDYMVSSVISAYAGISGLEFGRSVHALAVKACVEGDIFV 297 Query: 1498 GSALVDMYGKCGCIEDCELAFYEMPERNLICWNALIGGYAHQGHAERALELSEEMTRESS 1319 GSALVDMYGKCG IEDCE F+EMPERNL+ WNA+I GYAHQG + A+ L EEM E+ Sbjct: 298 GSALVDMYGKCGSIEDCEQVFHEMPERNLVSWNAMISGYAHQGDVDMAMTLFEEMQSEA- 356 Query: 1318 RMMPNYVTFVSVLTACSRGGMVNQGVDFFESMRGKYGILPGAEHYACVVDMLGRAGLVDR 1139 + NYVT + VL+ACSRGG V G + FESMR +Y I PGAEHYAC+ DMLGRAG+V+R Sbjct: 357 --VANYVTLICVLSACSRGGAVKLGNEIFESMRDRYRIEPGAEHYACIADMLGRAGMVER 414 Query: 1138 AYEFVKNMPMQPTVSIWGALLGACKVYGRPDIGKIAAEKLFELDPNDSGNHVILSNMFAA 959 AYEFV+ MP++PT+S+WGALL AC+VYG P++GKIAA+ LF+LDP DSGNHV+LSNMFAA Sbjct: 415 AYEFVQKMPIRPTISVWGALLNACRVYGEPELGKIAADNLFKLDPKDSGNHVLLSNMFAA 474 Query: 958 AGRWEDANLVRKEMKDVGMKKGTGYSWITVKNTVHVFQAKDESHERNSEIQAMLGKLKKD 779 AGRW++A LVRKEMKDVG+KKG G SW+T KN VHVFQAKD SHERNSEIQAML KL+ + Sbjct: 475 AGRWDEATLVRKEMKDVGIKKGAGCSWVTAKNKVHVFQAKDTSHERNSEIQAMLVKLRTE 534 Query: 778 MKAAGYIPDTNVALYDLQEEEKESEVWYHSEKIALAFGLIALPPGVPIRINKNLRVCVDC 599 M+AAGY+PDTN ALYDL+EEEK +EV YHSEKIALAFGLIALPPGVPIRI KNLR+C DC Sbjct: 535 MQAAGYMPDTNYALYDLEEEEKMTEVGYHSEKIALAFGLIALPPGVPIRITKNLRICGDC 594 Query: 598 HSAIKFISGIVGREIIVRDNRRFHHFKGNECSCRDYW 488 HSA KFISGIVGREIIVRDN RFH F+ ++CSCRD+W Sbjct: 595 HSAFKFISGIVGREIIVRDNNRFHRFRDSQCSCRDFW 631 >gb|EPS73044.1| hypothetical protein M569_01710 [Genlisea aurea] Length = 684 Score = 885 bits (2286), Expect = 0.0 Identities = 427/673 (63%), Positives = 533/673 (79%), Gaps = 2/673 (0%) Frame = -1 Query: 2500 THSSLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVV 2321 + SSL LG+A H ++++ LG+ PFLS HL+NMYSKLDR +A+++L LT S SVV Sbjct: 19 SRSSL-LGRAAHGQVVKKLGRAP-DPFLSAHLVNMYSKLDRPRTAEVLLFLTPSDSRSVV 76 Query: 2320 TWTALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLA 2141 WTALI+G++QNGH SA S + MRRD I PNDFT PCLFKA+A+L SP LGQQLH L+ Sbjct: 77 IWTALIAGNIQNGHSASAISNLADMRRDGIQPNDFTLPCLFKAAAALRSPLLGQQLHDLS 136 Query: 2140 VKLKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAI 1961 +KL LI+D FVACSAFDMYSKT + QDA K+FD M RRNIATWNA ISNA P E+I Sbjct: 137 IKLLLIHDAFVACSAFDMYSKTGLLQDAGKMFDEMPRRNIATWNAAISNAAD---PPESI 193 Query: 1960 VNFIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDF 1781 +I LL++G+ +PNSI+ CA AC+ L+ GQQLHG +++ G + + ++LN L+DF Sbjct: 194 SRYIALLRTGDASPNSISLCASLTACSAAGFLQEGQQLHGHLVKQGHDADTSVLNTLVDF 253 Query: 1780 YGKCKEVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDFLV 1601 YGKC+ V+ +E +F+ +R+R VSW +MLA+YEQN+MGEKAC+LF+E + G PT+F++ Sbjct: 254 YGKCRHVDHSERVFHSIRDRTTVSWSTMLAIYEQNHMGEKACELFLEATRAGFEPTEFML 313 Query: 1600 SSVISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEMPE 1421 S+ ISACAGLA LE G+A H +AVK +EG+++VGSALVDMYGKCG ++DCE AF +M Sbjct: 314 SAAISACAGLAALESGKAAHALAVKARVEGSVYVGSALVDMYGKCGSVDDCERAFQQMRS 373 Query: 1420 RNLICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQGV 1241 RN +CWNA+IG YAHQG AE AL L M E +R PNYVTFVSVL CSR GMV++G+ Sbjct: 374 RNSVCWNAMIGAYAHQGRAESALRLFRRMGGEGAR--PNYVTFVSVLAGCSRSGMVDEGM 431 Query: 1240 DFFESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNM--PMQPTVSIWGALLGAC 1067 F M YGI PGAEHYAC+VDMLGRAG V+RA+ ++ M + PT+SIWGALLGAC Sbjct: 432 AIFSEMTPVYGIRPGAEHYACIVDMLGRAGQVERAHRIIEEMMPDIPPTISIWGALLGAC 491 Query: 1066 KVYGRPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTG 887 K++G+P++GK+AAE LF LDP DSGNHV+LSNM AA GRW++A+LVR+EMKDVG+KKGTG Sbjct: 492 KMHGKPELGKVAAENLFRLDPMDSGNHVLLSNMLAAEGRWDEASLVREEMKDVGIKKGTG 551 Query: 886 YSWITVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKES 707 SWI+V++ +HVFQAKD SH RNSEIQ ML KL++DMKAAGY+PDT VALYDL++EEKES Sbjct: 552 CSWISVRDAIHVFQAKDTSHPRNSEIQTMLTKLRRDMKAAGYVPDTKVALYDLEDEEKES 611 Query: 706 EVWYHSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFH 527 EVW HSEKIALAFGL+ALP G+PIRI KNLRVC DCHSAIKF+SGIV REI+VRDN R+H Sbjct: 612 EVWSHSEKIALAFGLVALPTGIPIRITKNLRVCNDCHSAIKFVSGIVEREIVVRDNNRYH 671 Query: 526 HFKGNECSCRDYW 488 HF+ N CSC DYW Sbjct: 672 HFRDNRCSCGDYW 684 >ref|NP_193221.3| pentatricopeptide repeat-containing protein LOI1 [Arabidopsis thaliana] gi|122236284|sp|Q0WSH6.1|PP312_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g14850; AltName: Full=Protein LOVASTATIN INSENSITIVE 1 gi|110735893|dbj|BAE99922.1| hypothetical protein [Arabidopsis thaliana] gi|332658109|gb|AEE83509.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 684 Score = 865 bits (2236), Expect = 0.0 Identities = 421/669 (62%), Positives = 517/669 (77%) Frame = -1 Query: 2494 SSLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVVTW 2315 SS+RLG+ VHARI++TL PPFL+N+LINMYSKLD SA+LVL LT + +VV+W Sbjct: 20 SSMRLGRVVHARIVKTLDSPP-PPFLANYLINMYSKLDHPESARLVLRLT--PARNVVSW 76 Query: 2314 TALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLAVK 2135 T+LISG QNGHF++A F MRR+ + PNDFTFPC FKA ASL P G+Q+H LAVK Sbjct: 77 TSLISGLAQNGHFSTALVEFFEMRREGVVPNDFTFPCAFKAVASLRLPVTGKQIHALAVK 136 Query: 2134 LKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAIVN 1955 I DVFV CSAFDMY KTR+ DA K+FD + RN+ TWNA ISN+VT+GRPREAI Sbjct: 137 CGRILDVFVGCSAFDMYCKTRLRDDARKLFDEIPERNLETWNAFISNSVTDGRPREAIEA 196 Query: 1954 FIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDFYG 1775 FIE + + PNSITFCAF NAC+D L+L LG QLHG V+R G + +V++ NGLIDFYG Sbjct: 197 FIEFRRI-DGHPNSITFCAFLNACSDWLHLNLGMQLHGLVLRSGFDTDVSVCNGLIDFYG 255 Query: 1774 KCKEVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDFLVSS 1595 KCK++ +E+IF M +N VSWCS++A Y QN+ EKA L++ +RK + +DF++SS Sbjct: 256 KCKQIRSSEIIFTEMGTKNAVSWCSLVAAYVQNHEDEKASVLYLRSRKDIVETSDFMISS 315 Query: 1594 VISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEMPERN 1415 V+SACAG+A LELGR++H AVK C+E +FVGSALVDMYGKCGCIED E AF EMPE+N Sbjct: 316 VLSACAGMAGLELGRSIHAHAVKACVERTIFVGSALVDMYGKCGCIEDSEQAFDEMPEKN 375 Query: 1414 LICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQGVDF 1235 L+ N+LIGGYAHQG + AL L EEM PNY+TFVS+L+ACSR G V G+ Sbjct: 376 LVTRNSLIGGYAHQGQVDMALALFEEMAPRGCGPTPNYMTFVSLLSACSRAGAVENGMKI 435 Query: 1234 FESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLGACKVYG 1055 F+SMR YGI PGAEHY+C+VDMLGRAG+V+RAYEF+K MP+QPT+S+WGAL AC+++G Sbjct: 436 FDSMRSTYGIEPGAEHYSCIVDMLGRAGMVERAYEFIKKMPIQPTISVWGALQNACRMHG 495 Query: 1054 RPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTGYSWI 875 +P +G +AAE LF+LDP DSGNHV+LSN FAAAGRW +AN VR+E+K VG+KKG GYSWI Sbjct: 496 KPQLGLLAAENLFKLDPKDSGNHVLLSNTFAAAGRWAEANTVREELKGVGIKKGAGYSWI 555 Query: 874 TVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKESEVWY 695 TVKN VH FQAKD SH N EIQ L KL+ +M+AAGY PD ++LYDL+EEEK +EV + Sbjct: 556 TVKNQVHAFQAKDRSHILNKEIQTTLAKLRNEMEAAGYKPDLKLSLYDLEEEEKAAEVSH 615 Query: 694 HSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFHHFKG 515 HSEK+ALAFGL++LP VPIRI KNLR+C DCHS KF+SG V REIIVRDN RFH FK Sbjct: 616 HSEKLALAFGLLSLPLSVPIRITKNLRICGDCHSFFKFVSGSVKREIIVRDNNRFHRFKD 675 Query: 514 NECSCRDYW 488 CSC+DYW Sbjct: 676 GICSCKDYW 684 >ref|XP_002870277.1| hypothetical protein ARALYDRAFT_493409 [Arabidopsis lyrata subsp. lyrata] gi|297316113|gb|EFH46536.1| hypothetical protein ARALYDRAFT_493409 [Arabidopsis lyrata subsp. lyrata] Length = 684 Score = 862 bits (2227), Expect = 0.0 Identities = 417/669 (62%), Positives = 519/669 (77%) Frame = -1 Query: 2494 SSLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVVTW 2315 SS+RLG+ VHARI++TL PPFL+N+LINMYSKLD SA+LVL LT + +VV+W Sbjct: 20 SSMRLGRVVHARIVKTLDSPP-PPFLANYLINMYSKLDHPESARLVLRLT--PARNVVSW 76 Query: 2314 TALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLAVK 2135 T+L+SG QNGHF++A F MRR+ + PNDFTFPC+FKA ASL P G+Q+H LAVK Sbjct: 77 TSLVSGLAQNGHFSTALFEFFEMRREGVAPNDFTFPCVFKAVASLRLPVTGKQIHALAVK 136 Query: 2134 LKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAIVN 1955 I DVFV CSAFDMY KTR+ DA K+FD + RN+ TWNA ISN+VT+GRP+EAI Sbjct: 137 CGRILDVFVGCSAFDMYCKTRLRDDARKLFDEIPERNLETWNAYISNSVTDGRPKEAIEA 196 Query: 1954 FIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDFYG 1775 FIE + G + PNSITFC F NAC+D L L LG Q+HG V R G + +V++ NGLIDFYG Sbjct: 197 FIEFRRIGGQ-PNSITFCGFLNACSDGLLLDLGMQMHGLVFRSGFDTDVSVYNGLIDFYG 255 Query: 1774 KCKEVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDFLVSS 1595 KCK++ +E+IF M +N VSWCS++A Y QN+ EKA L++ +RK+ + +DF++SS Sbjct: 256 KCKQIRSSEIIFAEMGMKNAVSWCSLVAAYVQNHEDEKASVLYLRSRKEIVETSDFMISS 315 Query: 1594 VISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEMPERN 1415 V+SACAG+A LELGR++H AVK C+E N+FVGSALVDMYGKCGCIED E AF EMPE+N Sbjct: 316 VLSACAGMAGLELGRSIHAHAVKACVERNIFVGSALVDMYGKCGCIEDSEQAFDEMPEKN 375 Query: 1414 LICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQGVDF 1235 L+ N+LIGGYAHQG + AL L E+M PNY+TFVS+L+ACSR G V G+ Sbjct: 376 LVTLNSLIGGYAHQGQVDMALALFEDMAPRGCGPAPNYMTFVSLLSACSRAGAVENGMKI 435 Query: 1234 FESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLGACKVYG 1055 F+SM+ YGI PGAEHY+C+VDMLGRAG+V++A+EF+K MP++PT+S+WGAL AC+++G Sbjct: 436 FDSMKSTYGIEPGAEHYSCIVDMLGRAGMVEQAFEFIKKMPIKPTISVWGALQNACRMHG 495 Query: 1054 RPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTGYSWI 875 +P +G +AAE LF+LDP DSGNHV+LSN FAAAGRW +AN VR+EMK VG+KKG GYSWI Sbjct: 496 KPHLGILAAENLFKLDPKDSGNHVLLSNTFAAAGRWAEANTVREEMKGVGIKKGAGYSWI 555 Query: 874 TVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKESEVWY 695 TVKN VH FQAKD SH+ N EIQ ML KL+ M+AAGY PD ++LYDL+EEEK +EV + Sbjct: 556 TVKNQVHAFQAKDRSHKMNKEIQTMLTKLRNKMEAAGYKPDLKLSLYDLEEEEKAAEVSH 615 Query: 694 HSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFHHFKG 515 HSEK+ALAFGL+ALP VPIRI KNLR+C DCHS KF+SG V REIIVRDN RFH FK Sbjct: 616 HSEKLALAFGLVALPLSVPIRITKNLRICGDCHSFFKFVSGSVKREIIVRDNNRFHRFKD 675 Query: 514 NECSCRDYW 488 CSC+DYW Sbjct: 676 GICSCKDYW 684 >ref|XP_006414633.1| hypothetical protein EUTSA_v10024593mg [Eutrema salsugineum] gi|557115803|gb|ESQ56086.1| hypothetical protein EUTSA_v10024593mg [Eutrema salsugineum] Length = 680 Score = 858 bits (2216), Expect = 0.0 Identities = 420/671 (62%), Positives = 521/671 (77%), Gaps = 2/671 (0%) Frame = -1 Query: 2494 SSLRLGKAVHARIIRTLGQYDLPP--FLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVV 2321 SSLR G+ VHARI++TL D PP FL+N+L+NMYSKLD+ SA+LVL L S +VV Sbjct: 20 SSLRSGRVVHARIVKTL---DSPPPLFLTNYLVNMYSKLDQPESARLVLHLE--PSRNVV 74 Query: 2320 TWTALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLA 2141 +WT+L+SG VQNGH+ SA F MRR+ +FPNDFTFPC+FK++A L P G+Q+H LA Sbjct: 75 SWTSLVSGLVQNGHYYSALFEFLEMRREGVFPNDFTFPCVFKSAALLRLPVTGKQIHALA 134 Query: 2140 VKLKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAI 1961 VK + DVFV CSAFDMY KTR+ DA KVFD M +RN+ TWNA +SN+V +GRP+EAI Sbjct: 135 VKCGRVMDVFVGCSAFDMYCKTRLRDDARKVFDEMPKRNLETWNAYMSNSVIDGRPKEAI 194 Query: 1960 VNFIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDF 1781 FIE + G PNSITFCAF NAC+D L L LG+QLHG V R G + +V++ NGLIDF Sbjct: 195 EAFIEFRKIGGH-PNSITFCAFLNACSDKLLLSLGEQLHGLVFRSGFDRDVSVCNGLIDF 253 Query: 1780 YGKCKEVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDFLV 1601 YGKCK+V +E +F M ERN+VSWCS++A + QN+ EKA L++ +RK + ++F++ Sbjct: 254 YGKCKKVRCSEFVFGEMGERNVVSWCSLVAAFVQNHEDEKASLLYLRSRKDIVETSEFMI 313 Query: 1600 SSVISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEMPE 1421 SS +SACAG+A LELGR+VH AVK CIE +LFVGSALVDMYGKCGCIED E AF EMPE Sbjct: 314 SSTLSACAGMAGLELGRSVHAHAVKACIERSLFVGSALVDMYGKCGCIEDSEQAFDEMPE 373 Query: 1420 RNLICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQGV 1241 +NL+ N+LIGGYAHQG + AL L EEM + + PNY+TFVS+L+ACSR G V G+ Sbjct: 374 KNLVTLNSLIGGYAHQGQVDMALALFEEM----APLTPNYMTFVSLLSACSRAGNVENGM 429 Query: 1240 DFFESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLGACKV 1061 F+SM+ YG+ PGAEHY+CVVDMLGRAG+V+RAYEF+K MP++PT+S+WGAL AC++ Sbjct: 430 KIFDSMKSSYGVEPGAEHYSCVVDMLGRAGMVERAYEFIKKMPIKPTISVWGALQNACRM 489 Query: 1060 YGRPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTGYS 881 + +PD+G IAAE LF+LDP DSGNHV+LSN AAGRW +AN VR+EMK VG+KKGTGYS Sbjct: 490 HSKPDLGIIAAENLFKLDPKDSGNHVLLSNTLVAAGRWVEANTVREEMKGVGIKKGTGYS 549 Query: 880 WITVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKESEV 701 WITVKN VH FQAKD SH+ + EIQ L KLK +M+AAGY PD ++LYD++EEEK +EV Sbjct: 550 WITVKNQVHTFQAKDRSHKMSKEIQRTLSKLKNEMEAAGYKPDLKLSLYDVEEEEKAAEV 609 Query: 700 WYHSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFHHF 521 +HSEK+ALAFGL+ALP GVPIRI KNLR+C DCHS KF+SG V REIIVRDN RFH F Sbjct: 610 AHHSEKLALAFGLVALPLGVPIRITKNLRICEDCHSFFKFVSGSVKREIIVRDNNRFHRF 669 Query: 520 KGNECSCRDYW 488 CSC+DYW Sbjct: 670 LDGFCSCKDYW 680 >ref|XP_006283247.1| hypothetical protein CARUB_v10004282mg [Capsella rubella] gi|482551952|gb|EOA16145.1| hypothetical protein CARUB_v10004282mg [Capsella rubella] Length = 684 Score = 853 bits (2203), Expect = 0.0 Identities = 413/669 (61%), Positives = 515/669 (76%) Frame = -1 Query: 2494 SSLRLGKAVHARIIRTLGQYDLPPFLSNHLINMYSKLDRLNSAQLVLSLTRPSSLSVVTW 2315 SS+RLG+ VH RI++TL PPFL+N+LI++YSKLD SA+LVL T + +VV+W Sbjct: 20 SSMRLGRVVHGRIVKTLDSPP-PPFLANYLISLYSKLDHPESARLVLRFT--PARNVVSW 76 Query: 2314 TALISGSVQNGHFTSAFSYFSGMRRDNIFPNDFTFPCLFKASASLNSPFLGQQLHGLAVK 2135 T+L+SG V NGHF+ A F MRRD + PNDFTFPC FKA ASL P G+Q+HGLAVK Sbjct: 77 TSLVSGLVNNGHFSIALFEFVEMRRDGVSPNDFTFPCAFKAVASLRLPVTGKQIHGLAVK 136 Query: 2134 LKLIYDVFVACSAFDMYSKTRMNQDANKVFDRMSRRNIATWNAKISNAVTNGRPREAIVN 1955 I DVFV CSAFDMY KTR+ DA ++FD + RN TWNA ISN+VT+GRPREAI Sbjct: 137 CGRILDVFVGCSAFDMYCKTRLRDDARQLFDEIPERNCETWNAFISNSVTDGRPREAIEA 196 Query: 1954 FIELLQSGEEAPNSITFCAFFNACADCLNLKLGQQLHGFVIRFGCENEVALLNGLIDFYG 1775 FIE + G + PN+ITFC F NAC+D L+L LG+QLHG V R G + +V++ NGLIDFYG Sbjct: 197 FIEFRRIGGQ-PNTITFCGFLNACSDGLHLNLGKQLHGLVFRCGFDTDVSVYNGLIDFYG 255 Query: 1774 KCKEVEFAEMIFNGMRERNIVSWCSMLAVYEQNNMGEKACQLFVETRKKGMGPTDFLVSS 1595 KCK++ +E++F M +N VSWCS++A Y QN+ EKA L++ +RK+ + +DF++SS Sbjct: 256 KCKQIICSEIVFAEMGTKNAVSWCSLVAAYVQNHEDEKASLLYLRSRKEIVETSDFMISS 315 Query: 1594 VISACAGLARLELGRAVHGIAVKGCIEGNLFVGSALVDMYGKCGCIEDCELAFYEMPERN 1415 +SACAG+A LELGR++H AVK C+E +FVGSALVDMYGKCGCIED E AF EMPE+N Sbjct: 316 ALSACAGMAGLELGRSIHAHAVKACVEMTIFVGSALVDMYGKCGCIEDSEQAFDEMPEKN 375 Query: 1414 LICWNALIGGYAHQGHAERALELSEEMTRESSRMMPNYVTFVSVLTACSRGGMVNQGVDF 1235 L+ N+LIGGYAHQG + AL L EEM PNY+TFVS+L+ACSR G V G+ Sbjct: 376 LVTLNSLIGGYAHQGEVDMALALFEEMAPRGCGPTPNYMTFVSLLSACSRAGAVENGMKI 435 Query: 1234 FESMRGKYGILPGAEHYACVVDMLGRAGLVDRAYEFVKNMPMQPTVSIWGALLGACKVYG 1055 F+SM+ YGI PGAEHY+C+VDMLGRAG+V++AY+F+K +P+QPT+S+WGAL AC+++G Sbjct: 436 FDSMKSIYGIEPGAEHYSCIVDMLGRAGMVEQAYKFIKKLPIQPTISVWGALQNACRMHG 495 Query: 1054 RPDIGKIAAEKLFELDPNDSGNHVILSNMFAAAGRWEDANLVRKEMKDVGMKKGTGYSWI 875 +P +G +AAE LF+LDP DSGNHV+LSN FAAAGRW +AN VR+EMK VG+KKG GYSWI Sbjct: 496 KPHLGIVAAENLFKLDPKDSGNHVLLSNTFAAAGRWAEANTVREEMKGVGIKKGAGYSWI 555 Query: 874 TVKNTVHVFQAKDESHERNSEIQAMLGKLKKDMKAAGYIPDTNVALYDLQEEEKESEVWY 695 TVKN VH FQAKD SH N +IQ ML KL+ +M+A+GY PD ++LYDL+EEEK +EV Y Sbjct: 556 TVKNQVHAFQAKDRSHIMNKDIQTMLTKLRNEMEASGYKPDLKLSLYDLEEEEKAAEVAY 615 Query: 694 HSEKIALAFGLIALPPGVPIRINKNLRVCVDCHSAIKFISGIVGREIIVRDNRRFHHFKG 515 HSEK+ALAFGL+ALP GVPIRI KNLR+C DCHS KF+S V REIIVRDN RFH FK Sbjct: 616 HSEKLALAFGLVALPLGVPIRITKNLRICGDCHSFFKFVSRSVKREIIVRDNNRFHRFKD 675 Query: 514 NECSCRDYW 488 CSCRDYW Sbjct: 676 GICSCRDYW 684