BLASTX nr result
ID: Catharanthus22_contig00018365
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00018365 (2003 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ28505.1| hypothetical protein PRUPE_ppa016683mg [Prunus pe... 603 e-170 ref|XP_004149831.1| PREDICTED: pentatricopeptide repeat-containi... 600 e-169 gb|EXB60117.1| hypothetical protein L484_013382 [Morus notabilis] 598 e-168 ref|XP_006351204.1| PREDICTED: pentatricopeptide repeat-containi... 598 e-168 ref|XP_004244895.1| PREDICTED: pentatricopeptide repeat-containi... 595 e-167 ref|XP_004292714.1| PREDICTED: pentatricopeptide repeat-containi... 595 e-167 ref|XP_006472819.1| PREDICTED: pentatricopeptide repeat-containi... 584 e-164 ref|XP_006434253.1| hypothetical protein CICLE_v10003743mg [Citr... 583 e-164 gb|EOY16374.1| Pentatricopeptide repeat (PPR) superfamily protei... 583 e-164 gb|EOY16373.1| Pentatricopeptide repeat superfamily protein isof... 578 e-162 ref|XP_002279168.1| PREDICTED: pentatricopeptide repeat-containi... 570 e-159 ref|XP_002520332.1| pentatricopeptide repeat-containing protein,... 558 e-156 ref|XP_006386364.1| hypothetical protein POPTR_0002s08080g [Popu... 548 e-153 ref|XP_002887681.1| binding protein [Arabidopsis lyrata subsp. l... 548 e-153 ref|NP_177865.2| pentatricopeptide repeat-containing protein [Ar... 547 e-153 ref|XP_006390093.1| hypothetical protein EUTSA_v10019853mg [Eutr... 544 e-152 ref|XP_006302269.1| hypothetical protein CARUB_v10020312mg [Caps... 543 e-152 ref|XP_006857793.1| hypothetical protein AMTR_s00061p00214600 [A... 524 e-146 gb|ESW33688.1| hypothetical protein PHAVU_001G090600g [Phaseolus... 503 e-140 gb|EPS59883.1| binding protein, partial [Genlisea aurea] 502 e-139 >gb|EMJ28505.1| hypothetical protein PRUPE_ppa016683mg [Prunus persica] Length = 449 Score = 603 bits (1556), Expect = e-170 Identities = 291/430 (67%), Positives = 350/430 (81%), Gaps = 3/430 (0%) Frame = -2 Query: 1774 IVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSIGC 1595 +V+QV A++KNQPF ++ ASA+T P W +E+V++VL +PRF F+SP SIG Sbjct: 15 VVNQVLTAMLKNQPFNSELAASATTSQP------WISESVSQVLISIPRFFFQSPSSIGR 68 Query: 1594 QKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRFGF 1415 Q GFR+R LKQR L+ E+++ N +LVLGPAAHRD KV+LGL +ALEF+YWVET FGF Sbjct: 69 QHGFRHRAQLKQRNLRQESYRFHNNVLVLGPAAHRDLHKVQLGLDRALEFFYWVETHFGF 128 Query: 1414 VHNERTCREMSLVLAKGNGLKFLWGFLKEMSKRG---IVNTATVTCVVKVLGEEGLVDEA 1244 VHNE+TCR+M++VLA+GN LK LW FLKE+SKRG +V T T+TC++KVLGEEGLV +A Sbjct: 129 VHNEQTCRDMAVVLARGNKLKALWDFLKEISKRGSGGLVTTQTITCLIKVLGEEGLVTDA 188 Query: 1243 LAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTILI 1064 LAAFYRMKQFHCKPDV+AYNTII ALCRVGNF KA+ LLEQMELPGFR PPD FTYTILI Sbjct: 189 LAAFYRMKQFHCKPDVYAYNTIIYALCRVGNFNKARFLLEQMELPGFRCPPDVFTYTILI 248 Query: 1063 SSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRALE 884 SSYCRY LETGCRKATRRRMWEANH+FR MLF+GFVPD+VTYN LINGCCKTYRI RALE Sbjct: 249 SSYCRYGLETGCRKATRRRMWEANHMFRNMLFRGFVPDVVTYNSLINGCCKTYRIERALE 308 Query: 883 LLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPIIHAL 704 L DDM R GC PNRVTY+SFIRYY++VNEIDKA++MLR+M+ + HG P++SSYTPIIHA Sbjct: 309 LFDDMNRMGCTPNRVTYDSFIRYYAAVNEIDKAVDMLRKMQNMKHGMPTSSSYTPIIHAF 368 Query: 703 CEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIEKRT 524 CEAGR +EA+DF+ EL+ GS+PREYTYKLV L+ +L +I+ GIE R Sbjct: 369 CEAGRVIEARDFLAELIDGGSIPREYTYKLVCDALNSAGELNLLDNDLHRRIKYGIESRY 428 Query: 523 AQLMKVKPIV 494 Q+MKVKPI+ Sbjct: 429 RQIMKVKPIM 438 >ref|XP_004149831.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77405-like [Cucumis sativus] gi|449518241|ref|XP_004166151.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77405-like [Cucumis sativus] Length = 445 Score = 600 bits (1548), Expect = e-169 Identities = 290/431 (67%), Positives = 355/431 (82%), Gaps = 3/431 (0%) Frame = -2 Query: 1774 IVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSIGC 1595 +V Q+ A++KN+PF+ V ++AST S+T +W++++V++VLR VPRF F+S RSIG Sbjct: 14 LVDQILVAMLKNRPFDTHVHSAAST---STTHQLWSSDSVSDVLRSVPRFFFQSARSIGT 70 Query: 1594 QKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRFGF 1415 QKGFR+R PLKQRKLK EA+K +N +LVLGP AHRD K KLGL+KALEF+YWVET FGF Sbjct: 71 QKGFRHRTPLKQRKLKEEAYKFRNNVLVLGPGAHRDPFKAKLGLNKALEFFYWVETHFGF 130 Query: 1414 VHNERTCREMSLVLAKGNGLKFLWGFLKEMSKR---GIVNTATVTCVVKVLGEEGLVDEA 1244 H+E TCREM+ VLA+GN L LW FLKEMS+R G+V TAT+TC++KVLGEEGLV+EA Sbjct: 131 QHDEITCREMACVLARGNTLMGLWDFLKEMSRRENGGLVTTATITCLIKVLGEEGLVNEA 190 Query: 1243 LAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTILI 1064 L AFYRMKQFHCKPDV+AYNT+IN LCR+GNFKKA+ LLEQMELPGFR PPD FTYTILI Sbjct: 191 LTAFYRMKQFHCKPDVYAYNTVINVLCRIGNFKKARFLLEQMELPGFRCPPDIFTYTILI 250 Query: 1063 SSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRALE 884 SSYC+YSL+TGCRKA RRR+WEANHLFRIMLFKGF PD+VTYN LI+GCCKTYRI RALE Sbjct: 251 SSYCKYSLQTGCRKAIRRRLWEANHLFRIMLFKGFSPDVVTYNSLIDGCCKTYRIQRALE 310 Query: 883 LLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPIIHAL 704 L +DM +RGC PNR+TYNSFIRYYS+VNEID+AI+MLR M+++NHG ++SSYTPIIHAL Sbjct: 311 LFEDMSKRGCTPNRLTYNSFIRYYSAVNEIDQAIKMLRMMQKMNHGIATSSSYTPIIHAL 370 Query: 703 CEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIEKRT 524 CE G+ +EA+DF+ EL+ GSVPREYTY+LV L+ N+ E+I GIE R Sbjct: 371 CEGGKVIEARDFLLELLEEGSVPREYTYQLVCNLLNSAGKASLLDENVHERIRHGIENRY 430 Query: 523 AQLMKVKPIVN 491 ++ KVK I++ Sbjct: 431 REVKKVKLIMS 441 >gb|EXB60117.1| hypothetical protein L484_013382 [Morus notabilis] Length = 444 Score = 598 bits (1543), Expect = e-168 Identities = 295/437 (67%), Positives = 355/437 (81%), Gaps = 4/437 (0%) Frame = -2 Query: 1801 NRCLFNK-SYIVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRF 1625 ++ L++K + IV+Q+ A+++N+PF+ H +S++ W+ E V+EVLR VPRF Sbjct: 4 SKSLYSKYTRIVNQILTAMLQNRPFD---------SHLTSSSLRWSAEAVSEVLRSVPRF 54 Query: 1624 LFRSPRSIGCQKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEF 1445 F+SPRSIG QKGFR+R PLKQR LK E+ K N +LVLGPAAHRD EKV+LGL KA++F Sbjct: 55 FFQSPRSIGRQKGFRHRSPLKQRNLKQESLKFSNNVLVLGPAAHRDPEKVQLGLDKAMDF 114 Query: 1444 YYWVETRFGFVHNERTCREMSLVLAKGNGLKFLWGFLKEMSKRG---IVNTATVTCVVKV 1274 ++WVE FGF HNE TCREM+LVLA+GN L LW FLKEMS RG +V +++TC++KV Sbjct: 115 FFWVEDSFGFAHNEATCREMALVLARGNSLGTLWNFLKEMSIRGSGQLVTISSITCLIKV 174 Query: 1273 LGEEGLVDEALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIP 1094 LGEEGLV+EAL+ FYRMKQFHCKPDV+AYNTII ALCRVGNFKKA+ LLEQMELPGF P Sbjct: 175 LGEEGLVNEALSCFYRMKQFHCKPDVYAYNTIIYALCRVGNFKKARFLLEQMELPGFWCP 234 Query: 1093 PDTFTYTILISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCC 914 PDTFTYTILISSYCRYSLETGC+KA RRR+WEANHLFRIMLFKGFVPD+VT+N LINGCC Sbjct: 235 PDTFTYTILISSYCRYSLETGCKKAIRRRVWEANHLFRIMLFKGFVPDVVTFNSLINGCC 294 Query: 913 KTYRIGRALELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPST 734 KTYRIGRALEL +DM +RGC PN VTYNSFIRYYS+VNEID A++MLRRM+++NHG PS+ Sbjct: 295 KTYRIGRALELFEDMNKRGCTPNGVTYNSFIRYYSAVNEIDNAVDMLRRMQKMNHGIPSS 354 Query: 733 SSYTPIIHALCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFE 554 SSYTPIIHALCEAG+ALEA+DF+ ELV GSVPREYTYKLV L+ + + Sbjct: 355 SSYTPIIHALCEAGKALEARDFLVELVDAGSVPREYTYKLVCDALMSAGQAYLLDDGMHK 414 Query: 553 QIEEGIEKRTAQLMKVK 503 +I++GIE R QL K K Sbjct: 415 RIKDGIENRFKQLGKFK 431 >ref|XP_006351204.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77405-like [Solanum tuberosum] Length = 440 Score = 598 bits (1543), Expect = e-168 Identities = 296/445 (66%), Positives = 348/445 (78%), Gaps = 3/445 (0%) Frame = -2 Query: 1819 MLSTPQNRCLFNKSYIVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLR 1640 MLST F ++ ++ QV AAI++N+ FE +L IWT ETV++VLR Sbjct: 1 MLSTRSLSSPFIQNLVMDQVMAAIIRNRSFEASILQP-----------IWTVETVSQVLR 49 Query: 1639 CVPRFLFRSPRSIGCQKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLS 1460 +PRFLF+SPRSIG Q GFRYR PLKQR LK E AQ G+L+LGPAAHRD +KV+LGL Sbjct: 50 SIPRFLFQSPRSIGRQNGFRYRTPLKQRNLKQEFHNAQKGVLILGPAAHRDPQKVQLGLE 109 Query: 1459 KALEFYYWVETRFGFVHNERTCREMSLVLAKGN-GLKFLWGFLKEMSKRGIVNTATVTCV 1283 KALEF++WVET GF HNE TCREMSLVLAKG+ K LW FLK MS+RG++ T TVTC+ Sbjct: 110 KALEFFHWVETHCGFTHNELTCREMSLVLAKGSCNSKLLWEFLKRMSRRGLLTTPTVTCL 169 Query: 1282 VKVLGEEGLVDEALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGF 1103 +K LGEEGLV+EAL FYRMKQFHCKPDV+AYNT+I ALCRVGNFKKAK L+EQMELPGF Sbjct: 170 IKCLGEEGLVNEALTTFYRMKQFHCKPDVYAYNTLIFALCRVGNFKKAKFLMEQMELPGF 229 Query: 1102 RIPPDTFTYTILISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLIN 923 R PPD FTYTILISSYCRY +ETGCRKA RRR+WEANHLFR+MLFKG VPD+VTYN LIN Sbjct: 230 RCPPDVFTYTILISSYCRYGMETGCRKAIRRRIWEANHLFRVMLFKGLVPDVVTYNSLIN 289 Query: 922 GCCKTYRIGRALELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGT 743 GCCKT RI RALELLDDM++RG +PNR+TYNSFIRYYS NEIDKAIEMLRRM+ +NHG Sbjct: 290 GCCKTNRIERALELLDDMVKRGVVPNRITYNSFIRYYSVTNEIDKAIEMLRRMQGMNHGV 349 Query: 742 --PSTSSYTPIIHALCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLN 569 P SSYTPIIHA+CE GR +EA+D + EL +GS+PREYTYKLVR L+ Sbjct: 350 VLPCNSSYTPIIHAMCETGRVVEARDLLVELAEQGSIPREYTYKLVRDALESSGKIDLLD 409 Query: 568 RNLFEQIEEGIEKRTAQLMKVKPIV 494 L ++E+GI+ R Q+MKVKP++ Sbjct: 410 EELCTRLEDGIKGRIRQVMKVKPLL 434 >ref|XP_004244895.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77405-like [Solanum lycopersicum] Length = 426 Score = 595 bits (1535), Expect = e-167 Identities = 291/429 (67%), Positives = 342/429 (79%), Gaps = 3/429 (0%) Frame = -2 Query: 1771 VSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSIGCQ 1592 + Q AAI++N+PFE + SIWT ETV++VLR +PRFLF+SPRSIG Q Sbjct: 1 MDQAMAAIIRNRPFESSI-----------PQSIWTVETVSQVLRSIPRFLFQSPRSIGRQ 49 Query: 1591 KGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRFGFV 1412 GFR+R PLKQR LK E A+ G+L+LGPAAHRD +KV+LGL KALEF++WVETR GF Sbjct: 50 NGFRHRTPLKQRNLKQELHNARKGVLILGPAAHRDVQKVQLGLEKALEFFHWVETRCGFT 109 Query: 1411 HNERTCREMSLVLAKGN-GLKFLWGFLKEMSKRGIVNTATVTCVVKVLGEEGLVDEALAA 1235 HNE TCREMSLVLAKG+ KFLW FL+ MS+RG++ T TVTC++K LGEEGLV+EAL Sbjct: 110 HNELTCREMSLVLAKGSCNSKFLWEFLRRMSRRGLLTTPTVTCLIKCLGEEGLVNEALTT 169 Query: 1234 FYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTILISSY 1055 FYRMKQFHC PDV+AYNT+I ALCRVGNFKKAK L+EQMELPGFR PPD FTYTILISSY Sbjct: 170 FYRMKQFHCMPDVYAYNTLIFALCRVGNFKKAKFLMEQMELPGFRCPPDVFTYTILISSY 229 Query: 1054 CRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRALELLD 875 CRY ETGCRKA RRR+WEANHLFRIMLFKGFVPD+VTYNCLINGCCKT RI RALELLD Sbjct: 230 CRYGTETGCRKAIRRRIWEANHLFRIMLFKGFVPDVVTYNCLINGCCKTNRIERALELLD 289 Query: 874 DMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGT--PSTSSYTPIIHALC 701 DM++RG +PNR+TYNSFIRYYS NEIDKAIEMLRRM+ +NHG P SSYTPIIHA+C Sbjct: 290 DMVKRGVVPNRITYNSFIRYYSVTNEIDKAIEMLRRMQGMNHGVVLPCNSSYTPIIHAMC 349 Query: 700 EAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIEKRTA 521 E GR +EA+D + E+ +GS+PREYTYKLVR L+ L ++E+GI+ R Sbjct: 350 ETGRVVEARDLLVEVAEQGSIPREYTYKLVRDALESSGKIDLLDEELCTRLEDGIKGRIR 409 Query: 520 QLMKVKPIV 494 Q+MKVKP++ Sbjct: 410 QVMKVKPLL 418 >ref|XP_004292714.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77405-like [Fragaria vesca subsp. vesca] Length = 444 Score = 595 bits (1533), Expect = e-167 Identities = 287/432 (66%), Positives = 349/432 (80%), Gaps = 1/432 (0%) Frame = -2 Query: 1786 NKSYIVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPR 1607 N + +V+QV A++KNQPF+ +PS + WTT++V++VL +P F F SPR Sbjct: 11 NHTPLVNQVLTAMLKNQPFD---------PNPSPSTQPWTTDSVSQVLTSIPTFFFHSPR 61 Query: 1606 SIGCQKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVET 1427 SIG Q GFR+R PLKQR L+ E +K N IL+LGPAAHRD KV LG+ KAL+FYYWVE Sbjct: 62 SIGRQPGFRHRAPLKQRNLRQETYKFHNDILLLGPAAHRDLNKVNLGVEKALDFYYWVEN 121 Query: 1426 RFGFVHNERTCREMSLVLAKGNGLKFLWGFLKEMSKRG-IVNTATVTCVVKVLGEEGLVD 1250 +FGF HNERTCREM++VLAK N LK LW FLK+M++RG +V T ++TC++K LGEEGLV+ Sbjct: 122 QFGFHHNERTCREMAIVLAKANRLKALWDFLKDMARRGPLVTTQSITCLMKCLGEEGLVN 181 Query: 1249 EALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTI 1070 EALAAFYRMKQFHCKPDV+AYNTII ALCRVGNFKKA+ LLEQMELPGFR PPD FTYTI Sbjct: 182 EALAAFYRMKQFHCKPDVYAYNTIIYALCRVGNFKKARFLLEQMELPGFRCPPDVFTYTI 241 Query: 1069 LISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRA 890 LISSYCRY LETGCRKATRRR+WEANH+FR M+F+GFVPD+VTYN LINGCCK YRI RA Sbjct: 242 LISSYCRYGLETGCRKATRRRLWEANHMFRNMVFRGFVPDVVTYNALINGCCKNYRIERA 301 Query: 889 LELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPIIH 710 LEL +DM++R C PNRVTY+SFIRYY++VNEIDKA++MLRRM+++NHG ++SSYTPIIH Sbjct: 302 LELFEDMMKRDCTPNRVTYDSFIRYYAAVNEIDKAVDMLRRMQDMNHGLATSSSYTPIIH 361 Query: 709 ALCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIEK 530 ALCEAGRA EA+DF+ ELV GS+PREYTY LV L+ L E+IE+G+++ Sbjct: 362 ALCEAGRATEARDFLVELVDGGSIPREYTYNLVCNALSSAGEVNVLDDGLRERIEDGMQR 421 Query: 529 RTAQLMKVKPIV 494 R +MKVKPI+ Sbjct: 422 RYKHMMKVKPIM 433 >ref|XP_006472819.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77405-like isoform X1 [Citrus sinensis] gi|568837618|ref|XP_006472820.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77405-like isoform X2 [Citrus sinensis] Length = 451 Score = 584 bits (1506), Expect = e-164 Identities = 285/431 (66%), Positives = 347/431 (80%), Gaps = 4/431 (0%) Frame = -2 Query: 1774 IVSQVFAAIVKNQPFECQVLASAS-TQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSIG 1598 +V QV ++KN PF+ ++ AS + TQ+P +T E+V +VL+ +PRF F+SPRSIG Sbjct: 14 LVQQVLPLMLKNVPFDAKLAASTTKTQNP------FTIESVADVLKSIPRFFFQSPRSIG 67 Query: 1597 CQKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRFG 1418 Q GFR+R PL+QR LK EA+ N +LVLGPAA+RD +KV LGL+KA EFY+WVE F Sbjct: 68 RQTGFRHRTPLRQRILKKEAYNIANNVLVLGPAAYRDPQKVTLGLNKATEFYHWVERFFD 127 Query: 1417 FVHNERTCREMSLVLAKGNGLKFLWGFLKEMSKRG---IVNTATVTCVVKVLGEEGLVDE 1247 F HNE TC+EM +V A+GN +K LW FLK+MS+RG +V T++VTC++KVLGEEGLV+E Sbjct: 128 FFHNEMTCKEMGIVFARGNNVKGLWDFLKDMSRRGNGELVTTSSVTCLIKVLGEEGLVNE 187 Query: 1246 ALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTIL 1067 ALA FYRMKQF C+PDV+AYN +INALCRVGNF KA+ LLEQMELPGFR PPD +TYTIL Sbjct: 188 ALATFYRMKQFRCRPDVYAYNVVINALCRVGNFNKARFLLEQMELPGFRCPPDVYTYTIL 247 Query: 1066 ISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRAL 887 ISSYC+Y ++TGCRKA RRR+WEANHLFR+MLFKGFVPD+V YNCLI+GCCKTYRI RAL Sbjct: 248 ISSYCKYGMQTGCRKAIRRRIWEANHLFRLMLFKGFVPDVVAYNCLIDGCCKTYRIERAL 307 Query: 886 ELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPIIHA 707 EL DDM ++GC+PNRVTYNSFIRYYS VNEIDKAIEM+R+M+ LNHG P++SSYTPIIHA Sbjct: 308 ELFDDMNKKGCIPNRVTYNSFIRYYSVVNEIDKAIEMMRKMQNLNHGVPTSSSYTPIIHA 367 Query: 706 LCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIEKR 527 LCEAGR LEA+DF+ ELV GSVPREYTYKLV L+ L ++I +GIE R Sbjct: 368 LCEAGRVLEARDFLAELVDGGSVPREYTYKLVCDALNAAEEPSLLDDGLRKRIRDGIEYR 427 Query: 526 TAQLMKVKPIV 494 Q+MKVKPI+ Sbjct: 428 FRQVMKVKPIM 438 >ref|XP_006434253.1| hypothetical protein CICLE_v10003743mg [Citrus clementina] gi|557536375|gb|ESR47493.1| hypothetical protein CICLE_v10003743mg [Citrus clementina] Length = 451 Score = 583 bits (1503), Expect = e-164 Identities = 289/446 (64%), Positives = 351/446 (78%), Gaps = 4/446 (0%) Frame = -2 Query: 1819 MLSTPQNRCLFNKSYIVSQVFAAIVKNQPFECQVLASAS-TQHPSSTASIWTTETVTEVL 1643 ++S P N N + +V QV I+KN PF+ ++ AS + TQ+P +T E+V +VL Sbjct: 2 IVSKPLNS---NHTCLVQQVLPLILKNVPFDAKLAASTTKTQNP------FTIESVADVL 52 Query: 1642 RCVPRFLFRSPRSIGCQKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGL 1463 + +PRF F+SPRSIG Q GFR+R PLKQR LK EA N +LVLGPAA+R+ +KV LG+ Sbjct: 53 KSIPRFFFQSPRSIGRQTGFRHRTPLKQRILKKEADNIANNVLVLGPAAYRNPQKVTLGI 112 Query: 1462 SKALEFYYWVETRFGFVHNERTCREMSLVLAKGNGLKFLWGFLKEMSKRG---IVNTATV 1292 +KA EFY+WVE F F HNE TC+EM +V A+GN +K LW FLKEMS+RG +V T+TV Sbjct: 113 NKATEFYHWVERFFHFFHNEVTCKEMGIVFARGNNVKGLWDFLKEMSRRGNGELVTTSTV 172 Query: 1291 TCVVKVLGEEGLVDEALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMEL 1112 TC++KVLGEEGLV+EALA FYRMKQF C+PDV+AYN +INALCRVGNF KA+ LLEQMEL Sbjct: 173 TCLIKVLGEEGLVNEALATFYRMKQFRCRPDVYAYNVVINALCRVGNFNKARFLLEQMEL 232 Query: 1111 PGFRIPPDTFTYTILISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNC 932 PGFR PPD +TYTILISSYC+Y ++TGCRKA RRR+WEANHLFR+MLFKGFVPD+V YNC Sbjct: 233 PGFRCPPDVYTYTILISSYCKYGMQTGCRKAIRRRIWEANHLFRLMLFKGFVPDVVAYNC 292 Query: 931 LINGCCKTYRIGRALELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELN 752 LI+GCCKTYRI RALEL DDM ++GC+PNRVTYNSFIRYYS VNEIDKAIEM+R+M+ LN Sbjct: 293 LIDGCCKTYRIERALELFDDMNKKGCVPNRVTYNSFIRYYSVVNEIDKAIEMMRKMQNLN 352 Query: 751 HGTPSTSSYTPIIHALCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXL 572 H P++SSYTPIIHALCEAGR LEA+DF+ ELV GSVPREYTYKLV Sbjct: 353 HSVPTSSSYTPIIHALCEAGRVLEARDFLAELVDGGSVPREYTYKLVCDALNAAEEPSLP 412 Query: 571 NRNLFEQIEEGIEKRTAQLMKVKPIV 494 + L ++I +GIE R Q+MKVKPI+ Sbjct: 413 DDGLRKRIRDGIENRFRQVMKVKPIM 438 >gb|EOY16374.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Theobroma cacao] Length = 438 Score = 583 bits (1503), Expect = e-164 Identities = 283/431 (65%), Positives = 347/431 (80%), Gaps = 3/431 (0%) Frame = -2 Query: 1780 SYIVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSI 1601 S IV QV + +++N+PF+ Q LAS++T +P WTT+ V+++LR V +F F+SPRSI Sbjct: 12 SAIVHQVLSIMLQNRPFDSQ-LASSTTSNP------WTTDAVSDILRSVSKFFFQSPRSI 64 Query: 1600 GCQKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRF 1421 G Q GFR+R PLKQR +K E FK +L+LGPAA+RD ++V LGL KA+EFY WVE F Sbjct: 65 GSQTGFRHRAPLKQRNIKQENFKNYQNVLILGPAAYRDPKRVALGLDKAMEFYIWVENFF 124 Query: 1420 GFVHNERTCREMSLVLAKGNGLKFLWGFLKEMSKR---GIVNTATVTCVVKVLGEEGLVD 1250 GF HNE+TC+EM+ VLAKGN LK LW FLK+MS+R G+V T+TVTC++KVLGEEGLV+ Sbjct: 125 GFAHNEKTCKEMAFVLAKGNDLKVLWHFLKDMSRRENSGLVTTSTVTCLIKVLGEEGLVN 184 Query: 1249 EALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTI 1070 EALA FYRMKQF CKPDVFAYN II+ALCRVGNF KA+ LLEQMELPGF PPD +TYTI Sbjct: 185 EALACFYRMKQFRCKPDVFAYNMIIHALCRVGNFNKARFLLEQMELPGFICPPDVYTYTI 244 Query: 1069 LISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRA 890 LISSYC++S++TGCRKA RRR++EANHLFR+MLFKGFVPD+VTYNCLI+GCCKTYRI RA Sbjct: 245 LISSYCKFSMQTGCRKAIRRRLYEANHLFRLMLFKGFVPDVVTYNCLIDGCCKTYRIERA 304 Query: 889 LELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPIIH 710 LEL DDM +R C+PNR+TYNSFIRYY +VNEIDK IEM+RRM+++NHG + SSYTPIIH Sbjct: 305 LELYDDMNKRDCVPNRITYNSFIRYYCAVNEIDKGIEMMRRMQQMNHGLATNSSYTPIIH 364 Query: 709 ALCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIEK 530 ALCEAGR LEA+DF+ EL+ GS+PREYTYKLV ++ L ++I +GIE Sbjct: 365 ALCEAGRVLEAKDFLLELISGGSIPREYTYKLVCDTLNSVGAANLIDDELHKRIRDGIES 424 Query: 529 RTAQLMKVKPI 497 R Q+MKVK I Sbjct: 425 RCRQVMKVKQI 435 >gb|EOY16373.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma cacao] Length = 586 Score = 578 bits (1491), Expect = e-162 Identities = 280/427 (65%), Positives = 344/427 (80%), Gaps = 3/427 (0%) Frame = -2 Query: 1780 SYIVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSI 1601 S IV QV + +++N+PF+ Q LAS++T +P WTT+ V+++LR V +F F+SPRSI Sbjct: 12 SAIVHQVLSIMLQNRPFDSQ-LASSTTSNP------WTTDAVSDILRSVSKFFFQSPRSI 64 Query: 1600 GCQKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRF 1421 G Q GFR+R PLKQR +K E FK +L+LGPAA+RD ++V LGL KA+EFY WVE F Sbjct: 65 GSQTGFRHRAPLKQRNIKQENFKNYQNVLILGPAAYRDPKRVALGLDKAMEFYIWVENFF 124 Query: 1420 GFVHNERTCREMSLVLAKGNGLKFLWGFLKEMSKR---GIVNTATVTCVVKVLGEEGLVD 1250 GF HNE+TC+EM+ VLAKGN LK LW FLK+MS+R G+V T+TVTC++KVLGEEGLV+ Sbjct: 125 GFAHNEKTCKEMAFVLAKGNDLKVLWHFLKDMSRRENSGLVTTSTVTCLIKVLGEEGLVN 184 Query: 1249 EALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTI 1070 EALA FYRMKQF CKPDVFAYN II+ALCRVGNF KA+ LLEQMELPGF PPD +TYTI Sbjct: 185 EALACFYRMKQFRCKPDVFAYNMIIHALCRVGNFNKARFLLEQMELPGFICPPDVYTYTI 244 Query: 1069 LISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRA 890 LISSYC++S++TGCRKA RRR++EANHLFR+MLFKGFVPD+VTYNCLI+GCCKTYRI RA Sbjct: 245 LISSYCKFSMQTGCRKAIRRRLYEANHLFRLMLFKGFVPDVVTYNCLIDGCCKTYRIERA 304 Query: 889 LELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPIIH 710 LEL DDM +R C+PNR+TYNSFIRYY +VNEIDK IEM+RRM+++NHG + SSYTPIIH Sbjct: 305 LELYDDMNKRDCVPNRITYNSFIRYYCAVNEIDKGIEMMRRMQQMNHGLATNSSYTPIIH 364 Query: 709 ALCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIEK 530 ALCEAGR LEA+DF+ EL+ GS+PREYTYKLV ++ L ++I +GIE Sbjct: 365 ALCEAGRVLEAKDFLLELISGGSIPREYTYKLVCDTLNSVGAANLIDDELHKRIRDGIES 424 Query: 529 RTAQLMK 509 R Q+MK Sbjct: 425 RCRQVMK 431 >ref|XP_002279168.1| PREDICTED: pentatricopeptide repeat-containing protein At1g77405-like [Vitis vinifera] Length = 432 Score = 570 bits (1468), Expect = e-159 Identities = 287/434 (66%), Positives = 338/434 (77%), Gaps = 3/434 (0%) Frame = -2 Query: 1786 NKSYIVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPR 1607 ++S +V QV AA+V+N P + + S P WTT++V+EVLR +PR F+SPR Sbjct: 10 HRSSLVKQVLAAMVQNCPLDAS--PNKSCNQP------WTTDSVSEVLRSIPRLFFQSPR 61 Query: 1606 SIGCQKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVET 1427 SIG QKGFR+R PLKQR L E K +RD KVKLG+ KA+EFY WVET Sbjct: 62 SIGRQKGFRHRSPLKQRNLYQEPNKFHR---------YRDPHKVKLGVEKAMEFYSWVET 112 Query: 1426 RFGFVHNERTCREMSLVLAKGNGLKFLWGFLKEMSKRG---IVNTATVTCVVKVLGEEGL 1256 +FGF HNE TCREM VLA+GN LK LW FL EM+++G +V TAT+TC++KVLGEEGL Sbjct: 113 QFGFSHNEMTCREMGCVLARGNRLKVLWEFLHEMARKGGNGVVTTATITCLMKVLGEEGL 172 Query: 1255 VDEALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTY 1076 ++ALAAFYRMKQFHCKPDV+AYNTII ALCRVGNF+KA+ LLEQMELPGFR PPD+FTY Sbjct: 173 ANQALAAFYRMKQFHCKPDVYAYNTIIYALCRVGNFRKARFLLEQMELPGFRCPPDSFTY 232 Query: 1075 TILISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIG 896 TILI SYC+YSL+TGCRKA RRR+WEANHLFRIMLFKGFVPD+VTYNCLI+GCCKTYRI Sbjct: 233 TILIGSYCKYSLQTGCRKAVRRRLWEANHLFRIMLFKGFVPDVVTYNCLIDGCCKTYRIE 292 Query: 895 RALELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPI 716 RALEL DDM +RGC+PNRVTYNSFIRYYS+VNEIDKA++ML +M+E+NHG P+TSSYTPI Sbjct: 293 RALELFDDMNKRGCVPNRVTYNSFIRYYSAVNEIDKAVDMLCKMKEMNHGIPTTSSYTPI 352 Query: 715 IHALCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGI 536 IHALCE GR LEA+DF+ ELV GSVPREYTYK+V L L +IE GI Sbjct: 353 IHALCETGRILEARDFLIELVDGGSVPREYTYKVVCDSLRSAGEANMLGDELRGRIENGI 412 Query: 535 EKRTAQLMKVKPIV 494 E R Q+MKVK I+ Sbjct: 413 ENRYKQVMKVKLIM 426 >ref|XP_002520332.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223540551|gb|EEF42118.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 441 Score = 558 bits (1439), Expect = e-156 Identities = 269/433 (62%), Positives = 340/433 (78%), Gaps = 6/433 (1%) Frame = -2 Query: 1774 IVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSIGC 1595 +++QV + +++++PF+ Q+ +S +T S+ ++ +++VLR +PRF F+S RS+G Sbjct: 10 LINQVISLMIQHRPFDIQLASSTTT-------SLLSSNLISDVLRSIPRFFFQSTRSVGR 62 Query: 1594 QKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRFGF 1415 Q R+R PLKQR LK E K N +L+LGPAA++D ++VKLG+ KA+EF+YWVET F Sbjct: 63 QSTTRHRSPLKQRSLKQETHKHNNKLLILGPAAYKDPKRVKLGVFKAMEFFYWVETNCDF 122 Query: 1414 VHNERTCREMSLVLAKGNGLKFLWGFLKEMSKRGI------VNTATVTCVVKVLGEEGLV 1253 +H E TCREM VLA+ N L LW FL+EM+KR + V T VTC++KVLGEEGLV Sbjct: 123 IHTESTCREMGFVLARANRLDKLWNFLQEMAKREVFDGRKLVTTNAVTCLIKVLGEEGLV 182 Query: 1252 DEALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYT 1073 EAL+ FYRMK++HCKPDV+AYNTII ALCR+GNFKKA+ LLEQMELPGF PPDTFTYT Sbjct: 183 KEALSLFYRMKKYHCKPDVYAYNTIIYALCRIGNFKKARYLLEQMELPGFYCPPDTFTYT 242 Query: 1072 ILISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGR 893 I+ISSYC+YSL+TGCRKA RRR+WEANHLFRIMLFKGF PD+VTYNCLI+GCCKTYRI R Sbjct: 243 IMISSYCKYSLQTGCRKAIRRRLWEANHLFRIMLFKGFAPDVVTYNCLIDGCCKTYRIER 302 Query: 892 ALELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPII 713 ALEL +DM RRGC+PNRVTYNSFIRYYS+VNEIDKA+EMLRRM+ +NHG ++SSYTPII Sbjct: 303 ALELFEDMNRRGCVPNRVTYNSFIRYYSAVNEIDKAVEMLRRMQNMNHGLATSSSYTPII 362 Query: 712 HALCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIE 533 HALCEA R LEA+DF+ ELV GS+PREYTYKLV ++ +I++GI+ Sbjct: 363 HALCEADRVLEARDFLLELVDGGSIPREYTYKLVCETMKSAREVNLIDDEFHNRIKDGID 422 Query: 532 KRTAQLMKVKPIV 494 +R Q+ KVKPI+ Sbjct: 423 ERFRQVKKVKPIM 435 >ref|XP_006386364.1| hypothetical protein POPTR_0002s08080g [Populus trichocarpa] gi|550344529|gb|ERP64161.1| hypothetical protein POPTR_0002s08080g [Populus trichocarpa] Length = 448 Score = 548 bits (1412), Expect = e-153 Identities = 267/435 (61%), Positives = 345/435 (79%), Gaps = 6/435 (1%) Frame = -2 Query: 1774 IVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSIGC 1595 +++QV + +++N+PF+ ++ ++ST +P+ + T ++V+++LR +PRF F SPRSIG Sbjct: 16 VINQVLSIMIQNRPFDTKL--ASSTTNPN----LLTIDSVSDILRSIPRFFFLSPRSIGR 69 Query: 1594 QKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRFGF 1415 Q +R PLKQRKLK E K+++ +L+LGPAA+RD ++V LG++KA+EF+YW+E F F Sbjct: 70 QNTAFHRSPLKQRKLKEETHKSRHNVLILGPAAYRDPKRVALGVNKAVEFFYWLENHFSF 129 Query: 1414 VHNERTCREMSLVLAKGNGLKFLWGFLKEMSKR------GIVNTATVTCVVKVLGEEGLV 1253 H E TCREM++VLA+G+ L LW FLKEM++R G+V+ +TVTC++KVLGEEGLV Sbjct: 130 KHTEITCREMAVVLARGSKLDELWHFLKEMAQREHGNCLGLVSVSTVTCLIKVLGEEGLV 189 Query: 1252 DEALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYT 1073 +ALA FYRMKQ+H KPDV+AYNT+I ALCRVGNFK+A+ LLEQMELPGFR PPD +TYT Sbjct: 190 HQALALFYRMKQYHLKPDVYAYNTLIYALCRVGNFKRARFLLEQMELPGFRCPPDIYTYT 249 Query: 1072 ILISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGR 893 ILISS CRY L+TGCRKA RRR+WEAN LFRIMLFKGFVPD+VTYNCLINGCCKT RI R Sbjct: 250 ILISSCCRYGLQTGCRKAIRRRIWEANRLFRIMLFKGFVPDVVTYNCLINGCCKTNRIER 309 Query: 892 ALELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPII 713 ALEL +DM +RGC+PNRVTYNSFIRY+S VNEIDKA+EMLRRM+++NHG +TSSYTPII Sbjct: 310 ALELFEDMNKRGCVPNRVTYNSFIRYFSVVNEIDKAVEMLRRMQKMNHGLATTSSYTPII 369 Query: 712 HALCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIE 533 HALCEAGR +EA+DF+ ELV G +PREYTY+LV L ++I++GIE Sbjct: 370 HALCEAGRVVEARDFLVELVDGGLIPREYTYRLVCDALKSVREGSSLGGEFDKRIKDGIE 429 Query: 532 KRTAQLMKVKPIVNC 488 R ++ VKPI+ C Sbjct: 430 DRYRKVKNVKPIMAC 444 >ref|XP_002887681.1| binding protein [Arabidopsis lyrata subsp. lyrata] gi|297333522|gb|EFH63940.1| binding protein [Arabidopsis lyrata subsp. lyrata] Length = 459 Score = 548 bits (1411), Expect = e-153 Identities = 261/431 (60%), Positives = 338/431 (78%), Gaps = 4/431 (0%) Frame = -2 Query: 1774 IVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSIGC 1595 I+ Q+ AA+++N+PF+ VLAS++ +P WT + V++VLR +PRF F SPRSIG Sbjct: 11 IIDQLIAAMIQNRPFDA-VLASSTVANP------WTQQLVSDVLRSIPRFFFISPRSIGR 63 Query: 1594 QKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRFGF 1415 QKGFR+R PLKQR L E+ + ++ +LVLGP A+ D +K+ LGL KALEF++W+E FGF Sbjct: 64 QKGFRHRSPLKQRNLSDESQRRRSEVLVLGPGAYIDPKKISLGLQKALEFFFWIEIHFGF 123 Query: 1414 VHNERTCREMSLVLAKGNGLKFLWGFLKEMSKR----GIVNTATVTCVVKVLGEEGLVDE 1247 HNE TCR+M+ +LAKGN K LW FL+++S+R +V TA++TC++K LGEEG V E Sbjct: 124 GHNEITCRDMACLLAKGNDFKGLWDFLRQVSRRENGKNVVTTASITCLMKCLGEEGFVKE 183 Query: 1246 ALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTIL 1067 ALA FYRMK++HCKPDV+AYNTIINALCRVGNFKKA+ LL+QM+LPGFR PPDT+TYTIL Sbjct: 184 ALATFYRMKEYHCKPDVYAYNTIINALCRVGNFKKARFLLDQMQLPGFRYPPDTYTYTIL 243 Query: 1066 ISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRAL 887 ISSYCRY ++TGCRKA RRRMWEAN +FR MLF+GFVPD+VTYNCLI+GCCKT RIGRAL Sbjct: 244 ISSYCRYGMQTGCRKAIRRRMWEANRMFREMLFRGFVPDVVTYNCLIDGCCKTNRIGRAL 303 Query: 886 ELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPIIHA 707 EL +DM +GC+PN+VTYNSFIRYYS NEI+ AIEM+R M+++ HG P +S+YTP+IHA Sbjct: 304 ELFEDMKTKGCVPNQVTYNSFIRYYSVTNEIEGAIEMMRTMKKMGHGVPGSSTYTPLIHA 363 Query: 706 LCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIEKR 527 L E RA EA+D + E+V G VPREYTYKLV L+ L +++ EGI++R Sbjct: 364 LVETRRAAEARDLLVEMVEAGLVPREYTYKLVWDALSSEGMAGTLDEELHKRMREGIQQR 423 Query: 526 TAQLMKVKPIV 494 ++MK+KP++ Sbjct: 424 YRRVMKIKPVM 434 >ref|NP_177865.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|122244095|sp|Q1PFC5.1|PP130_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g77405 gi|91806103|gb|ABE65780.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332197853|gb|AEE35974.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 458 Score = 547 bits (1409), Expect = e-153 Identities = 262/429 (61%), Positives = 335/429 (78%), Gaps = 4/429 (0%) Frame = -2 Query: 1774 IVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSIGC 1595 IV Q+ A+++N+PF+ VLAS++ P WT + V++VL +PRF F SPRSIG Sbjct: 11 IVDQLITAMIQNRPFDA-VLASSTVAKP------WTQQLVSDVLHSIPRFFFISPRSIGR 63 Query: 1594 QKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRFGF 1415 QKGFR+R PLKQR L E+ + ++ +LVLGP A+ D +KV +GL KALEF++W+ET FGF Sbjct: 64 QKGFRHRSPLKQRNLSDESQRRRSEVLVLGPGAYMDPKKVSIGLQKALEFFFWIETHFGF 123 Query: 1414 VHNERTCREMSLVLAKGNGLKFLWGFLKEMSKR----GIVNTATVTCVVKVLGEEGLVDE 1247 HNE TCR+M+ +LAKGN K LW FL+++S+R +V TA++TC++K LGEEG V E Sbjct: 124 DHNEITCRDMACLLAKGNDFKGLWDFLRQVSRRENGKNVVTTASITCLMKCLGEEGFVKE 183 Query: 1246 ALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTIL 1067 ALA FYRMK++HCKPDV+AYNTIINALCRVGNFKKA+ LL+QM+LPGFR PPDT+TYTIL Sbjct: 184 ALATFYRMKEYHCKPDVYAYNTIINALCRVGNFKKARFLLDQMQLPGFRYPPDTYTYTIL 243 Query: 1066 ISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRAL 887 ISSYCRY ++TGCRKA RRRMWEAN +FR MLF+GFVPD+VTYNCLI+GCCKT RIGRAL Sbjct: 244 ISSYCRYGMQTGCRKAIRRRMWEANRMFREMLFRGFVPDVVTYNCLIDGCCKTNRIGRAL 303 Query: 886 ELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPIIHA 707 EL +DM +GC+PN+VTYNSFIRYYS NEI+ AIEM+R M++L HG P +S+YTP+IHA Sbjct: 304 ELFEDMKTKGCVPNQVTYNSFIRYYSVTNEIEGAIEMMRTMKKLGHGVPGSSTYTPLIHA 363 Query: 706 LCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIEKR 527 L E RA EA+D + E+V G VPREYTYKLV L+ L +++ EGI++R Sbjct: 364 LVETRRAAEARDLVVEMVEAGLVPREYTYKLVCDALSSEGLASTLDEELHKRMREGIQQR 423 Query: 526 TAQLMKVKP 500 +++MK+KP Sbjct: 424 YSRVMKIKP 432 >ref|XP_006390093.1| hypothetical protein EUTSA_v10019853mg [Eutrema salsugineum] gi|557086527|gb|ESQ27379.1| hypothetical protein EUTSA_v10019853mg [Eutrema salsugineum] Length = 451 Score = 544 bits (1401), Expect = e-152 Identities = 256/431 (59%), Positives = 336/431 (77%), Gaps = 4/431 (0%) Frame = -2 Query: 1774 IVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSIGC 1595 I+ Q+ A+++N+PF+ VLAS + P W+ + V++VLR +PRF F SPRSIG Sbjct: 11 IIDQLITAMIQNRPFDA-VLASVTVSSP------WSQQIVSDVLRSIPRFFFISPRSIGR 63 Query: 1594 QKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRFGF 1415 QKGFR+R PLKQR L+ E+ + ++ +LV+GP A+ D +KV LGL KA+EF++WVET FGF Sbjct: 64 QKGFRHRSPLKQRNLRDESQRRRSEVLVMGPGAYMDPKKVSLGLQKAIEFFFWVETHFGF 123 Query: 1414 VHNERTCREMSLVLAKGNGLKFLWGFLKEMSKR----GIVNTATVTCVVKVLGEEGLVDE 1247 HNE TCR+M+ +LAKGN K LW FL+++S+R +V TA++TC++K LGEEG V E Sbjct: 124 DHNEITCRDMACLLAKGNDFKGLWDFLRQVSRRENGRNVVTTASITCLMKCLGEEGFVKE 183 Query: 1246 ALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTIL 1067 ALA FYRMK++HCKPDV+AYNTIIN+LCRVGNFKKA+ LL+QM+LPGFR PPDT+TYTI Sbjct: 184 ALATFYRMKEYHCKPDVYAYNTIINSLCRVGNFKKARFLLDQMQLPGFRYPPDTYTYTIF 243 Query: 1066 ISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRAL 887 ISSYC+Y ++TGCRKA RRRMWEAN +FR MLF+GFVPD+VTYNCLI+GCCKT RIGRAL Sbjct: 244 ISSYCKYGMQTGCRKAIRRRMWEANRMFREMLFRGFVPDVVTYNCLIDGCCKTNRIGRAL 303 Query: 886 ELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPIIHA 707 EL DDM +R C+PN+VTYNSF+RYYS NEI++A++M+R M+++ HG P +S+YTP+IHA Sbjct: 304 ELFDDMKKRECVPNQVTYNSFVRYYSVTNEIERAVDMMRTMKKMGHGVPGSSTYTPLIHA 363 Query: 706 LCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIEKR 527 L E RA EA+D + E+V G VPREYTYK+V L+ L ++I EGIE R Sbjct: 364 LVETRRAAEARDLMVEMVEAGLVPREYTYKVVLDALSSEGMSSTLDEELHKRIREGIEHR 423 Query: 526 TAQLMKVKPIV 494 ++MK+KP++ Sbjct: 424 YRRVMKIKPVM 434 >ref|XP_006302269.1| hypothetical protein CARUB_v10020312mg [Capsella rubella] gi|482570979|gb|EOA35167.1| hypothetical protein CARUB_v10020312mg [Capsella rubella] Length = 438 Score = 543 bits (1400), Expect = e-152 Identities = 257/431 (59%), Positives = 337/431 (78%), Gaps = 4/431 (0%) Frame = -2 Query: 1774 IVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSIGC 1595 I+ Q+ AA+++N+PF+ +LAS++ +P W+ + V++VLR +PRF F SPRSIG Sbjct: 9 IIDQLIAAMIQNRPFDA-LLASSTVANP------WSQQLVSDVLRSIPRFFFISPRSIGR 61 Query: 1594 QKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRFGF 1415 QKGFR+R PLKQR L+ E+ + ++ +LVLGPAA+ D +KV LGL KA+EF++W+ET FGF Sbjct: 62 QKGFRHRSPLKQRNLREESQRRRSEVLVLGPAAYMDPKKVSLGLQKAMEFFFWIETHFGF 121 Query: 1414 VHNERTCREMSLVLAKGNGLKFLWGFLKEMSKR----GIVNTATVTCVVKVLGEEGLVDE 1247 HNE TCR+M+ +LAKGN K LW FL+++S+R +V TA++TC++K LGEEG V E Sbjct: 122 DHNEVTCRDMACLLAKGNDFKGLWDFLRQISRRENGQNVVTTASITCLMKCLGEEGFVKE 181 Query: 1246 ALAAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTIL 1067 ALA FYRMK++HCKPDV+AYNTIINALCRVGNFKKA+ LL+QM+LPGFR PPD +TYTIL Sbjct: 182 ALATFYRMKEYHCKPDVYAYNTIINALCRVGNFKKARFLLDQMQLPGFRYPPDVYTYTIL 241 Query: 1066 ISSYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRAL 887 I SYC+Y L+TGCRKA RRRMWEAN +FR MLF+GFVPD+VTYNCLI+GCCKT RI RAL Sbjct: 242 IGSYCKYGLQTGCRKAIRRRMWEANRMFREMLFRGFVPDVVTYNCLIDGCCKTNRIARAL 301 Query: 886 ELLDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPIIHA 707 EL +DM +RGC+PN+VTYNSFIRYYS NEI+ AIEM+R M+++ HG P S+YTP+IHA Sbjct: 302 ELFEDMNKRGCVPNQVTYNSFIRYYSVTNEIEHAIEMMRTMKKMGHGVPGASTYTPLIHA 361 Query: 706 LCEAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIEKR 527 L E R EA+D + E++ G +PREYTYKLV L+ L +++ EGIE+R Sbjct: 362 LVETRRVAEARDLVVEMLEAGWIPREYTYKLVLDALSSEGMVGTLDEELHKRMREGIEQR 421 Query: 526 TAQLMKVKPIV 494 ++M++KPI+ Sbjct: 422 YRRVMRIKPIM 432 >ref|XP_006857793.1| hypothetical protein AMTR_s00061p00214600 [Amborella trichopoda] gi|548861889|gb|ERN19260.1| hypothetical protein AMTR_s00061p00214600 [Amborella trichopoda] Length = 476 Score = 524 bits (1349), Expect = e-146 Identities = 257/429 (59%), Positives = 324/429 (75%), Gaps = 2/429 (0%) Frame = -2 Query: 1774 IVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSIGC 1595 IV+ V +V+NQ + L W+ + V+EVLR +PR+ F+S RS+GC Sbjct: 59 IVAPVIEVLVQNQGLDALRLNQ------------WSVDLVSEVLRAIPRYFFQSERSLGC 106 Query: 1594 QKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRFGF 1415 QKGFR R PL+QR L E ++ G+ + GPAA+R+ +KV+ G++KAL F++W+E+ GF Sbjct: 107 QKGFRRRAPLRQRNLYQETEDSKLGLRIRGPAAYRNPKKVEEGVNKALAFFFWLESEGGF 166 Query: 1414 VHNERTCREMSLVLAKGNGLKFLWGFLKEMSKRG--IVNTATVTCVVKVLGEEGLVDEAL 1241 H E TCREM+ LAKGN LK LW FL EM ++G +VNT TVTCV+K+LGEEGLV+EAL Sbjct: 167 QHTEITCREMACTLAKGNSLKILWKFLHEMHRKGAGLVNTVTVTCVIKILGEEGLVNEAL 226 Query: 1240 AAFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTILIS 1061 AFYRMKQFHCKPDV AYN II LCRV NFKKAK L+ QMELPG R PPDTFTYTI+I+ Sbjct: 227 GAFYRMKQFHCKPDVVAYNAIICVLCRVCNFKKAKFLMGQMELPGSRCPPDTFTYTIMIN 286 Query: 1060 SYCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRALEL 881 YC+Y+++TGC KA RRR+WEANHLFR+M+FKGF PD+VTYNCLI+G CKTYRIGRALEL Sbjct: 287 FYCKYAMQTGCSKAIRRRLWEANHLFRLMVFKGFKPDVVTYNCLIDGLCKTYRIGRALEL 346 Query: 880 LDDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPIIHALC 701 L+DML++ C PN++TYNSFIR+YS+VN+IDK+I+MLR M +HG P++SSYTPIIHALC Sbjct: 347 LNDMLQK-CSPNKITYNSFIRFYSAVNDIDKSIKMLRDMISRDHGVPTSSSYTPIIHALC 405 Query: 700 EAGRALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIEKRTA 521 E GR LEA+DF+ E+V RGS+PR YTYKLVR + L +IEEGI KR Sbjct: 406 EGGRVLEARDFLVEMVERGSIPRAYTYKLVRDALLLASEDAFMPSELCMRIEEGIRKRYE 465 Query: 520 QLMKVKPIV 494 + KVKP++ Sbjct: 466 YVKKVKPVM 474 >gb|ESW33688.1| hypothetical protein PHAVU_001G090600g [Phaseolus vulgaris] gi|561035159|gb|ESW33689.1| hypothetical protein PHAVU_001G090600g [Phaseolus vulgaris] Length = 445 Score = 503 bits (1296), Expect = e-140 Identities = 253/430 (58%), Positives = 313/430 (72%), Gaps = 2/430 (0%) Frame = -2 Query: 1777 YIVSQVFAAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSIG 1598 ++ +QV ++K+ PF+ A PS + + WT + VTEVLR + R+ +SPRSIG Sbjct: 13 HLANQVLVLVIKDLPFD------AHPPQPSPSGAPWTNDAVTEVLRSISRYTLQSPRSIG 66 Query: 1597 CQKGFRYRGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRFG 1418 Q GFR+R PL+QR L +E K +N LVLGPAAH+D KV LG KALEF+ WVE RF Sbjct: 67 RQSGFRHRTPLRQRNLNLEHHKLRNNTLVLGPAAHQDPYKVHLGPLKALEFFRWVEARFA 126 Query: 1417 FVHNERTCREMSLVLAKGNGLKFLWGFLKEMSKRGIVNTATVTCVVKVLGEEGLVDEALA 1238 F H+E TCRE++ +LA+ + +K LW FLK+ V TATVTCV+K+LGE+GL DEAL Sbjct: 127 FSHSEATCRELACLLARASTIKPLWNFLKQYPH---VTTATVTCVIKLLGEQGLADEALL 183 Query: 1237 AFYRMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTILISS 1058 F+RMKQF CKPD +YN +I+ALC VGNF KA+ LL+QMELPGF+ PPDTFTYTILISS Sbjct: 184 TFHRMKQFRCKPDTHSYNALIHALCCVGNFTKARSLLQQMELPGFQCPPDTFTYTILISS 243 Query: 1057 YCRYSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRALELL 878 YCR+ TGCRKATRRR++EA LFR +LFKG VPD+VTYN LI+GCCKT+R+ RALELL Sbjct: 244 YCRHGRLTGCRKATRRRIYEAGRLFRSLLFKGLVPDVVTYNALIDGCCKTWRVERALELL 303 Query: 877 DDMLRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPIIHALCE 698 DDM RG +PN VTY SFIRYYS VNEIDKA+EMLR M+ L +G P +SSYTPIIHALCE Sbjct: 304 DDMKTRGVVPNHVTYGSFIRYYSVVNEIDKAVEMLREMQRLGYGVPGSSSYTPIIHALCE 363 Query: 697 AGRALEAQDFIDELVRRGSVPREYTYKLV--RXXXXXXXXXXXLNRNLFEQIEEGIEKRT 524 AGR +EA F+ ELV GSVPREYTY LV + ++I++GI R Sbjct: 364 AGRVVEAWGFLVELVDSGSVPREYTYGLVCDALRAAGECGLLLEEGGVHKRIKDGIWNRY 423 Query: 523 AQLMKVKPIV 494 Q+MKVKP++ Sbjct: 424 RQMMKVKPVM 433 >gb|EPS59883.1| binding protein, partial [Genlisea aurea] Length = 414 Score = 502 bits (1293), Expect = e-139 Identities = 247/425 (58%), Positives = 315/425 (74%), Gaps = 4/425 (0%) Frame = -2 Query: 1756 AAIVKNQPFECQVLASASTQHPSSTASIWTTETVTEVLRCVPRFLFRSPRSIGCQKGFRY 1577 AAI++N+PF+ S+ IWT++ V +VLR +P LF+SPRSIG Q+ FR+ Sbjct: 2 AAIIQNKPFD------------SAARGIWTSDAVIQVLRSIPLHLFQSPRSIGRQRTFRH 49 Query: 1576 RGPLKQRKLKIEAFKAQNGILVLGPAAHRDQEKVKLGLSKALEFYYWVETRFGFVHNERT 1397 R PLKQR LK E + Q G LVLGPAAHR+ + V LGL KALEFYYW+E+ F H+E T Sbjct: 50 RSPLKQRNLKEETARRQTGSLVLGPAAHRNPKSVNLGLEKALEFYYWLESSSRFRHDEAT 109 Query: 1396 CREMSLVLAKGNGLKFLWGFLKEMSKR----GIVNTATVTCVVKVLGEEGLVDEALAAFY 1229 C+EM+L+L KGN + LW FLK MSKR +V T T+T ++K LGEEG+ +EA +AFY Sbjct: 110 CKEMALILVKGNRMNLLWDFLKNMSKRHKAGSLVTTPTMTSLIKALGEEGMANEAASAFY 169 Query: 1228 RMKQFHCKPDVFAYNTIINALCRVGNFKKAKLLLEQMELPGFRIPPDTFTYTILISSYCR 1049 RMKQF CKPDV AYN +I+ALCRVG F +A+ LL +MELPGFR PPD +TYTI+I+SYCR Sbjct: 170 RMKQFSCKPDVCAYNNLIHALCRVGFFDRARSLLAKMELPGFRCPPDVYTYTIMIASYCR 229 Query: 1048 YSLETGCRKATRRRMWEANHLFRIMLFKGFVPDIVTYNCLINGCCKTYRIGRALELLDDM 869 ++ E G RKA RRR+WEAN LFR+M+FKG PD+VTYN LINGCCKT RI RALELL+DM Sbjct: 230 FAFECGSRKAVRRRIWEANRLFRLMIFKGHEPDVVTYNSLINGCCKTSRIERALELLEDM 289 Query: 868 LRRGCLPNRVTYNSFIRYYSSVNEIDKAIEMLRRMEELNHGTPSTSSYTPIIHALCEAGR 689 +RGCLPNR+TYNSFIRY+S VNE+++A++MLR M+E N G PS+SSYTPI+H+LCEAGR Sbjct: 290 EKRGCLPNRITYNSFIRYFSVVNEVERAVKMLRTMQEKNRGVPSSSSYTPIVHSLCEAGR 349 Query: 688 ALEAQDFIDELVRRGSVPREYTYKLVRXXXXXXXXXXXLNRNLFEQIEEGIEKRTAQLMK 509 EA F++E+V GSVPREYTYKLV ++ NL + +E ++ R + K Sbjct: 350 GGEALSFVEEMVAGGSVPREYTYKLV-----FKSVGGSVDPNLAKLVETKVQDRIRCVRK 404 Query: 508 VKPIV 494 KP++ Sbjct: 405 AKPLM 409