BLASTX nr result
ID: Mentha26_contig00044189
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00044189 (424 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU42756.1| hypothetical protein MIMGU_mgv1a024236mg [Mimulus... 168 8e-40 emb|CAN78867.1| hypothetical protein VITISV_041982 [Vitis vinifera] 146 3e-33 ref|XP_006363010.1| PREDICTED: pentatricopeptide repeat-containi... 137 2e-30 ref|XP_006447755.1| hypothetical protein CICLE_v10014182mg [Citr... 135 8e-30 gb|EPS74565.1| hypothetical protein M569_00187, partial [Genlise... 132 4e-29 ref|XP_007214974.1| hypothetical protein PRUPE_ppa001796mg [Prun... 123 2e-26 ref|XP_007049305.1| Pentatricopeptide repeat-containing protein,... 110 3e-22 ref|XP_007049304.1| Pentatricopeptide repeat-containing protein,... 110 3e-22 ref|XP_003633341.1| PREDICTED: pentatricopeptide repeat-containi... 105 6e-21 ref|XP_002531694.1| pentatricopeptide repeat-containing protein,... 101 1e-19 ref|XP_006856878.1| hypothetical protein AMTR_s00055p00197790 [A... 100 2e-19 ref|XP_004295353.1| PREDICTED: pentatricopeptide repeat-containi... 100 2e-19 ref|XP_004163208.1| PREDICTED: pentatricopeptide repeat-containi... 97 2e-18 ref|XP_004149415.1| PREDICTED: pentatricopeptide repeat-containi... 97 2e-18 ref|XP_004243553.1| PREDICTED: pentatricopeptide repeat-containi... 96 7e-18 ref|XP_004139858.1| PREDICTED: pentatricopeptide repeat-containi... 86 4e-15 ref|XP_006838717.1| hypothetical protein AMTR_s00002p00251730 [A... 71 1e-10 ref|XP_002986176.1| hypothetical protein SELMODRAFT_182249 [Sela... 70 2e-10 ref|XP_006828862.1| hypothetical protein AMTR_s00001p00165480 [A... 70 3e-10 ref|XP_006391605.1| hypothetical protein EUTSA_v10023944mg [Eutr... 70 4e-10 >gb|EYU42756.1| hypothetical protein MIMGU_mgv1a024236mg [Mimulus guttatus] Length = 556 Score = 168 bits (425), Expect = 8e-40 Identities = 85/138 (61%), Positives = 103/138 (74%) Frame = -1 Query: 424 YCVLLKGIQKESLIVDEKVAMQHETVQSRILEEREVIVDTFCSLLVRMSEIGCEPSIDTF 245 YCVLL G+QK K+A+QHETV S ++ + VI +TFC+LLV+MSE+GCEP DT+ Sbjct: 332 YCVLLIGMQK-------KMAVQHETVHSHTIDAKCVIFETFCNLLVKMSEMGCEPCTDTY 384 Query: 244 SILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRIDXXXXXXXXXNS 65 ILI GLCR+GRS+EAD+LV FM+EKG +PNEAIYCSLL AYCKNLR+D Sbjct: 385 YILIAGLCRDGRSYEADKLVEFMEEKGLSPNEAIYCSLLGAYCKNLRVDAALDILNLLTV 444 Query: 64 RGFDLPLSVYAAMISALC 11 RGF PLS+YAA ISALC Sbjct: 445 RGFKPPLSIYAATISALC 462 Score = 63.2 bits (152), Expect = 4e-08 Identities = 35/101 (34%), Positives = 51/101 (50%) Frame = -1 Query: 313 VDTFCSLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCS 134 VD L RM EIGC P+I+ ++ +++G +EGR EA++L + M E G PN Y + Sbjct: 170 VDDALILFKRMQEIGCRPNIEVYNAVLNGFSKEGRLSEAEKLCSKMAECGLLPNVITYST 229 Query: 133 LLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 L+ CKN ID R L Y+++I C Sbjct: 230 LIDGLCKNGGIDLALKIFHEMERRNCLPNLYTYSSLIYGQC 270 >emb|CAN78867.1| hypothetical protein VITISV_041982 [Vitis vinifera] Length = 962 Score = 146 bits (369), Expect = 3e-33 Identities = 73/138 (52%), Positives = 98/138 (71%) Frame = -1 Query: 424 YCVLLKGIQKESLIVDEKVAMQHETVQSRILEEREVIVDTFCSLLVRMSEIGCEPSIDTF 245 Y VLLKG+QKE L+++EKVA+QHE V S E++V + +LL RMSEIGCEP++DT+ Sbjct: 733 YSVLLKGLQKECLLLEEKVAVQHEAVYSFSPHEKDVNFEIVSNLLARMSEIGCEPTLDTY 792 Query: 244 SILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRIDXXXXXXXXXNS 65 S L+ GLCR+GR +EA+QLV M+E+G P+ IY SLL A+CKNL +D + Sbjct: 793 STLVSGLCRKGRFYEAEQLVKDMKERGFCPDREIYYSLLIAHCKNLEVDHALKIFHSIEA 852 Query: 64 RGFDLPLSVYAAMISALC 11 +GF L LS+Y A+I ALC Sbjct: 853 KGFQLHLSIYRALICALC 870 Score = 64.7 bits (156), Expect = 1e-08 Identities = 34/101 (33%), Positives = 54/101 (53%) Frame = -1 Query: 313 VDTFCSLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCS 134 VD SLL RM E+GC P++++++ +I+GL +E R EA+++ M E+G PN Y + Sbjct: 571 VDIALSLLERMEEMGCNPNVESYNAVINGLSKENRFSEAEKICDKMAEQGLLPNVITYTT 630 Query: 133 LLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 L+ C+N R R L Y+++I LC Sbjct: 631 LIDGLCRNGRTQFAFKIFHDMEKRKCLPNLYTYSSLIYGLC 671 >ref|XP_006363010.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like isoform X1 [Solanum tuberosum] gi|565394734|ref|XP_006363011.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like isoform X2 [Solanum tuberosum] gi|565394736|ref|XP_006363012.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like isoform X3 [Solanum tuberosum] Length = 913 Score = 137 bits (344), Expect = 2e-30 Identities = 67/138 (48%), Positives = 94/138 (68%) Frame = -1 Query: 418 VLLKGIQKESLIVDEKVAMQHETVQSRILEEREVIVDTFCSLLVRMSEIGCEPSIDTFSI 239 VLLKG+QKE ++ KV+++ ETV S + +V ++ C+LL RMSEIGCEP+ DT+ Sbjct: 704 VLLKGLQKEHELISGKVSVKRETVYSSTASKNDVSIELLCTLLNRMSEIGCEPNEDTYCT 763 Query: 238 LIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRIDXXXXXXXXXNSRG 59 LI GL R+G+++EADQL+ M+EKG +P A YCSLL +YC NL++D +G Sbjct: 764 LILGLYRDGKTYEADQLIEHMREKGFSPTSAAYCSLLVSYCNNLKVDAALEIFDSLIQQG 823 Query: 58 FDLPLSVYAAMISALCIS 5 F PLS+Y ++I ALC S Sbjct: 824 FRPPLSIYQSLICALCRS 841 Score = 55.5 bits (132), Expect = 8e-06 Identities = 32/106 (30%), Positives = 50/106 (47%) Frame = -1 Query: 325 REVIVDTFCSLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEA 146 +E VD +LL RM E GC P I+T++ +I+GL ++ R E +L + E PN Sbjct: 536 KEEKVDDALALLKRMEESGCSPGIETYNAIINGLSKKNRLLEVKRLCNKLAESELLPNVI 595 Query: 145 IYCSLLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALCI 8 Y +L+ C+N R L Y+++I LC+ Sbjct: 596 TYSTLIDGLCRNGETHLAFEILHDMERRNCMPNLYTYSSLIYGLCL 641 >ref|XP_006447755.1| hypothetical protein CICLE_v10014182mg [Citrus clementina] gi|568830449|ref|XP_006469511.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like [Citrus sinensis] gi|557550366|gb|ESR60995.1| hypothetical protein CICLE_v10014182mg [Citrus clementina] Length = 929 Score = 135 bits (339), Expect = 8e-30 Identities = 71/138 (51%), Positives = 92/138 (66%) Frame = -1 Query: 424 YCVLLKGIQKESLIVDEKVAMQHETVQSRILEEREVIVDTFCSLLVRMSEIGCEPSIDTF 245 Y VLLKG+QKES I+ EKV Q++ V + ++ C+LL R+ E GCEP++DT+ Sbjct: 690 YGVLLKGLQKESQILTEKVVAQNDVVYGCSSYGKVGNLELMCNLLSRLPEYGCEPTVDTY 749 Query: 244 SILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRIDXXXXXXXXXNS 65 S LI GLCREGRS+EADQLV M+EKG P+ AIY SLL A+C+NL +D Sbjct: 750 STLICGLCREGRSYEADQLVEIMKEKGFCPDRAIYYSLLVAHCRNLEVDSALEIFNLMGI 809 Query: 64 RGFDLPLSVYAAMISALC 11 G + LS+YAA+ISALC Sbjct: 810 SGLEPHLSIYAALISALC 827 Score = 58.9 bits (141), Expect = 7e-07 Identities = 32/105 (30%), Positives = 52/105 (49%) Frame = -1 Query: 325 REVIVDTFCSLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEA 146 +E +D SL +M + C P I+T++ +I+GL ++ R EA++L M E+G PN Sbjct: 524 KEGKIDVALSLFEKMEQNNCRPKIETYNAIINGLSKDNRLLEAEKLCGKMAEQGLLPNVI 583 Query: 145 IYCSLLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 Y SL+ CKN + + L Y+++I LC Sbjct: 584 TYTSLIDGLCKNGGTNLAFKIFHEMERKNCLPNLHTYSSLIHGLC 628 >gb|EPS74565.1| hypothetical protein M569_00187, partial [Genlisea aurea] Length = 860 Score = 132 bits (333), Expect = 4e-29 Identities = 70/139 (50%), Positives = 89/139 (64%), Gaps = 1/139 (0%) Frame = -1 Query: 424 YCVLLKGIQ-KESLIVDEKVAMQHETVQSRILEEREVIVDTFCSLLVRMSEIGCEPSIDT 248 Y VLLKG+Q +E +V EKVA+Q E+ ++ + +EV DT CSLL RMSEIGC+PS++T Sbjct: 651 YSVLLKGLQIEECEVVVEKVAVQDESTRNHTTDAKEVAFDTICSLLARMSEIGCDPSVET 710 Query: 247 FSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRIDXXXXXXXXXN 68 + LI LC G S EAD LV M+EKG P + I+CSLL YC+NL +D N Sbjct: 711 YETLIAHLCHRGGSCEADLLVNMMKEKGLNPTDEIFCSLLSGYCRNLGVDSALKLLDSLN 770 Query: 67 SRGFDLPLSVYAAMISALC 11 GF PLS Y +I ALC Sbjct: 771 ISGFKPPLSTYTEIIHALC 789 Score = 58.5 bits (140), Expect = 9e-07 Identities = 31/95 (32%), Positives = 48/95 (50%) Frame = -1 Query: 295 LLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYC 116 L+ RM ++GC P I+ ++ +++GLC R EA +L+ + E G PN Y +L+ C Sbjct: 495 LMGRMQKVGCWPYIEAYNAVLNGLCTTKRLSEAHELLNEILESGLLPNTITYTTLIDGLC 554 Query: 115 KNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 KN +D R L Y+A+I LC Sbjct: 555 KNGDMDLAFEVFHDMEKRSCFPNLYTYSALIHGLC 589 >ref|XP_007214974.1| hypothetical protein PRUPE_ppa001796mg [Prunus persica] gi|462411124|gb|EMJ16173.1| hypothetical protein PRUPE_ppa001796mg [Prunus persica] Length = 763 Score = 123 bits (309), Expect = 2e-26 Identities = 65/140 (46%), Positives = 86/140 (61%), Gaps = 3/140 (2%) Frame = -1 Query: 424 YCVLLKGIQKESLIVDEKVA---MQHETVQSRILEEREVIVDTFCSLLVRMSEIGCEPSI 254 Y VL+KG+QKES ++ EKV QHE + S E + C+LL RMSE GCEP++ Sbjct: 531 YAVLVKGLQKESQLLTEKVVGLVAQHEGMYSCSSGESYNFFEALCNLLARMSENGCEPTV 590 Query: 253 DTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRIDXXXXXXXX 74 DT+ L+ GLC EGR +EADQLV M++KG PN IY SL +C NL+++ Sbjct: 591 DTYGTLVRGLCTEGRYYEADQLVQHMKDKGLCPNRRIYLSLFFVHCTNLKVESALEIFGL 650 Query: 73 XNSRGFDLPLSVYAAMISAL 14 GF++ LS Y A+ISAL Sbjct: 651 MEDNGFEVHLSAYNALISAL 670 Score = 58.5 bits (140), Expect = 9e-07 Identities = 29/72 (40%), Positives = 46/72 (63%) Frame = -1 Query: 313 VDTFCSLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCS 134 VDT SL +M E GC PSI+T++ +I+GL ++ + +A++L M+++G PN Y S Sbjct: 387 VDTALSLFEQMEEKGCCPSIETYNAIINGLSKDNQFVKAEKLCKKMEKQGLVPNVITYTS 446 Query: 133 LLKAYCKNLRID 98 L+ CK+ R D Sbjct: 447 LICGLCKSGRTD 458 >ref|XP_007049305.1| Pentatricopeptide repeat-containing protein, putative isoform 2 [Theobroma cacao] gi|590712142|ref|XP_007049306.1| Pentatricopeptide repeat-containing protein, putative isoform 2 [Theobroma cacao] gi|508701566|gb|EOX93462.1| Pentatricopeptide repeat-containing protein, putative isoform 2 [Theobroma cacao] gi|508701567|gb|EOX93463.1| Pentatricopeptide repeat-containing protein, putative isoform 2 [Theobroma cacao] Length = 716 Score = 110 bits (274), Expect = 3e-22 Identities = 59/138 (42%), Positives = 83/138 (60%) Frame = -1 Query: 424 YCVLLKGIQKESLIVDEKVAMQHETVQSRILEEREVIVDTFCSLLVRMSEIGCEPSIDTF 245 + VL KG+QKE ++ EKV Q+ V +++R +LL +S GCEP++D + Sbjct: 488 FSVLSKGLQKEFKLLTEKVVSQNRVVCGGRIDDRFANFGLMRNLLSTLSGNGCEPNVDIY 547 Query: 244 SILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRIDXXXXXXXXXNS 65 S L+ GLCREGR +EA QLVA M+EKG PN+ I SL+ A C+NL +D Sbjct: 548 SALVTGLCREGRYYEASQLVAHMKEKGLCPNKDILFSLIFAQCRNLEVDHALETFNLTLI 607 Query: 64 RGFDLPLSVYAAMISALC 11 +G++ PLS Y +I ALC Sbjct: 608 KGWEPPLSNYREVICALC 625 Score = 60.5 bits (145), Expect = 2e-07 Identities = 32/105 (30%), Positives = 52/105 (49%) Frame = -1 Query: 325 REVIVDTFCSLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEA 146 +E +D SL RM + GC P I+T++ +I+GL + + E ++L++ M EKG PN Sbjct: 322 KEGKMDAAVSLFERMEQHGCCPEIETYNAIINGLSQNNQFSEVEKLISKMVEKGLRPNVI 381 Query: 145 IYCSLLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 Y ++ CKN D R + Y+++I LC Sbjct: 382 TYTCMIDGICKNGGTDLAFRVFLEMKERNCSPNVYTYSSLIHGLC 426 >ref|XP_007049304.1| Pentatricopeptide repeat-containing protein, putative isoform 1 [Theobroma cacao] gi|508701565|gb|EOX93461.1| Pentatricopeptide repeat-containing protein, putative isoform 1 [Theobroma cacao] Length = 909 Score = 110 bits (274), Expect = 3e-22 Identities = 59/138 (42%), Positives = 83/138 (60%) Frame = -1 Query: 424 YCVLLKGIQKESLIVDEKVAMQHETVQSRILEEREVIVDTFCSLLVRMSEIGCEPSIDTF 245 + VL KG+QKE ++ EKV Q+ V +++R +LL +S GCEP++D + Sbjct: 681 FSVLSKGLQKEFKLLTEKVVSQNRVVCGGRIDDRFANFGLMRNLLSTLSGNGCEPNVDIY 740 Query: 244 SILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRIDXXXXXXXXXNS 65 S L+ GLCREGR +EA QLVA M+EKG PN+ I SL+ A C+NL +D Sbjct: 741 SALVTGLCREGRYYEASQLVAHMKEKGLCPNKDILFSLIFAQCRNLEVDHALETFNLTLI 800 Query: 64 RGFDLPLSVYAAMISALC 11 +G++ PLS Y +I ALC Sbjct: 801 KGWEPPLSNYREVICALC 818 Score = 60.5 bits (145), Expect = 2e-07 Identities = 32/105 (30%), Positives = 52/105 (49%) Frame = -1 Query: 325 REVIVDTFCSLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEA 146 +E +D SL RM + GC P I+T++ +I+GL + + E ++L++ M EKG PN Sbjct: 515 KEGKMDAAVSLFERMEQHGCCPEIETYNAIINGLSQNNQFSEVEKLISKMVEKGLRPNVI 574 Query: 145 IYCSLLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 Y ++ CKN D R + Y+++I LC Sbjct: 575 TYTCMIDGICKNGGTDLAFRVFLEMKERNCSPNVYTYSSLIHGLC 619 >ref|XP_003633341.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like [Vitis vinifera] Length = 822 Score = 105 bits (262), Expect = 6e-21 Identities = 50/92 (54%), Positives = 66/92 (71%) Frame = -1 Query: 286 RMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNL 107 RMSEIGCEP++DT+S L+ GLCR+GR +EA+QLV M+E+G P+ IY SLL A+CKNL Sbjct: 639 RMSEIGCEPTLDTYSTLVSGLCRKGRFYEAEQLVKDMKERGFCPDREIYYSLLIAHCKNL 698 Query: 106 RIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 +D ++GF L LS+Y A+I ALC Sbjct: 699 EVDHALKIFHSIEAKGFQLHLSIYRALICALC 730 Score = 64.3 bits (155), Expect = 2e-08 Identities = 34/101 (33%), Positives = 54/101 (53%) Frame = -1 Query: 313 VDTFCSLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCS 134 VD SLL RM E+GC P++++++ +I+GL +E R EA+++ M E+G PN Y + Sbjct: 529 VDIALSLLKRMEEMGCNPNVESYNAVINGLSKENRFSEAEKICDKMVEQGLLPNVITYTT 588 Query: 133 LLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 L+ C+N R R L Y+++I LC Sbjct: 589 LIDGLCRNGRTQFAFKIFHDMEKRKCLPNLYTYSSLIYGLC 629 >ref|XP_002531694.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223528670|gb|EEF30685.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 821 Score = 101 bits (251), Expect = 1e-19 Identities = 48/92 (52%), Positives = 64/92 (69%) Frame = -1 Query: 286 RMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNL 107 R++E GCEP+IDT+S L+ GLCREGRS+EA QLV M+EKG +P+ IYCSLL A+CK+L Sbjct: 638 RLTENGCEPTIDTYSTLVSGLCREGRSNEASQLVENMKEKGLSPSMEIYCSLLVAHCKSL 697 Query: 106 RIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 ++D +GF L +Y +I ALC Sbjct: 698 KVDCALEIFNLMAVKGFQPHLFIYKVLICALC 729 Score = 61.6 bits (148), Expect = 1e-07 Identities = 42/150 (28%), Positives = 65/150 (43%), Gaps = 12/150 (8%) Frame = -1 Query: 424 YCVLLKGIQKESLIVDEKVAMQHETVQSRILEER---EVIVDTFC---------SLLVRM 281 YC L+ G K + D + +E ++ I + ++D +C SL RM Sbjct: 480 YCELISGFCKGGKL-DSATSFFYEMLKCGISPNQWTYTAMIDGYCKEGKIDVALSLFERM 538 Query: 280 SEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRI 101 E GC SI+T++ +I GL + R EA++ A M E+G PN Y SL+ CKN Sbjct: 539 EENGCSASIETYNAIISGLSKGNRFSEAEKFCAKMTEQGLQPNTITYTSLINGLCKNTAT 598 Query: 100 DXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 + + Y ++I LC Sbjct: 599 NLAFKIFHEMEKKNCLPNAHTYTSLIYGLC 628 Score = 55.8 bits (133), Expect = 6e-06 Identities = 32/91 (35%), Positives = 44/91 (48%) Frame = -1 Query: 286 RMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNL 107 RM + GC P+ T+S LI+GLC EGR EA ++ M EKG P Y + + C Sbjct: 257 RMVKDGCNPNSVTYSTLINGLCNEGRIGEAMDMLEEMTEKGIEPTVYTYTVPISSLCDIG 316 Query: 106 RIDXXXXXXXXXNSRGFDLPLSVYAAMISAL 14 R+D +G + Y A+IS L Sbjct: 317 RVDDAINLVRSMGKKGCSPSVQTYTAIISGL 347 >ref|XP_006856878.1| hypothetical protein AMTR_s00055p00197790 [Amborella trichopoda] gi|548860812|gb|ERN18345.1| hypothetical protein AMTR_s00055p00197790 [Amborella trichopoda] Length = 940 Score = 100 bits (250), Expect = 2e-19 Identities = 51/138 (36%), Positives = 82/138 (59%) Frame = -1 Query: 424 YCVLLKGIQKESLIVDEKVAMQHETVQSRILEEREVIVDTFCSLLVRMSEIGCEPSIDTF 245 Y VL+KG+QKE ++ + A+Q + D SLL R+S+ E ++DT+ Sbjct: 722 YGVLIKGLQKEKQLMGSEKAIQRSNI------------DLIFSLLERLSQNNIEHTVDTY 769 Query: 244 SILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRIDXXXXXXXXXNS 65 +L+ GLCREG+ +EADQ++ M+E G NEA+Y SL+ AYCK +R++ + Sbjct: 770 GVLVCGLCREGKLYEADQVLGRMRENGFFLNEAMYASLIDAYCKEMRVESGLEMFHEMIT 829 Query: 64 RGFDLPLSVYAAMISALC 11 GF+ L++Y A++ +LC Sbjct: 830 NGFEPSLAIYKALLFSLC 847 Score = 55.8 bits (133), Expect = 6e-06 Identities = 36/121 (29%), Positives = 58/121 (47%), Gaps = 12/121 (9%) Frame = -1 Query: 424 YCVLLKGIQKESLIVDEKVAMQHETVQ---SRILEEREVIVDTFCSL---------LVRM 281 Y L+ G+ KE I DE + M ++ V+ + V++ T CSL + M Sbjct: 302 YSTLINGLCKEGRI-DEALVMLNQMVERDCQPTVYTYTVLLTTLCSLGRVKEAFDLVEDM 360 Query: 280 SEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRI 101 GC P++ T++ LI GLCR + +A L+ M G PN + +L+ +C R+ Sbjct: 361 KNRGCPPNVQTYTTLISGLCRCKKLEDACDLLKEMISNGLVPNTVTFNALINGFCSEGRV 420 Query: 100 D 98 D Sbjct: 421 D 421 >ref|XP_004295353.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like [Fragaria vesca subsp. vesca] Length = 927 Score = 100 bits (249), Expect = 2e-19 Identities = 55/142 (38%), Positives = 84/142 (59%), Gaps = 4/142 (2%) Frame = -1 Query: 424 YCVLLKGIQKESLIVDEKV---AMQHETVQSRILEEREVIVDTFCSLLVRMSEIGCEPSI 254 + VL+KG+++ES ++ EKV A QHE S +R ++ C+LL ++SE GCEP+ Sbjct: 695 FTVLVKGLKRESQLLTEKVVGLATQHEVQCSSSSNKRCNDLEILCNLLDKISENGCEPTT 754 Query: 253 DTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRIDXXXXXXXX 74 +T+ L+ GLC + + E DQLV M+EKG P+E Y + A CKNL++D Sbjct: 755 ETYHSLVRGLCEDRKYEEVDQLVEHMKEKGLYPSEEFYRPMFFANCKNLKLDSALEMLSG 814 Query: 73 XNS-RGFDLPLSVYAAMISALC 11 + RG ++ S+Y A+I A C Sbjct: 815 LMADRGLEVDFSIYTALICAFC 836 >ref|XP_004163208.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like [Cucumis sativus] Length = 830 Score = 97.4 bits (241), Expect = 2e-18 Identities = 47/96 (48%), Positives = 65/96 (67%) Frame = -1 Query: 298 SLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAY 119 +LL R++ GCEP++DT++ L+ GLC EGR +EADQLV MQ+KG P+E IY +LL Sbjct: 645 NLLARLTHYGCEPNVDTYTTLVKGLCGEGRCYEADQLVVSMQKKGLQPSEEIYRALLIGE 704 Query: 118 CKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 CKNL+++ ++ GF L LS Y A+I ALC Sbjct: 705 CKNLKVESALNIFYSMDTLGFQLHLSDYKALICALC 740 >ref|XP_004149415.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like [Cucumis sativus] Length = 830 Score = 97.4 bits (241), Expect = 2e-18 Identities = 47/96 (48%), Positives = 65/96 (67%) Frame = -1 Query: 298 SLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAY 119 +LL R++ GCEP++DT++ L+ GLC EGR +EADQLV MQ+KG P+E IY +LL Sbjct: 645 NLLARLTHYGCEPNVDTYTTLVKGLCGEGRCYEADQLVVSMQKKGLQPSEEIYRALLIGE 704 Query: 118 CKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 CKNL+++ ++ GF L LS Y A+I ALC Sbjct: 705 CKNLKVESALNIFYSMDTLGFQLHLSDYKALICALC 740 >ref|XP_004243553.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like [Solanum lycopersicum] Length = 815 Score = 95.5 bits (236), Expect = 7e-18 Identities = 46/94 (48%), Positives = 62/94 (65%) Frame = -1 Query: 286 RMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNL 107 RMSE+G EP+ + LI GL REG+++EADQL+ M+EKG +P A YCSLL +YC NL Sbjct: 650 RMSEVGFEPNEGAYCTLILGLYREGKTYEADQLIEHMREKGFSPTSAAYCSLLVSYCNNL 709 Query: 106 RIDXXXXXXXXXNSRGFDLPLSVYAAMISALCIS 5 ++D +GF PLS+Y ++I ALC S Sbjct: 710 KVDAALEIFDSLIQQGFQPPLSIYQSLICALCRS 743 Score = 55.8 bits (133), Expect = 6e-06 Identities = 32/106 (30%), Positives = 50/106 (47%) Frame = -1 Query: 325 REVIVDTFCSLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEA 146 +E VD +LL RM E GC P I+T++ +I+GL ++ R E +L + E PN Sbjct: 536 KEEKVDDALALLKRMEESGCSPGIETYNAIINGLSKKNRLLEVKRLCNKLAESELLPNVI 595 Query: 145 IYCSLLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALCI 8 Y +L+ C+N R L Y+++I LC+ Sbjct: 596 TYSTLINGLCRNGETHVAFEILHDMERRNCMPNLYTYSSLIYGLCL 641 >ref|XP_004139858.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like [Cucumis sativus] gi|449530677|ref|XP_004172320.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like [Cucumis sativus] Length = 839 Score = 86.3 bits (212), Expect = 4e-15 Identities = 42/92 (45%), Positives = 59/92 (64%) Frame = -1 Query: 286 RMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNL 107 R+ + GCEP++DT++ L+ GLC +GR +EADQLV M++KG P+E IY +LL CKNL Sbjct: 643 RLLDDGCEPNVDTYTTLVRGLCGKGRCYEADQLVESMKKKGLQPSEEIYRALLVGQCKNL 702 Query: 106 RIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 ++ + GF LS Y A+I ALC Sbjct: 703 EVESALKIFDSMVTTGFQPCLSDYKALICALC 734 >ref|XP_006838717.1| hypothetical protein AMTR_s00002p00251730 [Amborella trichopoda] gi|548841223|gb|ERN01286.1| hypothetical protein AMTR_s00002p00251730 [Amborella trichopoda] Length = 904 Score = 71.2 bits (173), Expect = 1e-10 Identities = 35/100 (35%), Positives = 53/100 (53%) Frame = -1 Query: 310 DTFCSLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSL 131 D LL MSE GC+P++ T+++LID LC++ + EAD+L+ M E+G P+ Y +L Sbjct: 319 DKAFGLLEEMSEKGCKPNVHTYTVLIDSLCKDNKLEEADRLMHEMTERGLAPSVVTYNAL 378 Query: 130 LKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 + YCK ++D S G Y +I LC Sbjct: 379 IDGYCKEGKVDSAFGILEVMESSGVKPNARTYNELICGLC 418 Score = 58.5 bits (140), Expect = 9e-07 Identities = 37/111 (33%), Positives = 57/111 (51%), Gaps = 7/111 (6%) Frame = -1 Query: 424 YCVLLKGIQKE-------SLIVDEKVAMQHETVQSRILEEREVIVDTFCSLLVRMSEIGC 266 Y VL++ I +E S + + + H +V + I DT L+ RM +G Sbjct: 690 YTVLIRHIVQENHSTKELSFQIIDGLVEDHSSVTPSHFWMKVKIEDTL-KLMERMWGLGF 748 Query: 265 EPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCK 113 +P+I T+ I G C GR EA++LV ++E G +PNE I+ SL+ CK Sbjct: 749 DPNIQTYGAFIAGFCNVGRLEEAEELVNLVRENGFSPNEDIFTSLIDCSCK 799 Score = 55.8 bits (133), Expect = 6e-06 Identities = 31/80 (38%), Positives = 40/80 (50%) Frame = -1 Query: 250 TFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRIDXXXXXXXXX 71 T+S LID LC++GR EA L+ + EKG NE IY SL+ YCK +ID Sbjct: 479 TYSPLIDALCKDGRIDEASALINSLPEKGIQANEVIYTSLIDGYCKLGKIDDARSLLDKM 538 Query: 70 NSRGFDLPLSVYAAMISALC 11 G Y ++I LC Sbjct: 539 IEHGCFPNSYTYNSVIDGLC 558 Score = 55.5 bits (132), Expect = 8e-06 Identities = 41/150 (27%), Positives = 67/150 (44%), Gaps = 12/150 (8%) Frame = -1 Query: 424 YCVLLKGIQKESLIVDEKVAMQHETVQSRILEE------------REVIVDTFCSLLVRM 281 Y VL+ + K++ + +E + HE + + +E VD+ +L M Sbjct: 340 YTVLIDSLCKDNKL-EEADRLMHEMTERGLAPSVVTYNALIDGYCKEGKVDSAFGILEVM 398 Query: 280 SEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRI 101 G +P+ T++ LI GLC+E + H+A L++ E G TP+ Y SL+ CK + Sbjct: 399 ESSGVKPNARTYNELICGLCKENKVHKAMGLLSKTLESGLTPSIVTYNSLIYGQCKAGHM 458 Query: 100 DXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 D GF Y+ +I ALC Sbjct: 459 DSAFRLLDLMAGGGFTGDHWTYSPLIDALC 488 >ref|XP_002986176.1| hypothetical protein SELMODRAFT_182249 [Selaginella moellendorffii] gi|300146035|gb|EFJ12707.1| hypothetical protein SELMODRAFT_182249 [Selaginella moellendorffii] Length = 609 Score = 70.5 bits (171), Expect = 2e-10 Identities = 41/152 (26%), Positives = 70/152 (46%), Gaps = 12/152 (7%) Frame = -1 Query: 424 YCVLLKGIQKESLIVDEKVAMQHETVQSRILEE---REVIVDTFCS---------LLVRM 281 Y ++ G+ K I + +V ++ +L + +++ C LL RM Sbjct: 392 YNTVIDGLCKLGKIAEAQVILEQMQESGDVLPDVVTYSTVINGLCKSDMLVEAQKLLDRM 451 Query: 280 SEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYCKNLRI 101 + GC P + T++ +IDGLC+ GR EA+ L+ M+ G PN Y +L+ CK ++ Sbjct: 452 CKAGCNPDVVTYTTIIDGLCKCGRLEEAEYLLQGMKRAGCAPNVVTYTTLISGLCKARKV 511 Query: 100 DXXXXXXXXXNSRGFDLPLSVYAAMISALCIS 5 D + G L Y M++ LC+S Sbjct: 512 DEAERVMEEMRNAGCPPNLVTYNTMVNGLCVS 543 Score = 60.5 bits (145), Expect = 2e-07 Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 9/112 (8%) Frame = -1 Query: 319 VIVDTFCSLLV---------RMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEK 167 V+VD C L + +M E G P++ TF+ L+DG C+ G +A +L+ M K Sbjct: 219 VLVDALCKLSMVGAAQDVVKKMIEGGFAPNVMTFNSLVDGFCKRGNVDDARKLLGIMVAK 278 Query: 166 GQTPNEAIYCSLLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 G PN Y +L+ CK+ + +RG Y+A+I LC Sbjct: 279 GMRPNVVTYSALIDGLCKSQKFLEAKEVLEEMKTRGVTPDAFTYSALIHGLC 330 Score = 60.1 bits (144), Expect = 3e-07 Identities = 32/98 (32%), Positives = 49/98 (50%) Frame = -1 Query: 295 LLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYC 116 LL M E GC P++ T+++L+D LC+ A +V M E G PN + SL+ +C Sbjct: 201 LLEEMRERGCPPNLVTYNVLVDALCKLSMVGAAQDVVKKMIEGGFAPNVMTFNSLVDGFC 260 Query: 115 KNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALCISQ 2 K +D ++G + Y+A+I LC SQ Sbjct: 261 KRGNVDDARKLLGIMVAKGMRPNVVTYSALIDGLCKSQ 298 Score = 55.8 bits (133), Expect = 6e-06 Identities = 35/113 (30%), Positives = 55/113 (48%), Gaps = 9/113 (7%) Frame = -1 Query: 316 IVDTFCS---------LLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKG 164 +VD FC LL M G P++ T+S LIDGLC+ + EA +++ M+ +G Sbjct: 255 LVDGFCKRGNVDDARKLLGIMVAKGMRPNVVTYSALIDGLCKSQKFLEAKEVLEEMKTRG 314 Query: 163 QTPNEAIYCSLLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALCIS 5 TP+ Y +L+ CK +I+ G + VY+++I A C S Sbjct: 315 VTPDAFTYSALIHGLCKADKIEEAEQMLRRMAGSGCTPDVVVYSSIIHAFCKS 367 >ref|XP_006828862.1| hypothetical protein AMTR_s00001p00165480 [Amborella trichopoda] gi|548833841|gb|ERM96278.1| hypothetical protein AMTR_s00001p00165480 [Amborella trichopoda] Length = 903 Score = 70.1 bits (170), Expect = 3e-10 Identities = 35/104 (33%), Positives = 54/104 (51%) Frame = -1 Query: 325 REVIVDTFCSLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEA 146 R++ +D L+ ++ GCEP+ D + +LI G C+EGR EAD+L+ M +G P +A Sbjct: 738 RKMEIDVAFRLIEKLCGKGCEPTSDLYRVLITGFCKEGRIVEADRLIKDMVGRGIVPQKA 797 Query: 145 IYCSLLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISAL 14 IY SL++ YC + GF S Y ++S L Sbjct: 798 IYASLIEGYCAESNVQSCYGLLNEMLDSGFLPSYSTYCMVVSGL 841 Score = 60.8 bits (146), Expect = 2e-07 Identities = 32/95 (33%), Positives = 46/95 (48%) Frame = -1 Query: 295 LLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCSLLKAYC 116 L M + GC P++ T+++LIDG CRE + EA+ L+ M EKG P+ Y +L+ YC Sbjct: 329 LFNEMVKKGCVPNVHTYTVLIDGWCRENKLEEANGLLKMMLEKGVLPSTITYNALINGYC 388 Query: 115 KNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 K R+ R Y +I LC Sbjct: 389 KEGRVFSAFELLSTMEKRKCKPNTRTYNELIDGLC 423 Score = 57.4 bits (137), Expect = 2e-06 Identities = 29/111 (26%), Positives = 55/111 (49%), Gaps = 9/111 (8%) Frame = -1 Query: 319 VIVDTFCS---------LLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEK 167 ++++ FC +L RM + C P++ T+++++DGLC+ GR EA+ L+ + E Sbjct: 592 ILINGFCQAGKFSLSFEVLDRMIQENCFPNVYTYTVMVDGLCKRGRFEEAEMLLYNLFEM 651 Query: 166 GQTPNEAIYCSLLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISAL 14 G PN Y L+K Y +++ +G + +Y A++ L Sbjct: 652 GICPNHVTYTVLVKGYVNAGKVERAFDLMTIMVKQGCEPNYRIYHALLKGL 702 Score = 55.8 bits (133), Expect = 6e-06 Identities = 34/101 (33%), Positives = 46/101 (45%) Frame = -1 Query: 313 VDTFCSLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEKGQTPNEAIYCS 134 +D +L M GC P+ T+SILI GLC +GR EA QL M EKG P+ Y Sbjct: 253 LDNAFNLFEMMYGDGCIPNAVTYSILISGLCEQGRMEEAFQLFGEMTEKGCEPSVYTYTV 312 Query: 133 LLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALC 11 L+KA C ++ +G + Y +I C Sbjct: 313 LIKALCGIRQVYKSIDLFNEMVKKGCVPNVHTYTVLIDGWC 353 >ref|XP_006391605.1| hypothetical protein EUTSA_v10023944mg [Eutrema salsugineum] gi|557088111|gb|ESQ28891.1| hypothetical protein EUTSA_v10023944mg [Eutrema salsugineum] Length = 533 Score = 69.7 bits (169), Expect = 4e-10 Identities = 39/114 (34%), Positives = 56/114 (49%), Gaps = 9/114 (7%) Frame = -1 Query: 319 VIVDTFC---------SLLVRMSEIGCEPSIDTFSILIDGLCREGRSHEADQLVAFMQEK 167 +++D FC S+L +M ++G EPSI T L++G CR R H+A LV M + Sbjct: 118 ILIDCFCRSSRLSLALSVLGKMMKLGFEPSIVTLGSLLNGFCRRNRFHDAVPLVDTMAKS 177 Query: 166 GQTPNEAIYCSLLKAYCKNLRIDXXXXXXXXXNSRGFDLPLSVYAAMISALCIS 5 G PN IY +++ CKN D +G + Y +ISALC S Sbjct: 178 GYEPNVVIYNTVINGLCKNRDADNALELFNLMEKKGIRADVVTYNTLISALCNS 231