BLASTX nr result
ID: Mentha23_contig00041817
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha23_contig00041817 (551 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU32269.1| hypothetical protein MIMGU_mgv1a026672mg [Mimulus... 295 5e-78 ref|XP_006339746.1| PREDICTED: pentatricopeptide repeat-containi... 252 4e-65 ref|XP_004230005.1| PREDICTED: pentatricopeptide repeat-containi... 244 1e-62 ref|XP_003633947.1| PREDICTED: pentatricopeptide repeat-containi... 243 2e-62 emb|CBI40590.3| unnamed protein product [Vitis vinifera] 243 2e-62 ref|XP_002523296.1| pentatricopeptide repeat-containing protein,... 242 5e-62 gb|EPS66384.1| hypothetical protein M569_08386 [Genlisea aurea] 236 2e-60 ref|XP_002301427.2| hypothetical protein POPTR_0002s17640g [Popu... 231 7e-59 gb|EXB83859.1| hypothetical protein L484_023466 [Morus notabilis] 230 2e-58 ref|XP_004510637.1| PREDICTED: pentatricopeptide repeat-containi... 229 3e-58 ref|XP_007051479.1| Pentatricopeptide repeat (PPR) superfamily p... 228 6e-58 ref|XP_004155716.1| PREDICTED: pentatricopeptide repeat-containi... 228 8e-58 ref|XP_004142577.1| PREDICTED: pentatricopeptide repeat-containi... 228 8e-58 ref|XP_003627527.1| Pentatricopeptide repeat-containing protein ... 224 1e-56 ref|XP_003548250.1| PREDICTED: pentatricopeptide repeat-containi... 223 2e-56 emb|CAN66974.1| hypothetical protein VITISV_022076 [Vitis vinifera] 223 3e-56 ref|XP_007135284.1| hypothetical protein PHAVU_010G116300g [Phas... 220 2e-55 ref|XP_006289662.1| hypothetical protein CARUB_v10003220mg [Caps... 199 5e-49 ref|NP_196468.1| pentatricopeptide repeat-containing protein [Ar... 198 8e-49 dbj|BAE98759.1| hypothetical protein [Arabidopsis thaliana] 198 8e-49 >gb|EYU32269.1| hypothetical protein MIMGU_mgv1a026672mg [Mimulus guttatus] Length = 516 Score = 295 bits (755), Expect = 5e-78 Identities = 139/183 (75%), Positives = 162/183 (88%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C SLY Q+L P + FT LF ACAKLS+PS GQMLHAHF+KFG +DVYALT+LVD Sbjct: 66 CFSLYSQILHLSFSPNPNCFTFLFSACAKLSNPSQGQMLHAHFIKFGLDYDVYALTALVD 125 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MYAKMGLLRF+R++FDEM+DK+ PTWNSLI+GYAR GDM EALR F +MPSRNVISWTA+ Sbjct: 126 MYAKMGLLRFSRKIFDEMNDKDAPTWNSLIAGYARNGDMSEALRLFSNMPSRNVISWTAI 185 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 ISG+SQNG+Y+EALEMYL MER+G+V+PNHVT+ASVLPACAN+GALEVGQRIEAYARANG Sbjct: 186 ISGFSQNGKYKEALEMYLAMERDGKVKPNHVTLASVLPACANLGALEVGQRIEAYARANG 245 Query: 11 YFR 3 YF+ Sbjct: 246 YFK 248 Score = 60.1 bits (144), Expect = 4e-07 Identities = 37/167 (22%), Positives = 79/167 (47%), Gaps = 6/167 (3%) Frame = -1 Query: 518 RLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDMYAKMGLLRFA 339 ++ P + + ACA L + GQ + A+ G+ + + +++++YA+ G++ A Sbjct: 210 KVKPNHVTLASVLPACANLGALEVGQRIEAYARANGYFKNAFVCNAVLELYARCGVIEKA 269 Query: 338 RRVFDEM--DDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNV----ISWTALISGYS 177 +VFDE+ ++ + +WN+LI G A G + AL F M ++ V +++ I + Sbjct: 270 MQVFDEIGSGNRNLCSWNTLIMGLAVHGRCDGALEIFNQMLTKGVTPDDVTFVGAILACT 329 Query: 176 QNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRI 36 G + E++ ME+ + P ++ G L+ ++ Sbjct: 330 HGGMVNKGREIFDSMEKRFSITPKIEHYGCMVDLLGRAGLLQEAYKL 376 >ref|XP_006339746.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like [Solanum tuberosum] Length = 508 Score = 252 bits (644), Expect = 4e-65 Identities = 120/183 (65%), Positives = 144/183 (78%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C SLY Q+ + P HSFT LF AC SSP GQM H HF+K+G D+Y LT+LVD Sbjct: 66 CFSLYIQMRRQGCSPNPHSFTFLFAACTNSSSPIQGQMFHVHFIKWGFEFDIYTLTALVD 125 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MYAKM LL AR++FDEM+ K+VPTWNSLI+GYA+ G++EEA + F MPSRNVISWTA+ Sbjct: 126 MYAKMSLLPSARKLFDEMEMKDVPTWNSLIAGYAKNGNVEEAFKLFSVMPSRNVISWTAM 185 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 ISGYSQNG+Y AL +Y EME++ V+PN VTIASVLPACAN+GALEVG+ IEAYARANG Sbjct: 186 ISGYSQNGKYANALAVYKEMEKDRRVKPNEVTIASVLPACANLGALEVGENIEAYARANG 245 Query: 11 YFR 3 YF+ Sbjct: 246 YFK 248 >ref|XP_004230005.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like [Solanum lycopersicum] Length = 508 Score = 244 bits (623), Expect = 1e-62 Identities = 115/183 (62%), Positives = 144/183 (78%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C SLY ++ + P HSFT LF AC+ S+P GQM H HF+K+G D+Y LT+LVD Sbjct: 66 CFSLYIKMRRQGCSPNPHSFTFLFAACSNRSTPIQGQMFHVHFIKWGFEFDIYTLTALVD 125 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MYAKM LL AR++FDEM+ K+VP WNSLI+GYA+ G++ EA + F MPSRNVISWTA+ Sbjct: 126 MYAKMSLLPSARKLFDEMEMKDVPIWNSLIAGYAKNGNVVEAFKLFSVMPSRNVISWTAM 185 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 ISGYSQNG+Y AL +Y +ME++ +V+PN VTIASVLPACAN+GALEVG+ IEAYARANG Sbjct: 186 ISGYSQNGKYANALAVYKQMEKDRKVKPNEVTIASVLPACANLGALEVGENIEAYARANG 245 Query: 11 YFR 3 YF+ Sbjct: 246 YFK 248 >ref|XP_003633947.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like [Vitis vinifera] Length = 512 Score = 243 bits (621), Expect = 2e-62 Identities = 118/183 (64%), Positives = 142/183 (77%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C SLY Q+ + P HSFT LF ACA LSS G+MLH HF+K G DV+ALT+LVD Sbjct: 66 CFSLYTQMCLQGCSPNEHSFTFLFSACASLSSHQQGRMLHTHFVKSGFGCDVFALTALVD 125 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MYAK+GLL AR+ FDEM ++VPTWNS+I+GYAR GD+E AL F MP+RNV SWTA+ Sbjct: 126 MYAKLGLLSLARKQFDEMTVRDVPTWNSMIAGYARCGDLEGALELFRLMPARNVTSWTAM 185 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 ISGY+QNG+Y +AL M+L ME E E+ PN VT+ASVLPACAN+GALEVG+RIE YAR NG Sbjct: 186 ISGYAQNGQYAKALSMFLMMEEETEMRPNEVTLASVLPACANLGALEVGERIEVYARGNG 245 Query: 11 YFR 3 YF+ Sbjct: 246 YFK 248 Score = 64.3 bits (155), Expect = 2e-08 Identities = 43/156 (27%), Positives = 72/156 (46%) Frame = -1 Query: 509 PGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDMYAKMGLLRFARRV 330 P + + ACA L + G+ + + G+ ++Y +L++MYA+ G + A V Sbjct: 213 PNEVTLASVLPACANLGALEVGERIEVYARGNGYFKNLYVSNALLEMYARCGRIDKAWGV 272 Query: 329 FDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTALISGYSQNGRYREAL 150 F+E+D + RN+ SW ++I G + +GR EA+ Sbjct: 273 FEEIDGR------------------------------RNLCSWNSMIMGLAVHGRCDEAI 302 Query: 149 EMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQ 42 E++ +M REG P+ VT VL AC + G + GQ Sbjct: 303 ELFYKMLREG-AAPDDVTFVGVLLACTHGGMVVEGQ 337 >emb|CBI40590.3| unnamed protein product [Vitis vinifera] Length = 495 Score = 243 bits (621), Expect = 2e-62 Identities = 118/183 (64%), Positives = 142/183 (77%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C SLY Q+ + P HSFT LF ACA LSS G+MLH HF+K G DV+ALT+LVD Sbjct: 66 CFSLYTQMCLQGCSPNEHSFTFLFSACASLSSHQQGRMLHTHFVKSGFGCDVFALTALVD 125 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MYAK+GLL AR+ FDEM ++VPTWNS+I+GYAR GD+E AL F MP+RNV SWTA+ Sbjct: 126 MYAKLGLLSLARKQFDEMTVRDVPTWNSMIAGYARCGDLEGALELFRLMPARNVTSWTAM 185 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 ISGY+QNG+Y +AL M+L ME E E+ PN VT+ASVLPACAN+GALEVG+RIE YAR NG Sbjct: 186 ISGYAQNGQYAKALSMFLMMEEETEMRPNEVTLASVLPACANLGALEVGERIEVYARGNG 245 Query: 11 YFR 3 YF+ Sbjct: 246 YFK 248 Score = 64.3 bits (155), Expect = 2e-08 Identities = 43/156 (27%), Positives = 72/156 (46%) Frame = -1 Query: 509 PGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDMYAKMGLLRFARRV 330 P + + ACA L + G+ + + G+ ++Y +L++MYA+ G + A V Sbjct: 213 PNEVTLASVLPACANLGALEVGERIEVYARGNGYFKNLYVSNALLEMYARCGRIDKAWGV 272 Query: 329 FDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTALISGYSQNGRYREAL 150 F+E+D + RN+ SW ++I G + +GR EA+ Sbjct: 273 FEEIDGR------------------------------RNLCSWNSMIMGLAVHGRCDEAI 302 Query: 149 EMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQ 42 E++ +M REG P+ VT VL AC + G + GQ Sbjct: 303 ELFYKMLREG-AAPDDVTFVGVLLACTHGGMVVEGQ 337 >ref|XP_002523296.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223537384|gb|EEF39012.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 353 Score = 242 bits (617), Expect = 5e-62 Identities = 114/183 (62%), Positives = 139/183 (75%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C S+Y Q+ SR H+FT LF ACA SP H QMLH HF K G DV ALT+LVD Sbjct: 66 CFSIYSQMRSRNCTGNQHTFTFLFAACASFFSPLHAQMLHTHFKKSGFESDVIALTALVD 125 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MY K+G++ FA RVFDE+ +++PTWN+LI+GY+R GDME AL+ F MP RNV+SWTA+ Sbjct: 126 MYCKLGMVAFAHRVFDEIPVRDIPTWNALIAGYSRCGDMEGALKIFKLMPDRNVVSWTAM 185 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 ISGYSQNGRY +ALE++L+ME+E + PN VTIAS+LPACAN+GALEVG RIE YAR NG Sbjct: 186 ISGYSQNGRYAKALELFLKMEKENGLRPNEVTIASILPACANLGALEVGDRIETYARENG 245 Query: 11 YFR 3 R Sbjct: 246 LLR 248 >gb|EPS66384.1| hypothetical protein M569_08386 [Genlisea aurea] Length = 486 Score = 236 bits (603), Expect = 2e-60 Identities = 116/183 (63%), Positives = 139/183 (75%), Gaps = 2/183 (1%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C LY Q+L R L P A+SFT LF+ACA L PS M+HA F K G + DVYA T+L+D Sbjct: 41 CFYLYSQILRRSLAPVANSFTFLFIACANLRDPSSAHMIHAQFSKLGFNRDVYASTALID 100 Query: 371 MYAKMGLLRFARRVFDEMDD--KEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWT 198 YAK+GLLR A FDEMDD K VPTWN L++ YAR G +EEA R F +MPSRNVISWT Sbjct: 101 TYAKLGLLRSATTAFDEMDDGAKGVPTWNCLLTAYARNGHLEEASRLFFEMPSRNVISWT 160 Query: 197 ALISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARA 18 ALISG++QNG+Y EALE+Y EMER ++PN VTIAS+LP+CAN+GAL G+RIEAYAR Sbjct: 161 ALISGFTQNGKYGEALELYSEMERIPNLKPNAVTIASILPSCANLGALNTGRRIEAYARE 220 Query: 17 NGY 9 NG+ Sbjct: 221 NGF 223 >ref|XP_002301427.2| hypothetical protein POPTR_0002s17640g [Populus trichocarpa] gi|550345235|gb|EEE80700.2| hypothetical protein POPTR_0002s17640g [Populus trichocarpa] Length = 514 Score = 231 bits (590), Expect = 7e-59 Identities = 111/183 (60%), Positives = 141/183 (77%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C+SLY Q+L + P +FT LF ACA S HG+++H HF+K G DVYALT+LV+ Sbjct: 66 CLSLYSQMLLKGCPPNELTFTFLFPACASFYSLLHGKVIHTHFIKSGFDFDVYALTALVN 125 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MYAK+G+L AR+VFDEM +++PTWNSLI+GY+R GDME AL F MPSR+V+SWT + Sbjct: 126 MYAKLGVLMLARQVFDEMTVRDIPTWNSLIAGYSRSGDMEGALELFKLMPSRSVVSWTTM 185 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 ISGYSQNG Y +ALEM+L+ME++ EV PN VTIASV ACA +GALEVG+RIE+YAR NG Sbjct: 186 ISGYSQNGMYTKALEMFLKMEKDKEVRPNEVTIASVFSACAKLGALEVGERIESYARDNG 245 Query: 11 YFR 3 + Sbjct: 246 LMK 248 Score = 66.2 bits (160), Expect = 5e-09 Identities = 42/162 (25%), Positives = 78/162 (48%) Frame = -1 Query: 521 RRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDMYAKMGLLRF 342 + + P + +F ACAKL + G+ + ++ G ++Y +L++MYA+ G + Sbjct: 209 KEVRPNEVTIASVFSACAKLGALEVGERIESYARDNGLMKNLYVSNTLLEMYARCGKIDA 268 Query: 341 ARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTALISGYSQNGRY 162 AR VF+E+ + RN+ SW +++ G + +GR Sbjct: 269 ARHVFNEIGKR------------------------------RNLCSWNSMMMGLAVHGRS 298 Query: 161 REALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRI 36 EAL++Y +M EG +EP+ VT ++ AC + G + G ++ Sbjct: 299 NEALQLYDQMLGEG-IEPDDVTFVGLILACTHGGLVAKGWQL 339 >gb|EXB83859.1| hypothetical protein L484_023466 [Morus notabilis] Length = 513 Score = 230 bits (587), Expect = 2e-58 Identities = 108/183 (59%), Positives = 140/183 (76%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C+ LY ++ + P HSFTLLF C+ LSS GQM+H+HF+K GH D++ALT+LVD Sbjct: 69 CLFLYRRMCLQGCTPNEHSFTLLFSVCSSLSSRQLGQMMHSHFVKLGHVRDIFALTALVD 128 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MYAK+G+L AR+ FDE + PTWNS++SGYAR GDME A F MP RNV+SWTA+ Sbjct: 129 MYAKLGMLDCARKQFDEKRVRGTPTWNSMLSGYARSGDMEGASELFRLMPQRNVVSWTAM 188 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 ISGYS+NG+Y +AL M+L+ME+E +V PN +TIASVLPACAN+GALEVG+R+E YAR G Sbjct: 189 ISGYSKNGQYAKALAMFLQMEKERDVRPNAITIASVLPACANLGALEVGERVEEYARKVG 248 Query: 11 YFR 3 + + Sbjct: 249 FLK 251 Score = 65.9 bits (159), Expect = 7e-09 Identities = 47/175 (26%), Positives = 81/175 (46%), Gaps = 5/175 (2%) Frame = -1 Query: 521 RRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDMYAKMGLLRF 342 R + P A + + ACA L + G+ + + K G D+Y ++++MYAK G + Sbjct: 212 RDVRPNAITIASVLPACANLGALEVGERVEEYARKVGFLKDLYVSNAVLEMYAKCGRIDT 271 Query: 341 ARRVFDEMD-DKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNV----ISWTALISGYS 177 ARRVFDE+ + + +WNS+I G A G EAL + M + + +++ LI + Sbjct: 272 ARRVFDEIGRRRNLCSWNSMIMGLAVHGRCNEALDLYEQMTTVRIAPDDVTFVGLILACT 331 Query: 176 QNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 G + +++ ME + + P ++ G L+ EAY G Sbjct: 332 HGGMAMKGQQLFKSMEPKFGITPKLEHYGCMVDLLGRAGKLQ-----EAYDLIQG 381 >ref|XP_004510637.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like [Cicer arietinum] Length = 512 Score = 229 bits (584), Expect = 3e-58 Identities = 110/183 (60%), Positives = 138/183 (75%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C LY Q+L P H+F LF A +SS S GQMLH HF+K G HDV+A T+L+D Sbjct: 67 CFFLYSQMLLHGHSPNQHTFNFLFKAGTSVSSISLGQMLHTHFIKSGFKHDVFASTALLD 126 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MYAK+G L+ AR VFDEM +EVPTWN++++GY R GDME AL F MP+RNV+SWT + Sbjct: 127 MYAKLGSLKLARHVFDEMSVREVPTWNAMMAGYTRFGDMERALELFGLMPARNVVSWTTV 186 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 +SGYSQN +Y +ALE++L ME E +V PN VT+ASVLPACAN+GALE+GQR+EAYAR NG Sbjct: 187 VSGYSQNKQYEKALELFLRMEWEKDVIPNEVTLASVLPACANLGALEIGQRVEAYARENG 246 Query: 11 YFR 3 F+ Sbjct: 247 LFK 249 >ref|XP_007051479.1| Pentatricopeptide repeat (PPR) superfamily protein, putative [Theobroma cacao] gi|508703740|gb|EOX95636.1| Pentatricopeptide repeat (PPR) superfamily protein, putative [Theobroma cacao] Length = 515 Score = 228 bits (582), Expect = 6e-58 Identities = 111/182 (60%), Positives = 136/182 (74%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C++LY Q+ P HSF LF ACA L S HGQ+LH FLK G D YALT+L+ Sbjct: 68 CLTLYSQMCLNNCSPNEHSFIFLFPACASLPSLLHGQILHTQFLKSGFGLDCYALTALLV 127 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MYAK+ +L AR+VFDEM + +PTWN+LISGY+ GDM+EAL F MP +NV+SWT + Sbjct: 128 MYAKLRMLPLARKVFDEMRVRNLPTWNALISGYSMCGDMKEALELFKSMPEKNVVSWTTM 187 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 ISGYSQNG+Y +AL+M+L ME+E V+PN VTIASVLPACAN+GALEVG+RIE YAR NG Sbjct: 188 ISGYSQNGQYSKALDMFLRMEKETGVKPNRVTIASVLPACANLGALEVGERIETYARENG 247 Query: 11 YF 6 F Sbjct: 248 LF 249 >ref|XP_004155716.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like [Cucumis sativus] Length = 512 Score = 228 bits (581), Expect = 8e-58 Identities = 109/183 (59%), Positives = 142/183 (77%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C LY Q+ S+ P +SFT LF ACA L + GQMLH+HF K G D++A+T+L+D Sbjct: 66 CWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAMTALLD 125 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MYAK+G+LR AR++FDEM +++PTWNSLI+GYAR G ME AL F MP RNVISWTAL Sbjct: 126 MYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTAL 185 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 ISGY+QNG+Y +ALEM++ +E E +PN V+IASVLPAC+ +GAL++G+RIEAYAR NG Sbjct: 186 ISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNG 245 Query: 11 YFR 3 +F+ Sbjct: 246 FFK 248 >ref|XP_004142577.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like [Cucumis sativus] Length = 589 Score = 228 bits (581), Expect = 8e-58 Identities = 109/183 (59%), Positives = 142/183 (77%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C LY Q+ S+ P +SFT LF ACA L + GQMLH+HF K G D++A+T+L+D Sbjct: 66 CWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAMTALLD 125 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MYAK+G+LR AR++FDEM +++PTWNSLI+GYAR G ME AL F MP RNVISWTAL Sbjct: 126 MYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTAL 185 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 ISGY+QNG+Y +ALEM++ +E E +PN V+IASVLPAC+ +GAL++G+RIEAYAR NG Sbjct: 186 ISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNG 245 Query: 11 YFR 3 +F+ Sbjct: 246 FFK 248 >ref|XP_003627527.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355521549|gb|AET02003.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 550 Score = 224 bits (571), Expect = 1e-56 Identities = 104/183 (56%), Positives = 135/183 (73%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C +LY Q+ P ++F LF C LSS S GQM+H F+K G HDV+A T+L+D Sbjct: 63 CFTLYSQMYLHGHSPNQYTFNFLFTTCTSLSSLSLGQMIHTQFMKSGFKHDVFASTALLD 122 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MYAK+G L+FAR VFDEM KE+ TWN++++G R GDME AL F MPSRNV+SWT + Sbjct: 123 MYAKLGCLKFARNVFDEMSVKELATWNAMMAGCTRFGDMERALELFWLMPSRNVVSWTTM 182 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 +SGY QN +Y +AL +++ MERE +V PN VT+ASVLPACAN+GALE+GQR+E YAR NG Sbjct: 183 VSGYLQNKQYEKALGLFMRMEREKDVSPNEVTLASVLPACANLGALEIGQRVEVYARKNG 242 Query: 11 YFR 3 +F+ Sbjct: 243 FFK 245 Score = 56.6 bits (135), Expect = 4e-06 Identities = 46/176 (26%), Positives = 83/176 (47%), Gaps = 18/176 (10%) Frame = -1 Query: 509 PGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDMYAKMGLLRFARRV 330 P + + ACA L + GQ + + K G +++ ++++MYAK G + A +V Sbjct: 210 PNEVTLASVLPACANLGALEIGQRVEVYARKNGFFKNLFVCNAVLEMYAKCGKIDVAWKV 269 Query: 329 FDEMDD-KEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWT----------ALISG 183 FDE+ + + +WNS+I G A G +A++ + M ++S++ +I G Sbjct: 270 FDEIGRFRNLCSWNSMIMGLAVHGQCHKAIQLYDQM----LVSYSLYLLFISFAFIMIRG 325 Query: 182 YSQNGRYREALEMYLEME-------REGEVEPNHVTIASVLPACANIGALEVGQRI 36 + E L +E REG + P+ VT +L AC + G +E G+ + Sbjct: 326 GHGLVNHINRTEPNLSVEMVRNNRTREGTL-PDDVTFVGLLLACTHGGMVEKGKHV 380 >ref|XP_003548250.1| PREDICTED: pentatricopeptide repeat-containing protein At5g08510-like [Glycine max] Length = 512 Score = 223 bits (568), Expect = 2e-56 Identities = 104/183 (56%), Positives = 138/183 (75%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C SLY Q+L LP H+F LF AC LSSPS GQMLH HF+K G D++A T+L+D Sbjct: 67 CFSLYSQMLLHSFLPNQHTFNFLFSACTSLSSPSLGQMLHTHFIKSGFEPDLFAATALLD 126 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MY K+G L AR++FD+M + VPTWN++++G+AR GDM+ AL F MPSRNV+SWT + Sbjct: 127 MYTKVGTLELARKLFDQMPVRGVPTWNAMMAGHARFGDMDVALELFRLMPSRNVVSWTTM 186 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 ISGYS++ +Y EAL ++L ME+E + PN VT+AS+ PA AN+GALE+GQR+EAYAR NG Sbjct: 187 ISGYSRSKKYGEALGLFLRMEQEKGMMPNAVTLASIFPAFANLGALEIGQRVEAYARKNG 246 Query: 11 YFR 3 +F+ Sbjct: 247 FFK 249 Score = 57.4 bits (137), Expect = 2e-06 Identities = 44/161 (27%), Positives = 75/161 (46%), Gaps = 1/161 (0%) Frame = -1 Query: 515 LLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDMYAKMGLLRFAR 336 ++P A + +F A A L + GQ + A+ K G ++Y ++++MYAK G + A Sbjct: 212 MMPNAVTLASIFPAFANLGALEIGQRVEAYARKNGFFKNLYVSNAVLEMYAKCGKIDVAW 271 Query: 335 RVFDEMDD-KEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTALISGYSQNGRYR 159 +VF+E+ + + +WNS+I G A G+ C Sbjct: 272 KVFNEIGSLRNLCSWNSMIMGLAVHGEC-----C-------------------------- 300 Query: 158 EALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRI 36 + L++Y +M EG P+ VT +L AC + G +E G+ I Sbjct: 301 KTLKLYDQMLGEG-TSPDDVTFVGLLLACTHGGMVEKGRHI 340 >emb|CAN66974.1| hypothetical protein VITISV_022076 [Vitis vinifera] Length = 967 Score = 223 bits (567), Expect = 3e-56 Identities = 109/167 (65%), Positives = 132/167 (79%) Frame = -1 Query: 503 AHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDMYAKMGLLRFARRVFD 324 A S+ L ACA LSS G+MLH HF+K G DV+ALT+LVDMYAK+GLL AR+ FD Sbjct: 567 ACSWKALISACASLSSHQQGRMLHTHFVKSGFGCDVFALTALVDMYAKLGLLSLARKQFD 626 Query: 323 EMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTALISGYSQNGRYREALEM 144 EM ++VPTWNS+I+GYAR GD+E AL F MP+RNV SWTA+ISGY+QNG+Y +AL M Sbjct: 627 EMTVRDVPTWNSMIAGYARCGDLEGALELFRLMPARNVTSWTAMISGYAQNGQYAKALSM 686 Query: 143 YLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANGYFR 3 +L ME E E+ PN VT+ASVLPACAN+GALEVG+RIE YAR NGYF+ Sbjct: 687 FLMMEEETEMRPNEVTLASVLPACANLGALEVGERIEVYARGNGYFK 733 >ref|XP_007135284.1| hypothetical protein PHAVU_010G116300g [Phaseolus vulgaris] gi|561008329|gb|ESW07278.1| hypothetical protein PHAVU_010G116300g [Phaseolus vulgaris] Length = 510 Score = 220 bits (561), Expect = 2e-55 Identities = 103/183 (56%), Positives = 138/183 (75%) Frame = -1 Query: 551 CISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVD 372 C SLY Q+ LP H+F LF AC L S S GQMLH HF+K G D++A T+L+D Sbjct: 67 CFSLYYQMRLHGFLPNQHTFNFLFSACTSLFSHSLGQMLHTHFIKSGFEPDLFAATALLD 126 Query: 371 MYAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTAL 192 MY K+G L AR++FDEM + VPTWN+++SGYA+ GDME AL F MP+RN++SWT + Sbjct: 127 MYCKVGTLGLARQLFDEMPVRGVPTWNAMMSGYAKFGDMEGALELFGLMPTRNLVSWTTM 186 Query: 191 ISGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANG 12 ISGYS+N ++ EAL ++L+ME+E + PN VT+AS+LPAC+N+GALE+GQR+EAYAR NG Sbjct: 187 ISGYSRNKQFGEALGLFLKMEQEKGIVPNEVTLASILPACSNLGALEIGQRVEAYARKNG 246 Query: 11 YFR 3 +F+ Sbjct: 247 FFK 249 >ref|XP_006289662.1| hypothetical protein CARUB_v10003220mg [Capsella rubella] gi|482558368|gb|EOA22560.1| hypothetical protein CARUB_v10003220mg [Capsella rubella] Length = 511 Score = 199 bits (505), Expect = 5e-49 Identities = 95/181 (52%), Positives = 127/181 (70%) Frame = -1 Query: 548 ISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDM 369 I LY L L P H+F +F A A SS ++LH+ F K G D + T+L+ Sbjct: 67 IVLYNLLSFDGLRPNHHTFNFIFAASASFSSARPLRLLHSQFFKSGFESDSFCCTALITA 126 Query: 368 YAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTALI 189 YAK+G L ARRVFDEM +++ P WN++I+GY R+GDM+ A+ F MP +NVISWT +I Sbjct: 127 YAKLGELCCARRVFDEMSNRDAPVWNTMITGYQRQGDMKAAMELFDSMPCKNVISWTTVI 186 Query: 188 SGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANGY 9 SG+SQNG Y EAL M+L ME++ V+PNHVT+ SVLPACAN+G LE+G+R+E+YAR NG+ Sbjct: 187 SGFSQNGNYSEALTMFLCMEKDKSVKPNHVTLVSVLPACANLGELEIGRRLESYARENGF 246 Query: 8 F 6 F Sbjct: 247 F 247 Score = 67.0 bits (162), Expect = 3e-09 Identities = 39/147 (26%), Positives = 72/147 (48%) Frame = -1 Query: 476 ACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDMYAKMGLLRFARRVFDEMDDKEVPT 297 ACA L G+ L ++ + G ++Y + ++MY+K G++ A+++F E+ ++ Sbjct: 224 ACANLGELEIGRRLESYARENGFFDNIYVCNATLEMYSKCGMIDLAKQLFHEIGNQ---- 279 Query: 296 WNSLISGYARKGDMEEALRCFLDMPSRNVISWTALISGYSQNGRYREALEMYLEMEREGE 117 RN+ SW ++I + +G++ EALE+Y +M REGE Sbjct: 280 --------------------------RNLCSWNSMIGSLATHGKHHEALELYAQMLREGE 313 Query: 116 VEPNHVTIASVLPACANIGALEVGQRI 36 +P+ VT +L AC + G + G + Sbjct: 314 -KPDAVTFVGLLLACVHGGMVVKGHEL 339 >ref|NP_196468.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75171895|sp|Q9FNN7.1|PP371_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At5g08510 gi|9759345|dbj|BAB10000.1| unnamed protein product [Arabidopsis thaliana] gi|50897238|gb|AAT85758.1| At5g08510 [Arabidopsis thaliana] gi|332003930|gb|AED91313.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 511 Score = 198 bits (503), Expect = 8e-49 Identities = 93/181 (51%), Positives = 126/181 (69%) Frame = -1 Query: 548 ISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDM 369 I LY L L P H+F +F A A SS ++LH+ F + G D + T+L+ Sbjct: 67 IVLYNLLSFDGLRPSHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCCTTLITA 126 Query: 368 YAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTALI 189 YAK+G L ARRVFDEM ++VP WN++I+GY R+GDM+ A+ F MP +NV SWT +I Sbjct: 127 YAKLGALCCARRVFDEMSKRDVPVWNAMITGYQRRGDMKAAMELFDSMPRKNVTSWTTVI 186 Query: 188 SGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANGY 9 SG+SQNG Y EAL+M+L ME++ V+PNH+T+ SVLPACAN+G LE+G+R+E YAR NG+ Sbjct: 187 SGFSQNGNYSEALKMFLCMEKDKSVKPNHITVVSVLPACANLGELEIGRRLEGYARENGF 246 Query: 8 F 6 F Sbjct: 247 F 247 Score = 63.2 bits (152), Expect = 4e-08 Identities = 35/147 (23%), Positives = 70/147 (47%), Gaps = 5/147 (3%) Frame = -1 Query: 476 ACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDMYAKMGLLRFARRVFDEM-DDKEVP 300 ACA L G+ L + + G ++Y + ++MY+K G++ A+R+F+E+ + + + Sbjct: 224 ACANLGELEIGRRLEGYARENGFFDNIYVCNATIEMYSKCGMIDVAKRLFEELGNQRNLC 283 Query: 299 TWNSLISGYARKGDMEEALRCFLDM----PSRNVISWTALISGYSQNGRYREALEMYLEM 132 +WNS+I A G +EAL F M + +++ L+ G + E++ M Sbjct: 284 SWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLLACVHGGMVVKGQELFKSM 343 Query: 131 EREGEVEPNHVTIASVLPACANIGALE 51 E ++ P ++ +G L+ Sbjct: 344 EEVHKISPKLEHYGCMIDLLGRVGKLQ 370 >dbj|BAE98759.1| hypothetical protein [Arabidopsis thaliana] Length = 504 Score = 198 bits (503), Expect = 8e-49 Identities = 93/181 (51%), Positives = 126/181 (69%) Frame = -1 Query: 548 ISLYPQLLSRRLLPGAHSFTLLFVACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDM 369 I LY L L P H+F +F A A SS ++LH+ F + G D + T+L+ Sbjct: 60 IVLYNLLSFDGLRPSHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCCTTLITA 119 Query: 368 YAKMGLLRFARRVFDEMDDKEVPTWNSLISGYARKGDMEEALRCFLDMPSRNVISWTALI 189 YAK+G L ARRVFDEM ++VP WN++I+GY R+GDM+ A+ F MP +NV SWT +I Sbjct: 120 YAKLGALCCARRVFDEMSKRDVPVWNAMITGYQRRGDMKAAMELFDSMPRKNVTSWTTVI 179 Query: 188 SGYSQNGRYREALEMYLEMEREGEVEPNHVTIASVLPACANIGALEVGQRIEAYARANGY 9 SG+SQNG Y EAL+M+L ME++ V+PNH+T+ SVLPACAN+G LE+G+R+E YAR NG+ Sbjct: 180 SGFSQNGNYSEALKMFLCMEKDKSVKPNHITVVSVLPACANLGELEIGRRLEGYARENGF 239 Query: 8 F 6 F Sbjct: 240 F 240 Score = 63.2 bits (152), Expect = 4e-08 Identities = 35/147 (23%), Positives = 70/147 (47%), Gaps = 5/147 (3%) Frame = -1 Query: 476 ACAKLSSPSHGQMLHAHFLKFGHHHDVYALTSLVDMYAKMGLLRFARRVFDEM-DDKEVP 300 ACA L G+ L + + G ++Y + ++MY+K G++ A+R+F+E+ + + + Sbjct: 217 ACANLGELEIGRRLEGYARENGFFDNIYVCNATIEMYSKCGMIDVAKRLFEELGNQRNLC 276 Query: 299 TWNSLISGYARKGDMEEALRCFLDM----PSRNVISWTALISGYSQNGRYREALEMYLEM 132 +WNS+I A G +EAL F M + +++ L+ G + E++ M Sbjct: 277 SWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLLACVHGGMVVKGQELFKSM 336 Query: 131 EREGEVEPNHVTIASVLPACANIGALE 51 E ++ P ++ +G L+ Sbjct: 337 EEVHKISPKLEHYGCMIDLLGRVGKLQ 363