BLASTX nr result
ID: Mentha29_contig00004997
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00004997 (1201 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU22395.1| hypothetical protein MIMGU_mgv1a010291mg [Mimulus... 361 4e-97 ref|XP_007052035.1| Tetratricopeptide repeat (TPR)-like superfam... 354 4e-95 ref|XP_004229820.1| PREDICTED: pentatricopeptide repeat-containi... 354 5e-95 ref|XP_006339440.1| PREDICTED: pentatricopeptide repeat-containi... 353 6e-95 gb|EXB52663.1| hypothetical protein L484_022440 [Morus notabilis] 346 1e-92 emb|CBI38862.3| unnamed protein product [Vitis vinifera] 343 6e-92 ref|XP_002273719.1| PREDICTED: pentatricopeptide repeat-containi... 343 6e-92 ref|XP_007220375.1| hypothetical protein PRUPE_ppa018787mg [Prun... 337 5e-90 ref|XP_006445236.1| hypothetical protein CICLE_v10020287mg [Citr... 333 1e-88 ref|XP_002892034.1| pentatricopeptide repeat-containing protein ... 330 7e-88 ref|XP_004306911.1| PREDICTED: pentatricopeptide repeat-containi... 329 1e-87 ref|NP_171699.1| pentatricopeptide repeat-containing protein [Ar... 328 4e-87 ref|XP_006418475.1| hypothetical protein EUTSA_v10007755mg [Eutr... 325 2e-86 ref|XP_006307695.1| hypothetical protein CARUB_v10009324mg [Caps... 325 3e-86 ref|XP_003623723.1| Pentatricopeptide repeat-containing protein ... 323 7e-86 gb|EPS65198.1| hypothetical protein M569_09579 [Genlisea aurea] 322 2e-85 ref|XP_004134345.1| PREDICTED: pentatricopeptide repeat-containi... 320 6e-85 ref|XP_004492640.1| PREDICTED: pentatricopeptide repeat-containi... 320 1e-84 ref|XP_003552343.1| PREDICTED: pentatricopeptide repeat-containi... 316 1e-83 ref|XP_003533639.2| PREDICTED: pentatricopeptide repeat-containi... 315 2e-83 >gb|EYU22395.1| hypothetical protein MIMGU_mgv1a010291mg [Mimulus guttatus] Length = 317 Score = 361 bits (926), Expect = 4e-97 Identities = 182/253 (71%), Positives = 213/253 (84%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 EESFE NIRDYT IIHGYAKQN+L+EAE L AMK R +CDQV LTALIHMYSK+GNLK Sbjct: 65 EESFEANIRDYTNIIHGYAKQNRLQEAEQTLTAMKNRGLLCDQVILTALIHMYSKSGNLK 124 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 A+ +FEEMK+LGV+LD+RSYGSI+MA+IRA KL+SAE LL+EM+ QEIYA REVYKALL Sbjct: 125 QAQYSFEEMKMLGVQLDRRSYGSIIMAHIRAEKLSSAETLLQEMEAQEIYAGREVYKALL 184 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSM GD GAQRIFD IQVAGI PD +VCGLLINAYV SG+ +EA +AF NMRRSG+E Sbjct: 185 RAYSMKGDFSGAQRIFDAIQVAGITPDARVCGLLINAYVVSGRAQEACVAFGNMRRSGIE 244 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 NDKCVALVL A+E E++LK+ALD LIELE + IM+GKE SDLLVKWF++LGVVEEV V Sbjct: 245 VNDKCVALVLAAFEMEDKLKDALDLLIELEGEGIMVGKEGSDLLVKWFQKLGVVEEVAIV 304 Query: 481 LKDFASRMVEPFL 443 L++ +M +P L Sbjct: 305 LRELGLKMAQPVL 317 >ref|XP_007052035.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative [Theobroma cacao] gi|508704296|gb|EOX96192.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative [Theobroma cacao] Length = 420 Score = 354 bits (909), Expect = 4e-95 Identities = 172/247 (69%), Positives = 213/247 (86%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 EESFE NIRD+TKIIHGY KQ +L+EAE+ L AMK+R F+CDQVTLT ++HMYSKAGNLK Sbjct: 165 EESFEANIRDFTKIIHGYGKQKRLQEAENILVAMKRRGFICDQVTLTTMVHMYSKAGNLK 224 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 LAE+TFEE+KLLG +LDKRSYGS++MAYIR+G E LLREM QEIYA EVYKALL Sbjct: 225 LAEETFEEIKLLGQQLDKRSYGSMIMAYIRSGTPEQGEALLREMDSQEIYAGSEVYKALL 284 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSMLGD+ GAQR+FD IQ+AGI PD ++CGLLINAY +G++ +A IAFENMRR+GLE Sbjct: 285 RAYSMLGDANGAQRVFDTIQLAGISPDARMCGLLINAYQLAGQSDKAHIAFENMRRAGLE 344 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 P+DKCVALV+ AYEK+ +L +ALDFL+ELERD I++GKEAS +L +WF++LGVVE+VE V Sbjct: 345 PSDKCVALVVAAYEKQNKLNKALDFLMELERDGIVVGKEASGILAQWFKKLGVVEQVELV 404 Query: 481 LKDFASR 461 L++FA++ Sbjct: 405 LREFAAK 411 >ref|XP_004229820.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like [Solanum lycopersicum] Length = 415 Score = 354 bits (908), Expect = 5e-95 Identities = 173/245 (70%), Positives = 207/245 (84%) Frame = -1 Query: 1198 ESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLKL 1019 ESFE NIRDYTKIIHGYAKQN+LKEAE +MK R F CDQVTLTAL+HMYSKA NLKL Sbjct: 165 ESFEANIRDYTKIIHGYAKQNRLKEAESVFLSMKSRGFTCDQVTLTALVHMYSKASNLKL 224 Query: 1018 AEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALLR 839 AEDTFEEM+LLGV LDKRS+GSI+MAY+RAGKL E LL+EM+ QE YA EVYKALLR Sbjct: 225 AEDTFEEMRLLGVPLDKRSFGSIIMAYVRAGKLGQGEALLKEMEEQETYAGPEVYKALLR 284 Query: 838 TYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLEP 659 YSM GDS+GAQR+FD IQ+AG+IPD +CGLL+NAY+ +G+ E IAFENMRR G++P Sbjct: 285 AYSMSGDSKGAQRVFDTIQLAGVIPDATICGLLMNAYIMAGQLSETCIAFENMRRVGIKP 344 Query: 658 NDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHVL 479 NDKC+ L+LTAYE E +L +ALD L++LERD I++G+EAS+LL +WF+RLGVV EVE VL Sbjct: 345 NDKCITLLLTAYETENKLSKALDVLMDLERDGIVLGREASELLARWFKRLGVVGEVELVL 404 Query: 478 KDFAS 464 +D+AS Sbjct: 405 RDYAS 409 >ref|XP_006339440.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like [Solanum tuberosum] Length = 415 Score = 353 bits (907), Expect = 6e-95 Identities = 172/245 (70%), Positives = 207/245 (84%) Frame = -1 Query: 1198 ESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLKL 1019 ESFE NIRDYTKIIHGYAKQN+LKEAE +MK R F CDQVTLTAL+HMYSKAGNLKL Sbjct: 165 ESFEANIRDYTKIIHGYAKQNRLKEAESVFLSMKSRGFTCDQVTLTALVHMYSKAGNLKL 224 Query: 1018 AEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALLR 839 AEDTFEEM+LLGV LDKRS+GSI+MAY+RAGKL E LL+EM+ QEIYA EVYKALLR Sbjct: 225 AEDTFEEMRLLGVPLDKRSFGSIIMAYVRAGKLGQGEALLKEMEEQEIYAGPEVYKALLR 284 Query: 838 TYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLEP 659 YSM GDS+GAQR+FD Q+AG+IPD +CGLL+NAY+ +G+ EA I FENMRR G++P Sbjct: 285 AYSMSGDSKGAQRVFDTTQLAGVIPDATICGLLMNAYIMAGQLSEACITFENMRRVGIKP 344 Query: 658 NDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHVL 479 NDKC+ L+L AYE E +L +ALD L++LERD +++G+EAS+LL +WF+RLGVV EVE VL Sbjct: 345 NDKCITLLLKAYETENKLSKALDVLMDLERDGVVLGREASELLARWFKRLGVVGEVELVL 404 Query: 478 KDFAS 464 +D+AS Sbjct: 405 RDYAS 409 >gb|EXB52663.1| hypothetical protein L484_022440 [Morus notabilis] Length = 406 Score = 346 bits (888), Expect = 1e-92 Identities = 173/247 (70%), Positives = 208/247 (84%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 EESFE NIRDYTKIIH Y KQN+L++AE L AMK R F+ DQVTLT IHMYSKAGNLK Sbjct: 151 EESFEANIRDYTKIIHCYGKQNRLEDAEKTLLAMKSRGFIRDQVTLTTFIHMYSKAGNLK 210 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 LAE+TFEE+KLLG LDKRSYGS++MAYIRAG E +LREM +EIYA EVYKALL Sbjct: 211 LAEETFEELKLLGQPLDKRSYGSMIMAYIRAGMPDQGENILREMDVEEIYAGSEVYKALL 270 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSM GD+EGAQR+FD IQ+AGI+PD ++CGLLINAYV SG++ +A +AF NMRR+GLE Sbjct: 271 RAYSMTGDAEGAQRVFDAIQLAGILPDPRLCGLLINAYVESGQSEKACVAFGNMRRAGLE 330 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 P+DKCVALVL AYEKE +L+ ALDFL+ELER IM+G+EAS+ LV WFR+LGVV+EV+ V Sbjct: 331 PSDKCVALVLCAYEKENKLQRALDFLMELERHGIMVGEEASETLVGWFRKLGVVKEVDLV 390 Query: 481 LKDFASR 461 L+++AS+ Sbjct: 391 LREYASK 397 >emb|CBI38862.3| unnamed protein product [Vitis vinifera] Length = 353 Score = 343 bits (881), Expect = 6e-92 Identities = 170/247 (68%), Positives = 211/247 (85%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 EESFE NIRDYTKII GY KQN+L++AE+ L AMK+R F+CDQVTLTA+I+MYSKAGNL+ Sbjct: 97 EESFEANIRDYTKIIDGYGKQNRLQDAENTLSAMKRRGFICDQVTLTAMINMYSKAGNLE 156 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 LAE TFEE+KLLG LDKRSYGS++MAYIRAG E L++EM+ +EIYA REVYKALL Sbjct: 157 LAEKTFEEIKLLGHPLDKRSYGSMIMAYIRAGMPDQGEILVKEMEAKEIYAGREVYKALL 216 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YS D+EGAQR+FD IQ AGI PDVK+C LLINAY +G+T++A +AFENMRRSGL+ Sbjct: 217 RAYSNTSDAEGAQRVFDAIQFAGISPDVKLCALLINAYRVAGQTQKAHVAFENMRRSGLK 276 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 PNDK +AL+L AYEKE +L +ALDFLI+LERD I++GKEAS+LL WF+RLGVV+EVE V Sbjct: 277 PNDKSIALMLAAYEKENKLNKALDFLIDLERDGIVLGKEASELLAAWFQRLGVVKEVELV 336 Query: 481 LKDFASR 461 L++++++ Sbjct: 337 LREYSAK 343 >ref|XP_002273719.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like [Vitis vinifera] Length = 352 Score = 343 bits (881), Expect = 6e-92 Identities = 170/247 (68%), Positives = 211/247 (85%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 EESFE NIRDYTKII GY KQN+L++AE+ L AMK+R F+CDQVTLTA+I+MYSKAGNL+ Sbjct: 97 EESFEANIRDYTKIIDGYGKQNRLQDAENTLSAMKRRGFICDQVTLTAMINMYSKAGNLE 156 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 LAE TFEE+KLLG LDKRSYGS++MAYIRAG E L++EM+ +EIYA REVYKALL Sbjct: 157 LAEKTFEEIKLLGHPLDKRSYGSMIMAYIRAGMPDQGEILVKEMEAKEIYAGREVYKALL 216 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YS D+EGAQR+FD IQ AGI PDVK+C LLINAY +G+T++A +AFENMRRSGL+ Sbjct: 217 RAYSNTSDAEGAQRVFDAIQFAGISPDVKLCALLINAYRVAGQTQKAHVAFENMRRSGLK 276 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 PNDK +AL+L AYEKE +L +ALDFLI+LERD I++GKEAS+LL WF+RLGVV+EVE V Sbjct: 277 PNDKSIALMLAAYEKENKLNKALDFLIDLERDGIVLGKEASELLAAWFQRLGVVKEVELV 336 Query: 481 LKDFASR 461 L++++++ Sbjct: 337 LREYSAK 343 >ref|XP_007220375.1| hypothetical protein PRUPE_ppa018787mg [Prunus persica] gi|462416837|gb|EMJ21574.1| hypothetical protein PRUPE_ppa018787mg [Prunus persica] Length = 377 Score = 337 bits (865), Expect = 5e-90 Identities = 165/245 (67%), Positives = 204/245 (83%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 EESFE N+RDYTKIIHGY KQN+++EA L MK R F+CDQVTLTA+I MYSKAG++K Sbjct: 122 EESFEVNLRDYTKIIHGYGKQNRIEEAVKILSNMKARGFICDQVTLTAMIDMYSKAGHVK 181 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 LAE+TFEE+KLLG LDKRSYGS++MAYIRAG E LL EM QEIYA EVYKALL Sbjct: 182 LAEETFEEIKLLGQPLDKRSYGSMIMAYIRAGVPDQGESLLIEMDAQEIYAGSEVYKALL 241 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSM+GD+EGAQR+F+ +Q+AGI PD K+CGLLINAY SG++++AR+AFENMR +G+ Sbjct: 242 RAYSMVGDTEGAQRVFNAVQLAGISPDAKLCGLLINAYGVSGQSQKARVAFENMRTAGIR 301 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 P DKC+ALVL AYEKE +L++AL FL+ LERD IM+GKEA++ L WFR+LGVVEEV+ + Sbjct: 302 PTDKCIALVLAAYEKENKLQKALKFLMALERDGIMVGKEAAETLAAWFRKLGVVEEVDTI 361 Query: 481 LKDFA 467 L++FA Sbjct: 362 LREFA 366 >ref|XP_006445236.1| hypothetical protein CICLE_v10020287mg [Citrus clementina] gi|568875716|ref|XP_006490938.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like [Citrus sinensis] gi|557547498|gb|ESR58476.1| hypothetical protein CICLE_v10020287mg [Citrus clementina] Length = 423 Score = 333 bits (853), Expect = 1e-88 Identities = 164/247 (66%), Positives = 203/247 (82%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 EESFE NIRDYTKIIHGY K+ +++ AE+ L AMK+R F+CDQVTLT ++ MYSKAGNLK Sbjct: 163 EESFEANIRDYTKIIHGYGKKMQIQNAENTLLAMKRRGFICDQVTLTVMVVMYSKAGNLK 222 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 +AE+TFEE+KLLG LDKRSYGS+VMAY+RAG L E LLREM QE+Y EVYKALL Sbjct: 223 MAEETFEEIKLLGEPLDKRSYGSMVMAYVRAGMLDRGEVLLREMDAQEVYVGSEVYKALL 282 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSM G+SEGAQR+F+ IQ AGI PD ++C LLINAY +G++++A AF+NMR++GLE Sbjct: 283 RGYSMNGNSEGAQRVFEAIQFAGITPDARMCALLINAYQMAGQSQKAYTAFQNMRKAGLE 342 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 P+DKCVAL+L+A EKE +L AL+FLI+LERD M+GKEAS L WF+RLGVVEEVEHV Sbjct: 343 PSDKCVALILSACEKENQLNRALEFLIDLERDGFMVGKEASCTLAAWFKRLGVVEEVEHV 402 Query: 481 LKDFASR 461 L+++ R Sbjct: 403 LREYGLR 409 >ref|XP_002892034.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297337876|gb|EFH68293.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 409 Score = 330 bits (846), Expect = 7e-88 Identities = 163/251 (64%), Positives = 205/251 (81%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 E+SFE N RDYTKIIH Y K N++++AE L +MK R F+ DQVTLTA++ +YSKAG K Sbjct: 158 EDSFEANARDYTKIIHYYGKLNQVEDAERTLLSMKNRGFLIDQVTLTAIVQLYSKAGYHK 217 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 LAE+TF E+KL+G LD RSYGS++MAYIRAG E LLREM QEI A REVYKALL Sbjct: 218 LAEETFNEIKLIGEPLDNRSYGSMIMAYIRAGAPEKGEALLREMDSQEICAGREVYKALL 277 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSM GD+EGA+R+FD +Q+AGI PDVK+CGLLINAY SG+++ AR+AFENMR++G++ Sbjct: 278 RAYSMGGDAEGAKRVFDAVQIAGITPDVKLCGLLINAYSVSGQSQNARLAFENMRKAGIK 337 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 DKCVALVL AYEKEE+L EAL FL+ELE+DSIM+GKEAS +L +WF++LGVVEEVE + Sbjct: 338 ATDKCVALVLAAYEKEEKLNEALGFLVELEKDSIMVGKEASAVLAQWFKKLGVVEEVELL 397 Query: 481 LKDFASRMVEP 449 L++F+S +P Sbjct: 398 LREFSSSQSQP 408 >ref|XP_004306911.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like [Fragaria vesca subsp. vesca] Length = 415 Score = 329 bits (844), Expect = 1e-87 Identities = 158/246 (64%), Positives = 207/246 (84%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 ++SFEPN+RDYTKIIHGY K+N++++AE L MK R F+CDQVTLTA+I MYSKAG+LK Sbjct: 160 DDSFEPNVRDYTKIIHGYGKRNRIEDAESTLLNMKSRGFVCDQVTLTAMIDMYSKAGHLK 219 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 LAEDTFE++KLLG ++DKR+YGS++MAYIRAG E +L EM QEI A EVYKALL Sbjct: 220 LAEDTFEDIKLLGQQVDKRAYGSMIMAYIRAGMPEQGETVLIEMDAQEIVAGSEVYKALL 279 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSM+GD+EGAQR+F+ +Q+AGI PD K+CGLLINAY SG++++AR AFENMR++GL+ Sbjct: 280 RAYSMVGDTEGAQRVFNALQLAGISPDAKICGLLINAYGISGQSQKARAAFENMRKAGLK 339 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 P+DKC+AL+L AYEKE +L+ AL FL+ LER+ IM+GKE ++ L WF++LGVVEEV+ V Sbjct: 340 PSDKCIALMLAAYEKENKLQMALKFLMGLEREGIMVGKEVAETLAGWFKKLGVVEEVDMV 399 Query: 481 LKDFAS 464 L++FA+ Sbjct: 400 LREFAA 405 >ref|NP_171699.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75264110|sp|Q9LPC4.1|PPR1_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g01970 gi|8570448|gb|AAF76475.1|AC020622_9 Contains similarity to an unknown protein gi|AAD26479 from Arabidopsis thaliana BAC gb|AC007169 and contains multiple PPR PF|01535 repeats [Arabidopsis thaliana] gi|34098825|gb|AAQ56795.1| At1g01970 [Arabidopsis thaliana] gi|110735700|dbj|BAE99830.1| hypothetical protein [Arabidopsis thaliana] gi|332189240|gb|AEE27361.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 409 Score = 328 bits (840), Expect = 4e-87 Identities = 163/251 (64%), Positives = 205/251 (81%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 ++SFE N RDYTKIIH Y K N++++AE L +MK R F+ DQVTLTA++ +YSKAG K Sbjct: 158 QDSFEANARDYTKIIHYYGKLNQVEDAERTLLSMKNRGFLIDQVTLTAMVQLYSKAGCHK 217 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 LAE+TF E+KLLG LD RSYGS++MAYIRAG E LLREM QEI A REVYKALL Sbjct: 218 LAEETFNEIKLLGEPLDYRSYGSMIMAYIRAGVPEKGESLLREMDSQEICAGREVYKALL 277 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSM GD+EGA+R+FD +Q+AGI PDVK+CGLLINAY SG+++ AR+AFENMR++G++ Sbjct: 278 RDYSMGGDAEGAKRVFDAVQIAGITPDVKLCGLLINAYSVSGQSQNARLAFENMRKAGIK 337 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 DKCVALVL AYEKEE+L EAL FL+ELE+DSIM+GKEAS +L +WF++LGVVEEVE + Sbjct: 338 ATDKCVALVLAAYEKEEKLNEALGFLVELEKDSIMLGKEASAVLAQWFKKLGVVEEVELL 397 Query: 481 LKDFASRMVEP 449 L++F+S +P Sbjct: 398 LREFSSSQSQP 408 >ref|XP_006418475.1| hypothetical protein EUTSA_v10007755mg [Eutrema salsugineum] gi|557096246|gb|ESQ36828.1| hypothetical protein EUTSA_v10007755mg [Eutrema salsugineum] Length = 414 Score = 325 bits (833), Expect = 2e-86 Identities = 162/245 (66%), Positives = 201/245 (82%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 E+SFE N RDYTKIIH Y K N++++AE L AMK R F+ DQVTLTA++ +YSKAG + Sbjct: 163 EDSFEANARDYTKIIHYYGKLNQVEDAEKTLLAMKNRGFLVDQVTLTAMVQLYSKAGYHQ 222 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 LAE+TF ++KLLG LD RSYGS++MAYIRAGK E LLREM EI A REVYKALL Sbjct: 223 LAEETFNDIKLLGEALDYRSYGSMIMAYIRAGKPEKGEALLREMDCLEICAGREVYKALL 282 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSM GD+EGA+R+FD +Q+AGI PDVK+CGLLINAY SG+++ AR+AFENMR++G++ Sbjct: 283 RAYSMGGDAEGAKRVFDAVQIAGITPDVKLCGLLINAYSVSGQSQNARLAFENMRKAGIK 342 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 DKCVALVL AYEKEE+L EAL FL+ELE+DSIM+GKEAS +L +WF+ LGVVEEVE V Sbjct: 343 ATDKCVALVLAAYEKEEKLNEALGFLVELEKDSIMVGKEASAVLARWFKELGVVEEVELV 402 Query: 481 LKDFA 467 L++F+ Sbjct: 403 LREFS 407 >ref|XP_006307695.1| hypothetical protein CARUB_v10009324mg [Capsella rubella] gi|482576406|gb|EOA40593.1| hypothetical protein CARUB_v10009324mg [Capsella rubella] Length = 404 Score = 325 bits (832), Expect = 3e-86 Identities = 164/251 (65%), Positives = 201/251 (80%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 E+SFE N RDYTKIIH Y K N+++EAE L AMK R F DQVTLTA++ +YSKAG K Sbjct: 153 EDSFEANARDYTKIIHFYGKLNQVEEAERTLLAMKNRLFPIDQVTLTAMVQLYSKAGYHK 212 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 LAE+TF E+KLLG LD RSYGS+VMAYIRAG E LLREM QEI A REVYKALL Sbjct: 213 LAEETFNEIKLLGEPLDNRSYGSMVMAYIRAGAPEKGEALLREMDSQEICAGREVYKALL 272 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSM GD+EGA+R+FD +Q+AGI PDVK+CGLLINAY G+++ AR+AFENMR++G++ Sbjct: 273 RAYSMSGDAEGAKRVFDAVQIAGITPDVKLCGLLINAYSVLGQSQNARLAFENMRKAGIK 332 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 DKCVALVL AYEKEE+L EAL FL+ELE++S+M+ KEAS +L +WF++LGVVEEVE V Sbjct: 333 ATDKCVALVLAAYEKEEKLNEALGFLVELEKESVMVEKEASAVLAQWFKKLGVVEEVELV 392 Query: 481 LKDFASRMVEP 449 L++F+S P Sbjct: 393 LREFSSSQSRP 403 >ref|XP_003623723.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355498738|gb|AES79941.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 426 Score = 323 bits (829), Expect = 7e-86 Identities = 159/246 (64%), Positives = 200/246 (81%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 EESFEPN+RDYTK+IH Y+K+N+L+ AE+ MK+R F+CDQV LT ++HMYSKAG+L Sbjct: 174 EESFEPNLRDYTKLIHYYSKENQLEAAENIFTLMKQRGFICDQVILTTMVHMYSKAGHLD 233 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 AE+ FEE+KLLG LDKRSYGS++MAYIRAG E LL EM Q+IYA EVYKALL Sbjct: 234 RAEEYFEEIKLLGEPLDKRSYGSMIMAYIRAGMPEKGESLLEEMDAQDIYAGSEVYKALL 293 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YS++G++EGAQR+FD IQ+AGIIPD K+C LLI AY +G++++ARIAFENM+R+G+E Sbjct: 294 RAYSVIGNAEGAQRVFDAIQLAGIIPDDKMCSLLIYAYSMAGQSQKARIAFENMKRAGIE 353 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 P DKC++ VL AYEKE L AL+FLIELERD IM+ +E S +L WFR+LGVVEEVE V Sbjct: 354 PTDKCISSVLVAYEKENMLNTALEFLIELERDGIMVKEETSRILAGWFRKLGVVEEVELV 413 Query: 481 LKDFAS 464 L+DFA+ Sbjct: 414 LRDFAT 419 >gb|EPS65198.1| hypothetical protein M569_09579 [Genlisea aurea] Length = 436 Score = 322 bits (825), Expect = 2e-85 Identities = 159/245 (64%), Positives = 199/245 (81%), Gaps = 2/245 (0%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 EE+FE +RDYTK+IH YA++++L+EAE LE+MK F+CDQV LT+L+HMYSK GNL+ Sbjct: 156 EETFEAGVRDYTKLIHFYARRDQLREAERTLESMKSDGFVCDQVLLTSLLHMYSKNGNLR 215 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 AE F EMK LGV LD+RSYG++ MA++RAGKL+ E LLRE + EIYA +EVYKALL Sbjct: 216 RAEAAFGEMKALGVPLDRRSYGAMAMAHVRAGKLSDGEALLREAEALEIYAGKEVYKALL 275 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSM GDS GAQRIFD +QV GI+PD K+CGLL+NA+V SG+TREARIAF NM R+G+E Sbjct: 276 RAYSMRGDSRGAQRIFDSMQVVGIVPDAKICGLLMNAFVESGETREARIAFGNMMRAGIE 335 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERD--SIMIGKEASDLLVKWFRRLGVVEEVE 488 PN+KCVALVL+A KEE+L EALD L+ LE D + IG+EAS+LL KWFR +GV++EVE Sbjct: 336 PNEKCVALVLSACRKEEKLSEALDLLVRLEGDGFGLAIGEEASNLLAKWFREMGVLKEVE 395 Query: 487 HVLKD 473 HVL + Sbjct: 396 HVLSE 400 >ref|XP_004134345.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like [Cucumis sativus] gi|449480346|ref|XP_004155867.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like [Cucumis sativus] Length = 404 Score = 320 bits (821), Expect = 6e-85 Identities = 153/249 (61%), Positives = 204/249 (81%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 E +FE N RDYTKIIH Y KQN+L++AE L +M++R F+CDQ+TLT +IH+YSKA L Sbjct: 154 EITFEANTRDYTKIIHHYGKQNQLEDAEKVLLSMRERGFVCDQITLTTMIHIYSKADKLN 213 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 LA+ TFEE+KLL LDKRS+G+++MAY+RAG EK+L+EM ++IYA EVYKALL Sbjct: 214 LAKQTFEELKLLEQPLDKRSFGAMIMAYVRAGFPEEGEKILKEMDAKDIYAGSEVYKALL 273 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSM+G++EGAQR+FD IQ+A I PD K+CGLLINAY+ +G++REA+IAF+NMRR+G+E Sbjct: 274 RAYSMVGNAEGAQRVFDAIQLAAITPDEKLCGLLINAYLMAGQSREAQIAFDNMRRAGIE 333 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 P+DKC+AL L+AYEKE RL AL+ LI+LE+D++M+GKEAS +L W +RLGVVEEVE V Sbjct: 334 PSDKCIALALSAYEKENRLNSALELLIDLEKDNVMVGKEASKILAAWLKRLGVVEEVEIV 393 Query: 481 LKDFASRMV 455 L+++ + V Sbjct: 394 LREYTEKEV 402 >ref|XP_004492640.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like isoform X1 [Cicer arietinum] gi|502104764|ref|XP_004492641.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like isoform X2 [Cicer arietinum] Length = 425 Score = 320 bits (819), Expect = 1e-84 Identities = 158/246 (64%), Positives = 199/246 (80%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 EESFEPN+RDYTK+IH Y+K+N+L+ AE+ MK+R F+CDQV LT ++HMYSKAG+L Sbjct: 175 EESFEPNLRDYTKLIHYYSKENQLEAAENIFTTMKQRGFICDQVILTTMVHMYSKAGHLD 234 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 AE+ FEE+KLLG +LDKRSYGS++MAYIRAG E LL EM QEIYA EVYKALL Sbjct: 235 RAEEYFEEIKLLGEQLDKRSYGSMIMAYIRAGMPEQGESLLEEMDAQEIYAGSEVYKALL 294 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YS G++EGAQR+FD IQ+AGI PD K+C LLI AY +G++++A+IAFENM+++G+E Sbjct: 295 RAYSGSGNAEGAQRVFDAIQLAGITPDDKMCSLLIYAYGMAGQSQKAQIAFENMKKAGIE 354 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 P DKC++LVL AYEKE L AL FLI+LERD IM+G+E S +L WFR+LGVVEEVE V Sbjct: 355 PTDKCISLVLFAYEKENMLDTALAFLIDLERDGIMVGEETSRILAGWFRKLGVVEEVELV 414 Query: 481 LKDFAS 464 L+DFA+ Sbjct: 415 LRDFAT 420 >ref|XP_003552343.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like isoform X1 [Glycine max] gi|571548118|ref|XP_006602756.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like isoform X2 [Glycine max] Length = 414 Score = 316 bits (810), Expect = 1e-83 Identities = 157/246 (63%), Positives = 199/246 (80%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 EESFE NIRDYTKIIH Y + N L++AE L MK+R F+ DQV LT ++HMYSKAGN Sbjct: 164 EESFEVNIRDYTKIIHYYGEHNLLEDAEKFLTLMKQRGFIYDQVILTTMVHMYSKAGNHD 223 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 A++ FEE+KLLG LDKRSYGS++MAYIRAG E LL+EM+ QEI A EVYKALL Sbjct: 224 RAKEYFEEIKLLGKPLDKRSYGSMIMAYIRAGMPEEGENLLQEMEAQEILAGSEVYKALL 283 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSM+G++EGAQR+FD IQ+AGI PD K+C L++NAYV +G++++A IAFENMRR+G++ Sbjct: 284 RAYSMIGNAEGAQRVFDAIQLAGITPDDKICSLVVNAYVMAGQSQKALIAFENMRRAGIK 343 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 P+DKC+A VL AYEKE ++ AL+FLI+LERD IM+ +EAS +L KWFR+LGVVEEVE V Sbjct: 344 PSDKCIASVLVAYEKESKINTALEFLIDLERDGIMVEEEASAVLAKWFRKLGVVEEVELV 403 Query: 481 LKDFAS 464 L+DF + Sbjct: 404 LRDFVT 409 >ref|XP_003533639.2| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like [Glycine max] Length = 415 Score = 315 bits (807), Expect = 2e-83 Identities = 156/245 (63%), Positives = 198/245 (80%) Frame = -1 Query: 1201 EESFEPNIRDYTKIIHGYAKQNKLKEAEHALEAMKKRDFMCDQVTLTALIHMYSKAGNLK 1022 EESFE NIRDYTKIIH Y + N L++AE L MK+R F+ DQV LT ++HM SKAGN Sbjct: 164 EESFEVNIRDYTKIIHYYGEHNLLEDAEKFLTLMKQRGFIYDQVILTTMVHMSSKAGNHD 223 Query: 1021 LAEDTFEEMKLLGVRLDKRSYGSIVMAYIRAGKLTSAEKLLREMKFQEIYAPREVYKALL 842 A++ FEE+KLLG LDKRSYGS++MAYIRAG E LL++M+ QEI A E+YKALL Sbjct: 224 RAKEYFEEIKLLGEPLDKRSYGSMIMAYIRAGMPEEGENLLQQMEAQEILAGSEIYKALL 283 Query: 841 RTYSMLGDSEGAQRIFDEIQVAGIIPDVKVCGLLINAYVASGKTREARIAFENMRRSGLE 662 R YSM+G++EGAQR+FD IQ+AGI PD K+C LL+NAY +G++++A IAFENMRR+G++ Sbjct: 284 RAYSMIGNAEGAQRVFDAIQLAGITPDDKICSLLVNAYAMAGQSQKALIAFENMRRAGIK 343 Query: 661 PNDKCVALVLTAYEKEERLKEALDFLIELERDSIMIGKEASDLLVKWFRRLGVVEEVEHV 482 P+DKC+A VL AYEKE ++ AL+FLI+LERD IM+G+EAS +L KWFR+LGVVEEVE V Sbjct: 344 PSDKCIASVLVAYEKESKINTALEFLIDLERDGIMVGEEASAVLAKWFRKLGVVEEVELV 403 Query: 481 LKDFA 467 L+DFA Sbjct: 404 LRDFA 408