BLASTX nr result
ID: Mentha28_contig00014453
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00014453 (1378 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU44305.1| hypothetical protein MIMGU_mgv1a003952mg [Mimulus... 568 e-159 ref|XP_002269078.1| PREDICTED: pentatricopeptide repeat-containi... 480 e-133 emb|CBI17575.3| unnamed protein product [Vitis vinifera] 471 e-130 ref|XP_006366541.1| PREDICTED: pentatricopeptide repeat-containi... 469 e-129 ref|XP_007047032.1| Tetratricopeptide repeat-like superfamily pr... 458 e-126 ref|XP_004228517.1| PREDICTED: pentatricopeptide repeat-containi... 456 e-125 ref|XP_006466716.1| PREDICTED: pentatricopeptide repeat-containi... 449 e-123 ref|XP_006383251.1| hypothetical protein POPTR_0005s12880g [Popu... 446 e-122 ref|XP_007204324.1| hypothetical protein PRUPE_ppa004294mg [Prun... 441 e-121 ref|XP_006425741.1| hypothetical protein CICLE_v10027297mg [Citr... 438 e-120 ref|XP_002525094.1| pentatricopeptide repeat-containing protein,... 425 e-116 ref|XP_007160534.1| hypothetical protein PHAVU_002G329700g [Phas... 413 e-113 ref|XP_003524343.2| PREDICTED: pentatricopeptide repeat-containi... 412 e-112 ref|XP_004503279.1| PREDICTED: pentatricopeptide repeat-containi... 407 e-111 ref|XP_003631109.1| Pentatricopeptide repeat-containing protein ... 406 e-110 ref|XP_004165472.1| PREDICTED: pentatricopeptide repeat-containi... 395 e-107 ref|XP_004148464.1| PREDICTED: pentatricopeptide repeat-containi... 390 e-106 ref|NP_178248.2| pentatricopeptide repeat-containing protein [Ar... 380 e-103 ref|XP_006398525.1| hypothetical protein EUTSA_v10000833mg [Eutr... 380 e-102 ref|XP_006292569.1| hypothetical protein CARUB_v10018804mg [Caps... 374 e-101 >gb|EYU44305.1| hypothetical protein MIMGU_mgv1a003952mg [Mimulus guttatus] Length = 552 Score = 568 bits (1464), Expect = e-159 Identities = 285/414 (68%), Positives = 344/414 (83%), Gaps = 1/414 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 DVVTYTS+MHW+SN GD GA+ LW+EM+AKGCRPTVVSYTAYMKIL K+V+ A+DVY Sbjct: 142 DVVTYTSIMHWLSNGGDFDGAVSLWEEMKAKGCRPTVVSYTAYMKILFGRKKVDEAADVY 201 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEMLESG+ PNC+TYTVLME+LASSGKF EALEIFDKMQ A VQPDKA+CNILI + C T Sbjct: 202 KEMLESGVKPNCYTYTVLMEFLASSGKFDEALEIFDKMQEASVQPDKASCNILIEISCKT 261 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSEYSNEGTND 541 GEI AM K+LEYMK NF VLR PIYQKALE FK+AGESD LLR+VNRHFS+E E ND Sbjct: 262 GEIRAMTKILEYMKDNFFVLRYPIYQKALEAFKIAGESDMLLRQVNRHFSAE---ENLND 318 Query: 542 VDAVFH-NSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNITRR 718 ++ +++F ++N LVLN +NK++L+ VDSLL+D+AEKGV+L+S V+S +IEVN + Sbjct: 319 EPEKYNLDNDFALENSLVLNLLNKRSLVGVDSLLSDMAEKGVRLESGVVSKVIEVNSACQ 378 Query: 719 RQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQLNS 898 RQNGALLA +YSVK GI+IERTAYLALIG +R++SF KVVEI EK++ QG +LGT LNS Sbjct: 379 RQNGALLACEYSVKSGIHIERTAYLALIGLFVRISSFPKVVEIFEKIAMQGFSLGTHLNS 438 Query: 899 LLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSKGV 1078 LIYRLGCG EA +A KVFD LPEKE+SCAEYTALI AYFS GN+ +ETF++MK GV Sbjct: 439 TLIYRLGCGEEATSAAKVFDLLPEKERSCAEYTALIGAYFSSGNADMALETFDVMKKAGV 498 Query: 1079 NVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQMEGCTTNISIEETICNLLFS 1240 V LGTY VL+AGLEKCGK+RE++YY+KEK RL+ G + N+S+EETICNLLF+ Sbjct: 499 KVVLGTYCVLLAGLEKCGKAREMDYYRKEKKRLE-NGRSFNVSMEETICNLLFA 551 >ref|XP_002269078.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01390-like [Vitis vinifera] Length = 519 Score = 480 bits (1235), Expect = e-133 Identities = 246/419 (58%), Positives = 306/419 (73%), Gaps = 2/419 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTS+MHW SNDGD+ A+ +WKEM+AKGC TVVSYTAYMKIL D+KRV A+DVY Sbjct: 100 DAVTYTSLMHWFSNDGDIERAVRVWKEMKAKGCCLTVVSYTAYMKILFDNKRVKEAADVY 159 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEML+SG +PNC+TYTVLME+L SGK+ ALEIF +MQ AGVQPDKATCNILI C T Sbjct: 160 KEMLQSGCAPNCYTYTVLMEHLTGSGKYKAALEIFSRMQEAGVQPDKATCNILIEKFCKT 219 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSEYSN-EGTN 538 G+ WA+ ++L YMK NFLVLR P+Y +AL+T K+AGESD LLR+VN H + S+ E Sbjct: 220 GQTWAITQILLYMKDNFLVLRYPVYLEALQTLKVAGESDILLRQVNPHLNVGLSSKEEIV 279 Query: 539 DVDAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNITRR 718 + + +D G VL F+ KQN A+D LLT + K +LDS +IS IIEVN Sbjct: 280 EFKETVADVHSTIDRGFVLLFLTKQNFYAIDCLLTGMIHKNTRLDSVIISQIIEVNCAHC 339 Query: 719 RQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQLNS 898 R NGALLA++YSVKMGINIER AYLAL+G IR NSF KV +IVEKM + G++LG L + Sbjct: 340 RINGALLAFEYSVKMGINIERIAYLALMGVFIRANSFPKVADIVEKMVRAGISLGMYLGA 399 Query: 899 LLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSKGV 1078 LLIYRLGC +A KVF LP+ +K A YTALISAYFS GN KG++ + M+ K + Sbjct: 400 LLIYRLGCARRPGSAAKVFGLLPDDQKGTAAYTALISAYFSSGNVDKGLKIYKTMQRKRI 459 Query: 1079 NVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQMEGCTTNI-SIEETICNLLFSGDFI 1252 + ALGTY +L+AGLEK G++RE E Y+KEK L G + +I S++E ICNLLFSG+ + Sbjct: 460 HPALGTYNLLLAGLEKKGRAREAEIYRKEKKTLHTHGHSQDIVSMDEKICNLLFSGNLV 518 >emb|CBI17575.3| unnamed protein product [Vitis vinifera] Length = 656 Score = 471 bits (1213), Expect = e-130 Identities = 246/430 (57%), Positives = 306/430 (71%), Gaps = 13/430 (3%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTS+MHW SNDGD+ A+ +WKEM+AKGC TVVSYTAYMKIL D+KRV A+DVY Sbjct: 226 DAVTYTSLMHWFSNDGDIERAVRVWKEMKAKGCCLTVVSYTAYMKILFDNKRVKEAADVY 285 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGK-----------FSEALEIFDKMQHAGVQPDKAT 328 KEML+SG +PNC+TYTVLME+L SGK + ALEIF +MQ AGVQPDKAT Sbjct: 286 KEMLQSGCAPNCYTYTVLMEHLTGSGKHLLQFSCIACKYKAALEIFSRMQEAGVQPDKAT 345 Query: 329 CNILINVCCITGEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHF 508 CNILI C TG+ WA+ ++L YMK NFLVLR P+Y +AL+T K+AGESD LLR+VN H Sbjct: 346 CNILIEKFCKTGQTWAITQILLYMKDNFLVLRYPVYLEALQTLKVAGESDILLRQVNPHL 405 Query: 509 SSEYSN-EGTNDVDAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVI 685 + S+ E + + +D G VL F+ KQN A+D LLT + K +LDS +I Sbjct: 406 NVGLSSKEEIVEFKETVADVHSTIDRGFVLLFLTKQNFYAIDCLLTGMIHKNTRLDSVII 465 Query: 686 SNIIEVNITRRRQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSK 865 S IIEVN R NGALLA++YSVKMGINIER AYLAL+G IR NSF KV +IVEKM + Sbjct: 466 SQIIEVNCAHCRINGALLAFEYSVKMGINIERIAYLALMGVFIRANSFPKVADIVEKMVR 525 Query: 866 QGLTLGTQLNSLLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGI 1045 G++LG L +LLIYRLGC +A KVF LP+ +K A YTALISAYFS GN KG+ Sbjct: 526 AGISLGMYLGALLIYRLGCARRPGSAAKVFGLLPDDQKGTAAYTALISAYFSSGNVDKGL 585 Query: 1046 ETFNIMKSKGVNVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQMEGCTTNI-SIEETI 1222 + + M+ K ++ ALGTY +L+AGLEK G++RE E Y+KEK L G + +I S++E I Sbjct: 586 KIYKTMQRKRIHPALGTYNLLLAGLEKKGRAREAEIYRKEKKTLHTHGHSQDIVSMDEKI 645 Query: 1223 CNLLFSGDFI 1252 CNLLFSG+ + Sbjct: 646 CNLLFSGNLV 655 >ref|XP_006366541.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01390-like isoform X1 [Solanum tuberosum] gi|565402130|ref|XP_006366542.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01390-like isoform X2 [Solanum tuberosum] Length = 591 Score = 469 bits (1207), Expect = e-129 Identities = 231/417 (55%), Positives = 309/417 (74%), Gaps = 1/417 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTS++HW+S+ GD+ +I LW++M+ KGC P VV YTAYMK L D RV + +Y Sbjct: 173 DAVTYTSLLHWLSDHGDINESIKLWQDMKDKGCVPNVVCYTAYMKGLFDHNRVKEGAKIY 232 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEML+SG SPNCHTYTVLME+LASSGKF ALEIF KMQ AGVQPDKATCNIL+ CC Sbjct: 233 KEMLQSGCSPNCHTYTVLMEHLASSGKFDGALEIFSKMQDAGVQPDKATCNILVGKCCKA 292 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSEYSNEGTND 541 GE AM+K+L YMK+NFLVLR +YQ+A T K+AG SD LLR+VNRH S + N+ D Sbjct: 293 GETQAMMKILHYMKENFLVLRYSVYQEAFHTLKMAGVSDRLLRDVNRHLSLQNFNQDLID 352 Query: 542 V-DAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNITRR 718 D + +S F +D+ ++L +NK++LLAVD LL L K ++LD +++S ++EVN + Sbjct: 353 ESDGIAESSCFTLDDRMILYLLNKKSLLAVDYLLDSLMNKRLKLDPRIVSTVVEVNCSCG 412 Query: 719 RQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQLNS 898 R NGA LA++ SVK+GI IER YL ++G IR NSF +VV+IVE M GL+LG++L + Sbjct: 413 RVNGAFLAFEVSVKLGITIERITYLTMVGELIRTNSFSRVVDIVEAMVGAGLSLGSELTA 472 Query: 899 LLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSKGV 1078 LLI+RLGC E A+AEK+F LP+++KS A YTALI+ YF+ GN+ KG+E F M+ +G+ Sbjct: 473 LLIHRLGCAREPASAEKLFSILPDQQKSIAVYTALINTYFTFGNADKGLEIFETMRKQGI 532 Query: 1079 NVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQMEGCTTNISIEETICNLLFSGDF 1249 N+AL TY VL+ GLE+ G +V+ Y+KEK R + E C+ +S+EET+C+LLF+G+F Sbjct: 533 NLALSTYCVLLNGLERNGLFDKVQSYRKEKKRFESESCSREMSMEETVCDLLFAGNF 589 >ref|XP_007047032.1| Tetratricopeptide repeat-like superfamily protein [Theobroma cacao] gi|508699293|gb|EOX91189.1| Tetratricopeptide repeat-like superfamily protein [Theobroma cacao] Length = 577 Score = 458 bits (1178), Expect = e-126 Identities = 231/419 (55%), Positives = 301/419 (71%), Gaps = 2/419 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D V YTS++HW+S DV GA+++W+EMR KGC PTVVSYTAYMK+L D+KRV +DVY Sbjct: 158 DAVAYTSVLHWLSKSRDVDGAVEMWEEMRGKGCFPTVVSYTAYMKVLFDNKRVKEGTDVY 217 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEML+SG+SPNCHTYTVLMEYL +GK EALEIF+KMQ AGV+PDKA CNIL+ CC Sbjct: 218 KEMLQSGISPNCHTYTVLMEYLFEAGKSEEALEIFNKMQEAGVKPDKAACNILVEKCCKA 277 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSE-YSNEGTN 538 E AM ++L+YMK+N+LVLR PI+ +ALETFK+AGES+ LLREV+ H S E NE Sbjct: 278 AETRAMTQILQYMKENYLVLRYPIFLEALETFKVAGESNVLLREVHPHISVECIGNETEA 337 Query: 539 DVDAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNITRR 718 + + D GL+ + KQNLLA+DSLLT+L +K ++LDS++IS II++N Sbjct: 338 EYKGNASEAPLSFDRGLMWALLKKQNLLAIDSLLTELMDKNIRLDSEMISTIIDINCNHC 397 Query: 719 RQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQLNS 898 R +G+LLA+ YSVK GIN+ER AYL LIG IR N+F VVEIV +M++ G + G L S Sbjct: 398 RLDGSLLAFKYSVKAGINLERAAYLTLIGSLIRSNTFTDVVEIVVEMTRAGHSPGVYLGS 457 Query: 899 LLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSKGV 1078 LLIYRLGC A K+F+ LP+ +K A YTAL+ YF+ G + KG++ + M+SKG+ Sbjct: 458 LLIYRLGCARRPTCAAKIFNLLPDDQKCVATYTALVGVYFAAGTADKGLKIYKTMRSKGI 517 Query: 1079 NVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQMEG-CTTNISIEETICNLLFSGDFI 1252 + +LGTY VL+AGLEK G+ E Y+KEK LQ + +I IEE IC+LLF+ D + Sbjct: 518 SPSLGTYCVLLAGLEKLGRVSTAETYRKEKKSLQKDAYFRESIPIEEKICDLLFARDVV 576 >ref|XP_004228517.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01390-like [Solanum lycopersicum] Length = 591 Score = 456 bits (1173), Expect = e-125 Identities = 224/417 (53%), Positives = 305/417 (73%), Gaps = 1/417 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTS++HW+S+ GD+ +I LW++M+ KGC P VV YTAYMK L D RV + +Y Sbjct: 173 DAVTYTSLLHWLSDHGDINESIKLWQDMKDKGCAPNVVCYTAYMKGLFDHNRVKEGAKIY 232 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEML+SG SPNCHTYTVLME+LA SGKF LEIF KMQ AGVQPDKATCNIL+ CC Sbjct: 233 KEMLQSGCSPNCHTYTVLMEHLAKSGKFDGVLEIFSKMQDAGVQPDKATCNILVGKCCKA 292 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFS-SEYSNEGTN 538 GE AM+K+L YMK+NFLVLR +YQ+A +T K+AG SD LLR+VNRH S ++ + + Sbjct: 293 GETQAMMKILHYMKENFLVLRYSVYQEAFQTLKMAGVSDRLLRDVNRHLSLQNFNQDQID 352 Query: 539 DVDAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNITRR 718 + D + +S F +D+ +VL +NK++LLAVD LL L K ++L ++S ++EVN + Sbjct: 353 ESDGIAESSCFTLDDRMVLYLLNKKSLLAVDYLLDGLMNKRLKLYPGIVSTVVEVNCSCG 412 Query: 719 RQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQLNS 898 R NGA LA+ S+K+GI I+R YL ++G IR NSF +VV+IVE M GL+LG++L + Sbjct: 413 RVNGAFLAFKVSMKLGITIDRITYLTMVGELIRANSFSRVVDIVEVMVGAGLSLGSELTA 472 Query: 899 LLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSKGV 1078 LLI+RLGC A+AEK+F LP+++KS A YTALI+ YF+ GN+ KG+E F M+ +G+ Sbjct: 473 LLIHRLGCARAPASAEKLFSILPDQQKSIAVYTALINTYFTFGNADKGLEIFETMRKQGI 532 Query: 1079 NVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQMEGCTTNISIEETICNLLFSGDF 1249 N+AL TY VL+ GLE+ G +++ Y+KEK R + E C+ +S+EETIC+LLF+G+F Sbjct: 533 NLALSTYCVLLNGLERNGLFDKLQSYRKEKKRFESESCSREMSMEETICDLLFAGNF 589 >ref|XP_006466716.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01390-like [Citrus sinensis] Length = 582 Score = 449 bits (1156), Expect = e-123 Identities = 225/425 (52%), Positives = 306/425 (72%), Gaps = 7/425 (1%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTS+MHW+SN GDV GA+++W+EM+ K C PTVVSYTAYMKIL + RV A+DVY Sbjct: 163 DAVTYTSVMHWLSNAGDVDGAVNIWEEMKLKECYPTVVSYTAYMKILFLNDRVKEATDVY 222 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEM++ GL PNC+TYTVLMEYL +GK+ EALEIF KMQ AGVQPDKA CNILI CC Sbjct: 223 KEMIQRGLPPNCYTYTVLMEYLVRAGKYEEALEIFSKMQEAGVQPDKAACNILIEKCCKA 282 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSEYSNEG--- 532 GE ++ +L YMK+N L LR P++++AL+TFK+A E+D+LL +V+ FS E+ ++ Sbjct: 283 GETRTIILILRYMKENRLALRYPVFKEALQTFKVADENDSLLWQVHPQFSPEFISDNDAV 342 Query: 533 ---TNDVDAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEV 703 T D++ +D GLVL + K+NL+A+D LL+ + +K +QLDS VIS IIEV Sbjct: 343 EFVTTDIE-----GPLSIDQGLVLILLKKKNLVAIDCLLSGIMDKNIQLDSAVISTIIEV 397 Query: 704 NITRRRQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLG 883 N RR++GALLA++YSVKM +N+ERTAYLALIG I++N+F KV EIVE+M+K G +LG Sbjct: 398 NCDHRRRDGALLAFEYSVKMDLNLERTAYLALIGILIKLNTFPKVAEIVEEMTKAGHSLG 457 Query: 884 TQLNSLLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIM 1063 L +LLI+RLG A K+F LPE +K A YTALI YFS G++ K ++ + M Sbjct: 458 VYLGALLIHRLGSARRPVPAAKIFSLLPEDQKCTATYTALIGVYFSAGSADKALKIYKTM 517 Query: 1064 KSKGVNVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQMEGCTTN-ISIEETICNLLFS 1240 KG++ +LGT+ VL+AGLEK G+ + E Y+KEK +Q + + + + +EE IC+LL+ Sbjct: 518 CRKGIHPSLGTFNVLLAGLEKLGRVSDAEIYRKEKKSIQADALSKDTVPMEEKICDLLYG 577 Query: 1241 GDFII 1255 GD ++ Sbjct: 578 GDGVL 582 >ref|XP_006383251.1| hypothetical protein POPTR_0005s12880g [Populus trichocarpa] gi|550338833|gb|ERP61048.1| hypothetical protein POPTR_0005s12880g [Populus trichocarpa] Length = 517 Score = 446 bits (1147), Expect = e-122 Identities = 229/421 (54%), Positives = 297/421 (70%), Gaps = 4/421 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 DVVTYTS+++W+S GDV GA+ +WKEMR C PTVVSYTAY+K+L D+KRV DVY Sbjct: 100 DVVTYTSILNWVSKSGDVDGAVKIWKEMRENMCFPTVVSYTAYLKVLFDNKRVKEGIDVY 159 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEMLESG+SPNCHTYTVLME+L +GK+ E LEIF KMQ AGVQPDKA CNIL+ CC Sbjct: 160 KEMLESGISPNCHTYTVLMEHLVVTGKYQETLEIFSKMQEAGVQPDKAACNILVERCCKA 219 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSEYSNEGTND 541 GE M +L+YMKQN LVLR P++ +ALET K AGESDALLR+VN H + E D Sbjct: 220 GETTTMTHILQYMKQNHLVLRYPVFMEALETLKDAGESDALLRKVNPHIDT----ESIGD 275 Query: 542 VDA---VFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNIT 712 VDA + + +D GLVL + KQNL+AVD LL + +K + LDS++++ IIE NI Sbjct: 276 VDAFETMTTVGDDALDGGLVLILLRKQNLVAVDHLLAGIMDKNILLDSRIVATIIERNID 335 Query: 713 RRRQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQL 892 +R +GALLA++YS+KMGI +ERT+YLALIG SIR ++F KVV+I EKM+ G +LG Sbjct: 336 HQRPDGALLAFEYSMKMGIQLERTSYLALIGMSIRSDTFLKVVDIAEKMTVAGHSLGVYQ 395 Query: 893 NSLLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSK 1072 +LLIYRLGC A K+FD LPE +K A YTAL+S +FS G+ K ++ + MK + Sbjct: 396 AALLIYRLGCAKRPTCAVKIFDLLPEGQKCTATYTALVSVFFSAGSPQKALQIYENMKRE 455 Query: 1073 GVNVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQMEGCTTN-ISIEETICNLLFSGDF 1249 G++ +LGTY VL+AGLE G+ E + Y+KEK L + N + + + ICNLLF+ Sbjct: 456 GIHPSLGTYNVLLAGLESSGRISEAKTYRKEKKGLLINNHHQNSVPMGQKICNLLFASHL 515 Query: 1250 I 1252 + Sbjct: 516 V 516 >ref|XP_007204324.1| hypothetical protein PRUPE_ppa004294mg [Prunus persica] gi|462399855|gb|EMJ05523.1| hypothetical protein PRUPE_ppa004294mg [Prunus persica] Length = 518 Score = 441 bits (1133), Expect = e-121 Identities = 226/418 (54%), Positives = 298/418 (71%), Gaps = 1/418 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D TYTS+MHW+S+ GDV GA+ +W+EMRA+GC PTVVSYTAYMK+L + RV A+DVY Sbjct: 100 DAFTYTSLMHWLSSAGDVDGALKVWEEMRAQGCVPTVVSYTAYMKVLFNDNRVKEAADVY 159 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEML+SG SP CHTYTVLMEYL SGK EALEIF KMQ AGVQPDKA CNILI C Sbjct: 160 KEMLQSGCSPTCHTYTVLMEYLIGSGKCKEALEIFGKMQDAGVQPDKAACNILIENLCKV 219 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSEYSNEGTND 541 GE W M +VL +MK++ L LR P++ +A+ T K+AG SD+LLR+V+ HFS E N+ T + Sbjct: 220 GETWTMNQVLWFMKEHRLALRYPVFLEAIRTLKIAGVSDSLLRQVHPHFSIESGNQETGE 279 Query: 542 VDAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNITRRR 721 + A ++ +D LVL + K+NL+A+D LL K ++L+S +IS IIEVN R Sbjct: 280 LRATAADAPSTMDEWLVLILLKKENLVAIDHLLAGNEGKNIKLNSAIISTIIEVNCGLCR 339 Query: 722 QNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQLNSL 901 +GALLA++YSVKMGI +ER AYL+LIG IR++SF KVVEIV +M + G + GT L++L Sbjct: 340 PDGALLAFEYSVKMGIIVERNAYLSLIGALIRLSSFPKVVEIVVEMIRAGYSPGTYLSAL 399 Query: 902 LIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSKGVN 1081 LIYRLG A KVF+ LP+ K YTAL+ YFS G++ +G++ F+ M+ +G Sbjct: 400 LIYRLGRARRTNCAAKVFNLLPDDHKCTTTYTALMGVYFSAGSADRGLKIFDTMRGEGFL 459 Query: 1082 VALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQMEGCTTN-ISIEETICNLLFSGDFI 1252 LGTY VL+AGL+K G+ RE E Y+KEK LQ +G + N + I++ IC+ LF+GD + Sbjct: 460 PFLGTYNVLLAGLQKLGRVREAEMYRKEKKSLQSDGPSQNAVPIDQKICDFLFAGDVV 517 >ref|XP_006425741.1| hypothetical protein CICLE_v10027297mg [Citrus clementina] gi|557527731|gb|ESR38981.1| hypothetical protein CICLE_v10027297mg [Citrus clementina] Length = 522 Score = 438 bits (1126), Expect = e-120 Identities = 224/424 (52%), Positives = 306/424 (72%), Gaps = 6/424 (1%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTS+MHW+SN GDV GA+++W+EM+ K C PTVVSYTAYMKIL + RV A+DVY Sbjct: 100 DAVTYTSVMHWLSNAGDVDGAVNIWEEMKLKECYPTVVSYTAYMKILFLNDRVKEATDVY 159 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFS---EALEIFDKMQHAGVQPDKATCNILINVC 352 KEM++ GL PNC+TYTVLMEYL +G + E LEIF KMQ AGVQPDKA CNILI C Sbjct: 160 KEMIQRGLPPNCYTYTVLMEYLVRAGMRANKKEPLEIFSKMQEAGVQPDKAACNILIEKC 219 Query: 353 CITGEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSEYSNEG 532 C GE ++ +L YMK+N L LR P++++AL+TFK+A E+D+LL +V+ FS E+ ++ Sbjct: 220 CKAGETRTIILILRYMKENRLALRYPVFKEALQTFKVADENDSLLWQVHPQFSPEFISDN 279 Query: 533 TNDVDAVFHNSE--FDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVN 706 + V+ V + E +D GLVL + K+NL+A+D LL+ + +K +QLDS VIS IIEVN Sbjct: 280 -DAVEFVTTDIEGPLSIDQGLVLILLKKKNLVAIDCLLSGIMDKNIQLDSAVISTIIEVN 338 Query: 707 ITRRRQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGT 886 RR++GALLA++YSVKM +N+ERTAYLALIG I++N+F KV EIVE+M+K G +LG Sbjct: 339 CDHRRRDGALLAFEYSVKMDLNLERTAYLALIGILIKLNTFPKVAEIVEEMTKAGHSLGV 398 Query: 887 QLNSLLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMK 1066 L +LLI+RLG A K+F LPE +K A YTALI YFS G++ K ++ + M Sbjct: 399 YLGALLIHRLGSARRPVPAAKIFSLLPEDQKCTATYTALIGVYFSAGSADKALKIYKTMC 458 Query: 1067 SKGVNVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQMEGCTTN-ISIEETICNLLFSG 1243 KG++ +LGT+ VL+AGLEK G+ + E Y+KEK +Q + + + + +EE IC+LL+ G Sbjct: 459 RKGIHPSLGTFNVLLAGLEKLGRVSDAEIYRKEKKSIQADALSKDTVPMEEKICDLLYGG 518 Query: 1244 DFII 1255 D ++ Sbjct: 519 DGVL 522 >ref|XP_002525094.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223535553|gb|EEF37221.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 472 Score = 425 bits (1092), Expect = e-116 Identities = 217/420 (51%), Positives = 299/420 (71%), Gaps = 3/420 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTS+MHW+S +GDV GAI +W+EM+ KG TVVSYTA+MKIL D+KRV A+++Y Sbjct: 55 DTVTYTSLMHWLSRNGDVDGAIKIWEEMKEKGLYLTVVSYTAFMKILFDNKRVKEATNIY 114 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEMLE G+ PNCHTYTVLMEYL SGK EALEIF KMQ AGVQPDKA CNIL+ CC Sbjct: 115 KEMLECGIPPNCHTYTVLMEYLVVSGKCQEALEIFKKMQEAGVQPDKAMCNILVERCCEA 174 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSEYSNEGTND 541 GE M +L+YMK+N L LR PI+ +AL+ ++AGE DALL++VN H ++E N ND Sbjct: 175 GETNTMTPILQYMKENHLALRYPIFLEALKILRIAGECDALLKQVNPHCAAESIN---ND 231 Query: 542 VDAVFHNSEFD--VDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNITR 715 + F S ++ +D GL+L + KQNL+AVD LL + +K + L+S ++S IIEVN ++ Sbjct: 232 DSSEFTTSAYNDPIDKGLLLILLRKQNLVAVDHLLAGVLDKNMVLESWIVSTIIEVNCSQ 291 Query: 716 RRQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQLN 895 R + AL+A++YS+++GI++ERTAYLALIG IR N+ V+EIV++ + G +LG L+ Sbjct: 292 CRPDSALMAFEYSMRVGIDLERTAYLALIGIMIRSNTSLWVLEIVKETIRVGHSLGLYLS 351 Query: 896 SLLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSKG 1075 +LLI+RLG A K+FD LP+++K A YTA+I YFS G++ K ++ + MK Sbjct: 352 ALLIHRLGSARRPNCAAKIFDLLPDEQKCTATYTAMIGVYFSAGSAAKALKIYKTMKKNS 411 Query: 1076 VNVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQMEGCTTN-ISIEETICNLLFSGDFI 1252 +N +LGTY VL+AGLEK G+ E + ++KEK L +G + + +EE IC+LLF+G + Sbjct: 412 INPSLGTYNVLLAGLEKSGRVCETDNFRKEKRSLMADGNRLSCLPMEEKICDLLFAGSLV 471 >ref|XP_007160534.1| hypothetical protein PHAVU_002G329700g [Phaseolus vulgaris] gi|561033949|gb|ESW32528.1| hypothetical protein PHAVU_002G329700g [Phaseolus vulgaris] Length = 600 Score = 413 bits (1062), Expect = e-113 Identities = 220/420 (52%), Positives = 283/420 (67%), Gaps = 2/420 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTSMMHW+S+ G+V A+ +W+EM++KGC PTVVSYTAYMKIL D+K+V A+ VY Sbjct: 181 DSVTYTSMMHWLSSSGNVDEAMQVWEEMKSKGCYPTVVSYTAYMKILFDNKKVKEATRVY 240 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEM+ SG+ PNCHTYTVLM+YL SG EALEIF+KMQ AG QPDKA CNILI C Sbjct: 241 KEMISSGVPPNCHTYTVLMDYLIGSGNCEEALEIFEKMQEAGAQPDKAACNILIERCSKV 300 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSEYS-NEGTN 538 G M +L+YMK+N LVLR P++ KALE K+AGESD LLR+VN F + S + N Sbjct: 301 GGTEFMTHILQYMKENRLVLRYPVFVKALEALKVAGESDTLLRKVNPQFYIDCSITKNKN 360 Query: 539 DVDAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNITRR 718 D V +S ++D L+ + +N++A+D LLT + +K + LD V+S IIEVN + Sbjct: 361 DSITVAADSPTNMDKELLFVLLKNRNVVAIDHLLTGMMQKKLSLDHTVVSTIIEVNCSHC 420 Query: 719 RQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQLNS 898 R GALLA+ YSV MGI I R YL+L+G R N F K+V IVE+M++ G +LG L S Sbjct: 421 RPEGALLAFKYSVTMGIGIARKGYLSLMGLLTRSNMFSKLVYIVEEMTRAGHSLGIYLAS 480 Query: 899 LLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSKGV 1078 LLI+RLGC + +A K+F+ LP+ K A YTALIS YFS K +E + M SKG Sbjct: 481 LLIFRLGCARKHTSAMKIFNLLPDNHKCTATYTALISVYFSVKRVRKALEIYKTMCSKGF 540 Query: 1079 NVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQM-EGCTTNISIEETICNLLFSGDFII 1255 LGTY VLIAGLE+ G+ E ++Y+K K L G ++ IE ICNL+FS D I+ Sbjct: 541 CPVLGTYNVLIAGLERNGRYAEADHYRKAKKTLHANSGSQESVLIEGKICNLIFSVDVIL 600 >ref|XP_003524343.2| PREDICTED: pentatricopeptide repeat-containing protein At2g01390-like [Glycine max] Length = 591 Score = 412 bits (1058), Expect = e-112 Identities = 217/422 (51%), Positives = 284/422 (67%), Gaps = 4/422 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTSMMHW+S+ G+ A+ +W +M++KG PTVVSYTAY+KIL ++RV A+ Y Sbjct: 170 DSVTYTSMMHWLSSSGNFDEAMQMWDQMKSKGFHPTVVSYTAYIKILFHNQRVKEATRAY 229 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEM+ S ++PNCHTYTVLM+YL SG++ EALEIF+KMQ AG QPDKA CNILI C Sbjct: 230 KEMISSRVAPNCHTYTVLMDYLIGSGQYKEALEIFEKMQEAGAQPDKAACNILIERCSKV 289 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSE-YSNEGTN 538 G M +L+YMK+N LVLR P++ KALE K+AGESD LLR+VN F + + + Sbjct: 290 GGTEFMTHILQYMKENRLVLRYPVFVKALEALKIAGESDTLLRQVNPQFYMDCIIRKKAS 349 Query: 539 DVDAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNITRR 718 + V + ++D L+ + +N++A+D LLT + +K + LD KV+S IIEVN + Sbjct: 350 NTITVAADCPTNIDKELLFVLLKNRNVVAIDHLLTGMMDKKISLDHKVVSTIIEVNCSHC 409 Query: 719 RQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQLNS 898 R GALLA+ YSV MGI+IERT YL+LIG IR N F K+ EIVEKM++ G +LG L S Sbjct: 410 RPEGALLAFKYSVTMGISIERTGYLSLIGLLIRSNMFSKLAEIVEKMTRAGHSLGIYLAS 469 Query: 899 LLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSKGV 1078 LLI+RLGC + A K+F+ LP+ K A YTALIS YFS + +E + IM SKG Sbjct: 470 LLIFRLGCARKHTLAMKIFNLLPDNHKCSATYTALISVYFSVRRVNEALEIYKIMCSKGF 529 Query: 1079 NVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQM---EGCTTNISIEETICNLLFSGDF 1249 LGTY VLIAGLE+ GK E E+Y+K K R + G ++ IE ICNLLFS D Sbjct: 530 CPVLGTYDVLIAGLERNGKYAEAEHYRKAKKRKSLRANSGSQESVCIEGKICNLLFSVDV 589 Query: 1250 II 1255 ++ Sbjct: 590 VL 591 >ref|XP_004503279.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01390-like [Cicer arietinum] Length = 580 Score = 407 bits (1047), Expect = e-111 Identities = 214/420 (50%), Positives = 283/420 (67%), Gaps = 2/420 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTSMMHW+SN G++ AI LW EM++KGC PTVVSYTAY+KIL D++R A+ VY Sbjct: 161 DSVTYTSMMHWLSNSGNLDEAIALWDEMKSKGCYPTVVSYTAYIKILFDNQRTKEATGVY 220 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEML SG PNC+TYTVLM++L +SGK EALEIF+KMQ AGV+PDKA CNILI C Sbjct: 221 KEMLHSGCVPNCYTYTVLMDHLIASGKCKEALEIFEKMQEAGVEPDKAACNILIEKCSKV 280 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSEYS-NEGTN 538 G M +L YMK+N LVLR+ ++ +A++ K+AGESD LLR+VN F + S + N Sbjct: 281 GGTVFMTHILHYMKENRLVLRHRVFLEAMKALKIAGESDTLLRQVNPQFYLDCSFRKKEN 340 Query: 539 DVDAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNITRR 718 D V S ++DN L+ + + ++A+D LL + EK + +D+KVIS IIEVN Sbjct: 341 DSGTVIAGSSANIDNELLFVLLKNRKVVAIDHLLQGMMEKKIGVDNKVISTIIEVNCNCC 400 Query: 719 RQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQLNS 898 R +GALLA++YSV MGI++ER YL+LIG IR N F K+V+IV +M+ G +LG L S Sbjct: 401 RPDGALLAFNYSVTMGISLERAGYLSLIGLLIRSNMFSKLVQIVAEMTNAGHSLGIYLAS 460 Query: 899 LLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSKGV 1078 LLIYRLGC + A K+F+ LP+ K A YTALIS Y S K +E + IM KG+ Sbjct: 461 LLIYRLGCARRPSFALKIFNLLPDNHKCTATYTALISTYLSARRVNKALEIYKIMCQKGI 520 Query: 1079 NVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQME-GCTTNISIEETICNLLFSGDFII 1255 GTY +L+AGLE+ G+ E E ++K K L G ++S E +C+LLF+GD I+ Sbjct: 521 CPTSGTYNILVAGLERNGRYSEAELHRKSKKNLHSNIGTRESVSTEGKMCDLLFAGDVIL 580 >ref|XP_003631109.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355525131|gb|AET05585.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 687 Score = 406 bits (1044), Expect = e-110 Identities = 218/422 (51%), Positives = 291/422 (68%), Gaps = 4/422 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGC-RPTVVSYTAYMKILLDSKRVNVASDV 178 D VTYTSMMHW+S G+V AI LW EM++KGC PTVVSYTA++KIL D+ RV A+ + Sbjct: 267 DSVTYTSMMHWLSTSGNVDEAIALWDEMKSKGCCYPTVVSYTAFIKILFDNHRVKEATAI 326 Query: 179 YKEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCC- 355 YKEML +G PNC+TYTVLM++L +SGK EALEIF KMQ AGV+PDKA CNILI+ C Sbjct: 327 YKEMLHNGCVPNCYTYTVLMDHLIASGKCKEALEIFQKMQEAGVEPDKAACNILIDKCSK 386 Query: 356 ITGEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSEYS-NEG 532 + G ++ M K+L+YMK+N LVLR +Y KA+E K+AGESD LLR+VN HF + S E Sbjct: 387 VCGTVF-MTKILQYMKENRLVLRYRVYVKAMEALKIAGESDTLLRQVNPHFYLDSSFKEK 445 Query: 533 TNDVDAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNIT 712 +D + V +S ++D L+L + +N++A+D L+ + +K + +D+KVIS IIEVN Sbjct: 446 AHDRNTVIADSSSNIDKELLLVLLRNRNVVAIDHLIQGMMDKKISVDNKVISTIIEVNCN 505 Query: 713 RRRQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQL 892 R +GALLA++YSV MGI+IERT YL+L+G R N F K+VEIV +M++ G +LG L Sbjct: 506 CCRPDGALLAFNYSVTMGISIERTGYLSLVGLLNRSNMFSKLVEIVGEMTRAGHSLGIYL 565 Query: 893 NSLLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSK 1072 SLLIYRLGC + + A K+F+ LP+ K A YTALIS YFS K +E + IM K Sbjct: 566 ASLLIYRLGCARQLSIALKIFNLLPDNHKCVATYTALISIYFSARRVNKALEIYKIMCQK 625 Query: 1073 GVNVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQME-GCTTNISIEETICNLLFSGDF 1249 G GT+ +L+AGLE+ G+ E ++K K L G N+S E IC+LLF+GD Sbjct: 626 GNCPTSGTFNILVAGLERNGRFSEAGVHRKAKKNLNSNIGSQENLSTEGRICDLLFAGDV 685 Query: 1250 II 1255 I+ Sbjct: 686 IL 687 >ref|XP_004165472.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01390-like [Cucumis sativus] Length = 737 Score = 395 bits (1015), Expect = e-107 Identities = 209/415 (50%), Positives = 274/415 (66%), Gaps = 3/415 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTS+MHW SN GDV GAI +WKEM+A GC PTVVSYTAY+KILLD+ ++N A+ Y Sbjct: 235 DAVTYTSLMHWRSNSGDVDGAIKVWKEMKANGCHPTVVSYTAYIKILLDNGQINEATATY 294 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 K+ML+SGLSPNC TYT+LMEYL GK EAL+IF KMQ AGV PDKA CNILI CC + Sbjct: 295 KKMLQSGLSPNCCTYTILMEYLIGEGKCKEALDIFSKMQDAGVYPDKAACNILIQKCCKS 354 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSEYSNEG-TN 538 GE M ++LE+MK+N VLR P++ +A ET K S ALL++VN H E ++G Sbjct: 355 GERLVMTQILEFMKENRFVLRYPVFVEAHETLKSCSVSYALLKQVNPHMEIESISKGEVV 414 Query: 539 DVDAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNITRR 718 DV + +VDN L+ + L AVD +L + +K +QLDS +I +IIEVN Sbjct: 415 DVSTGSNTVPPNVDNELLAMLLKDNKLTAVDHMLIGIVDKNIQLDSSIIYSIIEVNCKSN 474 Query: 719 RQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQLNS 898 R N ALLA+DY +K +NI+R YL LIG IR + + K++EIV++M QG LG + Sbjct: 475 RPNSALLAFDYCLKNSVNIKRKLYLDLIGILIRSSIYPKLLEIVQEMYTQGHCLGLYHAT 534 Query: 899 LLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSKGV 1078 L++Y LG G+ A KVF+ LPE+ K A YTAL+ YFS G+SGKG++ F M+ KG Sbjct: 535 LILYSLGKAGKPQYARKVFNMLPEELKCTATYTALVDGYFSAGSSGKGLKIFETMRKKGF 594 Query: 1079 NVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQM--EGCTTNISIEETICNLLF 1237 +LGTY VL+ GL K G+ E+ Y++EK ++ I +E IC+LLF Sbjct: 595 TPSLGTYNVLLNGLAKNGRGVELNIYRREKKSFEISHHSRLNTILDDERICDLLF 649 >ref|XP_004148464.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01390-like [Cucumis sativus] Length = 1058 Score = 390 bits (1003), Expect = e-106 Identities = 208/415 (50%), Positives = 272/415 (65%), Gaps = 3/415 (0%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTS+MHW SN GDV GAI LWKEM+A GC PTVVSYTAY+KILLD+ ++N A+ Y Sbjct: 216 DAVTYTSLMHWRSNSGDVDGAIKLWKEMKANGCHPTVVSYTAYIKILLDNGQINEATATY 275 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 K+ML+SGLSPNC YT+LMEYL GK EAL+IF KMQ AGV PDKA CNILI CC + Sbjct: 276 KKMLQSGLSPNCCIYTILMEYLIGEGKCKEALDIFSKMQDAGVYPDKAACNILIQKCCKS 335 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSEYSNEG-TN 538 GE M ++LE+MK+N VLR P++ +A ET K S ALL++VN H E ++G Sbjct: 336 GERLVMTQILEFMKENRFVLRYPVFVEAHETLKSCSVSYALLKQVNPHMEIESISKGEVV 395 Query: 539 DVDAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNITRR 718 DV + +VDN L+ + L AVD +L + +K +QLDS +I +IIEVN Sbjct: 396 DVSTGSNTVPPNVDNELLAMLLKDNKLTAVDHMLIGIVDKNIQLDSSIIYSIIEVNCKSN 455 Query: 719 RQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQLNS 898 R N ALLA+DY +K +NI+R YL LIG IR + + K++EIV++M QG LG + Sbjct: 456 RPNSALLAFDYCLKNSVNIKRKLYLDLIGILIRSSIYPKLLEIVQEMYTQGHCLGLYHAT 515 Query: 899 LLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSKGV 1078 L++ LG G+ A KVF+ LPE+ K A YTAL+ YFS G+SGKG++ F M+ KG Sbjct: 516 LILCSLGKAGKPQYARKVFNMLPEELKCTATYTALVDGYFSAGSSGKGLKIFETMRKKGF 575 Query: 1079 NVALGTYRVLIAGLEKCGKSREVEYYKKEKMRLQM--EGCTTNISIEETICNLLF 1237 +LGTY VL+ GL K G+ E+ Y++EK ++ I +E IC+LLF Sbjct: 576 TPSLGTYNVLLNGLAKNGRGVELNIYRREKKSFEISHHSRLNTILDDERICDLLF 630 >ref|NP_178248.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|218546783|sp|Q9ZU29.2|PP139_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g01390 gi|330250351|gb|AEC05445.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 577 Score = 380 bits (976), Expect = e-103 Identities = 208/422 (49%), Positives = 279/422 (66%), Gaps = 5/422 (1%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTS++HW+S+ GDV GA+ LW+EMR GC PTVVSYTAYMK+L RV A++VY Sbjct: 156 DTVTYTSLIHWVSSSGDVDGAMRLWEEMRDNGCEPTVVSYTAYMKMLFADGRVEEATEVY 215 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEML S +SPNCHTYTVLMEYL ++GK EAL+IF KMQ GVQPDKA CNILI Sbjct: 216 KEMLRSRVSPNCHTYTVLMEYLVATGKCEEALDIFFKMQEIGVQPDKAACNILIAKALKF 275 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSE-YSNEGTN 538 GE M +VL YMK+N +VLR PI+ +ALET K AGESD LLREVN H S E + + Sbjct: 276 GETSFMTRVLVYMKENGVVLRYPIFVEALETLKAAGESDDLLREVNSHISVESLCSSDID 335 Query: 539 DVDAVFHNSEFDVDNGLVLN--FMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNIT 712 + N + D+ V++ + KQNL+AVD LL + ++ ++LDS V+S IIE N Sbjct: 336 ETPTAEVNDTKNSDDSRVISSVLLMKQNLVAVDILLNQMRDRNIKLDSFVVSAIIETNCD 395 Query: 713 RRRQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQL 892 R R GA LA+DYS++MGI+++++AYLALIG +R N KV+E+V++M K +LG Sbjct: 396 RCRTEGASLAFDYSLEMGIHLKKSAYLALIGNFLRSNELPKVIEVVKEMVKAQHSLGCYQ 455 Query: 893 NSLLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSK 1072 ++LI+RLG G A VFD LP+ +K A YTAL+ Y S G+ K ++ M+ + Sbjct: 456 GAMLIHRLGFGRRPRLAADVFDLLPDDQKGVAAYTALMDVYISAGSPEKAMKILREMRER 515 Query: 1073 GVNVALGTYRVLIAGLEKCGK-SREVEYYKKEKMRLQMEG-CTTNISIEETICNLLFSGD 1246 + +LGTY VL++GLEK +EV +KEK L N+ +E+ IC+LLF+ + Sbjct: 516 EIMPSLGTYDVLLSGLEKTSDFQKEVALLRKEKKSLVASARFRENVHVEDKICDLLFATN 575 Query: 1247 FI 1252 + Sbjct: 576 LL 577 >ref|XP_006398525.1| hypothetical protein EUTSA_v10000833mg [Eutrema salsugineum] gi|557099614|gb|ESQ39978.1| hypothetical protein EUTSA_v10000833mg [Eutrema salsugineum] Length = 576 Score = 380 bits (975), Expect = e-102 Identities = 205/418 (49%), Positives = 275/418 (65%), Gaps = 5/418 (1%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTS++HW+S+ GDV GA+ LW+EMR +G PTVVSYTAYMK+ D+ +V A++VY Sbjct: 156 DTVTYTSLIHWVSSSGDVDGAMKLWEEMRDQGSEPTVVSYTAYMKMFFDNGKVEEATEVY 215 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 KEML S +SPNCHTYTVLMEYL +GK EAL+IF KMQ GVQPDKA CNILI C Sbjct: 216 KEMLRSRISPNCHTYTVLMEYLVGTGKCEEALDIFFKMQEIGVQPDKAACNILIGKACKF 275 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHFSSE-YSNEGTN 538 GE M ++L YMK+N +VLR+ ++ +ALET K AGESD LLRE N H S+E + + Sbjct: 276 GETSFMTRILLYMKENGIVLRHTVFFEALETLKDAGESDYLLREANSHISAESLCSNNID 335 Query: 539 DVDAVFHNSEFDVDNGLVLN--FMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEVNIT 712 + V + E VD V++ + KQNL+AVD LL + ++ ++LDS V+S I+E N Sbjct: 336 ETPKVEVDEEKSVDYSRVISSVLLMKQNLVAVDHLLNQMKDRNIKLDSFVVSAIVETNCD 395 Query: 713 RRRQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLGTQL 892 R R GA LA DYS +MGI++E+TAYLAL+G +R N KV E V++M K +LG Sbjct: 396 RCRTEGASLALDYSSEMGIHLEKTAYLALVGNFLRSNELAKVTETVKEMVKGQHSLGVYQ 455 Query: 893 NSLLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIMKSK 1072 + LI++LG G A +VFD +P+ +K A YTAL+ Y S G+ K ++ M+ + Sbjct: 456 GATLIHKLGFGRRPRLAAEVFDLIPDDQKGVAAYTALMDVYISAGSPEKAMKILEAMRER 515 Query: 1073 GVNVALGTYRVLIAGLEKCGK-SREVEYYKKEKMRLQMEG-CTTNISIEETICNLLFS 1240 + +LGTY VL++GLEK RE +KEK L G N+ +EE IC+LLF+ Sbjct: 516 EIMPSLGTYNVLLSGLEKTSDFQREAASLRKEKKSLVASGRFRENVDVEEKICDLLFA 573 >ref|XP_006292569.1| hypothetical protein CARUB_v10018804mg [Capsella rubella] gi|482561276|gb|EOA25467.1| hypothetical protein CARUB_v10018804mg [Capsella rubella] Length = 662 Score = 374 bits (961), Expect = e-101 Identities = 209/421 (49%), Positives = 274/421 (65%), Gaps = 8/421 (1%) Frame = +2 Query: 2 DVVTYTSMMHWISNDGDVAGAIDLWKEMRAKGCRPTVVSYTAYMKILLDSKRVNVASDVY 181 D VTYTS++HW+S+ GD+ GA+ LW+EMR KGC PTVVSYTAYMKIL RV A++VY Sbjct: 242 DTVTYTSLIHWVSSSGDIDGAMRLWEEMRDKGCEPTVVSYTAYMKILFADGRVEEATEVY 301 Query: 182 KEMLESGLSPNCHTYTVLMEYLASSGKFSEALEIFDKMQHAGVQPDKATCNILINVCCIT 361 K+ML S +SPNC+TYTVLMEYL +GK EAL+IF KMQ GVQPDKA CNILI C Sbjct: 302 KDMLRSRVSPNCYTYTVLMEYLVGTGKCEEALDIFFKMQEIGVQPDKAACNILIGKACKF 361 Query: 362 GEIWAMVKVLEYMKQNFLVLRNPIYQKALETFKLAGESDALLREVNRHF------SSEYS 523 GE M +VL YMKQN +VLR PI+ +A ET K AGESD LLREVN H SS+ Sbjct: 362 GETSFMARVLLYMKQNGIVLRYPIFLEASETLKAAGESDDLLREVNLHISAELLCSSDID 421 Query: 524 NEGTNDVDAVFHNSEFDVDNGLVLNFMNKQNLLAVDSLLTDLAEKGVQLDSKVISNIIEV 703 T +VD + + V + ++L KQNL+AVD +L + E+ ++LDS V+S IIE Sbjct: 422 ETPTAEVDDTKSSDDSRVISSVLL---MKQNLVAVDLMLNQMRERKMKLDSFVVSAIIET 478 Query: 704 NITRRRQNGALLAYDYSVKMGINIERTAYLALIGFSIRVNSFHKVVEIVEKMSKQGLTLG 883 N R R GA LA DYS KMGI++E++AYLALIG +RVN V+E+V++M K +LG Sbjct: 479 NCGRCRTEGASLALDYSSKMGIHLEKSAYLALIGQFLRVNELPNVIEVVKEMLKAQHSLG 538 Query: 884 TQLNSLLIYRLGCGGEAAAAEKVFDFLPEKEKSCAEYTALISAYFSCGNSGKGIETFNIM 1063 ++LI+RLG A +VF LP+ +K A YTAL+ Y S G+ K ++ M Sbjct: 539 VYQGAMLIHRLGFERRPRLAAEVFYLLPDDQKGVAAYTALMGVYVSAGSPEKAMKILEEM 598 Query: 1064 KSKGVNVALGTYRVLIAGLEKCGK-SREVEYYKKEKMRLQMEG-CTTNISIEETICNLLF 1237 + + + +LGTY VL++GLEK RE +KE+ L N+ +E+ IC+LLF Sbjct: 599 REREIMPSLGTYNVLLSGLEKTSDFQRETALLRKEEKSLVASARFRENVHVEDKICDLLF 658 Query: 1238 S 1240 + Sbjct: 659 A 659