BLASTX nr result
ID: Sinomenium21_contig00016674
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00016674 (1815 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002265876.1| PREDICTED: pentatricopeptide repeat-containi... 593 e-167 ref|XP_007222454.1| hypothetical protein PRUPE_ppa006191mg [Prun... 555 e-155 ref|XP_004301396.1| PREDICTED: pentatricopeptide repeat-containi... 545 e-152 ref|XP_006453186.1| hypothetical protein CICLE_v10010414mg [Citr... 525 e-146 ref|XP_006372219.1| pentatricopeptide repeat-containing family p... 523 e-145 ref|XP_002511816.1| pentatricopeptide repeat-containing protein,... 520 e-145 emb|CAN79718.1| hypothetical protein VITISV_012741 [Vitis vinifera] 511 e-142 ref|XP_007014560.1| Pentatricopeptide repeat superfamily protein... 496 e-137 ref|XP_006360648.1| PREDICTED: pentatricopeptide repeat-containi... 494 e-137 ref|XP_006850970.1| hypothetical protein AMTR_s00025p00206120 [A... 491 e-136 ref|XP_004152890.1| PREDICTED: pentatricopeptide repeat-containi... 488 e-135 ref|XP_006360650.1| PREDICTED: pentatricopeptide repeat-containi... 486 e-134 ref|XP_004240282.1| PREDICTED: pentatricopeptide repeat-containi... 485 e-134 ref|XP_006372218.1| hypothetical protein POPTR_0018s14360g [Popu... 478 e-132 ref|XP_006573403.1| PREDICTED: pentatricopeptide repeat-containi... 470 e-130 ref|XP_007134710.1| hypothetical protein PHAVU_010G069800g [Phas... 469 e-129 ref|XP_007134709.1| hypothetical protein PHAVU_010G069800g [Phas... 468 e-129 gb|ABA18111.1| pentatricopeptide repeat protein [Arabidopsis are... 467 e-129 ref|XP_007132326.1| hypothetical protein PHAVU_011G085400g [Phas... 465 e-128 ref|NP_566863.2| pentatricopeptide repeat-containing protein [Ar... 463 e-127 >ref|XP_002265876.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630 [Vitis vinifera] gi|297736023|emb|CBI24061.3| unnamed protein product [Vitis vinifera] Length = 423 Score = 593 bits (1529), Expect = e-167 Identities = 290/399 (72%), Positives = 339/399 (84%) Frame = +2 Query: 128 HQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLGRKRMPHIAQQLWLEMNSE 307 HQN S R++AR++ WH K++ S GKD Y+++ +++ L RKR+PH+AQ+L EM SE Sbjct: 26 HQNY-SPNRALARKLFWHWKQERSVDGKDNYVDYTPLIQALSRKRLPHVAQELLFEMKSE 84 Query: 308 GFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRFDAV 487 GFLP TLSALMLCYADNGLF AQ +WDEIINSS+ PNI++VS+LI+AYGKMG F V Sbjct: 85 GFLPNNSTLSALMLCYADNGLFPKAQALWDEIINSSFGPNIQIVSKLIDAYGKMGHFGEV 144 Query: 488 SRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAFLQY 667 +RIL +V+ RDF EVYS AI CFGKGGQLE+ME +KEMVS GFPVDSATGNAF++Y Sbjct: 145 TRILHQVSSRDFNFMHEVYSLAISCFGKGGQLEMMENALKEMVSRGFPVDSATGNAFIRY 204 Query: 668 YSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRKNVG 847 YS FGSLT MEAAY RLK+S+ILIE+EGIRAM+ AYIKE K+++LG+FLR VGLGRKNVG Sbjct: 205 YSIFGSLTEMEAAYDRLKKSRILIEEEGIRAMSFAYIKEKKYYRLGQFLRDVGLGRKNVG 264 Query: 848 NLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHVSLE 1027 NLLWN LLLSYAANFKMKSLQREF M EAGF+PDL+TFNIRALAFSRM+LFWDLH+SLE Sbjct: 265 NLLWNLLLLSYAANFKMKSLQREFLEMVEAGFAPDLTTFNIRALAFSRMSLFWDLHLSLE 324 Query: 1028 HMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGKGDF 1207 HM+H KVV DLVTYGCVVDA+LDR++G+NL+FAL KMN +DSPLVSTD VFEVLGKGDF Sbjct: 325 HMQHVKVVADLVTYGCVVDAYLDRRLGKNLDFALKKMNMDDSPLVSTDHFVFEVLGKGDF 384 Query: 1208 HSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324 HSSSEAFLE R WTYRKLIA YL+KKYRSNQ+FWNY Sbjct: 385 HSSSEAFLESKRNGKWTYRKLIATYLKKKYRSNQIFWNY 423 >ref|XP_007222454.1| hypothetical protein PRUPE_ppa006191mg [Prunus persica] gi|462419390|gb|EMJ23653.1| hypothetical protein PRUPE_ppa006191mg [Prunus persica] Length = 423 Score = 555 bits (1429), Expect = e-155 Identities = 277/429 (64%), Positives = 347/429 (80%) Frame = +2 Query: 38 MGVVSLLATSHCFSLSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDI 217 MG LL+T+ C S SL +P+ + SHQ + S R++AR+II K++ GK I Sbjct: 1 MGGTLLLSTT-CVSSSL----KPQHLSFSSHQPQ-SQSRALARKIIRKWKQEECFDGKGI 54 Query: 218 YLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWD 397 Y++ ++R L R++MPH+AQ+L LEM S+G LP TLSALMLC+A+NGLF A+ IWD Sbjct: 55 YVDCVPLIRSLSRQKMPHVAQELVLEMKSDGLLPSNSTLSALMLCHANNGLFPQAEAIWD 114 Query: 398 EIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGG 577 E+++SS+VP+I+VVSEL +AYG +G F+ V+ IL ++ R+ L PEVYS AI CFGKGG Sbjct: 115 EMLHSSFVPSIQVVSELFDAYGNVGCFEKVNEILAQIRSRNLSLFPEVYSLAISCFGKGG 174 Query: 578 QLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIR 757 QLELME T+KEM+S GFP+DSATGNAF++YYS FGSLT ME AYGRLKRS+ LIE+EGIR Sbjct: 175 QLELMEGTLKEMISRGFPLDSATGNAFIRYYSIFGSLTEMETAYGRLKRSRFLIEEEGIR 234 Query: 758 AMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEA 937 AM+ AY+K+ KF++L E L+ VGLGR+N+GNL WN LLLSYAA+FKMKSLQREF RM EA Sbjct: 235 AMSFAYLKKRKFYRLAELLKNVGLGRRNLGNLSWNLLLLSYAADFKMKSLQREFLRMVEA 294 Query: 938 GFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNL 1117 GF PDL+TFNIRALAFSRM+L WDLH+SLEHMKHEKV PDLVT GCVVDA+L+R++G+N+ Sbjct: 295 GFHPDLTTFNIRALAFSRMSLLWDLHLSLEHMKHEKVFPDLVTCGCVVDAYLERRLGKNM 354 Query: 1118 NFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKY 1297 FALNKMN +DSPL+ TD VFEVLGKGDFH+SSEAFLE QR WTYR+LI++YL+K+Y Sbjct: 355 YFALNKMNLDDSPLILTDPFVFEVLGKGDFHASSEAFLEFQSQREWTYRRLISVYLKKQY 414 Query: 1298 RSNQLFWNY 1324 R NQ+FWNY Sbjct: 415 RRNQIFWNY 423 >ref|XP_004301396.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Fragaria vesca subsp. vesca] Length = 424 Score = 545 bits (1405), Expect = e-152 Identities = 262/409 (64%), Positives = 332/409 (81%) Frame = +2 Query: 98 LRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLGRKRMPHIA 277 L P ++ SHQ++ S R++AR+I+ K++ + GKD Y++ +++ L R++MPH+A Sbjct: 16 LNPNRLSVLSHQSQRSQNRALARKIVRTWKQEECSRGKDCYVDCVPLIQSLSRQKMPHVA 75 Query: 278 QQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEA 457 Q++ L M SEG +P TLSA+MLC+A NGL A+ IWDE++NSS+VP I+VVSEL + Sbjct: 76 QEVLLVMKSEGLIPSNSTLSAVMLCHAKNGLLPQAEAIWDEMLNSSFVPGIQVVSELFDV 135 Query: 458 YGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVD 637 YG +G F V+ I+ ++ R+ L P+VYS AI CFGKGGQLELME T+KEMVS GFPVD Sbjct: 136 YGNVGSFGKVNEIVGQIRSRNLSLLPQVYSLAISCFGKGGQLELMEDTLKEMVSRGFPVD 195 Query: 638 SATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLR 817 SATGN F++YYS FGSLT ME AY RLKRS+ LIE+EGIRAM+ AY+K+ KF+ L EFL+ Sbjct: 196 SATGNVFIRYYSIFGSLTEMETAYDRLKRSRFLIEEEGIRAMSLAYLKKRKFYSLAEFLK 255 Query: 818 GVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMA 997 VGLGR+N+GNLLWN LLLSYAANFKMK+LQREF RM EAGF PDL+TFNIRALAFSRM+ Sbjct: 256 SVGLGRRNLGNLLWNLLLLSYAANFKMKTLQREFLRMVEAGFHPDLTTFNIRALAFSRMS 315 Query: 998 LFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQL 1177 L WDLH++LEHMKH KVVPDLVT GC+VDA+LDR++GRNL FALNKMN +DSP+V TD Sbjct: 316 LLWDLHLTLEHMKHVKVVPDLVTCGCIVDAYLDRRLGRNLYFALNKMNLDDSPVVLTDPF 375 Query: 1178 VFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324 VFEVLGKGDFH+SSEAFLE +Q+ WTY+KLI++YL+K+YR +Q+FWNY Sbjct: 376 VFEVLGKGDFHASSEAFLEFRKQKEWTYQKLISVYLKKQYRRDQIFWNY 424 >ref|XP_006453186.1| hypothetical protein CICLE_v10010414mg [Citrus clementina] gi|568840749|ref|XP_006474328.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X1 [Citrus sinensis] gi|568840751|ref|XP_006474329.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X2 [Citrus sinensis] gi|568840753|ref|XP_006474330.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X3 [Citrus sinensis] gi|568840755|ref|XP_006474331.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X4 [Citrus sinensis] gi|568840757|ref|XP_006474332.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X5 [Citrus sinensis] gi|568840759|ref|XP_006474333.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X6 [Citrus sinensis] gi|557556412|gb|ESR66426.1| hypothetical protein CICLE_v10010414mg [Citrus clementina] Length = 412 Score = 525 bits (1351), Expect = e-146 Identities = 264/417 (63%), Positives = 330/417 (79%) Frame = +2 Query: 74 FSLSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLG 253 FSLSL + K + SHQ G +AR+II ++K++ +++ AS++ LG Sbjct: 4 FSLSLHGSFKFKRFNVPSHQTHPKNG-DLARKIIRYRKQEG-------FVDCASLVEDLG 55 Query: 254 RKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIK 433 RK+ PH+A QL + SEG LP TL ALMLCYA+NG AQ +W+E+++SS+V +++ Sbjct: 56 RKKKPHLAHQLVNTVKSEGLLPDNSTLCALMLCYANNGFVLEAQVVWEELLSSSFVLSVQ 115 Query: 434 VVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEM 613 V+S+L++AYG++G F+ + I+ +V+ R+ L PEVYS AI CFGK GQLELME T+KEM Sbjct: 116 VLSDLMDAYGRIGCFNEIISIIDQVSCRNADLLPEVYSRAISCFGKQGQLELMENTLKEM 175 Query: 614 VSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKF 793 VS GF VDSATGNAF+ YYS+FGSLT ME AYGRLKRS+ LI+KEGIRA++ Y+KE KF Sbjct: 176 VSRGFSVDSATGNAFIIYYSRFGSLTEMETAYGRLKRSRHLIDKEGIRAVSFTYLKERKF 235 Query: 794 HKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIR 973 LGEFLR VGLGRK++GNLLWN LLLSYA NFKMKSLQREF RM+EAGF PDL+TFNIR Sbjct: 236 FMLGEFLRDVGLGRKDLGNLLWNLLLLSYAGNFKMKSLQREFMRMSEAGFHPDLTTFNIR 295 Query: 974 ALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDS 1153 A+AFSRM++FWDLH+SLEHMKHE V PDLVTYGCVVDA+LD+++GRNL+F L+KMN +DS Sbjct: 296 AVAFSRMSMFWDLHLSLEHMKHESVGPDLVTYGCVVDAYLDKRLGRNLDFGLSKMNLDDS 355 Query: 1154 PLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324 P+VSTD VFE GKGDFHSSSEAFLE RQR WTYRKLIA+YL+K+ R NQ+FWNY Sbjct: 356 PVVSTDPYVFEAFGKGDFHSSSEAFLEFKRQRKWTYRKLIAVYLKKQLRRNQIFWNY 412 >ref|XP_006372219.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550318750|gb|ERP50016.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 428 Score = 523 bits (1347), Expect = e-145 Identities = 261/430 (60%), Positives = 338/430 (78%), Gaps = 1/430 (0%) Frame = +2 Query: 38 MGVVSLLATSHCFSLSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDI 217 M +++A + C++ ++ +PK A++S + +D R++A+++I KR GK+ Sbjct: 1 METKTVIAATTCYA-NVIGSYKPKRFAIFSIK-RDPKKRALAQKMIRQWKRDQGVFGKET 58 Query: 218 YLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWD 397 + AS+++ L + R PH+A++L LE+ EGFLP TLSA+MLCYAD+GL AQ IW+ Sbjct: 59 CADCASLIQTLCKHRRPHLAEELLLELKCEGFLPDNRTLSAMMLCYADSGLLPQAQAIWE 118 Query: 398 EIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVT-LRDFKLCPEVYSSAICCFGKG 574 E++ SS+VP+++V+S+LI+ Y K G FD V +IL +++ LR F P+VYS AI CFGKG Sbjct: 119 EMLYSSFVPSVQVISDLIDIYAKSGLFDEVIKILDQLSSLRTFDFLPQVYSLAISCFGKG 178 Query: 575 GQLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGI 754 GQLELME T+K+MVS GF VDSATGNAF+ YYS GSL MEAAY RLKRS++LIE+EGI Sbjct: 179 GQLELMEDTLKKMVSKGFWVDSATGNAFVVYYSLHGSLAEMEAAYDRLKRSRLLIEREGI 238 Query: 755 RAMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAE 934 RAM+ AYIKE KF+ L EFLR VGLGRKN+GNL+WN LLLSY+ANFKMK+LQREF M E Sbjct: 239 RAMSFAYIKERKFYGLSEFLRDVGLGRKNLGNLIWNLLLLSYSANFKMKTLQREFLNMLE 298 Query: 935 AGFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRN 1114 AGF PDL+TFNIRALAFSRM+L WDLH+ LEHMKH+KV PDLVTYGC+VDA+LDR++ RN Sbjct: 299 AGFHPDLTTFNIRALAFSRMSLLWDLHLGLEHMKHDKVAPDLVTYGCIVDAYLDRRLVRN 358 Query: 1115 LNFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKK 1294 L FAL+KM+ ++SP++STD VFEV GKGDFHSSSEAF+E RQR WTYR+LI +YLRK+ Sbjct: 359 LEFALSKMHVDNSPVLSTDPFVFEVFGKGDFHSSSEAFMEFKRQRKWTYRELIKIYLRKQ 418 Query: 1295 YRSNQLFWNY 1324 +RS +FWNY Sbjct: 419 HRSKHIFWNY 428 >ref|XP_002511816.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223548996|gb|EEF50485.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 427 Score = 520 bits (1340), Expect = e-145 Identities = 252/402 (62%), Positives = 326/402 (81%) Frame = +2 Query: 119 LYSHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLGRKRMPHIAQQLWLEM 298 L+S Q +D R +AR+II+ K+ S + K++ + AS+++ L KR PH+AQ++ LEM Sbjct: 29 LFSSQ-RDPTNRPLARKIIYQWKQDQSFSCKEV--DCASLVQNLHSKRTPHLAQEILLEM 85 Query: 299 NSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRF 478 S+G++ TLSA++LCYADNGL AQ IW ++N S+ P+I++VS LI+AY K G F Sbjct: 86 KSQGYVLNNPTLSAILLCYADNGLLPQAQAIWKHMLNGSFTPSIQIVSRLIDAYSKKGHF 145 Query: 479 DAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAF 658 + V IL +++ +F L E YS AI CFGKGGQL+LME +K+MV GFPVD ATGNAF Sbjct: 146 NEVMNILDQLSYSNFSLLHEAYSLAISCFGKGGQLQLMENALKDMVLRGFPVDYATGNAF 205 Query: 659 LQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRK 838 ++YYS GSLT ME+AY RLKRS+ L+++EGIRA++ AY+KE KF++LGEFLR VGLGRK Sbjct: 206 IRYYSIHGSLTDMESAYSRLKRSRHLVDREGIRAVSLAYVKERKFYRLGEFLRDVGLGRK 265 Query: 839 NVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHV 1018 +VGNL+WNFLLLS+AANFKMKSLQREF RM EAGF PD++TFNIRALAFSRM+L WDLH+ Sbjct: 266 DVGNLIWNFLLLSFAANFKMKSLQREFLRMLEAGFHPDVTTFNIRALAFSRMSLLWDLHL 325 Query: 1019 SLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGK 1198 +LEHMKHEKV PD+VTYGC+VDA+LDR++G+NL+FA+ KMN + SP++ TD VFEVLGK Sbjct: 326 TLEHMKHEKVSPDIVTYGCIVDAYLDRRLGKNLDFAIKKMNLDGSPVLLTDPFVFEVLGK 385 Query: 1199 GDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324 GDFHSS+EAFLE RQR WTYR+L+++YLRK+YRSNQ+FWNY Sbjct: 386 GDFHSSAEAFLEFKRQRKWTYRELVSIYLRKQYRSNQIFWNY 427 >emb|CAN79718.1| hypothetical protein VITISV_012741 [Vitis vinifera] Length = 446 Score = 511 bits (1315), Expect = e-142 Identities = 256/386 (66%), Positives = 301/386 (77%) Frame = +2 Query: 167 EIIWHQKRQASATGKDIYLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALM 346 E+ WH K++ S GKD Y+++ +++ L RKR+PH+AQ+L EM SE Sbjct: 100 ELFWHWKQERSVDGKDNYVDYTPLIQALSRKRLPHVAQELLFEMKSE------------- 146 Query: 347 LCYADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFK 526 DNGLF AQ +WDEIINSS+ PNI++VS+LI+AYGKMG F V+RIL + Sbjct: 147 ----DNGLFPKAQALWDEIINSSFGPNIQIVSKLIDAYGKMGHFGEVTRILHQ------- 195 Query: 527 LCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAA 706 GGQLE+ME +KEMVS GFPVDSATGNAF++YYS FGSLT MEAA Sbjct: 196 ---------------GGQLEMMENALKEMVSRGFPVDSATGNAFIRYYSIFGSLTEMEAA 240 Query: 707 YGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAA 886 Y RLK+S+ILIE+EGIRAM+ AYIKE K+++LG+FLR VGLGRKNVGNLLWN LLLSYAA Sbjct: 241 YDRLKKSRILIEEEGIRAMSFAYIKEKKYYRLGQFLRDVGLGRKNVGNLLWNLLLLSYAA 300 Query: 887 NFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVT 1066 NFKMKSLQREF M EAGF+PDL+TFNIRALAFSRM+LFWDLH+SLEHM+H KVV DLVT Sbjct: 301 NFKMKSLQREFLEMVEAGFAPDLTTFNIRALAFSRMSLFWDLHLSLEHMQHVKVVADLVT 360 Query: 1067 YGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQ 1246 YGCVVDA+LDR++G+NL+FAL KMN +DSPLVSTD VFEVLGKGDFHSSSEAFLE R Sbjct: 361 YGCVVDAYLDRRLGKNLDFALKKMNMDDSPLVSTDHFVFEVLGKGDFHSSSEAFLESKRN 420 Query: 1247 RNWTYRKLIALYLRKKYRSNQLFWNY 1324 WTYRKLIA YL+KKYRSNQ+FWNY Sbjct: 421 GKWTYRKLIATYLKKKYRSNQIFWNY 446 >ref|XP_007014560.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] gi|508784923|gb|EOY32179.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] Length = 429 Score = 496 bits (1277), Expect = e-137 Identities = 252/404 (62%), Positives = 313/404 (77%), Gaps = 4/404 (0%) Frame = +2 Query: 125 SHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLGRKRMP--HIAQQLWLEM 298 S+ N R + R +W + G+D +++F S+L+ L K+MP H+ L L+ Sbjct: 32 SNNNLPLARRQIIR--LWKRDGSILGVGRDNFVDFDSLLQTLASKKMPQPHVVHHLLLQ- 88 Query: 299 NSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINS-SYVPNIKVVSELIEAYGKMGR 475 G +P TLS +ML YADNGLF AQ IW+E++N+ S+ P+I+VVS+ ++AYGKMG Sbjct: 89 ---GLIPNNSTLSEIMLWYADNGLFPQAQAIWEEMLNTTSFTPSIQVVSKFMDAYGKMGH 145 Query: 476 FDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNA 655 F V +IL V L L PEVY AI CFGK G+L+LME T+KEMVS G PVDSATGNA Sbjct: 146 FHKVHKILDRVILLRVNLLPEVYPVAISCFGKHGRLDLMENTLKEMVSRGLPVDSATGNA 205 Query: 656 FLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGR 835 F++YYS FGSL+ ME AY RLKRS+ LIE+EGIRAM+SAYIKEGKF++LGEFL +GLGR Sbjct: 206 FVRYYSIFGSLSEMEIAYARLKRSRHLIEEEGIRAMSSAYIKEGKFYRLGEFLNDLGLGR 265 Query: 836 KNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLH 1015 +N+GNLLWN LLLSYAANFKMK++QR F +M ++GF PDL+TFNIRA AFSRM++FWDLH Sbjct: 266 RNLGNLLWNLLLLSYAANFKMKTMQRLFLKMMDSGFRPDLTTFNIRAWAFSRMSMFWDLH 325 Query: 1016 VSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLG 1195 +SLEHMKHE VV DLVTYGCVVDA+LDR++ RNL+FALN MN +DSPLV TD LVFE LG Sbjct: 326 LSLEHMKHESVVSDLVTYGCVVDAYLDRRLARNLDFALNHMNADDSPLVLTDPLVFEALG 385 Query: 1196 KGDFHSSSEAFLECNRQ-RNWTYRKLIALYLRKKYRSNQLFWNY 1324 KGDFHSS+EAFLE RQ + WTYR+LIA+YL+K+ R NQ+FWNY Sbjct: 386 KGDFHSSAEAFLEFKRQKKKWTYRQLIAVYLKKQLRRNQIFWNY 429 >ref|XP_006360648.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X1 [Solanum tuberosum] gi|565389826|ref|XP_006360649.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X2 [Solanum tuberosum] Length = 416 Score = 494 bits (1272), Expect = e-137 Identities = 251/423 (59%), Positives = 320/423 (75%) Frame = +2 Query: 56 LATSHCFSLSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFAS 235 +A S+++ LRP +L SHQN+ S A++ W K+ + + Y + AS Sbjct: 1 MAAGLVVSIAVTPKLRP--FSLISHQNQSS-----AQKRRWRMKQGGNIDPRGNYADCAS 53 Query: 236 VLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSS 415 +++ L RK++P A++L LEM SEGF+P TLSALMLCYA NGLF A WDEI+NSS Sbjct: 54 LIQGLSRKKLPVAAERLVLEMKSEGFVPDSSTLSALMLCYASNGLFYKALAAWDEIMNSS 113 Query: 416 YVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELME 595 ++P++ V++ELI+ Y G D RIL ++ L+D L +VY+ AI FGK GQLELME Sbjct: 114 FLPDVHVIAELIDIYVCKGYLDVAVRILHQIQLKDSNLLRDVYAQAISRFGKKGQLELME 173 Query: 596 TTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAY 775 +KEMVS GFPVDS TGNA++ YYS FG L+ ME AYGRLK S+ILIE+E IR+++ AY Sbjct: 174 VMLKEMVSMGFPVDSTTGNAYVIYYSNFGMLSEMEVAYGRLKMSRILIEEEAIRSISLAY 233 Query: 776 IKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDL 955 +K+ KF+ LG+F+R VGL R+NVGNLLWN LLLSYAANFKMKSLQREF RM E+GF PDL Sbjct: 234 LKKEKFYSLGQFVRDVGLCRRNVGNLLWNLLLLSYAANFKMKSLQREFVRMVESGFFPDL 293 Query: 956 STFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNK 1135 +TFNIRALAFS+M+LFWDLHV+LEHMKHEKVVPDLVTYG VVDA+LDR +GRNL+FAL K Sbjct: 294 NTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDRGLGRNLDFALRK 353 Query: 1136 MNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLF 1315 +N ND +V+T+ LVFE +GKGDFH SS+A LE ++ +NWTY +LI YL+K +R NQ+F Sbjct: 354 LNINDCVIVATEPLVFEAIGKGDFHLSSDARLEFSKNKNWTYEELITTYLKKYFRRNQIF 413 Query: 1316 WNY 1324 WNY Sbjct: 414 WNY 416 >ref|XP_006850970.1| hypothetical protein AMTR_s00025p00206120 [Amborella trichopoda] gi|548854641|gb|ERN12551.1| hypothetical protein AMTR_s00025p00206120 [Amborella trichopoda] Length = 354 Score = 491 bits (1263), Expect = e-136 Identities = 237/354 (66%), Positives = 282/354 (79%) Frame = +2 Query: 263 MPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIKVVS 442 MPH+ Q+L+ E+ S+ F TLSALM+C A+NGLFS + IW EIINSS+ +I VVS Sbjct: 1 MPHVVQRLFTEIESQNFRTGCTTLSALMICCAENGLFSLSNAIWTEIINSSFELDIGVVS 60 Query: 443 ELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVSN 622 EL+ AYGK +D V R+L E R+F LCPE+Y+ AI CFGKG QLELME T+KEMVS Sbjct: 61 ELMHAYGKANLYDEVYRMLNEAISREFNLCPEIYTVAISCFGKGAQLELMEATIKEMVSR 120 Query: 623 GFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHKL 802 GF VDS TGNAF+ YYS FGSL ME AYGRLK S+ILIE+E IRAMASAYI+E KF K+ Sbjct: 121 GFKVDSNTGNAFIIYYSSFGSLAEMEIAYGRLKCSRILIEREAIRAMASAYIRERKFFKM 180 Query: 803 GEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRALA 982 GEFLR VGLGR+N GNLLWN LLLSYAANFKMKSLQR F M EAGFSPD++TFNIR LA Sbjct: 181 GEFLRDVGLGRRNSGNLLWNLLLLSYAANFKMKSLQRTFLGMLEAGFSPDITTFNIRTLA 240 Query: 983 FSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPLV 1162 FSRM +FWDLH+S+EHM+H V+PDLVTYGC+VDA+++R+ GRNL F L MN + SPL+ Sbjct: 241 FSRMCMFWDLHLSIEHMRHMNVIPDLVTYGCIVDAYVERRFGRNLGFGLKCMNLDSSPLI 300 Query: 1163 STDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324 TD +V+EV GKGDFHSSSEA LE ++ WTY KL+A YL+K+YRSNQ+FWNY Sbjct: 301 LTDPIVYEVFGKGDFHSSSEALLELKWKKEWTYSKLVAFYLKKRYRSNQIFWNY 354 >ref|XP_004152890.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Cucumis sativus] gi|449507537|ref|XP_004163059.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Cucumis sativus] Length = 388 Score = 488 bits (1257), Expect = e-135 Identities = 236/384 (61%), Positives = 303/384 (78%) Frame = +2 Query: 173 IWHQKRQASATGKDIYLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLC 352 I+ Q + S+ +N + V++ L R+RMP +A++++LE+ SEGF TLS +M+ Sbjct: 5 IFQQHNEGSSVDDSFNINNSQVIKKLSRRRMPILAKEIFLELKSEGFPLNNSTLSTIMVH 64 Query: 353 YADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLC 532 Y D+G AQ +W+E++NS + P+++V+S+L AYGKMG FD ++++L +V LR L Sbjct: 65 YIDDGSPLQAQAMWEEMLNSCFEPSVQVISKLFNAYGKMGHFDYITKVLDQVKLRYSHLL 124 Query: 533 PEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYG 712 PE YS AI CFGK QLELME+T++EMVS+GF V+SATGN+F+ YYS FGSL ME AYG Sbjct: 125 PEAYSLAISCFGKHKQLELMESTLREMVSSGFTVNSATGNSFIIYYSMFGSLVEMETAYG 184 Query: 713 RLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANF 892 RLKRS+ LIEK+GI AMA AYI++ KF++LGEFLR VGLGRKNVGNLLWN LLLSYAANF Sbjct: 185 RLKRSRFLIEKKGIMAMAFAYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLLLLSYAANF 244 Query: 893 KMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYG 1072 KMKSLQREF +M +AGF+PDL+TFNIRALAFSRM L WDLH+SLEHMKH + PDLVTYG Sbjct: 245 KMKSLQREFLQMVDAGFNPDLTTFNIRALAFSRMDLLWDLHLSLEHMKHMNIEPDLVTYG 304 Query: 1073 CVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRN 1252 CVVDA++DR++GRNL F L+KMN + P+ TD VFE LGKGDFH SSEAF++ +Q+ Sbjct: 305 CVVDAYVDRRLGRNLEFILSKMNPDQPPVSLTDSFVFEALGKGDFHMSSEAFMQFRKQKK 364 Query: 1253 WTYRKLIALYLRKKYRSNQLFWNY 1324 WTYR+LI+LYL+K +R NQ+FWNY Sbjct: 365 WTYRELISLYLKKHHRRNQVFWNY 388 >ref|XP_006360650.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like isoform X3 [Solanum tuberosum] Length = 409 Score = 486 bits (1251), Expect = e-134 Identities = 242/390 (62%), Positives = 305/390 (78%), Gaps = 5/390 (1%) Frame = +2 Query: 170 IIWHQKRQASATGKDI-----YLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTL 334 +I HQ + ++ G +I Y + AS+++ L RK++P A++L LEM SEGF+P TL Sbjct: 20 LISHQNQSSAQKGGNIDPRGNYADCASLIQGLSRKKLPVAAERLVLEMKSEGFVPDSSTL 79 Query: 335 SALMLCYADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTL 514 SALMLCYA NGLF A WDEI+NSS++P++ V++ELI+ Y G D RIL ++ L Sbjct: 80 SALMLCYASNGLFYKALAAWDEIMNSSFLPDVHVIAELIDIYVCKGYLDVAVRILHQIQL 139 Query: 515 RDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTA 694 +D L +VY+ AI FGK GQLELME +KEMVS GFPVDS TGNA++ YYS FG L+ Sbjct: 140 KDSNLLRDVYAQAISRFGKKGQLELMEVMLKEMVSMGFPVDSTTGNAYVIYYSNFGMLSE 199 Query: 695 MEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLL 874 ME AYGRLK S+ILIE+E IR+++ AY+K+ KF+ LG+F+R VGL R+NVGNLLWN LLL Sbjct: 200 MEVAYGRLKMSRILIEEEAIRSISLAYLKKEKFYSLGQFVRDVGLCRRNVGNLLWNLLLL 259 Query: 875 SYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVP 1054 SYAANFKMKSLQREF RM E+GF PDL+TFNIRALAFS+M+LFWDLHV+LEHMKHEKVVP Sbjct: 260 SYAANFKMKSLQREFVRMVESGFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVP 319 Query: 1055 DLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLE 1234 DLVTYG VVDA+LDR +GRNL+FAL K+N ND +V+T+ LVFE +GKGDFH SS+A LE Sbjct: 320 DLVTYGSVVDAYLDRGLGRNLDFALRKLNINDCVIVATEPLVFEAIGKGDFHLSSDARLE 379 Query: 1235 CNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324 ++ +NWTY +LI YL+K +R NQ+FWNY Sbjct: 380 FSKNKNWTYEELITTYLKKYFRRNQIFWNY 409 >ref|XP_004240282.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Solanum lycopersicum] Length = 381 Score = 485 bits (1248), Expect = e-134 Identities = 239/369 (64%), Positives = 296/369 (80%) Frame = +2 Query: 218 YLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWD 397 Y + AS+++ L RK++P A++L LEM SEGF+P TLSALMLCYA NGLF A WD Sbjct: 13 YRDCASLIQGLSRKKLPVAAERLVLEMKSEGFVPDSSTLSALMLCYATNGLFCKALAAWD 72 Query: 398 EIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGG 577 EI+NSS++P++ V++ELI+ YG G D RIL ++ L+D L +VY+ AI FGK G Sbjct: 73 EIMNSSFLPDVHVIAELIDIYGCKGYLDVAVRILHQIQLKDSNLLRDVYAQAISRFGKKG 132 Query: 578 QLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIR 757 QLELME ++EMVS GFPVDS TGNA++ YYS FG+L+ ME AYGRLK S+ILIE+E IR Sbjct: 133 QLELMEVMLEEMVSMGFPVDSTTGNAYVIYYSNFGTLSEMEVAYGRLKMSRILIEEEAIR 192 Query: 758 AMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEA 937 +++ AY+K+ KF+ LG+F+R VGL R+NVGNLLWN LLLSYAANFKMKSLQREF RM E+ Sbjct: 193 SISLAYLKKEKFYSLGQFVRDVGLCRRNVGNLLWNLLLLSYAANFKMKSLQREFVRMVES 252 Query: 938 GFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNL 1117 GF PDL+TFNIRALAFS+M+LFWDLHV+LEHMKHEKVVPDLVTYG VVDA+LDR +GRNL Sbjct: 253 GFFPDLNTFNIRALAFSKMSLFWDLHVTLEHMKHEKVVPDLVTYGSVVDAYLDRGLGRNL 312 Query: 1118 NFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKY 1297 +FAL K+NTND V+T+ LVFE +GKGDFH SSEA LE +++ NWTY LI YL+K + Sbjct: 313 DFALRKLNTNDCVTVATEPLVFEAMGKGDFHLSSEARLEFSKKTNWTYEVLITTYLKKYF 372 Query: 1298 RSNQLFWNY 1324 R NQ+FWNY Sbjct: 373 RRNQIFWNY 381 >ref|XP_006372218.1| hypothetical protein POPTR_0018s14360g [Populus trichocarpa] gi|550318749|gb|ERP50015.1| hypothetical protein POPTR_0018s14360g [Populus trichocarpa] Length = 392 Score = 478 bits (1231), Expect = e-132 Identities = 245/429 (57%), Positives = 314/429 (73%) Frame = +2 Query: 38 MGVVSLLATSHCFSLSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDI 217 M +++A + C++ ++ +PK A++S + +D R++A+++I KR GK+ Sbjct: 1 METKTVIAATTCYA-NVIGSYKPKRFAIFSIK-RDPKKRALAQKMIRQWKRDQGVFGKET 58 Query: 218 YLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWD 397 + AS+++ L + R PH+A++L LE+ EGFLP TLSA+MLCYAD+GL AQ IW+ Sbjct: 59 CADCASLIQTLCKHRRPHLAEELLLELKCEGFLPDNRTLSAMMLCYADSGLLPQAQAIWE 118 Query: 398 EIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGG 577 E++ SS+VP++ +VYS AI CFGKGG Sbjct: 119 EMLYSSFVPSV-----------------------------------QVYSLAISCFGKGG 143 Query: 578 QLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIR 757 QLELME T+K+MVS GF VDSATGNAF+ YYS GSL MEAAY RLKRS++LIE+EGIR Sbjct: 144 QLELMEDTLKKMVSKGFWVDSATGNAFVVYYSLHGSLAEMEAAYDRLKRSRLLIEREGIR 203 Query: 758 AMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEA 937 AM+ AYIKE KF+ L EFLR VGLGRKN+GNL+WN LLLSY+ANFKMK+LQREF M EA Sbjct: 204 AMSFAYIKERKFYGLSEFLRDVGLGRKNLGNLIWNLLLLSYSANFKMKTLQREFLNMLEA 263 Query: 938 GFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNL 1117 GF PDL+TFNIRALAFSRM+L WDLH+ LEHMKH+KV PDLVTYGC+VDA+LDR++ RNL Sbjct: 264 GFHPDLTTFNIRALAFSRMSLLWDLHLGLEHMKHDKVAPDLVTYGCIVDAYLDRRLVRNL 323 Query: 1118 NFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKY 1297 FAL+KM+ ++SP++STD VFEV GKGDFHSSSEAF+E RQR WTYR+LI +YLRK++ Sbjct: 324 EFALSKMHVDNSPVLSTDPFVFEVFGKGDFHSSSEAFMEFKRQRKWTYRELIKIYLRKQH 383 Query: 1298 RSNQLFWNY 1324 RS +FWNY Sbjct: 384 RSKHIFWNY 392 >ref|XP_006573403.1| PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Glycine max] Length = 415 Score = 470 bits (1210), Expect = e-130 Identities = 240/418 (57%), Positives = 300/418 (71%), Gaps = 2/418 (0%) Frame = +2 Query: 77 SLSLFKCLRPKFHALYSHQNKDSVGRSVARE--IIWHQKRQASATGKDIYLNFASVLRLL 250 SLSL C P KDS S ++ +IW Q + GKD ++ +S+ + Sbjct: 5 SLSLPSCRMPILL-------KDSHSGSPQQQNKLIWWQNEKGVIGGKDNSVDCSSLAQNS 57 Query: 251 GRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNI 430 RKRM H + ++ EG++P +L ML Y +NG F AQT+W++++NSS+VP++ Sbjct: 58 SRKRMIHQSDGSLHDIKVEGYMPKQTSLCVSMLYYTENGFFPQAQTLWEQLVNSSFVPSV 117 Query: 431 KVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKE 610 + +S L +AY K +FD V ILR V +R+F + P+VY AI CFG+ GQLELME E Sbjct: 118 QFISRLFDAYAKHRKFDVVIDILRYVDMRNFSILPDVYWLAISCFGREGQLELMEDMANE 177 Query: 611 MVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGK 790 M S+G + S T NAFL YYS FG+L ME YGRLK+S+ LIEKE IRA+ASAYIKE K Sbjct: 178 MASSGVHIYSRTANAFLLYYSLFGTLEEMENTYGRLKKSRFLIEKEVIRAVASAYIKERK 237 Query: 791 FHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNI 970 F++LGEFLR VGL RKNVGNLLWN +LLSYAANFKMKSLQREF M E+GF PD++TFNI Sbjct: 238 FYELGEFLRDVGLRRKNVGNLLWNLMLLSYAANFKMKSLQREFIGMVESGFRPDITTFNI 297 Query: 971 RALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTND 1150 RALAFSRMALFWDLH+S+EHM+H K++PDLVT+GCVVDA+LDR++GRNL+FALNKMN +D Sbjct: 298 RALAFSRMALFWDLHLSIEHMEHTKIIPDLVTFGCVVDAYLDRRLGRNLDFALNKMNLDD 357 Query: 1151 SPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324 SP + TD V+E LGKG F SSEAF E QR WTYR LI YL+K YR NQ+FWNY Sbjct: 358 SPRLLTDPFVYEALGKGGFQMSSEAFFEYKTQRKWTYRSLIQKYLKKHYRKNQIFWNY 415 >ref|XP_007134710.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris] gi|593265068|ref|XP_007134712.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris] gi|561007755|gb|ESW06704.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris] gi|561007757|gb|ESW06706.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris] Length = 423 Score = 469 bits (1207), Expect = e-129 Identities = 227/381 (59%), Positives = 290/381 (76%) Frame = +2 Query: 182 QKRQASATGKDIYLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYAD 361 + + ++ G ++ +S+L+ RKRM + ++ + +G++P +L LML Y + Sbjct: 43 RNEKGASGGMHSSVDSSSLLQKNSRKRMFPQSDGVFPDTKDDGYMPKQTSLCVLMLYYTE 102 Query: 362 NGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEV 541 NGLF AQT W++++ SS+VP+++ +S L +AY K G+FD V ILR V +R+F + P V Sbjct: 103 NGLFPLAQTTWEQLLYSSFVPSVEFISRLFDAYAKHGKFDEVVNILRYVDMRNFSILPNV 162 Query: 542 YSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLK 721 YS AICCFG+ GQLELME KEM S G V S TGNAF+ YYS FGSL ME AYGRLK Sbjct: 163 YSLAICCFGREGQLELMEDMAKEMASRGVHVSSKTGNAFVLYYSIFGSLKDMENAYGRLK 222 Query: 722 RSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMK 901 +S+ LIE+E IRAMASAY +E +F++LGEF+R VGLGRK++GNLLWN +LLSYA NFKMK Sbjct: 223 KSRFLIEREVIRAMASAYTRERQFYELGEFIRDVGLGRKDLGNLLWNLMLLSYAVNFKMK 282 Query: 902 SLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVV 1081 SLQ+EF +M E+GF PD++TFNIRALAFSRMALFWDLH+S+EHM+HE V+PDLVT+GCVV Sbjct: 283 SLQKEFLQMVESGFRPDITTFNIRALAFSRMALFWDLHLSIEHMEHENVIPDLVTFGCVV 342 Query: 1082 DAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTY 1261 DA+LDR +GRNLNFALNKMN +DSP++ TD V+E LGKGDF SSEAF E R WTY Sbjct: 343 DAYLDRGLGRNLNFALNKMNLDDSPMLLTDPFVYEALGKGDFQMSSEAFFEFKTHRKWTY 402 Query: 1262 RKLIALYLRKKYRSNQLFWNY 1324 R LI YL+K YR NQ+FWNY Sbjct: 403 RALIQKYLKKHYRRNQIFWNY 423 >ref|XP_007134709.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris] gi|593265066|ref|XP_007134711.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris] gi|561007754|gb|ESW06703.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris] gi|561007756|gb|ESW06705.1| hypothetical protein PHAVU_010G069800g [Phaseolus vulgaris] Length = 372 Score = 468 bits (1205), Expect = e-129 Identities = 226/365 (61%), Positives = 283/365 (77%) Frame = +2 Query: 230 ASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIIN 409 +S+L+ RKRM + ++ + +G++P +L LML Y +NGLF AQT W++++ Sbjct: 8 SSLLQKNSRKRMFPQSDGVFPDTKDDGYMPKQTSLCVLMLYYTENGLFPLAQTTWEQLLY 67 Query: 410 SSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLEL 589 SS+VP+++ +S L +AY K G+FD V ILR V +R+F + P VYS AICCFG+ GQLEL Sbjct: 68 SSFVPSVEFISRLFDAYAKHGKFDEVVNILRYVDMRNFSILPNVYSLAICCFGREGQLEL 127 Query: 590 METTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMAS 769 ME KEM S G V S TGNAF+ YYS FGSL ME AYGRLK+S+ LIE+E IRAMAS Sbjct: 128 MEDMAKEMASRGVHVSSKTGNAFVLYYSIFGSLKDMENAYGRLKKSRFLIEREVIRAMAS 187 Query: 770 AYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSP 949 AY +E +F++LGEF+R VGLGRK++GNLLWN +LLSYA NFKMKSLQ+EF +M E+GF P Sbjct: 188 AYTRERQFYELGEFIRDVGLGRKDLGNLLWNLMLLSYAVNFKMKSLQKEFLQMVESGFRP 247 Query: 950 DLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFAL 1129 D++TFNIRALAFSRMALFWDLH+S+EHM+HE V+PDLVT+GCVVDA+LDR +GRNLNFAL Sbjct: 248 DITTFNIRALAFSRMALFWDLHLSIEHMEHENVIPDLVTFGCVVDAYLDRGLGRNLNFAL 307 Query: 1130 NKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQ 1309 NKMN +DSP++ TD V+E LGKGDF SSEAF E R WTYR LI YL+K YR NQ Sbjct: 308 NKMNLDDSPMLLTDPFVYEALGKGDFQMSSEAFFEFKTHRKWTYRALIQKYLKKHYRRNQ 367 Query: 1310 LFWNY 1324 +FWNY Sbjct: 368 IFWNY 372 >gb|ABA18111.1| pentatricopeptide repeat protein [Arabidopsis arenosa] Length = 419 Score = 467 bits (1201), Expect = e-129 Identities = 233/416 (56%), Positives = 312/416 (75%) Frame = +2 Query: 77 SLSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLGR 256 +LS L+P+ L S DS S+AR++I K + K +++A +++ L + Sbjct: 5 NLSHHLSLKPQHLKLLSCYT-DSSAPSIARKLIKESKLSREFSRKIQIVDYAPLVQTLSQ 63 Query: 257 KRMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIKV 436 +R+P +A +++++ S LP Y TL ALMLC+A+NG A+TIWDEI+NSS+VP++ V Sbjct: 64 RRLPDVAHEIFIQTKSVNLLPNYRTLCALMLCFAENGFVLRARTIWDEILNSSFVPDVFV 123 Query: 437 VSELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMV 616 VS+LI AY ++G FD V++I ++V R L P VYS AI CFGK GQLELME ++EM Sbjct: 124 VSKLISAYEQLGFFDEVAKITKDVAARHSTLLPVVYSLAISCFGKNGQLELMEGVIEEMD 183 Query: 617 SNGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFH 796 S G +DSAT NA ++Y+S FG+L +E AYGRLK+ I+IE+E IRA+ AY+K+ KF+ Sbjct: 184 SKGMSLDSATANAIVRYFSFFGTLDKIEHAYGRLKKFGIVIEEEEIRAVLLAYLKQRKFY 243 Query: 797 KLGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRA 976 +L EFL VGLGR+N+GN+LWN +LLSYAA FKMKSLQREF M +AGFSPDL+TFNIRA Sbjct: 244 RLREFLSDVGLGRRNLGNMLWNSVLLSYAAEFKMKSLQREFIEMLDAGFSPDLTTFNIRA 303 Query: 977 LAFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSP 1156 LAFSRMALFWDLH++LEHM+ +VPDLVT+GCVVDA++D+++ RNL F N+MN +DSP Sbjct: 304 LAFSRMALFWDLHLTLEHMRRLNIVPDLVTFGCVVDAYMDKRLARNLEFVYNQMNLDDSP 363 Query: 1157 LVSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324 +V TD L FEVLGKGDFH SSEA LE + ++NWTYRKLI +Y++KK R +Q+FWNY Sbjct: 364 VVLTDPLAFEVLGKGDFHLSSEAVLEFSTEKNWTYRKLIGVYVKKKLRRDQIFWNY 419 >ref|XP_007132326.1| hypothetical protein PHAVU_011G085400g [Phaseolus vulgaris] gi|593195390|ref|XP_007132327.1| hypothetical protein PHAVU_011G085400g [Phaseolus vulgaris] gi|561005326|gb|ESW04320.1| hypothetical protein PHAVU_011G085400g [Phaseolus vulgaris] gi|561005327|gb|ESW04321.1| hypothetical protein PHAVU_011G085400g [Phaseolus vulgaris] Length = 411 Score = 465 bits (1196), Expect = e-128 Identities = 226/386 (58%), Positives = 291/386 (75%) Frame = +2 Query: 167 EIIWHQKRQASATGKDIYLNFASVLRLLGRKRMPHIAQQLWLEMNSEGFLPCYVTLSALM 346 ++IW + + + G ++ +S+++ RKRM + ++ + +G++P +L LM Sbjct: 26 QMIWWRNEKGAFGGMHSSVDSSSLVQNNSRKRMFPQSDGVFHDTKDDGYMPKQTSLCVLM 85 Query: 347 LCYADNGLFSYAQTIWDEIINSSYVPNIKVVSELIEAYGKMGRFDAVSRILREVTLRDFK 526 L Y +NGLF AQT W++++ SS+VP+++ +S L +AY K G+FD V ILR V +R+F Sbjct: 86 LYYTENGLFPQAQTTWEQLLYSSFVPSVEFISRLFDAYAKHGKFDEVVNILRYVDMRNFS 145 Query: 527 LCPEVYSSAICCFGKGGQLELMETTMKEMVSNGFPVDSATGNAFLQYYSKFGSLTAMEAA 706 + P VYS AI CFG+ GQLELME KEM S G + S T NAF+ YYS FGSL ME A Sbjct: 146 ILPNVYSLAISCFGREGQLELMEDMAKEMASRGVHISSKTANAFVLYYSIFGSLKDMENA 205 Query: 707 YGRLKRSQILIEKEGIRAMASAYIKEGKFHKLGEFLRGVGLGRKNVGNLLWNFLLLSYAA 886 YGRLK+S+ LIE+E IRAMASAY +E +F++LGEFLR VGL RK+VGNLLWN +LLSYAA Sbjct: 206 YGRLKKSRFLIEREVIRAMASAYTRERQFYELGEFLRDVGLVRKDVGNLLWNLMLLSYAA 265 Query: 887 NFKMKSLQREFSRMAEAGFSPDLSTFNIRALAFSRMALFWDLHVSLEHMKHEKVVPDLVT 1066 NFKMKSLQ+EF +M E+GF PD++TFNIRALAFSRMALFWDLH+S+EHM+HE V+PDLVT Sbjct: 266 NFKMKSLQKEFLQMVESGFRPDITTFNIRALAFSRMALFWDLHLSIEHMEHENVIPDLVT 325 Query: 1067 YGCVVDAFLDRKIGRNLNFALNKMNTNDSPLVSTDQLVFEVLGKGDFHSSSEAFLECNRQ 1246 +GCVVDA+LDR +G+NLNFALNKMN +DSP++ TD V+E LGKGDF SSEAF E Sbjct: 326 FGCVVDAYLDRGLGKNLNFALNKMNLDDSPMLLTDPFVYEALGKGDFQMSSEAFFEFKTH 385 Query: 1247 RNWTYRKLIALYLRKKYRSNQLFWNY 1324 R WTYR LI YL+K YR NQ+FWNY Sbjct: 386 RKWTYRALIQKYLKKHYRRNQIFWNY 411 >ref|NP_566863.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|218546757|sp|Q9M2A1.2|PP263_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g42630 gi|332644221|gb|AEE77742.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 415 Score = 463 bits (1191), Expect = e-127 Identities = 234/415 (56%), Positives = 309/415 (74%) Frame = +2 Query: 80 LSLFKCLRPKFHALYSHQNKDSVGRSVAREIIWHQKRQASATGKDIYLNFASVLRLLGRK 259 LSL L+P+ L S DS S+A+++I K + K +++A +++ L ++ Sbjct: 2 LSLNLSLKPQHLKLLSCYT-DSSAPSIAKKLIKESKLSRDFSQKIQIVDYAPLVQTLSQR 60 Query: 260 RMPHIAQQLWLEMNSEGFLPCYVTLSALMLCYADNGLFSYAQTIWDEIINSSYVPNIKVV 439 R+P +A +++L+ S LP Y TL ALMLC+A+NG A+TIWDEIINS +VP++ VV Sbjct: 61 RLPDVAHEIFLQTKSVNLLPNYRTLCALMLCFAENGFVLRARTIWDEIINSCFVPDVFVV 120 Query: 440 SELIEAYGKMGRFDAVSRILREVTLRDFKLCPEVYSSAICCFGKGGQLELMETTMKEMVS 619 S+LI AY + G FD V++I ++V R KL P V S AI CFGK GQLELME ++EM S Sbjct: 121 SKLISAYEQFGCFDEVAKITKDVAARHSKLLPVVSSLAISCFGKNGQLELMEGVIEEMDS 180 Query: 620 NGFPVDSATGNAFLQYYSKFGSLTAMEAAYGRLKRSQILIEKEGIRAMASAYIKEGKFHK 799 G +++ T N ++YYS FGSL ME AYGR+K+ I+IE+E IRA+ AY+K+ KF++ Sbjct: 181 KGVLLEAETANVIVRYYSFFGSLDKMEKAYGRVKKFGIVIEEEEIRAVVLAYLKQRKFYR 240 Query: 800 LGEFLRGVGLGRKNVGNLLWNFLLLSYAANFKMKSLQREFSRMAEAGFSPDLSTFNIRAL 979 L EFL VGLGR+N+GN+LWN +LLSYAA+FKMKSLQREF M +AGFSPDL+TFNIRAL Sbjct: 241 LREFLSDVGLGRRNLGNMLWNSVLLSYAADFKMKSLQREFIGMLDAGFSPDLTTFNIRAL 300 Query: 980 AFSRMALFWDLHVSLEHMKHEKVVPDLVTYGCVVDAFLDRKIGRNLNFALNKMNTNDSPL 1159 AFSRMALFWDLH++LEHM+ +VPDLVT+GCVVDA++D+++ RNL F N+MN +DSPL Sbjct: 301 AFSRMALFWDLHLTLEHMRRLNIVPDLVTFGCVVDAYMDKRLARNLEFVYNRMNLDDSPL 360 Query: 1160 VSTDQLVFEVLGKGDFHSSSEAFLECNRQRNWTYRKLIALYLRKKYRSNQLFWNY 1324 V TD L FEVLGKGDFH SSEA LE + ++NWTYRKLI +YL+KK R +Q+FWNY Sbjct: 361 VLTDPLAFEVLGKGDFHLSSEAVLEFSPRKNWTYRKLIGVYLKKKLRRDQIFWNY 415