BLASTX nr result
ID: Rheum21_contig00010940
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rheum21_contig00010940 (2324 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003631269.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 588 e-165 emb|CBI28530.3| unnamed protein product [Vitis vinifera] 587 e-165 ref|XP_006494986.1| PREDICTED: pentatricopeptide repeat-containi... 575 e-161 ref|XP_006440653.1| hypothetical protein CICLE_v10023621mg [Citr... 573 e-160 gb|EOY21933.1| Pentatricopeptide repeat superfamily protein [The... 554 e-155 ref|XP_004508741.1| PREDICTED: pentatricopeptide repeat-containi... 553 e-154 ref|XP_004138304.1| PREDICTED: pentatricopeptide repeat-containi... 552 e-154 gb|EMJ11418.1| hypothetical protein PRUPE_ppa015814mg [Prunus pe... 551 e-154 gb|EXB38552.1| hypothetical protein L484_008580 [Morus notabilis] 548 e-153 ref|XP_002514722.1| pentatricopeptide repeat-containing protein,... 546 e-152 ref|XP_003622167.1| Pentatricopeptide repeat protein [Medicago t... 542 e-151 ref|XP_006280363.1| hypothetical protein CARUB_v10026291mg [Caps... 536 e-149 ref|XP_002866430.1| hypothetical protein ARALYDRAFT_496296 [Arab... 536 e-149 gb|ESW27283.1| hypothetical protein PHAVU_003G188300g [Phaseolus... 530 e-147 gb|AAM65325.1| unknown [Arabidopsis thaliana] 528 e-147 ref|NP_200945.1| pentatricopeptide repeat-containing protein [Ar... 528 e-147 ref|XP_002318601.2| hypothetical protein POPTR_0012s07030g, part... 522 e-145 ref|XP_006394515.1| hypothetical protein EUTSA_v10004085mg [Eutr... 515 e-143 ref|XP_006849319.1| hypothetical protein AMTR_s00164p00020970 [A... 383 e-103 gb|EEC66969.1| hypothetical protein OsI_33629 [Oryza sativa Indi... 359 3e-96 >ref|XP_003631269.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g61370, mitochondrial-like [Vitis vinifera] Length = 505 Score = 588 bits (1515), Expect = e-165 Identities = 282/411 (68%), Positives = 343/411 (83%) Frame = +2 Query: 782 VQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKR 961 +QE+CN+VS VGSLDDLE+ LD+ A TSS++ Q+++ CKN +P+RRLLRFF WS K+ Sbjct: 40 LQELCNVVSNGVGSLDDLEASLDRLDASFTSSLISQILDTCKNEAPTRRLLRFFLWSSKK 99 Query: 962 LNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDD 1141 N L D +FN+AIQ FAE KD +A++IL+SD+S EGR + AQTF V + LV LGREDD Sbjct: 100 FNCKLEDDDFNYAIQVFAEKKDLKAIDILVSDLSNEGREMKAQTFGIVAETLVSLGREDD 159 Query: 1142 ALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLLH 1321 ALG+FKNLDKF C +D TVTAI+ ALC+KGHARRAEGV+RHH DKI GV+ C+Y+SL + Sbjct: 160 ALGLFKNLDKFKCSYDSVTVTAIVNALCSKGHARRAEGVVRHHKDKILGVKPCIYRSLFY 219 Query: 1322 GWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEM 1501 GWS ++NVKEARRIL+EMKS +M DL+C+NTFL+CLCE NLK NPSGLVPEALNVMMEM Sbjct: 220 GWSEQKNVKEARRILKEMKSVGIMPDLFCYNTFLRCLCERNLKSNPSGLVPEALNVMMEM 279 Query: 1502 RSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRF 1681 RS +I P SISYNILLSCLGRTRRVKE+ ++LM++ C+PDWVSYYLVARVLYL+GRF Sbjct: 280 RSNRITPTSISYNILLSCLGRTRRVKESCRILDLMKRLGCSPDWVSYYLVARVLYLTGRF 339 Query: 1682 GKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLL 1861 GKGNQIV+EMIE G+ P K Y++L+GVLCGVERVNYALE+FERMKRSS+G YGPVYD+L Sbjct: 340 GKGNQIVDEMIEEGLVPDRKFYYDLIGVLCGVERVNYALEMFERMKRSSLGGYGPVYDVL 399 Query: 1862 IPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDEE 2014 IPKLCR GDF KG+ELWDEA R+GV L CS LDPSIT+V+ R+D+E+ Sbjct: 400 IPKLCRSGDFGKGRELWDEATRVGVLLHCSSEVLDPSITKVFKPARKDEEK 450 >emb|CBI28530.3| unnamed protein product [Vitis vinifera] Length = 452 Score = 587 bits (1514), Expect = e-165 Identities = 282/410 (68%), Positives = 342/410 (83%) Frame = +2 Query: 782 VQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKR 961 +QE+CN+VS VGSLDDLE+ LD+ A TSS++ Q+++ CKN +P+RRLLRFF WS K+ Sbjct: 9 LQELCNVVSNGVGSLDDLEASLDRLDASFTSSLISQILDTCKNEAPTRRLLRFFLWSSKK 68 Query: 962 LNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDD 1141 N L D +FN+AIQ FAE KD +A++IL+SD+S EGR + AQTF V + LV LGREDD Sbjct: 69 FNCKLEDDDFNYAIQVFAEKKDLKAIDILVSDLSNEGREMKAQTFGIVAETLVSLGREDD 128 Query: 1142 ALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLLH 1321 ALG+FKNLDKF C +D TVTAI+ ALC+KGHARRAEGV+RHH DKI GV+ C+Y+SL + Sbjct: 129 ALGLFKNLDKFKCSYDSVTVTAIVNALCSKGHARRAEGVVRHHKDKILGVKPCIYRSLFY 188 Query: 1322 GWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEM 1501 GWS ++NVKEARRIL+EMKS +M DL+C+NTFL+CLCE NLK NPSGLVPEALNVMMEM Sbjct: 189 GWSEQKNVKEARRILKEMKSVGIMPDLFCYNTFLRCLCERNLKSNPSGLVPEALNVMMEM 248 Query: 1502 RSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRF 1681 RS +I P SISYNILLSCLGRTRRVKE+ ++LM++ C+PDWVSYYLVARVLYL+GRF Sbjct: 249 RSNRITPTSISYNILLSCLGRTRRVKESCRILDLMKRLGCSPDWVSYYLVARVLYLTGRF 308 Query: 1682 GKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLL 1861 GKGNQIV+EMIE G+ P K Y++L+GVLCGVERVNYALE+FERMKRSS+G YGPVYD+L Sbjct: 309 GKGNQIVDEMIEEGLVPDRKFYYDLIGVLCGVERVNYALEMFERMKRSSLGGYGPVYDVL 368 Query: 1862 IPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDE 2011 IPKLCR GDF KG+ELWDEA R+GV L CS LDPSIT+V+ R+D+E Sbjct: 369 IPKLCRSGDFGKGRELWDEATRVGVLLHCSSEVLDPSITKVFKPARKDEE 418 >ref|XP_006494986.1| PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial-like [Citrus sinensis] Length = 495 Score = 575 bits (1481), Expect = e-161 Identities = 273/415 (65%), Positives = 343/415 (82%) Frame = +2 Query: 779 QVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCK 958 +++E+C +VS+ +G LDDLE L++ LTSS+V QV++ CK +P+RRLLRFF WSCK Sbjct: 46 ELKELCKVVSSTIGGLDDLELSLNQFTGSLTSSLVTQVIDSCKQEAPTRRLLRFFLWSCK 105 Query: 959 RLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGRED 1138 ++ L DK++N AI+ FAE +D A+ IL+SD+ KEGRV+++Q+F +V+ LVKLGRED Sbjct: 106 NMSASLEDKDYNHAIRVFAEKRDHTAMNILVSDLRKEGRVMESQSFGVLVETLVKLGRED 165 Query: 1139 DALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLL 1318 +ALGIFKNL+KF C D TV+AI++ALC KGHARRAEGV+ HH DKISGVE C+Y+SL+ Sbjct: 166 EALGIFKNLEKFKCVQDSVTVSAIVSALCAKGHARRAEGVVYHHKDKISGVELCIYRSLI 225 Query: 1319 HGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMME 1498 +GWS++ENVK AR+I++EMKS +M DL+C+NTFL+ LCE NLK+NPSGLVPEALNVMME Sbjct: 226 YGWSMQENVKAARKIIKEMKSAGIMPDLFCYNTFLRGLCERNLKRNPSGLVPEALNVMME 285 Query: 1499 MRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGR 1678 MRSY+I P SISYNILLSCLGRTRRVKE+ +E M+K+ C PDWVSYYLVARVLYLSGR Sbjct: 286 MRSYRIAPTSISYNILLSCLGRTRRVKESCQVLEQMKKSGCAPDWVSYYLVARVLYLSGR 345 Query: 1679 FGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDL 1858 FGKGN+IV+EMIE G+ P K Y++L+G+LCGVERVN+ALELFERMKRSS+G YGPVYD+ Sbjct: 346 FGKGNKIVDEMIEEGLIPDRKFYYDLIGILCGVERVNFALELFERMKRSSLGGYGPVYDV 405 Query: 1859 LIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDEELRG 2023 LIPK+CRGGDFVKG+ELWDEA MG+ L CS LDPSI EV+ +R+ E G Sbjct: 406 LIPKVCRGGDFVKGRELWDEAMVMGLTLSCSSNVLDPSIIEVFQPRRKPTESCLG 460 >ref|XP_006440653.1| hypothetical protein CICLE_v10023621mg [Citrus clementina] gi|557542915|gb|ESR53893.1| hypothetical protein CICLE_v10023621mg [Citrus clementina] Length = 488 Score = 573 bits (1477), Expect = e-160 Identities = 274/415 (66%), Positives = 343/415 (82%) Frame = +2 Query: 779 QVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCK 958 +++E+C +VS+ +G LDDLE L++ L+SS+V QV++ CK+ +P+RRLLRFF WSCK Sbjct: 41 ELKELCKVVSSTIGGLDDLELSLNQFTGSLSSSLVTQVIDSCKHEAPTRRLLRFFLWSCK 100 Query: 959 RLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGRED 1138 L+ L DK++N AI+ FAE KD A+ IL+SD+ KEGRV++ Q+F +V+ LVKLGRED Sbjct: 101 NLSASLEDKDYNHAIRVFAEKKDHMAMNILVSDLRKEGRVMETQSFGVLVETLVKLGRED 160 Query: 1139 DALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLL 1318 +ALGIFKNL+KF C D TV+AI++ALC KGHARRAEGV+ HH DKISGVE C+Y+SL+ Sbjct: 161 EALGIFKNLEKFKCVQDSVTVSAIVSALCAKGHARRAEGVVYHHKDKISGVELCIYRSLI 220 Query: 1319 HGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMME 1498 +GWS++ENVK AR+I++EMKS M DL+C+NTFL+ LCE NLK+NPSGLVPEALNVMME Sbjct: 221 YGWSMQENVKAARKIIKEMKSAGFMPDLFCYNTFLRGLCERNLKRNPSGLVPEALNVMME 280 Query: 1499 MRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGR 1678 MRSY+I P SISYNILLSCLGRTRRVKE+ +E M+K+ C PDWVSYYLVARVLYLSGR Sbjct: 281 MRSYRIAPTSISYNILLSCLGRTRRVKESCRVLEQMKKSGCAPDWVSYYLVARVLYLSGR 340 Query: 1679 FGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDL 1858 FGKGN+IV+EMIE G+ P K Y++L+G+LCGVERVN+ALELFERMKRSS+G YGPVYD+ Sbjct: 341 FGKGNKIVDEMIEEGLIPDRKFYYDLIGILCGVERVNFALELFERMKRSSLGGYGPVYDV 400 Query: 1859 LIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDEELRG 2023 LIPK+C+GGDFVKG+ELWDEA MG+ L CS LDPSITEV+ +R+ E G Sbjct: 401 LIPKVCQGGDFVKGRELWDEAMVMGLTLSCSSNVLDPSITEVFHPRRKPTEGCLG 455 >gb|EOY21933.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao] Length = 487 Score = 554 bits (1427), Expect = e-155 Identities = 263/411 (63%), Positives = 330/411 (80%) Frame = +2 Query: 779 QVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCK 958 + +E+C +VS+ +G LDDLES L++ L+ +V QV+ C+N +P+RRLLRFF WS K Sbjct: 39 EFEELCKVVSSSMGGLDDLESSLNRFKLSLSPLLVTQVINSCENEAPTRRLLRFFLWSVK 98 Query: 959 RLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGRED 1138 L+ L DK+ N ++ FA+ KD A+ IL+SDI GR +++QTF V ++LVKLGRED Sbjct: 99 NLSSSLEDKDLNNVVRVFAKKKDHTAMGILVSDIRNRGRTMESQTFSVVAEMLVKLGRED 158 Query: 1139 DALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLL 1318 +ALGIFKNL+KF CP D ++TAI+ ALC KGHAR+AEGV+ HH D I+GVE C+Y+ LL Sbjct: 159 EALGIFKNLEKFKCPRDSFSLTAIVNALCAKGHARKAEGVVYHHKDTIAGVEPCIYRCLL 218 Query: 1319 HGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMME 1498 +GWS++ENVKEARR+++EMKS LDLYC+NTFL+CLC N K+NPSGLVPEALNVMME Sbjct: 219 YGWSVQENVKEARRVIKEMKSAGFELDLYCYNTFLRCLCGKNAKRNPSGLVPEALNVMME 278 Query: 1499 MRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGR 1678 MRS +I P S+SYNILLSCLGRTRRVKE+ +ELM+K C PDW+SYYLVARVLYL+GR Sbjct: 279 MRSQRIAPTSVSYNILLSCLGRTRRVKESCQILELMKKAGCAPDWISYYLVARVLYLTGR 338 Query: 1679 FGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDL 1858 FGKGN+IV+EMIE G+ P K Y++L+GVLCGVERVN+ALELFERMKRSS+G YGPVYD+ Sbjct: 339 FGKGNKIVDEMIEQGLTPDRKFYYDLIGVLCGVERVNFALELFERMKRSSLGGYGPVYDV 398 Query: 1859 LIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDE 2011 LIPKLCRGGDF KG+ELWDEA GV+L CS LDPSITEV+ R+ ++ Sbjct: 399 LIPKLCRGGDFEKGRELWDEAVATGVSLSCSSDVLDPSITEVFKPTRKAEK 449 >ref|XP_004508741.1| PREDICTED: pentatricopeptide repeat-containing protein At3g22690-like [Cicer arietinum] Length = 1253 Score = 553 bits (1424), Expect = e-154 Identities = 277/456 (60%), Positives = 349/456 (76%) Frame = +2 Query: 638 RSA*VRN*MISQNSLARLSALLKPTNAWKFKLLYFCSCTXXXXXXXXQVQEICNLVSTPV 817 RSA + M+ ++L R K T+ ++ + F S T Q+QE+CN+V++ V Sbjct: 756 RSARQKMHMLLNSALKRFGLQNKSTHKFQLLSVSFYS-TLHSISAPPQLQELCNIVTSTV 814 Query: 818 GSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKRLNGGLVDKEFNF 997 G LDDLE L+K + SS+V Q ++ K+ + +RRLLRFF WS K L+ L D ++N+ Sbjct: 815 GGLDDLELSLNKFKGSINSSLVAQAIDSIKHEAHTRRLLRFFLWSNKHLSRDLEDNDYNY 874 Query: 998 AIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDDALGIFKNLDKFG 1177 A++ FAE KD A++ILL D+ KEGRV+DAQTF V + VKLG+ED+ALGIFKNLDK+ Sbjct: 875 ALRVFAEKKDYTAMDILLGDLKKEGRVMDAQTFGLVAETFVKLGKEDEALGIFKNLDKYK 934 Query: 1178 CPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLLHGWSLKENVKEAR 1357 C D+ TVTAII ALC+KGHA+RAEGV+ HH DK+ GV C+Y+SLL+GWS++ NVKEAR Sbjct: 935 CFIDEFTVTAIINALCSKGHAKRAEGVVWHHKDKVKGVLPCIYRSLLYGWSVQRNVKEAR 994 Query: 1358 RILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEMRSYKIMPNSISY 1537 RI+QEMKS V DL C+NTFL+CLCE NL+ NPSGLVPEALNVMMEMR YK++P SISY Sbjct: 995 RIIQEMKSNGVNPDLVCYNTFLRCLCERNLRHNPSGLVPEALNVMMEMRFYKVLPTSISY 1054 Query: 1538 NILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRFGKGNQIVEEMIE 1717 NILLSCLG+TRRVKE+ +E M K+ PDWVSYYLVARVL+LSGRFGKG +IV++MIE Sbjct: 1055 NILLSCLGKTRRVKESCQILEAMNKSGVAPDWVSYYLVARVLFLSGRFGKGKEIVDQMIE 1114 Query: 1718 AGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLLIPKLCRGGDFVK 1897 G+ P K Y++L+G+LCGVERVN+ALELFE+MK SS+G YGPVYD+LIPKLCRGG F K Sbjct: 1115 KGLVPNHKFYYSLIGILCGVERVNHALELFEKMKGSSLGGYGPVYDVLIPKLCRGGAFEK 1174 Query: 1898 GKELWDEAERMGVALCCSRVALDPSITEVYVCKRQD 2005 G+ELWDEA+ MG+ L CSR LDPSITEVY KR + Sbjct: 1175 GRELWDEAKCMGITLQCSRDVLDPSITEVYKPKRPE 1210 >ref|XP_004138304.1| PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial-like [Cucumis sativus] gi|449477571|ref|XP_004155060.1| PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial-like [Cucumis sativus] Length = 487 Score = 552 bits (1422), Expect = e-154 Identities = 263/412 (63%), Positives = 334/412 (81%) Frame = +2 Query: 782 VQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKR 961 V ++C ++S +G LD+LES L+KC LTSS+V QV++ KN +P+RRLLRFF WS K+ Sbjct: 50 VSKLCEVISCTIGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKK 109 Query: 962 LNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDD 1141 LN L D++FN AI+ FA+ KD AV ILLS++ K R +D QTF V + VK+ RED+ Sbjct: 110 LNHTLEDEDFNNAIRFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDE 169 Query: 1142 ALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLLH 1321 ALG+FKNL+K+ CPHD+ TV AIITALC+KGHA+RAEGV+ HH DKIS SC+Y+SLL+ Sbjct: 170 ALGLFKNLEKYKCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLY 229 Query: 1322 GWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEM 1501 GWS+K+N KEARRIL+EMKS M DL+C+NTFLKCLCE N++KNPSGLVPE+LNVMMEM Sbjct: 230 GWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEM 289 Query: 1502 RSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRF 1681 RSYKI PNSISYNILLSCL +TRRVKE+ +E+M++T C PD VSYYL+ARVL+L+GRF Sbjct: 290 RSYKISPNSISYNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRF 349 Query: 1682 GKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLL 1861 GKG +IV+EMIE G+ P K Y++L+G+LCGVER NYALELFE+MKRSS+G YGPVYD+L Sbjct: 350 GKGREIVDEMIEEGLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVL 409 Query: 1862 IPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDEEL 2017 IPKLCRGG+F G++LW+EA MGV+L CS LDPSIT+V+ R+ + ++ Sbjct: 410 IPKLCRGGEFEMGRQLWEEAMAMGVSLNCSSEILDPSITKVFKPTRKIENKI 461 >gb|EMJ11418.1| hypothetical protein PRUPE_ppa015814mg [Prunus persica] Length = 524 Score = 551 bits (1421), Expect = e-154 Identities = 264/411 (64%), Positives = 331/411 (80%) Frame = +2 Query: 779 QVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCK 958 ++QE+C +VS +G LDDLE L+K LTSS+V QV++ CK+ +P+RRLLRFF+W K Sbjct: 7 ELQELCTIVSRAIGGLDDLELSLNKFTGSLTSSLVTQVIDSCKSEAPTRRLLRFFSWCHK 66 Query: 959 RLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGRED 1138 L+ GL DK++N+ I+ FAE KD A+ ILLSD+ K GR ++AQTF V LVKLGRED Sbjct: 67 NLDYGLKDKDYNYGIRVFAEKKDHTAMHILLSDLVKTGRAMEAQTFGLVAQALVKLGRED 126 Query: 1139 DALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLL 1318 +ALG+FKNL + CP D TVT+I+ ALC++GHA+RAEGV+ HH DKI+G+E C+YKSLL Sbjct: 127 EALGLFKNLSTYKCPQDGHTVTSIVNALCSRGHAKRAEGVVWHHRDKIAGIEPCIYKSLL 186 Query: 1319 HGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMME 1498 +GWS++ENVKE RRI++EMKS +M DL+C+NTFL+ LC NLK NPSGLVPEALNVM+E Sbjct: 187 YGWSVQENVKEERRIIKEMKSAGIMPDLFCYNTFLRSLCMKNLKCNPSGLVPEALNVMIE 246 Query: 1499 MRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGR 1678 M++Y+I PNSISYNILLSCLGRTRRVKE+ N +E M+KT C+PDWVSYYLVARVLYLSGR Sbjct: 247 MKTYRIFPNSISYNILLSCLGRTRRVKESCNILETMKKTGCSPDWVSYYLVARVLYLSGR 306 Query: 1679 FGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDL 1858 FGKGN++V+EM+ G+ P K Y++L+G+L G ER YALELFERMK SS+G YGPVYD+ Sbjct: 307 FGKGNKMVDEMLAEGLQPNCKFYYDLIGILVGNERPYYALELFERMKASSLGGYGPVYDV 366 Query: 1859 LIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDE 2011 LIPK CRGGDF KG+ELWDEA MGV L CS LDPSITEV+ R +++ Sbjct: 367 LIPKFCRGGDFEKGRELWDEAMAMGVTLRCSSDLLDPSITEVFKPTRNEEK 417 >gb|EXB38552.1| hypothetical protein L484_008580 [Morus notabilis] Length = 518 Score = 548 bits (1411), Expect = e-153 Identities = 270/441 (61%), Positives = 337/441 (76%), Gaps = 1/441 (0%) Frame = +2 Query: 698 LLKPTNAWKFKLLYFCSCTXXXXXXXX-QVQEICNLVSTPVGSLDDLESGLDKCGAPLTS 874 LL+ A KF+ L SC ++QE+C +VS +G LDDLES L LTS Sbjct: 13 LLRSFTAQKFRQL---SCLPNSNLSSASRLQELCTIVSRTIGGLDDLESSLSDFRGSLTS 69 Query: 875 SMVVQVVEHCKNNSPSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDGRAVEILLS 1054 S+V QV++ CK +P+RRLLRFF WS K L L DK++N AI+ FA KD A+EIL+S Sbjct: 70 SLVTQVIDSCKTEAPTRRLLRFFLWSHKNLKCDLEDKDYNHAIRVFAGKKDHTALEILVS 129 Query: 1055 DISKEGRVLDAQTFCDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAIITALCTKG 1234 D+ K GR L++QT+ V + LVKLGRED+ALGIFKN DK+ CP + TVTA++ ALC +G Sbjct: 130 DLKKGGRALESQTYAIVAETLVKLGREDEALGIFKNSDKYKCPQNSFTVTAVVNALCAQG 189 Query: 1235 HARRAEGVLRHHGDKISGVESCVYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFN 1414 HA+RAEGV+ HH D+ISG+E C+Y+SLL+GWS +ENVKEARRI++EMKS + DL+C+N Sbjct: 190 HAKRAEGVVGHHKDRISGMERCIYRSLLYGWSEQENVKEARRIIKEMKSAGINPDLFCYN 249 Query: 1415 TFLKCLCENNLKKNPSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNT 1594 TFL+CLCE NLK+NPSGLVPEALNVMMEMRSY I PNSISYNILLSCLGR RRVKE Sbjct: 250 TFLRCLCERNLKRNPSGLVPEALNVMMEMRSYMITPNSISYNILLSCLGRARRVKEACQI 309 Query: 1595 IELMRKTRCNPDWVSYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCG 1774 +E M++ C+PDW+SYYLV RVLYL+ RFGKGN++V+EMI G+ P K Y++L+GVLCG Sbjct: 310 LERMKQAGCSPDWMSYYLVIRVLYLTMRFGKGNKLVDEMIGEGLVPNCKFYYDLIGVLCG 369 Query: 1775 VERVNYALELFERMKRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSR 1954 VER YALELFE MK+ S+G YGPVYD+LIPKLCRGGDF KG+ELW EA MGV CCS Sbjct: 370 VERPYYALELFEHMKKRSLGGYGPVYDVLIPKLCRGGDFEKGRELWIEAMNMGVDFCCSS 429 Query: 1955 VALDPSITEVYVCKRQDDEEL 2017 LDPSIT+V+ R+++E++ Sbjct: 430 DVLDPSITKVFKPTRKEEEKI 450 >ref|XP_002514722.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223546326|gb|EEF47828.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 479 Score = 546 bits (1406), Expect = e-152 Identities = 270/442 (61%), Positives = 344/442 (77%), Gaps = 3/442 (0%) Frame = +2 Query: 713 NAWKFKLLYFCSCTXXXXXXXXQVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQV 892 NA K K T ++QEIC VS+ +G LDDLES L+ LTS +V QV Sbjct: 2 NANKSKHFVCLYSTISHNRVPLELQEICKAVSSSIGGLDDLESSLNGFRGNLTSQIVTQV 61 Query: 893 VEHCKNNSPSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEG 1072 ++ CK+ +P+RRLLRFF WS KRL+ + D++FN AI+ AE KD A++IL+SD+ KEG Sbjct: 62 IDCCKHEAPTRRLLRFFLWSYKRLDFSMKDEDFNHAIRVLAEKKDHTAMQILISDLRKEG 121 Query: 1073 RVLDAQTFCDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAE 1252 RV++ QTF V + LVKLGRED+ALGIFKNLDKF CP D TVTAIITALC +GHA++A Sbjct: 122 RVMEPQTFGLVAEALVKLGREDEALGIFKNLDKFKCPQDCETVTAIITALCAEGHAKKAY 181 Query: 1253 GVLRHHGDKISGV-ESCVYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKC 1429 GV+ HH DK+S V C+Y+SL++GWS+++NVK AR ++QEMK + DL+C+NTFL+C Sbjct: 182 GVVLHHKDKLSEVIRPCIYRSLIYGWSMQKNVKRAREVIQEMKRNGIKPDLFCYNTFLRC 241 Query: 1430 LCENNLKKNPSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMR 1609 LCE N+++NPSGLVPE+LNVMMEMRSY+I PNSISYNILLSCLGR RRV+E+ +ELM+ Sbjct: 242 LCERNVERNPSGLVPESLNVMMEMRSYRIEPNSISYNILLSCLGRVRRVQESCKILELMK 301 Query: 1610 KTRCNPDWVSYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVN 1789 K+ C PDWVSYYLVA+VLYL+GRFGKGN+IV+EMIE + P K Y++L+G+LCGVERVN Sbjct: 302 KSSCAPDWVSYYLVAKVLYLTGRFGKGNKIVDEMIERRLVPDRKFYYDLIGILCGVERVN 361 Query: 1790 YALELFERMKRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDP 1969 +AL+LF++MKRSS G YGPVYDLLIPKLC GG+F KGKELWDEA MGV + CS LDP Sbjct: 362 FALKLFDQMKRSSSGGYGPVYDLLIPKLCIGGNFEKGKELWDEAMAMGVTVHCSSEVLDP 421 Query: 1970 SITEVY--VCKRQDDEELRGRD 2029 SIT+V+ K +++EE+R +D Sbjct: 422 SITKVFEPTRKVEEEEEVRLQD 443 >ref|XP_003622167.1| Pentatricopeptide repeat protein [Medicago truncatula] gi|355497182|gb|AES78385.1| Pentatricopeptide repeat protein [Medicago truncatula] Length = 563 Score = 542 bits (1397), Expect = e-151 Identities = 269/436 (61%), Positives = 337/436 (77%), Gaps = 1/436 (0%) Frame = +2 Query: 701 LKPTNAWKFKLLYFCS-CTXXXXXXXXQVQEICNLVSTPVGSLDDLESGLDKCGAPLTSS 877 L+ T+ KF+LL T +Q++C++V++ VG LDDLES L+K LTS Sbjct: 86 LQNTSTHKFQLLSVSLFSTLHPISTPPLLQDLCDIVTSTVGGLDDLESCLNKFKGSLTSP 145 Query: 878 MVVQVVEHCKNNSPSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSD 1057 +V QV++ K+ + +RRLLRFF WS K L+ L DK++N+A++ F E KD A++ILL D Sbjct: 146 LVAQVIDSVKHEAHTRRLLRFFLWSNKNLSNDLEDKDYNYALRVFIEKKDYTAMDILLGD 205 Query: 1058 ISKEGRVLDAQTFCDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAIITALCTKGH 1237 K+GRV++AQTF V + VKLG+ED+ALGIFKNLDK+ C D+ TVTAII ALC+KGH Sbjct: 206 FKKQGRVMEAQTFGVVAETYVKLGKEDEALGIFKNLDKYKCLIDEFTVTAIINALCSKGH 265 Query: 1238 ARRAEGVLRHHGDKISGVESCVYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFNT 1417 A+RAEGV HH DKI G CVY+SLL+GWSL+ NVKE+RRI+QEMK+ V DL C+NT Sbjct: 266 AKRAEGVAWHHKDKIKGALPCVYRSLLYGWSLERNVKESRRIIQEMKTNGVTPDLVCYNT 325 Query: 1418 FLKCLCENNLKKNPSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNTI 1597 FL+CLCE NL+ NPSGLV EALNVMMEMRSYK+ P SISYNILLSCLG+TRRVKE+ + Sbjct: 326 FLRCLCERNLRNNPSGLVLEALNVMMEMRSYKVFPTSISYNILLSCLGKTRRVKESCQIL 385 Query: 1598 ELMRKTRCNPDWVSYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGV 1777 E M K+ PDWVSYYLV+RVL+LSGRFGKG +IV++MIE G+ P K Y++L+G+LCGV Sbjct: 386 EAMNKSGVAPDWVSYYLVSRVLFLSGRFGKGKEIVDQMIEKGLVPNHKFYYSLIGILCGV 445 Query: 1778 ERVNYALELFERMKRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSRV 1957 ERVN+AL+LFE+MK SSVG YGPVYD+LIPKLCRGGDF KG+ELWDE MG+ L CS+ Sbjct: 446 ERVNHALDLFEKMKGSSVGGYGPVYDVLIPKLCRGGDFEKGRELWDEGTYMGITLQCSKD 505 Query: 1958 ALDPSITEVYVCKRQD 2005 LDPSITEVY+ KR + Sbjct: 506 VLDPSITEVYIPKRPE 521 >ref|XP_006280363.1| hypothetical protein CARUB_v10026291mg [Capsella rubella] gi|482549067|gb|EOA13261.1| hypothetical protein CARUB_v10026291mg [Capsella rubella] Length = 490 Score = 536 bits (1381), Expect = e-149 Identities = 259/428 (60%), Positives = 323/428 (75%), Gaps = 2/428 (0%) Frame = +2 Query: 737 YFCS--CTXXXXXXXXQVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKN 910 YFCS ++QE LVS+P+G LDDLE L++ +S +V QV+E CKN Sbjct: 23 YFCSHHLVDRLDHSSSELQEFIRLVSSPIGGLDDLEENLNRVSVSPSSKLVTQVIESCKN 82 Query: 911 NSPSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQ 1090 + RRLLRFF+WSCK L L DKEFN ++ AE KD A++ILLSD+ KE R +D Q Sbjct: 83 ETSPRRLLRFFSWSCKNLGSSLHDKEFNHVLRVLAEKKDNTAIQILLSDLRKENRAMDKQ 142 Query: 1091 TFCDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHH 1270 TF V + LVK+G+EDDA+GIFK LDKF CP D TVTAII+ALC++GH +RA GV+ HH Sbjct: 143 TFSIVAETLVKIGKEDDAIGIFKILDKFSCPQDSFTVTAIISALCSRGHVKRALGVMHHH 202 Query: 1271 GDKISGVESCVYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLK 1450 D ISG E VY+SLL GWS++ NVKEARR++Q+MKS + DL+CFN+ L CLCE N+ Sbjct: 203 KDAISGNELSVYRSLLFGWSVQRNVKEARRVIQDMKSAGITPDLFCFNSLLTCLCERNVN 262 Query: 1451 KNPSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPD 1630 +NPSGLVPEALN+M+EM+SYKI P SISYN LLSCLGRTRRVKE+ +E M+++ C+PD Sbjct: 263 RNPSGLVPEALNIMLEMKSYKIQPTSISYNTLLSCLGRTRRVKESCQILEQMKRSGCDPD 322 Query: 1631 WVSYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFE 1810 SYY V RVLYL+GRFGKGNQIV+EMIE + P K Y++L+GVLCGVERVN+AL+LFE Sbjct: 323 TASYYFVVRVLYLTGRFGKGNQIVDEMIERELRPERKFYYDLIGVLCGVERVNFALQLFE 382 Query: 1811 RMKRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYV 1990 +MKRSSVG YGPVYDLLIPKLC+GG+F KGKELW+EA + V LC S LDPS+TEV+ Sbjct: 383 KMKRSSVGGYGPVYDLLIPKLCKGGNFEKGKELWEEAMSLDVTLCSSIDLLDPSVTEVFK 442 Query: 1991 CKRQDDEE 2014 ++ + E Sbjct: 443 PMKKKEVE 450 >ref|XP_002866430.1| hypothetical protein ARALYDRAFT_496296 [Arabidopsis lyrata subsp. lyrata] gi|297312265|gb|EFH42689.1| hypothetical protein ARALYDRAFT_496296 [Arabidopsis lyrata subsp. lyrata] Length = 489 Score = 536 bits (1380), Expect = e-149 Identities = 261/447 (58%), Positives = 335/447 (74%), Gaps = 2/447 (0%) Frame = +2 Query: 677 SLARLSALLKPTNAWKFKLLYFCS--CTXXXXXXXXQVQEICNLVSTPVGSLDDLESGLD 850 S+ R + ++ TN K YFCS ++QE+ +VS+P+G LDDLE L+ Sbjct: 3 SIVRSNGIVFVTNTIKLTR-YFCSHHLVDRPDRASTELQEVIRIVSSPIGGLDDLEKNLN 61 Query: 851 KCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDG 1030 + +S++V QV+E CKN + RRLLRFF+WSCK L + DKEFN ++ AE KD Sbjct: 62 QVSVSPSSNLVTQVIESCKNETSPRRLLRFFSWSCKSLGSNVHDKEFNHVLRVLAEKKDH 121 Query: 1031 RAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAI 1210 A++ILLSD+ +E R +D QTF V + LVK+G+E+DA+GIFK LDKF CP D TVTAI Sbjct: 122 TAIQILLSDLRQENRAMDKQTFSIVAETLVKIGKEEDAIGIFKILDKFLCPQDSFTVTAI 181 Query: 1211 ITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLLHGWSLKENVKEARRILQEMKSKKV 1390 I+ALC++GH +RA GV+ HH D ISG E VY+SLL GWS++ NVKEARR++Q+MKS + Sbjct: 182 ISALCSRGHVKRALGVMHHHKDAISGNELSVYRSLLFGWSVQRNVKEARRVIQDMKSAGI 241 Query: 1391 MLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTR 1570 DL+CFN+ L CLCE N+ +NPSGLVPEALN+M+EMRSYKI P SISYNILLSCLGRTR Sbjct: 242 TPDLFCFNSLLTCLCERNVNRNPSGLVPEALNIMLEMRSYKIQPTSISYNILLSCLGRTR 301 Query: 1571 RVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYH 1750 RV+E+ +E M+++ C+PD SYY V RVLYL+GRFGKGNQIV+EMIE G+ P K Y+ Sbjct: 302 RVRESCQILEQMKRSGCDPDTASYYFVVRVLYLTGRFGKGNQIVDEMIERGLRPEHKFYY 361 Query: 1751 NLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERM 1930 +L+GVLCGVERVN+AL+LFE+MKRSSV YGPVYDLLIPKLC+GG+F KGKELW+EA + Sbjct: 362 DLIGVLCGVERVNFALQLFEKMKRSSVDGYGPVYDLLIPKLCKGGNFEKGKELWEEAMSL 421 Query: 1931 GVALCCSRVALDPSITEVYVCKRQDDE 2011 V L CS LDPS+TEV+ ++ +E Sbjct: 422 NVTLSCSISLLDPSVTEVFKPMKKKEE 448 >gb|ESW27283.1| hypothetical protein PHAVU_003G188300g [Phaseolus vulgaris] Length = 494 Score = 530 bits (1364), Expect = e-147 Identities = 253/403 (62%), Positives = 322/403 (79%) Frame = +2 Query: 779 QVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCK 958 Q+QE+C++V + VG LDDLE L+K LTSS+V Q ++ K+ + +RRLLRFF WS K Sbjct: 45 QLQELCSVVVSTVGGLDDLEFSLNKFKDSLTSSLVAQAIDSSKHEAHTRRLLRFFLWSSK 104 Query: 959 RLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGRED 1138 L+ L +K++N A++ FAE D A++IL+ D+ KEGRV+DA+TF V D LVKLG+ED Sbjct: 105 NLSHSLENKDYNHALRVFAEKNDYTAMDILMEDLKKEGRVMDAETFGLVADTLVKLGKED 164 Query: 1139 DALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLL 1318 ALG+FKNLDK+ C D+ TVTAII ALC+KGHA+RAEGV+ HH DKI+G + C+Y+SLL Sbjct: 165 QALGVFKNLDKYKCSIDEFTVTAIINALCSKGHAKRAEGVVWHHRDKITGAKPCIYRSLL 224 Query: 1319 HGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMME 1498 +GWS++ NVKEARRI++EMK+ V DL C+NTFL+CLCE NL+ NPSGLVPEALNVMME Sbjct: 225 YGWSVQRNVKEARRIIKEMKANGVTPDLLCYNTFLRCLCERNLRHNPSGLVPEALNVMME 284 Query: 1499 MRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGR 1678 MRS ++ P ISYNILLSCLG+TRRVKE+ +E M C+PDWVSYYLVA+VL+LSGR Sbjct: 285 MRSCRVFPTPISYNILLSCLGKTRRVKESCQILETMTNGGCDPDWVSYYLVAKVLFLSGR 344 Query: 1679 FGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDL 1858 FGKG IV++MI G+ P K Y++L+G+LCGVERVN+ALELFE+MK++S+G YGPVYD+ Sbjct: 345 FGKGKDIVDQMIGKGLMPNHKFYYSLIGILCGVERVNHALELFEKMKKNSMGGYGPVYDV 404 Query: 1859 LIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVY 1987 LIPKLC GG+F KG+ELWDEA MG+ L CS LDPSIT+VY Sbjct: 405 LIPKLCTGGNFEKGRELWDEATSMGIILQCSEDVLDPSITQVY 447 >gb|AAM65325.1| unknown [Arabidopsis thaliana] Length = 487 Score = 528 bits (1361), Expect = e-147 Identities = 256/425 (60%), Positives = 323/425 (76%) Frame = +2 Query: 737 YFCSCTXXXXXXXXQVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNS 916 YFCS + E+ +VS+PVG LDDLE L++ +S++V QV+E CKN + Sbjct: 23 YFCS-HHLVDRSETALHEVIRIVSSPVGGLDDLEENLNQVSVSPSSNLVTQVIESCKNET 81 Query: 917 PSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTF 1096 RRLLRFF+WSCK L L DKEFN+ ++ AE KD A++ILLSD+ KE R +D QTF Sbjct: 82 SPRRLLRFFSWSCKSLGSSLHDKEFNYVLRVLAEKKDHTAMQILLSDLRKENRAMDKQTF 141 Query: 1097 CDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGD 1276 V + LVK+G+E+DA+GIFK LDKF CP D TVTAII+ALC++GH +RA GV+ HH D Sbjct: 142 SIVAETLVKIGKEEDAIGIFKILDKFSCPQDGFTVTAIISALCSRGHVKRALGVMHHHKD 201 Query: 1277 KISGVESCVYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKN 1456 ISG E VY+SLL GWS++ NVKEARR++Q+MKS + DL+CFN+ L CLCE N+ +N Sbjct: 202 VISGNELSVYRSLLFGWSVQRNVKEARRVIQDMKSAGITPDLFCFNSLLTCLCERNVNRN 261 Query: 1457 PSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWV 1636 PSGLVPEALN+M+EMRSYKI P S+SYNILLSCLGRTRRV+E+ +E M+++ C+PD Sbjct: 262 PSGLVPEALNIMLEMRSYKIQPTSMSYNILLSCLGRTRRVRESCQILEQMKRSGCDPDTG 321 Query: 1637 SYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERM 1816 SYY V RVLYL+GRFGKGNQIV+EMIE G P K Y++L+GVLCGVERVN+AL+LFE+M Sbjct: 322 SYYFVVRVLYLTGRFGKGNQIVDEMIERGFRPERKFYYDLIGVLCGVERVNFALQLFEKM 381 Query: 1817 KRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCK 1996 KRSSVG YG VYDLLIPKLC+GG+F KG+ELW+EA + V L CS LDPS+TEV+ Sbjct: 382 KRSSVGGYGQVYDLLIPKLCKGGNFEKGRELWEEALSIDVTLSCSISLLDPSVTEVFKPM 441 Query: 1997 RQDDE 2011 + +E Sbjct: 442 KMKEE 446 >ref|NP_200945.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75171474|sp|Q9FLJ6.1|PP439_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At5g61370, mitochondrial; Flags: Precursor gi|9757858|dbj|BAB08492.1| unnamed protein product [Arabidopsis thaliana] gi|17529064|gb|AAL38742.1| unknown protein [Arabidopsis thaliana] gi|23296891|gb|AAN13197.1| unknown protein [Arabidopsis thaliana] gi|332010076|gb|AED97459.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 487 Score = 528 bits (1360), Expect = e-147 Identities = 256/425 (60%), Positives = 323/425 (76%) Frame = +2 Query: 737 YFCSCTXXXXXXXXQVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNS 916 YFCS + E+ +VS+PVG LDDLE L++ +S++V QV+E CKN + Sbjct: 23 YFCS-HHLVDRSETALHEVIRIVSSPVGGLDDLEENLNQVSVSPSSNLVTQVIESCKNET 81 Query: 917 PSRRLLRFFTWSCKRLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTF 1096 RRLLRFF+WSCK L L DKEFN+ ++ AE KD A++ILLSD+ KE R +D QTF Sbjct: 82 SPRRLLRFFSWSCKSLGSSLHDKEFNYVLRVLAEKKDHTAMQILLSDLRKENRAMDKQTF 141 Query: 1097 CDVVDVLVKLGREDDALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGD 1276 V + LVK+G+E+DA+GIFK LDKF CP D TVTAII+ALC++GH +RA GV+ HH D Sbjct: 142 SIVAETLVKVGKEEDAIGIFKILDKFSCPQDGFTVTAIISALCSRGHVKRALGVMHHHKD 201 Query: 1277 KISGVESCVYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKN 1456 ISG E VY+SLL GWS++ NVKEARR++Q+MKS + DL+CFN+ L CLCE N+ +N Sbjct: 202 VISGNELSVYRSLLFGWSVQRNVKEARRVIQDMKSAGITPDLFCFNSLLTCLCERNVNRN 261 Query: 1457 PSGLVPEALNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWV 1636 PSGLVPEALN+M+EMRSYKI P S+SYNILLSCLGRTRRV+E+ +E M+++ C+PD Sbjct: 262 PSGLVPEALNIMLEMRSYKIQPTSMSYNILLSCLGRTRRVRESCQILEQMKRSGCDPDTG 321 Query: 1637 SYYLVARVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERM 1816 SYY V RVLYL+GRFGKGNQIV+EMIE G P K Y++L+GVLCGVERVN+AL+LFE+M Sbjct: 322 SYYFVVRVLYLTGRFGKGNQIVDEMIERGFRPERKFYYDLIGVLCGVERVNFALQLFEKM 381 Query: 1817 KRSSVGEYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCK 1996 KRSSVG YG VYDLLIPKLC+GG+F KG+ELW+EA + V L CS LDPS+TEV+ Sbjct: 382 KRSSVGGYGQVYDLLIPKLCKGGNFEKGRELWEEALSIDVTLSCSISLLDPSVTEVFKPM 441 Query: 1997 RQDDE 2011 + +E Sbjct: 442 KMKEE 446 >ref|XP_002318601.2| hypothetical protein POPTR_0012s07030g, partial [Populus trichocarpa] gi|550326549|gb|EEE96821.2| hypothetical protein POPTR_0012s07030g, partial [Populus trichocarpa] Length = 410 Score = 522 bits (1344), Expect = e-145 Identities = 256/399 (64%), Positives = 316/399 (79%), Gaps = 3/399 (0%) Frame = +2 Query: 794 CNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKRLNGG 973 C ++S+ +G LDDLE L++ LT +V Q++ CK+ +PSRR+LRFF WS K L+ Sbjct: 2 CKVISSWIGGLDDLELSLNQFKGQLTYPLVTQIINSCKHEAPSRRILRFFLWSNKVLDSE 61 Query: 974 -LVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDDALG 1150 L D +FN I+ AE KD + IL+SD+ KEGRV+D QTF V + LVKLGRED+ALG Sbjct: 62 KLKDDDFNHVIRVLAEKKDHTGMRILISDLRKEGRVMDPQTFALVAETLVKLGREDEALG 121 Query: 1151 IFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHH-GDKISGVESCV-YKSLLHG 1324 IFKNL+KF CP D VTAII+ALC KGHA++A+GV HH +KISG+E CV Y+ LL+G Sbjct: 122 IFKNLEKFKCPQDGFAVTAIISALCAKGHAKKAQGVFSHHKNNKISGLEPCVVYRCLLYG 181 Query: 1325 WSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEMR 1504 WS++ENVKEAR+I+QEMK ++ DL+C+NTFLKCLCE NLK+NPSGLVPEALNVMMEMR Sbjct: 182 WSVQENVKEARKIIQEMKGDGLIPDLFCYNTFLKCLCERNLKRNPSGLVPEALNVMMEMR 241 Query: 1505 SYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRFG 1684 SY+I PNSISYN LLS LGR RRVKE+ +E M+ T C PDWVSY+LVA+V+YL+GRFG Sbjct: 242 SYRIEPNSISYNTLLSSLGRARRVKESYRMLETMKTTGCAPDWVSYFLVAKVMYLTGRFG 301 Query: 1685 KGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLLI 1864 KGN+IV+EMI G+ P K Y+NL+GVLCGVERV+YALELFERMK SS+G YGPVYD+LI Sbjct: 302 KGNEIVDEMIGQGLLPDRKFYYNLIGVLCGVERVSYALELFERMKTSSLGGYGPVYDILI 361 Query: 1865 PKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITE 1981 PKLC+GGDF +G+ELW+EA MGV+ CS LDPSITE Sbjct: 362 PKLCKGGDFERGRELWEEATAMGVSFSCSSDVLDPSITE 400 >ref|XP_006394515.1| hypothetical protein EUTSA_v10004085mg [Eutrema salsugineum] gi|557091154|gb|ESQ31801.1| hypothetical protein EUTSA_v10004085mg [Eutrema salsugineum] Length = 489 Score = 515 bits (1326), Expect = e-143 Identities = 245/412 (59%), Positives = 315/412 (76%) Frame = +2 Query: 779 QVQEICNLVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCK 958 ++ E+ +VS+P+G LDDLE L++ +S +V +V++ CK+ + RRLLRFF+WSCK Sbjct: 38 ELHEVIRIVSSPIGGLDDLEESLNQVSVSPSSKLVHKVIDSCKDETSPRRLLRFFSWSCK 97 Query: 959 RLNGGLVDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGRED 1138 L L DK FN ++ AE KD A++ILLSD+ K+ R +D QTF V + LVK+GRE+ Sbjct: 98 NLGSCLEDKTFNHVLRVLAEKKDHTAIQILLSDLRKQNRAMDKQTFSLVAETLVKIGREE 157 Query: 1139 DALGIFKNLDKFGCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLL 1318 DA+GIFK LDKF C D TVTAII+ALC++GH +RA GV+ HH ISG E VY+SLL Sbjct: 158 DAIGIFKILDKFSCQQDSFTVTAIISALCSRGHVKRALGVMHHHKALISGNELSVYRSLL 217 Query: 1319 HGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMME 1498 GWS++ NVKEARR++Q+MKS ++ DL+C+NT L CLCE N+ +NPSGLVPEALN+M+E Sbjct: 218 FGWSVQRNVKEARRVIQDMKSSRITPDLFCYNTMLTCLCERNVNRNPSGLVPEALNIMLE 277 Query: 1499 MRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGR 1678 MRSYKI P ISYNILLSCL RTRRVKE+ +E M+K+ C+PD SYY V RVLYL+GR Sbjct: 278 MRSYKIQPTCISYNILLSCLARTRRVKESCQILEQMKKSGCDPDTASYYFVVRVLYLTGR 337 Query: 1679 FGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDL 1858 FGKGNQ V+EMIE G+ P + Y++L+GVLCGV+RVN+AL+LF +MKRSSVG YGPVYDL Sbjct: 338 FGKGNQTVDEMIERGLRPERRFYYDLIGVLCGVKRVNFALQLFAKMKRSSVGGYGPVYDL 397 Query: 1859 LIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQDDEE 2014 LIPKLC+GGDF KG+ELW+EA + V L CS LDPS+TEV+ ++ EE Sbjct: 398 LIPKLCKGGDFEKGRELWEEAMSLDVTLSCSVDLLDPSLTEVFKPMKKKKEE 449 >ref|XP_006849319.1| hypothetical protein AMTR_s00164p00020970 [Amborella trichopoda] gi|548852840|gb|ERN10900.1| hypothetical protein AMTR_s00164p00020970 [Amborella trichopoda] Length = 459 Score = 383 bits (984), Expect = e-103 Identities = 185/397 (46%), Positives = 275/397 (69%) Frame = +2 Query: 815 VGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPSRRLLRFFTWSCKRLNGGLVDKEFN 994 +G+LDD+ES L++ ++ +V QV+E C + + +RRLLRFFTWS K+ L D FN Sbjct: 35 IGNLDDIESNLNQSEILISPPLVTQVMESCTHRAQTRRLLRFFTWSAKQPTCKLPDTLFN 94 Query: 995 FAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDDALGIFKNLDKF 1174 AI+ FA +KD RA+E+L++++ +E R + T+ + +V G+ED A+GIFKN++K+ Sbjct: 95 HAIKLFASLKDLRAMELLVTELKRESRGMGIDTWAAIATTMVDHGKEDQAIGIFKNIEKY 154 Query: 1175 GCPHDKTTVTAIITALCTKGHARRAEGVLRHHGDKISGVESCVYKSLLHGWSLKENVKEA 1354 CP D+ ++ ++ ALC +GHAR+AEGV+ + + +S ++S ++ +L+HGW +K K+A Sbjct: 155 RCPRDEKSLNLLVHALCARGHARKAEGVVWNAKNWVS-MDSYIFTTLIHGWCIKGEFKDA 213 Query: 1355 RRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEALNVMMEMRSYKIMPNSIS 1534 RR+ +EM+S +L +++ ++C+C NL+ NPS LV + ++MEMRS + P +IS Sbjct: 214 RRVFEEMRSNGFSPNLVAYHSLIRCVCAKNLRINPSALVRDFFELVMEMRSNSVCPTTIS 273 Query: 1535 YNILLSCLGRTRRVKETLNTIELMRKTRCNPDWVSYYLVARVLYLSGRFGKGNQIVEEMI 1714 +NIL+S LGR RRVKE M + C+PD+VSY+LV R+LYL+GR GKGN++V+EMI Sbjct: 274 FNILISYLGRARRVKEADQVFRAMVQEGCDPDYVSYFLVVRLLYLTGRMGKGNEMVDEMI 333 Query: 1715 EAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVGEYGPVYDLLIPKLCRGGDFV 1894 + G+ P + YH+L GVLCGVE+V++AL L RMK + YGP YDLLI KLC+GG F Sbjct: 334 QIGLKPKARFYHSLTGVLCGVEKVDHALWLLARMKENCSEVYGPTYDLLITKLCKGGKFE 393 Query: 1895 KGKELWDEAERMGVALCCSRVALDPSITEVYVCKRQD 2005 G++LWDEA G L CS LDPS TEVY KR++ Sbjct: 394 IGRKLWDEALERGAVLQCSVDLLDPSKTEVYKPKRKE 430 >gb|EEC66969.1| hypothetical protein OsI_33629 [Oryza sativa Indica Group] Length = 648 Score = 359 bits (921), Expect = 3e-96 Identities = 192/409 (46%), Positives = 267/409 (65%), Gaps = 15/409 (3%) Frame = +2 Query: 800 LVSTPVGSLDDLESGLDKCGAPLTSSMVVQVVEHCKNNSPS-RRLLRFFTWSCKRLNGGL 976 +V + GSLD++ LD+ G P++ +MV +V++ C S RRLLRF +W + GG+ Sbjct: 30 VVCSGAGSLDEVGGALDRLGVPVSPAMVARVIDACSERMGSGRRLLRFLSWCRSKDAGGI 89 Query: 977 VDKEFNFAIQAFAEMKDGRAVEILLSDISKEGRVLDAQTFCDVVDVLVKLGREDDALGIF 1156 D+ + AI A A M D A+ I ++D K+GR + +TF VV+ LVKLG+ED+A+ +F Sbjct: 90 GDEALDSAIAALARMGDLTAMRIAVADAEKDGRRMSPETFTVVVEALVKLGKEDEAVRLF 149 Query: 1157 KNLDKFGCPHDK----------TTVTAIITALCTKGHARRAEGVLRHHGDKIS--GVESC 1300 + L++ + ++ A++ ALC KGHAR A+GV+ HH ++S + S Sbjct: 150 RGLERQRLLPQRDAGDGGEGVWSSSLAMVQALCMKGHAREAQGVVWHHKSELSVEPMVSI 209 Query: 1301 VYKSLLHGWSLKENVKEARRILQEMKSKKVMLDLYCFNTFLKCLCENNLKKNPSGLVPEA 1480 V +SLLHGW + N KEARR+L ++KS L L FN +L CLC NLK NPS LV EA Sbjct: 210 VQRSLLHGWCVHGNAKEARRVLDDIKSSCTPLGLPSFNDYLHCLCHRNLKFNPSALVTEA 269 Query: 1481 LNVMMEMRSYKIMPNSISYNILLSCLGRTRRVKETLNTIELMR--KTRCNPDWVSYYLVA 1654 ++V+ EMRSY + P++ S NILLSCLGR RRVKE+ + LMR K C+PDWVSYYLV Sbjct: 270 MDVLSEMRSYGVTPDASSLNILLSCLGRARRVKESYRILYLMREGKAGCSPDWVSYYLVV 329 Query: 1655 RVLYLSGRFGKGNQIVEEMIEAGVDPPPKLYHNLVGVLCGVERVNYALELFERMKRSSVG 1834 RVLYL+GR +G ++V++M+E+GV P K +H L+GVLCG E+V++ L++F MKR + Sbjct: 330 RVLYLTGRIIRGKRLVDDMLESGVLPTAKFFHGLIGVLCGTEKVDHGLDMFRLMKRCQLV 389 Query: 1835 EYGPVYDLLIPKLCRGGDFVKGKELWDEAERMGVALCCSRVALDPSITE 1981 + YDLLI KLCR G F GKELWD+A++ G L CS LDP TE Sbjct: 390 D-THTYDLLIEKLCRNGRFENGKELWDDAKKNGFMLGCSEDLLDPLKTE 437