BLASTX nr result
ID: Atropa21_contig00011447
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00011447 (1088 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004237380.1| PREDICTED: pentatricopeptide repeat-containi... 462 e-128 ref|XP_006350412.1| PREDICTED: pentatricopeptide repeat-containi... 459 e-127 ref|XP_002269673.1| PREDICTED: pentatricopeptide repeat-containi... 320 8e-85 emb|CBI39461.3| unnamed protein product [Vitis vinifera] 319 1e-84 ref|XP_002526313.1| conserved hypothetical protein [Ricinus comm... 311 3e-82 ref|XP_004298657.1| PREDICTED: pentatricopeptide repeat-containi... 309 1e-81 gb|EOX92648.1| Uncharacterized protein TCM_046974 [Theobroma cacao] 305 2e-80 ref|XP_002515828.1| conserved hypothetical protein [Ricinus comm... 301 2e-79 ref|XP_006478887.1| PREDICTED: pentatricopeptide repeat-containi... 296 7e-78 ref|XP_006373907.1| hypothetical protein POPTR_0016s10300g [Popu... 295 2e-77 ref|XP_006443149.1| hypothetical protein CICLE_v10021498mg [Citr... 294 5e-77 ref|XP_006373908.1| hypothetical protein POPTR_0016s10300g [Popu... 283 6e-74 ref|XP_002323526.2| hypothetical protein POPTR_0016s10300g [Popu... 283 6e-74 ref|XP_002309089.2| hypothetical protein POPTR_0006s09260g [Popu... 283 1e-73 ref|XP_004511665.1| PREDICTED: pentatricopeptide repeat-containi... 282 2e-73 ref|XP_004136857.1| PREDICTED: pentatricopeptide repeat-containi... 278 2e-72 ref|NP_563712.1| uncharacterized protein [Arabidopsis thaliana] ... 278 3e-72 ref|NP_001241921.1| uncharacterized protein LOC100795658 [Glycin... 276 8e-72 ref|XP_002892228.1| EMB2748 [Arabidopsis lyrata subsp. lyrata] g... 275 2e-71 gb|ESW29328.1| hypothetical protein PHAVU_002G061400g [Phaseolus... 274 4e-71 >ref|XP_004237380.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Solanum lycopersicum] Length = 281 Score = 462 bits (1190), Expect = e-128 Identities = 227/281 (80%), Positives = 244/281 (86%) Frame = +3 Query: 129 MMSRSAILTRLARQISQLTVNRTYVLTCSFNTYVRHSITKRSDAETIGSFDDRFSNKSLS 308 MMS+ AI+T LARQISQLTVNR+ VLTCS++T V HSI+ R DAET GS DRF KSLS Sbjct: 1 MMSKLAIITTLARQISQLTVNRSSVLTCSYSTDVWHSISNRGDAETTGSLGDRFGYKSLS 60 Query: 309 SFAQDPSGGECKPQIGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWVAWERTFPIGP 488 S A P GG KPQ+GENVSRKDK+SFLVNTLLDL+DSKEAVYGALDAWVAWER FPIG Sbjct: 61 SLAGKPIGGNSKPQVGENVSRKDKVSFLVNTLLDLEDSKEAVYGALDAWVAWERNFPIGS 120 Query: 489 XXXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFWKKKI 668 WHR+VQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFW KKI Sbjct: 121 LKQVLLKLEKEQQWHRIVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFWNKKI 180 Query: 669 GSDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQKVADTYEVLGF 848 GSDLHSVPWRLCSLMISVYYRNHMLEDL KLFKGLE+FDRKPPDKSI+QKVADTYEV G+ Sbjct: 181 GSDLHSVPWRLCSLMISVYYRNHMLEDLIKLFKGLESFDRKPPDKSIIQKVADTYEVQGY 240 Query: 849 LDEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRKEKQAQEN 971 +D+KDRLLEKYKDLFTETW+GNPKGLRGSR QRKEKQAQE+ Sbjct: 241 VDQKDRLLEKYKDLFTETWNGNPKGLRGSRPQRKEKQAQED 281 >ref|XP_006350412.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21190-like [Solanum tuberosum] Length = 280 Score = 459 bits (1182), Expect = e-127 Identities = 231/281 (82%), Positives = 242/281 (86%) Frame = +3 Query: 129 MMSRSAILTRLARQISQLTVNRTYVLTCSFNTYVRHSITKRSDAETIGSFDDRFSNKSLS 308 MMS+ AI+TRLARQISQLTVNRT VLTCS++T VRHS + R D ET GSF RF KSLS Sbjct: 1 MMSKLAIITRLARQISQLTVNRTSVLTCSYSTDVRHSTSNRGDGETTGSFGYRFGYKSLS 60 Query: 309 SFAQDPSGGECKPQIGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWVAWERTFPIGP 488 S A P G KPQ+GENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWVAWER FPIG Sbjct: 61 SLAGKPIGNS-KPQVGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWVAWERNFPIGS 119 Query: 489 XXXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFWKKKI 668 WH++VQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFW KKI Sbjct: 120 LKQVLLKLEKEQQWHKIVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFWNKKI 179 Query: 669 GSDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQKVADTYEVLGF 848 GSDLHSVPWRLCSLMISVYYRNHMLEDL KLFKGLEAFDRKPPDKSIVQKVADTYEV G Sbjct: 180 GSDLHSVPWRLCSLMISVYYRNHMLEDLIKLFKGLEAFDRKPPDKSIVQKVADTYEVQGN 239 Query: 849 LDEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRKEKQAQEN 971 LD+KDRLLEKYKDLFTETW+GNPKGLRGSR QRKEKQAQE+ Sbjct: 240 LDQKDRLLEKYKDLFTETWNGNPKGLRGSRPQRKEKQAQED 280 >ref|XP_002269673.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform 1 [Vitis vinifera] Length = 300 Score = 320 bits (819), Expect = 8e-85 Identities = 160/277 (57%), Positives = 196/277 (70%) Frame = +3 Query: 123 LHMMSRSAILTRLARQISQLTVNRTYVLTCSFNTYVRHSITKRSDAETIGSFDDRFSNKS 302 L MS+S + L RQ +QL R L S++T+ + ++ S+ + + +N+ Sbjct: 2 LMAMSKSKAMVNLVRQFTQLGATRVQTLASSYSTFTQTQMSDTSNVGEVAFLGGQCNNQP 61 Query: 303 LSSFAQDPSGGECKPQIGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWVAWERTFPI 482 + + + K QIGENVSRKDKI+FLV TLLDLKDSKEAVYGALDAWVAWE+ FPI Sbjct: 62 MYHDSGKDAASVHKHQIGENVSRKDKINFLVTTLLDLKDSKEAVYGALDAWVAWEQNFPI 121 Query: 483 GPXXXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFWKK 662 WHRV+QV+KWMLSKGQG TMGTY QLI+ALDMDHRA+EAHEFW K Sbjct: 122 ASLKRVLITLEKEQQWHRVIQVVKWMLSKGQGTTMGTYGQLIRALDMDHRAEEAHEFWVK 181 Query: 663 KIGSDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQKVADTYEVL 842 KIG+DLHSVPW LC MISVYYRN+MLE+L KLFKGLEAFDRKP DK +V+KVAD YE+L Sbjct: 182 KIGTDLHSVPWHLCHRMISVYYRNNMLENLVKLFKGLEAFDRKPQDKLVVKKVADAYEML 241 Query: 843 GFLDEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRKE 953 G L+EK+R+ EKY LFTET G PK + S++K+ Sbjct: 242 GLLEEKERIFEKYDYLFTETVAGKPKKSKKFLSEKKK 278 >emb|CBI39461.3| unnamed protein product [Vitis vinifera] Length = 296 Score = 319 bits (818), Expect = 1e-84 Identities = 159/274 (58%), Positives = 195/274 (71%) Frame = +3 Query: 132 MSRSAILTRLARQISQLTVNRTYVLTCSFNTYVRHSITKRSDAETIGSFDDRFSNKSLSS 311 MS+S + L RQ +QL R L S++T+ + ++ S+ + + +N+ + Sbjct: 1 MSKSKAMVNLVRQFTQLGATRVQTLASSYSTFTQTQMSDTSNVGEVAFLGGQCNNQPMYH 60 Query: 312 FAQDPSGGECKPQIGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWVAWERTFPIGPX 491 + + K QIGENVSRKDKI+FLV TLLDLKDSKEAVYGALDAWVAWE+ FPI Sbjct: 61 DSGKDAASVHKHQIGENVSRKDKINFLVTTLLDLKDSKEAVYGALDAWVAWEQNFPIASL 120 Query: 492 XXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFWKKKIG 671 WHRV+QV+KWMLSKGQG TMGTY QLI+ALDMDHRA+EAHEFW KKIG Sbjct: 121 KRVLITLEKEQQWHRVIQVVKWMLSKGQGTTMGTYGQLIRALDMDHRAEEAHEFWVKKIG 180 Query: 672 SDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQKVADTYEVLGFL 851 +DLHSVPW LC MISVYYRN+MLE+L KLFKGLEAFDRKP DK +V+KVAD YE+LG L Sbjct: 181 TDLHSVPWHLCHRMISVYYRNNMLENLVKLFKGLEAFDRKPQDKLVVKKVADAYEMLGLL 240 Query: 852 DEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRKE 953 +EK+R+ EKY LFTET G PK + S++K+ Sbjct: 241 EEKERIFEKYDYLFTETVAGKPKKSKKFLSEKKK 274 >ref|XP_002526313.1| conserved hypothetical protein [Ricinus communis] gi|223534394|gb|EEF36102.1| conserved hypothetical protein [Ricinus communis] Length = 300 Score = 311 bits (797), Expect = 3e-82 Identities = 166/277 (59%), Positives = 194/277 (70%), Gaps = 4/277 (1%) Frame = +3 Query: 132 MSRSAILTRLARQISQLTVNRTYVLTCSFNTY----VRHSITKRSDAETIGSFDDRFSNK 299 M RS + L ++SQ+ V R L CS Y V+ I+ R+ D + K Sbjct: 1 MWRSPAFSSLTGRLSQVGVAR---LQCSNGRYSSTMVQAQISNRNTPSPRPEDQDDY--K 55 Query: 300 SLSSFAQDPSGGECKPQIGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWVAWERTFP 479 + + +GG K QIG+NVSRK+KI FL+ TLLDLKDSKEAVYGALDAWVAWE FP Sbjct: 56 TTCHNSNQSAGGVQKNQIGKNVSRKEKIDFLLKTLLDLKDSKEAVYGALDAWVAWEHNFP 115 Query: 480 IGPXXXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFWK 659 I WH+VVQVIKWMLSKGQGNTMGTY QLI+ALDMDHRA EAH FW Sbjct: 116 IASLKRVLILLEKEQQWHKVVQVIKWMLSKGQGNTMGTYGQLIRALDMDHRANEAHMFWL 175 Query: 660 KKIGSDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQKVADTYEV 839 KKIG DLHSVPW+LC MISVYYRN+MLE L KLFKGLEAFDRKPPDKSI+QKVAD YE+ Sbjct: 176 KKIGLDLHSVPWQLCHRMISVYYRNNMLESLVKLFKGLEAFDRKPPDKSILQKVADAYEM 235 Query: 840 LGFLDEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRK 950 LG L+EK+R+L+KYKDLF ET G PK R + +++K Sbjct: 236 LGMLEEKERVLQKYKDLFKETEKGRPKKSRSTLAKKK 272 >ref|XP_004298657.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 275 Score = 309 bits (792), Expect = 1e-81 Identities = 156/275 (56%), Positives = 198/275 (72%) Frame = +3 Query: 132 MSRSAILTRLARQISQLTVNRTYVLTCSFNTYVRHSITKRSDAETIGSFDDRFSNKSLSS 311 M +S ++ L +++QL V R VLT S++T + + + S +D+ SN+ + Sbjct: 1 MWKSPPMSYLVGRLTQLGVIRAQVLTSSYSTAAHAQLYHHTTGKAAVSLEDQHSNQGIRH 60 Query: 312 FAQDPSGGECKPQIGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWVAWERTFPIGPX 491 F + +GGE + QIG NVSRKDK++FLV TLLDL DSKEAVYG LD WVAWE+ FPIG Sbjct: 61 FPEKNAGGENRNQIGWNVSRKDKVNFLVKTLLDLNDSKEAVYGTLDGWVAWEQDFPIGKL 120 Query: 492 XXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFWKKKIG 671 WHR++QVIKWMLSKGQG TMGTY QLI ALDMD R +EAH+FWKKKIG Sbjct: 121 RMALIALEKEQQWHRIIQVIKWMLSKGQGTTMGTYGQLIHALDMDQRPEEAHKFWKKKIG 180 Query: 672 SDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQKVADTYEVLGFL 851 DLH+VPW+LC M+S+YYRN+MLE+L KLF+GLEAFDRKPP KSIV+KVAD YE+LG L Sbjct: 181 MDLHAVPWQLCKSMMSIYYRNNMLENLIKLFEGLEAFDRKPPQKSIVRKVADAYEILGRL 240 Query: 852 DEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRKEK 956 ++K+R+LEKY LFTE D + K R + S+ K+K Sbjct: 241 EKKERVLEKYNYLFTE--DQSRKKPRKALSKEKKK 273 >gb|EOX92648.1| Uncharacterized protein TCM_046974 [Theobroma cacao] Length = 285 Score = 305 bits (781), Expect = 2e-80 Identities = 154/246 (62%), Positives = 186/246 (75%) Frame = +3 Query: 213 SFNTYVRHSITKRSDAETIGSFDDRFSNKSLSSFAQDPSGGECKPQIGENVSRKDKISFL 392 SF Y +I+K +E D+ N++ + ++ GG K QIG+NVSRKDKI FL Sbjct: 32 SFAAY--QAISKGQGSEAHQIVKDQGGNQAENLSSKPNIGGILKHQIGQNVSRKDKIKFL 89 Query: 393 VNTLLDLKDSKEAVYGALDAWVAWERTFPIGPXXXXXXXXXXXXXWHRVVQVIKWMLSKG 572 V TLLDLKD KEAVYGALDAWVAWE+ FPIGP WHRVVQVIKWMLSKG Sbjct: 90 VTTLLDLKDGKEAVYGALDAWVAWEQNFPIGPLKNVILALEKEHQWHRVVQVIKWMLSKG 149 Query: 573 QGNTMGTYEQLIKALDMDHRAKEAHEFWKKKIGSDLHSVPWRLCSLMISVYYRNHMLEDL 752 QGNTMGTY QLI+ALDMD+RA+EAH+FW KK+ +DLHSVPW+LC MISVYYRN+MLE+L Sbjct: 150 QGNTMGTYVQLIRALDMDNRAEEAHQFWLKKVSADLHSVPWQLCRQMISVYYRNNMLENL 209 Query: 753 TKLFKGLEAFDRKPPDKSIVQKVADTYEVLGFLDEKDRLLEKYKDLFTETWDGNPKGLRG 932 KLFKGLEAFDRKPP+KSIVQ+VAD YE+LG L+EK+R+LEKYKD+ T+T + K + Sbjct: 210 VKLFKGLEAFDRKPPEKSIVQRVADAYEMLGLLEEKERVLEKYKDIPTKTDKVHKKSKQA 269 Query: 933 SRSQRK 950 S ++K Sbjct: 270 SSKRKK 275 >ref|XP_002515828.1| conserved hypothetical protein [Ricinus communis] gi|223545057|gb|EEF46570.1| conserved hypothetical protein [Ricinus communis] Length = 317 Score = 301 bits (772), Expect = 2e-79 Identities = 148/229 (64%), Positives = 174/229 (75%) Frame = +3 Query: 279 DDRFSNKSLSSFAQDPSGGECKPQIGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWV 458 +D+ K+ + +GG K QIG+NVSRK+KI FL+ TLLDLKDSKEAVYGA+DAWV Sbjct: 17 EDQDDYKTTCHNSNQSAGGVQKNQIGKNVSRKEKIDFLLKTLLDLKDSKEAVYGAVDAWV 76 Query: 459 AWERTFPIGPXXXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAK 638 AWE FPI WHRVVQVIKW++SKGQGNTMGTY QLI+ALDMDHRA Sbjct: 77 AWEHNFPIASLKRVLILLEKEQQWHRVVQVIKWIISKGQGNTMGTYGQLIRALDMDHRAN 136 Query: 639 EAHEFWKKKIGSDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQK 818 EAH FW KKIG DLHSVPW+LC MISVYYRN+MLE L KL KGLEAFD KPPDKSIVQK Sbjct: 137 EAHMFWLKKIGLDLHSVPWQLCHRMISVYYRNNMLESLVKLSKGLEAFDHKPPDKSIVQK 196 Query: 819 VADTYEVLGFLDEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRKEKQAQ 965 VAD YE+LG L+EK+R+L+KYKDLF ET G PK R + +++K +++ Sbjct: 197 VADAYEMLGMLEEKERVLQKYKDLFKETEKGRPKKSRSTLAKKKSARSE 245 >ref|XP_006478887.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X1 [Citrus sinensis] Length = 288 Score = 296 bits (759), Expect = 7e-78 Identities = 145/236 (61%), Positives = 181/236 (76%) Frame = +3 Query: 258 AETIGSFDDRFSNKSLSSFAQDPSGGECKPQIGENVSRKDKISFLVNTLLDLKDSKEAVY 437 A ++ S + + +N+S+ + + + +IGENV RKDKI+FLVNTLLDLK+SKE VY Sbjct: 35 AMSMSSLEGQRTNQSVDQYPERNAASTRNFRIGENVPRKDKINFLVNTLLDLKNSKEDVY 94 Query: 438 GALDAWVAWERTFPIGPXXXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKAL 617 G LDAWVAWE+ FP+G WHRVVQVIKWMLSKGQG+TMGT QLI+AL Sbjct: 95 GTLDAWVAWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSKGQGSTMGTCGQLIRAL 154 Query: 618 DMDHRAKEAHEFWKKKIGSDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPP 797 DMDHRA+EAH+FW+K+IG DLHSVPW+LC MI++YYRN+MLE L KLFKGLEAFDRKPP Sbjct: 155 DMDHRAEEAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLFKGLEAFDRKPP 214 Query: 798 DKSIVQKVADTYEVLGFLDEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRKEKQAQ 965 +KSIVQ+VAD YEVLG L+EK+R+LEKYKDLFTE + K + S + K+K+ + Sbjct: 215 EKSIVQRVADAYEVLGLLEEKERVLEKYKDLFTEKEKRSNKKSKSSSMKGKKKKGR 270 >ref|XP_006373907.1| hypothetical protein POPTR_0016s10300g [Populus trichocarpa] gi|550321203|gb|ERP51704.1| hypothetical protein POPTR_0016s10300g [Populus trichocarpa] Length = 295 Score = 295 bits (755), Expect = 2e-77 Identities = 139/206 (67%), Positives = 165/206 (80%) Frame = +3 Query: 348 QIGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWVAWERTFPIGPXXXXXXXXXXXXX 527 QIG+NVS+KDKI FL+ TLLDL DSK++VYGALDAWVAWE+ FPI Sbjct: 62 QIGDNVSKKDKIKFLITTLLDLNDSKDSVYGALDAWVAWEQKFPIASIKQVLIALEKEQQ 121 Query: 528 WHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFWKKKIGSDLHSVPWRLCS 707 WHR+VQVIKWMLSKGQG TMGTY Q I+ALDMDHRAKEAHEFW KKIG DLHSVPW+LC+ Sbjct: 122 WHRIVQVIKWMLSKGQGTTMGTYAQFIRALDMDHRAKEAHEFWLKKIGRDLHSVPWQLCN 181 Query: 708 LMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQKVADTYEVLGFLDEKDRLLEKYKD 887 MIS+YYRN+MLE+L KLFKGLEAFDR+PP+KSIVQKVAD+YE+LG L+EK+R+LEKY Sbjct: 182 RMISIYYRNNMLENLIKLFKGLEAFDRQPPEKSIVQKVADSYEMLGLLEEKERVLEKYNH 241 Query: 888 LFTETWDGNPKGLRGSRSQRKEKQAQ 965 +F E G K LR + S++ +K + Sbjct: 242 IFVEAGKGQNKKLRNASSKKNKKSGK 267 >ref|XP_006443149.1| hypothetical protein CICLE_v10021498mg [Citrus clementina] gi|568850372|ref|XP_006478888.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X2 [Citrus sinensis] gi|557545411|gb|ESR56389.1| hypothetical protein CICLE_v10021498mg [Citrus clementina] Length = 287 Score = 294 bits (752), Expect = 5e-77 Identities = 144/232 (62%), Positives = 178/232 (76%) Frame = +3 Query: 258 AETIGSFDDRFSNKSLSSFAQDPSGGECKPQIGENVSRKDKISFLVNTLLDLKDSKEAVY 437 A ++ S + + +N+S+ + + + +IGENV RKDKI+FLVNTLLDLK+SKE VY Sbjct: 35 AMSMSSLEGQRTNQSVDQYPERNAASTRNFRIGENVPRKDKINFLVNTLLDLKNSKEDVY 94 Query: 438 GALDAWVAWERTFPIGPXXXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKAL 617 G LDAWVAWE+ FP+G WHRVVQVIKWMLSKGQG+TMGT QLI+AL Sbjct: 95 GTLDAWVAWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSKGQGSTMGTCGQLIRAL 154 Query: 618 DMDHRAKEAHEFWKKKIGSDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPP 797 DMDHRA+EAH+FW+K+IG DLHSVPW+LC MI++YYRN+MLE L KLFKGLEAFDRKPP Sbjct: 155 DMDHRAEEAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLFKGLEAFDRKPP 214 Query: 798 DKSIVQKVADTYEVLGFLDEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRKE 953 +KSIVQ+VAD YEVLG L+EK+R+LEKYKDLFTE + K + S + K+ Sbjct: 215 EKSIVQRVADAYEVLGLLEEKERVLEKYKDLFTEKEKRSNKKSKSSSMKGKK 266 >ref|XP_006373908.1| hypothetical protein POPTR_0016s10300g [Populus trichocarpa] gi|550321204|gb|ERP51705.1| hypothetical protein POPTR_0016s10300g [Populus trichocarpa] Length = 314 Score = 283 bits (725), Expect = 6e-74 Identities = 139/225 (61%), Positives = 165/225 (73%), Gaps = 19/225 (8%) Frame = +3 Query: 348 QIGENVSRKDKISFLVNT-------------------LLDLKDSKEAVYGALDAWVAWER 470 QIG+NVS+KDKI FL+ T LLDL DSK++VYGALDAWVAWE+ Sbjct: 62 QIGDNVSKKDKIKFLITTVSTQNPNYQSLFICMVVFTLLDLNDSKDSVYGALDAWVAWEQ 121 Query: 471 TFPIGPXXXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHE 650 FPI WHR+VQVIKWMLSKGQG TMGTY Q I+ALDMDHRAKEAHE Sbjct: 122 KFPIASIKQVLIALEKEQQWHRIVQVIKWMLSKGQGTTMGTYAQFIRALDMDHRAKEAHE 181 Query: 651 FWKKKIGSDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQKVADT 830 FW KKIG DLHSVPW+LC+ MIS+YYRN+MLE+L KLFKGLEAFDR+PP+KSIVQKVAD+ Sbjct: 182 FWLKKIGRDLHSVPWQLCNRMISIYYRNNMLENLIKLFKGLEAFDRQPPEKSIVQKVADS 241 Query: 831 YEVLGFLDEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRKEKQAQ 965 YE+LG L+EK+R+LEKY +F E G K LR + S++ +K + Sbjct: 242 YEMLGLLEEKERVLEKYNHIFVEAGKGQNKKLRNASSKKNKKSGK 286 >ref|XP_002323526.2| hypothetical protein POPTR_0016s10300g [Populus trichocarpa] gi|550321202|gb|EEF05287.2| hypothetical protein POPTR_0016s10300g [Populus trichocarpa] Length = 312 Score = 283 bits (725), Expect = 6e-74 Identities = 139/225 (61%), Positives = 165/225 (73%), Gaps = 19/225 (8%) Frame = +3 Query: 348 QIGENVSRKDKISFLVNT-------------------LLDLKDSKEAVYGALDAWVAWER 470 QIG+NVS+KDKI FL+ T LLDL DSK++VYGALDAWVAWE+ Sbjct: 62 QIGDNVSKKDKIKFLITTVSTQNPNYQSLFICMVVFTLLDLNDSKDSVYGALDAWVAWEQ 121 Query: 471 TFPIGPXXXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHE 650 FPI WHR+VQVIKWMLSKGQG TMGTY Q I+ALDMDHRAKEAHE Sbjct: 122 KFPIASIKQVLIALEKEQQWHRIVQVIKWMLSKGQGTTMGTYAQFIRALDMDHRAKEAHE 181 Query: 651 FWKKKIGSDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQKVADT 830 FW KKIG DLHSVPW+LC+ MIS+YYRN+MLE+L KLFKGLEAFDR+PP+KSIVQKVAD+ Sbjct: 182 FWLKKIGRDLHSVPWQLCNRMISIYYRNNMLENLIKLFKGLEAFDRQPPEKSIVQKVADS 241 Query: 831 YEVLGFLDEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRKEKQAQ 965 YE+LG L+EK+R+LEKY +F E G K LR + S++ +K + Sbjct: 242 YEMLGLLEEKERVLEKYNHIFVEAGKGQNKKLRNASSKKNKKSGK 286 >ref|XP_002309089.2| hypothetical protein POPTR_0006s09260g [Populus trichocarpa] gi|550335841|gb|EEE92612.2| hypothetical protein POPTR_0006s09260g [Populus trichocarpa] Length = 286 Score = 283 bits (723), Expect = 1e-73 Identities = 142/222 (63%), Positives = 161/222 (72%), Gaps = 16/222 (7%) Frame = +3 Query: 348 QIGENVSRKDKISFLVNTL----------------LDLKDSKEAVYGALDAWVAWERTFP 479 QIG+NVS+KDKI FL+ TL LDL DSK+AVYGALDAWVAWE+ FP Sbjct: 62 QIGDNVSKKDKIKFLITTLVLYQLLYDKTILHMQLLDLNDSKDAVYGALDAWVAWEQKFP 121 Query: 480 IGPXXXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFWK 659 I WHR+VQVIKWMLSKGQG TM TY QLI+ALDMDHRAKEAHEFW Sbjct: 122 IASIKQVLIALEKEQQWHRIVQVIKWMLSKGQGTTMATYAQLIRALDMDHRAKEAHEFWL 181 Query: 660 KKIGSDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQKVADTYEV 839 KKIG DLHSVPW+LC+ MI++YYRN+MLE+L KLFKGLEAFDRKPP+KSIVQKVAD YE+ Sbjct: 182 KKIGRDLHSVPWKLCNSMITIYYRNNMLENLIKLFKGLEAFDRKPPEKSIVQKVADAYEM 241 Query: 840 LGFLDEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRKEKQAQ 965 LG L+EK RLLEKY LF ET G K R S++ K + Sbjct: 242 LGLLEEKGRLLEKYNHLFIETGKGWNKNFRVVSSKKNNKSGK 283 >ref|XP_004511665.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X1 [Cicer arietinum] gi|502160198|ref|XP_004511666.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like isoform X2 [Cicer arietinum] Length = 264 Score = 282 bits (721), Expect = 2e-73 Identities = 139/209 (66%), Positives = 160/209 (76%) Frame = +3 Query: 342 KPQIGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWVAWERTFPIGPXXXXXXXXXXX 521 K IGENVSRKD+ FL+ TL D+ DSKEA+YGALDAWVAWE+ FPIG Sbjct: 50 KHYIGENVSRKDRTMFLLTTLRDIDDSKEAIYGALDAWVAWEQKFPIGSLRNILIRLEME 109 Query: 522 XXWHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFWKKKIGSDLHSVPWRL 701 WHRVVQVIKWMLSKGQG TMGTY QLI+ALDMDHR +EAH+FW+ KIG+DLHSVPW+L Sbjct: 110 QQWHRVVQVIKWMLSKGQGTTMGTYGQLIRALDMDHRVEEAHKFWEMKIGTDLHSVPWQL 169 Query: 702 CSLMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQKVADTYEVLGFLDEKDRLLEKY 881 C LMISVYYRN MLEDL KLFKGLEAFDRKP DK I+QKVA+ YE+LG ++EK+R++EKY Sbjct: 170 CHLMISVYYRNKMLEDLVKLFKGLEAFDRKPRDKLIIQKVANAYEMLGLVEEKERIMEKY 229 Query: 882 KDLFTETWDGNPKGLRGSRSQRKEKQAQE 968 LF E G K R S+ KE+Q E Sbjct: 230 NHLFAE--KGPTKKSRRKLSKTKEEQPDE 256 >ref|XP_004136857.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic-like [Cucumis sativus] Length = 302 Score = 278 bits (712), Expect = 2e-72 Identities = 149/268 (55%), Positives = 184/268 (68%), Gaps = 2/268 (0%) Frame = +3 Query: 168 QISQLTVNRTYVLTCSFNTYVRHSITKRSDAETIGSFDDRFSNKSLSSFAQDPSGGECKP 347 Q +L V+R V + + T ++ + ++ A+ D S+K+L ++ G K Sbjct: 24 QTMELGVSRLQVGSSCYCTTIQDQMCQQL-ADKDRKDKDVNSSKALGHISEQNIGDIRKH 82 Query: 348 QIGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWVAWERTFPIGPXXXXXXXXXXXXX 527 QIG+N+SRKDKI FLVNTLLDL+DSKEAVYGALDAWVAWE+ FPI Sbjct: 83 QIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQVFPIASLKHVLAALEKEQQ 142 Query: 528 WHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFWKKKIGSDLHSVPWRLCS 707 WHR+VQVIKWMLSKGQG TM Y QLI+ALDMDHR +EAH+FW KIGSDLHSVPW++C Sbjct: 143 WHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCR 202 Query: 708 LMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQKVADTYEVLGFLDEKDRLLEKYKD 887 M+++YYRN LEDL KLFK LEAF RKPPDKSIVQ+VAD E+LG L+EK+R+L KYK Sbjct: 203 SMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKY 262 Query: 888 LFTETWDGNPKGLRGS--RSQRKEKQAQ 965 LF E K R S +S+RK K + Sbjct: 263 LFDEKEGPMKKYKRISFEKSKRKRKSTK 290 >ref|NP_563712.1| uncharacterized protein [Arabidopsis thaliana] gi|13430646|gb|AAK25945.1|AF360235_1 unknown protein [Arabidopsis thaliana] gi|14532820|gb|AAK64092.1| unknown protein [Arabidopsis thaliana] gi|332189597|gb|AEE27718.1| uncharacterized protein AT1G04590 [Arabidopsis thaliana] Length = 381 Score = 278 bits (711), Expect = 3e-72 Identities = 136/229 (59%), Positives = 170/229 (74%) Frame = +3 Query: 279 DDRFSNKSLSSFAQDPSGGECKPQIGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWV 458 ++ FS+ S A++P K QIGEN+ +KDKI FLVNTLLD++D+KEAVYGALDAWV Sbjct: 116 EEDFSDSSKKGNAENPR----KHQIGENIPKKDKIKFLVNTLLDIEDNKEAVYGALDAWV 171 Query: 459 AWERTFPIGPXXXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAK 638 AWER FPI WHR+VQVIKW+LSKGQGNTMGTY QLI+ALDMD RA+ Sbjct: 172 AWERNFPIASLKIVIASLEKEHQWHRMVQVIKWILSKGQGNTMGTYGQLIRALDMDRRAE 231 Query: 639 EAHEFWKKKIGSDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQK 818 EAH W+KK+G+DLHSVPW+LC M+ +Y+RN+ML++L KLFK LE++DRKPPDK IVQ Sbjct: 232 EAHVIWRKKVGNDLHSVPWQLCLQMMRIYFRNNMLQELVKLFKDLESYDRKPPDKHIVQT 291 Query: 819 VADTYEVLGFLDEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRKEKQAQ 965 VAD YE+LG LDEK+R++ KY L G P + SRS RK+K+ + Sbjct: 292 VADAYELLGMLDEKERVVTKYSHLLL----GTPSDDKPSRSSRKKKKPE 336 >ref|NP_001241921.1| uncharacterized protein LOC100795658 [Glycine max] gi|255637229|gb|ACU18945.1| unknown [Glycine max] Length = 300 Score = 276 bits (707), Expect = 8e-72 Identities = 142/219 (64%), Positives = 160/219 (73%), Gaps = 13/219 (5%) Frame = +3 Query: 285 RFSNKSLSSFAQDPSG-----GECKPQ--------IGENVSRKDKISFLVNTLLDLKDSK 425 RF + +S+ PS +C P IGENVSRKDK +L TLL+L DSK Sbjct: 30 RFCHSQVSTVLPPPSNLQHQQTQCDPPHTAVPRNYIGENVSRKDKNKYLYTTLLELNDSK 89 Query: 426 EAVYGALDAWVAWERTFPIGPXXXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQL 605 EAVYGALDAWVAWE+ FPI WHRVVQVIKWMLSKGQG TMGTY QL Sbjct: 90 EAVYGALDAWVAWEQNFPIASLKTILISLEKDQQWHRVVQVIKWMLSKGQGMTMGTYGQL 149 Query: 606 IKALDMDHRAKEAHEFWKKKIGSDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFD 785 I+ALDMDHR +EA +FW+ KIGSDLHSVPW+LC LMISVYYRN+ML+DL KLFKGLEAFD Sbjct: 150 IRALDMDHRVEEAQKFWEIKIGSDLHSVPWQLCHLMISVYYRNNMLQDLVKLFKGLEAFD 209 Query: 786 RKPPDKSIVQKVADTYEVLGFLDEKDRLLEKYKDLFTET 902 RKP DKSI+QKVA+ YEVLG + EK R+LEKY LFTET Sbjct: 210 RKPRDKSIIQKVANAYEVLGLVKEKVRVLEKYNHLFTET 248 >ref|XP_002892228.1| EMB2748 [Arabidopsis lyrata subsp. lyrata] gi|297338070|gb|EFH68487.1| EMB2748 [Arabidopsis lyrata subsp. lyrata] Length = 395 Score = 275 bits (703), Expect = 2e-71 Identities = 134/226 (59%), Positives = 169/226 (74%) Frame = +3 Query: 279 DDRFSNKSLSSFAQDPSGGECKPQIGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWV 458 ++ FS+ S A+ P K QIGEN+ +KDKI FLVNTLLD++D+KEAVYGALDAWV Sbjct: 117 EEDFSDSSKKENAETPR----KHQIGENIPKKDKIKFLVNTLLDMEDNKEAVYGALDAWV 172 Query: 459 AWERTFPIGPXXXXXXXXXXXXXWHRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAK 638 AWER FPI WHR++QVIKW+LSKGQGNTMGTY QLI+ALDMD RA+ Sbjct: 173 AWERNFPIASLKRVIAILEKEHQWHRMIQVIKWILSKGQGNTMGTYGQLIRALDMDRRAE 232 Query: 639 EAHEFWKKKIGSDLHSVPWRLCSLMISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQK 818 EAH W+KKIG+DLHSVPW+LC M+ +Y+RN+ML++L KLFK LE++DRKPPDK IVQ Sbjct: 233 EAHVIWRKKIGNDLHSVPWQLCLQMMRIYFRNNMLQELVKLFKDLESYDRKPPDKHIVQT 292 Query: 819 VADTYEVLGFLDEKDRLLEKYKDLFTETWDGNPKGLRGSRSQRKEK 956 VADTYE+LG +DEK+R++ KY L T + K R SR ++K++ Sbjct: 293 VADTYELLGMVDEKERVMTKYSHLLLGT-ASDDKPRRSSRKKKKQE 337 >gb|ESW29328.1| hypothetical protein PHAVU_002G061400g [Phaseolus vulgaris] Length = 277 Score = 274 bits (701), Expect = 4e-71 Identities = 131/183 (71%), Positives = 149/183 (81%) Frame = +3 Query: 351 IGENVSRKDKISFLVNTLLDLKDSKEAVYGALDAWVAWERTFPIGPXXXXXXXXXXXXXW 530 IGENVSRKDK +L +TLL+L DSKEAVYGALDAW+AWE+ FPI W Sbjct: 60 IGENVSRKDKTKYLYSTLLELNDSKEAVYGALDAWIAWEQNFPIASLKTILNSLEKEQQW 119 Query: 531 HRVVQVIKWMLSKGQGNTMGTYEQLIKALDMDHRAKEAHEFWKKKIGSDLHSVPWRLCSL 710 HRVVQVIKWMLSKGQG TMGTY QLI+ALDMDHR +EA +FW+ KIGSDLHSVPW+LC L Sbjct: 120 HRVVQVIKWMLSKGQGTTMGTYGQLIRALDMDHRVEEAQKFWEMKIGSDLHSVPWQLCHL 179 Query: 711 MISVYYRNHMLEDLTKLFKGLEAFDRKPPDKSIVQKVADTYEVLGFLDEKDRLLEKYKDL 890 MISVYYRN+MLEDL KLFKGLEAFDRKP DK+I+QKVA+ YE+LG L EK+++L KY L Sbjct: 180 MISVYYRNNMLEDLVKLFKGLEAFDRKPRDKTIIQKVANAYEMLGLLKEKEKVLAKYSHL 239 Query: 891 FTE 899 FTE Sbjct: 240 FTE 242