BLASTX nr result
ID: Sinomenium21_contig00040745
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00040745 (412 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003632994.1| PREDICTED: pentatricopeptide repeat-containi... 182 6e-50 ref|XP_004295634.1| PREDICTED: pentatricopeptide repeat-containi... 179 2e-49 ref|XP_007212650.1| hypothetical protein PRUPE_ppa018206mg, part... 179 5e-48 ref|XP_007147940.1| hypothetical protein PHAVU_006G167300g [Phas... 172 2e-45 ref|XP_003541672.2| PREDICTED: pentatricopeptide repeat-containi... 167 7e-45 ref|XP_004485987.1| PREDICTED: pentatricopeptide repeat-containi... 161 2e-43 ref|XP_002531058.1| pentatricopeptide repeat-containing protein,... 178 8e-43 ref|XP_006468073.1| PREDICTED: pentatricopeptide repeat-containi... 159 2e-42 gb|EXB51999.1| hypothetical protein L484_019777 [Morus notabilis] 160 2e-42 ref|XP_007041101.1| Tetratricopeptide repeat (TPR)-like superfam... 164 1e-41 ref|XP_006355278.1| PREDICTED: pentatricopeptide repeat-containi... 173 2e-41 ref|XP_004244886.1| PREDICTED: pentatricopeptide repeat-containi... 170 2e-40 gb|EPS63069.1| hypothetical protein M569_11717 [Genlisea aurea] 166 3e-39 ref|XP_004158687.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 153 2e-35 ref|XP_004134903.1| PREDICTED: pentatricopeptide repeat-containi... 153 2e-35 ref|XP_006836321.1| hypothetical protein AMTR_s00092p00064890 [A... 147 1e-33 gb|EYU19817.1| hypothetical protein MIMGU_mgv1a017899mg, partial... 145 6e-33 ref|XP_002869909.1| binding protein [Arabidopsis lyrata subsp. l... 130 3e-30 ref|XP_006413827.1| hypothetical protein EUTSA_v10027143mg [Eutr... 127 3e-30 emb|CAB45902.1| putative protein (fragment) [Arabidopsis thalian... 126 7e-30 >ref|XP_003632994.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Vitis vinifera] Length = 613 Score = 182 bits (462), Expect(2) = 6e-50 Identities = 84/115 (73%), Positives = 99/115 (86%) Frame = +3 Query: 66 DMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLK 245 DM K+LIF L+ +P SYAH IFSQIQ+PNIFTWNTMIRGYAES+NP PA+E++ QM Sbjct: 74 DMGKYLIFTLLSFCSPMSYAHQIFSQIQNPNIFTWNTMIRGYAESENPMPALELYRQMHV 133 Query: 246 ASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 + +EPDTHTYPFLLKA AKLM VREGEKVH +A++NGFESLVF+QNT VH+YAAC Sbjct: 134 SCIEPDTHTYPFLLKAIAKLMDVREGEKVHSIAIRNGFESLVFVQNTLVHMYAAC 188 Score = 41.2 bits (95), Expect(2) = 6e-50 Identities = 19/23 (82%), Positives = 20/23 (86%) Frame = +2 Query: 2 ALLLDCASSKSKLKQIHAFSIRH 70 ALLL CASSK K +QIHAFSIRH Sbjct: 44 ALLLSCASSKFKFRQIHAFSIRH 66 Score = 70.5 bits (171), Expect = 2e-10 Identities = 36/96 (37%), Positives = 51/96 (53%) Frame = +3 Query: 123 AHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKACAK 302 AH +F + N+ TWN++I GYA + P A+ + +M VEPD T LL ACA+ Sbjct: 194 AHKLFELMAERNLVTWNSVINGYALNGRPNEALTLFREMGLRGVEPDGFTMVSLLSACAE 253 Query: 303 LMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 L A+ G + H VK G + + N + LYA C Sbjct: 254 LGALALGRRAHVYMVKVGLDGNLHAGNALLDLYAKC 289 >ref|XP_004295634.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Fragaria vesca subsp. vesca] Length = 611 Score = 179 bits (455), Expect(2) = 2e-49 Identities = 82/118 (69%), Positives = 98/118 (83%) Frame = +3 Query: 57 SQSDMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQ 236 S DM KHLIF V LS+P SYAH+IFSQI+ PN+FTWNTMIRGYAESQNP P I+++ Q Sbjct: 69 SNPDMGKHLIFTSVSLSSPMSYAHHIFSQIKHPNVFTWNTMIRGYAESQNPMPVIQLYRQ 128 Query: 237 MLKASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 M + +EPDTHTYPFLLKA AKL+ VREGEKVHC+A++NG ESLVF++N +HLYA C Sbjct: 129 MRVSCIEPDTHTYPFLLKAVAKLLDVREGEKVHCIALRNGLESLVFVKNALLHLYAVC 186 Score = 42.0 bits (97), Expect(2) = 2e-49 Identities = 20/23 (86%), Positives = 20/23 (86%) Frame = +2 Query: 2 ALLLDCASSKSKLKQIHAFSIRH 70 ALL CASS SKLKQIHAFSIRH Sbjct: 42 ALLQSCASSNSKLKQIHAFSIRH 64 Score = 63.9 bits (154), Expect = 2e-08 Identities = 33/112 (29%), Positives = 54/112 (48%) Frame = +3 Query: 75 KHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASV 254 K+ + +L + AH +F + ++ WN++I G++ + P A+ I +M V Sbjct: 176 KNALLHLYAVCGQVESAHKVFESMSERDLVAWNSVINGFSLNGRPNEALTIFREMSLEGV 235 Query: 255 EPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 PD T LL ACA+L A+ G ++H VK G N + +YA C Sbjct: 236 VPDGFTMVSLLGACAELGALALGGRIHVYMVKLGLTRNAHASNALLDVYAKC 287 >ref|XP_007212650.1| hypothetical protein PRUPE_ppa018206mg, partial [Prunus persica] gi|462408515|gb|EMJ13849.1| hypothetical protein PRUPE_ppa018206mg, partial [Prunus persica] Length = 604 Score = 179 bits (454), Expect(2) = 5e-48 Identities = 82/118 (69%), Positives = 97/118 (82%) Frame = +3 Query: 57 SQSDMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQ 236 S DM KHLIF V L AP YAH IFSQI+SPN+FTWNTMIRGYAES+NP P ++++HQ Sbjct: 62 SSPDMGKHLIFTTVSLKAPMPYAHQIFSQIRSPNVFTWNTMIRGYAESENPTPVLQLYHQ 121 Query: 237 MLKASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 M SVEPDTHTYPFLLKA AKL VREGEK+H +A++NGFESLVF++NT +H+YA C Sbjct: 122 MHVNSVEPDTHTYPFLLKAVAKLTNVREGEKIHSIALRNGFESLVFVKNTLLHMYACC 179 Score = 37.7 bits (86), Expect(2) = 5e-48 Identities = 17/23 (73%), Positives = 20/23 (86%) Frame = +2 Query: 2 ALLLDCASSKSKLKQIHAFSIRH 70 ALL CASSK K++QIHAFS+RH Sbjct: 35 ALLQCCASSKLKMQQIHAFSVRH 57 Score = 63.9 bits (154), Expect = 2e-08 Identities = 32/112 (28%), Positives = 52/112 (46%) Frame = +3 Query: 75 KHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASV 254 K+ + ++ AH +F I ++ WN++I G+A + P A+ + M V Sbjct: 169 KNTLLHMYACCGHVESAHRVFESISERDLVAWNSVINGFALNGRPNEALTVFRDMSLEGV 228 Query: 255 EPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 +PD T LL ACA+L + G ++H +K G N + LYA C Sbjct: 229 QPDGFTMVSLLSACAELGTLALGRRIHVYMLKVGLTGNSHATNALLDLYAKC 280 >ref|XP_007147940.1| hypothetical protein PHAVU_006G167300g [Phaseolus vulgaris] gi|561021163|gb|ESW19934.1| hypothetical protein PHAVU_006G167300g [Phaseolus vulgaris] Length = 611 Score = 172 bits (436), Expect(2) = 2e-45 Identities = 76/115 (66%), Positives = 96/115 (83%) Frame = +3 Query: 66 DMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLK 245 DM KHLIF +V LSAP SYA+N+F++I +PN+FTWNTMIRGYAESQNP PA+ + QM Sbjct: 72 DMAKHLIFTIVSLSAPMSYAYNVFTRIHNPNVFTWNTMIRGYAESQNPSPALHFYRQMTV 131 Query: 246 ASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 + VEPDTHTYPFLLKA +K + VREGE +H V ++NGF+SLVF+QN+ +H+YAAC Sbjct: 132 SCVEPDTHTYPFLLKAISKSLNVREGEAIHSVTIRNGFQSLVFVQNSLLHIYAAC 186 Score = 36.2 bits (82), Expect(2) = 2e-45 Identities = 18/27 (66%), Positives = 20/27 (74%) Frame = +2 Query: 5 LLLDCASSKSKLKQIHAFSIRHDQTPH 85 LL ASSK KL+QIHAFSIRH + H Sbjct: 43 LLQSSASSKYKLRQIHAFSIRHGVSLH 69 Score = 68.6 bits (166), Expect = 9e-10 Identities = 33/99 (33%), Positives = 54/99 (54%) Frame = +3 Query: 114 TSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKA 293 T A+ +F ++ ++ WN++I G+A + P A+ + +M VEPD T LL A Sbjct: 189 TESAYKVFELMKERDLVAWNSVINGFALNGRPNEALTLFREMSVEGVEPDGFTVVSLLSA 248 Query: 294 CAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 CA+L A+ G +VH +K G ++ N+ + LYA C Sbjct: 249 CAELGALELGRRVHVYLLKVGLRENSYVTNSLLDLYAKC 287 >ref|XP_003541672.2| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Glycine max] Length = 607 Score = 167 bits (422), Expect(2) = 7e-45 Identities = 75/115 (65%), Positives = 94/115 (81%) Frame = +3 Query: 66 DMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLK 245 DM KHLIF +V LSAP SYA+N+F+ I +PN+FTWNT+IRGYAES NP PA + QM+ Sbjct: 68 DMGKHLIFTIVSLSAPMSYAYNVFTVIHNPNVFTWNTIIRGYAESDNPSPAFLFYRQMVV 127 Query: 246 ASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 + VEPDTHTYPFLLKA +K + VREGE +H V ++NGFESLVF+QN+ +H+YAAC Sbjct: 128 SCVEPDTHTYPFLLKAISKSLNVREGEAIHSVTIRNGFESLVFVQNSLLHIYAAC 182 Score = 39.7 bits (91), Expect(2) = 7e-45 Identities = 19/23 (82%), Positives = 20/23 (86%) Frame = +2 Query: 2 ALLLDCASSKSKLKQIHAFSIRH 70 +LL CASSK KLKQIHAFSIRH Sbjct: 38 SLLQFCASSKHKLKQIHAFSIRH 60 Score = 64.7 bits (156), Expect = 1e-08 Identities = 33/99 (33%), Positives = 52/99 (52%) Frame = +3 Query: 114 TSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKA 293 T A+ +F ++ ++ WN+MI G+A + P A+ + +M VEPD T LL A Sbjct: 185 TESAYKVFELMKERDLVAWNSMINGFALNGRPNEALTLFREMSVEGVEPDGFTVVSLLSA 244 Query: 294 CAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 A+L A+ G +VH +K G + N+ + LYA C Sbjct: 245 SAELGALELGRRVHVYLLKVGLSKNSHVTNSLLDLYAKC 283 >ref|XP_004485987.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cicer arietinum] Length = 610 Score = 161 bits (407), Expect(2) = 2e-43 Identities = 71/115 (61%), Positives = 94/115 (81%) Frame = +3 Query: 66 DMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLK 245 DM K+LIF +V LSAP SYA+N+F+ + +PN+FTWNTMIRGYAES N PA+ + +ML Sbjct: 71 DMGKYLIFTVVSLSAPMSYAYNVFTLLHNPNVFTWNTMIRGYAESDNSSPALPFYRKMLV 130 Query: 246 ASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 + VEPDTHTYPFLLKA +K + VREGE +H V ++NGFESL+F++N+ +H+YAAC Sbjct: 131 SCVEPDTHTYPFLLKAISKSLNVREGEAIHSVTIRNGFESLIFVRNSLLHIYAAC 185 Score = 40.8 bits (94), Expect(2) = 2e-43 Identities = 20/23 (86%), Positives = 20/23 (86%) Frame = +2 Query: 2 ALLLDCASSKSKLKQIHAFSIRH 70 ALL CASSK KLKQIHAFSIRH Sbjct: 41 ALLQYCASSKHKLKQIHAFSIRH 63 Score = 67.0 bits (162), Expect = 3e-09 Identities = 33/99 (33%), Positives = 52/99 (52%) Frame = +3 Query: 114 TSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKA 293 T A+ +F + ++ WN++I G+A + P A+ + +M VEPD T LL A Sbjct: 188 TESAYKVFELMGERDLVAWNSVINGFALNGKPNEALSLFREMSLEGVEPDGFTVVSLLSA 247 Query: 294 CAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 CA+L AV G +VH +K G + + N+ + YA C Sbjct: 248 CAELGAVELGRRVHVYLLKIGLTENLHVNNSLLDFYAKC 286 >ref|XP_002531058.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223529353|gb|EEF31319.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 341 Score = 178 bits (451), Expect = 8e-43 Identities = 77/119 (64%), Positives = 105/119 (88%) Frame = +3 Query: 54 PSQSDMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHH 233 P+ DM KHLI+++V +SAP +YAHNIF+ IQ+PNIFTWNTMIRG+AES+NP+PAIE++H Sbjct: 70 PNNPDMGKHLIYSIVSVSAPMTYAHNIFTLIQNPNIFTWNTMIRGHAESENPKPAIELYH 129 Query: 234 QMLKASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 ++ S+EPDTHTYPFLLKA +K++ VR GEK+H ++++NGFESLVF+QN+ +H+YAAC Sbjct: 130 RLHFNSIEPDTHTYPFLLKAVSKMVNVRVGEKIHSISIRNGFESLVFVQNSLMHMYAAC 188 Score = 69.7 bits (169), Expect = 4e-10 Identities = 34/96 (35%), Positives = 53/96 (55%) Frame = +3 Query: 123 AHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKACAK 302 AH +F + ++ WNT I G+A + P A++++ +M VEPD T LL ACA+ Sbjct: 194 AHKLFEFMPDRDLVAWNTAISGFALNGKPNEALKLYMEMGLEGVEPDGFTLVSLLSACAE 253 Query: 303 LMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 L A+ G ++H VK G + + N+ + LYA C Sbjct: 254 LGALALGRRIHAYMVKVGLDENLHANNSLIDLYAKC 289 >ref|XP_006468073.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Citrus sinensis] Length = 616 Score = 159 bits (403), Expect(2) = 2e-42 Identities = 74/114 (64%), Positives = 93/114 (81%) Frame = +3 Query: 66 DMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLK 245 D+ K+LI+ +V LS P SYAHNIFS +Q PNIFTWNTMIRGYAES NP A+E++ +M Sbjct: 77 DLGKYLIYAIVSLSFPMSYAHNIFSHVQDPNIFTWNTMIRGYAESANPLLAVELYSKMHV 136 Query: 246 ASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAA 407 + ++PDTHTYPFLLKA +KL VR GE+ H VA++NGFESLVF+QN+ VH+YAA Sbjct: 137 SGIKPDTHTYPFLLKAISKLADVRMGEQTHSVAIRNGFESLVFVQNSLVHMYAA 190 Score = 38.9 bits (89), Expect(2) = 2e-42 Identities = 18/22 (81%), Positives = 19/22 (86%) Frame = +2 Query: 5 LLLDCASSKSKLKQIHAFSIRH 70 LL CASSK KLKQ+HAFSIRH Sbjct: 48 LLQVCASSKHKLKQVHAFSIRH 69 Score = 62.0 bits (149), Expect = 8e-08 Identities = 30/96 (31%), Positives = 47/96 (48%) Frame = +3 Query: 123 AHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKACAK 302 A +F + ++ WN++I G+A + P A+ I +M VEPD +T L ACA+ Sbjct: 197 ACKVFELMSERDLVAWNSVINGFASNGKPNEALTIFREMASEGVEPDGYTMVSLFSACAE 256 Query: 303 LMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 L A+ G + H K G V + N + Y+ C Sbjct: 257 LGALALGRRAHTYVWKVGLSDNVNVNNALLDFYSKC 292 >gb|EXB51999.1| hypothetical protein L484_019777 [Morus notabilis] Length = 623 Score = 160 bits (406), Expect(2) = 2e-42 Identities = 74/116 (63%), Positives = 95/116 (81%), Gaps = 1/116 (0%) Frame = +3 Query: 66 DMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLK 245 DM KHLIF V LSA SYA+N+FSQI PNI+TWNTM RGYAES+NPR A++++H+ ++ Sbjct: 83 DMGKHLIFTAVSLSASMSYANNVFSQIDRPNIYTWNTMFRGYAESENPRLALDLYHRFIR 142 Query: 246 -ASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 +SV+PDTHTYPF+LKA AKL V EG K+H VA++NGFESLV++QN +H YA+C Sbjct: 143 VSSVKPDTHTYPFVLKAVAKLADVEEGGKIHSVALRNGFESLVYVQNALLHFYASC 198 Score = 37.4 bits (85), Expect(2) = 2e-42 Identities = 18/23 (78%), Positives = 20/23 (86%) Frame = +2 Query: 2 ALLLDCASSKSKLKQIHAFSIRH 70 +LL CASS+SKL QIHAFSIRH Sbjct: 53 SLLQLCASSESKLMQIHAFSIRH 75 Score = 57.4 bits (137), Expect = 2e-06 Identities = 32/99 (32%), Positives = 47/99 (47%) Frame = +3 Query: 114 TSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKA 293 T AH +F + ++ WNT+I G+A + P A+ + M V PD T LL A Sbjct: 201 TDSAHKMFVLMAHRDLVAWNTVINGFALNGRPNEALVLFRDMGFEGVGPDGFTMVSLLSA 260 Query: 294 CAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 C +L A+ G + H +K G + N + LYA C Sbjct: 261 CGELGALALGRRAHVYMLKVGLCLNLIANNALLDLYAKC 299 >ref|XP_007041101.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cacao] gi|590681507|ref|XP_007041102.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cacao] gi|590681511|ref|XP_007041103.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cacao] gi|508705036|gb|EOX96932.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cacao] gi|508705037|gb|EOX96933.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cacao] gi|508705038|gb|EOX96934.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cacao] Length = 616 Score = 164 bits (415), Expect(2) = 1e-41 Identities = 73/115 (63%), Positives = 94/115 (81%) Frame = +3 Query: 66 DMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLK 245 D+ KHLI++LV LS P SY ++IFS+IQS N+F WNTMIRGYAES+NP PA+E++ QM Sbjct: 77 DIGKHLIYSLVSLSTPMSYPYSIFSRIQSSNVFIWNTMIRGYAESENPEPALELYRQMQA 136 Query: 246 ASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 + +EPDTHTYPFLLKA AKL +R GE +H ++NGFESLVF+QN+ +H+YAAC Sbjct: 137 SCIEPDTHTYPFLLKAVAKLADIRVGENMHSTVIRNGFESLVFVQNSMLHMYAAC 191 Score = 31.6 bits (70), Expect(2) = 1e-41 Identities = 14/23 (60%), Positives = 19/23 (82%) Frame = +2 Query: 2 ALLLDCASSKSKLKQIHAFSIRH 70 +LL + SS+ KL+QIHAFS+RH Sbjct: 47 SLLQNYGSSELKLRQIHAFSLRH 69 Score = 64.7 bits (156), Expect = 1e-08 Identities = 31/96 (32%), Positives = 52/96 (54%) Frame = +3 Query: 123 AHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKACAK 302 A+ +F + + ++ WN++I G+A + P A+ + +M VEPD T L ACA+ Sbjct: 197 AYKMFELMPARDVVAWNSVINGFALNGKPNEALTLFREMGLEGVEPDGFTLVSLFSACAE 256 Query: 303 LMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 L A+ G ++H VK G + ++N + LYA C Sbjct: 257 LGALALGNRIHVYIVKVGLSENLHVKNALLDLYAKC 292 >ref|XP_006355278.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Solanum tuberosum] Length = 585 Score = 173 bits (439), Expect = 2e-41 Identities = 83/117 (70%), Positives = 95/117 (81%) Frame = +3 Query: 57 SQSDMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQ 236 S +M K+LIF LV LS P YA IF+QIQ PNIFTWNTMIRGYAES+NP PAIEIH+Q Sbjct: 43 SSPEMGKYLIFTLVSLSGPMCYAKKIFNQIQFPNIFTWNTMIRGYAESENPYPAIEIHNQ 102 Query: 237 MLKASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAA 407 M V PDTHTYPFLLKA AK++ VREGEKVHC+A++NGFESLVF+QN+ VH Y A Sbjct: 103 MCVNYVAPDTHTYPFLLKAIAKVIDVREGEKVHCIAIRNGFESLVFVQNSLVHFYGA 159 Score = 63.5 bits (153), Expect = 3e-08 Identities = 30/96 (31%), Positives = 48/96 (50%) Frame = +3 Query: 123 AHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKACAK 302 AH +F ++ N+ WN++I GYA + P + + +M+ PD T LL A A+ Sbjct: 166 AHKVFEEMSDKNLVAWNSVINGYALNSRPNETLTLFRKMVVEGARPDGFTLVSLLTASAE 225 Query: 303 LMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 L A+ G + H +K G + + N + LYA C Sbjct: 226 LGALALGRRAHVYMLKVGLDKNLHAANALLDLYAKC 261 >ref|XP_004244886.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Solanum lycopersicum] Length = 585 Score = 170 bits (431), Expect = 2e-40 Identities = 82/113 (72%), Positives = 92/113 (81%) Frame = +3 Query: 69 MTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKA 248 M K+LIF LV LS P YA IF+QIQ PNIFTWNTMIRGYAES NP PAIEIH+ M Sbjct: 47 MGKYLIFTLVSLSGPMCYAQQIFNQIQFPNIFTWNTMIRGYAESINPYPAIEIHNDMCVN 106 Query: 249 SVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAA 407 SV PDTHTYPFLLKA AK++ VREGEKVHC+A++NGFESLVF+QN+ VH Y A Sbjct: 107 SVAPDTHTYPFLLKAIAKVIDVREGEKVHCIAIRNGFESLVFVQNSLVHFYGA 159 Score = 65.5 bits (158), Expect = 8e-09 Identities = 31/96 (32%), Positives = 49/96 (51%) Frame = +3 Query: 123 AHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKACAK 302 AH +F ++ N+ WN++I GYA + P + + +M+ V PD T LL A A+ Sbjct: 166 AHKVFEEMSDKNLVAWNSVINGYALNSRPNETLTLFRKMVLEGVRPDGFTLVSLLTASAE 225 Query: 303 LMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 L A+ G + H +K G + + N + LYA C Sbjct: 226 LGALALGRRAHVYMLKVGLDKNLHASNALLDLYAKC 261 >gb|EPS63069.1| hypothetical protein M569_11717 [Genlisea aurea] Length = 601 Score = 166 bits (420), Expect = 3e-39 Identities = 79/123 (64%), Positives = 97/123 (78%) Frame = +3 Query: 42 SRSTPSQSDMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAI 221 S S+PS M KHLIF LV LS P YAH +F QI PNIFTW+TMIRGYAESQ+P PA+ Sbjct: 56 SLSSPS---MGKHLIFTLVSLSEPMQYAHKVFDQIPHPNIFTWDTMIRGYAESQDPSPAL 112 Query: 222 EIHHQMLKASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLY 401 I+H++ AS+ PDTHTYPFLLKA AKL ++EG+KVHC A+K+G ESLVF+QN+ +HLY Sbjct: 113 SIYHRLRLASLRPDTHTYPFLLKAFAKLTMLKEGQKVHCSALKDGLESLVFVQNSLLHLY 172 Query: 402 AAC 410 +C Sbjct: 173 GSC 175 Score = 56.6 bits (135), Expect = 4e-06 Identities = 31/94 (32%), Positives = 48/94 (51%), Gaps = 1/94 (1%) Frame = +3 Query: 132 IFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKACAKLMA 311 +F + + WN++I GYA + P A++++ +M V+PD T LL A A+L A Sbjct: 184 LFQSMTCKTLVAWNSVINGYALNNRPNEALKLYREMGLEGVKPDGFTVVSLLTASAELGA 243 Query: 312 VREGEKVHCVAVKNGFESL-VFIQNTSVHLYAAC 410 + G + H K G ES + N + LYA C Sbjct: 244 LALGRRAHAYMAKVGLESTNLHAANALLVLYAKC 277 >ref|XP_004158687.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis sativus] Length = 609 Score = 153 bits (387), Expect = 2e-35 Identities = 75/118 (63%), Positives = 91/118 (77%), Gaps = 1/118 (0%) Frame = +3 Query: 54 PSQSDMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHH 233 P D KHLIF LV LSAP S+A IF+QIQ+PNIFTWNTMIRG+AES+NP PA+E+ Sbjct: 65 PQNPDFNKHLIFALVSLSAPMSFAAQIFNQIQAPNIFTWNTMIRGFAESENPSPAVELFS 124 Query: 234 QMLKA-SVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYA 404 QM A S+ PDTHT+PFL KA AKLM V GE +H V V+NGF+SL F+QN+ VH+Y+ Sbjct: 125 QMHAASSILPDTHTFPFLFKAVAKLMDVSLGEGIHSVVVRNGFDSLRFVQNSLVHMYS 182 Score = 60.5 bits (145), Expect = 2e-07 Identities = 32/96 (33%), Positives = 49/96 (51%) Frame = +3 Query: 123 AHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKACAK 302 A+ +F + + WN++I G+A + P A+ ++ +M VEPD T LL AC + Sbjct: 190 AYQVFEIMSYRDRVAWNSVINGFALNGMPNEALTLYREMGSEGVEPDGFTMVSLLSACVE 249 Query: 303 LMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 L A+ GE+VH VK G N + LY+ C Sbjct: 250 LGALALGERVHMYMVKVGLVQNQHASNALLDLYSKC 285 >ref|XP_004134903.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis sativus] Length = 609 Score = 153 bits (387), Expect = 2e-35 Identities = 75/118 (63%), Positives = 91/118 (77%), Gaps = 1/118 (0%) Frame = +3 Query: 54 PSQSDMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHH 233 P D KHLIF LV LSAP S+A IF+QIQ+PNIFTWNTMIRG+AES+NP PA+E+ Sbjct: 65 PQNPDFNKHLIFALVSLSAPMSFAAQIFNQIQAPNIFTWNTMIRGFAESENPSPAVELFS 124 Query: 234 QMLKA-SVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYA 404 QM A S+ PDTHT+PFL KA AKLM V GE +H V V+NGF+SL F+QN+ VH+Y+ Sbjct: 125 QMHAASSILPDTHTFPFLFKAVAKLMDVSLGEGIHSVVVRNGFDSLRFVQNSLVHMYS 182 Score = 60.5 bits (145), Expect = 2e-07 Identities = 32/96 (33%), Positives = 49/96 (51%) Frame = +3 Query: 123 AHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKACAK 302 A+ +F + + WN++I G+A + P A+ ++ +M VEPD T LL AC + Sbjct: 190 AYQVFEIMSYRDRVAWNSVINGFALNGMPNEALTLYREMGSEGVEPDGFTMVSLLSACVE 249 Query: 303 LMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 L A+ GE+VH VK G N + LY+ C Sbjct: 250 LGALALGERVHMYMVKVGLVQNQHASNALLDLYSKC 285 >ref|XP_006836321.1| hypothetical protein AMTR_s00092p00064890 [Amborella trichopoda] gi|548838839|gb|ERM99174.1| hypothetical protein AMTR_s00092p00064890 [Amborella trichopoda] Length = 285 Score = 147 bits (372), Expect = 1e-33 Identities = 68/119 (57%), Positives = 87/119 (73%) Frame = +3 Query: 54 PSQSDMTKHLIFNLVFLSAPTSYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHH 233 PS + KHLIF+LV LS P YA NIFS IQ PN FTWNTMI+G+++ + + +I+ H Sbjct: 85 PSDPLVGKHLIFSLVSLSTPMRYALNIFSHIQFPNAFTWNTMIKGFSDQEQAQKSIDFFH 144 Query: 234 QMLKASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 QM+ + PDTHTYPF LKACA L ++RE EK+HC A+K+GF SLVF QN +H Y+AC Sbjct: 145 QMVNDGIAPDTHTYPFSLKACAMLNSLRESEKIHCKALKDGFGSLVFGQNALIHAYSAC 203 >gb|EYU19817.1| hypothetical protein MIMGU_mgv1a017899mg, partial [Mimulus guttatus] Length = 452 Score = 145 bits (366), Expect = 6e-33 Identities = 65/97 (67%), Positives = 79/97 (81%) Frame = +3 Query: 120 YAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKACA 299 YA +F QI PNIFTW+TMIRGYAES++P PA+ I+ ++ +SVEPDTHTYPFLLKA A Sbjct: 3 YARKVFDQIPHPNIFTWDTMIRGYAESEDPSPALHIYQRLRLSSVEPDTHTYPFLLKAIA 62 Query: 300 KLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 KLM VREGEKVHC +K+GFESL+F+QN +H Y AC Sbjct: 63 KLMIVREGEKVHCSTLKDGFESLMFVQNALLHFYGAC 99 Score = 61.2 bits (147), Expect = 1e-07 Identities = 30/96 (31%), Positives = 49/96 (51%) Frame = +3 Query: 123 AHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKACAK 302 AH +F ++ ++ WN++I GYA + P + + +M +V+PD T L ACA+ Sbjct: 105 AHCLFEKMPYKDLVAWNSVINGYALNNMPNETLTLFRKMGSENVKPDGFTLVSLFTACAE 164 Query: 303 LMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 L A+ G + H K G + + N + LYA C Sbjct: 165 LGALSLGRRAHVYMTKTGLDKNLHAANALLVLYAKC 200 >ref|XP_002869909.1| binding protein [Arabidopsis lyrata subsp. lyrata] gi|297315745|gb|EFH46168.1| binding protein [Arabidopsis lyrata subsp. lyrata] Length = 595 Score = 130 bits (327), Expect(2) = 3e-30 Identities = 64/122 (52%), Positives = 88/122 (72%), Gaps = 4/122 (3%) Frame = +3 Query: 57 SQSDMTKHLIFNLVFLSAPT--SYAHNIFSQIQSP-NIFTWNTMIRGYAESQNPRPAIEI 227 S +++ KHLIF LV L +P SYAH +FS+I+ P N+F WNT+IRGYAE N A+ + Sbjct: 48 SDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFIWNTLIRGYAEIGNSVSAVSL 107 Query: 228 HHQMLKAS-VEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYA 404 + +M + VEPDTHTYPFLLKA K+ VR GE +H V +++GF SL+++QN+ +HLYA Sbjct: 108 YREMRASGFVEPDTHTYPFLLKAVGKMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYA 167 Query: 405 AC 410 C Sbjct: 168 NC 169 Score = 27.3 bits (59), Expect(2) = 3e-30 Identities = 12/16 (75%), Positives = 15/16 (93%) Frame = +2 Query: 23 SSKSKLKQIHAFSIRH 70 SS +KL+QIHAFSIR+ Sbjct: 28 SSLTKLRQIHAFSIRN 43 Score = 65.5 bits (158), Expect = 8e-09 Identities = 30/96 (31%), Positives = 53/96 (55%) Frame = +3 Query: 123 AHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKACAK 302 A+ +F ++ ++ WN++I G+AE+ P A+ ++ +M ++PD T LL ACAK Sbjct: 175 AYKVFDKMPEKDLVAWNSVINGFAENGKPEEALALYTEMDLKGIKPDGFTIVSLLSACAK 234 Query: 303 LMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 + A+ G++ H +K G + N + LYA C Sbjct: 235 IGALTLGKRFHVYMIKVGLTRNLHSSNVLLDLYARC 270 >ref|XP_006413827.1| hypothetical protein EUTSA_v10027143mg [Eutrema salsugineum] gi|557114997|gb|ESQ55280.1| hypothetical protein EUTSA_v10027143mg [Eutrema salsugineum] Length = 595 Score = 127 bits (320), Expect(2) = 3e-30 Identities = 64/122 (52%), Positives = 86/122 (70%), Gaps = 4/122 (3%) Frame = +3 Query: 57 SQSDMTKHLIFNLVFLSAPT--SYAHNIFSQIQSP-NIFTWNTMIRGYAESQNPRPAIEI 227 S ++ KHLIF LV L +P SYAH +FS+I+ P N+F WNT+IRGYAE + A+ + Sbjct: 48 SDAEFGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFIWNTLIRGYAEIGDSVSAVSL 107 Query: 228 HHQM-LKASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYA 404 + +M + VEPDTHTYPFLLKA AK+ R GE +H V +++GF SL+F QN+ +HLYA Sbjct: 108 YREMRVSGFVEPDTHTYPFLLKAVAKMADARLGETIHSVVIRSGFGSLIFAQNSLLHLYA 167 Query: 405 AC 410 C Sbjct: 168 NC 169 Score = 30.0 bits (66), Expect(2) = 3e-30 Identities = 15/23 (65%), Positives = 18/23 (78%), Gaps = 1/23 (4%) Frame = +2 Query: 5 LLLDCA-SSKSKLKQIHAFSIRH 70 LL C SS +KLK++HAFSIRH Sbjct: 21 LLQTCGVSSLTKLKKVHAFSIRH 43 Score = 66.6 bits (161), Expect = 3e-09 Identities = 30/98 (30%), Positives = 54/98 (55%) Frame = +3 Query: 117 SYAHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKAC 296 S A+ +F ++ ++ WN++I G+AE+ P A++++ +M ++PD T LL AC Sbjct: 173 SSAYKVFDKMPVKDLVAWNSVINGFAENGKPNEALKLYTEMDSKGIKPDGFTVVSLLSAC 232 Query: 297 AKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 AK+ A+ G +VH +K G + N + Y+ C Sbjct: 233 AKIGALTLGRRVHVYMIKAGLTRKLHSSNVLLDFYSRC 270 >emb|CAB45902.1| putative protein (fragment) [Arabidopsis thaliana] gi|7268904|emb|CAB79107.1| putative protein (fragment) [Arabidopsis thaliana] Length = 1495 Score = 126 bits (317), Expect(2) = 7e-30 Identities = 62/122 (50%), Positives = 86/122 (70%), Gaps = 4/122 (3%) Frame = +3 Query: 57 SQSDMTKHLIFNLVFLSAPT--SYAHNIFSQIQSP-NIFTWNTMIRGYAESQNPRPAIEI 227 S +++ KHLIF LV L +P SYAH +FS+I+ P N+F WNT+IRGYAE N A + Sbjct: 48 SDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFIWNTLIRGYAEIGNSISAFSL 107 Query: 228 HHQM-LKASVEPDTHTYPFLLKACAKLMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYA 404 + +M + VEPDTHTYPFL+KA + VR GE +H V +++GF SL+++QN+ +HLYA Sbjct: 108 YREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYA 167 Query: 405 AC 410 C Sbjct: 168 NC 169 Score = 29.6 bits (65), Expect(2) = 7e-30 Identities = 13/16 (81%), Positives = 15/16 (93%) Frame = +2 Query: 23 SSKSKLKQIHAFSIRH 70 SS +KL+QIHAFSIRH Sbjct: 28 SSITKLRQIHAFSIRH 43 Score = 68.6 bits (166), Expect = 9e-10 Identities = 31/96 (32%), Positives = 54/96 (56%) Frame = +3 Query: 123 AHNIFSQIQSPNIFTWNTMIRGYAESQNPRPAIEIHHQMLKASVEPDTHTYPFLLKACAK 302 A+ +F ++ ++ WN++I G+AE+ P A+ ++ +M ++PD T LL ACAK Sbjct: 175 AYKVFDKMPEKDLVAWNSVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAK 234 Query: 303 LMAVREGEKVHCVAVKNGFESLVFIQNTSVHLYAAC 410 + A+ G++VH +K G + N + LYA C Sbjct: 235 IGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYARC 270