BLASTX nr result
ID: Sinomenium22_contig00028021
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium22_contig00028021 (657 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002266698.1| PREDICTED: pentatricopeptide repeat-containi... 57 2e-16 emb|CBI15289.3| unnamed protein product [Vitis vinifera] 57 1e-15 ref|XP_003525037.2| PREDICTED: pentatricopeptide repeat-containi... 53 1e-14 ref|XP_007157658.1| hypothetical protein PHAVU_002G087700g [Phas... 58 3e-14 ref|XP_002511505.1| pentatricopeptide repeat-containing protein,... 55 1e-13 ref|XP_003527053.1| PREDICTED: pentatricopeptide repeat-containi... 57 7e-13 ref|XP_002321537.1| pentatricopeptide repeat-containing family p... 57 3e-12 ref|XP_006578589.1| PREDICTED: pentatricopeptide repeat-containi... 54 3e-11 ref|XP_003523047.2| PREDICTED: pentatricopeptide repeat-containi... 54 3e-11 ref|XP_006578590.1| PREDICTED: pentatricopeptide repeat-containi... 54 3e-11 gb|EXC34220.1| hypothetical protein L484_010090 [Morus notabilis] 54 1e-10 gb|EYU35898.1| hypothetical protein MIMGU_mgv1a019674mg [Mimulus... 57 1e-09 ref|XP_006306742.1| hypothetical protein CARUB_v10008274mg [Caps... 46 3e-09 ref|XP_006416553.1| hypothetical protein EUTSA_v10009547mg [Eutr... 45 4e-09 ref|XP_006390391.1| hypothetical protein EUTSA_v10018113mg [Eutr... 45 5e-09 gb|AAF79278.1|AC068602_1 F14D16.2 [Arabidopsis thaliana] 46 5e-08 ref|NP_001185030.1| pentatricopeptide repeat-containing protein ... 46 5e-08 ref|NP_173324.1| pentatricopeptide repeat-containing protein [Ar... 46 5e-08 ref|NP_177613.1| pentatricopeptide repeat-containing protein [Ar... 44 5e-08 ref|XP_002890305.1| pentatricopeptide repeat-containing protein ... 46 8e-08 >ref|XP_002266698.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74750 [Vitis vinifera] Length = 875 Score = 57.0 bits (136), Expect(2) = 2e-16 Identities = 25/41 (60%), Positives = 31/41 (75%) Frame = -3 Query: 124 GSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 G +AL NLNC + YQANQ+LKQ+QD+ VA GFFYW+K Sbjct: 330 GPAAEEALRNLNCLMDAYQANQVLKQIQDHPVALGFFYWLK 370 Score = 55.5 bits (132), Expect(2) = 2e-16 Identities = 51/196 (26%), Positives = 80/196 (40%), Gaps = 13/196 (6%) Frame = -1 Query: 657 GVFSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNVAGGLESKNRSHDLKVK 478 G+ Q+ VD ++ + + S+N K R FSKV+A S N+A S + H + K Sbjct: 159 GMLKLPQNCMVDPTKPLSKIKSTNIKPIRKGKFSKVRAESSANIAAASNSTSSYHSTRGK 218 Query: 477 SEKLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXXSD 298 +K GSV+ +W + S+S N Sbjct: 219 GDKSGSVKGCSHVGD--------TWTRNTVDTRSLSSDTHNKRSMPQKSKAYSNYS---- 266 Query: 297 RLTPGCEVTDDIRYVSNIQKP-PKFTGEIS---TKP---------APLAKQSNNDRQVVE 157 T + + SN++ P+F G I+ +KP AP+++Q + VVE Sbjct: 267 --------TSNSNFNSNVRNSEPRFVGGIAGGFSKPLRDTKMIGIAPVSRQFGSSGHVVE 318 Query: 156 SVFHTLKHLKWGPAAE 109 +V L+ L WGPAAE Sbjct: 319 NVSRILRQLSWGPAAE 334 >emb|CBI15289.3| unnamed protein product [Vitis vinifera] Length = 793 Score = 57.0 bits (136), Expect(2) = 1e-15 Identities = 25/41 (60%), Positives = 31/41 (75%) Frame = -3 Query: 124 GSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 G +AL NLNC + YQANQ+LKQ+QD+ VA GFFYW+K Sbjct: 339 GPAAEEALRNLNCLMDAYQANQVLKQIQDHPVALGFFYWLK 379 Score = 52.4 bits (124), Expect(2) = 1e-15 Identities = 47/183 (25%), Positives = 72/183 (39%) Frame = -1 Query: 657 GVFSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNVAGGLESKNRSHDLKVK 478 G+ Q+ VD ++ + + S+N K R FSKV+A S N+A S + H + K Sbjct: 186 GMLKLPQNCMVDPTKPLSKIKSTNIKPIRKGKFSKVRAESSANIAAASNSTSSYHSTRGK 245 Query: 477 SEKLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXXSD 298 +K GSV+ +W + S+S N Sbjct: 246 GDKSGSVKGCSHVGD--------TWTRNTVDTRSLSSDTHNKRSMPQKSKAYSNYS---- 293 Query: 297 RLTPGCEVTDDIRYVSNIQKPPKFTGEISTKPAPLAKQSNNDRQVVESVFHTLKHLKWGP 118 T + + SN + K G AP+++Q + VVE+V L+ L WGP Sbjct: 294 --------TSNSNFNSNPLRDTKMIGI-----APVSRQFGSSGHVVENVSRILRQLSWGP 340 Query: 117 AAE 109 AAE Sbjct: 341 AAE 343 >ref|XP_003525037.2| PREDICTED: pentatricopeptide repeat-containing protein At1g74750-like [Glycine max] Length = 876 Score = 53.1 bits (126), Expect(2) = 1e-14 Identities = 47/199 (23%), Positives = 77/199 (38%), Gaps = 15/199 (7%) Frame = -1 Query: 657 GVFSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNVAGGLESKNRSHDLKVK 478 G+ S +++ VD++R+ PN+ SSN K + ENF+ V R SV+ + H K K Sbjct: 164 GILSYSKNCMVDTARTPPNIRSSNVKQIKRENFTSVHPRPSVSTNSRSKRAGHHHSGKCK 223 Query: 477 SEKLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXXSD 298 +K + + PS + K++ +P I Sbjct: 224 GDKSNLGKGFKHIPS-----------------SGMEKSVVSPNI---------PLNNHEH 257 Query: 297 RLTPGCEVTDDIRYVSNIQKPPKFTGEISTKPAPLAKQSNN---------------DRQV 163 R P T V+N + + + P K+S N +R++ Sbjct: 258 RAFPQRTTTKSNHIVTNFGSYMRASNTQMVEVVPTIKESFNKHPRDLKMSARTAPMNRRI 317 Query: 162 VESVFHTLKHLKWGPAAER 106 VE V L+ L+WGP AE+ Sbjct: 318 VEVVSDILRQLRWGPTAEK 336 Score = 53.1 bits (126), Expect(2) = 1e-14 Identities = 27/43 (62%), Positives = 31/43 (72%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 + G KAL NLN S+ YQANQILKQLQD SVA GFF W++ Sbjct: 329 RWGPTAEKALYNLNFSMDAYQANQILKQLQDPSVALGFFDWLR 371 >ref|XP_007157658.1| hypothetical protein PHAVU_002G087700g [Phaseolus vulgaris] gi|561031073|gb|ESW29652.1| hypothetical protein PHAVU_002G087700g [Phaseolus vulgaris] Length = 881 Score = 58.2 bits (139), Expect(2) = 3e-14 Identities = 29/43 (67%), Positives = 32/43 (74%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 K G KAL NLN S+ YQANQILKQLQD+SVA FFYW+K Sbjct: 334 KWGPATEKALCNLNFSIDAYQANQILKQLQDHSVALSFFYWLK 376 Score = 46.6 bits (109), Expect(2) = 3e-14 Identities = 45/203 (22%), Positives = 73/203 (35%), Gaps = 19/203 (9%) Frame = -1 Query: 657 GVFSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNV-AGGLESKNRSHDLKV 481 G+ + ++++ VD R++P++ SSN K R E+F+ V + V+ G + N H K Sbjct: 165 GILNYSKNYMVDPGRALPSIRSSNVKQIRKESFTAVHPKPPVSTHPGPSKPTNNHHGAKG 224 Query: 480 KSEKLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXXS 301 K +K + K + +P I Sbjct: 225 KGDKSNLAKGF--------------------------KPVASPGIEKSGEAPNIPVNSHD 258 Query: 300 DRLTPGCEVTDDIRYVSNI-----QKPPKFTGE-------------ISTKPAPLAKQSNN 175 R P T R+V+N P+ G ++ AP + N Sbjct: 259 RRALPQRTRTRPNRFVTNFGSNMPSSNPQMAGSFKESFCKYTRNVNMAAGIAPSNRHFTN 318 Query: 174 DRQVVESVFHTLKHLKWGPAAER 106 VV+ V L+ LKWGPA E+ Sbjct: 319 SGHVVDMVKDMLRQLKWGPATEK 341 >ref|XP_002511505.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223550620|gb|EEF52107.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 876 Score = 54.7 bits (130), Expect(2) = 1e-13 Identities = 25/43 (58%), Positives = 32/43 (74%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 + G +AL NLN S+ YQANQ+LKQLQD++VA FFYW+K Sbjct: 329 RWGPAAEEALANLNYSMDPYQANQVLKQLQDHTVALNFFYWLK 371 Score = 47.8 bits (112), Expect(2) = 1e-13 Identities = 46/185 (24%), Positives = 80/185 (43%), Gaps = 5/185 (2%) Frame = -1 Query: 648 SSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNVAGGLESKNRSHDLKVKSEK 469 +S ++ VD +R V SSN K R EN SKV + S A + N + KSEK Sbjct: 162 NSPKNCMVDPTRPQSTVRSSNVKPIRRENCSKVYPKASPEAAVSSSTSNYD-STRDKSEK 220 Query: 468 LGSVEALEFGPSFE*FYREISWDS*SNFR-CSVSKAICNPEIRXXXXXXXXXXXXXSDRL 292 ++ + R + + ++ + CS++ C+ I + Sbjct: 221 SSFIKGSK---------RVSNTPAGNSVKTCSIASDTCDRRIIPQKSKGQSNRSTANFNA 271 Query: 291 TPGCEVTDDIRY----VSNIQKPPKFTGEISTKPAPLAKQSNNDRQVVESVFHTLKHLKW 124 T D +Y + +KPP+ T ++ P ++ ++ +VE+V H L+ ++W Sbjct: 272 NVQTVQTSDTKYGEYVAEDYRKPPRET-KMPVVRVPSTRRFASNGHIVENVAHILRQIRW 330 Query: 123 GPAAE 109 GPAAE Sbjct: 331 GPAAE 335 >ref|XP_003527053.1| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like [Glycine max] Length = 882 Score = 57.0 bits (136), Expect(2) = 7e-13 Identities = 28/43 (65%), Positives = 32/43 (74%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 + G KAL NLN S+ YQANQILKQLQD+SVA FFYW+K Sbjct: 335 RWGPATEKALYNLNFSIDAYQANQILKQLQDHSVALSFFYWLK 377 Score = 43.1 bits (100), Expect(2) = 7e-13 Identities = 44/203 (21%), Positives = 73/203 (35%), Gaps = 19/203 (9%) Frame = -1 Query: 657 GVFSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNV-AGGLESKNRSHDLKV 481 G+ + +++ VD +R++P + SSN + R ENF+ V + V G + N +H K Sbjct: 166 GILNYSKNCMVDPARALPKIRSSNVQQIRTENFTSVHPKPPVPAHPGPSKHTNNNHGAKG 225 Query: 480 KSEKLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXXS 301 K+ K + ++ + K+ P I Sbjct: 226 KANKSNLAKGFKYVAA-----------------SGTEKSGAAPNIPVNNHDR-------- 260 Query: 300 DRLTPGCEVTDDIRYVSNIQKPPKFTGEISTKP------------------APLAKQSNN 175 R P T+ +V+N + + +P AP + N Sbjct: 261 -RALPQRTRTNSNHFVTNFGSNMQSSNPQMARPFKESFNKHTRDLNMPAGIAPTRRHFTN 319 Query: 174 DRQVVESVFHTLKHLKWGPAAER 106 VVE V LK L+WGPA E+ Sbjct: 320 SGHVVEGVKDILKQLRWGPATEK 342 >ref|XP_002321537.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222868533|gb|EEF05664.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 834 Score = 56.6 bits (135), Expect(2) = 3e-12 Identities = 25/43 (58%), Positives = 33/43 (76%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 + G +AL NLNC + YQANQ+LKQLQD++VA GFF+W+K Sbjct: 287 RWGPSAEEALVNLNCHMDAYQANQVLKQLQDHTVALGFFHWLK 329 Score = 41.2 bits (95), Expect(2) = 3e-12 Identities = 43/182 (23%), Positives = 69/182 (37%) Frame = -1 Query: 654 VFSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNVAGGLESKNRSHDLKVKS 475 V +ST + +D +R + N+ SSN K R ENF+K S + G + + +K + Sbjct: 144 VINSTINCMIDPTRQLSNIKSSNVKPIRRENFTKAYPNSSAEIPVGSNAAVNYNSMKDRG 203 Query: 474 EKLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXXSDR 295 K V + S + S DS S + K P+ Sbjct: 204 NKSSFVRGFKQVSSIA---ADSSLDSHSLPSDAFDKRRTIPQ------------------ 242 Query: 294 LTPGCEVTDDIRYVSNIQKPPKFTGEISTKPAPLAKQSNNDRQVVESVFHTLKHLKWGPA 115 R + + P ++ A A+Q + VVE+V L+ L+WGP+ Sbjct: 243 -----------RLKAQPNRRPSRDTKMPAVVARSARQFVSTGHVVENVSQILRQLRWGPS 291 Query: 114 AE 109 AE Sbjct: 292 AE 293 >ref|XP_006578589.1| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like isoform X2 [Glycine max] Length = 898 Score = 54.3 bits (129), Expect(2) = 3e-11 Identities = 27/43 (62%), Positives = 31/43 (72%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 + G K L NLN S+ YQANQILKQLQD+SVA GFF W+K Sbjct: 351 RWGPATEKTLYNLNFSIDAYQANQILKQLQDHSVAVGFFCWLK 393 Score = 40.4 bits (93), Expect(2) = 3e-11 Identities = 46/203 (22%), Positives = 72/203 (35%), Gaps = 19/203 (9%) Frame = -1 Query: 657 GVFSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNV-AGGLESKNRSHDLKV 481 G+ + ++++ VD +R++P + SSN + + ENF+ V + V G + N H K Sbjct: 182 GILNYSKNYMVDPARALPKIRSSNVQQIKKENFTAVHPKPPVPTHPGPSKHTNNHHGAKG 241 Query: 480 KSEKLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXXS 301 K++K + + S K+ P I Sbjct: 242 KADKSNLAKGFKHVAS-----------------SGTEKSGAAPNIPVNNHDR-------- 276 Query: 300 DRLTPGCEVTDDIRYVSNI-----QKPPKFTGEISTK----------PA---PLAKQSNN 175 R P T +V+N P+ G PA P + N Sbjct: 277 -RALPQRTRTHSNHFVANFGSNMQSSNPQMAGPFKESFNKHTRDLNMPAGIVPTKRHFTN 335 Query: 174 DRQVVESVFHTLKHLKWGPAAER 106 VVE V LK L+WGPA E+ Sbjct: 336 SGHVVEVVKDILKQLRWGPATEK 358 >ref|XP_003523047.2| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like isoform X1 [Glycine max] Length = 882 Score = 54.3 bits (129), Expect(2) = 3e-11 Identities = 27/43 (62%), Positives = 31/43 (72%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 + G K L NLN S+ YQANQILKQLQD+SVA GFF W+K Sbjct: 335 RWGPATEKTLYNLNFSIDAYQANQILKQLQDHSVAVGFFCWLK 377 Score = 40.4 bits (93), Expect(2) = 3e-11 Identities = 46/203 (22%), Positives = 72/203 (35%), Gaps = 19/203 (9%) Frame = -1 Query: 657 GVFSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNV-AGGLESKNRSHDLKV 481 G+ + ++++ VD +R++P + SSN + + ENF+ V + V G + N H K Sbjct: 166 GILNYSKNYMVDPARALPKIRSSNVQQIKKENFTAVHPKPPVPTHPGPSKHTNNHHGAKG 225 Query: 480 KSEKLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXXS 301 K++K + + S K+ P I Sbjct: 226 KADKSNLAKGFKHVAS-----------------SGTEKSGAAPNIPVNNHDR-------- 260 Query: 300 DRLTPGCEVTDDIRYVSNI-----QKPPKFTGEISTK----------PA---PLAKQSNN 175 R P T +V+N P+ G PA P + N Sbjct: 261 -RALPQRTRTHSNHFVANFGSNMQSSNPQMAGPFKESFNKHTRDLNMPAGIVPTKRHFTN 319 Query: 174 DRQVVESVFHTLKHLKWGPAAER 106 VVE V LK L+WGPA E+ Sbjct: 320 SGHVVEVVKDILKQLRWGPATEK 342 >ref|XP_006578590.1| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like isoform X3 [Glycine max] Length = 879 Score = 54.3 bits (129), Expect(2) = 3e-11 Identities = 27/43 (62%), Positives = 31/43 (72%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 + G K L NLN S+ YQANQILKQLQD+SVA GFF W+K Sbjct: 332 RWGPATEKTLYNLNFSIDAYQANQILKQLQDHSVAVGFFCWLK 374 Score = 40.4 bits (93), Expect(2) = 3e-11 Identities = 46/203 (22%), Positives = 72/203 (35%), Gaps = 19/203 (9%) Frame = -1 Query: 657 GVFSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNV-AGGLESKNRSHDLKV 481 G+ + ++++ VD +R++P + SSN + + ENF+ V + V G + N H K Sbjct: 163 GILNYSKNYMVDPARALPKIRSSNVQQIKKENFTAVHPKPPVPTHPGPSKHTNNHHGAKG 222 Query: 480 KSEKLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXXS 301 K++K + + S K+ P I Sbjct: 223 KADKSNLAKGFKHVAS-----------------SGTEKSGAAPNIPVNNHDR-------- 257 Query: 300 DRLTPGCEVTDDIRYVSNI-----QKPPKFTGEISTK----------PA---PLAKQSNN 175 R P T +V+N P+ G PA P + N Sbjct: 258 -RALPQRTRTHSNHFVANFGSNMQSSNPQMAGPFKESFNKHTRDLNMPAGIVPTKRHFTN 316 Query: 174 DRQVVESVFHTLKHLKWGPAAER 106 VVE V LK L+WGPA E+ Sbjct: 317 SGHVVEVVKDILKQLRWGPATEK 339 >gb|EXC34220.1| hypothetical protein L484_010090 [Morus notabilis] Length = 872 Score = 54.3 bits (129), Expect(2) = 1e-10 Identities = 24/43 (55%), Positives = 33/43 (76%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 + G +AL NLN ++ +QANQ+LKQLQD++VA GFFYW+K Sbjct: 325 RWGRAAEEALENLNYAMDAFQANQVLKQLQDHNVALGFFYWLK 367 Score = 38.1 bits (87), Expect(2) = 1e-10 Identities = 48/185 (25%), Positives = 74/185 (40%), Gaps = 2/185 (1%) Frame = -1 Query: 657 GVFSST--QSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNVAGGLESKNRSHDLK 484 G+F++ Q+ VD +R ++ SS+ + +NFS V R SV A N + K Sbjct: 157 GIFNNNLPQNCMVDPARLSTSIRSSHVNHVKRKNFSGVHPRPSVEAA---VQYNSTSSTK 213 Query: 483 VKSEKLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXX 304 K K SV+ + P+ SW + S + + + Sbjct: 214 SKDSKSSSVKGVNNVPNTR---NGNSWATRSVPAEARDRRAIPNRTKACLNSFKADFSSD 270 Query: 303 SDRLTPGCEVTDDIRYVSNIQKPPKFTGEISTKPAPLAKQSNNDRQVVESVFHTLKHLKW 124 S++ T G V + +PP+ T AP+ + N VVE V H L L+W Sbjct: 271 SNQSTDGGNVGFGNK---GFNRPPREMN-FPTGYAPIKRPYANTANVVERVSHMLHGLRW 326 Query: 123 GPAAE 109 G AAE Sbjct: 327 GRAAE 331 >gb|EYU35898.1| hypothetical protein MIMGU_mgv1a019674mg [Mimulus guttatus] Length = 846 Score = 56.6 bits (135), Expect(2) = 1e-09 Identities = 26/36 (72%), Positives = 30/36 (83%) Frame = -3 Query: 109 KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 +AL LNC+L YQANQILKQLQD +VA GFFYW+K Sbjct: 306 EALCKLNCTLDAYQANQILKQLQDYTVALGFFYWLK 341 Score = 32.3 bits (72), Expect(2) = 1e-09 Identities = 17/44 (38%), Positives = 24/44 (54%) Frame = -1 Query: 240 KPPKFTGEISTKPAPLAKQSNNDRQVVESVFHTLKHLKWGPAAE 109 K + T + T+PA A+ + +VESV L HL+WGP E Sbjct: 263 KDIRHTKVVITRPAQ-ARPFSTSSPIVESVSRILHHLQWGPPTE 305 >ref|XP_006306742.1| hypothetical protein CARUB_v10008274mg [Capsella rubella] gi|482575453|gb|EOA39640.1| hypothetical protein CARUB_v10008274mg [Capsella rubella] Length = 878 Score = 46.2 bits (108), Expect(2) = 3e-09 Identities = 21/43 (48%), Positives = 28/43 (65%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 + G +AL NL+ + YQANQ+LKQ+ D A GFFYW+K Sbjct: 331 RWGPAAEEALQNLDFRIDAYQANQVLKQMNDYGNALGFFYWLK 373 Score = 41.6 bits (96), Expect(2) = 3e-09 Identities = 47/182 (25%), Positives = 76/182 (41%), Gaps = 1/182 (0%) Frame = -1 Query: 651 FSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNVAGGLESKNRSHDLKVKSE 472 F +S+ VD +R I +V SSN K+ R E+F+KV R + + + +N S + + Sbjct: 182 FGFGKSYMVDPTRPISSVKSSNVKAIRREHFAKVYPRSAAKESSVGKIRNSSSNFR---- 237 Query: 471 KLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXXSDRL 292 G+ EA G F ++++S V K + P ++ Sbjct: 238 --GAKEAERTG--FVKGFKQVS-------NSGVGKLL--PTTNNTYGKRTSVLQRPNN-- 282 Query: 291 TPGCEVTDDIRYVSN-IQKPPKFTGEISTKPAPLAKQSNNDRQVVESVFHTLKHLKWGPA 115 D R+V N T ++ A ++Q N +VE+V LK +WGPA Sbjct: 283 -------DSNRFVPNGFSNSSMDTVKVPPGAALTSRQYCNSGHIVENVSSVLKRFRWGPA 335 Query: 114 AE 109 AE Sbjct: 336 AE 337 >ref|XP_006416553.1| hypothetical protein EUTSA_v10009547mg [Eutrema salsugineum] gi|557094324|gb|ESQ34906.1| hypothetical protein EUTSA_v10009547mg [Eutrema salsugineum] Length = 847 Score = 45.4 bits (106), Expect(2) = 4e-09 Identities = 20/35 (57%), Positives = 25/35 (71%) Frame = -3 Query: 106 ALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 AL NL + YQANQ+LKQ+ D+ A GFFYW+K Sbjct: 308 ALQNLGLRMDPYQANQVLKQMNDHGNALGFFYWLK 342 Score = 42.0 bits (97), Expect(2) = 4e-09 Identities = 43/181 (23%), Positives = 77/181 (42%) Frame = -1 Query: 651 FSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNVAGGLESKNRSHDLKVKSE 472 F ++S VD +R I +V SSN K+ R E+F+KV R + + + +N + + + Sbjct: 161 FGFSKSCMVDPTRPITSVKSSNVKAIRREHFAKVYPRSAAKESSMDKIRNPTSNFR---- 216 Query: 471 KLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXXSDRL 292 G+ EA G F ++++S + +N + + P I Sbjct: 217 --GAKEAERAG--FVKGFKQVSITATNNTYVKRTSMLQRPNI------------------ 254 Query: 291 TPGCEVTDDIRYVSNIQKPPKFTGEISTKPAPLAKQSNNDRQVVESVFHTLKHLKWGPAA 112 D R+VSN ++ + + ++Q N +VE+V L+ +WGP A Sbjct: 255 -------DSNRFVSNGSSTEMM--KVHSGTSLTSRQYCNSGHIVENVSSVLRRFRWGPDA 305 Query: 111 E 109 E Sbjct: 306 E 306 >ref|XP_006390391.1| hypothetical protein EUTSA_v10018113mg [Eutrema salsugineum] gi|557086825|gb|ESQ27677.1| hypothetical protein EUTSA_v10018113mg [Eutrema salsugineum] Length = 859 Score = 45.1 bits (105), Expect(2) = 5e-09 Identities = 52/183 (28%), Positives = 75/183 (40%), Gaps = 6/183 (3%) Frame = -1 Query: 639 QSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNVAGGLESKNRSHDLKVKSEKLGS 460 +S VD +R I +V SSN K R E+ +KV R + + N S KV +E+ G Sbjct: 156 KSRMVDPTRPISSVKSSNVKVIRREHLAKVYPRSASKDSPDQVHVNNSRGTKV-AERSGF 214 Query: 459 VEALEF------GPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXXSD 298 V+ L G FE + + ++ S R S+ + P I Sbjct: 215 VKGLNHASNDVAGNPFE--AQGLPTNTASGKRKSMPQ---RPSI--------------DS 255 Query: 297 RLTPGCEVTDDIRYVSNIQKPPKFTGEISTKPAPLAKQSNNDRQVVESVFHTLKHLKWGP 118 R G + V KP + ++ AP +Q N VVE+V L+ KWGP Sbjct: 256 RHASGGYDYNVHSSVEGFSKPSREMVRVTPGTAPTPRQYGNSGYVVENVSSILRRFKWGP 315 Query: 117 AAE 109 AAE Sbjct: 316 AAE 318 Score = 42.0 bits (97), Expect(2) = 5e-09 Identities = 19/43 (44%), Positives = 27/43 (62%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 K G +AL + + YQANQ+LKQ+ + + A GFFYW+K Sbjct: 312 KWGPAAEEALQSFGFRMDAYQANQVLKQMDNYANALGFFYWLK 354 >gb|AAF79278.1|AC068602_1 F14D16.2 [Arabidopsis thaliana] Length = 977 Score = 46.2 bits (108), Expect(2) = 5e-08 Identities = 21/43 (48%), Positives = 27/43 (62%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 + G +AL NL + YQANQ+LKQ+ D A GFFYW+K Sbjct: 430 RWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLK 472 Score = 37.4 bits (85), Expect(2) = 5e-08 Identities = 46/182 (25%), Positives = 74/182 (40%), Gaps = 1/182 (0%) Frame = -1 Query: 651 FSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNVAGGLESKNRSHDLKVKSE 472 F +S VD +R I +V SSN K+ R E+F+K+ R + + ++N S + + Sbjct: 281 FGLPKSCMVDPTRPISSVKSSNVKAIRREHFAKIYPRSAAKESSVGTTRNPSSNFR---- 336 Query: 471 KLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAI-CNPEIRXXXXXXXXXXXXXSDR 295 G+ EA G F +R++S V K++ S+R Sbjct: 337 --GAKEAERTG--FVKGFRQVS-------NSVVGKSLPTTNNTYGKRTSVLQRPHIDSNR 385 Query: 294 LTPGCEVTDDIRYVSNIQKPPKFTGEISTKPAPLAKQSNNDRQVVESVFHTLKHLKWGPA 115 P + + K P T A ++Q N +VE+V L+ +WGPA Sbjct: 386 FVPSGFSNSSVE----MMKGPSGT-------ALTSRQYCNSGHIVENVSSVLRRFRWGPA 434 Query: 114 AE 109 AE Sbjct: 435 AE 436 >ref|NP_001185030.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332191659|gb|AEE29780.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 886 Score = 46.2 bits (108), Expect(2) = 5e-08 Identities = 21/43 (48%), Positives = 27/43 (62%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 + G +AL NL + YQANQ+LKQ+ D A GFFYW+K Sbjct: 313 RWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLK 355 Score = 37.4 bits (85), Expect(2) = 5e-08 Identities = 46/182 (25%), Positives = 74/182 (40%), Gaps = 1/182 (0%) Frame = -1 Query: 651 FSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNVAGGLESKNRSHDLKVKSE 472 F +S VD +R I +V SSN K+ R E+F+K+ R + + ++N S + + Sbjct: 164 FGLPKSCMVDPTRPISSVKSSNVKAIRREHFAKIYPRSAAKESSVGTTRNPSSNFR---- 219 Query: 471 KLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAI-CNPEIRXXXXXXXXXXXXXSDR 295 G+ EA G F +R++S V K++ S+R Sbjct: 220 --GAKEAERTG--FVKGFRQVS-------NSVVGKSLPTTNNTYGKRTSVLQRPHIDSNR 268 Query: 294 LTPGCEVTDDIRYVSNIQKPPKFTGEISTKPAPLAKQSNNDRQVVESVFHTLKHLKWGPA 115 P + + K P T A ++Q N +VE+V L+ +WGPA Sbjct: 269 FVPSGFSNSSVE----MMKGPSGT-------ALTSRQYCNSGHIVENVSSVLRRFRWGPA 317 Query: 114 AE 109 AE Sbjct: 318 AE 319 >ref|NP_173324.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|42571539|ref|NP_973860.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75151479|sp|Q8GYP6.1|PPR49_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g18900 gi|26450017|dbj|BAC42129.1| unknown protein [Arabidopsis thaliana] gi|28827402|gb|AAO50545.1| unknown protein [Arabidopsis thaliana] gi|332191657|gb|AEE29778.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332191658|gb|AEE29779.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 860 Score = 46.2 bits (108), Expect(2) = 5e-08 Identities = 21/43 (48%), Positives = 27/43 (62%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 + G +AL NL + YQANQ+LKQ+ D A GFFYW+K Sbjct: 313 RWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLK 355 Score = 37.4 bits (85), Expect(2) = 5e-08 Identities = 46/182 (25%), Positives = 74/182 (40%), Gaps = 1/182 (0%) Frame = -1 Query: 651 FSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNVAGGLESKNRSHDLKVKSE 472 F +S VD +R I +V SSN K+ R E+F+K+ R + + ++N S + + Sbjct: 164 FGLPKSCMVDPTRPISSVKSSNVKAIRREHFAKIYPRSAAKESSVGTTRNPSSNFR---- 219 Query: 471 KLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAI-CNPEIRXXXXXXXXXXXXXSDR 295 G+ EA G F +R++S V K++ S+R Sbjct: 220 --GAKEAERTG--FVKGFRQVS-------NSVVGKSLPTTNNTYGKRTSVLQRPHIDSNR 268 Query: 294 LTPGCEVTDDIRYVSNIQKPPKFTGEISTKPAPLAKQSNNDRQVVESVFHTLKHLKWGPA 115 P + + K P T A ++Q N +VE+V L+ +WGPA Sbjct: 269 FVPSGFSNSSVE----MMKGPSGT-------ALTSRQYCNSGHIVENVSSVLRRFRWGPA 317 Query: 114 AE 109 AE Sbjct: 318 AE 319 >ref|NP_177613.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75207514|sp|Q9SSF9.1|PP123_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g74750 gi|5882748|gb|AAD55301.1|AC008263_32 Contains 2 PF|01535 DUF domains [Arabidopsis thaliana] gi|332197508|gb|AEE35629.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 855 Score = 43.9 bits (102), Expect(2) = 5e-08 Identities = 20/43 (46%), Positives = 27/43 (62%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 K G +AL N + YQANQ+LKQ+ + + A GFFYW+K Sbjct: 308 KWGHAAEEALHNFGFRMDAYQANQVLKQMDNYANALGFFYWLK 350 Score = 39.7 bits (91), Expect(2) = 5e-08 Identities = 47/179 (26%), Positives = 70/179 (39%), Gaps = 2/179 (1%) Frame = -1 Query: 639 QSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQS--VNVAGGLESKNRSHDLKVKSEKL 466 +S VD +R I V SSN K R E+ +KV R + V + +K S+D+ Sbjct: 160 KSCMVDPTRPISGVKSSNVKVIRREHLAKVYPRSADRVPINSSPGTKQASNDVA------ 213 Query: 465 GSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAICNPEIRXXXXXXXXXXXXXSDRLTP 286 G SFE ++ ++ S R + + R DR Sbjct: 214 --------GKSFE--AHDLLSNNVSGKRKIMPQRPYTDSTRYASGGCDYSVHSSDDRTI- 262 Query: 285 GCEVTDDIRYVSNIQKPPKFTGEISTKPAPLAKQSNNDRQVVESVFHTLKHLKWGPAAE 109 I V KP + +++ + AP +Q N VVE+V L+ KWG AAE Sbjct: 263 -------ISSVEGFGKPSREMMKVTPRTAPTPRQHCNPGYVVENVSSILRRFKWGHAAE 314 >ref|XP_002890305.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297336147|gb|EFH66564.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 860 Score = 46.2 bits (108), Expect(2) = 8e-08 Identities = 21/43 (48%), Positives = 27/43 (62%) Frame = -3 Query: 130 KMGSCC*KALGNLNCSLGVYQANQILKQLQDNSVAHGFFYWVK 2 + G +AL NL + YQANQ+LKQ+ D A GFFYW+K Sbjct: 313 RWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLK 355 Score = 36.6 bits (83), Expect(2) = 8e-08 Identities = 47/182 (25%), Positives = 73/182 (40%), Gaps = 1/182 (0%) Frame = -1 Query: 651 FSSTQSHKVDSSRSIPNVGSSNTKSHRDENFSKVQARQSVNVAGGLESKNRSHDLKVKSE 472 F +S VD +R I +V SS+ K+ R E FSKV R + + +++N S + + Sbjct: 164 FGLPKSCMVDPTRPISSVKSSSVKAIRREQFSKVYPRSAAKESSIGKTRNPSSNFR---- 219 Query: 471 KLGSVEALEFGPSFE*FYREISWDS*SNFRCSVSKAI-CNPEIRXXXXXXXXXXXXXSDR 295 G+ EA G F +R++S V K++ S+R Sbjct: 220 --GAKEAERTG--FVKGFRQVS-------NSMVGKSLPTTNNTYGKRTSVLQRPHIDSNR 268 Query: 294 LTPGCEVTDDIRYVSNIQKPPKFTGEISTKPAPLAKQSNNDRQVVESVFHTLKHLKWGPA 115 P + V PP A ++Q N +VE+V L+ +WGPA Sbjct: 269 FVPSGFSNSSMEMVKG---PPG--------TALTSRQYCNSGYIVENVSSVLRRFRWGPA 317 Query: 114 AE 109 AE Sbjct: 318 AE 319