BLASTX nr result
ID: Sinomenium22_contig00047063
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium22_contig00047063 (381 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI17926.3| unnamed protein product [Vitis vinifera] 155 4e-36 emb|CAN60259.1| hypothetical protein VITISV_007741 [Vitis vinifera] 155 6e-36 ref|XP_004306129.1| PREDICTED: pentatricopeptide repeat-containi... 117 2e-24 ref|XP_006838888.1| hypothetical protein AMTR_s00002p00268120 [A... 115 6e-24 ref|XP_007141849.1| hypothetical protein PHAVU_008G230800g [Phas... 113 3e-23 ref|XP_007139542.1| hypothetical protein PHAVU_008G038800g [Phas... 112 4e-23 ref|XP_002530005.1| pentatricopeptide repeat-containing protein,... 112 7e-23 emb|CAN60969.1| hypothetical protein VITISV_033859 [Vitis vinifera] 112 7e-23 ref|XP_004292199.1| PREDICTED: pentatricopeptide repeat-containi... 111 9e-23 ref|XP_003545249.1| PREDICTED: pentatricopeptide repeat-containi... 110 2e-22 ref|XP_007014360.1| Pentatricopeptide repeat superfamily protein... 110 2e-22 ref|XP_007014358.1| Pentatricopeptide repeat superfamily protein... 110 2e-22 ref|XP_007014357.1| Pentatricopeptide repeat superfamily protein... 110 2e-22 ref|XP_004301874.1| PREDICTED: pentatricopeptide repeat-containi... 110 3e-22 ref|XP_002278681.2| PREDICTED: pentatricopeptide repeat-containi... 109 5e-22 ref|XP_003527818.1| PREDICTED: pentatricopeptide repeat-containi... 109 5e-22 emb|CBI22251.3| unnamed protein product [Vitis vinifera] 109 5e-22 sp|Q56X05.2|PPR15_ARATH RecName: Full=Pentatricopeptide repeat-c... 108 6e-22 ref|XP_006343635.1| PREDICTED: pentatricopeptide repeat-containi... 108 6e-22 ref|NP_172105.4| transcription factor EMB1444 [Arabidopsis thali... 108 6e-22 >emb|CBI17926.3| unnamed protein product [Vitis vinifera] Length = 602 Score = 155 bits (393), Expect = 4e-36 Identities = 73/127 (57%), Positives = 94/127 (74%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FDEM Q GLVLWT++IR YV E+A +F MREVG+ PD VA++TVVSACG LGD Sbjct: 149 FDEMRQPGLVLWTLIIRAYVCVTFPEKALELFRTMREVGLTPDMVAISTVVSACGLLGDL 208 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 +++H FI KSGIE+D FV S L+ YGECGSLD+AY F E ++N+VVWNT+IHQ Sbjct: 209 GVAKAMHCFIEKSGIEVDAFVSSTLISTYGECGSLDYAYRFFQETPMKNIVVWNTMIHQS 268 Query: 21 VKHEDMD 1 V+H +++ Sbjct: 269 VEHNNLE 275 Score = 81.6 bits (200), Expect = 1e-13 Identities = 40/123 (32%), Positives = 64/123 (52%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F MP R +V W +I G+ +EA F +M GV P+A+ L + +SAC G Sbjct: 281 FQSMPDRDVVSWNSMIGGFARIGQYQEALTWFHEMEFSGVSPNALTLLSTLSACASHGAL 340 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 +TG +H ++ K+ + D + S L+ MY +CG +D A +F E R++ W +I+ Sbjct: 341 DTGAWIHAYVDKNDMNRDGSLDSSLIDMYSKCGDIDKAVQIFEESTRRDLFTWTSIVCGL 400 Query: 21 VKH 13 H Sbjct: 401 AMH 403 Score = 58.2 bits (139), Expect = 1e-06 Identities = 38/135 (28%), Positives = 60/135 (44%), Gaps = 8/135 (5%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F+E +R L WT ++ G E+A F KM+E V PD V + V+SAC G Sbjct: 382 FEESTRRDLFTWTSIVCGLAMHGRGEKALHYFSKMKEAQVQPDDVTMVGVLSACAHAGLL 441 Query: 201 NTG-------RSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVR-NVVV 46 + G V G + K ++ + +V + G G L AY L M + N ++ Sbjct: 442 DQGWWYFQSMEKVFGLVPK----VEHY--GCMVDLLGRMGCLKEAYDLIMGMPMEANEII 495 Query: 45 WNTIIHQCVKHEDMD 1 W + C H +++ Sbjct: 496 WGAFLSACRVHNNVE 510 >emb|CAN60259.1| hypothetical protein VITISV_007741 [Vitis vinifera] Length = 602 Score = 155 bits (392), Expect = 6e-36 Identities = 73/127 (57%), Positives = 94/127 (74%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FDEM Q GLVLWT++IR YV E+A +F MREVG+ PD VA++TVVSACG LGD Sbjct: 149 FDEMRQPGLVLWTLIIRAYVCVTFPEKALELFRTMREVGLTPDMVAVSTVVSACGLLGDL 208 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 +++H FI KSGIE+D FV S L+ YGECGSLD+AY F E ++N+VVWNT+IHQ Sbjct: 209 GVAKAMHCFIEKSGIEVDAFVSSTLISTYGECGSLDYAYRFFQETPMKNIVVWNTMIHQS 268 Query: 21 VKHEDMD 1 V+H +++ Sbjct: 269 VEHNNLE 275 Score = 81.6 bits (200), Expect = 1e-13 Identities = 40/123 (32%), Positives = 64/123 (52%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F MP R +V W +I G+ +EA F +M GV P+A+ L + +SAC G Sbjct: 281 FQSMPDRDVVSWNSMIGGFARIGQYQEALTWFHEMEFSGVSPNALTLLSTLSACASHGAL 340 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 +TG +H ++ K+ + D + S L+ MY +CG +D A +F E R++ W +I+ Sbjct: 341 DTGAWIHAYVDKNDMNRDGSLDSSLIDMYSKCGDIDKAVQIFEESTRRDLFTWTSIVCGL 400 Query: 21 VKH 13 H Sbjct: 401 AMH 403 Score = 59.3 bits (142), Expect = 5e-07 Identities = 38/135 (28%), Positives = 60/135 (44%), Gaps = 8/135 (5%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F+E +R L WT ++ G E+A F KM+E V PD V + V+SAC G Sbjct: 382 FEESTRRDLFTWTSIVCGLAMHGRGEKALHYFSKMKEAQVQPDDVTMVGVLSACAHAGLL 441 Query: 201 NTG-------RSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVR-NVVV 46 + G V G + K ++ + +V + G G L AY L M + N ++ Sbjct: 442 DQGWWYFQSMEKVFGLVPK----VEHY--GXMVDLLGRMGCLKEAYDLIMGMPMEANEII 495 Query: 45 WNTIIHQCVKHEDMD 1 W + C H +++ Sbjct: 496 WGAFLSACRVHNNVE 510 >ref|XP_004306129.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74630-like [Fragaria vesca subsp. vesca] Length = 487 Score = 117 bits (292), Expect = 2e-24 Identities = 53/127 (41%), Positives = 78/127 (61%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FDEMP+R +V WT +I GY C+ EA +F KM GV+PD V + +V+SAC LGD Sbjct: 199 FDEMPERDVVSWTTMISGYSQAKCSREALTLFWKMNCEGVMPDEVTMVSVISACTDLGDV 258 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 TG S+H FI ++G + + L+ MY +CG +D A+ LF M+ ++ + WN++I C Sbjct: 259 QTGISIHRFIEENGFAWMVSLCNALIDMYAKCGCMDRAWQLFDSMSQKSYITWNSMISAC 318 Query: 21 VKHEDMD 1 H + D Sbjct: 319 ANHGNAD 325 Score = 56.6 bits (135), Expect = 4e-06 Identities = 39/129 (30%), Positives = 58/129 (44%), Gaps = 2/129 (1%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FD M Q+ + W +I + A++AFG+F M GV PD V ++ A G Sbjct: 300 FDSMSQKSYITWNSMISACANHGNADDAFGLFECMVSAGVPPDGVTFLALLVAYTHKGLV 359 Query: 201 NTG-RSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAV-RNVVVWNTIIH 28 + G R + IE + M G+ G L+ AY L M + N VVW T++ Sbjct: 360 DEGLRLFERMQREYRIEACIEHYGCVADMLGKAGRLEEAYRLISRMPIPGNDVVWGTLLA 419 Query: 27 QCVKHEDMD 1 C + D+D Sbjct: 420 ACRSYGDVD 428 >ref|XP_006838888.1| hypothetical protein AMTR_s00002p00268120 [Amborella trichopoda] gi|548841394|gb|ERN01457.1| hypothetical protein AMTR_s00002p00268120 [Amborella trichopoda] Length = 364 Score = 115 bits (288), Expect = 6e-24 Identities = 59/127 (46%), Positives = 79/127 (62%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FDEM Q +V WT +I GY++ EA +F +M VG PD+V L V+S CG+LGD Sbjct: 165 FDEMNQPEIVAWTAMINGYLTQGELNEALALFKRMCMVGPEPDSVTLTVVLSVCGKLGDL 224 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 G VH FI K IE D F+ + L+ MY ECG LD A+ +F EM R VV +N+II Q Sbjct: 225 GVGMQVHSFIEKHVIERDSFLGNSLMIMYAECGFLDIAHKVFDEMPRRTVVCFNSIISQY 284 Query: 21 VKHEDMD 1 +KH +++ Sbjct: 285 LKHGEVE 291 Score = 55.8 bits (133), Expect = 6e-06 Identities = 31/113 (27%), Positives = 55/113 (48%), Gaps = 3/113 (2%) Frame = -1 Query: 357 LVLWTVLIRGYVSGNCAE---EAFGVFVKMREVGVVPDAVALATVVSACGQLGDFNTGRS 187 L + LI+ + S N + F ++ ++R V PD+ L ++ A G S Sbjct: 69 LFTYNTLIKSFSSSNPSSLTIHPFCLYKQLRHSSVSPDSHTLTFMIKALASNPKLKDGNS 128 Query: 186 VHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIH 28 +H + +SG + + FV S L+ +Y + GS+ A +F EM +V W +I+ Sbjct: 129 IHSHLIRSGFDSNQFVMSSLIKLYTKGGSIVSARQIFDEMNQPEIVAWTAMIN 181 >ref|XP_007141849.1| hypothetical protein PHAVU_008G230800g [Phaseolus vulgaris] gi|561014982|gb|ESW13843.1| hypothetical protein PHAVU_008G230800g [Phaseolus vulgaris] Length = 610 Score = 113 bits (282), Expect = 3e-23 Identities = 54/118 (45%), Positives = 74/118 (62%), Gaps = 1/118 (0%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKM-REVGVVPDAVALATVVSACGQLGD 205 FDE+P R LV W +I GY CA+EA VF +M R G PD ++L +V+ ACG+LGD Sbjct: 176 FDEIPHRDLVSWNSMIAGYAKAGCAKEAVEVFGEMGRRDGFEPDEMSLVSVLGACGELGD 235 Query: 204 FNTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTII 31 GR V GF+ + G+ ++ FV S L+ MY +CG L A +F MA R+V+ WN +I Sbjct: 236 LELGRWVEGFVVERGMALNSFVGSALISMYAKCGDLGSARRIFDSMATRDVITWNAVI 293 Score = 90.9 bits (224), Expect = 2e-16 Identities = 41/123 (33%), Positives = 66/123 (53%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FD M R ++ W +I GY A+EA +F M++ V + + L V+SAC +G Sbjct: 278 FDSMATRDVITWNAVISGYAQNGMADEAISLFHAMKDDNVKENKITLTAVLSACATIGAL 337 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 + G+ ++ + S+ G + D FV + L+ MY +CGSL+ +F EM +N WN +I Sbjct: 338 DLGKQINEYASQRGFQHDIFVATALIDMYAKCGSLESGQRVFKEMPQKNEASWNAMISAL 397 Query: 21 VKH 13 H Sbjct: 398 ASH 400 >ref|XP_007139542.1| hypothetical protein PHAVU_008G038800g [Phaseolus vulgaris] gi|561012675|gb|ESW11536.1| hypothetical protein PHAVU_008G038800g [Phaseolus vulgaris] Length = 488 Score = 112 bits (281), Expect = 4e-23 Identities = 52/127 (40%), Positives = 78/127 (61%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FDEMP R +V WT ++ GY + +A +F +MR GV PD V + +V+SAC LGD Sbjct: 200 FDEMPHRDVVTWTAMLSGYSRASRPRDALELFREMRHAGVWPDEVTMVSVISACATLGDV 259 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 TGR VH F++++G + + L+ MYG+CG LD A+ +F M R+++ WN++I C Sbjct: 260 ETGRMVHHFVNENGFGWMVALCNALIDMYGKCGCLDEAWYVFHGMTRRSLITWNSMITVC 319 Query: 21 VKHEDMD 1 H + D Sbjct: 320 ANHGNAD 326 Score = 55.8 bits (133), Expect = 6e-06 Identities = 40/129 (31%), Positives = 61/129 (47%), Gaps = 2/129 (1%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F M +R L+ W +I + A++AF +F M GVVPD+V L ++ A G Sbjct: 301 FHGMTRRSLITWNSMITVCANHGNADDAFRLFEWMVCSGVVPDSVTLLALLVAFAHKGLV 360 Query: 201 NTGRSVHGFISKS-GIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVR-NVVVWNTIIH 28 + G + + + GIE +V M G G L AY L +++ N VVW ++ Sbjct: 361 DDGIRLFERMERDYGIEPRIEHYGAVVDMLGRAGRLQEAYDLLTNISIPCNDVVWGALLG 420 Query: 27 QCVKHEDMD 1 C H D+D Sbjct: 421 ACRIHGDVD 429 >ref|XP_002530005.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223530484|gb|EEF32367.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 499 Score = 112 bits (279), Expect = 7e-23 Identities = 46/123 (37%), Positives = 78/123 (63%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FD+MP++ +V WT ++ GY NC+ EA +F +M + G+ PD V + +V+SAC LGD Sbjct: 211 FDDMPEKDVVSWTAMVSGYSKANCSREALELFWEMSDAGIRPDEVTIVSVISACTNLGDV 270 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 TG +VH +I+++G + + L+ MY +CG +D A+ +F M ++++ WN++I C Sbjct: 271 ETGMNVHSYINENGFGWMVSLCNALINMYAKCGCVDRAWRVFNNMKRKSLITWNSMISAC 330 Query: 21 VKH 13 H Sbjct: 331 ANH 333 >emb|CAN60969.1| hypothetical protein VITISV_033859 [Vitis vinifera] Length = 722 Score = 112 bits (279), Expect = 7e-23 Identities = 51/123 (41%), Positives = 77/123 (62%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FDEMP+R +V WTV++ GY + EA +F +MR+VGV PD VA+ V+SAC LGD Sbjct: 434 FDEMPERDVVSWTVMVSGYAQAKRSREALELFREMRDVGVRPDEVAMVIVISACTSLGDL 493 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 TG VH +I ++G + + L+ MY +CG +D A+ +F M ++++ WN++I C Sbjct: 494 ETGFEVHRYIDENGFGWMVSLCNALIDMYAKCGCMDLAWQVFNNMERKSLITWNSMISAC 553 Query: 21 VKH 13 H Sbjct: 554 ANH 556 Score = 56.6 bits (135), Expect = 4e-06 Identities = 36/129 (27%), Positives = 61/129 (47%), Gaps = 2/129 (1%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F+ M ++ L+ W +I + AE+AF VF M G+ PD V +++A G Sbjct: 535 FNNMERKSLITWNSMISACANHGNAEDAFRVFTLMLXSGIRPDGVTFLALLTAYTHKGWV 594 Query: 201 NTGRSVHGFISKS-GIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVR-NVVVWNTIIH 28 + G + + + G+E +V M G G L+ AY L M++ N VVW ++ Sbjct: 595 DDGYGLFESMQRDYGVEAGVEHYGCMVDMLGRAGRLEEAYKLITSMSMPCNDVVWGALLA 654 Query: 27 QCVKHEDMD 1 C + D++ Sbjct: 655 ACRIYGDVE 663 >ref|XP_004292199.1| PREDICTED: pentatricopeptide repeat-containing protein At1g06145-like [Fragaria vesca subsp. vesca] Length = 532 Score = 111 bits (278), Expect = 9e-23 Identities = 50/127 (39%), Positives = 77/127 (60%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FD+MP R L+ WT +I Y EA VF +MR GV PDAV ++TVVSAC LG Sbjct: 199 FDQMPARDLISWTAMINCYCQNKRFGEALAVFDEMRINGVSPDAVTMSTVVSACAHLGAL 258 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 + G+ +H ++ ++G ++D ++ S L+ MY +CG+LD A +F + +N+ WN++I Sbjct: 259 DLGKEIHYYVMRNGFDLDVYIGSALIDMYAKCGALDRALVVFFNLREKNLFCWNSVIEGL 318 Query: 21 VKHEDMD 1 H D + Sbjct: 319 AAHGDAE 325 >ref|XP_003545249.1| PREDICTED: pentatricopeptide repeat-containing protein At2g34400-like isoform X1 [Glycine max] Length = 608 Score = 110 bits (276), Expect = 2e-22 Identities = 52/118 (44%), Positives = 74/118 (62%), Gaps = 1/118 (0%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKM-REVGVVPDAVALATVVSACGQLGD 205 FDE+P+R LV W +I GY CA EA VF +M R G PD ++L +V+ ACG+LGD Sbjct: 174 FDEIPRRDLVSWNSMIAGYAKAGCAREAVEVFGEMGRRDGFEPDEMSLVSVLGACGELGD 233 Query: 204 FNTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTII 31 GR V GF+ + G+ ++ ++ S L+ MY +CG L A +F MA R+V+ WN +I Sbjct: 234 LELGRWVEGFVVERGMTLNSYIGSALISMYAKCGDLGSARRIFDGMAARDVITWNAVI 291 Score = 90.1 bits (222), Expect = 3e-16 Identities = 43/123 (34%), Positives = 65/123 (52%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FD M R ++ W +I GY A+EA +F M+E V + + L V+SAC +G Sbjct: 276 FDGMAARDVITWNAVISGYAQNGMADEAISLFHAMKEDCVTENKITLTAVLSACATIGAL 335 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 + G+ + + S+ G + D FV + L+ MY +CGSL A +F EM +N WN +I Sbjct: 336 DLGKQIDEYASQRGFQHDIFVATALIDMYAKCGSLASAQRVFKEMPQKNEASWNAMISAL 395 Query: 21 VKH 13 H Sbjct: 396 ASH 398 >ref|XP_007014360.1| Pentatricopeptide repeat superfamily protein isoform 4 [Theobroma cacao] gi|590581496|ref|XP_007014363.1| Pentatricopeptide repeat superfamily protein isoform 4 [Theobroma cacao] gi|508784723|gb|EOY31979.1| Pentatricopeptide repeat superfamily protein isoform 4 [Theobroma cacao] gi|508784726|gb|EOY31982.1| Pentatricopeptide repeat superfamily protein isoform 4 [Theobroma cacao] Length = 619 Score = 110 bits (275), Expect = 2e-22 Identities = 54/117 (46%), Positives = 73/117 (62%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FDE+ +R LV W +I GY A EA G+F KMRE G VPD + L +V+ ACG LGD Sbjct: 185 FDEISERDLVSWNSMISGYSKMGYANEAVGLFGKMREEGFVPDEMTLVSVLGACGDLGDL 244 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTII 31 + GR V GF + I+++ F+ S L+GMYG+CG A +F M ++VV WN +I Sbjct: 245 SLGRWVEGFAIEHKIKLNSFIASALIGMYGKCGDFVSARGVFDGMEGKDVVTWNAMI 301 Score = 99.4 bits (246), Expect = 5e-19 Identities = 46/123 (37%), Positives = 72/123 (58%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FD M + +V W +I GY ++EA +F M++ GV+PD + L V+SAC +G Sbjct: 286 FDGMEGKDVVTWNAMITGYAQNGMSDEAIKLFHGMKDAGVIPDKITLVGVLSACASIGAL 345 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 + G+ + + S+ G++ + FV + LV MY +CGSLD+A +F M V+N V WN +I Sbjct: 346 DLGKRIDTYASQRGLQRNIFVSTALVDMYAKCGSLDNAQRVFENMPVKNEVSWNAMISAL 405 Query: 21 VKH 13 H Sbjct: 406 AFH 408 Score = 60.8 bits (146), Expect = 2e-07 Identities = 31/118 (26%), Positives = 60/118 (50%), Gaps = 1/118 (0%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSG-NCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGD 205 F ++PQ + V+IRG + + +M+ +G+ P+ + AC L + Sbjct: 83 FSQIPQPNDYAFNVMIRGLTTTWQHYSTTLHFYYQMKFLGLKPNKFTYPFLFIACANLLE 142 Query: 204 FNTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTII 31 + G++ H + + G+++D L+ MY CG L A +F E++ R++V WN++I Sbjct: 143 LSHGQAAHSSVFRLGLDVDSHTTHSLITMYARCGELGSARRVFDEISERDLVSWNSMI 200 >ref|XP_007014358.1| Pentatricopeptide repeat superfamily protein isoform 2 [Theobroma cacao] gi|590581482|ref|XP_007014359.1| Pentatricopeptide repeat superfamily protein isoform 2 [Theobroma cacao] gi|590581490|ref|XP_007014361.1| Pentatricopeptide repeat superfamily protein isoform 2 [Theobroma cacao] gi|590581493|ref|XP_007014362.1| Pentatricopeptide repeat superfamily protein isoform 2 [Theobroma cacao] gi|508784721|gb|EOY31977.1| Pentatricopeptide repeat superfamily protein isoform 2 [Theobroma cacao] gi|508784722|gb|EOY31978.1| Pentatricopeptide repeat superfamily protein isoform 2 [Theobroma cacao] gi|508784724|gb|EOY31980.1| Pentatricopeptide repeat superfamily protein isoform 2 [Theobroma cacao] gi|508784725|gb|EOY31981.1| Pentatricopeptide repeat superfamily protein isoform 2 [Theobroma cacao] Length = 565 Score = 110 bits (275), Expect = 2e-22 Identities = 54/117 (46%), Positives = 73/117 (62%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FDE+ +R LV W +I GY A EA G+F KMRE G VPD + L +V+ ACG LGD Sbjct: 131 FDEISERDLVSWNSMISGYSKMGYANEAVGLFGKMREEGFVPDEMTLVSVLGACGDLGDL 190 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTII 31 + GR V GF + I+++ F+ S L+GMYG+CG A +F M ++VV WN +I Sbjct: 191 SLGRWVEGFAIEHKIKLNSFIASALIGMYGKCGDFVSARGVFDGMEGKDVVTWNAMI 247 Score = 99.4 bits (246), Expect = 5e-19 Identities = 46/123 (37%), Positives = 72/123 (58%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FD M + +V W +I GY ++EA +F M++ GV+PD + L V+SAC +G Sbjct: 232 FDGMEGKDVVTWNAMITGYAQNGMSDEAIKLFHGMKDAGVIPDKITLVGVLSACASIGAL 291 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 + G+ + + S+ G++ + FV + LV MY +CGSLD+A +F M V+N V WN +I Sbjct: 292 DLGKRIDTYASQRGLQRNIFVSTALVDMYAKCGSLDNAQRVFENMPVKNEVSWNAMISAL 351 Query: 21 VKH 13 H Sbjct: 352 AFH 354 Score = 60.8 bits (146), Expect = 2e-07 Identities = 31/118 (26%), Positives = 60/118 (50%), Gaps = 1/118 (0%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSG-NCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGD 205 F ++PQ + V+IRG + + +M+ +G+ P+ + AC L + Sbjct: 29 FSQIPQPNDYAFNVMIRGLTTTWQHYSTTLHFYYQMKFLGLKPNKFTYPFLFIACANLLE 88 Query: 204 FNTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTII 31 + G++ H + + G+++D L+ MY CG L A +F E++ R++V WN++I Sbjct: 89 LSHGQAAHSSVFRLGLDVDSHTTHSLITMYARCGELGSARRVFDEISERDLVSWNSMI 146 >ref|XP_007014357.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma cacao] gi|508784720|gb|EOY31976.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma cacao] Length = 656 Score = 110 bits (275), Expect = 2e-22 Identities = 54/117 (46%), Positives = 73/117 (62%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FDE+ +R LV W +I GY A EA G+F KMRE G VPD + L +V+ ACG LGD Sbjct: 222 FDEISERDLVSWNSMISGYSKMGYANEAVGLFGKMREEGFVPDEMTLVSVLGACGDLGDL 281 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTII 31 + GR V GF + I+++ F+ S L+GMYG+CG A +F M ++VV WN +I Sbjct: 282 SLGRWVEGFAIEHKIKLNSFIASALIGMYGKCGDFVSARGVFDGMEGKDVVTWNAMI 338 Score = 99.4 bits (246), Expect = 5e-19 Identities = 46/123 (37%), Positives = 72/123 (58%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FD M + +V W +I GY ++EA +F M++ GV+PD + L V+SAC +G Sbjct: 323 FDGMEGKDVVTWNAMITGYAQNGMSDEAIKLFHGMKDAGVIPDKITLVGVLSACASIGAL 382 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 + G+ + + S+ G++ + FV + LV MY +CGSLD+A +F M V+N V WN +I Sbjct: 383 DLGKRIDTYASQRGLQRNIFVSTALVDMYAKCGSLDNAQRVFENMPVKNEVSWNAMISAL 442 Query: 21 VKH 13 H Sbjct: 443 AFH 445 Score = 60.8 bits (146), Expect = 2e-07 Identities = 31/118 (26%), Positives = 60/118 (50%), Gaps = 1/118 (0%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSG-NCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGD 205 F ++PQ + V+IRG + + +M+ +G+ P+ + AC L + Sbjct: 120 FSQIPQPNDYAFNVMIRGLTTTWQHYSTTLHFYYQMKFLGLKPNKFTYPFLFIACANLLE 179 Query: 204 FNTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTII 31 + G++ H + + G+++D L+ MY CG L A +F E++ R++V WN++I Sbjct: 180 LSHGQAAHSSVFRLGLDVDSHTTHSLITMYARCGELGSARRVFDEISERDLVSWNSMI 237 >ref|XP_004301874.1| PREDICTED: pentatricopeptide repeat-containing protein At2g22410, mitochondrial-like [Fragaria vesca subsp. vesca] Length = 655 Score = 110 bits (274), Expect = 3e-22 Identities = 53/127 (41%), Positives = 76/127 (59%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FDEM +R LV W +I GY C +EAFG+F +MR GV PD L ++S C Q D Sbjct: 188 FDEMSERSLVSWNSMIGGYSGVGCWKEAFGLFREMRGFGVEPDKYTLVNLLSVCSQSCDL 247 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 + GR +H F+ SG+ +D +R+ L+ MYG+CG L A +F +M +NVV W +++ Sbjct: 248 DLGRYLHHFVVVSGMVVDHILRNALLDMYGKCGHLASAEMVFNQMGCKNVVSWTSMVRAY 307 Query: 21 VKHEDMD 1 KH +D Sbjct: 308 AKHGCID 314 Score = 95.9 bits (237), Expect = 5e-18 Identities = 47/123 (38%), Positives = 68/123 (55%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FD+MP + +V W LI YV EA +F KM + GV PD L ++SAC Q+GD Sbjct: 320 FDQMPLKNVVSWNSLISCYVREGQCREALDLFQKMLDSGVAPDEATLVFILSACSQVGDL 379 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 G+ H ISKS I + + + L+ MY +CG++ A +F ++ +N+V WN II Sbjct: 380 VIGKEAHDHISKSNIALTVTLYNSLIDMYAKCGAVRTAMDIFTQIPEKNLVSWNIIISAL 439 Query: 21 VKH 13 H Sbjct: 440 ALH 442 Score = 58.9 bits (141), Expect = 7e-07 Identities = 36/131 (27%), Positives = 66/131 (50%), Gaps = 4/131 (3%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F ++P++ LV W ++I EA +F +M+E G+ PD + ++SAC G Sbjct: 421 FTQIPEKNLVSWNIIISALALHGYGSEAIRIFEQMQEGGIWPDEITFIGLLSACSHSGLL 480 Query: 201 NTGRSVH---GFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVR-NVVVWNTI 34 + GR + + EI+ + + LV + G G L+ A +L M ++ ++VVW + Sbjct: 481 DLGRFFFERMKSVYRISPEIEHY--ACLVDLLGRRGCLEEAITLMRGMPMKPDIVVWGAM 538 Query: 33 IHQCVKHEDMD 1 + C H ++D Sbjct: 539 LGACRIHGNVD 549 Score = 58.5 bits (140), Expect = 9e-07 Identities = 34/117 (29%), Positives = 60/117 (51%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FD++ + ++ LIRGY + N +A ++ M G+ P+ L V+ C + Sbjct: 87 FDQVHEPNKYMYNSLIRGYSNSNDTFKAMSLYYHMINSGLSPNEFTLPFVLKVCAAKTAY 146 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTII 31 G +V K GI V++ L+ +Y CG + A ++F EM+ R++V WN++I Sbjct: 147 WEGVAVQCQAVKLGIGCQVCVQNGLINVYSVCGLVHCARNVFDEMSERSLVSWNSMI 203 >ref|XP_002278681.2| PREDICTED: pentatricopeptide repeat-containing protein At1g74630-like [Vitis vinifera] Length = 663 Score = 109 bits (272), Expect = 5e-22 Identities = 50/123 (40%), Positives = 77/123 (62%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F EMP+R +V WTV++ GY + EA +F +MR+VGV PD VA+ +V+SAC LGD Sbjct: 375 FYEMPERDVVSWTVMVSGYAQAKRSREALELFREMRDVGVRPDEVAMVSVISACTSLGDL 434 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 TG VH +I ++G + + L+ MY +CG +D A+ +F M ++++ WN++I C Sbjct: 435 ETGFEVHRYIDENGFGWMVSLCNALIDMYAKCGCMDLAWQVFNNMERKSLITWNSMISAC 494 Query: 21 VKH 13 H Sbjct: 495 ANH 497 Score = 56.2 bits (134), Expect = 5e-06 Identities = 36/129 (27%), Positives = 61/129 (47%), Gaps = 2/129 (1%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F+ M ++ L+ W +I + AE+AF VF M G+ PD V +++A G Sbjct: 476 FNNMERKSLITWNSMISACANHGNAEDAFRVFTLMLYSGIRPDGVTFLALLTAYTHKGWV 535 Query: 201 NTGRSVHGFISKS-GIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVR-NVVVWNTIIH 28 + G + + + G+E +V M G G L+ AY L M++ N VVW ++ Sbjct: 536 DDGYGLFESMQRDYGVEAGVEHYGCMVDMLGRAGRLEEAYKLITSMSMPCNDVVWGALLA 595 Query: 27 QCVKHEDMD 1 C + D++ Sbjct: 596 ACRIYGDVE 604 >ref|XP_003527818.1| PREDICTED: pentatricopeptide repeat-containing protein At2g20540-like [Glycine max] Length = 535 Score = 109 bits (272), Expect = 5e-22 Identities = 50/123 (40%), Positives = 76/123 (61%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FDEMP R +V WT +I GY G C +A G+F +M+ VG+ PD +++ +V+ AC QLG Sbjct: 195 FDEMPCRTIVSWTTMINGYARGGCYADALGIFREMQVVGIEPDEISVISVLPACAQLGAL 254 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 G+ +H + KSG + V + LV MY +CG +D A+ LF +M ++V+ W+T+I Sbjct: 255 EVGKWIHKYSEKSGFLKNAGVFNALVEMYAKCGCIDEAWGLFNQMIEKDVISWSTMIGGL 314 Query: 21 VKH 13 H Sbjct: 315 ANH 317 Score = 56.2 bits (134), Expect = 5e-06 Identities = 32/118 (27%), Positives = 53/118 (44%), Gaps = 1/118 (0%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREV-GVVPDAVALATVVSACGQLGD 205 F ++ + + +IR Y + A VF +M PD V+ +C L Sbjct: 62 FQQLENPNVFSYNAIIRTYTHNHKHPLAITVFNQMLTTKSASPDKFTFPFVIKSCAGLLC 121 Query: 204 FNTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTII 31 G+ VH + K G + + L+ MY +CG + AY ++ EM R+ V WN++I Sbjct: 122 RRLGQQVHAHVCKFGPKTHAITENALIDMYTKCGDMSGAYQVYEEMTERDAVSWNSLI 179 >emb|CBI22251.3| unnamed protein product [Vitis vinifera] Length = 476 Score = 109 bits (272), Expect = 5e-22 Identities = 50/123 (40%), Positives = 77/123 (62%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F EMP+R +V WTV++ GY + EA +F +MR+VGV PD VA+ +V+SAC LGD Sbjct: 188 FYEMPERDVVSWTVMVSGYAQAKRSREALELFREMRDVGVRPDEVAMVSVISACTSLGDL 247 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 TG VH +I ++G + + L+ MY +CG +D A+ +F M ++++ WN++I C Sbjct: 248 ETGFEVHRYIDENGFGWMVSLCNALIDMYAKCGCMDLAWQVFNNMERKSLITWNSMISAC 307 Query: 21 VKH 13 H Sbjct: 308 ANH 310 Score = 56.2 bits (134), Expect = 5e-06 Identities = 36/129 (27%), Positives = 61/129 (47%), Gaps = 2/129 (1%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F+ M ++ L+ W +I + AE+AF VF M G+ PD V +++A G Sbjct: 289 FNNMERKSLITWNSMISACANHGNAEDAFRVFTLMLYSGIRPDGVTFLALLTAYTHKGWV 348 Query: 201 NTGRSVHGFISKS-GIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVR-NVVVWNTIIH 28 + G + + + G+E +V M G G L+ AY L M++ N VVW ++ Sbjct: 349 DDGYGLFESMQRDYGVEAGVEHYGCMVDMLGRAGRLEEAYKLITSMSMPCNDVVWGALLA 408 Query: 27 QCVKHEDMD 1 C + D++ Sbjct: 409 ACRIYGDVE 417 >sp|Q56X05.2|PPR15_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g06145; AltName: Full=Protein EMBRYO DEFECTIVE 1444 Length = 577 Score = 108 bits (271), Expect = 6e-22 Identities = 48/123 (39%), Positives = 74/123 (60%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F++MP + ++ WT +I+GY EA VF KM E G++PD V ++TV+SAC LG Sbjct: 244 FNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPDEVTMSTVISACAHLGVL 303 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 G+ VH + ++G +D ++ S LV MY +CGSL+ A +F + +N+ WN+II Sbjct: 304 EIGKEVHMYTLQNGFVLDVYIGSALVDMYSKCGSLERALLVFFNLPKKNLFCWNSIIEGL 363 Query: 21 VKH 13 H Sbjct: 364 AAH 366 Score = 56.2 bits (134), Expect = 5e-06 Identities = 34/128 (26%), Positives = 59/128 (46%), Gaps = 2/128 (1%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F +P++ L W +I G + A+EA +F KM V P+AV +V +AC G Sbjct: 345 FFNLPKKNLFCWNSIIEGLAAHGFAQEALKMFAKMEMESVKPNAVTFVSVFTACTHAGLV 404 Query: 201 NTGRSVH-GFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVR-NVVVWNTIIH 28 + GR ++ I I + +V ++ + G + A L M N V+W ++ Sbjct: 405 DEGRRIYRSMIDDYSIVSNVEHYGGMVHLFSKAGLIYEALELIGNMEFEPNAVIWGALLD 464 Query: 27 QCVKHEDM 4 C H+++ Sbjct: 465 GCRIHKNL 472 >ref|XP_006343635.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Solanum tuberosum] Length = 476 Score = 108 bits (271), Expect = 6e-22 Identities = 52/117 (44%), Positives = 72/117 (61%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 FDEMPQR +V WTVLI GY ++A VF KMR+ GV P+ V + +SAC G Sbjct: 176 FDEMPQRDVVSWTVLIMGYRDCGKFDDALVVFEKMRDSGVAPNRVTMVNALSACANCGAL 235 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTII 31 + G +H I +SG E+D + + L+ MYG+CG ++H + +F EM RNV WN +I Sbjct: 236 DMGVLIHDEIRRSGWEMDVILGTSLIDMYGKCGKIEHGFWVFQEMKHRNVYTWNAVI 292 Score = 60.1 bits (144), Expect = 3e-07 Identities = 29/103 (28%), Positives = 52/103 (50%), Gaps = 3/103 (2%) Frame = -1 Query: 306 EEAFGVFVKMREVGVVPDAVALATVVSACGQLGDFNTGRSVHGFISKSGIEIDDFVRSEL 127 + + ++V M + G+ P+ V+ + L + G+SVH + K G D +V++ L Sbjct: 100 QNSISMYVHMHKEGIFPNNYTYPFVLKSLSDLKELKLGKSVHTHVVKWGYVCDIYVQNSL 159 Query: 126 VGMYGECGSLDHAYSLFCEMAVRNVVVWNTII---HQCVKHED 7 + +Y CG ++ +F EM R+VV W +I C K +D Sbjct: 160 LNLYASCGEIEFCQQVFDEMPQRDVVSWTVLIMGYRDCGKFDD 202 >ref|NP_172105.4| transcription factor EMB1444 [Arabidopsis thaliana] gi|8810477|gb|AAF80138.1|AC024174_20 Contains similarity to an unknown protein T5J8.5 gi|4263522 from Arabidopsis thaliana BAC T5J8 gb|AC004044 and contains multiple PPR PF|01535 repeats. ESTs gb|AV565358, gb|AV558710, gb|AV524184 come from this gene [Arabidopsis thaliana] gi|332189826|gb|AEE27947.1| bHLH transcription factor LHL1 [Arabidopsis thaliana] Length = 1322 Score = 108 bits (271), Expect = 6e-22 Identities = 48/123 (39%), Positives = 74/123 (60%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F++MP + ++ WT +I+GY EA VF KM E G++PD V ++TV+SAC LG Sbjct: 989 FNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPDEVTMSTVISACAHLGVL 1048 Query: 201 NTGRSVHGFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVRNVVVWNTIIHQC 22 G+ VH + ++G +D ++ S LV MY +CGSL+ A +F + +N+ WN+II Sbjct: 1049 EIGKEVHMYTLQNGFVLDVYIGSALVDMYSKCGSLERALLVFFNLPKKNLFCWNSIIEGL 1108 Query: 21 VKH 13 H Sbjct: 1109 AAH 1111 Score = 56.2 bits (134), Expect = 5e-06 Identities = 34/128 (26%), Positives = 59/128 (46%), Gaps = 2/128 (1%) Frame = -1 Query: 381 FDEMPQRGLVLWTVLIRGYVSGNCAEEAFGVFVKMREVGVVPDAVALATVVSACGQLGDF 202 F +P++ L W +I G + A+EA +F KM V P+AV +V +AC G Sbjct: 1090 FFNLPKKNLFCWNSIIEGLAAHGFAQEALKMFAKMEMESVKPNAVTFVSVFTACTHAGLV 1149 Query: 201 NTGRSVH-GFISKSGIEIDDFVRSELVGMYGECGSLDHAYSLFCEMAVR-NVVVWNTIIH 28 + GR ++ I I + +V ++ + G + A L M N V+W ++ Sbjct: 1150 DEGRRIYRSMIDDYSIVSNVEHYGGMVHLFSKAGLIYEALELIGNMEFEPNAVIWGALLD 1209 Query: 27 QCVKHEDM 4 C H+++ Sbjct: 1210 GCRIHKNL 1217