BLASTX nr result
ID: Glycyrrhiza32_contig00033884
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza32_contig00033884 (556 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_004490153.1 PREDICTED: pentatricopeptide repeat-containing pr... 283 1e-90 XP_003614017.1 PPR containing plant-like protein [Medicago trunc... 280 1e-89 KYP65771.1 Pentatricopeptide repeat-containing protein At2g20540... 269 2e-85 KHN02018.1 Pentatricopeptide repeat-containing protein [Glycine ... 268 4e-85 XP_003520106.1 PREDICTED: pentatricopeptide repeat-containing pr... 268 4e-85 OIW02168.1 hypothetical protein TanjilG_02392 [Lupinus angustifo... 261 1e-82 KHN03629.1 Pentatricopeptide repeat-containing protein [Glycine ... 258 2e-82 XP_019459635.1 PREDICTED: pentatricopeptide repeat-containing pr... 261 2e-82 KRH75002.1 hypothetical protein GLYMA_01G056200 [Glycine max] 257 3e-81 XP_007157581.1 hypothetical protein PHAVU_002G081200g [Phaseolus... 252 8e-79 XP_017435484.1 PREDICTED: pentatricopeptide repeat-containing pr... 248 5e-77 XP_015966522.1 PREDICTED: pentatricopeptide repeat-containing pr... 247 1e-76 XP_014492913.1 PREDICTED: pentatricopeptide repeat-containing pr... 244 1e-75 XP_016203711.1 PREDICTED: pentatricopeptide repeat-containing pr... 243 3e-75 XP_011463034.1 PREDICTED: pentatricopeptide repeat-containing pr... 229 1e-69 OAY59401.1 hypothetical protein MANES_01G029700 [Manihot esculenta] 228 2e-69 OMO71979.1 hypothetical protein COLO4_27920 [Corchorus olitorius] 227 7e-69 GAU50198.1 hypothetical protein TSUD_408860 [Trifolium subterran... 229 2e-68 XP_008389431.1 PREDICTED: pentatricopeptide repeat-containing pr... 224 1e-67 EOY01760.1 Pentatricopeptide repeat-containing protein [Theobrom... 224 1e-67 >XP_004490153.1 PREDICTED: pentatricopeptide repeat-containing protein At2g20540-like [Cicer arietinum] Length = 523 Score = 283 bits (723), Expect = 1e-90 Identities = 142/172 (82%), Positives = 150/172 (87%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 YSGMA EGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRR+TNSW G+ E Sbjct: 353 YSGMAFEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRITNSWIGNAE 412 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDMM 362 TLAWRA LSACCNHG +L+NHSGVYVLLSNLYA+SGKH+DARRVRDMM Sbjct: 413 TLAWRAFLSACCNHGKTQLAEVAAEKLLQLENHSGVYVLLSNLYASSGKHSDARRVRDMM 472 Query: 363 KNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD*NR 518 K KGANK APGCSS EIDGVVSEFIAGEK HPQMEEIHSVLE+MHMQLD N+ Sbjct: 473 KIKGANK-APGCSSTEIDGVVSEFIAGEKIHPQMEEIHSVLEKMHMQLDYNQ 523 >XP_003614017.1 PPR containing plant-like protein [Medicago truncatula] AES96975.1 PPR containing plant-like protein [Medicago truncatula] Length = 525 Score = 280 bits (717), Expect = 1e-89 Identities = 142/174 (81%), Positives = 151/174 (86%), Gaps = 2/174 (1%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 YSGMA EGL LLDKMCSVYNI PKSEHYGCLVDLLSRAGLFEEAMV+IR++TNSWNGSEE Sbjct: 353 YSGMAYEGLMLLDKMCSVYNIVPKSEHYGCLVDLLSRAGLFEEAMVMIRKITNSWNGSEE 412 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDN--HSGVYVLLSNLYAASGKHADARRVRD 356 TLAWRA LSACCNHG +LDN HSGVYVLLSNLYAASGKH+DARRVRD Sbjct: 413 TLAWRAFLSACCNHGETQLAELAAEKVLQLDNHIHSGVYVLLSNLYAASGKHSDARRVRD 472 Query: 357 MMKNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD*NR 518 MMK KG NK APGCSSVEIDGV+SEFIAGEKTHPQMEEIHSVL++MHMQLD N+ Sbjct: 473 MMKIKGTNK-APGCSSVEIDGVISEFIAGEKTHPQMEEIHSVLKKMHMQLDYNQ 525 >KYP65771.1 Pentatricopeptide repeat-containing protein At2g20540 family [Cajanus cajan] Length = 500 Score = 269 bits (687), Expect = 2e-85 Identities = 134/169 (79%), Positives = 144/169 (85%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 YSG+A EGL+LL KMCSVY I PKSEHYGCLVDLLSR+G FEEAMV++RR+TNSWN SEE Sbjct: 333 YSGLAHEGLQLLHKMCSVYKIVPKSEHYGCLVDLLSRSGHFEEAMVMMRRITNSWNASEE 392 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDMM 362 TLAWRA LSACCN G LDNHSGVYVLLSNLYAASGKH+DARRVRDMM Sbjct: 393 TLAWRAFLSACCNQGQAQLAESAAERLLLLDNHSGVYVLLSNLYAASGKHSDARRVRDMM 452 Query: 363 KNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 KNKG K APGCSSVEIDGVVSEFIAGE+THPQM+EIHSVLE+MHMQLD Sbjct: 453 KNKGVEK-APGCSSVEIDGVVSEFIAGEETHPQMKEIHSVLEKMHMQLD 500 >KHN02018.1 Pentatricopeptide repeat-containing protein [Glycine soja] Length = 518 Score = 268 bits (686), Expect = 4e-85 Identities = 136/170 (80%), Positives = 148/170 (87%), Gaps = 1/170 (0%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTN-SWNGSE 179 YSGMA EGL+LLDKM S+Y IEPKSEHYGCLVDLLSRAGLF EAMV+IRR+T+ SWNGSE Sbjct: 350 YSGMAHEGLQLLDKMSSLYEIEPKSEHYGCLVDLLSRAGLFGEAMVMIRRITSTSWNGSE 409 Query: 180 ETLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDM 359 ETLAWRA LSACCNHG RL+NHSGVYVLLSNLYAASGKH+DARRVR+M Sbjct: 410 ETLAWRAFLSACCNHGQAQLAERAAKRLLRLENHSGVYVLLSNLYAASGKHSDARRVRNM 469 Query: 360 MKNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 M+NKG +K APGCSSVEIDGVVSEFIAGE+THPQMEEIHSVLE +HMQLD Sbjct: 470 MRNKGVDK-APGCSSVEIDGVVSEFIAGEETHPQMEEIHSVLEILHMQLD 518 >XP_003520106.1 PREDICTED: pentatricopeptide repeat-containing protein At2g20540-like [Glycine max] KRH70861.1 hypothetical protein GLYMA_02G114500 [Glycine max] Length = 518 Score = 268 bits (686), Expect = 4e-85 Identities = 136/170 (80%), Positives = 148/170 (87%), Gaps = 1/170 (0%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTN-SWNGSE 179 YSGMA EGL+LLDKM S+Y IEPKSEHYGCLVDLLSRAGLF EAMV+IRR+T+ SWNGSE Sbjct: 350 YSGMAHEGLQLLDKMSSLYEIEPKSEHYGCLVDLLSRAGLFGEAMVMIRRITSTSWNGSE 409 Query: 180 ETLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDM 359 ETLAWRA LSACCNHG RL+NHSGVYVLLSNLYAASGKH+DARRVR+M Sbjct: 410 ETLAWRAFLSACCNHGQAQLAERAAKRLLRLENHSGVYVLLSNLYAASGKHSDARRVRNM 469 Query: 360 MKNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 M+NKG +K APGCSSVEIDGVVSEFIAGE+THPQMEEIHSVLE +HMQLD Sbjct: 470 MRNKGVDK-APGCSSVEIDGVVSEFIAGEETHPQMEEIHSVLEILHMQLD 518 >OIW02168.1 hypothetical protein TanjilG_02392 [Lupinus angustifolius] Length = 500 Score = 261 bits (668), Expect = 1e-82 Identities = 130/169 (76%), Positives = 145/169 (85%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 +SGMA EGL+LL+KMCSVY IEPKSEHYGC+VDLLSRA LFEEAM +IRR+T+S NGSEE Sbjct: 333 HSGMAYEGLQLLEKMCSVYKIEPKSEHYGCIVDLLSRASLFEEAMAVIRRITSSSNGSEE 392 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDMM 362 TLAWRA LSACCNHG +L+NHSGVYVLLSNLYAAS KH+DARRVRDMM Sbjct: 393 TLAWRAFLSACCNHGQAQLAEFAAERLLQLENHSGVYVLLSNLYAASEKHSDARRVRDMM 452 Query: 363 KNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 KNKGA+K PGCSSVEIDGVV+EFIAGEK HP ME+I+SVLE+MHMQLD Sbjct: 453 KNKGADK-TPGCSSVEIDGVVNEFIAGEKVHPLMEDIYSVLEKMHMQLD 500 >KHN03629.1 Pentatricopeptide repeat-containing protein [Glycine soja] Length = 416 Score = 258 bits (660), Expect = 2e-82 Identities = 128/170 (75%), Positives = 145/170 (85%), Gaps = 1/170 (0%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMT-NSWNGSE 179 YSGMA EGL+LL KMCSVY IEPKSE YGCLVDLL+RAGLFEEAMV++RR+T NSWNGSE Sbjct: 248 YSGMAHEGLQLLHKMCSVYKIEPKSEQYGCLVDLLTRAGLFEEAMVMMRRITSNSWNGSE 307 Query: 180 ETLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDM 359 ETLAWRA LSACCNHG RL+NHSGVYVLLS+LY ASGKH+++RRVRDM Sbjct: 308 ETLAWRAFLSACCNHGHAQLAQCAAERLLRLENHSGVYVLLSSLYGASGKHSNSRRVRDM 367 Query: 360 MKNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 M+NKG +K APGCS+VE DGVVSEFIAGE+TH QMEEIH +LE++HMQLD Sbjct: 368 MRNKGVDK-APGCSTVESDGVVSEFIAGEETHSQMEEIHPILEKLHMQLD 416 >XP_019459635.1 PREDICTED: pentatricopeptide repeat-containing protein At2g20540-like [Lupinus angustifolius] Length = 522 Score = 261 bits (668), Expect = 2e-82 Identities = 130/169 (76%), Positives = 145/169 (85%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 +SGMA EGL+LL+KMCSVY IEPKSEHYGC+VDLLSRA LFEEAM +IRR+T+S NGSEE Sbjct: 355 HSGMAYEGLQLLEKMCSVYKIEPKSEHYGCIVDLLSRASLFEEAMAVIRRITSSSNGSEE 414 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDMM 362 TLAWRA LSACCNHG +L+NHSGVYVLLSNLYAAS KH+DARRVRDMM Sbjct: 415 TLAWRAFLSACCNHGQAQLAEFAAERLLQLENHSGVYVLLSNLYAASEKHSDARRVRDMM 474 Query: 363 KNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 KNKGA+K PGCSSVEIDGVV+EFIAGEK HP ME+I+SVLE+MHMQLD Sbjct: 475 KNKGADK-TPGCSSVEIDGVVNEFIAGEKVHPLMEDIYSVLEKMHMQLD 522 >KRH75002.1 hypothetical protein GLYMA_01G056200 [Glycine max] Length = 477 Score = 257 bits (657), Expect = 3e-81 Identities = 127/170 (74%), Positives = 145/170 (85%), Gaps = 1/170 (0%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMT-NSWNGSE 179 YSGMA EGL+LL KMCSVY IEPKSE YGCLVDLL+RAGLFEEAMV++RR+T NSWNGSE Sbjct: 309 YSGMAHEGLQLLHKMCSVYKIEPKSEQYGCLVDLLTRAGLFEEAMVMMRRITSNSWNGSE 368 Query: 180 ETLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDM 359 ETLAWRA LSACCNHG RL+NHSGVYVLLS+LY ASGKH+++RRVRDM Sbjct: 369 ETLAWRAFLSACCNHGHAQLAQCAAERLLRLENHSGVYVLLSSLYGASGKHSNSRRVRDM 428 Query: 360 MKNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 M+NKG +K APGCS+VE DGVV+EFIAGE+TH QMEEIH +LE++HMQLD Sbjct: 429 MRNKGVDK-APGCSTVESDGVVNEFIAGEETHSQMEEIHPILEKLHMQLD 477 >XP_007157581.1 hypothetical protein PHAVU_002G081200g [Phaseolus vulgaris] ESW29575.1 hypothetical protein PHAVU_002G081200g [Phaseolus vulgaris] Length = 517 Score = 252 bits (644), Expect = 8e-79 Identities = 126/169 (74%), Positives = 142/169 (84%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 YSGMA EGL+LL KMCSVY IEPK+EHY CLVDLLSRAGLF+EAMV++RR++N N S+E Sbjct: 350 YSGMAHEGLQLLCKMCSVYKIEPKNEHYSCLVDLLSRAGLFQEAMVMMRRISNLGNVSDE 409 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDMM 362 TLAWRA LSACCNHG RL+NHSGVYVLLSNLY+ASGKH+DARRVRDMM Sbjct: 410 TLAWRAFLSACCNHGQAQLAERAAERLIRLENHSGVYVLLSNLYSASGKHSDARRVRDMM 469 Query: 363 KNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 +NKG +K PG SSVEI GVVSEFIAGE+THP M+EIHSVLE+MHMQLD Sbjct: 470 RNKGVDK-VPGSSSVEIGGVVSEFIAGEETHPMMKEIHSVLEKMHMQLD 517 >XP_017435484.1 PREDICTED: pentatricopeptide repeat-containing protein At2g20540-like [Vigna angularis] KOM53127.1 hypothetical protein LR48_Vigan09g178600 [Vigna angularis] BAT99107.1 hypothetical protein VIGAN_10049200 [Vigna angularis var. angularis] Length = 517 Score = 248 bits (632), Expect = 5e-77 Identities = 125/169 (73%), Positives = 140/169 (82%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 YSGMA EGL+LL KMCSVY IEPKSEHY CLVDLLSRAG FEEAMV++RR+++S N SEE Sbjct: 350 YSGMAHEGLQLLYKMCSVYKIEPKSEHYSCLVDLLSRAGHFEEAMVMLRRISSSGNVSEE 409 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDMM 362 TLAWRA LSACCNHG RL NHSGVYVLLSN+Y+ASGKH+DARRVRDMM Sbjct: 410 TLAWRAFLSACCNHGQAQLAERAAERLLRLQNHSGVYVLLSNVYSASGKHSDARRVRDMM 469 Query: 363 KNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 +NK +K PG SSVEI GVVSEFIAGE+THP M+EIHSVLE+MH+QLD Sbjct: 470 RNKRVDK-VPGSSSVEIGGVVSEFIAGEETHPMMKEIHSVLEKMHLQLD 517 >XP_015966522.1 PREDICTED: pentatricopeptide repeat-containing protein At5g06540-like [Arachis duranensis] Length = 520 Score = 247 bits (630), Expect = 1e-76 Identities = 124/170 (72%), Positives = 142/170 (83%), Gaps = 1/170 (0%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNG-SE 179 YSGM EGLKLLDKMC+V+ IEPK EHYGCLVDLLSRAGLF+EAMV+IRR+ +S +G SE Sbjct: 352 YSGMVLEGLKLLDKMCNVHKIEPKIEHYGCLVDLLSRAGLFKEAMVMIRRIRDSSSGTSE 411 Query: 180 ETLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDM 359 E L+WRA LSACCNHG +L+NHSGVYVLLSNLYA +GK+++ RRVRDM Sbjct: 412 EALSWRAFLSACCNHGQTKFAELAAAKLLKLENHSGVYVLLSNLYATTGKYSETRRVRDM 471 Query: 360 MKNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 MKNKG+ K APGCSSVEIDGV+SEFIAGEKTH QM EIHSVL++MHMQLD Sbjct: 472 MKNKGSEK-APGCSSVEIDGVISEFIAGEKTHLQMHEIHSVLKKMHMQLD 520 >XP_014492913.1 PREDICTED: pentatricopeptide repeat-containing protein At2g20540-like [Vigna radiata var. radiata] Length = 517 Score = 244 bits (623), Expect = 1e-75 Identities = 123/169 (72%), Positives = 139/169 (82%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 YSGMA EGL+LL KMCS Y IEPKSEHY CLVDLLSRAGLFEEAMV+IRR+++S N SEE Sbjct: 350 YSGMAHEGLQLLHKMCSEYKIEPKSEHYSCLVDLLSRAGLFEEAMVMIRRISSSGNVSEE 409 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDMM 362 TLAWRA LSACCNHG RL NHSGVYVLLSN+Y+ASGKH+DARRVR+MM Sbjct: 410 TLAWRAFLSACCNHGQAQLAERVAERLLRLQNHSGVYVLLSNVYSASGKHSDARRVREMM 469 Query: 363 KNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 +NK +K PG SSVEI GVVSEFIAGE+THP +EIHSVLE++H+QLD Sbjct: 470 RNKRVDK-VPGSSSVEIGGVVSEFIAGEETHPMTKEIHSVLEKIHLQLD 517 >XP_016203711.1 PREDICTED: pentatricopeptide repeat-containing protein At5g06540-like [Arachis ipaensis] Length = 518 Score = 243 bits (620), Expect = 3e-75 Identities = 122/170 (71%), Positives = 141/170 (82%), Gaps = 1/170 (0%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNG-SE 179 YSGM EGLKLLDKMC+V+ I+PK EHYGCLVDLLSRAGLF+EAMV+IRR+ +S +G SE Sbjct: 350 YSGMVLEGLKLLDKMCNVHKIKPKIEHYGCLVDLLSRAGLFKEAMVMIRRIRDSSSGTSE 409 Query: 180 ETLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDM 359 E L+WRA LSACCNHG +L+NHSGVYVLLSNLYA +GK+++ RRVRDM Sbjct: 410 EALSWRAFLSACCNHGQTKIAEFAAAKLLKLENHSGVYVLLSNLYATTGKYSETRRVRDM 469 Query: 360 MKNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 MKNKG+ K APGCSSVEIDGV+SEFIA EKTH QM EIHSVL++MHMQLD Sbjct: 470 MKNKGSEK-APGCSSVEIDGVISEFIASEKTHLQMHEIHSVLKKMHMQLD 518 >XP_011463034.1 PREDICTED: pentatricopeptide repeat-containing protein At5g43790-like [Fragaria vesca subsp. vesca] Length = 534 Score = 229 bits (584), Expect = 1e-69 Identities = 112/169 (66%), Positives = 134/169 (79%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 Y+GMA EG+++LDKMC+VYNI PKSEHYGC+VDLLSRAGLFEEA II R+ +S N SEE Sbjct: 352 YAGMAYEGMRVLDKMCNVYNIRPKSEHYGCIVDLLSRAGLFEEAREIIERIPSSSNPSEE 411 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDMM 362 +AWRA LSACCNHG RL+ HSGVYVLLSNLYAA+GKH DARR+R++M Sbjct: 412 AVAWRAFLSACCNHGQAELTEVAAEKLFRLERHSGVYVLLSNLYAAAGKHGDARRMRNLM 471 Query: 363 KNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 +N+G +K PGCSSVEID V EFIAGEKTHPQMEEI VL+ ++ Q++ Sbjct: 472 RNRGVDK-VPGCSSVEIDRAVYEFIAGEKTHPQMEEIQLVLQTINKQIE 519 >OAY59401.1 hypothetical protein MANES_01G029700 [Manihot esculenta] Length = 535 Score = 228 bits (582), Expect = 2e-69 Identities = 111/169 (65%), Positives = 134/169 (79%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 YSG A EGL++LD+MC+V+NIEPKSEHYGC+VDLLSRAGL +EA II+RM NS + SEE Sbjct: 354 YSGRAHEGLRILDRMCNVHNIEPKSEHYGCMVDLLSRAGLLQEAKEIIQRMPNSRSSSEE 413 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDMM 362 +AWRALLSACCN G +L+ HSG YVLLSNLYA +GKH DA+R++ MM Sbjct: 414 AIAWRALLSACCNQGQAQLAEVAAERLLQLELHSGAYVLLSNLYATAGKHNDAKRIKKMM 473 Query: 363 KNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 +NKG NK APGCSS++IDG+V EF+AGEKTH QMEEI SVLE+M QL+ Sbjct: 474 RNKGVNK-APGCSSIKIDGIVHEFVAGEKTHKQMEEIESVLEKMKKQLN 521 >OMO71979.1 hypothetical protein COLO4_27920 [Corchorus olitorius] Length = 523 Score = 227 bits (578), Expect = 7e-69 Identities = 108/169 (63%), Positives = 138/169 (81%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 YSGM EGL++LD+MC+VY IEPKSEH+GC++DLLSR GLFEEA II+ + +S N S+E Sbjct: 347 YSGMVFEGLRILDRMCNVYKIEPKSEHFGCIIDLLSRGGLFEEANQIIQGIPDSSNPSDE 406 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDMM 362 +AWRALLS+CC++G +L++HSGVYVLLSNLYAASGKH DA+R++ MM Sbjct: 407 AIAWRALLSSCCSNGQTKLAEVAAKKLMQLEHHSGVYVLLSNLYAASGKHNDAKRIKQMM 466 Query: 363 KNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 KN+G NK APGCSSV+I+GVV EFIAGEK+HPQ+E+IHS+LE++ QLD Sbjct: 467 KNRGVNK-APGCSSVKINGVVHEFIAGEKSHPQLEDIHSILEKLDNQLD 514 >GAU50198.1 hypothetical protein TSUD_408860 [Trifolium subterraneum] Length = 660 Score = 229 bits (583), Expect = 2e-68 Identities = 117/145 (80%), Positives = 122/145 (84%), Gaps = 2/145 (1%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 YSGMA EGLKLLDKMCSVYN+EPKSEHY CLVDLLSR GLFE+AMVIIR+MTNSWNGSEE Sbjct: 353 YSGMAYEGLKLLDKMCSVYNMEPKSEHYSCLVDLLSRKGLFEKAMVIIRKMTNSWNGSEE 412 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDN--HSGVYVLLSNLYAASGKHADARRVRD 356 TLAWRA LSACCNHG RLDN HSGVYVLLSNLYA SGKH DARRVRD Sbjct: 413 TLAWRAFLSACCNHGETQLAELAAEKVLRLDNHIHSGVYVLLSNLYATSGKHTDARRVRD 472 Query: 357 MMKNKGANKAAPGCSSVEIDGVVSE 431 +MK KGANK APGCSSVEIDGVV+E Sbjct: 473 VMKIKGANK-APGCSSVEIDGVVNE 496 >XP_008389431.1 PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Malus domestica] XP_008350909.1 PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Malus domestica] Length = 536 Score = 224 bits (571), Expect = 1e-67 Identities = 111/169 (65%), Positives = 128/169 (75%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 YSGMA EG+K+ DKMC +YNIEPKSEH+GC VDLLSRAGLFEEA II R+ S SEE Sbjct: 355 YSGMADEGMKVFDKMCRIYNIEPKSEHFGCFVDLLSRAGLFEEAKEIIARIPTSSKPSEE 414 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDMM 362 +AWRA LSACCNHG +L+ HSGVYVLLSNLYAASGKH DARR+R++M Sbjct: 415 AVAWRAFLSACCNHGQAQLAEVAAEKLFQLERHSGVYVLLSNLYAASGKHGDARRIRNLM 474 Query: 363 KNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 +N+G K APGCSSVEI+ V EFIAGEKTHP+M+EI VLE + LD Sbjct: 475 RNRGVEK-APGCSSVEINRAVHEFIAGEKTHPKMDEIQLVLETIKKHLD 522 >EOY01760.1 Pentatricopeptide repeat-containing protein [Theobroma cacao] Length = 523 Score = 224 bits (570), Expect = 1e-67 Identities = 108/169 (63%), Positives = 135/169 (79%) Frame = +3 Query: 3 YSGMASEGLKLLDKMCSVYNIEPKSEHYGCLVDLLSRAGLFEEAMVIIRRMTNSWNGSEE 182 YSGMA EGL +LD+MC VYNIEPKSEH+GC++DLLSR GLFEEA II+RM +S N S+E Sbjct: 350 YSGMAFEGLTILDRMCKVYNIEPKSEHFGCIIDLLSRGGLFEEANKIIQRMPDSSNPSDE 409 Query: 183 TLAWRALLSACCNHGXXXXXXXXXXXXXRLDNHSGVYVLLSNLYAASGKHADARRVRDMM 362 +AWRALLS+CC++G +L++HSGVYVLLSNLYAASGK+ DA+ ++ MM Sbjct: 410 AIAWRALLSSCCSNGQTKLAEVAAEKLMQLEDHSGVYVLLSNLYAASGKYYDAKIIKQMM 469 Query: 363 KNKGANKAAPGCSSVEIDGVVSEFIAGEKTHPQMEEIHSVLERMHMQLD 509 KN+G NK PGCSSV+I GVV EFIAGEK+HPQME+IH +LE++ Q+D Sbjct: 470 KNRGVNK-VPGCSSVKIIGVVHEFIAGEKSHPQMEDIHLILEKLEKQMD 517