BLASTX nr result
ID: Glycyrrhiza23_contig00003927
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00003927 (2051 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003592720.1| Pentatricopeptide repeat-containing protein ... 873 0.0 ref|XP_003555843.1| PREDICTED: pentatricopeptide repeat-containi... 852 0.0 ref|XP_003536733.1| PREDICTED: pentatricopeptide repeat-containi... 854 0.0 ref|XP_002298285.1| predicted protein [Populus trichocarpa] gi|2... 719 0.0 ref|XP_004149531.1| PREDICTED: pentatricopeptide repeat-containi... 713 0.0 >ref|XP_003592720.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355481768|gb|AES62971.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 553 Score = 873 bits (2256), Expect = 0.0 Identities = 437/526 (83%), Positives = 472/526 (89%), Gaps = 2/526 (0%) Frame = +1 Query: 82 MYSFVSKSHCHLG--QFPFPPLHTGKLEYGIAKVKMGGRMEVVCKGMLAPXXXXXXXXXX 255 M S +SKSHCHLG Q PL T KLEYGI VKM GR++V+CKGML P Sbjct: 1 MCSLISKSHCHLGLGQLSLQPLQTRKLEYGIVNVKMSGRLKVMCKGMLTPRKFMQRKRKM 60 Query: 256 XXXXDAADEAYQKNWWKLMNQIDETGSAVSVLSSEMMKNQTIPKDLVIGTLIRFKQLQKW 435 DAADEA QKNWW+LM IDETGSAVSVL+SE MKNQTIPK LV+GTL+RFKQL+KW Sbjct: 61 VVFKDAADEAEQKNWWRLMKLIDETGSAVSVLNSEKMKNQTIPKALVVGTLMRFKQLKKW 120 Query: 436 NLVAETLEWLWTQSWWNFGKMDFFMLITAYGKLGDFNGAEKVLALMNKNGYASNVVSQTA 615 NLVAE LEWL Q+WW+FGKMDFFMLITAYGKLGDFNGAEKVL LMNKNGYA NVVSQTA Sbjct: 121 NLVAEILEWLRAQNWWDFGKMDFFMLITAYGKLGDFNGAEKVLGLMNKNGYAPNVVSQTA 180 Query: 616 LMEAYGRGGKYNNAEAIFRRMQRFGPEPSALTYQIILKTFVRGDKFKEAEEVFNNLLDDE 795 LMEAYG+GG+YNNAEAIFRRMQ FGPEPSA TYQIILKTFV+G+KFKEAEEVF+ LL+DE Sbjct: 181 LMEAYGKGGRYNNAEAIFRRMQTFGPEPSAFTYQIILKTFVQGNKFKEAEEVFDKLLNDE 240 Query: 796 KSPLKPDQKMFNMMIYMYRKAGSYEKARKTFALMAERGIEQSTVTYNSLMSFETNYKEVS 975 KSPL+PDQKMFNMMIYMY+K+GS+EKAR+TFALMAERGI+++TVTYNSLMSFETNYKEVS Sbjct: 241 KSPLRPDQKMFNMMIYMYKKSGSHEKARQTFALMAERGIKKATVTYNSLMSFETNYKEVS 300 Query: 976 NIYDQMQRADVRPDVVSYALLINAYGKARREEEALAVFEEMLDAGVRPTRKAYNILLDAF 1155 NIYDQMQRAD+RPDVVSYALLINAYGKARREEEALAVFEEMLDAGVRPTRKAYNILLDAF Sbjct: 301 NIYDQMQRADLRPDVVSYALLINAYGKARREEEALAVFEEMLDAGVRPTRKAYNILLDAF 360 Query: 1156 SISGMVDQARTVFKSMRRDKYFPDLCSYTTMLSAYVNAPDMEGAEKFFKRLIQDGIEPNV 1335 SISGMV+QAR VFKSMRRDKY PDLCSYTTMLSAYVNAPDMEGAEKFFKRLIQDG EPNV Sbjct: 361 SISGMVEQARIVFKSMRRDKYMPDLCSYTTMLSAYVNAPDMEGAEKFFKRLIQDGFEPNV 420 Query: 1336 VTYGTLIKGYAKSNDLEKVMEKYEEMLGRGIKANQTILTTIMDAHGKNGDFDSAVHWFKE 1515 VTYGTLIKGYAK+ND+EKVMEKYEEMLGRGIKANQTILTTIMDAHGKNGDFDSAV+WFKE Sbjct: 421 VTYGTLIKGYAKANDIEKVMEKYEEMLGRGIKANQTILTTIMDAHGKNGDFDSAVNWFKE 480 Query: 1516 MESSGVPPDQKAKNVLLSLARTEEEKIEVNELVQHSIENNNLLKAN 1653 M +G+ PDQKAKN+LLSLA+TEE+ E NELV HSIE NNL K N Sbjct: 481 MALNGLLPDQKAKNILLSLAKTEEDIKEANELVLHSIEINNLPKVN 526 >ref|XP_003555843.1| PREDICTED: pentatricopeptide repeat-containing protein At3g59040-like [Glycine max] Length = 562 Score = 852 bits (2200), Expect(2) = 0.0 Identities = 424/531 (79%), Positives = 473/531 (89%), Gaps = 1/531 (0%) Frame = +1 Query: 82 MYSFVSKSHCHLGQ-FPFPPLHTGKLEYGIAKVKMGGRMEVVCKGMLAPXXXXXXXXXXX 258 M S +SKSHCHL Q P P TG GIA+V+M GRMEVVC+GML P Sbjct: 1 MCSLISKSHCHLRQQLPLQPTRTGN---GIARVRMSGRMEVVCRGMLKPRKFMQRRRKFE 57 Query: 259 XXXDAADEAYQKNWWKLMNQIDETGSAVSVLSSEMMKNQTIPKDLVIGTLIRFKQLQKWN 438 D+ADEA QKNW ++M +I+E+GSAVSVLS+E + NQ IPKDLV+GTLIRFKQL+KWN Sbjct: 58 VFKDSADEADQKNWRRIMTEIEESGSAVSVLSAEKINNQNIPKDLVVGTLIRFKQLKKWN 117 Query: 439 LVAETLEWLWTQSWWNFGKMDFFMLITAYGKLGDFNGAEKVLALMNKNGYASNVVSQTAL 618 LV E LEWL TQ+WW+FGKMDFFMLITAYGKLGDFNGAEKVL LMNKNGYA NVVSQTAL Sbjct: 118 LVVEILEWLRTQNWWDFGKMDFFMLITAYGKLGDFNGAEKVLGLMNKNGYAPNVVSQTAL 177 Query: 619 MEAYGRGGKYNNAEAIFRRMQRFGPEPSALTYQIILKTFVRGDKFKEAEEVFNNLLDDEK 798 MEAYG+GG+YNNAEAIFRRMQ++GPEPSA TYQIILKTFV+G+KF+EAEE+F+NLL+DE Sbjct: 178 MEAYGKGGRYNNAEAIFRRMQKWGPEPSAFTYQIILKTFVQGNKFREAEELFDNLLNDEN 237 Query: 799 SPLKPDQKMFNMMIYMYRKAGSYEKARKTFALMAERGIEQSTVTYNSLMSFETNYKEVSN 978 SPLKPDQKMFNMMIYM++KAGSYEKARKTFA MAE GI+Q+TVTYNSLMSFETNYKEVSN Sbjct: 238 SPLKPDQKMFNMMIYMHKKAGSYEKARKTFAQMAELGIQQTTVTYNSLMSFETNYKEVSN 297 Query: 979 IYDQMQRADVRPDVVSYALLINAYGKARREEEALAVFEEMLDAGVRPTRKAYNILLDAFS 1158 IYDQMQRAD+RPDVVSYALL++AYGKARREEEALAVFEEMLDAG+RPTRKAYNILLDAFS Sbjct: 298 IYDQMQRADLRPDVVSYALLVSAYGKARREEEALAVFEEMLDAGIRPTRKAYNILLDAFS 357 Query: 1159 ISGMVDQARTVFKSMRRDKYFPDLCSYTTMLSAYVNAPDMEGAEKFFKRLIQDGIEPNVV 1338 ISGMV+QA+TVFKSMRRD+YFPDLCSYTTMLSAY+NA DMEGAEKFFKRLIQDG EPNVV Sbjct: 358 ISGMVEQAQTVFKSMRRDRYFPDLCSYTTMLSAYINADDMEGAEKFFKRLIQDGFEPNVV 417 Query: 1339 TYGTLIKGYAKSNDLEKVMEKYEEMLGRGIKANQTILTTIMDAHGKNGDFDSAVHWFKEM 1518 TYGTLIKGYAK NDLE VM+KYEEML RGIKANQTILTTIMDA+GK+GDFDSAVHWFKEM Sbjct: 418 TYGTLIKGYAKINDLEMVMKKYEEMLMRGIKANQTILTTIMDAYGKSGDFDSAVHWFKEM 477 Query: 1519 ESSGVPPDQKAKNVLLSLARTEEEKIEVNELVQHSIENNNLLKANGAVRFV 1671 ES+G+PPDQKAKNVLLSLA+T+EE+ E NELV H EN++L K NG V+ V Sbjct: 478 ESNGIPPDQKAKNVLLSLAKTDEEREEANELVVHFSENSSLPKVNGIVKLV 528 Score = 33.5 bits (75), Expect(2) = 0.0 Identities = 13/18 (72%), Positives = 15/18 (83%) Frame = +3 Query: 1785 DNYEYFDAQLGRAYDEQS 1838 DNYEYFDAQL RAY+ + Sbjct: 535 DNYEYFDAQLARAYEHST 552 >ref|XP_003536733.1| PREDICTED: pentatricopeptide repeat-containing protein At3g59040-like [Glycine max] Length = 553 Score = 854 bits (2206), Expect(2) = 0.0 Identities = 424/530 (80%), Positives = 471/530 (88%) Frame = +1 Query: 82 MYSFVSKSHCHLGQFPFPPLHTGKLEYGIAKVKMGGRMEVVCKGMLAPXXXXXXXXXXXX 261 M S +SKSHCHL Q P P TG GIA V+M GRMEVVC+GML P Sbjct: 1 MCSLISKSHCHLRQLPLPHTRTGN---GIASVRMSGRMEVVCRGMLKPRKFMQRRRKFEV 57 Query: 262 XXDAADEAYQKNWWKLMNQIDETGSAVSVLSSEMMKNQTIPKDLVIGTLIRFKQLQKWNL 441 DAADEA QKNW ++M +I+E+GSAVSVLSSE + NQ IPKDL++GTLIRFKQL+KW+L Sbjct: 58 FKDAADEADQKNWRRIMTEIEESGSAVSVLSSEKINNQNIPKDLLVGTLIRFKQLKKWHL 117 Query: 442 VAETLEWLWTQSWWNFGKMDFFMLITAYGKLGDFNGAEKVLALMNKNGYASNVVSQTALM 621 V E L+WL TQ+WW+FGKMDFFMLITAYGKLGDFNGAEKVL LMNKNGY NVVSQTALM Sbjct: 118 VVEILDWLRTQNWWDFGKMDFFMLITAYGKLGDFNGAEKVLGLMNKNGYVPNVVSQTALM 177 Query: 622 EAYGRGGKYNNAEAIFRRMQRFGPEPSALTYQIILKTFVRGDKFKEAEEVFNNLLDDEKS 801 EAYG+GG+YNNAEAIFRRMQ++GPEPSA TYQIILKTFV+G+K++EAEE+F+NLL+DE S Sbjct: 178 EAYGKGGRYNNAEAIFRRMQKWGPEPSAFTYQIILKTFVQGNKYREAEELFDNLLNDENS 237 Query: 802 PLKPDQKMFNMMIYMYRKAGSYEKARKTFALMAERGIEQSTVTYNSLMSFETNYKEVSNI 981 PLKPDQKMFNMMIYMY+KAGSYEKARKTFALMAERGI+Q+TVTYNSLMSFET+YKEVSNI Sbjct: 238 PLKPDQKMFNMMIYMYKKAGSYEKARKTFALMAERGIQQTTVTYNSLMSFETDYKEVSNI 297 Query: 982 YDQMQRADVRPDVVSYALLINAYGKARREEEALAVFEEMLDAGVRPTRKAYNILLDAFSI 1161 YDQMQRAD+RPDVVSYALL++AYGKARREEEALAVFEEMLDAGVRPTRKAYNILLDAFSI Sbjct: 298 YDQMQRADLRPDVVSYALLVSAYGKARREEEALAVFEEMLDAGVRPTRKAYNILLDAFSI 357 Query: 1162 SGMVDQARTVFKSMRRDKYFPDLCSYTTMLSAYVNAPDMEGAEKFFKRLIQDGIEPNVVT 1341 SGMV+QA+TVFKSMRRD+YFPDLCSYTTMLSAYVNA DMEGAEKFFKRLIQD EPNVVT Sbjct: 358 SGMVEQAQTVFKSMRRDRYFPDLCSYTTMLSAYVNADDMEGAEKFFKRLIQDDFEPNVVT 417 Query: 1342 YGTLIKGYAKSNDLEKVMEKYEEMLGRGIKANQTILTTIMDAHGKNGDFDSAVHWFKEME 1521 YGTLIKGYAK NDLE VM+KYEEML RGIKANQTILTTIMDA+GK+GDFDSAVHWFKEME Sbjct: 418 YGTLIKGYAKINDLEMVMKKYEEMLVRGIKANQTILTTIMDAYGKSGDFDSAVHWFKEME 477 Query: 1522 SSGVPPDQKAKNVLLSLARTEEEKIEVNELVQHSIENNNLLKANGAVRFV 1671 S+G+PPDQKAKNVLLSL +T+EE+ E NELV H ENN+L K NG V+ V Sbjct: 478 SNGIPPDQKAKNVLLSLPKTDEEREEANELVGHFSENNSLSKVNGIVKLV 527 Score = 28.5 bits (62), Expect(2) = 0.0 Identities = 11/18 (61%), Positives = 14/18 (77%) Frame = +3 Query: 1785 DNYEYFDAQLGRAYDEQS 1838 + YEYFDAQL RAY+ + Sbjct: 533 NKYEYFDAQLERAYEHST 550 >ref|XP_002298285.1| predicted protein [Populus trichocarpa] gi|222845543|gb|EEE83090.1| predicted protein [Populus trichocarpa] Length = 575 Score = 719 bits (1855), Expect = 0.0 Identities = 365/501 (72%), Positives = 418/501 (83%) Frame = +1 Query: 166 IAKVKMGGRMEVVCKGMLAPXXXXXXXXXXXXXXDAADEAYQKNWWKLMNQIDETGSAVS 345 IA +K+ R+EVV GML+P DA+DEA QKNW +LM QI++TGSAVS Sbjct: 7 IANIKIHRRLEVVSMGMLSPRKFLQKRRKVEVFKDASDEADQKNWRRLMKQIEDTGSAVS 66 Query: 346 VLSSEMMKNQTIPKDLVIGTLIRFKQLQKWNLVAETLEWLWTQSWWNFGKMDFFMLITAY 525 VL E +K +P+DLV+GTL+RFKQL+KW+LV+E LEWL +Q WW+F +MDF MLITAY Sbjct: 67 VLRRERIKKDGLPRDLVLGTLVRFKQLKKWDLVSEILEWLQSQHWWDFNEMDFLMLITAY 126 Query: 526 GKLGDFNGAEKVLALMNKNGYASNVVSQTALMEAYGRGGKYNNAEAIFRRMQRFGPEPSA 705 GKLGDFNGAE VL MN NGY NVVS TALMEAYGRGG+YNNAEAIFRRMQ GPEPSA Sbjct: 127 GKLGDFNGAEMVLRSMNGNGYVPNVVSHTALMEAYGRGGRYNNAEAIFRRMQTSGPEPSA 186 Query: 706 LTYQIILKTFVRGDKFKEAEEVFNNLLDDEKSPLKPDQKMFNMMIYMYRKAGSYEKARKT 885 LTYQIILKTFV G+KFKEAEEVF LL+ E SPL+PDQKMF+MMIYM +KAG+YEKARK Sbjct: 187 LTYQIILKTFVEGNKFKEAEEVFETLLNKENSPLEPDQKMFHMMIYMQKKAGNYEKARKV 246 Query: 886 FALMAERGIEQSTVTYNSLMSFETNYKEVSNIYDQMQRADVRPDVVSYALLINAYGKARR 1065 FALMAERG+ QSTVTYNSLMSFETNYKEVS IYDQMQR+ +RPDVVSYALLI AYG+ARR Sbjct: 247 FALMAERGVPQSTVTYNSLMSFETNYKEVSKIYDQMQRSGLRPDVVSYALLIKAYGRARR 306 Query: 1066 EEEALAVFEEMLDAGVRPTRKAYNILLDAFSISGMVDQARTVFKSMRRDKYFPDLCSYTT 1245 EEEALAVFEEMLDAGVRP+ KAYNILLDAF+ISGMV+QAR VFKSMRRD+ PDLCSYTT Sbjct: 307 EEEALAVFEEMLDAGVRPSHKAYNILLDAFAISGMVEQARVVFKSMRRDRCTPDLCSYTT 366 Query: 1246 MLSAYVNAPDMEGAEKFFKRLIQDGIEPNVVTYGTLIKGYAKSNDLEKVMEKYEEMLGRG 1425 MLSAYVNA DMEGAE FFKRL QDG++PNVVTYG LIKG+AK N+LEK+ME YEEM Sbjct: 367 MLSAYVNASDMEGAENFFKRLRQDGLKPNVVTYGALIKGHAKVNNLEKMMEIYEEMQLNS 426 Query: 1426 IKANQTILTTIMDAHGKNGDFDSAVHWFKEMESSGVPPDQKAKNVLLSLARTEEEKIEVN 1605 IKANQTILTTIMDA+GKN DF SAV W+KEME GVPPDQKA+N+LLSLA+T++E+ E + Sbjct: 427 IKANQTILTTIMDAYGKNKDFGSAVIWYKEMEHHGVPPDQKAQNILLSLAKTQDEQKEAS 486 Query: 1606 ELVQHSIENNNLLKANGAVRF 1668 +LV + ++ + NGA RF Sbjct: 487 QLVGYP-DDCGIQSINGASRF 506 >ref|XP_004149531.1| PREDICTED: pentatricopeptide repeat-containing protein At3g59040-like [Cucumis sativus] Length = 580 Score = 713 bits (1841), Expect = 0.0 Identities = 358/490 (73%), Positives = 413/490 (84%) Frame = +1 Query: 169 AKVKMGGRMEVVCKGMLAPXXXXXXXXXXXXXXDAADEAYQKNWWKLMNQIDETGSAVSV 348 + V + ++ V C GML P D ADEA QKNW +LMN+I+ETGSAVSV Sbjct: 30 SNVNVRRKLVVTCMGMLTPRKFLQKRKKLEVFKDEADEAEQKNWRRLMNEIEETGSAVSV 89 Query: 349 LSSEMMKNQTIPKDLVIGTLIRFKQLQKWNLVAETLEWLWTQSWWNFGKMDFFMLITAYG 528 L SE +KN+ IPKDLV+GTL+RFKQL+KWNLV+E LEWL TQSWWNF +MDF MLITAYG Sbjct: 90 LRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEILEWLRTQSWWNFSEMDFVMLITAYG 149 Query: 529 KLGDFNGAEKVLALMNKNGYASNVVSQTALMEAYGRGGKYNNAEAIFRRMQRFGPEPSAL 708 KLGDFN AEKVL LMNK GYA NVVS TALMEAYGRG +YNNAEAIFRRMQ GPEPSAL Sbjct: 150 KLGDFNRAEKVLNLMNKKGYAPNVVSHTALMEAYGRGRRYNNAEAIFRRMQSGGPEPSAL 209 Query: 709 TYQIILKTFVRGDKFKEAEEVFNNLLDDEKSPLKPDQKMFNMMIYMYRKAGSYEKARKTF 888 TYQI+LKTFV G KFKEAEE+F++LL+ EK LKPDQKMF+M+IYM++KAG+YEKARK F Sbjct: 210 TYQIMLKTFVEGSKFKEAEELFDSLLNKEKPVLKPDQKMFHMIIYMFKKAGNYEKARKVF 269 Query: 889 ALMAERGIEQSTVTYNSLMSFETNYKEVSNIYDQMQRADVRPDVVSYALLINAYGKARRE 1068 A MA RG+ Q+TVTYNSLMSFETNYKEVS IYDQMQRA ++PDVVSYALLI+AYGKARRE Sbjct: 270 AEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALLISAYGKARRE 329 Query: 1069 EEALAVFEEMLDAGVRPTRKAYNILLDAFSISGMVDQARTVFKSMRRDKYFPDLCSYTTM 1248 EEALAVFEEMLDAG+RPT KAYNILLDAF+ISGMV+QA+ VFKSM+RD+ PD+CSYTTM Sbjct: 330 EEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMKRDRCSPDICSYTTM 389 Query: 1249 LSAYVNAPDMEGAEKFFKRLIQDGIEPNVVTYGTLIKGYAKSNDLEKVMEKYEEMLGRGI 1428 LSAYVNA DMEGAE FF+RL QDG PNVVTYGTLIKGYAK N+LEK++++YEEM GI Sbjct: 390 LSAYVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMIKRYEEMKVNGI 449 Query: 1429 KANQTILTTIMDAHGKNGDFDSAVHWFKEMESSGVPPDQKAKNVLLSLARTEEEKIEVNE 1608 + NQTILTTIMDA+GKN DF SAV WF E+ES G+ PDQKAKN+LLSLA+T EE E N+ Sbjct: 450 RVNQTILTTIMDAYGKNKDFGSAVIWFNEIESCGLRPDQKAKNILLSLAKTAEELDEANQ 509 Query: 1609 LVQHSIENNN 1638 LV +S ++++ Sbjct: 510 LVGYSSQSSS 519