BLASTX nr result
ID: Glycyrrhiza23_contig00000429
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00000429 (2497 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003550222.1| PREDICTED: uncharacterized protein LOC100796... 996 0.0 ref|XP_003544580.1| PREDICTED: uncharacterized protein LOC100799... 980 0.0 ref|XP_003523656.1| PREDICTED: pentatricopeptide repeat-containi... 859 0.0 ref|XP_003527761.1| PREDICTED: pentatricopeptide repeat-containi... 886 0.0 ref|XP_002516159.1| pentatricopeptide repeat-containing protein,... 814 0.0 >ref|XP_003550222.1| PREDICTED: uncharacterized protein LOC100796128 isoform 1 [Glycine max] gi|356563956|ref|XP_003550223.1| PREDICTED: uncharacterized protein LOC100796128 isoform 2 [Glycine max] Length = 573 Score = 996 bits (2575), Expect = 0.0 Identities = 500/573 (87%), Positives = 527/573 (91%) Frame = +2 Query: 365 KKGSQTDTKLSSPNKATALNPNAAEFIPFALRSLPSGSTSSVDATTRITTAGSLGKAVLD 544 KKGSQTDTKLSS NKAT LNPNAAEF+PFALRS PSGSTSSVDA R TTAGSLGKAVLD Sbjct: 5 KKGSQTDTKLSSLNKATYLNPNAAEFVPFALRSSPSGSTSSVDAAARFTTAGSLGKAVLD 64 Query: 545 RXXXXXXXXXXXXAHQYWRCQLPDDITPDFKVMGEDDSQGLDNLSLAGLSIHDDNESSMF 724 R AHQYWRCQLPDDITPDFKVMGED+SQGL+NLSLAGLSI+DDNESSMF Sbjct: 65 RSESSISNNSDDEAHQYWRCQLPDDITPDFKVMGEDESQGLNNLSLAGLSINDDNESSMF 124 Query: 725 PSSKGSRYMLNEQQELSQQHLNGNTFADKLRFSNSTYREEPSSASFLNTLAKPWDRQIGN 904 PSSKGSRY+LNEQ ELS QHLNGNTFADKLRFSNSTYREEPSS S LN+ AKPWDRQIGN Sbjct: 125 PSSKGSRYILNEQLELSPQHLNGNTFADKLRFSNSTYREEPSSGSILNSSAKPWDRQIGN 184 Query: 905 TNLHVSSGQEALAYDDNASHGFLNDVLAENAIMDDTDFNPLEFLASLFPGFASESLAEVF 1084 T+LHV+SGQE L YD+N+ HGFLNDV A N++++DTD NPLEFLASLFPGFASESLAEVF Sbjct: 185 TDLHVTSGQEELVYDENSGHGFLNDVFAGNSLVNDTDLNPLEFLASLFPGFASESLAEVF 244 Query: 1085 FANGCDLHLTTEMLTQLEIQVDGNFSQNPSPKTLSAPNLTAMDFPALTSTNGQTTTAKYA 1264 FAN CDLHLT EMLTQLEIQVDG F+QNPSPKTLS+PNL+AMDFPALTS+NGQ T+KYA Sbjct: 245 FANACDLHLTIEMLTQLEIQVDGGFNQNPSPKTLSSPNLSAMDFPALTSSNGQ-NTSKYA 303 Query: 1265 ADNVQQSGNPYLSSDKDMLMFKSSSSIPSRGAIDFASAVRKMASQDSGIWKYDRNGSGDA 1444 ADNVQQSG PY+SSDKDMLMFKS SSIPSRG++DFASAVRK+ASQDSGIWKYD+NGSGDA Sbjct: 304 ADNVQQSGIPYISSDKDMLMFKSGSSIPSRGSVDFASAVRKLASQDSGIWKYDKNGSGDA 363 Query: 1445 STGSSRSLNVLASAYNGGQGRANFGDRLQNRGSGRAAPVWLETGDTVANMYSELREEARD 1624 STGSSR LN LASAYNGGQGR N GDRLQ+RGS RAAPVWLETGD VANMYSELREEARD Sbjct: 364 STGSSRGLNALASAYNGGQGRVNIGDRLQSRGSARAAPVWLETGDAVANMYSELREEARD 423 Query: 1625 HARLRNAYFEQARQAYLIGNKALAKELSVKGQLHNMHMKAAHGKAQESIYRQRNPVGPEM 1804 HARLRNAYFEQARQAYLIGNKALAKELSVKGQLHNMHMKAAHGKAQESIYRQRNPV PE Sbjct: 424 HARLRNAYFEQARQAYLIGNKALAKELSVKGQLHNMHMKAAHGKAQESIYRQRNPVAPE- 482 Query: 1805 QLNGRGHERMIDLHGLHVSEAIHVLKHELSVLRSTARAAEQRLQVYICVGTGHHTRGSRT 1984 NGRGH+RMIDLHGLHVSEAIHVLKHELSVLRSTARAAEQRLQVYICVGTGHHTRGSRT Sbjct: 483 --NGRGHQRMIDLHGLHVSEAIHVLKHELSVLRSTARAAEQRLQVYICVGTGHHTRGSRT 540 Query: 1985 PARLPIAVQRYLLEEEGLDFTEPQPGLLRVVLY 2083 PARLPIAVQRYLLEEEGLDFTEPQPGLLRVV+Y Sbjct: 541 PARLPIAVQRYLLEEEGLDFTEPQPGLLRVVIY 573 >ref|XP_003544580.1| PREDICTED: uncharacterized protein LOC100799961 [Glycine max] Length = 572 Score = 980 bits (2534), Expect = 0.0 Identities = 496/573 (86%), Positives = 522/573 (91%) Frame = +2 Query: 365 KKGSQTDTKLSSPNKATALNPNAAEFIPFALRSLPSGSTSSVDATTRITTAGSLGKAVLD 544 KKGSQTD KLSS NKAT LNPNAAEF+PFALRS PSGSTS VDAT R AGSLGKAVLD Sbjct: 5 KKGSQTDAKLSSLNKATYLNPNAAEFVPFALRSSPSGSTSLVDATARFAAAGSLGKAVLD 64 Query: 545 RXXXXXXXXXXXXAHQYWRCQLPDDITPDFKVMGEDDSQGLDNLSLAGLSIHDDNESSMF 724 R AHQYWRCQLPDDITPDFKVMGED+SQGL+NLSLAGLSI+DDNESSMF Sbjct: 65 RAESSISNNSDDEAHQYWRCQLPDDITPDFKVMGEDESQGLNNLSLAGLSINDDNESSMF 124 Query: 725 PSSKGSRYMLNEQQELSQQHLNGNTFADKLRFSNSTYREEPSSASFLNTLAKPWDRQIGN 904 PSSKG RY+LNEQQELSQQHLNGNTFADKLRFSNSTYR+EPSSAS LN+ AKPWDRQI N Sbjct: 125 PSSKGFRYILNEQQELSQQHLNGNTFADKLRFSNSTYRDEPSSASILNSSAKPWDRQIRN 184 Query: 905 TNLHVSSGQEALAYDDNASHGFLNDVLAENAIMDDTDFNPLEFLASLFPGFASESLAEVF 1084 T+LHVSSGQEAL YDDN HGF NDV A N++++DTD NPLEFLASLFPGFASESL+EVF Sbjct: 185 TDLHVSSGQEALVYDDNTGHGFFNDVFAGNSLVNDTDLNPLEFLASLFPGFASESLSEVF 244 Query: 1085 FANGCDLHLTTEMLTQLEIQVDGNFSQNPSPKTLSAPNLTAMDFPALTSTNGQTTTAKYA 1264 FANGCDLHLT EMLTQLEIQVD +F+QNPSPKTLS+PNL+AMDFPALTS+NGQ +KYA Sbjct: 245 FANGCDLHLTIEMLTQLEIQVDSSFNQNPSPKTLSSPNLSAMDFPALTSSNGQ-NASKYA 303 Query: 1265 ADNVQQSGNPYLSSDKDMLMFKSSSSIPSRGAIDFASAVRKMASQDSGIWKYDRNGSGDA 1444 ADNVQQSGNPYLSSDKDMLMFKS SSIPSRGA+DFASAVRK+ASQDSGIWKYD+NGSGDA Sbjct: 304 ADNVQQSGNPYLSSDKDMLMFKSGSSIPSRGAVDFASAVRKLASQDSGIWKYDKNGSGDA 363 Query: 1445 STGSSRSLNVLASAYNGGQGRANFGDRLQNRGSGRAAPVWLETGDTVANMYSELREEARD 1624 STGSSRSLN LASAYNGGQGR N GDRLQNRGS RAAPVWLETGD VANMYSELREEARD Sbjct: 364 STGSSRSLNALASAYNGGQGRVNNGDRLQNRGSARAAPVWLETGDAVANMYSELREEARD 423 Query: 1625 HARLRNAYFEQARQAYLIGNKALAKELSVKGQLHNMHMKAAHGKAQESIYRQRNPVGPEM 1804 HARLRNAYFEQARQAYL+GNKALAKELSVKGQLHN+HMKAAHGKAQESIYRQRNPV PE Sbjct: 424 HARLRNAYFEQARQAYLVGNKALAKELSVKGQLHNVHMKAAHGKAQESIYRQRNPVAPE- 482 Query: 1805 QLNGRGHERMIDLHGLHVSEAIHVLKHELSVLRSTARAAEQRLQVYICVGTGHHTRGSRT 1984 NGRG +RMIDLHGLHVSEAIHVLKHELSVLRSTARA EQRLQVYICVGTGHHTRGSRT Sbjct: 483 --NGRGPQRMIDLHGLHVSEAIHVLKHELSVLRSTARAPEQRLQVYICVGTGHHTRGSRT 540 Query: 1985 PARLPIAVQRYLLEEEGLDFTEPQPGLLRVVLY 2083 PARLPIAVQRYLL EEGLDFTEPQPGLL VV+Y Sbjct: 541 PARLPIAVQRYLL-EEGLDFTEPQPGLLCVVIY 572 >ref|XP_003523656.1| PREDICTED: pentatricopeptide repeat-containing protein At4g33170-like [Glycine max] Length = 1611 Score = 859 bits (2219), Expect(2) = 0.0 Identities = 433/564 (76%), Positives = 488/564 (86%), Gaps = 3/564 (0%) Frame = +2 Query: 389 KLSSPNKATALNPNAAEFIPFALRSLP-SGSTSSVDATTRITTAGSLGKAVLDRXXXXXX 565 K NKAT LNPNAAEF+ FALRS SG+TSSVDAT +++ AG+LGKAV DR Sbjct: 50 KAIKENKATTLNPNAAEFVSFALRSSSLSGTTSSVDATAKLSAAGALGKAVFDRSESSIS 109 Query: 566 XXXXXXAHQYWRCQLPDDITPDFKVMGEDDSQGLDNLSLAGLSIHDDNESSMFPSSKGSR 745 HQYWRCQLPDDITPDFKVMGED+S+ LD+LSLAGLSIHDDNE+S FPSSKGS+ Sbjct: 110 NNSDDEVHQYWRCQLPDDITPDFKVMGEDESRVLDDLSLAGLSIHDDNEASRFPSSKGSK 169 Query: 746 YMLNEQQELSQQHLNGNTFADKLRFSNSTYREEPSSASFLNTLAKPWDRQIGNTNLHVSS 925 Y++NEQ+E+SQQH+NGN+ ADKL FSNS+YRE+PSS SFLN LAKPW+ IG+ + +SS Sbjct: 170 YIINEQEEISQQHVNGNSLADKLGFSNSSYREDPSSGSFLNVLAKPWEGPIGSADQCISS 229 Query: 926 GQEALAYDDNASHGFLNDVLAENAIMDDTDFNPLEFLASLFPGFASESLAEVFFANGCDL 1105 GQE L YDDN+ HG+LNDVL ENAI+DDTD NPLEFLASLFPGFA+ESLAE +FANGCDL Sbjct: 230 GQEGLTYDDNSIHGYLNDVLVENAIVDDTDLNPLEFLASLFPGFAAESLAEAYFANGCDL 289 Query: 1106 HLTTEMLTQLEIQVDGNFSQNPSPKTLSAPNLTAMDFPALTSTNGQTTTAKYAADNVQQS 1285 HLTTEML QLE+QVDG F+QN + KTLSAPNL+AMD+PALTS +GQT + KY DNVQQS Sbjct: 290 HLTTEMLNQLELQVDGGFNQNLNSKTLSAPNLSAMDYPALTSPDGQTASVKYVVDNVQQS 349 Query: 1286 GNPYLSSDKDMLMFKSSSSIPS-RGAIDFASAVRKMASQDS-GIWKYDRNGSGDASTGSS 1459 GNPY S D D+L+FKS SS+PS GAI+FASAVRK+ASQDS GIWKY++NGSGDA+ GSS Sbjct: 350 GNPYRSFDSDVLLFKSGSSVPSGGGAIEFASAVRKLASQDSGGIWKYEKNGSGDAAIGSS 409 Query: 1460 RSLNVLASAYNGGQGRANFGDRLQNRGSGRAAPVWLETGDTVANMYSELREEARDHARLR 1639 R+ NVLAS YNGGQGRA+F DRLQN GS RAAPVWLET D VANM+SELREEARDHA LR Sbjct: 410 RTSNVLASDYNGGQGRAHFVDRLQNVGSARAAPVWLETSDAVANMFSELREEARDHACLR 469 Query: 1640 NAYFEQARQAYLIGNKALAKELSVKGQLHNMHMKAAHGKAQESIYRQRNPVGPEMQLNGR 1819 NAYFEQA+QAYLIG+KALAKELS KGQLHNMHMKAAHGKAQESIYRQRNPV PE+Q NGR Sbjct: 470 NAYFEQAQQAYLIGDKALAKELSAKGQLHNMHMKAAHGKAQESIYRQRNPVAPEVQGNGR 529 Query: 1820 GHERMIDLHGLHVSEAIHVLKHELSVLRSTARAAEQRLQVYICVGTGHHTRGSRTPARLP 1999 G+ER++DLHGLH SEAIHVLKHELSVL+STA AAEQRLQVYI VGTGHHTRGSRTPARLP Sbjct: 530 GNERIVDLHGLHASEAIHVLKHELSVLKSTAIAAEQRLQVYILVGTGHHTRGSRTPARLP 589 Query: 2000 IAVQRYLLEEEGLDFTEPQPGLLR 2071 IAVQR+LL EEG+DF E QPGLLR Sbjct: 590 IAVQRFLL-EEGIDFMETQPGLLR 612 Score = 56.6 bits (135), Expect(2) = 0.0 Identities = 27/39 (69%), Positives = 30/39 (76%) Frame = +1 Query: 262 KGLVEAVWKFGVTVFETHLCFSHFVHILVNNELIQERIP 378 +GLVEA W VT F+ LC S+FVHILVNNELIQER P Sbjct: 8 QGLVEADWTCAVTTFKVCLCCSYFVHILVNNELIQERTP 46 >ref|XP_003527761.1| PREDICTED: pentatricopeptide repeat-containing protein At4g33170-like [Glycine max] Length = 1582 Score = 886 bits (2290), Expect = 0.0 Identities = 445/574 (77%), Positives = 500/574 (87%), Gaps = 3/574 (0%) Frame = +2 Query: 365 KKGSQTDTKLSSPNKATALNPNAAEFIPFALRSLP-SGSTSSVDATTRITTAGSLGKAVL 541 KKG TD KL SPNKAT LNPNAAEF+PFALRS SG+TS VDAT R+T AG+LGKAVL Sbjct: 5 KKGPLTDAKLLSPNKATTLNPNAAEFVPFALRSSSLSGTTSLVDATARLTAAGTLGKAVL 64 Query: 542 DRXXXXXXXXXXXXAHQYWRCQLPDDITPDFKVMGEDDSQGLDNLSLAGLSIHDDNESSM 721 DR H+YWRCQLPDDITPDFKV+GED+S+ LD++SLAGLSIHDDNE+S Sbjct: 65 DRSESSISNNSDDEVHRYWRCQLPDDITPDFKVLGEDESRVLDDISLAGLSIHDDNEASR 124 Query: 722 FPSSKGSRYMLNEQQELSQQHLNGNTFADKLRFSNSTYREEPSSASFLNTLAKPWDRQIG 901 FPSSKGS+Y++NEQ+E+SQQH NGN+ ADKL FSNS+YRE+PSS SFLN LAKPW+R IG Sbjct: 125 FPSSKGSKYIINEQEEISQQHANGNSLADKLGFSNSSYREDPSSGSFLNALAKPWERPIG 184 Query: 902 NTNLHVSSGQEALAYDDNASHGFLNDVLAENAIMDDTDFNPLEFLASLFPGFASESLAEV 1081 + + ++SGQE L YDDN+ HG+LND+LAENAI+DDTD NPLEFLASLFPGFA+ESLAE Sbjct: 185 SADQRINSGQEGLTYDDNSRHGYLNDILAENAIVDDTDLNPLEFLASLFPGFAAESLAEA 244 Query: 1082 FFANGCDLHLTTEMLTQLEIQVDGNFSQNPSPKTLSAPNLTAMDFPALTSTNGQTTTAKY 1261 +FAN CDLHLTTEML QLE+QVDG F+QN + KTLSAPNL+AMD+PALTS +GQT + +Y Sbjct: 245 YFANRCDLHLTTEMLNQLELQVDGGFNQNLNSKTLSAPNLSAMDYPALTSPDGQTASVEY 304 Query: 1262 AADNVQQSGNPYLSSDKDMLMFKSSSSIPSR-GAIDFASAVRKMASQDS-GIWKYDRNGS 1435 DNVQQSGNPY S D D+L+FKS SS+ SR GAIDFASAVRK+AS+DS GIWKYD+NGS Sbjct: 305 VVDNVQQSGNPYRSYDSDVLLFKSVSSVSSRGGAIDFASAVRKLASRDSGGIWKYDKNGS 364 Query: 1436 GDASTGSSRSLNVLASAYNGGQGRANFGDRLQNRGSGRAAPVWLETGDTVANMYSELREE 1615 GDA+ GSSR+ NVLAS YNGGQGRA+FGDRLQNRGS RA PVWLETGD VANMYSELREE Sbjct: 365 GDAAIGSSRNSNVLASGYNGGQGRAHFGDRLQNRGSARAFPVWLETGDAVANMYSELREE 424 Query: 1616 ARDHARLRNAYFEQARQAYLIGNKALAKELSVKGQLHNMHMKAAHGKAQESIYRQRNPVG 1795 ARDHA LRNAYFEQA+QAYLIGNKALAKELS KGQLHNMHMK AHGKAQESIY QRNPV Sbjct: 425 ARDHACLRNAYFEQAQQAYLIGNKALAKELSAKGQLHNMHMKVAHGKAQESIYLQRNPVA 484 Query: 1796 PEMQLNGRGHERMIDLHGLHVSEAIHVLKHELSVLRSTARAAEQRLQVYICVGTGHHTRG 1975 PE+Q +GRG+ER+IDLHGLH SEAIHVLKHELSVL+STA AAEQRLQVYI VGTGHHTRG Sbjct: 485 PELQGDGRGNERIIDLHGLHASEAIHVLKHELSVLKSTAIAAEQRLQVYILVGTGHHTRG 544 Query: 1976 SRTPARLPIAVQRYLLEEEGLDFTEPQPGLLRVV 2077 SRTPARLPIAVQR+LL EEG+DFTE QPGLLRVV Sbjct: 545 SRTPARLPIAVQRFLL-EEGIDFTETQPGLLRVV 577 >ref|XP_002516159.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223544645|gb|EEF46161.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 1439 Score = 814 bits (2102), Expect = 0.0 Identities = 416/576 (72%), Positives = 487/576 (84%), Gaps = 7/576 (1%) Frame = +2 Query: 365 KKGSQTDT-KLSSPNKATALNPNAAEFIPFALRSL--PSGSTSSVDATT-RITTAGSLGK 532 KKGSQT+T KL+ P+KATALNPNAAEF+PF+LRSL PSGSTS+ ATT R T+G +GK Sbjct: 5 KKGSQTNTTKLNIPSKATALNPNAAEFVPFSLRSLSSPSGSTSNAAATTARFATSGPVGK 64 Query: 533 AVLDRXXXXXXXXXXXXAHQYWRCQLPDDITPDFKVMGEDDSQGLDNLSLAGLSIHDDNE 712 AVLDR AHQ+WR QLPDDITPDFKVMGED+SQ L LSLAGLS+HD +E Sbjct: 65 AVLDRSESSISTTSDEEAHQFWRHQLPDDITPDFKVMGEDESQSLGGLSLAGLSLHDSSE 124 Query: 713 SSMFPSSKGSRYMLNEQQELSQQHLNGNTFADKLRFSNSTYREEPSSAS-FLNTLAKPWD 889 FP+S GS Y+L EQQE S +H+NG++F++K+R++ ++Y E+P+SA+ +LN KPWD Sbjct: 125 VPKFPASVGSGYILTEQQEPSPRHINGSSFSEKMRYAIASYGEDPTSAAGYLNLPTKPWD 184 Query: 890 RQIGNTNLHVSSGQEALAYDDNASHGFLNDVLAENAIMDDTDFNPLEFLASLFPGFASES 1069 +QI N + + +G+E Y+ N+ GF+ND+L E AI+D+ D NPL+FLAS FPGFA+ES Sbjct: 185 KQIINNDHLLGNGREVHPYNGNSRRGFMNDMLGEQAIVDEPDMNPLDFLASHFPGFAAES 244 Query: 1070 LAEVFFANGCDLHLTTEMLTQLEIQVDGNFSQNPSPKTLSAPNLTAMDFPALTSTNGQTT 1249 LAEV+FANG DL+LT EMLTQLE+QVDG F+QN + K LSAPNL+AMDFPAL + Q + Sbjct: 245 LAEVYFANGYDLNLTIEMLTQLELQVDGGFNQNMNSKALSAPNLSAMDFPALPVPDSQNS 304 Query: 1250 TAKYAADNVQQSGNPYLSSDKD-MLMFKSSSSIPSRG-AIDFASAVRKMASQDSGIWKYD 1423 +KY+ D++QQSGNPY SSDK+ +L+FKSSSS PSRG AIDFASAVRK+ASQDSGIWKY+ Sbjct: 305 PSKYSGDDIQQSGNPYRSSDKENILLFKSSSSTPSRGGAIDFASAVRKLASQDSGIWKYE 364 Query: 1424 RNGSGDASTGSSRSLNVLASAYNGGQGRANFGDRLQNRGSGRAAPVWLETGDTVANMYSE 1603 RNGS D++ GSSRS +VLAS+Y+ G GR + +R QNRGS RAAPVWLETG+ VANMYSE Sbjct: 365 RNGSADSAVGSSRSSHVLASSYSSGNGRGIYSERAQNRGSARAAPVWLETGEAVANMYSE 424 Query: 1604 LREEARDHARLRNAYFEQARQAYLIGNKALAKELSVKGQLHNMHMKAAHGKAQESIYRQR 1783 LREEARDHARLRNAYFEQARQAYLIGNKALAKELSVKGQLHNMHMKAAHGKAQESIYR R Sbjct: 425 LREEARDHARLRNAYFEQARQAYLIGNKALAKELSVKGQLHNMHMKAAHGKAQESIYRLR 484 Query: 1784 NPVGPEMQLNGRGHERMIDLHGLHVSEAIHVLKHELSVLRSTARAAEQRLQVYICVGTGH 1963 NP+ EMQ NGRGHERMIDLHGLHVSEAIHVLKHELSVLRSTARAA+QRLQVYICVGTGH Sbjct: 485 NPISSEMQGNGRGHERMIDLHGLHVSEAIHVLKHELSVLRSTARAADQRLQVYICVGTGH 544 Query: 1964 HTRGSRTPARLPIAVQRYLLEEEGLDFTEPQPGLLR 2071 HTRGSRTPARLPIAVQ+YLLEEEGLD+TEPQPGLLR Sbjct: 545 HTRGSRTPARLPIAVQQYLLEEEGLDYTEPQPGLLR 580