BLASTX nr result
ID: Glycyrrhiza23_contig00010692
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00010692 (2332 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003539136.1| PREDICTED: uncharacterized protein LOC100796... 575 e-161 ref|XP_003517960.1| PREDICTED: uncharacterized protein LOC100783... 559 e-156 ref|XP_002320047.1| predicted protein [Populus trichocarpa] gi|2... 453 e-125 ref|XP_002301271.1| predicted protein [Populus trichocarpa] gi|2... 449 e-123 ref|XP_004140484.1| PREDICTED: uncharacterized protein LOC101203... 434 e-119 >ref|XP_003539136.1| PREDICTED: uncharacterized protein LOC100796904 [Glycine max] Length = 510 Score = 575 bits (1483), Expect = e-161 Identities = 300/407 (73%), Positives = 333/407 (81%) Frame = -3 Query: 1805 EITETTSHDLISSIFAAVSAFEASYFQLQTAHVPFVEENVKNADRVLVSHLQRLSELKKF 1626 +I ETT H LISS+FAAVSAFEASYFQLQ+AHVPFVEE+V +AD+VLVSHLQRLSELKKF Sbjct: 122 QIRETT-HALISSVFAAVSAFEASYFQLQSAHVPFVEEHVTSADKVLVSHLQRLSELKKF 180 Query: 1625 YXXXXXXXXXXXXXSLEAQVEENQNKLRTLGTVSNRLQSELEHKHDEVLSLRRKLNEIHK 1446 Y L A+VEENQ+KLRTLGTVSNRLQ ELE KHDEV++LR KL+EIH+ Sbjct: 181 YCNPEPRGFPFGSR-LGAEVEENQSKLRTLGTVSNRLQWELEQKHDEVVALRAKLDEIHR 239 Query: 1445 GNANLTKKLCATTSTTMNRGCDVLLTVRVFDSLLHDASRAAHKFAKILIGLMRKAGWDLG 1266 GN NL+KKLCA +N DVLLTV+VFDSLLHDASRA H+F KILIGLMRKAGWDLG Sbjct: 240 GNVNLSKKLCARA---LNPSSDVLLTVKVFDSLLHDASRATHRFTKILIGLMRKAGWDLG 296 Query: 1265 LVANVVHPGIEYAKKGHNQYALLSYVCLGMFQGFDLPCFGLSSNNDDNEEKGGGELEVCN 1086 L AN VHP ++YAKKGHNQYALLSYVCLG+F GFD FG+ + G G L Sbjct: 297 LAANAVHPNVDYAKKGHNQYALLSYVCLGIFHGFDSMNFGMEDGEELVVSNGHGSL---- 352 Query: 1085 NGDLHDLGLVENLKNSCLKQLLEHVSSNPLELLGIHPGCEFSRFCERKYERLVHPSMESS 906 DL D ++ CLKQLLEHVSSNP+ELLGIHPGCEFSRFCE KYERL+HPSMESS Sbjct: 353 --DLED-------RDGCLKQLLEHVSSNPMELLGIHPGCEFSRFCEHKYERLIHPSMESS 403 Query: 905 IFVDLDQNEAVVNSWRSLSMFYEAFVGMASSVWTLHKLSHAFDPKVEIFQVERGVEFSMI 726 IFV+L++ EAV+NSWRSLSMFYEAFVGMAS+VWTLHKLS+ FDP VEIFQVERGVEFSMI Sbjct: 404 IFVNLEEKEAVLNSWRSLSMFYEAFVGMASAVWTLHKLSYTFDPTVEIFQVERGVEFSMI 463 Query: 725 YMEDVTKRLTWPNKGRAKVGFTVFPGFKIGERIVIQSQVYISNFTST 585 YMEDVTKRLTWPNKGRAKVGFTV PGF+IG R+VIQSQVYISNF T Sbjct: 464 YMEDVTKRLTWPNKGRAKVGFTVLPGFRIG-RVVIQSQVYISNFKCT 509 Score = 105 bits (263), Expect = 4e-20 Identities = 55/70 (78%), Positives = 63/70 (90%), Gaps = 2/70 (2%) Frame = -3 Query: 2183 MPEMDGIPNSKPPQISEMFQKFALAFKTKTFEFFADDENAST--IDDSDGFSLLDSAEEI 2010 MPEMDG ++KPPQISEMFQKFALAFKTKTFEFF+++EN ++ +DD DGFSLLDS EEI Sbjct: 1 MPEMDG-SSAKPPQISEMFQKFALAFKTKTFEFFSEEENNASPLLDDIDGFSLLDSTEEI 59 Query: 2009 ITDQKVVVIK 1980 ITDQKVVVIK Sbjct: 60 ITDQKVVVIK 69 >ref|XP_003517960.1| PREDICTED: uncharacterized protein LOC100783971 [Glycine max] Length = 506 Score = 559 bits (1441), Expect = e-156 Identities = 294/407 (72%), Positives = 331/407 (81%) Frame = -3 Query: 1805 EITETTSHDLISSIFAAVSAFEASYFQLQTAHVPFVEENVKNADRVLVSHLQRLSELKKF 1626 +I E T H L+SS+FAAVSAFEASYFQLQ+AHVPFVEE+V +AD+VLVSHLQRLSELK+F Sbjct: 122 QIREMT-HALVSSVFAAVSAFEASYFQLQSAHVPFVEEHVTSADKVLVSHLQRLSELKRF 180 Query: 1625 YXXXXXXXXXXXXXSLEAQVEENQNKLRTLGTVSNRLQSELEHKHDEVLSLRRKLNEIHK 1446 Y LEA+VEENQ+KLRTLGTVSNRLQ ELE KHDEV++LR KL+EIH+ Sbjct: 181 YSNSEPCGFPLGLR-LEAEVEENQSKLRTLGTVSNRLQWELEQKHDEVVALRAKLDEIHR 239 Query: 1445 GNANLTKKLCATTSTTMNRGCDVLLTVRVFDSLLHDASRAAHKFAKILIGLMRKAGWDLG 1266 GN NL+KKLCA +N DVLLTV+VFDSLL DASRA H+F KILIGLMRKAGWDLG Sbjct: 240 GNVNLSKKLCARA---LNPSSDVLLTVKVFDSLLLDASRATHRFTKILIGLMRKAGWDLG 296 Query: 1265 LVANVVHPGIEYAKKGHNQYALLSYVCLGMFQGFDLPCFGLSSNNDDNEEKGGGELEVCN 1086 L AN VHP ++YAKKGHNQYALLSYVCLGMF GFD FG+ E V Sbjct: 297 LAANAVHPNVDYAKKGHNQYALLSYVCLGMFHGFDSLNFGM-------------EEPVVL 343 Query: 1085 NGDLHDLGLVENLKNSCLKQLLEHVSSNPLELLGIHPGCEFSRFCERKYERLVHPSMESS 906 NG DL ++ CLKQLLEHVSSNP++LLGIHPGC+FSRFCE KYERL+HPS+ESS Sbjct: 344 NGHGSDL----EDRDGCLKQLLEHVSSNPMDLLGIHPGCKFSRFCEHKYERLIHPSIESS 399 Query: 905 IFVDLDQNEAVVNSWRSLSMFYEAFVGMASSVWTLHKLSHAFDPKVEIFQVERGVEFSMI 726 IFV+L++ EAV+NSWRSLSMFYE FVGMAS+VWTLHKLS+AF+P VEIFQVERGVEFSMI Sbjct: 400 IFVNLEEKEAVLNSWRSLSMFYETFVGMASAVWTLHKLSYAFNPAVEIFQVERGVEFSMI 459 Query: 725 YMEDVTKRLTWPNKGRAKVGFTVFPGFKIGERIVIQSQVYISNFTST 585 YMEDVTKRLTWPNKGRAKVGF+V PGFKIG R+VIQSQVYISNF T Sbjct: 460 YMEDVTKRLTWPNKGRAKVGFSVLPGFKIG-RVVIQSQVYISNFRCT 505 Score = 102 bits (253), Expect = 6e-19 Identities = 54/71 (76%), Positives = 60/71 (84%), Gaps = 3/71 (4%) Frame = -3 Query: 2183 MPEMDGIPNSKPPQISEMFQKFALAFKTKTFEFFADDEN---ASTIDDSDGFSLLDSAEE 2013 MPEMDG ++KPPQISEMFQKFALAFKTKTFEFF+++EN + DD DGFSLLDS EE Sbjct: 1 MPEMDG-SSAKPPQISEMFQKFALAFKTKTFEFFSEEENNNASPLFDDIDGFSLLDSTEE 59 Query: 2012 IITDQKVVVIK 1980 II DQKVVVIK Sbjct: 60 IIPDQKVVVIK 70 >ref|XP_002320047.1| predicted protein [Populus trichocarpa] gi|222860820|gb|EEE98362.1| predicted protein [Populus trichocarpa] Length = 466 Score = 453 bits (1166), Expect = e-125 Identities = 245/412 (59%), Positives = 300/412 (72%), Gaps = 4/412 (0%) Frame = -3 Query: 1808 REITETTSHDLISSIFAAVSAFEASYFQLQTAHVPFVEENVKNADRVLVSHLQRLSELKK 1629 + + ++ LISS+FA VS+FEASY QLQ AHVPF EEN+K AD+ VS LQRLS+LK+ Sbjct: 74 KHLNTQLANTLISSVFAKVSSFEASYLQLQIAHVPFNEENIKVADKASVSVLQRLSDLKQ 133 Query: 1628 FYXXXXXXXXXXXXXS----LEAQVEENQNKLRTLGTVSNRLQSELEHKHDEVLSLRRKL 1461 Y LEAQVEENQ+KLR +GTVSN LQ+E++ K EV +L++KL Sbjct: 134 VYRDMCKNPDSGDDLPIGSCLEAQVEENQSKLRIMGTVSNSLQAEIDKKDCEVSALKKKL 193 Query: 1460 NEIHKGNANLTKKLCATTSTTMNRGCDVLLTVRVFDSLLHDASRAAHKFAKILIGLMRKA 1281 E+ K N+ L+K+L ++ +N +VLLTV+VFDS+L+DA R HKF KIL+ LMRKA Sbjct: 194 IEVQKSNSLLSKRLLSS----LNLNSEVLLTVKVFDSVLNDACRTMHKFTKILVDLMRKA 249 Query: 1280 GWDLGLVANVVHPGIEYAKKGHNQYALLSYVCLGMFQGFDLPCFGLSSNNDDNEEKGGGE 1101 GWDL L AN VH + Y K+GHN+YA LSYVCLGMF+GFDL FGL S+ GE Sbjct: 250 GWDLDLAANSVHSDVGYVKRGHNRYAFLSYVCLGMFKGFDLEGFGLKSD---------GE 300 Query: 1100 LEVCNNGDLHDLGLVENLKNSCLKQLLEHVSSNPLELLGIHPGCEFSRFCERKYERLVHP 921 + +CN HD V++ NS LKQLLEHVSSNP+ELL ++P CEF RFCE+KY+ L+HP Sbjct: 301 I-LCNG---HDSVSVKS--NSALKQLLEHVSSNPMELLSMNPTCEFLRFCEKKYQELIHP 354 Query: 920 SMESSIFVDLDQNEAVVNSWRSLSMFYEAFVGMASSVWTLHKLSHAFDPKVEIFQVERGV 741 +MESSIF + DQNE V+NSWRSL MFYE+FV MASSVWTLHKL+ +FDP V+IFQVERGV Sbjct: 355 TMESSIFSNFDQNEFVLNSWRSLGMFYESFVNMASSVWTLHKLAFSFDPVVDIFQVERGV 414 Query: 740 EFSMIYMEDVTKRLTWPNKGRAKVGFTVFPGFKIGERIVIQSQVYISNFTST 585 +FSM+YMEDVT R T P K R KVGFTV PGFKIG R IQSQVY+ T T Sbjct: 415 DFSMVYMEDVTGRCTMPGKTRLKVGFTVVPGFKIG-RTAIQSQVYLCGSTCT 465 Score = 68.2 bits (165), Expect = 1e-08 Identities = 34/51 (66%), Positives = 41/51 (80%) Frame = -3 Query: 2132 MFQKFALAFKTKTFEFFADDENASTIDDSDGFSLLDSAEEIITDQKVVVIK 1980 MF KFALAFKTKTFEFFAD+ + D +GFSLLDSAE+ I DQKV+++K Sbjct: 1 MFSKFALAFKTKTFEFFADEIS----DADEGFSLLDSAEDFIPDQKVIILK 47 >ref|XP_002301271.1| predicted protein [Populus trichocarpa] gi|222842997|gb|EEE80544.1| predicted protein [Populus trichocarpa] Length = 483 Score = 449 bits (1155), Expect = e-123 Identities = 241/399 (60%), Positives = 297/399 (74%), Gaps = 4/399 (1%) Frame = -3 Query: 1787 SHDLISSIFAAVSAFEASYFQLQTAHVPFVEENVKNADRVLVSHLQRLSELKKFYXXXXX 1608 ++ LISS+F++VS+FEASY QLQTAHVPF EE++K AD+ LVS LQRLS+LK+ Y Sbjct: 103 ANTLISSVFSSVSSFEASYLQLQTAHVPFNEESIKVADKALVSALQRLSDLKQVYRDLCK 162 Query: 1607 XXXXXXXXS----LEAQVEENQNKLRTLGTVSNRLQSELEHKHDEVLSLRRKLNEIHKGN 1440 LEAQV+ENQ+KLR LGTVSN LQ+E++ K EV L++KL+E+ K N Sbjct: 163 NPDFGDDLPIGSCLEAQVDENQSKLRILGTVSNSLQAEIDQKDSEVSVLKKKLSEVQKFN 222 Query: 1439 ANLTKKLCATTSTTMNRGCDVLLTVRVFDSLLHDASRAAHKFAKILIGLMRKAGWDLGLV 1260 + +K+LC++ +N +VLLTV+VFDS+L+DA R HKF KIL+ LMRKA WDL L Sbjct: 223 SLSSKRLCSS----LNLNSEVLLTVKVFDSVLNDACRTMHKFTKILVDLMRKARWDLDLA 278 Query: 1259 ANVVHPGIEYAKKGHNQYALLSYVCLGMFQGFDLPCFGLSSNNDDNEEKGGGELEVCNNG 1080 AN VH ++Y K+GHN+YA LSYV L M++GF+L FGL S GE+ CN Sbjct: 279 ANSVHSDVDYVKRGHNRYAFLSYVSLVMYKGFNLEGFGLESE---------GEVS-CNK- 327 Query: 1079 DLHDLGLVENLKNSCLKQLLEHVSSNPLELLGIHPGCEFSRFCERKYERLVHPSMESSIF 900 LGL NS LKQLLEHVSSNP+ELL +P CEFSRFCE+KY+ L+HP+MESSIF Sbjct: 328 ----LGLDSVKSNSSLKQLLEHVSSNPMELLSRNPTCEFSRFCEKKYQELMHPAMESSIF 383 Query: 899 VDLDQNEAVVNSWRSLSMFYEAFVGMASSVWTLHKLSHAFDPKVEIFQVERGVEFSMIYM 720 +LDQNE V+NSWRSLSMFYE+FV M+SSVWTLHKL+ +FDP V+IFQVERGV+FS +YM Sbjct: 384 SNLDQNEVVLNSWRSLSMFYESFVNMSSSVWTLHKLAFSFDPVVDIFQVERGVDFSTVYM 443 Query: 719 EDVTKRLTWPNKGRAKVGFTVFPGFKIGERIVIQSQVYI 603 EDVT+R T PNK R KVGFTV PGFKIG R VIQSQVY+ Sbjct: 444 EDVTRRCTMPNKTRLKVGFTVVPGFKIG-RTVIQSQVYL 481 Score = 85.1 bits (209), Expect = 8e-14 Identities = 43/60 (71%), Positives = 49/60 (81%), Gaps = 2/60 (3%) Frame = -3 Query: 2153 KPPQISEMFQKFALAFKTKTFEFFADDENAS--TIDDSDGFSLLDSAEEIITDQKVVVIK 1980 K PQISEMF KFALAFKTKTFEFFAD+ A+ T D DGFSLLDSAE+ I DQKV+++K Sbjct: 10 KQPQISEMFSKFALAFKTKTFEFFADETTAADETTDVDDGFSLLDSAEDFIPDQKVIILK 69 >ref|XP_004140484.1| PREDICTED: uncharacterized protein LOC101203555 [Cucumis sativus] gi|449505090|ref|XP_004162373.1| PREDICTED: uncharacterized protein LOC101226600 [Cucumis sativus] Length = 494 Score = 434 bits (1115), Expect = e-119 Identities = 230/396 (58%), Positives = 287/396 (72%), Gaps = 4/396 (1%) Frame = -3 Query: 1778 LISSIFAAVSAFEASYFQLQTAHVPFVEENVKNADRVLVSHLQRLSELKKFYXXXXXXXX 1599 L+SSIFA VS+FEASY QLQTAHVPFVEE V ADRVLVSH ++LS+LK FY Sbjct: 111 LVSSIFATVSSFEASYIQLQTAHVPFVEEKVTAADRVLVSHFKQLSDLKFFYKDFRTNPE 170 Query: 1598 XXXXXS----LEAQVEENQNKLRTLGTVSNRLQSELEHKHDEVLSLRRKLNEIHKGNANL 1431 LEAQV+ENQ+KLR LGTVS+R QSE++ K EV++LR+KL E+ K N L Sbjct: 171 EDISIPVGSCLEAQVQENQSKLRVLGTVSDRAQSEIDRKDSEVMALRKKLGELQKSNLRL 230 Query: 1430 TKKLCATTSTTMNRGCDVLLTVRVFDSLLHDASRAAHKFAKILIGLMRKAGWDLGLVANV 1251 +KKL S ++N CDVLL+VRVFDS+LHDA RAA+ F+K+L+ LM+KA WD+ L AN Sbjct: 231 SKKL----SASLNAPCDVLLSVRVFDSILHDACRAAYNFSKVLMELMKKASWDMDLAANS 286 Query: 1250 VHPGIEYAKKGHNQYALLSYVCLGMFQGFDLPCFGLSSNNDDNEEKGGGELEVCNNGDLH 1071 VH I YAKK H +YA LSYVCL MF+ FD +G++ C + Sbjct: 287 VHCEIRYAKKAHIRYAFLSYVCLWMFRSFDSEVYGVTETES-----------FCTEQSQN 335 Query: 1070 DLGLVENLKNSCLKQLLEHVSSNPLELLGIHPGCEFSRFCERKYERLVHPSMESSIFVDL 891 G+ + LKQLLEHVSSNP+ELL ++P C F++FCE+KY+ L+HP+MESSIF +L Sbjct: 336 FDGI-----SISLKQLLEHVSSNPMELLSVNPQCAFAKFCEKKYQELIHPTMESSIFSNL 390 Query: 890 DQNEAVVNSWRSLSMFYEAFVGMASSVWTLHKLSHAFDPKVEIFQVERGVEFSMIYMEDV 711 D+ EA++NSWRS+S+FY++FV MASSVW LHKL+ +FDP VEIFQVERG EFSM++MEDV Sbjct: 391 DRKEAILNSWRSVSVFYKSFVKMASSVWMLHKLAFSFDPIVEIFQVERGAEFSMVFMEDV 450 Query: 710 TKRLTWPNKGRAKVGFTVFPGFKIGERIVIQSQVYI 603 T+R P K RAKVGFTV PGFKIG + VIQSQVY+ Sbjct: 451 TRRYIPPFKSRAKVGFTVVPGFKIG-KTVIQSQVYL 485 Score = 103 bits (257), Expect = 2e-19 Identities = 55/68 (80%), Positives = 58/68 (85%) Frame = -3 Query: 2183 MPEMDGIPNSKPPQISEMFQKFALAFKTKTFEFFADDENASTIDDSDGFSLLDSAEEIIT 2004 M +MDG N K PQIS+MFQKFALAFKTKTFEFFADD+ DDSDGFSLLDSAEEIIT Sbjct: 1 MCDMDGSSNYKTPQISQMFQKFALAFKTKTFEFFADDD---APDDSDGFSLLDSAEEIIT 57 Query: 2003 DQKVVVIK 1980 DQKVVVIK Sbjct: 58 DQKVVVIK 65