BLASTX nr result

ID: Glycyrrhiza23_contig00008494 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00008494
         (1621 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AFK40295.1| unknown [Medicago truncatula]                          634   e-179
ref|XP_003589780.1| hypothetical protein MTR_1g039190 [Medicago ...   634   e-179
ref|NP_001239662.1| uncharacterized protein LOC100806152 [Glycin...   618   e-174
ref|XP_002310360.1| predicted protein [Populus trichocarpa] gi|2...   607   e-171
ref|XP_002513137.1| conserved hypothetical protein [Ricinus comm...   582   e-163

>gb|AFK40295.1| unknown [Medicago truncatula]
          Length = 428

 Score =  634 bits (1636), Expect = e-179
 Identities = 314/415 (75%), Positives = 346/415 (83%), Gaps = 1/415 (0%)
 Frame = +1

Query: 172  ARPFLRGELESIDKDLPSLVGILRSVGAGECWHKHGSFLDHLVDIYRILNLWKSPRPVSL 351
            A+PFLR EL SID  LPSL+ ILRSVGA ECWHKHG+FL+HL+DI+RIL+LWKSP  VSL
Sbjct: 22   AKPFLRNELISIDPKLPSLITILRSVGASECWHKHGTFLEHLIDIFRILHLWKSPYSVSL 81

Query: 352  CGLFHSAYSNSYVNLAIFDPSTAREFVRGHVGHDAERLIHLFCVVPRQTIIHDNLLFHYS 531
            CGLFHSAYSNSYVNLAIFDPST+RE VRGHVG +AERLIHLFCVVPRQ++IHD+LLFHYS
Sbjct: 82   CGLFHSAYSNSYVNLAIFDPSTSREVVRGHVGVEAERLIHLFCVVPRQSLIHDDLLFHYS 141

Query: 532  DSELVQHLGESEVSVRNAKEKGIFETKEGVCWRKKLQGLVPADGVVVKHIRTGEDXXXXX 711
            D EL   L +SE+S+RNAKEKGIF   E   WRKKLQGLVPADG+ VKHIRTGED     
Sbjct: 142  DKELCHDLEKSELSLRNAKEKGIFNKDES--WRKKLQGLVPADGIKVKHIRTGEDVKLSR 199

Query: 712  XXXXXFLMMTMADFCDQLFGFQDKLFNNFDGRLEFKGNNFGALWPGDGRPGLWLNSISRM 891
                 F+MMTMADFCDQLFGFQD LF NFDGRLEFKGNNFGA+WPG+G+PGLWLNSISRM
Sbjct: 200  RVVAVFVMMTMADFCDQLFGFQDMLFENFDGRLEFKGNNFGAVWPGNGKPGLWLNSISRM 259

Query: 892  GAVCNLIVREEEIFLEEKKKRVVVDG-QCYETERDEDIELVLPPVFANCTKVLDAGDQIV 1068
            GAV NLI+REEEIFLEEKKK + V G    + ERDE IELVLPPVFA CTKVLDA DQIV
Sbjct: 260  GAVYNLILREEEIFLEEKKKMLGVKGVNGVDYERDEHIELVLPPVFAKCTKVLDARDQIV 319

Query: 1069 ARDLYWEAVSINDVARXXXXXXXXXXXXXXSIEKNPFVGEPYVVLSQVYLTEGRFEEAER 1248
            ARDLYWEA+   +                 SIEKNPFVGEPYVVLSQVYLT+GRFEE E+
Sbjct: 320  ARDLYWEAMICEE------GLEKIEELLVKSIEKNPFVGEPYVVLSQVYLTKGRFEEGEK 373

Query: 1249 HAERGLTLLLEWGCPWDKRVSWEGWISWTRVLLMKAKEKSWPHSSWGILNLGLVR 1413
             AERGLTLLLEWGC WDKR+SWEGWI+WTRVLLMKAKEKSWP++SWGILNLGLV+
Sbjct: 374  EAERGLTLLLEWGCHWDKRISWEGWIAWTRVLLMKAKEKSWPNTSWGILNLGLVK 428


>ref|XP_003589780.1| hypothetical protein MTR_1g039190 [Medicago truncatula]
            gi|355478828|gb|AES60031.1| hypothetical protein
            MTR_1g039190 [Medicago truncatula]
          Length = 462

 Score =  634 bits (1636), Expect = e-179
 Identities = 314/415 (75%), Positives = 346/415 (83%), Gaps = 1/415 (0%)
 Frame = +1

Query: 172  ARPFLRGELESIDKDLPSLVGILRSVGAGECWHKHGSFLDHLVDIYRILNLWKSPRPVSL 351
            A+PFLR EL SID  LPSL+ ILRSVGA ECWHKHG+FL+HL+DI+RIL+LWKSP  VSL
Sbjct: 22   AKPFLRNELISIDPKLPSLITILRSVGASECWHKHGTFLEHLIDIFRILHLWKSPYSVSL 81

Query: 352  CGLFHSAYSNSYVNLAIFDPSTAREFVRGHVGHDAERLIHLFCVVPRQTIIHDNLLFHYS 531
            CGLFHSAYSNSYVNLAIFDPST+RE VRGHVG +AERLIHLFCVVPRQ++IHD+LLFHYS
Sbjct: 82   CGLFHSAYSNSYVNLAIFDPSTSREVVRGHVGVEAERLIHLFCVVPRQSLIHDDLLFHYS 141

Query: 532  DSELVQHLGESEVSVRNAKEKGIFETKEGVCWRKKLQGLVPADGVVVKHIRTGEDXXXXX 711
            D EL   L +SE+S+RNAKEKGIF   E   WRKKLQGLVPADG+ VKHIRTGED     
Sbjct: 142  DKELCHDLEKSELSLRNAKEKGIFNKDES--WRKKLQGLVPADGIKVKHIRTGEDVKLSR 199

Query: 712  XXXXXFLMMTMADFCDQLFGFQDKLFNNFDGRLEFKGNNFGALWPGDGRPGLWLNSISRM 891
                 F+MMTMADFCDQLFGFQD LF NFDGRLEFKGNNFGA+WPG+G+PGLWLNSISRM
Sbjct: 200  RVVAVFVMMTMADFCDQLFGFQDMLFENFDGRLEFKGNNFGAVWPGNGKPGLWLNSISRM 259

Query: 892  GAVCNLIVREEEIFLEEKKKRVVVDG-QCYETERDEDIELVLPPVFANCTKVLDAGDQIV 1068
            GAV NLI+REEEIFLEEKKK + V G    + ERDE IELVLPPVFA CTKVLDA DQIV
Sbjct: 260  GAVYNLILREEEIFLEEKKKMLGVKGVNGVDYERDEHIELVLPPVFAKCTKVLDARDQIV 319

Query: 1069 ARDLYWEAVSINDVARXXXXXXXXXXXXXXSIEKNPFVGEPYVVLSQVYLTEGRFEEAER 1248
            ARDLYWEA+   +                 SIEKNPFVGEPYVVLSQVYLT+GRFEE E+
Sbjct: 320  ARDLYWEAMICEE------GLEKIEELLVKSIEKNPFVGEPYVVLSQVYLTKGRFEEGEK 373

Query: 1249 HAERGLTLLLEWGCPWDKRVSWEGWISWTRVLLMKAKEKSWPHSSWGILNLGLVR 1413
             AERGLTLLLEWGC WDKR+SWEGWI+WTRVLLMKAKEKSWP++SWGILNLGLV+
Sbjct: 374  EAERGLTLLLEWGCHWDKRISWEGWIAWTRVLLMKAKEKSWPNTSWGILNLGLVK 428


>ref|NP_001239662.1| uncharacterized protein LOC100806152 [Glycine max]
            gi|255646517|gb|ACU23736.1| unknown [Glycine max]
          Length = 429

 Score =  618 bits (1593), Expect = e-174
 Identities = 306/415 (73%), Positives = 340/415 (81%), Gaps = 3/415 (0%)
 Frame = +1

Query: 175  RPFLRGELESIDKDLPSLVGILRSVGAGECWHKHGSFLDHLVDIYRILNLWKSPRPVSLC 354
            RPFLRG+LESIDK+LP LVG+L+SVGAGECWHKHGSFL HLVDI+RIL LWK+P  V LC
Sbjct: 23   RPFLRGDLESIDKNLPRLVGVLQSVGAGECWHKHGSFLHHLVDIFRILKLWKAPHSVCLC 82

Query: 355  GLFHSAYSNSYVNLAIFDPSTAREFVRGHVGHDAERLIHLFCVVPRQTIIHDNLLFHYSD 534
            GLFHSAYSNSYVNLAIFDPST RE VR  VG +AE LIHLFC+VPRQ +IHD+LLFHYSD
Sbjct: 83   GLFHSAYSNSYVNLAIFDPSTGREVVRALVGEEAESLIHLFCIVPRQPLIHDDLLFHYSD 142

Query: 535  SELVQHLGESEVSVRNAK--EKGIFETK-EGVCWRKKLQGLVPADGVVVKHIRTGEDXXX 705
             ELVQHL +SE+S+RNAK  EKG+F+   E   WRKKLQGLVPA+GV VKHIRTGE    
Sbjct: 143  EELVQHLAQSEISLRNAKKMEKGLFDDDGELEGWRKKLQGLVPAEGVQVKHIRTGEGVHV 202

Query: 706  XXXXXXXFLMMTMADFCDQLFGFQDKLFNNFDGRLEFKGNNFGALWPGDGRPGLWLNSIS 885
                   F+MMTMADFCDQLFGFQD LF+N +GRLEF GNNFGALWPGDG+PGLWLNSIS
Sbjct: 203  SRRIVAVFIMMTMADFCDQLFGFQDLLFDNANGRLEFSGNNFGALWPGDGKPGLWLNSIS 262

Query: 886  RMGAVCNLIVREEEIFLEEKKKRVVVDGQCYETERDEDIELVLPPVFANCTKVLDAGDQI 1065
            RMGAV  LI REEEIF++E+K++V V     + ER+EDIELVLPPVF  C KVL+AGDQI
Sbjct: 263  RMGAVYTLIAREEEIFIQERKRKVGV-AVVPDLERNEDIELVLPPVFDYCRKVLEAGDQI 321

Query: 1066 VARDLYWEAVSINDVARXXXXXXXXXXXXXXSIEKNPFVGEPYVVLSQVYLTEGRFEEAE 1245
            VARDLYWEAV                     SIEKNPFVGEPYVVLSQVYLTEGRFEEAE
Sbjct: 322  VARDLYWEAV--------CEGGSKAEELLLESIEKNPFVGEPYVVLSQVYLTEGRFEEAE 373

Query: 1246 RHAERGLTLLLEWGCPWDKRVSWEGWISWTRVLLMKAKEKSWPHSSWGILNLGLV 1410
            +HAERGL LLLEWGCPWDKR SWEGW++WTRVLLM+AK+KSWP +SWGILNLGLV
Sbjct: 374  KHAERGLKLLLEWGCPWDKRTSWEGWVAWTRVLLMRAKDKSWPQTSWGILNLGLV 428


>ref|XP_002310360.1| predicted protein [Populus trichocarpa] gi|222853263|gb|EEE90810.1|
            predicted protein [Populus trichocarpa]
          Length = 430

 Score =  607 bits (1564), Expect = e-171
 Identities = 297/414 (71%), Positives = 339/414 (81%)
 Frame = +1

Query: 172  ARPFLRGELESIDKDLPSLVGILRSVGAGECWHKHGSFLDHLVDIYRILNLWKSPRPVSL 351
            ARPFLRGELESIDK+LPSL+ +LRSVGAGECWHKHGSFLDHLV+IYRIL +WK+P  V L
Sbjct: 26   ARPFLRGELESIDKNLPSLISVLRSVGAGECWHKHGSFLDHLVEIYRILKIWKAPDSVCL 85

Query: 352  CGLFHSAYSNSYVNLAIFDPSTAREFVRGHVGHDAERLIHLFCVVPRQTIIHDNLLFHYS 531
            CGLFHSAYSNSYVNLAIFDP+T R+ VR HVG  AERLIHLFC+VPRQ++IHD+LLF YS
Sbjct: 86   CGLFHSAYSNSYVNLAIFDPNTGRDVVRNHVGEAAERLIHLFCIVPRQSLIHDDLLFKYS 145

Query: 532  DSELVQHLGESEVSVRNAKEKGIFETKEGVCWRKKLQGLVPADGVVVKHIRTGEDXXXXX 711
            D ELV+HL  SE+S+RNA EKG+F  +E   WRKKL  L+PA G+ VKHI++GED     
Sbjct: 146  DIELVEHLKASELSLRNAGEKGLFNGEES--WRKKLASLLPASGITVKHIKSGEDVLVTR 203

Query: 712  XXXXXFLMMTMADFCDQLFGFQDKLFNNFDGRLEFKGNNFGALWPGDGRPGLWLNSISRM 891
                 FL+MTMADF DQLFGFQD LF NFDGRLEF GNNFGALWPGDG+PGLW+NSISRM
Sbjct: 204  RMVGVFLLMTMADFSDQLFGFQDLLFENFDGRLEFLGNNFGALWPGDGKPGLWINSISRM 263

Query: 892  GAVCNLIVREEEIFLEEKKKRVVVDGQCYETERDEDIELVLPPVFANCTKVLDAGDQIVA 1071
            GA+ +LIVREEEIF+EE+K+     G   + ERDEDIELVL PVF NCT+VLDA +Q+VA
Sbjct: 264  GAIYSLIVREEEIFIEERKR---AGGFEVDRERDEDIELVLAPVFENCTQVLDAREQVVA 320

Query: 1072 RDLYWEAVSINDVARXXXXXXXXXXXXXXSIEKNPFVGEPYVVLSQVYLTEGRFEEAERH 1251
            RDLYWEAV   D ++              SIEKNPFVGEP+VVL Q YLT+GRFEEAE+ 
Sbjct: 321  RDLYWEAVC--DTSK--GGLERAEELLVSSIEKNPFVGEPHVVLGQFYLTKGRFEEAEKE 376

Query: 1252 AERGLTLLLEWGCPWDKRVSWEGWISWTRVLLMKAKEKSWPHSSWGILNLGLVR 1413
            AERG+TLLLEWG PWDKR+SWEGWI+W RVLLMKAKEKSWP +SWGILNLGLVR
Sbjct: 377  AERGVTLLLEWGSPWDKRMSWEGWIAWARVLLMKAKEKSWPQTSWGILNLGLVR 430


>ref|XP_002513137.1| conserved hypothetical protein [Ricinus communis]
            gi|223548148|gb|EEF49640.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 424

 Score =  582 bits (1500), Expect = e-163
 Identities = 284/413 (68%), Positives = 332/413 (80%)
 Frame = +1

Query: 175  RPFLRGELESIDKDLPSLVGILRSVGAGECWHKHGSFLDHLVDIYRILNLWKSPRPVSLC 354
            RPFLRGELES+DK+LP+L+ +LRSVGAGECWHKHGSFLDHLVDIYRIL +W +   V LC
Sbjct: 21   RPFLRGELESVDKNLPALISVLRSVGAGECWHKHGSFLDHLVDIYRILKIWNASDSVCLC 80

Query: 355  GLFHSAYSNSYVNLAIFDPSTAREFVRGHVGHDAERLIHLFCVVPRQTIIHDNLLFHYSD 534
            GLFHSAYSNSYVNLAIFDP+T R+ VRGHVG  AERLIHLFC+VPRQ +IHD+LLF+YSD
Sbjct: 81   GLFHSAYSNSYVNLAIFDPNTGRDVVRGHVGPAAERLIHLFCIVPRQPLIHDDLLFNYSD 140

Query: 535  SELVQHLGESEVSVRNAKEKGIFETKEGVCWRKKLQGLVPADGVVVKHIRTGEDXXXXXX 714
            SELVQHL  SE+S++NAKEKG+F+ ++   WRKK+  L+PA G+ VK I+TGED      
Sbjct: 141  SELVQHLLLSEISLKNAKEKGVFDAEDS--WRKKINSLLPAAGITVKRIKTGEDVLVTRR 198

Query: 715  XXXXFLMMTMADFCDQLFGFQDKLFNNFDGRLEFKGNNFGALWPGDGRPGLWLNSISRMG 894
                F+MMTMADF DQLF FQD LF+N DGRLEF GNN  +LWPGDG+PGLW+NSISRMG
Sbjct: 199  IVAVFVMMTMADFSDQLFSFQDLLFDNSDGRLEFSGNNLASLWPGDGKPGLWINSISRMG 258

Query: 895  AVCNLIVREEEIFLEEKKKRVVVDGQCYETERDEDIELVLPPVFANCTKVLDAGDQIVAR 1074
            A+  LI REEEIF+EE   R+   G   + ERDEDIELV+PPVF  CT++LDA  QI AR
Sbjct: 259  AIYTLIRREEEIFVEE---RIRAGGIEVDEERDEDIELVVPPVFDKCTRILDARQQIEAR 315

Query: 1075 DLYWEAVSINDVARXXXXXXXXXXXXXXSIEKNPFVGEPYVVLSQVYLTEGRFEEAERHA 1254
            DLYWEAV   D+++               IEKNP+VGEP+VVLSQVYLT+ RFEEAER A
Sbjct: 316  DLYWEAVC--DLSK--RGLDKVEELLLSCIEKNPYVGEPHVVLSQVYLTKDRFEEAEREA 371

Query: 1255 ERGLTLLLEWGCPWDKRVSWEGWISWTRVLLMKAKEKSWPHSSWGILNLGLVR 1413
            E+G+TL+LEWG PWDKR SWEGWI+W RVLLMKAKEKSWP++SWGILNLGLVR
Sbjct: 372  EKGVTLMLEWGSPWDKRTSWEGWIAWGRVLLMKAKEKSWPNTSWGILNLGLVR 424


Top