BLASTX nr result

ID: Glycyrrhiza23_contig00013519 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00013519
         (2263 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003533974.1| PREDICTED: uncharacterized protein LOC100809...   800   0.0  
ref|XP_003549017.1| PREDICTED: uncharacterized protein LOC100797...   736   0.0  
ref|XP_002513636.1| conserved hypothetical protein [Ricinus comm...   651   0.0  
ref|XP_002327502.1| predicted protein [Populus trichocarpa] gi|2...   617   e-174
ref|XP_004145902.1| PREDICTED: uncharacterized protein LOC101215...   607   e-171

>ref|XP_003533974.1| PREDICTED: uncharacterized protein LOC100809082 [Glycine max]
          Length = 532

 Score =  800 bits (2067), Expect = 0.0
 Identities = 417/539 (77%), Positives = 448/539 (83%), Gaps = 1/539 (0%)
 Frame = -3

Query: 2195 MALPSEXXXXXXXXXXHISSFLQSTASTFASLFNPHQNXXXXXXXXXXXXSAISLPLFLP 2016
            MALPSE           IS+FLQSTAS FASLFNP               S+ SLPLF  
Sbjct: 1    MALPSEPHRRRRHNH--ISTFLQSTASNFASLFNPPN---PPSLALPHPPSSFSLPLFFA 55

Query: 2015 PPLTASAKTVDSPATEPARPAAKSVRVARLNSNGKVGG-PAFVGQVFSMCDLSGTGLMAV 1839
            PPL++S   VDS   EPARPAAKSVR+ARL +NGK GG P FVGQVFSMCDLSGTGLMAV
Sbjct: 56   PPLSSST-AVDSATAEPARPAAKSVRIARLGANGKGGGGPVFVGQVFSMCDLSGTGLMAV 114

Query: 1838 STHFDIPFISKRTPQWLKKMFSAITKSERNGPVFRFFIDLGDAVSYVKKLNIPSGVVGAC 1659
            STHFDIPFISKRTP+WLKK+F+AITKSERNGPVFRFFIDLGDAVSYVKKLNIPSGVVGAC
Sbjct: 115  STHFDIPFISKRTPEWLKKVFAAITKSERNGPVFRFFIDLGDAVSYVKKLNIPSGVVGAC 174

Query: 1658 RLDLAYEHFKEKPHLFQFVPNEKQVKAANKLLKTILHDGVRKKVDGVPVFSAQNLDIAIA 1479
            RLDLAYEHFKEKPHLFQFVPNEKQVKAANKLLKTI   G +KKVDGVPVFSAQNLDIAIA
Sbjct: 175  RLDLAYEHFKEKPHLFQFVPNEKQVKAANKLLKTISEHGEKKKVDGVPVFSAQNLDIAIA 234

Query: 1478 TTDGIKWYTPYFFDKNMLDNILEEAVDQHFHTLIQTRHMQRRRDVVDDNLAAEVIEEMGD 1299
            TTDGIKWYTPYFFDKNMLDNILEEAVDQHFHTLIQTRHM RRRDVVDDNLAAEVIEEMGD
Sbjct: 235  TTDGIKWYTPYFFDKNMLDNILEEAVDQHFHTLIQTRHMHRRRDVVDDNLAAEVIEEMGD 294

Query: 1298 SLGDPPEVQEVLEEMGHPGIPLSVISKAAELQLHYTVDKVLLGNRWLRKATGIQPKFPYM 1119
            SLG+PPEVQE+L+EMGHP IPLSVISKAAELQ  YTVDKV LGNRWLRKATGIQP FPYM
Sbjct: 295  SLGEPPEVQELLDEMGHPSIPLSVISKAAELQFQYTVDKVFLGNRWLRKATGIQPIFPYM 354

Query: 1118 VDSFERRSEASFLRLTESSSCLDNPKVEDDSKHSECTAXXXXXXXXXXSEAVKDLHSKPR 939
            VDSFERRSEAS LR TESSS L+N KVEDD K++EC            +EA+K    +  
Sbjct: 355  VDSFERRSEASLLRATESSSSLENSKVEDDRKNAEC-IDSSKCSLDGNTEAIKQSSPRLS 413

Query: 938  LPFGDWFGHPSPKQSHEKVGLPRNGLNKQDLKQSPFLPKITMVGLSTEEAGQMSKASLKK 759
            LPFG+WF H  PKQ  +KVG  R G+NK+++K +PFLPKITMVGLSTEEAGQMSKA+LKK
Sbjct: 414  LPFGNWFHHLWPKQCRKKVGSSRKGVNKEEMKPAPFLPKITMVGLSTEEAGQMSKANLKK 473

Query: 758  TMDDLTRELEKTEVDNVNGGDGNEFKVEDRDPLFVANVGDYYSSLGRTGTARWIRGGSN 582
            TMDDLTRELEKTE+D +  G   E KVEDRDPLFVANVGDYYSSLG+ G+ RWIRGGSN
Sbjct: 474  TMDDLTRELEKTELDIMTDGGSKECKVEDRDPLFVANVGDYYSSLGKPGSGRWIRGGSN 532


>ref|XP_003549017.1| PREDICTED: uncharacterized protein LOC100797355 [Glycine max]
          Length = 543

 Score =  736 bits (1901), Expect = 0.0
 Identities = 388/541 (71%), Positives = 432/541 (79%), Gaps = 2/541 (0%)
 Frame = -3

Query: 2234 SLTRTKPTS-HAHCMALPSEXXXXXXXXXXHISSFLQSTASTFASLFNPHQNXXXXXXXX 2058
            S T++KP+S +A  MALPSE          HIS+FLQSTAS FASLFNP           
Sbjct: 2    SSTQSKPSSVNARGMALPSEPHRRRRHNHNHISTFLQSTASNFASLFNPPN---PPSLTL 58

Query: 2057 XXXXSAISLPLFLPPPLTASAKTVDSPATEPARPAAKSVRVARLNSNGKVGG-PAFVGQV 1881
                +++SLPLF  PPL+ S+ TVDS  ++PA P AKSVR+ARL +NGK GG P F+G+V
Sbjct: 59   PHPPTSVSLPLFFAPPLSGSS-TVDSATSKPAHPPAKSVRIARLGANGKGGGGPVFLGEV 117

Query: 1880 FSMCDLSGTGLMAVSTHFDIPFISKRTPQWLKKMFSAITKSERNGPVFRFFIDLGDAVSY 1701
            FS+CDLSGTGL+A S HF IPFIS+RTP+WLKK+F+ ITKSERNGPVFRFFIDL DAVSY
Sbjct: 118  FSLCDLSGTGLIAASKHFGIPFISERTPEWLKKIFAPITKSERNGPVFRFFIDLEDAVSY 177

Query: 1700 VKKLNIPSGVVGACRLDLAYEHFKEKPHLFQFVPNEKQVKAANKLLKTILHDGVRKKVDG 1521
            V+KLNIPS VVGA RLDLAY+ FKEKPHLFQFVPNEKQVKAANKLLKTI   G +KKVDG
Sbjct: 178  VEKLNIPSCVVGAFRLDLAYKQFKEKPHLFQFVPNEKQVKAANKLLKTISEHGDKKKVDG 237

Query: 1520 VPVFSAQNLDIAIATTDGIKWYTPYFFDKNMLDNILEEAVDQHFHTLIQTRHMQRRRDVV 1341
            VPVF AQNLDIAIATTDGIKWYTPYFFDKNMLDNILE+AVDQHFH LIQTRHMQRRRDVV
Sbjct: 238  VPVFGAQNLDIAIATTDGIKWYTPYFFDKNMLDNILEDAVDQHFHNLIQTRHMQRRRDVV 297

Query: 1340 DDNLAAEVIEEMGDSLGDPPEVQEVLEEMGHPGIPLSVISKAAELQLHYTVDKVLLGNRW 1161
            DDNLAAEVIEEM D LG+PPEVQE+L+EMGHP IPLSVISKAA LQ  YTVDKVLLGNRW
Sbjct: 298  DDNLAAEVIEEMSDRLGEPPEVQELLDEMGHPSIPLSVISKAAGLQFQYTVDKVLLGNRW 357

Query: 1160 LRKATGIQPKFPYMVDSFERRSEASFLRLTESSSCLDNPKVEDDSKHSECTAXXXXXXXX 981
            LRKATGIQPKFPYMVDSFERRSEAS LR TESSSCL+N KVEDD + SEC          
Sbjct: 358  LRKATGIQPKFPYMVDSFERRSEASLLRATESSSCLENSKVEDDRQISEC-LDSSNCSLD 416

Query: 980  XXSEAVKDLHSKPRLPFGDWFGHPSPKQSHEKVGLPRNGLNKQDLKQSPFLPKITMVGLS 801
              +EA+K       LPFG+WF H  PKQ  EK    R G+NK+++K  P LPK+TMVGLS
Sbjct: 417  GNTEAIKQPSPSLSLPFGNWFHHLWPKQCREKASSSRKGVNKEEMKPRPILPKVTMVGLS 476

Query: 800  TEEAGQMSKASLKKTMDDLTRELEKTEVDNVNGGDGNEFKVEDRDPLFVANVGDYYSSLG 621
            TEEAGQ SKA+LKKTMDDLTRELEKTE+D +  G   E KVEDRDPLFVAN G+YYSS+ 
Sbjct: 477  TEEAGQTSKANLKKTMDDLTRELEKTELDIMTDGGSKECKVEDRDPLFVANGGNYYSSMR 536

Query: 620  R 618
            R
Sbjct: 537  R 537


>ref|XP_002513636.1| conserved hypothetical protein [Ricinus communis]
            gi|223547544|gb|EEF49039.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 544

 Score =  651 bits (1680), Expect = 0.0
 Identities = 334/479 (69%), Positives = 388/479 (81%), Gaps = 3/479 (0%)
 Frame = -3

Query: 2012 PLTASAKTVDSPATEPARPAAKSVRVARLNSNGKVGGPAFVGQVFSMCDLSGTGLMAVST 1833
            P +A+ K + S  +    P+  +VR+A LNSNGK GGPAFVGQVFSMCDLSGTGLMAVST
Sbjct: 76   PKSAAVKGLSSAESSSGFPS--TVRIAGLNSNGKGGGPAFVGQVFSMCDLSGTGLMAVST 133

Query: 1832 HFDIPFISKRTPQWLKKMFSAITKSERNGPVFRFFIDLGDAVSYVKKLNIPSGVVGACRL 1653
            HFDIPFISKRTP+WLKK+F+ +TKSER GPVFRFF+DLGDAV+YVK+LNIPSGVVGACRL
Sbjct: 134  HFDIPFISKRTPEWLKKVFTTVTKSERKGPVFRFFMDLGDAVTYVKRLNIPSGVVGACRL 193

Query: 1652 DLAYEHFKEKPHLFQFVPNEKQVKAANKLLKTILHDGVRKKVDGVPVFSAQNLDIAIATT 1473
            DLAYEHFKEKPHLFQFVPNEKQVKAAN+LLKTI     R+KVDGVPVFSAQNLDIAIATT
Sbjct: 194  DLAYEHFKEKPHLFQFVPNEKQVKAANQLLKTIPQSDGRRKVDGVPVFSAQNLDIAIATT 253

Query: 1472 DGIKWYTPYFFDKNMLDNILEEAVDQHFHTLIQTRHMQRRRDVVDDNLAAEVIEEMGDSL 1293
            DGIKWYTPYFFDK+MLDNILEE+VDQHFH LIQTRHMQRRRDV+DDNLAAEVIEEMGDS+
Sbjct: 254  DGIKWYTPYFFDKSMLDNILEESVDQHFHALIQTRHMQRRRDVIDDNLAAEVIEEMGDSM 313

Query: 1292 GDPPEVQEVLEEMGHPGIPLSVISKAAELQLHYTVDKVLLGNRWLRKATGIQPKFPYMVD 1113
             +PPEVQE+++E+GHP IPL+VISKAAE+QL Y VD+V+LGNRWLRKATGIQPKFPYMVD
Sbjct: 314  LEPPEVQEMMDEIGHPAIPLNVISKAAEIQLLYAVDRVILGNRWLRKATGIQPKFPYMVD 373

Query: 1112 SFERRSEASFLRLTESSSCLDNPKVEDDSKHSECTAXXXXXXXXXXSEAVKDLHSKPRLP 933
            SFE+RS +SF R +E +S L   K + D+     +            E + DL    RL 
Sbjct: 374  SFEKRSASSFRRASEPASYLAKSKTDADT-----SKLNLEDGAQANHEPITDL----RLQ 424

Query: 932  FGDWFGHPSPKQSHE-KVGLPRNGLNKQDLKQSPFLPKITMVGLSTEEAGQMSKASLKKT 756
            FGDWF     KQ  + + G   +   KQ L+ +PFLPKITMVG+ST EAGQMSKASLKKT
Sbjct: 425  FGDWFKSLGLKQQQKPEKGSEISECRKQKLEMNPFLPKITMVGISTGEAGQMSKASLKKT 484

Query: 755  MDDLTRELEKTEVDNVNG--GDGNEFKVEDRDPLFVANVGDYYSSLGRTGTARWIRGGS 585
            M+DLTRELE T+ +N  G   +GN+ ++EDRDPLFVANVGDYYS + +T + R +RGGS
Sbjct: 485  MEDLTRELEHTDRENAPGSSNNGNDLEMEDRDPLFVANVGDYYSGMSKTNSPRLVRGGS 543


>ref|XP_002327502.1| predicted protein [Populus trichocarpa] gi|222836056|gb|EEE74477.1|
            predicted protein [Populus trichocarpa]
          Length = 424

 Score =  617 bits (1590), Expect = e-174
 Identities = 313/433 (72%), Positives = 357/433 (82%), Gaps = 9/433 (2%)
 Frame = -3

Query: 1874 MCDLSGTGLMAVSTHFDIPFISKRTPQWLKKMFSAITKSERNGPVFRFFIDLGDAVSYVK 1695
            MCDLSGTGLMAVSTHFD+PFISKRTP+WLKK+F+ +TKSERNGPVFRFF+DLGDAV+YVK
Sbjct: 1    MCDLSGTGLMAVSTHFDVPFISKRTPEWLKKIFATVTKSERNGPVFRFFMDLGDAVAYVK 60

Query: 1694 KLNIPSGVVGACRLDLAYEHFKEKPHLFQFVPNEKQVKAANKLLKTILHDGVRKKVDGVP 1515
            +LNIPSGVVGACRLDLAYEHFKEKPHLFQFVPNEKQVKAAN+LLK+I H    ++VDGVP
Sbjct: 61   RLNIPSGVVGACRLDLAYEHFKEKPHLFQFVPNEKQVKAANQLLKSIPHGDGSRRVDGVP 120

Query: 1514 VFSAQNLDIAIATTDGIKWYTPYFFDKNMLDNILEEAVDQHFHTLIQTRHMQRRRDVVDD 1335
            VFSAQNLDIAIATTDGIKWYTPYFFDKNMLDNILEE+VDQHFH LIQTRHMQRRRDV+DD
Sbjct: 121  VFSAQNLDIAIATTDGIKWYTPYFFDKNMLDNILEESVDQHFHALIQTRHMQRRRDVIDD 180

Query: 1334 NLAAEVIEEMGDSLGDPPEVQEVLEEMGHPGIPLSVISKAAELQLHYTVDKVLLGNRWLR 1155
            N+AAEVIEEMGDSL +PPEVQEVL+EMGHP IPLSVISKAAE+QL Y VDKVLLGNRWLR
Sbjct: 181  NVAAEVIEEMGDSLLEPPEVQEVLDEMGHPAIPLSVISKAAEIQLLYAVDKVLLGNRWLR 240

Query: 1154 KATGIQPKFPYMVDSFERRSEASFLRLTESSSCLDNPKVEDDSKHSECTAXXXXXXXXXX 975
            KATGIQPKFPY+VDSFERRS +S  R  ES+SCL N K++D +   +             
Sbjct: 241  KATGIQPKFPYLVDSFERRSASSLRRALESTSCLANSKIDDSTSEHK-----LKDNVQTD 295

Query: 974  SEAVKDLHSKPRLPFGDWFGHPSPK---QSHEKVGLPRNGLNKQDLK----QSPFLPKIT 816
             E  KDL    RLPFGDWF HP  K   +S  +    + GL+K  LK     +PFLPK+T
Sbjct: 296  HEQRKDL----RLPFGDWFSHPWLKKHSKSERESDTRKEGLSKDCLKWKSESNPFLPKVT 351

Query: 815  MVGLSTEEAGQMSKASLKKTMDDLTRELEKTEV--DNVNGGDGNEFKVEDRDPLFVANVG 642
            MVG+ST +AGQ+SK+SLKKTM+DLT+ELE+T+   D+      +EFKV DRDPLFVANVG
Sbjct: 352  MVGVSTGDAGQLSKSSLKKTMEDLTKELEQTDEANDSFISNSSSEFKVNDRDPLFVANVG 411

Query: 641  DYYSSLGRTGTAR 603
            DYYS + +TG +R
Sbjct: 412  DYYSGMAKTGISR 424


>ref|XP_004145902.1| PREDICTED: uncharacterized protein LOC101215938 [Cucumis sativus]
          Length = 554

 Score =  607 bits (1565), Expect = e-171
 Identities = 312/479 (65%), Positives = 375/479 (78%), Gaps = 2/479 (0%)
 Frame = -3

Query: 2021 LPPPLTASAKTVDS-PATEPARPAAKSVRVARLNSNGKVGGPAFVGQVFSMCDLSGTGLM 1845
            L  P +A+ K + S P  +   P+  ++R++ LNS+GK GGPAFVGQVFSMCDLSG GLM
Sbjct: 80   LDSPKSAAVKGLSSSPNFDSGFPS--TLRISGLNSDGKTGGPAFVGQVFSMCDLSGAGLM 137

Query: 1844 AVSTHFDIPFISKRTPQWLKKMFSAITKSERNGPVFRFFIDLGDAVSYVKKLNIPSGVVG 1665
            AV+++ +IPF+SKRT +WLKKMFS ITKS+RN P+FRFF DLGDAV+YVK+LNIPS VVG
Sbjct: 138  AVTSNMNIPFVSKRTEEWLKKMFSTITKSKRNAPIFRFFTDLGDAVTYVKRLNIPSAVVG 197

Query: 1664 ACRLDLAYEHFKEKPHLFQFVPNEKQVKAANKLLKTILHDGVRKKVDGVPVFSAQNLDIA 1485
             CRLDLAYEHFKEKPHLFQF+PNEKQVKAANKLLK +  +G  KK+DGVPVFSAQNLDIA
Sbjct: 198  VCRLDLAYEHFKEKPHLFQFIPNEKQVKAANKLLKGLPQNGGSKKIDGVPVFSAQNLDIA 257

Query: 1484 IATTDGIKWYTPYFFDKNMLDNILEEAVDQHFHTLIQTRHMQRRRDVVDDNLAAEVIEEM 1305
            IATT+GIKWYTPYFFDKNMLDNILEE+VDQHFH LIQTR +QRRR++VDDN AAEV+EEM
Sbjct: 258  IATTNGIKWYTPYFFDKNMLDNILEESVDQHFHALIQTRRLQRRREIVDDNAAAEVLEEM 317

Query: 1304 GDSLGDPPEVQEVLEEMGHPGIPLSVISKAAELQLHYTVDKVLLGNRWLRKATGIQPKFP 1125
            GDSL +PPEVQEV++EMG+PGIPLSVISK AE+QL YTVDKV+LGNRWLRKA GIQPKFP
Sbjct: 318  GDSLLEPPEVQEVMDEMGNPGIPLSVISKVAEMQLLYTVDKVILGNRWLRKAVGIQPKFP 377

Query: 1124 YMVDSFERRSEASFLRLTESSSCLDNPKVEDDSKHSECTAXXXXXXXXXXSEAVKDLHSK 945
            YMVDSFERRS AS LR+ ES+S L N +  +++K  +C +           EA ++    
Sbjct: 378  YMVDSFERRSAASLLRIQESASGLTNSESVEETKELQCYS-SSPLNTEDNREANQEPKQH 436

Query: 944  PRLPFGDWFGHPSPKQ-SHEKVGLPRNGLNKQDLKQSPFLPKITMVGLSTEEAGQMSKAS 768
               PF +WFGH   KQ   +     R    KQ+++ SPFLPKITMVG+ST ++G  SKA+
Sbjct: 437  SFNPFRNWFGHLWSKQRQRDDFSQER---TKQNVQISPFLPKITMVGISTGDSGHTSKAN 493

Query: 767  LKKTMDDLTRELEKTEVDNVNGGDGNEFKVEDRDPLFVANVGDYYSSLGRTGTARWIRG 591
            LKKTM+DLTRELE  +  N    +  EF  E+RDPLFVANV  + S L + G+ARW+RG
Sbjct: 494  LKKTMEDLTRELEHIDQGNAASHNEYEFNNEERDPLFVANVSHFSSGLSKAGSARWVRG 552


Top