BLASTX nr result

ID: Glycyrrhiza23_contig00005090 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00005090
         (1685 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003526342.1| PREDICTED: uncharacterized protein LOC100798...   522   e-145
ref|XP_003522853.1| PREDICTED: uncharacterized protein LOC100804...   518   e-144
ref|XP_003523899.1| PREDICTED: uncharacterized protein LOC100779...   375   e-101
ref|XP_003526341.1| PREDICTED: uncharacterized protein LOC100797...   368   2e-99
ref|XP_002336026.1| predicted protein [Populus trichocarpa] gi|2...   281   4e-73

>ref|XP_003526342.1| PREDICTED: uncharacterized protein LOC100798367 [Glycine max]
          Length = 674

 Score =  522 bits (1345), Expect = e-145
 Identities = 280/455 (61%), Positives = 316/455 (69%), Gaps = 23/455 (5%)
 Frame = -3

Query: 1683 DEKNKEKEIQNGSTSGKSPIEKSRRARRNKMLKLDKNEQKPKNEERRREPDKGARSRIKK 1504
            D +NKEKE QN ST  KSPI+ S +A +NK++K+DK+EQK K++E+  E DKGA SRI +
Sbjct: 238  DGRNKEKESQNRSTDDKSPIDNSHQAEKNKIIKVDKSEQKQKSKEKHTESDKGAGSRINR 297

Query: 1503 TKDIGTENTXXXXXXXEKIAGFIFMCNAKTKPDCFRHRVMGVPVGKKEVVLGIKPGTXXX 1324
            TK    ENT       EK +GFIFMC+AKTKPDCFR+RVMGV   KK++VL IKPGT   
Sbjct: 298  TKHSRVENTNSSEKRREKPSGFIFMCSAKTKPDCFRYRVMGVSATKKDIVLSIKPGTKLF 357

Query: 1323 XXXXXXXXXYGVYKASSSGGMKLEPRAFGGNFPAQVRFSVAADCFPLPERIFKKAIKENY 1144
                     YG+Y+ASSSGGMKLEPRAFGGNFPAQVRF+VA+DCFPLPE IFKKAIKENY
Sbjct: 358  LYDFDLRLLYGIYRASSSGGMKLEPRAFGGNFPAQVRFNVASDCFPLPESIFKKAIKENY 417

Query: 1143 NEKNKFKTELTVRQVRKLTELFRPMDIHSARQPARSPPKAIIRDREGRDDVRGSWSHLHR 964
            NEKNKFKTELT RQV+KLTELFRP+D+ S  QP RSPPKAII DR   D VRGSWS+LHR
Sbjct: 418  NEKNKFKTELTARQVKKLTELFRPVDVRSGLQPVRSPPKAIIPDRVAPDGVRGSWSYLHR 477

Query: 963  EIPSVLSHXXXXXXXXXXXXXXXXXXDLFLTERSYRAYGLQGERRNVTPASQVNPIMXXX 784
            E    +                    DLFLTE+SYRAYGLQG++RN  PA QVNP +   
Sbjct: 478  ERDPQIER------------QEEIPRDLFLTEKSYRAYGLQGDKRNAIPAPQVNPTLDPY 525

Query: 783  XXXXXXXXXXHLDPIYRNNVPAHVESLRADPLRADPLHFNENEHQAYYRGAISEHADDPY 604
                      H+DP+YR NVP++ ES     L ADP HFNE+EHQ Y RG IS HADDPY
Sbjct: 526  ERDYERDHLHHVDPLYRTNVPSYRES-----LHADPHHFNESEHQTYLRGGISAHADDPY 580

Query: 603  HAYRYGASSRDPYLPP-----------------------LQRRETVTDRLYSTYSAADAL 493
            H Y YGAS RDPYLPP                       LQRRE V DRLYSTYSAADAL
Sbjct: 581  HQYPYGASPRDPYLPPMSRQEISSSTYLAGGRSLTASDNLQRREAVQDRLYSTYSAADAL 640

Query: 492  SEYNRMQHYQGEELEATAVPVSSRYSFAGPPYSFR 388
            SEYNRMQHYQ E L+ATAVPVSSRYSFAGP YS R
Sbjct: 641  SEYNRMQHYQ-ESLKATAVPVSSRYSFAGPSYSLR 674


>ref|XP_003522853.1| PREDICTED: uncharacterized protein LOC100804883 [Glycine max]
          Length = 540

 Score =  518 bits (1335), Expect = e-144
 Identities = 278/455 (61%), Positives = 312/455 (68%), Gaps = 23/455 (5%)
 Frame = -3

Query: 1683 DEKNKEKEIQNGSTSGKSPIEKSRRARRNKMLKLDKNEQKPKNEERRREPDKGARSRIKK 1504
            D +NKEKE QN ST  KSPIE S RA +NK++K+DK+EQK KN+E+  E DKGA SRI +
Sbjct: 104  DGRNKEKESQNRSTHEKSPIENSHRAEKNKIIKVDKSEQKQKNKEKHTESDKGAGSRINR 163

Query: 1503 TKDIGTENTXXXXXXXEKIAGFIFMCNAKTKPDCFRHRVMGVPVGKKEVVLGIKPGTXXX 1324
            TK    ENT        K +GFIFMC+AKTKPDCFR+RVMGV   KK++VL I PGT   
Sbjct: 164  TKHGRVENTVSSEKKRGKPSGFIFMCSAKTKPDCFRYRVMGVSATKKDIVLSINPGTKLF 223

Query: 1323 XXXXXXXXXYGVYKASSSGGMKLEPRAFGGNFPAQVRFSVAADCFPLPERIFKKAIKENY 1144
                     YG+YKASSSGGMKLEPRAFGGNFPAQVRF +A+DCFPLPE IFKKAI+ENY
Sbjct: 224  LYDFDLRLLYGIYKASSSGGMKLEPRAFGGNFPAQVRFKIASDCFPLPESIFKKAIQENY 283

Query: 1143 NEKNKFKTELTVRQVRKLTELFRPMDIHSARQPARSPPKAIIRDREGRDDVRGSWSHLHR 964
            NEK+KFKTELT RQVRKLTELFRP+D+HS  QP RSPP+AII DR+  D VRGSWSHLHR
Sbjct: 284  NEKHKFKTELTARQVRKLTELFRPVDVHSGLQPVRSPPRAIIHDRDALDGVRGSWSHLHR 343

Query: 963  EIPSVLSHXXXXXXXXXXXXXXXXXXDLFLTERSYRAYGLQGERRNVTPASQVNPIMXXX 784
            E    +                    DLFLTE+SYRAYGLQG+RRNV PASQVNP +   
Sbjct: 344  ERDPQIER------------QEEIPRDLFLTEKSYRAYGLQGDRRNVIPASQVNPTLDPY 391

Query: 783  XXXXXXXXXXHLDPIYRNNVPAHVESLRADPLRADPLHFNENEHQAYYRGAISEHADDPY 604
                      H+D +Y  NVP++ ES     L  D  H NE+EHQ Y  G IS HA++PY
Sbjct: 392  ERDYEREHLHHVDTLYHTNVPSYRES-----LHGDLHHLNESEHQTYLHGGISTHANNPY 446

Query: 603  HAYRYGASSRDPYLPP-----------------------LQRRETVTDRLYSTYSAADAL 493
            H YRYGAS RDPYLPP                       LQRRE V DRLYSTYSAADAL
Sbjct: 447  HPYRYGASPRDPYLPPMSREEISSSSYLAGGRSLIGSDILQRREAVQDRLYSTYSAADAL 506

Query: 492  SEYNRMQHYQGEELEATAVPVSSRYSFAGPPYSFR 388
            SEYNRMQHYQ E LEATAVPVSSRYSFAGP YS R
Sbjct: 507  SEYNRMQHYQ-ESLEATAVPVSSRYSFAGPSYSLR 540


>ref|XP_003523899.1| PREDICTED: uncharacterized protein LOC100779568 [Glycine max]
          Length = 957

 Score =  375 bits (964), Expect = e-101
 Identities = 227/476 (47%), Positives = 270/476 (56%), Gaps = 47/476 (9%)
 Frame = -3

Query: 1683 DEKNKEKEIQNGSTSGKSPIEKSRRARRNKMLKLDKNEQKPKNEERRREPDKGARSRIKK 1504
            + +NK KE QN STSGKSP EKS++A+++K  +LDK EQK +++E+ RE  KG+ SR  K
Sbjct: 502  EAENKVKESQNRSTSGKSPREKSQQAQKDKTSQLDKVEQKQESKEKHRELSKGSSSRKNK 561

Query: 1503 TKDIGTENTXXXXXXXEKIAGFIFMCNAKTKPDCFRHRVMGVPVGKKEVVLGIKPGTXXX 1324
             K  G E +       EKI GFIF+CNAKTKPDCFR+ VMGV  GKK+ VL IKPG    
Sbjct: 562  GKSSGMERSQLKGEKGEKIGGFIFLCNAKTKPDCFRYHVMGVSAGKKDDVLQIKPGLKLF 621

Query: 1323 XXXXXXXXXYGVYKASSSGGMKLEPRAFGGNFPAQVRFSVAADCFPLPERIFKKAIKENY 1144
                     YG+YKASSSG MKLEP+AFGG FPAQVRF +A+DCFPLPE IFKKAIK+NY
Sbjct: 622  LYDFDLKLLYGIYKASSSGAMKLEPKAFGGKFPAQVRFKIASDCFPLPESIFKKAIKDNY 681

Query: 1143 NEKNKFKTELTVRQVRKLTELFRPM--------------------DIHSARQPARSPPKA 1024
            NEK+KF+TELTVRQVRKLT+LFRP+                    +IHSA  P  S PK 
Sbjct: 682  NEKHKFRTELTVRQVRKLTQLFRPVGIHSAVHPVHSQPKVIIREREIHSAVHPVHSQPKV 741

Query: 1023 IIRDREGRDDVRGSWSHLHREIPSV----LSHXXXXXXXXXXXXXXXXXXDLFLTERSYR 856
            IIR+RE  D +RGSWSHL RE  +V                         DLF  E  Y 
Sbjct: 742  IIRERESLDGIRGSWSHLQRESYNVRFIDRDQFGWREEIANDLFRVGIPHDLFRME-GYT 800

Query: 855  AYGLQGERRNVTPASQVNPIMXXXXXXXXXXXXXHLDPIYRNNVPAHVESLRADPLRADP 676
               L  +RRN+   S VNP++             HLD  Y  N PAHVESLR DPL  D 
Sbjct: 801  PSHLPRDRRNLANTSHVNPLL---EFYEGDYQPHHLDRGYPRNAPAHVESLRTDPLYLD- 856

Query: 675  LHFNENEHQAYYRGAISEHADDPYHAYRYGASSRDPYLPPLQR----------------- 547
                               + DPYHAYR G S  D Y  PL R                 
Sbjct: 857  ------------------GSRDPYHAYRRGVSPMDAYFAPLSREEISPNSYLVGGRPFVG 898

Query: 546  ------RETVTDRLYSTYSAADALSEYNRMQHYQGEELEATAVPVSSRYSFAGPPY 397
                  RE V DR Y  YSA DALS+++R + Y G++LEA+  PVSSRYSFAGP +
Sbjct: 899  TDNLPKREAVQDRCYPIYSAPDALSDHHRRRQYHGDKLEASGGPVSSRYSFAGPSF 954


>ref|XP_003526341.1| PREDICTED: uncharacterized protein LOC100797837 [Glycine max]
          Length = 958

 Score =  368 bits (945), Expect = 2e-99
 Identities = 222/480 (46%), Positives = 269/480 (56%), Gaps = 51/480 (10%)
 Frame = -3

Query: 1683 DEKNKEKEIQNGSTSGKSPIEKSRRARRNKMLKLDKNEQKPKNEERRREPDKGARSRIKK 1504
            + +NK K  QN STSGKSP E S++A+++K  +LDK EQK K++E+ R+  KG+  R  K
Sbjct: 503  EAENKVKGSQNRSTSGKSPREMSQQAQKDKTSQLDKTEQKQKSKEKHRKLSKGSMIRKNK 562

Query: 1503 TKDIGTENTXXXXXXXEKIAGFIFMCNAKTKPDCFRHRVMGVPVGKKEVVLGIKPGTXXX 1324
             K  G E         EK+ GFIF+CNAKTKPDCFR+ VMGV  GKK+ VL IKPG    
Sbjct: 563  DKSSGMERNQLKGKKGEKLGGFIFLCNAKTKPDCFRYHVMGVSAGKKDDVLQIKPGLKLF 622

Query: 1323 XXXXXXXXXYGVYKASSSGGMKLEPRAFGGNFPAQVRFSVAADCFPLPERIFKKAIKENY 1144
                     YG+YKAS SGGMKLEP+AF G FPAQVRF +A+DCFP+PE IFKKAIK+NY
Sbjct: 623  LYDFDLKLLYGIYKASCSGGMKLEPKAFSGKFPAQVRFKIASDCFPIPESIFKKAIKDNY 682

Query: 1143 NEKNKFKTELTVRQVRKLTELFRPMDIH--------------------SARQPARSPPKA 1024
            NEK+KF+TELTVRQVRKLT+LFRP+ IH                    SA QP  S PK 
Sbjct: 683  NEKHKFRTELTVRQVRKLTQLFRPVGIHSAVHPVHSQPKVIIQEREIRSAVQPVHSQPKV 742

Query: 1023 IIRDREGRDDVRGSWSHLHREIPSVLSHXXXXXXXXXXXXXXXXXXDLFLTERSYRAYGL 844
            IIR+RE  D VRGSW+HL RE     S+                  DLF  E  +  + +
Sbjct: 743  IIRERESSDSVRGSWTHLQRE-----SYNVRSINRDQFDRREEIADDLFRVEIPHDLFRM 797

Query: 843  QG--------ERRNVTPASQVNPIMXXXXXXXXXXXXXHLDPIYRNNVPAHVESLRADPL 688
            +G        +RRNV   S VNP++             HLD  Y  NV AHVESLR DPL
Sbjct: 798  EGYTPSHLPRDRRNVATTSHVNPLL---EYYEGDYQPYHLDRGYPRNVSAHVESLRTDPL 854

Query: 687  RADPLHFNENEHQAYYRGAISEHADDPYHAYRYGASSRDPYLPPLQ-------------- 550
              D                    + DPYHAY  G S+RD Y  PL               
Sbjct: 855  YLD-------------------DSWDPYHAYHRGVSARDAYFAPLSREEISPNSYLAGGR 895

Query: 549  ---------RRETVTDRLYSTYSAADALSEYNRMQHYQGEELEATAVPVSSRYSFAGPPY 397
                     RRE V DR Y  YSA DALS+++RM+ Y G++ EA+  PVSSRYSFAGP +
Sbjct: 896  PFVGTDNLPRREAVQDRHYPIYSAPDALSDHHRMRPYHGDKFEASRGPVSSRYSFAGPSF 955


>ref|XP_002336026.1| predicted protein [Populus trichocarpa] gi|222838883|gb|EEE77234.1|
            predicted protein [Populus trichocarpa]
          Length = 622

 Score =  281 bits (719), Expect = 4e-73
 Identities = 183/477 (38%), Positives = 246/477 (51%), Gaps = 57/477 (11%)
 Frame = -3

Query: 1647 STSGKSPIEKSRRARRN--KMLKLDKNEQKPKNEERRRE-PDKGARSRIKKTKDIGTENT 1477
            + S K  IE  +  R+N  + +  +K ++  K+ E+     D+ +R++  K K    E  
Sbjct: 152  NVSKKERIENGQEIRKNQERFVGSNKGQRNQKSGEKHGVLVDRSSRNQKNKGKHDEREKN 211

Query: 1476 XXXXXXXEKIAGFIFMCNAKTKPDCFRHRVMGVPVGKKEVVLGIKPGTXXXXXXXXXXXX 1297
                   EK+ G IFMC+AKTKPDCF +RVMGV + KKE++LG+KPG             
Sbjct: 212  GWDEKKKEKLGGMIFMCSAKTKPDCFLYRVMGVTMNKKELILGVKPGLKLFLYDFDLKLM 271

Query: 1296 YGVYKASSSGGMKLEPRAFGGNFPAQVRFSVAADCFPLPERIFKKAIKENYNEKNKFKTE 1117
            YG+Y+ASS+GG+KLEP+AFGG+FP QVRF V  DCFP+ E +FKKAIK+NYNEKNKFKTE
Sbjct: 272  YGIYEASSAGGVKLEPKAFGGSFPFQVRFVVHKDCFPITESVFKKAIKDNYNEKNKFKTE 331

Query: 1116 LTVRQVRKLTELFRPMDIHSARQPARSPPKAIIRDREGRDDVRGSWSHLHREIPSVLSH- 940
            LTVRQV KL+ LFRP+       P RSPP   ++DRE     R    HL RE  +  +H 
Sbjct: 332  LTVRQVLKLSALFRPV-----IGPVRSPPMVTVQDREVYAGARDLQVHLEREAFARGNHD 386

Query: 939  ------------XXXXXXXXXXXXXXXXXXDLFLTERSYRAYGLQGERRNVTPASQVNPI 796
                                          DLF++E+ YR YGL GERR +TP+  +   
Sbjct: 387  ARRYSMLSDERDGHVEYQQAGSMHRDEFPCDLFMSEKEYRTYGLSGERRKLTPSHHIPST 446

Query: 795  MXXXXXXXXXXXXXHL-DPIYRNNVPAHVESLRADPLRADPLHFNENEHQ---------- 649
            +              L DPIYR+ VP   E++ A PL  +  + +    +          
Sbjct: 447  LDPYQRDQEREHLLRLPDPIYRDTVPLQREAVLAVPLYLNQPYNSSGRRELPPAVTSIPP 506

Query: 648  ---AYYRGAISEHADDPYHAYRYGASSRDPYLP--------------------------P 556
                    A+  +  DPY+ Y +GASS D Y+P                          P
Sbjct: 507  TSSGSALAALDPYTRDPYYTYHHGASSADAYVPPPRRDELSSGSYYVDGRRETYLFEADP 566

Query: 555  LQRRETVTD-RLYSTYSAADALSEYNRMQHYQGEELEATAVPVSSRYSFAGPPYSFR 388
            L+RRE   + RLYST+ A+DALS YN++  Y G + E     VSSRYSFAGP   +R
Sbjct: 567  LRRREADQEGRLYSTH-ASDALSNYNKLLQYHGAKPETAPPSVSSRYSFAGPSVYYR 622


Top