BLASTX nr result

ID: Glycyrrhiza23_contig00009027 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00009027
         (2104 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003524452.1| PREDICTED: uncharacterized protein LOC100819...   760   0.0  
ref|XP_003532418.1| PREDICTED: uncharacterized protein LOC100816...   731   0.0  
ref|XP_002509831.1| DNA binding protein, putative [Ricinus commu...   456   e-125
ref|XP_002298024.1| predicted protein [Populus trichocarpa] gi|2...   439   e-120
ref|XP_002273481.2| PREDICTED: AT-rich interactive domain-contai...   437   e-120

>ref|XP_003524452.1| PREDICTED: uncharacterized protein LOC100819320 [Glycine max]
          Length = 618

 Score =  760 bits (1962), Expect = 0.0
 Identities = 389/580 (67%), Positives = 440/580 (75%), Gaps = 1/580 (0%)
 Frame = +2

Query: 2    LKQKCLFYQLFPDYLKENSCRGNVRPVPVLLGDGHLLDLYQLFSLVKERGGYAAVSRKGL 181
            LKQK LFYQLFP YLK++     VRP+PVL+ DG  LDLY+LFSLVKERGGYA VS+KGL
Sbjct: 46   LKQKYLFYQLFPAYLKDSCSMDGVRPLPVLV-DGQQLDLYKLFSLVKERGGYARVSKKGL 104

Query: 182  WGSVTKDXXXXXXXXXXXXXXYDKYLNDFEGWVRKTFEEKNFKNGNHGGDRGFKSLPLDL 361
            WGSVTK+              +DKYLNDFE W++KTFEEKN KNGNHG D G K LPLD+
Sbjct: 105  WGSVTKELGLNPVVCWSVKLVFDKYLNDFERWLKKTFEEKNLKNGNHGCDWGLKWLPLDI 164

Query: 362  EKEFRGLLCSNLKDKDDD-FVQLESDKFMKYIDLVNHKSDASLSDTENQNNKCEXXXXXX 538
            EKEFR LLCSNLKDKDDD  V+ +S K  K  DLVNHK+  +L DT +QNNK +      
Sbjct: 165  EKEFRALLCSNLKDKDDDRLVKSKSIKKKKNADLVNHKNGNNLLDTNDQNNKSKDVQRIE 224

Query: 539  XXXXXXXXXEKLCNGVKDDLSASVAEIAEKGFNSRKRKREALSGMLNWMKDVAKHPLDPL 718
                     EK  NG+K + +   AE AEK FN RKRKR+ L GMLNWMK +AKHPLDPL
Sbjct: 225  GDND-----EKSANGIKGNPATLGAEGAEKEFNPRKRKRDVLFGMLNWMKHIAKHPLDPL 279

Query: 719  TQPIPKLSKFKEYKGQDFVVQVLRARDVLSLRQHVEPNSGPSSLQKPKICPAMYEDHVAL 898
            TQPIPK SK+KEY GQDF  Q LRAR+ LSLRQH EPNSG SSLQK K+ PAMYEDHVAL
Sbjct: 280  TQPIPKPSKWKEYNGQDFFGQFLRAREALSLRQHEEPNSGSSSLQKQKMHPAMYEDHVAL 339

Query: 899  GHHGTMKLRCSERLPISVKKXXXXXXXXXXANENGLPSSVNTEAEKCPLEKTTAKPDAVT 1078
            G   T KLRCSERLP  VK            N N L  S N EAEKCP EKTT   D  T
Sbjct: 340  GRPATGKLRCSERLPSFVKSRSCSCCNPCSPNGNRLAGSHNMEAEKCPPEKTTETLDVST 399

Query: 1079 EKVISDPAGDDVREKQVSIGPLFQAEVPQWTGVVSESDSKWLGTQVWPVKHDSKPATETN 1258
             K I++P+GD+  EKQVS+GP FQAEVP+WTGVVSESDSKWLGTQVW +K+D++ ATET+
Sbjct: 400  TKTIAEPSGDESLEKQVSVGPRFQAEVPEWTGVVSESDSKWLGTQVWTLKNDTEHATETD 459

Query: 1259 LIGRGRQEKCGCRVRGSVECIRFHIAENRMKLKLELGSVFYHWGFDQMGEEVSLRWTPEE 1438
             IGRGRQEKC C   GSVEC+R HIAENRMKLKLELGS FY WGFD+MGEEVSL+WT EE
Sbjct: 460  -IGRGRQEKCSCEFHGSVECVRLHIAENRMKLKLELGSEFYRWGFDRMGEEVSLQWTTEE 518

Query: 1439 EKKFKDVMRSSIPSQNRTFWNNPSKYFHKKTRKDLVSYYFNVFLIQLRRYQNRVTPKSID 1618
            EK+FKD+M+S+IPS+N+ FWNNPSKYF KKTR++LVSYYFN FLIQLR YQNRV+PKS+D
Sbjct: 519  EKRFKDIMKSNIPSKNKYFWNNPSKYFPKKTRRNLVSYYFNAFLIQLRTYQNRVSPKSVD 578

Query: 1619 SDNDEVEFGSVGDGFGMEAVKGPGVDILKCSENKQCTDLE 1738
            SD+DEVEFGS  DGFGMEAVKGP  D L+CS NKQCTD E
Sbjct: 579  SDDDEVEFGSFSDGFGMEAVKGPDDDFLECSLNKQCTDFE 618


>ref|XP_003532418.1| PREDICTED: uncharacterized protein LOC100816021 [Glycine max]
          Length = 632

 Score =  731 bits (1886), Expect = 0.0
 Identities = 381/589 (64%), Positives = 440/589 (74%), Gaps = 1/589 (0%)
 Frame = +2

Query: 2    LKQKCLFYQLFPDYLKENSCRGNVRPVPVLLGDGHLLDLYQLFSLVKERGGYAAVSRKGL 181
            LKQK LFYQLFP YLK++  +G VRP+PVL+ DG  LDLY+LF LVKERGGYA VS+K L
Sbjct: 48   LKQKYLFYQLFPAYLKDSCSKGGVRPLPVLV-DGQWLDLYKLFFLVKERGGYARVSKKRL 106

Query: 182  WGSVTKDXXXXXXXXXXXXXXYDKYLNDFEGWVRKTFEEKNFKNGNHGGDRGFKSLPLDL 361
            WGSVTK+              YDKYLNDF+ W++KTFEEKN K+GNHG D G K LP D+
Sbjct: 107  WGSVTKELGLNLVVCWSVKLVYDKYLNDFDRWLKKTFEEKNLKSGNHGCDWGLKWLPFDI 166

Query: 362  EKEFRGLLCSNLKDKDDD-FVQLESDKFMKYIDLVNHKSDASLSDTENQNNKCEXXXXXX 538
            EKEFR LLC NLKDKDD   V+ +S    K  DLVNHK+  +L DT++QNNK E      
Sbjct: 167  EKEFRALLCPNLKDKDDHKLVKSKSINKKKNTDLVNHKNGNNLLDTKDQNNKSEDVQRIE 226

Query: 539  XXXXXXXXXEKLCNGVKDDLSASVAEIAEKGFNSRKRKREALSGMLNWMKDVAKHPLDPL 718
                     EK  NGVKD+ +   AE A+K FN  KRKR+ALSGMLNWMK VAKH LDPL
Sbjct: 227  GDND-----EKSTNGVKDNPATLGAEGAKKEFNPHKRKRDALSGMLNWMKHVAKHALDPL 281

Query: 719  TQPIPKLSKFKEYKGQDFVVQVLRARDVLSLRQHVEPNSGPSSLQKPKICPAMYEDHVAL 898
            TQPIPK SK+KEYKGQDF  Q LRAR+VLS RQH EP+S  SSLQK K+ PAMYEDHVAL
Sbjct: 282  TQPIPKPSKWKEYKGQDFFGQFLRAREVLSPRQHEEPSSELSSLQKQKMHPAMYEDHVAL 341

Query: 899  GHHGTMKLRCSERLPISVKKXXXXXXXXXXANENGLPSSVNTEAEKCPLEKTTAKPDAVT 1078
            G H T KLRCSERLP  VK            N N L  S   EAEKCPLEKTT  PD  T
Sbjct: 342  GCHATRKLRCSERLPSFVKSRSCSCCNPCSPNGNRLTGSHIMEAEKCPLEKTTETPDVST 401

Query: 1079 EKVISDPAGDDVREKQVSIGPLFQAEVPQWTGVVSESDSKWLGTQVWPVKHDSKPATETN 1258
               I+ P+GD+  EKQVS+GP FQAEVP+WTGV SESDSKWLGT VW +K+D++PAT T+
Sbjct: 402  TITIAKPSGDESLEKQVSVGPRFQAEVPEWTGVFSESDSKWLGTHVWSLKNDTEPATATD 461

Query: 1259 LIGRGRQEKCGCRVRGSVECIRFHIAENRMKLKLELGSVFYHWGFDQMGEEVSLRWTPEE 1438
             +GRGRQE C C   GSVEC+R HIAENRMKLKLELGS FY  GFD++GEEVSL+WT EE
Sbjct: 462  -VGRGRQEMCSCEFHGSVECVRLHIAENRMKLKLELGSEFYRLGFDRIGEEVSLQWTTEE 520

Query: 1439 EKKFKDVMRSSIPSQNRTFWNNPSKYFHKKTRKDLVSYYFNVFLIQLRRYQNRVTPKSID 1618
            E++FKD+M+S+I S+N+ FWNNPSKYF KKTR++LV+YYFNVFLIQLR YQNRVTP+S+D
Sbjct: 521  EQRFKDIMKSNISSKNKYFWNNPSKYFPKKTRRNLVNYYFNVFLIQLRTYQNRVTPESVD 580

Query: 1619 SDNDEVEFGSVGDGFGMEAVKGPGVDILKCSENKQCTDLEEETKLE*QF 1765
            SD+DEVEFGS  DGFGM+AVK  G D L+CS NKQCTDL + TKLE +F
Sbjct: 581  SDDDEVEFGSFSDGFGMDAVKSLGDDFLECSLNKQCTDL-KYTKLECKF 628


>ref|XP_002509831.1| DNA binding protein, putative [Ricinus communis]
            gi|223549730|gb|EEF51218.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 656

 Score =  456 bits (1172), Expect = e-125
 Identities = 265/613 (43%), Positives = 347/613 (56%), Gaps = 34/613 (5%)
 Frame = +2

Query: 2    LKQKCLFYQLFPDYLKENSCRGNVRPVPVLLGDGHLLDLYQLFSLVKERGGYAAVSRKGL 181
            +K +CLF Q+   +  E + RG+ RP+P LLG G  LDL++LF +V++RGG+  V+  G 
Sbjct: 50   VKLRCLFDQVLSVFANEAAARGSFRPIPALLGGGKSLDLFKLFRVVRKRGGFDLVN--GF 107

Query: 182  WGSVTKDXXXXXXXXXXXXXXYDKYLNDFEGWVRKTFEEKNFKNGNHGGDRGFKSLPLDL 361
            W  V K+              Y KYL + E W+R +   +   NG       F  L ++L
Sbjct: 108  WSFVVKELGLDLAASASVKLVYFKYLYELERWLRGSNSSRRLGNGQCRPGGKFNCLSMEL 167

Query: 362  EKEFRGLLCSNLKDKDDDFVQLESDKFMKYIDLVNHKSDASLSDTENQNN---------- 511
            E EFR LL +  K   D   + +S  F   I++V  KS   L DT++ ++          
Sbjct: 168  EAEFRKLLSNGSKKGKDGKYKKKSKNFG--INVV--KSKIGLPDTKDVHSAGSRHTDDDE 223

Query: 512  -------KCEXXXXXXXXXXXXXXX---EKLCNGVK------------DDLSASVAEIAE 625
                   KC+                  E L   VK            DD+      I E
Sbjct: 224  NFQEYTGKCKGKSADICAHPPPTPALAEEHLVRRVKPYNEKCSTDNDGDDVVILDPSIGE 283

Query: 626  KGFNSRKRKREALSGMLNWMKDVAKHPLDPLTQPIPKLSKFKEYKGQDFVVQVLRARDVL 805
            K F+ RKRKRE+LS MLNW+   AK P  P    IP LSK K+ KG +   Q +RARD L
Sbjct: 284  KLFSPRKRKRESLSRMLNWVIQAAKSPDHPSIGNIPPLSKCKDNKGNELWAQAIRARDAL 343

Query: 806  SLRQHVEPNSGPSSLQK-PKICPAMYEDHVALGHHGTMKLRCSERLPISVKKXXXXXXXX 982
              R+ V      S LQ   KI P+MY+D +      + ++RCSERLP  VK         
Sbjct: 344  VRRRQVNSGCERSLLQNHQKIHPSMYKDAIPPSDPSSERVRCSERLPALVKPRSCSCCNS 403

Query: 983  XXANENGLPSSVNTEAEKCPLEKTTAKPDAVTEKVISDPAGDDVREKQVSIGPLFQAEVP 1162
              A ++ L S   TE E  P  K     D          +GD    + VS+G  FQAEVP
Sbjct: 404  CSAPKSQLISPPKTELENAPKAKVLMAEDLSAATATLSSSGDIHIHRHVSVGRRFQAEVP 463

Query: 1163 QWTGVVSESDSKWLGTQVWPVKHDSKPA-TETNLIGRGRQEKCGCRVRGSVECIRFHIAE 1339
            +WTG+VSES+SKWLGTQ WP++     A  + + IG+GR E CGC + GSVEC+RFHIAE
Sbjct: 464  EWTGLVSESESKWLGTQAWPLEFGEHNAMVQEDTIGKGRPESCGCELPGSVECVRFHIAE 523

Query: 1340 NRMKLKLELGSVFYHWGFDQMGEEVSLRWTPEEEKKFKDVMRSSIPSQNRTFWNNPSKYF 1519
            NR+KLK+ELGSVFYHW FD MGEE++LRWT EEEK+FKDV+R ++PS ++ FW+N  KYF
Sbjct: 524  NRIKLKIELGSVFYHWKFDCMGEEIALRWTAEEEKRFKDVVRFNLPSLDKFFWDNSRKYF 583

Query: 1520 HKKTRKDLVSYYFNVFLIQLRRYQNRVTPKSIDSDNDEVEFGSVGDGFGMEAVKGPGVDI 1699
             +KT+++LVSYYFNVFL+Q R YQNRVTPK IDSD+DE EFGS+ D +G +AV  PG  +
Sbjct: 584  RRKTKEELVSYYFNVFLVQRRSYQNRVTPKHIDSDDDESEFGSLSDTYGQQAVTVPGTKM 643

Query: 1700 LKCSENKQCTDLE 1738
            L CSENKQCTD +
Sbjct: 644  LMCSENKQCTDFK 656


>ref|XP_002298024.1| predicted protein [Populus trichocarpa] gi|222845282|gb|EEE82829.1|
            predicted protein [Populus trichocarpa]
          Length = 648

 Score =  439 bits (1128), Expect = e-120
 Identities = 246/588 (41%), Positives = 339/588 (57%), Gaps = 13/588 (2%)
 Frame = +2

Query: 14   CLFYQLFPDYLKENSCRGNVRPVPVLLGDGHLLDLYQLFSLVKERGGYAAVSRKGLWGSV 193
            CLF +L   +L E +  G +RP+P LLGDG  LDL++LF +V++RGG+  V+  G W  V
Sbjct: 64   CLFDELVSKFLNETAEGGCIRPIPALLGDGQSLDLFKLFWVVRKRGGFDLVN--GFWSFV 121

Query: 194  TKDXXXXXXXXXXXXXXYDKYLNDFEGWVRKTFEEKNFKNGNHGGDRGFKSLPLDLEKEF 373
             K+              Y KYL + E  + ++ +E+  + G    D       L+LE + 
Sbjct: 122  AKELGLELQFSPSVKLIYIKYLYELEKCMSRSCKEEKIRKGKRQCDGNLSCSSLELEMQL 181

Query: 374  RGLLCSNLKDKDDD--FVQLESDKFMKYIDLVNHKSDASLSDTENQNNKCEXXXXXXXXX 547
            R LL      K +D  F     +K  +Y+++   K    L DT+  +             
Sbjct: 182  RSLLLHRCDRKQEDCKFASEVYEKNGRYVEMDTGKGKIGLLDTKAVHRVHNGVGNRNCDY 241

Query: 548  XXXXXXEK--LCNGVKDDLSASVAEIAEKGFNSRKRKREALSGMLNWMKDVAKHPLDPLT 721
                  E+    N   DD+      I +K FNSRKRKRE L+ MLNW+  +AK P DP  
Sbjct: 242  DKKFHAERNNYFNDSDDDVVILDPSIVKKEFNSRKRKREPLTRMLNWVIQIAKCPDDPSI 301

Query: 722  QPIPKLSKFKEYKGQDFVVQVLRARDVLSLRQHVEPNSGPSSLQ--------KPKICPAM 877
                   K+K +KG +  +Q +RAR+ L  R+H +PN   S LQ          K+ P+M
Sbjct: 302  GVRSPQFKWKNHKGNELWLQAIRAREALLHRRHFDPNIEQSLLQVYNMNFQDNRKMHPSM 361

Query: 878  YEDHVALGHHGTMKLRCSERLPISVKKXXXXXXXXXXANENGLPSSVNTEAEKCPLEKTT 1057
            +ED   L  H   + RCS+RLP   K           A ++   S + TE E    E+  
Sbjct: 362  FEDVSVLSEHFAERSRCSKRLPALAKPDVCSCCNSCSAPQSKSTSPLKTECENGLKEQEL 421

Query: 1058 AKPDAVTEKVISDPAGDDVREKQVSIGPLFQAEVPQWTGVVSESDSKWLGTQVWPVKHDS 1237
            A  D  ++    D +GD    + V++GPLFQAEVP+WT VVSESDSKWLGT++WP++ ++
Sbjct: 422  AV-DLSSKNATFDGSGDGHVRRHVAVGPLFQAEVPEWTSVVSESDSKWLGTRLWPLECEN 480

Query: 1238 KPAT-ETNLIGRGRQEKCGCRVRGSVECIRFHIAENRMKLKLELGSVFYHWGFDQMGEEV 1414
              A    + IG GR   CGC++ GSV C+RFHIAE R+KLKLELG +FYHW FD+MGEEV
Sbjct: 481  HNAVFAMDPIGNGRPSVCGCQLPGSVGCVRFHIAEKRIKLKLELGYLFYHWQFDRMGEEV 540

Query: 1415 SLRWTPEEEKKFKDVMRSSIPSQNRTFWNNPSKYFHKKTRKDLVSYYFNVFLIQLRRYQN 1594
            SLRWT EEEK+FKD+++ ++ S  + FW+N  KYF +KTR++LVSYYFN +L++ R YQN
Sbjct: 541  SLRWTTEEEKRFKDMVKFNLLSAGKCFWDNKHKYFPRKTREELVSYYFNAYLVRRRSYQN 600

Query: 1595 RVTPKSIDSDNDEVEFGSVGDGFGMEAVKGPGVDILKCSENKQCTDLE 1738
            RVTPK+IDSD+DE EFGS  DG+G E +  PG  +L CSENKQCTD +
Sbjct: 601  RVTPKNIDSDDDETEFGSFSDGYGHEVLMVPGAYMLICSENKQCTDFK 648


>ref|XP_002273481.2| PREDICTED: AT-rich interactive domain-containing protein 2-like
            [Vitis vinifera]
          Length = 601

 Score =  437 bits (1123), Expect = e-120
 Identities = 248/584 (42%), Positives = 344/584 (58%), Gaps = 6/584 (1%)
 Frame = +2

Query: 5    KQKCLFYQLFPDYLKENSCRGNVRPVPVLLGDGHLLDLYQLFSLVKERGGYAAVSRKGLW 184
            K + LF Q+   ++KE      VRP+P +LGDG  +DL++LF +V+ +GGY  VS KGLW
Sbjct: 42   KLQVLFDQVLLIFMKEVLGNECVRPIPAMLGDGRSVDLFKLFWVVRGKGGYEWVSDKGLW 101

Query: 185  GSVTKDXXXXXXXXXXXXXXYDKYLNDFEGWVRKTFEEKNFKNGNHGGDRGFKSLPLDLE 364
            G V ++              Y KYL+  + W+     + +   G         S+  +L 
Sbjct: 102  GLVAEECGLDVGVKTCLKLIYFKYLDQLDQWLLGILRDGSLDKGEGECGGKLDSVLEELG 161

Query: 365  KEFRGLLCSNL--KDKDDDFVQLESDKFMKYIDLVNHKSDASLSDTENQNNKCEXXXXXX 538
             EFRGL+      K+KDD   +LES++    +DL   KS  +LS    +  K        
Sbjct: 162  TEFRGLILGGTGPKEKDDGVFELESERTDNCVDLDKEKSILNLSIVLRRAEKSPNDDD-- 219

Query: 539  XXXXXXXXXEKLCNGVKDDLSASVAEIAEKGFNSRKRKREALSGMLNWMKDVAKHPLDPL 718
                     EK C    DD++     IA+K   SRKRKRE+ SGMLNW++++AKHP +P 
Sbjct: 220  ---------EKNCVDGGDDVAIQDPIIAKKSSFSRKRKRESFSGMLNWVREIAKHPEEPY 270

Query: 719  TQPIPKLSKFKEYKGQDFVVQVLRARDVLSLRQHVEPNSGPSSLQ-KPKICPAMYEDHVA 895
                    K  +   + F  Q+LR R+ L +R+ V      S LQ K K+ P+MYED++ 
Sbjct: 271  --------KGNKNGNEVFSTQILRVRESLLIRRKVHTKEEQSLLQVKQKMHPSMYEDNIV 322

Query: 896  LGHHGTMKLRCSERLPISVKKXXXXXXXXXXANENGLPSSVNTEAEKCPLEK--TTAKPD 1069
            + H  T K RCS RL  SVK           A ++ LP    T++E CP E+  T  + +
Sbjct: 323  VNHQSTEKSRCSRRL-FSVKSHLCSCS----AAQSKLPGPHRTKSESCPKEQALTPNEQE 377

Query: 1070 AVTEKVISDPAGDDVREKQVSIGPLFQAEVPQWTGVVSESDSKWLGTQVWPVKH-DSKPA 1246
                  + D   DD+  K + +GP FQAEVP+WTG V+ESDSKWLGTQVWP+++ +    
Sbjct: 378  PENTNTLDDLFPDDLFLKPIPVGPNFQAEVPEWTGEVTESDSKWLGTQVWPLENGECSFR 437

Query: 1247 TETNLIGRGRQEKCGCRVRGSVECIRFHIAENRMKLKLELGSVFYHWGFDQMGEEVSLRW 1426
             E + +GRG+ + C CR  GSVEC+RFHIAE RMKLKL+LGS+FYHW FD+MGEE+SL W
Sbjct: 438  IEKDCVGRGKPDSCDCRFSGSVECVRFHIAEKRMKLKLDLGSLFYHWRFDRMGEEISLAW 497

Query: 1427 TPEEEKKFKDVMRSSIPSQNRTFWNNPSKYFHKKTRKDLVSYYFNVFLIQLRRYQNRVTP 1606
            T EEEK+FK ++R +   Q+ +FW+N  + F  KTR+ LVSYYFNVFLI+ R YQNRVTP
Sbjct: 498  TTEEEKRFKHMIRLNSSLQSPSFWDNALRIFPTKTREALVSYYFNVFLIRRRIYQNRVTP 557

Query: 1607 KSIDSDNDEVEFGSVGDGFGMEAVKGPGVDILKCSENKQCTDLE 1738
            + IDSD+DE+EFGS+G  FG E +K P  + L C++N+Q T+L+
Sbjct: 558  RKIDSDDDELEFGSLGGSFGHEVIKVPWSEFLTCTQNEQSTELD 601


Top