BLASTX nr result

ID: Glycyrrhiza23_contig00020395 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00020395
         (1375 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003532564.1| PREDICTED: uncharacterized protein LOC100810...   682   0.0  
ref|XP_003528426.1| PREDICTED: uncharacterized protein LOC100787...   654   0.0  
ref|XP_003547131.1| PREDICTED: uncharacterized protein LOC100802...   551   e-154
ref|XP_003541752.1| PREDICTED: uncharacterized protein LOC100799...   547   e-153
ref|XP_002524700.1| conserved hypothetical protein [Ricinus comm...   438   e-120

>ref|XP_003532564.1| PREDICTED: uncharacterized protein LOC100810673 [Glycine max]
          Length = 1047

 Score =  682 bits (1761), Expect = 0.0
 Identities = 328/441 (74%), Positives = 366/441 (82%), Gaps = 2/441 (0%)
 Frame = -3

Query: 1319 YSEGELRRELPNGVMEISPASTPNV--GVGSHCDVKVGGADHKAVTPRYFRSKNVERVPA 1146
            Y++ ELRRELPNGVMEISPAS       VGSHCDVKVG  D K VTPRYFRSKNV+RVPA
Sbjct: 190  YTKEELRRELPNGVMEISPASPTRDYNNVGSHCDVKVG-VDSKTVTPRYFRSKNVDRVPA 248

Query: 1145 GKLQVVPYGPNLKKGNTKRKKCHWCQKSDSWSLIKCSSCQKEFFCMDCIKERYFDSQNEV 966
            GKLQ+VPYG NLKKG  KRKKCHWCQ+S+S +LI+CSSCQ+EFFCMDC+KERYFD++NE+
Sbjct: 249  GKLQIVPYGSNLKKG--KRKKCHWCQRSESGNLIQCSSCQREFFCMDCVKERYFDAENEI 306

Query: 965  KKVCPVCRGTCTCKDCLASQCKDSESKACLSGKSRVDRILHFHYLICMLLPVLKRISEDQ 786
            KK CPVCRGTC CK C ASQCKDSESK CL+GKSRVDRILHFHYLICMLLPVLK+ISEDQ
Sbjct: 307  KKACPVCRGTCPCKYCSASQCKDSESKECLTGKSRVDRILHFHYLICMLLPVLKQISEDQ 366

Query: 785  DTELETEAKIKGKSISDIQIKQVEFDCNQKNYCNHCKTPVLDLHRXXXXXXXXXXXXXXX 606
            + ELETE KIKGK+ISDIQIKQVEF C++KNYCNHCKTP+LDLHR               
Sbjct: 367  NIELETEVKIKGKNISDIQIKQVEFGCSEKNYCNHCKTPILDLHRSCPSCSYSLCSSCCQ 426

Query: 605  XXSQGRIFGEINSTMLNLPDKRKAYVDSEGHTLDQKAISSDNLTAALILPQQTKCDDIEN 426
              SQG+  G +NS++   PDK K    SE HTL+++A S  NLT   +LP+ T  + I++
Sbjct: 427  ELSQGKASGAMNSSVFKRPDKMKPCSASENHTLEERATSIGNLTDTSVLPEWTNGNGIDS 486

Query: 425  ASCPPTGLGGCGKGHLELRCIFPSSWIKEMELKAEEIVCSYDFPETLDKSSSCSLCFDTD 246
             SCPPT LGGCGK HLELR +FPSSWIKEME KAEEIVCSYDFPET DKSSSCSLCFDTD
Sbjct: 487  LSCPPTELGGCGKSHLELRSVFPSSWIKEMEAKAEEIVCSYDFPETSDKSSSCSLCFDTD 546

Query: 245  HNTNRYKQLQKAALREDPSDNCLFCPTVFDISGDNFEHFQKHWGKGHPIVVRDVLQSTSN 66
            H TNRYKQLQ+AALRED +DN LFCPTV DISGDNFEHFQKHWGKGHPIVV+D L+STSN
Sbjct: 547  HGTNRYKQLQEAALREDSNDNYLFCPTVMDISGDNFEHFQKHWGKGHPIVVQDALRSTSN 606

Query: 65   LSWNPLIMFCTYLEQSITRYE 3
            LSW+PL MFCTYLEQSITRYE
Sbjct: 607  LSWDPLTMFCTYLEQSITRYE 627


>ref|XP_003528426.1| PREDICTED: uncharacterized protein LOC100787798 [Glycine max]
          Length = 1030

 Score =  654 bits (1686), Expect = 0.0
 Identities = 320/441 (72%), Positives = 358/441 (81%), Gaps = 2/441 (0%)
 Frame = -3

Query: 1319 YSEGELRRELPNGVMEISPASTPNV--GVGSHCDVKVGGADHKAVTPRYFRSKNVERVPA 1146
            Y++ ELRRELPNGVMEISPAS       VGSHCDVKVG  D K V PRYFRSKNV+RVPA
Sbjct: 177  YTKEELRRELPNGVMEISPASPTRDYNNVGSHCDVKVG-VDSKTVAPRYFRSKNVDRVPA 235

Query: 1145 GKLQVVPYGPNLKKGNTKRKKCHWCQKSDSWSLIKCSSCQKEFFCMDCIKERYFDSQNEV 966
            GKLQ+VPYG    KG  KRKKCHWCQ+S+S +LI+C SCQ+EFFCMDC+KERYFD+QNE+
Sbjct: 236  GKLQIVPYG---SKG--KRKKCHWCQRSESGNLIQCLSCQREFFCMDCVKERYFDTQNEI 290

Query: 965  KKVCPVCRGTCTCKDCLASQCKDSESKACLSGKSRVDRILHFHYLICMLLPVLKRISEDQ 786
            KK CPVC GTCTCKDC ASQCKDSESK  L+GKS+VDRILHFHYLICMLLPVLK+IS+DQ
Sbjct: 291  KKACPVCCGTCTCKDCSASQCKDSESKEYLTGKSKVDRILHFHYLICMLLPVLKQISKDQ 350

Query: 785  DTELETEAKIKGKSISDIQIKQVEFDCNQKNYCNHCKTPVLDLHRXXXXXXXXXXXXXXX 606
            + ELE EAK+KGK+ISDIQIKQV F  N+KNYCNHCKTP+LDLHR               
Sbjct: 351  NIELEAEAKVKGKNISDIQIKQVGFGYNEKNYCNHCKTPILDLHRSCPSCSYSLCSSCCQ 410

Query: 605  XXSQGRIFGEINSTMLNLPDKRKAYVDSEGHTLDQKAISSDNLTAALILPQQTKCDDIEN 426
              SQG+  GEINS++   P K K    +E H LD+KA SS NLT   +LP+    + I+ 
Sbjct: 411  ELSQGKASGEINSSVFKRPGKMKPCGANESHNLDEKATSSGNLTDTSMLPEWKNGNGIDT 470

Query: 425  ASCPPTGLGGCGKGHLELRCIFPSSWIKEMELKAEEIVCSYDFPETLDKSSSCSLCFDTD 246
             SCPPT LGGCGK HLELR +FPSSWIKEME+KAEEIVCSYDFPET DKSSSCSLCFDTD
Sbjct: 471  LSCPPTELGGCGKSHLELRSVFPSSWIKEMEVKAEEIVCSYDFPETSDKSSSCSLCFDTD 530

Query: 245  HNTNRYKQLQKAALREDPSDNCLFCPTVFDISGDNFEHFQKHWGKGHPIVVRDVLQSTSN 66
            H+TNRYKQLQ+AALRED +DN LFCPTV DISGDNFEHFQKH GKGHPIVV+D L+STSN
Sbjct: 531  HSTNRYKQLQEAALREDSNDNYLFCPTVMDISGDNFEHFQKHCGKGHPIVVQDALRSTSN 590

Query: 65   LSWNPLIMFCTYLEQSITRYE 3
            LSW+PL MFCTYLEQSITRYE
Sbjct: 591  LSWDPLTMFCTYLEQSITRYE 611


>ref|XP_003547131.1| PREDICTED: uncharacterized protein LOC100802129 [Glycine max]
          Length = 951

 Score =  551 bits (1420), Expect = e-154
 Identities = 273/444 (61%), Positives = 323/444 (72%), Gaps = 1/444 (0%)
 Frame = -3

Query: 1331 LHRNYSEGELRRELPNGVMEISPASTPNVGVGSHCDVKVGGADHKAVTPRYFRSKNVERV 1152
            LH N+   EL++ELPNGVM I+                         + RYFRSKN ER 
Sbjct: 118  LHFNH---ELKKELPNGVMAIAS---------------------NMASSRYFRSKNAERG 153

Query: 1151 PAGKLQVVPYGPNLKKGNTKRKKCHWCQKSDSWSLIKCSSCQKEFFCMDCIKERYFDSQN 972
               KLQVV  G ++KKG  +RKKCHWCQ+SDSWSL+ CSSCQ+EFFCM+CIK+RYF +QN
Sbjct: 154  SVSKLQVVQCGQSIKKG--RRKKCHWCQRSDSWSLVMCSSCQREFFCMECIKQRYFATQN 211

Query: 971  EVKKVCPVCRGTCTCKDCLASQCKDSESKACLSGKSRVDRILHFHYLICMLLPVLKRISE 792
            EVK  CPVCRGTCTCKDCL+SQ ++SESK  L+GK+RVDRILHFHYL+CMLLPVLK+I E
Sbjct: 212  EVKMACPVCRGTCTCKDCLSSQYEESESKEYLAGKNRVDRILHFHYLVCMLLPVLKQIKE 271

Query: 791  DQDTELETEAKIKG-KSISDIQIKQVEFDCNQKNYCNHCKTPVLDLHRXXXXXXXXXXXX 615
            D    +E  AKIKG K  SDI IK V+F CN+KNYCN+CKTP+LDLHR            
Sbjct: 272  DHHVGVEKTAKIKGGKRTSDIIIKPVDFVCNEKNYCNYCKTPILDLHRSCLSCSYSLCLS 331

Query: 614  XXXXXSQGRIFGEINSTMLNLPDKRKAYVDSEGHTLDQKAISSDNLTAALILPQQTKCDD 435
                 SQG    EINS++ NLPDK  A + SEGH LD K IS+ NLT    L + T C+ 
Sbjct: 332  CSQALSQGSTSEEINSSISNLPDKINACIFSEGHLLDDKVISNGNLTDTSTLVEWTNCNG 391

Query: 434  IENASCPPTGLGGCGKGHLELRCIFPSSWIKEMELKAEEIVCSYDFPETLDKSSSCSLCF 255
             +  SCPPT LG CG  HL+L+ +FP SWIKEME+KAEEIVCSYDFPETLD+SSSCSLC 
Sbjct: 392  ADIVSCPPTKLGDCGDSHLDLKYVFPLSWIKEMEVKAEEIVCSYDFPETLDRSSSCSLCV 451

Query: 254  DTDHNTNRYKQLQKAALREDPSDNCLFCPTVFDISGDNFEHFQKHWGKGHPIVVRDVLQS 75
            D DH T+RYKQL +AA RED +DN LF PT+ DIS ++FEHF+KHWG GHP+VVRDVLQS
Sbjct: 452  DKDHKTSRYKQLPEAAQREDSNDNFLFYPTILDISCNHFEHFRKHWGIGHPVVVRDVLQS 511

Query: 74   TSNLSWNPLIMFCTYLEQSITRYE 3
              NLSW+PL+MFCTYLE+S+TRYE
Sbjct: 512  MPNLSWDPLVMFCTYLERSMTRYE 535


>ref|XP_003541752.1| PREDICTED: uncharacterized protein LOC100799234 [Glycine max]
          Length = 922

 Score =  547 bits (1410), Expect = e-153
 Identities = 265/434 (61%), Positives = 314/434 (72%)
 Frame = -3

Query: 1304 LRRELPNGVMEISPASTPNVGVGSHCDVKVGGADHKAVTPRYFRSKNVERVPAGKLQVVP 1125
            L++ELPNGVM I+                         + RYFRSKN +     KLQVV 
Sbjct: 97   LKKELPNGVMAIAS---------------------NVASSRYFRSKNADSGSVSKLQVVQ 135

Query: 1124 YGPNLKKGNTKRKKCHWCQKSDSWSLIKCSSCQKEFFCMDCIKERYFDSQNEVKKVCPVC 945
             G ++ KG  +RKKCHWCQ+SDSWSL+ CSSCQ+EFFCM+CIK+RYFD+QNEVK  CPVC
Sbjct: 136  CGRSMNKG--RRKKCHWCQRSDSWSLVMCSSCQREFFCMECIKQRYFDTQNEVKMACPVC 193

Query: 944  RGTCTCKDCLASQCKDSESKACLSGKSRVDRILHFHYLICMLLPVLKRISEDQDTELETE 765
            RGTCTCKDCL+SQ +DSESK  L+GK+RVD ILHFHYL+CMLLPVLK+I ED   ++E  
Sbjct: 194  RGTCTCKDCLSSQYEDSESKEYLAGKNRVDGILHFHYLVCMLLPVLKQIKEDHHVDVEET 253

Query: 764  AKIKGKSISDIQIKQVEFDCNQKNYCNHCKTPVLDLHRXXXXXXXXXXXXXXXXXSQGRI 585
            AK KGK  SDI IK V+F CN+KNYCN+CKTP+LDLHR                 SQG  
Sbjct: 254  AKTKGKRTSDILIKPVDFVCNEKNYCNYCKTPILDLHRSCLSCSYSLCLSCSQALSQGST 313

Query: 584  FGEINSTMLNLPDKRKAYVDSEGHTLDQKAISSDNLTAALILPQQTKCDDIENASCPPTG 405
              EINS++ NLPDK  A + SE H LD K IS+ NLT    L + T C+     SCPPT 
Sbjct: 314  SEEINSSISNLPDKINACISSESHLLDDKVISNGNLTDTSTLLEWTNCNGAGIVSCPPTK 373

Query: 404  LGGCGKGHLELRCIFPSSWIKEMELKAEEIVCSYDFPETLDKSSSCSLCFDTDHNTNRYK 225
            LG CG  HL+L+ +FP SWIKEME+KAEEIVCSYDFPET DKSSSCSLC D DH T+RYK
Sbjct: 374  LGDCGDNHLDLKYVFPLSWIKEMEVKAEEIVCSYDFPETSDKSSSCSLCVDKDHKTSRYK 433

Query: 224  QLQKAALREDPSDNCLFCPTVFDISGDNFEHFQKHWGKGHPIVVRDVLQSTSNLSWNPLI 45
            QL +AA RED +DN LF PT+ DIS ++FEHF+KHWGKGHP+VVRDVLQ T NLSW+P++
Sbjct: 434  QLPEAAQREDSNDNYLFYPTILDISCNHFEHFRKHWGKGHPVVVRDVLQCTPNLSWDPVV 493

Query: 44   MFCTYLEQSITRYE 3
            MFCTYLE+S+TRYE
Sbjct: 494  MFCTYLERSMTRYE 507


>ref|XP_002524700.1| conserved hypothetical protein [Ricinus communis]
            gi|223536061|gb|EEF37719.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1033

 Score =  438 bits (1126), Expect = e-120
 Identities = 224/459 (48%), Positives = 295/459 (64%), Gaps = 19/459 (4%)
 Frame = -3

Query: 1322 NYSEGELRRELPNGVMEISPASTPNVGVGS----HCDVKVGGA--DHKAVTPRYFRSKNV 1161
            N  EGEL R+LPNG+M ISPA        S     CD+K+GGA  D  A T R FRSKN+
Sbjct: 163  NSEEGELMRDLPNGLMAISPAKHNLSNAASCSTTPCDIKIGGAAADSSAFTRRCFRSKNI 222

Query: 1160 ERVPAGKLQVVPYGPN---LKKGNTKRKKCHWCQKSDSWSLIKCSSCQKEFFCMDCIKER 990
            E +P G LQVVP+  +   L+KG  KRKKCH+C++S   +LI+CSSC+K+FFCMDCIK++
Sbjct: 223  EPMPIGTLQVVPFKKDMVRLRKG--KRKKCHFCRRSGLKTLIRCSSCRKQFFCMDCIKDQ 280

Query: 989  YFDSQNEVKKVCPVCRGTCTCKDCLASQCKDSESKACLSGKSRVDRILHFHYLICMLLPV 810
            YF+ Q EVK  C VCRGTC+CK C A QC++ E K     KS+V+++LHFHYLICMLLPV
Sbjct: 281  YFNMQEEVKIACSVCRGTCSCKACSAIQCRNIECKGFSKDKSKVNKVLHFHYLICMLLPV 340

Query: 809  LKRISEDQDTELETEAKIKGKSISDIQIKQVEFDCNQKNYCNHCKTPVLDLHRXXXXXXX 630
            LK I++DQ  ELE EAKI+G+  SD+QI+Q E  CN++  C++CKT ++D HR       
Sbjct: 341  LKEINQDQSIELEIEAKIRGQKPSDLQIQQAEVGCNKRWCCDNCKTSIMDFHRSCPSCSY 400

Query: 629  XXXXXXXXXXSQGRIFGEINSTMLNLPDKRKAYVDSEGHTLDQKAIS----------SDN 480
                       QG +   +   +   P+++KA +  +  + + K++           SD 
Sbjct: 401  NLCLSCCQDIYQGSLLRSVKGLLCKCPNRKKACLSGKQFS-EMKSVCTYKQNNGIKYSDF 459

Query: 479  LTAALILPQQTKCDDIENASCPPTGLGGCGKGHLELRCIFPSSWIKEMELKAEEIVCSYD 300
              + L L      D      CPPT  GGCGK  L+L CIFPSSW KE+E+ AEEI+  Y+
Sbjct: 460  SMSLLSLKAP---DGNGGIPCPPTEFGGCGKSLLDLCCIFPSSWTKELEISAEEIIGCYE 516

Query: 299  FPETLDKSSSCSLCFDTDHNTNRYKQLQKAALREDPSDNCLFCPTVFDISGDNFEHFQKH 120
             PET+D  S CSLC   D   N   QLQ+AA RE+ +DN L+ PTV DI  DN EHFQKH
Sbjct: 517  LPETVDVFSRCSLCIGMDCEVNESLQLQEAATREESNDNFLYYPTVVDIHSDNLEHFQKH 576

Query: 119  WGKGHPIVVRDVLQSTSNLSWNPLIMFCTYLEQSITRYE 3
            WGKG P++VR+VLQ TS+LSW+P++MFCTYL+ +  + E
Sbjct: 577  WGKGQPVIVRNVLQGTSDLSWDPIVMFCTYLKNNAAKSE 615


Top