BLASTX nr result

ID: Glycyrrhiza23_contig00020298 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00020298
         (1957 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula] gi...   795   0.0  
ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [G...   748   0.0  
ref|XP_003590277.1| Cysteine protease [Medicago truncatula] gi|3...   743   0.0  
ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis v...   684   0.0  
ref|XP_002305743.1| predicted protein [Populus trichocarpa] gi|2...   669   0.0  

>ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula] gi|355502731|gb|AES83934.1|
            Cysteine proteinase [Medicago truncatula]
          Length = 475

 Score =  795 bits (2053), Expect = 0.0
 Identities = 383/506 (75%), Positives = 417/506 (82%), Gaps = 1/506 (0%)
 Frame = -2

Query: 1851 MGSHQKNQXXXXXXXLIWGSWAFL-CYGISSDEYSILALDLDKLPSEEQVVELFQQWKQD 1675
            MGSHQKN        L+W S  FL CYGI S EYSILA DL+K PSEEQVVELFQQWK++
Sbjct: 1    MGSHQKN--LLLLFTLLWCSLTFLSCYGIPS-EYSILAFDLNKFPSEEQVVELFQQWKKE 57

Query: 1674 HQKFYRHPEEAALRLENFKRNLKYIIEKNAMRNSPLSHRLGLNRFADMSNEEFRNKFISK 1495
            HQKFY HPEEAALRLENFKRNLKYI+E+NAMRNSP+ H LGLNRFADMSNEEF+NKFISK
Sbjct: 58   HQKFYIHPEEAALRLENFKRNLKYIVERNAMRNSPVGHHLGLNRFADMSNEEFKNKFISK 117

Query: 1494 VKKPFGKRNSLTRSSGLHVKDDSCEDAPSSLDWRKKGAVTGVKDQGNCGSCWAFSSTGAM 1315
            V                    +SC+DAP SLDWRKKG VTGVKDQGNCGSCW+FSSTGA+
Sbjct: 118  V--------------------ESCDDAPYSLDWRKKGVVTGVKDQGNCGSCWSFSSTGAI 157

Query: 1314 EGVNAIVTGDLISLSEQELVDCDTSNDGCDGGYMDYAFEWVINNGGIDTEANYPYTGLDG 1135
            EGVNAIVTGDLISLSEQELVDCDT+NDGC+GGYMDYAFEWVINNGGIDTEA+YPY G+ G
Sbjct: 158  EGVNAIVTGDLISLSEQELVDCDTTNDGCEGGYMDYAFEWVINNGGIDTEADYPYIGVGG 217

Query: 1134 TCNVTKEETKVVSIDGYTDVAQSDSAVLCATVKQPISAGIDGGSLDFQLYTGGIYDGDCS 955
            TCNVTKEETKVV+IDGYTDV QSDSA+ CATVKQPIS GIDG +LDFQLYTGGIYDGDCS
Sbjct: 218  TCNVTKEETKVVTIDGYTDVTQSDSALFCATVKQPISVGIDGSTLDFQLYTGGIYDGDCS 277

Query: 954  SNPDDIDHAILIVGYGSEGDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMAS 775
            SNPDDIDHA+LIVGYGS+G++DYWIVKNSWGTSWG+EG+IYIRRNTNLKYGVCAINYMAS
Sbjct: 278  SNPDDIDHAVLIVGYGSDGNQDYWIVKNSWGTSWGIEGFIYIRRNTNLKYGVCAINYMAS 337

Query: 774  YPTKESTAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSQCGDFSYCPFDETCCCLYEFF 595
            +PTKEST+                              S+CGDFSYC  +ETCCCLYE F
Sbjct: 338  FPTKESTS--------ISPTSPPSPPSPPPPTPPSPTPSKCGDFSYCTTEETCCCLYELF 389

Query: 594  DFCLIYGCCEYENAVCCTGTDYCCPSDYPICDIEDGLCLQNSGDIMGVAAKKKKMGRHKF 415
            DFCL YGCCEYENAVCCTGT YCCPSDYPICD EDGLCLQN GD+MGVAAKKKKMG+HKF
Sbjct: 390  DFCLAYGCCEYENAVCCTGTKYCCPSDYPICDTEDGLCLQNYGDLMGVAAKKKKMGKHKF 449

Query: 414  PWTKFEQTKMTHYILQMRRNPFAAMR 337
            PWTK+EQTK THY LQ+RR  FA +R
Sbjct: 450  PWTKYEQTKKTHYPLQLRRGAFATVR 475


>ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 517

 Score =  748 bits (1931), Expect = 0.0
 Identities = 357/509 (70%), Positives = 395/509 (77%), Gaps = 21/509 (4%)
 Frame = -2

Query: 1803 IWGSWAFLCYGISSDEYSILALDLDKLPSEEQVVELFQQWKQDHQKFYRHPEEAALRLEN 1624
            +WGSW FLCYG+ S EYSILAL++DK PSEE V+ELFQ+WK++++K YR P++  LR EN
Sbjct: 15   VWGSWTFLCYGLPS-EYSILALEIDKFPSEEGVIELFQRWKEENKKIYRSPDQEKLRFEN 73

Query: 1623 FKRNLKYIIEKNAMRNSPLSHRLGLNRFADMSNEEFRNKFISKVKKPFGKRNSLTRSSGL 1444
            FKRNLKYI EKN+ R SP    LGLNRFADMSNEEF++KF SKVKKPF KRN      GL
Sbjct: 74   FKRNLKYIAEKNSKRISPYGQSLGLNRFADMSNEEFKSKFTSKVKKPFSKRN------GL 127

Query: 1443 HVKDDSCEDAPSSLDWRKKGAVTGVKDQGNCGSCWAFSSTGAMEGVNAIVTGDLISLSEQ 1264
              KD SCEDAP SLDWRKKG VT VKDQG CG CWAFSSTGA+EG+NAIV+GDLISLSE 
Sbjct: 128  SGKDHSCEDAPYSLDWRKKGVVTAVKDQGYCGCCWAFSSTGAIEGINAIVSGDLISLSEP 187

Query: 1263 ELVDCDTSNDGCDGGYMDYAFEWVINNGGIDTEANYPYTGLDGTCNVTKEETKVVSIDGY 1084
            ELVDCD +NDGCDGG+MDYAFEWV++NGGIDTE NYPY+G DGTCNV KEETKV+ IDGY
Sbjct: 188  ELVDCDRTNDGCDGGHMDYAFEWVMHNGGIDTETNYPYSGADGTCNVAKEETKVIGIDGY 247

Query: 1083 TDVAQSDSAVLCATVKQPISAGIDGGSLDFQLYTGGIYDGDCSSNPDDIDHAILIVGYGS 904
             +V QSD ++LCATVKQPISAGIDG S DFQLY GGIYDGDCSS+PDDIDHAIL+VGYGS
Sbjct: 248  YNVEQSDRSLLCATVKQPISAGIDGSSWDFQLYIGGIYDGDCSSDPDDIDHAILVVGYGS 307

Query: 903  EGDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMASYPTKESTA--------- 751
            EGDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMASYPTKE TA         
Sbjct: 308  EGDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMASYPTKEPTAPSPSSPPSP 367

Query: 750  ------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSQCGDFSYCPFDETCCCL 607
                                                      S+CG FSYCP  ETCCCL
Sbjct: 368  PSSPPPSPLTPPALPPPSPPATPPLSPPLPPATPPPLPPPPPSKCGQFSYCPAHETCCCL 427

Query: 606  YEFFDFCLIYGCCEYENAVCCTGTDYCCPSDYPICDIEDGLCLQNSGDIMGVAAKKKKMG 427
            YEFF FCL+YGCCEY+NAVCC  T+YCCPSDYPICDI DGLCLQ  GD+MGVAAKK K G
Sbjct: 428  YEFFGFCLVYGCCEYKNAVCCIWTEYCCPSDYPICDIRDGLCLQKHGDLMGVAAKKIKKG 487

Query: 426  RHKFPWTKFEQTKMTHYILQMRRNPFAAM 340
            RHK PWTKFEQT+ T++ LQ  RN FAA+
Sbjct: 488  RHKLPWTKFEQTEKTYHHLQTGRNAFAAV 516


>ref|XP_003590277.1| Cysteine protease [Medicago truncatula] gi|355479325|gb|AES60528.1|
            Cysteine protease [Medicago truncatula]
          Length = 514

 Score =  743 bits (1917), Expect = 0.0
 Identities = 366/519 (70%), Positives = 399/519 (76%), Gaps = 45/519 (8%)
 Frame = -2

Query: 1845 SHQKNQXXXXXXXLIWGSWAFL-CYGISSDEYSILALDLDKLPSEEQVVELFQQWKQDHQ 1669
            S Q  +       L+W S  FL CYGI S EYSILA DL+K PSEEQVVELFQQWK++HQ
Sbjct: 2    SQQSKKNLLLLFTLLWCSLTFLSCYGIPS-EYSILAFDLNKFPSEEQVVELFQQWKKEHQ 60

Query: 1668 KFYRHPEEAALRLENFKRNLKYIIEKNAMRNSPLSHRLGLNRFADMSNEEFRNKFISKVK 1489
            KFY HPEEAALRLENFKRNLKYI+E+NAMRNSP+ H LGLNRFADMSNEEF+NKFISKVK
Sbjct: 61   KFYIHPEEAALRLENFKRNLKYIVERNAMRNSPVGHHLGLNRFADMSNEEFKNKFISKVK 120

Query: 1488 KPFGKRNSLTRSSGLHVKDDSCEDAPSSLDWRKKGAVTGVKDQGNCG------------- 1348
            KP  KR     +S LHVK +SC+DAP SLDWRKKG VTGVKDQGNCG             
Sbjct: 121  KPISKR-----ASNLHVKVESCDDAPYSLDWRKKGVVTGVKDQGNCGKLLYFMHFKSFLV 175

Query: 1347 -------------------------------SCWAFSSTGAMEGVNAIVTGDLISLSEQE 1261
                                           SCW+FSSTGA+EGVNAIVTGDLISLSEQE
Sbjct: 176  IYILELTTNFPLYSFESQFCILEKKKLDFVGSCWSFSSTGAIEGVNAIVTGDLISLSEQE 235

Query: 1260 LVDCDTSNDGCDGGYMDYAFEWVINNGGIDTEANYPYTGLDGTCNVTKEETKVVSIDGYT 1081
            LVDCDT+NDGC+GGYMDYAFEWVINNGGIDTEA+YPY G+ GTCNVTKEETKVV+IDGYT
Sbjct: 236  LVDCDTTNDGCEGGYMDYAFEWVINNGGIDTEADYPYIGVGGTCNVTKEETKVVTIDGYT 295

Query: 1080 DVAQSDSAVLCATVKQPISAGIDGGSLDFQLYTGGIYDGDCSSNPDDIDHAILIVGYGSE 901
            DV QSDSA+ CATVKQPIS GIDG +LDFQLYTGGIYDGDCSSNPDDIDHA+LIVGYGS+
Sbjct: 296  DVTQSDSALFCATVKQPISVGIDGSTLDFQLYTGGIYDGDCSSNPDDIDHAVLIVGYGSD 355

Query: 900  GDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMASYPTKESTAXXXXXXXXXX 721
            G++DYWIVKNSWGTSWG+EG+IYIRRNTNLKYGVCAINYMAS+PTKEST+          
Sbjct: 356  GNQDYWIVKNSWGTSWGIEGFIYIRRNTNLKYGVCAINYMASFPTKESTS--------IS 407

Query: 720  XXXXXXXXXXXXXXXXXXXXSQCGDFSYCPFDETCCCLYEFFDFCLIYGCCEYENAVCCT 541
                                S+CGDFSYC  +ETCCCLYE FDFCL YGCCEYENAVCCT
Sbjct: 408  PTSPPSPPSPPPPTPPSPTPSKCGDFSYCTTEETCCCLYELFDFCLAYGCCEYENAVCCT 467

Query: 540  GTDYCCPSDYPICDIEDGLCLQNSGDIMGVAAKKKKMGR 424
            GT YCCPSDYPICD EDGLCLQN GD+MGVAAKKKK G+
Sbjct: 468  GTKYCCPSDYPICDTEDGLCLQNYGDLMGVAAKKKKNGK 506


>ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
          Length = 501

 Score =  684 bits (1766), Expect = 0.0
 Identities = 321/495 (64%), Positives = 381/495 (76%), Gaps = 6/495 (1%)
 Frame = -2

Query: 1803 IWGSWAFLCYGISSDEYSILALDLDKLPSEEQVVELFQQWKQDHQKFYRHPEEAALRLEN 1624
            IW S A L   + ++ Y    +  ++  SEE+V ELF  WK+ H++ Y+H EE A R E 
Sbjct: 14   IWASLACLSSSLPTEFY----ITGEEFASEERVRELFHLWKERHKRVYKHAEETAKRFEI 69

Query: 1623 FKRNLKYIIEKNAMRNSPLSHRLGLNRFADMSNEEFRNKFISKVKKPFGKRNSLTRSSGL 1444
            FK NLKY+IE+N+  +    H LG+N+FADMSNEEF+ K++SK+KKP  K+N+  R S  
Sbjct: 70   FKENLKYVIERNSKGHR---HTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLRRSMQ 126

Query: 1443 HVKDDSCEDAPSSLDWRKKGAVTGVKDQGNCGSCWAFSSTGAMEGVNAIVTGDLISLSEQ 1264
              K  +  +APSSLDWRKKG VTG+KDQG+CGSCWAFSSTGAMEG+NAIVTGDLISLSEQ
Sbjct: 127  QKKGTASCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDLISLSEQ 186

Query: 1263 ELVDCDTSNDGCDGGYMDYAFEWVINNGGIDTEANYPYTGLDGTCNVTKEETKVVSIDGY 1084
            ELVDCDT+N GC+GGYMDYAFEWVI+NGGID+E++YPYTG DGTCN TKE+TKVVSIDGY
Sbjct: 187  ELVDCDTTNYGCEGGYMDYAFEWVISNGGIDSESDYPYTGTDGTCNTTKEDTKVVSIDGY 246

Query: 1083 TDVAQSDSAVLCATVKQPISAGIDGGSLDFQLYTGGIYDGDCSSNPDDIDHAILIVGYGS 904
             DV +SDSA+LCA V QPIS G+DG +LDFQLYT GIY GDCS +PDDIDHA+LIVGYGS
Sbjct: 247  KDVDESDSALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHAVLIVGYGS 306

Query: 903  EGDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMASYPTKEST------AXXX 742
            E  EDYWI KNSWGTSWGMEGY YI+RNT+L YG CAIN MASYPTKES+      +   
Sbjct: 307  EDSEDYWICKNSWGTSWGMEGYFYIKRNTDLPYGECAINAMASYPTKESSSPSPYPSPAV 366

Query: 741  XXXXXXXXXXXXXXXXXXXXXXXXXXXSQCGDFSYCPFDETCCCLYEFFDFCLIYGCCEY 562
                                       S+CGDFSYCP DETCCC+YEF+DFCLIYGCCEY
Sbjct: 367  PPPPPPPPSPPPPPPPSPPPPSPGPSPSECGDFSYCPSDETCCCIYEFYDFCLIYGCCEY 426

Query: 561  ENAVCCTGTDYCCPSDYPICDIEDGLCLQNSGDIMGVAAKKKKMGRHKFPWTKFEQTKMT 382
            ENAVCCTGT+YCCPSDYPICD+E+GLCL+N GD +GVAAKK+KM +HKFPWTK E+T+ T
Sbjct: 427  ENAVCCTGTEYCCPSDYPICDVEEGLCLKNQGDYLGVAAKKRKMAKHKFPWTKIEETQKT 486

Query: 381  HYILQMRRNPFAAMR 337
            +  L+ +RN FAAMR
Sbjct: 487  YQPLEWKRNRFAAMR 501


>ref|XP_002305743.1| predicted protein [Populus trichocarpa] gi|222848707|gb|EEE86254.1|
            predicted protein [Populus trichocarpa]
          Length = 494

 Score =  669 bits (1727), Expect = 0.0
 Identities = 314/476 (65%), Positives = 365/476 (76%), Gaps = 2/476 (0%)
 Frame = -2

Query: 1758 EYSILALDLDKLPSEEQVVELFQQWKQDHQKFYRHPEEAALRLENFKRNLKYIIEKNAMR 1579
            EYSI+  D  +LP +E ++E+FQQW+  HQK Y+H EEA  R  NFKRNLKYIIEK   +
Sbjct: 22   EYSIVGNDFSELPPDESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTG-K 80

Query: 1578 NSPLSHRLGLNRFADMSNEEFRNKFISKVKKPFGKRNSLTRSSGLHVKDDSCEDAPSSLD 1399
             + L HR+GLN+FAD+SNEEF+  ++SKVKKP  K   +           SC DAPSSLD
Sbjct: 81   ETTLRHRVGLNKFADLSNEEFKQLYLSKVKKPINKTR-IDAEDRSRRNLQSC-DAPSSLD 138

Query: 1398 WRKKGAVTGVKDQGNCGSCWAFSSTGAMEGVNAIVTGDLISLSEQELVDCDTSNDGCDGG 1219
            WRKKG VT VKDQG+CGSCW+FS+TGA+EG+NAIVT DLISLSEQELVDCDT+N GC+GG
Sbjct: 139  WRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTTNYGCEGG 198

Query: 1218 YMDYAFEWVINNGGIDTEANYPYTGLDGTCNVTKEETKVVSIDGYTDVAQSDSAVLCATV 1039
            YMDYAFEWVINNGGIDTEANYPYTG+DGTCN  KEE KVVSIDGY DV ++DSA+LCA  
Sbjct: 199  YMDYAFEWVINNGGIDTEANYPYTGVDGTCNTAKEEIKVVSIDGYKDVDETDSALLCAAA 258

Query: 1038 KQPISAGIDGGSLDFQLYTGGIYDGDCSSNPDDIDHAILIVGYGSEGDEDYWIVKNSWGT 859
            +QPIS GIDG ++DFQLYTGGIYDGDCS +PDDIDHA+LIVGYGSE  EDYWIVKNSWGT
Sbjct: 259  QQPISVGIDGSAIDFQLYTGGIYDGDCSDDPDDIDHAVLIVGYGSENGEDYWIVKNSWGT 318

Query: 858  SWGMEGYIYIRRNTNLKYGVCAINYMASYPTKESTA-XXXXXXXXXXXXXXXXXXXXXXX 682
            SWG+EGY YI+RNT+L YGVCAIN MASYPTKE++A                        
Sbjct: 319  SWGIEGYFYIKRNTDLPYGVCAINAMASYPTKEASAQSPTSPPSPPSPPPPPPPPPTPVP 378

Query: 681  XXXXXXXSQCGDFSYCPFDETCCCLYEFFDFCLIYGCCEYENAVCCTGTDYCCPSDYPIC 502
                   S CGDFSYCP DETCCC+   FD+CL+YGCC YENAVCC  + YCCPSDYPIC
Sbjct: 379  PPPSPQPSDCGDFSYCPSDETCCCILNVFDYCLVYGCCAYENAVCCADSVYCCPSDYPIC 438

Query: 501  DIEDGLCLQNSGDIMGVAAKKKKMGRHKFPWTKF-EQTKMTHYILQMRRNPFAAMR 337
            D+E+GLCL+  GD +GVAA K+ M +HKFPWTK  E+ K  H +LQ +RNPFAAMR
Sbjct: 439  DVEEGLCLKGQGDYLGVAASKRHMAKHKFPWTKLQERAKTDHRVLQWKRNPFAAMR 494


Top