BLASTX nr result

ID: Glycyrrhiza23_contig00013744 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00013744
         (1662 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003540701.1| PREDICTED: uncharacterized protein LOC100800...   526   e-147
ref|XP_003597511.1| hypothetical protein MTR_2g098830 [Medicago ...   468   e-129
emb|CBI36653.3| unnamed protein product [Vitis vinifera]              299   2e-78
ref|XP_002533047.1| hypothetical protein RCOM_0068670 [Ricinus c...   241   3e-61
gb|AAF80120.1|AC024174_2 Contains similarity to an unknown prote...   200   1e-48

>ref|XP_003540701.1| PREDICTED: uncharacterized protein LOC100800099 [Glycine max]
          Length = 677

 Score =  526 bits (1355), Expect = e-147
 Identities = 291/472 (61%), Positives = 342/472 (72%), Gaps = 1/472 (0%)
 Frame = +3

Query: 18   YKKKRVIKKPTKDESKVDEAEFLQVGYSAVKEAIGINNTDIMLLESYTVYSQSKEKAASR 197
            YKK+RVIKK  + E KVDE  FLQVGYSAVKEA GINNTDIMLLES TVYS+SKEK +  
Sbjct: 213  YKKRRVIKKSAQKELKVDEDVFLQVGYSAVKEATGINNTDIMLLESGTVYSESKEKLSDD 272

Query: 198  FYIMQCSQSINQEVIQVPLKNVIESLQGPLVKKSSSSWTITPVVAYFHVLPYSEIISKWI 377
                + S S ++E+   P+      LQGPLV KSS SW IT V+ YFHVLPYSEIISKWI
Sbjct: 273  SG-PRGSVSDDREMGLFPV-----CLQGPLVSKSSRSWMITSVIDYFHVLPYSEIISKWI 326

Query: 378  SREAFSNSLQNSRVTEKNIMVDSPEVTESYVTRDMFTGLDSKPSKDTIELLEQKENKGSC 557
            SR AFS SLQ+SRVTEKNI V++PE T+ YV +D FT LDSKP+ D I+L +QKE+ GSC
Sbjct: 327  SRGAFSTSLQDSRVTEKNIKVNTPEATDFYVNKDTFTALDSKPNSDNIDLPKQKEHHGSC 386

Query: 558  TLRLSDSIKEPREMDVNKPSMFPSKNKEKCQN-IANTVQVGEDQEENNPSVQYNSNGXXX 734
            T  LS  I EP EMDVN+ S+F S+NKEKCQ  I NTVQVG DQE+N  S++YNSN    
Sbjct: 387  TPALSYYINEPIEMDVNENSIFKSQNKEKCQYIIGNTVQVGVDQEKNYLSLKYNSNAYAS 446

Query: 735  XXXXXXXXXTRMLITEGEINNIASCHKIRANGPNSSYKKDTIDVCTQSAKHSNSDTEKLQ 914
                     TRMLI EG +NN+AS H   ANGPN+S +K  +   T +A HSNSD EKLQ
Sbjct: 447  AVKALKVDSTRMLIAEGGVNNLASLHNNYANGPNTSSEKGILVNYTPTANHSNSDLEKLQ 506

Query: 915  ILLDSKKILSRTALTALIRKRNELALQQRKIEEEIAICDKKIQRLSKGDVEDNFELKIES 1094
            IL DSKKILS+TAL ALIRKRNELALQQRKIE+EIA+CDKKIQR+   D EDNF+LKIES
Sbjct: 507  ILSDSKKILSQTALAALIRKRNELALQQRKIEDEIAVCDKKIQRMLT-DGEDNFKLKIES 565

Query: 1095 IVDGCNDIWVSNQERMCGQQSFPLEKKKLSEAVFIRLSPCQELDGVCHENNWVLPTYHLS 1274
            I++GCN  WV NQER   QQS P E+KKLSEAVF+  SP QELD +  EN WVLPTYHLS
Sbjct: 566  IIEGCNGTWVRNQERTSEQQSLPFERKKLSEAVFLTQSPFQELDNIFRENYWVLPTYHLS 625

Query: 1275 QSDGGFQANVTVNGVGFRCSLAGSVCXXXXXXXXXXXXMMLTNLRSMAKVAK 1430
             ++GGF+ANV V G  F+CS  G +              MLT+ ++MAK+A+
Sbjct: 626  HANGGFKANVIVKGEDFQCSFEGIMGSNPPEARESAAAQMLTHFKNMAKLAQ 677


>ref|XP_003597511.1| hypothetical protein MTR_2g098830 [Medicago truncatula]
            gi|355486559|gb|AES67762.1| hypothetical protein
            MTR_2g098830 [Medicago truncatula]
          Length = 1588

 Score =  468 bits (1205), Expect = e-129
 Identities = 259/482 (53%), Positives = 327/482 (67%), Gaps = 10/482 (2%)
 Frame = +3

Query: 3    ETKHAYKKKRVIKKPTKDESKVDEAEFLQVGYSAVKEAIG--INNTDIMLLESYTVYSQS 176
            E KH Y+K+RVI+ PTK+   VD+ EFLQVGYSAVKEA G  +N+ DIMLLESYTVYSQ 
Sbjct: 1113 ELKHTYQKRRVIQNPTKNGLNVDDDEFLQVGYSAVKEATGQGVNSNDIMLLESYTVYSQR 1172

Query: 177  KEKAASRFYIMQCSQSINQEVIQVPLKNVIESLQGPLVKKSSSSWTITPVVAYFHVLPYS 356
            KEK ASRFYIM+CSQS     IQVP+K++IES +GPL+KKSSSSWTIT VV YFHVLPYS
Sbjct: 1173 KEKTASRFYIMKCSQSTADGSIQVPIKDLIESFRGPLLKKSSSSWTITSVVEYFHVLPYS 1232

Query: 357  EIISKWISREAFSNSLQNSRVTEKNIMVDSPEVTESYV-TRDMFTGLDSKPSKDTIELLE 533
            EIIS WISRE FSNSLQ+S++ EK       EVTES+V ++ ++  LD+KP  DT   L 
Sbjct: 1233 EIISDWISRETFSNSLQDSKLAEKQF--PKHEVTESHVSSKGLYIDLDNKPGSDTKVALN 1290

Query: 534  QKENKGSCTLRLSDSIKEPREMDVNKPSMFPSKNKEKCQNIANTVQVGEDQEENNPSVQY 713
            QKE  G    +  DS+KE  +MDV+K  + PSKNKE  ++ ANT+ + EDQ+  NPSVQ+
Sbjct: 1291 QKEKNGCGITKRCDSVKEDWDMDVDKSLVLPSKNKECQKHTANTLHISEDQKIENPSVQH 1350

Query: 714  NSNGXXXXXXXXXXXXTRMLITEGEINNIASCHKIRANGPNSSYKKDTIDVCTQSAKHSN 893
            +SN              R  ITEG I + ++  KI A    ++++ D+++ C  +A  SN
Sbjct: 1351 HSNECTRPSKAEKAVSKRKHITEGGIKDQSAFDKICA---GTTFENDSVEKCILNANSSN 1407

Query: 894  SDTEKLQILLDSK-KILSRTALTALIRKRNELALQQRKIEEEIAICDKKIQRLSKGDVED 1070
             + EK+Q  + SK  ILS+TAL ALIRKRN LALQQR IE+E+A+C+ KI R   G+ ED
Sbjct: 1408 KNLEKIQTFIASKGTILSQTALNALIRKRNALALQQRAIEDEMAVCNMKIHRWLAGE-ED 1466

Query: 1071 NFELKIESIVDGCNDIWVSNQERMCGQ----QSFP--LEKKKLSEAVFIRLSPCQELDGV 1232
            +FELK+ES+++GCN  W+ NQ RMC Q    Q  P  ++ K+L+EAV    SPCQELDG+
Sbjct: 1467 DFELKLESVIEGCNGTWLRNQGRMCSQYLDDQCLPQSVKSKRLTEAVLTLHSPCQELDGI 1526

Query: 1233 CHENNWVLPTYHLSQSDGGFQANVTVNGVGFRCSLAGSVCXXXXXXXXXXXXMMLTNLRS 1412
            CHENNW+LPTY +S SDG F A V V GV F  S  G+ C             MLT  RS
Sbjct: 1527 CHENNWILPTYSVSLSDGEFHATVRVKGVDFEYSCEGNTCPFPREARDSAAAQMLTKFRS 1586

Query: 1413 MA 1418
            MA
Sbjct: 1587 MA 1588



 Score =  214 bits (546), Expect = 4e-53
 Identities = 144/382 (37%), Positives = 207/382 (54%), Gaps = 61/382 (15%)
 Frame = +3

Query: 18   YKKKRVIKKPTKDESKVDEAEFLQVGYSAVKEAIGINNTDIMLLESYTVYSQSKEKAASR 197
            YKK+RV+KKP+++ S VDE   LQVGYSAVKEA G+N+ DIMLLESYTVYSQSKEK +SR
Sbjct: 193  YKKRRVVKKPSRNGSNVDEDRILQVGYSAVKEAAGVNSIDIMLLESYTVYSQSKEKTSSR 252

Query: 198  FYIMQCSQSINQEVIQVPLKNVIES--------------------LQGPLVKKSSSSWTI 317
            FYIM+CSQSI++   QVP+K++IE                     ++GPLVK+SS SW +
Sbjct: 253  FYIMKCSQSIDEGFTQVPIKDLIERFVTIGMVFLLADHSGLEGEFVRGPLVKRSSDSWKV 312

Query: 318  TPVVAYFHVLPYSEIISKWISREAFSNSLQNSRVTEK-----------------NIMVDS 446
            TPVV YFH+LPYS+IIS+WISRE FSNSLQ+S++ EK                 ++ +D+
Sbjct: 313  TPVVEYFHMLPYSKIISEWISRETFSNSLQDSKLAEKQFPKLEVKESHISSKALSVGLDN 372

Query: 447  PEVTESYV------------------TRDMFTGLDSKPSKDTIELLEQKENKGSCTLRLS 572
             + +E+ V                  +  M  GLD+K   DTI  L QKE  G  T+   
Sbjct: 373  KQCSETIVALNQKQLLKLEVKEMHVSSEGMSAGLDNKACSDTIVTLNQKEKNGCGTITQC 432

Query: 573  DSIKEPREMDVNKPSMFPSKNKEKCQNIANT-VQVGEDQEENNPSV----QYNSNGXXXX 737
             S+K+ ++MDV+  S   +  +    ++++  + VG D ++++ ++    Q  +NG    
Sbjct: 433  GSVKKDQDMDVDNSSRVKTNLEVTESHVSSEGMSVGLDNKQSSDTIAALNQKENNGC--- 489

Query: 738  XXXXXXXXTRMLITEGEINNIASCHKIR-ANGPNSSYKKDTIDVCTQSAKHSNSDTEKLQ 914
                           G I    S  K    +  NSS KK  ++V       S+  +E + 
Sbjct: 490  ---------------GIITQCGSVKKDEDMDVDNSSIKKTNLEV-----TESHVSSEGMS 529

Query: 915  ILLDSKKILSRTALTALIRKRN 980
            + LD+K       + AL +K N
Sbjct: 530  VDLDNKP--CSDTIAALNQKEN 549



 Score =  109 bits (272), Expect = 2e-21
 Identities = 68/169 (40%), Positives = 103/169 (60%), Gaps = 2/169 (1%)
 Frame = +3

Query: 450  EVTESYVTRD-MFTGLDSKPSKDTIELLEQKENKGSCTLRLSDSIKEPREMDVNKPSMFP 626
            EVTES+V+ + M  GLD+KP  DTI  L+QKEN     +    S+KE ++MDV+  S FP
Sbjct: 769  EVTESHVSSEGMSVGLDNKPCSDTIVALDQKENSCCGKITRCSSVKEDQDMDVDNCSTFP 828

Query: 627  SK-NKEKCQNIANTVQVGEDQEENNPSVQYNSNGXXXXXXXXXXXXTRMLITEGEINNIA 803
            SK N+E  +++ANT+QV EDQ+  N SVQ++SN             TRM I EG I + +
Sbjct: 829  SKLNEEYQKHVANTLQVNEDQKIENSSVQHHSNECTRPSEAEKVVSTRMHIIEGGIKDES 888

Query: 804  SCHKIRANGPNSSYKKDTIDVCTQSAKHSNSDTEKLQILLDSKKILSRT 950
            +  KI     +++ + ++I+ CT  A + N+D EK++  +DSK  + R+
Sbjct: 889  AFDKICV---DATVENESIEKCTPIADNFNADFEKVRSFVDSKGKMGRS 934


>emb|CBI36653.3| unnamed protein product [Vitis vinifera]
          Length = 691

 Score =  299 bits (765), Expect = 2e-78
 Identities = 196/517 (37%), Positives = 279/517 (53%), Gaps = 44/517 (8%)
 Frame = +3

Query: 9    KHAYKKKRVIKKPTKDESKVDEAEFLQVGYSAVKEAIGINNTDIMLLESYTVYSQSKEKA 188
            K  Y+KKR  KK T+DE+  DEA   Q+ +SAVK+A GI+  D+M+LES+ VYS SKE+ 
Sbjct: 182  KQIYRKKRTTKKSTRDETGSDEASLQQLAFSAVKKAAGISQADLMVLESHVVYSLSKERT 241

Query: 189  ASRFYIMQCSQSINQEVIQVPLKNVIESLQGPLVKKSSSSWTITPVVAYFHVLPYSEIIS 368
            A RFYIMQC+QSIN+++ Q+P+   I+SLQGPLVK SS SWT+T VV YFHVLPY  I+S
Sbjct: 242  ACRFYIMQCTQSINEDISQIPINEAIDSLQGPLVKNSSCSWTVTSVVEYFHVLPYKGILS 301

Query: 369  KWISREAFSNSLQNSRVTEKNIMVDSPEVT---------------ESYVTRDMFTGLDSK 503
             W+SR  FSNSLQ+ RV   N  ++S + T                S   R    G ++ 
Sbjct: 302  DWLSRGVFSNSLQDLRVGLGNEKLNSTQRTAEPLDAEVKRNSNESHSNCGRVDVLGNENM 361

Query: 504  PSKDTIELLEQKENKGSCT---------LRLSDSI-KEPREMDVNKPSMFPSKNKEKCQN 653
             + +   +  Q E+              L  +D +  +    D   P    S N+ + ++
Sbjct: 362  NADNPCMVCPQNEDDAEVNKKRVGSDRYLSTADVLGNKSSGTDTESPRWKYSNNESRSKS 421

Query: 654  IANTVQVGEDQEENNPSVQYNSNGXXXXXXXXXXXXTRMLITEGEINNI-----ASC--H 812
            + N+V+V   Q+E       + N             T  +  EG  N++      SC   
Sbjct: 422  LPNSVEVNLHQKEMTLFTARDLNA----------AATGAMAKEGIANSVMTPCTTSCRGE 471

Query: 813  KIRANGPNSSY---KKDTI---DVCTQSAKHSNSDTEKLQILLDSK-KILSRTALTALIR 971
            K+   G   ++    +D +   D    + + ++   +KLQI + SK K+LS+TAL  L+R
Sbjct: 472  KVANGGEICNFIMPDQDGMLIEDRALVTYESNSEHLDKLQITIASKEKLLSQTALKVLLR 531

Query: 972  KRNELALQQRKIEEEIAICDKKIQRLSKGDVEDNFELKIESIVDGCNDIWVSNQERM--- 1142
            KR+ L+ QQRK+E+EIA CDK IQ +  G  ED+  LKIESI++ CND     ++R    
Sbjct: 532  KRDRLSHQQRKLEDEIAQCDKNIQTILDGG-EDDLALKIESILEFCNDACPQTRDRTYRH 590

Query: 1143 CGQQSFP--LEKKKLSEAVFIRLSPCQELDGVCHENNWVLPTYHLSQSDGGFQANVTVNG 1316
               Q  P  +++K+LSEA+      CQELDG+C+ENNW+LPTY +S  DG  +  V+V G
Sbjct: 591  LEDQESPQHIKRKRLSEAILNIQKSCQELDGICYENNWILPTYRVSLLDGKSEGTVSVKG 650

Query: 1317 VGFRCSLAGSVCXXXXXXXXXXXXMMLTNLRSMAKVA 1427
            V F  S+ G  C             ML  L+SMA  A
Sbjct: 651  VDFEISVVGEPCDTPREARESAAAQMLAKLQSMATAA 687


>ref|XP_002533047.1| hypothetical protein RCOM_0068670 [Ricinus communis]
            gi|223527166|gb|EEF29337.1| hypothetical protein
            RCOM_0068670 [Ricinus communis]
          Length = 624

 Score =  241 bits (616), Expect = 3e-61
 Identities = 162/469 (34%), Positives = 244/469 (52%), Gaps = 22/469 (4%)
 Frame = +3

Query: 3    ETKHAYKKKRVIKKPTKDESKVDEAEFLQVGYSAVKEAIGINNTDIMLLESYTVYSQSKE 182
            + +H  KKKR I+KP  + S  DEA+  Q+ +SAVKE  GIN  D+++LE + VYS SKE
Sbjct: 188  DPEHVTKKKRFIRKPLNNVSVADEADLQQLAFSAVKEVTGINQNDLLILERHDVYSTSKE 247

Query: 183  KAASRFYIMQCSQSINQEVIQVPLKNVIESLQGPLVKKSSSSWTITPVVAYFHVLPYSEI 362
            KAA+ FYIMQ +QS N  + + P+++ + SLQGPL  +SSS W  T VV YFH+LPY+ I
Sbjct: 248  KAAACFYIMQYAQSNNNNITKTPIEDALNSLQGPLFVRSSSRWIHTSVVEYFHLLPYAGI 307

Query: 363  ISKWISREAFSNSLQNSRVTEKNIMVDS---------PEVTESYVTRDMFTGLDSKPSKD 515
            +S W++R+  S+SLQ      + I V+S         PE  +S    D+ +   S  +K 
Sbjct: 308  LSDWLARK--SSSLQVQNPGSETINVNSSKRIERPCIPEAPKSSHDGDLGSKKGSGSAK- 364

Query: 516  TIELLEQKENKGSCTLRLSDSIKEPREMDVNKPSMFPSKNKEKCQNIANTVQVGEDQEEN 695
                  Q+E+ G   + L+D I  PR+M+V+   +  ++ K+K  N+ + V+    Q++ 
Sbjct: 365  ------QREDDGFYAVDLTDDIDGPRKMEVDDSFVAHAETKDKVTNVVSKVKPQNCQKKT 418

Query: 696  NPSVQYNSNGXXXXXXXXXXXXTRMLITEGEINNIASCHKIRANGPNSSYKKDTIDVCTQ 875
              S+  +SNG                + +  + +I    KI  + P +S  KD     + 
Sbjct: 419  --SLDGSSNGS---------------VDKANMADILKRQKISRDEPAASRNKDLKGTSSD 461

Query: 876  SA------------KHSNSDTEKLQILLDSK-KILSRTALTALIRKRNELALQQRKIEEE 1016
                          + +++D +KL+ ++ SK + LS+ AL  ++ KR +L LQ R IE++
Sbjct: 462  QDGIPRNGHAIVKDRSNSNDLDKLRTVIASKDQELSQAALQVVLSKRAKLCLQLRDIEDQ 521

Query: 1017 IAICDKKIQRLSKGDVEDNFELKIESIVDGCNDIWVSNQERMCGQQSFPLEKKKLSEAVF 1196
            IA CDK IQ +  G  E +  LKIES+++GCND                           
Sbjct: 522  IAQCDKNIQTILNGG-EGDLALKIESLLEGCND--------------------------- 553

Query: 1197 IRLSPCQELDGVCHENNWVLPTYHLSQSDGGFQANVTVNGVGFRCSLAG 1343
                   ELD +C + NW+LPTY +S  DGGFQANV V G     S  G
Sbjct: 554  -------ELDDLCRQKNWILPTYQVSAPDGGFQANVIVKGKDCEYSTGG 595


>gb|AAF80120.1|AC024174_2 Contains similarity to an unknown protein T11A7.7 gi|2335096 from
            Arabidopsis thaliana BAC T11A7 gb|AC002339 and contains a
            tropomyosin PF|00261 domain. ESTs gb|AI995205, gb|N37925,
            gb|F13889, gb|AV523107, gb|AV535948, gb|AV558461,
            gb|F13888 come from this gene [Arabidopsis thaliana]
          Length = 1628

 Score =  200 bits (508), Expect = 1e-48
 Identities = 147/462 (31%), Positives = 235/462 (50%), Gaps = 22/462 (4%)
 Frame = +3

Query: 24   KKRVIKKPTKDESKVDEAEFLQVGYSAVKEAIGINNTDIMLLESYTVYSQSKEKAASRFY 203
            +K + K+    E++ +E  F +V ++ VKEA G+N+ DI++LE + V S S+EK A RFY
Sbjct: 1170 EKPIEKEKAARENQKEEGVFQKVAFAVVKEATGVNHKDIVILERHLVCSLSEEKTAVRFY 1229

Query: 204  IMQCSQSINQEVIQVPLKNVIESLQGPLVKKSSSSWTITPVVAYFHVLPYSEIISKWISR 383
            IM+C+ S ++   + P++ V+  +QGPL +KS S WT+  +V YFHVLPY+ +I  W SR
Sbjct: 1230 IMKCT-SQDKFSGENPVEEVLSCMQGPLFEKSFSDWTMNSIVEYFHVLPYATLIEDWFSR 1288

Query: 384  ------------EAFSNSLQNSRV--TEKNIMVDSPEVTESYVTRDMFTGLDSKPSKDTI 521
                        EA  + +++++V  T+++ + D  E  E    +  +       +K   
Sbjct: 1289 RGDTEFVIEKEPEAVCDDIESNKVDATKESEVSDIFERREKAALKRRY----EIKAKKVA 1344

Query: 522  ELLEQKENKGSCTLRLSD-----SIKEPREMDVNKPSMFPSKNKEKCQNIANTVQVGEDQ 686
             LL     +G  T RL +     S+   +E +V+  ++   K K    N+ N +   +D 
Sbjct: 1345 ALLSHPGARGKATTRLQNRYLKGSMSGAKEPNVHSETVVALKAK----NVGNEMSPCKDN 1400

Query: 687  EENNPSVQYNSNGXXXXXXXXXXXXTRMLITEGEINNIASCHKIRANGPNSSYKKDTIDV 866
              N         G              +   +   + + S HK+ +              
Sbjct: 1401 YSNG-----EKGGFEVASDPKELKERGLQRKKAVPDRLNSIHKLNST------------- 1442

Query: 867  CTQSAKHSNSDTEKLQILLDSKKI-LSRTALTALIRKRNELALQQRKIEEEIAICDKKIQ 1043
               SA +SN + E+LQ  L SK   LS TAL  L+ KR++L  QQR IE+EIA CDK IQ
Sbjct: 1443 -PASAHNSNPNLEELQTSLLSKATSLSETALKVLLCKRDKLTRQQRNIEDEIAKCDKCIQ 1501

Query: 1044 RLSKGDVEDNFELKIESIVDGCNDIWVSN--QERMCGQQSFPLEKKKLSEAVFIRLSPCQ 1217
             + KGD    +EL++E++++ CN+ +     QE +        ++ KLSE +    S CQ
Sbjct: 1502 NI-KGD----WELQLETVLECCNETYPRRNLQESLDKSACQSNKRLKLSETLPSTKSLCQ 1556

Query: 1218 ELDGVCHENNWVLPTYHLSQSDGGFQANVTVNGVGFRCSLAG 1343
             LD +C  NNWVLP Y ++ SDGG++A V + G    C++ G
Sbjct: 1557 RLDDICLMNNWVLPNYRVAPSDGGYEAEVRITGNHVACTIHG 1598


Top