BLASTX nr result

ID: Glycyrrhiza23_contig00014205 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00014205
         (1558 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003537798.1| PREDICTED: uncharacterized protein LOC100798...   398   e-108
ref|XP_003540800.1| PREDICTED: uncharacterized protein LOC100820...   394   e-107
ref|XP_002275899.1| PREDICTED: uncharacterized protein LOC100262...   279   1e-72
emb|CAN77162.1| hypothetical protein VITISV_029831 [Vitis vinifera]   275   3e-71
ref|XP_004147022.1| PREDICTED: uncharacterized protein LOC101214...   268   3e-69

>ref|XP_003537798.1| PREDICTED: uncharacterized protein LOC100798129 [Glycine max]
          Length = 377

 Score =  398 bits (1023), Expect = e-108
 Identities = 241/404 (59%), Positives = 264/404 (65%), Gaps = 19/404 (4%)
 Frame = +2

Query: 95   MAATVS-AWSKPGAWALDSEEHEAELLXXXXXXXXXXDTKPLADFPSLXXXXXXXXXXXX 271
            MAATVS AWSKPGAWALDSEEHEAELL          + KPLADFPSL            
Sbjct: 1    MAATVSSAWSKPGAWALDSEEHEAELLQQN-------NDKPLADFPSLAAAAAKPKKKKA 53

Query: 272  XQTLSLAEFNAKPDSSFTNPDPVDLPTGPRERTAEELDRDRNRLGGGFRSYGDRPNRNS- 448
             QT SLAEF AKPD+SF + DPV LPTGPR+RTAEELDR   RLGGGFR+YGDRPNRN+ 
Sbjct: 54   -QTYSLAEFTAKPDTSFADQDPVVLPTGPRQRTAEELDR--TRLGGGFRNYGDRPNRNNS 110

Query: 449  -GGDEXXXXXXXXXXXXD---RNGFGSRDRDSNRDLAPSRADEIDNWAAMKKSSTASXXX 616
             GGDE            D   RNGFG+RD  SNR+L PSRADE DNWAA KK S      
Sbjct: 111  GGGDESSNSRWGSSRVSDEPRRNGFGARD--SNRELPPSRADETDNWAASKKPSGGGFER 168

Query: 617  XXXXXXXXXXXXXXSQSKADESDSWVTNKSFVPXXXXXXXXXXXXXXXXXXXXXKVGFGT 796
                          SQS+ADESDSWV+NKSFVP                      VGFG+
Sbjct: 169  RERDKGGFFD----SQSRADESDSWVSNKSFVPSEGRRFSSNGGGERRV------VGFGS 218

Query: 797  SGGADSDNWNKKKGEFSVVGSERTTTESVGGRPRLNLQPRSALSVSNENNDVAKPKGTNP 976
            SGGADSDNWN KK   S +GS  +    VGGRP+L LQPR+ LSVSNE ++V KPKG NP
Sbjct: 219  SGGADSDNWNNKKKSESNIGSSESV--GVGGRPKLVLQPRT-LSVSNEGDNVGKPKGVNP 275

Query: 977  FGEARPREQVLAEKGQDWKKIDEQLESMKIKETGPVVDGGFGKRAFGS----GNGRASLP 1144
            FGEARPREQVLAEKGQDWKKIDEQLES+KIKET      GFGKR FGS    G GRA LP
Sbjct: 276  FGEARPREQVLAEKGQDWKKIDEQLESVKIKETSGGGGDGFGKRGFGSSNGGGGGRAILP 335

Query: 1145 EDRTERTWRKPLSESDDGRPQSAEKVEDE---------QHVEEN 1249
            E RTER+WRKP  +SDD RP+SAEKVE+E         +HVEEN
Sbjct: 336  ESRTERSWRKP--QSDDDRPKSAEKVENEPDQKKEVEDEHVEEN 377


>ref|XP_003540800.1| PREDICTED: uncharacterized protein LOC100820014 [Glycine max]
          Length = 380

 Score =  394 bits (1011), Expect = e-107
 Identities = 243/406 (59%), Positives = 265/406 (65%), Gaps = 21/406 (5%)
 Frame = +2

Query: 95   MAATVS-AWSKPGAWALDSEEHEAELLXXXXXXXXXXDTKPLADFPSLXXXXXXXXXXXX 271
            MAATVS AWSKPGAWALDSEEHEAELL          + KPLADFPSL            
Sbjct: 1    MAATVSSAWSKPGAWALDSEEHEAELLQQNNNNP---NDKPLADFPSLAAAAATKPKKKK 57

Query: 272  XQTLSLAEFNAKPDSSFTNPDPVDLPTGPRERTAEELDRDRNRLGGGFRSYGDRPNRN-- 445
             QT SLAEF AKPDS+F + DPV LPTGPR+RTAEELDR   RLGGGFR+YGDRPNRN  
Sbjct: 58   AQTYSLAEFTAKPDSAFADQDPVVLPTGPRQRTAEELDR--TRLGGGFRNYGDRPNRNNS 115

Query: 446  SGGDEXXXXXXXXXXXXD---RNGFGSRDRDSNRDLAPSRADEIDNWAAMKKSSTASXXX 616
            SGGDE            D   RNGFG+RD  SNR+L PSRADE DNWAA KK S      
Sbjct: 116  SGGDESSNSRWGSSRVSDEPRRNGFGARD--SNRELPPSRADETDNWAAAKKPSGG---- 169

Query: 617  XXXXXXXXXXXXXXSQSKADESDSWVTNKSFVPXXXXXXXXXXXXXXXXXXXXXKVGFGT 796
                          SQS+ADESDSWV+NKSFVP                      VGFG+
Sbjct: 170  -FERRERDKGGFFDSQSRADESDSWVSNKSFVPSEGRRFGSNGGGFERERRV---VGFGS 225

Query: 797  SGGADSDNWNKKKGEFSVVGSERTTTESVGGRPRLNLQPRSALSVSNEN---NDVAKPKG 967
            SGGADSDNWN KKGE S VGSE     SVGGRP+L LQPR+ +SVS+E    N+  KPKG
Sbjct: 226  SGGADSDNWNTKKGE-SNVGSE-----SVGGRPKLVLQPRT-VSVSDEGVDGNNAGKPKG 278

Query: 968  TNPFGEARPREQVLAEKGQDWKKIDEQLESMKIKETGPVVDGGFGKRAFGS---GNGRAS 1138
             NPFGEARPREQVLAEKGQDWKKIDEQLES+KIKE       GFGKR FGS   G GRA+
Sbjct: 279  VNPFGEARPREQVLAEKGQDWKKIDEQLESVKIKEASG--GDGFGKRGFGSSNGGGGRAT 336

Query: 1139 LPEDRTERTWRKPLSESDDGRPQSAEKVEDE---------QHVEEN 1249
            LPE RTER+WRKP  + DD RP+SAEKVEDE         +HVE+N
Sbjct: 337  LPESRTERSWRKP--QFDDDRPKSAEKVEDEPDQKKEVEDEHVEKN 380


>ref|XP_002275899.1| PREDICTED: uncharacterized protein LOC100262348 [Vitis vinifera]
          Length = 401

 Score =  279 bits (714), Expect = 1e-72
 Identities = 188/417 (45%), Positives = 230/417 (55%), Gaps = 38/417 (9%)
 Frame = +2

Query: 95   MAATVSAWSKPGAWALDSEEHEAELLXXXXXXXXXXD---------TKPLADFPSLXXXX 247
            MAATVS W K GAWALDSEEHE ELL          +          +  ADFP+L    
Sbjct: 1    MAATVSPWGKAGAWALDSEEHEDELLQQQRDDKVNGEFSGGEGRQAPEASADFPTLATAA 60

Query: 248  XXXXXXXXXQTLSLAEFNA---------KPDSSFTNPDPVDLPTGPRERTAEELDRDRNR 400
                     QTLSL+EF+A               T+ D + LPTGPR+R+AEELDR   R
Sbjct: 61   ATKSKKKKGQTLSLSEFSAFGAGKSAQPSQTKGLTHEDLMMLPTGPRQRSAEELDR--GR 118

Query: 401  LGGGFRSYGDRPN------RNSGGDEXXXXXXXXXXXXDRN--GFGSRDRDSNRDLAPSR 556
            LGGGFRSYG   +      R  GG++            +R   GFG   RDS+R+LAPSR
Sbjct: 119  LGGGFRSYGSNGSYEGGRSRYGGGEDSANPRWGPRGSEERRQGGFG---RDSSRELAPSR 175

Query: 557  ADEIDNWAAMKKSSTASXXXXXXXXXXXXXXXXXSQSKADESDSWVTNKSFVPXXXXXXX 736
            ADEID+W A KKS+  +                 SQS+ADES SWV+NKSF P       
Sbjct: 176  ADEIDDWGAAKKSTVGNGFERRDRGGFFD-----SQSRADESASWVSNKSFTPSEGRRFG 230

Query: 737  XXXXXXXXXXXXXXKVGFGTS----GGADSDNWNKKKGEFSVVGSERTTTESVGGRPRLN 904
                          + GF ++    GGADS++W +KK E S          S G RP+L 
Sbjct: 231  GGGGFESLRER---RGGFDSASDGGGGADSESWGRKKEEGS-----GNANGSAGSRPKLI 282

Query: 905  LQPRSALSVSNE---NNDVAKPKGTNPFGEARPREQVLAEKGQDWKKIDEQLESMKIKET 1075
            LQPR+      +   +  VAKPKG NPFGEARPRE+VLAEKGQDWK+I+E+LES+K+K+ 
Sbjct: 283  LQPRTVPVNDGQQPGSGSVAKPKGPNPFGEARPREEVLAEKGQDWKEIEEKLESVKLKDV 342

Query: 1076 GP----VVDG-GFGKRAFGSGNGRASLPEDRTERTWRKPLSESDDGRPQSAEKVEDE 1231
            G       DG  FGKR+FGSGN RASLPE R+E++WRKP  ES+D R   A K EDE
Sbjct: 343  GSPGVGQTDGPSFGKRSFGSGNARASLPESRSEKSWRKP--ESEDVR---AAKTEDE 394


>emb|CAN77162.1| hypothetical protein VITISV_029831 [Vitis vinifera]
          Length = 1434

 Score =  275 bits (702), Expect = 3e-71
 Identities = 181/402 (45%), Positives = 221/402 (54%), Gaps = 38/402 (9%)
 Frame = +2

Query: 95   MAATVSAWSKPGAWALDSEEHEAELLXXXXXXXXXXD---------TKPLADFPSLXXXX 247
            MAATVS W K GAWALDSEEHE ELL          +          +  ADFP+L    
Sbjct: 1    MAATVSPWGKAGAWALDSEEHEDELLQQQRDDKVNGEFSGGEGRQAPEASADFPTLATAA 60

Query: 248  XXXXXXXXXQTLSLAEFNA---------KPDSSFTNPDPVDLPTGPRERTAEELDRDRNR 400
                     QTLSL+EF+A               T+ D + LPTGPR+R+AEELDR   R
Sbjct: 61   ATKSKKKKGQTLSLSEFSAFGAGKSAQPSQTKGLTHEDLMMLPTGPRQRSAEELDR--GR 118

Query: 401  LGGGFRSYGDRPN------RNSGGDEXXXXXXXXXXXXDRN--GFGSRDRDSNRDLAPSR 556
            LGGGFRSYG   +      R  GG++            +R   GFG   RDS+R+LAPSR
Sbjct: 119  LGGGFRSYGSNGSYEGGRSRYGGGEDSANPRWGPRGSEERRQGGFG---RDSSRELAPSR 175

Query: 557  ADEIDNWAAMKKSSTASXXXXXXXXXXXXXXXXXSQSKADESDSWVTNKSFVPXXXXXXX 736
            ADEID+W A KKS+  +                 SQS+ADES SWV+NKSF P       
Sbjct: 176  ADEIDDWGAAKKSTVGNGFERRDRGGFFD-----SQSRADESASWVSNKSFTPSEGRRFG 230

Query: 737  XXXXXXXXXXXXXXKVGFGTS----GGADSDNWNKKKGEFSVVGSERTTTESVGGRPRLN 904
                          + GF ++    GGADS++W +KK E S          S G RP+L 
Sbjct: 231  GGGGFESLRER---RGGFDSASDGGGGADSESWGRKKEEGS-----GNANGSAGSRPKLI 282

Query: 905  LQPRSALSVSNE---NNDVAKPKGTNPFGEARPREQVLAEKGQDWKKIDEQLESMKIKET 1075
            LQPR+      +   +  VAKPKG NPFGEARPRE+VLAEKGQDWK+I+E+LES+K+K+ 
Sbjct: 283  LQPRTVPVNDGQQPGSGSVAKPKGPNPFGEARPREEVLAEKGQDWKEIEEKLESVKLKDV 342

Query: 1076 GP----VVDG-GFGKRAFGSGNGRASLPEDRTERTWRKPLSE 1186
            G       DG  FGKR+FGSGN RASLPE R E++WRKP SE
Sbjct: 343  GSPGVGQTDGPSFGKRSFGSGNARASLPESRXEKSWRKPESE 384


>ref|XP_004147022.1| PREDICTED: uncharacterized protein LOC101214573 [Cucumis sativus]
            gi|449489695|ref|XP_004158389.1| PREDICTED:
            uncharacterized LOC101214573 [Cucumis sativus]
          Length = 405

 Score =  268 bits (685), Expect = 3e-69
 Identities = 189/426 (44%), Positives = 235/426 (55%), Gaps = 41/426 (9%)
 Frame = +2

Query: 95   MAATVSAWSKPGAWALDSEEHEAELLXXXXXXXXXXDTKPLADFPSLXXXXXXXXXXXXX 274
            MAATVS W KPGAWALD+EEHEAELL          + +P ADFPSL             
Sbjct: 1    MAATVSPWGKPGAWALDAEEHEAELLKDQEEQSRHQE-EPSADFPSLAAAAATKPKKKKG 59

Query: 275  QTLSLAEFNA----KPDSSFTNP------DPVDLPTGPRERTAEELDRDRNRLGGGFRSY 424
            Q++ L+EF      KP +  ++P      D + LPTGPR+RTAEE+DR  NRLGGGF+S+
Sbjct: 60   QSIPLSEFQTYGGPKPSAQSSDPKGLTAEDLMMLPTGPRQRTAEEMDR--NRLGGGFKSW 117

Query: 425  G-----DRPNRNSGGDEXXXXXXXXXXXXD--RNGFGSRDRDSNRDLAPSRADEIDNWAA 583
            G     DR NR S  ++            +  R   GS DR+  R+  PSRADEID+W A
Sbjct: 118  GQNSLYDRGNRYSNSEDSPNSRRSSRVFDESRRTNDGS-DREFRRESLPSRADEIDDWGA 176

Query: 584  MKKSSTASXXXXXXXXXXXXXXXXXSQSKADESDSWVTNKSFVPXXXXXXXXXXXXXXXX 763
             KK    +                 S SKADESDSWV++KSF P                
Sbjct: 177  GKKPMVGNGFERRERGGGGGFFDSHS-SKADESDSWVSSKSFTPSEGRRSGGFDRER--- 232

Query: 764  XXXXXKVGFGTSGG-ADSDNWNKKK-GEFSVVGSERTTTES-------------VGGRPR 898
                 + GF TSGG ADSDNW +K  G    +G    + +S             +G RPR
Sbjct: 233  -----RGGFPTSGGGADSDNWGRKPDGARGGIGENGGSADSENWGKRSEGVRSGIGERPR 287

Query: 899  LNLQPRSALSVSNENNDVA----KPKGTNPFGEARPREQVLAEKGQDWKKIDEQLESMKI 1066
            LNLQPRS + ++N N + +    KPKG+NPFG ARPRE+VLAEKGQDWKKIDEQLES+KI
Sbjct: 288  LNLQPRS-IPLNNGNQEASGVAVKPKGSNPFGNARPREEVLAEKGQDWKKIDEQLESVKI 346

Query: 1067 KETGPVVDGGFG-----KRAFGSGNGRASLPEDRTERTWRKPLSESDDGRPQSAEKVEDE 1231
            K+T    +   G     K+ FG+ +GR+  P+  + RTWRKP  ES + RPQSAE VED 
Sbjct: 347  KDTVERAETSSGASFERKKGFGARSGRS--PD--SGRTWRKP--ESVESRPQSAELVEDG 400

Query: 1232 QHVEEN 1249
               EEN
Sbjct: 401  P-AEEN 405


Top