BLASTX nr result

ID: Astragalus22_contig00005121 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00005121
         (1228 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_012570569.1| PREDICTED: titin [Cicer arietinum] >gi|82830...   135   1e-30
dbj|GAU34054.1| hypothetical protein TSUD_16420 [Trifolium subte...    80   2e-12
gb|PNY07853.1| hypothetical protein L195_g004359 [Trifolium prat...    80   3e-12
ref|XP_003530323.1| PREDICTED: plectin-like [Glycine max] >gi|73...    69   1e-08
gb|KRG89177.1| hypothetical protein GLYMA_20G006500 [Glycine max]      67   3e-08
ref|XP_003556620.1| PREDICTED: plectin-like [Glycine max] >gi|94...    67   3e-08
ref|XP_016180080.1| proton pump-interactor BIP103 isoform X2 [Ar...    62   2e-06
ref|XP_016180077.1| immunoglobulin A1 protease autotransporter i...    62   2e-06

>ref|XP_012570569.1| PREDICTED: titin [Cicer arietinum]
 ref|XP_012570570.1| PREDICTED: titin [Cicer arietinum]
 ref|XP_012570571.1| PREDICTED: titin [Cicer arietinum]
          Length = 1114

 Score =  135 bits (339), Expect = 1e-30
 Identities = 134/341 (39%), Positives = 158/341 (46%), Gaps = 59/341 (17%)
 Frame = -2

Query: 966 DHHKIKGDHCNGDAHAKEKXXXXXXXXXXXXVADSGASDPIVTVDGNGTASEDRKVDGES 787
           DHHKI  +H NGD     K            V+DS  SDPIVTVDGN  A ED KV+ ES
Sbjct: 21  DHHKIITEHRNGDV----KEISENGNVGGEVVSDSSTSDPIVTVDGNDAAVEDHKVEDES 76

Query: 786 IVKVTEGARDCKVVESNNEENDCVDAPDLSNGSAEGNCETTTFDVVEREGESGIIIDGQN 607
                   R+C+VV  +N  N         N SA   CET T DVVE+EGE       QN
Sbjct: 77  -------QRECEVVNDDNNSN------IKENDSA---CETKTVDVVEKEGEI-----CQN 115

Query: 606 GSGSAVVSVDENENDVHVIDNNTVVEGGDEFRTVQNGVSD---SEIRDEVMVG----ASD 448
           GSGS V       NDVHV D  TV E GDEF +VQNGVSD   +EIR+ V V        
Sbjct: 116 GSGSDV-------NDVHVSD--TVAEVGDEFASVQNGVSDKESNEIREGVKVDDDRELES 166

Query: 447 VQNDVVAENEIC--DDVNKTIE-----XXXXXXXXXXXDWEVVAVENGVV-ES------- 313
           V+N VV+ENEIC   DV+K+                    E V V+NGVV ES       
Sbjct: 167 VENGVVSENEICVAADVDKSDREYEGVENGAVDRDEEVKLESVDVQNGVVLESEICVDAD 226

Query: 312 --------------AXXXXXXXXXXXXDASESVDSNVQE-------NHEKLESEEI---- 208
                                      DA ESVDS+V E       +HE +ES+ +    
Sbjct: 227 VVRDEKDKEIEVPVVVEEEVTTAAAATDAVESVDSDVVEGSESKSKDHEIVESKNVDGVD 286

Query: 207 ---VEKNEIPV---------DVKECADEDSKHGLEKVPEEN 121
               EKNEI V         DVKECA ED+++GLE    E+
Sbjct: 287 VVSDEKNEIAVDVDGVCDDADVKECAVEDTQNGLENAVVES 327


>dbj|GAU34054.1| hypothetical protein TSUD_16420 [Trifolium subterraneum]
          Length = 763

 Score = 80.5 bits (197), Expect = 2e-12
 Identities = 85/241 (35%), Positives = 102/241 (42%), Gaps = 24/241 (9%)
 Frame = -2

Query: 966 DHHKIKGDHCNGDAHAKEKXXXXXXXXXXXXVADSGASDPIVTVDGNGTAS--------- 814
           DHHKI  +HCNG      K            V+D   SDPIVTVD N   +         
Sbjct: 24  DHHKIVTEHCNG------KEISVNGNVGGEVVSDIVDSDPIVTVDVNSNGNSNGNSNGVV 77

Query: 813 --EDRKVDGESIVKVTEGARDCKVVESNNEENDCVDAPDLSNGSAEGNCETTTFDVVERE 640
             ED KV+GES VKV                         SNGS +   ETT  DVVERE
Sbjct: 78  DLEDHKVEGESEVKVISN----------------------SNGSVD---ETTKVDVVERE 112

Query: 639 GESGIIIDGQNGSGS---AVVSVDENENDVH--VIDNN---TVVEGGDEFRTVQNGVSDS 484
           GE       QNGSGS    V+  D+   +VH   +DN      VE  +    ++  VSDS
Sbjct: 113 GEI-----YQNGSGSDDNHVLVTDDTVAEVHNGALDNELSACAVESSE--NEIRGEVSDS 165

Query: 483 EIRDEVMVGASD-----VQNDVVAENEICDDVNKTIEXXXXXXXXXXXDWEVVAVENGVV 319
            + + V     D     V+N VV ENEICD V                + E V VENGVV
Sbjct: 166 VVEEGVKDRDQDRELESVENGVVEENEICDGV--------ADVDNSDPELEGVVVENGVV 217

Query: 318 E 316
           +
Sbjct: 218 D 218


>gb|PNY07853.1| hypothetical protein L195_g004359 [Trifolium pratense]
          Length = 1261

 Score = 80.1 bits (196), Expect = 3e-12
 Identities = 101/303 (33%), Positives = 120/303 (39%), Gaps = 50/303 (16%)
 Frame = -2

Query: 966 DHHKIKGDHCNGDAHAKEKXXXXXXXXXXXXVADSGASDPIVTVD----GNGTA---SED 808
           DHHKI  +HCNG      K            V+D   SDPIVTVD    GNG       +
Sbjct: 24  DHHKIVTEHCNG------KEISVNGNVGGEVVSDIVDSDPIVTVDVNSNGNGVVVLEDHN 77

Query: 807 RKVDGESIVKVTEGARDCKVVESNNEENDCVDAPDLSNGSAEGNCETTTFDVVEREGESG 628
            KV+GES V V                         SNGS +   +TT  DVVEREGE  
Sbjct: 78  HKVEGESEVTVISN----------------------SNGSVD---KTTIVDVVEREGEI- 111

Query: 627 IIIDGQNGSGSAVVSVDENENDVHVIDNNTVVEGGDEFRTVQNGV------------SDS 484
                QNGSGS V       N VHV D NTVVE       VQNG             S+ 
Sbjct: 112 ----YQNGSGSDV-------NHVHVTD-NTVVE-------VQNGALDKESSECAVESSED 152

Query: 483 EIRDEVMVGA------------SDVQNDVVAENEICD---DVNKTIEXXXXXXXXXXXDW 349
           EIRDEV                   +N VV ENEICD   DV+K+             + 
Sbjct: 153 EIRDEVSDSVVEEGVKVQDRELESFENGVVEENEICDVVADVDKS-----------DREL 201

Query: 348 EVVAVENGVVESAXXXXXXXXXXXXDASESV-----------DSNV-----QENHEKLES 217
           E   VENGVV++                E V           DS+V      ++HE +ES
Sbjct: 202 EGGVVENGVVDTGGGDVEEIEVPVPVVVEEVAAASTEAVEAGDSDVVGTAESKDHESVES 261

Query: 216 EEI 208
           E++
Sbjct: 262 EKV 264


>ref|XP_003530323.1| PREDICTED: plectin-like [Glycine max]
 gb|KHN19392.1| S-antigen protein [Glycine soja]
 gb|KRH49503.1| hypothetical protein GLYMA_07G159600 [Glycine max]
          Length = 1296

 Score = 68.9 bits (167), Expect = 1e-08
 Identities = 83/297 (27%), Positives = 120/297 (40%), Gaps = 56/297 (18%)
 Frame = -2

Query: 723 DCVDAPDLSNGSAEGNCETTTFDVVEREGESGIIIDGQNGSGSAVVSVDENEN------- 565
           +C DA  +SNG+AE   ET T DVV RE E     D QNG GS    V +N++       
Sbjct: 51  NCNDA--VSNGTAEEGTETATVDVVSREDELKSA-DSQNGMGSVQNGVVDNDDKSANAVA 107

Query: 564 -------DVHVIDNNTVVEGGDEFRTVQNGVSDSEIRDEVMVGASDVQNDVVAENE--IC 412
                  + +V+  ++ V+ GD+     NGV + E+ D     AS  +N VV E E  +C
Sbjct: 108 EELVTDHEEYVVVGDSDVQNGDD--VTANGVEECEMLDGAE--ASGDENGVVVEGEEDVC 163

Query: 411 DDVNKTIEXXXXXXXXXXXDWEVVAVENGVVESAXXXXXXXXXXXXDASESVDSNVQENH 232
              ++  E             E     N V   +            ++   V ++V +  
Sbjct: 164 QS-DREFECVDVHDDVTATTDENGGNGNDVQGRSESVSDKDVNKRGESENVVSADVSDEK 222

Query: 231 EKL-----ESEEIVEKNEIPV---------DVKECADEDSKHGLEK-------------- 136
           + +     + EE+VEKNE+PV         DVKEC  ED+++ LEK              
Sbjct: 223 DIVTDGDHDVEEVVEKNEVPVVVDGGSASTDVKECEPEDAQNSLEKGQVESVSGLAEPVL 282

Query: 135 ----VPEENEIPDDXXXXXXXXXXXXXXXXEVIPEGEKL--------SDKVVDGDVE 1
                 EENEI  +                E++PEGE L        SD  V+ D E
Sbjct: 283 EPSECTEENEIAVEGEPGSKLERSEEEAGSEIVPEGEILTALSCTDVSDIAVESDGE 339


>gb|KRG89177.1| hypothetical protein GLYMA_20G006500 [Glycine max]
          Length = 1498

 Score = 67.4 bits (163), Expect = 3e-08
 Identities = 92/353 (26%), Positives = 137/353 (38%), Gaps = 70/353 (19%)
 Frame = -2

Query: 852  DPIVTVDG--NGTASEDRKVDGESIVKVTEGARDCKVVESNN----EENDCVDAPDLSNG 691
            + +V V G  + + ++ R V    +     G  D  +   NN     E  C DA  +SNG
Sbjct: 4    EQVVNVRGEVSDSVTDHRSVKANGVAHGVGG--DLDLAADNNGAALSEKICKDA--VSNG 59

Query: 690  SAEGNCETTTFDVVEREGESGIIIDGQNGSGSAVVSVDENEN--------------DVHV 553
            +AE   ET   +VV R+ E     DGQN +GS    V EN++              D +V
Sbjct: 60   TAEEVSETAMVNVVSRDDELKCA-DGQNDTGSVQNGVVENDDKSANAVAEELVTDHDEYV 118

Query: 552  IDNNTVVEGGDEFRTVQNGVSDSEIRDEVMVGASDVQNDVVAENEICDDVNKTIEXXXXX 373
            +  ++ V+ GD+     NGV + E+ D    G+ D    VV+  E   DVN +       
Sbjct: 119  VVGDSDVQNGDDVNANANGVEECEMLDGA-EGSGDENGVVVSAVEGDADVNHSDREFECV 177

Query: 372  XXXXXXDWEVVAVENGVVESAXXXXXXXXXXXXDASESV-DSNVQENHEKL--------- 223
                    E V  E   V +               SESV D +V ++ E +         
Sbjct: 178  DVHNDVAVETVEEE---VTATTDQNVGNGNDVQGRSESVSDEDVDKSGESVNVVSADVLD 234

Query: 222  ----------ESEEIVEKNEIPV---------DVKECADEDSKHGLEK------------ 136
                      ++EE++EKNEI V         D+K+C  ED+++  EK            
Sbjct: 235  EKDIVTDGDHDAEEVLEKNEILVDADGVSATTDLKQCEPEDARNSSEKGQVESVSGLAKP 294

Query: 135  ----VPEENEIPDDXXXXXXXXXXXXXXXXEVIPEGEKL-----SDKVVDGDV 4
                  EENEI  +                E++P+GE L     +D   DGDV
Sbjct: 295  EPSECTEENEIAVEGEPGSKLERSEEEAGSEIVPQGENLTALNSTDVTGDGDV 347


>ref|XP_003556620.1| PREDICTED: plectin-like [Glycine max]
 gb|KRG89178.1| hypothetical protein GLYMA_20G006500 [Glycine max]
          Length = 1501

 Score = 67.4 bits (163), Expect = 3e-08
 Identities = 92/353 (26%), Positives = 137/353 (38%), Gaps = 70/353 (19%)
 Frame = -2

Query: 852  DPIVTVDG--NGTASEDRKVDGESIVKVTEGARDCKVVESNN----EENDCVDAPDLSNG 691
            + +V V G  + + ++ R V    +     G  D  +   NN     E  C DA  +SNG
Sbjct: 4    EQVVNVRGEVSDSVTDHRSVKANGVAHGVGG--DLDLAADNNGAALSEKICKDA--VSNG 59

Query: 690  SAEGNCETTTFDVVEREGESGIIIDGQNGSGSAVVSVDENEN--------------DVHV 553
            +AE   ET   +VV R+ E     DGQN +GS    V EN++              D +V
Sbjct: 60   TAEEVSETAMVNVVSRDDELKCA-DGQNDTGSVQNGVVENDDKSANAVAEELVTDHDEYV 118

Query: 552  IDNNTVVEGGDEFRTVQNGVSDSEIRDEVMVGASDVQNDVVAENEICDDVNKTIEXXXXX 373
            +  ++ V+ GD+     NGV + E+ D    G+ D    VV+  E   DVN +       
Sbjct: 119  VVGDSDVQNGDDVNANANGVEECEMLDGA-EGSGDENGVVVSAVEGDADVNHSDREFECV 177

Query: 372  XXXXXXDWEVVAVENGVVESAXXXXXXXXXXXXDASESV-DSNVQENHEKL--------- 223
                    E V  E   V +               SESV D +V ++ E +         
Sbjct: 178  DVHNDVAVETVEEE---VTATTDQNVGNGNDVQGRSESVSDEDVDKSGESVNVVSADVLD 234

Query: 222  ----------ESEEIVEKNEIPV---------DVKECADEDSKHGLEK------------ 136
                      ++EE++EKNEI V         D+K+C  ED+++  EK            
Sbjct: 235  EKDIVTDGDHDAEEVLEKNEILVDADGVSATTDLKQCEPEDARNSSEKGQVESVSGLAKP 294

Query: 135  ----VPEENEIPDDXXXXXXXXXXXXXXXXEVIPEGEKL-----SDKVVDGDV 4
                  EENEI  +                E++P+GE L     +D   DGDV
Sbjct: 295  EPSECTEENEIAVEGEPGSKLERSEEEAGSEIVPQGENLTALNSTDVTGDGDV 347


>ref|XP_016180080.1| proton pump-interactor BIP103 isoform X2 [Arachis ipaensis]
          Length = 1101

 Score = 62.0 bits (149), Expect = 2e-06
 Identities = 93/337 (27%), Positives = 131/337 (38%), Gaps = 50/337 (14%)
 Frame = -2

Query: 966 DHHKIKGDHCNGDA---HAKEKXXXXXXXXXXXXVADSGASDPIVTVDGNGTASEDRKVD 796
           D H IK  +CNGDA   +                  DSGASD IV V+       + K  
Sbjct: 21  DRHNIKVKYCNGDAAHENGSVAGDGDGAVVSGNGGGDSGASDLIVVVEDK----VEGKAL 76

Query: 795 GESIVKVTEGAR----DCKVVESNNEENDCVDAPDLSNGSAE----GNCETTTFDVVE-- 646
           GES V V+E +     +C+   +  EEND  +  ++SNG+      G  +  +  V+E  
Sbjct: 77  GESDVTVSEASPVAECECEEAVNKEEENDLEEKTEISNGTIPVGWGGQNDEGSVVVLEDN 136

Query: 645 -REGESGIIIDG----QNGSGSAVVS-VDE-NENDVHVIDNNTVVEGGDEF--------- 514
            +E    I  D     QNG+ S +   VDE N  D + I  N  + G ++          
Sbjct: 137 AKEVNETITCDHELAVQNGAKSEIRDGVDEVNGADANGIQQNGEIHGDEKKEPVTVVEVD 196

Query: 513 RTVQNGVSDSEIRDEVMVGASDVQNDVV-AENEICDDVNKTIEXXXXXXXXXXXDWEVV- 340
           RT  N  S SE+          V+N VV  E  +  DVN+  +             E V 
Sbjct: 197 RTDGNNGSHSELESVA------VENCVVNKEVSVTMDVNEFADKDGESKSAEKAQLEAVD 250

Query: 339 ----------AVENGVVESAXXXXXXXXXXXXDASESVDSNVQENHEKLESEEIVEKNEI 190
                     +V  G  ES             + ++   +NV    ++  S E  EK+EI
Sbjct: 251 SGGGIEEGGGSVLEGTTESTSDAVSDEKAVAQEVTDREFTNVVNGDDQNGSAE-TEKDEI 309

Query: 189 P---------VDVKECADEDSKHGLEKVPEENEIPDD 106
           P         VDVKECA ED+  G +    E E   D
Sbjct: 310 PIGIDGVHVSVDVKECAGEDAHTGSDVEKSEAEAVTD 346


>ref|XP_016180077.1| immunoglobulin A1 protease autotransporter isoform X1 [Arachis
           ipaensis]
          Length = 1278

 Score = 62.0 bits (149), Expect = 2e-06
 Identities = 93/337 (27%), Positives = 131/337 (38%), Gaps = 50/337 (14%)
 Frame = -2

Query: 966 DHHKIKGDHCNGDA---HAKEKXXXXXXXXXXXXVADSGASDPIVTVDGNGTASEDRKVD 796
           D H IK  +CNGDA   +                  DSGASD IV V+       + K  
Sbjct: 21  DRHNIKVKYCNGDAAHENGSVAGDGDGAVVSGNGGGDSGASDLIVVVEDK----VEGKAL 76

Query: 795 GESIVKVTEGAR----DCKVVESNNEENDCVDAPDLSNGSAE----GNCETTTFDVVE-- 646
           GES V V+E +     +C+   +  EEND  +  ++SNG+      G  +  +  V+E  
Sbjct: 77  GESDVTVSEASPVAECECEEAVNKEEENDLEEKTEISNGTIPVGWGGQNDEGSVVVLEDN 136

Query: 645 -REGESGIIIDG----QNGSGSAVVS-VDE-NENDVHVIDNNTVVEGGDEF--------- 514
            +E    I  D     QNG+ S +   VDE N  D + I  N  + G ++          
Sbjct: 137 AKEVNETITCDHELAVQNGAKSEIRDGVDEVNGADANGIQQNGEIHGDEKKEPVTVVEVD 196

Query: 513 RTVQNGVSDSEIRDEVMVGASDVQNDVV-AENEICDDVNKTIEXXXXXXXXXXXDWEVV- 340
           RT  N  S SE+          V+N VV  E  +  DVN+  +             E V 
Sbjct: 197 RTDGNNGSHSELESVA------VENCVVNKEVSVTMDVNEFADKDGESKSAEKAQLEAVD 250

Query: 339 ----------AVENGVVESAXXXXXXXXXXXXDASESVDSNVQENHEKLESEEIVEKNEI 190
                     +V  G  ES             + ++   +NV    ++  S E  EK+EI
Sbjct: 251 SGGGIEEGGGSVLEGTTESTSDAVSDEKAVAQEVTDREFTNVVNGDDQNGSAE-TEKDEI 309

Query: 189 P---------VDVKECADEDSKHGLEKVPEENEIPDD 106
           P         VDVKECA ED+  G +    E E   D
Sbjct: 310 PIGIDGVHVSVDVKECAGEDAHTGSDVEKSEAEAVTD 346


Top