BLASTX nr result

ID: Dioscorea21_contig00003377 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00003377
         (1755 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002321060.1| predicted protein [Populus trichocarpa] gi|2...   154   5e-35
ref|XP_002874890.1| hypothetical protein ARALYDRAFT_490272 [Arab...   154   7e-35
ref|XP_003536443.1| PREDICTED: uncharacterized protein LOC100820...   149   3e-33
gb|AAD15341.1| hypothetical protein [Arabidopsis thaliana] gi|72...   149   3e-33
ref|XP_003520215.1| PREDICTED: uncharacterized protein LOC100789...   145   4e-32

>ref|XP_002321060.1| predicted protein [Populus trichocarpa] gi|222861833|gb|EEE99375.1|
            predicted protein [Populus trichocarpa]
          Length = 549

 Score =  154 bits (390), Expect = 5e-35
 Identities = 152/536 (28%), Positives = 230/536 (42%), Gaps = 36/536 (6%)
 Frame = -2

Query: 1640 MGFEDVLAALVEIFPQVDYCMLQAVASQHSEDVNAALDFILHDVLPSFP----ASWTPCT 1473
            MGF  V   L ++FPQVD  +L+AVA +HS+D + A + +L +V+PS      A   PC 
Sbjct: 1    MGFSTVYKCLTDVFPQVDARILKAVAIEHSKDADIAAEVVLSEVIPSLSRHSAAPSPPCE 60

Query: 1472 SNSSDWGHQGVEEAHAQWPSESGLLSSGIACNSDFQVNDSMTCMDKPVGRDAEYETKTNG 1293
              S      G  E       E+GL    ++     + ++     ++  G+       T+G
Sbjct: 61   DTSPSLPLDGQTEQE----EETGLRHRQVSLVKSVRSSEPGLIAEEDDGKTE----LTSG 112

Query: 1292 MGYPTNSTEANSNMHPKERFVEFGVCLPSDHTNDFK-----VNSMNETLAGPDLQEVSAC 1128
            +    ++ + N    P        + +PS    D       + +  E   G   ++VS  
Sbjct: 113  VNDGDSTHQENRQDQP--------IVVPSGANADTNQLQGHIETEQEEETGLRHRQVSLV 164

Query: 1127 ----SAGPTSIHENSAMQDKYT----DTATSHGAQDPFASILNKSIEHLKEFPSQREIEN 972
                S+ P  I E    + + T    D  ++H        ++  S  +      Q  IE+
Sbjct: 165  KSVRSSEPGLIAEEDDGKTELTGGVNDGDSTHQEIRQDQPVVVPSGANADTNQLQGHIES 224

Query: 971  DDNL---SSQGDIGSFDVNGQSSSIASVTDLFEG-------------IKYLEDLTIDAKN 840
            D+ +     Q   G        + I    DL  G             I+ LE++   AK+
Sbjct: 225  DELILLGKPQHQEGISQPGSSQTLILVSNDLLLGVNAENMNSKQYRQIELLEEIVEAAKD 284

Query: 839  SKVILTSAMESTINMEKEVELHEERAKQAKIEASLAGEDTLVHVXXXXXXXXXXXXRNDM 660
            +K  L SAMES +NM KEVEL E  A+QAK EA+  G D LV V             NDM
Sbjct: 285  NKKTLFSAMESVMNMMKEVELQEISAEQAKEEAARGGLDILVEVEKLKQMLVHAKEANDM 344

Query: 659  HAEEVYGERSMLAAEAQGLQSWLLSVSNERDKSLGIIDEIRNTLETRXXXXXXXXXXXXX 480
            HA EVYGE+++LA E + LQ+ LLS+S+ERD +L I+DE+R TLE+R             
Sbjct: 345  HAGEVYGEKAILATEVRELQARLLSLSDERDNALAILDEMRQTLESRLAAAEELRKTAEL 404

Query: 479  XXXXXXXXXXXXXADGRLFAASIFXXXXXXXXXXXXXXXXXXXLMNHGQIIDILRLEIAS 300
                         A+  +    +                    LM+ G ++D L+ EI+ 
Sbjct: 405  EKLEKEETARNALAEQEIIMEKVVQESKILQKEAEENAKLQEFLMDRGCVVDTLQGEISV 464

Query: 299  ILKDVMTLKERVDGHIPLSRSASHASVTSNLVSLSTELSD--SQIAPSEG-TSRNP 141
            I +DV  LKER D  +PLS+S S +  +  L S  + +    S +A   G TS  P
Sbjct: 465  ICQDVRLLKERFDERVPLSKSVSSSQTSCILASSGSSIKSMASNLAAETGETSELP 520


>ref|XP_002874890.1| hypothetical protein ARALYDRAFT_490272 [Arabidopsis lyrata subsp.
            lyrata] gi|297320727|gb|EFH51149.1| hypothetical protein
            ARALYDRAFT_490272 [Arabidopsis lyrata subsp. lyrata]
          Length = 559

 Score =  154 bits (389), Expect = 7e-35
 Identities = 135/516 (26%), Positives = 231/516 (44%), Gaps = 31/516 (6%)
 Frame = -2

Query: 1640 MGFEDVLAALVEIFPQVDYCMLQAVASQHSEDVNAALDFILHDVLPSFPASWTPCTSNSS 1461
            MG++ V  +L E+FPQ+D  +L+AVA +H +D N A   ++ +++P F         N +
Sbjct: 1    MGYKAVYRSLTELFPQIDARLLKAVAIEHPKDANEAAAVVVSEIVPFF-------YPNLA 53

Query: 1460 DWGHQGVEEAHAQWPSE-SGLLSSGIACNSDFQVNDSMTCMDKPVGRDAEYETKT----- 1299
            D   Q         P++    + +G+   S+     S +    P+  D ++E++      
Sbjct: 54   DNSTQPENRTPGNVPNKVERAMQNGVLSGSE--TGSSSSSGSIPLAVDCDHESRAPITES 111

Query: 1298 ----NGMGY--PTNSTEANSNMHPKERFVEFGVCLPSDHTNDFKVNSMNET--------- 1164
                N + +  P    +  SN        E    + S++   F+  + + +         
Sbjct: 112  ISSRNQLTHVMPNVDLDIQSNAKIGLSGSEESGVVSSENPVSFQAGAKSTSHGCQGVGFH 171

Query: 1163 LAGPDLQEVSACSAGPTSIH------ENSAMQDKYTDTATSHGAQDPFASILNKSIEHLK 1002
            + G +  E S  S    ++H      +NSAM  K        G+ D      + S+  ++
Sbjct: 172  ITGSNQAEASTSSESEDAVHKLVYPADNSAMTQKSPPLQIRFGSIDIVNETSSGSLA-VE 230

Query: 1001 EFPSQREIENDDNLSSQGDI----GSFDVNGQSSSIASVTDLFEGIKYLEDLTIDAKNSK 834
               ++    N  +++S+G +    G  ++ G  SS+ S +     I +LE +  DAK++K
Sbjct: 231  NSDAELSGSNLVDVTSKGSLAVENGDPELVGAFSSVVSRSTQGCNIVHLEQIIEDAKSNK 290

Query: 833  VILTSAMESTINMEKEVELHEERAKQAKIEASLAGEDTLVHVXXXXXXXXXXXXRNDMHA 654
              L + MES +N+ +EVEL E+ A++AK +AS  G DTL  V             NDM A
Sbjct: 291  KTLFTVMESIMNLMREVELQEKDAEKAKEDASRGGFDTLDKVEELKKMLEHAKEANDMDA 350

Query: 653  EEVYGERSMLAAEAQGLQSWLLSVSNERDKSLGIIDEIRNTLETRXXXXXXXXXXXXXXX 474
             EVYGERS+L  E   L++ LL++S ERDKSL ++DE+R  LE R               
Sbjct: 351  GEVYGERSILTTEVNELENRLLNLSEERDKSLSVLDEMREVLEIRLAAALEIKNAAEQEK 410

Query: 473  XXXXXXXXXXXADGRLFAASIFXXXXXXXXXXXXXXXXXXXLMNHGQIIDILRLEIASIL 294
                       A+       +                    LM+HG+I+D L+ EI+ I 
Sbjct: 411  QEKEGSARMAFAEQEAIMEKVVQESKLLQQEAEENSKLREFLMDHGRIVDSLQGEISVIC 470

Query: 293  KDVMTLKERVDGHIPLSRSASHASVTSNLVSLSTEL 186
            +D+  LKE+ D  +PLS+S + +  +  L S ++ +
Sbjct: 471  QDIRHLKEKFDNRVPLSQSITSSQTSCKLASSASSM 506


>ref|XP_003536443.1| PREDICTED: uncharacterized protein LOC100820331 [Glycine max]
          Length = 546

 Score =  149 bits (375), Expect = 3e-33
 Identities = 144/526 (27%), Positives = 231/526 (43%), Gaps = 17/526 (3%)
 Frame = -2

Query: 1640 MGFEDVLAALVEIFPQVDYCMLQAVASQHSEDVNAALDFILHDVLP----SFPASWTPCT 1473
            MGF  V   L EIFPQVD  +L+AVA +H +D + A   +L +V+P      PA+  P  
Sbjct: 1    MGFNSVYRNLQEIFPQVDPRLLRAVAIEHPKDADLAAGIVLAEVIPFMSKKLPAAIPPQH 60

Query: 1472 SNSS-------DWGHQGVEEAHAQWPSE-----SGLLSSGIACNSDFQVNDSMTCMDKPV 1329
            ++         +   +G    H Q   +     S  LS+G  CNS     +    MD   
Sbjct: 61   NDHGAPLDVEVESEEEGNRLRHCQRVDDVNVGPSSTLSNG--CNSKDDT-EKFLGMDDIK 117

Query: 1328 GRDAEYETKTNGMGYPTNS-TEANSNMHPKERFVEFGVCLPSDHTNDFKVNSMNETLAGP 1152
              D     + N +G   N   +  SN   +E   E     P D   +  ++S ++    P
Sbjct: 118  ELDIFQNAEDNFIGETLNEIAQEMSNGFIQEEDNENFERQPVDFDCENLISSADDYDVTP 177

Query: 1151 DLQEVSACSAGPTSIHENSAMQDKYTDTATSHGAQDPFASILNKSIEHLKEFPSQREIEN 972
                +  C      +  + A +  +    T + ++D   S L+          S  ++EN
Sbjct: 178  S-HRLEECETYLIELESSEAQEVCHVQGDTLN-SKDSLQSELDAGSSTAGGNTS--DVEN 233

Query: 971  DDNLSSQGDIGSFDVNGQSSSIASVTDLFEGIKYLEDLTIDAKNSKVILTSAMESTINME 792
            D+   S G         Q S ++ + DL      LE++  +AK +K  L S+MES IN+ 
Sbjct: 234  DNGAKSAGS--------QYSQVSRI-DL------LEEIIDEAKTNKKTLFSSMESLINLM 278

Query: 791  KEVELHEERAKQAKIEASLAGEDTLVHVXXXXXXXXXXXXRNDMHAEEVYGERSMLAAEA 612
            +EVE+ E+ A+QA +EA+  G + L  +             NDMHA EVYGE+++LA E 
Sbjct: 279  REVEVQEKAAEQANMEAATGGSNILARIEEYKTMLVQAKEANDMHAGEVYGEKAILATEL 338

Query: 611  QGLQSWLLSVSNERDKSLGIIDEIRNTLETRXXXXXXXXXXXXXXXXXXXXXXXXXXADG 432
            + LQS LL +S+ERDKSL I+DE+R+ LE R                           + 
Sbjct: 339  KELQSRLLGLSDERDKSLAILDEMRHILEERLAAAEESRKAAEQQKLEKEESARKALVEQ 398

Query: 431  RLFAASIFXXXXXXXXXXXXXXXXXXXLMNHGQIIDILRLEIASILKDVMTLKERVDGHI 252
                  +                    L++ G+++D+L+ EI+ I +D+  LKE+ D ++
Sbjct: 399  ERLVEMVVHESQRLQQEAEENSKLQEFLIDRGRVVDMLQGEISVICQDIKLLKEKFDANL 458

Query: 251  PLSRSASHASVTSNLVSLSTELSDSQIAPSEGTSRNPHSDGVSVSS 114
            PLS+S + +  +  L S     S  +   S+  S +  S G+  +S
Sbjct: 459  PLSKSFTSSQTSCKLASSG---SSHKTLASDAGSDHSESSGIRKTS 501


>gb|AAD15341.1| hypothetical protein [Arabidopsis thaliana]
            gi|7269773|emb|CAB77773.1| hypothetical protein
            [Arabidopsis thaliana]
          Length = 539

 Score =  149 bits (375), Expect = 3e-33
 Identities = 133/503 (26%), Positives = 223/503 (44%), Gaps = 18/503 (3%)
 Frame = -2

Query: 1640 MGFEDVLAALVEIFPQVDYCMLQAVASQHSEDVNAALDFILHDVLPSFPASWTPCTSNSS 1461
            MG++ V  +L E+FPQ+D  +L+AVA +H +DVN A   ++ +++P F         N +
Sbjct: 1    MGYKAVYRSLTELFPQIDARLLKAVAIEHPKDVNEAAAVVVSEIVPFF-------YPNLA 53

Query: 1460 DWGHQGVEEAHAQWPSESGLLSSGIACNSDFQVNDSMTCMDKPVGR---------DAEYE 1308
            D   Q   +     P+E    S   + +  F+ +++   + + V +         +   +
Sbjct: 54   DSSTQPENKTPGNVPTEEMGGSYSGSASMAFEYHETRAPVTESVSKRNQLTHVMPNVVVD 113

Query: 1307 TKTNGMGYPTNSTEANSNMHPKERFVEFGVCLPSDHTNDFKVNSMNETLAGPDLQEVSAC 1128
             +  G    + S E+           + G     D     + +S           E S  
Sbjct: 114  IQRKGKIGLSGSDESGVVSSEPPVSCQAGAKSTGDDWQGVEFHSTGNQA------EASTS 167

Query: 1127 SAGPTSIHENSAMQDKYTDTATSHGAQDPFASI--LNK-SIEHLKEFPSQREIENDDNLS 957
            +    ++H+     D    T  SH  Q  F SI  +N+ S   L    S  E+   + + 
Sbjct: 168  ADSEDAVHKLVYPADNLAITQNSHPLQIRFGSIDVVNETSSGSLAVENSDAELSGSNLVD 227

Query: 956  --SQGDI----GSFDVNGQSSSIASVTDLFEGIKYLEDLTIDAKNSKVILTSAMESTINM 795
              S+G +    G  +++G  SS+ + +     + +LE +  DAK++K  L + MES +N+
Sbjct: 228  EISKGSLADENGDPELDGAVSSVGNRSTQGCNMVHLEQIIEDAKSNKRTLFTVMESIMNL 287

Query: 794  EKEVELHEERAKQAKIEASLAGEDTLVHVXXXXXXXXXXXXRNDMHAEEVYGERSMLAAE 615
             +EVEL E+ A++AK +AS+ G DTL  V             NDM A EVYGERS+L  E
Sbjct: 288  MREVELQEKEAEKAKEDASIGGFDTLDKVEELKKMLEHAKEANDMAAGEVYGERSILTTE 347

Query: 614  AQGLQSWLLSVSNERDKSLGIIDEIRNTLETRXXXXXXXXXXXXXXXXXXXXXXXXXXAD 435
               L++ L+S+S ERD SL ++DE+R  LE R                          A+
Sbjct: 348  VNELENRLISLSEERDNSLSVLDEMRVDLEIRLATALGIKNAAEQEKQEKEGSARKAFAE 407

Query: 434  GRLFAASIFXXXXXXXXXXXXXXXXXXXLMNHGQIIDILRLEIASILKDVMTLKERVDGH 255
                   +                    LM+HG+I+D L+ EI+ I +D+  LKE+ D  
Sbjct: 408  QEAIMERVVQESKLLQQEAEENSKLREFLMDHGRIVDSLQGEISVICQDIRHLKEKFDNR 467

Query: 254  IPLSRSASHASVTSNLVSLSTEL 186
            +PLS+S S +  +  L S ++ +
Sbjct: 468  VPLSQSISSSQTSCKLASSASSM 490


>ref|XP_003520215.1| PREDICTED: uncharacterized protein LOC100789476 [Glycine max]
          Length = 603

 Score =  145 bits (365), Expect = 4e-32
 Identities = 149/571 (26%), Positives = 240/571 (42%), Gaps = 68/571 (11%)
 Frame = -2

Query: 1640 MGFEDVLAALVEIFPQVDYCMLQAVASQHSEDVNAALDFILHDVLP----SFPASWTPCT 1473
            MGF  V  +L EIFPQVD  +L+AVA +H +D + A   ++ +V+P      PA+  P  
Sbjct: 1    MGFNSVYRSLQEIFPQVDPRLLRAVAIEHPKDADLAAGIVIAEVIPFMSKKLPAAIPPQH 60

Query: 1472 SN----------SSDWGH-----QGVEEAHAQWPSESGLLSSGIACNSDFQ-VNDSMTCM 1341
            +N          S + G+     Q V++      S    +S  +   +D+  V D    +
Sbjct: 61   NNYVASLNVEVESEEEGNRLRHRQLVDDVTVGPSSAPHSISVEVIKTADYSFVPDLNEAL 120

Query: 1340 DKPV----GRDAEYE------------TKTNGMGYPTN----------STEANSNMHPK- 1242
            DK      G D   E             + N  G   N          S E N N   + 
Sbjct: 121  DKSTMSNDGTDKFLEMNDIKELDIYQNAEDNFSGETLNEIAQEMSNGFSQEDNENFERRF 180

Query: 1241 -----ERFVEFGVC--LPSDHTNDFKVNSMNETLAGPDLQEVSACSAGPTSIHENSAMQD 1083
                 E  +  G+C  +   H N  K  + N      D   +   S     +   S++ D
Sbjct: 181  VDVDCENLISSGICQEMEPKHNNLSKEAASNNG----DGNRIGNDSNEMGWLEVVSSLVD 236

Query: 1082 KYTDTATSHGAQDPFASILNKSIEHLKEFPSQREIEND-----DNLSSQGDIGSFDVNGQ 918
             Y D  TSH  ++    ++        E P    ++ D     D+L S+   GS      
Sbjct: 237  DY-DATTSHRLEECETYLIELETS---EAPKVCHVQGDALNYKDSLQSELVAGSSSTGDN 292

Query: 917  SSSIAS-VTDLFEGIKY--------LEDLTIDAKNSKVILTSAMESTINMEKEVELHEER 765
            +S +   +     G +Y        LE++  +AK +K +L S+MES IN+ +EVEL E+ 
Sbjct: 293  TSDVEDDIGAKNAGSQYSHVCRIDLLEEIIDEAKTNKKMLFSSMESLINLMREVELQEKA 352

Query: 764  AKQAKIEASLAGEDTLVHVXXXXXXXXXXXXRNDMHAEEVYGERSMLAAEAQGLQSWLLS 585
            A+QA +EA+  G + L  +             NDMH+ EVYGE+++L  E + LQS LL 
Sbjct: 353  AEQANMEAATGGSNILARIEEYKTMVVQANEANDMHSGEVYGEKAILTTELKELQSRLLG 412

Query: 584  VSNERDKSLGIIDEIRNTLETRXXXXXXXXXXXXXXXXXXXXXXXXXXADGRLFAASIFX 405
            +S+ERD+SL I+DEIR+ LE R                           +       +  
Sbjct: 413  LSDERDRSLAILDEIRHILEVRLAAAEELRKAAEQLKLEKEESARKALVEQERLVEKVVH 472

Query: 404  XXXXXXXXXXXXXXXXXXLMNHGQIIDILRLEIASILKDVMTLKERVDGHIPLSRSASHA 225
                              L++ G+++D+L+ EI+ I +D+  LKE+ D ++PLS+S + +
Sbjct: 473  ESQRLQQEAEENSKLQEFLIDRGRVVDMLQGEISVICQDIKLLKEKFDANLPLSKSFTSS 532

Query: 224  SVTSNLVSLSTELSDSQIAPSEGTSRNPHSD 132
              +  L S  +  S   +A   G+  +  S+
Sbjct: 533  QTSCKLASSGS--SHKTLASDAGSEHSESSE 561


Top