BLASTX nr result

ID: Dioscorea21_contig00016589 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00016589
         (1874 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004159451.1| PREDICTED: uncharacterized LOC101213190 [Cuc...   203   1e-49
ref|XP_004140922.1| PREDICTED: uncharacterized protein LOC101213...   203   1e-49
emb|CAN73830.1| hypothetical protein VITISV_043067 [Vitis vinifera]   198   4e-48
ref|NP_001078603.1| uncharacterized protein [Arabidopsis thalian...   185   4e-44
ref|XP_003545319.1| PREDICTED: uncharacterized protein LOC100786...   184   7e-44

>ref|XP_004159451.1| PREDICTED: uncharacterized LOC101213190 [Cucumis sativus]
          Length = 552

 Score =  203 bits (516), Expect = 1e-49
 Identities = 141/338 (41%), Positives = 194/338 (57%), Gaps = 14/338 (4%)
 Frame = +2

Query: 641  VKDGESGILNSESSSNNNFTKTFHRSGEREATQMRFQKSQFHHGKYAKGGVKPFAKNGGQ 820
            + DG +G   S S SNN+  + F R+ ++      FQK+Q HH K  K   K F   GGQ
Sbjct: 215  ISDGGNG---SNSISNNSAHRNFMRNSKKG-----FQKNQTHHLKNEK---KKFGFPGGQ 263

Query: 821  GQQNW-KERNHQFNKGSKQAPV-ECKRPIPINYTENEIKAWREARRKNFPTNANIAKKLA 994
             ++ +  ER ++F   +    V E KR + + YT+ EI+ WREARRKN+P++ NI KKL 
Sbjct: 264  KEKGFHNERRNKFCGTNPTDQVKEQKRSLSLVYTDQEIRQWREARRKNYPSSTNIQKKLT 323

Query: 995  GNVKHVEDADGDDAKLRRQQLKEILAQQAKLGVEVAEVPSSYLSDSENLVSNSKGDRKDF 1174
            G   +    D  +AKL RQ+LKEILA+QA+LGVEVAE+P  YLS SE      K D +  
Sbjct: 324  GKQTNCTLVD-KEAKLLRQELKEILAKQAELGVEVAEIPPEYLSYSE------KHDNRKQ 376

Query: 1175 RHGRKSW------------NARGRLQNDNKRRNNQYQDKPCGKRPKFANDDTSKNPSKIR 1318
            R GR +             N++ RL   NKR   + +++P  K+ KF    ++K P K R
Sbjct: 377  RGGRSTLGEEAEEASIEKENSQNRL---NKRGRCKKKNRP-RKKGKFEKHLSNKPPLKKR 432

Query: 1319 EPTLLQKLLSKEIKKDKSKLLQVFRFMVMNSFFKHWPEKPLEFPVITIKDPDSETVAAVE 1498
            EPTLLQKLL  +++KDKS+LLQ  RF VMNSFFK WP KPL+FP +T+K+ + ET     
Sbjct: 433  EPTLLQKLLKADVRKDKSQLLQALRFTVMNSFFKEWPNKPLKFPSVTVKENEGET----- 487

Query: 1499 TDTLLNGLESTPASGKNENERIDELNTAVDDVESGPID 1612
                 N ++ T  S  N N +    N+ V++  S  ID
Sbjct: 488  -----NVVDETSLSTGNFNLQETNNNSLVENDGSHDID 520


>ref|XP_004140922.1| PREDICTED: uncharacterized protein LOC101213190 [Cucumis sativus]
          Length = 599

 Score =  203 bits (516), Expect = 1e-49
 Identities = 141/338 (41%), Positives = 194/338 (57%), Gaps = 14/338 (4%)
 Frame = +2

Query: 641  VKDGESGILNSESSSNNNFTKTFHRSGEREATQMRFQKSQFHHGKYAKGGVKPFAKNGGQ 820
            + DG +G   S S SNN+  + F R+ ++      FQK+Q HH K  K   K F   GGQ
Sbjct: 262  ISDGGNG---SNSISNNSAHRNFMRNSKKG-----FQKNQTHHLKNEK---KKFGFPGGQ 310

Query: 821  GQQNW-KERNHQFNKGSKQAPV-ECKRPIPINYTENEIKAWREARRKNFPTNANIAKKLA 994
             ++ +  ER ++F   +    V E KR + + YT+ EI+ WREARRKN+P++ NI KKL 
Sbjct: 311  KEKGFHNERRNKFCGTNPTDQVKEQKRSLSLVYTDQEIRQWREARRKNYPSSTNIQKKLT 370

Query: 995  GNVKHVEDADGDDAKLRRQQLKEILAQQAKLGVEVAEVPSSYLSDSENLVSNSKGDRKDF 1174
            G   +    D  +AKL RQ+LKEILA+QA+LGVEVAE+P  YLS SE      K D +  
Sbjct: 371  GKQTNCTLVD-KEAKLLRQELKEILAKQAELGVEVAEIPPEYLSYSE------KHDNRKQ 423

Query: 1175 RHGRKSW------------NARGRLQNDNKRRNNQYQDKPCGKRPKFANDDTSKNPSKIR 1318
            R GR +             N++ RL   NKR   + +++P  K+ KF    ++K P K R
Sbjct: 424  RGGRSTLGEEAEEASIEKENSQNRL---NKRGRCKKKNRP-RKKGKFEKHLSNKPPLKKR 479

Query: 1319 EPTLLQKLLSKEIKKDKSKLLQVFRFMVMNSFFKHWPEKPLEFPVITIKDPDSETVAAVE 1498
            EPTLLQKLL  +++KDKS+LLQ  RF VMNSFFK WP KPL+FP +T+K+ + ET     
Sbjct: 480  EPTLLQKLLKADVRKDKSQLLQALRFTVMNSFFKEWPNKPLKFPSVTVKENEGET----- 534

Query: 1499 TDTLLNGLESTPASGKNENERIDELNTAVDDVESGPID 1612
                 N ++ T  S  N N +    N+ V++  S  ID
Sbjct: 535  -----NVVDETSLSTGNFNLQETNNNSLVENDGSHDID 567


>emb|CAN73830.1| hypothetical protein VITISV_043067 [Vitis vinifera]
          Length = 605

 Score =  198 bits (504), Expect = 4e-48
 Identities = 130/369 (35%), Positives = 195/369 (52%), Gaps = 13/369 (3%)
 Frame = +2

Query: 653  ESGILNSESSSNNNFTKTFHRSGEREATQMRFQKSQFHHGKYAKGGVKPFAKNGGQGQQN 832
            ++G+ NS  +  N+  K F  + + + +    +KSQ HH +  +G      +N G+G  N
Sbjct: 256  DAGVNNSNPNWKNSSRKNFMXNPKGKNSHWGSRKSQLHHMQNGRGKAGISNENRGKGLSN 315

Query: 833  WKERNHQFNKGSKQAPVECKRPIPINYTENEIKAWREARRKNFPTNANIAKKLAGNVKHV 1012
                N      + Q  VE KRP+P+NYTE EI+ WRE R+KN+P+  N+ KK A  + + 
Sbjct: 316  NMAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEKLTNS 375

Query: 1013 EDADGDDAKLRRQQLKEILAQQAKLGVEVAEVPSSYLSDSENLVSNSKGDRKDFRHGRKS 1192
            E  + +  K RRQQLKEILA+QA+LGVEVAE+P  YLSDSE    + + +      G+K 
Sbjct: 376  EVIEAE-VKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFGKKE 434

Query: 1193 -WNARGRLQNDNKRRNNQYQDKPCG--KRPKFANDDT------SKNPSKIREPTLLQKLL 1345
             +  RG  +  + R+  Q  D+  G  K+ + A  D+      ++ P   ++ TLLQKLL
Sbjct: 435  RFQNRGNKRRRHDRKQWQRHDQEDGFTKKQRLAGTDSGDTNASNQPPLNKKKQTLLQKLL 494

Query: 1346 SKEIKKDKSKLLQVFRFMVMNSFFKHWPEKPLEFPVITIKDP--DSETVAAVETDTLLNG 1519
            S +IK+DK  LLQVFRFM MNSFFK WPEKPL+FP++ +K+     E V    + T    
Sbjct: 495  STDIKRDKRHLLQVFRFMAMNSFFKDWPEKPLKFPLVAVKETGCQGEVVDRKSSPTSKGV 554

Query: 1520 LESTPASGKNENERIDELN--TAVDDVESGPIDGCXXXXXXXXXXXXXXXXHNQNEDLHE 1693
             +    +   E   +DE +   A +    G ++G                  +  +++  
Sbjct: 555  PQGGRKTXAEEFSNVDEADDAQAKETAHFGKVEG------------------SSGDEIER 596

Query: 1694 KLEEGEITE 1720
              EEGEI +
Sbjct: 597  SEEEGEIID 605


>ref|NP_001078603.1| uncharacterized protein [Arabidopsis thaliana]
            gi|145358174|ref|NP_197345.2| uncharacterized protein
            [Arabidopsis thaliana] gi|60547897|gb|AAX23912.1|
            hypothetical protein At5g18440 [Arabidopsis thaliana]
            gi|71905555|gb|AAZ52755.1| hypothetical protein At5g18440
            [Arabidopsis thaliana] gi|71905557|gb|AAZ52756.1|
            hypothetical protein At5g18440 [Arabidopsis thaliana]
            gi|332005180|gb|AED92563.1| uncharacterized protein
            [Arabidopsis thaliana] gi|332005181|gb|AED92564.1|
            uncharacterized protein [Arabidopsis thaliana]
          Length = 470

 Score =  185 bits (469), Expect = 4e-44
 Identities = 122/316 (38%), Positives = 183/316 (57%), Gaps = 8/316 (2%)
 Frame = +2

Query: 668  NSESSSNNNFTKTFHRSGEREATQMRFQKSQFHHGKYAKGGVKPFAKNGGQGQQNWKERN 847
            N    + N+F   F +    +     FQ+ Q H     K       K+G       K  N
Sbjct: 178  NGSGPNGNDFRNKFPKHQNFKGPGQGFQRPQLHQADNGK------RKSGFNKDHRGKGNN 231

Query: 848  HQFNKGSKQAPV-----ECKRPIPINYTENEIKAWREARRKNFPTNANIAKKLAGNVKHV 1012
            ++   G   +       E KR   + YT  E++ WREARRKN+PT   + KK+  NV   
Sbjct: 232  NKMKTGLDGSDTGNIAKEKKRSYALMYTPREVQQWREARRKNYPTKFLVEKKVKKNVS-- 289

Query: 1013 EDADGDDAKLRRQQLKEILAQQAKLGVEVAEVPSSYLSDSENLVSNSKGDRKDFRHGRKS 1192
                 ++AK+RRQQL+E+LA+QA+LGVEVAEVPS YLS+++  V+  +G+     +GRK 
Sbjct: 290  ASILDEEAKMRRQQLREVLAKQAELGVEVAEVPSHYLSNNDEQVNGDRGNN----NGRK- 344

Query: 1193 WNARGRLQND--NKRRNNQYQDKPCGKRPKFANDDTSKNPS-KIREPTLLQKLLSKEIKK 1363
                GR QN+  NKRR+++ +DK   K+P+  +  +S++ S   R+PTLL+KLLS +IK+
Sbjct: 345  ----GRFQNNRRNKRRHDR-KDKFDNKKPRLEDKKSSQDSSITTRKPTLLEKLLSADIKR 399

Query: 1364 DKSKLLQVFRFMVMNSFFKHWPEKPLEFPVITIKDPDSETVAAVETDTLLNGLESTPASG 1543
            DKS+LLQVFRFMVMNS  K +PE+PL+ P+IT+K+   E   A+E D  + GL      G
Sbjct: 400  DKSQLLQVFRFMVMNSLLKEFPEQPLKLPLITVKETGCED--AME-DPSIEGL----CDG 452

Query: 1544 KNENERIDELNTAVDD 1591
             ++ + +D  +++ D+
Sbjct: 453  LSDGDDVDGDDSSCDE 468


>ref|XP_003545319.1| PREDICTED: uncharacterized protein LOC100786384 [Glycine max]
          Length = 822

 Score =  184 bits (467), Expect = 7e-44
 Identities = 128/340 (37%), Positives = 183/340 (53%), Gaps = 23/340 (6%)
 Frame = +2

Query: 662  ILNSESSSNNNF-------------TKTFHRSGEREATQMRFQKSQFHHGKYAKGGVKPF 802
            +L +E   N+N              +K F     R   Q  FQKS+FH     K G    
Sbjct: 466  VLKTEEKPNSNIKTNVPNSNWKGSPSKNFKNKPNRGGFQAGFQKSKFHDVNNGKKGSGFP 525

Query: 803  AKNGGQGQQNWKERNHQFN-KGSKQAPVECKRPIPINYTENEIKAWREARRKNFPTNANI 979
             ++ G+G  + +  ++    K  KQ P   +R + + YT  EI+ WREAR+KN P N NI
Sbjct: 526  IEHNGKGPNSGRGGHYGLKPKEHKQQP---ERSLSVTYTVQEIQQWREARKKNHPFNNNI 582

Query: 980  AKKLAGNVKHVEDADGDDAKLRRQQLKEILAQQAKLGVEVAEVPSSYLSDSENLVSNSKG 1159
             KK   + +H +D    + ++ +++LKE+LA+QA+LGVEVAE+PS YL +S+N    S+G
Sbjct: 583  QKK---HSEHPKDRKAINREVLQRELKEVLAKQAELGVEVAEIPSYYLKNSDNQALQSEG 639

Query: 1160 DRKDFRHGRKSWNARGRLQNDNKRRNNQYQDKPCGKRPKFANDDTSKNPS-KIREPTLLQ 1336
              K F   RK  N   + ++D K R          KR KF + D S++PS K R+PTLLQ
Sbjct: 640  KNK-FTDKRKFQNKFNK-KSDRKGR--------FAKRQKFDDKDFSESPSLKKRKPTLLQ 689

Query: 1337 KLLSKEIKKDKSKLLQVFRFMVMNSFFKHWPEKPLEFPVITIKDPDSETVAAVE-----T 1501
            KLLS ++K+DKS L+QVFRFMVMNSFFKH  +KPL +P++ +K+  SE     +      
Sbjct: 690  KLLSSDVKRDKSHLIQVFRFMVMNSFFKHCLDKPLRYPLVVVKEKGSEVDGEEKYLHTGK 749

Query: 1502 DTLLNGLESTP---ASGKNENERIDELNTAVDDVESGPID 1612
            D L  G E T     +  N+N    E   + DD     +D
Sbjct: 750  DVLKGGNEETVQKIVTFNNDNSHDCEDEDSDDDENDSIVD 789


Top