BLASTX nr result

ID: Dioscorea21_contig00027279 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00027279
         (1123 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAG51464.1|AC069160_10 gypsy/Ty3 element polyprotein, putativ...   206   1e-50
emb|CAN60890.1| hypothetical protein VITISV_011880 [Vitis vinifera]   171   3e-40
emb|CAN80132.1| hypothetical protein VITISV_012031 [Vitis vinifera]   167   4e-39
emb|CAN80491.1| hypothetical protein VITISV_042679 [Vitis vinifera]   164   5e-38
ref|XP_003525745.1| PREDICTED: uncharacterized protein LOC100809...   160   4e-37

>gb|AAG51464.1|AC069160_10 gypsy/Ty3 element polyprotein, putative [Arabidopsis thaliana]
          Length = 1447

 Score =  206 bits (523), Expect = 1e-50
 Identities = 125/363 (34%), Positives = 193/363 (53%), Gaps = 25/363 (6%)
 Frame = +1

Query: 106  QLTKLEFPKFNGERLREWVLRCIQFFELDCTPENTKVKIASLHMEGRALQWHQNFMKGRL 285
            +L K++FP+F+G R+ EW+ +  +FF +D TPE  KVK+ ++H +  A  WH +F++  +
Sbjct: 113  RLGKIDFPRFDGSRINEWLFKVEEFFGVDFTPEEMKVKMVAIHFDSHAATWHHSFIQSGI 172

Query: 286  SGE-FPVWGEYVSALNSRFGSELHYDPMMELTNLKQLGSVQTYSDKFDELLNKVSLSTEH 462
              + F  W EYV  L  RF  +   DPM EL  L++   +  Y  +F+ +  +++LS E+
Sbjct: 173  GLDVFFNWPEYVKLLKDRF-EDACDDPMAELKKLQETDGIVEYHQQFELIKVRLNLSEEY 231

Query: 463  ALSLFLGGLKEELQYTVRMLAPKNLQQAISLAKLQELAVENAKRREKSIQRDIDS-ASFL 639
             +S++L GL+ + Q  VRM  PK ++  + L K  E A           Q+   S  S+ 
Sbjct: 232  LVSVYLAGLRTDTQMHVRMFEPKTVRDCLRLGKYYERAHPKKTVSSTWSQKGTRSGGSYR 291

Query: 640  PTPKMPQDGGGV---------------------NVGRIDINNQ-EDIIEAEENNIPGVDP 753
            P  ++ Q    +                      + R+D++ + ED +E   ++      
Sbjct: 292  PVKEVEQKSDHLGLCYFCDEKFTPEHYLVHKKTQLFRMDVDEEFEDAVEVLSDD------ 345

Query: 754  LDEETVP-PVVSSNALDGIPSLANYNIMRVSGSVQSQKIHILIDSGSTHNFIDSSTTKRL 930
             D E  P P +S NA+ GI   + Y  M V G+V  + + ILIDSGSTHNFIDS+   +L
Sbjct: 346  -DHEQKPMPQISVNAVSGI---SGYKTMGVKGTVDKRDLFILIDSGSTHNFIDSTVAAKL 401

Query: 931  GCKISNYHPVTVAKADGNKILCDKVCAGMKWKMQSVEFRTDLLGIPLKGCQMVLGIQWLI 1110
            GC + +     VA ADG K+  D    G  WK+QS  F++D+L IPL+G  MVLG+QWL 
Sbjct: 402  GCHVESAGLTKVAVADGRKLNVDGQIKGFTWKLQSTTFQSDILLIPLQGVDMVLGVQWLE 461

Query: 1111 LLG 1119
             LG
Sbjct: 462  TLG 464


>emb|CAN60890.1| hypothetical protein VITISV_011880 [Vitis vinifera]
          Length = 1378

 Score =  171 bits (433), Expect = 3e-40
 Identities = 82/168 (48%), Positives = 115/168 (68%)
 Frame = +1

Query: 112 TKLEFPKFNGERLREWVLRCIQFFELDCTPENTKVKIASLHMEGRALQWHQNFMKGRLSG 291
           TK++FPKFNG  L  W+LR   FFE+D TP   +V++A+LH+EG+A+QWHQ ++K R + 
Sbjct: 127 TKVDFPKFNGCGLDGWLLRVEYFFEVDRTPPEARVRLAALHLEGKAIQWHQGYIKTRGNE 186

Query: 292 EFPVWGEYVSALNSRFGSELHYDPMMELTNLKQLGSVQTYSDKFDELLNKVSLSTEHALS 471
            +  W EYV ALN+RFG  +  DP+ +L NL+Q GS+Q+Y D+FDEL  +  +   HALS
Sbjct: 187 AYLDWSEYVIALNARFGQHVFDDPIADLRNLRQTGSLQSYMDEFDELYPRADIKESHALS 246

Query: 472 LFLGGLKEELQYTVRMLAPKNLQQAISLAKLQELAVENAKRREKSIQR 615
            FL GL +ELQ  VRM  P+ L  A SLA+LQE+AV   + + K + +
Sbjct: 247 FFLSGLIDELQMPVRMFKPQTLADAYSLARLQEIAVAALQNKPKPVSK 294



 Score = 63.2 bits (152), Expect = 1e-07
 Identities = 27/84 (32%), Positives = 49/84 (58%)
 Frame = +1

Query: 793  ALDGIPSLANYNIMRVSGSVQSQKIHILIDSGSTHNFIDSSTTKRLGCKISNYHPVTVAK 972
            +L+ + S  +   M ++G+ + + + +LIDSGS+HNF+ S   KR+ C       + V  
Sbjct: 424  SLNALMSNEDSQTMTLNGNYKGRSLFVLIDSGSSHNFLSSKVAKRVDCCWQKARGIRVTV 483

Query: 973  ADGNKILCDKVCAGMKWKMQSVEF 1044
            A+G ++ C  +C+  +W+MQ  EF
Sbjct: 484  ANGQELHCTALCSDFRWRMQGQEF 507


>emb|CAN80132.1| hypothetical protein VITISV_012031 [Vitis vinifera]
          Length = 1371

 Score =  167 bits (423), Expect = 4e-39
 Identities = 80/168 (47%), Positives = 113/168 (67%)
 Frame = +1

Query: 112 TKLEFPKFNGERLREWVLRCIQFFELDCTPENTKVKIASLHMEGRALQWHQNFMKGRLSG 291
           TK++FPKFNG  L  W+LR   FFE+D TP   +V++A+LH+EG+A+QWHQ ++K R + 
Sbjct: 75  TKVDFPKFNGGGLDGWLLRVEYFFEVDRTPPEARVRLAALHLEGKAIQWHQGYIKTRGNE 134

Query: 292 EFPVWGEYVSALNSRFGSELHYDPMMELTNLKQLGSVQTYSDKFDELLNKVSLSTEHALS 471
            +  W EYV ALN+RFG  +  DP+ +L NL+Q GS+Q+Y D+FDEL  +  +   HALS
Sbjct: 135 AYLDWSEYVIALNARFGQHVFBDPIADLRNLRQTGSLQSYMDEFDELYPRADIKESHALS 194

Query: 472 LFLGGLKEELQYTVRMLAPKNLQQAISLAKLQELAVENAKRREKSIQR 615
            FL  L +ELQ  VRM  P+ L  A SLA+LQE+A    + + K + +
Sbjct: 195 FFLSXLIDELQMPVRMFKPQTLADAYSLARLQEIAXAALQNKPKPVSK 242



 Score = 83.6 bits (205), Expect = 8e-14
 Identities = 45/134 (33%), Positives = 75/134 (55%)
 Frame = +1

Query: 718  EAEENNIPGVDPLDEETVPPVVSSNALDGIPSLANYNIMRVSGSVQSQKIHILIDSGSTH 897
            E  E N+  ++ L EE     +S NAL    S  +   M ++G+ + + + +LIDSGS+H
Sbjct: 351  EGPEGNLQ-MEGLGEEDEQIQLSLNAL---MSNEDSQTMTLNGNYKGRSLFVLIDSGSSH 406

Query: 898  NFIDSSTTKRLGCKISNYHPVTVAKADGNKILCDKVCAGMKWKMQSVEFRTDLLGIPLKG 1077
            NF+ S   KR+ C       + V  A+G+++ C  +C+  +W+MQ  EF  ++  +PL+ 
Sbjct: 407  NFLSSKVAKRVDCCWQKARGIRVTVANGHELHCTALCSDFRWRMQGQEFIAEVYVLPLET 466

Query: 1078 CQMVLGIQWLILLG 1119
              ++LG QWL  LG
Sbjct: 467  YDLILGTQWLATLG 480


>emb|CAN80491.1| hypothetical protein VITISV_042679 [Vitis vinifera]
          Length = 1412

 Score =  164 bits (414), Expect = 5e-38
 Identities = 79/168 (47%), Positives = 112/168 (66%)
 Frame = +1

Query: 112 TKLEFPKFNGERLREWVLRCIQFFELDCTPENTKVKIASLHMEGRALQWHQNFMKGRLSG 291
           TK++F KFNG  L  W+LR   FFE+D TP   +V++A+LH+EG+A+QWHQ ++K R + 
Sbjct: 75  TKVDFXKFNGXGLDGWLLRVEYFFEVDRTPPEARVRLAALHLEGKAIQWHQGYIKTRGNE 134

Query: 292 EFPVWGEYVSALNSRFGSELHYDPMMELTNLKQLGSVQTYSDKFDELLNKVSLSTEHALS 471
            +  W E V ALN+RFG  +  DP+ +L NL+Q GS+Q+Y D+FDEL  +  +   HALS
Sbjct: 135 AYLDWSEXVIALNARFGQHVFDDPIADLRNLRQTGSLQSYMDEFDELYPRADIKESHALS 194

Query: 472 LFLGGLKEELQYTVRMLAPKNLQQAISLAKLQELAVENAKRREKSIQR 615
            FL GL +EL   VRM  P+ L  A SLA+LQE+AV   + + K + +
Sbjct: 195 FFLSGLIDELXMPVRMFKPQTLADAYSLARLQEIAVAALQNKPKPVSK 242



 Score = 83.2 bits (204), Expect = 1e-13
 Identities = 45/134 (33%), Positives = 74/134 (55%)
 Frame = +1

Query: 718  EAEENNIPGVDPLDEETVPPVVSSNALDGIPSLANYNIMRVSGSVQSQKIHILIDSGSTH 897
            E  E N+  ++ L EE     +S NAL    S  +   M ++G+ + + + +LIDSGS+H
Sbjct: 351  EGPEGNLQ-MEGLXEEDEQIQLSLNAL---MSNEDSQTMTLNGNYKGRSLFVLIDSGSSH 406

Query: 898  NFIDSSTTKRLGCKISNYHPVTVAKADGNKILCDKVCAGMKWKMQSVEFRTDLLGIPLKG 1077
            NF+ S   KR+ C       + V  A+G ++ C  +C+  +W+MQ  EF  ++  +PL+ 
Sbjct: 407  NFLSSKVAKRVDCCWQKARGIRVTVANGQELHCTALCSDFRWRMQGQEFIAEVYVLPLET 466

Query: 1078 CQMVLGIQWLILLG 1119
              ++LG QWL  LG
Sbjct: 467  YDLILGTQWLATLG 480


>ref|XP_003525745.1| PREDICTED: uncharacterized protein LOC100809540 [Glycine max]
          Length = 1232

 Score =  160 bits (406), Expect = 4e-37
 Identities = 112/347 (32%), Positives = 164/347 (47%), Gaps = 12/347 (3%)
 Frame = +1

Query: 115  KLEFPKFNGERLREWVLRCIQFFELDCTPENTKVKIASLHMEGRALQWHQNFMKGRLSGE 294
            KLE P+F+G+    W+ +  QFF+     +  ++ +AS +MEG AL W Q   +   +G 
Sbjct: 2    KLEVPRFDGKDPLGWIFKITQFFDYQGVSDAERLTVASFYMEGPALCWFQWMSR---NGF 58

Query: 295  FPVWGEYVSALNSRFGSELHYDPMMELTNLKQLGSVQTYSDKFDELLNKV-SLSTEHALS 471
               W   + AL +RF    + DP   L  ++Q G+V  Y  +F+ L N+V  L+   +LS
Sbjct: 59   LTSWQAMLQALETRFAPSYYDDPYGALFKIQQRGTVNEYLSEFERLANRVVGLAPPLSLS 118

Query: 472  LFLGGLKEELQYTVRMLAPKNLQQAISLAKLQELAVENAKRREKSIQRDIDSASFLPTPK 651
             F+ GL  EL   V+ L P  L QA++LAKLQE  + + +R  +       +    PTP 
Sbjct: 119  CFISGLNPELHREVQALQPMCLPQAMALAKLQEDKLLDRRRNHRH------NFPVSPTPH 172

Query: 652  MPQDGGGVNVGRIDINNQEDIIEAEENNIPGVDPLDEETVPPVVSS-----------NAL 798
             P             NN          N P +  L  E +                 NAL
Sbjct: 173  HP-------------NNPPPSSSTSATNKPPIRRLTPEEMALKRDKGLCYHCDEKCLNAL 219

Query: 799  DGIPSLANYNIMRVSGSVQSQKIHILIDSGSTHNFIDSSTTKRLGCKISNYHPVTVAKAD 978
             G+P+   +   RV G+V+  ++ IL+D GSTHNF+     K LG   +   P+ V   D
Sbjct: 220  SGMPAPETF---RVYGTVRRHQLTILVDGGSTHNFVQLRVAKFLGLPSTPMTPLPVMVGD 276

Query: 979  GNKILCDKVCAGMKWKMQSVEFRTDLLGIPLKGCQMVLGIQWLILLG 1119
            G  I CD     +   +Q  +F TDL G+PL G  +VLG+QWL  LG
Sbjct: 277  GGVIHCDCRYPQVSITIQGHQFTTDLFGLPLSGADLVLGVQWLRALG 323