BLASTX nr result

ID: Dioscorea21_contig00018480 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00018480
         (1390 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum ...   507   e-141
emb|CAN79061.1| hypothetical protein VITISV_024577 [Vitis vinifera]   504   e-140
gb|ABA95820.1| retrotransposon protein, putative, unclassified [...   498   e-138
gb|AAT47077.1| putative polyprotein [Oryza sativa Japonica Group]     493   e-137
gb|ABW74566.1| integrase [Boechera divaricarpa]                       488   e-135

>gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1333

 Score =  507 bits (1306), Expect = e-141
 Identities = 251/442 (56%), Positives = 320/442 (72%)
 Frame = +3

Query: 3    KGYSQQPGVDFNETFAPVVRMETIRAVLAIAAQLELEVFQLDVKSAFLNGHLEEEVYVEQ 182
            KGYSQQ GVDF+ETF+PV R ET+R VLA+AAQL L V+Q DVKSAFLNG LEEEVYV Q
Sbjct: 886  KGYSQQQGVDFDETFSPVARFETVRVVLALAAQLHLPVYQFDVKSAFLNGDLEEEVYVSQ 945

Query: 183  PQGFIVKGQENKVYRLKKALYGLKQAPRAWNMKIDCYLQQNGFSRSQNEPSLYVKTWDNS 362
            PQGF++ G ENKVY+L+KALYGLKQAPRAW  KID + Q +GF RS NEP+LY+K     
Sbjct: 946  PQGFMITGNENKVYKLRKALYGLKQAPRAWYSKIDSFFQGSGFRRSDNEPTLYLKKQGTD 1005

Query: 363  DFLVVCLYVDDIIYTSTNLRMADEFKNAMMHNFEMTDLGRMKYFLGIQVKQSKGEIFICQ 542
            +FL+VCLYVDD+IY  ++  + ++FK+ MM NFEM+DLG +KYFLG++V Q K  IFI Q
Sbjct: 1006 EFLLVCLYVDDMIYIGSSKSLVNDFKSNMMRNFEMSDLGLLKYFLGLEVIQDKDGIFISQ 1065

Query: 543  EKYTEDLLKKFHMESCKPMPTPLALNEKLQAIDGAQNANPDVYRSLVGSLIYLTNTRPDI 722
            +KY EDLLKKF M +C+   TP+ +NEKLQ  DG + ANP ++RSLVG L YLT+TRPDI
Sbjct: 1066 KKYAEDLLKKFQMMNCEVATTPMNINEKLQRADGTEKANPKLFRSLVGGLNYLTHTRPDI 1125

Query: 723  LHAVSYISRFMNNPSKLHFNAAKRILRYLQGTKKLGIKYAKEENNKLIGYTDSDWAGALD 902
              +VS +SRF+ +P+K HF AAKR+LRY+ GT   GI Y+K  N +L+G+TDSD+AG LD
Sbjct: 1126 AFSVSVVSRFLQSPTKQHFGAAKRVLRYVAGTTDFGIWYSKAPNFRLVGFTDSDYAGCLD 1185

Query: 903  DRKSTSGYLFCLGTKVISWSSKKQNTVXXXXXXXXXXXXXXXXXXXVWLRRLLSDLKQKQ 1082
            DRKSTSG  F  G+ V++WSSKKQ TV                   +WLR+LL D   +Q
Sbjct: 1186 DRKSTSGSCFSFGSGVVTWSSKKQETVALSTSEAEYTAASLAARQALWLRKLLEDFSYEQ 1245

Query: 1083 EGPTEIFCDNMSAIAMTKNPVFHGRTKHIELRHHFIRDLVNRKEVQLNYINTTNQPADVL 1262
            +  TEIF D+ SAIAM KNP FHGRTKHI++++HFIR LV    + L + +T  Q AD+ 
Sbjct: 1246 KESTEIFSDSKSAIAMAKNPSFHGRTKHIDVQYHFIRTLVADGRIVLKFCSTNEQAADIF 1305

Query: 1263 TKAVSKDKLVSFKDHLKITN*E 1328
            TK++ + K   F+  L + + E
Sbjct: 1306 TKSLPQAKHEYFRLQLGVCDFE 1327


>emb|CAN79061.1| hypothetical protein VITISV_024577 [Vitis vinifera]
          Length = 1424

 Score =  504 bits (1299), Expect = e-140
 Identities = 244/438 (55%), Positives = 320/438 (73%)
 Frame = +3

Query: 3    KGYSQQPGVDFNETFAPVVRMETIRAVLAIAAQLELEVFQLDVKSAFLNGHLEEEVYVEQ 182
            KGYSQQPG DF+ETFAPV R++TIR ++A+AAQ    ++QLD+KSAFLNG LE E+YVEQ
Sbjct: 976  KGYSQQPGXDFHETFAPVARLDTIRTIIAVAAQKGWLLYQLDIKSAFLNGKLEXEIYVEQ 1035

Query: 183  PQGFIVKGQENKVYRLKKALYGLKQAPRAWNMKIDCYLQQNGFSRSQNEPSLYVKTWDNS 362
            PQGF+V G+ENKVY+LKKALYGLKQAPRAW  +ID Y  +NGF RS++EP+LYVK+ DNS
Sbjct: 1036 PQGFVVDGEENKVYKLKKALYGLKQAPRAWYTQIDSYFIENGFIRSKSEPTLYVKSKDNS 1095

Query: 363  DFLVVCLYVDDIIYTSTNLRMADEFKNAMMHNFEMTDLGRMKYFLGIQVKQSKGEIFICQ 542
              L+V LYVDD+I+T  + +M ++F+N MM  +EM+D+G + YFLGI+V Q +  +FICQ
Sbjct: 1096 QILIVALYVDDLIFTGNDEKMVEKFRNEMMKKYEMSDMGLLHYFLGIEVYQEEDGVFICQ 1155

Query: 543  EKYTEDLLKKFHMESCKPMPTPLALNEKLQAIDGAQNANPDVYRSLVGSLIYLTNTRPDI 722
            ++Y E +LKKF M  C  + TPL +NEKL+  DG +  +   +RSLVG+L+YLT TRPDI
Sbjct: 1156 KRYVEHILKKFGMAGCNXVSTPLVVNEKLRKEDGGKMVDETHFRSLVGNLLYLTATRPDI 1215

Query: 723  LHAVSYISRFMNNPSKLHFNAAKRILRYLQGTKKLGIKYAKEENNKLIGYTDSDWAGALD 902
            + A S +SRFM+ PS LH  AAKR+LRYLQGT +LGIKY +    KLIG+ DSDW G +D
Sbjct: 1216 MFAASLLSRFMHYPSHLHLGAAKRVLRYLQGTVELGIKYFRNIEVKLIGHCDSDWGGCID 1275

Query: 903  DRKSTSGYLFCLGTKVISWSSKKQNTVXXXXXXXXXXXXXXXXXXXVWLRRLLSDLKQKQ 1082
            D KSTSGY F LG+ VISW SKKQ +V                   +WLRR+L D+K+KQ
Sbjct: 1276 DMKSTSGYAFSLGSGVISWVSKKQGSVAQSSAEAEYISASLATSQAIWLRRILEDIKEKQ 1335

Query: 1083 EGPTEIFCDNMSAIAMTKNPVFHGRTKHIELRHHFIRDLVNRKEVQLNYINTTNQPADVL 1262
               T + CDN SAIA+ KN VFH RT+HI +++HFI+++++  EVQL Y  +  Q AD+ 
Sbjct: 1336 NEATYLLCDNKSAIAIAKNXVFHSRTRHIAVKYHFIKEVISDGEVQLMYCKSEEQXADIF 1395

Query: 1263 TKAVSKDKLVSFKDHLKI 1316
            TKA+  +KLV F+  L +
Sbjct: 1396 TKALPLEKLVHFRKLLGV 1413


>gb|ABA95820.1| retrotransposon protein, putative, unclassified [Oryza sativa
            Japonica Group]
          Length = 1142

 Score =  498 bits (1283), Expect = e-138
 Identities = 242/430 (56%), Positives = 320/430 (74%)
 Frame = +3

Query: 3    KGYSQQPGVDFNETFAPVVRMETIRAVLAIAAQLELEVFQLDVKSAFLNGHLEEEVYVEQ 182
            KG+ Q+PG+D+ ET+APV R+ETIR ++A+AAQ   +++QLDVKSAFLNG+L+EE+YVEQ
Sbjct: 703  KGFKQKPGIDYYETYAPVARLETIRTIIALAAQKRWKIYQLDVKSAFLNGYLDEEIYVEQ 762

Query: 183  PQGFIVKGQENKVYRLKKALYGLKQAPRAWNMKIDCYLQQNGFSRSQNEPSLYVKTWDNS 362
            P+GF V+G ENKV+RLKKALYGLKQAPR W  +ID Y  Q GF++S +EP+LYV     +
Sbjct: 763  PEGFSVQGGENKVFRLKKALYGLKQAPRVWYSQIDKYFIQKGFAKSISEPTLYVNK-TGT 821

Query: 363  DFLVVCLYVDDIIYTSTNLRMADEFKNAMMHNFEMTDLGRMKYFLGIQVKQSKGEIFICQ 542
            D L+V LYVDD+IYT  + +M  +FK  MMH +EM+DLG + YFLG++V QS   IFI Q
Sbjct: 822  DILIVSLYVDDLIYTGNSEKMMQDFKKDMMHTYEMSDLGLLYYFLGMEVHQSDEGIFISQ 881

Query: 543  EKYTEDLLKKFHMESCKPMPTPLALNEKLQAIDGAQNANPDVYRSLVGSLIYLTNTRPDI 722
             KY E++LKKF M++CK + TPL  NEK +A DGA   +P +YRSLVGSL+YLT TRPDI
Sbjct: 882  RKYAENILKKFKMDNCKSVTTPLLPNEKQKARDGADKVDPTIYRSLVGSLLYLTATRPDI 941

Query: 723  LHAVSYISRFMNNPSKLHFNAAKRILRYLQGTKKLGIKYAKEENNKLIGYTDSDWAGALD 902
            + A S +SR+M++PS+L+F AAKR+LRY++GT   GI Y   + +KLIGYTDSDWAG LD
Sbjct: 942  MFAASLLSRYMSSPSQLNFTAAKRVLRYIKGTADYGIWYKPVKESKLIGYTDSDWAGCLD 1001

Query: 903  DRKSTSGYLFCLGTKVISWSSKKQNTVXXXXXXXXXXXXXXXXXXXVWLRRLLSDLKQKQ 1082
            D K TSGY F LG+ + SWS+KKQN V                   VWLRR++ DL +KQ
Sbjct: 1002 DMKGTSGYAFSLGSGMCSWSTKKQNIVALSSAEAEYVAASKAVSQVVWLRRIMEDLGEKQ 1061

Query: 1083 EGPTEIFCDNMSAIAMTKNPVFHGRTKHIELRHHFIRDLVNRKEVQLNYINTTNQPADVL 1262
              PT I+CD+ SAIA+++NPV H RTKHI +++H+IR+ V+R+EV+L +  T  Q AD+ 
Sbjct: 1062 YQPTTIYCDSKSAIAISENPVSHDRTKHIAIKYHYIREAVDRQEVKLEFCRTDEQLADIF 1121

Query: 1263 TKAVSKDKLV 1292
            TKA+SK+K V
Sbjct: 1122 TKALSKEKFV 1131


>gb|AAT47077.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 1136

 Score =  493 bits (1268), Expect = e-137
 Identities = 241/430 (56%), Positives = 319/430 (74%)
 Frame = +3

Query: 3    KGYSQQPGVDFNETFAPVVRMETIRAVLAIAAQLELEVFQLDVKSAFLNGHLEEEVYVEQ 182
            KG+ Q+PG+D+ ET+A V R+ETI  ++A+AAQ   +++QLDVKSAFLNG+L+EE+YVEQ
Sbjct: 697  KGFKQKPGIDYYETYAHVARLETIHTIIALAAQKRWKIYQLDVKSAFLNGYLDEEIYVEQ 756

Query: 183  PQGFIVKGQENKVYRLKKALYGLKQAPRAWNMKIDCYLQQNGFSRSQNEPSLYVKTWDNS 362
            P+ F V+G ENKV+RLKKALYGLKQAPRAW  +ID Y  Q GF++S +EP LYVK    +
Sbjct: 757  PERFSVQGGENKVFRLKKALYGLKQAPRAWYSQIDKYFIQKGFAKSISEPILYVKK-TGT 815

Query: 363  DFLVVCLYVDDIIYTSTNLRMADEFKNAMMHNFEMTDLGRMKYFLGIQVKQSKGEIFICQ 542
            D L+V LYVDD+IYT  + ++  +FK  MMH +EM+DLG + YFLG++V QS   IFI Q
Sbjct: 816  DILIVSLYVDDLIYTGNSEKLMQDFKKDMMHTYEMSDLGLLHYFLGMEVHQSDEGIFISQ 875

Query: 543  EKYTEDLLKKFHMESCKPMPTPLALNEKLQAIDGAQNANPDVYRSLVGSLIYLTNTRPDI 722
             KY E++LKKF M++CK + TPL  NEK +A DGA  A+P +YRSLVGSL+YLT TRPDI
Sbjct: 876  RKYAENILKKFKMDNCKSVTTPLLPNEKQKARDGADKADPTIYRSLVGSLLYLTATRPDI 935

Query: 723  LHAVSYISRFMNNPSKLHFNAAKRILRYLQGTKKLGIKYAKEENNKLIGYTDSDWAGALD 902
            + A S +SR+M++PS+L+F AAKR+LRY++GT   GI Y   + +KLIGYTDSDWAG LD
Sbjct: 936  MFAASLLSRYMSSPSQLNFTAAKRVLRYIKGTAYYGIWYKPVKESKLIGYTDSDWAGCLD 995

Query: 903  DRKSTSGYLFCLGTKVISWSSKKQNTVXXXXXXXXXXXXXXXXXXXVWLRRLLSDLKQKQ 1082
            D KSTSGY F LG+ + SWS+KKQN V                   VWLRR++ DL +KQ
Sbjct: 996  DMKSTSGYAFSLGSGMWSWSTKKQNIVALSSAEAEYVAASKAVSQVVWLRRIMEDLGEKQ 1055

Query: 1083 EGPTEIFCDNMSAIAMTKNPVFHGRTKHIELRHHFIRDLVNRKEVQLNYINTTNQPADVL 1262
              PT I+CD+ SAIA+ +NPV H RTKHI +++H+IR+ V+R+EV+L +  T  Q AD+ 
Sbjct: 1056 YQPTTIYCDSKSAIAINENPVSHDRTKHIAIKYHYIREAVDRQEVKLEFCRTDEQLADIF 1115

Query: 1263 TKAVSKDKLV 1292
            TKA+SK+K +
Sbjct: 1116 TKALSKEKFI 1125


>gb|ABW74566.1| integrase [Boechera divaricarpa]
          Length = 1165

 Score =  488 bits (1257), Expect = e-135
 Identities = 240/444 (54%), Positives = 310/444 (69%)
 Frame = +3

Query: 3    KGYSQQPGVDFNETFAPVVRMETIRAVLAIAAQLELEVFQLDVKSAFLNGHLEEEVYVEQ 182
            KGY+Q+ GVD+ +TF+PV R +T+R +LA+ A +   ++Q DVKSAFLNG L EEVYV+Q
Sbjct: 717  KGYAQEYGVDYEKTFSPVARFDTLRTLLALGAYMHWPIYQFDVKSAFLNGELREEVYVDQ 776

Query: 183  PQGFIVKGQENKVYRLKKALYGLKQAPRAWNMKIDCYLQQNGFSRSQNEPSLYVKTWDNS 362
            P+GFIV+G+E  VYRL KALYGLKQAPRAW  KID Y  + GF RS++EP+LY+K     
Sbjct: 777  PEGFIVEGREGFVYRLYKALYGLKQAPRAWYNKIDSYFAETGFERSKSEPTLYIKKQGAG 836

Query: 363  DFLVVCLYVDDIIYTSTNLRMADEFKNAMMHNFEMTDLGRMKYFLGIQVKQSKGEIFICQ 542
            D LVVCLYVDD+IY  ++  +  EFK +MM  FEMTDLG + +FLG++VKQ +  +F+ Q
Sbjct: 837  DILVVCLYVDDMIYMGSSASLVSEFKASMMEKFEMTDLGLLYFFLGLEVKQVEDGVFVSQ 896

Query: 543  EKYTEDLLKKFHMESCKPMPTPLALNEKLQAIDGAQNANPDVYRSLVGSLIYLTNTRPDI 722
             KY  DLLK+F M  C  + TP+ +NEKL A DG + A+   +RSLVG LIYLT+TRPDI
Sbjct: 897  HKYACDLLKRFDMAGCNAVETPMNVNEKLLAGDGTEKADATKFRSLVGGLIYLTHTRPDI 956

Query: 723  LHAVSYISRFMNNPSKLHFNAAKRILRYLQGTKKLGIKYAKEENNKLIGYTDSDWAGALD 902
              AVS ISRFM+ P+K HF AAKR+LRY+  T + G+ Y      KL+G+TDSDWAG + 
Sbjct: 957  CFAVSAISRFMHGPTKQHFGAAKRLLRYIARTAEYGLWYCSVSKFKLVGFTDSDWAGCVQ 1016

Query: 903  DRKSTSGYLFCLGTKVISWSSKKQNTVXXXXXXXXXXXXXXXXXXXVWLRRLLSDLKQKQ 1082
            DRKSTSG++F LG+  + WSSKKQN                     VWLRR+L+D+KQ+Q
Sbjct: 1017 DRKSTSGHVFNLGSGAVCWSSKKQNVTALSSSEAEYTAATAAACQAVWLRRILADIKQEQ 1076

Query: 1083 EGPTEIFCDNMSAIAMTKNPVFHGRTKHIELRHHFIRDLVNRKEVQLNYINTTNQPADVL 1262
            E  T IFCDN + IAM KNP +HGRTKHI ++ HFIRDLV+   V L Y +T  Q ADVL
Sbjct: 1077 EKATTIFCDNKATIAMNKNPAYHGRTKHISIKVHFIRDLVSEGSVTLEYCSTNEQSADVL 1136

Query: 1263 TKAVSKDKLVSFKDHLKITN*EGV 1334
            TKA+S++K   F+  L +   E +
Sbjct: 1137 TKALSRNKFDYFRSKLGVCKFESM 1160


Top