BLASTX nr result

ID: Dioscorea21_contig00018270 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00018270
         (1682 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABW74566.1| integrase [Boechera divaricarpa]                       524   e-146
gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum ...   498   e-138
emb|CAN71759.1| hypothetical protein VITISV_020777 [Vitis vinifera]   489   e-136
emb|CAN74767.1| hypothetical protein VITISV_041860 [Vitis vinifera]   478   e-132
emb|CAN68235.1| hypothetical protein VITISV_037104 [Vitis vinifera]   476   e-131

>gb|ABW74566.1| integrase [Boechera divaricarpa]
          Length = 1165

 Score =  524 bits (1350), Expect = e-146
 Identities = 270/512 (52%), Positives = 342/512 (66%)
 Frame = +1

Query: 145  GTSEGLQATQIREFNIEDSPGQKVRSLRDIYESCTFALSVTDPSTYEEAIKSLHWQKAME 324
            G+S+G  ++ I       SP QK RSLR+IYE    A    DP T  EA     W+KAME
Sbjct: 616  GSSDGEGSSSI-------SPPQKFRSLREIYEE-QHAFFSADPVTVNEAATKEEWRKAME 667

Query: 325  VEMDSIQKNGTWRLTDLPLERKVIGVKWVYKTKYNPDGAIDKYKARLVAKGYVQEHGVDY 504
             E+ SI+KN TW+L +LP E+  IGVKWV+KTKY  D  I KYKARLV KGY QE+GVDY
Sbjct: 668  EEIASIEKNQTWQLVELPEEKHSIGVKWVFKTKYQADDNIQKYKARLVVKGYAQEYGVDY 727

Query: 505  EEVFSPVARLETVRIFLAIAAYRHWPVYQLDVKSAFLNGEIEEEVYVAQPRGFEIPGKEK 684
            E+ FSPVAR +T+R  LA+ AY HWP+YQ DVKSAFLNGE+ EEVYV QP GF + G+E 
Sbjct: 728  EKTFSPVARFDTLRTLLALGAYMHWPIYQFDVKSAFLNGELREEVYVDQPEGFIVEGREG 787

Query: 685  MVYKLSKALYGLKQAPRAWYEKLDSWFKLQNFQRSQIEHTLYKKITQNGDLIVVCVYVDD 864
             VY+L KALYGLKQAPRAWY K+DS+F    F+RS+ E TLY K    GD++VVC+YVDD
Sbjct: 788  FVYRLYKALYGLKQAPRAWYNKIDSYFAETGFERSKSEPTLYIKKQGAGDILVVCLYVDD 847

Query: 865  LIYMGSSLKIVRKFKEDMENVFEMNDLGLMKYFLGFEIKQDEHGIHLSQRKYAEDLLKLY 1044
            +IYMGSS  +V +FK  M   FEM DLGL+ +FLG E+KQ E G+ +SQ KYA DLLK +
Sbjct: 848  MIYMGSSASLVSEFKASMMEKFEMTDLGLLYFFLGLEVKQVEDGVFVSQHKYACDLLKRF 907

Query: 1045 NMQGCKAVSTPMSYSTKQQLFEQSEEANATIYRCLIGKLLYLSHSRPDLMFAVSLLSRFM 1224
            +M GC AV TPM+ + K    + +E+A+AT +R L+G L+YL+H+RPD+ FAVS +SRFM
Sbjct: 908  DMAGCNAVETPMNVNEKLLAGDGTEKADATKFRSLVGGLIYLTHTRPDICFAVSAISRFM 967

Query: 1225 ASPTRIQFAAARNVLRYVSGTLNYGIQYSGSAEFALEGYADSDWCGDVRDRKSTSXXXXX 1404
              PT+  F AA+ +LRY++ T  YG+ Y   ++F L G+ DSDW G V+DRKSTS     
Sbjct: 968  HGPTKQHFGAAKRLLRYIARTAEYGLWYCSVSKFKLVGFTDSDWAGCVQDRKSTS----- 1022

Query: 1405 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFNFGSGAVCWASKKQDVVALSTTEAEYISL 1584
                                          FN GSGAVCW+SKKQ+V ALS++EAEY + 
Sbjct: 1023 ---------------------------GHVFNLGSGAVCWSSKKQNVTALSSSEAEYTAA 1055

Query: 1585 CAACCHGVWMKRIVADFGIQCENPIPIWCDNK 1680
             AA C  VW++RI+AD   + E    I+CDNK
Sbjct: 1056 TAAACQAVWLRRILADIKQEQEKATTIFCDNK 1087


>gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1333

 Score =  498 bits (1282), Expect = e-138
 Identities = 254/515 (49%), Positives = 342/515 (66%)
 Frame = +1

Query: 136  PSEGTSEGLQATQIREFNIEDSPGQKVRSLRDIYESCTFALSVTDPSTYEEAIKSLHWQK 315
            P E + E +    +R    E  P  K  +   +  SC FAL V+DP  YEEA++   W+ 
Sbjct: 779  PDESSVEPIP---LRRSTREKKPNPKYSNT--VNTSCQFALLVSDPICYEEAVEQSEWKN 833

Query: 316  AMEVEMDSIQKNGTWRLTDLPLERKVIGVKWVYKTKYNPDGAIDKYKARLVAKGYVQEHG 495
            AM  E+ +I++N TW L D P  + VIG+KWV++TKYN DG+I K+KARLVAKGY Q+ G
Sbjct: 834  AMIEEIQAIERNSTWELVDAPEGKNVIGLKWVFRTKYNADGSIQKHKARLVAKGYSQQQG 893

Query: 496  VDYEEVFSPVARLETVRIFLAIAAYRHWPVYQLDVKSAFLNGEIEEEVYVAQPRGFEIPG 675
            VD++E FSPVAR ETVR+ LA+AA  H PVYQ DVKSAFLNG++EEEVYV+QP+GF I G
Sbjct: 894  VDFDETFSPVARFETVRVVLALAAQLHLPVYQFDVKSAFLNGDLEEEVYVSQPQGFMITG 953

Query: 676  KEKMVYKLSKALYGLKQAPRAWYEKLDSWFKLQNFQRSQIEHTLYKKITQNGDLIVVCVY 855
             E  VYKL KALYGLKQAPRAWY K+DS+F+   F+RS  E TLY K     + ++VC+Y
Sbjct: 954  NENKVYKLRKALYGLKQAPRAWYSKIDSFFQGSGFRRSDNEPTLYLKKQGTDEFLLVCLY 1013

Query: 856  VDDLIYMGSSLKIVRKFKEDMENVFEMNDLGLMKYFLGFEIKQDEHGIHLSQRKYAEDLL 1035
            VDD+IY+GSS  +V  FK +M   FEM+DLGL+KYFLG E+ QD+ GI +SQ+KYAEDLL
Sbjct: 1014 VDDMIYIGSSKSLVNDFKSNMMRNFEMSDLGLLKYFLGLEVIQDKDGIFISQKKYAEDLL 1073

Query: 1036 KLYNMQGCKAVSTPMSYSTKQQLFEQSEEANATIYRCLIGKLLYLSHSRPDLMFAVSLLS 1215
            K + M  C+  +TPM+ + K Q  + +E+AN  ++R L+G L YL+H+RPD+ F+VS++S
Sbjct: 1074 KKFQMMNCEVATTPMNINEKLQRADGTEKANPKLFRSLVGGLNYLTHTRPDIAFSVSVVS 1133

Query: 1216 RFMASPTRIQFAAARNVLRYVSGTLNYGIQYSGSAEFALEGYADSDWCGDVRDRKSTSXX 1395
            RF+ SPT+  F AA+ VLRYV+GT ++GI YS +  F L G+ DSD+ G + DRKSTS  
Sbjct: 1134 RFLQSPTKQHFGAAKRVLRYVAGTTDFGIWYSKAPNFRLVGFTDSDYAGCLDDRKSTS-- 1191

Query: 1396 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFNFGSGAVCWASKKQDVVALSTTEAEY 1575
                                             F+FGSG V W+SKKQ+ VALST+EAEY
Sbjct: 1192 ------------------------------GSCFSFGSGVVTWSSKKQETVALSTSEAEY 1221

Query: 1576 ISLCAACCHGVWMKRIVADFGIQCENPIPIWCDNK 1680
             +   A    +W+++++ DF  + +    I+ D+K
Sbjct: 1222 TAASLAARQALWLRKLLEDFSYEQKESTEIFSDSK 1256


>emb|CAN71759.1| hypothetical protein VITISV_020777 [Vitis vinifera]
          Length = 1472

 Score =  489 bits (1260), Expect = e-136
 Identities = 240/475 (50%), Positives = 321/475 (67%)
 Frame = +1

Query: 256  LSVTDPSTYEEAIKSLHWQKAMEVEMDSIQKNGTWRLTDLPLERKVIGVKWVYKTKYNPD 435
            +  TDP+T+EEA++   W  AM+ E+ +I+KN TW L +LP ++ VIGVKWV++TKY  D
Sbjct: 743  IPATDPTTFEEAVEKEEWCSAMKEEIAAIEKNETWELVELPEDKNVIGVKWVFRTKYLAD 802

Query: 436  GAIDKYKARLVAKGYVQEHGVDYEEVFSPVARLETVRIFLAIAAYRHWPVYQLDVKSAFL 615
            G+I K+KARLVAKGY Q+HGVDY++ FSPVAR ETVR  LA+AA+ HW  YQ DVKSAFL
Sbjct: 803  GSIQKHKARLVAKGYAQQHGVDYDDTFSPVARFETVRTLLALAAHMHWCXYQFDVKSAFL 862

Query: 616  NGEIEEEVYVAQPRGFEIPGKEKMVYKLSKALYGLKQAPRAWYEKLDSWFKLQNFQRSQI 795
            NGE+ EEVYV+Q  GF +P KE+ VY+L KALYGLKQAPRAWY K+DS+F    F+RS+ 
Sbjct: 863  NGELVEEVYVSQXEGFIVPXKEEHVYRLKKALYGLKQAPRAWYSKIDSYFVENGFERSKS 922

Query: 796  EHTLYKKITQNGDLIVVCVYVDDLIYMGSSLKIVRKFKEDMENVFEMNDLGLMKYFLGFE 975
            E  LY K     DL+++C+YVDD+IYMGSS  ++ +FK  M+  FEM++LGL+ +FL  E
Sbjct: 923  EPNLYLKRQGKNDLLIICLYVDDMIYMGSSSSLINEFKACMKKKFEMSBLGLLHFFLXLE 982

Query: 976  IKQDEHGIHLSQRKYAEDLLKLYNMQGCKAVSTPMSYSTKQQLFEQSEEANATIYRCLIG 1155
            +KQ E G+ +SQRKY  DLLK +NM  CK V+T M+ + K Q  + +E A+A  +  L+ 
Sbjct: 983  VKQVEDGVFVSQRKYXVDLLKKFNMLNCKVVATXMNSNEKLQAEDGTERADARRFXSLVR 1042

Query: 1156 KLLYLSHSRPDLMFAVSLLSRFMASPTRIQFAAARNVLRYVSGTLNYGIQYSGSAEFALE 1335
             L+YL+H+RPD+ F V ++SRFM  P++    AA+ +LRY+ GT ++GI Y    EF L 
Sbjct: 1043 GLIYLTHTRPDIAFPVEVISRFMHCPSKQHLGAAKRLLRYIVGTYDFGIWYGHVQEFKLV 1102

Query: 1336 GYADSDWCGDVRDRKSTSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFNFGSGA 1515
            GY DSDW G + DRKSTS                                   F+ GSGA
Sbjct: 1103 GYTDSDWAGCLEDRKSTS--------------------------------GYMFSLGSGA 1130

Query: 1516 VCWASKKQDVVALSTTEAEYISLCAACCHGVWMKRIVADFGIQCENPIPIWCDNK 1680
            VCW+SKKQ V ALS++EAEY +  ++ C  VW++RI+AD   + E P  I+CDNK
Sbjct: 1131 VCWSSKKQAVTALSSSEAEYTAATSSACQAVWLRRILADINQEHEEPTVIYCDNK 1185


>emb|CAN74767.1| hypothetical protein VITISV_041860 [Vitis vinifera]
          Length = 1945

 Score =  478 bits (1229), Expect = e-132
 Identities = 240/495 (48%), Positives = 326/495 (65%)
 Frame = +1

Query: 196  DSPGQKVRSLRDIYESCTFALSVTDPSTYEEAIKSLHWQKAMEVEMDSIQKNGTWRLTDL 375
            D+P  K+R L D+YE C   L   +P+ Y EA + L W +AM+ E+D+I++NGTW+LT+L
Sbjct: 860  DTPVLKMRPLSDVYERCN--LVHAEPTCYTEAARFLEWIEAMKAEIDAIERNGTWKLTEL 917

Query: 376  PLERKVIGVKWVYKTKYNPDGAIDKYKARLVAKGYVQEHGVDYEEVFSPVARLETVRIFL 555
            P  +  IGVKWV++TK+N DG+I ++KARLV KG+ Q  GVDY + F+PVAR +T+R+ L
Sbjct: 918  PEAKNAIGVKWVFRTKFNSDGSIFRHKARLVVKGFAQVAGVDYGDTFAPVARHDTIRLLL 977

Query: 556  AIAAYRHWPVYQLDVKSAFLNGEIEEEVYVAQPRGFEIPGKEKMVYKLSKALYGLKQAPR 735
            A+A    W VY LDVKSAFLNG + EE+YV QP GFE+ G E  VYKL KALYGLKQAPR
Sbjct: 978  ALAGQMGWKVYHLDVKSAFLNGILLEEIYVQQPEGFEVIGHEHKVYKLHKALYGLKQAPR 1037

Query: 736  AWYEKLDSWFKLQNFQRSQIEHTLYKKITQNGDLIVVCVYVDDLIYMGSSLKIVRKFKED 915
            AWY ++DS      F+RS+ E TLY K   +G  +VV +YVDD++  GS++K++  FK +
Sbjct: 1038 AWYSRIDSHLIQLGFRRSENEATLYLKQNDDGLQLVVSLYVDDMLVTGSNVKLLADFKME 1097

Query: 916  MENVFEMNDLGLMKYFLGFEIKQDEHGIHLSQRKYAEDLLKLYNMQGCKAVSTPMSYSTK 1095
            M++VFEM+DLG+M YFLG EI Q   GI +SQRKYA D+LK + ++ CK V+TP++ + K
Sbjct: 1098 MQDVFEMSDLGIMNYFLGMEIYQCSWGIFISQRKYAMDILKKFKLESCKEVATPLAQNEK 1157

Query: 1096 QQLFEQSEEANATIYRCLIGKLLYLSHSRPDLMFAVSLLSRFMASPTRIQFAAARNVLRY 1275
                +  +    + YR L+G LLYL+ +RPDLMF  SLLSRF++SP+ +    ++ VL+Y
Sbjct: 1158 ISKNDGEKLEEPSAYRSLVGSLLYLTVTRPDLMFPTSLLSRFLSSPSNVHMGVSKRVLKY 1217

Query: 1276 VSGTLNYGIQYSGSAEFALEGYADSDWCGDVRDRKSTSXXXXXXXXXXXXXXXXXXXXXX 1455
            V GT N GI Y  +    L+GYADSDW G V D KSTS                      
Sbjct: 1218 VKGTTNLGIWYLKTVGVKLDGYADSDWAGSVDDMKSTS---------------------- 1255

Query: 1456 XXXXXXXXXXXXXFNFGSGAVCWASKKQDVVALSTTEAEYISLCAACCHGVWMKRIVADF 1635
                         F  GSG +CW S+KQ+VVA STTEAEYISL AA    +W+++++AD 
Sbjct: 1256 ----------SYVFTIGSGVICWNSRKQEVVAQSTTEAEYISLAAAANQAIWLRKLLADL 1305

Query: 1636 GIQCENPIPIWCDNK 1680
            G +  +P  ++CDNK
Sbjct: 1306 GQEQTSPTELYCDNK 1320


>emb|CAN68235.1| hypothetical protein VITISV_037104 [Vitis vinifera]
          Length = 2041

 Score =  476 bits (1224), Expect = e-131
 Identities = 240/495 (48%), Positives = 325/495 (65%)
 Frame = +1

Query: 196  DSPGQKVRSLRDIYESCTFALSVTDPSTYEEAIKSLHWQKAMEVEMDSIQKNGTWRLTDL 375
            D+P  K+R L D+YE C   L   +P+ Y EA + L W +AM+ E+D+I++NGTW+LT+L
Sbjct: 1505 DTPVLKMRPLFDVYERCN--LVHAEPTCYTEAARFLEWIEAMKAEIDAIERNGTWKLTEL 1562

Query: 376  PLERKVIGVKWVYKTKYNPDGAIDKYKARLVAKGYVQEHGVDYEEVFSPVARLETVRIFL 555
            P  +  IGVKWV++TK+N DG+I ++KARLV KG+ Q  GVDY + F+PVAR +T+R+ L
Sbjct: 1563 PEAKNAIGVKWVFRTKFNSDGSIFRHKARLVVKGFAQVAGVDYGDTFAPVARHDTIRLLL 1622

Query: 556  AIAAYRHWPVYQLDVKSAFLNGEIEEEVYVAQPRGFEIPGKEKMVYKLSKALYGLKQAPR 735
            A+A    W VY LDVKSAFLNG + EE+YV QP GFE+ G E  VYKL KALYGLKQAPR
Sbjct: 1623 ALAGQMGWKVYHLDVKSAFLNGILLEEIYVQQPEGFEVIGHEHKVYKLHKALYGLKQAPR 1682

Query: 736  AWYEKLDSWFKLQNFQRSQIEHTLYKKITQNGDLIVVCVYVDDLIYMGSSLKIVRKFKED 915
            AWY ++DS      F+RS+ E TLY K   +G  +VV +YVDD++  GS++K++  FK +
Sbjct: 1683 AWYSRIDSHLIQLGFRRSENEATLYLKQNDDGLQLVVSLYVDDMLVTGSNVKLLADFKME 1742

Query: 916  MENVFEMNDLGLMKYFLGFEIKQDEHGIHLSQRKYAEDLLKLYNMQGCKAVSTPMSYSTK 1095
            M++VFEM DLG+M YFLG EI Q   GI +SQRKYA D+LK + ++ CK V+TP++ + K
Sbjct: 1743 MQDVFEMFDLGIMNYFLGMEIYQCSWGIFISQRKYAMDILKKFKLESCKEVATPLAQNEK 1802

Query: 1096 QQLFEQSEEANATIYRCLIGKLLYLSHSRPDLMFAVSLLSRFMASPTRIQFAAARNVLRY 1275
                +  +    + YR L+G LLYL+ ++PDLMF  SLLSRFM+SP+ +    A+ VL+Y
Sbjct: 1803 ISKNDGEKLEEPSAYRSLVGSLLYLTVTKPDLMFPASLLSRFMSSPSNVHMGVAKRVLKY 1862

Query: 1276 VSGTLNYGIQYSGSAEFALEGYADSDWCGDVRDRKSTSXXXXXXXXXXXXXXXXXXXXXX 1455
            + GT N GI Y  +    L+GYADSDW G V D KSTS                      
Sbjct: 1863 LKGTTNLGIWYLKTGGVKLDGYADSDWAGSVDDMKSTS---------------------- 1900

Query: 1456 XXXXXXXXXXXXXFNFGSGAVCWASKKQDVVALSTTEAEYISLCAACCHGVWMKRIVADF 1635
                         F  GSG +CW S+KQ+VVA STTEAEYISL AA    +W+++++AD 
Sbjct: 1901 ----------GYAFTIGSGVICWNSRKQEVVAQSTTEAEYISLAAAANQAIWLRKLLADL 1950

Query: 1636 GIQCENPIPIWCDNK 1680
            G +  +P  ++CDNK
Sbjct: 1951 GQEQSSPTELYCDNK 1965


Top