BLASTX nr result

ID: Cephaelis21_contig00019471 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00019471
         (1807 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABI34321.1| RNase H family protein [Solanum demissum]              160   8e-37
emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulga...   149   2e-33
dbj|BAE79382.1| unnamed protein product [Ipomoea batatas]             134   6e-29
emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulga...   132   3e-28
gb|AAT38805.1| hypothetical protein SDM1_47t00008 [Solanum demis...   122   4e-25

>gb|ABI34321.1| RNase H family protein [Solanum demissum]
          Length = 945

 Score =  160 bits (406), Expect = 8e-37
 Identities = 141/530 (26%), Positives = 234/530 (44%), Gaps = 15/530 (2%)
 Frame = +2

Query: 164  WRAWDRLAYPISENGLGLRSLSSILDAFSCKLWWKLHKGIGVWSKYVTGISWQHSSFHSR 343
            WR    L   + ++G+GLR+LS I D+ + K   ++H             S + S   S+
Sbjct: 400  WRLQRELLQDLKKSGIGLRNLSPISDSVAYK---RVHP-------VAKAKSSKQSHTWSK 449

Query: 344  LQAVKAFADNHCHTRVGDGSSSF*TSNWFGIGPL-NRLVPHSLFPNISLHEAYGENGWHP 520
            +  ++   +N+    +  G+ S    NW G G L N L P S +   ++ +   +  W  
Sbjct: 450  MLKIRHSVENNILWIIYAGNVSMWWDNWMGNGALSNILPPPSHYNKDNVKDFIHKREWDF 509

Query: 521  QVLAEL--PQADLQVVQPATFHFSDHPDLLIWDLTDSGEFTTKSVFSGVRRRQPEVQFHR 694
              L+++  PQ   Q+V        +  D  IW  +++G FTTKS +      + +     
Sbjct: 510  DKLSDILPPQVVNQIVS-IPIGDPNQSDYAIWIPSENGHFTTKSAYVDCSNTREKNDMRN 568

Query: 695  WVWRTGVPIKISFFLWRILNGLLPFDDVLARLGFSLASKCPFCSS--ADSISHGFYEWGL 868
             +W    P K+SF  WR++   LPF D + +   ++ S C  C +   ++I+H F    +
Sbjct: 569  KIWHGKFPFKMSFLTWRLVQNKLPFYDTVGKFVDNIDSNCVCCKNMKTETINHVFLNSDV 628

Query: 869  ARATWSFFGALLGVHFVERDLKGFLYSLWN---HRDPNSQLLCLLPAVVCWSLWSARNTY 1039
            A   W  FG  LG+          L + WN   H   ++ ++  LP ++ W +W  R   
Sbjct: 629  ASYLWKKFGGTLGIDTRASSTINLLKTWWNVQTHNSIHNVIIHTLPILIFWEIWKRRCAC 688

Query: 1040 LFDGKQ-----TSPNMVIASVSQLLKELTSLKPPRVSGELPADLLWSNVTLMRRYRTPIA 1204
             +  ++     T  N V  ++   L+      P    G    DLL + V  +R Y     
Sbjct: 689  KYGDQKKMWYRTMENHVWWNLKMSLRMTF---PSFEIGNSWRDLL-NKVESLRPYPKWKI 744

Query: 1205 VH*NRPFARW-KLNVDGSSRGNPGHAGAGIIVRDITGTVVLTEAIYLGQLTSLFVESLAL 1381
            VH N P     K+N DGS   + G+AG G IVRD T  +++  +I     ++   E+LA 
Sbjct: 745  VHWNTPNINCVKINTDGSF--SSGNAGLGWIVRDHTRRMIMAFSIPSSCSSNNLAEALAA 802

Query: 1382 LHGLKMCTERRLFPLEVETDSLVLFRMLRAGSSWPWRIHSVLSSIHPLLGLDAISFAHIY 1561
              G+  C ++      +E DS ++  M+R G +   +I  V+  I  ++        H Y
Sbjct: 803  RFGILWCLQQGFHNCYLELDSKLVVDMVRNGQATNLKIKGVVEDIIQVVAKMNCEVNHCY 862

Query: 1562 REANTVADYLATHA-XXXXXXXXXXXXXLPRKLVGLVLLDQLGCPNLRMR 1708
            REAN VAD LA HA              +P+  VG   LD++  P++R+R
Sbjct: 863  REANQVADALAKHAVISNEAHMYHDWRDIPKLAVGSYQLDKMQMPSIRIR 912


>emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1389

 Score =  149 bits (377), Expect = 2e-33
 Identities = 149/581 (25%), Positives = 239/581 (41%), Gaps = 50/581 (8%)
 Frame = +2

Query: 2    LIAGGRIILVRHVLQALPVHTLAAMNPPKSLLRQMEGLFAKFIWGSHGEQARRVWRAWDR 181
            L   GR +L++  L +     + +   PK +L  ++  +  F W          +  W++
Sbjct: 800  LSQAGRTVLIQSNLASKASFQMQSFTLPKKVLTTLDTTYRNFFWNKDPAAKSANFIGWNK 859

Query: 182  LAYPISENGLGLRSLSSILDAFSCKLWWKLH-KGIGVWSKYVT----------------G 310
            +  P S  G+G R       A   KL WK+      +W K VT                 
Sbjct: 860  ICQPKSVGGVGFRKAEVTNIALQMKLLWKIMVSKDNIWVKLVTQKYLKEQNLLVCKIPSN 919

Query: 311  ISWQHSSFHSRLQAVKAFADNHCHTRVGDGSS-SF*TSNWFGIGPLN-RLVPHSLFPNIS 484
             SWQ  +    L   + F        +GDG   SF T NW    PLN + VP     NI 
Sbjct: 920  ASWQWKN----LLRHRNFFSKGLRWLIGDGQDISFWTDNWIFQYPLNSKYVPTVGSENIK 975

Query: 485  LHEAY-GENGWH-PQVLAELPQADLQVVQPATFHFSDHPDLLIWDLTDSGEFTTKSVFSG 658
            + E + G  GW  P++L  +P   ++ +       S   D L+W LT +G+++ KS  S 
Sbjct: 976  VAECFNGLGGWDIPKLLTLVPPNIVKAISSVFIPSSSQQDRLLWGLTPTGQYSVKSGASL 1035

Query: 659  VRRRQ----PEVQFHRWVWRTGVPIKISFFLWRILNGLLPFDDVLARLGFSLASKCPFCS 826
            +R        +V+F+ W+W    P KI  FLW+  N  L     L R    +   C FC 
Sbjct: 1036 IREVNGGTIEKVEFN-WIWGIHAPPKIKNFLWKACNDGLATTSRLERSHIFVPQNCCFCD 1094

Query: 827  -SADSISHGFYEWGLARATWS-----FFGALLGVHFVERDLKGFLYSLWN-HRDPNSQLL 985
              +++I H  ++       +S     F        F    L  F   L   H +   + L
Sbjct: 1095 CPSETICHLCFQCPFTLDIYSHLEDKFQWPAYPSWFSTLQLSSFRSVLEACHINLTLEYL 1154

Query: 986  CLLPAVVCWSLWSARNTYLFDGKQTSPNMVIASVSQLLK--ELTSLKPPRVSGELPAD-- 1153
              L ++V W +W  RN  +F+ + TS +     +   +   E  +L+ P  +  LP D  
Sbjct: 1155 TKL-SIVWWHVWYFRNKLIFNNESTSFSQASFIIHSFMGKWEKANLEIPSFNTPLPKDCK 1213

Query: 1154 --------LLWS--NVTLMRRYRTPIAVH*NRPFARWKLNVDGSSRGNPGHAGAGIIVRD 1303
                    L+WS  N  ++                  K+N DGS   N G A  G ++R+
Sbjct: 1214 LPVRSGKNLIWSPPNEDVL------------------KVNFDGSKLDN-GQAAYGFVIRN 1254

Query: 1304 ITGTVVLTEAIYLGQLTS-LFVESLALLHGLKMCTERRLFPLEV--ETDSLVLFRMLRAG 1474
              G V++  A  LG   S L  E++ LL G+K     + +  ++  E D++ +   +   
Sbjct: 1255 SNGEVLMARAKALGVYPSILMAEAMGLLEGIKGAISLQNWSRKIIFEGDNIAVINAMSPS 1314

Query: 1475 SSWPWRIHSVLSSIHPLLG-LDAISFAHIYREANTVADYLA 1594
            ++ PW I +++     LLG    + F H YREAN +AD++A
Sbjct: 1315 ATGPWTIANIILDAGALLGHFQEVKFQHCYREANRLADFMA 1355


>dbj|BAE79382.1| unnamed protein product [Ipomoea batatas]
          Length = 1366

 Score =  134 bits (338), Expect = 6e-29
 Identities = 144/559 (25%), Positives = 219/559 (39%), Gaps = 32/559 (5%)
 Frame = +2

Query: 14   GRIILVRHVLQALPVHTLAAMNPPKSLLRQMEGLFAKFIWGSHGEQARRVWRA-WDRLAY 190
            GR +LV+  L  +P +T+  M  P S   +++     F+WG H    R++    W  +  
Sbjct: 792  GRRVLVQASLATVPTYTMQVMALPVSTCNEIDKTCRNFLWG-HDTNTRKLHSVNWAEICK 850

Query: 191  PISENGLGLRSLSSILDAFSCKLWWKLHKGIG-VW-----SKYVTGISWQHSSFHSR--- 343
            P +E GLGLR       AF  K+ W++   I  +W      KYV    + H    S    
Sbjct: 851  PRNEGGLGLRMARDFNRAFLTKMAWQIFSNIDKLWVKVLREKYVKNADFLHLQSQSNCSW 910

Query: 344  ----LQAVKAFADNHCHTRVGDGSS-SF*TSNWFGIGPLNRLV-----PHSLFPNISLHE 493
                +   K          VG+G   +F    W G GPL         PH    +I + +
Sbjct: 911  GWRSIMKGKDVLAGAIKWNVGNGRKINFWNDWWVGDGPLASNTDCINQPH--MTDIKVED 968

Query: 494  AY-GENGWHPQVLAE-LPQADLQVVQPATFHF-SDHPDLLIWDLTDSGEFTTKSVFSGVR 664
                +  W    L   LP   + +V+       S+  D L W  + +G  T  S +S + 
Sbjct: 969  LITSQRRWDTGALHNILPTNMIDMVRATPIAINSEQEDFLSWPHSTTGMVTVSSAYSLIA 1028

Query: 665  RRQPEVQFHRWVWRTGVPIKISFFLWRILNGLLPFDDVLARLGFSLASKCPFCSSAD-SI 841
                + + H W+WR     KI  F+W+I+   L  +    R G + A+ CP C   D ++
Sbjct: 1029 GHDGDDRSHDWIWRATCTEKIKLFMWKIVKNGLMVNVERKRRGLADAASCPVCGEEDETL 1088

Query: 842  SHGFYEWGLARATWSF------FGALLGVHFVERDLKGFLYSLWNHRDPNSQLLCLLPAV 1003
             H F    LA A W        F     +H +   +K    S    +D  S    L+   
Sbjct: 1089 DHLFRRCLLAEACWDSAVPPLTFQTSNHLH-MHSWMKAACSS--QQKDGYSTNWSLIFPY 1145

Query: 1004 VCWSLWSARNTYLFDGKQTSPNMVIASVSQLLKELTSLKPPRVSGELPADLLWSNVTLMR 1183
            + W+LW ARN  +FD   T+P+ ++        E   L   R               L  
Sbjct: 1146 ILWNLWKARNRLVFDNNITAPSDILNRSFMESSEARCLLAKRTG-------------LQT 1192

Query: 1184 RYRTPIAVH*NRPFARW-KLNVDGSSRGNPGHAGAGIIVRDITGTVVLTEAIYLGQLTSL 1360
             ++T +    + P A + KLN DG+ + +   A AG ++R+  G  V      +G   S 
Sbjct: 1193 AFQTWVVW--SPPAAGFTKLNSDGACKSHSHLASAGGLLRNENGLWVAGYTCNIGTANSF 1250

Query: 1361 FVESLALLHGLKMCTERRLFPLEVETDSLVLFRMLRAGSSWPWRIHSVLSSIHPLLG-LD 1537
              E   L  GL +   R    L  ETDS  + ++LR           ++     LL    
Sbjct: 1251 LAELWGLREGLLLAKNRGFTKLIAETDSEAVVQVLRKDGPVTPDASILVKDCKLLLDHFQ 1310

Query: 1538 AISFAHIYREANTVADYLA 1594
             I   HI RE N  AD+LA
Sbjct: 1311 EIKVTHILREGNQCADFLA 1329


>emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1378

 Score =  132 bits (332), Expect = 3e-28
 Identities = 158/604 (26%), Positives = 252/604 (41%), Gaps = 36/604 (5%)
 Frame = +2

Query: 2    LIAGGRIILVRHVLQALPVHTLAAMNPPKSLLRQMEGLFAKFIWGSHGEQARRVWRAWDR 181
            L   GR  L++    ++P +T+ +   P+S    ++     F+WG    + R    AW+ 
Sbjct: 790  LSIAGRATLIQSAFSSIPYYTMQSTKLPRSTCDDIDRKSRSFLWGEQEGKRRVHLVAWEN 849

Query: 182  LAYPISENGLGLRSLSSILDAFSCKLWWK-LHKGIGVWSKYVTG----------ISWQHS 328
            ++    E GLG+RS+     AF  KL W+ L +   +WS+ +            +  + S
Sbjct: 850  ISKSKKEGGLGIRSMRQANSAFLVKLGWRLLAEPSSLWSRILRAKYCDNRCDIDMFKEKS 909

Query: 329  SFHSRLQAVKAFAD---NHCHTRVGDGSSS-F*TSNWFGIGPLNRLVPHSLFPNISLHEA 496
            +  S  + + +  D      ++ VG+G+ + F    W    PL  L   S  P I L +A
Sbjct: 910  NASSTWRGILSSIDVVRKGINSAVGNGAKTLFWHHRWATSEPLISLA--SPIPPIELQDA 967

Query: 497  YGE------NGWHPQVLAE-LPQADLQVVQPATFHFSDHP---DLLIWDLTDSGEFTTKS 646
              +      +GW   V A  LP+A L+++  A     D     D + W+ + SG FT  S
Sbjct: 968  TVKEMWDLVSGWKVDVFANYLPEATLKLI--AAHELIDDEEAIDDIYWNGSPSGGFTIGS 1025

Query: 647  VFSGVRRRQ-PEVQFH-RW--VWRTGVPIKISFFLWRILNGLLPFDDVLARLGFSLASKC 814
              +  R  +   +  H +W  VW+   P ++ FF+W  +   L  +        +   +C
Sbjct: 1026 AMNITRNAELANMDAHPKWSAVWKIPTPQRVRFFIWLAIQDRLMTNSNRFLRRLTDDPRC 1085

Query: 815  PFCSSA-DSISHGFYEWGLARATWSFFGALLGVHFVERDLKG--FLYSLWNHRDPNSQLL 985
              C    ++  H      +AR  W   G +LG H  E    G     +L       S+ L
Sbjct: 1086 LVCGEVEENTDHILRRCPVARILWRKLG-MLGEHNREEINLGSWITKNLSADTMMGSEWL 1144

Query: 986  CLLPAVVCWSLWSARNTYLFDGKQTSPNMVIASVSQLLKELTSLKPPRVSGELPADLLWS 1165
             +  AV CW LW  RN   F+    +P++ I  VS +   +  +K      +       +
Sbjct: 1145 RVF-AVSCWWLWRWRNDRCFN---RNPSIPIDQVSFIFARVKEIKEAMDRND-------T 1193

Query: 1166 NVTLMRRYRTPIAVH*NRPFARW-KLNVDGSSRGNPGHAGAGIIVRDITGTVVLTEAIYL 1342
            N +     R  I V    P   W KLN DG+S+GNPG AG G ++R   G +    AI  
Sbjct: 1194 NKSQHSGRRKEILVRWQCPKEGWVKLNTDGASKGNPGPAGGGGLIRGPRGEIHEVFAINC 1253

Query: 1343 GQLTSLFVESLALLHGLKMCTERRLFPLEVETDSLVLFRML--RAGSSWPWRIHSVLSSI 1516
            G  T    E LA+L GL +  E     + V  DS ++ ++L   A  S P+ IH +   +
Sbjct: 1254 GSCTCTKAELLAVLRGLMIAWEGNHKQVIVSVDSELVAKLLISNAPPSSPY-IHIINRCL 1312

Query: 1517 HPLLGLD-AISFAHIYREANTVADYLATHAXXXXXXXXXXXXXLPRKLVGLVLLDQLGCP 1693
              +   +  I   H YRE N  AD LA +              +P+ L  ++L D  G  
Sbjct: 1313 SLIARKEWKIVIEHCYRETNRAADRLA-NMGVCAVERVVMIEAIPKDLHAILLEDLSGVA 1371

Query: 1694 NLRM 1705
              RM
Sbjct: 1372 WTRM 1375


>gb|AAT38805.1| hypothetical protein SDM1_47t00008 [Solanum demissum]
          Length = 1155

 Score =  122 bits (305), Expect = 4e-25
 Identities = 77/279 (27%), Positives = 118/279 (42%), Gaps = 3/279 (1%)
 Frame = +2

Query: 2    LIAGGRIILVRHVLQALPVHTLAAMNPPKSLLRQMEGLFAKFIWGSHGEQARRVWRAWDR 181
            L  GGR++L++HVL A+P H L A+ P K  L  +E   A+F W S  E  +  W AW  
Sbjct: 705  LSTGGRVVLIKHVLLAIPTHLLVALQPTKGTLDNIEIYLARFFWSSKEEGGKHHWIAWRT 764

Query: 182  LAYPISENGLGLRSLSSILDAFSCKLWWKLHKGIGVWSKYVTGISWQHSSFHSRLQAVKA 361
            L  P  E  + +R +  + +AFS                                   K 
Sbjct: 765  LCLPFEEGWINIRRIGDVCNAFS----------------------------------YKE 790

Query: 362  FADNHCHTRVGDGSSSF*TSNWFGIGPLNRLVPHSL-FPNISLHEAYGENGWHPQVLA-E 535
             A+N     +G    +F   NW  IGPL +L+P      NI + E  G   W+  +L  +
Sbjct: 791  IAENEIKRILGRDEVNFWCDNWSQIGPLYKLLPTGFSVHNIMVKEILGNGKWNWSILQNQ 850

Query: 536  LP-QADLQVVQPATFHFSDHPDLLIWDLTDSGEFTTKSVFSGVRRRQPEVQFHRWVWRTG 712
            LP Q  + +        ++  D  +W  T +G FT  S ++  R +  E+     +W   
Sbjct: 851  LPNQIKVMITGLGLNLNNEEQDKPVWSPTSAGNFTVTSAWNICRHKGIEITDFNKIWHKD 910

Query: 713  VPIKISFFLWRILNGLLPFDDVLARLGFSLASKCPFCSS 829
            +P K+SF  W+ +   LP      RLG  L+  C  C++
Sbjct: 911  IPFKMSFLTWKAIINRLPTGAKFKRLGIPLSPTCYCCTN 949



 Score = 82.8 bits (203), Expect = 3e-13
 Identities = 52/157 (33%), Positives = 81/157 (51%), Gaps = 1/157 (0%)
 Frame = +2

Query: 1235 KLNVDGSSRGNPGHAGAGIIVRDITGTVVLTEAIYLGQLTSLFVESLALLHGLKMCTERR 1414
            KLN DGS     G  G G I+R+  G V++   I LG+ TS + E++++LHG+++C +R 
Sbjct: 1001 KLNTDGSCVN--GRCGGGGILRNALGQVIMAFTIKLGEGTSSWAEAMSMLHGMQLCIQRG 1058

Query: 1415 LFPLEVETDSLVLFRMLRAGSSWPWRIHSVLSSIHPLLGLDAISFAHIYREANTVADYLA 1594
            +  +  ETDS++L + +    S PWR++  +  I  ++        H  REAN  AD LA
Sbjct: 1059 VNMIIGETDSILLAKAITENWSIPWRMYIPVKKIQKMVEEHGFIINHCLREANQPADKLA 1118

Query: 1595 T-HAXXXXXXXXXXXXXLPRKLVGLVLLDQLGCPNLR 1702
            +                LP  + GLV LD++  P  R
Sbjct: 1119 SISLSTDVNHVFKSYANLPSLVKGLVNLDRMNLPTFR 1155


Top