BLASTX nr result

ID: Cimicifuga21_contig00001871 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cimicifuga21_contig00001871
         (1839 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002280960.2| PREDICTED: DNA topoisomerase 1-like [Vitis v...   652   0.0  
emb|CAN67233.1| hypothetical protein VITISV_020021 [Vitis vinifera]   649   0.0  
ref|XP_002528003.1| prokaryotic DNA topoisomerase, putative [Ric...   613   e-173
ref|XP_002334451.1| predicted protein [Populus trichocarpa] gi|2...   606   e-171
ref|NP_194849.3| DNA topoisomerase, type IA, core [Arabidopsis t...   602   e-170

>ref|XP_002280960.2| PREDICTED: DNA topoisomerase 1-like [Vitis vinifera]
          Length = 1185

 Score =  652 bits (1683), Expect = 0.0
 Identities = 355/536 (66%), Positives = 404/536 (75%), Gaps = 4/536 (0%)
 Frame = +2

Query: 242  DGKKDVKLESPVSTSVKSSGNNKNPQAQGKKKQPGNKREKALSGSTVPPGD-AKINGSKQ 418
            +G+KD  L   +STS  S+ N  +   + ++KQ   K+ K    ST    D A+   +K 
Sbjct: 230  NGRKDADLSPSISTSPVSNNNRGSKATEKQRKQSRTKKNKEQVTSTDASSDVAQKKSTKS 289

Query: 419  VFHVKKPNVATGGGQSSQVSEILLAGDTPLLKIDNSTLTH---EKSVGKPARKRKPQKKV 589
                 K N+ T   QS Q S+    G+ P+  +D+S  T    +K+ G   +K K  K  
Sbjct: 290  SSEANKSNI-TKKSQSPQASKNNSTGNKPVEALDSSVSTKSQSKKATGSSNKKGKSPK-- 346

Query: 590  VVHMKPDXXXXXXXXXXXXXXXXXXGLKSLYPPTGKSVVVVESLTKAKVIQGYLGDMFEV 769
            V +  P                    LK LYP +GKSVVVVES+TKAKVIQGYLGDM+EV
Sbjct: 347  VANESPKKQTVHTMGKIKSLEQRP--LKKLYPSSGKSVVVVESVTKAKVIQGYLGDMYEV 404

Query: 770  LPSYGHVRDLAGRSGSVRPDDDFSMVWEVPSAAWTHLKSIKVALNGAENLILASDPDREG 949
            LPSYGHVRDLAGRSGSVRPDDDFSMVWEVPSAAWTHLKSIKVAL GAENLILASDPDREG
Sbjct: 405  LPSYGHVRDLAGRSGSVRPDDDFSMVWEVPSAAWTHLKSIKVALGGAENLILASDPDREG 464

Query: 950  EAIAWHITEMLQQQDALNDDINVSRVVFHEITEPSIKSALQAPRDIDANLVHAYLARRAL 1129
            EAIAWHI EML QQDAL+ D+ V+RVVFHEITE SIKSAL APR+ID NLVHAYLARRAL
Sbjct: 465  EAIAWHIIEMLLQQDALHKDLTVARVVFHEITESSIKSALDAPREIDVNLVHAYLARRAL 524

Query: 1130 DYLIGFSISPLLWRKLPACQSAGRVQSAALALLCDREMEIDEFRPHEYWTVEVGFHKTEP 1309
            DYLIGF+ISPLLWRKLP CQSAGRVQSAALAL+CDREMEIDEF+P EYWTVEV F++ + 
Sbjct: 525  DYLIGFNISPLLWRKLPGCQSAGRVQSAALALICDREMEIDEFKPQEYWTVEVEFNRKQ- 583

Query: 1310 GSSANVSSFPSHLTHLDFKKLEQLSISSCEEAHAIEQKMTSSSFEVSGSKRSKSRRNPPT 1489
            GSS N   FPS+LTH D KKL Q SISS  EA AIEQ++ S  F+V GSKR+K R+NPPT
Sbjct: 584  GSSMNSKFFPSYLTHFDSKKLNQFSISSHTEAKAIEQEINSLEFKVIGSKRNKMRKNPPT 643

Query: 1490 PYITSTLQQDAANKLNFSASYTMKLAQKLYEGVKLTNDEATGLITYMRTDGLHISDVAAK 1669
            PYITSTLQQDAANKL+FSA YTMKLAQ+LYEGV+L++ +A GLITYMRTDGLH+SD AAK
Sbjct: 644  PYITSTLQQDAANKLHFSAMYTMKLAQRLYEGVQLSDGKAAGLITYMRTDGLHVSDEAAK 703

Query: 1670 DICSLVIERYGQVFASESPRKYFKKVKNAQEAHEAIRPTSIRRLPSMLTGVLDEDS 1837
            DI SLV ERYG   AS+  RKYFKKVKNAQEAHEAIRPT I+RLPSML GVLDEDS
Sbjct: 704  DIRSLVAERYGSNLASDGVRKYFKKVKNAQEAHEAIRPTDIQRLPSMLAGVLDEDS 759


>emb|CAN67233.1| hypothetical protein VITISV_020021 [Vitis vinifera]
          Length = 1039

 Score =  649 bits (1674), Expect = 0.0
 Identities = 357/549 (65%), Positives = 406/549 (73%), Gaps = 17/549 (3%)
 Frame = +2

Query: 242  DGKKDVKLESPVSTSVKSSGNNKNPQAQGKKKQPGNKREKALSGSTVPPGD-AKINGSKQ 418
            +G+KDV L   +STS  S+ N  +   + ++KQ   K+ K    ST    D A+   +K 
Sbjct: 230  NGRKDVDLSPSISTSPVSNNNRGSKATEKQRKQSRTKKNKEQVTSTDASSDVAQKKSTKS 289

Query: 419  VFHVKKPNVATGGGQSSQVSEIL-------------LAGDTPLLKIDNSTLTH---EKSV 550
                 K N+ T   QS Q S++                G+ P+  +D+S  T    +K+ 
Sbjct: 290  SSEANKSNI-TKKSQSPQASKVRHVSICIYELFQNNSTGNKPVEALDSSVSTKSQSKKAT 348

Query: 551  GKPARKRKPQKKVVVHMKPDXXXXXXXXXXXXXXXXXXGLKSLYPPTGKSVVVVESLTKA 730
            G   +K K  K  V +  P                    LK LYP +GKSVVVVES+TKA
Sbjct: 349  GSSNKKGKSPK--VANESPKKQTVHTMGKIKSLEQRP--LKKLYPSSGKSVVVVESVTKA 404

Query: 731  KVIQGYLGDMFEVLPSYGHVRDLAGRSGSVRPDDDFSMVWEVPSAAWTHLKSIKVALNGA 910
            KVIQGYLGDM+EVLPSYGHVRDLAGRSGSVRPDDDFSMVWEVPSAAWTHLKSIKVAL GA
Sbjct: 405  KVIQGYLGDMYEVLPSYGHVRDLAGRSGSVRPDDDFSMVWEVPSAAWTHLKSIKVALGGA 464

Query: 911  ENLILASDPDREGEAIAWHITEMLQQQDALNDDINVSRVVFHEITEPSIKSALQAPRDID 1090
            ENLILASDPDREGEAIAWHI EML QQDAL+ D+ V+RVVFHEITE SIKSAL APR+ID
Sbjct: 465  ENLILASDPDREGEAIAWHIIEMLLQQDALHKDLTVARVVFHEITESSIKSALDAPREID 524

Query: 1091 ANLVHAYLARRALDYLIGFSISPLLWRKLPACQSAGRVQSAALALLCDREMEIDEFRPHE 1270
             NLVHAYLARRALDYLIGF+ISPLLWRKLP CQSAGRVQSAALAL+CDREMEIDEF+P E
Sbjct: 525  VNLVHAYLARRALDYLIGFNISPLLWRKLPGCQSAGRVQSAALALICDREMEIDEFKPQE 584

Query: 1271 YWTVEVGFHKTEPGSSANVSSFPSHLTHLDFKKLEQLSISSCEEAHAIEQKMTSSSFEVS 1450
            YWTVEV F++ + GSS N   FPS+LTH D KKL Q SISS  EA AIEQ++ S  F+V 
Sbjct: 585  YWTVEVEFNRKQ-GSSMNSKFFPSYLTHFDSKKLNQFSISSHTEAKAIEQEINSLEFKVI 643

Query: 1451 GSKRSKSRRNPPTPYITSTLQQDAANKLNFSASYTMKLAQKLYEGVKLTNDEATGLITYM 1630
            GSKR+K R+NPPTPYITSTLQQDAANKL+FSA YTMKLAQ+LYEGV+L++ +A GLITYM
Sbjct: 644  GSKRNKMRKNPPTPYITSTLQQDAANKLHFSAMYTMKLAQRLYEGVQLSDGKAAGLITYM 703

Query: 1631 RTDGLHISDVAAKDICSLVIERYGQVFASESPRKYFKKVKNAQEAHEAIRPTSIRRLPSM 1810
            RTDGLH+SD AAKDI SLV ERYG   AS+  RKYFKKVKNAQEAHEAIRPT IRRLPSM
Sbjct: 704  RTDGLHVSDEAAKDIRSLVAERYGSNLASDGVRKYFKKVKNAQEAHEAIRPTDIRRLPSM 763

Query: 1811 LTGVLDEDS 1837
            L GVLDEDS
Sbjct: 764  LAGVLDEDS 772


>ref|XP_002528003.1| prokaryotic DNA topoisomerase, putative [Ricinus communis]
            gi|223532629|gb|EEF34415.1| prokaryotic DNA
            topoisomerase, putative [Ricinus communis]
          Length = 1071

 Score =  613 bits (1581), Expect = e-173
 Identities = 344/578 (59%), Positives = 399/578 (69%), Gaps = 16/578 (2%)
 Frame = +2

Query: 152  SSVSVSHQPVKKSPDAIKSV-PKNLHVPNHKDGK----------KDVKLESPVSTSVKSS 298
            SS++ S  P    P  + S+ P +L + +H+             K  K   P +  + + 
Sbjct: 95   SSINPSMLPQTSPPLFLSSLTPPHLFLTSHRHFSSVNQRSAGFSKYWKRTKPFTAHLNNK 154

Query: 299  GNNKNPQAQGKKKQPGNKREKALSGSTVPPGDAKINGSKQVFHVKKPNVATGGGQSSQV- 475
             N+ NP  Q       +      + ++ P      +      + KK   +T   +++Q  
Sbjct: 155  DNDANPTEQSSGDVNLDSTAPVTTPNSKPTPTTTRHKKHSKTNSKKQQSSTSVEEAAQAK 214

Query: 476  -SEILLAGDTPLLKIDNSTLTH---EKSVGKPARKRKPQKKVVVHMKPDXXXXXXXXXXX 643
             S        P   +D S  T     K   KP  K K  K V V   PD           
Sbjct: 215  TSSSTKTKTKPKELLDASASTKPQPNKKTRKPTGKPKAIKTVKV--SPDKQQQHKPMHKS 272

Query: 644  XXXXXXXGLKSLYPPTGKSVVVVESLTKAKVIQGYLGDMFEVLPSYGHVRDLAGRSGSVR 823
                    LK LYPPT KSVVVVES+TKAKVIQGYLG MFEVLPSYGHVRDLA RSGSVR
Sbjct: 273  KPFRQG-SLKPLYPPTAKSVVVVESVTKAKVIQGYLGPMFEVLPSYGHVRDLAARSGSVR 331

Query: 824  PDDDFSMVWEVPSAAWTHLKSIKVALNGAENLILASDPDREGEAIAWHITEMLQQQDALN 1003
            PDDDFSMVWEVPS AWTHLK+IKVALNGAENL+LASDPDREGEAIAWHI EMLQQQDAL+
Sbjct: 332  PDDDFSMVWEVPSPAWTHLKTIKVALNGAENLVLASDPDREGEAIAWHIIEMLQQQDALH 391

Query: 1004 DDINVSRVVFHEITEPSIKSALQAPRDIDANLVHAYLARRALDYLIGFSISPLLWRKLPA 1183
              + V+RVVFHEITE SIK+ALQAPR+ID NLVHAYLARRALDYLIGF+ISPLLWRKLP 
Sbjct: 392  QGVTVARVVFHEITEQSIKNALQAPREIDLNLVHAYLARRALDYLIGFNISPLLWRKLPG 451

Query: 1184 CQSAGRVQSAALALLCDREMEIDEFRPHEYWTVEVGFHKTEPGSSANVSSFPSHLTHLDF 1363
            CQSAGRVQSAAL+L+CDREMEIDEF P EYWT++V  +  E GSS N     + LTH DF
Sbjct: 452  CQSAGRVQSAALSLICDREMEIDEFTPQEYWTIDVELYIMETGSSVN-----ARLTHFDF 506

Query: 1364 KKLEQLSISSCEEAHAIEQKMTSSSFEVSGSKRSKSRRNPPTPYITSTLQQDAANKLNFS 1543
             KL QLS+ S  EA  IEQK+  +SF V+G+K SK RRNPPTPYITSTLQQDAANKL+FS
Sbjct: 507  NKLNQLSVRSHTEARDIEQKINVASFLVAGAKESKMRRNPPTPYITSTLQQDAANKLHFS 566

Query: 1544 ASYTMKLAQKLYEGVKLTNDEATGLITYMRTDGLHISDVAAKDICSLVIERYGQVFASES 1723
            + YTMKLAQKLYEGV+L++ + TGLITY+RTDGLHIS  A K+I SLVIERYGQ FAS+S
Sbjct: 567  SMYTMKLAQKLYEGVQLSDGKTTGLITYIRTDGLHISGEAVKEIHSLVIERYGQDFASDS 626

Query: 1724 PRKYFKKVKNAQEAHEAIRPTSIRRLPSMLTGVLDEDS 1837
            PRKYFKKVKNAQEAHEA+RPT++R LPSML  VLDEDS
Sbjct: 627  PRKYFKKVKNAQEAHEAVRPTNVRMLPSMLANVLDEDS 664


>ref|XP_002334451.1| predicted protein [Populus trichocarpa] gi|222872210|gb|EEF09341.1|
            predicted protein [Populus trichocarpa]
          Length = 780

 Score =  606 bits (1562), Expect = e-171
 Identities = 341/538 (63%), Positives = 391/538 (72%), Gaps = 10/538 (1%)
 Frame = +2

Query: 254  DVKLESPVS-TSVKSSGNNKNPQAQGKKKQPGNKREKALSGSTVPPGDAK----INGSKQ 418
            D KL S VS T  K S  N+  + + K K+  NK E + +  +     AK     +  K+
Sbjct: 181  DAKLGSAVSRTRNKKSKVNETEKKESKSKK--NKGEASFTAVSEKAAGAKSMTTTSQLKK 238

Query: 419  VFHVKKPNVATGGGQSSQVSEILLAGDTPLL---KIDNSTL--THEKSVGKPARKRKPQK 583
                K   VA    Q     E +  G T      K  NS++  T +K+V   A K+KPQK
Sbjct: 239  SLPSKSGQVAARASQKKATEE-MPNGSTKQQSNKKNGNSSIKRTTKKAVNGSAEKQKPQK 297

Query: 584  KVVVHMKPDXXXXXXXXXXXXXXXXXXGLKSLYPPTGKSVVVVESLTKAKVIQGYLGDMF 763
              +  ++P                    LKSLYP  GK VVVVES+TKAKVIQGYLGDM+
Sbjct: 298  --IGKLQP---------------FAQGKLKSLYPSAGKCVVVVESVTKAKVIQGYLGDMY 340

Query: 764  EVLPSYGHVRDLAGRSGSVRPDDDFSMVWEVPSAAWTHLKSIKVALNGAENLILASDPDR 943
            EVLPSYGHVRDLA RSGSVRPDDDFS+VWEVPSAAWTHL SIKVAL+GA+ LILASDPDR
Sbjct: 341  EVLPSYGHVRDLAARSGSVRPDDDFSIVWEVPSAAWTHLNSIKVALSGAKILILASDPDR 400

Query: 944  EGEAIAWHITEMLQQQDALNDDINVSRVVFHEITEPSIKSALQAPRDIDANLVHAYLARR 1123
            EGEAIAWHI EMLQQQDAL  D+ V+RVVFHEITE SIK+ALQAPR ID NLVHAYLARR
Sbjct: 401  EGEAIAWHIVEMLQQQDALRQDLTVARVVFHEITETSIKNALQAPRGIDVNLVHAYLARR 460

Query: 1124 ALDYLIGFSISPLLWRKLPACQSAGRVQSAALALLCDREMEIDEFRPHEYWTVEVGFHKT 1303
            ALDYLIGF+ISPLLWRKLP CQSAGRVQSAAL+LLCDREMEIDEF   EYWT+     K 
Sbjct: 461  ALDYLIGFNISPLLWRKLPGCQSAGRVQSAALSLLCDREMEIDEFNSQEYWTIGAKLKKQ 520

Query: 1304 EPGSSANVSSFPSHLTHLDFKKLEQLSISSCEEAHAIEQKMTSSSFEVSGSKRSKSRRNP 1483
            E  S  N     +HLTH D KKL Q SISS  EA  +EQK+ S+ F+V GSK++K+ RNP
Sbjct: 521  EQDSLVN-----AHLTHFDSKKLNQFSISSDTEAKDMEQKINSADFQVVGSKKTKTHRNP 575

Query: 1484 PTPYITSTLQQDAANKLNFSASYTMKLAQKLYEGVKLTNDEATGLITYMRTDGLHISDVA 1663
            PTPYITSTLQQDAANKL+F+ SYTMK+AQKLYEGV+L++ +ATGLITY+RTDGLHISD A
Sbjct: 576  PTPYITSTLQQDAANKLDFATSYTMKVAQKLYEGVQLSDGKATGLITYLRTDGLHISDEA 635

Query: 1664 AKDICSLVIERYGQVFASESPRKYFKKVKNAQEAHEAIRPTSIRRLPSMLTGVLDEDS 1837
              +I SL+IERYGQ FAS+ PRKYF+KVKNAQEAHEAIRPT+I  LPSML GVLDEDS
Sbjct: 636  VGNIRSLIIERYGQDFASKGPRKYFRKVKNAQEAHEAIRPTNIHLLPSMLVGVLDEDS 693


>ref|NP_194849.3| DNA topoisomerase, type IA, core [Arabidopsis thaliana]
            gi|332660478|gb|AEE85878.1| DNA topoisomerase, type IA,
            core [Arabidopsis thaliana]
          Length = 1284

 Score =  602 bits (1553), Expect = e-170
 Identities = 342/611 (55%), Positives = 416/611 (68%), Gaps = 9/611 (1%)
 Frame = +2

Query: 32   PLVSVSSQP--VKKSASNAIKSVPKNLHA--PKANDGKKGVKLESSVSVSHQPVKKSPDA 199
            P  S+S Q       ASNA K    N+     KA   K G K  +     H  +  S   
Sbjct: 262  PSNSMSEQQHWTSTKASNAPKQEQDNIVGGDEKAGGNKVGFKKFNKNRKKHNVLASSEAE 321

Query: 200  IKSVPKNLHVPNHKDGKKDVKLESPVSTSVKSSGNNKNPQAQGKKKQPGNKREKALSGST 379
            + +  +    P   DG   +K E   + S  S+GN         K++P NK+ +  S S 
Sbjct: 322  VVTSTE----PVIGDGSSGIKAELSTAASPASNGNQATTVKS--KRRPKNKKVEDKSSSV 375

Query: 380  VPPGDAKINGSKQVFHVKKPNVATGGGQSSQVSEILLAGDTPLLKIDNSTLTHEKSVGK- 556
            VP  +A ++  +    V KP  +  G + S  ++  +A + P+ +  +   ++ KS  + 
Sbjct: 376  VPVLEA-VSLDESPISVPKPKHSGSGNRKSSSAKKEVAKNHPVEEPKSPAPSNSKSEQQH 434

Query: 557  ----PARKRKPQKKVVVHMKPDXXXXXXXXXXXXXXXXXXGLKSLYPPTGKSVVVVESLT 724
                 A K   QK V  HMK                      K LYPP+GKSV+VVES+T
Sbjct: 435  LKSTKASKAPKQKLVPQHMKNSIEHRGQNAS-----------KPLYPPSGKSVIVVESMT 483

Query: 725  KAKVIQGYLGDMFEVLPSYGHVRDLAGRSGSVRPDDDFSMVWEVPSAAWTHLKSIKVALN 904
            KAK+IQGYLGDM+EVLPSYGH+RDLA RSGSVRPDDDFSMVWEVPS+AWTH+KSIKVALN
Sbjct: 484  KAKIIQGYLGDMYEVLPSYGHIRDLATRSGSVRPDDDFSMVWEVPSSAWTHIKSIKVALN 543

Query: 905  GAENLILASDPDREGEAIAWHITEMLQQQDALNDDINVSRVVFHEITEPSIKSALQAPRD 1084
            GAENLILASDPDREGEAIAWHI EMLQQQ AL++ + V+RVVFHEITE +IKSALQ+PR+
Sbjct: 544  GAENLILASDPDREGEAIAWHIIEMLQQQGALHESMTVARVVFHEITESAIKSALQSPRE 603

Query: 1085 IDANLVHAYLARRALDYLIGFSISPLLWRKLPACQSAGRVQSAALALLCDREMEIDEFRP 1264
            ID +LVHAYLARRALDYLIGF+ISPLLWRKLP C SAGRVQSAALAL+CDRE EID F+P
Sbjct: 604  IDGDLVHAYLARRALDYLIGFNISPLLWRKLPGCPSAGRVQSAALALVCDRESEIDGFKP 663

Query: 1265 HEYWTVEVGFHKTEPGSSANVSSFPSHLTHLDFKKLEQLSISSCEEAHAIEQKMTSSSFE 1444
             EYWTV +     +     N ++F +HLT L+ K+L QLSISS   A  IEQ++ S  F 
Sbjct: 664  QEYWTVGI-----KVKGKDNSATFSAHLTSLNSKRLNQLSISSEANAQDIEQRIKSEGFL 718

Query: 1445 VSGSKRSKSRRNPPTPYITSTLQQDAANKLNFSASYTMKLAQKLYEGVKLTNDEATGLIT 1624
            V G+K S +R+NPPTPYITSTLQQDAANKL+FS ++TMKLAQKLYEGV+L++ ++ GLIT
Sbjct: 719  VKGTKTSTTRKNPPTPYITSTLQQDAANKLHFSTAHTMKLAQKLYEGVQLSDGKSAGLIT 778

Query: 1625 YMRTDGLHISDVAAKDICSLVIERYGQVFASESPRKYFKKVKNAQEAHEAIRPTSIRRLP 1804
            YMRTDGLHI+D A KDI SLV ERYG+ F S+SPRKYFKKVKNAQEAHEAIRPT IRRLP
Sbjct: 779  YMRTDGLHIADEAIKDIQSLVAERYGKNFTSDSPRKYFKKVKNAQEAHEAIRPTDIRRLP 838

Query: 1805 SMLTGVLDEDS 1837
            S +  +LD DS
Sbjct: 839  STIASLLDADS 849


Top