BLASTX nr result

ID: Coptis23_contig00003403 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00003403
         (1718 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002279655.1| PREDICTED: uncharacterized protein LOC100255...   724   0.0  
ref|XP_002327325.1| predicted protein [Populus trichocarpa] gi|2...   713   0.0  
ref|XP_003530919.1| PREDICTED: uncharacterized protein LOC100809...   701   0.0  
ref|XP_002881945.1| hypothetical protein ARALYDRAFT_903808 [Arab...   695   0.0  
ref|NP_566005.1| transcription termination factor domain-contain...   695   0.0  

>ref|XP_002279655.1| PREDICTED: uncharacterized protein LOC100255927 [Vitis vinifera]
          Length = 481

 Score =  724 bits (1869), Expect = 0.0
 Identities = 369/455 (81%), Positives = 397/455 (87%)
 Frame = +1

Query: 235  FASDSSKFPEYEMPTVTWGVVQGRKEKLVSRVIICDYLKNLGIVPDELEHLELPSTVQVM 414
            F S SSKFPEYEMP+VTWGVV GRKE+LVSRVII DYLK LGI+PDELE +ELPSTV+VM
Sbjct: 28   FLSSSSKFPEYEMPSVTWGVVLGRKERLVSRVIISDYLKTLGIIPDELEQVELPSTVEVM 87

Query: 415  KERVEFLQKIGLTIDDINEYPLMLGCSVRKNIIPVLGYLEKIGIKRSSMGEFVKNYPQXX 594
            +ERVEFLQK+G+TID +NEYPLMLGCSVRKN+IPVLGYLEKIGI RS +GEFV NYPQ  
Sbjct: 88   RERVEFLQKLGVTIDHLNEYPLMLGCSVRKNMIPVLGYLEKIGIPRSKLGEFVVNYPQVL 147

Query: 595  XXXXXXXXXXXXXXXRGLDVDRQDIGYVLQKYPELLGFKLEGTMSTSVAYLVSIGVSPRD 774
                           RGLDVD+QDIGYVL KYPELLGFKLEGTMSTSVAYLVSIGVSPRD
Sbjct: 148  HASVVVELAPVVKFLRGLDVDKQDIGYVLMKYPELLGFKLEGTMSTSVAYLVSIGVSPRD 207

Query: 775  IGPMVTQYPYLLGMRVGTVIKPIVDYLVSLGIPKKILARMFEKRAYVLGYDLEETIKSNV 954
            IGPMVTQYPY LGMRVGTVIKPIVDYLVSLG+PKK+LARMFEKRAYVLGYDLEE IK NV
Sbjct: 208  IGPMVTQYPYFLGMRVGTVIKPIVDYLVSLGLPKKVLARMFEKRAYVLGYDLEECIKPNV 267

Query: 955  DCLLSFGIRREALASVIAQYPQILGLPLKAKLSSQQYFFNLKLKIDPDGFARVIEKMPQM 1134
            DCL+SFGIRREALASVIAQ+PQILGLPLKAKLSSQQYFFNLKLKIDPDGFARVIE+MPQ+
Sbjct: 268  DCLVSFGIRREALASVIAQFPQILGLPLKAKLSSQQYFFNLKLKIDPDGFARVIERMPQI 327

Query: 1135 VSLNQSVIMKPIEFLRGRTIPAEDVAKMVVKCPQLVAQRVELMKNSYYFFKSEMKWSLQE 1314
            VSLNQ+VIMKP+EFL GR IPA DVAKMVVKCPQLVA RVELMKN YYFFKSEM   ++E
Sbjct: 328  VSLNQNVIMKPVEFLLGRGIPAVDVAKMVVKCPQLVALRVELMKNGYYFFKSEMGRQVKE 387

Query: 1315 LVEFPEYFTYSLESRIKPRYQSLSSKGVKFSLAWFLNCSDQRFEERLQADYIETESAGPS 1494
            LVEFPEYFTYSLESRIKPRYQ L SKGV+ SL WFLNCSDQRFEERLQADYIE E+ GPS
Sbjct: 388  LVEFPEYFTYSLESRIKPRYQRLQSKGVRSSLDWFLNCSDQRFEERLQADYIEMETIGPS 447

Query: 1495 FYVGGKLELPGSEIVXXXXXXXXXXXILYRRTVSL 1599
            F +GGKL+LPG+E+V            LYRRTVSL
Sbjct: 448  FCMGGKLQLPGNEVVSDEEDESDDEE-LYRRTVSL 481


>ref|XP_002327325.1| predicted protein [Populus trichocarpa] gi|222835695|gb|EEE74130.1|
            predicted protein [Populus trichocarpa]
          Length = 514

 Score =  713 bits (1840), Expect = 0.0
 Identities = 366/489 (74%), Positives = 410/489 (83%), Gaps = 4/489 (0%)
 Frame = +1

Query: 145  LISKTIDSS--QNPQFTNAQSKNAFFNVKVRNFASDSSKFPEYEMPTVTWGVVQGRKEKL 318
            LISKT  +   QNP   N      F+ +    F++ +SKF EYEMP+VTWGVVQG+KEKL
Sbjct: 30   LISKTQQNPCPQNPLTQNPLGVLQFYAL----FSTQASKFHEYEMPSVTWGVVQGKKEKL 85

Query: 319  VSRVIICDYLKNLGIVPDELEHLELPSTVQVMKERVEFLQKIGLTIDDINEYPLMLGCSV 498
            V+RVIICDYLK LGI+PDELE LELPSTV+VMKERVEFLQ++GLTIDDINEYPLMLGCSV
Sbjct: 86   VNRVIICDYLKGLGIIPDELESLELPSTVEVMKERVEFLQRMGLTIDDINEYPLMLGCSV 145

Query: 499  RKNIIPVLGYLEKIGIKRSSMGEFVKNYPQXXXXXXXXXXXXXXXXXRGLDVDRQDIGYV 678
            RKNIIPVLGYLEKIGI RS +GEFVK+YPQ                 RGLDVD+ DIGYV
Sbjct: 146  RKNIIPVLGYLEKIGISRSKLGEFVKSYPQVLHASVVVELQPVIKFLRGLDVDKLDIGYV 205

Query: 679  LQKYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQYPYLLGMRVGTVIKPIVDYLV 858
            LQKYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQYPYLLGMRVGT+IKP+VDYLV
Sbjct: 206  LQKYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQYPYLLGMRVGTMIKPLVDYLV 265

Query: 859  SLGIPKKILARMFEKRAYVLGYDLEETIKSNVDCLLSFGIRREALASVIAQYPQILGLPL 1038
            SLG+PKKI+ARM EKR YVLGYDL+ET+K NVDCL+SFGIRRE LAS++AQYP ILGLPL
Sbjct: 266  SLGLPKKIVARMLEKRPYVLGYDLQETVKPNVDCLISFGIRREVLASIVAQYPPILGLPL 325

Query: 1039 KAKLSSQQYFFNLKLKIDPDGFARVIEKMPQMVSLNQSVIMKPIEFLRGRTIPAEDVAKM 1218
            KAKLSSQQYFFNLKLKIDP+ FARVIEKMPQ+VSLNQ+VIMKP++FL  R IP+EDVA M
Sbjct: 326  KAKLSSQQYFFNLKLKIDPERFARVIEKMPQIVSLNQNVIMKPVQFLLERAIPSEDVATM 385

Query: 1219 VVKCPQLVAQRVELMKNSYYFFKSEMKWSLQELVEFPEYFTYSLESRIKPRYQSLSSKGV 1398
            V+KCPQL+A RV LMKNSYYFFKSEM   L+ELVEFPEYFTYSLESRIKPRY+ L SKG+
Sbjct: 386  VIKCPQLLALRVPLMKNSYYFFKSEMGRPLKELVEFPEYFTYSLESRIKPRYEMLKSKGI 445

Query: 1399 KFSLAWFLNCSDQRFEERLQADYIETESAGPSFYVGGKLELPGSEIV--XXXXXXXXXXX 1572
            + SL WFLNCSD+RFEERL+ DYIE+ES GPSF +GGKLELPG EI+             
Sbjct: 446  RSSLNWFLNCSDKRFEERLEGDYIESESLGPSFCMGGKLELPGCEILSDEEDEIDDDEDE 505

Query: 1573 ILYRRTVSL 1599
            +L+RRTVSL
Sbjct: 506  VLFRRTVSL 514


>ref|XP_003530919.1| PREDICTED: uncharacterized protein LOC100809590 [Glycine max]
          Length = 499

 Score =  701 bits (1810), Expect = 0.0
 Identities = 351/453 (77%), Positives = 388/453 (85%)
 Frame = +1

Query: 241  SDSSKFPEYEMPTVTWGVVQGRKEKLVSRVIICDYLKNLGIVPDELEHLELPSTVQVMKE 420
            S +SK PEYEMP+VTWGV+QGRKEKLVSRVII DYLK LGI+PDEL  LELPSTV VM+E
Sbjct: 48   SSASKLPEYEMPSVTWGVIQGRKEKLVSRVIIFDYLKGLGIIPDELHDLELPSTVDVMRE 107

Query: 421  RVEFLQKIGLTIDDINEYPLMLGCSVRKNIIPVLGYLEKIGIKRSSMGEFVKNYPQXXXX 600
            RVEFLQK+GLT+DDIN YPLMLGCSVRKN+IPVLGYLEKIGI R  +G FVKNYPQ    
Sbjct: 108  RVEFLQKLGLTVDDINNYPLMLGCSVRKNMIPVLGYLEKIGIARPKLGGFVKNYPQVLHA 167

Query: 601  XXXXXXXXXXXXXRGLDVDRQDIGYVLQKYPELLGFKLEGTMSTSVAYLVSIGVSPRDIG 780
                         RGLDV++ DIGYVLQKYPELLGFKLEGTMSTSVAYLVSIGV+PRDIG
Sbjct: 168  SVIVELAPVVKFLRGLDVEKDDIGYVLQKYPELLGFKLEGTMSTSVAYLVSIGVNPRDIG 227

Query: 781  PMVTQYPYLLGMRVGTVIKPIVDYLVSLGIPKKILARMFEKRAYVLGYDLEETIKSNVDC 960
            PMVTQYPYLLGMRVGTVIKP++DYLV LG+PKK+LARM EKRAYVLGYDLEET+K NV+C
Sbjct: 228  PMVTQYPYLLGMRVGTVIKPMIDYLVDLGLPKKVLARMLEKRAYVLGYDLEETVKPNVEC 287

Query: 961  LLSFGIRREALASVIAQYPQILGLPLKAKLSSQQYFFNLKLKIDPDGFARVIEKMPQMVS 1140
            L+SFG+ R+ LAS+IAQYPQILGLPLKAKLS+QQYFF+LKLK+DP+GFARV+E MPQ+VS
Sbjct: 288  LISFGVGRDCLASIIAQYPQILGLPLKAKLSTQQYFFSLKLKVDPEGFARVVENMPQVVS 347

Query: 1141 LNQSVIMKPIEFLRGRTIPAEDVAKMVVKCPQLVAQRVELMKNSYYFFKSEMKWSLQELV 1320
            L+Q VIMKP+EFL GRTIPA+DVA MVVKCPQLVA RVELMKNSYYFFKSEM   LQELV
Sbjct: 348  LHQHVIMKPVEFLLGRTIPAQDVASMVVKCPQLVALRVELMKNSYYFFKSEMGRPLQELV 407

Query: 1321 EFPEYFTYSLESRIKPRYQSLSSKGVKFSLAWFLNCSDQRFEERLQADYIETESAGPSFY 1500
            EFPEYFTYSLESRIKPRYQ L SKG++ SL W LNCSDQRFEERLQ  YIETES GP F 
Sbjct: 408  EFPEYFTYSLESRIKPRYQRLKSKGIRCSLNWMLNCSDQRFEERLQGHYIETESVGPRFC 467

Query: 1501 VGGKLELPGSEIVXXXXXXXXXXXILYRRTVSL 1599
            +GGKLELPG+ +V           +LYRRTVSL
Sbjct: 468  MGGKLELPGNGLV-SDEEEESDDELLYRRTVSL 499


>ref|XP_002881945.1| hypothetical protein ARALYDRAFT_903808 [Arabidopsis lyrata subsp.
            lyrata] gi|297327784|gb|EFH58204.1| hypothetical protein
            ARALYDRAFT_903808 [Arabidopsis lyrata subsp. lyrata]
          Length = 508

 Score =  695 bits (1794), Expect = 0.0
 Identities = 341/506 (67%), Positives = 414/506 (81%)
 Frame = +1

Query: 82   IAFRKNSSIQYLLKKLLIPNVLISKTIDSSQNPQFTNAQSKNAFFNVKVRNFASDSSKFP 261
            +  R+N  +  + ++ LI  +  ++T  +  NP     Q +N     ++ ++A+ SSKFP
Sbjct: 4    LLLRRNKFLALIRRQSLIFPITSTET-KTLINPDPNIPQFQNPCSIFRIAHYATQSSKFP 62

Query: 262  EYEMPTVTWGVVQGRKEKLVSRVIICDYLKNLGIVPDELEHLELPSTVQVMKERVEFLQK 441
            EYEMPTVTWGV+QG+KEKLV+RV ICDYLK LGI+ DELE +ELPST++VM ERVEFLQK
Sbjct: 63   EYEMPTVTWGVIQGKKEKLVNRVKICDYLKGLGIITDELESIELPSTIEVMCERVEFLQK 122

Query: 442  IGLTIDDINEYPLMLGCSVRKNIIPVLGYLEKIGIKRSSMGEFVKNYPQXXXXXXXXXXX 621
            +GLTIDDINEYPLMLGCSVRKN+IPVL YLEKIGI RS +GEFVKNYPQ           
Sbjct: 123  LGLTIDDINEYPLMLGCSVRKNLIPVLAYLEKIGISRSKLGEFVKNYPQVLHASVVVELA 182

Query: 622  XXXXXXRGLDVDRQDIGYVLQKYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQYP 801
                  RGLDV++QD+GYVL KYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQYP
Sbjct: 183  PVVKFLRGLDVEKQDLGYVLMKYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQYP 242

Query: 802  YLLGMRVGTVIKPIVDYLVSLGIPKKILARMFEKRAYVLGYDLEETIKSNVDCLLSFGIR 981
            YLLGMRVGT+IKP+VDYL+S+G+PKKI+ARM EKRAY++GY+LEET+K NVDCL+SFG++
Sbjct: 243  YLLGMRVGTMIKPLVDYLISIGLPKKIVARMLEKRAYIVGYNLEETVKPNVDCLISFGVK 302

Query: 982  REALASVIAQYPQILGLPLKAKLSSQQYFFNLKLKIDPDGFARVIEKMPQMVSLNQSVIM 1161
            +E L  +IAQYPQILGLP+KAK+S+QQYFF+LKLKIDP+GFARV+EKMPQ+VSL Q+VIM
Sbjct: 303  KELLPLLIAQYPQILGLPVKAKMSTQQYFFSLKLKIDPEGFARVVEKMPQIVSLKQNVIM 362

Query: 1162 KPIEFLRGRTIPAEDVAKMVVKCPQLVAQRVELMKNSYYFFKSEMKWSLQELVEFPEYFT 1341
            KPIEFL GR    ED+AKMVV+CPQ++  RVELMKNSYYF+K+EM   ++ELVE+PEYFT
Sbjct: 363  KPIEFLLGRAFQVEDIAKMVVRCPQILCSRVELMKNSYYFYKTEMGRPMKELVEYPEYFT 422

Query: 1342 YSLESRIKPRYQSLSSKGVKFSLAWFLNCSDQRFEERLQADYIETESAGPSFYVGGKLEL 1521
            YSLESRIKPRYQ L SKG++ SL WFLNCSDQRFEERLQ ++I+ ++ GP F +GGKLE+
Sbjct: 423  YSLESRIKPRYQKLQSKGIRSSLNWFLNCSDQRFEERLQGNFIDPDTEGPMFDMGGKLEM 482

Query: 1522 PGSEIVXXXXXXXXXXXILYRRTVSL 1599
            PG EIV           +LYRRT++L
Sbjct: 483  PGGEIVSDEEEDESDDEVLYRRTLTL 508


>ref|NP_566005.1| transcription termination factor domain-containing protein
            [Arabidopsis thaliana] gi|3212859|gb|AAC23410.1|
            expressed protein [Arabidopsis thaliana]
            gi|14532592|gb|AAK64024.1| unknown protein [Arabidopsis
            thaliana] gi|19310761|gb|AAL85111.1| unknown protein
            [Arabidopsis thaliana] gi|330255268|gb|AEC10362.1|
            transcription termination factor domain-containing
            protein [Arabidopsis thaliana]
          Length = 507

 Score =  695 bits (1793), Expect = 0.0
 Identities = 342/508 (67%), Positives = 413/508 (81%)
 Frame = +1

Query: 76   MRIAFRKNSSIQYLLKKLLIPNVLISKTIDSSQNPQFTNAQSKNAFFNVKVRNFASDSSK 255
            M    R+N  +  L ++ LI  +  S    +  NP     Q +N     ++ ++A+ SSK
Sbjct: 1    MSYLLRRNKFVALLKRQSLIFPIT-STEAKTLINPDPNIPQFQNPCSIFRIAHYATQSSK 59

Query: 256  FPEYEMPTVTWGVVQGRKEKLVSRVIICDYLKNLGIVPDELEHLELPSTVQVMKERVEFL 435
            FPEYEMPTVTWGV+QG+KEKLV+RV ICDYLK LGI+ DELE +ELPST++VM ERVEFL
Sbjct: 60   FPEYEMPTVTWGVIQGKKEKLVNRVKICDYLKGLGIITDELESIELPSTIEVMCERVEFL 119

Query: 436  QKIGLTIDDINEYPLMLGCSVRKNIIPVLGYLEKIGIKRSSMGEFVKNYPQXXXXXXXXX 615
            QK+GLTIDDINEYPLMLGCSVRKN+IPVL YLEKIGI RS +GEFVKNYPQ         
Sbjct: 120  QKLGLTIDDINEYPLMLGCSVRKNLIPVLAYLEKIGISRSKLGEFVKNYPQVLHASVVVE 179

Query: 616  XXXXXXXXRGLDVDRQDIGYVLQKYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQ 795
                    RGLDV++QD+GYVL KYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQ
Sbjct: 180  LAPVVKFLRGLDVEKQDLGYVLMKYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQ 239

Query: 796  YPYLLGMRVGTVIKPIVDYLVSLGIPKKILARMFEKRAYVLGYDLEETIKSNVDCLLSFG 975
            YPYLLGMRVGT+IKP+VDYL+S+G+PKKI+ARM EKR+Y++GY+LEET+K NVDCL+SFG
Sbjct: 240  YPYLLGMRVGTMIKPLVDYLISIGLPKKIVARMLEKRSYIVGYNLEETVKPNVDCLISFG 299

Query: 976  IRREALASVIAQYPQILGLPLKAKLSSQQYFFNLKLKIDPDGFARVIEKMPQMVSLNQSV 1155
            +++E L  +IAQYPQILGLP+KAK+S+QQYFF+LKLKIDP+GFARV+EKMPQ+VSL Q+V
Sbjct: 300  VKKELLPLLIAQYPQILGLPVKAKMSTQQYFFSLKLKIDPEGFARVVEKMPQIVSLKQNV 359

Query: 1156 IMKPIEFLRGRTIPAEDVAKMVVKCPQLVAQRVELMKNSYYFFKSEMKWSLQELVEFPEY 1335
            IMKPIEFL GR    ED+AKMVV+CPQ++  RVELMKNSYYF+K+EM   ++ELVE+PEY
Sbjct: 360  IMKPIEFLLGRAFQVEDIAKMVVRCPQILCSRVELMKNSYYFYKTEMGRPMKELVEYPEY 419

Query: 1336 FTYSLESRIKPRYQSLSSKGVKFSLAWFLNCSDQRFEERLQADYIETESAGPSFYVGGKL 1515
            FTYSLESRIKPRYQ L SKG++ SL WFLNCSDQRFEERLQ ++I+ ++ GP+F +GGKL
Sbjct: 420  FTYSLESRIKPRYQKLQSKGIRSSLNWFLNCSDQRFEERLQGNFIDPDTEGPTFDMGGKL 479

Query: 1516 ELPGSEIVXXXXXXXXXXXILYRRTVSL 1599
            E+PG EIV           +LYRRT++L
Sbjct: 480  EMPGGEIVTDEEEDESDDEVLYRRTLTL 507


Top