BLASTX nr result

ID: Atractylodes21_contig00019212 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00019212
         (1838 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002279655.1| PREDICTED: uncharacterized protein LOC100255...   729   0.0  
ref|NP_566005.1| transcription termination factor domain-contain...   713   0.0  
ref|XP_002881945.1| hypothetical protein ARALYDRAFT_903808 [Arab...   713   0.0  
ref|XP_002327325.1| predicted protein [Populus trichocarpa] gi|2...   708   0.0  
ref|XP_003530919.1| PREDICTED: uncharacterized protein LOC100809...   704   0.0  

>ref|XP_002279655.1| PREDICTED: uncharacterized protein LOC100255927 [Vitis vinifera]
          Length = 481

 Score =  729 bits (1881), Expect = 0.0
 Identities = 369/466 (79%), Positives = 407/466 (87%), Gaps = 4/466 (0%)
 Frame = -3

Query: 1809 SNPRVLS----FSTQSSKFPEYEMPTVTWGVVQGRKEKLVSRVIICDYLKSIGIIPDELE 1642
            SNPR L     F + SSKFPEYEMP+VTWGVV GRKE+LVSRVII DYLK++GIIPDELE
Sbjct: 17   SNPRTLRPFLRFLSSSSKFPEYEMPSVTWGVVLGRKERLVSRVIISDYLKTLGIIPDELE 76

Query: 1641 DLELPSTVDVMRERVEFLQKIGLTIDDINEYPLMLGCSVRKNIIPVLGYLEKVGVQRSKM 1462
             +ELPSTV+VMRERVEFLQK+G+TID +NEYPLMLGCSVRKN+IPVLGYLEK+G+ RSK+
Sbjct: 77   QVELPSTVEVMRERVEFLQKLGVTIDHLNEYPLMLGCSVRKNMIPVLGYLEKIGIPRSKL 136

Query: 1461 GEFVKKYPQXXXXXXXXXXXXXXXXLRGLDVERQDIGYVLMKYPELLGFKLEGTMSTSVA 1282
            GEFV  YPQ                LRGLDV++QDIGYVLMKYPELLGFKLEGTMSTSVA
Sbjct: 137  GEFVVNYPQVLHASVVVELAPVVKFLRGLDVDKQDIGYVLMKYPELLGFKLEGTMSTSVA 196

Query: 1281 YLVSIGVNPRDIGPMVTQYPYFLGMRVGTMIKPLVEYLISLGLPKKVLARMFEKRAYLVG 1102
            YLVSIGV+PRDIGPMVTQYPYFLGMRVGT+IKP+V+YL+SLGLPKKVLARMFEKRAY++G
Sbjct: 197  YLVSIGVSPRDIGPMVTQYPYFLGMRVGTVIKPIVDYLVSLGLPKKVLARMFEKRAYVLG 256

Query: 1101 YDLEEMVKPNVDCLVSFGIRREAIASIIAQYPQILGLPLKAKLSTQQYFFNLKLKIDPEG 922
            YDLEE +KPNVDCLVSFGIRREA+AS+IAQ+PQILGLPLKAKLS+QQYFFNLKLKIDP+G
Sbjct: 257  YDLEECIKPNVDCLVSFGIRREALASVIAQFPQILGLPLKAKLSSQQYFFNLKLKIDPDG 316

Query: 921  FARVIERMPQVVSLGQKVIMKPVEFLLGRGISAEDVAKMIVRCPQLVALQVGIMKNSYYF 742
            FARVIERMPQ+VSL Q VIMKPVEFLLGRGI A DVAKM+V+CPQLVAL+V +MKN YYF
Sbjct: 317  FARVIERMPQIVSLNQNVIMKPVEFLLGRGIPAVDVAKMVVKCPQLVALRVELMKNGYYF 376

Query: 741  FKSEMGRPVKELVEFPEYFTYGLESRIKPRYQRLQHKGIRSSLSWFLNCSDQRFEERLYA 562
            FKSEMGR VKELVEFPEYFTY LESRIKPRYQRLQ KG+RSSL WFLNCSDQRFEERL A
Sbjct: 377  FKSEMGRQVKELVEFPEYFTYSLESRIKPRYQRLQSKGVRSSLDWFLNCSDQRFEERLQA 436

Query: 561  DYIETEIEGPSFVMGGKLELPGGKQMVXXXXXXXXXEILYRRTVSL 424
            DYIE E  GPSF MGGKL+LP G ++V         E LYRRTVSL
Sbjct: 437  DYIEMETIGPSFCMGGKLQLP-GNEVVSDEEDESDDEELYRRTVSL 481


>ref|NP_566005.1| transcription termination factor domain-containing protein
            [Arabidopsis thaliana] gi|3212859|gb|AAC23410.1|
            expressed protein [Arabidopsis thaliana]
            gi|14532592|gb|AAK64024.1| unknown protein [Arabidopsis
            thaliana] gi|19310761|gb|AAL85111.1| unknown protein
            [Arabidopsis thaliana] gi|330255268|gb|AEC10362.1|
            transcription termination factor domain-containing
            protein [Arabidopsis thaliana]
          Length = 507

 Score =  713 bits (1841), Expect = 0.0
 Identities = 344/469 (73%), Positives = 405/469 (86%), Gaps = 4/469 (0%)
 Frame = -3

Query: 1818 PQFSNP----RVLSFSTQSSKFPEYEMPTVTWGVVQGRKEKLVSRVIICDYLKSIGIIPD 1651
            PQF NP    R+  ++TQSSKFPEYEMPTVTWGV+QG+KEKLV+RV ICDYLK +GII D
Sbjct: 39   PQFQNPCSIFRIAHYATQSSKFPEYEMPTVTWGVIQGKKEKLVNRVKICDYLKGLGIITD 98

Query: 1650 ELEDLELPSTVDVMRERVEFLQKIGLTIDDINEYPLMLGCSVRKNIIPVLGYLEKVGVQR 1471
            ELE +ELPST++VM ERVEFLQK+GLTIDDINEYPLMLGCSVRKN+IPVL YLEK+G+ R
Sbjct: 99   ELESIELPSTIEVMCERVEFLQKLGLTIDDINEYPLMLGCSVRKNLIPVLAYLEKIGISR 158

Query: 1470 SKMGEFVKKYPQXXXXXXXXXXXXXXXXLRGLDVERQDIGYVLMKYPELLGFKLEGTMST 1291
            SK+GEFVK YPQ                LRGLDVE+QD+GYVLMKYPELLGFKLEGTMST
Sbjct: 159  SKLGEFVKNYPQVLHASVVVELAPVVKFLRGLDVEKQDLGYVLMKYPELLGFKLEGTMST 218

Query: 1290 SVAYLVSIGVNPRDIGPMVTQYPYFLGMRVGTMIKPLVEYLISLGLPKKVLARMFEKRAY 1111
            SVAYLVSIGV+PRDIGPMVTQYPY LGMRVGTMIKPLV+YLIS+GLPKK++ARM EKR+Y
Sbjct: 219  SVAYLVSIGVSPRDIGPMVTQYPYLLGMRVGTMIKPLVDYLISIGLPKKIVARMLEKRSY 278

Query: 1110 LVGYDLEEMVKPNVDCLVSFGIRREAIASIIAQYPQILGLPLKAKLSTQQYFFNLKLKID 931
            +VGY+LEE VKPNVDCL+SFG+++E +  +IAQYPQILGLP+KAK+STQQYFF+LKLKID
Sbjct: 279  IVGYNLEETVKPNVDCLISFGVKKELLPLLIAQYPQILGLPVKAKMSTQQYFFSLKLKID 338

Query: 930  PEGFARVIERMPQVVSLGQKVIMKPVEFLLGRGISAEDVAKMIVRCPQLVALQVGIMKNS 751
            PEGFARV+E+MPQ+VSL Q VIMKP+EFLLGR    ED+AKM+VRCPQ++  +V +MKNS
Sbjct: 339  PEGFARVVEKMPQIVSLKQNVIMKPIEFLLGRAFQVEDIAKMVVRCPQILCSRVELMKNS 398

Query: 750  YYFFKSEMGRPVKELVEFPEYFTYGLESRIKPRYQRLQHKGIRSSLSWFLNCSDQRFEER 571
            YYF+K+EMGRP+KELVE+PEYFTY LESRIKPRYQ+LQ KGIRSSL+WFLNCSDQRFEER
Sbjct: 399  YYFYKTEMGRPMKELVEYPEYFTYSLESRIKPRYQKLQSKGIRSSLNWFLNCSDQRFEER 458

Query: 570  LYADYIETEIEGPSFVMGGKLELPGGKQMVXXXXXXXXXEILYRRTVSL 424
            L  ++I+ + EGP+F MGGKLE+PGG+ +          E+LYRRT++L
Sbjct: 459  LQGNFIDPDTEGPTFDMGGKLEMPGGEIVTDEEEDESDDEVLYRRTLTL 507


>ref|XP_002881945.1| hypothetical protein ARALYDRAFT_903808 [Arabidopsis lyrata subsp.
            lyrata] gi|297327784|gb|EFH58204.1| hypothetical protein
            ARALYDRAFT_903808 [Arabidopsis lyrata subsp. lyrata]
          Length = 508

 Score =  713 bits (1840), Expect = 0.0
 Identities = 345/469 (73%), Positives = 404/469 (86%), Gaps = 4/469 (0%)
 Frame = -3

Query: 1818 PQFSNP----RVLSFSTQSSKFPEYEMPTVTWGVVQGRKEKLVSRVIICDYLKSIGIIPD 1651
            PQF NP    R+  ++TQSSKFPEYEMPTVTWGV+QG+KEKLV+RV ICDYLK +GII D
Sbjct: 40   PQFQNPCSIFRIAHYATQSSKFPEYEMPTVTWGVIQGKKEKLVNRVKICDYLKGLGIITD 99

Query: 1650 ELEDLELPSTVDVMRERVEFLQKIGLTIDDINEYPLMLGCSVRKNIIPVLGYLEKVGVQR 1471
            ELE +ELPST++VM ERVEFLQK+GLTIDDINEYPLMLGCSVRKN+IPVL YLEK+G+ R
Sbjct: 100  ELESIELPSTIEVMCERVEFLQKLGLTIDDINEYPLMLGCSVRKNLIPVLAYLEKIGISR 159

Query: 1470 SKMGEFVKKYPQXXXXXXXXXXXXXXXXLRGLDVERQDIGYVLMKYPELLGFKLEGTMST 1291
            SK+GEFVK YPQ                LRGLDVE+QD+GYVLMKYPELLGFKLEGTMST
Sbjct: 160  SKLGEFVKNYPQVLHASVVVELAPVVKFLRGLDVEKQDLGYVLMKYPELLGFKLEGTMST 219

Query: 1290 SVAYLVSIGVNPRDIGPMVTQYPYFLGMRVGTMIKPLVEYLISLGLPKKVLARMFEKRAY 1111
            SVAYLVSIGV+PRDIGPMVTQYPY LGMRVGTMIKPLV+YLIS+GLPKK++ARM EKRAY
Sbjct: 220  SVAYLVSIGVSPRDIGPMVTQYPYLLGMRVGTMIKPLVDYLISIGLPKKIVARMLEKRAY 279

Query: 1110 LVGYDLEEMVKPNVDCLVSFGIRREAIASIIAQYPQILGLPLKAKLSTQQYFFNLKLKID 931
            +VGY+LEE VKPNVDCL+SFG+++E +  +IAQYPQILGLP+KAK+STQQYFF+LKLKID
Sbjct: 280  IVGYNLEETVKPNVDCLISFGVKKELLPLLIAQYPQILGLPVKAKMSTQQYFFSLKLKID 339

Query: 930  PEGFARVIERMPQVVSLGQKVIMKPVEFLLGRGISAEDVAKMIVRCPQLVALQVGIMKNS 751
            PEGFARV+E+MPQ+VSL Q VIMKP+EFLLGR    ED+AKM+VRCPQ++  +V +MKNS
Sbjct: 340  PEGFARVVEKMPQIVSLKQNVIMKPIEFLLGRAFQVEDIAKMVVRCPQILCSRVELMKNS 399

Query: 750  YYFFKSEMGRPVKELVEFPEYFTYGLESRIKPRYQRLQHKGIRSSLSWFLNCSDQRFEER 571
            YYF+K+EMGRP+KELVE+PEYFTY LESRIKPRYQ+LQ KGIRSSL+WFLNCSDQRFEER
Sbjct: 400  YYFYKTEMGRPMKELVEYPEYFTYSLESRIKPRYQKLQSKGIRSSLNWFLNCSDQRFEER 459

Query: 570  LYADYIETEIEGPSFVMGGKLELPGGKQMVXXXXXXXXXEILYRRTVSL 424
            L  ++I+ + EGP F MGGKLE+PGG+ +          E+LYRRT++L
Sbjct: 460  LQGNFIDPDTEGPMFDMGGKLEMPGGEIVSDEEEDESDDEVLYRRTLTL 508


>ref|XP_002327325.1| predicted protein [Populus trichocarpa] gi|222835695|gb|EEE74130.1|
            predicted protein [Populus trichocarpa]
          Length = 514

 Score =  708 bits (1827), Expect = 0.0
 Identities = 359/474 (75%), Positives = 405/474 (85%), Gaps = 7/474 (1%)
 Frame = -3

Query: 1824 QNPQFSNPR-VLSF----STQSSKFPEYEMPTVTWGVVQGRKEKLVSRVIICDYLKSIGI 1660
            QNP   NP  VL F    STQ+SKF EYEMP+VTWGVVQG+KEKLV+RVIICDYLK +GI
Sbjct: 41   QNPLTQNPLGVLQFYALFSTQASKFHEYEMPSVTWGVVQGKKEKLVNRVIICDYLKGLGI 100

Query: 1659 IPDELEDLELPSTVDVMRERVEFLQKIGLTIDDINEYPLMLGCSVRKNIIPVLGYLEKVG 1480
            IPDELE LELPSTV+VM+ERVEFLQ++GLTIDDINEYPLMLGCSVRKNIIPVLGYLEK+G
Sbjct: 101  IPDELESLELPSTVEVMKERVEFLQRMGLTIDDINEYPLMLGCSVRKNIIPVLGYLEKIG 160

Query: 1479 VQRSKMGEFVKKYPQXXXXXXXXXXXXXXXXLRGLDVERQDIGYVLMKYPELLGFKLEGT 1300
            + RSK+GEFVK YPQ                LRGLDV++ DIGYVL KYPELLGFKLEGT
Sbjct: 161  ISRSKLGEFVKSYPQVLHASVVVELQPVIKFLRGLDVDKLDIGYVLQKYPELLGFKLEGT 220

Query: 1299 MSTSVAYLVSIGVNPRDIGPMVTQYPYFLGMRVGTMIKPLVEYLISLGLPKKVLARMFEK 1120
            MSTSVAYLVSIGV+PRDIGPMVTQYPY LGMRVGTMIKPLV+YL+SLGLPKK++ARM EK
Sbjct: 221  MSTSVAYLVSIGVSPRDIGPMVTQYPYLLGMRVGTMIKPLVDYLVSLGLPKKIVARMLEK 280

Query: 1119 RAYLVGYDLEEMVKPNVDCLVSFGIRREAIASIIAQYPQILGLPLKAKLSTQQYFFNLKL 940
            R Y++GYDL+E VKPNVDCL+SFGIRRE +ASI+AQYP ILGLPLKAKLS+QQYFFNLKL
Sbjct: 281  RPYVLGYDLQETVKPNVDCLISFGIRREVLASIVAQYPPILGLPLKAKLSSQQYFFNLKL 340

Query: 939  KIDPEGFARVIERMPQVVSLGQKVIMKPVEFLLGRGISAEDVAKMIVRCPQLVALQVGIM 760
            KIDPE FARVIE+MPQ+VSL Q VIMKPV+FLL R I +EDVA M+++CPQL+AL+V +M
Sbjct: 341  KIDPERFARVIEKMPQIVSLNQNVIMKPVQFLLERAIPSEDVATMVIKCPQLLALRVPLM 400

Query: 759  KNSYYFFKSEMGRPVKELVEFPEYFTYGLESRIKPRYQRLQHKGIRSSLSWFLNCSDQRF 580
            KNSYYFFKSEMGRP+KELVEFPEYFTY LESRIKPRY+ L+ KGIRSSL+WFLNCSD+RF
Sbjct: 401  KNSYYFFKSEMGRPLKELVEFPEYFTYSLESRIKPRYEMLKSKGIRSSLNWFLNCSDKRF 460

Query: 579  EERLYADYIETEIEGPSFVMGGKLELPGGKQM--VXXXXXXXXXEILYRRTVSL 424
            EERL  DYIE+E  GPSF MGGKLELPG + +            E+L+RRTVSL
Sbjct: 461  EERLEGDYIESESLGPSFCMGGKLELPGCEILSDEEDEIDDDEDEVLFRRTVSL 514


>ref|XP_003530919.1| PREDICTED: uncharacterized protein LOC100809590 [Glycine max]
          Length = 499

 Score =  704 bits (1816), Expect = 0.0
 Identities = 358/471 (76%), Positives = 396/471 (84%), Gaps = 5/471 (1%)
 Frame = -3

Query: 1821 NPQFSNPRVLS---FSTQSS--KFPEYEMPTVTWGVVQGRKEKLVSRVIICDYLKSIGII 1657
            NP    P+ L    + TQSS  K PEYEMP+VTWGV+QGRKEKLVSRVII DYLK +GII
Sbjct: 30   NPFTKIPKTLFRVYYGTQSSASKLPEYEMPSVTWGVIQGRKEKLVSRVIIFDYLKGLGII 89

Query: 1656 PDELEDLELPSTVDVMRERVEFLQKIGLTIDDINEYPLMLGCSVRKNIIPVLGYLEKVGV 1477
            PDEL DLELPSTVDVMRERVEFLQK+GLT+DDIN YPLMLGCSVRKN+IPVLGYLEK+G+
Sbjct: 90   PDELHDLELPSTVDVMRERVEFLQKLGLTVDDINNYPLMLGCSVRKNMIPVLGYLEKIGI 149

Query: 1476 QRSKMGEFVKKYPQXXXXXXXXXXXXXXXXLRGLDVERQDIGYVLMKYPELLGFKLEGTM 1297
             R K+G FVK YPQ                LRGLDVE+ DIGYVL KYPELLGFKLEGTM
Sbjct: 150  ARPKLGGFVKNYPQVLHASVIVELAPVVKFLRGLDVEKDDIGYVLQKYPELLGFKLEGTM 209

Query: 1296 STSVAYLVSIGVNPRDIGPMVTQYPYFLGMRVGTMIKPLVEYLISLGLPKKVLARMFEKR 1117
            STSVAYLVSIGVNPRDIGPMVTQYPY LGMRVGT+IKP+++YL+ LGLPKKVLARM EKR
Sbjct: 210  STSVAYLVSIGVNPRDIGPMVTQYPYLLGMRVGTVIKPMIDYLVDLGLPKKVLARMLEKR 269

Query: 1116 AYLVGYDLEEMVKPNVDCLVSFGIRREAIASIIAQYPQILGLPLKAKLSTQQYFFNLKLK 937
            AY++GYDLEE VKPNV+CL+SFG+ R+ +ASIIAQYPQILGLPLKAKLSTQQYFF+LKLK
Sbjct: 270  AYVLGYDLEETVKPNVECLISFGVGRDCLASIIAQYPQILGLPLKAKLSTQQYFFSLKLK 329

Query: 936  IDPEGFARVIERMPQVVSLGQKVIMKPVEFLLGRGISAEDVAKMIVRCPQLVALQVGIMK 757
            +DPEGFARV+E MPQVVSL Q VIMKPVEFLLGR I A+DVA M+V+CPQLVAL+V +MK
Sbjct: 330  VDPEGFARVVENMPQVVSLHQHVIMKPVEFLLGRTIPAQDVASMVVKCPQLVALRVELMK 389

Query: 756  NSYYFFKSEMGRPVKELVEFPEYFTYGLESRIKPRYQRLQHKGIRSSLSWFLNCSDQRFE 577
            NSYYFFKSEMGRP++ELVEFPEYFTY LESRIKPRYQRL+ KGIR SL+W LNCSDQRFE
Sbjct: 390  NSYYFFKSEMGRPLQELVEFPEYFTYSLESRIKPRYQRLKSKGIRCSLNWMLNCSDQRFE 449

Query: 576  ERLYADYIETEIEGPSFVMGGKLELPGGKQMVXXXXXXXXXEILYRRTVSL 424
            ERL   YIETE  GP F MGGKLELP G  +V         E+LYRRTVSL
Sbjct: 450  ERLQGHYIETESVGPRFCMGGKLELP-GNGLVSDEEEESDDELLYRRTVSL 499


Top