BLASTX nr result
ID: Atractylodes22_contig00028681
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes22_contig00028681 (1701 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002273611.1| PREDICTED: DUF21 domain-containing protein A... 493 e-137 ref|XP_002327610.1| predicted protein [Populus trichocarpa] gi|2... 473 e-131 ref|XP_004140829.1| PREDICTED: DUF21 domain-containing protein A... 448 e-123 ref|NP_200091.2| CBS domain-containing protein with a domain of ... 439 e-121 dbj|BAA98097.1| unnamed protein product [Arabidopsis thaliana] 439 e-121 >ref|XP_002273611.1| PREDICTED: DUF21 domain-containing protein At5g52790 [Vitis vinifera] gi|302143780|emb|CBI22641.3| unnamed protein product [Vitis vinifera] Length = 540 Score = 493 bits (1270), Expect = e-137 Identities = 269/470 (57%), Positives = 336/470 (71%), Gaps = 29/470 (6%) Frame = +3 Query: 255 MAANDVPCCETMFWVYLASAIALVAFAXXXXXXXXXXXXXXXXXXEVLIKAGQPNDRKNA 434 MAA+DVPCCETMFW+YL +ALV+FA EVL KAG+P DR+NA Sbjct: 1 MAASDVPCCETMFWIYLVICVALVSFAGLMSGLTLGLMSLSLVDLEVLAKAGRPQDRRNA 60 Query: 435 EKIMPIVKNHHLLLCTLLICNAIAMEALPIFLDSILLPWTAILISVTLVVAFGEIIPQAV 614 EKI+PIVKN HLLLCTLLI N++AMEALPIFLD+++ W AILISVTL++AFGEIIPQAV Sbjct: 61 EKILPIVKNQHLLLCTLLIGNSLAMEALPIFLDALVPAWGAILISVTLILAFGEIIPQAV 120 Query: 615 CSRYGLAIGAKXXXXXXXXXXXXFPIAYPLSKLLDLILGKGHSVLLRRAELKTLVDMHGN 794 CS+YGL++GAK FPI+YP+SKLLD +LGKGHS LLRRAELKTLVDMHGN Sbjct: 121 CSQYGLSVGAKLSVVVRLLVLVLFPISYPISKLLDWLLGKGHSALLRRAELKTLVDMHGN 180 Query: 795 KAGKGGELTNDEITIISGALDLAQKTVKDAMTLISEIFSLELNSKLNEDTMSLLLSRGHS 974 +AG+GGELT+DE TIISG LD+ QKT KDAMT ISEIFSL++N++L+EDTMSL+L+RGHS Sbjct: 181 EAGRGGELTHDETTIISGVLDMTQKTAKDAMTPISEIFSLDINTRLDEDTMSLILNRGHS 240 Query: 975 RVPVYLGRSENIIGLILVKRLIKYRPEDEVPIKNLSVQKIPRIHESLPLYEMLNLFQKGQ 1154 R+PV+ G NIIGLILVK LIK R EDE PI+NL++++IPR+++ LPLY++LN FQKG Sbjct: 241 RIPVFSGSLTNIIGLILVKNLIKCRAEDETPIRNLTIRRIPRVYDCLPLYDILNQFQKGH 300 Query: 1155 SHMAVVVRSKSALNDAAKRTTAKHEIVKTHINSNLTQILVDNRG------QNTKVSIYRS 1316 SHMAVVV+ + + + K + NSN Q N+G + +++I R+ Sbjct: 301 SHMAVVVKCRKDVKTNTENANTKPCTFAIN-NSNSRQRQAKNKGVDNQFCPSVQLNISRN 359 Query: 1317 SSDPA-----------SQNSAP--------DQNIANCGLDSFPNPDEEIIGIITMEDVLE 1439 S + + ++P D N+ + L+S PN DEE+IGIITMEDV+E Sbjct: 360 VSSESKNPTLKKMMEQGKGASPRLKKWGSGDGNVTDEDLESLPNLDEEVIGIITMEDVME 419 Query: 1440 ELLQEPILDERNEYIDVDNIMKINMLP---SSLRSGA-ASASHLDWKTLV 1577 ELLQE ILDE +EYIDV N +KINMLP SS RS A A ASHL W++ V Sbjct: 420 ELLQEEILDETDEYIDVHNKIKINMLPSRRSSSRSPAVALASHLHWRSPV 469 >ref|XP_002327610.1| predicted protein [Populus trichocarpa] gi|222836164|gb|EEE74585.1| predicted protein [Populus trichocarpa] Length = 446 Score = 473 bits (1218), Expect = e-131 Identities = 251/434 (57%), Positives = 305/434 (70%), Gaps = 1/434 (0%) Frame = +3 Query: 255 MAANDVPCCETMFWVYLASAIALVAFAXXXXXXXXXXXXXXXXXXEVLIKAGQPNDRKNA 434 MAANDVPCCE MFW YL +ALV+FA EVLIKAGQP +RKNA Sbjct: 1 MAANDVPCCEPMFWTYLIICMALVSFAGLMSGLTLGLMSLTVVDLEVLIKAGQPQERKNA 60 Query: 435 EKIMPIVKNHHLLLCTLLICNAIAMEALPIFLDSILLPWTAILISVTLVVAFGEIIPQAV 614 EKI+PIVKN HLLLCTLLI NA+AMEALPIFLD++L W AILISVTL++ FGEIIPQAV Sbjct: 61 EKILPIVKNQHLLLCTLLIGNALAMEALPIFLDALLPAWGAILISVTLILTFGEIIPQAV 120 Query: 615 CSRYGLAIGAKXXXXXXXXXXXXFPIAYPLSKLLDLILGKGHSVLLRRAELKTLVDMHGN 794 CSRYGL+IGAK FP+AYP+SKLLD ILG+ HS LLRRAELKTLVDMHGN Sbjct: 121 CSRYGLSIGAKLSIVVRFIVIVLFPLAYPISKLLDWILGEKHSALLRRAELKTLVDMHGN 180 Query: 795 KAGKGGELTNDEITIISGALDLAQKTVKDAMTLISEIFSLELNSKLNEDTMSLLLSRGHS 974 +AGKGGELT+DE TII+GALDL QKT KDAMT ISE FSL++N KL+E TM L++ +GHS Sbjct: 181 EAGKGGELTHDETTIITGALDLTQKTAKDAMTPISETFSLDINCKLDEKTMGLIIRKGHS 240 Query: 975 RVPVYLGRSENIIGLILVKRLIKYRPEDEVPIKNLSVQKIPRIHESLPLYEMLNLFQKGQ 1154 RVP+Y G NIIGLILVK LI+ RPEDE PI++L++++IPR+ + LPLY+++N FQKG Sbjct: 241 RVPIYTGNPTNIIGLILVKNLIRCRPEDETPIRDLTIRRIPRVPDLLPLYDIMNQFQKGH 300 Query: 1155 SHMAVVVRSKSALNDAAKRTTAKHEIVKTHINSNLTQILVDNRGQNTKVSIYRSSSDPAS 1334 SHMAVVV+SK+ N+ A++ K I H P Sbjct: 301 SHMAVVVKSKNDANETAQKANYKPTIDNLH---------------------------PKL 333 Query: 1335 QNSAPDQ-NIANCGLDSFPNPDEEIIGIITMEDVLEELLQEPILDERNEYIDVDNIMKIN 1511 QN N+++ L+ DEE+IG+IT+EDV+EEL+QE ILDE +EY+DV N + IN Sbjct: 334 QNQEHQHGNLSHEELEFLSASDEEVIGVITLEDVMEELIQEEILDETDEYVDVHNKITIN 393 Query: 1512 MLPSSLRSGAASAS 1553 M+P GA +AS Sbjct: 394 MIPPRRSPGAGTAS 407 >ref|XP_004140829.1| PREDICTED: DUF21 domain-containing protein At5g52790-like [Cucumis sativus] Length = 490 Score = 448 bits (1153), Expect = e-123 Identities = 234/441 (53%), Positives = 299/441 (67%) Frame = +3 Query: 255 MAANDVPCCETMFWVYLASAIALVAFAXXXXXXXXXXXXXXXXXXEVLIKAGQPNDRKNA 434 MAANDVPCCE FW+YL + LVAFA EVL+K+G+P+DRKNA Sbjct: 1 MAANDVPCCEPRFWMYLLICVGLVAFAGLMSGLTLGLMSLSLVDLEVLVKSGRPDDRKNA 60 Query: 435 EKIMPIVKNHHLLLCTLLICNAIAMEALPIFLDSILLPWTAILISVTLVVAFGEIIPQAV 614 KI+PIVKN HLLLCTLLI NA+AMEALPIF+D++L W AI+ISVTL++ FGEIIPQA+ Sbjct: 61 AKILPIVKNQHLLLCTLLISNAMAMEALPIFIDALLPAWGAIVISVTLILTFGEIIPQAI 120 Query: 615 CSRYGLAIGAKXXXXXXXXXXXXFPIAYPLSKLLDLILGKGHSVLLRRAELKTLVDMHGN 794 CSRYGL++GAK FP++YP+SKLLD +LGKGH LLRRAELKT VDMHGN Sbjct: 121 CSRYGLSVGAKLSVVVRVLVLVLFPLSYPISKLLDWLLGKGHFALLRRAELKTFVDMHGN 180 Query: 795 KAGKGGELTNDEITIISGALDLAQKTVKDAMTLISEIFSLELNSKLNEDTMSLLLSRGHS 974 KAGKGGELT +E TII+GALD+ KT KDAMT ++++FSL++NSKL+E TM L+L +GHS Sbjct: 181 KAGKGGELTQEETTIITGALDMTLKTAKDAMTPLAKLFSLDINSKLDEKTMELILRKGHS 240 Query: 975 RVPVYLGRSENIIGLILVKRLIKYRPEDEVPIKNLSVQKIPRIHESLPLYEMLNLFQKGQ 1154 RVP+Y G NIIG+ILVK LIK+ PEDE PI+NL+++K+PR+ E+LPLY++LN FQ+G Sbjct: 241 RVPIYSGYPTNIIGIILVKNLIKFHPEDETPIRNLTIRKVPRVRENLPLYDILNEFQQGH 300 Query: 1155 SHMAVVVRSKSALNDAAKRTTAKHEIVKTHINSNLTQILVDNRGQNTKVSIYRSSSDPAS 1334 SHMAVV++S H K +SN ++ ++ + Sbjct: 301 SHMAVVIKS--------------HNEAKRPADSNKPELETATPVTEMELGHIKLQIGNIC 346 Query: 1335 QNSAPDQNIANCGLDSFPNPDEEIIGIITMEDVLEELLQEPILDERNEYIDVDNIMKINM 1514 N D + S P+ DE +IGIIT+EDV+EELLQE ILDE +EY+ V N +K+NM Sbjct: 347 SNGDTDTD-----GKSMPDFDENVIGIITLEDVMEELLQEEILDETDEYVAVHNKLKVNM 401 Query: 1515 LPSSLRSGAASASHLDWKTLV 1577 S + L W + V Sbjct: 402 EVRRSTSESPGGPRLQWMSPV 422 >ref|NP_200091.2| CBS domain-containing protein with a domain of unknown function (DUF21) [Arabidopsis thaliana] gi|342179476|sp|Q9LTD8.2|Y5279_ARATH RecName: Full=DUF21 domain-containing protein At5g52790; AltName: Full=CBS domain-containing protein CBSDUF5 gi|332008877|gb|AED96260.1| CBS domain-containing protein with a domain of unknown function (DUF21) [Arabidopsis thaliana] Length = 500 Score = 439 bits (1130), Expect = e-121 Identities = 236/420 (56%), Positives = 297/420 (70%) Frame = +3 Query: 255 MAANDVPCCETMFWVYLASAIALVAFAXXXXXXXXXXXXXXXXXXEVLIKAGQPNDRKNA 434 MAANDVPCCETMFWVYL +ALV FA EV+IKAG+P+DRKNA Sbjct: 1 MAANDVPCCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNA 60 Query: 435 EKIMPIVKNHHLLLCTLLICNAIAMEALPIFLDSILLPWTAILISVTLVVAFGEIIPQAV 614 EKI+P+VKN HLLLCTLLI NA+AMEALPIF+DS+L W AILISVTL++AFGEIIPQAV Sbjct: 61 EKILPLVKNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAV 120 Query: 615 CSRYGLAIGAKXXXXXXXXXXXXFPIAYPLSKLLDLILGKGHSVLLRRAELKTLVDMHGN 794 CSRYGL+IGAK FP++YP+SKLLDL+LGK HS LL RAELK+LV MHGN Sbjct: 121 CSRYGLSIGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGN 180 Query: 795 KAGKGGELTNDEITIISGALDLAQKTVKDAMTLISEIFSLELNSKLNEDTMSLLLSRGHS 974 +AGKGGELT+DE TIISGALD++QK+ KDAMT +S+IFSL++N KL+E TM L+ S GHS Sbjct: 181 EAGKGGELTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHS 240 Query: 975 RVPVYLGRSENIIGLILVKRLIKYRPEDEVPIKNLSVQKIPRIHESLPLYEMLNLFQKGQ 1154 R+P+Y IIG ILVK LIK RPEDE I++L ++++P++ +LPLY++LN+FQ G+ Sbjct: 241 RIPIYSVNPNVIIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGR 300 Query: 1155 SHMAVVVRSKSALNDAAKRTTAKHEIVKTHINSNLTQILVDNRGQNTKVSIYRSSSDPAS 1334 SHMA VV +K+ N T HE IN + N+ N +SI PA Sbjct: 301 SHMAAVVGTKNHTN----TNTPVHE---KSINGS------PNKDANVFLSI------PAL 341 Query: 1335 QNSAPDQNIANCGLDSFPNPDEEIIGIITMEDVLEELLQEPILDERNEYIDVDNIMKINM 1514 +S +DS + DEE+IGIIT+EDV+EEL+QE I DE ++Y+++ + INM Sbjct: 342 NSSETSHQSPIRYIDSISDEDEEVIGIITLEDVMEELIQEEIYDETDQYVELHKRITINM 401 >dbj|BAA98097.1| unnamed protein product [Arabidopsis thaliana] Length = 519 Score = 439 bits (1130), Expect = e-121 Identities = 236/420 (56%), Positives = 297/420 (70%) Frame = +3 Query: 255 MAANDVPCCETMFWVYLASAIALVAFAXXXXXXXXXXXXXXXXXXEVLIKAGQPNDRKNA 434 MAANDVPCCETMFWVYL +ALV FA EV+IKAG+P+DRKNA Sbjct: 1 MAANDVPCCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNA 60 Query: 435 EKIMPIVKNHHLLLCTLLICNAIAMEALPIFLDSILLPWTAILISVTLVVAFGEIIPQAV 614 EKI+P+VKN HLLLCTLLI NA+AMEALPIF+DS+L W AILISVTL++AFGEIIPQAV Sbjct: 61 EKILPLVKNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAV 120 Query: 615 CSRYGLAIGAKXXXXXXXXXXXXFPIAYPLSKLLDLILGKGHSVLLRRAELKTLVDMHGN 794 CSRYGL+IGAK FP++YP+SKLLDL+LGK HS LL RAELK+LV MHGN Sbjct: 121 CSRYGLSIGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGN 180 Query: 795 KAGKGGELTNDEITIISGALDLAQKTVKDAMTLISEIFSLELNSKLNEDTMSLLLSRGHS 974 +AGKGGELT+DE TIISGALD++QK+ KDAMT +S+IFSL++N KL+E TM L+ S GHS Sbjct: 181 EAGKGGELTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHS 240 Query: 975 RVPVYLGRSENIIGLILVKRLIKYRPEDEVPIKNLSVQKIPRIHESLPLYEMLNLFQKGQ 1154 R+P+Y IIG ILVK LIK RPEDE I++L ++++P++ +LPLY++LN+FQ G+ Sbjct: 241 RIPIYSVNPNVIIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGR 300 Query: 1155 SHMAVVVRSKSALNDAAKRTTAKHEIVKTHINSNLTQILVDNRGQNTKVSIYRSSSDPAS 1334 SHMA VV +K+ N T HE IN + N+ N +SI PA Sbjct: 301 SHMAAVVGTKNHTN----TNTPVHE---KSINGS------PNKDANVFLSI------PAL 341 Query: 1335 QNSAPDQNIANCGLDSFPNPDEEIIGIITMEDVLEELLQEPILDERNEYIDVDNIMKINM 1514 +S +DS + DEE+IGIIT+EDV+EEL+QE I DE ++Y+++ + INM Sbjct: 342 NSSETSHQSPIRYIDSISDEDEEVIGIITLEDVMEELIQEEIYDETDQYVELHKRITINM 401