BLASTX nr result
ID: Atractylodes22_contig00005363
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes22_contig00005363 (1456 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN75603.1| hypothetical protein VITISV_016382 [Vitis vinifera] 398 e-108 ref|XP_002315275.1| predicted protein [Populus trichocarpa] gi|2... 393 e-107 ref|XP_002523905.1| hypothetical protein RCOM_1068550 [Ricinus c... 392 e-106 ref|XP_004143691.1| PREDICTED: uncharacterized protein LOC101204... 354 3e-95 ref|XP_003535180.1| PREDICTED: uncharacterized protein LOC100784... 349 1e-93 >emb|CAN75603.1| hypothetical protein VITISV_016382 [Vitis vinifera] Length = 1887 Score = 398 bits (1023), Expect = e-108 Identities = 237/497 (47%), Positives = 308/497 (61%), Gaps = 40/497 (8%) Frame = -1 Query: 1372 EQSTGNEVTKPLDNE----------SSFSLHQLCYFRPPEKGGQFSVSDLVWGKIRSHPW 1223 EQ T NE K L+ + + HQ Y PPE G+FSVSDLVWGK+RSHPW Sbjct: 1210 EQGTDNEQQKSLEEKMVKRATLKPGNLIRGHQATYQLPPESEGEFSVSDLVWGKVRSHPW 1269 Query: 1222 WPGQIFSPSDASEKAKKYHKKDCFLVAYFGDGTFAWTDSAVLKPFRANFSQIEKHANTGA 1043 WPGQIF PSDASEKA KYHKKDCFLVAYFGD TFAW ++++LKPFR +FSQI K +N+ Sbjct: 1270 WPGQIFDPSDASEKAMKYHKKDCFLVAYFGDRTFAWNEASLLKPFRTHFSQIVKQSNSEV 1329 Query: 1042 FKNAVRCALEEVSRRVELGLACSCVPQDIYDKIKYQIVENGGIRQKSNRRHGTDESASVS 863 F NAV CAL+EVSRRVELGLACSC+P+D YD+IK QIVEN GIR +S+RR G D+SA++S Sbjct: 1330 FHNAVDCALDEVSRRVELGLACSCIPKDDYDEIKCQIVENTGIRPESSRRDGVDKSATMS 1389 Query: 862 SFEPDKLVDYVRLLAQFP-GEGDKMELTIAKAQLSSYGRYKGYRQLPELLFYGDILEDDS 686 EPD V+Y++ LAQFP G D++EL IAKAQL ++ R KGY +LPE + G + E+D+ Sbjct: 1390 LLEPDTFVEYIKALAQFPSGGADQLELVIAKAQLLAFSRLKGYHRLPEFQYCGGLQENDA 1449 Query: 685 -MLCEGVEQVNKYGAKISERRNIPGEYNASQKFENNSMDSGCP-NKERSLSDLTDDAPHS 512 + C ++ + + + ++S K ++N DS P KERSLS+L +S Sbjct: 1450 DISCFNEMMEHETDVLMGDDGKFKIQNSSSHKRKHNLKDSAYPRKKERSLSELMSGMAYS 1509 Query: 511 GDGDGAEGXXXXXXNLVSKSGFKKRKAPD-----------RPSVRPRKVPTALISTPKPS 365 D D + VS SG +KRK D S+ KV +P+ S Sbjct: 1510 PD-DENDSDGKATSKPVSSSG-RKRKVVDSFGNDSEVQDRTESIFVAKVSNTSAPSPRQS 1567 Query: 364 FKVGECIRRVASQLTGPPSE---HGQHADQLV----------GSAVSLQIPEDSQRGRMI 224 FKVG+CIRR ASQLTG PS G+ ++V GS VSL PED Q RMI Sbjct: 1568 FKVGDCIRRAASQLTGSPSILKCSGERPQKVVDGSIGKLGGPGSDVSLMSPEDPQ--RMI 1625 Query: 223 VEGKQSSVAEMLSQLHLTAQDPMKGYGFLNTIIPFFYERRAAVF--SRSLRKNSSIERPV 50 + + S+ EMLSQL L A+DPMKGY FL+TI+ FF E R ++ S R++ ++++ Sbjct: 1626 IPMEYPSLDEMLSQLRLAARDPMKGYSFLDTIVSFFSEFRNSILLGRYSGRESLTMDKVA 1685 Query: 49 NARKRKACNE-NDPEEF 2 R++K+ PEEF Sbjct: 1686 GNRRKKSSQPIGSPEEF 1702 >ref|XP_002315275.1| predicted protein [Populus trichocarpa] gi|222864315|gb|EEF01446.1| predicted protein [Populus trichocarpa] Length = 1405 Score = 393 bits (1009), Expect = e-107 Identities = 232/481 (48%), Positives = 306/481 (63%), Gaps = 27/481 (5%) Frame = -1 Query: 1456 DKEVDVIREQTNFCREQKAEALDHGSEAEQSTGNEVTKPLDNESSFSLHQLCYFRPPEKG 1277 D E ++ Q EQ + + E+S+ V KP +E Q CY PP+ Sbjct: 732 DAEQVDLQGQEMEVEEQDTDTEQLNTMEEKSSKLSVLKPGSSEKE---DQACYLLPPDNE 788 Query: 1276 GQFSVSDLVWGKIRSHPWWPGQIFSPSDASEKAKKYHKKDCFLVAYFGDGTFAWTDSAVL 1097 G+FSVSDLVWGK+RSHPWWPGQIF PSDASEKA +YHKKDC+LVAYFGD TFAW ++++L Sbjct: 789 GEFSVSDLVWGKVRSHPWWPGQIFDPSDASEKAMRYHKKDCYLVAYFGDRTFAWNEASLL 848 Query: 1096 KPFRANFSQIEKHANTGAFKNAVRCALEEVSRRVELGLACSCVPQDIYDKIKYQIVENGG 917 KPFR++FSQ+EK +N+ F+NAV C+LEEVSRRVELGLACSC+P+D YD+IK Q+VEN G Sbjct: 849 KPFRSHFSQVEKQSNSEVFQNAVDCSLEEVSRRVELGLACSCLPKDAYDEIKCQVVENTG 908 Query: 916 IRQKSNRRHGTDESASVSSFEPDKLVDYVRLLAQFP-GEGDKMELTIAKAQLSSYGRYKG 740 IR +++ R G D+ S F+PDKLVDY++ LAQ P G +++E IAK+QL ++ R KG Sbjct: 909 IRPEASTRDGVDKDMSADLFQPDKLVDYMKALAQSPSGGANRLEFVIAKSQLLAFYRLKG 968 Query: 739 YRQLPELLFYGDILE-------DDSMLCEGVEQVNKYGAKISERRNIPGEYNASQKFENN 581 Y +LPE F G +LE +D + +G S + + +S K ++N Sbjct: 969 YSELPEYQFCGGLLEKSDALQFEDGSIDHTSAVYEDHGQISSGEEILQTQRGSSHKRKHN 1028 Query: 580 SMDSGCP-NKERSLSDLTDDAPHS-GDGDGAEGXXXXXXNLVSKSGFKKRKAPDR----- 422 DS P KER+LSDL D+ S GD G++G LVS SG KKRK D Sbjct: 1029 LKDSIYPRKKERNLSDLISDSWDSVGDEIGSDG--KANSMLVSPSG-KKRKGSDTFADDA 1085 Query: 421 -PSVRPRKVPTALISTP--KPSFKVGECIRRVASQLTGPPS-------EHGQHADQLV-- 278 + R + + A +S+ KPSFK+GECI+RVASQ+TG PS + +D LV Sbjct: 1086 YMTGRRKTISFAKVSSTALKPSFKIGECIQRVASQMTGSPSILKCNSPKVDGSSDGLVGD 1145 Query: 277 GSAVSLQIPEDSQRGRMIVEGKQSSVAEMLSQLHLTAQDPMKGYGFLNTIIPFFYERRAA 98 GS S ED++ R+IV + SS+ ++LSQLHLTAQDP+KGYGFLN II FF + R + Sbjct: 1146 GSDASFLHSEDAEIKRIIVPTEYSSLDDLLSQLHLTAQDPLKGYGFLNIIISFFSDFRNS 1205 Query: 97 V 95 V Sbjct: 1206 V 1206 >ref|XP_002523905.1| hypothetical protein RCOM_1068550 [Ricinus communis] gi|223536835|gb|EEF38474.1| hypothetical protein RCOM_1068550 [Ricinus communis] Length = 1557 Score = 392 bits (1006), Expect = e-106 Identities = 238/492 (48%), Positives = 311/492 (63%), Gaps = 38/492 (7%) Frame = -1 Query: 1456 DKEVDVIREQTNFCREQKAE----ALDHGSEAE----QSTGNEVTKPLDNESSFSLHQLC 1301 D E D+ R+ + +E AE AL G E E ++T ++ L E++ +Q Sbjct: 868 DIESDIGRQTAD--QEHDAEVQQIALHEGQEIEAEQPKTTDDKQEAALPPENTVKAYQAT 925 Query: 1300 YFRPPEKGGQFSVSDLVWGKIRSHPWWPGQIFSPSDASEKAKKYHKKDCFLVAYFGDGTF 1121 Y PP+ G+FSVSDLVWGK+RSHPWWPGQIF PSDASEKA KY+K+DCFLVAYFGD TF Sbjct: 926 YQLPPDDEGEFSVSDLVWGKVRSHPWWPGQIFDPSDASEKAMKYYKRDCFLVAYFGDRTF 985 Query: 1120 AWTDSAVLKPFRANFSQIEKHANTGAFKNAVRCALEEVSRRVELGLACSCVPQDIYDKIK 941 AW ++++LKPFR+NFS +EK +N+ F+NAV CALEEVSRRVE GLACSC+P+++YDKIK Sbjct: 986 AWNEASLLKPFRSNFSLVEKQSNSEIFQNAVDCALEEVSRRVEFGLACSCLPRNMYDKIK 1045 Query: 940 YQIVENGGIRQKSNRRHGTDESASVSSFEPDKLVDYVRLLAQFP-GEGDKMELTIAKAQL 764 +QIVEN GIRQ+S+ R DES F PDKLV+Y++ L Q P G D++EL IAK+QL Sbjct: 1046 FQIVENAGIRQESSVRDSVDESLHADVFGPDKLVEYMKALGQSPAGGADRLELVIAKSQL 1105 Query: 763 SSYGRYKGYRQLPELLFYGDILED------DSMLCEGVEQVNKYGAKISERRNIPGEYNA 602 S+ R KGY QLPE F G +LE+ + + EG + K + S + I + Sbjct: 1106 LSFYRLKGYSQLPEFQFCGGLLENADTLPVEDEVTEGASALYKDDGQSSSGQEILQTQRS 1165 Query: 601 S-QKFENNSMDSGCP-NKERSLSDLTDDAPHSGDGD-GAEGXXXXXXNLVSKSGFKKRKA 431 S K ++N D+ P KERSLS+L DD+ S D + GA+G L+S S KKR+ Sbjct: 1166 SYHKRKHNLKDTIYPRKKERSLSELMDDSWDSVDDEIGADG--KPSNKLLSPSSGKKRRG 1223 Query: 430 PD-----------RPSVRPRKVPTALISTPKPSFKVGECIRRVASQLTGPPS-------E 305 D R ++ KV T ++ PKPSFK+GECIRRVASQ+TG PS + Sbjct: 1224 SDSFADDAAMIEGRKTISLAKVSTP-VTLPKPSFKIGECIRRVASQMTGSPSILRPNSQK 1282 Query: 304 HGQHADQLV--GSAVSLQIPEDSQRGRMIVEGKQSSVAEMLSQLHLTAQDPMKGYGFLNT 131 +D LV GS + +Q ED + RM V + SS+ E+LSQL L A+DP+KGY FL Sbjct: 1283 PDGGSDGLVGDGSDILIQHSEDLEMRRMNVPTEYSSLDELLSQLLLAARDPLKGYSFLTV 1342 Query: 130 IIPFFYERRAAV 95 II FF + R V Sbjct: 1343 IISFFSDFRNTV 1354 >ref|XP_004143691.1| PREDICTED: uncharacterized protein LOC101204371 [Cucumis sativus] Length = 1936 Score = 354 bits (909), Expect = 3e-95 Identities = 226/507 (44%), Positives = 305/507 (60%), Gaps = 46/507 (9%) Frame = -1 Query: 1411 EQKAEALDHGSEAEQSTGNEVTKPLDN----ESSFSLHQLCYFRPPEKGGQFSVSDLVWG 1244 + K A G E+ G +VT D+ ESS LHQ CY P E G FSVSDLVWG Sbjct: 503 DHKFNANQMGLHGEEEDG-DVTGIEDDDDQLESSVQLHQACYHLPSENEGDFSVSDLVWG 561 Query: 1243 KIRSHPWWPGQIFSPSDASEKAKKYHKKDCFLVAYFGDGTFAWTDSAVLKPFRANFSQIE 1064 K+RSHPWWPGQIF PSD+S++A KY+KKD +LVAYFGD TFAW + + LKPFR +FSQ E Sbjct: 562 KVRSHPWWPGQIFDPSDSSDQAMKYYKKDFYLVAYFGDRTFAWNEVSHLKPFRTHFSQEE 621 Query: 1063 KHANTGAFKNAVRCALEEVSRRVELGLACSCVPQDIYDKIKYQIVENGGIRQKSNRRHGT 884 +++ AF+N+V CALEEVSRR ELGLAC+C P++ YD +K QI+EN GIR++S+RR+G Sbjct: 622 MQSHSEAFQNSVECALEEVSRRAELGLACACTPKEAYDMVKCQIIENAGIREESSRRYGV 681 Query: 883 DESASVSSFEPDKLVDYVRLLAQFPGEG-DKMELTIAKAQLSSYGRYKGYRQLPELLFYG 707 D+SAS +SFEP KL++Y+R LA+FP +G D++EL IAKAQL+++ R KGY LP+ F G Sbjct: 682 DKSASATSFEPAKLIEYIRDLAKFPSDGSDRLELVIAKAQLTAFYRLKGYCGLPQFQFGG 741 Query: 706 -------DILEDDSMLCEGVE----QVNKYGAKISE-------RRNIPGEYNASQKFENN 581 L D+ + G+E + + A + + N+ ++ K ++N Sbjct: 742 LPQFQFCGGLADNELDSLGIEMQSSDFDHHAAPCQDDAQASPSKENVEVRSSSYHKRKHN 801 Query: 580 SMDSGCP-NKERSLSDLTDDAPHSGDGDGAEGXXXXXXNLVSKSGFKKRKAPDRP---SV 413 D P KE+SL +L + + + DG LVS S K+RK + P S Sbjct: 802 LKDGLYPKKKEKSLYELMGE--NFDNIDGENWSDARTSTLVSPS-CKRRKTVEHPIDGSG 858 Query: 412 RP---RKVPTALIS---TPKPSFKVGECIRRVASQLTGPP-----SEHGQHAD------Q 284 P + + A +S + K SFK+G+CIRRVASQLTG P E Q D Sbjct: 859 APDGRKTISVAKVSGTASLKQSFKIGDCIRRVASQLTGTPPIKSTCERFQKPDGSFDGNA 918 Query: 283 LVGSAVSLQIPEDSQRGRMIVEGKQSSVAEMLSQLHLTAQDPMKGYGFLNTIIPFFYERR 104 L S V LQ +D+QRG++ + SS+ E+L QL L A DPMK Y FLN I+ FF + R Sbjct: 919 LHESDVFLQNFDDAQRGKVNFPPEYSSLDELLDQLQLVASDPMKEYSFLNVIVSFFTDFR 978 Query: 103 AAVFSRSLRKNSSIERPV--NARKRKA 29 ++ LR++ IE + N KRKA Sbjct: 979 DSLI---LRQHPGIEEALERNGGKRKA 1002 >ref|XP_003535180.1| PREDICTED: uncharacterized protein LOC100784689 [Glycine max] Length = 1019 Score = 349 bits (895), Expect = 1e-93 Identities = 207/464 (44%), Positives = 274/464 (59%), Gaps = 25/464 (5%) Frame = -1 Query: 1318 SLHQLCYFRPPEKGGQFSVSDLVWGKIRSHPWWPGQIFSPSDASEKAKKYHKKDCFLVAY 1139 SLH Y P EK G+FSVSD+VWGK+RSHPWWPGQIF PSD+SEKA K++KKDC LVAY Sbjct: 351 SLHNARYLLPIEKEGEFSVSDMVWGKVRSHPWWPGQIFDPSDSSEKAMKHYKKDCHLVAY 410 Query: 1138 FGDGTFAWTDSAVLKPFRANFSQIEKHANTGAFKNAVRCALEEVSRRVELGLACSCVPQD 959 FGD TFAW + + LKPFR +FS IEK + + +F+NAV CA++EV+RR E GLACSC+P+D Sbjct: 411 FGDRTFAWNEESQLKPFRTHFSSIEKQSTSESFQNAVDCAVDEVTRRAEYGLACSCIPKD 470 Query: 958 IYDKIKYQIVENGGIRQKSNRRHGTDESASVSSFEPDKLVDYVRLLAQFPGEG-DKMELT 782 YD IK+Q VEN GIR + + RHG DES + SSF P LV+Y++ L+ P G D++EL Sbjct: 471 TYDSIKFQTVENTGIRSELSARHGVDESLNASSFSPGNLVEYLKTLSALPTGGFDRLELE 530 Query: 781 IAKAQLSSYGRYKGYRQLPELLFYGDILEDDSMLCEGVEQVNKYGAKISER------RNI 620 IAKAQL S+ R+KGY LPEL + G +D L E N + A +S+ N+ Sbjct: 531 IAKAQLLSFYRFKGYSCLPELQYCGGFDDDMDSLVHDDE--NNHAAPVSKNYGQAGSGNL 588 Query: 619 PGEYNASQKFENNSMD-SGCPNKERSLSDLTDDAPHSGDGDGAEGXXXXXXNLVSKSGFK 443 + ++ +K ++N D KERSLS+L P S DGD NLVS K Sbjct: 589 KNQSSSHRKRKHNLKDIMHETKKERSLSELMGGTPDSPDGD-YWSEEKVIDNLVSPGRSK 647 Query: 442 KRKAPD-------RPSVRPRKVPTALISTPKPSFKVGECIRRVASQLTGPPS---EHGQH 293 KR+ D +P R + +T KPSF +G+ IRRVAS+LTG PS G Sbjct: 648 KRRTVDHYADDFGKPDGRKTISVAKVSNTTKPSFLIGDRIRRVASKLTGSPSTVKSSGDR 707 Query: 292 ADQLVGSAVSLQ------IPEDSQRGRMIVEGKQSSVAEMLSQLHLTAQDPMKGYGFLNT 131 + + GS E++QR M + SS+ +LS LHL AQ+P+ Y FLN Sbjct: 708 SQKTDGSTDGFSGNGTDFSFEEAQRSSMAAPTEYSSLDNLLSSLHLVAQEPLGDYNFLNP 767 Query: 130 IIPFFYE-RRAAVFSRSLRKNSSIERPVNARKRKACNENDPEEF 2 I+ FF + R + V + K + V +++K PE F Sbjct: 768 IVSFFSDFRNSIVVADDSVKGIFCKEKVGTKRKKLPPAGLPESF 811