BLASTX nr result
ID: Glycyrrhiza23_contig00013744
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00013744 (1662 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003540701.1| PREDICTED: uncharacterized protein LOC100800... 526 e-147 ref|XP_003597511.1| hypothetical protein MTR_2g098830 [Medicago ... 468 e-129 emb|CBI36653.3| unnamed protein product [Vitis vinifera] 299 2e-78 ref|XP_002533047.1| hypothetical protein RCOM_0068670 [Ricinus c... 241 3e-61 gb|AAF80120.1|AC024174_2 Contains similarity to an unknown prote... 200 1e-48 >ref|XP_003540701.1| PREDICTED: uncharacterized protein LOC100800099 [Glycine max] Length = 677 Score = 526 bits (1355), Expect = e-147 Identities = 291/472 (61%), Positives = 342/472 (72%), Gaps = 1/472 (0%) Frame = +3 Query: 18 YKKKRVIKKPTKDESKVDEAEFLQVGYSAVKEAIGINNTDIMLLESYTVYSQSKEKAASR 197 YKK+RVIKK + E KVDE FLQVGYSAVKEA GINNTDIMLLES TVYS+SKEK + Sbjct: 213 YKKRRVIKKSAQKELKVDEDVFLQVGYSAVKEATGINNTDIMLLESGTVYSESKEKLSDD 272 Query: 198 FYIMQCSQSINQEVIQVPLKNVIESLQGPLVKKSSSSWTITPVVAYFHVLPYSEIISKWI 377 + S S ++E+ P+ LQGPLV KSS SW IT V+ YFHVLPYSEIISKWI Sbjct: 273 SG-PRGSVSDDREMGLFPV-----CLQGPLVSKSSRSWMITSVIDYFHVLPYSEIISKWI 326 Query: 378 SREAFSNSLQNSRVTEKNIMVDSPEVTESYVTRDMFTGLDSKPSKDTIELLEQKENKGSC 557 SR AFS SLQ+SRVTEKNI V++PE T+ YV +D FT LDSKP+ D I+L +QKE+ GSC Sbjct: 327 SRGAFSTSLQDSRVTEKNIKVNTPEATDFYVNKDTFTALDSKPNSDNIDLPKQKEHHGSC 386 Query: 558 TLRLSDSIKEPREMDVNKPSMFPSKNKEKCQN-IANTVQVGEDQEENNPSVQYNSNGXXX 734 T LS I EP EMDVN+ S+F S+NKEKCQ I NTVQVG DQE+N S++YNSN Sbjct: 387 TPALSYYINEPIEMDVNENSIFKSQNKEKCQYIIGNTVQVGVDQEKNYLSLKYNSNAYAS 446 Query: 735 XXXXXXXXXTRMLITEGEINNIASCHKIRANGPNSSYKKDTIDVCTQSAKHSNSDTEKLQ 914 TRMLI EG +NN+AS H ANGPN+S +K + T +A HSNSD EKLQ Sbjct: 447 AVKALKVDSTRMLIAEGGVNNLASLHNNYANGPNTSSEKGILVNYTPTANHSNSDLEKLQ 506 Query: 915 ILLDSKKILSRTALTALIRKRNELALQQRKIEEEIAICDKKIQRLSKGDVEDNFELKIES 1094 IL DSKKILS+TAL ALIRKRNELALQQRKIE+EIA+CDKKIQR+ D EDNF+LKIES Sbjct: 507 ILSDSKKILSQTALAALIRKRNELALQQRKIEDEIAVCDKKIQRMLT-DGEDNFKLKIES 565 Query: 1095 IVDGCNDIWVSNQERMCGQQSFPLEKKKLSEAVFIRLSPCQELDGVCHENNWVLPTYHLS 1274 I++GCN WV NQER QQS P E+KKLSEAVF+ SP QELD + EN WVLPTYHLS Sbjct: 566 IIEGCNGTWVRNQERTSEQQSLPFERKKLSEAVFLTQSPFQELDNIFRENYWVLPTYHLS 625 Query: 1275 QSDGGFQANVTVNGVGFRCSLAGSVCXXXXXXXXXXXXMMLTNLRSMAKVAK 1430 ++GGF+ANV V G F+CS G + MLT+ ++MAK+A+ Sbjct: 626 HANGGFKANVIVKGEDFQCSFEGIMGSNPPEARESAAAQMLTHFKNMAKLAQ 677 >ref|XP_003597511.1| hypothetical protein MTR_2g098830 [Medicago truncatula] gi|355486559|gb|AES67762.1| hypothetical protein MTR_2g098830 [Medicago truncatula] Length = 1588 Score = 468 bits (1205), Expect = e-129 Identities = 259/482 (53%), Positives = 327/482 (67%), Gaps = 10/482 (2%) Frame = +3 Query: 3 ETKHAYKKKRVIKKPTKDESKVDEAEFLQVGYSAVKEAIG--INNTDIMLLESYTVYSQS 176 E KH Y+K+RVI+ PTK+ VD+ EFLQVGYSAVKEA G +N+ DIMLLESYTVYSQ Sbjct: 1113 ELKHTYQKRRVIQNPTKNGLNVDDDEFLQVGYSAVKEATGQGVNSNDIMLLESYTVYSQR 1172 Query: 177 KEKAASRFYIMQCSQSINQEVIQVPLKNVIESLQGPLVKKSSSSWTITPVVAYFHVLPYS 356 KEK ASRFYIM+CSQS IQVP+K++IES +GPL+KKSSSSWTIT VV YFHVLPYS Sbjct: 1173 KEKTASRFYIMKCSQSTADGSIQVPIKDLIESFRGPLLKKSSSSWTITSVVEYFHVLPYS 1232 Query: 357 EIISKWISREAFSNSLQNSRVTEKNIMVDSPEVTESYV-TRDMFTGLDSKPSKDTIELLE 533 EIIS WISRE FSNSLQ+S++ EK EVTES+V ++ ++ LD+KP DT L Sbjct: 1233 EIISDWISRETFSNSLQDSKLAEKQF--PKHEVTESHVSSKGLYIDLDNKPGSDTKVALN 1290 Query: 534 QKENKGSCTLRLSDSIKEPREMDVNKPSMFPSKNKEKCQNIANTVQVGEDQEENNPSVQY 713 QKE G + DS+KE +MDV+K + PSKNKE ++ ANT+ + EDQ+ NPSVQ+ Sbjct: 1291 QKEKNGCGITKRCDSVKEDWDMDVDKSLVLPSKNKECQKHTANTLHISEDQKIENPSVQH 1350 Query: 714 NSNGXXXXXXXXXXXXTRMLITEGEINNIASCHKIRANGPNSSYKKDTIDVCTQSAKHSN 893 +SN R ITEG I + ++ KI A ++++ D+++ C +A SN Sbjct: 1351 HSNECTRPSKAEKAVSKRKHITEGGIKDQSAFDKICA---GTTFENDSVEKCILNANSSN 1407 Query: 894 SDTEKLQILLDSK-KILSRTALTALIRKRNELALQQRKIEEEIAICDKKIQRLSKGDVED 1070 + EK+Q + SK ILS+TAL ALIRKRN LALQQR IE+E+A+C+ KI R G+ ED Sbjct: 1408 KNLEKIQTFIASKGTILSQTALNALIRKRNALALQQRAIEDEMAVCNMKIHRWLAGE-ED 1466 Query: 1071 NFELKIESIVDGCNDIWVSNQERMCGQ----QSFP--LEKKKLSEAVFIRLSPCQELDGV 1232 +FELK+ES+++GCN W+ NQ RMC Q Q P ++ K+L+EAV SPCQELDG+ Sbjct: 1467 DFELKLESVIEGCNGTWLRNQGRMCSQYLDDQCLPQSVKSKRLTEAVLTLHSPCQELDGI 1526 Query: 1233 CHENNWVLPTYHLSQSDGGFQANVTVNGVGFRCSLAGSVCXXXXXXXXXXXXMMLTNLRS 1412 CHENNW+LPTY +S SDG F A V V GV F S G+ C MLT RS Sbjct: 1527 CHENNWILPTYSVSLSDGEFHATVRVKGVDFEYSCEGNTCPFPREARDSAAAQMLTKFRS 1586 Query: 1413 MA 1418 MA Sbjct: 1587 MA 1588 Score = 214 bits (546), Expect = 4e-53 Identities = 144/382 (37%), Positives = 207/382 (54%), Gaps = 61/382 (15%) Frame = +3 Query: 18 YKKKRVIKKPTKDESKVDEAEFLQVGYSAVKEAIGINNTDIMLLESYTVYSQSKEKAASR 197 YKK+RV+KKP+++ S VDE LQVGYSAVKEA G+N+ DIMLLESYTVYSQSKEK +SR Sbjct: 193 YKKRRVVKKPSRNGSNVDEDRILQVGYSAVKEAAGVNSIDIMLLESYTVYSQSKEKTSSR 252 Query: 198 FYIMQCSQSINQEVIQVPLKNVIES--------------------LQGPLVKKSSSSWTI 317 FYIM+CSQSI++ QVP+K++IE ++GPLVK+SS SW + Sbjct: 253 FYIMKCSQSIDEGFTQVPIKDLIERFVTIGMVFLLADHSGLEGEFVRGPLVKRSSDSWKV 312 Query: 318 TPVVAYFHVLPYSEIISKWISREAFSNSLQNSRVTEK-----------------NIMVDS 446 TPVV YFH+LPYS+IIS+WISRE FSNSLQ+S++ EK ++ +D+ Sbjct: 313 TPVVEYFHMLPYSKIISEWISRETFSNSLQDSKLAEKQFPKLEVKESHISSKALSVGLDN 372 Query: 447 PEVTESYV------------------TRDMFTGLDSKPSKDTIELLEQKENKGSCTLRLS 572 + +E+ V + M GLD+K DTI L QKE G T+ Sbjct: 373 KQCSETIVALNQKQLLKLEVKEMHVSSEGMSAGLDNKACSDTIVTLNQKEKNGCGTITQC 432 Query: 573 DSIKEPREMDVNKPSMFPSKNKEKCQNIANT-VQVGEDQEENNPSV----QYNSNGXXXX 737 S+K+ ++MDV+ S + + ++++ + VG D ++++ ++ Q +NG Sbjct: 433 GSVKKDQDMDVDNSSRVKTNLEVTESHVSSEGMSVGLDNKQSSDTIAALNQKENNGC--- 489 Query: 738 XXXXXXXXTRMLITEGEINNIASCHKIR-ANGPNSSYKKDTIDVCTQSAKHSNSDTEKLQ 914 G I S K + NSS KK ++V S+ +E + Sbjct: 490 ---------------GIITQCGSVKKDEDMDVDNSSIKKTNLEV-----TESHVSSEGMS 529 Query: 915 ILLDSKKILSRTALTALIRKRN 980 + LD+K + AL +K N Sbjct: 530 VDLDNKP--CSDTIAALNQKEN 549 Score = 109 bits (272), Expect = 2e-21 Identities = 68/169 (40%), Positives = 103/169 (60%), Gaps = 2/169 (1%) Frame = +3 Query: 450 EVTESYVTRD-MFTGLDSKPSKDTIELLEQKENKGSCTLRLSDSIKEPREMDVNKPSMFP 626 EVTES+V+ + M GLD+KP DTI L+QKEN + S+KE ++MDV+ S FP Sbjct: 769 EVTESHVSSEGMSVGLDNKPCSDTIVALDQKENSCCGKITRCSSVKEDQDMDVDNCSTFP 828 Query: 627 SK-NKEKCQNIANTVQVGEDQEENNPSVQYNSNGXXXXXXXXXXXXTRMLITEGEINNIA 803 SK N+E +++ANT+QV EDQ+ N SVQ++SN TRM I EG I + + Sbjct: 829 SKLNEEYQKHVANTLQVNEDQKIENSSVQHHSNECTRPSEAEKVVSTRMHIIEGGIKDES 888 Query: 804 SCHKIRANGPNSSYKKDTIDVCTQSAKHSNSDTEKLQILLDSKKILSRT 950 + KI +++ + ++I+ CT A + N+D EK++ +DSK + R+ Sbjct: 889 AFDKICV---DATVENESIEKCTPIADNFNADFEKVRSFVDSKGKMGRS 934 >emb|CBI36653.3| unnamed protein product [Vitis vinifera] Length = 691 Score = 299 bits (765), Expect = 2e-78 Identities = 196/517 (37%), Positives = 279/517 (53%), Gaps = 44/517 (8%) Frame = +3 Query: 9 KHAYKKKRVIKKPTKDESKVDEAEFLQVGYSAVKEAIGINNTDIMLLESYTVYSQSKEKA 188 K Y+KKR KK T+DE+ DEA Q+ +SAVK+A GI+ D+M+LES+ VYS SKE+ Sbjct: 182 KQIYRKKRTTKKSTRDETGSDEASLQQLAFSAVKKAAGISQADLMVLESHVVYSLSKERT 241 Query: 189 ASRFYIMQCSQSINQEVIQVPLKNVIESLQGPLVKKSSSSWTITPVVAYFHVLPYSEIIS 368 A RFYIMQC+QSIN+++ Q+P+ I+SLQGPLVK SS SWT+T VV YFHVLPY I+S Sbjct: 242 ACRFYIMQCTQSINEDISQIPINEAIDSLQGPLVKNSSCSWTVTSVVEYFHVLPYKGILS 301 Query: 369 KWISREAFSNSLQNSRVTEKNIMVDSPEVT---------------ESYVTRDMFTGLDSK 503 W+SR FSNSLQ+ RV N ++S + T S R G ++ Sbjct: 302 DWLSRGVFSNSLQDLRVGLGNEKLNSTQRTAEPLDAEVKRNSNESHSNCGRVDVLGNENM 361 Query: 504 PSKDTIELLEQKENKGSCT---------LRLSDSI-KEPREMDVNKPSMFPSKNKEKCQN 653 + + + Q E+ L +D + + D P S N+ + ++ Sbjct: 362 NADNPCMVCPQNEDDAEVNKKRVGSDRYLSTADVLGNKSSGTDTESPRWKYSNNESRSKS 421 Query: 654 IANTVQVGEDQEENNPSVQYNSNGXXXXXXXXXXXXTRMLITEGEINNI-----ASC--H 812 + N+V+V Q+E + N T + EG N++ SC Sbjct: 422 LPNSVEVNLHQKEMTLFTARDLNA----------AATGAMAKEGIANSVMTPCTTSCRGE 471 Query: 813 KIRANGPNSSY---KKDTI---DVCTQSAKHSNSDTEKLQILLDSK-KILSRTALTALIR 971 K+ G ++ +D + D + + ++ +KLQI + SK K+LS+TAL L+R Sbjct: 472 KVANGGEICNFIMPDQDGMLIEDRALVTYESNSEHLDKLQITIASKEKLLSQTALKVLLR 531 Query: 972 KRNELALQQRKIEEEIAICDKKIQRLSKGDVEDNFELKIESIVDGCNDIWVSNQERM--- 1142 KR+ L+ QQRK+E+EIA CDK IQ + G ED+ LKIESI++ CND ++R Sbjct: 532 KRDRLSHQQRKLEDEIAQCDKNIQTILDGG-EDDLALKIESILEFCNDACPQTRDRTYRH 590 Query: 1143 CGQQSFP--LEKKKLSEAVFIRLSPCQELDGVCHENNWVLPTYHLSQSDGGFQANVTVNG 1316 Q P +++K+LSEA+ CQELDG+C+ENNW+LPTY +S DG + V+V G Sbjct: 591 LEDQESPQHIKRKRLSEAILNIQKSCQELDGICYENNWILPTYRVSLLDGKSEGTVSVKG 650 Query: 1317 VGFRCSLAGSVCXXXXXXXXXXXXMMLTNLRSMAKVA 1427 V F S+ G C ML L+SMA A Sbjct: 651 VDFEISVVGEPCDTPREARESAAAQMLAKLQSMATAA 687 >ref|XP_002533047.1| hypothetical protein RCOM_0068670 [Ricinus communis] gi|223527166|gb|EEF29337.1| hypothetical protein RCOM_0068670 [Ricinus communis] Length = 624 Score = 241 bits (616), Expect = 3e-61 Identities = 162/469 (34%), Positives = 244/469 (52%), Gaps = 22/469 (4%) Frame = +3 Query: 3 ETKHAYKKKRVIKKPTKDESKVDEAEFLQVGYSAVKEAIGINNTDIMLLESYTVYSQSKE 182 + +H KKKR I+KP + S DEA+ Q+ +SAVKE GIN D+++LE + VYS SKE Sbjct: 188 DPEHVTKKKRFIRKPLNNVSVADEADLQQLAFSAVKEVTGINQNDLLILERHDVYSTSKE 247 Query: 183 KAASRFYIMQCSQSINQEVIQVPLKNVIESLQGPLVKKSSSSWTITPVVAYFHVLPYSEI 362 KAA+ FYIMQ +QS N + + P+++ + SLQGPL +SSS W T VV YFH+LPY+ I Sbjct: 248 KAAACFYIMQYAQSNNNNITKTPIEDALNSLQGPLFVRSSSRWIHTSVVEYFHLLPYAGI 307 Query: 363 ISKWISREAFSNSLQNSRVTEKNIMVDS---------PEVTESYVTRDMFTGLDSKPSKD 515 +S W++R+ S+SLQ + I V+S PE +S D+ + S +K Sbjct: 308 LSDWLARK--SSSLQVQNPGSETINVNSSKRIERPCIPEAPKSSHDGDLGSKKGSGSAK- 364 Query: 516 TIELLEQKENKGSCTLRLSDSIKEPREMDVNKPSMFPSKNKEKCQNIANTVQVGEDQEEN 695 Q+E+ G + L+D I PR+M+V+ + ++ K+K N+ + V+ Q++ Sbjct: 365 ------QREDDGFYAVDLTDDIDGPRKMEVDDSFVAHAETKDKVTNVVSKVKPQNCQKKT 418 Query: 696 NPSVQYNSNGXXXXXXXXXXXXTRMLITEGEINNIASCHKIRANGPNSSYKKDTIDVCTQ 875 S+ +SNG + + + +I KI + P +S KD + Sbjct: 419 --SLDGSSNGS---------------VDKANMADILKRQKISRDEPAASRNKDLKGTSSD 461 Query: 876 SA------------KHSNSDTEKLQILLDSK-KILSRTALTALIRKRNELALQQRKIEEE 1016 + +++D +KL+ ++ SK + LS+ AL ++ KR +L LQ R IE++ Sbjct: 462 QDGIPRNGHAIVKDRSNSNDLDKLRTVIASKDQELSQAALQVVLSKRAKLCLQLRDIEDQ 521 Query: 1017 IAICDKKIQRLSKGDVEDNFELKIESIVDGCNDIWVSNQERMCGQQSFPLEKKKLSEAVF 1196 IA CDK IQ + G E + LKIES+++GCND Sbjct: 522 IAQCDKNIQTILNGG-EGDLALKIESLLEGCND--------------------------- 553 Query: 1197 IRLSPCQELDGVCHENNWVLPTYHLSQSDGGFQANVTVNGVGFRCSLAG 1343 ELD +C + NW+LPTY +S DGGFQANV V G S G Sbjct: 554 -------ELDDLCRQKNWILPTYQVSAPDGGFQANVIVKGKDCEYSTGG 595 >gb|AAF80120.1|AC024174_2 Contains similarity to an unknown protein T11A7.7 gi|2335096 from Arabidopsis thaliana BAC T11A7 gb|AC002339 and contains a tropomyosin PF|00261 domain. ESTs gb|AI995205, gb|N37925, gb|F13889, gb|AV523107, gb|AV535948, gb|AV558461, gb|F13888 come from this gene [Arabidopsis thaliana] Length = 1628 Score = 200 bits (508), Expect = 1e-48 Identities = 147/462 (31%), Positives = 235/462 (50%), Gaps = 22/462 (4%) Frame = +3 Query: 24 KKRVIKKPTKDESKVDEAEFLQVGYSAVKEAIGINNTDIMLLESYTVYSQSKEKAASRFY 203 +K + K+ E++ +E F +V ++ VKEA G+N+ DI++LE + V S S+EK A RFY Sbjct: 1170 EKPIEKEKAARENQKEEGVFQKVAFAVVKEATGVNHKDIVILERHLVCSLSEEKTAVRFY 1229 Query: 204 IMQCSQSINQEVIQVPLKNVIESLQGPLVKKSSSSWTITPVVAYFHVLPYSEIISKWISR 383 IM+C+ S ++ + P++ V+ +QGPL +KS S WT+ +V YFHVLPY+ +I W SR Sbjct: 1230 IMKCT-SQDKFSGENPVEEVLSCMQGPLFEKSFSDWTMNSIVEYFHVLPYATLIEDWFSR 1288 Query: 384 ------------EAFSNSLQNSRV--TEKNIMVDSPEVTESYVTRDMFTGLDSKPSKDTI 521 EA + +++++V T+++ + D E E + + +K Sbjct: 1289 RGDTEFVIEKEPEAVCDDIESNKVDATKESEVSDIFERREKAALKRRY----EIKAKKVA 1344 Query: 522 ELLEQKENKGSCTLRLSD-----SIKEPREMDVNKPSMFPSKNKEKCQNIANTVQVGEDQ 686 LL +G T RL + S+ +E +V+ ++ K K N+ N + +D Sbjct: 1345 ALLSHPGARGKATTRLQNRYLKGSMSGAKEPNVHSETVVALKAK----NVGNEMSPCKDN 1400 Query: 687 EENNPSVQYNSNGXXXXXXXXXXXXTRMLITEGEINNIASCHKIRANGPNSSYKKDTIDV 866 N G + + + + S HK+ + Sbjct: 1401 YSNG-----EKGGFEVASDPKELKERGLQRKKAVPDRLNSIHKLNST------------- 1442 Query: 867 CTQSAKHSNSDTEKLQILLDSKKI-LSRTALTALIRKRNELALQQRKIEEEIAICDKKIQ 1043 SA +SN + E+LQ L SK LS TAL L+ KR++L QQR IE+EIA CDK IQ Sbjct: 1443 -PASAHNSNPNLEELQTSLLSKATSLSETALKVLLCKRDKLTRQQRNIEDEIAKCDKCIQ 1501 Query: 1044 RLSKGDVEDNFELKIESIVDGCNDIWVSN--QERMCGQQSFPLEKKKLSEAVFIRLSPCQ 1217 + KGD +EL++E++++ CN+ + QE + ++ KLSE + S CQ Sbjct: 1502 NI-KGD----WELQLETVLECCNETYPRRNLQESLDKSACQSNKRLKLSETLPSTKSLCQ 1556 Query: 1218 ELDGVCHENNWVLPTYHLSQSDGGFQANVTVNGVGFRCSLAG 1343 LD +C NNWVLP Y ++ SDGG++A V + G C++ G Sbjct: 1557 RLDDICLMNNWVLPNYRVAPSDGGYEAEVRITGNHVACTIHG 1598