BLASTX nr result
ID: Glycyrrhiza24_contig00004007
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza24_contig00004007 (1395 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003522290.1| PREDICTED: uncharacterized protein LOC100787... 630 e-178 ref|XP_003528229.1| PREDICTED: uncharacterized protein LOC100805... 592 e-167 ref|XP_003525442.1| PREDICTED: uncharacterized protein LOC100775... 461 e-127 ref|XP_002328635.1| predicted protein [Populus trichocarpa] gi|2... 449 e-124 ref|XP_002514640.1| conserved hypothetical protein [Ricinus comm... 449 e-123 >ref|XP_003522290.1| PREDICTED: uncharacterized protein LOC100787391 [Glycine max] Length = 1247 Score = 630 bits (1626), Expect = e-178 Identities = 337/429 (78%), Positives = 362/429 (84%), Gaps = 2/429 (0%) Frame = -2 Query: 1394 SDSNKPRRQSGKKATDSGSPGGKVRAKVLNSQHSDEQLSEISNESRSLSFQGDEISLQSD 1215 SDSNKPRRQSGKKAT+SGSPGG+ R K LN H DEQLSEISNE RSLSFQGDEISLQS+ Sbjct: 821 SDSNKPRRQSGKKATESGSPGGRQRPKSLNVPHGDEQLSEISNEPRSLSFQGDEISLQSN 880 Query: 1214 SITVNSKMDMEVTSSLRSCEINDSQSPSLKAMKQLVSETVQKKSTPRLDEDETIAELATV 1035 S+TVNSKMDMEVTSSL++ EI+DSQSPSLKA+KQL+SETVQKKSTPRLDEDET+AELAT Sbjct: 881 SLTVNSKMDMEVTSSLQTVEIDDSQSPSLKAVKQLISETVQKKSTPRLDEDETVAELATD 940 Query: 1034 APEHPSPISVLDGGSVYRDDVSSPVKQISRVPKADDAQESQENEVKDQWKPADSLSFSST 855 PEHPSPISVLD GSVYRDD+ SPVKQIS K +DAQES+ENE+KDQW PADSLSF+ T Sbjct: 941 TPEHPSPISVLD-GSVYRDDMPSPVKQISEDSKGEDAQESKENEIKDQWNPADSLSFNCT 999 Query: 854 GSGEINRKKLQSIDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEIXXXXXXXX 675 GS EINRKKLQ+IDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEI Sbjct: 1000 GSLEINRKKLQNIDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEILLASGLLL 1059 Query: 674 XXXXXXXLTFQLHSLGNPINPELFLVLEQTXXXXXXXXXXXSPGKVAFLKQNTEKLHRKL 495 LTFQLHS G+PINPELFLVLEQT SPGK A +K N EK HRKL Sbjct: 1060 RDLSSELLTFQLHSSGHPINPELFLVLEQTKASSLLSKEESSPGKDANMKLNKEKFHRKL 1119 Query: 494 IFDAVNEILGAKLGSSPEPWFQP--NRLTKKTLSAQKLLKELCFEIEKAQAKEPECCLXX 321 IFD+VNEILGAK GSSPEP FQP NRLTKKTLSAQKLLKELCFEIEK QAK+PECCL Sbjct: 1120 IFDSVNEILGAKFGSSPEPCFQPNSNRLTKKTLSAQKLLKELCFEIEKIQAKKPECCL-E 1178 Query: 320 XXXDGLKCMLWEDVMHGSESWENFTGELPGVVLDVERLVFKDLVDEIVIGEAAGLRVKSS 141 DGLK ML EDVMHGSESW +F G LPGVVLDVERL+FKDLVDE+VIGE++GLRVK S Sbjct: 1179 DDHDGLKNMLCEDVMHGSESWTDFHGYLPGVVLDVERLLFKDLVDEVVIGESSGLRVKPS 1238 Query: 140 VRRRKLFGK 114 VRRRKLFGK Sbjct: 1239 VRRRKLFGK 1247 >ref|XP_003528229.1| PREDICTED: uncharacterized protein LOC100805643 [Glycine max] Length = 1092 Score = 592 bits (1526), Expect = e-167 Identities = 321/429 (74%), Positives = 348/429 (81%), Gaps = 2/429 (0%) Frame = -2 Query: 1394 SDSNKPRRQSGKKATDSGSPGGKVRAKVLNSQHSDEQLSEISNESRSLSFQGDEISLQSD 1215 SDSNKPRRQSGKKAT+ GSPGG+ R K LN H DEQLSEISNESRSLS QGD +SLQSD Sbjct: 674 SDSNKPRRQSGKKATELGSPGGRQRPKSLNLPHGDEQLSEISNESRSLSCQGDGVSLQSD 733 Query: 1214 SITVNSKMDMEVTSSLRSCEINDSQSPSLKAMKQLVSETVQKKSTPRLDEDETIAELATV 1035 S+TVNSKMDMEVTSSLR+ EI+DS+SPSLKA K+L+SETVQKKSTPRLDE+ET+AELAT Sbjct: 734 SLTVNSKMDMEVTSSLRTVEIDDSRSPSLKAAKRLISETVQKKSTPRLDEEETVAELATD 793 Query: 1034 APEHPSPISVLDGGSVYRDDVSSPVKQISRVPKADDAQESQENEVKDQWKPADSLSFSST 855 APEHPSPISVLD GSVYRDDV SPVKQIS ++S+ENE+KDQW P DSLSF+ST Sbjct: 794 APEHPSPISVLD-GSVYRDDVPSPVKQIS--------EDSKENEIKDQWNPEDSLSFNST 844 Query: 854 GSGEINRKKLQSIDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEIXXXXXXXX 675 G EINRKKLQ+I+HLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEI Sbjct: 845 GPLEINRKKLQNINHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEILLASGLLL 904 Query: 674 XXXXXXXLTFQLHSLGNPINPELFLVLEQTXXXXXXXXXXXSPGKVAFLKQNTEKLHRKL 495 LTFQLHS +PINPELFLVLEQT PGK A K N EK HRKL Sbjct: 905 RDLSSELLTFQLHSSVHPINPELFLVLEQTKASSLLSKEESIPGKDANSKLNKEKFHRKL 964 Query: 494 IFDAVNEILGAKLGSSPEPWFQP--NRLTKKTLSAQKLLKELCFEIEKAQAKEPECCLXX 321 IFD+VNEILGAK SSPEPW QP NRLTKKTLSAQKLLKELCFEIEK QAK+ EC L Sbjct: 965 IFDSVNEILGAKFSSSPEPWIQPNSNRLTKKTLSAQKLLKELCFEIEKIQAKKTECSL-E 1023 Query: 320 XXXDGLKCMLWEDVMHGSESWENFTGELPGVVLDVERLVFKDLVDEIVIGEAAGLRVKSS 141 DGLK +L EDV+HGSESW +F G LPGVVLDVERL+FKDLVDE+VIGE+ GLRVKS Sbjct: 1024 EEDDGLKNILCEDVLHGSESWTDFHGYLPGVVLDVERLIFKDLVDEVVIGESTGLRVKSL 1083 Query: 140 VRRRKLFGK 114 VRRRKLFGK Sbjct: 1084 VRRRKLFGK 1092 >ref|XP_003525442.1| PREDICTED: uncharacterized protein LOC100775311 [Glycine max] Length = 1051 Score = 461 bits (1187), Expect = e-127 Identities = 263/421 (62%), Positives = 302/421 (71%) Frame = -2 Query: 1394 SDSNKPRRQSGKKATDSGSPGGKVRAKVLNSQHSDEQLSEISNESRSLSFQGDEISLQSD 1215 SDSN PRRQS K+ T+SGSP K+R KV NS +SD++LSE SNE RSLS Q DEISLQSD Sbjct: 637 SDSNNPRRQSCKQTTESGSPSRKLRPKVANSWYSDDRLSETSNELRSLSSQWDEISLQSD 696 Query: 1214 SITVNSKMDMEVTSSLRSCEINDSQSPSLKAMKQLVSETVQKKSTPRLDEDETIAELATV 1035 SITV+SKMD+EVTSSL+S + DSQ S+KA + LVS + KKST R DEDE+IAE AT Sbjct: 697 SITVDSKMDIEVTSSLQSDDTIDSQFRSMKANEHLVSGSTHKKSTLRWDEDESIAEPATD 756 Query: 1034 APEHPSPISVLDGGSVYRDDVSSPVKQISRVPKADDAQESQENEVKDQWKPADSLSFSST 855 A +HPS SV D SVY+ D+ SPVK S PKAD+ QE + N+ D W PAD ++T Sbjct: 757 ASDHPSLDSV-DDVSVYKYDMPSPVKSKSNAPKADNGQEYKANDNTDHWNPADGFFVNNT 815 Query: 854 GSGEINRKKLQSIDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEIXXXXXXXX 675 INRKK QS+D L+QKLR+LNSSHDE RIDYIASLCENTNPDHRYI+EI Sbjct: 816 ----INRKKFQSVDCLIQKLRQLNSSHDETRIDYIASLCENTNPDHRYIAEILLASGLLL 871 Query: 674 XXXXXXXLTFQLHSLGNPINPELFLVLEQTXXXXXXXXXXXSPGKVAFLKQNTEKLHRKL 495 LTFQ HS G+PINPELFLVLEQT S GKVA+++ NTEK HRKL Sbjct: 872 RALSSELLTFQHHSSGHPINPELFLVLEQTKLSSLLSKDESSFGKVAYMRLNTEKWHRKL 931 Query: 494 IFDAVNEILGAKLGSSPEPWFQPNRLTKKTLSAQKLLKELCFEIEKAQAKEPECCLXXXX 315 IFDAVNEILG KLGS EP +PN L K +SAQKLLKELCFE++K Q +P+C L Sbjct: 932 IFDAVNEILGEKLGSFVEPCLKPNGLATKFVSAQKLLKELCFEVQKLQYVKPDCSL-EDE 990 Query: 314 XDGLKCMLWEDVMHGSESWENFTGELPGVVLDVERLVFKDLVDEIVIGEAAGLRVKSSVR 135 DGLK ML EDVM SE+W F GELPGVVLDVERL+FKDL+DE VI E A LRVK S Sbjct: 991 GDGLKSMLREDVMCHSENWTGFPGELPGVVLDVERLIFKDLIDEFVIDEMASLRVKFSKH 1050 Query: 134 R 132 R Sbjct: 1051 R 1051 >ref|XP_002328635.1| predicted protein [Populus trichocarpa] gi|222838811|gb|EEE77162.1| predicted protein [Populus trichocarpa] Length = 1027 Score = 449 bits (1155), Expect = e-124 Identities = 255/433 (58%), Positives = 298/433 (68%), Gaps = 6/433 (1%) Frame = -2 Query: 1394 SDSNKPRRQSGKKATDSGSPGGKVRAKVLNSQHSDEQLSEISNESRSLSFQGDEISLQSD 1215 SD++K R QS ++ T+ GSPG K R K SD+QLS+ISNESR+ S QGD+ISLQSD Sbjct: 598 SDTSKQRTQSNRQPTEIGSPGRKHRVKYPKVPPSDDQLSQISNESRTSSHQGDDISLQSD 657 Query: 1214 SITVNSKMDMEVTSSLRSCEINDSQSPSLKAMKQLVSETVQKKSTPRLDEDETIAELATV 1035 T + K DMEVTS+ RS + QSP+L A +LVS ++QKKST +ED T AELA V Sbjct: 658 GTTFDLKTDMEVTSTERSTDNYSGQSPTLNAASRLVSGSLQKKSTFMFEEDRTSAELAVV 717 Query: 1034 APEHPSPISVLDGGSVYRDDVSSPVKQISRVPKADDAQESQENEVKDQWKPADSLSFSST 855 APEHPSP+SVLD SVYRDD SPVKQ+ + K D ++ + +DQW PAD+L +S Sbjct: 718 APEHPSPVSVLD-ASVYRDDALSPVKQMPNLIKGDVPKDFHYQQSEDQWNPADNLLSNSV 776 Query: 854 GSG---EINRKKLQSIDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEIXXXXX 684 SG +INRKKLQ I++LVQKLR+LNS+HDE+ DYIASLCENTNPDHRYISEI Sbjct: 777 ASGLSSDINRKKLQKIENLVQKLRQLNSTHDESSTDYIASLCENTNPDHRYISEILLASG 836 Query: 683 XXXXXXXXXXLTFQLHSLGNPINPELFLVLEQTXXXXXXXXXXXSPGKVAFLKQNTEKLH 504 TFQLH G+PINPELF VLEQT SPGK K N EK H Sbjct: 837 LLLRDLSSGLSTFQLHPSGHPINPELFFVLEQTKASNLVSKEECSPGKSFHSKPNPEKFH 896 Query: 503 RKLIFDAVNEILGAKLG---SSPEPWFQPNRLTKKTLSAQKLLKELCFEIEKAQAKEPEC 333 RKLIFDAVNEIL KL SPEPW + ++L KKTLSAQKLLKELC E+E+ K+ EC Sbjct: 897 RKLIFDAVNEILVKKLALVEPSPEPWLKSDKLAKKTLSAQKLLKELCSEMEQLLVKKSEC 956 Query: 332 CLXXXXXDGLKCMLWEDVMHGSESWENFTGELPGVVLDVERLVFKDLVDEIVIGEAAGLR 153 L DGLK +L DVMH SESW +F E GVVLDVERLVFKDLVDEIVIGEAAG+R Sbjct: 957 SL--EEEDGLKSILCYDVMHRSESWIDFHSETSGVVLDVERLVFKDLVDEIVIGEAAGIR 1014 Query: 152 VKSSVRRRKLFGK 114 K RR+LFGK Sbjct: 1015 TKPGRSRRQLFGK 1027 >ref|XP_002514640.1| conserved hypothetical protein [Ricinus communis] gi|223546244|gb|EEF47746.1| conserved hypothetical protein [Ricinus communis] Length = 1094 Score = 449 bits (1154), Expect = e-123 Identities = 256/434 (58%), Positives = 298/434 (68%), Gaps = 7/434 (1%) Frame = -2 Query: 1394 SDSNKPRRQSGKKATDSGSPGGKVRAKVLNSQHSDEQLSEISNESRSLSFQGDEISLQSD 1215 SDSNKPRRQS K + GSPGGK R K SD+QLS+ISNESR+ S QGD+ISLQSD Sbjct: 669 SDSNKPRRQSKKMLNELGSPGGKNRPKSHKLPTSDDQLSQISNESRTSSHQGDDISLQSD 728 Query: 1214 SITV-NSKMDMEVTSSLRSCEINDSQSPSLKAMKQLVSETVQKKSTPRLDEDETIAELAT 1038 + V + K DMEVTS+ + E+N SPS A+ +VS + Q TPRL+ED T+A+ A Sbjct: 729 NTVVFDLKTDMEVTSTEQPNELNIDHSPSSNAVSHVVSGSKQNNPTPRLEEDGTLADFAV 788 Query: 1037 VAPEHPSPISVLDGGSVYRDDVSSPVKQISRVPKADDAQESQENEVKDQWKPADSLSFSS 858 PEHPSPISVLD SVYRDD SPVKQI +PK D A+ S KDQW PAD+ S Sbjct: 789 DTPEHPSPISVLDA-SVYRDDALSPVKQIPNLPKGDSAEAS-----KDQWDPADNFLSDS 842 Query: 857 TGS---GEINRKKLQSIDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEIXXXX 687 GS EI+RKKLQ++++LV+KLRRLNS+HDEA DYIASLCENTNPDHRYISEI Sbjct: 843 VGSVLTSEISRKKLQNVENLVKKLRRLNSTHDEASTDYIASLCENTNPDHRYISEILLAS 902 Query: 686 XXXXXXXXXXXLTFQLHSLGNPINPELFLVLEQTXXXXXXXXXXXSPGKVAFLKQNTEKL 507 TFQLHS G+PINPELF VLEQT +PGK K N E+ Sbjct: 903 GLLLRDLGSGMTTFQLHSSGHPINPELFFVLEQTKASTLASKEECNPGKTYHSKPNPERF 962 Query: 506 HRKLIFDAVNEILGAKLG---SSPEPWFQPNRLTKKTLSAQKLLKELCFEIEKAQAKEPE 336 HRKLIFDAVNE++ KL SPEPW + ++L KKTLSAQKLLKELC EIE+ Q K+ E Sbjct: 963 HRKLIFDAVNEMIVKKLALEEQSPEPWLKSDKLAKKTLSAQKLLKELCSEIEQLQDKKSE 1022 Query: 335 CCLXXXXXDGLKCMLWEDVMHGSESWENFTGELPGVVLDVERLVFKDLVDEIVIGEAAGL 156 C L D LK +LW+DVM SESW +F EL GVVLDVER +FKDLVDEIVIGEAAG Sbjct: 1023 CSL-EDEEDDLKGVLWDDVMRRSESWTDFHSELSGVVLDVERSIFKDLVDEIVIGEAAGS 1081 Query: 155 RVKSSVRRRKLFGK 114 R+K RRR+LF K Sbjct: 1082 RIKPG-RRRQLFAK 1094