BLASTX nr result

ID: Glycyrrhiza23_contig00021827 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00021827
         (1309 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago ...   580   e-163
ref|XP_003532247.1| PREDICTED: UPF0361 protein C3orf37 homolog [...   552   e-155
ref|XP_002303080.1| predicted protein [Populus trichocarpa] gi|2...   432   e-118
ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [...   419   e-115
ref|XP_002527247.1| conserved hypothetical protein [Ricinus comm...   406   e-111

>ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago truncatula]
            gi|355497798|gb|AES79001.1| hypothetical protein
            MTR_7g052250 [Medicago truncatula]
          Length = 354

 Score =  580 bits (1495), Expect = e-163
 Identities = 284/355 (80%), Positives = 312/355 (87%), Gaps = 2/355 (0%)
 Frame = -2

Query: 1266 MCGRGRCTLRADDIPRACHRTVAPARVLHVDRYRPSYNVSPGFNLPVVRRED--ASDSEG 1093
            MCGR RC+LRADD+PRACHRT AP+R+LH+DRYRPS NVSPGFN+PVVRRED  +++S+G
Sbjct: 1    MCGRTRCSLRADDVPRACHRTTAPSRLLHIDRYRPSNNVSPGFNIPVVRREDNASAESDG 60

Query: 1092 YAVHWMKWGLIPSFTKKSEKPDHYKMFNARSESIDEKASFRRLLPKNRCLVAFEGFYEWK 913
            + VH MKWGLIPSFTKK++KPDHYKMFNARSESIDEKASFRRLLPKNRCLVA EGFYEWK
Sbjct: 61   HVVHCMKWGLIPSFTKKTDKPDHYKMFNARSESIDEKASFRRLLPKNRCLVAVEGFYEWK 120

Query: 912  KDGSRKQPYYIHFKDGRPLVFAALYDSWQNSEDEILHTFTIVTTSSSSALQWLHDRMPVI 733
            KDGS+KQPYYIHFKDGRPLVFAALYDSWQNSE EIL+TFTIVTTSSSSA +WLHDRMPVI
Sbjct: 121  KDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTSSSSAFKWLHDRMPVI 180

Query: 732  LGDKDSTDTWLSSPASSCKSVLKPYEESDLVWYPVTPAMGKPSFDGPECIKEIQVKAEGN 553
            LGDKD+TDTWLSS ASS KSV+KPYEESDLVWYPVTPAMGKPSFDGPECIKEIQ+K EG 
Sbjct: 181  LGDKDTTDTWLSS-ASSFKSVMKPYEESDLVWYPVTPAMGKPSFDGPECIKEIQIKTEGY 239

Query: 552  TPISAFFSRKGAEVEDTEPDHKILCREFVKTEHTKDQSEGAKTEEGDDDLKSSGSSHSQN 373
             PIS FFS+K AEVEDT+P+HKIL  E VKTE TKD SE AKTEEGD DLKSSG S SQN
Sbjct: 240  IPISKFFSKKEAEVEDTKPEHKILSHEPVKTEQTKDVSEEAKTEEGDTDLKSSGISPSQN 299

Query: 372  VTKFPIKREYEAFSADSKPFIANDDQVSAYPXXXXXXXXXXXXKQPTLFSYFGKK 208
            V +F IKREY+A S+DSKP +AN+DQVSA P            KQPTLFSYFGK+
Sbjct: 300  VNRFAIKREYDAISSDSKPSLANNDQVSANPAKKKEKAKTADDKQPTLFSYFGKR 354


>ref|XP_003532247.1| PREDICTED: UPF0361 protein C3orf37 homolog [Glycine max]
          Length = 382

 Score =  552 bits (1422), Expect = e-155
 Identities = 274/379 (72%), Positives = 301/379 (79%), Gaps = 27/379 (7%)
 Frame = -2

Query: 1266 MCGRGRCTLRADDIPRACHRTVAPARVLHVDRYRPSYNVSPGFNLPVVRREDASDSEGYA 1087
            MCGR RCTLRADD+PRACHR+ +P R LH+DRYRP+YNVSPGF++PVVRR+DAS  EGY 
Sbjct: 1    MCGRARCTLRADDVPRACHRSTSPTRTLHIDRYRPAYNVSPGFDVPVVRRDDASGGEGYV 60

Query: 1086 VHWMKWGLIPSFTKKSEKPDHYKMFNARSESIDEKASFRRLLPKNRCLVAFEGFYEWKKD 907
            +  MKWGLIPSFTKK+EKPDHY+MFNARSESIDEKASFRRLLPK+RCLVA EGFYEWKKD
Sbjct: 61   LQCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKSRCLVAVEGFYEWKKD 120

Query: 906  GSRKQPYYIHFKDGRPLVFAALYDSWQNSEDEILHTFTIVTTSSSSALQWLHDRMPVILG 727
            GS+KQPYYIHFKDGRPLVFAALYDSWQNSE E L+TFTIVTTSSSSALQWLHDRMPVILG
Sbjct: 121  GSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIVTTSSSSALQWLHDRMPVILG 180

Query: 726  DKDSTDTWLSSPASSCKSVLKPYEESDLVWYPVTPAMGKPSFDGPECIKEIQVKAEGNTP 547
             K+STD WLSS ASS KSV+KPYEESDLVWYPVT AMGK SFDGPECIKEIQVKA+GNT 
Sbjct: 181  SKESTDIWLSSSASSFKSVMKPYEESDLVWYPVTSAMGKASFDGPECIKEIQVKAQGNTS 240

Query: 546  ISAFFSRKGAEVEDTEPDHKILCREFVKTEHTKDQSEG---------------------- 433
            IS FFS+KG E +DT+P+ K  C E VKTEHT+D +E                       
Sbjct: 241  ISMFFSKKGDESKDTKPEQKASCPEVVKTEHTEDLTESKDTKPEQKTSSHEFVKTEPTED 300

Query: 432  ----AKTEEGDDDLKSSGSSHSQNVTKFPIKREYEAFS-ADSKPFIANDDQVSAYPXXXX 268
                AKTEEG +DLK  GSSHSQNV+  PIKREYE FS ADSKP +AN DQ+S  P    
Sbjct: 301  LRERAKTEEGGNDLKFHGSSHSQNVSMLPIKREYETFSAADSKPALANHDQISPNPAKKK 360

Query: 267  XXXXXXXXKQPTLFSYFGK 211
                    KQPTLFSYFGK
Sbjct: 361  EKAKTANDKQPTLFSYFGK 379


>ref|XP_002303080.1| predicted protein [Populus trichocarpa] gi|222844806|gb|EEE82353.1|
            predicted protein [Populus trichocarpa]
          Length = 367

 Score =  432 bits (1110), Expect = e-118
 Identities = 233/371 (62%), Positives = 269/371 (72%), Gaps = 18/371 (4%)
 Frame = -2

Query: 1266 MCGRGRCTLRADDIPRACHRTVAPARVLHVDRYRPSYNVSPGFNLPVVRREDA------S 1105
            MCGR RCTLRADDIPRACHR  A  R +++DRYRPSYN SPG NL VVRR+DA      S
Sbjct: 1    MCGRARCTLRADDIPRACHRNTATVRSVNMDRYRPSYNASPGSNLAVVRRDDAASGDGAS 60

Query: 1104 DSEGYAVHWMKWGLIPSFTKKSEKPDHYKMFNARSESIDEKASFRRLLPKNRCLVAFEGF 925
              +GYA+H MKWGLIP FTKKSEKPD YKMFNARSES+ EKASFRRL+PK+RCLVA EGF
Sbjct: 61   GGDGYAIHCMKWGLIPGFTKKSEKPDFYKMFNARSESLSEKASFRRLIPKSRCLVAVEGF 120

Query: 924  YEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWQNSEDEILHTFTIVTTSSSSALQWLHDR 745
            YEWKKDGS+KQPYYIHFKDGRPLVFAALYDSWQNSE EIL+TFTIVTT++SSA+QWLH+R
Sbjct: 121  YEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTAASSAIQWLHER 180

Query: 744  MPVILGDKDSTDTWLS-SPASSCKSVLKPYEESDLVWYPVTPAMGKPSFDGPECIKEIQV 568
            MPVILGDK++TDTWLS S  S   +VLKPYE SDLVWYPVTPAMGKPSFDGPECIKEI +
Sbjct: 181  MPVILGDKEATDTWLSVSSNSKFDTVLKPYEHSDLVWYPVTPAMGKPSFDGPECIKEIHL 240

Query: 567  KAEGNTPISAFFSRKGAEVEDTEPDHKILCREF-VKTEHTKDQSEGAKTEEG-------D 412
            K E    IS FFSRK  + E++ P+     +   ++ +  K+++E  +  E        D
Sbjct: 241  KMEEKGTISKFFSRKEFK-EESNPEESTHGKSLKLEPKSVKEENESEEKLETPCSAKTVD 299

Query: 411  DDLKSSGSSHS-QNVTKFPIKREYEAFSADSKPFIANDDQVS--AYPXXXXXXXXXXXXK 241
             DLKS   + S +  TK   KR+ E    DSK  +  D+ V   A P            K
Sbjct: 300  YDLKSELETFSHEGETKCKTKRDREEL-VDSK--LKTDEIVKPRASPAKKKANLKSVDDK 356

Query: 240  QPTLFSYFGKK 208
            QPTL SYFGKK
Sbjct: 357  QPTLLSYFGKK 367


>ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [Vitis vinifera]
            gi|296090568|emb|CBI40918.3| unnamed protein product
            [Vitis vinifera]
          Length = 392

 Score =  419 bits (1078), Expect = e-115
 Identities = 232/400 (58%), Positives = 263/400 (65%), Gaps = 48/400 (12%)
 Frame = -2

Query: 1266 MCGRGRCTLRADDIPRACHRTVAPARVLHVDRYRPSYNVSPGFNLPVVRREDASDSEGYA 1087
            MCGR RCTLR D+I RAC+    P + + +DRYRPSYNVSPG NLPVVRR   ++ E   
Sbjct: 1    MCGRARCTLRPDNIARACNLNTLPTQNIQMDRYRPSYNVSPGANLPVVRRGGGTEGEEAI 60

Query: 1086 VHWMKWGLIPSFTKKSEKPDHYKMFNARSESIDEKASFRRLLPKNRCLVAFEGFYEWKKD 907
            VH MKWGL+PSFTKKSEKPDHYKMFNARSES+ EKASFRRL+PKNRCLVA EGFYEWKKD
Sbjct: 61   VHCMKWGLVPSFTKKSEKPDHYKMFNARSESVCEKASFRRLVPKNRCLVAVEGFYEWKKD 120

Query: 906  GSRKQPYYIHFKDGRPLVFAALYDSWQNSEDEILHTFTIVTTSSSSALQWLHDRMPVILG 727
            GS+KQPYYIH KDGRPLVFAAL+DSW NSE EIL+T TI+TTSSSSALQWLHDRMPVILG
Sbjct: 121  GSKKQPYYIHLKDGRPLVFAALFDSWANSEGEILYTCTILTTSSSSALQWLHDRMPVILG 180

Query: 726  DKDSTDTWLSSPASS-CKSVLKPYEESDLVWYPVTPAMGKPSFDGPECIKEIQVKAEGNT 550
            DK+STD WL+  +SS   +VLKPYE+ DLVWYPVT AMGKPSF+GPECIKEIQ+K E   
Sbjct: 181  DKESTDAWLNGSSSSQFNTVLKPYEDPDLVWYPVTQAMGKPSFEGPECIKEIQLKNE-QR 239

Query: 549  PISAFFSRKGAEVED---TEP----------------------------DHKILCREFVK 463
            PIS FFS KG + E     EP                            DH   C   + 
Sbjct: 240  PISKFFSTKGIKNEQGLSNEPVKSNLPQSLKEEPAIENSTGLPSSTVKGDHDSTCSRSIP 299

Query: 462  TEHT-------KDQSEGAKTEE-------GDDDLKSSGSSHSQNVTKFPIKREYEAFSAD 325
             E +       K   +  +TE+       GD D K       +  TK PIKR++E FSAD
Sbjct: 300  QEESTWFTNLPKSLKQEPETEDKTGLPFPGDHDSKCD-----EEATKLPIKRDFEEFSAD 354

Query: 324  SKPFIANDDQVS--AYPXXXXXXXXXXXXKQPTLFSYFGK 211
            SKP   N D V   +              KQPTLFSYFGK
Sbjct: 355  SKP---NTDTVEKPSPVTKKGKLNKNAGDKQPTLFSYFGK 391


>ref|XP_002527247.1| conserved hypothetical protein [Ricinus communis]
            gi|223533340|gb|EEF35091.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 409

 Score =  406 bits (1043), Expect = e-111
 Identities = 228/409 (55%), Positives = 265/409 (64%), Gaps = 56/409 (13%)
 Frame = -2

Query: 1266 MCGRGRCTLRADDIPRACHRTVAPARVLHVDRYRPSYNVSPGFNLPVVRRE-DASDS-EG 1093
            MCGR RCTLRADDIPRACHRT  P R +++DR+RPSYNVSPG N+PVV RE D SD  +G
Sbjct: 1    MCGRARCTLRADDIPRACHRTTGPVRSVNMDRWRPSYNVSPGSNMPVVCREGDGSDGGDG 60

Query: 1092 YAVHWMKWGLIPSFTKKSEKPDHYKMFNARSESIDEKASFRRLLPKNRCLVAFEGFYEWK 913
            + V  M WGLIPSFTKK+EKPD YKMFNARSES+ EKASFRRLLPK+RCLVA EGFYEWK
Sbjct: 61   FFVQCMTWGLIPSFTKKTEKPDFYKMFNARSESVGEKASFRRLLPKSRCLVAAEGFYEWK 120

Query: 912  KDGSRKQPYYIHFKDGRPLVFAALYDSWQNSEDEILHTFTIVTTSSSSALQWLHDRMPVI 733
            KDGS+KQPYYIHFKDGRPLVFAALYDSWQNSE EIL+TFTI+TTSSSSAL+WLHDRMPVI
Sbjct: 121  KDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTILTTSSSSALEWLHDRMPVI 180

Query: 732  LGDKDSTDTWLS-SPASSCKSVLKPYEESDLVWYPVTPAMGKPSFDGPECIKEIQVKAEG 556
            LGDK+STDTWL+ S +S    VL+ YE SDLVW PVTPAMGK SFDGPEC+KEI VK E 
Sbjct: 181  LGDKESTDTWLNGSSSSKYDVVLESYESSDLVWCPVTPAMGKSSFDGPECVKEIHVKTES 240

Query: 555  NTPISAFFSRKGAEVEDTEPDHKILCREFVKTEHTKDQSEGAKTEE----------GDDD 406
             + IS FFSRK  + E      +    + VK +  +   E  ++EE           D D
Sbjct: 241  KSTISKFFSRKEIKGEQELNSRESTFDKSVKMDLPESVKEEYESEEKLDIPPSNQINDQD 300

Query: 405  LKSSGSS-------------HSQ----------------------NVTKFP--------I 355
            LKS+ S+             H +                      NV+K P         
Sbjct: 301  LKSNVSTIPCEDETKCQIPDHDETKCQIPDHDETKCQIPDHDLISNVSKLPHEDATLGQP 360

Query: 354  KREYEAFSADSKPFIANDDQVSAYPXXXXXXXXXXXXKQPTLFSYFGKK 208
            KR +E    D +     ++++   P            KQPTL SYF KK
Sbjct: 361  KRHHEEALIDRELNPDGNEKLRRNPARKKANLKSGGDKQPTLLSYFRKK 409


Top