BLASTX nr result
ID: Glycyrrhiza23_contig00007776
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00007776 (1980 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003527209.1| PREDICTED: polyadenylate-binding protein, cy... 422 e-115 ref|XP_003541062.1| PREDICTED: polyadenylate-binding protein, cy... 419 e-114 ref|XP_004137219.1| PREDICTED: uncharacterized protein LOC101207... 369 1e-99 ref|XP_002300424.1| predicted protein [Populus trichocarpa] gi|2... 361 4e-97 ref|NP_188119.1| RNA recognition motif-containing protein [Arabi... 353 1e-94 >ref|XP_003527209.1| PREDICTED: polyadenylate-binding protein, cytoplasmic and nuclear-like [Glycine max] Length = 425 Score = 422 bits (1084), Expect = e-115 Identities = 216/259 (83%), Positives = 231/259 (89%), Gaps = 1/259 (0%) Frame = -3 Query: 1849 MDPLKKRKLDENGFGGGEP-DHLKLSPSDARKMIERFTTDQLLDILQDAVSRHSDVLAAV 1673 MDP KKRKLDENGF + D LKLSPS+ RK+IERFT+DQLLDILQDAV+RH DVLAAV Sbjct: 1 MDPTKKRKLDENGFNNNDSSDPLKLSPSEVRKLIERFTSDQLLDILQDAVARHLDVLAAV 60 Query: 1672 RSVADPDVSQRKLFIRGLGWDTTTDGLRSLFSAYGELEEAVVILDKATGKSKGYGFVTFR 1493 R+VADPDVSQRKLFIRGLGWDTTTDGLRSLFS +G+LEEAVVILDKATGKSKGYGFVTFR Sbjct: 61 RAVADPDVSQRKLFIRGLGWDTTTDGLRSLFSTFGDLEEAVVILDKATGKSKGYGFVTFR 120 Query: 1492 HVDGALLALREPSKRIDGRVTVTQLXXXXXXXXXXXXADIALRKIYVANVPPDLPADKLL 1313 HVDGALLALREPSKRIDGRVTVTQL AD+ALRKIYVANVPPDLPADKLL Sbjct: 121 HVDGALLALREPSKRIDGRVTVTQLAAAGNSASNVNPADVALRKIYVANVPPDLPADKLL 180 Query: 1312 AHFSLYGEIEEGPLGFDKQTGKSKGFALFVYKTPEGAQSALLDPVKTVEGRQLNCKLAIT 1133 AHFS+YGEIEEGPLGFDKQTGKSKGFALFVYK+PEGAQ+AL+DPVKTVEGRQL+CKLAIT Sbjct: 181 AHFSVYGEIEEGPLGFDKQTGKSKGFALFVYKSPEGAQAALIDPVKTVEGRQLSCKLAIT 240 Query: 1132 DGKQGKRGAGAGPDAVQGH 1076 DGKQGKR GPD+ Q H Sbjct: 241 DGKQGKR---VGPDSAQAH 256 Score = 90.5 bits (223), Expect = 1e-15 Identities = 42/62 (67%), Positives = 47/62 (75%), Gaps = 2/62 (3%) Frame = -3 Query: 697 LYRLPXXXXXXXXXG--YPESGHYSLSASSGYQNQHHHPPTGASPVPRVPPGSMYPNVPP 524 +YRLP YP+SGHY LSASSGYQNQHH PP+G SP+PRVPPGSMYPN+PP Sbjct: 365 MYRLPGSGGMPAGGTGGYPDSGHYGLSASSGYQNQHH-PPSGTSPMPRVPPGSMYPNMPP 423 Query: 523 YY 518 YY Sbjct: 424 YY 425 >ref|XP_003541062.1| PREDICTED: polyadenylate-binding protein, cytoplasmic and nuclear-like [Glycine max] Length = 437 Score = 419 bits (1078), Expect = e-114 Identities = 214/259 (82%), Positives = 230/259 (88%), Gaps = 1/259 (0%) Frame = -3 Query: 1849 MDPLKKRKLDENGF-GGGEPDHLKLSPSDARKMIERFTTDQLLDILQDAVSRHSDVLAAV 1673 MDP KKRKLDENGF + LKLSPSDARK+IERFTTDQLLDILQD V+RH DVLAAV Sbjct: 1 MDPTKKRKLDENGFINNDSSEPLKLSPSDARKLIERFTTDQLLDILQDTVARHPDVLAAV 60 Query: 1672 RSVADPDVSQRKLFIRGLGWDTTTDGLRSLFSAYGELEEAVVILDKATGKSKGYGFVTFR 1493 R+V+DPDVSQRKLFIRGLGWDTTTDGLRSLFS YG+LEEAVVILDKATGKSKGYGFVTFR Sbjct: 61 RAVSDPDVSQRKLFIRGLGWDTTTDGLRSLFSTYGDLEEAVVILDKATGKSKGYGFVTFR 120 Query: 1492 HVDGALLALREPSKRIDGRVTVTQLXXXXXXXXXXXXADIALRKIYVANVPPDLPADKLL 1313 HVDGALLALREPSKRIDGRVTVTQL D+ALRKIYVANVPPDLPADKLL Sbjct: 121 HVDGALLALREPSKRIDGRVTVTQLAAAGNSALNANAVDVALRKIYVANVPPDLPADKLL 180 Query: 1312 AHFSLYGEIEEGPLGFDKQTGKSKGFALFVYKTPEGAQSALLDPVKTVEGRQLNCKLAIT 1133 AHFS+YGEIEEGPLGFDKQTGKSKGFALFVYK+PEGA++AL+DP+KTVEGRQL+CKLAIT Sbjct: 181 AHFSVYGEIEEGPLGFDKQTGKSKGFALFVYKSPEGAKAALIDPMKTVEGRQLSCKLAIT 240 Query: 1132 DGKQGKRGAGAGPDAVQGH 1076 DGKQGKR +GPD+ Q H Sbjct: 241 DGKQGKR---SGPDSGQAH 256 Score = 85.9 bits (211), Expect = 4e-14 Identities = 44/61 (72%), Positives = 47/61 (77%), Gaps = 1/61 (1%) Frame = -3 Query: 697 LYRLPXXXXXXXXXG-YPESGHYSLSASSGYQNQHHHPPTGASPVPRVPPGSMYPNVPPY 521 +YRLP G YP+SGHY LSASSGYQNQ HHPP+GASPVPRVPP MYPNVPPY Sbjct: 380 MYRLPGSGGMPAGGGGYPDSGHYGLSASSGYQNQ-HHPPSGASPVPRVPP--MYPNVPPY 436 Query: 520 Y 518 Y Sbjct: 437 Y 437 >ref|XP_004137219.1| PREDICTED: uncharacterized protein LOC101207569 [Cucumis sativus] gi|449483196|ref|XP_004156519.1| PREDICTED: uncharacterized protein LOC101231665 [Cucumis sativus] Length = 434 Score = 369 bits (948), Expect = 1e-99 Identities = 187/259 (72%), Positives = 215/259 (83%) Frame = -3 Query: 1849 MDPLKKRKLDENGFGGGEPDHLKLSPSDARKMIERFTTDQLLDILQDAVSRHSDVLAAVR 1670 MD KKR++DENG E +++P DARK+I+RFT DQL+DILQDAVSRH DVL AVR Sbjct: 1 MDVTKKRRMDENGVDSSESSFSRITPEDARKIIDRFTPDQLIDILQDAVSRHLDVLDAVR 60 Query: 1669 SVADPDVSQRKLFIRGLGWDTTTDGLRSLFSAYGELEEAVVILDKATGKSKGYGFVTFRH 1490 S+AD DVSQRKLFIRGL DT+T+GLRSLFS+YGELEEAVVI+DKATGKSKGYGFVTF+H Sbjct: 61 SIADRDVSQRKLFIRGLSCDTSTEGLRSLFSSYGELEEAVVIIDKATGKSKGYGFVTFKH 120 Query: 1489 VDGALLALREPSKRIDGRVTVTQLXXXXXXXXXXXXADIALRKIYVANVPPDLPADKLLA 1310 VDGALLAL+EPSK IDGRVTVTQL AD++LRKIYVANVP D+PADKLLA Sbjct: 121 VDGALLALKEPSKTIDGRVTVTQLAAVGISGQNSNAADLSLRKIYVANVPMDMPADKLLA 180 Query: 1309 HFSLYGEIEEGPLGFDKQTGKSKGFALFVYKTPEGAQSALLDPVKTVEGRQLNCKLAITD 1130 HFSLYGEIEEGPLGFDKQTGK +G+ALFVYK PEGAQ+AL+DP+KT++GRQL+CK A D Sbjct: 181 HFSLYGEIEEGPLGFDKQTGKCRGYALFVYKKPEGAQAALVDPIKTIDGRQLSCKFA-ND 239 Query: 1129 GKQGKRGAGAGPDAVQGHG 1073 GK+GK G G + QG G Sbjct: 240 GKKGKPGGGPDGNQTQGAG 258 Score = 86.7 bits (213), Expect = 2e-14 Identities = 39/60 (65%), Positives = 44/60 (73%) Frame = -3 Query: 697 LYRLPXXXXXXXXXGYPESGHYSLSASSGYQNQHHHPPTGASPVPRVPPGSMYPNVPPYY 518 LYRLP GYP+SGHYS+S++SG+ NQHH P G SP PRVPPG MYPNVPPYY Sbjct: 376 LYRLPQSSVGMPSGGYPDSGHYSMSSASGHPNQHHQP-AGTSPAPRVPPGGMYPNVPPYY 434 >ref|XP_002300424.1| predicted protein [Populus trichocarpa] gi|222847682|gb|EEE85229.1| predicted protein [Populus trichocarpa] Length = 310 Score = 361 bits (926), Expect = 4e-97 Identities = 189/273 (69%), Positives = 216/273 (79%), Gaps = 14/273 (5%) Frame = -3 Query: 1849 MDPLKKRKLDENGFGGGEPD---HLKLSPSDARKMIERFTTDQLLDILQDAVSRHSDVLA 1679 MDP KKRKL+ENG D KL+P DARKM+ERFT DQLLDILQ+AV RH D+L Sbjct: 1 MDPTKKRKLEENGIVSSTTDLDSPYKLTPQDARKMMERFTPDQLLDILQNAVVRHPDILE 60 Query: 1678 AVRSVADPDVSQRKLFIRGLGWDTTTDGLRSLFSAYGELEEAVVILDKATGKSKGYGFVT 1499 AVRS+ADPD +QRKLFIRGLGW+TTT+ LR+LFS YGELEEAVVILDK TGKSKGYGFV Sbjct: 61 AVRSIADPDATQRKLFIRGLGWETTTENLRNLFSTYGELEEAVVILDKNTGKSKGYGFVI 120 Query: 1498 FRHVDGALLALREPSKRIDGRVTVTQL--------XXXXXXXXXXXXADIALRKIYVANV 1343 ++HVDGALLAL+EPSK+IDGRVTVTQL D+A+RKIYVANV Sbjct: 121 YKHVDGALLALKEPSKKIDGRVTVTQLAIAGNSGANNNNNSSANPGVVDVAMRKIYVANV 180 Query: 1342 PPDLPADKLLAHFSLYGEIEEGPLGFDKQTGKSKGFALFVYKTPEGAQSALLDPVKTVEG 1163 P ++P+DKLL HF+ YGEIEEGPLGFDKQTGKSKGFALFVYKT EGAQ+ALL+PVK +EG Sbjct: 181 PYEMPSDKLLNHFAQYGEIEEGPLGFDKQTGKSKGFALFVYKTAEGAQAALLEPVKMIEG 240 Query: 1162 RQLNCKLAITDGKQGKR---GAGAGPDAVQGHG 1073 RQLNCKLAI DGK+G++ G G G D +QG G Sbjct: 241 RQLNCKLAI-DGKRGRQPGGGQGPGQDGLQGQG 272 >ref|NP_188119.1| RNA recognition motif-containing protein [Arabidopsis thaliana] gi|42572443|ref|NP_974317.1| RNA recognition motif-containing protein [Arabidopsis thaliana] gi|8777484|dbj|BAA97064.1| unnamed protein product [Arabidopsis thaliana] gi|17380744|gb|AAL36202.1| putative RNA-binding protein [Arabidopsis thaliana] gi|20259621|gb|AAM14167.1| putative RNA-binding protein [Arabidopsis thaliana] gi|222422941|dbj|BAH19456.1| AT3G15010 [Arabidopsis thaliana] gi|332642081|gb|AEE75602.1| RNA recognition motif-containing protein [Arabidopsis thaliana] gi|332642082|gb|AEE75603.1| RNA recognition motif-containing protein [Arabidopsis thaliana] Length = 404 Score = 353 bits (905), Expect = 1e-94 Identities = 181/265 (68%), Positives = 208/265 (78%), Gaps = 5/265 (1%) Frame = -3 Query: 1849 MDPLKKRKLDENGFG-----GGEPDHLKLSPSDARKMIERFTTDQLLDILQDAVSRHSDV 1685 MD +KKRKLDENG G GG +LSP DARK+IERFTTDQLLD+LQ+A+ RH DV Sbjct: 1 MDMMKKRKLDENGNGLNTNGGGTIGPTRLSPQDARKIIERFTTDQLLDLLQEAIVRHPDV 60 Query: 1684 LAAVRSVADPDVSQRKLFIRGLGWDTTTDGLRSLFSAYGELEEAVVILDKATGKSKGYGF 1505 L +VR AD D+SQRKLFIRGL DTTT+GLRSLFS+YG+LEEA+VILDK TGKSKGYGF Sbjct: 61 LESVRLTADSDISQRKLFIRGLAADTTTEGLRSLFSSYGDLEEAIVILDKVTGKSKGYGF 120 Query: 1504 VTFRHVDGALLALREPSKRIDGRVTVTQLXXXXXXXXXXXXADIALRKIYVANVPPDLPA 1325 VTF HVDGALLAL+EPSK+IDGRVTVTQL ADI++RKIYVANVP D+PA Sbjct: 121 VTFMHVDGALLALKEPSKKIDGRVTVTQLAASGNQGTGSQIADISMRKIYVANVPFDMPA 180 Query: 1324 DKLLAHFSLYGEIEEGPLGFDKQTGKSKGFALFVYKTPEGAQSALLDPVKTVEGRQLNCK 1145 D+LL HF YG++EEGPLGFDK TGKS+GFALFVYKT EGAQ+AL DPVK ++G+ LNCK Sbjct: 181 DRLLNHFMAYGDVEEGPLGFDKVTGKSRGFALFVYKTAEGAQAALADPVKVIDGKHLNCK 240 Query: 1144 LAITDGKQGKRGAGAGPDAVQGHGN 1070 LA+ K GK G D GHG+ Sbjct: 241 LAVDGKKGGKPGMPQAQDGGSGHGH 265 Score = 68.9 bits (167), Expect = 5e-09 Identities = 31/45 (68%), Positives = 33/45 (73%) Frame = -3 Query: 652 YPESGHYSLSASSGYQNQHHHPPTGASPVPRVPPGSMYPNVPPYY 518 YPESGHY LS+S+GY QHH G SPVPRVP G MYPN PP Y Sbjct: 361 YPESGHYGLSSSAGYPGQHHQA-VGTSPVPRVPHGGMYPNGPPNY 404