BLASTX nr result
ID: Glycyrrhiza23_contig00016525
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00016525 (1910 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003626608.1| Pentatricopeptide repeat-containing protein ... 755 0.0 ref|XP_002320901.1| predicted protein [Populus trichocarpa] gi|2... 682 0.0 emb|CAN72416.1| hypothetical protein VITISV_027905 [Vitis vinifera] 636 e-180 ref|XP_004138384.1| PREDICTED: pentatricopeptide repeat-containi... 605 e-170 ref|NP_001154199.1| uncharacterized protein [Arabidopsis thalian... 578 e-162 >ref|XP_003626608.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|87240852|gb|ABD32710.1| Tetratricopeptide-like helical [Medicago truncatula] gi|355501623|gb|AES82826.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 451 Score = 755 bits (1950), Expect = 0.0 Identities = 373/424 (87%), Positives = 397/424 (93%), Gaps = 1/424 (0%) Frame = +3 Query: 243 FHSSNNHSCSSPIGSPTRVQKLIASQSDPLLAKEIFDYASLQPNFRHSYSTYLVLILKLG 422 FHSS+ S SSPIGSPTRVQKLIASQSDPLLAKEIFDYASLQPNFRH+YSTYL+LILK G Sbjct: 29 FHSSS--SSSSPIGSPTRVQKLIASQSDPLLAKEIFDYASLQPNFRHNYSTYLILILKFG 86 Query: 423 RFRYFSLVDDLLRRLKSES-QPVTPTLFSYLIRIYGEADLPDKALKTFYTMVQFDCKPLP 599 R ++FSL+DDLLRRLKSES QP+TPTLFSYLI+IYGEA+LPDKAL TFY M+QF+ KPL Sbjct: 87 RSKHFSLLDDLLRRLKSESSQPITPTLFSYLIKIYGEANLPDKALNTFYIMLQFNIKPLT 146 Query: 600 KHLNRILEILVAHRNYVRPAFDLFRDAHRHGVFPSTKSYNIMMRAFCFNGDISIAYNLFN 779 KHLNRIL+ILV+HRNY+RPAFDLF+DAH+HGVFP TKSYNI+MRAFC NGDISIAY LFN Sbjct: 147 KHLNRILDILVSHRNYLRPAFDLFKDAHKHGVFPDTKSYNILMRAFCLNGDISIAYTLFN 206 Query: 780 KMFKRDVVPDIESYRILMQALCRKSQVNGAVDLLEDMLNKGFVPDSLTYTTLLNSLCRKK 959 KMFKRDVVPDI+SYRILMQALCRKSQVNGAVDL EDMLNKGFVPDS TYTTLLNSLCRKK Sbjct: 207 KMFKRDVVPDIQSYRILMQALCRKSQVNGAVDLFEDMLNKGFVPDSFTYTTLLNSLCRKK 266 Query: 960 KLREAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRAHDACKVIDDMHANGCLPNLVSYR 1139 KLREAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRAHDACKVIDDM ANGCLPNLVSYR Sbjct: 267 KLREAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRAHDACKVIDDMQANGCLPNLVSYR 326 Query: 1140 TLVNSLCDMGMLDEANKYMEEMLSKGFSPHFAVIHSLVKGFCNVGRTEEACGVLTKSLEH 1319 TLVN LC +GMLDEA KY+EEMLSKGFSPHFAVIH+LVKGFCNVGR EEACGVLTKSLEH Sbjct: 327 TLVNGLCHLGMLDEATKYVEEMLSKGFSPHFAVIHALVKGFCNVGRIEEACGVLTKSLEH 386 Query: 1320 GEAPHTDTWMIIVPLICEVEDAVKIGGVLEEVLRIEIKGHTRIVDAGIGLENYLIGKIRA 1499 EAPH DTWMIIVP ICEV+D VKI GVLEEVL+IEIKG TRIVDAGIGLE+YLI KIRA Sbjct: 387 REAPHKDTWMIIVPQICEVDDGVKIDGVLEEVLKIEIKGDTRIVDAGIGLEDYLIRKIRA 446 Query: 1500 KSRQ 1511 KSRQ Sbjct: 447 KSRQ 450 >ref|XP_002320901.1| predicted protein [Populus trichocarpa] gi|222861674|gb|EEE99216.1| predicted protein [Populus trichocarpa] Length = 475 Score = 682 bits (1759), Expect = 0.0 Identities = 331/473 (69%), Positives = 398/473 (84%), Gaps = 6/473 (1%) Frame = +3 Query: 105 MHRPFSKT----LMTVASDGIFLSLRKKQPPVLSGVSSLPHQLQNQKRQ--PFHSSNNHS 266 MH+PF T L+T + + K P L SS PH Q KR+ P S N + Sbjct: 1 MHKPFLVTCKILLLTTPPRTRTVPILPK-PQSLFFYSSSPHHHQQHKRELEPSDSHPNAN 59 Query: 267 CSSPIGSPTRVQKLIASQSDPLLAKEIFDYASLQPNFRHSYSTYLVLILKLGRFRYFSLV 446 SPIGSP+RVQKLIASQSDPLLAKEIFDYAS QPNF+HSYS+YL+LILKLGR +YFS + Sbjct: 60 TKSPIGSPSRVQKLIASQSDPLLAKEIFDYASRQPNFQHSYSSYLILILKLGRAKYFSFI 119 Query: 447 DDLLRRLKSESQPVTPTLFSYLIRIYGEADLPDKALKTFYTMVQFDCKPLPKHLNRILEI 626 DDLL LKS++ PVTPTLFSY+I IYGEA+LPDKALK FYT+++FDC P PKHLN ILEI Sbjct: 120 DDLLTDLKSKNYPVTPTLFSYIINIYGEANLPDKALKIFYTILKFDCNPSPKHLNGILEI 179 Query: 627 LVAHRNYVRPAFDLFRDAHRHGVFPSTKSYNIMMRAFCFNGDISIAYNLFNKMFKRDVVP 806 LV+H+NY++PAFDLF+DAH + VFP+TKSYNI++RAFC NG IS+AY+LFN+MFKRDV+P Sbjct: 180 LVSHQNYIKPAFDLFKDAHTYDVFPNTKSYNILIRAFCLNGQISMAYSLFNQMFKRDVMP 239 Query: 807 DIESYRILMQALCRKSQVNGAVDLLEDMLNKGFVPDSLTYTTLLNSLCRKKKLREAYKLL 986 D+ESYRILMQALCRKSQVNGAVDLLEDMLNKG+VPD+L+YTTLLNSLCRKKKLREAYKLL Sbjct: 240 DVESYRILMQALCRKSQVNGAVDLLEDMLNKGYVPDALSYTTLLNSLCRKKKLREAYKLL 299 Query: 987 CRMKVKGCNPDIVHYNTVILGFCREGRAHDACKVIDDMHANGCLPNLVSYRTLVNSLCDM 1166 CRMKVKGCNPDI+HYNTVILGFCREGRA DACKV++DM +NGC+PNLVSYRTLV LCD Sbjct: 300 CRMKVKGCNPDIIHYNTVILGFCREGRAMDACKVLEDMESNGCMPNLVSYRTLVGGLCDQ 359 Query: 1167 GMLDEANKYMEEMLSKGFSPHFAVIHSLVKGFCNVGRTEEACGVLTKSLEHGEAPHTDTW 1346 GM DEA ++EEM+ KGFSPHFAV ++L+KGFCNVG+ EEACGV+ + L+HGEAPHT+TW Sbjct: 360 GMFDEAKSHLEEMMMKGFSPHFAVSNALIKGFCNVGKIEEACGVVEELLKHGEAPHTETW 419 Query: 1347 MIIVPLICEVEDAVKIGGVLEEVLRIEIKGHTRIVDAGIGLENYLIGKIRAKS 1505 +++V ICEV+D +IG +L++V ++E+KG TRIV+AGIGLE YLI + + K+ Sbjct: 420 VMMVSRICEVDDLQRIGEILDKVKKVELKGDTRIVEAGIGLEEYLIKRTQQKA 472 >emb|CAN72416.1| hypothetical protein VITISV_027905 [Vitis vinifera] Length = 422 Score = 636 bits (1641), Expect = e-180 Identities = 302/418 (72%), Positives = 360/418 (86%) Frame = +3 Query: 261 HSCSSPIGSPTRVQKLIASQSDPLLAKEIFDYASLQPNFRHSYSTYLVLILKLGRFRYFS 440 H SPIGSP+RVQKLIASQSDPLLAKEIFD ASLQPNF+HSYS++ +LILKLG R FS Sbjct: 5 HVKPSPIGSPSRVQKLIASQSDPLLAKEIFDLASLQPNFKHSYSSFHILILKLGWARQFS 64 Query: 441 LVDDLLRRLKSESQPVTPTLFSYLIRIYGEADLPDKALKTFYTMVQFDCKPLPKHLNRIL 620 L+ DLL RLKSE + P+LFS +I IYGEA+LPD+ALKTF++M+QF KPLPKHLN +L Sbjct: 65 LMQDLLMRLKSEQYSINPSLFSDIIEIYGEANLPDQALKTFHSMLQFHSKPLPKHLNXLL 124 Query: 621 EILVAHRNYVRPAFDLFRDAHRHGVFPSTKSYNIMMRAFCFNGDISIAYNLFNKMFKRDV 800 ++LV+HRNY+RPAFDLF+ AHR+GV P TKSYNI+M AFCFNGD+SIAY LFN+MFKRDV Sbjct: 125 QLLVSHRNYIRPAFDLFKSAHRYGVSPDTKSYNILMSAFCFNGDLSIAYTLFNQMFKRDV 184 Query: 801 VPDIESYRILMQALCRKSQVNGAVDLLEDMLNKGFVPDSLTYTTLLNSLCRKKKLREAYK 980 PD+ESYRILMQ LCRKSQVN AVDLLEDMLNKG+VPD+L+YTTLLNSLCRKKKL+EAYK Sbjct: 185 APDVESYRILMQGLCRKSQVNRAVDLLEDMLNKGYVPDALSYTTLLNSLCRKKKLKEAYK 244 Query: 981 LLCRMKVKGCNPDIVHYNTVILGFCREGRAHDACKVIDDMHANGCLPNLVSYRTLVNSLC 1160 LLCRMKVKGCNPDIVHYNTVILGFCREGR DACKV++DM +NGC PNL+SY TLV+ LC Sbjct: 245 LLCRMKVKGCNPDIVHYNTVILGFCREGRXLDACKVLEDMPSNGCSPNLMSYGTLVSGLC 304 Query: 1161 DMGMLDEANKYMEEMLSKGFSPHFAVIHSLVKGFCNVGRTEEACGVLTKSLEHGEAPHTD 1340 D G+ DEA Y+EEMLSKGFSPHF+V H+L+ GFCNVG+ EEAC VL + L HGEA HT+ Sbjct: 305 DQGLYDEAKNYVEEMLSKGFSPHFSVFHALINGFCNVGKLEEACEVLXEMLRHGEAXHTE 364 Query: 1341 TWMIIVPLICEVEDAVKIGGVLEEVLRIEIKGHTRIVDAGIGLENYLIGKIRAKSRQS 1514 TW+ I+P ICEV+ V++ + +E L++EI +TR+V+AGIGLE Y+I K+R KSR++ Sbjct: 365 TWVAIIPRICEVDKLVRMENIFDEXLKLEITPNTRLVEAGIGLEEYVIRKVRDKSRKA 422 >ref|XP_004138384.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucumis sativus] gi|449499186|ref|XP_004160743.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucumis sativus] Length = 482 Score = 605 bits (1559), Expect = e-170 Identities = 302/469 (64%), Positives = 367/469 (78%), Gaps = 10/469 (2%) Frame = +3 Query: 123 KTLMTVASDGIFLSLRKKQP---PVLSGVSSL--PH-QLQNQKRQPFHSSNNHSCSSP-- 278 +T+ TVA+ + +K P ++S SSL PH + N+ + + + C Sbjct: 12 RTIETVAAAHV----ARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHEQCEDQPD 67 Query: 279 --IGSPTRVQKLIASQSDPLLAKEIFDYASLQPNFRHSYSTYLVLILKLGRFRYFSLVDD 452 IGSP RVQKLIASQSDPLLAKEIFDYA QP+FR S S+ LVLILKLGR +YFSL+DD Sbjct: 68 FSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYFSLIDD 127 Query: 453 LLRRLKSESQPVTPTLFSYLIRIYGEADLPDKALKTFYTMVQFDCKPLPKHLNRILEILV 632 LL KS PVTPT FSY+I+IYGEADLPDKALK FYTM+ F C P K LNRILEILV Sbjct: 128 LLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLNRILEILV 187 Query: 633 AHRNYVRPAFDLFRDAHRHGVFPSTKSYNIMMRAFCFNGDISIAYNLFNKMFKRDVVPDI 812 +HRN++RPAFDLF++A HGV P+TKSYNI++RAFC+NG+ISIAY LFNKMF+R+V+PD+ Sbjct: 188 SHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFERNVIPDV 247 Query: 813 ESYRILMQALCRKSQVNGAVDLLEDMLNKGFVPDSLTYTTLLNSLCRKKKLREAYKLLCR 992 E+YR LMQ LCRK+QVNGAVDLLEDMLNKG++PD+L+Y TLLNSLCRKKKLREAYKLLCR Sbjct: 248 ETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCR 307 Query: 993 MKVKGCNPDIVHYNTVILGFCREGRAHDACKVIDDMHANGCLPNLVSYRTLVNSLCDMGM 1172 MKVKGCNPDI HYNTVI+GFCREGRA DACK+++DM +NGCLPNLVSY +L N LCD GM Sbjct: 308 MKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQGM 367 Query: 1173 LDEANKYMEEMLSKGFSPHFAVIHSLVKGFCNVGRTEEACGVLTKSLEHGEAPHTDTWMI 1352 + A Y+EEM KGF PHF+VIH+LVKGF ++GR E+C VL L+ G+APH+DTW I Sbjct: 368 FELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEI 427 Query: 1353 IVPLICEVEDAVKIGGVLEEVLRIEIKGHTRIVDAGIGLENYLIGKIRA 1499 I+ ICEVED K V E++L+ +++ TRIV+AG GL YLI K++A Sbjct: 428 IISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQA 476 >ref|NP_001154199.1| uncharacterized protein [Arabidopsis thaliana] gi|223635643|sp|Q8LDU5.2|PP298_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g01400, mitochondrial; Flags: Precursor gi|332656621|gb|AEE82021.1| uncharacterized protein [Arabidopsis thaliana] Length = 466 Score = 578 bits (1490), Expect = e-162 Identities = 275/445 (61%), Positives = 350/445 (78%), Gaps = 8/445 (1%) Frame = +3 Query: 207 LPHQLQNQKRQPFHSSNNHSC--------SSPIGSPTRVQKLIASQSDPLLAKEIFDYAS 362 L L R F+SS+ H SPIGSPTRVQKLIASQSDPLLAKEIFDYAS Sbjct: 16 LTSPLSTSSRFLFYSSSEHEARKPIVSNPKSPIGSPTRVQKLIASQSDPLLAKEIFDYAS 75 Query: 363 LQPNFRHSYSTYLVLILKLGRFRYFSLVDDLLRRLKSESQPVTPTLFSYLIRIYGEADLP 542 QPNFRHS S++L+LILKLGR RYF+L+DD+L + +S P+T +F+YLI++Y EA LP Sbjct: 76 QQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYLIKVYAEAKLP 135 Query: 543 DKALKTFYTMVQFDCKPLPKHLNRILEILVAHRNYVRPAFDLFRDAHRHGVFPSTKSYNI 722 +K L TFY M++F+ P PKHLNRIL++LV+HR Y++ AF+LF+ + HGV P+T+SYN+ Sbjct: 136 EKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHGVMPNTRSYNL 195 Query: 723 MMRAFCFNGDISIAYNLFNKMFKRDVVPDIESYRILMQALCRKSQVNGAVDLLEDMLNKG 902 +M+AFC N D+SIAY LF KM +RDVVPD++SY+IL+Q CRK QVNGA++LL+DMLNKG Sbjct: 196 LMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAMELLDDMLNKG 255 Query: 903 FVPDSLTYTTLLNSLCRKKKLREAYKLLCRMKVKGCNPDIVHYNTVILGFCREGRAHDAC 1082 FVPD L+YTTLLNSLCRK +LREAYKLLCRMK+KGCNPD+VHYNT+ILGFCRE RA DA Sbjct: 256 FVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILGFCREDRAMDAR 315 Query: 1083 KVIDDMHANGCLPNLVSYRTLVNSLCDMGMLDEANKYMEEMLSKGFSPHFAVIHSLVKGF 1262 KV+DDM +NGC PN VSYRTL+ LCD GM DE KY+EEM+SKGFSPHF+V + LVKGF Sbjct: 316 KVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFSVSNCLVKGF 375 Query: 1263 CNVGRTEEACGVLTKSLEHGEAPHTDTWMIIVPLICEVEDAVKIGGVLEEVLRIEIKGHT 1442 C+ G+ EEAC V+ +++GE H+DTW +++PLIC +++ KI LE+ ++ EI G T Sbjct: 376 CSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDAVKEEITGDT 435 Query: 1443 RIVDAGIGLENYLIGKIRAKSRQSQ 1517 RIVD GIGL +YL K++ K + ++ Sbjct: 436 RIVDVGIGLGSYLSSKLQMKRKNAR 460