BLASTX nr result
ID: Glycyrrhiza23_contig00023616
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00023616 (589 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003532384.1| PREDICTED: pentatricopeptide repeat-containi... 311 7e-83 ref|XP_002318186.1| predicted protein [Populus trichocarpa] gi|2... 271 6e-71 ref|XP_002281032.2| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 260 1e-67 ref|XP_002511070.1| pentatricopeptide repeat-containing protein,... 253 1e-65 ref|XP_002866455.1| hypothetical protein ARALYDRAFT_496347 [Arab... 251 8e-65 >ref|XP_003532384.1| PREDICTED: pentatricopeptide repeat-containing protein At5g61800-like [Glycine max] Length = 577 Score = 311 bits (796), Expect = 7e-83 Identities = 154/195 (78%), Positives = 170/195 (87%) Frame = +3 Query: 3 DAHNLFSQSPHRDVVSYNAMIDGFVKSSQISRARELFDEMLVRDSVSWGTMIAGYSHAKL 182 DAH LF + PH DVVSYNA+I G VK+ QISRARELFDEM VRD +SWGTMIAGYSH KL Sbjct: 162 DAHKLFYECPHGDVVSYNALIHGLVKTRQISRARELFDEMPVRDEISWGTMIAGYSHLKL 221 Query: 183 CREAIELFNEMISLGGVWPDNIALVSVLSACAQLGELEQGKFVHDYITRNGIRVDSFLAT 362 C +AIELFNEM+ L V PDNIALVSVLSACAQLGELEQG VHDYI RN IRVDS+LAT Sbjct: 222 CNQAIELFNEMMRLE-VKPDNIALVSVLSACAQLGELEQGSIVHDYIKRNRIRVDSYLAT 280 Query: 363 GLVDLYAKCGCVETARDIFESSKDKDVFTWNAMLVGLAIHGKGSILMDYFSRMVAAGVRP 542 GLVDLYAKCGCVETARD+FES +K VFTWNAMLVG AIHG+GS++++YFSRMV+ GV+P Sbjct: 281 GLVDLYAKCGCVETARDVFESCMEKYVFTWNAMLVGFAIHGEGSMVLEYFSRMVSEGVKP 340 Query: 543 DGVTFLGVLVGCSHA 587 DGVT LGVLVGCSHA Sbjct: 341 DGVTLLGVLVGCSHA 355 Score = 62.4 bits (150), Expect = 5e-08 Identities = 36/144 (25%), Positives = 72/144 (50%), Gaps = 2/144 (1%) Frame = +3 Query: 60 MIDGFVKSSQISRARELFDEMLVRDSVSWGTMIAGYSHAKLCREAIELFNEMISLGGVWP 239 ++D + K + AR++F+ + + +W M+ G++ +E F+ M+S GV P Sbjct: 282 LVDLYAKCGCVETARDVFESCMEKYVFTWNAMLVGFAIHGEGSMVLEYFSRMVS-EGVKP 340 Query: 240 DNIALVSVLSACAQLGELEQGKFVHDYITR-NGIRVDSFLATGLVDLYAKCGCVETARDI 416 D + L+ VL C+ G + + + + D + G++ + + D+ A+ G +E ++ Sbjct: 341 DGVTLLGVLVGCSHAGLVLEARRIFDEMENVYGVKREGKHYGCMADMLARAGLIEEGVEM 400 Query: 417 FESSKDK-DVFTWNAMLVGLAIHG 485 ++ DVF W +L G IHG Sbjct: 401 VKAMPSGGDVFAWGGLLGGCRIHG 424 >ref|XP_002318186.1| predicted protein [Populus trichocarpa] gi|222858859|gb|EEE96406.1| predicted protein [Populus trichocarpa] Length = 537 Score = 271 bits (693), Expect = 6e-71 Identities = 129/195 (66%), Positives = 161/195 (82%) Frame = +3 Query: 3 DAHNLFSQSPHRDVVSYNAMIDGFVKSSQISRARELFDEMLVRDSVSWGTMIAGYSHAKL 182 DA+ +F +SP RDVVSYN +IDGFVK+ + +ARELFD M VRDSVSW T+IAG + Sbjct: 177 DAYKVFDESPQRDVVSYNVLIDGFVKAGDVVKARELFDLMPVRDSVSWNTIIAGCAKGDY 236 Query: 183 CREAIELFNEMISLGGVWPDNIALVSVLSACAQLGELEQGKFVHDYITRNGIRVDSFLAT 362 C EAIELF+ M+ L + PDN+ALVS LSACAQLGELE+GK +HDYI RN ++VD+FL+T Sbjct: 237 CEEAIELFDFMMDLE-IRPDNVALVSTLSACAQLGELEKGKKIHDYIERNAMKVDTFLST 295 Query: 363 GLVDLYAKCGCVETARDIFESSKDKDVFTWNAMLVGLAIHGKGSILMDYFSRMVAAGVRP 542 GLVD YAKCGCV+ A IF+SS DK++FTWNAMLVGLA+HG G +L++YFSRM+ AGV+P Sbjct: 296 GLVDFYAKCGCVDIALKIFDSSSDKNLFTWNAMLVGLAMHGYGELLLEYFSRMIEAGVKP 355 Query: 543 DGVTFLGVLVGCSHA 587 DG++ LGVLVGCSH+ Sbjct: 356 DGISILGVLVGCSHS 370 >ref|XP_002281032.2| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g61800 [Vitis vinifera] Length = 576 Score = 260 bits (664), Expect = 1e-67 Identities = 123/195 (63%), Positives = 161/195 (82%), Gaps = 1/195 (0%) Frame = +3 Query: 6 AHNLFSQSPHRDVVSYNAMIDGFVKSSQISRARELFDEMLVRDSVSWGTMIAGYSHA-KL 182 A +F+++ +DVVSYNA+I GF+K RAR LFD+M +RD+VSWGT++AGY+ + L Sbjct: 183 ACQVFNETSLKDVVSYNALIGGFIKVGDTDRARRLFDKMPIRDAVSWGTLLAGYAQSGDL 242 Query: 183 CREAIELFNEMISLGGVWPDNIALVSVLSACAQLGELEQGKFVHDYITRNGIRVDSFLAT 362 C +AI+LFN M+ + V PDNIALVS LSACAQLGELEQGK +H YI +N I +++FL+T Sbjct: 243 CMDAIQLFNRML-ISTVRPDNIALVSALSACAQLGELEQGKSIHVYIKQNRIPINAFLST 301 Query: 363 GLVDLYAKCGCVETARDIFESSKDKDVFTWNAMLVGLAIHGKGSILMDYFSRMVAAGVRP 542 GLVDLYAKCGC+ETAR+IFESS DK++FTWNA+LVGL +HG+G + + YFSRM+ AG++P Sbjct: 302 GLVDLYAKCGCIETAREIFESSPDKNLFTWNALLVGLGMHGRGHLSLHYFSRMIEAGIKP 361 Query: 543 DGVTFLGVLVGCSHA 587 DGV+FLG+LVGC HA Sbjct: 362 DGVSFLGILVGCGHA 376 Score = 55.1 bits (131), Expect = 9e-06 Identities = 39/166 (23%), Positives = 69/166 (41%), Gaps = 2/166 (1%) Frame = +3 Query: 60 MIDGFVKSSQISRARELFDEMLVRDSVSWGTMIAGYSHAKLCREAIELFNEMISLGGVWP 239 ++D + K I ARE+F+ ++ +W ++ G ++ F+ MI G+ P Sbjct: 303 LVDLYAKCGCIETAREIFESSPDKNLFTWNALLVGLGMHGRGHLSLHYFSRMIE-AGIKP 361 Query: 240 DNIALVSVLSACAQLGEL-EQGKFVHDYITRNGIRVDSFLATGLVDLYAKCGCVETARDI 416 D ++ + +L C G + E F + + + + DL + G + A ++ Sbjct: 362 DGVSFLGILVGCGHAGLVCEARNFFQEMEVVYRVPRELKHYGCMADLLGRAGLIREAMEM 421 Query: 417 FESSK-DKDVFTWNAMLVGLAIHGKGSILMDYFSRMVAAGVRPDGV 551 E DVF W +L G IHG I ++A DGV Sbjct: 422 IERMPMGGDVFVWGGVLGGCRIHGNVEIAEKAAENVMALNPEDDGV 467 >ref|XP_002511070.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223550185|gb|EEF51672.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 551 Score = 253 bits (647), Expect = 1e-65 Identities = 120/193 (62%), Positives = 155/193 (80%) Frame = +3 Query: 6 AHNLFSQSPHRDVVSYNAMIDGFVKSSQISRARELFDEMLVRDSVSWGTMIAGYSHAKLC 185 A +F +S RDVVSYNA++DGFVK+ + +ARE+FD M +RDSVSWG++IAGY+ C Sbjct: 191 ACQVFDESSDRDVVSYNALVDGFVKAGEFVKAREIFDLMPMRDSVSWGSLIAGYAQGSYC 250 Query: 186 REAIELFNEMISLGGVWPDNIALVSVLSACAQLGELEQGKFVHDYITRNGIRVDSFLATG 365 EAI LF+ M+ L + PDNIALVS LSACAQLGELE+GK +HDYI +N I+ DSFL+TG Sbjct: 251 NEAIGLFDLMMGLK-LEPDNIALVSALSACAQLGELEKGKQIHDYIKKNRIQADSFLSTG 309 Query: 366 LVDLYAKCGCVETARDIFESSKDKDVFTWNAMLVGLAIHGKGSILMDYFSRMVAAGVRPD 545 LVD YAK GC++TA +FE S DK + TWNAML+G+A+HG +L++YFSRM+ AG++PD Sbjct: 310 LVDFYAKSGCIDTAIKVFELSPDKSLITWNAMLIGIAMHGNSHLLLNYFSRMMEAGIKPD 369 Query: 546 GVTFLGVLVGCSH 584 G++FLGVLVGCSH Sbjct: 370 GISFLGVLVGCSH 382 Score = 56.2 bits (134), Expect = 4e-06 Identities = 36/144 (25%), Positives = 71/144 (49%), Gaps = 2/144 (1%) Frame = +3 Query: 60 MIDGFVKSSQISRARELFDEMLVRDSVSWGTMIAGYSHAKLCREAIELFNEMISLGGVWP 239 ++D + KS I A ++F+ + ++W M+ G + + F+ M+ G+ P Sbjct: 310 LVDFYAKSGCIDTAIKVFELSPDKSLITWNAMLIGIAMHGNSHLLLNYFSRMME-AGIKP 368 Query: 240 DNIALVSVLSACAQLGELEQGKFVHDYITR-NGIRVDSFLATGLVDLYAKCGCVETARDI 416 D I+ + VL C+ G +++ K + D + G+R + + DL A+ G ++ A ++ Sbjct: 369 DGISFLGVLVGCSHGGLVDEAKKLFDEMESIYGVRRELKHYGCMADLLARAGLIKEAVEL 428 Query: 417 FES-SKDKDVFTWNAMLVGLAIHG 485 + D+F W+ +L G IHG Sbjct: 429 TKGLPMGGDIFVWSGLLGGCRIHG 452 >ref|XP_002866455.1| hypothetical protein ARALYDRAFT_496347 [Arabidopsis lyrata subsp. lyrata] gi|297312290|gb|EFH42714.1| hypothetical protein ARALYDRAFT_496347 [Arabidopsis lyrata subsp. lyrata] Length = 1401 Score = 251 bits (640), Expect = 8e-65 Identities = 117/194 (60%), Positives = 152/194 (78%) Frame = +3 Query: 6 AHNLFSQSPHRDVVSYNAMIDGFVKSSQISRARELFDEMLVRDSVSWGTMIAGYSHAKLC 185 A LF ++P RDVV+YN +IDG VK+ +I RARELFD M RD VSW ++IAGY+ C Sbjct: 585 ALQLFDENPQRDVVTYNVLIDGLVKACEIVRARELFDSMPFRDLVSWNSLIAGYAQMNQC 644 Query: 186 REAIELFNEMISLGGVWPDNIALVSVLSACAQLGELEQGKFVHDYITRNGIRVDSFLATG 365 REAI LF+EMI LG + PDN+A+VS LSACAQ G+LE+GK +HDY + + +DSFLATG Sbjct: 645 REAISLFDEMIGLG-LKPDNVAIVSTLSACAQSGDLEKGKAIHDYTKKKRLFIDSFLATG 703 Query: 366 LVDLYAKCGCVETARDIFESSKDKDVFTWNAMLVGLAIHGKGSILMDYFSRMVAAGVRPD 545 LVD YAKCG ++TA +IF S DK +FTWNAM+ GLA+HG G + +DYF +MV++G++PD Sbjct: 704 LVDFYAKCGFIDTAMEIFHLSSDKTLFTWNAMITGLAMHGNGELTVDYFHKMVSSGIKPD 763 Query: 546 GVTFLGVLVGCSHA 587 GV+F+ VLVGCSH+ Sbjct: 764 GVSFISVLVGCSHS 777 Score = 57.4 bits (137), Expect = 2e-06 Identities = 42/171 (24%), Positives = 77/171 (45%), Gaps = 7/171 (4%) Frame = +3 Query: 60 MIDGFVKSSQISRARELFDEMLVRDSVSWGTMIAGYSHAKLCREAIELFNEMISLGGVWP 239 ++D + K I A E+F + +W MI G + ++ F++M+S G+ P Sbjct: 704 LVDFYAKCGFIDTAMEIFHLSSDKTLFTWNAMITGLAMHGNGELTVDYFHKMVS-SGIKP 762 Query: 240 DNIALVSVLSACAQLGELEQGKFVHDYITRNGIRVDSFLA--TGLVDLYAKCGCVETARD 413 D ++ +SVL C+ G + + + + D + R+ VD + + DL + G +E A + Sbjct: 763 DGVSFISVLVGCSHSGLVGEARKLFDQM-RSLYDVDREMKHYGCMADLLGRAGLIEEAAE 821 Query: 414 IFE-----SSKDKDVFTWNAMLVGLAIHGKGSILMDYFSRMVAAGVRPDGV 551 + E K + + W+ +L G IHG + R+ A GV Sbjct: 822 MIEQMPKDGGKREKLLAWSGLLGGCRIHGNIEVAEKAAKRVKALSPEDGGV 872