BLASTX nr result
ID: Glycyrrhiza24_contig00008081
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza24_contig00008081 (1418 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003549191.1| PREDICTED: pentatricopeptide repeat-containi... 574 e-161 ref|XP_002284744.1| PREDICTED: pentatricopeptide repeat-containi... 504 e-140 ref|XP_002301973.1| predicted protein [Populus trichocarpa] gi|2... 486 e-135 ref|XP_002890375.1| pentatricopeptide repeat-containing protein ... 449 e-124 ref|NP_173449.1| pentatricopeptide repeat-containing protein [Ar... 448 e-123 >ref|XP_003549191.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230-like [Glycine max] Length = 748 Score = 574 bits (1480), Expect = e-161 Identities = 272/314 (86%), Positives = 295/314 (93%) Frame = +1 Query: 1 DKMSAPNLVSWNAVMSGYAMHGMARETIEMFDMMIQSGQKPDAITFTCVLSACTQNGITE 180 DKMSA NLVSWNAVM GYAMHG A+ET+EMF MM+QSGQKPD +TFTCVLSAC QNG+TE Sbjct: 435 DKMSALNLVSWNAVMKGYAMHGKAKETMEMFHMMLQSGQKPDLVTFTCVLSACAQNGLTE 494 Query: 181 EGWNFFNSMSKEHGIEPKMEHYACMVTLLSRVGKLEEAYSIIKEMSFELDACIWGSLLSS 360 EGW +NSMS+EHGIEPKMEHYAC+VTLLSRVGKLEEAYSIIKEM FE DAC+WG+LLSS Sbjct: 495 EGWRCYNSMSEEHGIEPKMEHYACLVTLLSRVGKLEEAYSIIKEMPFEPDACVWGALLSS 554 Query: 361 CRVHHNLSLGEIAAEKLFLLEPDNPGNYILMSNIYASKGMWDEVNRMRDVMKSKGLRKNP 540 CRVH+NLSLGEIAAEKLF LEP NPGNYIL+SNIYASKG+WDE NR+R+VMKSKGLRKNP Sbjct: 555 CRVHNNLSLGEIAAEKLFFLEPTNPGNYILLSNIYASKGLWDEENRIREVMKSKGLRKNP 614 Query: 541 GCSWIEIGRSVHTLLAGDKSHPQMKEISEKLDKLSIEMKKSGYLPMTDFVLQDVEEQDKE 720 G SWIE+G VH LLAGD+SHPQMK+I EKLDKL+++MKKSGYLP T+FVLQDVEEQDKE Sbjct: 615 GYSWIEVGHKVHMLLAGDQSHPQMKDILEKLDKLNMQMKKSGYLPKTNFVLQDVEEQDKE 674 Query: 721 QILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEGREIFVRDTNRF 900 QILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEGREI+VRDTNRF Sbjct: 675 QILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEGREIYVRDTNRF 734 Query: 901 HHFKDGVCSCGDFW 942 HHFKDGVCSCGDFW Sbjct: 735 HHFKDGVCSCGDFW 748 >ref|XP_002284744.1| PREDICTED: pentatricopeptide repeat-containing protein At1g20230 [Vitis vinifera] Length = 758 Score = 504 bits (1298), Expect = e-140 Identities = 236/314 (75%), Positives = 273/314 (86%) Frame = +1 Query: 1 DKMSAPNLVSWNAVMSGYAMHGMARETIEMFDMMIQSGQKPDAITFTCVLSACTQNGITE 180 D + NLV WNAV++GYAMHG A+E +E+FD+M +SGQKPD I+FTCVLSAC+Q+G+TE Sbjct: 445 DGIPTKNLVCWNAVIAGYAMHGKAKEAMEIFDLMQRSGQKPDIISFTCVLSACSQSGLTE 504 Query: 181 EGWNFFNSMSKEHGIEPKMEHYACMVTLLSRVGKLEEAYSIIKEMSFELDACIWGSLLSS 360 EG +FNSMS ++GIE ++EHYACMVTLLSR GKLE+AY++I+ M DAC+WG+LLSS Sbjct: 505 EGSYYFNSMSSKYGIEARVEHYACMVTLLSRAGKLEQAYAMIRRMPVNPDACVWGALLSS 564 Query: 361 CRVHHNLSLGEIAAEKLFLLEPDNPGNYILMSNIYASKGMWDEVNRMRDVMKSKGLRKNP 540 CRVH+N+SLGE+AAEKLF LEP NPGNYIL+SNIYASKGMW+EVNR+RD+MK+KGLRKNP Sbjct: 565 CRVHNNVSLGEVAAEKLFELEPSNPGNYILLSNIYASKGMWNEVNRVRDMMKNKGLRKNP 624 Query: 541 GCSWIEIGRSVHTLLAGDKSHPQMKEISEKLDKLSIEMKKSGYLPMTDFVLQDVEEQDKE 720 GCSWIE+ VH LLAGDKSHPQM +I EKLDKLS+EMKK GY P +FVLQDVEEQDKE Sbjct: 625 GCSWIEVKNKVHMLLAGDKSHPQMTQIIEKLDKLSMEMKKLGYFPEINFVLQDVEEQDKE 684 Query: 721 QILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEGREIFVRDTNRF 900 QILCGHSEKLAVV GLLNT PG PLQVIKNLRIC DCH VIK IS E REIFVRDTNRF Sbjct: 685 QILCGHSEKLAVVFGLLNTPPGYPLQVIKNLRICGDCHVVIKFISSFERREIFVRDTNRF 744 Query: 901 HHFKDGVCSCGDFW 942 HHFK+G CSCGD+W Sbjct: 745 HHFKEGACSCGDYW 758 >ref|XP_002301973.1| predicted protein [Populus trichocarpa] gi|222843699|gb|EEE81246.1| predicted protein [Populus trichocarpa] Length = 716 Score = 486 bits (1251), Expect = e-135 Identities = 228/314 (72%), Positives = 268/314 (85%) Frame = +1 Query: 1 DKMSAPNLVSWNAVMSGYAMHGMARETIEMFDMMIQSGQKPDAITFTCVLSACTQNGITE 180 D M NLVSWN++M+GYAMHG E I +F++M + GQKPD ++FTCVLSACTQ G+TE Sbjct: 403 DMMPNRNLVSWNSLMAGYAMHGKTFEAINIFELMQRCGQKPDHVSFTCVLSACTQGGLTE 462 Query: 181 EGWNFFNSMSKEHGIEPKMEHYACMVTLLSRVGKLEEAYSIIKEMSFELDACIWGSLLSS 360 EGW +F+SMS+ HG+E +MEHY+CMVTLL R G+LEEAY++IK+M FE D+C+WG+LLSS Sbjct: 463 EGWFYFDSMSRNHGVEARMEHYSCMVTLLGRSGRLEEAYAMIKQMPFEPDSCVWGALLSS 522 Query: 361 CRVHHNLSLGEIAAEKLFLLEPDNPGNYILMSNIYASKGMWDEVNRMRDVMKSKGLRKNP 540 CRVH+ + LGEIAA+++F LEP NPGNYIL+SNIYASK MW EV+ +RD+M+S+GL+KNP Sbjct: 523 CRVHNRVDLGEIAAKRVFELEPRNPGNYILLSNIYASKAMWVEVDMVRDMMRSRGLKKNP 582 Query: 541 GCSWIEIGRSVHTLLAGDKSHPQMKEISEKLDKLSIEMKKSGYLPMTDFVLQDVEEQDKE 720 G SWIEI VH LLAGD SHPQM +I EKL KL++EMKKSGY+P TDFVLQDVEEQDKE Sbjct: 583 GYSWIEIKNKVHMLLAGDSSHPQMPQIIEKLAKLTVEMKKSGYVPHTDFVLQDVEEQDKE 642 Query: 721 QILCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEGREIFVRDTNRF 900 QILCGHSEKLAVVLGLLNT PG PLQVIKNLRIC DCHAVIK IS E REIFVRDTNRF Sbjct: 643 QILCGHSEKLAVVLGLLNTKPGFPLQVIKNLRICRDCHAVIKFISDFEKREIFVRDTNRF 702 Query: 901 HHFKDGVCSCGDFW 942 H FK GVCSCGD+W Sbjct: 703 HQFKGGVCSCGDYW 716 >ref|XP_002890375.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297336217|gb|EFH66634.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 760 Score = 449 bits (1155), Expect = e-124 Identities = 203/312 (65%), Positives = 261/312 (83%) Frame = +1 Query: 7 MSAPNLVSWNAVMSGYAMHGMARETIEMFDMMIQSGQKPDAITFTCVLSACTQNGITEEG 186 M NLV WN++M+GY+MHG A+E + +F+ ++++ KPD I+FT +LSAC Q G+T+EG Sbjct: 449 MPTKNLVCWNSLMNGYSMHGKAKEVMSIFESLMRTRLKPDFISFTSLLSACGQVGLTDEG 508 Query: 187 WNFFNSMSKEHGIEPKMEHYACMVTLLSRVGKLEEAYSIIKEMSFELDACIWGSLLSSCR 366 W +FN MS+E+GI+P++EHY+CMV LL R GKL+EAY +IKE+ FE D+C+WG+LL+SCR Sbjct: 509 WKYFNMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEIPFEPDSCVWGALLNSCR 568 Query: 367 VHHNLSLGEIAAEKLFLLEPDNPGNYILMSNIYASKGMWDEVNRMRDVMKSKGLRKNPGC 546 + +N+ L EIAA+KLF LEP+NPG Y+LMSNIYA+KGMW EV+ +R+ M+S GL+KNPGC Sbjct: 569 LQNNVDLAEIAAQKLFHLEPENPGTYVLMSNIYAAKGMWTEVDSIRNKMESLGLKKNPGC 628 Query: 547 SWIEIGRSVHTLLAGDKSHPQMKEISEKLDKLSIEMKKSGYLPMTDFVLQDVEEQDKEQI 726 SWI++ V+TLLA DKSHPQ+ +I+EK+D++S EM+KSG+ P DF LQDVEEQ++EQ+ Sbjct: 629 SWIQVKNKVYTLLACDKSHPQIDQITEKMDEISEEMRKSGHRPNLDFALQDVEEQEQEQM 688 Query: 727 LCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEGREIFVRDTNRFHH 906 L GHSEKLAVV GLLNT G PLQVIKNLRIC DCHAVIK IS GREIF+RDTNRFHH Sbjct: 689 LWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCHAVIKFISSYAGREIFIRDTNRFHH 748 Query: 907 FKDGVCSCGDFW 942 FKDG+CSCGDFW Sbjct: 749 FKDGICSCGDFW 760 >ref|NP_173449.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|193806503|sp|Q9LNU6.2|PPR53_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g20230 gi|332191832|gb|AEE29953.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 760 Score = 448 bits (1152), Expect = e-123 Identities = 202/312 (64%), Positives = 260/312 (83%) Frame = +1 Query: 7 MSAPNLVSWNAVMSGYAMHGMARETIEMFDMMIQSGQKPDAITFTCVLSACTQNGITEEG 186 M NLV WN++M+G++MHG A+E + +F+ ++++ KPD I+FT +LSAC Q G+T+EG Sbjct: 449 MPTKNLVCWNSLMNGFSMHGKAKEVMSIFESLMRTRLKPDFISFTSLLSACGQVGLTDEG 508 Query: 187 WNFFNSMSKEHGIEPKMEHYACMVTLLSRVGKLEEAYSIIKEMSFELDACIWGSLLSSCR 366 W +F MS+E+GI+P++EHY+CMV LL R GKL+EAY +IKEM FE D+C+WG+LL+SCR Sbjct: 509 WKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEMPFEPDSCVWGALLNSCR 568 Query: 367 VHHNLSLGEIAAEKLFLLEPDNPGNYILMSNIYASKGMWDEVNRMRDVMKSKGLRKNPGC 546 + +N+ L EIAAEKLF LEP+NPG Y+L+SNIYA+KGMW EV+ +R+ M+S GL+KNPGC Sbjct: 569 LQNNVDLAEIAAEKLFHLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLKKNPGC 628 Query: 547 SWIEIGRSVHTLLAGDKSHPQMKEISEKLDKLSIEMKKSGYLPMTDFVLQDVEEQDKEQI 726 SWI++ V+TLLAGDKSHPQ+ +I+EK+D++S EM+KSG+ P DF L DVEEQ++EQ+ Sbjct: 629 SWIQVKNRVYTLLAGDKSHPQIDQITEKMDEISKEMRKSGHRPNLDFALHDVEEQEQEQM 688 Query: 727 LCGHSEKLAVVLGLLNTSPGQPLQVIKNLRICDDCHAVIKVISRLEGREIFVRDTNRFHH 906 L GHSEKLAVV GLLNT G PLQVIKNLRIC DCHAVIK IS GREIF+RDTNRFHH Sbjct: 689 LWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCHAVIKFISSYAGREIFIRDTNRFHH 748 Query: 907 FKDGVCSCGDFW 942 FKDG+CSCGDFW Sbjct: 749 FKDGICSCGDFW 760