BLASTX nr result
ID: Glycyrrhiza23_contig00012379
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00012379 (1978 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003590960.1| Pentatricopeptide repeat-containing protein ... 849 0.0 ref|XP_003555568.1| PREDICTED: pentatricopeptide repeat-containi... 766 0.0 ref|XP_003521773.1| PREDICTED: pentatricopeptide repeat-containi... 757 0.0 ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containi... 748 0.0 ref|XP_002521980.1| pentatricopeptide repeat-containing protein,... 746 0.0 >ref|XP_003590960.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355480008|gb|AES61211.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 590 Score = 849 bits (2194), Expect = 0.0 Identities = 420/543 (77%), Positives = 472/543 (86%) Frame = -1 Query: 1900 NNKEHARFTTPTPSETTRPQVVQNYNFGDTHFMKVLNRSCKTGKYNESLYFLQHMVNKGY 1721 NN E +F +ET + Q+Y+F DT+FMK LNRSCK+ KY+ESLYFLQHMVN+GY Sbjct: 52 NNNEQQQFRV---NETKPTKHDQDYDFRDTNFMKTLNRSCKSAKYDESLYFLQHMVNRGY 108 Query: 1720 KPDVILCTKLIKGFFNSKRIDKAIQVMEVLEKHGDPDVFAYNAVISGFCKADRIDAANKV 1541 KPDVILCTKLIKGFFN K+I+KAIQVME+LEKHG PDVFAYNAVISGFCKADR+D A+KV Sbjct: 109 KPDVILCTKLIKGFFNMKKIEKAIQVMEILEKHGKPDVFAYNAVISGFCKADRVDHASKV 168 Query: 1540 LDRMKKRGFSPDVVTYNILIGNLCGKGKLDLALRVMDQLLKDNCKPTVITYTILIEATII 1361 LDRMKKRGF PDVVTYNILIGN CG+G+LDLALRVMDQLLKDNCKPTVITYTILIEATI Sbjct: 169 LDRMKKRGFEPDVVTYNILIGNFCGRGRLDLALRVMDQLLKDNCKPTVITYTILIEATIT 228 Query: 1360 EGGIDEAMKLLDEMLSRGLQPDMYTYNVIVKGMCREGLVDRAFEFVSSISDKGCAAGVSS 1181 +GGIDEAMKLLDEMLSRGL+PD YTYNV+V GMC+EG++DRAFEF+S IS GC AGVS+ Sbjct: 229 QGGIDEAMKLLDEMLSRGLRPDRYTYNVVVNGMCKEGMLDRAFEFLSRISKNGCVAGVST 288 Query: 1180 YNILLRGLLNEGKWEAVERLINDMFAKGCEPNVVTYSTLIGSLCRDGRIVEAKNVLKVMK 1001 YNILLR LLNEGKWE E+L++DM KGCEPN +TYSTLI +LCRDG+I EAKNVLKVMK Sbjct: 289 YNILLRDLLNEGKWEYGEKLMSDMLVKGCEPNPITYSTLITALCRDGKIDEAKNVLKVMK 348 Query: 1000 EKGLTPDSYSYDPLISAFCREGRVDLAIEFMVNMIADGCLPDVFSYNRILASLCKNGNAD 821 EK L PD YSYDPLISA CREG+VDLAIEF+ +MI+ G LPD+ SYN ILASLCKNGNAD Sbjct: 349 EKALAPDGYSYDPLISALCREGKVDLAIEFLDDMISGGHLPDILSYNSILASLCKNGNAD 408 Query: 820 EALSIFEKLGEVGCPLDASSYNTMLGALWSSGDKTRALGMVLKMLSNGVDPDGITYNSLI 641 EAL+IFEKLGEVGCP +A SYNT+ GALWSSGDK RALGM+L+MLSNG+DPD ITYNSLI Sbjct: 409 EALNIFEKLGEVGCPPNAGSYNTLFGALWSSGDKIRALGMILEMLSNGIDPDEITYNSLI 468 Query: 640 SCLCRDGMVDEAIELLVDMLEMGSGSRCKPTVISYNIVLLGLCKVHRIVDAIEVLAAMVD 461 SCLCRDG+VD+AIELLVDM E +C+PTVISYN VLLGLCKV RI+DAIEVLAAMV+ Sbjct: 469 SCLCRDGLVDQAIELLVDMFE---SEKCQPTVISYNTVLLGLCKVQRIIDAIEVLAAMVN 525 Query: 460 KGCRPNETSYTLLIQGIGFAGWPNDAMELASSLVSMGAISEYSFNRLNKTFPVLDVYKEL 281 +GC PNET+YTLLIQGIGFAGW DAMELA+ LV+M AISE SF R K FPV D +KEL Sbjct: 526 EGCLPNETTYTLLIQGIGFAGWRYDAMELANLLVNMDAISEDSFKRFQKIFPVFDAHKEL 585 Query: 280 AFS 272 A S Sbjct: 586 ALS 588 >ref|XP_003555568.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic-like [Glycine max] Length = 576 Score = 766 bits (1978), Expect = 0.0 Identities = 387/555 (69%), Positives = 454/555 (81%) Frame = -1 Query: 1978 TIVTCTILHLKDEXXXXXXXXXXXXSNNKEHARFTTPTPSETTRPQVVQNYNFGDTHFMK 1799 T++TC I L ++ NNK H R T S TRPQ Q+Y+F DTH MK Sbjct: 30 TVITCRIPLLNEDNPSKRRLNNNN--NNKGHTRVT----SSDTRPQQ-QHYDFRDTHHMK 82 Query: 1798 VLNRSCKTGKYNESLYFLQHMVNKGYKPDVILCTKLIKGFFNSKRIDKAIQVMEVLEKHG 1619 LNR CKTGKY E+LYFL+ MV +GYKPDVILCTKLIKG F SKR +KA++VME+LE++G Sbjct: 83 ALNRLCKTGKYTEALYFLEQMVKRGYKPDVILCTKLIKGLFTSKRTEKAVRVMEILEQYG 142 Query: 1618 DPDVFAYNAVISGFCKADRIDAANKVLDRMKKRGFSPDVVTYNILIGNLCGKGKLDLALR 1439 DPD FAYNAVISGFC++DR DAAN+V+ RMK RGFSPDVVTYNILIG+LC +GKLDLAL+ Sbjct: 143 DPDSFAYNAVISGFCRSDRFDAANRVILRMKYRGFSPDVVTYNILIGSLCARGKLDLALK 202 Query: 1438 VMDQLLKDNCKPTVITYTILIEATIIEGGIDEAMKLLDEMLSRGLQPDMYTYNVIVKGMC 1259 VMDQLL+DNC PTVITYTILIEATII G ID+AM+LLDEM+SRGLQPDMYTYNVIV+GMC Sbjct: 203 VMDQLLEDNCNPTVITYTILIEATIIHGSIDDAMRLLDEMMSRGLQPDMYTYNVIVRGMC 262 Query: 1258 REGLVDRAFEFVSSISDKGCAAGVSSYNILLRGLLNEGKWEAVERLINDMFAKGCEPNVV 1079 + GLVDRAFEFVS+++ ++ YN+LL+GLLNEG+WEA ERL++DM KGCEPN+V Sbjct: 263 KRGLVDRAFEFVSNLN---TTPSLNLYNLLLKGLLNEGRWEAGERLMSDMIVKGCEPNIV 319 Query: 1078 TYSTLIGSLCRDGRIVEAKNVLKVMKEKGLTPDSYSYDPLISAFCREGRVDLAIEFMVNM 899 TYS LI SLCRDG+ EA +VL+VMKEKGL PD+Y YDPLISAFC+EG+VDLAI F+ +M Sbjct: 320 TYSVLISSLCRDGKAGEAVDVLRVMKEKGLNPDAYCYDPLISAFCKEGKVDLAIGFVDDM 379 Query: 898 IADGCLPDVFSYNRILASLCKNGNADEALSIFEKLGEVGCPLDASSYNTMLGALWSSGDK 719 I+ G LPD+ +YN I+ SLCK G ADEAL+IF+KL EVGCP +ASSYNTM GALWSSGDK Sbjct: 380 ISAGWLPDIVNYNTIMGSLCKKGRADEALNIFKKLEEVGCPPNASSYNTMFGALWSSGDK 439 Query: 718 TRALGMVLKMLSNGVDPDGITYNSLISCLCRDGMVDEAIELLVDMLEMGSGSRCKPTVIS 539 RAL M+L+MLSNGVDPD ITYNSLIS LCRDGMVDEAI LLVDM + +PTVIS Sbjct: 440 IRALTMILEMLSNGVDPDRITYNSLISSLCRDGMVDEAIGLLVDM----ERTEWQPTVIS 495 Query: 538 YNIVLLGLCKVHRIVDAIEVLAAMVDKGCRPNETSYTLLIQGIGFAGWPNDAMELASSLV 359 YNIVLLGLCK HRIVDAIEVLA MVD GC+PNET+YTLL++G+G+AGW + A+ELA SLV Sbjct: 496 YNIVLLGLCKAHRIVDAIEVLAVMVDNGCQPNETTYTLLVEGVGYAGWRSYAVELAKSLV 555 Query: 358 SMGAISEYSFNRLNK 314 SM AIS+ F RL K Sbjct: 556 SMNAISQDLFRRLQK 570 >ref|XP_003521773.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic-like [Glycine max] Length = 570 Score = 757 bits (1955), Expect = 0.0 Identities = 375/533 (70%), Positives = 442/533 (82%) Frame = -1 Query: 1867 TPSETTRPQVVQNYNFGDTHFMKVLNRSCKTGKYNESLYFLQHMVNKGYKPDVILCTKLI 1688 T S ++PQ +F DTH +K L+RSCK G +NESLYFL+H+VNKG+KPDV+LCTKLI Sbjct: 41 TLSSVSKPQT-HTLDFKDTHLLKSLSRSCKAGNFNESLYFLRHLVNKGHKPDVVLCTKLI 99 Query: 1687 KGFFNSKRIDKAIQVMEVLEKHGDPDVFAYNAVISGFCKADRIDAANKVLDRMKKRGFSP 1508 G F SK IDKAIQVM +LE HG PD+ AYNA+I+GFC+A+RID+A +VLDRMK +GFSP Sbjct: 100 HGLFTSKTIDKAIQVMHILENHGHPDLIAYNAIITGFCRANRIDSAYQVLDRMKNKGFSP 159 Query: 1507 DVVTYNILIGNLCGKGKLDLALRVMDQLLKDNCKPTVITYTILIEATIIEGGIDEAMKLL 1328 D+VTYNILIG+LC +G LD AL +QLLK+NCKPTV+TYTILIEAT+++GGIDEAMKLL Sbjct: 160 DIVTYNILIGSLCSRGMLDSALEFKNQLLKENCKPTVVTYTILIEATLLQGGIDEAMKLL 219 Query: 1327 DEMLSRGLQPDMYTYNVIVKGMCREGLVDRAFEFVSSISDKGCAAGVSSYNILLRGLLNE 1148 DEML LQPDM+TYN I++GMCREG VDRAF+ +SSIS KG A V +YNILLRGLLN+ Sbjct: 220 DEMLEINLQPDMFTYNSIIRGMCREGYVDRAFQIISSISSKGYAPDVITYNILLRGLLNQ 279 Query: 1147 GKWEAVERLINDMFAKGCEPNVVTYSTLIGSLCRDGRIVEAKNVLKVMKEKGLTPDSYSY 968 GKWEA L++DM A+GCE NVVTYS LI S+CRDG++ E +LK MK+KGL PD Y Y Sbjct: 280 GKWEAGYELMSDMVARGCEANVVTYSVLISSVCRDGKVEEGVGLLKDMKKKGLKPDGYCY 339 Query: 967 DPLISAFCREGRVDLAIEFMVNMIADGCLPDVFSYNRILASLCKNGNADEALSIFEKLGE 788 DPLI+A C+EGRVDLAIE + MI+DGC+PD+ +YN ILA LCK ADEALSIFEKLGE Sbjct: 340 DPLIAALCKEGRVDLAIEVLDVMISDGCVPDIVNYNTILACLCKQKRADEALSIFEKLGE 399 Query: 787 VGCPLDASSYNTMLGALWSSGDKTRALGMVLKMLSNGVDPDGITYNSLISCLCRDGMVDE 608 VGC +ASSYN+M ALWS+G K RALGM+L+ML GVDPDGITYNSLISCLCRDGMVDE Sbjct: 400 VGCSPNASSYNSMFSALWSTGHKVRALGMILEMLDKGVDPDGITYNSLISCLCRDGMVDE 459 Query: 607 AIELLVDMLEMGSGSRCKPTVISYNIVLLGLCKVHRIVDAIEVLAAMVDKGCRPNETSYT 428 AIELLVDM EM S S CKP+V+SYNIVLLGLCKV R+ DAIEVLAAMVDKGCRPNET+YT Sbjct: 460 AIELLVDM-EMES-SECKPSVVSYNIVLLGLCKVSRVSDAIEVLAAMVDKGCRPNETTYT 517 Query: 427 LLIQGIGFAGWPNDAMELASSLVSMGAISEYSFNRLNKTFPVLDVYKELAFSD 269 LI+GIGF G NDA +LA++LV+M AISE+SF RL KTF LDVY++L SD Sbjct: 518 FLIEGIGFGGCLNDARDLATTLVNMDAISEHSFERLYKTFCKLDVYRQLNLSD 570 >ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic-like [Vitis vinifera] Length = 582 Score = 748 bits (1930), Expect = 0.0 Identities = 368/531 (69%), Positives = 442/531 (83%) Frame = -1 Query: 1861 SETTRPQVVQNYNFGDTHFMKVLNRSCKTGKYNESLYFLQHMVNKGYKPDVILCTKLIKG 1682 S RP +Q+Y+F +TH MK+LNRSCK GK+NESLYFL+ +VNKGY PDVILCTKLIKG Sbjct: 53 SAEARPAHLQSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKGYTPDVILCTKLIKG 112 Query: 1681 FFNSKRIDKAIQVMEVLEKHGDPDVFAYNAVISGFCKADRIDAANKVLDRMKKRGFSPDV 1502 FFN K I+KA +VME+LE H +PDVFAYNAVISGFCK +RI+AA +VL+RMK RGF PD+ Sbjct: 113 FFNFKNIEKASRVMEILESHTEPDVFAYNAVISGFCKVNRIEAATQVLNRMKARGFLPDI 172 Query: 1501 VTYNILIGNLCGKGKLDLALRVMDQLLKDNCKPTVITYTILIEATIIEGGIDEAMKLLDE 1322 VTYNI+IG+LC + KL LAL+V+DQLL DNC PTVITYTILIEATI+EGGI+EAMKLL+E Sbjct: 173 VTYNIMIGSLCNRRKLGLALKVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEE 232 Query: 1321 MLSRGLQPDMYTYNVIVKGMCREGLVDRAFEFVSSISDKGCAAGVSSYNILLRGLLNEGK 1142 ML+RGL PDMYTYN I++GMC+EG+V+RA E ++S++ KGC V SYNILLR LN+GK Sbjct: 233 MLARGLLPDMYTYNAIIRGMCKEGMVERAAELITSLTSKGCKPDVISYNILLRAFLNQGK 292 Query: 1141 WEAVERLINDMFAKGCEPNVVTYSTLIGSLCRDGRIVEAKNVLKVMKEKGLTPDSYSYDP 962 W+ E+L+ +MF++GCEPN VTYS LI SLCR GRI EA +VLKVM EK LTPD+YSYDP Sbjct: 293 WDEGEKLVAEMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDP 352 Query: 961 LISAFCREGRVDLAIEFMVNMIADGCLPDVFSYNRILASLCKNGNADEALSIFEKLGEVG 782 LISA C+EGR+DLAI M MI++GCLPD+ +YN ILA+LCKNGNA++AL IF KL +G Sbjct: 353 LISALCKEGRLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMG 412 Query: 781 CPLDASSYNTMLGALWSSGDKTRALGMVLKMLSNGVDPDGITYNSLISCLCRDGMVDEAI 602 CP + SSYNTM+ ALWS GD++RALGMV M+S GVDPD ITYNSLISCLCRDG+V+EAI Sbjct: 413 CPPNVSSYNTMISALWSCGDRSRALGMVPAMISKGVDPDEITYNSLISCLCRDGLVEEAI 472 Query: 601 ELLVDMLEMGSGSRCKPTVISYNIVLLGLCKVHRIVDAIEVLAAMVDKGCRPNETSYTLL 422 LL DM + G +PTVISYNIVLLGLCKV RI DAI + A M++KGCRPNET+Y LL Sbjct: 473 GLLDDMEQSG----FRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILL 528 Query: 421 IQGIGFAGWPNDAMELASSLVSMGAISEYSFNRLNKTFPVLDVYKELAFSD 269 I+GIGFAGW +AMELA+SL S IS+ SF RLNKTFP+LDVYKEL+ S+ Sbjct: 529 IEGIGFAGWRTEAMELANSLFSRDVISQDSFKRLNKTFPMLDVYKELSNSE 579 >ref|XP_002521980.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223538784|gb|EEF40384.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 584 Score = 746 bits (1926), Expect = 0.0 Identities = 359/531 (67%), Positives = 442/531 (83%) Frame = -1 Query: 1861 SETTRPQVVQNYNFGDTHFMKVLNRSCKTGKYNESLYFLQHMVNKGYKPDVILCTKLIKG 1682 S TR V +++F + H MK+LNRSC+ GKYNESLYFL+ MV+KGY PDVILCTKLIKG Sbjct: 54 SAETRQTHVLSFDFKEVHLMKLLNRSCRAGKYNESLYFLECMVDKGYTPDVILCTKLIKG 113 Query: 1681 FFNSKRIDKAIQVMEVLEKHGDPDVFAYNAVISGFCKADRIDAANKVLDRMKKRGFSPDV 1502 FFNS+ I KA +VME+LE++G PDVFAYNA+ISGF KA++++ AN+VLDRMK RGF PDV Sbjct: 114 FFNSRNIGKATRVMEILERYGKPDVFAYNALISGFIKANQLENANRVLDRMKSRGFLPDV 173 Query: 1501 VTYNILIGNLCGKGKLDLALRVMDQLLKDNCKPTVITYTILIEATIIEGGIDEAMKLLDE 1322 VTYNI+IG+ C +GKLDLAL + ++LLKDNC+PTVITYTILIEATI++GGID AMKLLDE Sbjct: 174 VTYNIMIGSFCSRGKLDLALEIFEELLKDNCEPTVITYTILIEATILDGGIDVAMKLLDE 233 Query: 1321 MLSRGLQPDMYTYNVIVKGMCREGLVDRAFEFVSSISDKGCAAGVSSYNILLRGLLNEGK 1142 MLS+GL+PD TYN I++GMC+E +VD+AFE + S+S +GC + +YNILLR LL+ GK Sbjct: 234 MLSKGLEPDTLTYNAIIRGMCKEMMVDKAFELLRSLSSRGCKPDIITYNILLRTLLSRGK 293 Query: 1141 WEAVERLINDMFAKGCEPNVVTYSTLIGSLCRDGRIVEAKNVLKVMKEKGLTPDSYSYDP 962 W E+LI++M + GC+PNVVT+S LIG+LCRDG++ EA N+L+ MKEKGL PD+Y YDP Sbjct: 294 WSEGEKLISEMISIGCKPNVVTHSILIGTLCRDGKVEEAVNLLRSMKEKGLKPDAYCYDP 353 Query: 961 LISAFCREGRVDLAIEFMVNMIADGCLPDVFSYNRILASLCKNGNADEALSIFEKLGEVG 782 LI+ FCREGR+DLA EF+ MI+DGCLPD+ +YN I+A LC+ G AD+AL +FEKL EVG Sbjct: 354 LIAGFCREGRLDLATEFLEYMISDGCLPDIVNYNTIMAGLCRTGKADQALEVFEKLDEVG 413 Query: 781 CPLDASSYNTMLGALWSSGDKTRALGMVLKMLSNGVDPDGITYNSLISCLCRDGMVDEAI 602 CP + SSYNT+ ALWSSGD+ RAL M+LK+L+ G+DPD ITYNSLISCLCRDGMVDEAI Sbjct: 414 CPPNVSSYNTLFSALWSSGDRYRALEMILKLLNQGIDPDEITYNSLISCLCRDGMVDEAI 473 Query: 601 ELLVDMLEMGSGSRCKPTVISYNIVLLGLCKVHRIVDAIEVLAAMVDKGCRPNETSYTLL 422 ELLVDM R +P V+SYNI+LLGLCKV+R DAIEVLAAM +KGC+PNET+Y LL Sbjct: 474 ELLVDM----QSGRYRPNVVSYNIILLGLCKVNRANDAIEVLAAMTEKGCQPNETTYILL 529 Query: 421 IQGIGFAGWPNDAMELASSLVSMGAISEYSFNRLNKTFPVLDVYKELAFSD 269 I+GIGF+G +AMELA+SL M AISE SFNRLNKTFP+LDVYK+L FSD Sbjct: 530 IEGIGFSGLRAEAMELANSLHGMNAISEDSFNRLNKTFPLLDVYKDLTFSD 580