BLASTX nr result
ID: Glycyrrhiza24_contig00006614
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza24_contig00006614 (3343 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containi... 803 0.0 ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containi... 790 0.0 ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containi... 666 0.0 ref|XP_002313976.1| predicted protein [Populus trichocarpa] gi|2... 649 0.0 ref|XP_002521193.1| conserved hypothetical protein [Ricinus comm... 627 e-177 >ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100, chloroplastic-like [Glycine max] Length = 508 Score = 803 bits (2075), Expect = 0.0 Identities = 409/515 (79%), Positives = 439/515 (85%) Frame = -3 Query: 1763 MAYAHGLAPTFKLGFMFSSLCSPQQRPHLAVFPASKCGFSLKFCDGVSARSCKFQNPSLV 1584 MA AHGLAP FKLGF+FSS+ Q++ H +FPAS CGFSLKF G+SARSCKF+NPS V Sbjct: 1 MASAHGLAPIFKLGFVFSSVSPSQRKRHPLMFPASHCGFSLKFYGGLSARSCKFKNPSFV 60 Query: 1583 AAKPFSIRCFSRRKSVELDQYVTSXXXXXXXXXXXXXXXEAVEELERMTREPSDILEEMN 1404 +AK S+R F KSVE+DQYVTS A+EELERMTREPSD+LEEMN Sbjct: 61 SAKHGSLRGFRALKSVEMDQYVTSNDEMSDGFFE------AIEELERMTREPSDVLEEMN 114 Query: 1403 NRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQE 1224 +RLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQ+ Sbjct: 115 DRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQQ 174 Query: 1223 EKHXXXXXXXXXXXXXXXXLRPGFSMIEKVISLYWEMGEKEGAALFVEEVLRRGISCAED 1044 + H LRPGFSMIEKVISLYWEMGEKEGA LFVEEVLRRGI E+ Sbjct: 175 Q-HGVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYVEE 233 Query: 1043 DPEGHKGGPTGYLAWKMMVEGDYRGAVRLVIRFREAGLKPEVYSYLVAMTAVVKELNEFA 864 D EGHKGGPTGYLAWKMM EGDYR AVRLVIRFRE+GLKPE+YSYLVAMTAVVKELNEFA Sbjct: 234 DEEGHKGGPTGYLAWKMMAEGDYRNAVRLVIRFRESGLKPEIYSYLVAMTAVVKELNEFA 293 Query: 863 KALRKLKGFARSGSVAELDPEGVELAEKYQSDLLADGLRLSNWVIQDGSPSLHGVIHERL 684 KALRKLKGF R+G VAELD E VEL EKYQSD LADG+RLSNWVIQDGSPSLHG++HERL Sbjct: 294 KALRKLKGFTRAGLVAELDLEDVELTEKYQSDTLADGVRLSNWVIQDGSPSLHGIVHERL 353 Query: 683 LAMYICAGHGVEAERQLWEMKLVGKEADGDLYDIVLAICASQKEXXXXXXXXXXLEVASS 504 LAMYICAGHG+EAERQLWEMKLVGKEADGDLYDIVLAICASQKE LEV SS Sbjct: 354 LAMYICAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLEVVSS 413 Query: 503 PQKRKTLSWLLRGYIKGGHFNEAAETVMKMLELGFYPEYLDRAAVLQGLRKRIQQLGNLD 324 PQK+K+LSWLLRGYIKGGHFNEAAET+MKMLELGFYPEYLDRAAVLQGLRKRIQQ GNLD Sbjct: 414 PQKKKSLSWLLRGYIKGGHFNEAAETIMKMLELGFYPEYLDRAAVLQGLRKRIQQYGNLD 473 Query: 323 TYIKLCKSLSDANLIGPCLVYLYIRKYKVWVVKML 219 TY++LCKSLSDANLIGPCLV+LYIRKYK+WVVKML Sbjct: 474 TYVRLCKSLSDANLIGPCLVHLYIRKYKLWVVKML 508 >ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100, chloroplastic-like [Glycine max] Length = 510 Score = 790 bits (2039), Expect = 0.0 Identities = 410/516 (79%), Positives = 433/516 (83%), Gaps = 1/516 (0%) Frame = -3 Query: 1763 MAYAHGLAPTFKLGFMFSSLCSPQQRPHLAVFPASKCGFSLKFCDGV-SARSCKFQNPSL 1587 MAYAHG AP FKLGF+FSS+ SP Q+ H VFPAS CG+SLKF DGV SARSCKF+NPS Sbjct: 1 MAYAHGFAPIFKLGFVFSSV-SPSQKRHPLVFPASHCGYSLKFYDGVLSARSCKFKNPSF 59 Query: 1586 VAAKPFSIRCFSRRKSVELDQYVTSXXXXXXXXXXXXXXXEAVEELERMTREPSDILEEM 1407 V K SIR F KSVELDQYVTS A+EELERMTREPSD+LEEM Sbjct: 60 V--KQGSIRGFRALKSVELDQYVTSDDEEDEMSDGFFE---AIEELERMTREPSDVLEEM 114 Query: 1406 NNRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQ 1227 N+RLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQ Sbjct: 115 NDRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQ 174 Query: 1226 EEKHXXXXXXXXXXXXXXXXLRPGFSMIEKVISLYWEMGEKEGAALFVEEVLRRGISCAE 1047 E LRPGFSMIEKVISLYWEMGEKEGA LFVEEVLRRGI E Sbjct: 175 EHHGVVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYLE 234 Query: 1046 DDPEGHKGGPTGYLAWKMMVEGDYRGAVRLVIRFREAGLKPEVYSYLVAMTAVVKELNEF 867 +D EGHKGGPTGYLAWKMM EGDY AVRLVI F E+GLKPEVYSYLVAMTAVVKELNE Sbjct: 235 EDEEGHKGGPTGYLAWKMMAEGDYTSAVRLVIHFTESGLKPEVYSYLVAMTAVVKELNEL 294 Query: 866 AKALRKLKGFARSGSVAELDPEGVELAEKYQSDLLADGLRLSNWVIQDGSPSLHGVIHER 687 AKALRKLK FAR+G VAELD E VEL EKYQSDLL DG+RLSNW IQDGSPSLHG+IHER Sbjct: 295 AKALRKLKSFARTGLVAELDLEDVELTEKYQSDLLGDGVRLSNWAIQDGSPSLHGIIHER 354 Query: 686 LLAMYICAGHGVEAERQLWEMKLVGKEADGDLYDIVLAICASQKEXXXXXXXXXXLEVAS 507 LLAMYICAGHG+EAE+QLWEMKLVGKEADGDLYDIVLAICASQKE LEVAS Sbjct: 355 LLAMYICAGHGIEAEKQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLEVAS 414 Query: 506 SPQKRKTLSWLLRGYIKGGHFNEAAETVMKMLELGFYPEYLDRAAVLQGLRKRIQQLGNL 327 SPQK+K+LSWLLRGYIKGGHFNEAAET+MKML+LGFYPEYLDRAAVLQGLRKRIQQ GNL Sbjct: 415 SPQKKKSLSWLLRGYIKGGHFNEAAETIMKMLDLGFYPEYLDRAAVLQGLRKRIQQYGNL 474 Query: 326 DTYIKLCKSLSDANLIGPCLVYLYIRKYKVWVVKML 219 DTY++LCKSLSDANLIGPCLV+LYIRKYK+WVVKML Sbjct: 475 DTYVRLCKSLSDANLIGPCLVHLYIRKYKLWVVKML 510 >ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Vitis vinifera] Length = 511 Score = 666 bits (1718), Expect = 0.0 Identities = 352/519 (67%), Positives = 403/519 (77%), Gaps = 4/519 (0%) Frame = -3 Query: 1763 MAYAHGLAPTF----KLGFMFSSLCSPQQRPHLAVFPASKCGFSLKFCDGVSARSCKFQN 1596 MA AHG A + +LGF SS S Q RP L V P F ++C + C QN Sbjct: 1 MASAHGFASSLMSPTELGFTLSSSFSIQ-RPRLIV-PKFSRSFLGEYCSRATT-ICNHQN 57 Query: 1595 PSLVAAKPFSIRCFSRRKSVELDQYVTSXXXXXXXXXXXXXXXEAVEELERMTREPSDIL 1416 P V K IR F KSVELDQ++TS A+EELERMTREPSD+L Sbjct: 58 PRFVVPKRDKIREFRLFKSVELDQFLTSDDEDEMSEGFFE----AIEELERMTREPSDVL 113 Query: 1415 EEMNNRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKK 1236 EEMN+RLSARELQLVLVYFSQ+GRDSWCALEVF+WLRKENRVDKETMELMV+IMC WVKK Sbjct: 114 EEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKK 173 Query: 1235 LIQEEKHXXXXXXXXXXXXXXXXLRPGFSMIEKVISLYWEMGEKEGAALFVEEVLRRGIS 1056 LI+ E H L+PGFSMIEKVISLYWEM EKE A LFV+EVLRR I+ Sbjct: 174 LIEGE-HDVGDVVDLLVDMDCVGLKPGFSMIEKVISLYWEMEEKEKAVLFVKEVLRREIA 232 Query: 1055 CAEDDPEGHKGGPTGYLAWKMMVEGDYRGAVRLVIRFREAGLKPEVYSYLVAMTAVVKEL 876 +EDD +GHKGGPTGYLAWKMM EG+YRGAV+LVI RE+GLKPEVYSYL+AMTAVVKEL Sbjct: 233 YSEDDGDGHKGGPTGYLAWKMMAEGNYRGAVKLVIHLRESGLKPEVYSYLIAMTAVVKEL 292 Query: 875 NEFAKALRKLKGFARSGSVAELDPEGVELAEKYQSDLLADGLRLSNWVIQDGSPSLHGVI 696 NEFAKALRKLKGF +SG +AELD E VEL EKYQSDLLADG+RLS+WVIQ+G LHGV+ Sbjct: 293 NEFAKALRKLKGFTKSGLIAELDAENVELIEKYQSDLLADGVRLSSWVIQEGRSPLHGVV 352 Query: 695 HERLLAMYICAGHGVEAERQLWEMKLVGKEADGDLYDIVLAICASQKEXXXXXXXXXXLE 516 +ERLLAMYICAG G+EAERQLWEMKLVGKEAD +LYDIVLAICAS+KE +E Sbjct: 353 YERLLAMYICAGRGLEAERQLWEMKLVGKEADRELYDIVLAICASKKEASAISRLLTGME 412 Query: 515 VASSPQKRKTLSWLLRGYIKGGHFNEAAETVMKMLELGFYPEYLDRAAVLQGLRKRIQQL 336 V SS +++KTLSWLLRGYIKG HF++A+ET++KML+LG PEYLDRAAVLQGLR RIQQ Sbjct: 413 VTSSIRRKKTLSWLLRGYIKGSHFDDASETIIKMLDLGLCPEYLDRAAVLQGLRNRIQQT 472 Query: 335 GNLDTYIKLCKSLSDANLIGPCLVYLYIRKYKVWVVKML 219 GN++TY+KLCK LSDANLIGPCLVYLYI+KYK+W++K + Sbjct: 473 GNVETYLKLCKHLSDANLIGPCLVYLYIKKYKLWILKTI 511 >ref|XP_002313976.1| predicted protein [Populus trichocarpa] gi|222850384|gb|EEE87931.1| predicted protein [Populus trichocarpa] Length = 500 Score = 649 bits (1675), Expect = 0.0 Identities = 331/478 (69%), Positives = 385/478 (80%), Gaps = 3/478 (0%) Frame = -3 Query: 1643 LKFCDGVSARSCKFQNP---SLVAAKPFSIRCFSRRKSVELDQYVTSXXXXXXXXXXXXX 1473 +K C VS C +Q P + V AK +R F KSVELDQYVTS Sbjct: 28 MKPCCMVSTIICNYQTPKRPNFVVAKTTKVREFRLFKSVELDQYVTSDDEEEMGEGFFE- 86 Query: 1472 XXEAVEELERMTREPSDILEEMNNRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENR 1293 A+EELERMTREPSDILEEMN+RLSARELQLVLVYFSQ+GRDSWCALEVF+WLRKENR Sbjct: 87 ---AIEELERMTREPSDILEEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENR 143 Query: 1292 VDKETMELMVAIMCGWVKKLIQEEKHXXXXXXXXXXXXXXXXLRPGFSMIEKVISLYWEM 1113 VDKETMELMV+IMC WVKKLI+ E+ +P FSMIEKVISLYW+M Sbjct: 144 VDKETMELMVSIMCSWVKKLIEGEQDVGDVVDLLVDMDCVGL-KPSFSMIEKVISLYWDM 202 Query: 1112 GEKEGAALFVEEVLRRGISCAEDDPEGHKGGPTGYLAWKMMVEGDYRGAVRLVIRFREAG 933 G+KEGA FV+EVLRRGI+ + DD EG KGGPTGYL WKMMV+G+YR AV+LVI RE+G Sbjct: 203 GKKEGAVSFVKEVLRRGIAYSGDDGEGQKGGPTGYLTWKMMVDGNYRNAVKLVIHLRESG 262 Query: 932 LKPEVYSYLVAMTAVVKELNEFAKALRKLKGFARSGSVAELDPEGVELAEKYQSDLLADG 753 LKPE+Y+YL+AMTAVVKELNEF+KALRKLKG++RSG V ELD E VEL EKYQSDLLADG Sbjct: 263 LKPEIYAYLIAMTAVVKELNEFSKALRKLKGYSRSGMVTELDAENVELVEKYQSDLLADG 322 Query: 752 LRLSNWVIQDGSPSLHGVIHERLLAMYICAGHGVEAERQLWEMKLVGKEADGDLYDIVLA 573 + LS+WVIQ+GSP+L+GV+HERLLAMYICAG G++AERQLWEMKLVGKEADGDLYDIVLA Sbjct: 323 VCLSSWVIQEGSPALYGVVHERLLAMYICAGRGLDAERQLWEMKLVGKEADGDLYDIVLA 382 Query: 572 ICASQKEXXXXXXXXXXLEVASSPQKRKTLSWLLRGYIKGGHFNEAAETVMKMLELGFYP 393 ICASQKE +EVASS +K+K+LSWLLRGYIKGGH+ EAAET++KML+LG P Sbjct: 383 ICASQKEASAVARLLTRIEVASSMRKKKSLSWLLRGYIKGGHYGEAAETLIKMLDLGLSP 442 Query: 392 EYLDRAAVLQGLRKRIQQLGNLDTYIKLCKSLSDANLIGPCLVYLYIRKYKVWVVKML 219 +YLDR AV+QGLRKRIQQ GN+++Y+KLCK LSD NLIGP LVYLYI+KYK+W++K+L Sbjct: 443 DYLDRVAVMQGLRKRIQQWGNVESYLKLCKRLSDVNLIGPSLVYLYIKKYKLWIMKLL 500 >ref|XP_002521193.1| conserved hypothetical protein [Ricinus communis] gi|223539607|gb|EEF41193.1| conserved hypothetical protein [Ricinus communis] Length = 499 Score = 627 bits (1618), Expect = e-177 Identities = 317/448 (70%), Positives = 366/448 (81%) Frame = -3 Query: 1562 RCFSRRKSVELDQYVTSXXXXXXXXXXXXXXXEAVEELERMTREPSDILEEMNNRLSARE 1383 R F KSVELDQY+ S A+EELERMTREPSD+LEEMN++LSARE Sbjct: 57 REFRVLKSVELDQYIASDDEEEMSEGFFE----AIEELERMTREPSDVLEEMNDKLSARE 112 Query: 1382 LQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQEEKHXXXX 1203 LQLVLVYFSQ+GRDSWCALEVF+WLRKENRVDKETMELMV+IMC W+KKLI+ E H Sbjct: 113 LQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWIKKLIEGE-HEIGD 171 Query: 1202 XXXXXXXXXXXXLRPGFSMIEKVISLYWEMGEKEGAALFVEEVLRRGISCAEDDPEGHKG 1023 L+P FSMIEKVISLYWE+GEKE + FV+EVLRR ++ EDD EG KG Sbjct: 172 VVDLLVDMDCVGLKPSFSMIEKVISLYWEIGEKEKSVSFVKEVLRREVAYFEDDGEGQKG 231 Query: 1022 GPTGYLAWKMMVEGDYRGAVRLVIRFREAGLKPEVYSYLVAMTAVVKELNEFAKALRKLK 843 GPTGYLAWKMMV+G+YR AV+LVI FRE+GLKPEVYSYL+AMTAVVKELNEFAKALRKLK Sbjct: 232 GPTGYLAWKMMVDGNYRDAVKLVIHFRESGLKPEVYSYLIAMTAVVKELNEFAKALRKLK 291 Query: 842 GFARSGSVAELDPEGVELAEKYQSDLLADGLRLSNWVIQDGSPSLHGVIHERLLAMYICA 663 GFA+SG +AELD E L EKYQSDL+ADG+ LS+WVIQ+GSPSL+GV+HERLLAMYICA Sbjct: 292 GFAKSGLIAELDAENTRLIEKYQSDLIADGVCLSSWVIQEGSPSLYGVVHERLLAMYICA 351 Query: 662 GHGVEAERQLWEMKLVGKEADGDLYDIVLAICASQKEXXXXXXXXXXLEVASSPQKRKTL 483 G G++AERQLWEMKLVGK ADGDLYDIVLAICASQKE +EV SS QK+KTL Sbjct: 352 GRGLDAERQLWEMKLVGKHADGDLYDIVLAICASQKEASAVSRLLTRVEVTSSLQKKKTL 411 Query: 482 SWLLRGYIKGGHFNEAAETVMKMLELGFYPEYLDRAAVLQGLRKRIQQLGNLDTYIKLCK 303 SWLLRGY+KGG ++EAAE ++KML++G P+YLDR AVLQGLRKRIQQ GN+++Y+ LCK Sbjct: 412 SWLLRGYLKGGQYDEAAEALVKMLDMGLCPDYLDRVAVLQGLRKRIQQWGNVESYLNLCK 471 Query: 302 SLSDANLIGPCLVYLYIRKYKVWVVKML 219 LSD NLIGP LVYLYI+KYK+W++KML Sbjct: 472 RLSDENLIGPSLVYLYIKKYKLWIMKML 499