BLASTX nr result

ID: Glycyrrhiza23_contig00022269 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00022269
         (1235 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003527867.1| PREDICTED: pentatricopeptide repeat-containi...   676   0.0  
ref|XP_003602939.1| Pentatricopeptide repeat-containing protein ...   676   0.0  
ref|XP_002283907.2| PREDICTED: pentatricopeptide repeat-containi...   580   e-163
ref|XP_004136259.1| PREDICTED: pentatricopeptide repeat-containi...   565   e-159
ref|XP_002532248.1| pentatricopeptide repeat-containing protein,...   562   e-158

>ref|XP_003527867.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like
            [Glycine max]
          Length = 546

 Score =  676 bits (1745), Expect = 0.0
 Identities = 328/401 (81%), Positives = 368/401 (91%)
 Frame = -2

Query: 1234 DRVLHQMTYETCKFHEGTFINLMKHYSKSSLHEKVLWAFFSIQPIVREKPSPKAISTCLN 1055
            DRVLHQMTYETCKFHEG F+NLMKH+SKSSLHEK+L A+FSIQPIVREKPSPKA+STCLN
Sbjct: 146  DRVLHQMTYETCKFHEGIFVNLMKHFSKSSLHEKLLHAYFSIQPIVREKPSPKALSTCLN 205

Query: 1054 LLVESNRVDLARQLLLHAKRTLTYRPNVCIFNILVKYHCKHGDLDSAFDVVKEMRNSKSS 875
            LL++SNRVDLAR+LLLHAKR LT +PNVC+FNILVKYHCK+GDLDSAF++V+EMRNS+ S
Sbjct: 206  LLLDSNRVDLARKLLLHAKRDLTRKPNVCVFNILVKYHCKNGDLDSAFEIVEEMRNSEFS 265

Query: 874  YPNIVTYSTLMDGLCQKGRVKEAFELFEEMVSRDRIVPDPLTYNVLINGFCRGGKPDRAR 695
            YPN+VTYSTLMDGLC+ GRVKEAF+LFEEMVSRD IVPDPLTYNVLINGFCRGGKPDRAR
Sbjct: 266  YPNLVTYSTLMDGLCRNGRVKEAFDLFEEMVSRDHIVPDPLTYNVLINGFCRGGKPDRAR 325

Query: 694  NVIEFMKSNGCRPNVFNYSALVNGLCKVGKLQDAKEVFAEMKSSGLKPDTVGYTCLINFF 515
            NVI+FMKSNGC PNV+NYSALV+GLCKVGKL+DAK V AE+K SGLKPD V YT LINF 
Sbjct: 326  NVIQFMKSNGCYPNVYNYSALVDGLCKVGKLEDAKGVLAEIKGSGLKPDAVTYTSLINFL 385

Query: 514  CRNGQIDEAIELLNEMKENECRADTVTFNVILGGLCREGRFEQALDMVEKLPQQGVYLNK 335
            CRNG+ DEAIELL EMKEN C+AD+VTFNV+LGGLCREG+FE+ALDMVEKLPQQGVYLNK
Sbjct: 386  CRNGKSDEAIELLEEMKENGCQADSVTFNVLLGGLCREGKFEEALDMVEKLPQQGVYLNK 445

Query: 334  SSYRIVLNSLTQKCELERAKKLLGLMLSRGFLPHYATSNELLVRLCEXXXXXXXXXALFD 155
             SYRIVLNSLTQKCEL+RAK+LLGLML RGF PHYATSNELLV LC+         ALFD
Sbjct: 446  GSYRIVLNSLTQKCELKRAKELLGLMLRRGFQPHYATSNELLVCLCKAGMVDDAAVALFD 505

Query: 154  LVDMGFQPRLDSWELLIDLICRDRKLLYVFELLDELVITDS 32
            LV+MGFQP L++WE+LI LICR+RKLLYVFELLDELV+T++
Sbjct: 506  LVEMGFQPGLETWEVLIGLICRERKLLYVFELLDELVVTNT 546


>ref|XP_003602939.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355491987|gb|AES73190.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 586

 Score =  676 bits (1743), Expect = 0.0
 Identities = 331/401 (82%), Positives = 361/401 (90%)
 Frame = -2

Query: 1234 DRVLHQMTYETCKFHEGTFINLMKHYSKSSLHEKVLWAFFSIQPIVREKPSPKAISTCLN 1055
            DRVLHQMTYE CKFHEG FINLMKHYSK   HEKV  AF SIQ IVREKPSPKAIS+CLN
Sbjct: 186  DRVLHQMTYEACKFHEGVFINLMKHYSKCGFHEKVFDAFLSIQTIVREKPSPKAISSCLN 245

Query: 1054 LLVESNRVDLARQLLLHAKRTLTYRPNVCIFNILVKYHCKHGDLDSAFDVVKEMRNSKSS 875
            LLV+SN+VDL R+LLL+AKR+L Y+PNVCIFNILVKYHC+ GD+DSAF+VVKEMRNSK S
Sbjct: 246  LLVDSNQVDLVRKLLLYAKRSLVYKPNVCIFNILVKYHCRRGDIDSAFEVVKEMRNSKYS 305

Query: 874  YPNIVTYSTLMDGLCQKGRVKEAFELFEEMVSRDRIVPDPLTYNVLINGFCRGGKPDRAR 695
            YPN++TYSTLMDGLC+ GR+KEAFELFEEMVS+D+IVPDPLTYNVLINGFCR GK DRAR
Sbjct: 306  YPNVITYSTLMDGLCRNGRLKEAFELFEEMVSKDQIVPDPLTYNVLINGFCREGKADRAR 365

Query: 694  NVIEFMKSNGCRPNVFNYSALVNGLCKVGKLQDAKEVFAEMKSSGLKPDTVGYTCLINFF 515
            NVIEFMK+NGC PNVFNYSALV+GLCK GKLQDAK V AEMKSSGLKPD + YT LINFF
Sbjct: 366  NVIEFMKNNGCCPNVFNYSALVDGLCKAGKLQDAKGVLAEMKSSGLKPDAITYTSLINFF 425

Query: 514  CRNGQIDEAIELLNEMKENECRADTVTFNVILGGLCREGRFEQALDMVEKLPQQGVYLNK 335
             RNGQIDEAIELL EMKEN+C+ADTVTFNVILGGLCREGRF++ALDM+EKLPQQGVYLNK
Sbjct: 426  SRNGQIDEAIELLTEMKENDCQADTVTFNVILGGLCREGRFDEALDMIEKLPQQGVYLNK 485

Query: 334  SSYRIVLNSLTQKCELERAKKLLGLMLSRGFLPHYATSNELLVRLCEXXXXXXXXXALFD 155
             SYRIVLNSLTQ CEL +A KLLGLMLSRGF+PHYATSNELLVRLC+         ALFD
Sbjct: 486  GSYRIVLNSLTQNCELRKANKLLGLMLSRGFVPHYATSNELLVRLCKEGMANDAATALFD 545

Query: 154  LVDMGFQPRLDSWELLIDLICRDRKLLYVFELLDELVITDS 32
            LVDMGFQP+ DSWELLIDLICRDRKLLYVFELLDELV ++S
Sbjct: 546  LVDMGFQPQHDSWELLIDLICRDRKLLYVFELLDELVTSNS 586


>ref|XP_002283907.2| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like
            [Vitis vinifera]
          Length = 513

 Score =  580 bits (1495), Expect = e-163
 Identities = 279/400 (69%), Positives = 339/400 (84%)
 Frame = -2

Query: 1234 DRVLHQMTYETCKFHEGTFINLMKHYSKSSLHEKVLWAFFSIQPIVREKPSPKAISTCLN 1055
            D VLHQMTYETCKFHEG F+NLMKH+SK SLHE+V+  F +I+PIVREKPS KAISTCLN
Sbjct: 113  DAVLHQMTYETCKFHEGIFLNLMKHFSKLSLHERVVEMFDAIRPIVREKPSLKAISTCLN 172

Query: 1054 LLVESNRVDLARQLLLHAKRTLTYRPNVCIFNILVKYHCKHGDLDSAFDVVKEMRNSKSS 875
            LLVESN+VDL R+ LL++K++L   PN CIFNILVK+HCK+GD+DSAF+VV+EM+ S  S
Sbjct: 173  LLVESNQVDLTRKFLLNSKKSLNLEPNTCIFNILVKHHCKNGDIDSAFEVVEEMKKSHVS 232

Query: 874  YPNIVTYSTLMDGLCQKGRVKEAFELFEEMVSRDRIVPDPLTYNVLINGFCRGGKPDRAR 695
            YPN++TYSTL++GLC  GR+KEA ELFEEMVS+D+I+PD LTYN LINGFC G K DRA 
Sbjct: 233  YPNLITYSTLINGLCGSGRLKEAIELFEEMVSKDQILPDALTYNALINGFCHGEKVDRAL 292

Query: 694  NVIEFMKSNGCRPNVFNYSALVNGLCKVGKLQDAKEVFAEMKSSGLKPDTVGYTCLINFF 515
             ++EFMK NGC PNVFNYSAL+NG CK G+L++AKEVF EMKS GLKPDTVGYT LINFF
Sbjct: 293  KIMEFMKKNGCNPNVFNYSALMNGFCKEGRLEEAKEVFDEMKSLGLKPDTVGYTTLINFF 352

Query: 514  CRNGQIDEAIELLNEMKENECRADTVTFNVILGGLCREGRFEQALDMVEKLPQQGVYLNK 335
            CR G++DEA+ELL +M+EN+CRADTVTFNVILGGLCREGRFE+A  M+E+LP +GVYLNK
Sbjct: 353  CRAGRVDEAMELLKDMRENKCRADTVTFNVILGGLCREGRFEEARGMLERLPYEGVYLNK 412

Query: 334  SSYRIVLNSLTQKCELERAKKLLGLMLSRGFLPHYATSNELLVRLCEXXXXXXXXXALFD 155
            +SYRIVLNSL ++ EL++A +L+GLML RG LPH+ATSNELLV LCE         AL  
Sbjct: 413  ASYRIVLNSLCREGELQKATQLVGLMLGRGVLPHFATSNELLVHLCEAGKVGDAVMALLG 472

Query: 154  LVDMGFQPRLDSWELLIDLICRDRKLLYVFELLDELVITD 35
            L+++GF+P  +SW LL++LICR+RKLL  FELLD+LVI +
Sbjct: 473  LLELGFKPEPNSWALLVELICRERKLLPAFELLDDLVIQE 512


>ref|XP_004136259.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like
            [Cucumis sativus] gi|449497032|ref|XP_004160294.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At5g18475-like [Cucumis sativus]
          Length = 504

 Score =  565 bits (1456), Expect = e-159
 Identities = 270/397 (68%), Positives = 333/397 (83%)
 Frame = -2

Query: 1234 DRVLHQMTYETCKFHEGTFINLMKHYSKSSLHEKVLWAFFSIQPIVREKPSPKAISTCLN 1055
            D VLHQMTY+TCK HEG F+NLMKH+SKSS+HE+VL  F++I+ IVREKPS KAISTCLN
Sbjct: 103  DGVLHQMTYDTCKVHEGIFLNLMKHFSKSSMHERVLDMFYAIKSIVREKPSLKAISTCLN 162

Query: 1054 LLVESNRVDLARQLLLHAKRTLTYRPNVCIFNILVKYHCKHGDLDSAFDVVKEMRNSKSS 875
            LLVES+RVDLAR+LL++A+  L  RPN CIFNILVK+HC++GDL +AF+VVKEM++++ S
Sbjct: 163  LLVESDRVDLARKLLVNARSKLNLRPNTCIFNILVKHHCRNGDLQAAFEVVKEMKSARVS 222

Query: 874  YPNIVTYSTLMDGLCQKGRVKEAFELFEEMVSRDRIVPDPLTYNVLINGFCRGGKPDRAR 695
            YPN+VTYSTL+ GLC+ G++KEA E FEEMVS+D I+PD LTYN+LINGFC+ GK DRAR
Sbjct: 223  YPNLVTYSTLIGGLCENGKLKEAIEFFEEMVSKDNILPDALTYNILINGFCQRGKVDRAR 282

Query: 694  NVIEFMKSNGCRPNVFNYSALVNGLCKVGKLQDAKEVFAEMKSSGLKPDTVGYTCLINFF 515
             ++EFMKSNGC PNVFNYS L+NG CK G+LQ+AKEVF E+KS G+KPDT+ YT LIN  
Sbjct: 283  TILEFMKSNGCSPNVFNYSVLMNGYCKEGRLQEAKEVFNEIKSLGMKPDTISYTTLINCL 342

Query: 514  CRNGQIDEAIELLNEMKENECRADTVTFNVILGGLCREGRFEQALDMVEKLPQQGVYLNK 335
            CR G++DEA ELL +MK+ +CRADTVTFNV+LGGLCREGRF++ALDMV+KLP +G YLNK
Sbjct: 343  CRTGRVDEATELLQQMKDKDCRADTVTFNVMLGGLCREGRFDEALDMVQKLPFEGFYLNK 402

Query: 334  SSYRIVLNSLTQKCELERAKKLLGLMLSRGFLPHYATSNELLVRLCEXXXXXXXXXALFD 155
             SYRIVLN LTQK EL +A +LLGLML+RGF+PH+ATSN LL+ LC          +L  
Sbjct: 403  GSYRIVLNFLTQKGELRKATELLGLMLNRGFVPHHATSNTLLLLLCNNGMVKDAVESLLG 462

Query: 154  LVDMGFQPRLDSWELLIDLICRDRKLLYVFELLDELV 44
            L++MGF+P  +SW  L+DLICR+RK+L VFELLD LV
Sbjct: 463  LLEMGFKPEHESWFTLVDLICRERKMLPVFELLDVLV 499



 Score =  129 bits (324), Expect = 2e-27
 Identities = 78/311 (25%), Positives = 158/311 (50%)
 Frame = -2

Query: 1171 LMKHYSKSSLHEKVLWAFFSIQPIVREKPSPKAISTCLNLLVESNRVDLARQLLLHAKRT 992
            L+KH+ ++   +        ++      P+    ST +  L E+ ++  A +        
Sbjct: 196  LVKHHCRNGDLQAAFEVVKEMKSARVSYPNLVTYSTLIGGLCENGKLKEAIEFFEEMVSK 255

Query: 991  LTYRPNVCIFNILVKYHCKHGDLDSAFDVVKEMRNSKSSYPNIVTYSTLMDGLCQKGRVK 812
                P+   +NIL+   C+ G +D A  +++ M+++  S PN+  YS LM+G C++GR++
Sbjct: 256  DNILPDALTYNILINGFCQRGKVDRARTILEFMKSNGCS-PNVFNYSVLMNGYCKEGRLQ 314

Query: 811  EAFELFEEMVSRDRIVPDPLTYNVLINGFCRGGKPDRARNVIEFMKSNGCRPNVFNYSAL 632
            EA E+F E+ S   + PD ++Y  LIN  CR G+ D A  +++ MK   CR +   ++ +
Sbjct: 315  EAKEVFNEIKSLG-MKPDTISYTTLINCLCRTGRVDEATELLQQMKDKDCRADTVTFNVM 373

Query: 631  VNGLCKVGKLQDAKEVFAEMKSSGLKPDTVGYTCLINFFCRNGQIDEAIELLNEMKENEC 452
            + GLC+ G+  +A ++  ++   G   +   Y  ++NF  + G++ +A ELL  M     
Sbjct: 374  LGGLCREGRFDEALDMVQKLPFEGFYLNKGSYRIVLNFLTQKGELRKATELLGLMLNRGF 433

Query: 451  RADTVTFNVILGGLCREGRFEQALDMVEKLPQQGVYLNKSSYRIVLNSLTQKCELERAKK 272
                 T N +L  LC  G  + A++ +  L + G      S+  +++ + ++ ++    +
Sbjct: 434  VPHHATSNTLLLLLCNNGMVKDAVESLLGLLEMGFKPEHESWFTLVDLICRERKMLPVFE 493

Query: 271  LLGLMLSRGFL 239
            LL +++++ +L
Sbjct: 494  LLDVLVTQEYL 504


>ref|XP_002532248.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223528066|gb|EEF30142.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 521

 Score =  562 bits (1448), Expect = e-158
 Identities = 277/405 (68%), Positives = 327/405 (80%)
 Frame = -2

Query: 1234 DRVLHQMTYETCKFHEGTFINLMKHYSKSSLHEKVLWAFFSIQPIVREKPSPKAISTCLN 1055
            D +LHQMTYETCKFHE  F+NLMKH+ KSSLHE+VL  F++IQPIVREKPS KAISTCLN
Sbjct: 112  DALLHQMTYETCKFHENIFLNLMKHFYKSSLHERVLEMFYAIQPIVREKPSLKAISTCLN 171

Query: 1054 LLVESNRVDLARQLLLHAKRTLTYRPNVCIFNILVKYHCKHGDLDSAFDVVKEMRNSKSS 875
            +LVES ++DLA++ LL+    L  RPN CIFNILVK+HCK GDL+SA +V+ EM+ S+ S
Sbjct: 172  ILVESKQIDLAQKCLLYVNEHLKVRPNTCIFNILVKHHCKSGDLESALEVMHEMKKSRRS 231

Query: 874  YPNIVTYSTLMDGLCQKGRVKEAFELFEEMVSRDRIVPDPLTYNVLINGFCRGGKPDRAR 695
            YPN++TYSTL+DGLC  GR+KEA ELFEEMVS+D+I+PD LTY+VLI GFC GGK DRAR
Sbjct: 232  YPNVITYSTLIDGLCGNGRLKEAIELFEEMVSKDQILPDALTYSVLIKGFCHGGKADRAR 291

Query: 694  NVIEFMKSNGCRPNVFNYSALVNGLCKVGKLQDAKEVFAEMKSSGLKPDTVGYTCLINFF 515
             ++EFM+SNGC PNVFNYS L+NG CK G+L++AKEVF EMKSSGLKPDTVGYT LIN F
Sbjct: 292  KIMEFMRSNGCDPNVFNYSVLMNGFCKEGRLEEAKEVFDEMKSSGLKPDTVGYTTLINCF 351

Query: 514  CRNGQIDEAIELLNEMKENECRADTVTFNVILGGLCREGRFEQALDMVEKLPQQGVYLNK 335
            C  G+IDEA+ELL EM E +C+AD VTFNV+L GLCREGRF++AL M+E L  +GVYLNK
Sbjct: 352  CGVGRIDEAMELLKEMTEMKCKADAVTFNVLLKGLCREGRFDEALRMLENLAYEGVYLNK 411

Query: 334  SSYRIVLNSLTQKCELERAKKLLGLMLSRGFLPHYATSNELLVRLCEXXXXXXXXXALFD 155
             SYRIVLN L QK ELE++  LLGLMLSRGF+PHYATSNELLV LCE         ALF 
Sbjct: 412  GSYRIVLNFLCQKGELEKSCALLGLMLSRGFVPHYATSNELLVCLCEAGMVDNAVTALFG 471

Query: 154  LVDMGFQPRLDSWELLIDLICRDRKLLYVFELLDELVITDS*YLH 20
            L  MGF P   SW  LI+ ICR+RKLL+VFEL+DELV  +S   H
Sbjct: 472  LTQMGFTPEPKSWAHLIEYICRERKLLFVFELVDELVEKESGKFH 516



 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 77/332 (23%), Positives = 138/332 (41%), Gaps = 38/332 (11%)
 Frame = -2

Query: 928  DLDSAFDVVKEMRNSKSSYPNIVTYSTLMDGLCQKGRV---------------------- 815
            D   A ++   +   K    N  TYSTL+  L Q  +                       
Sbjct: 71   DPQHALEIFNMVGEQKGFNHNHATYSTLIHKLAQTKKFHAVDALLHQMTYETCKFHENIF 130

Query: 814  -------------KEAFELFEEMVSRDRIVPDPLTYNVLINGFCRGGKPDRARNVIEFMK 674
                         +   E+F  +    R  P     +  +N      + D A+  + ++ 
Sbjct: 131  LNLMKHFYKSSLHERVLEMFYAIQPIVREKPSLKAISTCLNILVESKQIDLAQKCLLYVN 190

Query: 673  SN-GCRPNVFNYSALVNGLCKVGKLQDAKEVFAEMKSSGLK-PDTVGYTCLINFFCRNGQ 500
             +   RPN   ++ LV   CK G L+ A EV  EMK S    P+ + Y+ LI+  C NG+
Sbjct: 191  EHLKVRPNTCIFNILVKHHCKSGDLESALEVMHEMKKSRRSYPNVITYSTLIDGLCGNGR 250

Query: 499  IDEAIELLNEM-KENECRADTVTFNVILGGLCREGRFEQALDMVEKLPQQGVYLNKSSYR 323
            + EAIEL  EM  +++   D +T++V++ G C  G+ ++A  ++E +   G   N  +Y 
Sbjct: 251  LKEAIELFEEMVSKDQILPDALTYSVLIKGFCHGGKADRARKIMEFMRSNGCDPNVFNYS 310

Query: 322  IVLNSLTQKCELERAKKLLGLMLSRGFLPHYATSNELLVRLCEXXXXXXXXXALFDLVDM 143
            +++N   ++  LE AK++   M S G  P       L+   C           L ++ +M
Sbjct: 311  VLMNGFCKEGRLEEAKEVFDEMKSSGLKPDTVGYTTLINCFCGVGRIDEAMELLKEMTEM 370

Query: 142  GFQPRLDSWELLIDLICRDRKLLYVFELLDEL 47
              +    ++ +L+  +CR+ +      +L+ L
Sbjct: 371  KCKADAVTFNVLLKGLCREGRFDEALRMLENL 402


Top