BLASTX nr result

ID: Glycyrrhiza23_contig00005509 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00005509
         (943 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003598903.1| Pentatricopeptide repeat-containing protein ...   447   e-123
ref|XP_003555182.1| PREDICTED: pentatricopeptide repeat-containi...   437   e-120
ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containi...   395   e-108
emb|CBI17752.3| unnamed protein product [Vitis vinifera]              395   e-108
ref|XP_002513116.1| pentatricopeptide repeat-containing protein,...   387   e-105

>ref|XP_003598903.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355487951|gb|AES69154.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 767

 Score =  447 bits (1150), Expect = e-123
 Identities = 222/262 (84%), Positives = 242/262 (92%)
 Frame = +1

Query: 1    EQMEKLGLPVIDDLSKFFSLLVERKGPIMGLEAFTHLKEKGYVSVEIYNVLMDSLHMTGQ 180
            E+M+KLG PVIDDLSKFFS LVE+KGP M LE FTHLKEK YVSVEIYN+ M+SLH++G+
Sbjct: 459  EKMKKLGFPVIDDLSKFFSHLVEKKGPEMALEIFTHLKEKSYVSVEIYNIFMESLHLSGK 518

Query: 181  MKKALSLFDEIKGSDLKPDSSTYGIAILCLVDLGEIQEACVCHNKIIEMSCIPSVAAYCC 360
            ++KALSLFDEIKGSDL+PDSSTY IAILCLVD G+I+EAC CHNKIIEMS IPSVAAY C
Sbjct: 519  VEKALSLFDEIKGSDLEPDSSTYNIAILCLVDHGQIKEACECHNKIIEMSSIPSVAAYNC 578

Query: 361  LAKGLCKIGEIDEAMMLVRDCLGNVTSGPMEFKYSLTVIHACKSNDAEKVIDVLNEMMQQ 540
            LAKGLC IGEIDEAM+LVRDCLGNVTSGPMEFKY LT+I  CKSN AEK+IDVLNEMMQ+
Sbjct: 579  LAKGLCNIGEIDEAMLLVRDCLGNVTSGPMEFKYCLTIIRMCKSNVAEKLIDVLNEMMQE 638

Query: 541  GCPPDNVVCSAVISGMCKHGTIEEARKIFSNLRERKLLTESDTIVYDELLIDHMKKKTAD 720
            GC  DNVVCSA+ISGMCK+GTIEEARK+FS LRERKLLTESDTIVYDELLIDHMKKKTAD
Sbjct: 639  GCSLDNVVCSAIISGMCKYGTIEEARKVFSILRERKLLTESDTIVYDELLIDHMKKKTAD 698

Query: 721  LVISGLKFFGLESKLKSKGCKL 786
            LVISGLKFFGLESKLKSKGCKL
Sbjct: 699  LVISGLKFFGLESKLKSKGCKL 720



 Score = 63.2 bits (152), Expect = 9e-08
 Identities = 61/249 (24%), Positives = 102/249 (40%), Gaps = 34/249 (13%)
 Frame = +1

Query: 133 VEIYNVLMDSLHMTGQMKKALSLFDEIKGSDLKPDSSTYGIAILCLVDLGEIQEACVCHN 312
           V +YN +MD+L  TG +  ALS++++ +   L  +S T+ I I  L   G+I E      
Sbjct: 225 VFLYNRIMDALVKTGHLDLALSVYNDFREDGLVEESVTFMILIKGLCKGGKIDEMLEVLG 284

Query: 313 KIIEMSCIPSVAAYCCLAKGLCKIGEIDEAMMLVRDCL---------------------G 429
           ++ E  C P V AY  L + + K G +D  + + ++                       G
Sbjct: 285 RMREKLCKPDVFAYTALVRIMVKEGNLDGCLRVWKEMKRDRVDPDVMAYGTIIGGLAKGG 344

Query: 430 NVTSGPMEFK-------------YSLTVIHACKSNDAEKVIDVLNEMMQQGCPPDNVVCS 570
            V+ G   FK             Y   V      N      D+L +++  G   D  + +
Sbjct: 345 RVSEGYELFKEMKSKGHLIDRAIYGSLVESFVAGNKVGLAFDLLKDLVSSGYRADLGMYN 404

Query: 571 AVISGMCKHGTIEEARKIFSNLRERKLLTESDTIVYDELLIDHMKKKTADLVISGLKFFG 750
            +I G+C    +E+A K+F    +  L  E D +    LL+ + + K  +      +FF 
Sbjct: 405 NLIEGLCNLNKVEKAYKLFQVTIQEGL--EPDFLSVKPLLLAYAEAKRME------EFFM 456

Query: 751 LESKLKSKG 777
           L  K+K  G
Sbjct: 457 LLEKMKKLG 465


>ref|XP_003555182.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Glycine max]
          Length = 733

 Score =  437 bits (1123), Expect = e-120
 Identities = 214/265 (80%), Positives = 238/265 (89%)
 Frame = +1

Query: 1    EQMEKLGLPVIDDLSKFFSLLVERKGPIMGLEAFTHLKEKGYVSVEIYNVLMDSLHMTGQ 180
            EQM+KLG PVI DLSKFFS+LVE+KGPIM LE F  LKEKG+VSVEIYN+ MDSLH  G+
Sbjct: 469  EQMQKLGFPVIADLSKFFSVLVEKKGPIMALETFGQLKEKGHVSVEIYNIFMDSLHKIGE 528

Query: 181  MKKALSLFDEIKGSDLKPDSSTYGIAILCLVDLGEIQEACVCHNKIIEMSCIPSVAAYCC 360
            +KKALSLFDE+KG  LKPDS TY  AILCLVDLGEI+EAC CHN+IIEMSCIPSVAAY  
Sbjct: 529  VKKALSLFDEMKGLSLKPDSFTYCTAILCLVDLGEIKEACACHNRIIEMSCIPSVAAYSS 588

Query: 361  LAKGLCKIGEIDEAMMLVRDCLGNVTSGPMEFKYSLTVIHACKSNDAEKVIDVLNEMMQQ 540
            L KGLC+IGEIDEAM+LVRDCLGNV+ GP+EFKYSLT+IHACKSN AEKVIDVLNEM++Q
Sbjct: 589  LTKGLCQIGEIDEAMLLVRDCLGNVSDGPLEFKYSLTIIHACKSNVAEKVIDVLNEMIEQ 648

Query: 541  GCPPDNVVCSAVISGMCKHGTIEEARKIFSNLRERKLLTESDTIVYDELLIDHMKKKTAD 720
            GC  DNV+  ++ISGMCKHGTIEEARK+FSNLRER  LTES+TIVYDELLIDHMKKKTAD
Sbjct: 649  GCSLDNVIYCSIISGMCKHGTIEEARKVFSNLRERNFLTESNTIVYDELLIDHMKKKTAD 708

Query: 721  LVISGLKFFGLESKLKSKGCKLLPS 795
            LV+S LKFFGLESKLK+KGCKLLPS
Sbjct: 709  LVLSSLKFFGLESKLKAKGCKLLPS 733



 Score = 72.0 bits (175), Expect = 2e-10
 Identities = 47/168 (27%), Positives = 79/168 (47%)
 Frame = +1

Query: 133 VEIYNVLMDSLHMTGQMKKALSLFDEIKGSDLKPDSSTYGIAILCLVDLGEIQEACVCHN 312
           V +YN +MD+L  TG +  ALS++D++K   L  +S T+ + +  L   G I E      
Sbjct: 235 VFLYNRVMDALVRTGHLDLALSVYDDLKEDGLVEESVTFMVLVKGLCKCGRIDEMLEVLG 294

Query: 313 KIIEMSCIPSVAAYCCLAKGLCKIGEIDEAMMLVRDCLGNVTSGPMEFKYSLTVIHACKS 492
           ++ E  C P V AY  L K L   G +D A + V + +      P    Y+  ++   K 
Sbjct: 295 RMRERLCKPDVFAYTALVKILVPAGNLD-ACLRVWEEMKRDRVEPDVKAYATMIVGLAKG 353

Query: 493 NDAEKVIDVLNEMMQQGCPPDNVVCSAVISGMCKHGTIEEARKIFSNL 636
              ++  ++  EM  +GC  D V+  A++      G +E A  +  +L
Sbjct: 354 GRVQEGYELFREMKGKGCLVDRVIYGALVEAFVAEGKVELAFDLLKDL 401


>ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Vitis vinifera]
          Length = 1294

 Score =  395 bits (1015), Expect = e-108
 Identities = 190/264 (71%), Positives = 229/264 (86%), Gaps = 1/264 (0%)
 Frame = +1

Query: 4    QMEKLGLPVIDDLSKFFSLLVERKGPI-MGLEAFTHLKEKGYVSVEIYNVLMDSLHMTGQ 180
            QM+KLG PVIDDLSKFFS+++E+   + + LE F HLK KGY S+ IYN+LM+++H TG+
Sbjct: 961  QMQKLGFPVIDDLSKFFSVMIEKGERLKLALEVFEHLKAKGYCSISIYNILMEAIHRTGE 1020

Query: 181  MKKALSLFDEIKGSDLKPDSSTYGIAILCLVDLGEIQEACVCHNKIIEMSCIPSVAAYCC 360
            +KKALSLFD+IK S+ KPDSSTY  AI+C V++G++QEAC C+NKIIEM  +PSVAAY  
Sbjct: 1021 VKKALSLFDDIKDSNFKPDSSTYSNAIICFVEVGDVQEACACYNKIIEMCQLPSVAAYRS 1080

Query: 361  LAKGLCKIGEIDEAMMLVRDCLGNVTSGPMEFKYSLTVIHACKSNDAEKVIDVLNEMMQQ 540
            L KGLCK  EID A+MLVRDCL NVTSGPMEFKY+LT++HACKS +AEKVIDVLNEMMQ+
Sbjct: 1081 LVKGLCKSEEIDAAIMLVRDCLANVTSGPMEFKYTLTILHACKSGNAEKVIDVLNEMMQE 1140

Query: 541  GCPPDNVVCSAVISGMCKHGTIEEARKIFSNLRERKLLTESDTIVYDELLIDHMKKKTAD 720
            GC PD V  SA+ISGMCKHGT+EEARK+FSN+RERKLLTE++ IVYDE+LI+HMKKKTAD
Sbjct: 1141 GCTPDEVTYSALISGMCKHGTLEEARKVFSNMRERKLLTEANVIVYDEILIEHMKKKTAD 1200

Query: 721  LVISGLKFFGLESKLKSKGCKLLP 792
            LV+SGLKFFGLESKL+SKG  LLP
Sbjct: 1201 LVLSGLKFFGLESKLRSKGSTLLP 1224



 Score = 64.7 bits (156), Expect = 3e-08
 Identities = 52/207 (25%), Positives = 88/207 (42%), Gaps = 11/207 (5%)
 Frame = +1

Query: 133  VEIYNVLMDSLHMTGQMKKALSLFDEIKGSDLKPDSSTYGIAILCLVDLGEIQEACVCHN 312
            V +YN +MD L  TG +  A+S++++ K   L  +S TY I +  L   G I E      
Sbjct: 761  VFLYNRIMDGLVKTGHLDLAMSVYEDFKEDGLVEESVTYMILVKGLCKAGRIDEVLEVWE 820

Query: 313  KIIEMSCIPSVAAYCCLAKGLCKIGEIDEAMMLVRD-----------CLGNVTSGPMEFK 459
            ++ +    P V AY  L   LC    + E   L ++             G++  G   F 
Sbjct: 821  EMRKDKVEPDVMAYTTLVAALCNGNRVGEGFELFKEMKQKKYLIDRAIYGSLIEG---FV 877

Query: 460  YSLTVIHACKSNDAEKVIDVLNEMMQQGCPPDNVVCSAVISGMCKHGTIEEARKIFSNLR 639
             +  V  AC         D+L ++M  G   D  + +++I GMC    +++A K+F    
Sbjct: 878  VNERVGSAC---------DLLKDLMDSGYRADLAIYNSLIEGMCNVKQVDKAYKLFQVTV 928

Query: 640  ERKLLTESDTIVYDELLIDHMKKKTAD 720
               L  E + +    +L+ + + K  D
Sbjct: 929  HESL--EPNFLTVKPMLVSYAEMKRMD 953


>emb|CBI17752.3| unnamed protein product [Vitis vinifera]
          Length = 729

 Score =  395 bits (1015), Expect = e-108
 Identities = 190/264 (71%), Positives = 229/264 (86%), Gaps = 1/264 (0%)
 Frame = +1

Query: 4    QMEKLGLPVIDDLSKFFSLLVERKGPI-MGLEAFTHLKEKGYVSVEIYNVLMDSLHMTGQ 180
            QM+KLG PVIDDLSKFFS+++E+   + + LE F HLK KGY S+ IYN+LM+++H TG+
Sbjct: 457  QMQKLGFPVIDDLSKFFSVMIEKGERLKLALEVFEHLKAKGYCSISIYNILMEAIHRTGE 516

Query: 181  MKKALSLFDEIKGSDLKPDSSTYGIAILCLVDLGEIQEACVCHNKIIEMSCIPSVAAYCC 360
            +KKALSLFD+IK S+ KPDSSTY  AI+C V++G++QEAC C+NKIIEM  +PSVAAY  
Sbjct: 517  VKKALSLFDDIKDSNFKPDSSTYSNAIICFVEVGDVQEACACYNKIIEMCQLPSVAAYRS 576

Query: 361  LAKGLCKIGEIDEAMMLVRDCLGNVTSGPMEFKYSLTVIHACKSNDAEKVIDVLNEMMQQ 540
            L KGLCK  EID A+MLVRDCL NVTSGPMEFKY+LT++HACKS +AEKVIDVLNEMMQ+
Sbjct: 577  LVKGLCKSEEIDAAIMLVRDCLANVTSGPMEFKYTLTILHACKSGNAEKVIDVLNEMMQE 636

Query: 541  GCPPDNVVCSAVISGMCKHGTIEEARKIFSNLRERKLLTESDTIVYDELLIDHMKKKTAD 720
            GC PD V  SA+ISGMCKHGT+EEARK+FSN+RERKLLTE++ IVYDE+LI+HMKKKTAD
Sbjct: 637  GCTPDEVTYSALISGMCKHGTLEEARKVFSNMRERKLLTEANVIVYDEILIEHMKKKTAD 696

Query: 721  LVISGLKFFGLESKLKSKGCKLLP 792
            LV+SGLKFFGLESKL+SKG  LLP
Sbjct: 697  LVLSGLKFFGLESKLRSKGSTLLP 720



 Score = 63.9 bits (154), Expect = 5e-08
 Identities = 58/253 (22%), Positives = 105/253 (41%), Gaps = 13/253 (5%)
 Frame = +1

Query: 1   EQMEKLGL-PVIDDLSKFFSLLVERKGPIMGLEAFTHLKEKGYVSVEI-YNVLMDSLHMT 174
           E+M+K G+ P +   ++    LV+     + +  +   KE G V   + Y +L+  L   
Sbjct: 211 EKMKKFGIKPRVFLYNRIMDGLVKTGHLDLAMSVYEDFKEDGLVEESVTYMILVKGLCKA 270

Query: 175 GQMKKALSLFDEIKGSDLKPDSSTYGIAILCLVDLGEIQEACVCHNKIIEMSCIPSVAAY 354
           G++ + L L D ++G+  KPD   Y   +  LV  G +        ++ +    P V AY
Sbjct: 271 GRIDEVLELLDRMRGNLCKPDVFAYTAMVKVLVAEGNLDGCLRVWEEMRKDKVEPDVMAY 330

Query: 355 CCLAKGLCKIGEIDEAMMLVRDC-----------LGNVTSGPMEFKYSLTVIHACKSNDA 501
             L   LC    + E   L ++             G++  G   F  +  V  AC     
Sbjct: 331 TTLVAALCNGNRVGEGFELFKEMKQKKYLIDRAIYGSLIEG---FVVNERVGSAC----- 382

Query: 502 EKVIDVLNEMMQQGCPPDNVVCSAVISGMCKHGTIEEARKIFSNLRERKLLTESDTIVYD 681
               D+L ++M  G   D  + +++I GMC    +++A K+F       L  E + +   
Sbjct: 383 ----DLLKDLMDSGYRADLAIYNSLIEGMCNVKQVDKAYKLFQVTVHESL--EPNFLTVK 436

Query: 682 ELLIDHMKKKTAD 720
            +L+ + + K  D
Sbjct: 437 PMLVSYAEMKRMD 449


>ref|XP_002513116.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223548127|gb|EEF49619.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 1128

 Score =  387 bits (994), Expect = e-105
 Identities = 188/263 (71%), Positives = 223/263 (84%), Gaps = 1/263 (0%)
 Frame = +1

Query: 4    QMEKLGLPVIDDLSKFFSLLVERKGPI-MGLEAFTHLKEKGYVSVEIYNVLMDSLHMTGQ 180
            QME+LG  V+DD+SK FS LV R+  I + LE F  LK KGY+SV IYN LM++L   G+
Sbjct: 864  QMERLGFSVMDDISKLFSFLVRREEIITLALEVFEELKVKGYISVLIYNTLMEALLKVGE 923

Query: 181  MKKALSLFDEIKGSDLKPDSSTYGIAILCLVDLGEIQEACVCHNKIIEMSCIPSVAAYCC 360
            ++KALSLF E+K  + +PDS+TY IA++C V+ G IQEACVCHNKIIEMS +PSVAAYC 
Sbjct: 924  VRKALSLFSEMKDLNCEPDSNTYSIAVICFVEDGNIQEACVCHNKIIEMSSVPSVAAYCS 983

Query: 361  LAKGLCKIGEIDEAMMLVRDCLGNVTSGPMEFKYSLTVIHACKSNDAEKVIDVLNEMMQQ 540
            L KGLC IGEIDEAMMLVRDCLGNVTSGPMEFKY+LTV+H C+S DAEKVI+VLNEMM +
Sbjct: 984  LTKGLCDIGEIDEAMMLVRDCLGNVTSGPMEFKYTLTVLHVCRSGDAEKVIEVLNEMMHE 1043

Query: 541  GCPPDNVVCSAVISGMCKHGTIEEARKIFSNLRERKLLTESDTIVYDELLIDHMKKKTAD 720
             CPP+ V+ SA+ISGMCKHGT+EEARK+F+NLRERKLLTE+ TI YDE LI+HMKKKTAD
Sbjct: 1044 NCPPNEVILSAIISGMCKHGTLEEARKVFTNLRERKLLTEAKTIFYDERLIEHMKKKTAD 1103

Query: 721  LVISGLKFFGLESKLKSKGCKLL 789
            LV+SGLKFFGLESKL++KGC LL
Sbjct: 1104 LVVSGLKFFGLESKLRAKGCTLL 1126



 Score = 73.6 bits (179), Expect = 7e-11
 Identities = 52/194 (26%), Positives = 86/194 (44%)
 Frame = +1

Query: 139  IYNVLMDSLHMTGQMKKALSLFDEIKGSDLKPDSSTYGIAILCLVDLGEIQEACVCHNKI 318
            +YN +MD+L  T  +  AL ++D+ K   L  DS TY I I  L   G I E      ++
Sbjct: 666  LYNRIMDALIKTAHLDLALVVYDDFKSDGLVEDSVTYMILIKGLCKFGRIDEMMEVWEEM 725

Query: 319  IEMSCIPSVAAYCCLAKGLCKIGEIDEAMMLVRDCLGNVTSGPMEFKYSLTVIHACKSND 498
                  P V AY  +  GLCK G + E   L ++   N         Y + +    K   
Sbjct: 726  KRDGVNPDVMAYATVVTGLCKGGRVAEGYELFKEMKENKVLIDRAI-YGVLIEAFVKDGK 784

Query: 499  AEKVIDVLNEMMQQGCPPDNVVCSAVISGMCKHGTIEEARKIFSNLRERKLLTESDTIVY 678
                 D+L  ++  G   D  + +++I G+C    +++ARK+F  + +  L  E D    
Sbjct: 785  IGSACDLLQGLVDSGYRADLGIYNSLIEGLCNVKRVDKARKLFQIMVQEGL--ELDFKTV 842

Query: 679  DELLIDHMKKKTAD 720
            + +L+ + + K  D
Sbjct: 843  NPMLVSYAEMKRMD 856


Top