BLASTX nr result

ID: Dioscorea21_contig00001841 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00001841
         (2468 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002263650.1| PREDICTED: pentatricopeptide repeat-containi...   309   3e-81
ref|XP_002867617.1| hypothetical protein ARALYDRAFT_354257 [Arab...   304   7e-80
ref|XP_003533519.1| PREDICTED: pentatricopeptide repeat-containi...   304   8e-80
ref|NP_194257.1| pentatricopeptide repeat-containing protein [Ar...   303   2e-79
ref|XP_002322407.1| predicted protein [Populus trichocarpa] gi|2...   300   9e-79

>ref|XP_002263650.1| PREDICTED: pentatricopeptide repeat-containing protein At4g25270,
            chloroplastic [Vitis vinifera]
            gi|296084180|emb|CBI24568.3| unnamed protein product
            [Vitis vinifera]
          Length = 516

 Score =  309 bits (791), Expect = 3e-81
 Identities = 152/258 (58%), Positives = 194/258 (75%), Gaps = 1/258 (0%)
 Frame = -1

Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSYNSKLGYEIHGWVLRQGMEWNLS 2289
            Y  HGL  +A  + R ML  G EP+++AIST++     + KL  +IHGWVLR+G++WNLS
Sbjct: 258  YIRHGLPLQALSIFRRMLQYGFEPDAVAISTVVTGVP-SLKLAGQIHGWVLRRGVQWNLS 316

Query: 2288 IANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMISVHRKDCRAINIFKQMEDSGVQP 2109
            IAN+LI +Y+ H +L  A  LF+ MPE+D+V+WN++IS HRKD +AI  F +M+ + V P
Sbjct: 317  IANSLIVLYSNHGKLDQACWLFDHMPERDVVSWNSIISAHRKDLKAITYFSRMQKADVLP 376

Query: 2108 DRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNMLGRAGLVDEAYE- 1932
            D VTFVSLLS+CA+LG+V  G  LF+ M++ Y + P MEHY CMVN+ GRAGL++EAYE 
Sbjct: 377  DVVTFVSLLSACAHLGLVKDGEGLFSMMREDYGMIPSMEHYACMVNLYGRAGLIEEAYEI 436

Query: 1931 MAKTMPFDGGPRVWGALLYACSVHGNVTIGEVAAERLFELEPDNEHNFELLMRIYRNAGR 1752
            + K M F+ GP VWGALLYAC  H NV IG++AAE LFELEPDNEHNFELLM IYRN GR
Sbjct: 437  IEKRMEFEAGPTVWGALLYACYFHHNVDIGKIAAECLFELEPDNEHNFELLMNIYRNVGR 496

Query: 1751 LEDVETVRMMMRERGLDT 1698
            LEDVE VR MM +RG D+
Sbjct: 497  LEDVEKVRKMMADRGFDS 514


>ref|XP_002867617.1| hypothetical protein ARALYDRAFT_354257 [Arabidopsis lyrata subsp.
            lyrata] gi|297313453|gb|EFH43876.1| hypothetical protein
            ARALYDRAFT_354257 [Arabidopsis lyrata subsp. lyrata]
          Length = 758

 Score =  304 bits (779), Expect = 7e-80
 Identities = 146/258 (56%), Positives = 191/258 (74%), Gaps = 1/258 (0%)
 Frame = -1

Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSYNSKLGYEIHGWVLRQGMEWNLS 2289
            Y HHGL  EA D+ R M+  G++P+ +AIS++LA+   + K G ++HGWV+R+GMEW LS
Sbjct: 502  YLHHGLLHEALDIFRLMVQNGIDPDKVAISSVLARV-LSFKHGRQLHGWVIRRGMEWELS 560

Query: 2288 IANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMISVHRKDCRAINIFKQMEDSGVQP 2109
            +ANALI +Y++  QL  A  +F+ M E+D V+WN +IS H +D      F+QM+ +  +P
Sbjct: 561  VANALIVLYSKRGQLGQACFIFDQMLERDTVSWNAIISAHSRDSNGFKYFEQMQHADAKP 620

Query: 2108 DRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNMLGRAGLVDEAYEM 1929
            D +TFVS+LS CAN GMV+ G RLF+ M K+Y I P MEHY CMVN+ GRAG+++EAY M
Sbjct: 621  DGITFVSVLSLCANTGMVEDGERLFSLMSKEYGINPKMEHYACMVNLYGRAGMMEEAYSM 680

Query: 1928 -AKTMPFDGGPRVWGALLYACSVHGNVTIGEVAAERLFELEPDNEHNFELLMRIYRNAGR 1752
              + M F+ GP VWGALLYAC +HGN  IGEV+A+RLFELEPDNEHNFELLMRIY  A R
Sbjct: 681  IVQEMEFEAGPTVWGALLYACYLHGNTDIGEVSAQRLFELEPDNEHNFELLMRIYSKAKR 740

Query: 1751 LEDVETVRMMMRERGLDT 1698
             EDVE VR M+ +RGL+T
Sbjct: 741  AEDVERVRQMLVDRGLET 758



 Score = 66.6 bits (161), Expect = 3e-08
 Identities = 69/268 (25%), Positives = 127/268 (47%), Gaps = 12/268 (4%)
 Frame = -1

Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSY--NSKLGYEIHGWVLRQGMEWN 2295
            YA  G   +A  +   M   G++P+      +L       + ++G  IH  +++ G  ++
Sbjct: 401  YAELGQYEDAMALYFQMAEDGVKPDRFTFPRVLKACGGIGSVQIGEAIHRDLVKAGFGYD 460

Query: 2294 LSIANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMIS---VHRKDCRAINIFKQMED 2124
            + + NAL+ MYA+   +  AR +F+ +P KD V+WN+M++    H     A++IF+ M  
Sbjct: 461  VHVLNALVDMYAKCGDIVKARNVFDMIPNKDYVSWNSMLTGYLHHGLLHEALDIFRLMVQ 520

Query: 2123 SGVQPDRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNML----GRA 1956
            +G+ PD+V   S+L   A +     GR+L       + I  GME    + N L     + 
Sbjct: 521  NGIDPDKVAISSVL---ARVLSFKHGRQLHG-----WVIRRGMEWELSVANALIVLYSKR 572

Query: 1955 GLVDEAYEMAKTMPFDGGPRVWGALLYACSVHGNVTIGEVAAERL--FELEPDNEHNFEL 1782
            G + +A  +   M  +     W A++   S H   + G    E++   + +PD    F  
Sbjct: 573  GQLGQACFIFDQM-LERDTVSWNAII---SAHSRDSNGFKYFEQMQHADAKPDG-ITFVS 627

Query: 1781 LMRIYRNAGRLEDVETV-RMMMRERGLD 1701
            ++ +  N G +ED E +  +M +E G++
Sbjct: 628  VLSLCANTGMVEDGERLFSLMSKEYGIN 655


>ref|XP_003533519.1| PREDICTED: pentatricopeptide repeat-containing protein At4g25270,
            chloroplastic-like [Glycine max]
          Length = 526

 Score =  304 bits (778), Expect = 8e-80
 Identities = 153/262 (58%), Positives = 194/262 (74%), Gaps = 6/262 (2%)
 Frame = -1

Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSYNSKLGYEIHGWVLRQGMEWNLS 2289
            Y HHGL  +A ++ R ML  G EP+S++IST+L   S +  LG +IHGWV+ QG EWNLS
Sbjct: 269  YVHHGLEVQAMNIFRQMLLEGCEPDSVSISTVLTGVS-SLGLGVQIHGWVISQGHEWNLS 327

Query: 2288 IANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMISVHRKDCRAINIFKQMEDSGVQP 2109
            IAN+LI MY+ H +L  AR +F  MPE+D+V+WN++IS H K   A+  F+QME +GVQP
Sbjct: 328  IANSLIMMYSNHGRLEKARWVFNLMPERDVVSWNSIISAHCKRREALAFFEQMEGAGVQP 387

Query: 2108 DRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNMLGRAGLVDEAYEM 1929
            D++TFVS+LS+CA LG++  G RLF  M  KY+I P MEHYGCMVN+ GRAGL+ +AY +
Sbjct: 388  DKITFVSILSACAYLGLLKDGERLFALMCGKYKIKPIMEHYGCMVNLYGRAGLIKKAYSI 447

Query: 1928 AKTMPFDG------GPRVWGALLYACSVHGNVTIGEVAAERLFELEPDNEHNFELLMRIY 1767
                  DG      GP +WGALLYAC +HG+ TIGE+AA  LF+LEPDNEHNF LLMRIY
Sbjct: 448  I----VDGIGTEAAGPTLWGALLYACFMHGDATIGEIAANWLFDLEPDNEHNFVLLMRIY 503

Query: 1766 RNAGRLEDVETVRMMMRERGLD 1701
             NAGRLED+E VRMM+ +RGLD
Sbjct: 504  ENAGRLEDMERVRMMLVDRGLD 525



 Score = 69.3 bits (168), Expect = 5e-09
 Identities = 44/142 (30%), Positives = 79/142 (55%), Gaps = 5/142 (3%)
 Frame = -1

Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSY--NSKLGYEIHGWVLRQGMEWN 2295
            YA  G   EA  +   M+  G+E +      +L   +   + ++G E+H   +R G   +
Sbjct: 168  YAQVGHYDEAIALYFQMVEEGVEADLFTFPRVLKVCAGIGSVQVGEEVHRHAIRAGFAAD 227

Query: 2294 LSIANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMISV---HRKDCRAINIFKQMED 2124
              I NAL+ MY++   +  AR +F+ MP +D V+WN+M++    H  + +A+NIF+QM  
Sbjct: 228  GFILNALVDMYSKCGDIVKARKVFDKMPHRDPVSWNSMLTAYVHHGLEVQAMNIFRQMLL 287

Query: 2123 SGVQPDRVTFVSLLSSCANLGM 2058
             G +PD V+  ++L+  ++LG+
Sbjct: 288  EGCEPDSVSISTVLTGVSSLGL 309



 Score = 61.2 bits (147), Expect = 1e-06
 Identities = 65/249 (26%), Positives = 104/249 (41%), Gaps = 38/249 (15%)
 Frame = -1

Query: 2342 GYEIHGWVLRQGMEWNLSIANALIAMYAEHKQLRCARILFESMPEKDLVT--WNTMISVH 2169
            G  +H  +    +  N+ I++ L+ +YA    L  A  LF+ M ++D     WN++IS +
Sbjct: 109  GIRVHRLIPTSLLHKNVGISSKLLRLYASCGYLDDAHDLFDQMAKRDTSAFPWNSLISGY 168

Query: 2168 RKDCR---AINIFKQMEDSGVQPDRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPG 1998
             +      AI ++ QM + GV+ D  TF  +L  CA +G V  G  +     +    A G
Sbjct: 169  AQVGHYDEAIALYFQMVEEGVEADLFTFPRVLKVCAGIGSVQVGEEVHRHAIRAGFAADG 228

Query: 1997 MEHYGCMVNMLGRAGLVDEAYEMAKTMPFDGGPRVWGALLYACSVHG-NVTIGEVAAERL 1821
                  +V+M  + G + +A ++   MP    P  W ++L A   HG  V    +  + L
Sbjct: 229  F-ILNALVDMYSKCGDIVKARKVFDKMP-HRDPVSWNSMLTAYVHHGLEVQAMNIFRQML 286

Query: 1820 FE-LEPDN--------------------------EHNFEL-----LMRIYRNAGRLEDVE 1737
             E  EPD+                           H + L     L+ +Y N GRLE   
Sbjct: 287  LEGCEPDSVSISTVLTGVSSLGLGVQIHGWVISQGHEWNLSIANSLIMMYSNHGRLEKAR 346

Query: 1736 TVRMMMRER 1710
             V  +M ER
Sbjct: 347  WVFNLMPER 355


>ref|NP_194257.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75265547|sp|Q9SB36.1|PP337_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g25270, chloroplastic; Flags: Precursor
            gi|4454015|emb|CAA23068.1| putative protein [Arabidopsis
            thaliana] gi|7269378|emb|CAB81338.1| putative protein
            [Arabidopsis thaliana] gi|332659633|gb|AEE85033.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 527

 Score =  303 bits (775), Expect = 2e-79
 Identities = 147/258 (56%), Positives = 190/258 (73%), Gaps = 1/258 (0%)
 Frame = -1

Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSYNSKLGYEIHGWVLRQGMEWNLS 2289
            Y HHGL  EA D+ R M+  G+EP+ +AIS++LA+   + K G ++HGWV+R+GMEW LS
Sbjct: 271  YLHHGLLHEALDIFRLMVQNGIEPDKVAISSVLARV-LSFKHGRQLHGWVIRRGMEWELS 329

Query: 2288 IANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMISVHRKDCRAINIFKQMEDSGVQP 2109
            +ANALI +Y++  QL  A  +F+ M E+D V+WN +IS H K+   +  F+QM  +  +P
Sbjct: 330  VANALIVLYSKRGQLGQACFIFDQMLERDTVSWNAIISAHSKNSNGLKYFEQMHRANAKP 389

Query: 2108 DRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNMLGRAGLVDEAYEM 1929
            D +TFVS+LS CAN GMV+ G RLF+ M K+Y I P MEHY CMVN+ GRAG+++EAY M
Sbjct: 390  DGITFVSVLSLCANTGMVEDGERLFSLMSKEYGIDPKMEHYACMVNLYGRAGMMEEAYSM 449

Query: 1928 -AKTMPFDGGPRVWGALLYACSVHGNVTIGEVAAERLFELEPDNEHNFELLMRIYRNAGR 1752
              + M  + GP VWGALLYAC +HGN  IGEVAA+RLFELEPDNEHNFELL+RIY  A R
Sbjct: 450  IVQEMGLEAGPTVWGALLYACYLHGNTDIGEVAAQRLFELEPDNEHNFELLIRIYSKAKR 509

Query: 1751 LEDVETVRMMMRERGLDT 1698
             EDVE VR MM +RGL+T
Sbjct: 510  AEDVERVRQMMVDRGLET 527



 Score = 70.9 bits (172), Expect = 2e-09
 Identities = 71/268 (26%), Positives = 129/268 (48%), Gaps = 12/268 (4%)
 Frame = -1

Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSY--NSKLGYEIHGWVLRQGMEWN 2295
            YA  G   +A  +   M   G++P+      +L       + ++G  IH  ++++G  ++
Sbjct: 170  YAELGQYEDAMALYFQMAEDGVKPDRFTFPRVLKACGGIGSVQIGEAIHRDLVKEGFGYD 229

Query: 2294 LSIANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMIS---VHRKDCRAINIFKQMED 2124
            + + NAL+ MYA+   +  AR +F+ +P KD V+WN+M++    H     A++IF+ M  
Sbjct: 230  VYVLNALVVMYAKCGDIVKARNVFDMIPHKDYVSWNSMLTGYLHHGLLHEALDIFRLMVQ 289

Query: 2123 SGVQPDRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNML----GRA 1956
            +G++PD+V   S+L   A +     GR+L       + I  GME    + N L     + 
Sbjct: 290  NGIEPDKVAISSVL---ARVLSFKHGRQLHG-----WVIRRGMEWELSVANALIVLYSKR 341

Query: 1955 GLVDEAYEMAKTMPFDGGPRVWGALLYACSVHGNVTIGEVAAERLF--ELEPDNEHNFEL 1782
            G + +A  +   M  +     W A++ A S + N   G    E++     +PD    F  
Sbjct: 342  GQLGQACFIFDQM-LERDTVSWNAIISAHSKNSN---GLKYFEQMHRANAKPDG-ITFVS 396

Query: 1781 LMRIYRNAGRLEDVETV-RMMMRERGLD 1701
            ++ +  N G +ED E +  +M +E G+D
Sbjct: 397  VLSLCANTGMVEDGERLFSLMSKEYGID 424


>ref|XP_002322407.1| predicted protein [Populus trichocarpa] gi|222869403|gb|EEF06534.1|
            predicted protein [Populus trichocarpa]
          Length = 442

 Score =  300 bits (769), Expect = 9e-79
 Identities = 146/256 (57%), Positives = 194/256 (75%), Gaps = 1/256 (0%)
 Frame = -1

Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSYNSKLGYEIHGWVLRQGMEWNLS 2289
            Y  HGL  EA      M+  G+E +S+A+ST+LA  S + ++  +IHGW++R+GMEW+ S
Sbjct: 188  YIRHGLIAEALHTFHSMVHDGMELDSVAVSTILANVS-SFEVAVQIHGWIVRRGMEWDFS 246

Query: 2288 IANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMISVHRKDCRAINIFKQMEDSGVQP 2109
            IAN+LIA+Y+  ++L  AR LF+ MP+KD+V+WN++IS H KD +A+  F+ ME  G  P
Sbjct: 247  IANSLIAVYSNGRKLDRARWLFDHMPKKDIVSWNSIISAHCKDLKALTYFELMERDGALP 306

Query: 2108 DRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNMLGRAGLVDEAYEM 1929
            D++TFVSLLS+CA+LG+V  G RLF+ MK KY+I P MEHY CMVN+ GRAGL++EAY +
Sbjct: 307  DKITFVSLLSACAHLGLVKDGERLFSLMKAKYQINPIMEHYACMVNLYGRAGLINEAYAI 366

Query: 1928 AK-TMPFDGGPRVWGALLYACSVHGNVTIGEVAAERLFELEPDNEHNFELLMRIYRNAGR 1752
             +  M F+ GP VWGALLY+C +H NV  GE+AA+ LF+LEPDNEHNFELLM+IY NAGR
Sbjct: 367  IRDQMEFEAGPTVWGALLYSCYLHRNVDTGEIAAQYLFDLEPDNEHNFELLMKIYDNAGR 426

Query: 1751 LEDVETVRMMMRERGL 1704
            LED E VR MM +RGL
Sbjct: 427  LEDAERVRKMMVDRGL 442


Top