BLASTX nr result
ID: Dioscorea21_contig00001841
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00001841 (2468 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002263650.1| PREDICTED: pentatricopeptide repeat-containi... 309 3e-81 ref|XP_002867617.1| hypothetical protein ARALYDRAFT_354257 [Arab... 304 7e-80 ref|XP_003533519.1| PREDICTED: pentatricopeptide repeat-containi... 304 8e-80 ref|NP_194257.1| pentatricopeptide repeat-containing protein [Ar... 303 2e-79 ref|XP_002322407.1| predicted protein [Populus trichocarpa] gi|2... 300 9e-79 >ref|XP_002263650.1| PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Vitis vinifera] gi|296084180|emb|CBI24568.3| unnamed protein product [Vitis vinifera] Length = 516 Score = 309 bits (791), Expect = 3e-81 Identities = 152/258 (58%), Positives = 194/258 (75%), Gaps = 1/258 (0%) Frame = -1 Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSYNSKLGYEIHGWVLRQGMEWNLS 2289 Y HGL +A + R ML G EP+++AIST++ + KL +IHGWVLR+G++WNLS Sbjct: 258 YIRHGLPLQALSIFRRMLQYGFEPDAVAISTVVTGVP-SLKLAGQIHGWVLRRGVQWNLS 316 Query: 2288 IANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMISVHRKDCRAINIFKQMEDSGVQP 2109 IAN+LI +Y+ H +L A LF+ MPE+D+V+WN++IS HRKD +AI F +M+ + V P Sbjct: 317 IANSLIVLYSNHGKLDQACWLFDHMPERDVVSWNSIISAHRKDLKAITYFSRMQKADVLP 376 Query: 2108 DRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNMLGRAGLVDEAYE- 1932 D VTFVSLLS+CA+LG+V G LF+ M++ Y + P MEHY CMVN+ GRAGL++EAYE Sbjct: 377 DVVTFVSLLSACAHLGLVKDGEGLFSMMREDYGMIPSMEHYACMVNLYGRAGLIEEAYEI 436 Query: 1931 MAKTMPFDGGPRVWGALLYACSVHGNVTIGEVAAERLFELEPDNEHNFELLMRIYRNAGR 1752 + K M F+ GP VWGALLYAC H NV IG++AAE LFELEPDNEHNFELLM IYRN GR Sbjct: 437 IEKRMEFEAGPTVWGALLYACYFHHNVDIGKIAAECLFELEPDNEHNFELLMNIYRNVGR 496 Query: 1751 LEDVETVRMMMRERGLDT 1698 LEDVE VR MM +RG D+ Sbjct: 497 LEDVEKVRKMMADRGFDS 514 >ref|XP_002867617.1| hypothetical protein ARALYDRAFT_354257 [Arabidopsis lyrata subsp. lyrata] gi|297313453|gb|EFH43876.1| hypothetical protein ARALYDRAFT_354257 [Arabidopsis lyrata subsp. lyrata] Length = 758 Score = 304 bits (779), Expect = 7e-80 Identities = 146/258 (56%), Positives = 191/258 (74%), Gaps = 1/258 (0%) Frame = -1 Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSYNSKLGYEIHGWVLRQGMEWNLS 2289 Y HHGL EA D+ R M+ G++P+ +AIS++LA+ + K G ++HGWV+R+GMEW LS Sbjct: 502 YLHHGLLHEALDIFRLMVQNGIDPDKVAISSVLARV-LSFKHGRQLHGWVIRRGMEWELS 560 Query: 2288 IANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMISVHRKDCRAINIFKQMEDSGVQP 2109 +ANALI +Y++ QL A +F+ M E+D V+WN +IS H +D F+QM+ + +P Sbjct: 561 VANALIVLYSKRGQLGQACFIFDQMLERDTVSWNAIISAHSRDSNGFKYFEQMQHADAKP 620 Query: 2108 DRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNMLGRAGLVDEAYEM 1929 D +TFVS+LS CAN GMV+ G RLF+ M K+Y I P MEHY CMVN+ GRAG+++EAY M Sbjct: 621 DGITFVSVLSLCANTGMVEDGERLFSLMSKEYGINPKMEHYACMVNLYGRAGMMEEAYSM 680 Query: 1928 -AKTMPFDGGPRVWGALLYACSVHGNVTIGEVAAERLFELEPDNEHNFELLMRIYRNAGR 1752 + M F+ GP VWGALLYAC +HGN IGEV+A+RLFELEPDNEHNFELLMRIY A R Sbjct: 681 IVQEMEFEAGPTVWGALLYACYLHGNTDIGEVSAQRLFELEPDNEHNFELLMRIYSKAKR 740 Query: 1751 LEDVETVRMMMRERGLDT 1698 EDVE VR M+ +RGL+T Sbjct: 741 AEDVERVRQMLVDRGLET 758 Score = 66.6 bits (161), Expect = 3e-08 Identities = 69/268 (25%), Positives = 127/268 (47%), Gaps = 12/268 (4%) Frame = -1 Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSY--NSKLGYEIHGWVLRQGMEWN 2295 YA G +A + M G++P+ +L + ++G IH +++ G ++ Sbjct: 401 YAELGQYEDAMALYFQMAEDGVKPDRFTFPRVLKACGGIGSVQIGEAIHRDLVKAGFGYD 460 Query: 2294 LSIANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMIS---VHRKDCRAINIFKQMED 2124 + + NAL+ MYA+ + AR +F+ +P KD V+WN+M++ H A++IF+ M Sbjct: 461 VHVLNALVDMYAKCGDIVKARNVFDMIPNKDYVSWNSMLTGYLHHGLLHEALDIFRLMVQ 520 Query: 2123 SGVQPDRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNML----GRA 1956 +G+ PD+V S+L A + GR+L + I GME + N L + Sbjct: 521 NGIDPDKVAISSVL---ARVLSFKHGRQLHG-----WVIRRGMEWELSVANALIVLYSKR 572 Query: 1955 GLVDEAYEMAKTMPFDGGPRVWGALLYACSVHGNVTIGEVAAERL--FELEPDNEHNFEL 1782 G + +A + M + W A++ S H + G E++ + +PD F Sbjct: 573 GQLGQACFIFDQM-LERDTVSWNAII---SAHSRDSNGFKYFEQMQHADAKPDG-ITFVS 627 Query: 1781 LMRIYRNAGRLEDVETV-RMMMRERGLD 1701 ++ + N G +ED E + +M +E G++ Sbjct: 628 VLSLCANTGMVEDGERLFSLMSKEYGIN 655 >ref|XP_003533519.1| PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic-like [Glycine max] Length = 526 Score = 304 bits (778), Expect = 8e-80 Identities = 153/262 (58%), Positives = 194/262 (74%), Gaps = 6/262 (2%) Frame = -1 Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSYNSKLGYEIHGWVLRQGMEWNLS 2289 Y HHGL +A ++ R ML G EP+S++IST+L S + LG +IHGWV+ QG EWNLS Sbjct: 269 YVHHGLEVQAMNIFRQMLLEGCEPDSVSISTVLTGVS-SLGLGVQIHGWVISQGHEWNLS 327 Query: 2288 IANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMISVHRKDCRAINIFKQMEDSGVQP 2109 IAN+LI MY+ H +L AR +F MPE+D+V+WN++IS H K A+ F+QME +GVQP Sbjct: 328 IANSLIMMYSNHGRLEKARWVFNLMPERDVVSWNSIISAHCKRREALAFFEQMEGAGVQP 387 Query: 2108 DRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNMLGRAGLVDEAYEM 1929 D++TFVS+LS+CA LG++ G RLF M KY+I P MEHYGCMVN+ GRAGL+ +AY + Sbjct: 388 DKITFVSILSACAYLGLLKDGERLFALMCGKYKIKPIMEHYGCMVNLYGRAGLIKKAYSI 447 Query: 1928 AKTMPFDG------GPRVWGALLYACSVHGNVTIGEVAAERLFELEPDNEHNFELLMRIY 1767 DG GP +WGALLYAC +HG+ TIGE+AA LF+LEPDNEHNF LLMRIY Sbjct: 448 I----VDGIGTEAAGPTLWGALLYACFMHGDATIGEIAANWLFDLEPDNEHNFVLLMRIY 503 Query: 1766 RNAGRLEDVETVRMMMRERGLD 1701 NAGRLED+E VRMM+ +RGLD Sbjct: 504 ENAGRLEDMERVRMMLVDRGLD 525 Score = 69.3 bits (168), Expect = 5e-09 Identities = 44/142 (30%), Positives = 79/142 (55%), Gaps = 5/142 (3%) Frame = -1 Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSY--NSKLGYEIHGWVLRQGMEWN 2295 YA G EA + M+ G+E + +L + + ++G E+H +R G + Sbjct: 168 YAQVGHYDEAIALYFQMVEEGVEADLFTFPRVLKVCAGIGSVQVGEEVHRHAIRAGFAAD 227 Query: 2294 LSIANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMISV---HRKDCRAINIFKQMED 2124 I NAL+ MY++ + AR +F+ MP +D V+WN+M++ H + +A+NIF+QM Sbjct: 228 GFILNALVDMYSKCGDIVKARKVFDKMPHRDPVSWNSMLTAYVHHGLEVQAMNIFRQMLL 287 Query: 2123 SGVQPDRVTFVSLLSSCANLGM 2058 G +PD V+ ++L+ ++LG+ Sbjct: 288 EGCEPDSVSISTVLTGVSSLGL 309 Score = 61.2 bits (147), Expect = 1e-06 Identities = 65/249 (26%), Positives = 104/249 (41%), Gaps = 38/249 (15%) Frame = -1 Query: 2342 GYEIHGWVLRQGMEWNLSIANALIAMYAEHKQLRCARILFESMPEKDLVT--WNTMISVH 2169 G +H + + N+ I++ L+ +YA L A LF+ M ++D WN++IS + Sbjct: 109 GIRVHRLIPTSLLHKNVGISSKLLRLYASCGYLDDAHDLFDQMAKRDTSAFPWNSLISGY 168 Query: 2168 RKDCR---AINIFKQMEDSGVQPDRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPG 1998 + AI ++ QM + GV+ D TF +L CA +G V G + + A G Sbjct: 169 AQVGHYDEAIALYFQMVEEGVEADLFTFPRVLKVCAGIGSVQVGEEVHRHAIRAGFAADG 228 Query: 1997 MEHYGCMVNMLGRAGLVDEAYEMAKTMPFDGGPRVWGALLYACSVHG-NVTIGEVAAERL 1821 +V+M + G + +A ++ MP P W ++L A HG V + + L Sbjct: 229 F-ILNALVDMYSKCGDIVKARKVFDKMP-HRDPVSWNSMLTAYVHHGLEVQAMNIFRQML 286 Query: 1820 FE-LEPDN--------------------------EHNFEL-----LMRIYRNAGRLEDVE 1737 E EPD+ H + L L+ +Y N GRLE Sbjct: 287 LEGCEPDSVSISTVLTGVSSLGLGVQIHGWVISQGHEWNLSIANSLIMMYSNHGRLEKAR 346 Query: 1736 TVRMMMRER 1710 V +M ER Sbjct: 347 WVFNLMPER 355 >ref|NP_194257.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75265547|sp|Q9SB36.1|PP337_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g25270, chloroplastic; Flags: Precursor gi|4454015|emb|CAA23068.1| putative protein [Arabidopsis thaliana] gi|7269378|emb|CAB81338.1| putative protein [Arabidopsis thaliana] gi|332659633|gb|AEE85033.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 527 Score = 303 bits (775), Expect = 2e-79 Identities = 147/258 (56%), Positives = 190/258 (73%), Gaps = 1/258 (0%) Frame = -1 Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSYNSKLGYEIHGWVLRQGMEWNLS 2289 Y HHGL EA D+ R M+ G+EP+ +AIS++LA+ + K G ++HGWV+R+GMEW LS Sbjct: 271 YLHHGLLHEALDIFRLMVQNGIEPDKVAISSVLARV-LSFKHGRQLHGWVIRRGMEWELS 329 Query: 2288 IANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMISVHRKDCRAINIFKQMEDSGVQP 2109 +ANALI +Y++ QL A +F+ M E+D V+WN +IS H K+ + F+QM + +P Sbjct: 330 VANALIVLYSKRGQLGQACFIFDQMLERDTVSWNAIISAHSKNSNGLKYFEQMHRANAKP 389 Query: 2108 DRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNMLGRAGLVDEAYEM 1929 D +TFVS+LS CAN GMV+ G RLF+ M K+Y I P MEHY CMVN+ GRAG+++EAY M Sbjct: 390 DGITFVSVLSLCANTGMVEDGERLFSLMSKEYGIDPKMEHYACMVNLYGRAGMMEEAYSM 449 Query: 1928 -AKTMPFDGGPRVWGALLYACSVHGNVTIGEVAAERLFELEPDNEHNFELLMRIYRNAGR 1752 + M + GP VWGALLYAC +HGN IGEVAA+RLFELEPDNEHNFELL+RIY A R Sbjct: 450 IVQEMGLEAGPTVWGALLYACYLHGNTDIGEVAAQRLFELEPDNEHNFELLIRIYSKAKR 509 Query: 1751 LEDVETVRMMMRERGLDT 1698 EDVE VR MM +RGL+T Sbjct: 510 AEDVERVRQMMVDRGLET 527 Score = 70.9 bits (172), Expect = 2e-09 Identities = 71/268 (26%), Positives = 129/268 (48%), Gaps = 12/268 (4%) Frame = -1 Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSY--NSKLGYEIHGWVLRQGMEWN 2295 YA G +A + M G++P+ +L + ++G IH ++++G ++ Sbjct: 170 YAELGQYEDAMALYFQMAEDGVKPDRFTFPRVLKACGGIGSVQIGEAIHRDLVKEGFGYD 229 Query: 2294 LSIANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMIS---VHRKDCRAINIFKQMED 2124 + + NAL+ MYA+ + AR +F+ +P KD V+WN+M++ H A++IF+ M Sbjct: 230 VYVLNALVVMYAKCGDIVKARNVFDMIPHKDYVSWNSMLTGYLHHGLLHEALDIFRLMVQ 289 Query: 2123 SGVQPDRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNML----GRA 1956 +G++PD+V S+L A + GR+L + I GME + N L + Sbjct: 290 NGIEPDKVAISSVL---ARVLSFKHGRQLHG-----WVIRRGMEWELSVANALIVLYSKR 341 Query: 1955 GLVDEAYEMAKTMPFDGGPRVWGALLYACSVHGNVTIGEVAAERLF--ELEPDNEHNFEL 1782 G + +A + M + W A++ A S + N G E++ +PD F Sbjct: 342 GQLGQACFIFDQM-LERDTVSWNAIISAHSKNSN---GLKYFEQMHRANAKPDG-ITFVS 396 Query: 1781 LMRIYRNAGRLEDVETV-RMMMRERGLD 1701 ++ + N G +ED E + +M +E G+D Sbjct: 397 VLSLCANTGMVEDGERLFSLMSKEYGID 424 >ref|XP_002322407.1| predicted protein [Populus trichocarpa] gi|222869403|gb|EEF06534.1| predicted protein [Populus trichocarpa] Length = 442 Score = 300 bits (769), Expect = 9e-79 Identities = 146/256 (57%), Positives = 194/256 (75%), Gaps = 1/256 (0%) Frame = -1 Query: 2468 YAHHGLAREAWDVCRGMLAAGLEPNSIAISTMLAKFSYNSKLGYEIHGWVLRQGMEWNLS 2289 Y HGL EA M+ G+E +S+A+ST+LA S + ++ +IHGW++R+GMEW+ S Sbjct: 188 YIRHGLIAEALHTFHSMVHDGMELDSVAVSTILANVS-SFEVAVQIHGWIVRRGMEWDFS 246 Query: 2288 IANALIAMYAEHKQLRCARILFESMPEKDLVTWNTMISVHRKDCRAINIFKQMEDSGVQP 2109 IAN+LIA+Y+ ++L AR LF+ MP+KD+V+WN++IS H KD +A+ F+ ME G P Sbjct: 247 IANSLIAVYSNGRKLDRARWLFDHMPKKDIVSWNSIISAHCKDLKALTYFELMERDGALP 306 Query: 2108 DRVTFVSLLSSCANLGMVDSGRRLFNKMKKKYRIAPGMEHYGCMVNMLGRAGLVDEAYEM 1929 D++TFVSLLS+CA+LG+V G RLF+ MK KY+I P MEHY CMVN+ GRAGL++EAY + Sbjct: 307 DKITFVSLLSACAHLGLVKDGERLFSLMKAKYQINPIMEHYACMVNLYGRAGLINEAYAI 366 Query: 1928 AK-TMPFDGGPRVWGALLYACSVHGNVTIGEVAAERLFELEPDNEHNFELLMRIYRNAGR 1752 + M F+ GP VWGALLY+C +H NV GE+AA+ LF+LEPDNEHNFELLM+IY NAGR Sbjct: 367 IRDQMEFEAGPTVWGALLYSCYLHRNVDTGEIAAQYLFDLEPDNEHNFELLMKIYDNAGR 426 Query: 1751 LEDVETVRMMMRERGL 1704 LED E VR MM +RGL Sbjct: 427 LEDAERVRKMMVDRGL 442