BLASTX nr result
ID: Dioscorea21_contig00013597
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00013597 (1326 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002320514.1| predicted protein [Populus trichocarpa] gi|2... 506 e-141 ref|XP_002530223.1| pentatricopeptide repeat-containing protein,... 498 e-138 ref|XP_002269867.1| PREDICTED: pentatricopeptide repeat-containi... 484 e-134 ref|XP_002869597.1| pentatricopeptide repeat-containing protein ... 474 e-131 ref|NP_194398.1| pentatricopeptide repeat-containing protein [Ar... 469 e-130 >ref|XP_002320514.1| predicted protein [Populus trichocarpa] gi|222861287|gb|EEE98829.1| predicted protein [Populus trichocarpa] Length = 478 Score = 506 bits (1302), Expect = e-141 Identities = 248/431 (57%), Positives = 317/431 (73%) Frame = +3 Query: 6 NHLLLKVHRDPILSVELYNRILIQNPSSQSLDTHAIILHILTKSRKFISAESLLRKTLIP 185 NH+LLK+ +D +LS+E +N + NP S +L+TH+IILHILTK KF SA+S+LR L Sbjct: 46 NHILLKIQKDHVLSLEFFNSLKTLNPISLTLETHSIILHILTKKSKFKSAQSILRTLLAS 105 Query: 186 QTLTSSSDLFDAILHSYRLCDSTPNVFDSLFKTYAHLKKFRNATETFHRMRDHGFLPTVR 365 +++ LFD +L SYR+CDS+P VFDSLFKTYAH+ KFRNAT+ F RM+D+GFLPTV Sbjct: 106 RSIDLPGKLFDTLLFSYRMCDSSPRVFDSLFKTYAHMNKFRNATDVFSRMKDYGFLPTVE 165 Query: 366 SCNAFLSSLLNHGRNDIVVSFYREMKRCRISPNVFTLNMVLCALCGLGRLEKAMDELEKM 545 SCNA+LSSLL+ R DI ++FYREM+RCRISPN +T N+VL ALC G+LEKA++ L +M Sbjct: 166 SCNAYLSSLLDFHRVDIALTFYREMRRCRISPNSYTFNLVLSALCKSGKLEKAVEVLREM 225 Query: 546 EIMGISPNVASFNTMIAGYCXXXXXXXXXXXXXXXXXXXXEPNVITFNTLIHQFCMEGKM 725 E +GI+PNV S+NT+IAG+C EPNV+TFN+LIH FC EGK+ Sbjct: 226 ESVGITPNVVSYNTLIAGHCNKGLLSIATKLKNLMGKNGLEPNVVTFNSLIHGFCKEGKL 285 Query: 726 HEANRIFREMKVAEVSPNTVTFNTLIAGYCRENNGEMGFRVYEEMVKNGVEVDILTYNSL 905 HEANR F EMKV V+PNTVT+NTLI GY + N M +VYEEM++NGV+ DILTYN+L Sbjct: 286 HEANRFFSEMKVMNVTPNTVTYNTLINGYGQVGNSNMAGKVYEEMMRNGVKADILTYNAL 345 Query: 906 ILGLCNEGKTRKAAHLVRELDRGNLVPNASTYFALITGQCKKQNSERALEIYKVMKMSGC 1085 ILGLC EGKT+KAA LV+ELD+ NLVPNASTY ALI+GQC ++NS+RA ++YK M SGC Sbjct: 346 ILGLCKEGKTKKAAFLVKELDKENLVPNASTYSALISGQCARKNSDRAFQLYKSMVRSGC 405 Query: 1086 HPNAETFDLLISTFVKNKDFEGAAEVLKEMLERWMVPGKVLLTELFDGLHLSGKRHLMDE 1265 HPN +TF +L S FVKN+DFEGA VL +M R M L E++DGL GK +L + Sbjct: 406 HPNEQTFKMLTSAFVKNEDFEGAFNVLMDMFARSMASDSNTLLEIYDGLCQCGKENLAMK 465 Query: 1266 LLSDVNGGRFV 1298 L ++ R + Sbjct: 466 LCHEMEARRLI 476 >ref|XP_002530223.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223530270|gb|EEF32170.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 517 Score = 498 bits (1281), Expect = e-138 Identities = 241/432 (55%), Positives = 322/432 (74%) Frame = +3 Query: 9 HLLLKVHRDPILSVELYNRILIQNPSSQSLDTHAIILHILTKSRKFISAESLLRKTLIPQ 188 H+LLK+ +D +LS+E +N + +NPSS +L+TH++ILHILTK+RKF SAE +L+ L+ Sbjct: 64 HILLKIQKDHVLSLEFFNWVQTENPSSHTLETHSMILHILTKNRKFKSAELILKSVLVKG 123 Query: 189 TLTSSSDLFDAILHSYRLCDSTPNVFDSLFKTYAHLKKFRNATETFHRMRDHGFLPTVRS 368 + LF+AIL+SYR+CDS+P VFDSLFKT AH+KKFRNAT+TF +M+ +GFLPTV S Sbjct: 124 FIDLPDKLFEAILYSYRMCDSSPRVFDSLFKTLAHMKKFRNATDTFLQMKGYGFLPTVES 183 Query: 369 CNAFLSSLLNHGRNDIVVSFYREMKRCRISPNVFTLNMVLCALCGLGRLEKAMDELEKME 548 CNA+LSSLL+ R DI ++FY+EM+RCRISPNV+T NMV+ A C G+LEKA+ E+ME Sbjct: 184 CNAYLSSLLDLHRVDIALAFYKEMRRCRISPNVYTRNMVMRAFCKSGKLEKAVQVFEEME 243 Query: 549 IMGISPNVASFNTMIAGYCXXXXXXXXXXXXXXXXXXXXEPNVITFNTLIHQFCMEGKMH 728 +GISPN S+NT+I GYC E NV+TFN+LI FC EGK+H Sbjct: 244 SVGISPNDTSYNTLIMGYCRKGLLNSAVKLKNSMRAKGVEANVVTFNSLIDGFCKEGKLH 303 Query: 729 EANRIFREMKVAEVSPNTVTFNTLIAGYCRENNGEMGFRVYEEMVKNGVEVDILTYNSLI 908 EA+++F EMKV V+PNT+T+NTLI G+ + N EMG R+YEEM +NGV+ DILTYN+LI Sbjct: 304 EASKVFSEMKVLNVAPNTITYNTLINGHSQMGNSEMGRRLYEEMSRNGVKADILTYNALI 363 Query: 909 LGLCNEGKTRKAAHLVRELDRGNLVPNASTYFALITGQCKKQNSERALEIYKVMKMSGCH 1088 LGLC EGKT+KAA++V+ELD+ NLVPNAST+ ALI+GQC + NS+RA ++YK M GCH Sbjct: 364 LGLCKEGKTKKAAYMVKELDKENLVPNASTFSALISGQCIRNNSDRAFQLYKSMVRIGCH 423 Query: 1089 PNAETFDLLISTFVKNKDFEGAAEVLKEMLERWMVPGKVLLTELFDGLHLSGKRHLMDEL 1268 PN +TF++L+S F KN+DFEGA VL EM ER PG +L+E++ GL GK HL +L Sbjct: 424 PNEQTFNMLVSAFCKNEDFEGAFLVLMEMFERCFTPGSDVLSEIYHGLCCCGKEHLAMKL 483 Query: 1269 LSDVNGGRFVSR 1304 S++ +++ Sbjct: 484 SSELKARHMMTK 495 >ref|XP_002269867.1| PREDICTED: pentatricopeptide repeat-containing protein At4g26680, mitochondrial-like [Vitis vinifera] Length = 616 Score = 484 bits (1246), Expect = e-134 Identities = 236/424 (55%), Positives = 317/424 (74%) Frame = +3 Query: 9 HLLLKVHRDPILSVELYNRILIQNPSSQSLDTHAIILHILTKSRKFISAESLLRKTLIPQ 188 H++LK+ +D +LS E +N + QNP+ Q+L+T++IILHILTK+ KF SAES+L+ L Sbjct: 162 HIMLKIKKDHVLSFEFFNWVKAQNPNCQTLETYSIILHILTKNHKFKSAESVLKGILGSG 221 Query: 189 TLTSSSDLFDAILHSYRLCDSTPNVFDSLFKTYAHLKKFRNATETFHRMRDHGFLPTVRS 368 ++ S LF+AIL+SYR+CDS+P VFDSLFKTYA +KK RNA + F +M+D+GFLP V S Sbjct: 222 SIDHPSKLFEAILYSYRICDSSPCVFDSLFKTYAQMKKLRNAIDVFCQMKDYGFLPRVES 281 Query: 369 CNAFLSSLLNHGRNDIVVSFYREMKRCRISPNVFTLNMVLCALCGLGRLEKAMDELEKME 548 CNA++S+ ++ R DI ++FYREM+R RISPNV+TLNMV+CA C G+LEKA++ ++ME Sbjct: 282 CNAYISASISLQRGDIALTFYREMQRYRISPNVYTLNMVMCAFCKWGKLEKAIEVFKRME 341 Query: 549 IMGISPNVASFNTMIAGYCXXXXXXXXXXXXXXXXXXXXEPNVITFNTLIHQFCMEGKMH 728 MG SP + S+NT+IAGYC P+ +TFNTLI+ FC GK+H Sbjct: 342 TMGFSPTITSYNTLIAGYCNKGLLNSGMKLKILMEKNGVRPDDVTFNTLINGFCRGGKLH 401 Query: 729 EANRIFREMKVAEVSPNTVTFNTLIAGYCRENNGEMGFRVYEEMVKNGVEVDILTYNSLI 908 EAN+IF EMK +V PNT+T+NTLI GY + N EMG R+++EM++NG++ DILTYN+LI Sbjct: 402 EANKIFSEMKANDVVPNTITYNTLINGYSQVGNSEMGGRLHDEMLRNGIKADILTYNALI 461 Query: 909 LGLCNEGKTRKAAHLVRELDRGNLVPNASTYFALITGQCKKQNSERALEIYKVMKMSGCH 1088 LGLC EG+T+KAA+LV+ELDR NLVPN+ST+ ALITGQC ++NSERA ++YK M SGCH Sbjct: 462 LGLCMEGRTKKAAYLVKELDRENLVPNSSTFSALITGQCVRKNSERAFQLYKSMIRSGCH 521 Query: 1089 PNAETFDLLISTFVKNKDFEGAAEVLKEMLERWMVPGKVLLTELFDGLHLSGKRHLMDEL 1268 PN TF +LISTF KN+DF+GA EV++EM ER + P L+EL GL LSGK L +L Sbjct: 522 PNYHTFKMLISTFCKNEDFDGAVEVVREMSERSIAPDSDTLSELCRGLWLSGKEELALKL 581 Query: 1269 LSDV 1280 ++ Sbjct: 582 CKEM 585 Score = 90.9 bits (224), Expect = 7e-16 Identities = 71/310 (22%), Positives = 123/310 (39%), Gaps = 37/310 (11%) Frame = +3 Query: 225 LHSYRLCDSTPNVFDSLFKTYAHLK--KFRNATETFHRMRDHGFLPTVRSCNAFLSSLLN 398 + YR+ +PNV+ A K K A E F RM GF PT+ S N ++ N Sbjct: 305 MQRYRI---SPNVYTLNMVMCAFCKWGKLEKAIEVFKRMETMGFSPTITSYNTLIAGYCN 361 Query: 399 HGRNDIVVSFYREMKRCRISPNVFTLNMVLCALCGLGRLEKAMDELEKMEIMGISPNVAS 578 G + + M++ + P+ T N ++ C G+L +A +M+ + PN + Sbjct: 362 KGLLNSGMKLKILMEKNGVRPDDVTFNTLINGFCRGGKLHEANKIFSEMKANDVVPNTIT 421 Query: 579 FNTMIAGYCXXXXXXXXXXXXXXXXXXXXEPNVITFNTLIHQFCMEGKMHEANRIFREMK 758 +NT+I GY + +++T+N LI CMEG+ +A + +E+ Sbjct: 422 YNTLINGYSQVGNSEMGGRLHDEMLRNGIKADILTYNALILGLCMEGRTKKAAYLVKELD 481 Query: 759 VAEVSPNTVTFNTLI-----------------------------------AGYCRENNGE 833 + PN+ TF+ LI + +C+ + + Sbjct: 482 RENLVPNSSTFSALITGQCVRKNSERAFQLYKSMIRSGCHPNYHTFKMLISTFCKNEDFD 541 Query: 834 MGFRVYEEMVKNGVEVDILTYNSLILGLCNEGKTRKAAHLVRELDRGNLVPNASTYFALI 1013 V EM + + D T + L GL GK A L +E++ +L+P I Sbjct: 542 GAVEVVREMSERSIAPDSDTLSELCRGLWLSGKEELALKLCKEMEMKHLMPEGFDKSKTI 601 Query: 1014 TGQCKKQNSE 1043 + + + E Sbjct: 602 NFRAESEEKE 611 >ref|XP_002869597.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297315433|gb|EFH45856.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 538 Score = 474 bits (1220), Expect = e-131 Identities = 231/434 (53%), Positives = 314/434 (72%) Frame = +3 Query: 9 HLLLKVHRDPILSVELYNRILIQNPSSQSLDTHAIILHILTKSRKFISAESLLRKTLIPQ 188 ++LLK+ +D +LS E +N +NP+S SL+THAI+LH LTK+RKF SAES+LR L+ Sbjct: 86 NVLLKIQKDYLLSFEFFNWAKTRNPASHSLETHAIVLHTLTKNRKFKSAESILRDVLVNG 145 Query: 189 TLTSSSDLFDAILHSYRLCDSTPNVFDSLFKTYAHLKKFRNATETFHRMRDHGFLPTVRS 368 + + +FDA+L+SYR CDSTP VFDSLFKT+AHLKKFRNAT+TF +M+D+GFLPTV S Sbjct: 146 GVDLPAKVFDALLYSYRECDSTPRVFDSLFKTFAHLKKFRNATDTFMQMKDYGFLPTVES 205 Query: 369 CNAFLSSLLNHGRNDIVVSFYREMKRCRISPNVFTLNMVLCALCGLGRLEKAMDELEKME 548 CNA++SSLL GR DI + FYREM+RC+ISPN +TLNMV+ C G+L+K ++ L+ ME Sbjct: 206 CNAYMSSLLGQGRVDIALRFYREMRRCKISPNTYTLNMVMSGYCRSGKLDKGIELLQDME 265 Query: 549 IMGISPNVASFNTMIAGYCXXXXXXXXXXXXXXXXXXXXEPNVITFNTLIHQFCMEGKMH 728 +G S+NT+IAG+C +PNV+TFNTLIH FC K+ Sbjct: 266 RLGFRATHVSYNTLIAGHCEKGLLSSALKLKNMMGKNGLQPNVVTFNTLIHGFCRAVKLQ 325 Query: 729 EANRIFREMKVAEVSPNTVTFNTLIAGYCRENNGEMGFRVYEEMVKNGVEVDILTYNSLI 908 EA+++F EMK + PNTVT+NTLI GY ++ + EM FR YE+MV NG++ DILTYN+LI Sbjct: 326 EASKVFGEMKALNLPPNTVTYNTLINGYSQQGDHEMAFRFYEDMVCNGIQRDILTYNTLI 385 Query: 909 LGLCNEGKTRKAAHLVRELDRGNLVPNASTYFALITGQCKKQNSERALEIYKVMKMSGCH 1088 LGLC + KTRKAA V+ELD+ NLVPN+ST+ ALI GQC ++N++R E+YK M SGCH Sbjct: 386 LGLCKQAKTRKAAQFVKELDKENLVPNSSTFSALIMGQCVRRNADRGFELYKSMIRSGCH 445 Query: 1089 PNAETFDLLISTFVKNKDFEGAAEVLKEMLERWMVPGKVLLTELFDGLHLSGKRHLMDEL 1268 PN +TF++LIS F KN+DF+GAA+VL+EM+ R + + ++ +GL+ GK L+ EL Sbjct: 446 PNEQTFNILISAFCKNEDFDGAAQVLREMVRRSIPLDSRTVHQVCNGLNHQGKDQLVKEL 505 Query: 1269 LSDVNGGRFVSRVL 1310 L ++ G +F+ L Sbjct: 506 LQEMEGKKFLQEPL 519 >ref|NP_194398.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|334186944|ref|NP_001190849.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75213515|sp|Q9SZ10.1|PP338_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g26680, mitochondrial; Flags: Precursor gi|4455191|emb|CAB36514.1| putative protein [Arabidopsis thaliana] gi|7269520|emb|CAB79523.1| putative protein [Arabidopsis thaliana] gi|332659836|gb|AEE85236.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332659837|gb|AEE85237.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 521 Score = 469 bits (1206), Expect = e-130 Identities = 226/430 (52%), Positives = 312/430 (72%) Frame = +3 Query: 9 HLLLKVHRDPILSVELYNRILIQNPSSQSLDTHAIILHILTKSRKFISAESLLRKTLIPQ 188 ++LLK+ +D +LS+E +N +NP S SL+THAI+LH LTK+RKF SAES+LR L+ Sbjct: 86 NVLLKIQKDYLLSLEFFNWAKTRNPGSHSLETHAIVLHTLTKNRKFKSAESILRDVLVNG 145 Query: 189 TLTSSSDLFDAILHSYRLCDSTPNVFDSLFKTYAHLKKFRNATETFHRMRDHGFLPTVRS 368 + + +FDA+L+SYR CDSTP VFDSLFKT+AHLKKFRNAT+TF +M+D+GFLPTV S Sbjct: 146 GVDLPAKVFDALLYSYRECDSTPRVFDSLFKTFAHLKKFRNATDTFMQMKDYGFLPTVES 205 Query: 369 CNAFLSSLLNHGRNDIVVSFYREMKRCRISPNVFTLNMVLCALCGLGRLEKAMDELEKME 548 CNA++SSLL GR DI + FYREM+RC+ISPN +TLNMV+ C G+L+K ++ L+ ME Sbjct: 206 CNAYMSSLLGQGRVDIALRFYREMRRCKISPNPYTLNMVMSGYCRSGKLDKGIELLQDME 265 Query: 549 IMGISPNVASFNTMIAGYCXXXXXXXXXXXXXXXXXXXXEPNVITFNTLIHQFCMEGKMH 728 +G S+NT+IAG+C +PNV+TFNTLIH FC K+ Sbjct: 266 RLGFRATDVSYNTLIAGHCEKGLLSSALKLKNMMGKSGLQPNVVTFNTLIHGFCRAMKLQ 325 Query: 729 EANRIFREMKVAEVSPNTVTFNTLIAGYCRENNGEMGFRVYEEMVKNGVEVDILTYNSLI 908 EA+++F EMK V+PNTVT+NTLI GY ++ + EM FR YE+MV NG++ DILTYN+LI Sbjct: 326 EASKVFGEMKAVNVAPNTVTYNTLINGYSQQGDHEMAFRFYEDMVCNGIQRDILTYNALI 385 Query: 909 LGLCNEGKTRKAAHLVRELDRGNLVPNASTYFALITGQCKKQNSERALEIYKVMKMSGCH 1088 GLC + KTRKAA V+ELD+ NLVPN+ST+ ALI GQC ++N++R E+YK M SGCH Sbjct: 386 FGLCKQAKTRKAAQFVKELDKENLVPNSSTFSALIMGQCVRKNADRGFELYKSMIRSGCH 445 Query: 1089 PNAETFDLLISTFVKNKDFEGAAEVLKEMLERWMVPGKVLLTELFDGLHLSGKRHLMDEL 1268 PN +TF++L+S F +N+DF+GA++VL+EM+ R + + ++ +GL GK L+ +L Sbjct: 446 PNEQTFNMLVSAFCRNEDFDGASQVLREMVRRSIPLDSRTVHQVCNGLKHQGKDQLVKKL 505 Query: 1269 LSDVNGGRFV 1298 L ++ G +F+ Sbjct: 506 LQEMEGKKFL 515