BLASTX nr result

ID: Dioscorea21_contig00013597 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00013597
         (1326 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002320514.1| predicted protein [Populus trichocarpa] gi|2...   506   e-141
ref|XP_002530223.1| pentatricopeptide repeat-containing protein,...   498   e-138
ref|XP_002269867.1| PREDICTED: pentatricopeptide repeat-containi...   484   e-134
ref|XP_002869597.1| pentatricopeptide repeat-containing protein ...   474   e-131
ref|NP_194398.1| pentatricopeptide repeat-containing protein [Ar...   469   e-130

>ref|XP_002320514.1| predicted protein [Populus trichocarpa] gi|222861287|gb|EEE98829.1|
            predicted protein [Populus trichocarpa]
          Length = 478

 Score =  506 bits (1302), Expect = e-141
 Identities = 248/431 (57%), Positives = 317/431 (73%)
 Frame = +3

Query: 6    NHLLLKVHRDPILSVELYNRILIQNPSSQSLDTHAIILHILTKSRKFISAESLLRKTLIP 185
            NH+LLK+ +D +LS+E +N +   NP S +L+TH+IILHILTK  KF SA+S+LR  L  
Sbjct: 46   NHILLKIQKDHVLSLEFFNSLKTLNPISLTLETHSIILHILTKKSKFKSAQSILRTLLAS 105

Query: 186  QTLTSSSDLFDAILHSYRLCDSTPNVFDSLFKTYAHLKKFRNATETFHRMRDHGFLPTVR 365
            +++     LFD +L SYR+CDS+P VFDSLFKTYAH+ KFRNAT+ F RM+D+GFLPTV 
Sbjct: 106  RSIDLPGKLFDTLLFSYRMCDSSPRVFDSLFKTYAHMNKFRNATDVFSRMKDYGFLPTVE 165

Query: 366  SCNAFLSSLLNHGRNDIVVSFYREMKRCRISPNVFTLNMVLCALCGLGRLEKAMDELEKM 545
            SCNA+LSSLL+  R DI ++FYREM+RCRISPN +T N+VL ALC  G+LEKA++ L +M
Sbjct: 166  SCNAYLSSLLDFHRVDIALTFYREMRRCRISPNSYTFNLVLSALCKSGKLEKAVEVLREM 225

Query: 546  EIMGISPNVASFNTMIAGYCXXXXXXXXXXXXXXXXXXXXEPNVITFNTLIHQFCMEGKM 725
            E +GI+PNV S+NT+IAG+C                    EPNV+TFN+LIH FC EGK+
Sbjct: 226  ESVGITPNVVSYNTLIAGHCNKGLLSIATKLKNLMGKNGLEPNVVTFNSLIHGFCKEGKL 285

Query: 726  HEANRIFREMKVAEVSPNTVTFNTLIAGYCRENNGEMGFRVYEEMVKNGVEVDILTYNSL 905
            HEANR F EMKV  V+PNTVT+NTLI GY +  N  M  +VYEEM++NGV+ DILTYN+L
Sbjct: 286  HEANRFFSEMKVMNVTPNTVTYNTLINGYGQVGNSNMAGKVYEEMMRNGVKADILTYNAL 345

Query: 906  ILGLCNEGKTRKAAHLVRELDRGNLVPNASTYFALITGQCKKQNSERALEIYKVMKMSGC 1085
            ILGLC EGKT+KAA LV+ELD+ NLVPNASTY ALI+GQC ++NS+RA ++YK M  SGC
Sbjct: 346  ILGLCKEGKTKKAAFLVKELDKENLVPNASTYSALISGQCARKNSDRAFQLYKSMVRSGC 405

Query: 1086 HPNAETFDLLISTFVKNKDFEGAAEVLKEMLERWMVPGKVLLTELFDGLHLSGKRHLMDE 1265
            HPN +TF +L S FVKN+DFEGA  VL +M  R M      L E++DGL   GK +L  +
Sbjct: 406  HPNEQTFKMLTSAFVKNEDFEGAFNVLMDMFARSMASDSNTLLEIYDGLCQCGKENLAMK 465

Query: 1266 LLSDVNGGRFV 1298
            L  ++   R +
Sbjct: 466  LCHEMEARRLI 476


>ref|XP_002530223.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223530270|gb|EEF32170.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 517

 Score =  498 bits (1281), Expect = e-138
 Identities = 241/432 (55%), Positives = 322/432 (74%)
 Frame = +3

Query: 9    HLLLKVHRDPILSVELYNRILIQNPSSQSLDTHAIILHILTKSRKFISAESLLRKTLIPQ 188
            H+LLK+ +D +LS+E +N +  +NPSS +L+TH++ILHILTK+RKF SAE +L+  L+  
Sbjct: 64   HILLKIQKDHVLSLEFFNWVQTENPSSHTLETHSMILHILTKNRKFKSAELILKSVLVKG 123

Query: 189  TLTSSSDLFDAILHSYRLCDSTPNVFDSLFKTYAHLKKFRNATETFHRMRDHGFLPTVRS 368
             +     LF+AIL+SYR+CDS+P VFDSLFKT AH+KKFRNAT+TF +M+ +GFLPTV S
Sbjct: 124  FIDLPDKLFEAILYSYRMCDSSPRVFDSLFKTLAHMKKFRNATDTFLQMKGYGFLPTVES 183

Query: 369  CNAFLSSLLNHGRNDIVVSFYREMKRCRISPNVFTLNMVLCALCGLGRLEKAMDELEKME 548
            CNA+LSSLL+  R DI ++FY+EM+RCRISPNV+T NMV+ A C  G+LEKA+   E+ME
Sbjct: 184  CNAYLSSLLDLHRVDIALAFYKEMRRCRISPNVYTRNMVMRAFCKSGKLEKAVQVFEEME 243

Query: 549  IMGISPNVASFNTMIAGYCXXXXXXXXXXXXXXXXXXXXEPNVITFNTLIHQFCMEGKMH 728
             +GISPN  S+NT+I GYC                    E NV+TFN+LI  FC EGK+H
Sbjct: 244  SVGISPNDTSYNTLIMGYCRKGLLNSAVKLKNSMRAKGVEANVVTFNSLIDGFCKEGKLH 303

Query: 729  EANRIFREMKVAEVSPNTVTFNTLIAGYCRENNGEMGFRVYEEMVKNGVEVDILTYNSLI 908
            EA+++F EMKV  V+PNT+T+NTLI G+ +  N EMG R+YEEM +NGV+ DILTYN+LI
Sbjct: 304  EASKVFSEMKVLNVAPNTITYNTLINGHSQMGNSEMGRRLYEEMSRNGVKADILTYNALI 363

Query: 909  LGLCNEGKTRKAAHLVRELDRGNLVPNASTYFALITGQCKKQNSERALEIYKVMKMSGCH 1088
            LGLC EGKT+KAA++V+ELD+ NLVPNAST+ ALI+GQC + NS+RA ++YK M   GCH
Sbjct: 364  LGLCKEGKTKKAAYMVKELDKENLVPNASTFSALISGQCIRNNSDRAFQLYKSMVRIGCH 423

Query: 1089 PNAETFDLLISTFVKNKDFEGAAEVLKEMLERWMVPGKVLLTELFDGLHLSGKRHLMDEL 1268
            PN +TF++L+S F KN+DFEGA  VL EM ER   PG  +L+E++ GL   GK HL  +L
Sbjct: 424  PNEQTFNMLVSAFCKNEDFEGAFLVLMEMFERCFTPGSDVLSEIYHGLCCCGKEHLAMKL 483

Query: 1269 LSDVNGGRFVSR 1304
             S++     +++
Sbjct: 484  SSELKARHMMTK 495


>ref|XP_002269867.1| PREDICTED: pentatricopeptide repeat-containing protein At4g26680,
            mitochondrial-like [Vitis vinifera]
          Length = 616

 Score =  484 bits (1246), Expect = e-134
 Identities = 236/424 (55%), Positives = 317/424 (74%)
 Frame = +3

Query: 9    HLLLKVHRDPILSVELYNRILIQNPSSQSLDTHAIILHILTKSRKFISAESLLRKTLIPQ 188
            H++LK+ +D +LS E +N +  QNP+ Q+L+T++IILHILTK+ KF SAES+L+  L   
Sbjct: 162  HIMLKIKKDHVLSFEFFNWVKAQNPNCQTLETYSIILHILTKNHKFKSAESVLKGILGSG 221

Query: 189  TLTSSSDLFDAILHSYRLCDSTPNVFDSLFKTYAHLKKFRNATETFHRMRDHGFLPTVRS 368
            ++   S LF+AIL+SYR+CDS+P VFDSLFKTYA +KK RNA + F +M+D+GFLP V S
Sbjct: 222  SIDHPSKLFEAILYSYRICDSSPCVFDSLFKTYAQMKKLRNAIDVFCQMKDYGFLPRVES 281

Query: 369  CNAFLSSLLNHGRNDIVVSFYREMKRCRISPNVFTLNMVLCALCGLGRLEKAMDELEKME 548
            CNA++S+ ++  R DI ++FYREM+R RISPNV+TLNMV+CA C  G+LEKA++  ++ME
Sbjct: 282  CNAYISASISLQRGDIALTFYREMQRYRISPNVYTLNMVMCAFCKWGKLEKAIEVFKRME 341

Query: 549  IMGISPNVASFNTMIAGYCXXXXXXXXXXXXXXXXXXXXEPNVITFNTLIHQFCMEGKMH 728
             MG SP + S+NT+IAGYC                     P+ +TFNTLI+ FC  GK+H
Sbjct: 342  TMGFSPTITSYNTLIAGYCNKGLLNSGMKLKILMEKNGVRPDDVTFNTLINGFCRGGKLH 401

Query: 729  EANRIFREMKVAEVSPNTVTFNTLIAGYCRENNGEMGFRVYEEMVKNGVEVDILTYNSLI 908
            EAN+IF EMK  +V PNT+T+NTLI GY +  N EMG R+++EM++NG++ DILTYN+LI
Sbjct: 402  EANKIFSEMKANDVVPNTITYNTLINGYSQVGNSEMGGRLHDEMLRNGIKADILTYNALI 461

Query: 909  LGLCNEGKTRKAAHLVRELDRGNLVPNASTYFALITGQCKKQNSERALEIYKVMKMSGCH 1088
            LGLC EG+T+KAA+LV+ELDR NLVPN+ST+ ALITGQC ++NSERA ++YK M  SGCH
Sbjct: 462  LGLCMEGRTKKAAYLVKELDRENLVPNSSTFSALITGQCVRKNSERAFQLYKSMIRSGCH 521

Query: 1089 PNAETFDLLISTFVKNKDFEGAAEVLKEMLERWMVPGKVLLTELFDGLHLSGKRHLMDEL 1268
            PN  TF +LISTF KN+DF+GA EV++EM ER + P    L+EL  GL LSGK  L  +L
Sbjct: 522  PNYHTFKMLISTFCKNEDFDGAVEVVREMSERSIAPDSDTLSELCRGLWLSGKEELALKL 581

Query: 1269 LSDV 1280
              ++
Sbjct: 582  CKEM 585



 Score = 90.9 bits (224), Expect = 7e-16
 Identities = 71/310 (22%), Positives = 123/310 (39%), Gaps = 37/310 (11%)
 Frame = +3

Query: 225  LHSYRLCDSTPNVFDSLFKTYAHLK--KFRNATETFHRMRDHGFLPTVRSCNAFLSSLLN 398
            +  YR+   +PNV+       A  K  K   A E F RM   GF PT+ S N  ++   N
Sbjct: 305  MQRYRI---SPNVYTLNMVMCAFCKWGKLEKAIEVFKRMETMGFSPTITSYNTLIAGYCN 361

Query: 399  HGRNDIVVSFYREMKRCRISPNVFTLNMVLCALCGLGRLEKAMDELEKMEIMGISPNVAS 578
             G  +  +     M++  + P+  T N ++   C  G+L +A     +M+   + PN  +
Sbjct: 362  KGLLNSGMKLKILMEKNGVRPDDVTFNTLINGFCRGGKLHEANKIFSEMKANDVVPNTIT 421

Query: 579  FNTMIAGYCXXXXXXXXXXXXXXXXXXXXEPNVITFNTLIHQFCMEGKMHEANRIFREMK 758
            +NT+I GY                     + +++T+N LI   CMEG+  +A  + +E+ 
Sbjct: 422  YNTLINGYSQVGNSEMGGRLHDEMLRNGIKADILTYNALILGLCMEGRTKKAAYLVKELD 481

Query: 759  VAEVSPNTVTFNTLI-----------------------------------AGYCRENNGE 833
               + PN+ TF+ LI                                   + +C+  + +
Sbjct: 482  RENLVPNSSTFSALITGQCVRKNSERAFQLYKSMIRSGCHPNYHTFKMLISTFCKNEDFD 541

Query: 834  MGFRVYEEMVKNGVEVDILTYNSLILGLCNEGKTRKAAHLVRELDRGNLVPNASTYFALI 1013
                V  EM +  +  D  T + L  GL   GK   A  L +E++  +L+P        I
Sbjct: 542  GAVEVVREMSERSIAPDSDTLSELCRGLWLSGKEELALKLCKEMEMKHLMPEGFDKSKTI 601

Query: 1014 TGQCKKQNSE 1043
              + + +  E
Sbjct: 602  NFRAESEEKE 611


>ref|XP_002869597.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297315433|gb|EFH45856.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 538

 Score =  474 bits (1220), Expect = e-131
 Identities = 231/434 (53%), Positives = 314/434 (72%)
 Frame = +3

Query: 9    HLLLKVHRDPILSVELYNRILIQNPSSQSLDTHAIILHILTKSRKFISAESLLRKTLIPQ 188
            ++LLK+ +D +LS E +N    +NP+S SL+THAI+LH LTK+RKF SAES+LR  L+  
Sbjct: 86   NVLLKIQKDYLLSFEFFNWAKTRNPASHSLETHAIVLHTLTKNRKFKSAESILRDVLVNG 145

Query: 189  TLTSSSDLFDAILHSYRLCDSTPNVFDSLFKTYAHLKKFRNATETFHRMRDHGFLPTVRS 368
             +   + +FDA+L+SYR CDSTP VFDSLFKT+AHLKKFRNAT+TF +M+D+GFLPTV S
Sbjct: 146  GVDLPAKVFDALLYSYRECDSTPRVFDSLFKTFAHLKKFRNATDTFMQMKDYGFLPTVES 205

Query: 369  CNAFLSSLLNHGRNDIVVSFYREMKRCRISPNVFTLNMVLCALCGLGRLEKAMDELEKME 548
            CNA++SSLL  GR DI + FYREM+RC+ISPN +TLNMV+   C  G+L+K ++ L+ ME
Sbjct: 206  CNAYMSSLLGQGRVDIALRFYREMRRCKISPNTYTLNMVMSGYCRSGKLDKGIELLQDME 265

Query: 549  IMGISPNVASFNTMIAGYCXXXXXXXXXXXXXXXXXXXXEPNVITFNTLIHQFCMEGKMH 728
             +G      S+NT+IAG+C                    +PNV+TFNTLIH FC   K+ 
Sbjct: 266  RLGFRATHVSYNTLIAGHCEKGLLSSALKLKNMMGKNGLQPNVVTFNTLIHGFCRAVKLQ 325

Query: 729  EANRIFREMKVAEVSPNTVTFNTLIAGYCRENNGEMGFRVYEEMVKNGVEVDILTYNSLI 908
            EA+++F EMK   + PNTVT+NTLI GY ++ + EM FR YE+MV NG++ DILTYN+LI
Sbjct: 326  EASKVFGEMKALNLPPNTVTYNTLINGYSQQGDHEMAFRFYEDMVCNGIQRDILTYNTLI 385

Query: 909  LGLCNEGKTRKAAHLVRELDRGNLVPNASTYFALITGQCKKQNSERALEIYKVMKMSGCH 1088
            LGLC + KTRKAA  V+ELD+ NLVPN+ST+ ALI GQC ++N++R  E+YK M  SGCH
Sbjct: 386  LGLCKQAKTRKAAQFVKELDKENLVPNSSTFSALIMGQCVRRNADRGFELYKSMIRSGCH 445

Query: 1089 PNAETFDLLISTFVKNKDFEGAAEVLKEMLERWMVPGKVLLTELFDGLHLSGKRHLMDEL 1268
            PN +TF++LIS F KN+DF+GAA+VL+EM+ R +      + ++ +GL+  GK  L+ EL
Sbjct: 446  PNEQTFNILISAFCKNEDFDGAAQVLREMVRRSIPLDSRTVHQVCNGLNHQGKDQLVKEL 505

Query: 1269 LSDVNGGRFVSRVL 1310
            L ++ G +F+   L
Sbjct: 506  LQEMEGKKFLQEPL 519


>ref|NP_194398.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|334186944|ref|NP_001190849.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|75213515|sp|Q9SZ10.1|PP338_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g26680, mitochondrial; Flags: Precursor
            gi|4455191|emb|CAB36514.1| putative protein [Arabidopsis
            thaliana] gi|7269520|emb|CAB79523.1| putative protein
            [Arabidopsis thaliana] gi|332659836|gb|AEE85236.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana] gi|332659837|gb|AEE85237.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 521

 Score =  469 bits (1206), Expect = e-130
 Identities = 226/430 (52%), Positives = 312/430 (72%)
 Frame = +3

Query: 9    HLLLKVHRDPILSVELYNRILIQNPSSQSLDTHAIILHILTKSRKFISAESLLRKTLIPQ 188
            ++LLK+ +D +LS+E +N    +NP S SL+THAI+LH LTK+RKF SAES+LR  L+  
Sbjct: 86   NVLLKIQKDYLLSLEFFNWAKTRNPGSHSLETHAIVLHTLTKNRKFKSAESILRDVLVNG 145

Query: 189  TLTSSSDLFDAILHSYRLCDSTPNVFDSLFKTYAHLKKFRNATETFHRMRDHGFLPTVRS 368
             +   + +FDA+L+SYR CDSTP VFDSLFKT+AHLKKFRNAT+TF +M+D+GFLPTV S
Sbjct: 146  GVDLPAKVFDALLYSYRECDSTPRVFDSLFKTFAHLKKFRNATDTFMQMKDYGFLPTVES 205

Query: 369  CNAFLSSLLNHGRNDIVVSFYREMKRCRISPNVFTLNMVLCALCGLGRLEKAMDELEKME 548
            CNA++SSLL  GR DI + FYREM+RC+ISPN +TLNMV+   C  G+L+K ++ L+ ME
Sbjct: 206  CNAYMSSLLGQGRVDIALRFYREMRRCKISPNPYTLNMVMSGYCRSGKLDKGIELLQDME 265

Query: 549  IMGISPNVASFNTMIAGYCXXXXXXXXXXXXXXXXXXXXEPNVITFNTLIHQFCMEGKMH 728
             +G      S+NT+IAG+C                    +PNV+TFNTLIH FC   K+ 
Sbjct: 266  RLGFRATDVSYNTLIAGHCEKGLLSSALKLKNMMGKSGLQPNVVTFNTLIHGFCRAMKLQ 325

Query: 729  EANRIFREMKVAEVSPNTVTFNTLIAGYCRENNGEMGFRVYEEMVKNGVEVDILTYNSLI 908
            EA+++F EMK   V+PNTVT+NTLI GY ++ + EM FR YE+MV NG++ DILTYN+LI
Sbjct: 326  EASKVFGEMKAVNVAPNTVTYNTLINGYSQQGDHEMAFRFYEDMVCNGIQRDILTYNALI 385

Query: 909  LGLCNEGKTRKAAHLVRELDRGNLVPNASTYFALITGQCKKQNSERALEIYKVMKMSGCH 1088
             GLC + KTRKAA  V+ELD+ NLVPN+ST+ ALI GQC ++N++R  E+YK M  SGCH
Sbjct: 386  FGLCKQAKTRKAAQFVKELDKENLVPNSSTFSALIMGQCVRKNADRGFELYKSMIRSGCH 445

Query: 1089 PNAETFDLLISTFVKNKDFEGAAEVLKEMLERWMVPGKVLLTELFDGLHLSGKRHLMDEL 1268
            PN +TF++L+S F +N+DF+GA++VL+EM+ R +      + ++ +GL   GK  L+ +L
Sbjct: 446  PNEQTFNMLVSAFCRNEDFDGASQVLREMVRRSIPLDSRTVHQVCNGLKHQGKDQLVKKL 505

Query: 1269 LSDVNGGRFV 1298
            L ++ G +F+
Sbjct: 506  LQEMEGKKFL 515


Top