BLASTX nr result

ID: Dioscorea21_contig00030147 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00030147
         (1213 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002311390.1| predicted protein [Populus trichocarpa] gi|2...   457   e-126
ref|XP_002265079.1| PREDICTED: pentatricopeptide repeat-containi...   328   1e-87
ref|XP_002281942.1| PREDICTED: pentatricopeptide repeat-containi...   327   5e-87
emb|CAN61593.1| hypothetical protein VITISV_030555 [Vitis vinifera]   325   1e-86
ref|NP_193619.1| pentatricopeptide repeat-containing protein [Ar...   325   2e-86

>ref|XP_002311390.1| predicted protein [Populus trichocarpa] gi|222851210|gb|EEE88757.1|
            predicted protein [Populus trichocarpa]
          Length = 594

 Score =  457 bits (1176), Expect = e-126
 Identities = 217/382 (56%), Positives = 292/382 (76%), Gaps = 1/382 (0%)
 Frame = +1

Query: 70   DQPDSLRYAHQLFHSPHCPRTPFFYNSLIKSYSINGYPKTAFLLFCDMLL-HGVAEPDRY 246
            D   SL YA +LF +   PR  F Y ++IK+Y+  G P+ AF  +  ML       P+ +
Sbjct: 77   DNLGSLNYAQKLFDTVDIPRNSFMYTTMIKAYANFGNPREAFAFYSRMLCDQRYVYPNDF 136

Query: 247  TYTFVCNACSKAMLVFEGKQVHARLVKNANGVSPESWNSLMDFYLNIGEDVRRVRRILDG 426
            T+T+V +ACSK   VFEGKQ HA+++K        SWNSL+DFY  +GE    VRR+ D 
Sbjct: 137  TFTYVFSACSKFNGVFEGKQAHAQMIKFPFEFGVHSWNSLLDFYGKVGEVGIVVRRVFDK 196

Query: 427  MKDPCIVSWNCLLDGYVKSGEIEDARKVFEEMPERDTVSWTTMLLGYVNEGMLDEACCLF 606
            ++ P +VSWNCL++GYVKSG++++AR++F+EMPERD VSWT ML+GY + G L EA CLF
Sbjct: 197  IEGPDVVSWNCLINGYVKSGDLDEARRLFDEMPERDVVSWTIMLVGYADAGFLSEASCLF 256

Query: 607  DEMPEKNMVSWSVMIKGFWRSGCYNEALDLFKEMQVLDIEIDKITLTTLLSACAGLGALD 786
            DEMP++N+VSWS +IKG+ + GCY++AL+LFKEMQV  +++D++ +TTLLSACA LGALD
Sbjct: 257  DEMPKRNLVSWSALIKGYIQIGCYSKALELFKEMQVAKVKMDEVIVTTLLSACARLGALD 316

Query: 787  QGCWIHAFIDKHGVEVDAHLCTALVDMYAKCGRLDLARKVFQGFKKRKVFVWNAMLGGLA 966
            QG W+H +IDKHG++VDAHL TAL+DMY+KCGR+D+A KVFQ    +KVFVW++M+GGLA
Sbjct: 317  QGRWLHMYIDKHGIKVDAHLSTALIDMYSKCGRIDMAWKVFQETGDKKVFVWSSMIGGLA 376

Query: 967  MHSLGLEAVELFSEMLRSGIRPNEITFICVLSACSHSGLVKDGLQIFHSMAEDYKIKPCV 1146
            MHS G +A+ELF++M+  GI P+EIT+I +L+AC+HSGLV  GLQIF+ M E+ K KP +
Sbjct: 377  MHSFGEKAIELFAKMIECGIEPSEITYINILAACTHSGLVDVGLQIFNRMVENQKPKPRM 436

Query: 1147 QHYGCLVDLLGRAGLFEEAKRV 1212
            QHYGC+VDLLGRAGL  +A RV
Sbjct: 437  QHYGCIVDLLGRAGLLHDAFRV 458



 Score = 57.0 bits (136), Expect = 9e-06
 Identities = 57/271 (21%), Positives = 119/271 (43%), Gaps = 9/271 (3%)
 Frame = +1

Query: 142  YNSLIKSYSINGYPKTAFLLFCDMLLHGVAEPDRYTYTFVCNACSKAMLVFEGKQVHARL 321
            +++LIK Y   G    A  LF +M +  V + D    T + +AC++   + +G+ +H  +
Sbjct: 267  WSALIKGYIQIGCYSKALELFKEMQVAKV-KMDEVIVTTLLSACARLGALDQGRWLHMYI 325

Query: 322  VKNANGVSPESWNSLMDFYLNIGEDVRRVRRILDGMKDPCIVSWNCLLDGYVKSGEIEDA 501
             K+   V      +L+D Y   G  +    ++     D  +  W+ ++ G       E A
Sbjct: 326  DKHGIKVDAHLSTALIDMYSKCGR-IDMAWKVFQETGDKKVFVWSSMIGGLAMHSFGEKA 384

Query: 502  RKVFEEMPE----RDTVSWTTMLLGYVNEGMLDEACCLFDEM-----PEKNMVSWSVMIK 654
             ++F +M E       +++  +L    + G++D    +F+ M     P+  M  +  ++ 
Sbjct: 385  IELFAKMIECGIEPSEITYINILAACTHSGLVDVGLQIFNRMVENQKPKPRMQHYGCIVD 444

Query: 655  GFWRSGCYNEALDLFKEMQVLDIEIDKITLTTLLSACAGLGALDQGCWIHAFIDKHGVEV 834
               R+G  ++A   F+ ++ + ++ D      LLSAC     ++ G  +   + K   + 
Sbjct: 445  LLGRAGLLHDA---FRVVETMPVKADPAIWRALLSACKLHRNVELGEQVGRILIKMEPQN 501

Query: 835  DAHLCTALVDMYAKCGRLDLARKVFQGFKKR 927
            D +      ++YA   R D++ K+ +  K R
Sbjct: 502  DMNY-VLFSNVYAAVNRWDISGKLRREMKVR 531


>ref|XP_002265079.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18840-like
            [Vitis vinifera]
          Length = 536

 Score =  328 bits (842), Expect = 1e-87
 Identities = 160/377 (42%), Positives = 246/377 (65%)
 Frame = +1

Query: 82   SLRYAHQLFHSPHCPRTPFFYNSLIKSYSINGYPKTAFLLFCDMLLHGVAEPDRYTYTFV 261
            ++ YAH +F     P + + +N++I++Y+ +  P+ A  +F  ML H    PD+YT+TF 
Sbjct: 57   AIPYAHSIFSRIPNPNS-YMWNTIIRAYANSPTPEAALTIFHQML-HASVLPDKYTFTFA 114

Query: 262  CNACSKAMLVFEGKQVHARLVKNANGVSPESWNSLMDFYLNIGEDVRRVRRILDGMKDPC 441
              +C     V EG+Q+H  ++K   G      N+L+  Y + G  +   R +LD M +  
Sbjct: 115  LKSCGSFSGVEEGRQIHGHVLKTGLGDDLFIQNTLIHLYASCG-CIEDARHLLDRMLERD 173

Query: 442  IVSWNCLLDGYVKSGEIEDARKVFEEMPERDTVSWTTMLLGYVNEGMLDEACCLFDEMPE 621
            +VSWN LL  Y + G +E A  +F+EM ER+  SW  M+ GYV  G+L+EA  +F E P 
Sbjct: 174  VVSWNALLSAYAERGLMELACHLFDEMTERNVESWNFMISGYVGVGLLEEARRVFGETPV 233

Query: 622  KNMVSWSVMIKGFWRSGCYNEALDLFKEMQVLDIEIDKITLTTLLSACAGLGALDQGCWI 801
            KN+VSW+ MI G+  +G ++E L LF++MQ   ++ D  TL ++LSACA +GAL QG W+
Sbjct: 234  KNVVSWNAMITGYSHAGRFSEVLVLFEDMQHAGVKPDNCTLVSVLSACAHVGALSQGEWV 293

Query: 802  HAFIDKHGVEVDAHLCTALVDMYAKCGRLDLARKVFQGFKKRKVFVWNAMLGGLAMHSLG 981
            HA+IDK+G+ +D  + TALVDMY+KCG ++ A +VF    ++ +  WN+++ GL+ H  G
Sbjct: 294  HAYIDKNGISIDGFVATALVDMYSKCGSIEKALEVFNSCLRKDISTWNSIISGLSTHGSG 353

Query: 982  LEAVELFSEMLRSGIRPNEITFICVLSACSHSGLVKDGLQIFHSMAEDYKIKPCVQHYGC 1161
              A+++FSEML  G +PNE+TF+CVLSACS +GL+ +G ++F+ M   + I+P ++HYGC
Sbjct: 354  QHALQIFSEMLVEGFKPNEVTFVCVLSACSRAGLLDEGREMFNLMVHVHGIQPTIEHYGC 413

Query: 1162 LVDLLGRAGLFEEAKRV 1212
            +VDLLGR GL EEA+ +
Sbjct: 414  MVDLLGRVGLLEEAEEL 430



 Score = 86.7 bits (213), Expect = 1e-14
 Identities = 83/368 (22%), Positives = 155/368 (42%), Gaps = 41/368 (11%)
 Frame = +1

Query: 136  FFYNSLIKSYSINGYPKTAFLLFCDMLLHGVAEPDRYTYTFVCNACSKAMLVFEGKQVHA 315
            F  N+LI  Y+  G  + A  L   ML     E D  ++  + +A ++  L+    ++  
Sbjct: 144  FIQNTLIHLYASCGCIEDARHLLDRML-----ERDVVSWNALLSAYAERGLM----ELAC 194

Query: 316  RLVKNANGVSPESWNSLMDFYLNIGEDVRRVRRILDGMKDPCIVSWNCLLDGYVKSGEIE 495
             L       + ESWN ++  Y+ +G  +   RR+        +VSWN ++ GY  +G   
Sbjct: 195  HLFDEMTERNVESWNFMISGYVGVGL-LEEARRVFGETPVKNVVSWNAMITGYSHAGRFS 253

Query: 496  DARKVFEEM------PERDTV----------------SW-----------------TTML 558
            +   +FE+M      P+  T+                 W                 T ++
Sbjct: 254  EVLVLFEDMQHAGVKPDNCTLVSVLSACAHVGALSQGEWVHAYIDKNGISIDGFVATALV 313

Query: 559  LGYVNEGMLDEACCLFDEMPEKNMVSWSVMIKGFWRSGCYNEALDLFKEMQVLDIEIDKI 738
              Y   G +++A  +F+    K++ +W+ +I G    G    AL +F EM V   + +++
Sbjct: 314  DMYSKCGSIEKALEVFNSCLRKDISTWNSIISGLSTHGSGQHALQIFSEMLVEGFKPNEV 373

Query: 739  TLTTLLSACAGLGALDQGC-WIHAFIDKHGVEVDAHLCTALVDMYAKCGRLDLARKVFQG 915
            T   +LSAC+  G LD+G    +  +  HG++        +VD+  + G L+ A ++ Q 
Sbjct: 374  TFVCVLSACSRAGLLDEGREMFNLMVHVHGIQPTIEHYGCMVDLLGRVGLLEEAEELVQK 433

Query: 916  F-KKRKVFVWNAMLGGLAMHSLGLEAVELFSEMLRSGIRPNEITFICVLSACSHSGLVKD 1092
              +K    VW ++LG    H   +E  E  ++ L         +F+ + +  +  G  KD
Sbjct: 434  MPQKEASVVWESLLGACRNHG-NVELAERVAQKLLELSPQESSSFVQLSNMYASMGRWKD 492

Query: 1093 GLQIFHSM 1116
             +++   M
Sbjct: 493  VMEVRQKM 500


>ref|XP_002281942.1| PREDICTED: pentatricopeptide repeat-containing protein At5g48910
            isoform 1 [Vitis vinifera]
          Length = 672

 Score =  327 bits (837), Expect = 5e-87
 Identities = 172/361 (47%), Positives = 240/361 (66%), Gaps = 2/361 (0%)
 Frame = +1

Query: 136  FFYNSLIKSYSINGYPKTAFLLFCDMLLHGVAEPDRYTYTFVCNACSKAMLVFEGKQVHA 315
            F +N +IK    N  P  A LL+ +M++     P++YTY  V  ACS A +V EG QVHA
Sbjct: 103  FLWNCMIKVCIENNEPFKAILLYYEMMVAHF-RPNKYTYPAVLKACSDAGVVAEGVQVHA 161

Query: 316  RLVKNANGVSPESWNSLMDFYLNIGEDVRRVRRILDGMKDPC-IVSWNCLLDGYVKSGEI 492
             LVK+  G      +S +  Y + G  V   RRILD        V WN ++DGY++ GE+
Sbjct: 162  HLVKHGLGGDGHILSSAIRMYASFGRLVE-ARRILDDKGGEVDAVCWNAMIDGYLRFGEV 220

Query: 493  EDARKVFEEMPERDTVS-WTTMLLGYVNEGMLDEACCLFDEMPEKNMVSWSVMIKGFWRS 669
            E AR++FE MP+R  +S W  M+ G+   GM++ A   FDEM E++ +SWS MI G+ + 
Sbjct: 221  EAARELFEGMPDRSMISTWNAMISGFSRCGMVEVAREFFDEMKERDEISWSAMIDGYIQE 280

Query: 670  GCYNEALDLFKEMQVLDIEIDKITLTTLLSACAGLGALDQGCWIHAFIDKHGVEVDAHLC 849
            GC+ EAL++F +MQ   I   K  L ++LSACA LGALDQG WIH +  ++ +++D  L 
Sbjct: 281  GCFMEALEIFHQMQKEKIRPRKFVLPSVLSACANLGALDQGRWIHTYAKRNSIQLDGVLG 340

Query: 850  TALVDMYAKCGRLDLARKVFQGFKKRKVFVWNAMLGGLAMHSLGLEAVELFSEMLRSGIR 1029
            T+LVDMYAKCGR+DLA +VF+    ++V  WNAM+GGLAMH    +A++LFS+M    I 
Sbjct: 341  TSLVDMYAKCGRIDLAWEVFEKMSNKEVSSWNAMIGGLAMHGRAEDAIDLFSKM---DIN 397

Query: 1030 PNEITFICVLSACSHSGLVKDGLQIFHSMAEDYKIKPCVQHYGCLVDLLGRAGLFEEAKR 1209
            PNEITF+ VL+AC+H GLV+ GL IF+SM ++Y ++P ++HYGC+VDLLGRAGL  EA++
Sbjct: 398  PNEITFVGVLNACAHGGLVQKGLTIFNSMRKEYGVEPQIEHYGCIVDLLGRAGLLTEAEK 457

Query: 1210 V 1212
            V
Sbjct: 458  V 458



 Score = 61.2 bits (147), Expect = 5e-07
 Identities = 34/106 (32%), Positives = 59/106 (55%), Gaps = 5/106 (4%)
 Frame = +1

Query: 802  HAFIDKHGVEVDAHLCTALVDMYAKCGR-----LDLARKVFQGFKKRKVFVWNAMLGGLA 966
            HA I + G   D+++  +LV  YA          + + +VF   +K  VF+WN M+    
Sbjct: 54   HALILRTGHLQDSYIAGSLVKSYANVSTNRYLSFESSLRVFDFVRKPNVFLWNCMIKVCI 113

Query: 967  MHSLGLEAVELFSEMLRSGIRPNEITFICVLSACSHSGLVKDGLQI 1104
             ++   +A+ L+ EM+ +  RPN+ T+  VL ACS +G+V +G+Q+
Sbjct: 114  ENNEPFKAILLYYEMMVAHFRPNKYTYPAVLKACSDAGVVAEGVQV 159


>emb|CAN61593.1| hypothetical protein VITISV_030555 [Vitis vinifera]
          Length = 673

 Score =  325 bits (834), Expect = 1e-86
 Identities = 171/361 (47%), Positives = 241/361 (66%), Gaps = 2/361 (0%)
 Frame = +1

Query: 136  FFYNSLIKSYSINGYPKTAFLLFCDMLLHGVAEPDRYTYTFVCNACSKAMLVFEGKQVHA 315
            F +N +IK    N  P  A LL+ +M++   + P++YTY  V  ACS + +V EG QVHA
Sbjct: 104  FLWNCMIKVCIENNEPFKAILLYYEMVV-AHSRPNKYTYPAVLKACSDSGVVAEGVQVHA 162

Query: 316  RLVKNANGVSPESWNSLMDFYLNIGEDVRRVRRILDGMKDPC-IVSWNCLLDGYVKSGEI 492
             LVK+  G      +S +  Y + G  V   RRILD        V WN ++DGY++ GE+
Sbjct: 163  HLVKHGLGGDGHILSSAIRMYASFGRLVE-ARRILDDKGGEVDAVCWNAMIDGYLRFGEV 221

Query: 493  EDARKVFEEMPERDTVS-WTTMLLGYVNEGMLDEACCLFDEMPEKNMVSWSVMIKGFWRS 669
            E AR++FE MP+R  +S W  M+ G+   GM++ A   FDEM E++ +SWS MI G+ + 
Sbjct: 222  EAARELFEGMPDRSMISTWNAMISGFSRCGMVEVAREFFDEMKERDEISWSAMIDGYIQE 281

Query: 670  GCYNEALDLFKEMQVLDIEIDKITLTTLLSACAGLGALDQGCWIHAFIDKHGVEVDAHLC 849
            GC+ EAL++F +MQ   I   K  L ++LSACA LGALDQG WIH +  ++ +++D  L 
Sbjct: 282  GCFMEALEIFHQMQKEKIRPRKFVLPSVLSACANLGALDQGRWIHTYAKRNSIQLDGVLG 341

Query: 850  TALVDMYAKCGRLDLARKVFQGFKKRKVFVWNAMLGGLAMHSLGLEAVELFSEMLRSGIR 1029
            T+LVDMYAKCGR+DLA +VF+    ++V  WNAM+GGLAMH    +A++LFS+M    I 
Sbjct: 342  TSLVDMYAKCGRIDLAWEVFEKMSNKEVSSWNAMIGGLAMHGRAEDAIDLFSKM---DIY 398

Query: 1030 PNEITFICVLSACSHSGLVKDGLQIFHSMAEDYKIKPCVQHYGCLVDLLGRAGLFEEAKR 1209
            PNEITF+ VL+AC+H GLV+ GL IF+SM ++Y ++P ++HYGC+VDLLGRAGL  EA++
Sbjct: 399  PNEITFVGVLNACAHGGLVQKGLTIFNSMRKEYGVEPQIEHYGCIVDLLGRAGLLTEAEK 458

Query: 1210 V 1212
            V
Sbjct: 459  V 459



 Score = 61.2 bits (147), Expect = 5e-07
 Identities = 35/106 (33%), Positives = 59/106 (55%), Gaps = 5/106 (4%)
 Frame = +1

Query: 802  HAFIDKHGVEVDAHLCTALVDMYAKCGR-----LDLARKVFQGFKKRKVFVWNAMLGGLA 966
            HA I + G   D+++  +LV  YA          + + +VF   +K  VF+WN M+    
Sbjct: 55   HALILRTGHLQDSYIAGSLVKSYANVSTNRYLSFESSLRVFDFVRKPNVFLWNCMIKVCI 114

Query: 967  MHSLGLEAVELFSEMLRSGIRPNEITFICVLSACSHSGLVKDGLQI 1104
             ++   +A+ L+ EM+ +  RPN+ T+  VL ACS SG+V +G+Q+
Sbjct: 115  ENNEPFKAILLYYEMVVAHSRPNKYTYPAVLKACSDSGVVAEGVQV 160


>ref|NP_193619.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75098703|sp|O49399.2|PP321_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g18840 gi|5738365|emb|CAA16741.2| putative protein
            [Arabidopsis thaliana] gi|7268678|emb|CAB78886.1|
            putative protein [Arabidopsis thaliana]
            gi|332658697|gb|AEE84097.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 545

 Score =  325 bits (832), Expect = 2e-86
 Identities = 162/383 (42%), Positives = 248/383 (64%), Gaps = 3/383 (0%)
 Frame = +1

Query: 73   QPDSLRYAHQLFHSPHCPRTPFFYNSLIKSYSINGYPKTAFLLFCDMLLHGVAEPDRYTY 252
            +P ++ YAH + +    P   F +NS+I++Y+ +  P+ A  +F +MLL G   PD+Y++
Sbjct: 86   EPKTVSYAHSILNRIGSPNG-FTHNSVIRAYANSSTPEVALTVFREMLL-GPVFPDKYSF 143

Query: 253  TFVCNACSKAMLVFEGKQVHARLVKNANGVSPESWNSLMDFYLNIGEDVRRVRRILDGMK 432
            TFV  AC+      EG+Q+H   +K+         N+L++ Y   G      R++LD M 
Sbjct: 144  TFVLKACAAFCGFEEGRQIHGLFIKSGLVTDVFVENTLVNVYGRSGY-FEIARKVLDRMP 202

Query: 433  DPCIVSWNCLLDGYVKSGEIEDARKVFEEMPERDTVSWTTMLLGYVNEGMLDEACCLFDE 612
                VSWN LL  Y++ G +++AR +F+EM ER+  SW  M+ GY   G++ EA  +FD 
Sbjct: 203  VRDAVSWNSLLSAYLEKGLVDEARALFDEMEERNVESWNFMISGYAAAGLVKEAKEVFDS 262

Query: 613  MPEKNMVSWSVMIKGFWRSGCYNEALDLFKEMQVLDIEIDK---ITLTTLLSACAGLGAL 783
            MP +++VSW+ M+  +   GCYNE L++F +M  LD   +K    TL ++LSACA LG+L
Sbjct: 263  MPVRDVVSWNAMVTAYAHVGCYNEVLEVFNKM--LDDSTEKPDGFTLVSVLSACASLGSL 320

Query: 784  DQGCWIHAFIDKHGVEVDAHLCTALVDMYAKCGRLDLARKVFQGFKKRKVFVWNAMLGGL 963
             QG W+H +IDKHG+E++  L TALVDMY+KCG++D A +VF+   KR V  WN+++  L
Sbjct: 321  SQGEWVHVYIDKHGIEIEGFLATALVDMYSKCGKIDKALEVFRATSKRDVSTWNSIISDL 380

Query: 964  AMHSLGLEAVELFSEMLRSGIRPNEITFICVLSACSHSGLVKDGLQIFHSMAEDYKIKPC 1143
            ++H LG +A+E+FSEM+  G +PN ITFI VLSAC+H G++    ++F  M+  Y+++P 
Sbjct: 381  SVHGLGKDALEIFSEMVYEGFKPNGITFIGVLSACNHVGMLDQARKLFEMMSSVYRVEPT 440

Query: 1144 VQHYGCLVDLLGRAGLFEEAKRV 1212
            ++HYGC+VDLLGR G  EEA+ +
Sbjct: 441  IEHYGCMVDLLGRMGKIEEAEEL 463


Top