BLASTX nr result

ID: Dioscorea21_contig00034797 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00034797
         (552 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003522169.1| PREDICTED: pentatricopeptide repeat-containi...   101   9e-20
ref|XP_002890108.1| pentatricopeptide repeat-containing protein ...    97   2e-18
ref|XP_002535423.1| pentatricopeptide repeat-containing protein,...    95   7e-18
ref|NP_173004.1| pentatricopeptide repeat-containing protein [Ar...    94   1e-17
ref|XP_004137012.1| PREDICTED: pentatricopeptide repeat-containi...    94   1e-17

>ref|XP_003522169.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g13650-like [Glycine max]
          Length = 751

 Score =  101 bits (251), Expect = 9e-20
 Identities = 64/187 (34%), Positives = 101/187 (54%), Gaps = 4/187 (2%)
 Frame = -1

Query: 552 KVFDEMLERNVVTWNSLISGYANAGKGALALEMFYRMRCVETEVAADGFTISSTLMACAG 373
           K+FD+M +RN+V+W S+I+G+A+  +   AL  F +MR +E E+A   F +SS L AC  
Sbjct: 131 KLFDKMSQRNMVSWTSIITGFAHNSRFQEALSSFCQMR-IEGEIATQ-FALSSVLQACTS 188

Query: 372 LGDRQSGVQVHAFLVVAGFGCDSAVLNSLANMYFRVGDXXXXXXXXXXXXXXXXSVILKL 193
           LG  Q G QVH  +V  GFGC+  V ++L +MY + G+                  +L  
Sbjct: 189 LGAIQFGTQVHCLVVKCGFGCELFVGSNLTDMYSKCGE--LSDACKAFEEMPCKDAVLWT 246

Query: 192 MMVKGYVFNELYEDALRSINLGNDFVKMIV----VDPSVVGSILTACSNLRLLHLGRQIH 25
            M+ G+V N  ++ AL +      ++KM+     +D  V+ S L+ACS L+    G+ +H
Sbjct: 247 SMIDGFVKNGDFKKALTA------YMKMVTDDVFIDQHVLCSTLSACSALKASSFGKSLH 300

Query: 24  GLIVTTG 4
             I+  G
Sbjct: 301 ATILKLG 307



 Score = 69.7 bits (169), Expect = 3e-10
 Identities = 53/181 (29%), Positives = 85/181 (46%), Gaps = 1/181 (0%)
 Frame = -1

Query: 552 KVFDEMLERNVVTWNSLISGYANAGKGALALEMFYRMRCVETEVAADGFTISSTLMACAG 373
           K F+EM  ++ V W S+I G+   G    AL  + +M  V  +V  D   + STL AC+ 
Sbjct: 232 KAFEEMPCKDAVLWTSMIDGFVKNGDFKKALTAYMKM--VTDDVFIDQHVLCSTLSACSA 289

Query: 372 LGDRQSGVQVHAFLVVAGFGCDSAVLNSLANMYFRVGDXXXXXXXXXXXXXXXXSVILKL 193
           L     G  +HA ++  GF  ++ + N+L +MY + GD                 V L  
Sbjct: 290 LKASSFGKSLHATILKLGFEYETFIGNALTDMYSKSGDMVSASNVFQIHSDCISIVSL-T 348

Query: 192 MMVKGYVFNELYEDALRS-INLGNDFVKMIVVDPSVVGSILTACSNLRLLHLGRQIHGLI 16
            ++ GYV  +  E AL + ++L     + I  +     S++ AC+N   L  G Q+HG +
Sbjct: 349 AIIDGYVEMDQIEKALSTFVDLRR---RGIEPNEFTFTSLIKACANQAKLEHGSQLHGQV 405

Query: 15  V 13
           V
Sbjct: 406 V 406


>ref|XP_002890108.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297335950|gb|EFH66367.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 866

 Score = 96.7 bits (239), Expect = 2e-18
 Identities = 58/180 (32%), Positives = 93/180 (51%), Gaps = 1/180 (0%)
 Frame = -1

Query: 549 VFDEMLERNVVTWNSLISGYANAGKGALALEMFYRMRCVETEVAADGFTISSTLMACAGL 370
           +FD M  R++++WN++ISGY   G G   L++F+ MR +  +   D  T++S + AC  L
Sbjct: 253 LFDRMPRRDIISWNAMISGYFENGMGHEGLKLFFAMRGLSVD--PDLMTLTSVISACELL 310

Query: 369 GDRQSGVQVHAFLVVAGFGCDSAVLNSLANMYFRVGDXXXXXXXXXXXXXXXXSVILKLM 190
           GDR+ G  +HA+++  GF  D +V NSL  MY   G                  ++    
Sbjct: 311 GDRRLGRDIHAYVITTGFAVDISVCNSLTQMYLYAGS--WREAEKLFSRMDCKDIVSWTT 368

Query: 189 MVKGYVFNELYEDALRSIN-LGNDFVKMIVVDPSVVGSILTACSNLRLLHLGRQIHGLIV 13
           M+ GY +N L E A+ +   +  D VK    D   V ++L+AC+ L  L  G ++H L +
Sbjct: 369 MISGYEYNFLPEKAIDTYRMMDQDSVK---PDEITVAAVLSACATLGDLDTGVELHKLAI 425



 Score = 84.0 bits (206), Expect = 2e-14
 Identities = 58/184 (31%), Positives = 89/184 (48%), Gaps = 2/184 (1%)
 Frame = -1

Query: 549 VFDEMLERNVVTWNSLISGYANAGKGALALEMFYRMRCVETEVAADGFTISSTLMACAGL 370
           VF +M ERN+ +WN L+ GYA  G    A+ +++RM  V   V  D +T    L  C G+
Sbjct: 151 VFGKMSERNLFSWNVLVGGYAKQGYFDEAICLYHRMLWVGG-VKPDVYTFPCVLRTCGGI 209

Query: 369 GDRQSGVQVHAFLVVAGFGCDSAVLNSLANMYFRVGDXXXXXXXXXXXXXXXXSVILKLM 190
            D   G +VH  +V  G+  D  V+N+L  MY + GD                 +I    
Sbjct: 210 PDLARGREVHVHVVRYGYELDIDVVNALITMYVKCGD--VKSARLLFDRMPRRDIISWNA 267

Query: 189 MVKGYVFNELYEDALRSINLGNDFVKMIVVDPSV--VGSILTACSNLRLLHLGRQIHGLI 16
           M+ GY  N +  + L+        ++ + VDP +  + S+++AC  L    LGR IH  +
Sbjct: 268 MISGYFENGMGHEGLKLFFA----MRGLSVDPDLMTLTSVISACELLGDRRLGRDIHAYV 323

Query: 15  VTTG 4
           +TTG
Sbjct: 324 ITTG 327



 Score = 55.5 bits (132), Expect = 6e-06
 Identities = 29/96 (30%), Positives = 52/96 (54%)
 Frame = -1

Query: 549 VFDEMLERNVVTWNSLISGYANAGKGALALEMFYRMRCVETEVAADGFTISSTLMACAGL 370
           +F  +  +NV++W S+I+G     +   AL  F +M+     +  +  T+++ L ACA +
Sbjct: 455 IFHNIPRKNVISWTSIIAGLRLNNRCFEALIFFRQMKMT---LQPNAITLTAALAACARI 511

Query: 369 GDRQSGVQVHAFLVVAGFGCDSAVLNSLANMYFRVG 262
           G    G ++HA ++  G G D  + N+L +MY R G
Sbjct: 512 GALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVRCG 547


>ref|XP_002535423.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223523164|gb|EEF26960.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 563

 Score = 95.1 bits (235), Expect = 7e-18
 Identities = 62/183 (33%), Positives = 91/183 (49%)
 Frame = -1

Query: 549 VFDEMLERNVVTWNSLISGYANAGKGALALEMFYRMRCVETEVAADGFTISSTLMACAGL 370
           VF+EM  R++V+WNSLISGY+  G    ALE++Y +R     +  D FT+SS L AC GL
Sbjct: 90  VFEEMTHRDIVSWNSLISGYSANGYWDEALEIYYELRI--AGLKPDNFTLSSVLPACGGL 147

Query: 369 GDRQSGVQVHAFLVVAGFGCDSAVLNSLANMYFRVGDXXXXXXXXXXXXXXXXSVILKLM 190
              + G  +H  +   G   D  + N L +MYF+ G                   +    
Sbjct: 148 LAVKEGEVIHGLVEKLGMNIDVIMSNGLLSMYFKFG--RLMDAQRVFNKMVVKDYVSWNT 205

Query: 189 MVKGYVFNELYEDALRSINLGNDFVKMIVVDPSVVGSILTACSNLRLLHLGRQIHGLIVT 10
           ++ GY   EL+E+   SI L  + VK    D   + S+L AC  LR L  G+ +H  I+ 
Sbjct: 206 LICGYCQMELFEE---SIQLFREMVKRFRPDLLTITSVLRACGLLRDLEFGKFVHDYILR 262

Query: 9   TGV 1
           +G+
Sbjct: 263 SGI 265



 Score = 69.3 bits (168), Expect = 4e-10
 Identities = 48/171 (28%), Positives = 80/171 (46%)
 Frame = -1

Query: 513 WNSLISGYANAGKGALALEMFYRMRCVETEVAADGFTISSTLMACAGLGDRQSGVQVHAF 334
           WNS+I    + G  + AL+++++M+  +  V  D +T  S + ACA LGD + G  V   
Sbjct: 1   WNSVIRALTHNGLFSKALDLYFKMK--DFNVKPDTYTFPSVINACAALGDFEIGNVVQNH 58

Query: 333 LVVAGFGCDSAVLNSLANMYFRVGDXXXXXXXXXXXXXXXXSVILKLMMVKGYVFNELYE 154
           ++  GFG D  + N+L +MY R GD                 ++    ++ GY  N  ++
Sbjct: 59  VLEIGFGFDLYIGNALVDMYARFGD--LVKARNVFEEMTHRDIVSWNSLISGYSANGYWD 116

Query: 153 DALRSINLGNDFVKMIVVDPSVVGSILTACSNLRLLHLGRQIHGLIVTTGV 1
           +AL         +  +  D   + S+L AC  L  +  G  IHGL+   G+
Sbjct: 117 EALEIYY--ELRIAGLKPDNFTLSSVLPACGGLLAVKEGEVIHGLVEKLGM 165



 Score = 56.2 bits (134), Expect = 3e-06
 Identities = 30/98 (30%), Positives = 51/98 (52%)
 Frame = -1

Query: 552 KVFDEMLERNVVTWNSLISGYANAGKGALALEMFYRMRCVETEVAADGFTISSTLMACAG 373
           K FD +  R+ V+WN+LI+GY  +      +++F +M+    ++  D  T  + L     
Sbjct: 290 KAFDRIKCRDSVSWNTLINGYIQSRSYGEGVKLFKKMKM---DLKPDSITFVTLLSISTR 346

Query: 372 LGDRQSGVQVHAFLVVAGFGCDSAVLNSLANMYFRVGD 259
           L D + G ++H  L   GF  D  V N+L +MY + G+
Sbjct: 347 LADTELGKEIHCDLAKLGFDSDLVVSNALVDMYSKCGN 384


>ref|NP_173004.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75191104|sp|Q9M9E2.1|PPR45_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g15510, chloroplastic; Flags: Precursor
           gi|8072389|gb|AAF71977.1|AC013453_2 Hypothetical protein
           [Arabidopsis thaliana] gi|300825685|gb|ADK35876.1|
           chloroplast vanilla cream 1 [Arabidopsis thaliana]
           gi|332191210|gb|AEE29331.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 866

 Score = 94.4 bits (233), Expect = 1e-17
 Identities = 57/180 (31%), Positives = 92/180 (51%), Gaps = 1/180 (0%)
 Frame = -1

Query: 549 VFDEMLERNVVTWNSLISGYANAGKGALALEMFYRMRCVETEVAADGFTISSTLMACAGL 370
           +FD M  R++++WN++ISGY   G     LE+F+ MR +  +   D  T++S + AC  L
Sbjct: 253 LFDRMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVD--PDLMTLTSVISACELL 310

Query: 369 GDRQSGVQVHAFLVVAGFGCDSAVLNSLANMYFRVGDXXXXXXXXXXXXXXXXSVILKLM 190
           GDR+ G  +HA+++  GF  D +V NSL  MY   G                  ++    
Sbjct: 311 GDRRLGRDIHAYVITTGFAVDISVCNSLTQMYLNAGS--WREAEKLFSRMERKDIVSWTT 368

Query: 189 MVKGYVFNELYEDALRSIN-LGNDFVKMIVVDPSVVGSILTACSNLRLLHLGRQIHGLIV 13
           M+ GY +N L + A+ +   +  D VK    D   V ++L+AC+ L  L  G ++H L +
Sbjct: 369 MISGYEYNFLPDKAIDTYRMMDQDSVK---PDEITVAAVLSACATLGDLDTGVELHKLAI 425



 Score = 84.0 bits (206), Expect = 2e-14
 Identities = 58/184 (31%), Positives = 88/184 (47%), Gaps = 2/184 (1%)
 Frame = -1

Query: 549 VFDEMLERNVVTWNSLISGYANAGKGALALEMFYRMRCVETEVAADGFTISSTLMACAGL 370
           VF +M ERN+ +WN L+ GYA  G    A+ +++RM  V   V  D +T    L  C G+
Sbjct: 151 VFGKMSERNLFSWNVLVGGYAKQGYFDEAMCLYHRMLWVGG-VKPDVYTFPCVLRTCGGI 209

Query: 369 GDRQSGVQVHAFLVVAGFGCDSAVLNSLANMYFRVGDXXXXXXXXXXXXXXXXSVILKLM 190
            D   G +VH  +V  G+  D  V+N+L  MY + GD                 +I    
Sbjct: 210 PDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGD--VKSARLLFDRMPRRDIISWNA 267

Query: 189 MVKGYVFNELYEDALRSINLGNDFVKMIVVDPSV--VGSILTACSNLRLLHLGRQIHGLI 16
           M+ GY  N +  + L         ++ + VDP +  + S+++AC  L    LGR IH  +
Sbjct: 268 MISGYFENGMCHEGLELFFA----MRGLSVDPDLMTLTSVISACELLGDRRLGRDIHAYV 323

Query: 15  VTTG 4
           +TTG
Sbjct: 324 ITTG 327



 Score = 62.0 bits (149), Expect = 6e-08
 Identities = 49/185 (26%), Positives = 86/185 (46%), Gaps = 1/185 (0%)
 Frame = -1

Query: 552 KVFDEMLERNVVTWNSLISGYANAGKGALALEMFYRMRCVETEVAADGFTISSTLMACAG 373
           K+F  M  +++V+W ++ISGY        A++ +  M   +  V  D  T+++ L ACA 
Sbjct: 353 KLFSRMERKDIVSWTTMISGYEYNFLPDKAIDTYRMMD--QDSVKPDEITVAAVLSACAT 410

Query: 372 LGDRQSGVQVHAFLVVAGFGCDSAVLNSLANMYFRVGDXXXXXXXXXXXXXXXXSVILKL 193
           LGD  +GV++H   + A       V N+L NMY +                   +VI   
Sbjct: 411 LGDLDTGVELHKLAIKARLISYVIVANNLINMYSKC--KCIDKALDIFHNIPRKNVISWT 468

Query: 192 MMVKGYVFNELYEDALRSINLGNDFVKMIVVDPSV-VGSILTACSNLRLLHLGRQIHGLI 16
            ++ G   N    +AL  +      +KM +   ++ + + L AC+ +  L  G++IH  +
Sbjct: 469 SIIAGLRLNNRCFEALIFLRQ----MKMTLQPNAITLTAALAACARIGALMCGKEIHAHV 524

Query: 15  VTTGV 1
           + TGV
Sbjct: 525 LRTGV 529


>ref|XP_004137012.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g03540-like [Cucumis sativus]
           gi|449493172|ref|XP_004159212.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At1g03540-like [Cucumis sativus]
          Length = 605

 Score = 94.0 bits (232), Expect = 1e-17
 Identities = 55/183 (30%), Positives = 95/183 (51%)
 Frame = -1

Query: 552 KVFDEMLERNVVTWNSLISGYANAGKGALALEMFYRMRCVETEVAADGFTISSTLMACAG 373
           +VFD +  ++VV+W S+I+GY   GK  +A+E+F+ M  +++ +  +GFT+S+ + AC+ 
Sbjct: 117 RVFDGLFVKDVVSWASMITGYVREGKSGIAIELFWDM--LDSGIEPNGFTLSAVIKACSE 174

Query: 372 LGDRQSGVQVHAFLVVAGFGCDSAVLNSLANMYFRVGDXXXXXXXXXXXXXXXXSVILKL 193
           +G+   G   H  +V  GF  +  +L+SL +MY R  +                  +   
Sbjct: 175 IGNLVLGKCFHGVVVRRGFDSNPVILSSLIDMYGR--NSVSSDARQLFDELLEPDPVCWT 232

Query: 192 MMVKGYVFNELYEDALRSINLGNDFVKMIVVDPSVVGSILTACSNLRLLHLGRQIHGLIV 13
            ++  +  N+LYE+AL    L +     +  D    GS+LTAC NL  L  G +IH  ++
Sbjct: 233 TVISAFTRNDLYEEALGFFYLKHR-AHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVI 291

Query: 12  TTG 4
             G
Sbjct: 292 AYG 294



 Score = 67.4 bits (163), Expect = 1e-09
 Identities = 51/183 (27%), Positives = 82/183 (44%)
 Frame = -1

Query: 552 KVFDEMLERNVVTWNSLISGYANAGKGALALEMFYRMRCVETEVAADGFTISSTLMACAG 373
           ++FDE+LE + V W ++IS +        AL  FY ++     +  D +T  S L AC  
Sbjct: 218 QLFDELLEPDPVCWTTVISAFTRNDLYEEALGFFY-LKHRAHRLCPDNYTFGSVLTACGN 276

Query: 372 LGDRQSGVQVHAFLVVAGFGCDSAVLNSLANMYFRVGDXXXXXXXXXXXXXXXXSVILKL 193
           LG  + G ++HA ++  GF  +    +SL +MY + G                      L
Sbjct: 277 LGRLRQGEEIHAKVIAYGFSGNVVTESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSAL 336

Query: 192 MMVKGYVFNELYEDALRSINLGNDFVKMIVVDPSVVGSILTACSNLRLLHLGRQIHGLIV 13
           + V  Y  N  YE A+      N F +M  VD    G+++ AC+ L  +  G++IH   +
Sbjct: 337 LAV--YCHNGDYEKAV------NLFREMKEVDLYSFGTVIRACAGLAAVTPGKEIHCQYI 388

Query: 12  TTG 4
             G
Sbjct: 389 RKG 391


Top