BLASTX nr result
ID: Dioscorea21_contig00033489
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00033489 (516 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002284545.2| PREDICTED: pentatricopeptide repeat-containi... 264 5e-69 ref|XP_003546945.1| PREDICTED: putative pentatricopeptide repeat... 256 2e-66 ref|XP_004154387.1| PREDICTED: pentatricopeptide repeat-containi... 252 3e-65 emb|CBI36131.3| unnamed protein product [Vitis vinifera] 250 1e-64 ref|XP_002516017.1| pentatricopeptide repeat-containing protein,... 245 3e-63 >ref|XP_002284545.2| PREDICTED: pentatricopeptide repeat-containing protein At2g27610-like [Vitis vinifera] Length = 648 Score = 264 bits (675), Expect = 5e-69 Identities = 120/170 (70%), Positives = 144/170 (84%) Frame = +3 Query: 3 KAVDPGKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFLWNVLI 182 KA+ PGKQLHA++ L+G GFD ++ATKLVN+Y VC+ L A LFDRIP+ N+FLWNVLI Sbjct: 89 KAIKPGKQLHAQVCLAGFGFDTVIATKLVNLYCVCDSLSSARLLFDRIPKHNIFLWNVLI 148 Query: 183 RGFAWEGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACSALSALEVGREIHEHVVRSRWD 362 RG+AW GP+E A+ LYY+M + GL PDNFTFPFVLKAC+ALSA+E GREIHEHVV++ W+ Sbjct: 149 RGYAWNGPYEAAVQLYYQMFDYGLVPDNFTFPFVLKACAALSAIEHGREIHEHVVQTGWE 208 Query: 363 SDVFVGAGLIDMYAKCGCVDDARQVFDRITKRDVVLWNAMIAAYSQNGHP 512 DVFVGA LIDMYAKCGCV AR+VFD+I RD VLWN+M+AAYSQNGHP Sbjct: 209 KDVFVGAALIDMYAKCGCVGSAREVFDKILVRDAVLWNSMLAAYSQNGHP 258 Score = 115 bits (287), Expect = 5e-24 Identities = 59/168 (35%), Positives = 95/168 (56%) Frame = +3 Query: 6 AVDPGKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFLWNVLIR 185 A++ G+++H ++ +G D + L+++Y+ C + A +FD+I R+ LWN ++ Sbjct: 191 AIEHGREIHEHVVQTGWEKDVFVGAALIDMYAKCGCVGSAREVFDKILVRDAVLWNSMLA 250 Query: 186 GFAWEGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACSALSALEVGREIHEHVVRSRWDS 365 ++ G + LSL MV GL+P T + A + +AL GRE+H R ++S Sbjct: 251 AYSQNGHPDACLSLCSEMVLTGLRPTEATLVTAISASADNAALPQGRELHGLSWRQEFES 310 Query: 366 DVFVGAGLIDMYAKCGCVDDARQVFDRITKRDVVLWNAMIAAYSQNGH 509 V L+DMYAKCG V AR +F+R+ + VV WNAMI Y+ +GH Sbjct: 311 HDKVKTALVDMYAKCGSVRVARNLFERLGVKRVVSWNAMITGYAMHGH 358 Score = 67.4 bits (163), Expect = 1e-09 Identities = 48/162 (29%), Positives = 78/162 (48%), Gaps = 2/162 (1%) Frame = +3 Query: 6 AVDPGKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFLWNVLIR 185 A+ G++LH + T LV++Y+ C + A NLF+R+ + V WN +I Sbjct: 292 ALPQGRELHGLSWRQEFESHDKVKTALVDMYAKCGSVRVARNLFERLGVKRVVSWNAMIT 351 Query: 186 GFAWEGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACSALSALEVGREIHEHVVRS-RWD 362 G+A G AL L+ M +PD+ TF VL ACS LE G E ++R + D Sbjct: 352 GYAMHGHATEALDLFEEMNRVA-KPDHITFVGVLSACSHGGLLEEGWMFFETMIRDYKID 410 Query: 363 SDVFVGAGLIDMYAKCGCVDDARQVFDRI-TKRDVVLWNAMI 485 V ++D+ G +D+A + ++ D +W A++ Sbjct: 411 PTVQHYTCMVDLLGHSGRLDEAYNLIMQMKVLPDSGVWGALL 452 Score = 63.2 bits (152), Expect = 2e-08 Identities = 27/85 (31%), Positives = 48/85 (56%) Frame = +3 Query: 252 LQPDNFTFPFVLKACSALSALEVGREIHEHVVRSRWDSDVFVGAGLIDMYAKCGCVDDAR 431 L P + +L++C A A++ G+++H V + + D + L+++Y C + AR Sbjct: 71 LTPTYSNYASLLQSCIARKAIKPGKQLHAQVCLAGFGFDTVIATKLVNLYCVCDSLSSAR 130 Query: 432 QVFDRITKRDVVLWNAMIAAYSQNG 506 +FDRI K ++ LWN +I Y+ NG Sbjct: 131 LLFDRIPKHNIFLWNVLIRGYAWNG 155 >ref|XP_003546945.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g23330-like [Glycine max] Length = 631 Score = 256 bits (653), Expect = 2e-66 Identities = 117/170 (68%), Positives = 142/170 (83%) Frame = +3 Query: 3 KAVDPGKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFLWNVLI 182 KA++PGKQLHARL G+ ++ LATKLVN YSVCN L +A++LFD+IP+ N+FLWNVLI Sbjct: 72 KALEPGKQLHARLCQLGIAYNLDLATKLVNFYSVCNSLRNAHHLFDKIPKGNLFLWNVLI 131 Query: 183 RGFAWEGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACSALSALEVGREIHEHVVRSRWD 362 R +AW GPHE A+SLY++M+E GL+PDNFT PFVLKACSALS + GR IHE V+RS W+ Sbjct: 132 RAYAWNGPHETAISLYHQMLEYGLKPDNFTLPFVLKACSALSTIGEGRVIHERVIRSGWE 191 Query: 363 SDVFVGAGLIDMYAKCGCVDDARQVFDRITKRDVVLWNAMIAAYSQNGHP 512 DVFVGA L+DMYAKCGCV DAR VFD+I RD VLWN+M+AAY+QNGHP Sbjct: 192 RDVFVGAALVDMYAKCGCVVDARHVFDKIVDRDAVLWNSMLAAYAQNGHP 241 Score = 109 bits (272), Expect = 3e-22 Identities = 58/166 (34%), Positives = 94/166 (56%) Frame = +3 Query: 9 VDPGKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFLWNVLIRG 188 + G+ +H R++ SG D + LV++Y+ C + DA ++FD+I R+ LWN ++ Sbjct: 175 IGEGRVIHERVIRSGWERDVFVGAALVDMYAKCGCVVDARHVFDKIVDRDAVLWNSMLAA 234 Query: 189 FAWEGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACSALSALEVGREIHEHVVRSRWDSD 368 +A G + +LSL M G++P T V+ + + ++ L GREIH R + + Sbjct: 235 YAQNGHPDESLSLCCEMAAKGVRPTEATLVTVISSSADIACLPHGREIHGFGWRHGFQYN 294 Query: 369 VFVGAGLIDMYAKCGCVDDARQVFDRITKRDVVLWNAMIAAYSQNG 506 V LIDMYAKCG V A +F+R+ ++ VV WNA+I Y+ +G Sbjct: 295 DKVKTALIDMYAKCGSVKVACVLFERLREKRVVSWNAIITGYAMHG 340 Score = 79.7 bits (195), Expect = 2e-13 Identities = 51/169 (30%), Positives = 91/169 (53%), Gaps = 5/169 (2%) Frame = +3 Query: 18 GKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFLWNVLIRGFAW 197 G+++H G ++ + T L+++Y+ C + A LF+R+ ++ V WN +I G+A Sbjct: 279 GREIHGFGWRHGFQYNDKVKTALIDMYAKCGSVKVACVLFERLREKRVVSWNAIITGYAM 338 Query: 198 EGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACSALSALEVGREIHEHVVRS-RWDSDVF 374 G AL L+ RM++ QPD+ TF L ACS L+ GR ++ +VR R + V Sbjct: 339 HGLAVEALDLFERMMKEA-QPDHITFVGALAACSRGRLLDEGRALYNLMVRDCRINPTVE 397 Query: 375 VGAGLIDMYAKCGCVDDARQVFDRITKRDVV----LWNAMIAAYSQNGH 509 ++D+ CG +D+A +D I + DV+ +W A++ + +G+ Sbjct: 398 HYTCMVDLLGHCGQLDEA---YDLIRQMDVMPDSGVWGALLNSCKTHGN 443 Score = 55.1 bits (131), Expect = 6e-06 Identities = 23/82 (28%), Positives = 50/82 (60%) Frame = +3 Query: 261 DNFTFPFVLKACSALSALEVGREIHEHVVRSRWDSDVFVGAGLIDMYAKCGCVDDARQVF 440 +++ + +L++C + ALE G+++H + + ++ + L++ Y+ C + +A +F Sbjct: 57 NHYYYASLLESCISAKALEPGKQLHARLCQLGIAYNLDLATKLVNFYSVCNSLRNAHHLF 116 Query: 441 DRITKRDVVLWNAMIAAYSQNG 506 D+I K ++ LWN +I AY+ NG Sbjct: 117 DKIPKGNLFLWNVLIRAYAWNG 138 >ref|XP_004154387.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucumis sativus] gi|449522468|ref|XP_004168248.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucumis sativus] Length = 574 Score = 252 bits (643), Expect = 3e-65 Identities = 115/170 (67%), Positives = 145/170 (85%) Frame = +3 Query: 3 KAVDPGKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFLWNVLI 182 KA++PGKQLHAR+ G+ F+ +LATKLVN+Y +CN L +A+ LFDRI +RN+FLWNV+I Sbjct: 15 KAIEPGKQLHARICQVGISFNPLLATKLVNLYCICNSLTNAHLLFDRISKRNLFLWNVMI 74 Query: 183 RGFAWEGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACSALSALEVGREIHEHVVRSRWD 362 RG+AW GP+E+A+SLYY+M + GL PD FTFPFVLKACSALSA+E G++IH+ V+RS + Sbjct: 75 RGYAWNGPYELAISLYYQMRDYGLVPDKFTFPFVLKACSALSAMEEGKKIHKDVIRSGLE 134 Query: 363 SDVFVGAGLIDMYAKCGCVDDARQVFDRITKRDVVLWNAMIAAYSQNGHP 512 SDVFVGA LIDMYAKCGCV+ ARQVFD+I +RDVV WN+M+A YSQNG P Sbjct: 135 SDVFVGAALIDMYAKCGCVESARQVFDKIDERDVVCWNSMLATYSQNGQP 184 Score = 111 bits (278), Expect = 5e-23 Identities = 57/168 (33%), Positives = 98/168 (58%) Frame = +3 Query: 6 AVDPGKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFLWNVLIR 185 A++ GK++H ++ SGL D + L+++Y+ C + A +FD+I +R+V WN ++ Sbjct: 117 AMEEGKKIHKDVIRSGLESDVFVGAALIDMYAKCGCVESARQVFDKIDERDVVCWNSMLA 176 Query: 186 GFAWEGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACSALSALEVGREIHEHVVRSRWDS 365 ++ G + +L+L M GL+P TF + A + L G+E+H + R ++S Sbjct: 177 TYSQNGQPDESLALCRVMAFNGLKPTEGTFVISIAASADNGLLPQGKELHGYSWRHGFES 236 Query: 366 DVFVGAGLIDMYAKCGCVDDARQVFDRITKRDVVLWNAMIAAYSQNGH 509 + V L+DMYAK G V+ AR +F+ + ++ VV WNAMI Y+ +GH Sbjct: 237 NDKVKTALMDMYAKSGSVNVARSLFELLEEKRVVSWNAMITGYAMHGH 284 Score = 62.4 bits (150), Expect = 4e-08 Identities = 44/167 (26%), Positives = 84/167 (50%), Gaps = 3/167 (1%) Frame = +3 Query: 18 GKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFLWNVLIRGFAW 197 GK+LH G + + T L+++Y+ ++ A +LF+ + ++ V WN +I G+A Sbjct: 222 GKELHGYSWRHGFESNDKVKTALMDMYAKSGSVNVARSLFELLEEKRVVSWNAMITGYAM 281 Query: 198 EGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACSALSALEVGREIHEHVVR--SRWDSDV 371 G AL L+ M + + PD+ TF VL ACS L G+ ++ + W + V Sbjct: 282 HGHANEALDLFKEM-KGKVLPDHITFVGVLAACSHGGLLNEGKMHFRSMISDFNIWPT-V 339 Query: 372 FVGAGLIDMYAKCGCVDDA-RQVFDRITKRDVVLWNAMIAAYSQNGH 509 +ID+ CG +++A + + + + D +W A++ + +G+ Sbjct: 340 QHYTCMIDLLGHCGRLEEAYKLIMEMRVEPDAGVWGALLHSCKIHGN 386 Score = 57.8 bits (138), Expect = 9e-07 Identities = 24/75 (32%), Positives = 45/75 (60%) Frame = +3 Query: 282 VLKACSALSALEVGREIHEHVVRSRWDSDVFVGAGLIDMYAKCGCVDDARQVFDRITKRD 461 +L++C A+E G+++H + + + + L+++Y C + +A +FDRI+KR+ Sbjct: 7 LLQSCVVRKAIEPGKQLHARICQVGISFNPLLATKLVNLYCICNSLTNAHLLFDRISKRN 66 Query: 462 VVLWNAMIAAYSQNG 506 + LWN MI Y+ NG Sbjct: 67 LFLWNVMIRGYAWNG 81 >emb|CBI36131.3| unnamed protein product [Vitis vinifera] Length = 550 Score = 250 bits (638), Expect = 1e-64 Identities = 116/172 (67%), Positives = 140/172 (81%), Gaps = 3/172 (1%) Frame = +3 Query: 3 KAVDPGKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFLWNVLI 182 KA+ PGKQLHA++ L+G GFD ++ATKLVN+Y VC+ L A LFDRIP+ N+FLWNVLI Sbjct: 89 KAIKPGKQLHAQVCLAGFGFDTVIATKLVNLYCVCDSLSSARLLFDRIPKHNIFLWNVLI 148 Query: 183 RGFAWEGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACSALSALEVGREIHEHVVRSRWD 362 RG+AW GP+E A+ LYY+M + GL PDNFTFPFVLKAC+ALSA+E GREIHEHVV++ W+ Sbjct: 149 RGYAWNGPYEAAVQLYYQMFDYGLVPDNFTFPFVLKACAALSAIEHGREIHEHVVQTGWE 208 Query: 363 SDVFVGAGLIDMYAKCGCVDDARQVFDRITKRDVVL---WNAMIAAYSQNGH 509 DVFVGA LIDMYAKCGCV AR+VFD+I RD VL WNAMI Y+ +GH Sbjct: 209 KDVFVGAALIDMYAKCGCVGSAREVFDKILVRDAVLVVSWNAMITGYAMHGH 260 Score = 69.7 bits (169), Expect = 2e-10 Identities = 47/165 (28%), Positives = 83/165 (50%), Gaps = 5/165 (3%) Frame = +3 Query: 6 AVDPGKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFL---WNV 176 A++ G+++H ++ +G D + L+++Y+ C + A +FD+I R+ L WN Sbjct: 191 AIEHGREIHEHVVQTGWEKDVFVGAALIDMYAKCGCVGSAREVFDKILVRDAVLVVSWNA 250 Query: 177 LIRGFAWEGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACSALSALEVGREIHEHVVRS- 353 +I G+A G AL L+ M +PD+ TF VL ACS LE G E ++R Sbjct: 251 MITGYAMHGHATEALDLFEEMNRVA-KPDHITFVGVLSACSHGGLLEEGWMFFETMIRDY 309 Query: 354 RWDSDVFVGAGLIDMYAKCGCVDDARQVFDRI-TKRDVVLWNAMI 485 + D V ++D+ G +D+A + ++ D +W A++ Sbjct: 310 KIDPTVQHYTCMVDLLGHSGRLDEAYNLIMQMKVLPDSGVWGALL 354 Score = 63.2 bits (152), Expect = 2e-08 Identities = 27/85 (31%), Positives = 48/85 (56%) Frame = +3 Query: 252 LQPDNFTFPFVLKACSALSALEVGREIHEHVVRSRWDSDVFVGAGLIDMYAKCGCVDDAR 431 L P + +L++C A A++ G+++H V + + D + L+++Y C + AR Sbjct: 71 LTPTYSNYASLLQSCIARKAIKPGKQLHAQVCLAGFGFDTVIATKLVNLYCVCDSLSSAR 130 Query: 432 QVFDRITKRDVVLWNAMIAAYSQNG 506 +FDRI K ++ LWN +I Y+ NG Sbjct: 131 LLFDRIPKHNIFLWNVLIRGYAWNG 155 >ref|XP_002516017.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223544922|gb|EEF46437.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 472 Score = 245 bits (625), Expect = 3e-63 Identities = 115/170 (67%), Positives = 139/170 (81%) Frame = +3 Query: 3 KAVDPGKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFLWNVLI 182 KA+ PGKQLHA L GL FD +LA KLVN+Y +CN L +A LFD+IP+RN+FLWNVLI Sbjct: 101 KALIPGKQLHASLCHVGLQFDRVLAPKLVNLYCICNSLCEARLLFDKIPKRNLFLWNVLI 160 Query: 183 RGFAWEGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACSALSALEVGREIHEHVVRSRWD 362 RG+AW GP+E ++ LYY++ + GL PDNFTFPFVLKACSALSA+E GR IHE V+RS W+ Sbjct: 161 RGYAWYGPYEASIQLYYKIFDYGLVPDNFTFPFVLKACSALSAIEDGRLIHEQVMRSGWE 220 Query: 363 SDVFVGAGLIDMYAKCGCVDDARQVFDRITKRDVVLWNAMIAAYSQNGHP 512 DVFVGA LIDMY+KCGCVD+AR+VF + RD VLWN+M+AAYSQNG P Sbjct: 221 RDVFVGAALIDMYSKCGCVDNAREVFHKFPVRDAVLWNSMLAAYSQNGKP 270 Score = 118 bits (296), Expect = 4e-25 Identities = 57/168 (33%), Positives = 102/168 (60%) Frame = +3 Query: 6 AVDPGKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFLWNVLIR 185 A++ G+ +H +++ SG D + L+++YS C + +A +F + P R+ LWN ++ Sbjct: 203 AIEDGRLIHEQVMRSGWERDVFVGAALIDMYSKCGCVDNAREVFHKFPVRDAVLWNSMLA 262 Query: 186 GFAWEGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACSALSALEVGREIHEHVVRSRWDS 365 ++ G + +L+L MV AG++P T V+ A + ++AL GRE+H R R++S Sbjct: 263 AYSQNGKPDKSLALCSEMVLAGVRPTEATLVTVISASADIAALPQGRELHGFAWRRRFES 322 Query: 366 DVFVGAGLIDMYAKCGCVDDARQVFDRITKRDVVLWNAMIAAYSQNGH 509 + V LIDMYAKCG + A+ +F+++ ++VV WNA+I Y+ +G+ Sbjct: 323 NDKVKTTLIDMYAKCGTMKVAQNLFEQLRDKNVVSWNAIITGYAMHGY 370 Score = 60.1 bits (144), Expect = 2e-07 Identities = 34/98 (34%), Positives = 53/98 (54%) Frame = +3 Query: 6 AVDPGKQLHARLLLSGLGFDAILATKLVNVYSVCNHLHDAYNLFDRIPQRNVFLWNVLIR 185 A+ G++LH + + T L+++Y+ C + A NLF+++ +NV WN +I Sbjct: 304 ALPQGRELHGFAWRRRFESNDKVKTTLIDMYAKCGTMKVAQNLFEQLRDKNVVSWNAIIT 363 Query: 186 GFAWEGPHEMALSLYYRMVEAGLQPDNFTFPFVLKACS 299 G+A G L L+ RM E +PD+ TF VL ACS Sbjct: 364 GYAMHGYSNEVLILFDRMREEA-KPDHITFVGVLLACS 400