BLASTX nr result
ID: Dioscorea21_contig00038729
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00038729 (443 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002268999.1| PREDICTED: pentatricopeptide repeat-containi... 183 1e-44 emb|CAN80769.1| hypothetical protein VITISV_013866 [Vitis vinifera] 183 1e-44 gb|AAF43948.1|AC012188_25 Contains similarity to a hypothetical ... 181 4e-44 ref|NP_172899.1| pentatricopeptide repeat-containing protein [Ar... 181 4e-44 ref|XP_002890056.1| hypothetical protein ARALYDRAFT_888825 [Arab... 173 1e-41 >ref|XP_002268999.1| PREDICTED: pentatricopeptide repeat-containing protein At1g14470-like [Vitis vinifera] Length = 729 Score = 183 bits (465), Expect = 1e-44 Identities = 87/149 (58%), Positives = 115/149 (77%), Gaps = 2/149 (1%) Frame = +1 Query: 1 LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSD--RGIVDWNSMLSGFWKWSSKEDAC 174 LG SD ++RNAV+ YA+ GP A +FDE+ D R + DWN+M+SG+WKW S+ A Sbjct: 124 LGHGSDAFVRNAVIDMYARLGPIGHARKVFDEIPDYERKVADWNAMVSGYWKWESEGQAQ 183 Query: 175 KVFDEMPERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDE 354 +FD MPERNV++WT MV+G A+ +LE ARR F+ MPERSVVSWNA+LSGY +NGL +E Sbjct: 184 WLFDVMPERNVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGLAEE 243 Query: 355 CLFLFNKMMNSGIRPNETSWVTVISACAS 441 L LF++M+N+GI P+ET+WVTVISAC+S Sbjct: 244 ALRLFDEMVNAGIEPDETTWVTVISACSS 272 Score = 85.1 bits (209), Expect = 5e-15 Identities = 52/174 (29%), Positives = 86/174 (49%), Gaps = 40/174 (22%) Frame = +1 Query: 34 AVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKVFDEMPERNV-- 207 A++ YAK A FD M +R +V WN+MLSG+ + E+A ++FDEM + Sbjct: 199 AMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGLAEEALRLFDEMVNAGIEP 258 Query: 208 --VSWTVMVSGL-----------------------------------ARAGELEEARRVF 276 +W ++S A+ G+L+ AR++F Sbjct: 259 DETTWVTVISACSSRGDPCLAASLVRTLHQKRIQLNCFVRTALLDMYAKFGDLDSARKLF 318 Query: 277 ELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNS-GIRPNETSWVTVISAC 435 MP R+VV+WN++++GY +NG + LF +M+ + + P+E + V+VISAC Sbjct: 319 NTMPGRNVVTWNSMIAGYAQNGQSAMAIELFKEMITAKKLTPDEVTMVSVISAC 372 Score = 81.6 bits (200), Expect = 6e-14 Identities = 53/179 (29%), Positives = 86/179 (48%), Gaps = 40/179 (22%) Frame = +1 Query: 22 YIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKVFDEM--- 192 ++R A+L YAK+G A LF+ M R +V WNSM++G+ + A ++F EM Sbjct: 296 FVRTALLDMYAKFGDLDSARKLFNTMPGRNVVTWNSMIAGYAQNGQSAMAIELFKEMITA 355 Query: 193 -----PERNVVS---------------WTV-----------------MVSGLARAGELEE 261 E +VS W V M+ +R G +E+ Sbjct: 356 KKLTPDEVTMVSVISACGHLGALELGNWVVRFLTENQIKLSISGHNAMIFMYSRCGSMED 415 Query: 262 ARRVFELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNSGIRPNETSWVTVISACA 438 A+RVF+ M R VVS+N ++SG+ +G E + L + M GI P+ +++ V++AC+ Sbjct: 416 AKRVFQEMATRDVVSYNTLISGFAAHGHGVEAINLMSTMKEGGIEPDRVTFIGVLTACS 474 >emb|CAN80769.1| hypothetical protein VITISV_013866 [Vitis vinifera] Length = 761 Score = 183 bits (464), Expect = 1e-44 Identities = 87/149 (58%), Positives = 115/149 (77%), Gaps = 2/149 (1%) Frame = +1 Query: 1 LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSD--RGIVDWNSMLSGFWKWSSKEDAC 174 LG SD ++RNAV+ YA+ GP A +FDE+ D R + DWN+M+SG+WKW S+ A Sbjct: 124 LGHGSDAFVRNAVIDMYARLGPIGHARKVFDEIPDYERKVADWNAMVSGYWKWESEGQAQ 183 Query: 175 KVFDEMPERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDE 354 +FD MPERNV++WT MV+G A+ +LE ARR F+ MPERSVVSWNA+LSGY +NGL +E Sbjct: 184 WLFDVMPERNVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGLAEE 243 Query: 355 CLFLFNKMMNSGIRPNETSWVTVISACAS 441 L LF++M+N+GI P+ET+WVTVISAC+S Sbjct: 244 VLRLFDEMVNAGIEPDETTWVTVISACSS 272 Score = 81.3 bits (199), Expect = 8e-14 Identities = 57/179 (31%), Positives = 86/179 (48%), Gaps = 44/179 (24%) Frame = +1 Query: 31 NAVLCYYAKYGPFLCALSLFDEMSDRGI----VDWNSMLSGFWKWSSKEDAC-------- 174 NA+L YA+ G L LFDEM + GI W +++S SS+ D C Sbjct: 229 NAMLSGYAQNGLAEEVLRLFDEMVNAGIEPDETTWVTVISAC---SSRGDPCLAASLVRT 285 Query: 175 ------------------------------KVFDEMPE-RNVVSWTVMVSGLARAGELEE 261 ++FDE+ RN V+W M+S R G L+ Sbjct: 286 LHQKQIQLNCFVRTALLDMYAKCGSIGAARRIFDELGAYRNSVTWNAMISAYTRVGNLDS 345 Query: 262 ARRVFELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNS-GIRPNETSWVTVISAC 435 AR +F MP R+VV+WN++++GY +NG + LF +M+ + + P+E + V+VISAC Sbjct: 346 ARELFNTMPGRNVVTWNSMIAGYAQNGQSAMAIELFKEMITAKKLTPDEVTMVSVISAC 404 Score = 75.5 bits (184), Expect = 4e-12 Identities = 52/186 (27%), Positives = 85/186 (45%), Gaps = 40/186 (21%) Frame = +1 Query: 1 LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180 LG + NA++ Y + G A LF+ M R +V WNSM++G+ + A ++ Sbjct: 321 LGAYRNSVTWNAMISAYTRVGNLDSARELFNTMPGRNVVTWNSMIAGYAQNGQSAMAIEL 380 Query: 181 FDEM--------PERNVVS---------------WTV-----------------MVSGLA 240 F EM E +VS W V M+ + Sbjct: 381 FKEMITAKKLTPDEVTMVSVISACGHLGALELGNWVVRFLTENQIKLSISGHNAMIFMYS 440 Query: 241 RAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNSGIRPNETSWVT 420 R G +E+A+RVF+ M R VVS+N ++SG+ +G E + L + M GI P+ +++ Sbjct: 441 RCGSMEDAKRVFQEMATRDVVSYNTLISGFAAHGHGVEAINLMSTMKEGGIEPDRVTFIG 500 Query: 421 VISACA 438 V++AC+ Sbjct: 501 VLTACS 506 >gb|AAF43948.1|AC012188_25 Contains similarity to a hypothetical protein from Arabidopsis thaliana gb|AC004044.1 and contains two domains PF|01535 of unknown function [Arabidopsis thaliana] Length = 455 Score = 181 bits (460), Expect = 4e-44 Identities = 80/146 (54%), Positives = 108/146 (73%) Frame = +1 Query: 1 LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180 LG D Y+RN ++ Y K+ A +FD++S R DWN M+SG+WKW +KE+ACK+ Sbjct: 45 LGFFKDPYVRNVIMDMYVKHESVESARKVFDQISQRKGSDWNVMISGYWKWGNKEEACKL 104 Query: 181 FDEMPERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDECL 360 FD MPE +VVSWTVM++G A+ +LE AR+ F+ MPE+SVVSWNA+LSGY +NG ++ L Sbjct: 105 FDMMPENDVVSWTVMITGFAKVKDLENARKYFDRMPEKSVVSWNAMLSGYAQNGFTEDAL 164 Query: 361 FLFNKMMNSGIRPNETSWVTVISACA 438 LFN M+ G+RPNET+WV VISAC+ Sbjct: 165 RLFNDMLRLGVRPNETTWVIVISACS 190 Score = 81.6 bits (200), Expect = 6e-14 Identities = 36/93 (38%), Positives = 65/93 (69%), Gaps = 2/93 (2%) Frame = +1 Query: 163 EDACKVFDEM-PERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKN 339 + A ++F+E+ +RN+V+W M+SG R G++ AR++F+ MP+R+VVSWN++++GY N Sbjct: 231 QSARRIFNELGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSLIAGYAHN 290 Query: 340 GLPDECLFLFNKMMNSG-IRPNETSWVTVISAC 435 G + F M++ G +P+E + ++V+SAC Sbjct: 291 GQAALAIEFFEDMIDYGDSKPDEVTMISVLSAC 323 Score = 74.7 bits (182), Expect = 7e-12 Identities = 51/185 (27%), Positives = 81/185 (43%), Gaps = 40/185 (21%) Frame = +1 Query: 1 LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180 LG + NA++ Y + G A LFD M R +V WNS+++G+ A + Sbjct: 240 LGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSLIAGYAHNGQAALAIEF 299 Query: 181 FDEM-------PER---------------------------------NVVSWTVMVSGLA 240 F++M P+ N + ++ A Sbjct: 300 FEDMIDYGDSKPDEVTMISVLSACGHMADLELGDCIVDYIRKNQIKLNDSGYRSLIFMYA 359 Query: 241 RAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNSGIRPNETSWVT 420 R G L EA+RVF+ M ER VVS+N + + + NG E L L +KM + GI P+ ++ + Sbjct: 360 RGGNLWEAKRVFDEMKERDVVSYNTLFTAFAANGDGVETLNLLSKMKDEGIEPDRVTYTS 419 Query: 421 VISAC 435 V++AC Sbjct: 420 VLTAC 424 Score = 63.5 bits (153), Expect = 2e-08 Identities = 32/89 (35%), Positives = 53/89 (59%), Gaps = 1/89 (1%) Frame = +1 Query: 22 YIRNAVLCYYAKYGPFLCALSLFDEM-SDRGIVDWNSMLSGFWKWSSKEDACKVFDEMPE 198 +++ A+L +AK A +F+E+ + R +V WN+M+SG+ + A ++FD MP+ Sbjct: 215 FVKTALLDMHAKCRDIQSARRIFNELGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPK 274 Query: 199 RNVVSWTVMVSGLARAGELEEARRVFELM 285 RNVVSW +++G A G+ A FE M Sbjct: 275 RNVVSWNSLIAGYAHNGQAALAIEFFEDM 303 >ref|NP_172899.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|193806395|sp|Q9M9R6.2|PPR43_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g14470 gi|332191047|gb|AEE29168.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 540 Score = 181 bits (460), Expect = 4e-44 Identities = 80/146 (54%), Positives = 108/146 (73%) Frame = +1 Query: 1 LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180 LG D Y+RN ++ Y K+ A +FD++S R DWN M+SG+WKW +KE+ACK+ Sbjct: 130 LGFFKDPYVRNVIMDMYVKHESVESARKVFDQISQRKGSDWNVMISGYWKWGNKEEACKL 189 Query: 181 FDEMPERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDECL 360 FD MPE +VVSWTVM++G A+ +LE AR+ F+ MPE+SVVSWNA+LSGY +NG ++ L Sbjct: 190 FDMMPENDVVSWTVMITGFAKVKDLENARKYFDRMPEKSVVSWNAMLSGYAQNGFTEDAL 249 Query: 361 FLFNKMMNSGIRPNETSWVTVISACA 438 LFN M+ G+RPNET+WV VISAC+ Sbjct: 250 RLFNDMLRLGVRPNETTWVIVISACS 275 Score = 81.6 bits (200), Expect = 6e-14 Identities = 36/93 (38%), Positives = 65/93 (69%), Gaps = 2/93 (2%) Frame = +1 Query: 163 EDACKVFDEM-PERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKN 339 + A ++F+E+ +RN+V+W M+SG R G++ AR++F+ MP+R+VVSWN++++GY N Sbjct: 316 QSARRIFNELGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSLIAGYAHN 375 Query: 340 GLPDECLFLFNKMMNSG-IRPNETSWVTVISAC 435 G + F M++ G +P+E + ++V+SAC Sbjct: 376 GQAALAIEFFEDMIDYGDSKPDEVTMISVLSAC 408 Score = 74.7 bits (182), Expect = 7e-12 Identities = 51/185 (27%), Positives = 81/185 (43%), Gaps = 40/185 (21%) Frame = +1 Query: 1 LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180 LG + NA++ Y + G A LFD M R +V WNS+++G+ A + Sbjct: 325 LGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSLIAGYAHNGQAALAIEF 384 Query: 181 FDEM-------PER---------------------------------NVVSWTVMVSGLA 240 F++M P+ N + ++ A Sbjct: 385 FEDMIDYGDSKPDEVTMISVLSACGHMADLELGDCIVDYIRKNQIKLNDSGYRSLIFMYA 444 Query: 241 RAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNSGIRPNETSWVT 420 R G L EA+RVF+ M ER VVS+N + + + NG E L L +KM + GI P+ ++ + Sbjct: 445 RGGNLWEAKRVFDEMKERDVVSYNTLFTAFAANGDGVETLNLLSKMKDEGIEPDRVTYTS 504 Query: 421 VISAC 435 V++AC Sbjct: 505 VLTAC 509 Score = 63.5 bits (153), Expect = 2e-08 Identities = 32/89 (35%), Positives = 53/89 (59%), Gaps = 1/89 (1%) Frame = +1 Query: 22 YIRNAVLCYYAKYGPFLCALSLFDEM-SDRGIVDWNSMLSGFWKWSSKEDACKVFDEMPE 198 +++ A+L +AK A +F+E+ + R +V WN+M+SG+ + A ++FD MP+ Sbjct: 300 FVKTALLDMHAKCRDIQSARRIFNELGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPK 359 Query: 199 RNVVSWTVMVSGLARAGELEEARRVFELM 285 RNVVSW +++G A G+ A FE M Sbjct: 360 RNVVSWNSLIAGYAHNGQAALAIEFFEDM 388 >ref|XP_002890056.1| hypothetical protein ARALYDRAFT_888825 [Arabidopsis lyrata subsp. lyrata] gi|297335898|gb|EFH66315.1| hypothetical protein ARALYDRAFT_888825 [Arabidopsis lyrata subsp. lyrata] Length = 790 Score = 173 bits (439), Expect = 1e-41 Identities = 80/148 (54%), Positives = 110/148 (74%), Gaps = 2/148 (1%) Frame = +1 Query: 1 LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180 +G+ D Y+RN ++ YAK+ A +FD+++ R DWN M+SG+WK+ +KE+ACK+ Sbjct: 130 MGIFKDPYVRNVIMDMYAKHESVESARKVFDQITHRKGSDWNVMISGYWKYGNKEEACKL 189 Query: 181 FDEMPER--NVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDE 354 FD MPE +VVSWTVM++G A+ +LE ARR F+ MPE+SVVSWNA+LSGY +NG +E Sbjct: 190 FDMMPEGKIDVVSWTVMITGFAKLKDLENARRCFDCMPEKSVVSWNAMLSGYSQNGFTEE 249 Query: 355 CLFLFNKMMNSGIRPNETSWVTVISACA 438 L LFN M+ G+RPNET+WV VISAC+ Sbjct: 250 TLRLFNDMLRLGVRPNETTWVIVISACS 277 Score = 84.0 bits (206), Expect = 1e-14 Identities = 37/91 (40%), Positives = 65/91 (71%), Gaps = 2/91 (2%) Frame = +1 Query: 169 ACKVFDEM-PERNVVSWTVMVSGLARAGELEEARRVFELMPERSVVSWNAVLSGYVKNGL 345 A ++F+E+ ++N+V+W M+SG R G++ AR++F+ MP+R+VVSWN+V++GY NG Sbjct: 320 ARRIFNELGTQKNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSVIAGYAHNGQ 379 Query: 346 PDECLFLFNKMMNSG-IRPNETSWVTVISAC 435 P + F M++ G +P+E + ++V+SAC Sbjct: 380 PALAIEFFEDMIDYGDSKPDEVTMISVLSAC 410 Score = 74.3 bits (181), Expect = 1e-11 Identities = 51/185 (27%), Positives = 81/185 (43%), Gaps = 40/185 (21%) Frame = +1 Query: 1 LGLMSDRYIRNAVLCYYAKYGPFLCALSLFDEMSDRGIVDWNSMLSGFWKWSSKEDACKV 180 LG + NA++ Y + G A LFD M R +V WNS+++G+ A + Sbjct: 327 LGTQKNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSVIAGYAHNGQPALAIEF 386 Query: 181 FDEM-------PER---------------------------------NVVSWTVMVSGLA 240 F++M P+ N + ++ A Sbjct: 387 FEDMIDYGDSKPDEVTMISVLSACGHMGDLELGDCIVDYIGKKQIKLNDSGYRSLIFMYA 446 Query: 241 RAGELEEARRVFELMPERSVVSWNAVLSGYVKNGLPDECLFLFNKMMNSGIRPNETSWVT 420 R G L EA+RVF+ M ER VVS+N + S + NG + L L +KM + GI P+ ++ + Sbjct: 447 RCGNLWEAKRVFDEMKERDVVSYNTLFSAFAANGDGVKTLNLLSKMKDEGIEPDRVTYTS 506 Query: 421 VISAC 435 V++AC Sbjct: 507 VLTAC 511