BLASTX nr result
ID: Dioscorea21_contig00003538
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00003538 (1908 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002516159.1| pentatricopeptide repeat-containing protein,... 532 e-148 ref|XP_002281998.2| PREDICTED: pentatricopeptide repeat-containi... 524 e-146 ref|XP_002324203.1| predicted protein [Populus trichocarpa] gi|2... 523 e-146 ref|XP_003544580.1| PREDICTED: uncharacterized protein LOC100799... 516 e-144 ref|XP_003550222.1| PREDICTED: uncharacterized protein LOC100796... 513 e-143 >ref|XP_002516159.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223544645|gb|EEF46161.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 1439 Score = 532 bits (1370), Expect = e-148 Identities = 311/582 (53%), Positives = 375/582 (64%), Gaps = 26/582 (4%) Frame = -1 Query: 1896 TESTMANKPTALNPNAAEFIPSAFRS---PLGSAKIVDST--RLDIAGTSGKAVLDRXXX 1732 T+ + +K TALNPNAAEF+P + RS P GS +T R +G GKAVLDR Sbjct: 13 TKLNIPSKATALNPNAAEFVPFSLRSLSSPSGSTSNAAATTARFATSGPVGKAVLDRSES 72 Query: 1731 XXXXXXXXEAHRFWRHQLPDDITPDFKAMGENELQDPGNLSLDSLSIHESAETLRFPVVA 1552 EAH+FWRHQLPDDITPDFK MGE+E Q G LSL LS+H+S+E +FP Sbjct: 73 SISTTSDEEAHQFWRHQLPDDITPDFKVMGEDESQSLGGLSLAGLSLHDSSEVPKFPASV 132 Query: 1551 ASQMLGATQGRSLGSDDLN---LLENLAYSGSAYAGDHSSGV-FMTSAASAWEEQFVNGD 1384 S + T+ + +N E + Y+ ++Y D +S ++ W++Q +N D Sbjct: 133 GSGYI-LTEQQEPSPRHINGSSFSEKMRYAIASYGEDPTSAAGYLNLPTKPWDKQIINND 191 Query: 1383 QNLID-QEALLYNGDSSANFLNXXXXXXXXXXXA-INPVDFLASQFPGFAAESLAEVYYA 1210 L + +E YNG+S F+N +NP+DFLAS FPGFAAESLAEVY+A Sbjct: 192 HLLGNGREVHPYNGNSRRGFMNDMLGEQAIVDEPDMNPLDFLASHFPGFAAESLAEVYFA 251 Query: 1209 NGCDLNLTIEILTQLELQVDTGFGQNLNSNTPASPNLSPLDFPALPLSDTQNGLSKVSRD 1030 NG DLNLTIE+LTQLELQVD GF QN+NS ++PNLS +DFPALP+ D+QN SK S D Sbjct: 252 NGYDLNLTIEMLTQLELQVDGGFNQNMNSKALSAPNLSAMDFPALPVPDSQNSPSKYSGD 311 Query: 1029 DIGQAPNTYRSA-------------TSIFGGATDFASAVRKVASQDSGQWKYERNGSGDG 889 DI Q+ N YRS+ T GGA DFASAVRK+ASQDSG WKYERNGS D Sbjct: 312 DIQQSGNPYRSSDKENILLFKSSSSTPSRGGAIDFASAVRKLASQDSGIWKYERNGSADS 371 Query: 888 RIGSSRMSQLLASS-QNGNAKLAYGDKMQSVRSSRAAPVWLETGEAVASMYSESREEARD 712 +GSSR S +LASS +GN + Y ++ Q+ S+RAAPVWLETGEAVA+MYSE REEARD Sbjct: 372 AVGSSRSSHVLASSYSSGNGRGIYSERAQNRGSARAAPVWLETGEAVANMYSELREEARD 431 Query: 711 FARLRNACFDQARQAYLIGNKALAKELSEKGQLYNIQMKAAHGKAREAIYRQRNPXXXXX 532 ARLRNA F+QARQAYLIGNKALAKELS KGQL+N+ MKAAHGKA+E+IYR RNP Sbjct: 432 HARLRNAYFEQARQAYLIGNKALAKELSVKGQLHNMHMKAAHGKAQESIYRLRNP----- 486 Query: 531 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLKHELSTLRSAARAVGRQAQVMI 352 LKHELS LRS ARA ++ QV I Sbjct: 487 --------ISSEMQGNGRGHERMIDLHGLHVSEAIHVLKHELSVLRSTARAADQRLQVYI 538 Query: 351 CVGTGHHTKGSRTPARLPVAVEQYLL-EENLHYTQPQPGLLR 229 CVGTGHHT+GSRTPARLP+AV+QYLL EE L YT+PQPGLLR Sbjct: 539 CVGTGHHTRGSRTPARLPIAVQQYLLEEEGLDYTEPQPGLLR 580 >ref|XP_002281998.2| PREDICTED: pentatricopeptide repeat-containing protein At4g33170 [Vitis vinifera] Length = 1580 Score = 524 bits (1349), Expect = e-146 Identities = 307/568 (54%), Positives = 370/568 (65%), Gaps = 18/568 (3%) Frame = -1 Query: 1875 KPTALNPNAAEFIPSAFRSPLGSAKIVD-STRLDIAGTSGKAVLDRXXXXXXXXXXXEAH 1699 K T LNPNAAEF+P A RS GS D S R +GT GKA LDR EAH Sbjct: 20 KVTTLNPNAAEFVPFALRSSSGSTSTGDASARFTPSGTLGKAKLDRSESSISNNSDEEAH 79 Query: 1698 RFWRHQLPDDITPDFKAMGENELQDPGNLSLDSLSIHESAETLRFPVVAASQMLGATQGR 1519 ++WR QLPDDITPDFK MGE+E Q G LSL LS+H+S ET RF + +L Q Sbjct: 80 QYWRCQLPDDITPDFKVMGEDESQALGGLSLAGLSLHDSTETSRFSA-GSGYILNEQQEL 138 Query: 1518 SLGSDDLN-LLENLAYSGSAYAGDHSSGVFMTSAASAWEEQFVNGDQNLID-QEALLYNG 1345 S + N + + YS S Y D SS F+ W++Q V+GDQ L + +E YNG Sbjct: 139 SPHHVNGNSFTDKMRYSASPYGEDPSSATFLHLPNKPWDKQIVHGDQLLSNGREGTPYNG 198 Query: 1344 DSSANFLNXXXXXXXXXXXA-INPVDFLASQFPGFAAESLAEVYYANGCDLNLTIEILTQ 1168 +S F+N +NP++FLA QFPGFAAESL EVY+AN CDLN+TIE+LTQ Sbjct: 199 NSRHGFVNDMLNEHAMVDETDMNPLEFLALQFPGFAAESLTEVYFANECDLNMTIEMLTQ 258 Query: 1167 LELQVDTGFGQNLNSNTPASPNLSPLDFPALPLSDTQNGLSKVSRDDIGQAPNTYRSA-- 994 LELQVD GF QN+NS T ++PNLS LDFP+LP+ D QNGL K + DDI Q N YRS+ Sbjct: 259 LELQVDGGFSQNMNSKTLSAPNLSGLDFPSLPVPDGQNGLPKYAGDDIQQGVNPYRSSDK 318 Query: 993 ---------TSIFG-GATDFASAVRKVASQDSGQWKYERNGSGDGRIGSSRMSQLLASSQ 844 +SI GATDFASAVRK+A+QDSG WK++RNG+ D +GSSR S +LASS Sbjct: 319 DNLLMFKSNSSIPSRGATDFASAVRKLATQDSGIWKFDRNGTADVNVGSSRSSHVLASSY 378 Query: 843 N-GNAKLAYGDKMQSVRSSRAAPVWLETGEAVASMYSESREEARDFARLRNACFDQARQA 667 N G+ + YGD++Q+ S+R AP W+ETGEAVA+MYSE REEARD ARLRNA F+QA+QA Sbjct: 379 NSGHGRGTYGDRLQNRGSARVAPAWVETGEAVANMYSELREEARDHARLRNAYFEQAQQA 438 Query: 666 YLIGNKALAKELSEKGQLYNIQMKAAHGKAREAIYRQRNPXXXXXXXXXXXXXXXXXXXX 487 YLIGNKALAKELS KG+L++I MKAAHGKA+EAIYRQRNP Sbjct: 439 YLIGNKALAKELSLKGKLHSIHMKAAHGKAQEAIYRQRNP--------------VSPELQ 484 Query: 486 XXXXXXXXXXXXXXXXXXXXXXLKHELSTLRSAARAVGRQAQVMICVGTGHHTKGSRTPA 307 LKHEL+ LRS AR+ ++ QV ICVGTGHHT+GSRTPA Sbjct: 485 GNARGERMIDLHGLHVSEAIHVLKHELNVLRSTARSADQRLQVYICVGTGHHTRGSRTPA 544 Query: 306 RLPVAVEQYLL-EENLHYTQPQPGLLRV 226 RLPVAV++YLL EE L YT+PQ GLLRV Sbjct: 545 RLPVAVQRYLLEEEGLDYTEPQAGLLRV 572 >ref|XP_002324203.1| predicted protein [Populus trichocarpa] gi|222865637|gb|EEF02768.1| predicted protein [Populus trichocarpa] Length = 583 Score = 523 bits (1346), Expect = e-146 Identities = 308/588 (52%), Positives = 376/588 (63%), Gaps = 25/588 (4%) Frame = -1 Query: 1905 SKDTESTMANKPTALNPNAAEFIPSAFRS---PLGSAKIVD--STRLDIAGTSGKAVLDR 1741 + DT+ ++ NK T LNPNAAEF+P + RS P GS +T+L +GT GK+VLDR Sbjct: 10 TNDTKLSLPNKATTLNPNAAEFVPFSLRSSSSPSGSTSNTTDAATKLATSGTVGKSVLDR 69 Query: 1740 XXXXXXXXXXXEAHRFWRHQLPDDITPDFKAMGENELQDPGNLSLDSLSIHESAETLRFP 1561 EAH+FWRHQLPDDITPDFK M E+E Q G LSL LS+H+S+E RF Sbjct: 70 SESSVSNASDDEAHQFWRHQLPDDITPDFKVMNEDESQGLGGLSLAGLSLHDSSEVPRFH 129 Query: 1560 VVAASQMLGATQGRSLGSDDLN---LLENLAYSGSAYAGDHSSGVFMTSAASAWEEQFVN 1390 + S + T+ + +N EN+ Y+ ++Y D +S F+ W++Q N Sbjct: 130 ASSRSGYV-LTEQQEPSPHHINGSSFSENMRYAVASYGEDPTSASFLNLPTKPWDKQIAN 188 Query: 1389 GDQNLID-QEALLYNGDSSANFLNXXXXXXXXXXXA-INPVDFLASQFPGFAAESLAEVY 1216 DQ L + +E YNG+S F + INP++FLASQFPGFAAESLAEVY Sbjct: 189 SDQLLSNGREVHPYNGNSRHGFRSEILGEHAIVDDTEINPLEFLASQFPGFAAESLAEVY 248 Query: 1215 YANGCDLNLTIEILTQLELQVDTGFGQNLNSNTPASP-NLSPLDFPALPLSDTQNGLSKV 1039 +AN CDLNLTIE+LTQLELQVD GF Q NS T ++P NLS LDFPAL + D QNG SK Sbjct: 249 FANACDLNLTIEMLTQLELQVDGGFNQTTNSKTVSAPTNLSALDFPALTVPDNQNGPSKY 308 Query: 1038 SRDDIGQAPNTYRSATS----IFG--------GATDFASAVRKVASQDSGQWKYERNGSG 895 + DD+ QA YRS+ +F GA DFASAVRK+ASQDS W ++RNGS Sbjct: 309 AGDDLQQAGIPYRSSNKDNMLVFKSGASFSSRGAVDFASAVRKLASQDSSMWNHDRNGSA 368 Query: 894 DGRIGSSRMSQLLASSQNG-NAKLAYGDKMQSVRSSRAAPVWLETGEAVASMYSESREEA 718 D +GSSR S +LAS+ +G + + Y D+ QS S +AAPVWLETGEAVASMYSE REEA Sbjct: 369 DSTVGSSRSSHVLASAYSGGHGRGIYADRSQSRGSGQAAPVWLETGEAVASMYSEMREEA 428 Query: 717 RDFARLRNACFDQARQAYLIGNKALAKELSEKGQLYNIQMKAAHGKAREAIYRQRNPXXX 538 RD AR+RNA +QARQAYLIGNKALAKELS KGQL+N+ MK AHGKA+E+IYRQRNP Sbjct: 429 RDHARIRNAYLEQARQAYLIGNKALAKELSAKGQLHNMHMKEAHGKAQESIYRQRNP--- 485 Query: 537 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLKHELSTLRSAARAVGRQAQV 358 LKHELS LRS ARA ++ QV Sbjct: 486 ----------ASLEMQGTGRGHERMIDLHGLHVTEAIHVLKHELSILRSTARAADQRLQV 535 Query: 357 MICVGTGHHTKGSRTPARLPVAVEQYLL-EENLHYTQPQPGLLRVVIY 217 ICVGTGHHT+G+RTPARLPVAV++YLL EE L YT+PQPGLLRVV+Y Sbjct: 536 YICVGTGHHTRGARTPARLPVAVQRYLLEEEGLDYTEPQPGLLRVVMY 583 >ref|XP_003544580.1| PREDICTED: uncharacterized protein LOC100799961 [Glycine max] Length = 572 Score = 516 bits (1330), Expect = e-144 Identities = 301/580 (51%), Positives = 376/580 (64%), Gaps = 19/580 (3%) Frame = -1 Query: 1899 DTESTMANKPTALNPNAAEFIPSAFRS-PLGSAKIVDST-RLDIAGTSGKAVLDRXXXXX 1726 D + + NK T LNPNAAEF+P A RS P GS +VD+T R AG+ GKAVLDR Sbjct: 11 DAKLSSLNKATYLNPNAAEFVPFALRSSPSGSTSLVDATARFAAAGSLGKAVLDRAESSI 70 Query: 1725 XXXXXXEAHRFWRHQLPDDITPDFKAMGENELQDPGNLSLDSLSIHESAETLRFPVVAAS 1546 EAH++WR QLPDDITPDFK MGE+E Q NLSL LSI++ E+ FP Sbjct: 71 SNNSDDEAHQYWRCQLPDDITPDFKVMGEDESQGLNNLSLAGLSINDDNESSMFPSSKGF 130 Query: 1545 QMLGATQGRSLGSDDLN---LLENLAYSGSAYAGDHSSGVFMTSAASAWEEQFVNGDQNL 1375 + + + + L LN + L +S S Y + SS + S+A W+ Q N D ++ Sbjct: 131 RYI-LNEQQELSQQHLNGNTFADKLRFSNSTYRDEPSSASILNSSAKPWDRQIRNTDLHV 189 Query: 1374 ID-QEALLYNGDSSANFLNXXXXXXXXXXXA-INPVDFLASQFPGFAAESLAEVYYANGC 1201 QEAL+Y+ ++ F N +NP++FLAS FPGFA+ESL+EV++ANGC Sbjct: 190 SSGQEALVYDDNTGHGFFNDVFAGNSLVNDTDLNPLEFLASLFPGFASESLSEVFFANGC 249 Query: 1200 DLNLTIEILTQLELQVDTGFGQNLNSNTPASPNLSPLDFPALPLSDTQNGLSKVSRDDIG 1021 DL+LTIE+LTQLE+QVD+ F QN + T +SPNLS +DFPAL S+ QN SK + D++ Sbjct: 250 DLHLTIEMLTQLEIQVDSSFNQNPSPKTLSSPNLSAMDFPALTSSNGQNA-SKYAADNVQ 308 Query: 1020 QAPNTY----------RSATSIFG-GATDFASAVRKVASQDSGQWKYERNGSGDGRIGSS 874 Q+ N Y +S +SI GA DFASAVRK+ASQDSG WKY++NGSGD GSS Sbjct: 309 QSGNPYLSSDKDMLMFKSGSSIPSRGAVDFASAVRKLASQDSGIWKYDKNGSGDASTGSS 368 Query: 873 RMSQLLASSQNGN-AKLAYGDKMQSVRSSRAAPVWLETGEAVASMYSESREEARDFARLR 697 R LAS+ NG ++ GD++Q+ S+RAAPVWLETG+AVA+MYSE REEARD ARLR Sbjct: 369 RSLNALASAYNGGQGRVNNGDRLQNRGSARAAPVWLETGDAVANMYSELREEARDHARLR 428 Query: 696 NACFDQARQAYLIGNKALAKELSEKGQLYNIQMKAAHGKAREAIYRQRNPXXXXXXXXXX 517 NA F+QARQAYL+GNKALAKELS KGQL+N+ MKAAHGKA+E+IYRQRNP Sbjct: 429 NAYFEQARQAYLVGNKALAKELSVKGQLHNVHMKAAHGKAQESIYRQRNP---------- 478 Query: 516 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLKHELSTLRSAARAVGRQAQVMICVGTG 337 LKHELS LRS ARA ++ QV ICVGTG Sbjct: 479 ------VAPENGRGPQRMIDLHGLHVSEAIHVLKHELSVLRSTARAPEQRLQVYICVGTG 532 Query: 336 HHTKGSRTPARLPVAVEQYLLEENLHYTQPQPGLLRVVIY 217 HHT+GSRTPARLP+AV++YLLEE L +T+PQPGLL VVIY Sbjct: 533 HHTRGSRTPARLPIAVQRYLLEEGLDFTEPQPGLLCVVIY 572 >ref|XP_003550222.1| PREDICTED: uncharacterized protein LOC100796128 isoform 1 [Glycine max] gi|356563956|ref|XP_003550223.1| PREDICTED: uncharacterized protein LOC100796128 isoform 2 [Glycine max] Length = 573 Score = 513 bits (1322), Expect = e-143 Identities = 306/581 (52%), Positives = 375/581 (64%), Gaps = 20/581 (3%) Frame = -1 Query: 1899 DTESTMANKPTALNPNAAEFIPSAFRS-PLGSAKIVDST-RLDIAGTSGKAVLDRXXXXX 1726 DT+ + NK T LNPNAAEF+P A RS P GS VD+ R AG+ GKAVLDR Sbjct: 11 DTKLSSLNKATYLNPNAAEFVPFALRSSPSGSTSSVDAAARFTTAGSLGKAVLDRSESSI 70 Query: 1725 XXXXXXEAHRFWRHQLPDDITPDFKAMGENELQDPGNLSLDSLSIHESAETLRFPVVAAS 1546 EAH++WR QLPDDITPDFK MGE+E Q NLSL LSI++ E+ FP S Sbjct: 71 SNNSDDEAHQYWRCQLPDDITPDFKVMGEDESQGLNNLSLAGLSINDDNESSMFPSSKGS 130 Query: 1545 QMLGATQGRSLGSDDLN---LLENLAYSGSAYAGDHSSGVFMTSAASAWEEQFVNGDQNL 1375 + + + L LN + L +S S Y + SSG + S+A W+ Q N D ++ Sbjct: 131 RYI-LNEQLELSPQHLNGNTFADKLRFSNSTYREEPSSGSILNSSAKPWDRQIGNTDLHV 189 Query: 1374 ID-QEALLYNGDSSANFLNXXXXXXXXXXXA-INPVDFLASQFPGFAAESLAEVYYANGC 1201 QE L+Y+ +S FLN +NP++FLAS FPGFA+ESLAEV++AN C Sbjct: 190 TSGQEELVYDENSGHGFLNDVFAGNSLVNDTDLNPLEFLASLFPGFASESLAEVFFANAC 249 Query: 1200 DLNLTIEILTQLELQVDTGFGQNLNSNTPASPNLSPLDFPALPLSDTQNGLSKVSRDDIG 1021 DL+LTIE+LTQLE+QVD GF QN + T +SPNLS +DFPAL S+ QN SK + D++ Sbjct: 250 DLHLTIEMLTQLEIQVDGGFNQNPSPKTLSSPNLSAMDFPALTSSNGQN-TSKYAADNVQ 308 Query: 1020 QAPNTY----------RSATSIFG-GATDFASAVRKVASQDSGQWKYERNGSGDGRIGSS 874 Q+ Y +S +SI G+ DFASAVRK+ASQDSG WKY++NGSGD GSS Sbjct: 309 QSGIPYISSDKDMLMFKSGSSIPSRGSVDFASAVRKLASQDSGIWKYDKNGSGDASTGSS 368 Query: 873 RMSQLLASSQNGN-AKLAYGDKMQSVRSSRAAPVWLETGEAVASMYSESREEARDFARLR 697 R LAS+ NG ++ GD++QS S+RAAPVWLETG+AVA+MYSE REEARD ARLR Sbjct: 369 RGLNALASAYNGGQGRVNIGDRLQSRGSARAAPVWLETGDAVANMYSELREEARDHARLR 428 Query: 696 NACFDQARQAYLIGNKALAKELSEKGQLYNIQMKAAHGKAREAIYRQRNPXXXXXXXXXX 517 NA F+QARQAYLIGNKALAKELS KGQL+N+ MKAAHGKA+E+IYRQRNP Sbjct: 429 NAYFEQARQAYLIGNKALAKELSVKGQLHNMHMKAAHGKAQESIYRQRNP---------- 478 Query: 516 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLKHELSTLRSAARAVGRQAQVMICVGTG 337 LKHELS LRS ARA ++ QV ICVGTG Sbjct: 479 ------VAPENGRGHQRMIDLHGLHVSEAIHVLKHELSVLRSTARAAEQRLQVYICVGTG 532 Query: 336 HHTKGSRTPARLPVAVEQYLL-EENLHYTQPQPGLLRVVIY 217 HHT+GSRTPARLP+AV++YLL EE L +T+PQPGLLRVVIY Sbjct: 533 HHTRGSRTPARLPIAVQRYLLEEEGLDFTEPQPGLLRVVIY 573