BLASTX nr result
ID: Dioscorea21_contig00029926
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00029926 (312 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002525999.1| pentatricopeptide repeat-containing protein,... 93 3e-17 ref|XP_004142302.1| PREDICTED: pentatricopeptide repeat-containi... 90 2e-16 ref|XP_002876773.1| pentatricopeptide repeat-containing protein ... 88 6e-16 ref|NP_178283.1| pentatricopeptide repeat-containing protein [Ar... 85 5e-15 ref|XP_002320961.1| predicted protein [Populus trichocarpa] gi|2... 81 1e-13 >ref|XP_002525999.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223534731|gb|EEF36423.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 557 Score = 92.8 bits (229), Expect = 3e-17 Identities = 41/89 (46%), Positives = 59/89 (66%) Frame = -1 Query: 270 DPVTFNTIIAALCRQSLRLPALCFLGAMHKYCKPNVVTYSTLIDMFCKMGDLDTAHKVFI 91 D V+FN + C++ ++ ++G M K C PNV+TY T ID CK+GDLDT +K F Sbjct: 41 DLVSFNALFNGFCKRKMKEEVFIYMGLMWKCCLPNVITYGTWIDTLCKVGDLDTGYKFFK 100 Query: 90 DMRFENGVLPNAVTFTCLIDGFCKNDRLD 4 +MR ++G++PN + FTCLIDG+ K LD Sbjct: 101 EMR-KDGIVPNLIAFTCLIDGYSKIGNLD 128 >ref|XP_004142302.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01740-like [Cucumis sativus] gi|449521427|ref|XP_004167731.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01740-like [Cucumis sativus] Length = 585 Score = 90.1 bits (222), Expect = 2e-16 Identities = 40/90 (44%), Positives = 58/90 (64%) Frame = -1 Query: 273 PDPVTFNTIIAALCRQSLRLPALCFLGAMHKYCKPNVVTYSTLIDMFCKMGDLDTAHKVF 94 PD V FN + + ++ A + G M KYC P++VTY T +DMFCKMGD+ +++F Sbjct: 125 PDLVMFNILFNGFAKVYMKNEAFMYFGLMWKYCLPSIVTYGTFVDMFCKMGDMKMGNRMF 184 Query: 93 IDMRFENGVLPNAVTFTCLIDGFCKNDRLD 4 +DM + G++PN V F+ LIDG+CK LD Sbjct: 185 LDM-MKVGIVPNLVVFSSLIDGYCKAGSLD 213 Score = 58.2 bits (139), Expect = 7e-07 Identities = 34/90 (37%), Positives = 49/90 (54%), Gaps = 1/90 (1%) Frame = -1 Query: 273 PDPVTFNTIIAALCRQS-LRLPALCFLGAMHKYCKPNVVTYSTLIDMFCKMGDLDTAHKV 97 P VT+ T + C+ +++ FL M PN+V +S+LID +CK G LD A + Sbjct: 159 PSIVTYGTFVDMFCKMGDMKMGNRMFLDMMKVGIVPNLVVFSSLIDGYCKAGSLDVAFEY 218 Query: 96 FIDMRFENGVLPNAVTFTCLIDGFCKNDRL 7 F M+ E V PN T++ LIDG K+ L Sbjct: 219 FERMK-ECSVRPNEFTYSTLIDGCSKHGML 247 >ref|XP_002876773.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297322611|gb|EFH53032.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 559 Score = 88.2 bits (217), Expect = 6e-16 Identities = 44/90 (48%), Positives = 56/90 (62%) Frame = -1 Query: 273 PDPVTFNTIIAALCRQSLRLPALCFLGAMHKYCKPNVVTYSTLIDMFCKMGDLDTAHKVF 94 PD V+FNT+ + + ++G M K C PNVVTYST IDMFCK G+L A K F Sbjct: 127 PDIVSFNTLFNGFSKMKMLDEVFVYMGVMLKCCSPNVVTYSTWIDMFCKSGELKLALKSF 186 Query: 93 IDMRFENGVLPNAVTFTCLIDGFCKNDRLD 4 M+ + + PN VTFTCLIDG+CK L+ Sbjct: 187 NCMK-RDALFPNVVTFTCLIDGYCKAGDLE 215 Score = 61.2 bits (147), Expect = 8e-08 Identities = 36/90 (40%), Positives = 50/90 (55%), Gaps = 1/90 (1%) Frame = -1 Query: 273 PDPVTFNTIIAALCRQS-LRLPALCFLGAMHKYCKPNVVTYSTLIDMFCKMGDLDTAHKV 97 P+ VT++T I C+ L+L F PNVVT++ LID +CK GDL+ + Sbjct: 161 PNVVTYSTWIDMFCKSGELKLALKSFNCMKRDALFPNVVTFTCLIDGYCKAGDLEVVVSL 220 Query: 96 FIDMRFENGVLPNAVTFTCLIDGFCKNDRL 7 + +MR L N VT+T LIDGFCK + Sbjct: 221 YEEMRRVRMSL-NVVTYTALIDGFCKKGEM 249 Score = 55.8 bits (133), Expect = 3e-06 Identities = 32/99 (32%), Positives = 57/99 (57%), Gaps = 3/99 (3%) Frame = -1 Query: 288 SLGLHPDPVTFNTIIAALCRQSLRLPALCFLGAMHKY-CKPNVVTYSTLIDMFCKMGDLD 112 S G P +FN++++ +C+ A+ + +M ++ C+P+V++Y++LID C+ GD+ Sbjct: 49 SRGYAPHRSSFNSVVSFVCKLGQVKFAVDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIR 108 Query: 111 TAHKVFIDMRFENGVL--PNAVTFTCLIDGFCKNDRLDE 1 +A V +R G P+ V+F L +GF K LDE Sbjct: 109 SACLVLESLRASYGFTCKPDIVSFNTLFNGFSKMKMLDE 147 >ref|NP_178283.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75216739|sp|Q9ZUA2.1|PP141_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g01740 gi|4220475|gb|AAD12698.1| hypothetical protein [Arabidopsis thaliana] gi|330250397|gb|AEC05491.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 559 Score = 85.1 bits (209), Expect = 5e-15 Identities = 42/90 (46%), Positives = 55/90 (61%) Frame = -1 Query: 273 PDPVTFNTIIAALCRQSLRLPALCFLGAMHKYCKPNVVTYSTLIDMFCKMGDLDTAHKVF 94 PD V+FN++ + + ++G M K C PNVVTYST ID FCK G+L A K F Sbjct: 127 PDIVSFNSLFNGFSKMKMLDEVFVYMGVMLKCCSPNVVTYSTWIDTFCKSGELQLALKSF 186 Query: 93 IDMRFENGVLPNAVTFTCLIDGFCKNDRLD 4 M+ + + PN VTFTCLIDG+CK L+ Sbjct: 187 HSMK-RDALSPNVVTFTCLIDGYCKAGDLE 215 Score = 65.1 bits (157), Expect = 6e-09 Identities = 37/90 (41%), Positives = 51/90 (56%), Gaps = 1/90 (1%) Frame = -1 Query: 273 PDPVTFNTIIAALCRQS-LRLPALCFLGAMHKYCKPNVVTYSTLIDMFCKMGDLDTAHKV 97 P+ VT++T I C+ L+L F PNVVT++ LID +CK GDL+ A + Sbjct: 161 PNVVTYSTWIDTFCKSGELQLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAVSL 220 Query: 96 FIDMRFENGVLPNAVTFTCLIDGFCKNDRL 7 + +MR L N VT+T LIDGFCK + Sbjct: 221 YKEMRRVRMSL-NVVTYTALIDGFCKKGEM 249 Score = 57.4 bits (137), Expect = 1e-06 Identities = 32/99 (32%), Positives = 58/99 (58%), Gaps = 3/99 (3%) Frame = -1 Query: 288 SLGLHPDPVTFNTIIAALCRQSLRLPALCFLGAMHKY-CKPNVVTYSTLIDMFCKMGDLD 112 S G P +FN++++ +C+ A + +M ++ C+P+V++Y++LID C+ GD+ Sbjct: 49 SRGYTPHRSSFNSVVSFVCKLGQVKFAEDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIR 108 Query: 111 TAHKVFIDMRFENGVL--PNAVTFTCLIDGFCKNDRLDE 1 +A V +R +G + P+ V+F L +GF K LDE Sbjct: 109 SASLVLESLRASHGFICKPDIVSFNSLFNGFSKMKMLDE 147 Score = 54.3 bits (129), Expect = 1e-05 Identities = 32/86 (37%), Positives = 48/86 (55%), Gaps = 1/86 (1%) Frame = -1 Query: 279 LHPDPVTFNTIIAALCRQSLRLPALCFLGAMHKY-CKPNVVTYSTLIDMFCKMGDLDTAH 103 L P+ VTF +I C+ A+ M + NVVTY+ LID FCK G++ A Sbjct: 194 LSPNVVTFTCLIDGYCKAGDLEVAVSLYKEMRRVRMSLNVVTYTALIDGFCKKGEMQRAE 253 Query: 102 KVFIDMRFENGVLPNAVTFTCLIDGF 25 +++ M E+ V PN++ +T +IDGF Sbjct: 254 EMYSRM-VEDRVEPNSLVYTTIIDGF 278 >ref|XP_002320961.1| predicted protein [Populus trichocarpa] gi|222861734|gb|EEE99276.1| predicted protein [Populus trichocarpa] Length = 474 Score = 80.9 bits (198), Expect = 1e-13 Identities = 43/95 (45%), Positives = 57/95 (60%), Gaps = 1/95 (1%) Frame = -1 Query: 282 GLHPDPVTFNTIIAALCRQSLRLPALCFLGAMHKY-CKPNVVTYSTLIDMFCKMGDLDTA 106 GL P+ T+N II +L + AL FL M K C PNVV YSTLID +C G +D A Sbjct: 162 GLQPNVYTYNVIINSLSKSGKANEALGFLKQMEKVGCVPNVVNYSTLIDGYCLRGQMDEA 221 Query: 105 HKVFIDMRFENGVLPNAVTFTCLIDGFCKNDRLDE 1 VF D+ G PN T+T L++G+CK +R++E Sbjct: 222 RSVF-DLMVSKGCTPNVYTYTSLMNGYCKIERIEE 255 Score = 65.5 bits (158), Expect = 4e-09 Identities = 40/100 (40%), Positives = 55/100 (55%), Gaps = 6/100 (6%) Frame = -1 Query: 282 GLHPDPVTFNTIIAALCRQSLRLPA------LCFLGAMHKYCKPNVVTYSTLIDMFCKMG 121 GL PD VTF TII+ LCR L A +C G PN++TY L+D CK G Sbjct: 267 GLVPDIVTFTTIISGLCRAGRPLAAQQLFRYICAHGHT-----PNIMTYGVLLDGLCKHG 321 Query: 120 DLDTAHKVFIDMRFENGVLPNAVTFTCLIDGFCKNDRLDE 1 +L+ A +F +M+ + V PN V +T LID CK ++ + Sbjct: 322 NLEEAFALFQEMQ-RSTVKPNLVIYTILIDSLCKCGKIKD 360 Score = 54.7 bits (130), Expect = 8e-06 Identities = 30/93 (32%), Positives = 52/93 (55%), Gaps = 1/93 (1%) Frame = -1 Query: 285 LGLHPDPVTFNTIIAALCRQSLRLPALCFLGAM-HKYCKPNVVTYSTLIDMFCKMGDLDT 109 +G P+ V ++T+I C + A M K C PNV TY++L++ +CK+ ++ Sbjct: 196 VGCVPNVVNYSTLIDGYCLRGQMDEARSVFDLMVSKGCTPNVYTYTSLMNGYCKIERIEE 255 Query: 108 AHKVFIDMRFENGVLPNAVTFTCLIDGFCKNDR 10 A ++ +D G++P+ VTFT +I G C+ R Sbjct: 256 AVQL-LDETLRKGLVPDIVTFTTIISGLCRAGR 287