BLASTX nr result
ID: Dioscorea21_contig00009408
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00009408 (613 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002528283.1| pentatricopeptide repeat-containing protein,... 94 2e-17 ref|XP_002991687.1| hypothetical protein SELMODRAFT_133908 [Sela... 89 8e-16 ref|XP_003602631.1| Pentatricopeptide repeat-containing protein ... 86 7e-15 ref|NP_179705.1| pentatricopeptide repeat-containing protein [Ar... 85 1e-14 ref|XP_003542017.1| PREDICTED: pentatricopeptide repeat-containi... 83 3e-14 >ref|XP_002528283.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223532320|gb|EEF34121.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 602 Score = 94.0 bits (232), Expect = 2e-17 Identities = 49/135 (36%), Positives = 78/135 (57%), Gaps = 1/135 (0%) Frame = -1 Query: 403 RRLVSVPKSIHALARAGLIKPTIGSLILRRSLGDPVPPDSFVAILKSVSSPKSLMCGKQL 224 +R+ + KS+ L+ G + I SL L G +P + +L+ ++ KSL GK + Sbjct: 13 KRVPCIVKSLLHLSSQGQLFQAISSLGLLSRNGIRLPSKTLAYLLQQCANTKSLKLGKWV 72 Query: 223 HAHMVVTDFNH-DTLVQNYLIVMYGKCGSLDDAHSIFRLMSRKNLHSWNTLIDCYCKFGS 47 H H+ VT +T + N+LI MY KCG A+ +F MS +NL+SWN ++ Y K G Sbjct: 73 HLHLKVTGLKRPNTFLANHLINMYSKCGDYPSAYKVFDEMSTRNLYSWNGMLSGYAKLGK 132 Query: 46 LSKAQQLFDEMPQRD 2 + A++LFD+MP++D Sbjct: 133 IKPARKLFDKMPEKD 147 Score = 63.9 bits (154), Expect = 2e-08 Identities = 35/132 (26%), Positives = 65/132 (49%) Frame = -1 Query: 397 LVSVPKSIHALARAGLIKPTIGSLILRRSLGDPVPPDSFVAILKSVSSPKSLMCGKQLHA 218 +VS + A A++G + R LG SF +L K L KQ H Sbjct: 148 VVSWNTMVIAYAKSGFCNDALRFYRELRRLGIGYNEYSFAGLLNICVKVKELELSKQAHG 207 Query: 217 HMVVTDFNHDTLVQNYLIVMYGKCGSLDDAHSIFRLMSRKNLHSWNTLIDCYCKFGSLSK 38 ++V F + ++ + ++ Y KC + DA +F M +++ +W T++ Y ++G + Sbjct: 208 QVLVAGFLSNLVISSSVLDAYAKCSEMGDARRLFDEMIIRDVLAWTTMVSGYAQWGDVEA 267 Query: 37 AQQLFDEMPQRD 2 A++LFD MP+++ Sbjct: 268 ARELFDLMPEKN 279 Score = 62.4 bits (150), Expect = 6e-08 Identities = 36/98 (36%), Positives = 57/98 (58%), Gaps = 3/98 (3%) Frame = -1 Query: 298 VPPDSFV--AILKSVSSPKSLMCGKQLHAHMVVTDFNHDTLVQNYLIVMYGKCGSLDDAH 125 + PD F + L + +S SL GKQ+H +++ T+ +T+V + LI MY KCG L+ Sbjct: 311 IRPDQFTFSSCLCASASIASLNHGKQIHGYLIRTNIRPNTIVVSSLIDMYSKCGCLEVGR 370 Query: 124 SIFRLMSRK-NLHSWNTLIDCYCKFGSLSKAQQLFDEM 14 +F LM K ++ WNT+I + G +A Q+FD+M Sbjct: 371 LVFDLMGDKWDVVLWNTIISSLAQHGRGQEAIQMFDDM 408 >ref|XP_002991687.1| hypothetical protein SELMODRAFT_133908 [Selaginella moellendorffii] gi|300140536|gb|EFJ07258.1| hypothetical protein SELMODRAFT_133908 [Selaginella moellendorffii] Length = 589 Score = 88.6 bits (218), Expect = 8e-16 Identities = 41/93 (44%), Positives = 60/93 (64%) Frame = -1 Query: 286 SFVAILKSVSSPKSLMCGKQLHAHMVVTDFNHDTLVQNYLIVMYGKCGSLDDAHSIFRLM 107 ++ +L+ + KSL GK+LHAH+V + +HDTL+ N LI MY CGS+D+AH F + Sbjct: 24 AYANLLRQCIACKSLREGKRLHAHIVASRQDHDTLLGNLLIQMYSSCGSMDEAHLAFSQI 83 Query: 106 SRKNLHSWNTLIDCYCKFGSLSKAQQLFDEMPQ 8 R N SWN L+ Y + G + A+++FD MPQ Sbjct: 84 QRSNTFSWNILLGAYVRNGDIVLAREVFDRMPQ 116 Score = 63.9 bits (154), Expect = 2e-08 Identities = 35/102 (34%), Positives = 56/102 (54%), Gaps = 3/102 (2%) Frame = -1 Query: 304 DPVPPD--SFVAILKSVSSPKSLMCGKQLHAHMVVTDFNHDTLVQNYLIVMYGKCGSLDD 131 D + PD +F+ + S + L G+ LHA + ++ + V N L+ MYGKCG+L + Sbjct: 366 DGIKPDEMAFLVAIDSCGAASDLARGRFLHAEIDAAGYDSSSTVANSLVGMYGKCGNLQE 425 Query: 130 AHSIF-RLMSRKNLHSWNTLIDCYCKFGSLSKAQQLFDEMPQ 8 A +F R +R++ WNT+I CY + G + +A L M Q Sbjct: 426 ARRLFDRGGARRSSALWNTMISCYSQAGFVREALDLLHAMEQ 467 >ref|XP_003602631.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355491679|gb|AES72882.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 705 Score = 85.5 bits (210), Expect = 7e-15 Identities = 44/122 (36%), Positives = 70/122 (57%) Frame = -1 Query: 367 LARAGLIKPTIGSLILRRSLGDPVPPDSFVAILKSVSSPKSLMCGKQLHAHMVVTDFNHD 188 +A+ GL++ +G L S D P F +L + KS+ + +HA ++ T F+ + Sbjct: 1 MAKHGLVRKVVGDL----SFLDSSP---FAKLLDTCVKSKSVFEARLVHARIIKTQFSSE 53 Query: 187 TLVQNYLIVMYGKCGSLDDAHSIFRLMSRKNLHSWNTLIDCYCKFGSLSKAQQLFDEMPQ 8 +QN L+ +YGKCG L+DA +F M ++N SWN ++ KFG+L +A LF MP+ Sbjct: 54 IFIQNRLVDVYGKCGFLEDARKVFDHMQQRNTFSWNAVLGALTKFGALDEALNLFKCMPE 113 Query: 7 RD 2 RD Sbjct: 114 RD 115 >ref|NP_179705.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206523|sp|Q9SKQ4.1|PP167_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g21090 gi|4803934|gb|AAD29807.1| unknown protein [Arabidopsis thaliana] gi|330252028|gb|AEC07122.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 597 Score = 84.7 bits (208), Expect = 1e-14 Identities = 43/103 (41%), Positives = 62/103 (60%), Gaps = 1/103 (0%) Frame = -1 Query: 307 GDPVPPDSFVAILKSVSSPKSLMCGKQLHAHMVVTDFNH-DTLVQNYLIVMYGKCGSLDD 131 G +P D ++L+ KSL GK +H H+ +T F +TL+ N+LI MY KCG D Sbjct: 41 GIRLPFDLLASLLQQCGDTKSLKQGKWIHRHLKITGFKRPNTLLSNHLIGMYMKCGKPID 100 Query: 130 AHSIFRLMSRKNLHSWNTLIDCYCKFGSLSKAQQLFDEMPQRD 2 A +F M +NL+SWN ++ Y K G L +A+ +FD MP+RD Sbjct: 101 ACKVFDQMHLRNLYSWNNMVSGYVKSGMLVRARVVFDSMPERD 143 Score = 67.8 bits (164), Expect = 1e-09 Identities = 31/95 (32%), Positives = 52/95 (54%) Frame = -1 Query: 286 SFVAILKSVSSPKSLMCGKQLHAHMVVTDFNHDTLVQNYLIVMYGKCGSLDDAHSIFRLM 107 SF +L + + L +Q H ++V F + ++ +I Y KCG ++ A F M Sbjct: 181 SFAGLLTACVKSRQLQLNRQAHGQVLVAGFLSNVVLSCSIIDAYAKCGQMESAKRCFDEM 240 Query: 106 SRKNLHSWNTLIDCYCKFGSLSKAQQLFDEMPQRD 2 + K++H W TLI Y K G + A++LF EMP+++ Sbjct: 241 TVKDIHIWTTLISGYAKLGDMEAAEKLFCEMPEKN 275 Score = 57.0 bits (136), Expect = 3e-06 Identities = 38/121 (31%), Positives = 65/121 (53%), Gaps = 6/121 (4%) Frame = -1 Query: 358 AGLIKPTIGSLIL---RRSLGDPVPPDSFV--AILKSVSSPKSLMCGKQLHAHMVVTDFN 194 AG ++ G+ L R+ + V P+ F + L + +S SL GK++H +M+ T+ Sbjct: 284 AGYVRQGSGNRALDLFRKMIALGVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRTNVR 343 Query: 193 HDTLVQNYLIVMYGKCGSLDDAHSIFRLMSRK-NLHSWNTLIDCYCKFGSLSKAQQLFDE 17 + +V + LI MY K GSL+ + +FR+ K + WNT+I + G KA ++ D+ Sbjct: 344 PNAIVISSLIDMYSKSGSLEASERVFRICDDKHDCVFWNTMISALAQHGLGHKALRMLDD 403 Query: 16 M 14 M Sbjct: 404 M 404 >ref|XP_003542017.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37170-like [Glycine max] Length = 693 Score = 83.2 bits (204), Expect = 3e-14 Identities = 39/83 (46%), Positives = 55/83 (66%) Frame = -1 Query: 250 KSLMCGKQLHAHMVVTDFNHDTLVQNYLIVMYGKCGSLDDAHSIFRLMSRKNLHSWNTLI 71 ++L G+++HAH ++F + N L+ MY KCGSL DA +F M ++L SWNT+I Sbjct: 101 RALELGRRVHAHTKASNFVPGVFISNRLLDMYAKCGSLVDAQMLFDEMGHRDLCSWNTMI 160 Query: 70 DCYCKFGSLSKAQQLFDEMPQRD 2 Y K G L +A++LFDEMPQRD Sbjct: 161 VGYAKLGRLEQARKLFDEMPQRD 183 Score = 60.5 bits (145), Expect = 2e-07 Identities = 32/90 (35%), Positives = 53/90 (58%) Frame = -1 Query: 277 AILKSVSSPKSLMCGKQLHAHMVVTDFNHDTLVQNYLIVMYGKCGSLDDAHSIFRLMSRK 98 A+ S + P L GK++H +++ T+ N D +V + L+ +YGKCGSLD+A IF M + Sbjct: 226 ALAASAAIP-CLRLGKEIHGYLIRTELNLDEVVWSALLDLYGKCGSLDEARGIFDQMKDR 284 Query: 97 NLHSWNTLIDCYCKFGSLSKAQQLFDEMPQ 8 ++ SW T+I + G + LF ++ Q Sbjct: 285 DVVSWTTMIHRCFEDGRREEGFLLFRDLMQ 314