BLASTX nr result
ID: Dioscorea21_contig00026344
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00026344 (877 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002275085.1| PREDICTED: pentatricopeptide repeat-containi... 240 3e-61 ref|NP_187126.1| pentatricopeptide repeat-containing protein [Ar... 228 1e-57 ref|XP_002884467.1| pentatricopeptide repeat-containing protein ... 226 5e-57 ref|XP_003559839.1| PREDICTED: pentatricopeptide repeat-containi... 226 7e-57 gb|EAY91821.1| hypothetical protein OsI_13463 [Oryza sativa Indi... 219 5e-55 >ref|XP_002275085.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04750, mitochondrial-like [Vitis vinifera] Length = 654 Score = 240 bits (613), Expect = 3e-61 Identities = 125/292 (42%), Positives = 182/292 (62%) Frame = +2 Query: 2 SAIAHPCXXXXXXXXXXXXXXNPNLFIFNTMISALSFSVNQSLAFYKSMLHLCVCPDEHT 181 SAI HP +PNL+I+NTMISALS S+NQS AFY S+L C+ P+ T Sbjct: 73 SAITHPENLDMAVLLFRHHTPHPNLYIYNTMISALSLSLNQSFAFYNSLLSSCIYPNRST 132 Query: 182 XXXXXXXXXXXXXXXXQIHAQVIIFGFSFHAYVHNSLVKMYLENDEIGVVEKLVRPCGDK 361 QIH II G ++ Y+ N+L+K+YLEN+++G+ ++ + Sbjct: 133 FLFLLQASKFLSQVM-QIHCHAIITGSFYYGYLQNTLMKIYLENEKMGLAYQVFQQMA-A 190 Query: 362 KDIVLFNTLMSWYVKKGCYLDALEVFDELMGSGVEPDQYTIVSLLVCCGQLKGVLAGKSV 541 D V FN ++ Y KKG ++AL+ E++G G++PD++T++ LL+CCG+L GKSV Sbjct: 191 PDAVSFNIMIFGYAKKGHNIEALKFLHEMVGLGLKPDEFTMLGLLICCGRLGDAQLGKSV 250 Query: 542 HGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFNNVG 721 H W+ RR + L+L NALLDMYVKC+E+ A IF+ I KD++SWN MI G+ VG Sbjct: 251 HAWIERRGLIKSSNLILNNALLDMYVKCKELRIAQSIFNVIVRKDTISWNTMIAGYAKVG 310 Query: 722 EFDLAYKAFEEMPEKDLVSWNSLLSGYLQKGNYKRVIELFHFMLSQNDVKPD 877 ++A+ FE+MP +DLVSWNS+++GY QKG+ V LF M+++N + PD Sbjct: 311 NLEIAHNFFEDMPCRDLVSWNSIIAGYAQKGDCLMVQRLFENMVAEN-IWPD 361 Score = 85.9 bits (211), Expect = 1e-14 Identities = 74/311 (23%), Positives = 134/311 (43%), Gaps = 42/311 (13%) Frame = +2 Query: 68 PNLFIFNTMISALSFSVN--QSLAFYKSMLHLCVCPDEHTXXXXXXXXXXXXXXXX--QI 235 P+ FN MI + + ++L F M+ L + PDE T + Sbjct: 191 PDAVSFNIMIFGYAKKGHNIEALKFLHEMVGLGLKPDEFTMLGLLICCGRLGDAQLGKSV 250 Query: 236 HAQVIIFGF--SFHAYVHNSLVKMYLENDEIGVVEKLVR--------------------- 346 HA + G S + ++N+L+ MY++ E+ + + + Sbjct: 251 HAWIERRGLIKSSNLILNNALLDMYVKCKELRIAQSIFNVIVRKDTISWNTMIAGYAKVG 310 Query: 347 ------------PCGDKKDIVLFNTLMSWYVKKGCYLDALEVFDELMGSGVEPDQYTIVS 490 PC +D+V +N++++ Y +KG L +F+ ++ + PD TI++ Sbjct: 311 NLEIAHNFFEDMPC---RDLVSWNSIIAGYAQKGDCLMVQRLFENMVAENIWPDFVTIIN 367 Query: 491 LLVCCGQLKGVLAGKSVHGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISE 670 L+ ++ + G+ +HGWVVR L +A +DMY KC I A +F ++E Sbjct: 368 LVSAAAEIGALHHGRWIHGWVVRMQ--MKIDAFLGSAFIDMYWKCGSIKRACMVFREVTE 425 Query: 671 KDSVSWNIMIMGFNNVGEFDLAYKAFEEMPE---KDLVSWNSLLSGYLQKGNYKRVIELF 841 KD W MI GF G A + F EM E + V++ ++L+ G + + +F Sbjct: 426 KDVTVWTTMITGFAFHGYGSKALQLFYEMQEYVMPNQVTFVAVLTACSHSGFVSQGLRIF 485 Query: 842 HFMLSQNDVKP 874 + M + ++P Sbjct: 486 NSMKERYGIEP 496 >ref|NP_187126.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75207287|sp|Q9SR01.1|PP212_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g04750, mitochondrial; Flags: Precursor gi|6175175|gb|AAF04901.1|AC011437_16 hypothetical protein [Arabidopsis thaliana] gi|332640610|gb|AEE74131.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 661 Score = 228 bits (581), Expect = 1e-57 Identities = 124/294 (42%), Positives = 171/294 (58%), Gaps = 2/294 (0%) Frame = +2 Query: 2 SAIAHPCXXXXXXXXXXXXXXNPNLFIFNTMISALSFSVNQSLAFYKSMLHLCVCPDEHT 181 SAI +P NPN+F++NTMISA+S S N+ Y SM+ V PD T Sbjct: 76 SAITYPENLDLAKLLFLNFTPNPNVFVYNTMISAVSSSKNECFGLYSSMIRHRVSPDRQT 135 Query: 182 XXXXXXXXXXXXXXXXQIHAQVIIFG-FSFHAYVHNSLVKMYLENDEIGVVEKLVRPCGD 358 QIH +I+ G S Y+ NSLVK Y+E GV EK+ Sbjct: 136 FLYLMKASSFLSEVK-QIHCHIIVSGCLSLGNYLWNSLVKFYMELGNFGVAEKVFARM-P 193 Query: 359 KKDIVLFNTLMSWYVKKGCYLDALEVFDELMGSGVEPDQYTIVSLLVCCGQLKGVLAGKS 538 D+ FN ++ Y K+G L+AL+++ +++ G+EPD+YT++SLLVCCG L + GK Sbjct: 194 HPDVSSFNVMIVGYAKQGFSLEALKLYFKMVSDGIEPDEYTVLSLLVCCGHLSDIRLGKG 253 Query: 539 VHGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFNNV 718 VHGW+ RR L+L NALLDMY KC+E G A + FD + +KD SWN M++GF + Sbjct: 254 VHGWIERRGPVYSSNLILSNALLDMYFKCKESGLAKRAFDAMKKKDMRSWNTMVVGFVRL 313 Query: 719 GEFDLAYKAFEEMPEKDLVSWNSLLSGYLQKGNYKRVI-ELFHFMLSQNDVKPD 877 G+ + A F++MP++DLVSWNSLL GY +KG +R + ELF+ M VKPD Sbjct: 314 GDMEAAQAVFDQMPKRDLVSWNSLLFGYSKKGCDQRTVRELFYEMTIVEKVKPD 367 Score = 73.2 bits (178), Expect = 8e-11 Identities = 54/161 (33%), Positives = 83/161 (51%), Gaps = 2/161 (1%) Frame = +2 Query: 359 KKDIVLFNTLMSWYVKKGCYLDAL-EVFDEL-MGSGVEPDQYTIVSLLVCCGQLKGVLAG 532 K+D+V +N+L+ Y KKGC + E+F E+ + V+PD+ T+VSL+ + G Sbjct: 328 KRDLVSWNSLLFGYSKKGCDQRTVRELFYEMTIVEKVKPDRVTMVSLISGAANNGELSHG 387 Query: 533 KSVHGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFN 712 + VHG V+R + +G L +AL+DMY KC I A +F +EKD W MI G Sbjct: 388 RWVHGLVIR-LQLKG-DAFLSSALIDMYCKCGIIERAFMVFKTATEKDVALWTSMITGLA 445 Query: 713 NVGEFDLAYKAFEEMPEKDLVSWNSLLSGYLQKGNYKRVIE 835 G A + F M E+ + N L L ++ ++E Sbjct: 446 FHGNGQQALQLFGRMQEEGVTPNNVTLLAVLTACSHSGLVE 486 >ref|XP_002884467.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297330307|gb|EFH60726.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 657 Score = 226 bits (576), Expect = 5e-57 Identities = 124/296 (41%), Positives = 173/296 (58%), Gaps = 4/296 (1%) Frame = +2 Query: 2 SAIAHPCXXXXXXXXXXXXXXNPNLFIFNTMISALSFSVNQSLAFYKSMLHLCVCPDEHT 181 SAI +P NPN+F++NTMISA+S S N+ Y SM+ V PD T Sbjct: 75 SAITYPENLDLAKLLFLDFTPNPNVFVYNTMISAVSSSKNECFGLYSSMIRYRVSPDRQT 134 Query: 182 XXXXXXXXXXXXXXXXQIHAQVIIFG-FSFHAYVHNSLVKMYLENDEIGVVEKL--VRPC 352 QIH +I+ G S Y+ NSLVK Y+E +G EK+ + P Sbjct: 135 FLHLMKASSFLSEVK-QIHCHIIVSGCLSLGNYLWNSLVKFYMELGSLGFAEKVFAIMP- 192 Query: 353 GDKKDIVLFNTLMSWYVKKGCYLDALEVFDELMGSGVEPDQYTIVSLLVCCGQLKGVLAG 532 + D+ FN ++ Y K+G L+ALE++ +++ G+EPD+YT++ LLVCCG L + G Sbjct: 193 --QPDVSSFNVMIVGYAKQGFGLEALELYYKMVSDGIEPDEYTLLGLLVCCGHLSDIRLG 250 Query: 533 KSVHGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFN 712 K VHGW+ RR L+L NALLDMY KC+E G A + FD + +KD SWN M++GF Sbjct: 251 KGVHGWIERRGPVYSSNLILRNALLDMYFKCKESGLAKRAFDALKKKDMRSWNTMVVGFV 310 Query: 713 NVGEFDLAYKAFEEMPEKDLVSWNSLLSGYLQKGNYKRVI-ELFHFMLSQNDVKPD 877 +G+ + A F++MP++DLVSWNSLL Y +KG +R + ELF+ ML VKPD Sbjct: 311 RLGDMEAAQAVFDQMPQRDLVSWNSLLFCYSKKGCDQRAVRELFYEMLIVEKVKPD 366 Score = 76.6 bits (187), Expect = 7e-12 Identities = 58/179 (32%), Positives = 95/179 (53%), Gaps = 6/179 (3%) Frame = +2 Query: 359 KKDIVLFNTLMSWYVKKGCYLDAL-EVFDE-LMGSGVEPDQYTIVSLLVCCGQLKGVLAG 532 ++D+V +N+L+ Y KKGC A+ E+F E L+ V+PD+ T+VSL+ + G Sbjct: 327 QRDLVSWNSLLFCYSKKGCDQRAVRELFYEMLIVEKVKPDRVTMVSLISGAANNGELSHG 386 Query: 533 KSVHGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFN 712 + VHG ++R + +G L +AL+DMY KC I A +F +EKD W MI GF Sbjct: 387 RWVHGLMIR-LQLEG-DAFLSSALIDMYCKCGLIERAFMVFKTATEKDVPLWTSMITGFA 444 Query: 713 NVGEFDLAYKAFEEMPEKDL----VSWNSLLSGYLQKGNYKRVIELFHFMLSQNDVKPD 877 G A + F+ M E+D+ V+ ++L+ G + + +F+ M + P+ Sbjct: 445 FHGYGQQALQLFKRMQEEDVTPNKVTLLAVLTACSHSGLVEEGLHVFYHMKEKFGFHPE 503 >ref|XP_003559839.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04750, mitochondrial-like [Brachypodium distachyon] Length = 601 Score = 226 bits (575), Expect = 7e-57 Identities = 122/270 (45%), Positives = 167/270 (61%) Frame = +2 Query: 68 PNLFIFNTMISALSFSVNQSLAFYKSMLHLCVCPDEHTXXXXXXXXXXXXXXXXQIHAQV 247 PNL+ +N ++SALS S ++S+A YKSML PDE T Q+HA V Sbjct: 85 PNLYCYNLVLSALSSSQSRSVALYKSMLASSASPDEKTFLSLLKSVGCASVGK-QVHAHV 143 Query: 248 IIFGFSFHAYVHNSLVKMYLENDEIGVVEKLVRPCGDKKDIVLFNTLMSWYVKKGCYLDA 427 ++ G Y+ NSL+KMYL+ + E + + D+V N ++S YVK GC ++A Sbjct: 144 LVNGLHSRVYLRNSLIKMYLDAGDAETAEAMFQSV-PVPDVVSCNIMLSGYVKGGCVVNA 202 Query: 428 LEVFDELMGSGVEPDQYTIVSLLVCCGQLKGVLAGKSVHGWVVRRMGYQGWGLVLCNALL 607 L++F ++ + DQY V+LL CCG+LK L G+SVHG VVRRM + GL+L NALL Sbjct: 203 LQLFRDMASREIGVDQYAAVALLSCCGRLKNALLGRSVHGVVVRRMDIKDRGLILSNALL 262 Query: 608 DMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFNNVGEFDLAYKAFEEMPEKDLVSWNS 787 DMY KC E+ TAM++F EKD +SWN MI GF N G DLA K F + P +DL+SWN+ Sbjct: 263 DMYAKCGEMNTAMRVFGEAKEKDDISWNTMIAGFANDGMLDLASKFFFDAPCRDLISWNT 322 Query: 788 LLSGYLQKGNYKRVIELFHFMLSQNDVKPD 877 LL+GY + + V+ELF+ MLS V+PD Sbjct: 323 LLAGYGRCREFAAVMELFNDMLSSR-VRPD 351 Score = 85.1 bits (209), Expect = 2e-14 Identities = 53/180 (29%), Positives = 91/180 (50%), Gaps = 4/180 (2%) Frame = +2 Query: 347 PCGDKKDIVLFNTLMSWYVKKGCYLDALEVFDELMGSGVEPDQYTIVSLLVCCGQLKGVL 526 PC +D++ +NTL++ Y + + +E+F++++ S V PD+ T V+L+ + Sbjct: 313 PC---RDLISWNTLLAGYGRCREFAAVMELFNDMLSSRVRPDKVTAVTLISAAVSKGALN 369 Query: 527 AGKSVHGWVVRRMGYQGWGLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMG 706 GKSVHGWV++ G Q L + L+DMY KC + A +F++ +KD W MI G Sbjct: 370 LGKSVHGWVLKEHGTQ--DAFLASTLVDMYCKCGNVKLAYAVFEKALDKDVTLWTAMISG 427 Query: 707 FNNVGEFDLAYKAFEEMPEKDL----VSWNSLLSGYLQKGNYKRVIELFHFMLSQNDVKP 874 G A F M + + V+ ++LS G E+F+ M + +++P Sbjct: 428 LAFHGHGTEALDLFWNMQNEGVAPNGVTLVTVLSACSHAGLLDEGCEIFYTMQKRFNIEP 487 >gb|EAY91821.1| hypothetical protein OsI_13463 [Oryza sativa Indica Group] Length = 468 Score = 219 bits (559), Expect = 5e-55 Identities = 120/279 (43%), Positives = 161/279 (57%), Gaps = 9/279 (3%) Frame = +2 Query: 68 PNLFIFNTMIS--------ALSFSVNQSLAFYKSMLHLCVCPDEHTXXXXXXXXXXXXXX 223 PNL+I+N M+S A S + A Y SML + PDE T Sbjct: 86 PNLYIYNLMLSSAAAAAAAASSSPSRRPAALYMSMLASSIHPDEQTFLSLLKSVDAERRS 145 Query: 224 XX-QIHAQVIIFGFSFHAYVHNSLVKMYLENDEIGVVEKLVRPCGDKKDIVLFNTLMSWY 400 Q+HA V++ G Y+ NSL+KMYL+ ++ E + R C D V N ++S Y Sbjct: 146 VGKQVHAHVVVTGLHSRVYLRNSLIKMYLDAGDVEAAEAMFR-CAPTADAVSCNIMLSGY 204 Query: 401 VKKGCYLDALEVFDELMGSGVEPDQYTIVSLLVCCGQLKGVLAGKSVHGWVVRRMGYQGW 580 VK GC AL F + G+ DQYT V+LL CCG+LK + G+SVHG VVRR+G Sbjct: 205 VKGGCSGKALRFFRGMASRGIGVDQYTAVALLACCGRLKKAVLGRSVHGVVVRRIGVADR 264 Query: 581 GLVLCNALLDMYVKCEEIGTAMKIFDRISEKDSVSWNIMIMGFNNVGEFDLAYKAFEEMP 760 GL+L NALLDMY KC E+ TAM++FD E+D +SWN M+ GF N G DLA K F E+P Sbjct: 265 GLILSNALLDMYAKCGEMNTAMRVFDEAGERDGISWNTMVAGFANAGLLDLASKYFGEVP 324 Query: 761 EKDLVSWNSLLSGYLQKGNYKRVIELFHFMLSQNDVKPD 877 +D++SWN+LL+GY + + + LFH ML+ + V PD Sbjct: 325 ARDIISWNALLAGYARYEEFSATMILFHDMLA-SSVIPD 362