BLASTX nr result
ID: Dioscorea21_contig00004973
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00004973 (961 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002304774.1| predicted protein [Populus trichocarpa] gi|2... 337 2e-90 emb|CBI17752.3| unnamed protein product [Vitis vinifera] 328 2e-87 ref|XP_003598903.1| Pentatricopeptide repeat-containing protein ... 325 1e-86 ref|XP_003555182.1| PREDICTED: pentatricopeptide repeat-containi... 318 1e-84 ref|XP_004140286.1| PREDICTED: pentatricopeptide repeat-containi... 318 2e-84 >ref|XP_002304774.1| predicted protein [Populus trichocarpa] gi|222842206|gb|EEE79753.1| predicted protein [Populus trichocarpa] Length = 728 Score = 337 bits (865), Expect = 2e-90 Identities = 169/325 (52%), Positives = 225/325 (69%), Gaps = 6/325 (1%) Frame = +3 Query: 3 RPSQNRPIVHGGLFTNRKALXXXXXXXXXXXXXTVDFRHWDXXXXXXXXXXXXXXXXXXX 182 +PSQNRP+V GGLFTNR+ + D WD Sbjct: 24 KPSQNRPVVRGGLFTNRQTVKPQPPKNPITPFKPFDLHKWDPQQNLPHQPQPSKPQSPRS 83 Query: 183 XXXXXXXX-----ARFISDLLRRHRH-WSPALLSELSKLRRVSPDLVAEVLRFDPNPNPS 344 ARFI D R++R+ W P +++EL KLRRV+PDLVAEVL+ + NP Sbjct: 84 RHSLALSQRLSPIARFILDAFRKNRNQWGPEVVTELCKLRRVTPDLVAEVLKVENNPQ-- 141 Query: 345 LSTRFFHWASRQKGFHHSYSSFNAFAHSLYRSGHPRAADRVPDLMLALGKPPSETQLELL 524 L+T+FFHWA +QKGF H+++S+NAFA++L RS RAAD++P+LM A GKPP+E Q E+L Sbjct: 142 LATKFFHWAGKQKGFKHTFASYNAFAYNLNRSNFFRAADQLPELMEAQGKPPTEKQFEIL 201 Query: 525 VRLHSQSRRGLRLFHVYKRMTSIFNIKPRVFLYNRILDSLIRTDHLDLALSVYDDMIRND 704 +R+HS + RGLR+++VY++M F +KPRVFLYNRI+DSLI+T HLDLALSVY+D R+ Sbjct: 202 IRMHSDANRGLRVYYVYQKMVK-FGVKPRVFLYNRIMDSLIKTGHLDLALSVYEDFRRDG 260 Query: 705 VKEEPITFTILAKGLCKAGRIDAALDLLEKMRRDVCKPDVFAYTAMVKVLAAEGNVDGCL 884 + EE +T+ IL KGLCKAGRI+ +++L +MR ++CKPDVFAYTAMV+ LA EGN+D CL Sbjct: 261 LVEESVTYMILIKGLCKAGRIEEMMEVLGRMRENLCKPDVFAYTAMVRALAGEGNLDACL 320 Query: 885 EVWNQMKMDGVQPDVMAYATLISGL 959 VW +MK DGV+PDVMAY TL++ L Sbjct: 321 RVWEEMKRDGVEPDVMAYVTLVTAL 345 Score = 65.5 bits (158), Expect = 2e-08 Identities = 49/215 (22%), Positives = 93/215 (43%) Frame = +3 Query: 312 VLRFDPNPNPSLSTRFFHWASRQKGFHHSYSSFNAFAHSLYRSGHPRAADRVPDLMLALG 491 ++R + N L + + + G +N SL ++GH A V + G Sbjct: 201 LIRMHSDANRGLRVYYVYQKMVKFGVKPRVFLYNRIMDSLIKTGHLDLALSVYEDFRRDG 260 Query: 492 KPPSETQLELLVRLHSQSRRGLRLFHVYKRMTSIFNIKPRVFLYNRILDSLIRTDHLDLA 671 +L++ ++ R + V RM KP VF Y ++ +L +LD Sbjct: 261 LVEESVTYMILIKGLCKAGRIEEMMEVLGRMRENL-CKPDVFAYTAMVRALAGEGNLDAC 319 Query: 672 LSVYDDMIRNDVKEEPITFTILAKGLCKAGRIDAALDLLEKMRRDVCKPDVFAYTAMVKV 851 L V+++M R+ V+ + + + L LCK GR+D ++ ++M+ D Y +V+ Sbjct: 320 LRVWEEMKRDGVEPDVMAYVTLVTALCKGGRVDKGYEVFKEMKGRRILIDRGIYGILVEA 379 Query: 852 LAAEGNVDGCLEVWNQMKMDGVQPDVMAYATLISG 956 A+G + ++ + G + D+ Y +LI G Sbjct: 380 FVADGKIGLACDLLKDLVDSGYRADLRIYNSLIEG 414 >emb|CBI17752.3| unnamed protein product [Vitis vinifera] Length = 729 Score = 328 bits (840), Expect = 2e-87 Identities = 164/321 (51%), Positives = 221/321 (68%), Gaps = 2/321 (0%) Frame = +3 Query: 3 RPSQNRPIVHGGLFTNRKALXXXXXXXXXXXXXTVDFRHWDXXXXXXXXXXXXXXXXXXX 182 +PSQNRP VHGGLF+NR L + ++WD Sbjct: 21 KPSQNRPTVHGGLFSNRTTLNPKPPTLQNPTTH-FNLQNWDPDSPKALAIPPSKTPCERF 79 Query: 183 XXXXXXXX--ARFISDLLRRHRHWSPALLSELSKLRRVSPDLVAEVLRFDPNPNPSLSTR 356 AR+I D R+HR+W P ++++L+KLRRV+P LVAEVL+ +P + ++ Sbjct: 80 FDIAKNLSPIARYICDSFRKHRNWGPPVVADLNKLRRVTPVLVAEVLKVQTDP--VICSK 137 Query: 357 FFHWASRQKGFHHSYSSFNAFAHSLYRSGHPRAADRVPDLMLALGKPPSETQLELLVRLH 536 FFHWA +QKG+ H+++S+NAFA+ L RS RAAD+VP+LM GKPPSE Q E+L+R+H Sbjct: 138 FFHWAGKQKGYKHNFASYNAFAYCLNRSNQFRAADQVPELMNMQGKPPSEKQFEILIRMH 197 Query: 537 SQSRRGLRLFHVYKRMTSIFNIKPRVFLYNRILDSLIRTDHLDLALSVYDDMIRNDVKEE 716 + RGLR+++VY++M F IKPRVFLYNRI+D L++T HLDLA+SVY+D + + EE Sbjct: 198 IDANRGLRVYYVYEKMKK-FGIKPRVFLYNRIMDGLVKTGHLDLAMSVYEDFKEDGLVEE 256 Query: 717 PITFTILAKGLCKAGRIDAALDLLEKMRRDVCKPDVFAYTAMVKVLAAEGNVDGCLEVWN 896 +T+ IL KGLCKAGRID L+LL++MR ++CKPDVFAYTAMVKVL AEGN+DGCL VW Sbjct: 257 SVTYMILVKGLCKAGRIDEVLELLDRMRGNLCKPDVFAYTAMVKVLVAEGNLDGCLRVWE 316 Query: 897 QMKMDGVQPDVMAYATLISGL 959 +M+ D V+PDVMAY TL++ L Sbjct: 317 EMRKDKVEPDVMAYTTLVAAL 337 Score = 60.8 bits (146), Expect = 5e-07 Identities = 36/122 (29%), Positives = 57/122 (46%), Gaps = 1/122 (0%) Frame = +3 Query: 597 NIKPRVFLYNRILDSLIRTDHLDLALSVYDDMIRNDVKEEPITFTILAKGLCKAGRIDAA 776 N KP Y+ + + + A + Y+ +I + L KGLCK+ IDAA Sbjct: 531 NFKPDSSTYSNAIICFVEVGDVQEACACYNKIIEMCQLPSVAAYRSLVKGLCKSEEIDAA 590 Query: 777 LDLLEKMRRDVCK-PDVFAYTAMVKVLAAEGNVDGCLEVWNQMKMDGVQPDVMAYATLIS 953 + L+ +V P F YT + GN + ++V N+M +G PD + Y+ LIS Sbjct: 591 IMLVRDCLANVTSGPMEFKYTLTILHACKSGNAEKVIDVLNEMMQEGCTPDEVTYSALIS 650 Query: 954 GL 959 G+ Sbjct: 651 GM 652 >ref|XP_003598903.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355487951|gb|AES69154.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 767 Score = 325 bits (832), Expect = 1e-86 Identities = 162/324 (50%), Positives = 218/324 (67%), Gaps = 5/324 (1%) Frame = +3 Query: 3 RPSQNRPIVHGGLFTNRKALXXXXXXXXXXXXXTVDFRHWDXXXXXXXXXXXXXXXXXXX 182 +PSQNRP V GGLF+NRK L + + WD Sbjct: 20 KPSQNRPTVRGGLFSNRKTLTPPKPKSTKPTN-SFQIQKWDPHFLSQPNSPSPSPSPSPE 78 Query: 183 XXXXXXXX----ARFISDLLRRHRH-WSPALLSELSKLRRVSPDLVAEVLRFDPNPNPSL 347 ARFI D R++ + W P +++EL+KLRRV+P LVAEVL+ NP +L Sbjct: 79 ATFSASLRLSPIARFILDAFRKNNNNWGPPVVTELNKLRRVTPTLVAEVLKVQTNP--TL 136 Query: 348 STRFFHWASRQKGFHHSYSSFNAFAHSLYRSGHPRAADRVPDLMLALGKPPSETQLELLV 527 + +FFHW +QKG+HH+++S+NAF + L R+ H RAAD++P+LM A GKPPSE Q E+L+ Sbjct: 137 AFKFFHWVEKQKGYHHNFASYNAFTYCLNRANHFRAADQLPELMDAQGKPPSEKQFEILI 196 Query: 528 RLHSQSRRGLRLFHVYKRMTSIFNIKPRVFLYNRILDSLIRTDHLDLALSVYDDMIRNDV 707 R+HS + RGLR++HVY +M + F +KPRVFLYNRI+D+L++T HLDLALSVY+D + + Sbjct: 197 RMHSDAGRGLRVYHVYDKMRNKFGVKPRVFLYNRIMDALVKTGHLDLALSVYNDFREDGL 256 Query: 708 KEEPITFTILAKGLCKAGRIDAALDLLEKMRRDVCKPDVFAYTAMVKVLAAEGNVDGCLE 887 EE +TF IL KGLCK G+ID L++L +MR +CKPDVFAYTA+V+++ EGN+DGCL Sbjct: 257 VEESVTFMILIKGLCKGGKIDEMLEVLGRMREKLCKPDVFAYTALVRIMVKEGNLDGCLR 316 Query: 888 VWNQMKMDGVQPDVMAYATLISGL 959 VW +MK D V PDVMAY T+I GL Sbjct: 317 VWKEMKRDRVDPDVMAYGTIIGGL 340 >ref|XP_003555182.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Glycine max] Length = 733 Score = 318 bits (815), Expect = 1e-84 Identities = 160/319 (50%), Positives = 216/319 (67%), Gaps = 1/319 (0%) Frame = +3 Query: 6 PSQNRPIVHGGLFTNRKALXXXXXXXXXXXXXTVDFRHWDXXXXXXXXXXXXXXXXXXXX 185 PSQNRP V GGLF+NR+ L + ++WD Sbjct: 36 PSQNRPTVRGGLFSNRQTL-NPNPSQPKPTTKPFNIKNWD-PHFLSNPNSNPSPSTLSSA 93 Query: 186 XXXXXXXARFISDLLRRH-RHWSPALLSELSKLRRVSPDLVAEVLRFDPNPNPSLSTRFF 362 ARFI D RR+ W P + +ELSKLRR++P+LVAEVL+ N +L+++FF Sbjct: 94 SLRLSPIARFIVDAFRRNDNKWCPNVAAELSKLRRITPNLVAEVLKV--QTNHTLASKFF 151 Query: 363 HWASRQKGFHHSYSSFNAFAHSLYRSGHPRAADRVPDLMLALGKPPSETQLELLVRLHSQ 542 HWA Q+G+HH+++S+NA A+ L R RAAD++P+LM + GKPPSE Q E+L+R+HS Sbjct: 152 HWAGSQRGYHHNFASYNALAYCLNRHHQFRAADQLPELMESQGKPPSEKQFEILIRMHSD 211 Query: 543 SRRGLRLFHVYKRMTSIFNIKPRVFLYNRILDSLIRTDHLDLALSVYDDMIRNDVKEEPI 722 + RGLR++HVY++M + F +KPRVFLYNR++D+L+RT HLDLALSVYDD+ + + EE + Sbjct: 212 ANRGLRVYHVYEKMRNKFGVKPRVFLYNRVMDALVRTGHLDLALSVYDDLKEDGLVEESV 271 Query: 723 TFTILAKGLCKAGRIDAALDLLEKMRRDVCKPDVFAYTAMVKVLAAEGNVDGCLEVWNQM 902 TF +L KGLCK GRID L++L +MR +CKPDVFAYTA+VK+L GN+D CL VW +M Sbjct: 272 TFMVLVKGLCKCGRIDEMLEVLGRMRERLCKPDVFAYTALVKILVPAGNLDACLRVWEEM 331 Query: 903 KMDGVQPDVMAYATLISGL 959 K D V+PDV AYAT+I GL Sbjct: 332 KRDRVEPDVKAYATMIVGL 350 Score = 73.2 bits (178), Expect = 9e-11 Identities = 66/248 (26%), Positives = 106/248 (42%), Gaps = 3/248 (1%) Frame = +3 Query: 225 LLRRHRHWSPALLSELSKLRRVSP-DLVAEVL-RFDPNPNPSLSTRFFHWASRQK-GFHH 395 L R H+ + L EL + + P + E+L R + N L + R K G Sbjct: 174 LNRHHQFRAADQLPELMESQGKPPSEKQFEILIRMHSDANRGLRVYHVYEKMRNKFGVKP 233 Query: 396 SYSSFNAFAHSLYRSGHPRAADRVPDLMLALGKPPSETQLELLVRLHSQSRRGLRLFHVY 575 +N +L R+GH A V D + G +LV+ + R + V Sbjct: 234 RVFLYNRVMDALVRTGHLDLALSVYDDLKEDGLVEESVTFMVLVKGLCKCGRIDEMLEVL 293 Query: 576 KRMTSIFNIKPRVFLYNRILDSLIRTDHLDLALSVYDDMIRNDVKEEPITFTILAKGLCK 755 RM KP VF Y ++ L+ +LD L V+++M R+ V+ + + + GL K Sbjct: 294 GRMRERL-CKPDVFAYTALVKILVPAGNLDACLRVWEEMKRDRVEPDVKAYATMIVGLAK 352 Query: 756 AGRIDAALDLLEKMRRDVCKPDVFAYTAMVKVLAAEGNVDGCLEVWNQMKMDGVQPDVMA 935 GR+ +L +M+ C D Y A+V+ AEG V+ ++ + G + D+ Sbjct: 353 GGRVQEGYELFREMKGKGCLVDRVIYGALVEAFVAEGKVELAFDLLKDLVSSGYRADLGI 412 Query: 936 YATLISGL 959 Y LI GL Sbjct: 413 YICLIEGL 420 >ref|XP_004140286.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cucumis sativus] gi|449531474|ref|XP_004172711.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like [Cucumis sativus] Length = 726 Score = 318 bits (814), Expect = 2e-84 Identities = 164/324 (50%), Positives = 222/324 (68%), Gaps = 6/324 (1%) Frame = +3 Query: 6 PSQNRPIVHGGLFTNRKALXXXXXXXXXXXXXTVDFRH-WDXXXXXXXXXXXXXXXXXXX 182 P Q+RP V+GG FTNR++L H WD Sbjct: 20 PHQHRPTVYGGFFTNRRSLPPPSPHQPTSPKPQPFLLHNWDPDLPSQKRSNLPSSTSDAF 79 Query: 183 XXXXXXXX--ARFISDLLRRHRH-WSPALLSELSKLRRVSPDLVAEVLRFDP--NPNPSL 347 ARFI D+ R++++ W P ++SEL+KLRRV+PDLVAEVL+ + N L Sbjct: 80 FSTSLRLSPIARFIVDVFRKNQNQWGPPVISELNKLRRVTPDLVAEVLKASHRRDSNSIL 139 Query: 348 STRFFHWASRQKGFHHSYSSFNAFAHSLYRSGHPRAADRVPDLMLALGKPPSETQLELLV 527 +++FF+WA +QKGFHH+++S+NAFA+ L R RAAD++P+LM + GKPPSE Q E+L+ Sbjct: 140 ASKFFYWAGKQKGFHHTFASYNAFAYCLNRHNRFRAADQIPELMDSQGKPPSEKQFEILI 199 Query: 528 RLHSQSRRGLRLFHVYKRMTSIFNIKPRVFLYNRILDSLIRTDHLDLALSVYDDMIRNDV 707 R+H + RGLR+++VY++M F + PRVFLYNRILD+L++TDHLDLAL+VY D N + Sbjct: 200 RMHCDANRGLRVYYVYEKMKK-FGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGL 258 Query: 708 KEEPITFTILAKGLCKAGRIDAALDLLEKMRRDVCKPDVFAYTAMVKVLAAEGNVDGCLE 887 EE +TF IL KGLCKAGR+D L+LL +MR ++CKPDVFAYTAMVKVLA++ N++GCL Sbjct: 259 VEESVTFMILIKGLCKAGRVDEMLELLARMRANLCKPDVFAYTAMVKVLASKDNLEGCLR 318 Query: 888 VWNQMKMDGVQPDVMAYATLISGL 959 VW++M+ D V+PDVMAY TLI GL Sbjct: 319 VWDEMRADRVEPDVMAYGTLIIGL 342