BLASTX nr result
ID: Dioscorea21_contig00028524
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00028524 (1253 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containi... 335 7e-90 ref|XP_002321108.1| predicted protein [Populus trichocarpa] gi|2... 327 5e-87 ref|XP_002516403.1| pentatricopeptide repeat-containing protein,... 326 9e-87 ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containi... 318 2e-84 ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containi... 310 4e-82 >ref|XP_002268109.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Vitis vinifera] Length = 581 Score = 335 bits (859), Expect(2) = 7e-90 Identities = 165/312 (52%), Positives = 224/312 (71%), Gaps = 6/312 (1%) Frame = +2 Query: 44 FLKKVVDAGLQPMAETYHGIIRAYGTCGMYDELSKCVKQMESVGCSPNEVTYNILIVEFA 223 FL ++ L ETY G+I++YG MYDEL +CVK+MES GC P+ +TYN+LI EF+ Sbjct: 222 FLNELKANNLIADTETYDGLIQSYGKYKMYDELDECVKKMESDGCLPDHITYNLLIQEFS 281 Query: 224 QGGLVDTMEGVYRTVLSKRMNLQPSTLVAMLEAYANLGIVEKMEKVYRMVMKTNAYIKDT 403 +GGL+ ME V++TVLSK+M LQ STLV MLEAYAN GI+EKME YR V+ + +KD Sbjct: 282 RGGLLKRMERVFQTVLSKKMGLQSSTLVVMLEAYANFGIIEKMENAYRRVLNSKTSLKDD 341 Query: 404 VVRKLAAVYIENYRFARLEELGNDISARTGRTDLVWCILLLASACFLSRKGIESIIREMK 583 ++RKLA VYIENY+F+RL ++G ++++ T RTDLVWC+ LL+ AC LSRKG++SI++EM+ Sbjct: 342 LIRKLAEVYIENYKFSRLADMGLNLASVTSRTDLVWCLRLLSHACLLSRKGLDSIVKEME 401 Query: 584 VAKVEFKVTFVNILALFLLKVKDFSKLDAVLSQTGKHNRKPDIITVGILFDACRIGYDGT 763 V + T N + L LK+KDF++L +L + + KPDI+TVGILFDA RIG++GT Sbjct: 402 AKNVPWNATVANTILLAYLKMKDFTRLRILLLELSTRHVKPDIVTVGILFDANRIGFNGT 461 Query: 764 RILEIWRRSGFLETEAEMRTDPLVLNTFGKGLFMRDCERIFSSAGSNLKEKKHWTYHSLI 943 L WRR+GFL+ EM TDPLVL+ FGKG F++ CE ++SS ++KK WTY +LI Sbjct: 462 MALNTWRRTGFLDEAVEMNTDPLVLSAFGKGNFLQSCEEMYSSLEPEARKKKIWTYQNLI 521 Query: 944 ------SLVFGK 961 L+FGK Sbjct: 522 DLDGLLGLLFGK 533 Score = 23.5 bits (49), Expect(2) = 7e-90 Identities = 10/18 (55%), Positives = 11/18 (61%) Frame = +1 Query: 1 HGFARKKEFDNSACFFKE 54 H FARK EFD + F E Sbjct: 208 HCFARKGEFDRALYFLNE 225 >ref|XP_002321108.1| predicted protein [Populus trichocarpa] gi|222861881|gb|EEE99423.1| predicted protein [Populus trichocarpa] Length = 419 Score = 327 bits (837), Expect = 5e-87 Identities = 158/303 (52%), Positives = 218/303 (71%) Frame = +2 Query: 44 FLKKVVDAGLQPMAETYHGIIRAYGTCGMYDELSKCVKQMESVGCSPNEVTYNILIVEFA 223 +L ++ + L P ++TY G+I AYGT MYDE++ C+K+ME GCSP+ TYN+LI +FA Sbjct: 117 YLNQMNEMNLSPESDTYDGLIEAYGTYRMYDEMAMCLKKMELDGCSPDRYTYNLLIQKFA 176 Query: 224 QGGLVDTMEGVYRTVLSKRMNLQPSTLVAMLEAYANLGIVEKMEKVYRMVMKTNAYIKDT 403 QGGL+ ME VY+++ +KRM LQ STL++MLEAYAN GIVEKMEK+ R + +K+ Sbjct: 177 QGGLLTRMERVYQSMRTKRMKLQSSTLISMLEAYANFGIVEKMEKILRWAWNSKITVKED 236 Query: 404 VVRKLAAVYIENYRFARLEELGNDISARTGRTDLVWCILLLASACFLSRKGIESIIREMK 583 +VRKLA VYI NY F+RL +L D+++ TGRTD+VWC+ LL+ AC LSR+G+++++REM+ Sbjct: 237 LVRKLAGVYIANYMFSRLHDLAVDLTSITGRTDIVWCLHLLSHACLLSRRGMDAVVREME 296 Query: 584 VAKVEFKVTFVNILALFLLKVKDFSKLDAVLSQTGKHNRKPDIITVGILFDACRIGYDGT 763 AK + +T NI+ L LK+KDF++L +LS+ + +PDI+T GILFDA IG+DG Sbjct: 297 DAKACWNITVANIILLAYLKMKDFTRLRILLSKLPEIRVEPDIVTFGILFDAEEIGFDGK 356 Query: 764 RILEIWRRSGFLETEAEMRTDPLVLNTFGKGLFMRDCERIFSSAGSNLKEKKHWTYHSLI 943 LE+WR+ G L EM TDPL L+ FGKG F+R CE +SS N +EKK WTY I Sbjct: 357 ECLEMWRKMGLLYRRVEMNTDPLALSAFGKGSFLRSCEEGYSSLEPNAREKKRWTYVDFI 416 Query: 944 SLV 952 +LV Sbjct: 417 NLV 419 >ref|XP_002516403.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223544501|gb|EEF46020.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 502 Score = 326 bits (835), Expect = 9e-87 Identities = 157/303 (51%), Positives = 219/303 (72%) Frame = +2 Query: 44 FLKKVVDAGLQPMAETYHGIIRAYGTCGMYDELSKCVKQMESVGCSPNEVTYNILIVEFA 223 +L + + L P+++TY+G+I+AYG MYDE+ C+K+ME GCSP+ VTYN+LI E A Sbjct: 190 YLNHLKEINLSPVSDTYNGLIQAYGKYKMYDEMGMCLKKMEMEGCSPDHVTYNLLIQELA 249 Query: 224 QGGLVDTMEGVYRTVLSKRMNLQPSTLVAMLEAYANLGIVEKMEKVYRMVMKTNAYIKDT 403 + GL+ ME VY+T RM+L+ +TL AMLEAYAN GIVEKME + + + A +K+ Sbjct: 250 EAGLLTRMEKVYQTTRMNRMDLKSTTLTAMLEAYANFGIVEKMELILKRTRNSKALLKED 309 Query: 404 VVRKLAAVYIENYRFARLEELGNDISARTGRTDLVWCILLLASACFLSRKGIESIIREMK 583 +++K+A VYIEN+ F+RLE+LG+ +S R+G+ D+VWC+LLL++AC LS+KG++S++REMK Sbjct: 310 LIKKIALVYIENFMFSRLEKLGHYLSKRSGQNDMVWCLLLLSNACMLSQKGMDSVVREMK 369 Query: 584 VAKVEFKVTFVNILALFLLKVKDFSKLDAVLSQTGKHNRKPDIITVGILFDACRIGYDGT 763 VAKV + VTF+NI+ L LK+KD +L +LS H KPDI+TVG+LFDA IG+ G Sbjct: 370 VAKVSWNVTFINIILLAYLKMKDSMRLGILLSTLTNHIVKPDIVTVGVLFDANNIGFHGN 429 Query: 764 RILEIWRRSGFLETEAEMRTDPLVLNTFGKGLFMRDCERIFSSAGSNLKEKKHWTYHSLI 943 ILE WRR+G L E TDPLVL FGKG F++ CE +SS ++K+ WTY +LI Sbjct: 430 GILETWRRTGILYRCVETETDPLVLAAFGKGQFLKKCEEAYSSLEPVARQKEKWTYCNLI 489 Query: 944 SLV 952 LV Sbjct: 490 DLV 492 >ref|XP_003552730.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Glycine max] Length = 509 Score = 318 bits (815), Expect = 2e-84 Identities = 152/303 (50%), Positives = 220/303 (72%) Frame = +2 Query: 44 FLKKVVDAGLQPMAETYHGIIRAYGTCGMYDELSKCVKQMESVGCSPNEVTYNILIVEFA 223 F+ ++ ++GL+ +ETY G++ AYG MYDE+ +CVK+ME GCSP+ +TYNILI E+A Sbjct: 194 FIDEMKESGLELDSETYDGLLGAYGKFQMYDEMGECVKKMELEGCSPDHITYNILIQEYA 253 Query: 224 QGGLVDTMEGVYRTVLSKRMNLQPSTLVAMLEAYANLGIVEKMEKVYRMVMKTNAYIKDT 403 + GL+ ME +Y+ ++SKRM++Q STLVAMLEAY G+VEKME YR ++ + ++D Sbjct: 254 RAGLLQRMEKLYQRMVSKRMHVQSSTLVAMLEAYTTFGMVEKMENFYRKILSSKTCLEDD 313 Query: 404 VVRKLAAVYIENYRFARLEELGNDISARTGRTDLVWCILLLASACFLSRKGIESIIREMK 583 ++RK+A VYI+NY F+RLE+L D+ G ++LVWC+ LL+ AC LS+KG++ ++REM+ Sbjct: 314 LIRKVAEVYIKNYMFSRLEDLALDLCPAFGESNLVWCLRLLSYACPLSKKGMDIVVREMR 373 Query: 584 VAKVEFKVTFVNILALFLLKVKDFSKLDAVLSQTGKHNRKPDIITVGILFDACRIGYDGT 763 AKV + VT NI+ L +K+KDF L +LSQ + +PDIIT+GILFDA RIG+DG+ Sbjct: 374 DAKVNWNVTVANIIMLAYVKMKDFRHLKILLSQLPIYRVQPDIITIGILFDATRIGFDGS 433 Query: 764 RILEIWRRSGFLETEAEMRTDPLVLNTFGKGLFMRDCERIFSSAGSNLKEKKHWTYHSLI 943 LE WRR G+L E++TD LVL FGKG F++ CE ++SS +++K WTYH LI Sbjct: 434 GALETWRRMGYLYRVVEIKTDSLVLTAFGKGHFLKSCEEVYSSLHPEDRKRKTWTYHDLI 493 Query: 944 SLV 952 +L+ Sbjct: 494 ALL 496 >ref|XP_003538531.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic-like [Glycine max] Length = 506 Score = 310 bits (795), Expect = 4e-82 Identities = 152/303 (50%), Positives = 216/303 (71%) Frame = +2 Query: 44 FLKKVVDAGLQPMAETYHGIIRAYGTCGMYDELSKCVKQMESVGCSPNEVTYNILIVEFA 223 F+ ++ ++GL+ +ETY G+I AYG MYDE+ +CVK+ME GCSP+ +TYNILI E+A Sbjct: 192 FIDEMKESGLELDSETYDGLIGAYGKFQMYDEMGECVKKMELEGCSPDPITYNILIQEYA 251 Query: 224 QGGLVDTMEGVYRTVLSKRMNLQPSTLVAMLEAYANLGIVEKMEKVYRMVMKTNAYIKDT 403 GGL+ ME +Y+ +LSKRM+++ STLVAMLEAY G+VEKMEK YR ++ + I+D Sbjct: 252 GGGLLQRMEKLYQRMLSKRMHVKSSTLVAMLEAYTTFGMVEKMEKFYRKILNSKTCIEDD 311 Query: 404 VVRKLAAVYIENYRFARLEELGNDISARTGRTDLVWCILLLASACFLSRKGIESIIREMK 583 ++RK+A VYI N+ F+RLE+L D+ G ++L WC LL+ AC LS+KG++ +++EM+ Sbjct: 312 LIRKVAEVYINNFMFSRLEDLALDLCPAFGESNLEWCFRLLSYACLLSKKGMDIVVQEMQ 371 Query: 584 VAKVEFKVTFVNILALFLLKVKDFSKLDAVLSQTGKHNRKPDIITVGILFDACRIGYDGT 763 AKV + VT NI+ L +K+K+F L +LSQ + +PDIIT+GILFDA RIG+DG+ Sbjct: 372 DAKVSWNVTVANIIMLAYVKMKEFRHLRILLSQLPIYRVQPDIITIGILFDATRIGFDGS 431 Query: 764 RILEIWRRSGFLETEAEMRTDPLVLNTFGKGLFMRDCERIFSSAGSNLKEKKHWTYHSLI 943 LE WRR G+L EM+TD LVL FGKG F++ CE ++SS +++K TYH LI Sbjct: 432 GALETWRRMGYLYRVVEMKTDSLVLTAFGKGHFLKSCEEVYSSLHPEDRKRKTCTYHDLI 491 Query: 944 SLV 952 L+ Sbjct: 492 PLL 494