BLASTX nr result
ID: Dioscorea21_contig00019491
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00019491 (1157 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002314482.1| predicted protein [Populus trichocarpa] gi|2... 195 2e-47 ref|XP_002883427.1| nucleic acid binding protein [Arabidopsis ly... 168 2e-39 ref|NP_188979.2| RNA recognition motif-containing protein [Arabi... 161 3e-37 ref|NP_001118682.1| RNA recognition motif-containing protein [Ar... 149 2e-33 gb|ABF59161.1| unknown protein [Arabidopsis thaliana] 124 3e-26 >ref|XP_002314482.1| predicted protein [Populus trichocarpa] gi|222863522|gb|EEF00653.1| predicted protein [Populus trichocarpa] Length = 702 Score = 195 bits (496), Expect = 2e-47 Identities = 116/296 (39%), Positives = 182/296 (61%), Gaps = 2/296 (0%) Frame = +1 Query: 271 NMLLVCFLPKTAKVKDLIQAFGDSGPISEICILPSRE--NRFNYAQVFFKTKEGLKKALS 444 N LL+ FL K D+I F + GPIS+I + S + N F+ A + F+T++GL KAL Sbjct: 405 NKLLLRFLHKDVGDGDIISCFRNCGPISKIEKVSSVKGSNLFD-AFLHFETRQGLHKALE 463 Query: 445 KTDVAVGGADVAMEAAITPSEICDRMFCTERVNNAYSPRHFLKHPSRTVVLDGLIPDDLS 624 K +V + ++ + + R+ + + +KHP+RTV + L DD+S Sbjct: 464 KPEVLIKNSNAFIH------DTASRISIPNLIGDIDISVALVKHPTRTVKIKQLT-DDIS 516 Query: 625 NYDLKCALSTWGRITSIVNGTSYHTVFVEFESEKSKERALAKATFSISGRTLSILRVDAP 804 ++ LK ALS ++ G S +VEFESE +KERALAK +SG+ LSI RVDAP Sbjct: 517 SHQLKEALSFCRSGINVFLGASSSNAYVEFESEDAKERALAKHFLQVSGKQLSIFRVDAP 576 Query: 805 KTTIIRISNVNPASGAIEVHSICDSYGKVKKVTERYIDTFDIHFKLSEWPNMLKIINSLN 984 +TT++RI N+NP + V +IC S+GK+ ++ R+ + D++FK+ EWPNML I+NSLN Sbjct: 577 RTTVVRILNINPQCRS-NVLTICKSFGKLWRMKLRHENIADVYFKIDEWPNMLNILNSLN 635 Query: 985 GLKVDQHKWIAQPATLIPVEVLQALWNKPEGQKQVHELVRNICERINDESIDTSSL 1152 GL+ D +W+AQPA++ P +LQALWN P+ ++ V ++ + +++ + +DT+ L Sbjct: 636 GLEADGSRWVAQPASIFPPIILQALWNHPDERRHVISSMQCLLKKL-EHPMDTAEL 690 >ref|XP_002883427.1| nucleic acid binding protein [Arabidopsis lyrata subsp. lyrata] gi|297329267|gb|EFH59686.1| nucleic acid binding protein [Arabidopsis lyrata subsp. lyrata] Length = 785 Score = 168 bits (426), Expect = 2e-39 Identities = 116/380 (30%), Positives = 191/380 (50%), Gaps = 10/380 (2%) Frame = +1 Query: 13 NMPYEVTESEENEDSNCLKLETILLDKDEKAVDTSDGLNSLFISDESDSDISAFEYPN-- 186 N+P + + N E + K A SDG + +ES + E + Sbjct: 396 NLPTDNGMDAQANSLNASSSEEKSISKSNSAFQKSDGFCATE-EEESKGETLVMENQSLC 454 Query: 187 ------PEPNNEVLSSMKGFHLNLNQHNEKLARNNMLLVCFLPKTAKVKDLIQAFGDSGP 348 N ++ F L+ +H+ N +L+ FL ++ + K +++ F G Sbjct: 455 SQATLAATTANPKVTKKSLFALSAGEHSP-----NKVLLRFLQESCQKKHIVEVFSQFGA 509 Query: 349 ISEICILPSRENR-FNYAQVFFKTKEGLKKALSKTDVAVGGADVAMEAAITPSEICDRMF 525 + + +PS E + A + F+T +KKAL K V V + +EA + ++ +R+ Sbjct: 510 VLHVQEIPSFEGCIYKDALLTFETNTAVKKALEKGRVTVMNNNAVVEAT-SQEDMVERIC 568 Query: 526 CTERVNNAYSPRHFLKHPSRTVVLDGLIPDDLSNYDLKCALSTWGRITSIVNGTSYHTVF 705 + + + P +K PSRTV + L D SN + I+ + G+S F Sbjct: 569 IPDLIGDPDVPVALVKEPSRTVKIHPLTHDFSSNQIKEALKFCRSNISKFILGSSRTDAF 628 Query: 706 VEFESEKSKERALAKATFSISGRTLSILRVDAPKTTIIRISNVNPASGAIEVHSICDSYG 885 VEFE+E KERALA+ + SI L I R+D P+TT+ RISN++ S +V ++C YG Sbjct: 629 VEFETEDGKERALAEHSISICNTQLFISRIDIPRTTVARISNLS-KSAMKDVRALCVPYG 687 Query: 886 KVKKVTERYIDTFDIHFKLSEWPNMLKIINSLNGLKVDQHKWIAQPA-TLIPVEVLQALW 1062 ++K+V R D+ F +SEWPNML I+NSLNG+ +D K + +PA T+IP E+L+ LW Sbjct: 688 QIKQVYIRGNGVVDVLFDVSEWPNMLNILNSLNGMGIDGKKLVVRPATTVIPPEILRVLW 747 Query: 1063 NKPEGQKQVHELVRNICERI 1122 P+ ++ V +++N+ I Sbjct: 748 KDPQEKRYVKSVIQNLVREI 767 >ref|NP_188979.2| RNA recognition motif-containing protein [Arabidopsis thaliana] gi|11994322|dbj|BAB02281.1| unnamed protein product [Arabidopsis thaliana] gi|332643236|gb|AEE76757.1| RNA recognition motif-containing protein [Arabidopsis thaliana] Length = 811 Score = 161 bits (408), Expect = 3e-37 Identities = 98/287 (34%), Positives = 161/287 (56%), Gaps = 3/287 (1%) Frame = +1 Query: 271 NMLLVCFLPKTAKVKDLIQAFGDS-GPISEICILPSRENR-FNYAQVFFKTKEGLKKALS 444 N +L+ FLP+++ K +++AF G + + +PS E + A + F+T +KKAL Sbjct: 509 NKVLLRFLPESSMKKHIVKAFSSQFGAVLHVQEIPSIEGCIYKDALLTFETNTAVKKALK 568 Query: 445 KTDVAVGGADVAMEAAITPSEICDRMFCTERVNNAYSPRHFLKHPSRTVVLDGLIPDDLS 624 K V V + +EA + ++ +R+ + + + P +K P+RTV + L D S Sbjct: 569 KGHVTVMNYNTVVEAT-SQEDMVERICIPDLIGDPDVPVALVKEPARTVKIHPLTHDFSS 627 Query: 625 NYDLKCALSTWGRITSIVNGTSYHTVFVEFESEKSKERALAKATFSISGRTLSILRVDAP 804 N + I+ G+S FVEFE+E KERALA+ + SI L I R+D P Sbjct: 628 NQIKEALKFCRSNISKFTLGSSRTDAFVEFETEDGKERALAEHSISICNTQLFISRIDIP 687 Query: 805 KTTIIRISNVNPASGAIEVHSICDSYGKVKKVTERYIDTFDIHFKLSEWPNMLKIINSLN 984 +T + RISN++ S +V ++C YG+++ V R D+ F +SEWPNML I+NS+N Sbjct: 688 RTIVARISNLS-KSAMRDVRALCVPYGQIRGVYIRGTGVADVFFDISEWPNMLAILNSMN 746 Query: 985 GLKVDQHKWIAQPA-TLIPVEVLQALWNKPEGQKQVHELVRNICERI 1122 G+++D K + +PA T+IP E+L+ LW P ++ V +++N+ I Sbjct: 747 GMEIDGKKLVVRPATTVIPPEILRVLWKDPREKRYVKSVIQNLVREI 793 >ref|NP_001118682.1| RNA recognition motif-containing protein [Arabidopsis thaliana] gi|332643237|gb|AEE76758.1| RNA recognition motif-containing protein [Arabidopsis thaliana] Length = 771 Score = 149 bits (375), Expect = 2e-33 Identities = 90/262 (34%), Positives = 145/262 (55%), Gaps = 2/262 (0%) Frame = +1 Query: 343 GPISEICILPSRENR-FNYAQVFFKTKEGLKKALSKTDVAVGGADVAMEAAITPSEICDR 519 G + + +PS E + A + F+T +KKAL K V V + +EA + ++ +R Sbjct: 494 GAVLHVQEIPSIEGCIYKDALLTFETNTAVKKALKKGHVTVMNYNTVVEAT-SQEDMVER 552 Query: 520 MFCTERVNNAYSPRHFLKHPSRTVVLDGLIPDDLSNYDLKCALSTWGRITSIVNGTSYHT 699 + + + + P +K P+RTV + L D SN + I+ G+S Sbjct: 553 ICIPDLIGDPDVPVALVKEPARTVKIHPLTHDFSSNQIKEALKFCRSNISKFTLGSSRTD 612 Query: 700 VFVEFESEKSKERALAKATFSISGRTLSILRVDAPKTTIIRISNVNPASGAIEVHSICDS 879 FVEFE+E KERALA+ + SI L I R+D P+T + RISN++ S +V ++C Sbjct: 613 AFVEFETEDGKERALAEHSISICNTQLFISRIDIPRTIVARISNLS-KSAMRDVRALCVP 671 Query: 880 YGKVKKVTERYIDTFDIHFKLSEWPNMLKIINSLNGLKVDQHKWIAQPA-TLIPVEVLQA 1056 YG+++ V R D+ F +SEWPNML I+NS+NG+++D K + +PA T+IP E+L+ Sbjct: 672 YGQIRGVYIRGTGVADVFFDISEWPNMLAILNSMNGMEIDGKKLVVRPATTVIPPEILRV 731 Query: 1057 LWNKPEGQKQVHELVRNICERI 1122 LW P ++ V +++N+ I Sbjct: 732 LWKDPREKRYVKSVIQNLVREI 753 >gb|ABF59161.1| unknown protein [Arabidopsis thaliana] Length = 226 Score = 124 bits (312), Expect = 3e-26 Identities = 68/176 (38%), Positives = 104/176 (59%), Gaps = 1/176 (0%) Frame = +1 Query: 598 DGLIPDDLSNYDLKCALSTWGRITSIVNGTSYHTVFVEFESEKSKERALAKATFSISGRT 777 DG + LSN + I+ G+S FVEFE+E KERALA+ + SI Sbjct: 34 DGRKCERLSNQIKEALKFCRSNISKFTLGSSRTDAFVEFETEDGKERALAEHSISICNTQ 93 Query: 778 LSILRVDAPKTTIIRISNVNPASGAIEVHSICDSYGKVKKVTERYIDTFDIHFKLSEWPN 957 L I R+D P+T + RISN++ S +V ++C YG+++ V R D+ F +SEWPN Sbjct: 94 LFISRIDIPRTIVARISNLSK-SAMRDVRALCVPYGQIRGVYIRGTGVADVFFDISEWPN 152 Query: 958 MLKIINSLNGLKVDQHKWIAQPA-TLIPVEVLQALWNKPEGQKQVHELVRNICERI 1122 ML I+NS+NG+++D K + +PA T+IP E+L+ LW P ++ V +++N+ I Sbjct: 153 MLAILNSMNGMEIDGKKLVVRPATTVIPPEILRVLWKDPREKRYVKSVIQNLVREI 208