BLASTX nr result
ID: Catharanthus23_contig00031126
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00031126 (554 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004231258.1| PREDICTED: uncharacterized protein LOC101260... 101 1e-19 gb|EXB99429.1| hypothetical protein L484_016405 [Morus notabilis] 89 9e-16 ref|XP_002530358.1| conserved hypothetical protein [Ricinus comm... 87 2e-15 ref|XP_004301624.1| PREDICTED: uncharacterized protein LOC101305... 78 2e-12 gb|EMJ20406.1| hypothetical protein PRUPE_ppa017292mg [Prunus pe... 77 4e-12 ref|XP_002317716.1| hypothetical protein POPTR_0012s03820g [Popu... 75 1e-11 ref|XP_006431682.1| hypothetical protein CICLE_v10000213mg [Citr... 74 2e-11 ref|XP_004145472.1| PREDICTED: uncharacterized protein LOC101205... 72 1e-10 ref|XP_006471160.1| PREDICTED: uncharacterized protein LOC102613... 71 2e-10 gb|EOY07249.1| TATA box-binding protein-associated factor RNA po... 67 2e-09 gb|EPS74338.1| hypothetical protein M569_00424 [Genlisea aurea] 62 7e-08 ref|XP_004495159.1| PREDICTED: uncharacterized protein LOC101491... 60 3e-07 ref|XP_004166877.1| PREDICTED: uncharacterized LOC101205354 [Cuc... 56 5e-06 >ref|XP_004231258.1| PREDICTED: uncharacterized protein LOC101260775 [Solanum lycopersicum] Length = 907 Score = 101 bits (251), Expect = 1e-19 Identities = 70/187 (37%), Positives = 92/187 (49%), Gaps = 5/187 (2%) Frame = +3 Query: 3 DFSEEWKSRWPIGSVFSAPLLLRADEDGQQLPPPSKRRRRSGXXXXXXXXXNGGKTLEGL 182 D S++WK+ W I S FS+PLLL + SKRRR + Sbjct: 2 DSSDKWKALWKIWSSFSSPLLLSNSHEESS----SKRRRIDSP----------------I 41 Query: 183 GPLVFIPRSETLIELYSSPDLASRLPPPYPEISLARFLHXXXXXXXXXXXXCIASEFGPQ 362 GPL+F P ETL L SP L++R+P P P++SL RFL IA+EF PQ Sbjct: 42 GPLIFRPCEETLTPLLRSPLLSTRIPSPVPDLSLPRFLQ-TSSGMLFSTASSIATEFSPQ 100 Query: 363 LSQTVGDHDNFRSSYGHDFNCLQLLPC-----SEESTSLLLFFPTGENCDQVGYVMLRLE 527 +S T+ H+FN +Q LP + + S++ PTGEN DQVG ML E Sbjct: 101 VSDTI-----------HNFNSIQFLPLPNFGENSKPNSIIGISPTGENYDQVGLFMLCSE 149 Query: 528 DSQFSIK 548 D+QF K Sbjct: 150 DTQFVAK 156 >gb|EXB99429.1| hypothetical protein L484_016405 [Morus notabilis] Length = 1000 Score = 88.6 bits (218), Expect = 9e-16 Identities = 67/188 (35%), Positives = 93/188 (49%), Gaps = 4/188 (2%) Frame = +3 Query: 3 DFSEEWKSRWPIGSVFSAPLLLRADEDGQQLPPPSKRRRRSGXXXXXXXXXNGGKTLEGL 182 +FSEEWKS +PI +VF +PLLL PS R L Sbjct: 2 NFSEEWKSLFPISAVFKSPLLLSG---------PSARTI--------------------L 32 Query: 183 GPLVFIPRSETLIELYSSPDLASRLPP--PYPEISLARFL--HXXXXXXXXXXXXCIASE 350 GPLVF P+ T+ L+SSP L LPP P P +S RFL IAS Sbjct: 33 GPLVFNPKESTITCLFSSPSL---LPPFTPLPRLSFPRFLLTSSDDSSQLPSTSSSIASV 89 Query: 351 FGPQLSQTVGDHDNFRSSYGHDFNCLQLLPCSEESTSLLLFFPTGENCDQVGYVMLRLED 530 FGP Q D+ S++ H N LQLL C + ++FFPTG+N +QVG+++L +++ Sbjct: 90 FGPHHYQ-----DDVASAFSH--NRLQLLHC-PRTDKFIVFFPTGDNANQVGFMLLSIKN 141 Query: 531 SQFSIKIN 554 S ++++ Sbjct: 142 SCLDVRVD 149 >ref|XP_002530358.1| conserved hypothetical protein [Ricinus communis] gi|223530105|gb|EEF32019.1| conserved hypothetical protein [Ricinus communis] Length = 912 Score = 87.4 bits (215), Expect = 2e-15 Identities = 68/181 (37%), Positives = 84/181 (46%), Gaps = 3/181 (1%) Frame = +3 Query: 3 DFSEEWKSRWPIGSVFSAPLLLRADEDGQQLPPPSKRRRRSGXXXXXXXXXNGGKTLEGL 182 D SEEWKS +PIGSVF APLLL + P SK L Sbjct: 2 DLSEEWKSLFPIGSVFDAPLLLSS--------PTSKSI---------------------L 32 Query: 183 GPLVFIPRSETLIELYSSPDLASRLPPPYPEISLARFL---HXXXXXXXXXXXXCIASEF 353 GPL F P +TL +LY SP L L P P +SL+RFL I S Sbjct: 33 GPLFFNPNRKTLTQLYKSPSLFPPLLNPPPRLSLSRFLTTSTTFDSPIPLSTASSITSRL 92 Query: 354 GPQLSQTVGDHDNFRSSYGHDFNCLQLLPCSEESTSLLLFFPTGENCDQVGYVMLRLEDS 533 G Q HDN S H N LQ L C ++ S+++FF TG N DQVG+++L + D Sbjct: 93 GSQF------HDNSASLLAH--NQLQFLNCPHDN-SVIVFFSTGCNHDQVGFLLLSVNDK 143 Query: 534 Q 536 + Sbjct: 144 R 144 >ref|XP_004301624.1| PREDICTED: uncharacterized protein LOC101305856 [Fragaria vesca subsp. vesca] Length = 914 Score = 77.8 bits (190), Expect = 2e-12 Identities = 64/182 (35%), Positives = 82/182 (45%), Gaps = 2/182 (1%) Frame = +3 Query: 12 EEWKSRWPIGSVFSAPLLLRADEDGQQLPPPSKRRRRSGXXXXXXXXXNGGKTLEGLGPL 191 EEWKS +PI SVF PLL+ PS LGPL Sbjct: 6 EEWKSLFPISSVFKPPLLISN---------PSI-----------------------LGPL 33 Query: 192 VFIPRSETLIELYSSPDLASRLPP--PYPEISLARFLHXXXXXXXXXXXXCIASEFGPQL 365 +F P++ + L+SSP L LPP P P +SL RFL +S P L Sbjct: 34 IFNPKANSTTLLFSSPTL---LPPLTPLPHLSLPRFLSTSSPESAPLPST--SSSIAPFL 88 Query: 366 SQTVGDHDNFRSSYGHDFNCLQLLPCSEESTSLLLFFPTGENCDQVGYVMLRLEDSQFSI 545 G H N L+ L C + +T +L+FFPTGEN DQVG + L L+DS F + Sbjct: 89 ----GPHQYKNDLLSSFRNRLEFLQCPKTNT-ILIFFPTGENSDQVGLLELVLKDSTFDV 143 Query: 546 KI 551 K+ Sbjct: 144 KV 145 >gb|EMJ20406.1| hypothetical protein PRUPE_ppa017292mg [Prunus persica] Length = 925 Score = 76.6 bits (187), Expect = 4e-12 Identities = 60/184 (32%), Positives = 81/184 (44%), Gaps = 2/184 (1%) Frame = +3 Query: 9 SEEWKSRWPIGSVFSAPLLLRADEDGQQLPPPSKRRRRSGXXXXXXXXXNGGKTLEGLGP 188 +EEWKS +PI SVF PLLL L P LGP Sbjct: 8 TEEWKSLFPISSVFKPPLLL----SNPSLKPI-------------------------LGP 38 Query: 189 LVFIPRSETLIELYSSPDLASRLPPPYPEISLARFL--HXXXXXXXXXXXXCIASEFGPQ 362 L+F P+ + L+SS PP P +SL RFL +AS GP Sbjct: 39 LIFNPKPNSTTLLFSSSSSLLAPLPPLPHLSLPRFLLTSPSDSAPLPSSVPSVASFLGPH 98 Query: 363 LSQTVGDHDNFRSSYGHDFNCLQLLPCSEESTSLLLFFPTGENCDQVGYVMLRLEDSQFS 542 H S +N L+ L C + +T +++FFPTGEN DQVG++ L L+ S F Sbjct: 99 -------HPKSDVSSSLLYNRLEFLQCPQINT-VVVFFPTGENSDQVGFLQLVLKGSTFD 150 Query: 543 IKIN 554 +K++ Sbjct: 151 VKVD 154 >ref|XP_002317716.1| hypothetical protein POPTR_0012s03820g [Populus trichocarpa] gi|222858389|gb|EEE95936.1| hypothetical protein POPTR_0012s03820g [Populus trichocarpa] Length = 906 Score = 75.1 bits (183), Expect = 1e-11 Identities = 61/179 (34%), Positives = 84/179 (46%), Gaps = 3/179 (1%) Frame = +3 Query: 3 DFSEEWKSRWPIGSVFSAPLLLRADEDGQQLPPPSKRRRRSGXXXXXXXXXNGGKTLEGL 182 +FS+EWKS +PI +V APLLL SK+ S + Sbjct: 5 EFSQEWKSGFPIDTVSKAPLLL------------SKQTSESL-----------------I 35 Query: 183 GPLVFIPRSETLIELYSSPDLASRLPPPYPEISLARFLH---XXXXXXXXXXXXCIASEF 353 GPLVF P E+L L++SP L+ L P P +SL RF+ IA F Sbjct: 36 GPLVFNPIPESLAHLFTSPALSPPLLNPPPHLSLTRFISTSTLADSPLPLSTASSIAFSF 95 Query: 354 GPQLSQTVGDHDNFRSSYGHDFNCLQLLPCSEESTSLLLFFPTGENCDQVGYVMLRLED 530 GPQ D SS +N LQ L C + T +++FF TG N D+VG+++L ++D Sbjct: 96 GPQ--------DLHFSSPLLAYNRLQFLKCPHDDT-VVVFFSTGTNLDRVGFLLLSVKD 145 >ref|XP_006431682.1| hypothetical protein CICLE_v10000213mg [Citrus clementina] gi|557533804|gb|ESR44922.1| hypothetical protein CICLE_v10000213mg [Citrus clementina] Length = 910 Score = 73.9 bits (180), Expect = 2e-11 Identities = 61/183 (33%), Positives = 85/183 (46%), Gaps = 2/183 (1%) Frame = +3 Query: 3 DFSEEWKSRWPIGSVFSAPLLLRADEDGQQLPPPSKRRRRSGXXXXXXXXXNGGKTLEGL 182 DF+EE KS++PIG F P LL++ E Q Sbjct: 2 DFTEELKSQFPIGK-FLKPPLLQSSESIQ------------------------------- 29 Query: 183 GPLVFIPRSETLIELYSSPDLASR-LPPPYPEISLARFLHXXXXXXXXXXXXCIASEFGP 359 GPL F P ETL L SS L L P P ++L+RFL IAS+FG Sbjct: 30 GPLFFNPNPETLTLLSSSKTLCPHSLFSPLPRLTLSRFLSTSSSSLLPSTSTSIASQFGD 89 Query: 360 QLSQTVGDHDNFRSSYG-HDFNCLQLLPCSEESTSLLLFFPTGENCDQVGYVMLRLEDSQ 536 VG H + S D+N L+LL C +T++ FFPTG+N DQ+G++++ + S+ Sbjct: 90 -----VGTHQHPDGSLSDQDYNRLRLLYCPLNNTAIA-FFPTGDNNDQLGFLVISAKGSR 143 Query: 537 FSI 545 F + Sbjct: 144 FDV 146 >ref|XP_004145472.1| PREDICTED: uncharacterized protein LOC101205354 [Cucumis sativus] Length = 907 Score = 71.6 bits (174), Expect = 1e-10 Identities = 57/179 (31%), Positives = 80/179 (44%) Frame = +3 Query: 12 EEWKSRWPIGSVFSAPLLLRADEDGQQLPPPSKRRRRSGXXXXXXXXXNGGKTLEGLGPL 191 EEWKS +PIG+VF +PLL+ +G +GPL Sbjct: 4 EEWKSLFPIGTVFKSPLLI-----------------------------SGSSVKNSIGPL 34 Query: 192 VFIPRSETLIELYSSPDLASRLPPPYPEISLARFLHXXXXXXXXXXXXCIASEFGPQLSQ 371 VF P +L L+SS L L PP ++L RFL +AS FG Q Q Sbjct: 35 VFNPVPTSLTRLFSSQSLLPSLSPP-SVLNLPRFL-LTSSSVVPSTSSSVASLFGEQ--Q 90 Query: 372 TVGDHDNFRSSYGHDFNCLQLLPCSEESTSLLLFFPTGENCDQVGYVMLRLEDSQFSIK 548 D + +N LQ LPC S+S+++FFPTG N D VG++++ S ++ Sbjct: 91 CCSDPPSVLR-----YNRLQCLPCPN-SSSVVVFFPTGPNSDHVGFLVVSSNGSGLDVQ 143 >ref|XP_006471160.1| PREDICTED: uncharacterized protein LOC102613824 [Citrus sinensis] Length = 910 Score = 71.2 bits (173), Expect = 2e-10 Identities = 57/183 (31%), Positives = 83/183 (45%), Gaps = 2/183 (1%) Frame = +3 Query: 3 DFSEEWKSRWPIGSVFSAPLLLRADEDGQQLPPPSKRRRRSGXXXXXXXXXNGGKTLEGL 182 D +EE KS++PIG PLL ++ L Sbjct: 2 DLTEELKSQFPIGKFLKPPLLQSSESI--------------------------------L 29 Query: 183 GPLVFIPRSETLIELYSSPDLASR-LPPPYPEISLARFLHXXXXXXXXXXXXCIASEFGP 359 GPL F P+ ETL L SS L L P P+++L+RFL IAS+F Sbjct: 30 GPLFFNPKPETLTLLSSSKTLCPHPLFSPPPKLTLSRFLSTSSSSLLPSTSTSIASQF-- 87 Query: 360 QLSQTVGDHDNFRSSYG-HDFNCLQLLPCSEESTSLLLFFPTGENCDQVGYVMLRLEDSQ 536 VG H + S D+N L+LL C +T++ FFPTG+N DQ+G++++ + S+ Sbjct: 88 ---DDVGTHQHPNGSLSDQDYNRLRLLYCPLNNTAIA-FFPTGDNNDQLGFLVISAKGSR 143 Query: 537 FSI 545 F + Sbjct: 144 FDV 146 >gb|EOY07249.1| TATA box-binding protein-associated factor RNA polymerase I subunit C, putative [Theobroma cacao] Length = 910 Score = 67.4 bits (163), Expect = 2e-09 Identities = 55/180 (30%), Positives = 79/180 (43%), Gaps = 1/180 (0%) Frame = +3 Query: 3 DFSEEWKSRWPIGSVFSAPLLLRADEDGQQLPPPSKRRRRSGXXXXXXXXXNGGKTLEGL 182 + SEEWKS +PIG PLLL + G Sbjct: 2 ELSEEWKSYFPIGKSLDPPLLLSSASPG-------------------------------- 29 Query: 183 GPLVFIPRSETLIE-LYSSPDLASRLPPPYPEISLARFLHXXXXXXXXXXXXCIASEFGP 359 PL FIP+ TL + L+SSP L L PP +S +RFL IAS FG Sbjct: 30 -PLFFIPKPRTLPKTLFSSPSLFPPLHPPPSRLSFSRFLSTSSVPYSASSS--IASRFGL 86 Query: 360 QLSQTVGDHDNFRSSYGHDFNCLQLLPCSEESTSLLLFFPTGENCDQVGYVMLRLEDSQF 539 + +D+ SS N L LL C +++ +++ FF TG N D++G+ + ++D+ F Sbjct: 87 E-----SFYDDAASSSFLSHNRLHLLHCPDQNIAVV-FFTTGANHDRIGFFAVHVQDNDF 140 >gb|EPS74338.1| hypothetical protein M569_00424 [Genlisea aurea] Length = 841 Score = 62.4 bits (150), Expect = 7e-08 Identities = 39/112 (34%), Positives = 57/112 (50%) Frame = +3 Query: 183 GPLVFIPRSETLIELYSSPDLASRLPPPYPEISLARFLHXXXXXXXXXXXXCIASEFGPQ 362 GPL+F P+ + L +P++A LPPPYP + L+RFL I+S GPQ Sbjct: 27 GPLIFAPKPNSSTTLIQTPEIALHLPPPYPFLPLSRFLQ--KHDCFYSSAASISSLLGPQ 84 Query: 363 LSQTVGDHDNFRSSYGHDFNCLQLLPCSEESTSLLLFFPTGENCDQVGYVML 518 + + S+ H FN LQLL + + FFP+G+N D V + +L Sbjct: 85 IPE---------YSHYHGFNTLQLLQIPNCKIA-VAFFPSGKNSDVVAFSIL 126 >ref|XP_004495159.1| PREDICTED: uncharacterized protein LOC101491542 [Cicer arietinum] Length = 185 Score = 60.5 bits (145), Expect = 3e-07 Identities = 53/178 (29%), Positives = 76/178 (42%), Gaps = 1/178 (0%) Frame = +3 Query: 3 DFSEEWKSRWPIGSVFSAPLLLRADEDGQQLPPPSKRRRRSGXXXXXXXXXNGGKTLEGL 182 +FSEEWKS +PI S +PLLL + L Sbjct: 2 EFSEEWKSLFPISSATQSPLLLTHSDSNS------------------------------L 31 Query: 183 GPLVFIPRSETLIELYSSPDLASRLPPPYPEISLARFLHXXXXXXXXXXXXCIASEFGPQ 362 GPL F P +L LYSS L L P P + RFL +AS F Sbjct: 32 GPLFFNPNPISLSLLYSSNSLFPPLHLP-PHLLTNRFLSTSDPSILPSTASTVASLF--H 88 Query: 363 LSQTVGDHDNFRSSYGHDF-NCLQLLPCSEESTSLLLFFPTGENCDQVGYVMLRLEDS 533 +++N ++ H N +QL+ + S + L+FFPTG N +++G+ ML ++DS Sbjct: 89 SPHQYNNNNNNTTNVSHFLHNRIQLIQYPD-SPNTLVFFPTGSNDEKIGFFMLGIKDS 145 >ref|XP_004166877.1| PREDICTED: uncharacterized LOC101205354 [Cucumis sativus] Length = 862 Score = 56.2 bits (134), Expect = 5e-06 Identities = 43/123 (34%), Positives = 61/123 (49%) Frame = +3 Query: 180 LGPLVFIPRSETLIELYSSPDLASRLPPPYPEISLARFLHXXXXXXXXXXXXCIASEFGP 359 +GPLVF P +L L+SS L L PP ++L RFL +AS FG Sbjct: 26 IGPLVFNPVPTSLTRLFSSQSLLPSLSPP-SVLNLPRFL-LTSSSVVPSTSSSVASLFGE 83 Query: 360 QLSQTVGDHDNFRSSYGHDFNCLQLLPCSEESTSLLLFFPTGENCDQVGYVMLRLEDSQF 539 Q Q D + +N LQ LPC S+S+++FFPTG N D VG++++ S Sbjct: 84 Q--QCYSDPPSVLR-----YNRLQCLPCPN-SSSVVVFFPTGPNSDHVGFLVVSSNGSGL 135 Query: 540 SIK 548 ++ Sbjct: 136 DVQ 138