BLASTX nr result
ID: Catharanthus22_contig00012917
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00012917 (1176 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY29745.1| Exostosin family protein [Theobroma cacao] 86 2e-14 ref|XP_006359765.1| PREDICTED: probable glycosyltransferase At3g... 78 6e-12 gb|EMJ26353.1| hypothetical protein PRUPE_ppa002395mg [Prunus pe... 77 1e-11 ref|XP_002309546.2| hypothetical protein POPTR_0006s25530g [Popu... 71 1e-09 ref|XP_004245169.1| PREDICTED: probable glycosyltransferase At5g... 71 1e-09 ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g... 69 4e-09 emb|CBI20855.3| unnamed protein product [Vitis vinifera] 58 7e-06 >gb|EOY29745.1| Exostosin family protein [Theobroma cacao] Length = 435 Score = 86.3 bits (212), Expect = 2e-14 Identities = 96/317 (30%), Positives = 130/317 (41%), Gaps = 29/317 (9%) Frame = +1 Query: 313 FLLRFPVEGRRLLCLMALVFAIVFIVQYFELPYGNVFLSLLSAGKT-KFASAG------- 468 F L E RRLL LMA+ FA+V VQYFELPY VF SL +AGK +F + G Sbjct: 12 FRLLCQAESRRLLLLMAITFALVLAVQYFELPYTEVFTSLFAAGKNGRFPTGGSSSKSGM 71 Query: 469 --NFTIGNXXXXXXXXXXXXXRNLTFSDSLNFSNPSAVDYMAD----------VSELSDG 612 N T+ N N T LN +A ++ VSE + G Sbjct: 72 VDNVTLSNGLNSTHNYADNDTENGT--AVLNIDKETAQGNESEENDRDLKNVYVSESNAG 129 Query: 613 NNSNKKEADDNSDNADPEDESPSKDLEPNHKHIMGNVGINDTLAPEKARVYDYNAPTPAV 792 +N++ + S + P S S LE + G V APE+ DYN P+ + Sbjct: 130 SNNSFGLLFNGSSSDTPIAPSISSTLENGDNVVNGPV---LHAAPEQNVTQDYN-PSSSS 185 Query: 793 AATPHITVAPTSGEDEIYXXXXXXXXXXXQPLDLSPPLASPVGVRKNSGTSNLFNPNATS 972 ++ AP SPPL SP + SN+ + NA+S Sbjct: 186 GSSGRYFAAPA-----------------------SPPLNSPSILPDTKLRSNMSSVNASS 222 Query: 973 VNRDATKAVEKSKNSGLQMSDVSPLFNYTLAKE---------FPKMKESSLGPPGVAVSI 1125 V ++ T EK K+ +S +PL ++ K K+ S P + VSI Sbjct: 223 VGKNTTILPEKDKDPNFLIS--TPLSGNVYSENTVPAVRKNGSKKPKKKSKKQPQIFVSI 280 Query: 1126 SEMNDKLLHSLALRHPV 1176 SEMND LL S H V Sbjct: 281 SEMNDLLLQSHTSPHSV 297 >ref|XP_006359765.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform X1 [Solanum tuberosum] gi|565387991|ref|XP_006359766.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform X2 [Solanum tuberosum] gi|565387993|ref|XP_006359767.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform X3 [Solanum tuberosum] Length = 669 Score = 78.2 bits (191), Expect = 6e-12 Identities = 82/290 (28%), Positives = 128/290 (44%), Gaps = 22/290 (7%) Frame = +1 Query: 334 EGRRLLCLMALVFAIVFIVQYFELPYGNVFLSLLSAGKTKFASA------GNFTIGNXXX 495 E RRL+ L+ +VF + ++QYF PYG SL +A + +S+ GNF+ + Sbjct: 12 ETRRLVSLLGVVFGLALMIQYFGFPYGYALSSLFTANGGQISSSQRVDQSGNFSRSD--- 68 Query: 496 XXXXXXXXXXRNLTFSDSLNFSNPSAVDYM----ADVSELSDGNNS--NKKEADDNSDNA 657 NL +N +N + ++ A+ E+ DG+ N++ D +++ Sbjct: 69 -----------NLKHGSVVNATNTNLINETKLSDANDEEVEDGSMPPMNERSGDTLTEDV 117 Query: 658 DPEDESPSKDLEPNHKHIMGNVGINDTLAPEKAR-VYDYNAPTPAVAATPHITVAPTSGE 834 DPEDESP KD + ++K + ++G N +L P+KA D + + + + + V T G Sbjct: 118 DPEDESPFKDSKLDNKSNVESLGRNSSLPPDKAADSEDDLQASNSTSESSLLRVVDTDGG 177 Query: 835 DEIYXXXXXXXXXXXQPLDLS---PPLASPVGVRKNSGTSNLFNPNATSVNRDATKAVEK 1005 I P LS PPL V NL + EK Sbjct: 178 GSISPAPTEAKLLEISPTALSIAPPPLVVTPQV-------NLDAKKEAPLISSYQNISEK 230 Query: 1006 SKNSG-LQMSDVSPLFNY-----TLAKEFPKMKESSLGPPGVAVSISEMN 1137 N+G L SD P+ T + +FP+MKES+ P VSI+EMN Sbjct: 231 EGNTGHLLESDNLPVQKRTDHAPTASHKFPEMKESN-KPIDSVVSIAEMN 279 >gb|EMJ26353.1| hypothetical protein PRUPE_ppa002395mg [Prunus persica] Length = 678 Score = 77.4 bits (189), Expect = 1e-11 Identities = 74/286 (25%), Positives = 118/286 (41%), Gaps = 10/286 (3%) Frame = +1 Query: 334 EGRRLLCLMALVFAIVFIVQYFELPYGNVFLSLLSAGKTKFASAGNFTIGNXXXXXXXXX 513 E RRLL + ++FA++ +V++ ELPYGN+ S+LS+ K F G Sbjct: 12 ETRRLLWIAGMLFAVILVVRHLELPYGNLLSSILSSTKVPLVGKSGFQAGYSPSNSEIVG 71 Query: 514 XXXXRNLTFSDSLNFSNPSAVDYMADVSELSD----GNNSNKKEADDNSDNADPEDESPS 681 NL+ S+ LN + A+ A + SD G+ + + + N D D +D S Sbjct: 72 -----NLSLSNDLNNTGTYAIHEKASNTRSSDSVLEGHEGSNRALEINEDEDDGKDASSG 126 Query: 682 KDLEPNHKHIMGNVGINDT-LAPEKARVYDYNAPTPAVAATPHITVAPTSGE----DEIY 846 ++ N I+ N+ +T A E R + ++ E D + Sbjct: 127 NLVKQNRTIIVENIKPLETNFAQEGGREPEVSSVEKKNTTDNTYLEGRIGNENNTVDVVN 186 Query: 847 XXXXXXXXXXXQPLDLSPPLASPVGVRKNSGTS-NLFNPNATSVNRDATKAVEKSKNSGL 1023 P+ S P +P N G + N TSV +D T EK++NS Sbjct: 187 STAGLPVSSPAPPMMNSSPSTAPAIFETNVGAPIKSVDSNVTSVEKDRTTPSEKTENSEQ 246 Query: 1024 QMSDVSPLFNYTLAKEFPKMKESSLGPPGVAVSISEMNDKLLHSLA 1161 SD++ + + P++K P SIS+MN+ LL S A Sbjct: 247 LHSDLNQTEHNSSMTRVPEVKIEPEVPILDVYSISDMNNLLLQSRA 292 >ref|XP_002309546.2| hypothetical protein POPTR_0006s25530g [Populus trichocarpa] gi|550337071|gb|EEE93069.2| hypothetical protein POPTR_0006s25530g [Populus trichocarpa] Length = 663 Score = 70.9 bits (172), Expect = 1e-09 Identities = 73/298 (24%), Positives = 122/298 (40%), Gaps = 22/298 (7%) Frame = +1 Query: 334 EGRRLLCLMALVFAIVFIVQYFELPYGNVFLSLLSAGKTKFASAGNFTIGNXXXXXXXXX 513 + RRLL L+ AIV +VQY E P V +SL SA T+ + N + G+ Sbjct: 12 KARRLLFLVGATVAIVIVVQYLEFPSSRVLVSLFSAVNTRSFMSRNSSTGSEALG----- 66 Query: 514 XXXXRNLTFSDSLNFSNPSAVDYMADVSELSDG-------NNSNKKEADDNSDNADPEDE 672 N+T S+ LN +N + D E SD + S +KE N+ N Sbjct: 67 -----NMTLSNGLNTTNTGILHETTDSDEASDDKKETAEVSKSEEKEGSPNNSNGSERKR 121 Query: 673 SPSKDLEPNHKHIMGNVGINDTLA--PEKARVYDYNAPTPAVAATPHIT-------VAPT 825 S+ ++ N +D LA + + + N A P + +AP Sbjct: 122 GSSESF-----GLVSNETTSDDLANQDKNSTLNTINGSEEEKAMAPDASYINVDKDIAPI 176 Query: 826 SGEDEIYXXXXXXXXXXXQPLDLSPPLASPVGVR-----KNSGTSNLFNPNATSVNRDAT 990 SG ++ +PP+ + + +NS + N TS+ +D Sbjct: 177 SGRNKSSDADPGYP-------SSAPPMMNTFSNKTFSTDENSSPMIFESSNTTSMRKDTA 229 Query: 991 KAVEKSKNSGLQMSDVSPLFNYTLAKEFPKMK-ESSLGPPGVAVSISEMNDKLLHSLA 1161 A+++ +NSGL ++ S + + + + K ++S PP +SI +MN+ L S A Sbjct: 230 GALKRDENSGLLPNNYSMSTSGSFSSKVTAAKRKTSKKPPSRVISIHQMNELLRQSHA 287 >ref|XP_004245169.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform 1 [Solanum lycopersicum] gi|460399281|ref|XP_004245170.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform 2 [Solanum lycopersicum] Length = 647 Score = 70.9 bits (172), Expect = 1e-09 Identities = 79/284 (27%), Positives = 113/284 (39%), Gaps = 16/284 (5%) Frame = +1 Query: 334 EGRRLLCLMALVFAIVFIVQYFELPYGNVFLSLLSAGKTKFASAGNFTIGNXXXXXXXXX 513 E RRL+CL+ ++F + ++QYF PYG S+ +A + +S+ Sbjct: 12 ETRRLVCLLGVIFGLALMIQYFGFPYGYALSSIFTANGGQISSS---------------Q 56 Query: 514 XXXXRNLTFSDSLNFSNPSAVDYMADVSELSDGN----NSNKKEADDNSDNADPEDESPS 681 FSD+ N E+ DG+ N + D +++ DPEDESP Sbjct: 57 RVDQSGTKFSDANN-------------EEVEDGSMPPMNERSGDGDTLTEDIDPEDESPF 103 Query: 682 KDLEPNHKHIMGNVGINDTLAPEKARVYDYNAPTPAVAATPHIT---VAPTSGEDEIYXX 852 KD + ++K + +G N +L PEKA D A T + V T G I Sbjct: 104 KDSKLDNKSNVETLGRNSSLPPEKA--VDSENDLQASNGTSESSLSRVVDTDGGGSISPA 161 Query: 853 XXXXXXXXXQPLDLS---PPLASPVGVRKNSGTSNLFNPNATSVNRDATKAVEKSKNSG- 1020 P LS PPL V NL + EK N+G Sbjct: 162 PMEAKSWEISPTVLSIAPPPLVVTPQV-------NLDAKKEAPLITSYQNVSEKEGNTGH 214 Query: 1021 LQMSDVSPLFNY-----TLAKEFPKMKESSLGPPGVAVSISEMN 1137 L+ SD P+ + T+ + P MKES P VSI+EMN Sbjct: 215 LRESDNLPVQKHTDHAPTVGHKIPVMKESD-KPIDSVVSIAEMN 257 >ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera] Length = 675 Score = 68.9 bits (167), Expect = 4e-09 Identities = 79/310 (25%), Positives = 129/310 (41%), Gaps = 23/310 (7%) Frame = +1 Query: 301 MVNKFLLRFPVEGRRLLCLMALVFAIVFIVQYFELPYGNVFLSLLSA------GKTKFAS 462 M +KF + VE R LL L+ VF++VF+VQYFELPYG+V SL SA GKT S Sbjct: 1 MGHKFRYLWQVEARHLLWLIGTVFSVVFVVQYFELPYGDVLSSLFSAGDIPAPGKTSLPS 60 Query: 463 AGNFTIGNXXXXXXXXXXXXXRNLTFSDSLNFSNPSAVDYMADVSELSDGNNSNKKE--- 633 + + + N+T + LN S+ A+ + +E +GNN K Sbjct: 61 SDSLS-----------KLGTMGNMTTAQGLNSSDVHAMHGIDSNAETMEGNNEGPKNDFA 109 Query: 634 -----ADDNSDNADPEDESPSKDLEPNHKHIMGNVGINDTLAPEKARVYDYNAPTPAVAA 798 A D S D ++++ + + N GN + ++ +Y N + ++ Sbjct: 110 SVMNGALDKSFGLDEDNKNVTVEKVNN----SGNRSALKNASKHESSLYLENITADSNSS 165 Query: 799 TPHITVAPTSGEDEIYXXXXXXXXXXXQPLDLSPPLASPVGVRKNSGTSNLFN------- 957 I ED++ + L PL + + +S T++L N Sbjct: 166 LGKIQ------EDDM---ALLSQRSERSGVGLISPLPALPQIISSSNTTSLTNLDPHPIT 216 Query: 958 --PNATSVNRDATKAVEKSKNSGLQMSDVSPLFNYTLAKEFPKMKESSLGPPGVAVSISE 1131 P +SV DA + K + + D++ + + P ++ P +ISE Sbjct: 217 LPPERSSVEEDAAHTLNKDEKAETSQKDLT--LSNRSSISVPALETRPELP--AVTTISE 272 Query: 1132 MNDKLLHSLA 1161 MND L+ S A Sbjct: 273 MNDLLVQSRA 282 >emb|CBI20855.3| unnamed protein product [Vitis vinifera] Length = 618 Score = 58.2 bits (139), Expect = 7e-06 Identities = 32/64 (50%), Positives = 41/64 (64%), Gaps = 6/64 (9%) Frame = +1 Query: 301 MVNKFLLRFPVEGRRLLCLMALVFAIVFIVQYFELPYGNVFLSLLSA------GKTKFAS 462 M +KF + VE R LL L+ VF++VF+VQYFELPYG+V SL SA GKT S Sbjct: 1 MGHKFRYLWQVEARHLLWLIGTVFSVVFVVQYFELPYGDVLSSLFSAGDIPAPGKTSLPS 60 Query: 463 AGNF 474 + +F Sbjct: 61 SDSF 64