BLASTX nr result
ID: Cephaelis21_contig00006819
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00006819 (1677 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002510442.1| glycosyltransferase, putative [Ricinus commu... 621 e-175 ref|XP_003527946.1| PREDICTED: uncharacterized protein LOC100791... 620 e-175 ref|XP_002327680.1| predicted protein [Populus trichocarpa] gi|2... 611 e-172 ref|XP_002281084.1| PREDICTED: uncharacterized protein LOC100257... 610 e-172 ref|NP_177675.1| UDP-glycosyltransferase-like protein [Arabidops... 610 e-172 >ref|XP_002510442.1| glycosyltransferase, putative [Ricinus communis] gi|223551143|gb|EEF52629.1| glycosyltransferase, putative [Ricinus communis] Length = 477 Score = 621 bits (1601), Expect = e-175 Identities = 318/437 (72%), Positives = 362/437 (82%), Gaps = 5/437 (1%) Frame = +3 Query: 159 CDCPQEVVTTAAAAE-KRFSK----ASPLITVAASRIRPLSFMXXXXXXXXXXXXXXXGG 323 C + TTA A RF + + P I + + PL FM GG Sbjct: 41 CHSSSNITTTATTANVDRFGEPKVDSKPQIHSSVAP-NPLDFMKSKLVLLVSHELSLSGG 99 Query: 324 PLLLMELAFLLRGVGADVRWITNQRPSEADNVIYSLEHKMLDRGVQVISAKGQEAVNTAL 503 PLLLMELAFLLRGVGA+V WITNQ+P+E D VIYSLE+KMLDRGVQV SAKGQ+A++TAL Sbjct: 100 PLLLMELAFLLRGVGAEVVWITNQKPTETDEVIYSLENKMLDRGVQVFSAKGQKAIDTAL 159 Query: 504 RADLVVLNTAVAGKWLDSVLKEKVSFVLPKTLWWIHEMRGHYFRLEYVKHLPFVAGAMID 683 +ADLVVLNTAVAGKWLD+ LKE V VLPK LWWIHEMRGHYF+LEYVKHLPFVAGAMID Sbjct: 160 KADLVVLNTAVAGKWLDATLKESVQQVLPKVLWWIHEMRGHYFKLEYVKHLPFVAGAMID 219 Query: 684 SNVTAEYWKNRTEARLRFKMPKTYVVHLGNSKELMEVAEDSVARRVLREHVRESLGVQSE 863 S+ TAEYWKNRT RL KMP+TYVVHLGNSK+LMEVAEDSVA+RVL EHVRESLGV+++ Sbjct: 220 SHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAEDSVAKRVLCEHVRESLGVRND 279 Query: 864 DILFAIINSVSRGKGQDLFLQAFYESLMHIKRQKLQVPPIHAAIVGSDVNAQSKFESELR 1043 D+LFAIINSVSRGKGQDLFL++FYESL I+ +KL+VP +HA +VGSD+NAQ+KFE ELR Sbjct: 280 DLLFAIINSVSRGKGQDLFLRSFYESLQLIQEKKLKVPSLHAVVVGSDMNAQTKFEMELR 339 Query: 1044 AFVESQKIQGQVHFVNKTLAVAPYLAAIDVLVQNSQARGECFGRITIEAMAFQLPVLGTA 1223 FV+ +KIQ +VHFVNKTL VAPYLA+IDVLVQNSQARGECFGRITIEAMAFQLPVLGTA Sbjct: 340 KFVQEKKIQDRVHFVNKTLTVAPYLASIDVLVQNSQARGECFGRITIEAMAFQLPVLGTA 399 Query: 1224 AGGTQEIVLNGTTGLLHPVGKQGVISLSRNMVKLATHVERRLTMGKIGYERVKERFLERH 1403 AGGT EIV+NGTTGLLHP GK+GV L+ N+VKLATHVERRLTMGK GY+RVKERFLE H Sbjct: 400 AGGTMEIVVNGTTGLLHPAGKEGVTPLANNIVKLATHVERRLTMGKNGYKRVKERFLEHH 459 Query: 1404 MEHRVAAVLKDVLQNAK 1454 M HR+A VLK+VL+ AK Sbjct: 460 MSHRIALVLKEVLRKAK 476 >ref|XP_003527946.1| PREDICTED: uncharacterized protein LOC100791337 [Glycine max] Length = 464 Score = 620 bits (1599), Expect = e-175 Identities = 312/408 (76%), Positives = 357/408 (87%) Frame = +3 Query: 231 ITVAASRIRPLSFMXXXXXXXXXXXXXXXGGPLLLMELAFLLRGVGADVRWITNQRPSEA 410 +T AAS PL FM GGPLLLMELAFLLRGVG+DV WI+NQ+PSE Sbjct: 58 LTNAASS--PLIFMKSKLVLMVSHELSLSGGPLLLMELAFLLRGVGSDVVWISNQKPSEH 115 Query: 411 DNVIYSLEHKMLDRGVQVISAKGQEAVNTALRADLVVLNTAVAGKWLDSVLKEKVSFVLP 590 D V+YSLE KMLDRGVQV+SAKG+ A++TAL+AD+V+LNTAVAGKWLD++LKEKV+ VLP Sbjct: 116 DRVVYSLESKMLDRGVQVLSAKGENAIDTALKADMVILNTAVAGKWLDAILKEKVAHVLP 175 Query: 591 KTLWWIHEMRGHYFRLEYVKHLPFVAGAMIDSNVTAEYWKNRTEARLRFKMPKTYVVHLG 770 K LWWIHEMRGHYF++EYVKHLPFVAGAMIDS+ TAEYWKNRT RL +MP+TYVVHLG Sbjct: 176 KVLWWIHEMRGHYFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTRERLGIEMPETYVVHLG 235 Query: 771 NSKELMEVAEDSVARRVLREHVRESLGVQSEDILFAIINSVSRGKGQDLFLQAFYESLMH 950 NSKELMEVAEDSVA+RVLREHVRESLGV+++D+LFAIINSVSRGKGQDLFL++FYESL Sbjct: 236 NSKELMEVAEDSVAKRVLREHVRESLGVRNDDLLFAIINSVSRGKGQDLFLRSFYESLQL 295 Query: 951 IKRQKLQVPPIHAAIVGSDVNAQSKFESELRAFVESQKIQGQVHFVNKTLAVAPYLAAID 1130 I+ +KLQ+P +HA IVGSD+NAQ+KFE ELR FV +KIQ +VHFVNKTLAVAPYLAAID Sbjct: 296 IQEKKLQLPFLHAVIVGSDMNAQTKFEMELRKFVVEKKIQNRVHFVNKTLAVAPYLAAID 355 Query: 1131 VLVQNSQARGECFGRITIEAMAFQLPVLGTAAGGTQEIVLNGTTGLLHPVGKQGVISLSR 1310 VLVQNSQARGECFGRITIEAMAF+LPVLGTAAGGT EIV+NGTTGLLHPVGK+GV L++ Sbjct: 356 VLVQNSQARGECFGRITIEAMAFRLPVLGTAAGGTMEIVVNGTTGLLHPVGKEGVTPLAK 415 Query: 1311 NMVKLATHVERRLTMGKIGYERVKERFLERHMEHRVAAVLKDVLQNAK 1454 N+VKLA+HVE+RLTMGK GYERVKERFLE HM R+A VLK+VLQ AK Sbjct: 416 NIVKLASHVEKRLTMGKKGYERVKERFLEHHMSQRIALVLKEVLQKAK 463 >ref|XP_002327680.1| predicted protein [Populus trichocarpa] gi|222836765|gb|EEE75158.1| predicted protein [Populus trichocarpa] Length = 481 Score = 611 bits (1576), Expect = e-172 Identities = 315/432 (72%), Positives = 357/432 (82%) Frame = +3 Query: 159 CDCPQEVVTTAAAAEKRFSKASPLITVAASRIRPLSFMXXXXXXXXXXXXXXXGGPLLLM 338 CD P AA+ K S I A S PLSFM GGPLLLM Sbjct: 48 CDPPHPHNFDVAASNKPAKVFSNSIKTAPS---PLSFMKSKLVLLVSHELSLSGGPLLLM 104 Query: 339 ELAFLLRGVGADVRWITNQRPSEADNVIYSLEHKMLDRGVQVISAKGQEAVNTALRADLV 518 ELAFLLR VG +V WIT Q+PSE D V+YSLE KML RGVQV+SAKGQEA++TA +ADLV Sbjct: 105 ELAFLLRSVGTEVFWITIQKPSETDEVVYSLEQKMLVRGVQVLSAKGQEAIDTAFKADLV 164 Query: 519 VLNTAVAGKWLDSVLKEKVSFVLPKTLWWIHEMRGHYFRLEYVKHLPFVAGAMIDSNVTA 698 VLNTAVAGKWLD+VLKE V VLPK LWWIHEMRGHYF+L+YVKHLP V GAMIDS+VTA Sbjct: 165 VLNTAVAGKWLDAVLKENVPRVLPKVLWWIHEMRGHYFKLDYVKHLPLVGGAMIDSHVTA 224 Query: 699 EYWKNRTEARLRFKMPKTYVVHLGNSKELMEVAEDSVARRVLREHVRESLGVQSEDILFA 878 EYWKNRT+ RLR KMP+TYVVHLGNSKELMEVAEDSVA+RVLREH+RESLGV+ EDILFA Sbjct: 225 EYWKNRTQERLRIKMPETYVVHLGNSKELMEVAEDSVAKRVLREHIRESLGVRDEDILFA 284 Query: 879 IINSVSRGKGQDLFLQAFYESLMHIKRQKLQVPPIHAAIVGSDVNAQSKFESELRAFVES 1058 IINSVSRGKGQDLFL++FYESL I+ +KL+VP +HA IVGSD++AQ+KFE+ELR +V Sbjct: 285 IINSVSRGKGQDLFLRSFYESLQIIQVKKLKVPSMHAVIVGSDMSAQTKFETELRNYVMQ 344 Query: 1059 QKIQGQVHFVNKTLAVAPYLAAIDVLVQNSQARGECFGRITIEAMAFQLPVLGTAAGGTQ 1238 + IQ +VHF+NKTL VAPYLAAIDVLVQNSQARGECFGRITIEAMAFQLPVLGTAAGGT Sbjct: 345 KNIQDRVHFINKTLTVAPYLAAIDVLVQNSQARGECFGRITIEAMAFQLPVLGTAAGGTT 404 Query: 1239 EIVLNGTTGLLHPVGKQGVISLSRNMVKLATHVERRLTMGKIGYERVKERFLERHMEHRV 1418 EIV+NGTTGLLH VGK+GV L++N+VKLATHVERRLTMGK GYERV+E FLE HM HR+ Sbjct: 405 EIVVNGTTGLLHSVGKEGVTPLAKNIVKLATHVERRLTMGKRGYERVREMFLEHHMAHRI 464 Query: 1419 AAVLKDVLQNAK 1454 A+VLK+VL+ +K Sbjct: 465 ASVLKEVLRKSK 476 >ref|XP_002281084.1| PREDICTED: uncharacterized protein LOC100257473 [Vitis vinifera] Length = 479 Score = 610 bits (1573), Expect = e-172 Identities = 310/435 (71%), Positives = 359/435 (82%), Gaps = 3/435 (0%) Frame = +3 Query: 159 CDCPQEVVTTAAAAEKRFSKASPLITVAA---SRIRPLSFMXXXXXXXXXXXXXXXGGPL 329 C+ TT +S + I V + + PL FM GGPL Sbjct: 40 CNTNSVTTTTTITTTPHYSYENTRIQVTSQVETPSNPLRFMKSKLVLLVSHELSLSGGPL 99 Query: 330 LLMELAFLLRGVGADVRWITNQRPSEADNVIYSLEHKMLDRGVQVISAKGQEAVNTALRA 509 LLMELAFLLRGVGA+V W+T Q+P+++D VIYSLEH+MLDRGV+V AKGQEA++TAL+A Sbjct: 100 LLMELAFLLRGVGAEVVWLTIQKPTDSDEVIYSLEHRMLDRGVKVFPAKGQEAIDTALKA 159 Query: 510 DLVVLNTAVAGKWLDSVLKEKVSFVLPKTLWWIHEMRGHYFRLEYVKHLPFVAGAMIDSN 689 DLVVLNTAVAGKWLDSV+KE V +LPK LWWIHEMRGHYF+LEYVKHLP+VAGAMIDS+ Sbjct: 160 DLVVLNTAVAGKWLDSVVKENVPRILPKVLWWIHEMRGHYFKLEYVKHLPYVAGAMIDSH 219 Query: 690 VTAEYWKNRTEARLRFKMPKTYVVHLGNSKELMEVAEDSVARRVLREHVRESLGVQSEDI 869 TAEYWKNRT RL KMP+TYVVHLGNSKELME+AE++VA+RVLREHVRESLGV++ED+ Sbjct: 220 TTAEYWKNRTRERLGIKMPETYVVHLGNSKELMEIAENNVAKRVLREHVRESLGVRNEDL 279 Query: 870 LFAIINSVSRGKGQDLFLQAFYESLMHIKRQKLQVPPIHAAIVGSDVNAQSKFESELRAF 1049 LFA+INSVSRGKGQDLFL++FY+SL IK +KLQVP IHA IVGSD+NAQ+KFE+ELR F Sbjct: 280 LFAVINSVSRGKGQDLFLRSFYQSLQLIKGRKLQVPSIHAVIVGSDMNAQTKFETELRNF 339 Query: 1050 VESQKIQGQVHFVNKTLAVAPYLAAIDVLVQNSQARGECFGRITIEAMAFQLPVLGTAAG 1229 V KIQ QVHF+NKTL VAPYLA+IDVLVQNSQARGECFGRITIEAMAFQLPVLGTAAG Sbjct: 340 VVENKIQDQVHFINKTLTVAPYLASIDVLVQNSQARGECFGRITIEAMAFQLPVLGTAAG 399 Query: 1230 GTQEIVLNGTTGLLHPVGKQGVISLSRNMVKLATHVERRLTMGKIGYERVKERFLERHME 1409 GT EIV+NGTTGLLH VGK+GV L+ N+VKLAT+VERRLTMGK GYERVKERFLE HM Sbjct: 400 GTTEIVVNGTTGLLHNVGKEGVKPLANNIVKLATNVERRLTMGKRGYERVKERFLEHHMS 459 Query: 1410 HRVAAVLKDVLQNAK 1454 R+A+VLK+VL+ A+ Sbjct: 460 ERIASVLKEVLKKAE 474 >ref|NP_177675.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana] gi|30793985|gb|AAP40442.1| unknown protein [Arabidopsis thaliana] gi|110739259|dbj|BAF01543.1| hypothetical protein [Arabidopsis thaliana] gi|332197597|gb|AEE35718.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana] Length = 463 Score = 610 bits (1572), Expect = e-172 Identities = 308/409 (75%), Positives = 348/409 (85%) Frame = +3 Query: 234 TVAASRIRPLSFMXXXXXXXXXXXXXXXGGPLLLMELAFLLRGVGADVRWITNQRPSEAD 413 + A + PL FM GGPLLLMELAFLLRGVGADV WITNQ+P E D Sbjct: 52 SAAKFQSNPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGADVVWITNQKPLEDD 111 Query: 414 NVIYSLEHKMLDRGVQVISAKGQEAVNTALRADLVVLNTAVAGKWLDSVLKEKVSFVLPK 593 V+YSLEHKMLDRGVQVISAKGQ+AV+T+L+ADL+VLNTAVAGKWLD+VLKE V VLPK Sbjct: 112 EVVYSLEHKMLDRGVQVISAKGQKAVDTSLKADLIVLNTAVAGKWLDAVLKENVVKVLPK 171 Query: 594 TLWWIHEMRGHYFRLEYVKHLPFVAGAMIDSNVTAEYWKNRTEARLRFKMPKTYVVHLGN 773 LWWIHEMRGHYF + VKHLPFVAGAMIDS+ TA YWKNRT+ARL KMPKTYVVHLGN Sbjct: 172 ILWWIHEMRGHYFNADLVKHLPFVAGAMIDSHATAGYWKNRTQARLGIKMPKTYVVHLGN 231 Query: 774 SKELMEVAEDSVARRVLREHVRESLGVQSEDILFAIINSVSRGKGQDLFLQAFYESLMHI 953 SKELMEVAEDSVA+RVLREHVRESLGV++ED+LF IINSVSRGKGQDLFL+AF+ESL I Sbjct: 232 SKELMEVAEDSVAKRVLREHVRESLGVRNEDLLFGIINSVSRGKGQDLFLRAFHESLERI 291 Query: 954 KRQKLQVPPIHAAIVGSDVNAQSKFESELRAFVESQKIQGQVHFVNKTLAVAPYLAAIDV 1133 K +KLQVP +HA +VGSD++ Q+KFE+ELR FV +K++ VHFVNKTL VAPY+AAIDV Sbjct: 292 KEKKLQVPTMHAVVVGSDMSKQTKFETELRNFVREKKLENFVHFVNKTLTVAPYIAAIDV 351 Query: 1134 LVQNSQARGECFGRITIEAMAFQLPVLGTAAGGTQEIVLNGTTGLLHPVGKQGVISLSRN 1313 LVQNSQARGECFGRITIEAMAF+LPVLGTAAGGT EIV+NGTTGLLH GK+GVI L++N Sbjct: 352 LVQNSQARGECFGRITIEAMAFKLPVLGTAAGGTMEIVVNGTTGLLHSAGKEGVIPLAKN 411 Query: 1314 MVKLATHVERRLTMGKIGYERVKERFLERHMEHRVAAVLKDVLQNAKGR 1460 +VKLAT VE RL MGK GYERVKE FLE HM HR+A+VLK+VLQ+AK R Sbjct: 412 IVKLATQVELRLRMGKNGYERVKEMFLEHHMSHRIASVLKEVLQHAKAR 460