BLASTX nr result
ID: Scutellaria22_contig00019226
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Scutellaria22_contig00019226 (1870 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003527954.1| PREDICTED: histone-lysine N-methyltransferas... 608 e-171 ref|XP_003522568.1| PREDICTED: histone-lysine N-methyltransferas... 607 e-171 emb|CBI18964.3| unnamed protein product [Vitis vinifera] 566 e-159 ref|XP_002307459.1| SET domain protein [Populus trichocarpa] gi|... 557 e-156 ref|XP_002889136.1| hypothetical protein ARALYDRAFT_476894 [Arab... 495 e-137 >ref|XP_003527954.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Glycine max] Length = 2037 Score = 608 bits (1569), Expect = e-171 Identities = 331/630 (52%), Positives = 420/630 (66%), Gaps = 9/630 (1%) Frame = -1 Query: 1864 ISPDGVRDQYDPPRNAWVLCDECQKWRRIPATLADQINETECEWTCRDNYDRDFADCSIP 1685 +S G +Q PRNAWV CD+C KWRRIPA LAD+I+ET C WTC+D+ D+ FADC+IP Sbjct: 1005 LSGVGYGEQLLSPRNAWVRCDDCHKWRRIPAVLADRIDETNCTWTCKDSSDKAFADCAIP 1064 Query: 1684 QEKSDSEINEELGISDASCEEDAC--GTDLKSIQ-DPSNTAQQSSWSLIKSNLFLSRSRK 1514 QEKS++EIN ELG+SDAS EEDA + K ++ P +Q+S+++ I +N FL RS K Sbjct: 1065 QEKSNAEINAELGLSDASGEEDAYEGSKNFKELEYRPPLVSQESTFTHILTNEFLHRSHK 1124 Query: 1513 TQTIDEVMVCHCKPPSDGRMGCGEKCLNRMLNIECVRGTCPCGERCSNQQFQKRNYSKVM 1334 TQTIDE+MVCHCKP +G++GCG++CLNR+LNIECV+GTCPCG+RCSNQQFQK Y+ + Sbjct: 1125 TQTIDEIMVCHCKPSQEGKLGCGDECLNRILNIECVQGTCPCGDRCSNQQFQKHKYASLK 1184 Query: 1333 WFKCGKKGYGLQALEDIAESHFLIEYVGEVLDVRAYEARQKEYAMNGHKHFYFMTLNGSE 1154 WFKCGKKGYGL+A+E++A+ FLIEYVGEVLD++AYEARQ+EYA+ GH+HFYFMTLNGSE Sbjct: 1185 WFKCGKKGYGLKAIENVAQGQFLIEYVGEVLDMQAYEARQREYALKGHRHFYFMTLNGSE 1244 Query: 1153 VIDACSKGNLGRFINHSCDPNCRTEKWMVNGEVCIGLFALRDIKKGEEITFDYNFVRVFG 974 VIDA +KGNLGRFINHSCDPNCRTEKWMVNGE+CIGLFALRDIKK EE+TFDYN+VRVFG Sbjct: 1245 VIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKDEELTFDYNYVRVFG 1304 Query: 973 AAAKKCVCGSPNCRGYI-GGDLTNSEVIAQDDSDDEYAEPVMICEDGEMNKDWNEIMSNT 797 AAAKKC CGSPNCRGYI GGD N+E+I Q DS++E+ EPVM+ +DGE+ + Sbjct: 1305 AAAKKCYCGSPNCRGYIGGGDPLNAELIVQSDSEEEFPEPVMLTKDGEIED--SVPTPEY 1362 Query: 796 FNNGKIESSAIEPPENIYHMKKLNCAGDENKSESHSFEFSPQKIEGVNSAQVAETREGCG 617 FNN +S+ HM K D N + + + S +K +N A Sbjct: 1363 FNNVDTQSAK--------HMLKDRDILD-NSTTAIDSDGSLEKERSMNPASAV------- 1406 Query: 616 LYNSVGNTSAAHVDEKVNMNNITGESLKGSEPAALKIESEVIWSRMHXXXXXXXXXXXXD 437 S+ ++SA D K G+ + + + E + S+ Sbjct: 1407 ---SLLHSSAEMEDSK-------GKLQSSVQVEEISQQMEDVTSKPMPAVHQGYEKESEF 1456 Query: 436 GIVNTSQEQVFNVSP----SKSFPDKVECKRKIKYASRGRRDERAKSNFVAKTNRSLSSI 269 +S +++ SP SK P+ R+ K G R KT + S+ Sbjct: 1457 ADKTSSIQRLDTTSPLTTVSKMLPNSAGSNRESKSEIIGGR----------KTPKLKGSV 1506 Query: 268 KKGKPKANVLNG-KAPPDEDKLSAAQHKSKKIPEHSLNNHVEAVEEKLNELLDPEGGISK 92 KKGK AN NG K ++L K KK+ E S N EAV+EKLNELLD +GGISK Sbjct: 1507 KKGKVHANPPNGLKTEVTANRLQVPSIKHKKV-EGSSNGRFEAVQEKLNELLDGDGGISK 1565 Query: 91 RKDASRGYLKLLFLTAATGNNGHGEAIQSN 2 RKDA++GYLKLLFLT A+G+ +GEAIQSN Sbjct: 1566 RKDATKGYLKLLFLTVASGDRINGEAIQSN 1595 >ref|XP_003522568.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Glycine max] Length = 2081 Score = 607 bits (1565), Expect = e-171 Identities = 332/633 (52%), Positives = 425/633 (67%), Gaps = 12/633 (1%) Frame = -1 Query: 1864 ISPDGVRDQYDPPRNAWVLCDECQKWRRIPATLADQINETECEWTCRDNYDRDFADCSIP 1685 +S G +Q PRNAWV CD+C KWRRIPA LAD+I+ET C WTC+D+ D+ FADC+IP Sbjct: 1049 LSGVGFGEQILSPRNAWVRCDDCHKWRRIPAVLADRIDETNCTWTCKDSSDKAFADCAIP 1108 Query: 1684 QEKSDSEINEELGISDASCEEDAC--GTDLKSIQD-PSNTAQQSSWSLIKSNLFLSRSRK 1514 QEKS++EIN ELG+SDAS EEDA + K ++ P +Q+S+++ I +N FL RS K Sbjct: 1109 QEKSNAEINAELGLSDASGEEDAYEGSKNFKELEYWPPIVSQESTFTNILTNEFLHRSHK 1168 Query: 1513 TQTIDEVMVCHCKPPSDGRMGCGEKCLNRMLNIECVRGTCPCGERCSNQQFQKRNYSKVM 1334 TQTIDE+MVCHCKP G++GCG++CLNR+LNIECV+GTCPCG+RCSNQQFQK Y+ + Sbjct: 1169 TQTIDEIMVCHCKPSQGGKLGCGDECLNRILNIECVQGTCPCGDRCSNQQFQKHKYASLK 1228 Query: 1333 WFKCGKKGYGLQALEDIAESHFLIEYVGEVLDVRAYEARQKEYAMNGHKHFYFMTLNGSE 1154 WFKCGKKGYGL+A+ED+A+ FLIEYVGEVLD++ YEARQ+EYA+ GH+HFYFMTLNGSE Sbjct: 1229 WFKCGKKGYGLKAIEDVAQGQFLIEYVGEVLDMQTYEARQREYALKGHRHFYFMTLNGSE 1288 Query: 1153 VIDACSKGNLGRFINHSCDPNCRTEKWMVNGEVCIGLFALRDIKKGEEITFDYNFVRVFG 974 VIDA +KGNLGRFINHSCDPNCRTEKWMVNGE+CIGLFALR++KK EE+TFDYN+VRVFG Sbjct: 1289 VIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRNVKKDEELTFDYNYVRVFG 1348 Query: 973 AAAKKCVCGSPNCRGYI-GGDLTNSEVIAQDDSDDEYAEPVMICEDGEMNKDWNEIMSNT 797 AAAKKC CGS NCRGYI GGD N+E+I Q DS++E+ EPVM+ +DGE+ Sbjct: 1349 AAAKKCYCGSSNCRGYIGGGDPLNAELIVQSDSEEEFPEPVMLTKDGEIED--AVPTPKY 1406 Query: 796 FNNGKIESSAIEPPENIYHMKKLNCAGDENKSESHSFEFSPQKIEGVNSAQ-VAETREGC 620 FNN ES+ HM K + EN + + + SP+K +N A V+ Sbjct: 1407 FNNVDTESAK--------HMLK-DRDILENPTTAIDSDGSPEKESSMNPASAVSLLHSSA 1457 Query: 619 GLYNSVGNTSAAHVDEKVN--MNNITGESLKGSEPAALKIESEVIWSRMHXXXXXXXXXX 446 + +S G ++ DE+++ M ++T + + K ESE Sbjct: 1458 EMEDSKGKLPSSVRDEEISQQMEDVTSKPMPSVHQGYEK-ESE----------------- 1499 Query: 445 XXDGIVNTSQEQVFNVSP----SKSFPDKVECKRKIKYASRGRRDERAKSNFVAKTNRSL 278 +S +++ SP SK P+ R+ K G + KT + Sbjct: 1500 --FADKTSSIQRLETTSPPTTVSKMLPNSAGSNRESKSEIIGGK----------KTPKLN 1547 Query: 277 SSIKKGKPKANVLNG-KAPPDEDKLSAAQHKSKKIPEHSLNNHVEAVEEKLNELLDPEGG 101 S+KKGK AN NG K ++L + K KK+ E S N EAV+EKLNELLD +GG Sbjct: 1548 GSVKKGKVHANPPNGLKTEVTANRLQVSSIKHKKV-EGSSNGRFEAVQEKLNELLDGDGG 1606 Query: 100 ISKRKDASRGYLKLLFLTAATGNNGHGEAIQSN 2 ISKRKDA++GYLKLLFLT A+G+ +GEAIQSN Sbjct: 1607 ISKRKDATKGYLKLLFLTVASGDRINGEAIQSN 1639 >emb|CBI18964.3| unnamed protein product [Vitis vinifera] Length = 1958 Score = 566 bits (1458), Expect = e-159 Identities = 277/447 (61%), Positives = 324/447 (72%), Gaps = 21/447 (4%) Frame = -1 Query: 1840 QYDPPRNAWVLCDECQKWRRIPATLADQINETECEWTCRDNYDRDFADCSIPQEKSDSEI 1661 QY PPR AWV CD+C KWRRI A LAD I ET C+W C+DN D+ FADCSIPQEKS+ EI Sbjct: 1097 QYLPPRIAWVRCDDCYKWRRIAAALADSIEETNCKWICKDNMDKAFADCSIPQEKSNGEI 1156 Query: 1660 NEELGISDASCEEDACGTDLKSI---QDPSNTAQQSSWSLIKSNLFLSRSRKTQTIDEVM 1490 N EL ISDASCEED L S Q S Q SSW LI+SNLFL RSR+TQTIDEVM Sbjct: 1157 NAELEISDASCEEDVYDAHLTSKEFGQRRSTVTQSSSWMLIRSNLFLHRSRRTQTIDEVM 1216 Query: 1489 VCHCKPPSDGRMGCGEKCLNRMLNIECVRGTCPCGERCSNQQFQKRNYSKVMWFKCGKKG 1310 VCHCK P +GR GCG++CLNRMLNIECV+GTCPCG+ CSNQQFQKR Y+K+ WFKCGKKG Sbjct: 1217 VCHCKRPVEGRFGCGDECLNRMLNIECVQGTCPCGDLCSNQQFQKRGYAKLKWFKCGKKG 1276 Query: 1309 YGLQALEDIAESHFLIEYVGEVLDVRAYEARQKEYAMNGHKHFYFMTLNGSEVIDACSKG 1130 YGLQ +DI++ FLIEYVGEVLD++ YEARQKEYA GHKHFYFMTLNGSEVIDAC+KG Sbjct: 1277 YGLQLQQDISQGQFLIEYVGEVLDLQTYEARQKEYASRGHKHFYFMTLNGSEVIDACAKG 1336 Query: 1129 NLGRFINHSCDPNCRTEKWMVNGEVCIGLFALRDIKKGEEITFDYNFVRVFGAAAKKCVC 950 NLGRFINHSCDPNCRTEKWMVNGE+CIGLFALRDIKKGEE+TFDYN+VRVFGAAAKKCVC Sbjct: 1337 NLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCVC 1396 Query: 949 GSPNCRGYIGGDLTNSEVIAQDDSDDEYAEPVMICEDGEMNKDWNEIMSNT--FNNGKIE 776 GSP CRGYIGGD ++EVI Q DSD+EY EPVM+ EDGE ++ +S T F+ +I+ Sbjct: 1397 GSPQCRGYIGGDPLSTEVIVQGDSDEEYPEPVMVNEDGETADSFDNTISTTSSFDAAEIQ 1456 Query: 775 SSAIEPPENIYH----------------MKKLNCAGDENKSESHSFEFSPQKIEGVNSAQ 644 +S+ N+ MK + + +S+S + K G+ + Sbjct: 1457 TSSDSADANVSKSETPEEKQVCSKSRLLMKASRSSSSVKRGKSNSNPVNANKPPGIGNKT 1516 Query: 643 VAETREGCGLYNSVGNTSAAHVDEKVN 563 + + L + N V EK+N Sbjct: 1517 QVLSNKPKKLLDGSANARFEAVQEKLN 1543 Score = 108 bits (271), Expect = 4e-21 Identities = 65/144 (45%), Positives = 86/144 (59%), Gaps = 1/144 (0%) Frame = -1 Query: 433 IVNTSQEQVFNVSPSKSFPDKVECKRKIKYASRGRRDERAKSNFVAKTNRSLSSIKKGKP 254 I +S NVS S++ +K C +KS + K +RS SS+K+GK Sbjct: 1455 IQTSSDSADANVSKSETPEEKQVC---------------SKSRLLMKASRSSSSVKRGKS 1499 Query: 253 KANVLNGKAPPD-EDKLSAAQHKSKKIPEHSLNNHVEAVEEKLNELLDPEGGISKRKDAS 77 +N +N PP +K +K KK+ + S N EAV+EKLNELLD GGISKRKD+S Sbjct: 1500 NSNPVNANKPPGIGNKTQVLSNKPKKLLDGSANARFEAVQEKLNELLDANGGISKRKDSS 1559 Query: 76 RGYLKLLFLTAATGNNGHGEAIQS 5 +GYLKLL LT A+G+NG+ EAIQS Sbjct: 1560 KGYLKLLLLTVASGDNGNREAIQS 1583 >ref|XP_002307459.1| SET domain protein [Populus trichocarpa] gi|222856908|gb|EEE94455.1| SET domain protein [Populus trichocarpa] Length = 594 Score = 557 bits (1436), Expect = e-156 Identities = 302/605 (49%), Positives = 374/605 (61%), Gaps = 7/605 (1%) Frame = -1 Query: 1828 PRNAWVLCDECQKWRRIPATLADQINETECEWTCRDNYDRDFADCSIPQEKSDSEINEEL 1649 P NAWV CD+C KWRRIP L + I++T C+W C+DN ++ FADCS PQEKS++EIN EL Sbjct: 4 PDNAWVRCDDCLKWRRIPVRLVESISQTHCQWICKDNMNKAFADCSFPQEKSNAEINAEL 63 Query: 1648 GISDASCEEDACGTDLKSIQDPSNTAQQSSWSLIKSNLFLSRSRKTQTIDEVMVCHCKPP 1469 GISD +ED C D S +++ ++ I +N FL RSRKTQTIDE+MVC+CK P Sbjct: 64 GISDV--DEDGC--DAPSNYMELEFSKEYEFTRITTNQFLHRSRKTQTIDEIMVCYCKAP 119 Query: 1468 SDGRMGCGEKCLNRMLNIECVRGTCPCGERCSNQQFQKRNYSKVMWFKCGKKGYGLQALE 1289 GR+GCG++CLNRMLNIECV+GTCPCG+ CSNQQFQKRNY+K+ W +CGKKG+GL+ E Sbjct: 120 VAGRLGCGDECLNRMLNIECVQGTCPCGDHCSNQQFQKRNYAKMTWERCGKKGFGLRLDE 179 Query: 1288 DIAESHFLIEYVGEVLDVRAYEARQKEYAMNGHKHFYFMTLNGSEVIDACSKGNLGRFIN 1109 DI+ FLIEYVGEVLDV AYEARQK+YA GHKHFYFMTL+GSEVIDAC+KGNLGRFIN Sbjct: 180 DISRGQFLIEYVGEVLDVHAYEARQKDYASKGHKHFYFMTLDGSEVIDACAKGNLGRFIN 239 Query: 1108 HSCDPNCRTEKWMVNGEVCIGLFALRDIKKGEEITFDYNFVRVFGAAAKKCVCGSPNCRG 929 HSCDPNCRTEKW+VNGE+CIGLFALRDIK GEE+TFDYN+VRV GAAAK+C CGSP CRG Sbjct: 240 HSCDPNCRTEKWVVNGEICIGLFALRDIKMGEEVTFDYNYVRVVGAAAKRCYCGSPQCRG 299 Query: 928 YIGGDLTNSEVIAQDDSDDEYAEPVMICEDGEMNKDWNEIMSNTFNNGKIESSAIEPPEN 749 YIGGD T++EV+ Q DSD+E+ EPVM+ EDG + +S T G + IE Sbjct: 300 YIGGDPTSTEVVDQVDSDEEFPEPVML-EDGRVGGGLKNKISKTNFFGLSKDREIEFKTA 358 Query: 748 IYHMKKLNCAGDENKSESHSFEFSP--QKIEGV-----NSAQVAETREGCGLYNSVGNTS 590 + +++ D + + SP ++ G+ +S+Q ET V + Sbjct: 359 VGNLEVATEIKDLTSQLTPAMSLSPSASEMNGLPGDFSSSSQQVETSPKA---EDVMSQP 415 Query: 589 AAHVDEKVNMNNITGESLKGSEPAALKIESEVIWSRMHXXXXXXXXXXXXDGIVNTSQEQ 410 V ++++M +SL SE S Sbjct: 416 TPAVQQEISMEETMNKSLYSSEKLRTSPTS------------------------------ 445 Query: 409 VFNVSPSKSFPDKVECKRKIKYASRGRRDERAKSNFVAKTNRSLSSIKKGKPKANVLNGK 230 +P+K PD V RK K A+ + KS F+ KT S IKKGK Sbjct: 446 ----TPTKILPDDVMINRKSKSAAAENKRVFVKSRFIIKTPHQSSLIKKGK--------- 492 Query: 229 APPDEDKLSAAQHKSKKIPEHSLNNHVEAVEEKLNELLDPEGGISKRKDASRGYLKLLFL 50 AV+EKLNELLD EGGISKRKDA +GYLKLL L Sbjct: 493 ---------------------------SAVQEKLNELLDSEGGISKRKDAPKGYLKLLLL 525 Query: 49 TAATG 35 TAA+G Sbjct: 526 TAASG 530 >ref|XP_002889136.1| hypothetical protein ARALYDRAFT_476894 [Arabidopsis lyrata subsp. lyrata] gi|297334977|gb|EFH65395.1| hypothetical protein ARALYDRAFT_476894 [Arabidopsis lyrata subsp. lyrata] Length = 1766 Score = 495 bits (1275), Expect = e-137 Identities = 284/611 (46%), Positives = 368/611 (60%), Gaps = 7/611 (1%) Frame = -1 Query: 1849 VRDQYDPPRNAWVLCDECQKWRRIPATLADQINETECEWTCRDNYDRDFADCSIPQEKSD 1670 + D Y +AWV CD+C KWRRIPA++ I+E+ W C +N D+ FADCS QE S+ Sbjct: 855 IEDSYST-ESAWVRCDDCFKWRRIPASVVGSIDESS-RWICLNNSDKKFADCSKSQEMSN 912 Query: 1669 SEINEELGI----SDA-SCEEDACGTDLKSIQDPSNTAQQSSWSLIKSNLFLSRSRKTQT 1505 EINEELGI +DA C+ G + + Q++ + IK+N FL R+RK+QT Sbjct: 913 EEINEELGIGQDEADAYDCDAAKRGKEKEQKSKRLTGKQKACFKAIKTNQFLHRNRKSQT 972 Query: 1504 IDEVMVCHCKPPSDGRMGCGEKCLNRMLNIECVRGTCPCGERCSNQQFQKRNYSKVMWFK 1325 IDE+MVCHCKPP DGR+GCGE+CLNRMLNIEC++GTCP G+ CSNQQFQKR Y K F+ Sbjct: 973 IDEIMVCHCKPPPDGRLGCGEECLNRMLNIECLQGTCPAGDLCSNQQFQKRKYVKFERFQ 1032 Query: 1324 CGKKGYGLQALEDIAESHFLIEYVGEVLDVRAYEARQKEYAMNGHKHFYFMTLNGSEVID 1145 GKKGYGL+ LED+ E FLIEYVGEVLD+++Y+ RQKEYA G KHFYFMTLNG+EVID Sbjct: 1033 SGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYDTRQKEYACKGQKHFYFMTLNGNEVID 1092 Query: 1144 ACSKGNLGRFINHSCDPNCRTEKWMVNGEVCIGLFALRDIKKGEEITFDYNFVRVFGAAA 965 A +KGNLGRFINHSC+PNCRTEKWMVNGE+C+G+F+++D+KKG+E+TFDYN+VRVFGAAA Sbjct: 1093 AGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMKDLKKGQELTFDYNYVRVFGAAA 1152 Query: 964 KKCVCGSPNCRGYIGGDLTNSEVIAQDDSDDEYAEPVMICEDGEMNKDWNEIMSNTFNNG 785 KKC CGS +CRGYIGGD N +VI Q DSD+EY E ++I +D E + + S TF Sbjct: 1153 KKCYCGSSHCRGYIGGDPLNGDVIIQSDSDEEYPE-LVILDDDESGEGILDATSRTF--- 1208 Query: 784 KIESSAIEPPENIYHMK-KLNCAGDENKSESHSFEFSPQKIEGVNSAQVAETREGCGLYN 608 I+ + + P+N + + A D +S+S P++ Q E L Sbjct: 1209 -IDDADEQMPQNSETVNGSKDLAPDNAQSQSSVSVKLPEREIPPPLLQPTEV-----LKE 1262 Query: 607 SVGNTSAAHVDEKVNMNNITGESLKGSEPAALKIESEVIWSRMHXXXXXXXXXXXXDGIV 428 + + V ++V + T K + P + + SR+ G Sbjct: 1263 LPSGIAVSAVQQEVPVEKKT----KSTSPTSSSL------SRL------------SSGGA 1300 Query: 427 NTSQEQVFNVSPSKSFPDKVECKRKIKYASRGRRDERAK-SNFVAKTNRSLSSIKKGKPK 251 NT K K R R R K S + R I G K Sbjct: 1301 NTDMTTKHGSGEDK------------KILPRPRPRPRMKTSRLSVSSKRDKGGILSGVNK 1348 Query: 250 ANVLNGKAPPDEDKLSAAQHKSKKIPEHSLNNHVEAVEEKLNELLDPEGGISKRKDASRG 71 A ++ +KL KSK E S + +E E KLNELLD GGISKR+D+++G Sbjct: 1349 AQII------PVNKLQQQPIKSKGSEEVSSSGRIETFEGKLNELLDAVGGISKRRDSAKG 1402 Query: 70 YLKLLFLTAAT 38 YLKLL LTAA+ Sbjct: 1403 YLKLLLLTAAS 1413