BLASTX nr result
ID: Scutellaria22_contig00005992
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Scutellaria22_contig00005992 (2156 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI35923.3| unnamed protein product [Vitis vinifera] 198 4e-48 ref|XP_002521366.1| conserved hypothetical protein [Ricinus comm... 184 1e-43 ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261... 164 1e-37 gb|AAM13859.1| unknown protein [Arabidopsis thaliana] 155 5e-35 ref|NP_177422.1| hydroxyproline-rich glycoprotein family protein... 155 5e-35 >emb|CBI35923.3| unnamed protein product [Vitis vinifera] Length = 628 Score = 198 bits (504), Expect = 4e-48 Identities = 201/664 (30%), Positives = 273/664 (41%), Gaps = 47/664 (7%) Frame = -3 Query: 2100 CNKNSKTHKKQTQKLQRAKARKQHEMEDGDGEDRPPFWLQNATHLRRGDRLRRGXXXXXX 1921 C+ ++ +KQ Q + E + +G P FW+ ++ RR R R Sbjct: 13 CDCPIQSSQKQNQNQNTRSKKNTLETMEEEGATTP-FWMPASSGHRRR-RSSRSPSSIFL 70 Query: 1920 XXXXXXXXXXXXXXXXXXXXVPSTLSFSAHIFKPNSVKKSWDSLNVVLVIFAVVFGFLSR 1741 +P LSF+++IFKPN VKKSWDSLN+VLV+FA++ GFLSR Sbjct: 71 SSGFLIIFLPLTALLFIVFVLPPILSFTSYIFKPNMVKKSWDSLNLVLVLFAIICGFLSR 130 Query: 1740 NKNEERDSYFDGFQSSPVKENGSQKSFDFERNVEQKYE---SEQKNLMLKRNSSSYPDLR 1570 S V E +Q+S N YE S + R+SSSYPDLR Sbjct: 131 GGGGGSSDMESSV--SEVPEESTQRS-----NHGHCYEERISGYGGMRRMRSSSSYPDLR 183 Query: 1569 EFSSVNWSYGDYQARFYDDINVDSGRVSDQGLIHHHRR---HRSLEQVDYLXXXXXXXXX 1399 + S+ W+ GD + R +DD +D+ RV + HR+ R E DY Sbjct: 184 QESA--WAGGDGRWRSFDDTQLDNHRV-----LGSHRQLYIRRRYEDQDYC--------- 227 Query: 1398 XXVDTLVRESKKXXXXXXXXXXXXXXXAEDEEIPKNAVARKDRLSRKINKELESLSYVAA 1219 E+ V +S K K L + Sbjct: 228 -------------------------------EVKNIDVDNTSMISPK-EKVLSHIPPRPP 255 Query: 1218 SKP--PLPPVGESPAPESQENQKRAHERVARRKERSNRK----QIKDVEAIDTVTAPXXX 1057 S P P PP P P + KR+ + VAR + R R+ + K V+A P Sbjct: 256 SPPLPPSPPPPPPPPPVVKRKVKRSFQAVAREERRETRENSSFESKRVQAAPPPPPPPPP 315 Query: 1056 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPSKEKISKSDRKRGGATGSSTKEFLN 877 + + KSDRKRGGAT KEFL Sbjct: 316 PPPPLAV-----------------------------ERRSEKSDRKRGGAT----KEFLT 342 Query: 876 SLYHXXXXXXXXK--SVDNMESLLHQA-----------EAPLSLQIXXXXPSVFQNLFST 736 SLY+ + S++N++++LH + P + SVF NLFS+ Sbjct: 343 SLYYQRNKKKKQRQKSMENLDTILHNSPHSDQPLRPPPSPPPPPPLPPPPNSVFHNLFSS 402 Query: 735 KKQKRKRTITVTLEPLPP-----QRAEARDPEPT-------PRPPEVTAGKPPQPIKMNS 592 KK K KR +TV P PP RA A + P + A KPP P K +S Sbjct: 403 KKGKSKRFLTVPPPPPPPPPPPASRAYAGKTKTKIALSRSHPYDHPLNASKPPIPEKSSS 462 Query: 591 FDKVEEASNSGGESPLNRIXXXXXXXXXAFFRSPAWKFVVQGDYVRIXXXXXXXXXSPDP 412 F+ V+ +G ES L I F+ P WKFVV GDYVRI SPD Sbjct: 463 FNSVDGNPYAGSESLL--IPVPPPPPPPPPFKMPDWKFVVHGDYVRIKSTNSSRSGSPDL 520 Query: 411 D--DTESDVTPSAAVAFHPS--------PLFCASPDVNTKAESFITNFRAKLKLEKIHSM 262 D + S PS + + PLFC SPDVNTKA++FI FRA LKLEKI+S+ Sbjct: 521 DYIGSPSSKGPSRSTSLKSETEGGDSAQPLFCPSPDVNTKADTFIARFRAGLKLEKINSI 580 Query: 261 KKRE 250 K+++ Sbjct: 581 KEKQ 584 >ref|XP_002521366.1| conserved hypothetical protein [Ricinus communis] gi|223539444|gb|EEF41034.1| conserved hypothetical protein [Ricinus communis] Length = 553 Score = 184 bits (466), Expect = 1e-43 Identities = 182/621 (29%), Positives = 250/621 (40%), Gaps = 36/621 (5%) Frame = -3 Query: 2007 EDRPPFWLQNATHLRRGDRLRRGXXXXXXXXXXXXXXXXXXXXXXXXXXVPSTLSFSAHI 1828 ED PPFWLQ RG RLRR VPS ++F++ + Sbjct: 5 EDVPPFWLQATDQHHRGRRLRRQASSIFLNSGVILIMLLVIAFVFVFVVVPSVVTFTSQV 64 Query: 1827 FKPNSVKKSWDSLNVVLVIFAVVFGFLSRNKNEERDSYFDGFQSSPVKENGSQKSFDFER 1648 FKPN +KK WDSLN VLV+FA+V GFL RN + +Q + + S S + ++ Sbjct: 65 FKPNLIKKGWDSLNFVLVLFAIVCGFLGRNSPNTSNESSTSYQR--LSSSSSASSSNVQQ 122 Query: 1647 NVEQKYES-----------------EQKNLMLKRNSSSYPDLREFSSVNWSYGDYQARFY 1519 +V++ Y S R+ SYPDLR+ S WS D + RFY Sbjct: 123 DVQRSYPSTPAYRWYDDGQYQDRTASYNTFNRLRSFRSYPDLRQESL--WSNNDERWRFY 180 Query: 1518 DDINVDSGRVSD----QGLIHHHRRHRSLEQVDYLXXXXXXXXXXXVDTLVRESKKXXXX 1351 DD V+ + S L H + ++ D +E +K Sbjct: 181 DDTRVNGYKFSSPLHQDELQDDHPPQQQQQEQD------------------QEPRK---- 218 Query: 1350 XXXXXXXXXXXAEDEEIPKNAVARKDRL--SRKINKELESLSYVAASKPPLPPVGESPA- 1180 +D+E ++ V+ KD + I+KE V PP+PP SP Sbjct: 219 ------------QDQEQEQD-VSTKDIAVDTFVIHKE----EVVQTPPPPMPPAPVSPPR 261 Query: 1179 -PESQENQKRAHERVARRKERSNRKQIKDVEAIDTVTAPXXXXXXXXXXXXXXXXXXXXX 1003 P ++RA E R++ K++E + T+ P Sbjct: 262 LPTRSTVKRRAKRTYHDLGEHEKRRENKNLE-VKTINIPPPPPPPQLIS----------- 309 Query: 1002 XXXXXXXXXXXXXXXXPSKEKISKSDRKRGGATGSSTKEFLNSLYHXXXXXXXXKSVDNM 823 SKSD++RG K+ L SL KSV+N+ Sbjct: 310 ----------------------SKSDKRRG-------KDLLISL-RRKRKKQRQKSVENL 339 Query: 822 ESLLHQAEAPLSLQIXXXXPS-----VFQNLFSTKKQKRKRTITVTL-EPLPPQRAEARD 661 ESL + P + P FQNLFS+KK K K+ + ++ +P PP R Sbjct: 340 ESLFNPEPLPSIIPPPPPPPPPPPPHFFQNLFSSKKGKTKKDHSHSVPQPQPPSRTHRS- 398 Query: 660 PEPTPRPPEVTAGKPPQPIKMNSFDKVEEASNSGGESPLNRIXXXXXXXXXAFFRSPAWK 481 T + + A KP + +K +F VEE G SPL I F+ WK Sbjct: 399 -RTTVQEATIEAYKPLKAVKTGNFSSVEENVERGNASPLIPIPPPPPPPPPPPFKMKPWK 457 Query: 480 FVVQGDYVRIXXXXXXXXXSPD-----PDDTESDVTPSAAVAFHPSPLFCASPDVNTKAE 316 F+ GDYVR+ SPD P D ES P FC SPDVNTKAE Sbjct: 458 FISDGDYVRVASFNSSRSGSPDIDSEDPSDKESSPMARNKEGDSAMPSFCPSPDVNTKAE 517 Query: 315 SFITNFRAKLKLEKIHSMKKR 253 +FI FRA LKLEKI+S+K R Sbjct: 518 NFIARFRAGLKLEKINSVKGR 538 >ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261010 [Vitis vinifera] Length = 555 Score = 164 bits (414), Expect = 1e-37 Identities = 164/568 (28%), Positives = 237/568 (41%), Gaps = 32/568 (5%) Frame = -3 Query: 1857 PSTLSFSAHIFKPNSVKKSWDSLNVVLVIFAVVFGFLSRNKNEERDSYFDGFQSS----- 1693 PS L+F++ +PNSV+KSWDSLNV+LV+FA++ G +R +E+ D + SS Sbjct: 52 PSFLNFTSQFLRPNSVRKSWDSLNVLLVLFAILCGVFARKNDEKNDDVLENHGSSGSVVM 111 Query: 1692 -PVKENGSQKSFDFERNVEQKYESEQKNLMLKRNSSSYPDLREFSSVNWSYGDYQARFYD 1516 E+ S F+F + ++ L+R+SSSYPDLR+ S W GD + RF+D Sbjct: 112 GKSHESISHSLFEFSDRKIYDPPIQSGSVRLRRSSSSYPDLRQESL--WGAGDDRRRFFD 169 Query: 1515 DINVDSGRVSDQGLIHHHRRHRSLEQVDYLXXXXXXXXXXXVDTLVRESKKXXXXXXXXX 1336 D V++ R + RRHR E L R+ Sbjct: 170 DFEVNNYRSPASS--DYVRRHRRSE-------------------LERDDS---------- 198 Query: 1335 XXXXXXAEDEEIPKNAVARKDRLSRKINKELESLSYVAASKPPLPPVGESPAPESQENQK 1156 E + IP + A + S S PP PP P P Q + Sbjct: 199 -------EVKVIPVDTFAVRSS---------PSPSPAPPRTPPPPP--PPPPPIVQRKPR 240 Query: 1155 RAHERVARRKERSNRKQIKDVEAIDTVTAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 976 R++E VAR+++ SN D + +P Sbjct: 241 RSYETVARKEKLSN----SDADQFKKSRSPPAPPPPPPPPPPPRVPGGHLP--------- 287 Query: 975 XXXXXXXPSKEKISKSDRKRGGATGSSTKEFLNSLYHXXXXXXXXKSVDNMESLLHQAEA 796 ++K KS R+ GGAT F+ SLY+ ++ + E+ + + Sbjct: 288 ---------EQKSRKSARRMGGATKDIATVFV-SLYNQTRKKKKQRTKNIHENAVQSPPS 337 Query: 795 -----PLSLQIXXXXPSVFQNLFSTKKQKRKRTITVTLEPLPPQ--------RAEARD-- 661 P PS+ NLF K K KR +V+ P PP R+ R Sbjct: 338 ATTPTPPPPPPPPPPPSMLHNLFR-KGSKSKRIHSVSAPPPPPPPPPRPPPPRSSKRKTH 396 Query: 660 -----PEPTPRPPEVT-----AGKPPQPIKMNSFDKVEEASNSGGESPLNRIXXXXXXXX 511 P P P PP T AGKPP P + +SF ++ NSGG+SPL + Sbjct: 397 IPPAPPTPPPPPPPDTSRRRAAGKPPLPARKSSFYNRDDNVNSGGQSPLIPMPPPPPP-- 454 Query: 510 XAFFRSPAWKFVVQGDYVRIXXXXXXXXXSPDPDDTESDVTPSAAVAFHP-SPLFCASPD 334 FR P K+VV+GD+VRI SP+ DD + SA FC SPD Sbjct: 455 ---FRMPELKYVVRGDFVRIRSTHSSRCSSPELDDVDLSSNKSAMDGGDAIGATFCPSPD 511 Query: 333 VNTKAESFITNFRAKLKLEKIHSMKKRE 250 VN KA++FI R + +LEKI+S+++R+ Sbjct: 512 VNVKADTFIARLRGEWRLEKINSLRERK 539 >gb|AAM13859.1| unknown protein [Arabidopsis thaliana] Length = 535 Score = 155 bits (391), Expect = 5e-35 Identities = 173/619 (27%), Positives = 253/619 (40%), Gaps = 29/619 (4%) Frame = -3 Query: 2025 MEDGDGEDRPPFWLQ---NATHLRRGDRLRRGXXXXXXXXXXXXXXXXXXXXXXXXXXVP 1855 ME+ DG+ PFWLQ N T+ RR L +P Sbjct: 1 MEEDDGDASTPFWLQSRRNNTYFRRTASL----GGRTTTIATQIFFAGTAAILIVVFIIP 56 Query: 1854 STLSFSAHIFKPNSVKKSWDSLNVVLVIFAVVFGFLSRNKNEERDSYF------DGFQSS 1693 S + IF+P+ V+KSWD LN VLV+FAV+ GFLSRN N + ++ + F +S Sbjct: 57 PFFSSVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNNDESNHHKEEDIRNKFSTS 116 Query: 1692 P--------VKENGSQKSF-DFERNVEQKYESEQKNLMLKRNSSSYPDLREFSSVNWSYG 1540 P V +G+ + + +R ++ K R+ SSYPDLR + Sbjct: 117 PSIIDRRSRVSNSGTTPRYWNDDRGGGGGDQTVYKRFSRLRSVSSYPDLR----LREYEA 172 Query: 1539 DYQARFYDDINVDSGRVSDQGLIHHHRRHRSLEQVDYLXXXXXXXXXXXVDTLVRESKKX 1360 D + RFYDD V R D I+ ++ +R+ + V +++ Sbjct: 173 DERWRFYDDTRVSQCRYEDVDPIYPNQSYRNWHE-----------EGKPPPEDVDQTEDG 221 Query: 1359 XXXXXXXXXXXXXXAEDEEIPKNAVARKDRLSRKINKELE--SLSYVAASKPPLPPVGES 1186 E E+ A A ++ +EL+ S S PP PP Sbjct: 222 DNGEGSKVRNGGSETEKVEVVATAEA-------EVVEELKVPSAPPYIPSPPPSPP-RPP 273 Query: 1185 PAPESQENQKRAHERVARRKERSNRKQIKDVEAIDTVTAPXXXXXXXXXXXXXXXXXXXX 1006 PA +++ R ++ V+ ++E+ R D A T P Sbjct: 274 PAKQAKRKTNRVYQDVSPQEEKKERD---DFVATTTPIPPPATVY--------------- 315 Query: 1005 XXXXXXXXXXXXXXXXXPSKEKISKSDRKRGGATGSSTKEFLNSLYHXXXXXXXXKSVDN 826 +K +K ++K+GGAT K+FL +L +S+D Sbjct: 316 --------------------QKSNKQEKKKGGAT----KDFLIAL-RRKKKKQRQQSIDG 350 Query: 825 MESLLHQAEAPLSLQIXXXXPS---VFQNLFSTKKQKRKRTITVTLEPLPP----QRAEA 667 ++ LL ++ PL P FQ LFS+KK K K+ + P PP +R E+ Sbjct: 351 LD-LLFGSDPPLVYSPPPPPPPPPPFFQGLFSSKKGKSKKNNSNPPPPPPPPPPERRYES 409 Query: 666 RDPEPTPR--PPEVTAGKPPQPIKMNSFDKVEEASNSGGESPLNRIXXXXXXXXXAFFRS 493 R R P E KP P K+ + +G ESPL I F+ Sbjct: 410 RASTSKLRKAPVESRTSKPNPPAKVTQY------VGTGSESPLMPIPPPPPPPP---FKM 460 Query: 492 PAWKFVVQGDYVRIXXXXXXXXXSPDPDDTESDVTPSAAVAFHPSPLFCASPDVNTKAES 313 PAWKFV +GDYVR+ PD + DV SA +FC SPDV+TKA+ Sbjct: 461 PAWKFVKRGDYVRMASDISISSDEPD----DPDVAQSAGSKEAAGSMFCPSPDVDTKADD 516 Query: 312 FITNFRAKLKLEKIHSMKK 256 FI FRA LKLEK++S+K+ Sbjct: 517 FIARFRAGLKLEKMNSVKR 535 >ref|NP_177422.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|12323765|gb|AAG51845.1|AC010926_8 unknown protein; 15669-13984 [Arabidopsis thaliana] gi|24030251|gb|AAN41301.1| unknown protein [Arabidopsis thaliana] gi|332197252|gb|AEE35373.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 561 Score = 155 bits (391), Expect = 5e-35 Identities = 173/619 (27%), Positives = 253/619 (40%), Gaps = 29/619 (4%) Frame = -3 Query: 2025 MEDGDGEDRPPFWLQ---NATHLRRGDRLRRGXXXXXXXXXXXXXXXXXXXXXXXXXXVP 1855 ME+ DG+ PFWLQ N T+ RR L +P Sbjct: 1 MEEDDGDASTPFWLQSRRNNTYFRRTASL----GGRTTTIATQIFFAGTAAILIVVFIIP 56 Query: 1854 STLSFSAHIFKPNSVKKSWDSLNVVLVIFAVVFGFLSRNKNEERDSYF------DGFQSS 1693 S + IF+P+ V+KSWD LN VLV+FAV+ GFLSRN N + ++ + F +S Sbjct: 57 PFFSSVSQIFRPHLVRKSWDYLNFVLVLFAVLCGFLSRNTNNDESNHHKEEDIRNKFSTS 116 Query: 1692 P--------VKENGSQKSF-DFERNVEQKYESEQKNLMLKRNSSSYPDLREFSSVNWSYG 1540 P V +G+ + + +R ++ K R+ SSYPDLR + Sbjct: 117 PSIIDRRSRVSNSGTTPRYWNDDRGGGGGDQTVYKRFSRLRSVSSYPDLR----LREYEA 172 Query: 1539 DYQARFYDDINVDSGRVSDQGLIHHHRRHRSLEQVDYLXXXXXXXXXXXVDTLVRESKKX 1360 D + RFYDD V R D I+ ++ +R+ + V +++ Sbjct: 173 DERWRFYDDTRVSQCRYEDVDPIYPNQSYRNWHE-----------EGKPPPEDVDQTEDG 221 Query: 1359 XXXXXXXXXXXXXXAEDEEIPKNAVARKDRLSRKINKELE--SLSYVAASKPPLPPVGES 1186 E E+ A A ++ +EL+ S S PP PP Sbjct: 222 DNGEGSKVRNGGSETEKVEVVATAEA-------EVVEELKVPSAPPYIPSPPPSPP-RPP 273 Query: 1185 PAPESQENQKRAHERVARRKERSNRKQIKDVEAIDTVTAPXXXXXXXXXXXXXXXXXXXX 1006 PA +++ R ++ V+ ++E+ R D A T P Sbjct: 274 PAKQAKRKTNRVYQDVSPQEEKKERD---DFVATTTPIPPPATVY--------------- 315 Query: 1005 XXXXXXXXXXXXXXXXXPSKEKISKSDRKRGGATGSSTKEFLNSLYHXXXXXXXXKSVDN 826 +K +K ++K+GGAT K+FL +L +S+D Sbjct: 316 --------------------QKSNKQEKKKGGAT----KDFLIAL-RRKKKKQRQQSIDG 350 Query: 825 MESLLHQAEAPLSLQIXXXXPS---VFQNLFSTKKQKRKRTITVTLEPLPP----QRAEA 667 ++ LL ++ PL P FQ LFS+KK K K+ + P PP +R E+ Sbjct: 351 LD-LLFGSDPPLVYSPPPPPPPPPPFFQGLFSSKKGKSKKNNSNPPPPPPPPPPERRYES 409 Query: 666 RDPEPTPR--PPEVTAGKPPQPIKMNSFDKVEEASNSGGESPLNRIXXXXXXXXXAFFRS 493 R R P E KP P K+ + +G ESPL I F+ Sbjct: 410 RASTSKLRKAPVESRTSKPNPPAKVTQY------VGTGSESPLMPIPPPPPPPP---FKM 460 Query: 492 PAWKFVVQGDYVRIXXXXXXXXXSPDPDDTESDVTPSAAVAFHPSPLFCASPDVNTKAES 313 PAWKFV +GDYVR+ PD + DV SA +FC SPDV+TKA+ Sbjct: 461 PAWKFVKRGDYVRMASDISISSDEPD----DPDVAQSAGSKEAAGSMFCPSPDVDTKADD 516 Query: 312 FITNFRAKLKLEKIHSMKK 256 FI FRA LKLEK++S+K+ Sbjct: 517 FIARFRAGLKLEKMNSVKR 535