BLASTX nr result
ID: Scutellaria23_contig00033943
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Scutellaria23_contig00033943 (444 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003519723.1| PREDICTED: pentatricopeptide repeat-containi... 145 3e-33 ref|XP_003559913.1| PREDICTED: pentatricopeptide repeat-containi... 114 1e-23 ref|XP_002528839.1| pentatricopeptide repeat-containing protein,... 107 1e-21 ref|XP_003524358.1| PREDICTED: putative pentatricopeptide repeat... 106 2e-21 ref|XP_002461091.1| hypothetical protein SORBIDRAFT_02g040530 [S... 104 7e-21 >ref|XP_003519723.1| PREDICTED: pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like [Glycine max] Length = 727 Score = 145 bits (366), Expect = 3e-33 Identities = 72/146 (49%), Positives = 100/146 (68%) Frame = +3 Query: 3 NLGAKPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCGLTA 182 N+G IV +S+LPA GKL LL+QGK MH F++K GL DVVVGSAL+ MY++CG Sbjct: 327 NVGLATNAIVATSVLPALGKLELLKQGKEMHNFVLKEGLMSDVVVGSALIVMYANCGSIK 386 Query: 183 ETEILLSIWSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPICT 362 E E + S+ D+M+WNS I G+ ++ FRRIW ++ +P+ IT++SILPICT Sbjct: 387 EAESIFECTSDKDIMVWNSMIVGYNLVGDFESAFFTFRRIWGAEHRPNFITVVSILPICT 446 Query: 363 KLGALKQGMEIHCHAIRSSLEMVVSV 440 ++GAL+QG EIH + +S L + VSV Sbjct: 447 QMGALRQGKEIHGYVTKSGLGLNVSV 472 Score = 75.5 bits (184), Expect = 4e-12 Identities = 41/138 (29%), Positives = 72/138 (52%) Frame = +3 Query: 9 GAKPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCGLTAET 188 G P ++++SILPA G+L ++ G + ++ G + D+ V +A++DMY CG E Sbjct: 228 GLMPDSVIVASILPACGRLEAVKLGMALQVCAVRSGFESDLYVSNAVIDMYCKCGDPLEA 287 Query: 189 EILLSIWSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPICTKL 368 + S D++ W++ IAG++ Y ++ + L ++I S+LP KL Sbjct: 288 HRVFSHMVYSDVVSWSTLIAGYSQNCLYQESYKLYIGMINVGLATNAIVATSVLPALGKL 347 Query: 369 GALKQGMEIHCHAIRSSL 422 LKQG E+H ++ L Sbjct: 348 ELLKQGKEMHNFVLKEGL 365 Score = 73.6 bits (179), Expect = 2e-11 Identities = 39/124 (31%), Positives = 68/124 (54%) Frame = +3 Query: 15 KPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCGLTAETEI 194 +P I + SILP ++ L QGK +HG++ K GL +V VG++L+DMYS CG E Sbjct: 432 RPNFITVVSILPICTQMGALRQGKEIHGYVTKSGLGLNVSVGNSLIDMYSKCGFLELGEK 491 Query: 195 LLSIWSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPICTKLGA 374 + ++ +N+ I+ S GL+ + ++ + +P+ +T +S+L C+ G Sbjct: 492 VFKQMMVRNVTTYNTMISACGSHGQGEKGLAFYEQMKEEGNRPNKVTFISLLSACSHAGL 551 Query: 375 LKQG 386 L +G Sbjct: 552 LDRG 555 Score = 65.5 bits (158), Expect = 4e-09 Identities = 44/146 (30%), Positives = 69/146 (47%), Gaps = 1/146 (0%) Frame = +3 Query: 9 GAKPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHG-LDFDVVVGSALVDMYSSCGLTAE 185 G P +L A L+ L+ G+ +H + HG +V V A++DM++ CG + Sbjct: 128 GVTPDNYTYPLVLKACSSLHALQLGRWVHETM--HGKTKANVYVQCAVIDMFAKCGSVED 185 Query: 186 TEILLSIWSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPICTK 365 + + DL W + I G L +FR++ L P S+ + SILP C + Sbjct: 186 ARRMFEEMPDRDLASWTALICGTMWNGECLEALLLFRKMRSEGLMPDSVIVASILPACGR 245 Query: 366 LGALKQGMEIHCHAIRSSLEMVVSVS 443 L A+K GM + A+RS E + VS Sbjct: 246 LEAVKLGMALQVCAVRSGFESDLYVS 271 >ref|XP_003559913.1| PREDICTED: pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like [Brachypodium distachyon] Length = 692 Score = 114 bits (284), Expect = 1e-23 Identities = 56/145 (38%), Positives = 84/145 (57%) Frame = +3 Query: 9 GAKPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCGLTAET 188 G KP ++SILP+ ++ L GK +HGF +++G D +GSA +D YS G E Sbjct: 343 GLKPNSNTMASILPSLSEMKLFRHGKEIHGFSLRNGFDQSKFLGSAFIDFYSRQGSIREA 402 Query: 189 EILLSIWSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPICTKL 368 EI+L + + DL+IWNS +AG+A N L FR + K +P +T++S+LP+C Sbjct: 403 EIVLELMPKRDLVIWNSMVAGYAVNGNTDSALCAFRALQKVGFRPDHVTVVSVLPVCNHH 462 Query: 369 GALKQGMEIHCHAIRSSLEMVVSVS 443 L QG E+H + +R + V SVS Sbjct: 463 SRLIQGKELHAYVVRHYMSSVCSVS 487 Score = 80.1 bits (196), Expect = 2e-13 Identities = 39/133 (29%), Positives = 75/133 (56%) Frame = +3 Query: 27 IVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCGLTAETEILLSI 206 +++++++PA G+ L G V+HG ++ G+ D V +ALVDMY CG + + Sbjct: 248 VIIATVIPACGRAKELRTGMVLHGCAVRCGVGDDTCVSNALVDMYCKCGCLGMADRVFWS 307 Query: 207 WSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPICTKLGALKQG 386 D++ W++ IAG++ +++F + + LKP+S T+ SILP +++ + G Sbjct: 308 IGFKDVVSWSTLIAGYSQNGKDHVSVNLFTEMVTAGLKPNSNTMASILPSLSEMKLFRHG 367 Query: 387 MEIHCHAIRSSLE 425 EIH ++R+ + Sbjct: 368 KEIHGFSLRNGFD 380 Score = 62.0 bits (149), Expect = 5e-08 Identities = 37/139 (26%), Positives = 65/139 (46%), Gaps = 9/139 (6%) Frame = +3 Query: 24 GIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVG---------SALVDMYSSCGL 176 G ++ A L ++EQG++ ++ ++ DVV G ALVDM++ CG Sbjct: 142 GFTYPPVIKACAALGVVEQGRM-----VRENVEADVVRGVVAPSVFVQCALVDMFAKCGC 196 Query: 177 TAETEILLSIWSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPI 356 E + E DL W + I G A ++ +S+F R+ S+ + +++P Sbjct: 197 LGEARSVFESMLERDLAAWTAMIGGAVHAGDWLDAMSLFSRMRSEGFLADSVIIATVIPA 256 Query: 357 CTKLGALKQGMEIHCHAIR 413 C + L+ GM +H A+R Sbjct: 257 CGRAKELRTGMVLHGCAVR 275 Score = 61.6 bits (148), Expect = 6e-08 Identities = 28/131 (21%), Positives = 67/131 (51%) Frame = +3 Query: 6 LGAKPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCGLTAE 185 +G +P + + S+LP + L QGK +H ++++H + V +AL+DMY C + Sbjct: 443 VGFRPDHVTVVSVLPVCNHHSRLIQGKELHAYVVRHYMSSVCSVSNALIDMYCKCCCLEK 502 Query: 186 TEILLSIWSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPICTK 365 + + + ++ D +N+ I+ + + +F + + + P +T +++L C+ Sbjct: 503 GKEIFQLVTDRDTATYNTLISSFGKHGHEDEAIMLFDLMKRDGIAPDKVTFVALLSSCSH 562 Query: 366 LGALKQGMEIH 398 G +++G+ + Sbjct: 563 AGLIEKGLHFY 573 >ref|XP_002528839.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223531751|gb|EEF33573.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 393 Score = 107 bits (267), Expect = 1e-21 Identities = 52/137 (37%), Positives = 81/137 (59%) Frame = +3 Query: 3 NLGAKPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCGLTA 182 N+ KP LSS+LP F + +++GK +HG+ I+HGLD DV +GS+L+DMY+ C Sbjct: 180 NVNLKPDSFTLSSVLPIFAEYVNVDKGKEIHGYAIRHGLDGDVFIGSSLIDMYAKCTRVE 239 Query: 183 ETEILLSIWSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPICT 362 ++ + S+ D + WNS IAG + GL FR++ K+ +KP ++ SILP C Sbjct: 240 DSLRVFSLLPRRDDISWNSIIAGCVQNSLFDEGLRFFRQMLKANVKPRQVSFSSILPACA 299 Query: 363 KLGALKQGMEIHCHAIR 413 L L G ++H + +R Sbjct: 300 HLTTLNLGRQLHGYILR 316 Score = 62.4 bits (150), Expect = 4e-08 Identities = 33/106 (31%), Positives = 56/106 (52%) Frame = +3 Query: 15 KPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCGLTAETEI 194 KP + SSILPA L L G+ +HG+I++ D +V + S+LVDMY+ CG Sbjct: 285 KPRQVSFSSILPACAHLTTLNLGRQLHGYILRVRFDNNVFIASSLVDMYAKCGNVKVARW 344 Query: 195 LLSIWSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSI 332 + + D++ W + I G+A + +S+F ++ +K +I Sbjct: 345 IFDKMKQHDMVSWTAMIMGYALHGHARDAISLFEQMEMRGMKLGNI 390 >ref|XP_003524358.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g23330-like [Glycine max] Length = 674 Score = 106 bits (264), Expect = 2e-21 Identities = 52/133 (39%), Positives = 76/133 (57%) Frame = +3 Query: 15 KPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCGLTAETEI 194 +P LSSILP F + + +GK +HG+ I+HG D DV +GS+L+DMY+ C + Sbjct: 196 RPDSFTLSSILPIFTEHANVTKGKEIHGYAIRHGFDKDVFIGSSLIDMYAKCTQVELSVC 255 Query: 195 LLSIWSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPICTKLGA 374 + S D + WNS IAG + GL FRR+ K K+KP ++ S++P C L A Sbjct: 256 AFHLLSNRDAISWNSIIAGCVQNGRFDQGLGFFRRMLKEKVKPMQVSFSSVIPACAHLTA 315 Query: 375 LKQGMEIHCHAIR 413 L G ++H + IR Sbjct: 316 LNLGKQLHAYIIR 328 Score = 67.4 bits (163), Expect = 1e-09 Identities = 36/126 (28%), Positives = 64/126 (50%), Gaps = 2/126 (1%) Frame = +3 Query: 15 KPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCGLTAETEI 194 KP + SS++PA L L GK +H +II+ G D + + S+L+DMY+ CG Sbjct: 297 KPMQVSFSSVIPACAHLTALNLGKQLHAYIIRLGFDDNKFIASSLLDMYAKCGNIKMARY 356 Query: 195 LLS--IWSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPICTKL 368 + + + D++ W + I G A + +S+F + +KP + M++L C+ Sbjct: 357 IFNKIEMCDRDMVSWTAIIMGCAMHGHALDAVSLFEEMLVDGVKPCYVAFMAVLTACSHA 416 Query: 369 GALKQG 386 G + +G Sbjct: 417 GLVDEG 422 Score = 62.4 bits (150), Expect = 4e-08 Identities = 44/168 (26%), Positives = 76/168 (45%), Gaps = 22/168 (13%) Frame = +3 Query: 3 NLGAKPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCG--- 173 + G P + S+L A + +H +I+ G FD+ +AL++MYS Sbjct: 69 SFGISPDRHLFPSLLRASTLFKHFNLAQSLHAAVIRLGFHFDLYTANALMNMYSKFHPHL 128 Query: 174 ----------------LTAETEILLSIWSEW---DLMIWNSAIAGHASAENYGFGLSIFR 296 + + + + ++ D++ WN+ IAG+A Y L++ + Sbjct: 129 SPLHEFPQARHNHNNKYSVKIDSVRKLFDRMPVRDVVSWNTVIAGNAQNGMYEEALNMVK 188 Query: 297 RIWKSKLKPSSITLMSILPICTKLGALKQGMEIHCHAIRSSLEMVVSV 440 + K L+P S TL SILPI T+ + +G EIH +AIR + V + Sbjct: 189 EMGKENLRPDSFTLSSILPIFTEHANVTKGKEIHGYAIRHGFDKDVFI 236 >ref|XP_002461091.1| hypothetical protein SORBIDRAFT_02g040530 [Sorghum bicolor] gi|241924468|gb|EER97612.1| hypothetical protein SORBIDRAFT_02g040530 [Sorghum bicolor] Length = 695 Score = 104 bits (260), Expect = 7e-21 Identities = 54/145 (37%), Positives = 83/145 (57%) Frame = +3 Query: 9 GAKPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCGLTAET 188 G KP L+SILP+ +L L GK +H F +++GL+ + SAL+D YS G E Sbjct: 346 GVKPNSTTLASILPSLSELRLFRYGKEIHCFSLRNGLEHSEFLASALIDFYSRQGSIKEA 405 Query: 189 EILLSIWSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPICTKL 368 EI+ + DL++ NS I G+ E+ L + R + K L+P +T++S+LP+C + Sbjct: 406 EIVFEFTPKNDLVVSNSMIGGYVVNEDSESALRLLRALLKEGLRPDRVTVVSVLPLCNQH 465 Query: 369 GALKQGMEIHCHAIRSSLEMVVSVS 443 L QG E+H +AIR ++ SVS Sbjct: 466 SRLLQGKELHAYAIRHNISSCCSVS 490 Score = 101 bits (251), Expect = 7e-20 Identities = 51/139 (36%), Positives = 80/139 (57%) Frame = +3 Query: 9 GAKPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCGLTAET 188 G +P ++L++++PA GK+ L G +HG ++K G+ D V +ALVDMY C Sbjct: 245 GFRPDSMILATVIPACGKVKELRTGTALHGCVVKCGVGVDTCVLNALVDMYCKCARLDFA 304 Query: 189 EILLSIWSEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPICTKL 368 L D++ W++ IAGH+ Y +S+F + S +KP+S TL SILP ++L Sbjct: 305 ASLFWSIDHKDVISWSTIIAGHSQNRRYHVSVSLFSEMVASGVKPNSTTLASILPSLSEL 364 Query: 369 GALKQGMEIHCHAIRSSLE 425 + G EIHC ++R+ LE Sbjct: 365 RLFRYGKEIHCFSLRNGLE 383 Score = 59.7 bits (143), Expect = 2e-07 Identities = 34/130 (26%), Positives = 64/130 (49%), Gaps = 3/130 (2%) Frame = +3 Query: 9 GAKPCGIVLSSILPAFGKLNLLEQGKVMHGFIIKHGLDFDVVVGSALVDMYSSCGLTAET 188 G +P + + S+LP + + L QGK +H + I+H + V +AL DMY CG Sbjct: 447 GLRPDRVTVVSVLPLCNQHSRLLQGKELHAYAIRHNISSCCSVSNALTDMYCKCGCLELA 506 Query: 189 EILLSIWSEWDLMIWN---SAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPIC 359 + + +E + + +N S++ H AE F + +R + P +T +++L C Sbjct: 507 FEIFLLMTERNTVTYNTLISSLGKHGHAEQAFFLFDLMKR---DGVSPDKVTFVALLSCC 563 Query: 360 TKLGALKQGM 389 + G + +G+ Sbjct: 564 SHEGLIDKGL 573 Score = 55.5 bits (132), Expect = 5e-06 Identities = 33/123 (26%), Positives = 60/123 (48%), Gaps = 4/123 (3%) Frame = +3 Query: 42 ILPAFGKLNLLEQGKVMHGFI---IKHGL-DFDVVVGSALVDMYSSCGLTAETEILLSIW 209 +L A L ++EQG+ + + I G+ +V V ALVDM++ CG E + Sbjct: 151 VLKACAALGVVEQGRKVQENVEADIARGIAKCNVFVQCALVDMFAKCGCLGEARNVFESM 210 Query: 210 SEWDLMIWNSAIAGHASAENYGFGLSIFRRIWKSKLKPSSITLMSILPICTKLGALKQGM 389 DL W + I G ++ +++ +R+ +P S+ L +++P C K+ L+ G Sbjct: 211 EVRDLAAWTAMIGGTVHGGDWLEVMTLLKRMKSEGFRPDSMILATVIPACGKVKELRTGT 270 Query: 390 EIH 398 +H Sbjct: 271 ALH 273