BLASTX nr result
ID: Scutellaria23_contig00016093
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Scutellaria23_contig00016093 (560 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002268999.1| PREDICTED: pentatricopeptide repeat-containi... 228 5e-58 emb|CAN80769.1| hypothetical protein VITISV_013866 [Vitis vinifera] 226 1e-57 ref|XP_002519368.1| pentatricopeptide repeat-containing protein,... 221 5e-56 gb|AAF43948.1|AC012188_25 Contains similarity to a hypothetical ... 218 5e-55 ref|NP_172899.1| pentatricopeptide repeat-containing protein [Ar... 218 5e-55 >ref|XP_002268999.1| PREDICTED: pentatricopeptide repeat-containing protein At1g14470-like [Vitis vinifera] Length = 729 Score = 228 bits (581), Expect = 5e-58 Identities = 108/183 (59%), Positives = 140/183 (76%), Gaps = 2/183 (1%) Frame = -1 Query: 560 SAGKNGFLFHSWLIKVGHDCGKYTRNALMNAYGKYGPVVGARQLFDEMS--ERSVADWNA 387 SAG G FH+ ++K+GH + RNA+++ Y + GP+ AR++FDE+ ER VADWNA Sbjct: 109 SAGTGGIGFHAHVLKLGHGSDAFVRNAVIDMYARLGPIGHARKVFDEIPDYERKVADWNA 168 Query: 386 VISGYWNWGVEGEAKRLFDSMPDKNVITWTTMVTGYSKMKDLESARSYFDRTPKKSVVSW 207 ++SGYW W EG+A+ LFD MP++NVITWT MVTGY+K+KDLE+AR YFD P++SVVSW Sbjct: 169 MVSGYWKWESEGQAQWLFDVMPERNVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSW 228 Query: 206 NAMLSGYARNGFSEEALCLFREMVSVGICPNETTWVAVISACSSHGDPTLVDSVVMMIED 27 NAMLSGYA+NG +EEAL LF EMV+ GI P+ETTWV VISACSS GDP L S+V + Sbjct: 229 NAMLSGYAQNGLAEEALRLFDEMVNAGIEPDETTWVTVISACSSRGDPCLAASLVRTLHQ 288 Query: 26 YKV 18 ++ Sbjct: 289 KRI 291 Score = 112 bits (280), Expect = 4e-23 Identities = 71/210 (33%), Positives = 106/210 (50%), Gaps = 40/210 (19%) Frame = -1 Query: 527 WLIKVGHDCGKYTRNALMNAYGKYGPVVGARQLFDEMSERSVADWNAVISGYWNWGVEGE 348 WL V + T A++ Y K + AR+ FD M ERSV WNA++SGY G+ E Sbjct: 184 WLFDVMPERNVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSWNAMLSGYAQNGLAEE 243 Query: 347 AKRLFDSMPDKNV----ITWTTMVTG---------------------------------- 282 A RLFD M + + TW T+++ Sbjct: 244 ALRLFDEMVNAGIEPDETTWVTVISACSSRGDPCLAASLVRTLHQKRIQLNCFVRTALLD 303 Query: 281 -YSKMKDLESARSYFDRTPKKSVVSWNAMLSGYARNGFSEEALCLFREMVSV-GICPNET 108 Y+K DL+SAR F+ P ++VV+WN+M++GYA+NG S A+ LF+EM++ + P+E Sbjct: 304 MYAKFGDLDSARKLFNTMPGRNVVTWNSMIAGYAQNGQSAMAIELFKEMITAKKLTPDEV 363 Query: 107 TWVAVISACSSHGDPTLVDSVVMMIEDYKV 18 T V+VISAC G L + VV + + ++ Sbjct: 364 TMVSVISACGHLGALELGNWVVRFLTENQI 393 Score = 80.9 bits (198), Expect = 1e-13 Identities = 55/182 (30%), Positives = 85/182 (46%), Gaps = 40/182 (21%) Frame = -1 Query: 494 YTRNALMNAYGKYGPVVGARQLFDEMSERSVADWNAVISGYWNWGVEGEAKRLFDSM--- 324 + R AL++ Y K+G + AR+LF+ M R+V WN++I+GY G A LF M Sbjct: 296 FVRTALLDMYAKFGDLDSARKLFNTMPGRNVVTWNSMIAGYAQNGQSAMAIELFKEMITA 355 Query: 323 ----PDK----NVIT------------W-----------------TTMVTGYSKMKDLES 255 PD+ +VI+ W M+ YS+ +E Sbjct: 356 KKLTPDEVTMVSVISACGHLGALELGNWVVRFLTENQIKLSISGHNAMIFMYSRCGSMED 415 Query: 254 ARSYFDRTPKKSVVSWNAMLSGYARNGFSEEALCLFREMVSVGICPNETTWVAVISACSS 75 A+ F + VVS+N ++SG+A +G EA+ L M GI P+ T++ V++ACS Sbjct: 416 AKRVFQEMATRDVVSYNTLISGFAAHGHGVEAINLMSTMKEGGIEPDRVTFIGVLTACSH 475 Query: 74 HG 69 G Sbjct: 476 AG 477 >emb|CAN80769.1| hypothetical protein VITISV_013866 [Vitis vinifera] Length = 761 Score = 226 bits (577), Expect = 1e-57 Identities = 107/183 (58%), Positives = 139/183 (75%), Gaps = 2/183 (1%) Frame = -1 Query: 560 SAGKNGFLFHSWLIKVGHDCGKYTRNALMNAYGKYGPVVGARQLFDEMS--ERSVADWNA 387 SAG G FH+ ++K+GH + RNA+++ Y + GP+ AR++FDE+ ER VADWNA Sbjct: 109 SAGNGGIGFHAHVLKLGHGSDAFVRNAVIDMYARLGPIGHARKVFDEIPDYERKVADWNA 168 Query: 386 VISGYWNWGVEGEAKRLFDSMPDKNVITWTTMVTGYSKMKDLESARSYFDRTPKKSVVSW 207 ++SGYW W EG+A+ LFD MP++NVITWT MVTGY+K+KDLE+AR YFD P++SVVSW Sbjct: 169 MVSGYWKWESEGQAQWLFDVMPERNVITWTAMVTGYAKVKDLEAARRYFDCMPERSVVSW 228 Query: 206 NAMLSGYARNGFSEEALCLFREMVSVGICPNETTWVAVISACSSHGDPTLVDSVVMMIED 27 NAMLSGYA+NG +EE L LF EMV+ GI P+ETTWV VISACSS GDP L S+V + Sbjct: 229 NAMLSGYAQNGLAEEVLRLFDEMVNAGIEPDETTWVTVISACSSRGDPCLAASLVRTLHQ 288 Query: 26 YKV 18 ++ Sbjct: 289 KQI 291 Score = 93.2 bits (230), Expect = 3e-17 Identities = 48/126 (38%), Positives = 82/126 (65%), Gaps = 2/126 (1%) Frame = -1 Query: 389 AVISGYWNWGVEGEAKRLFDSMPD-KNVITWTTMVTGYSKMKDLESARSYFDRTPKKSVV 213 A++ Y G G A+R+FD + +N +TW M++ Y+++ +L+SAR F+ P ++VV Sbjct: 300 ALLDMYAKCGSIGAARRIFDELGAYRNSVTWNAMISAYTRVGNLDSARELFNTMPGRNVV 359 Query: 212 SWNAMLSGYARNGFSEEALCLFREMVSV-GICPNETTWVAVISACSSHGDPTLVDSVVMM 36 +WN+M++GYA+NG S A+ LF+EM++ + P+E T V+VISAC G L + VV Sbjct: 360 TWNSMIAGYAQNGQSAMAIELFKEMITAKKLTPDEVTMVSVISACGHLGALELGNWVVRF 419 Query: 35 IEDYKV 18 + + ++ Sbjct: 420 LTENQI 425 Score = 78.2 bits (191), Expect = 8e-13 Identities = 55/181 (30%), Positives = 85/181 (46%), Gaps = 40/181 (22%) Frame = -1 Query: 491 TRNALMNAYGKYGPVVGARQLFDEMSERSVADWNAVISGYWNWGVEGEAKRLFDSM---- 324 T NA+++AY + G + AR+LF+ M R+V WN++I+GY G A LF M Sbjct: 329 TWNAMISAYTRVGNLDSARELFNTMPGRNVVTWNSMIAGYAQNGQSAMAIELFKEMITAK 388 Query: 323 ---PDK----NVIT------------W-----------------TTMVTGYSKMKDLESA 252 PD+ +VI+ W M+ YS+ +E A Sbjct: 389 KLTPDEVTMVSVISACGHLGALELGNWVVRFLTENQIKLSISGHNAMIFMYSRCGSMEDA 448 Query: 251 RSYFDRTPKKSVVSWNAMLSGYARNGFSEEALCLFREMVSVGICPNETTWVAVISACSSH 72 + F + VVS+N ++SG+A +G EA+ L M GI P+ T++ V++ACS Sbjct: 449 KRVFQEMATRDVVSYNTLISGFAAHGHGVEAINLMSTMKEGGIEPDRVTFIGVLTACSHA 508 Query: 71 G 69 G Sbjct: 509 G 509 >ref|XP_002519368.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223541435|gb|EEF42985.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 524 Score = 221 bits (564), Expect = 5e-56 Identities = 100/181 (55%), Positives = 136/181 (75%) Frame = -1 Query: 560 SAGKNGFLFHSWLIKVGHDCGKYTRNALMNAYGKYGPVVGARQLFDEMSERSVADWNAVI 381 S+GK+G LFH+ ++K+GH Y RN +++ Y K+ + AR+LFDEM+ERS+ADWN++I Sbjct: 38 SSGKDGILFHAHILKLGHQSDPYIRNVILDMYAKHSLIENARKLFDEMTERSLADWNSMI 97 Query: 380 SGYWNWGVEGEAKRLFDSMPDKNVITWTTMVTGYSKMKDLESARSYFDRTPKKSVVSWNA 201 GYW G E EA LF P++NVITWT MVTG+SK+K+L+SAR YFD P K++VSWNA Sbjct: 98 CGYWKCGNETEACSLFSMTPERNVITWTAMVTGFSKIKELDSARKYFDDMPVKNIVSWNA 157 Query: 200 MLSGYARNGFSEEALCLFREMVSVGICPNETTWVAVISACSSHGDPTLVDSVVMMIEDYK 21 ++SGYA+NGF EEAL LF M+ +G+ PNETTW VIS+CSS GDP +S V +++ K Sbjct: 158 IISGYAQNGFVEEALKLFNHMIRLGVQPNETTWATVISSCSSCGDPCRAESFVKLLDKRK 217 Query: 20 V 18 + Sbjct: 218 I 218 Score = 87.8 bits (216), Expect = 1e-15 Identities = 45/112 (40%), Positives = 70/112 (62%), Gaps = 2/112 (1%) Frame = -1 Query: 347 AKRLFDSMP-DKNVITWTTMVTGYSKMKDLESARSYFDRTPKKSVVSWNAMLSGYARNGF 171 A+ +F+ + +N TW M++ Y+++ DL SAR FD+ P++ VSWN M+SGYA+NG Sbjct: 241 ARGIFNELGVSRNSSTWNAMISAYTRVGDLLSARDLFDKMPERDAVSWNTMISGYAQNGQ 300 Query: 170 SEEALCLFREMVSV-GICPNETTWVAVISACSSHGDPTLVDSVVMMIEDYKV 18 S A+ LF+EM+ P+E T V++ISAC G L +V I +Y++ Sbjct: 301 SAMAIELFKEMIDAKDSQPDEVTMVSIISACGHLGALELGTWIVNFISEYRI 352 Score = 85.9 bits (211), Expect = 4e-15 Identities = 54/181 (29%), Positives = 84/181 (46%), Gaps = 40/181 (22%) Frame = -1 Query: 491 TRNALMNAYGKYGPVVGARQLFDEMSERSVADWNAVISGYWNWGVEGEAKRLF------- 333 T NA+++AY + G ++ AR LFD+M ER WN +ISGY G A LF Sbjct: 256 TWNAMISAYTRVGDLLSARDLFDKMPERDAVSWNTMISGYAQNGQSAMAIELFKEMIDAK 315 Query: 332 DSMPDKNVI----------------TW-----------------TTMVTGYSKMKDLESA 252 DS PD+ + TW ++ YSK +++ A Sbjct: 316 DSQPDEVTMVSIISACGHLGALELGTWIVNFISEYRIELTISGYNALIFMYSKCGNMKEA 375 Query: 251 RSYFDRTPKKSVVSWNAMLSGYARNGFSEEALCLFREMVSVGICPNETTWVAVISACSSH 72 + F + VVS+N+++ G+A +G EA+ L M G+ P+ T++ V++ACS Sbjct: 376 QRIFQEMETRDVVSYNSLIGGFAAHGEGNEAIKLLLSMKEEGVDPDHVTYIGVLTACSHA 435 Query: 71 G 69 G Sbjct: 436 G 436 >gb|AAF43948.1|AC012188_25 Contains similarity to a hypothetical protein from Arabidopsis thaliana gb|AC004044.1 and contains two domains PF|01535 of unknown function [Arabidopsis thaliana] Length = 455 Score = 218 bits (555), Expect = 5e-55 Identities = 100/181 (55%), Positives = 139/181 (76%) Frame = -1 Query: 560 SAGKNGFLFHSWLIKVGHDCGKYTRNALMNAYGKYGPVVGARQLFDEMSERSVADWNAVI 381 SAG+ G LF + + K+G Y RN +M+ Y K+ V AR++FD++S+R +DWN +I Sbjct: 30 SAGRFGILFQALVEKLGFFKDPYVRNVIMDMYVKHESVESARKVFDQISQRKGSDWNVMI 89 Query: 380 SGYWNWGVEGEAKRLFDSMPDKNVITWTTMVTGYSKMKDLESARSYFDRTPKKSVVSWNA 201 SGYW WG + EA +LFD MP+ +V++WT M+TG++K+KDLE+AR YFDR P+KSVVSWNA Sbjct: 90 SGYWKWGNKEEACKLFDMMPENDVVSWTVMITGFAKVKDLENARKYFDRMPEKSVVSWNA 149 Query: 200 MLSGYARNGFSEEALCLFREMVSVGICPNETTWVAVISACSSHGDPTLVDSVVMMIEDYK 21 MLSGYA+NGF+E+AL LF +M+ +G+ PNETTWV VISACS DP+L S+V +I++ + Sbjct: 150 MLSGYAQNGFTEDALRLFNDMLRLGVRPNETTWVIVISACSFRADPSLTRSLVKLIDEKR 209 Query: 20 V 18 V Sbjct: 210 V 210 Score = 92.4 bits (228), Expect = 4e-17 Identities = 43/112 (38%), Positives = 73/112 (65%), Gaps = 2/112 (1%) Frame = -1 Query: 347 AKRLFDSM-PDKNVITWTTMVTGYSKMKDLESARSYFDRTPKKSVVSWNAMLSGYARNGF 171 A+R+F+ + +N++TW M++GY+++ D+ SAR FD PK++VVSWN++++GYA NG Sbjct: 233 ARRIFNELGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSLIAGYAHNGQ 292 Query: 170 SEEALCLFREMVSVGIC-PNETTWVAVISACSSHGDPTLVDSVVMMIEDYKV 18 + A+ F +M+ G P+E T ++V+SAC D L D +V I ++ Sbjct: 293 AALAIEFFEDMIDYGDSKPDEVTMISVLSACGHMADLELGDCIVDYIRKNQI 344 Score = 77.8 bits (190), Expect = 1e-12 Identities = 51/181 (28%), Positives = 83/181 (45%), Gaps = 40/181 (22%) Frame = -1 Query: 491 TRNALMNAYGKYGPVVGARQLFDEMSERSVADWNAVISGYWNWGVEGEAKRLFDSMPD-- 318 T NA+++ Y + G + ARQLFD M +R+V WN++I+GY + G A F+ M D Sbjct: 248 TWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSLIAGYAHNGQAALAIEFFEDMIDYG 307 Query: 317 ---KNVITWTTMVTGYSKMKDLE-----------------------------------SA 252 + +T ++++ M DLE A Sbjct: 308 DSKPDEVTMISVLSACGHMADLELGDCIVDYIRKNQIKLNDSGYRSLIFMYARGGNLWEA 367 Query: 251 RSYFDRTPKKSVVSWNAMLSGYARNGFSEEALCLFREMVSVGICPNETTWVAVISACSSH 72 + FD ++ VVS+N + + +A NG E L L +M GI P+ T+ +V++AC+ Sbjct: 368 KRVFDEMKERDVVSYNTLFTAFAANGDGVETLNLLSKMKDEGIEPDRVTYTSVLTACNRA 427 Query: 71 G 69 G Sbjct: 428 G 428 >ref|NP_172899.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|193806395|sp|Q9M9R6.2|PPR43_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g14470 gi|332191047|gb|AEE29168.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 540 Score = 218 bits (555), Expect = 5e-55 Identities = 100/181 (55%), Positives = 139/181 (76%) Frame = -1 Query: 560 SAGKNGFLFHSWLIKVGHDCGKYTRNALMNAYGKYGPVVGARQLFDEMSERSVADWNAVI 381 SAG+ G LF + + K+G Y RN +M+ Y K+ V AR++FD++S+R +DWN +I Sbjct: 115 SAGRFGILFQALVEKLGFFKDPYVRNVIMDMYVKHESVESARKVFDQISQRKGSDWNVMI 174 Query: 380 SGYWNWGVEGEAKRLFDSMPDKNVITWTTMVTGYSKMKDLESARSYFDRTPKKSVVSWNA 201 SGYW WG + EA +LFD MP+ +V++WT M+TG++K+KDLE+AR YFDR P+KSVVSWNA Sbjct: 175 SGYWKWGNKEEACKLFDMMPENDVVSWTVMITGFAKVKDLENARKYFDRMPEKSVVSWNA 234 Query: 200 MLSGYARNGFSEEALCLFREMVSVGICPNETTWVAVISACSSHGDPTLVDSVVMMIEDYK 21 MLSGYA+NGF+E+AL LF +M+ +G+ PNETTWV VISACS DP+L S+V +I++ + Sbjct: 235 MLSGYAQNGFTEDALRLFNDMLRLGVRPNETTWVIVISACSFRADPSLTRSLVKLIDEKR 294 Query: 20 V 18 V Sbjct: 295 V 295 Score = 92.4 bits (228), Expect = 4e-17 Identities = 43/112 (38%), Positives = 73/112 (65%), Gaps = 2/112 (1%) Frame = -1 Query: 347 AKRLFDSM-PDKNVITWTTMVTGYSKMKDLESARSYFDRTPKKSVVSWNAMLSGYARNGF 171 A+R+F+ + +N++TW M++GY+++ D+ SAR FD PK++VVSWN++++GYA NG Sbjct: 318 ARRIFNELGTQRNLVTWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSLIAGYAHNGQ 377 Query: 170 SEEALCLFREMVSVGIC-PNETTWVAVISACSSHGDPTLVDSVVMMIEDYKV 18 + A+ F +M+ G P+E T ++V+SAC D L D +V I ++ Sbjct: 378 AALAIEFFEDMIDYGDSKPDEVTMISVLSACGHMADLELGDCIVDYIRKNQI 429 Score = 77.8 bits (190), Expect = 1e-12 Identities = 51/181 (28%), Positives = 83/181 (45%), Gaps = 40/181 (22%) Frame = -1 Query: 491 TRNALMNAYGKYGPVVGARQLFDEMSERSVADWNAVISGYWNWGVEGEAKRLFDSMPD-- 318 T NA+++ Y + G + ARQLFD M +R+V WN++I+GY + G A F+ M D Sbjct: 333 TWNAMISGYTRIGDMSSARQLFDTMPKRNVVSWNSLIAGYAHNGQAALAIEFFEDMIDYG 392 Query: 317 ---KNVITWTTMVTGYSKMKDLE-----------------------------------SA 252 + +T ++++ M DLE A Sbjct: 393 DSKPDEVTMISVLSACGHMADLELGDCIVDYIRKNQIKLNDSGYRSLIFMYARGGNLWEA 452 Query: 251 RSYFDRTPKKSVVSWNAMLSGYARNGFSEEALCLFREMVSVGICPNETTWVAVISACSSH 72 + FD ++ VVS+N + + +A NG E L L +M GI P+ T+ +V++AC+ Sbjct: 453 KRVFDEMKERDVVSYNTLFTAFAANGDGVETLNLLSKMKDEGIEPDRVTYTSVLTACNRA 512 Query: 71 G 69 G Sbjct: 513 G 513