BLASTX nr result
ID: Salvia21_contig00026837
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Salvia21_contig00026837 (1446 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002306075.1| predicted protein [Populus trichocarpa] gi|2... 452 e-124 ref|XP_002519113.1| pentatricopeptide repeat-containing protein,... 450 e-124 ref|XP_002281132.1| PREDICTED: pentatricopeptide repeat-containi... 446 e-123 ref|XP_003537906.1| PREDICTED: pentatricopeptide repeat-containi... 434 e-119 ref|NP_179197.1| pentatricopeptide repeat-containing protein [Ar... 399 e-108 >ref|XP_002306075.1| predicted protein [Populus trichocarpa] gi|222849039|gb|EEE86586.1| predicted protein [Populus trichocarpa] Length = 498 Score = 452 bits (1162), Expect = e-124 Identities = 236/465 (50%), Positives = 317/465 (68%), Gaps = 5/465 (1%) Frame = +1 Query: 58 THSLR-SFSSSP---HPADLVSAAASILKHHRSKSRWSHLRSLLATTKDNRLTPSQFSQV 225 T+SL S ++SP H + L +A S+L HHRSKSRWSHLRSLL TT L P FS + Sbjct: 20 TYSLPFSTTTSPPPNHHSPLTTAIISLLTHHRSKSRWSHLRSLLTTTTSTPLAPGHFSLI 79 Query: 226 ALQLRNNPHLVLRFFHFTLQHS-LTSHSPSSYATAIHILSRSRVKLHALNLIKSAIVAFS 402 L+L++NPHL L FFHFTL +S L SH+ SYAT IHILSR+R+K HA +I++ + + Sbjct: 80 TLKLKSNPHLALSFFHFTLHNSSLCSHNLRSYATIIHILSRARLKAHAQEIIRAGLRSQI 139 Query: 403 DAQPGSAIVLLEALVKAYRACDSAPFVFDLLVKACLESKKIDPALQIYSILKSKNVLLKT 582 + E LVK+YR CDSAPFVFDLL+K+CLE KKID +++I +L+SK + Sbjct: 140 LYHLLKEVRFFEVLVKSYRECDSAPFVFDLLIKSCLELKKIDGSIEIVKMLRSKGISPSI 199 Query: 583 STLNSLIELVCKTRSCFAGYDLYKEIFHNDVDNEDKSRPRGKENFPISNTLNVVMVGFYR 762 ST N+LI V + + F GY ++KE+F + + RG P ++ N +MVGFYR Sbjct: 200 STCNALISEVSRCKGSFVGYGVFKEVFGLESCELGEKMRRGFRVRPNVHSFNELMVGFYR 259 Query: 763 EGMVDKVEEVWQEHVRVGCVPNMYSFNVLMATYCDHGRMEDAIRVWEEMEDKGLKRDAVA 942 G V+ VEE+W E R GCV N +S+ VL+A +C+ GR+ +A R+W+EM KG+ D VA Sbjct: 260 NGEVEMVEEIWSEMERFGCVANGFSYGVLIAVFCEGGRLSEAERLWDEMRVKGIMPDVVA 319 Query: 943 YNTIIGGFCRAGDVERAEEIYREMVMQGVESTCVTFEYLINGYCEIGDVDSVMMLYKNMC 1122 YNTIIGGFC+AG+VE+AE ++REM + G+ES+CVTFE+LI GYC IGDV+S +++YK+M Sbjct: 320 YNTIIGGFCKAGEVEKAEGLFREMGLSGIESSCVTFEHLIEGYCRIGDVNSAILVYKDMR 379 Query: 1123 RKKFSPSSSTVNVIIRLLCGKNEISAASDFWWRAAKKHEAALERENYENLVKGLCGEGKM 1302 R+ F + T+ V+I LC + + A A + ++YE L+ GLC +GKM Sbjct: 380 RRDFRLEALTMEVLIGGLCEQKRVFEALKIMRSAMRDVSFHPNGKSYELLINGLCEDGKM 439 Query: 1303 EEALKLQAEMVVKGFEPNVGIYGAFIDGYDKLGNEAMASKLRKEM 1437 EEALKLQ+EMV KGF+PN IYGAFI+GY KLGNE MA+ LRKEM Sbjct: 440 EEALKLQSEMVGKGFDPNSAIYGAFIEGYVKLGNEEMAAMLRKEM 484 >ref|XP_002519113.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223541776|gb|EEF43324.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 486 Score = 450 bits (1158), Expect = e-124 Identities = 235/458 (51%), Positives = 317/458 (69%), Gaps = 5/458 (1%) Frame = +1 Query: 79 SSSPHPAD--LVSAAASILKHHRSKSRWSHLRSLLATTKDNRLTPSQFSQVALQLRNNPH 252 ++SP +D L++ S+L HHRSKSRW+HLRSL+ T+ + LTP+ FSQ+ L L++NP Sbjct: 24 TASPPSSDQQLITTITSLLIHHRSKSRWTHLRSLILTS-NKTLTPTHFSQIILLLKSNPR 82 Query: 253 LVLRFFHFTLQH-SLTSHSPSSYATAIHILSRSRVKLHALNLIKSAIVA--FSDAQPGSA 423 L LRFFHFTL++ S SH S +T HILSR+R+K A ++I A + D G A Sbjct: 83 LALRFFHFTLRNPSFCSHDLRSISTITHILSRARLKPQAQSIIHLAFTSPVLVDDSNGQA 142 Query: 424 IVLLEALVKAYRACDSAPFVFDLLVKACLESKKIDPALQIYSILKSKNVLLKTSTLNSLI 603 + E LVK YR CDSAPFVFDLL+K+CLE KKID L+I +L+S+ + ST N L+ Sbjct: 143 LKFFEILVKTYRECDSAPFVFDLLIKSCLELKKIDDGLKIVRLLRSRGISPLISTCNFLV 202 Query: 604 ELVCKTRSCFAGYDLYKEIFHNDVDNEDKSRPRGKENFPISNTLNVVMVGFYREGMVDKV 783 V K + C+AGY +++E+F + DNE K + + N +T N +M+GFYR+G ++ V Sbjct: 203 SWVSKCKGCYAGYGVFREVFEVN-DNEGKRVIKVRPNV---HTFNELMMGFYRDGELEMV 258 Query: 784 EEVWQEHVRVGCVPNMYSFNVLMATYCDHGRMEDAIRVWEEMEDKGLKRDAVAYNTIIGG 963 EEVW E R CVPN +S++VLM + D GR ++ ++WEEM KG+K D VAYNT+IGG Sbjct: 259 EEVWSEMERFECVPNGFSYSVLMTVFLDVGRTKEIEKLWEEMRAKGIKGDVVAYNTVIGG 318 Query: 964 FCRAGDVERAEEIYREMVMQGVESTCVTFEYLINGYCEIGDVDSVMMLYKNMCRKKFSPS 1143 FC+ G++E+AEE+ REM + GVE+ CVTFE+LINGYC +GDVDS ++++K+M RK F Sbjct: 319 FCKIGEIEKAEELSREMELNGVEANCVTFEHLINGYCSVGDVDSAILVFKHMVRKGFRAE 378 Query: 1144 SSTVNVIIRLLCGKNEISAASDFWWRAAKKHEAALERENYENLVKGLCGEGKMEEALKLQ 1323 S ++V+I LC K +S A + A + L ++YE L+KGLC +GKM+EALKLQ Sbjct: 379 GSVMDVLIGGLCEKRRVSEALEIMRIAMRNDGFRLSGKSYELLIKGLCKDGKMDEALKLQ 438 Query: 1324 AEMVVKGFEPNVGIYGAFIDGYDKLGNEAMASKLRKEM 1437 AEMV GFEPN IYGAFIDGY KLGNE MA+ LRKEM Sbjct: 439 AEMVGGGFEPNFEIYGAFIDGYMKLGNEEMAAMLRKEM 476 >ref|XP_002281132.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15980-like [Vitis vinifera] Length = 492 Score = 446 bits (1148), Expect = e-123 Identities = 235/467 (50%), Positives = 316/467 (67%), Gaps = 7/467 (1%) Frame = +1 Query: 64 SLRSFSSSPHPAD-LVSAAASILKHHRSKSRWSHLRSLLATTKDNRLTPSQFSQVALQLR 240 SL S S +P L+S A SIL+H RSKSRWSHL+SL TP++ SQ+ LQ++ Sbjct: 22 SLSSLPSDQNPTKTLISTAVSILRHQRSKSRWSHLQSLFP----KGFTPTEASQIVLQIK 77 Query: 241 NNPHLVLRFFHFTLQHSLTSHSPSSYATAIHILSRSRVKLHALNLIKSAIVAFSDAQPGS 420 NNPHL L FF + SL +H+ SY+T IHIL+R+R+K AL LI++AI F D+ S Sbjct: 78 NNPHLALSFFLWCHHKSLCNHTLLSYSTIIHILARARLKSQALGLIRTAIRVFDDSDECS 137 Query: 421 AIV--LLEALVKAYRACDSAPFVFDLLVKACLESKKIDPALQIYSILKSKNVLLKTSTLN 594 + + E+LVK Y +C SAPFVFDLL+KACL SK+I+ ++ I +L+S+ + ST N Sbjct: 138 SQPPKIFESLVKTYNSCGSAPFVFDLLIKACLNSKRIEQSISIVKMLRSRGISPTISTCN 197 Query: 595 SLIELVCKTRSCFAGYDLYKEIFHN-DVDNEDKSRPRGKENF---PISNTLNVVMVGFYR 762 +LI V + R C AGY++Y+E+F + D + +K R R + P +T N +MV FYR Sbjct: 198 ALIWQVSRGRGCDAGYEIYREVFGSWDDEINEKVRVRVRVRVRVCPNVHTFNALMVCFYR 257 Query: 763 EGMVDKVEEVWQEHVRVGCVPNMYSFNVLMATYCDHGRMEDAIRVWEEMEDKGLKRDAVA 942 +G V+KVEE+W E C PN YS++VLMA +CD GRM + ++WEEM K ++ D +A Sbjct: 258 DGGVEKVEEIWAEMGEWDCNPNAYSYSVLMAAFCDEGRMGEVEKLWEEMRMKKMEHDIMA 317 Query: 943 YNTIIGGFCRAGDVERAEEIYREMVMQGVESTCVTFEYLINGYCEIGDVDSVMMLYKNMC 1122 YNTIIGGFCR G++ER EE++REM + G++STCVT+E+LINGYCEIGDVDS ++LYK+MC Sbjct: 318 YNTIIGGFCRIGEIERGEELFREMELSGIQSTCVTYEHLINGYCEIGDVDSAVLLYKDMC 377 Query: 1123 RKKFSPSSSTVNVIIRLLCGKNEISAASDFWWRAAKKHEAALERENYENLVKGLCGEGKM 1302 RK F + TV+ +I LLC + A A E A ++YE L+KG C EGKM Sbjct: 378 RKGFRAEARTVDGMILLLCNNRRVHEALKLLRVAMGNVEFAPRGKSYETLIKGFCEEGKM 437 Query: 1303 EEALKLQAEMVVKGFEPNVGIYGAFIDGYDKLGNEAMASKLRKEMLD 1443 EEA KLQ+EMV KGF+P + IY AFIDGY K GN+ +A LRKEM + Sbjct: 438 EEASKLQSEMVGKGFKPTLEIYSAFIDGYMKQGNKEIAETLRKEMFE 484 >ref|XP_003537906.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15980-like [Glycine max] Length = 487 Score = 434 bits (1116), Expect = e-119 Identities = 229/464 (49%), Positives = 317/464 (68%), Gaps = 8/464 (1%) Frame = +1 Query: 73 SFSSSPHPAD-LVSAAASILKHHRSKSRWSHLRSLLATTKDNRLTPSQFSQVALQLRNNP 249 SFS S + LV+ A SIL HHRSKSRWS+LRS N +TP++FS++ L ++N P Sbjct: 23 SFSCSNDASQSLVTDAVSILTHHRSKSRWSNLRSACP----NGITPAEFSEITLHIKNKP 78 Query: 250 HLVLRFFHFTLQHSLTSHSPSSYATAIHILSRSRVKLHALNLIKSAIVAFSDAQPGSA-- 423 L LRFF +T SL +H+ +SY++ IH+L+R+R+ HA +LI++AI A + Sbjct: 79 QLALRFFLWTKSKSLCNHNLASYSSIIHLLARARLSSHAYDLIRTAIRASHQNDEENCRF 138 Query: 424 ----IVLLEALVKAYRACDSAPFVFDLLVKACLESKKIDPALQIYSILKSKNVLLKTSTL 591 + L E LVK YR SAPFVFDLL+KACL+SKK+DP+++I +L S+ + K STL Sbjct: 139 NSRPLNLFETLVKTYRDSGSAPFVFDLLIKACLDSKKLDPSIEIVRMLLSRGISPKVSTL 198 Query: 592 NSLIELVCKTRSCFAGYDLYKEIFHNDVDNEDKS-RPRGKENFPISNTLNVVMVGFYREG 768 NSLI VCK+R GY +Y+E F D +N + S R G P +T N +M+ Y++G Sbjct: 199 NSLISRVCKSRGVDEGYAIYREFFRLDEENNEISKRGSGFRVTPNVHTYNDLMLCCYQDG 258 Query: 769 MVDKVEEVWQEHVRVGCVPNMYSFNVLMATYCDHGRMEDAIRVWEEMEDKGLKRDAVAYN 948 +V++VE++W E ++ PN YS++VLMAT+CD GRM DA ++WEE+ + ++ D V+YN Sbjct: 259 LVERVEKIWIE-MKCNYKPNAYSYSVLMATFCDEGRMGDAEKLWEELRSEKIEPDVVSYN 317 Query: 949 TIIGGFCRAGDVERAEEIYREMVMQGVESTCVTFEYLINGYCEIGDVDSVMMLYKNMCRK 1128 TIIGGFC GDV RAEE +REM + GV +T T+E+L+ GYC IGDVDS +++YK+M R Sbjct: 318 TIIGGFCTIGDVGRAEEFFREMAVAGVGTTASTYEHLVKGYCNIGDVDSAVLVYKDMARS 377 Query: 1129 KFSPSSSTVNVIIRLLCGKNEISAASDFWWRAAKKHEAALERENYENLVKGLCGEGKMEE 1308 P +ST++V+IRLLC K + + +F A K + ++YE L+KGLC +G+MEE Sbjct: 378 DLRPDASTLDVMIRLLCDKGRVRESLEFVRCAVGKFDLIPMEKSYEALIKGLCFDGRMEE 437 Query: 1309 ALKLQAEMVVKGFEPNVGIYGAFIDGYDKLGNEAMASKLRKEML 1440 ALK+QAEMV KGF+PN IYGAF+DGY + GNE MA LRKEML Sbjct: 438 ALKVQAEMVGKGFQPNSEIYGAFVDGYVRHGNEEMAEALRKEML 481 >ref|NP_179197.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75267579|sp|Q9XIM8.1|PP155_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g15980 gi|5306237|gb|AAD41970.1| hypothetical protein [Arabidopsis thaliana] gi|330251359|gb|AEC06453.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 498 Score = 399 bits (1024), Expect = e-108 Identities = 214/466 (45%), Positives = 303/466 (65%), Gaps = 7/466 (1%) Frame = +1 Query: 67 LRSFSSSPHPAD--LVSAAASILKHHRSKSRWSHLRSLLATTKDNRLTPSQFSQVALQLR 240 L + SS P P L+S A SIL HHRSKSRWS LRSL + + TPSQFS++ L LR Sbjct: 27 LTTVSSPPSPPSDPLISDAVSILTHHRSKSRWSTLRSL----QPSGFTPSQFSEITLCLR 82 Query: 241 NNPHLVLRFFHFTLQHSLTSHSPSSYATAIHILSRSRVKLHALNLIKSAI-VAFSDAQPG 417 NNPHL LRFF FT ++SL SH S +T IHILSRSR+K HA +I+ A+ +A +D Sbjct: 83 NNPHLSLRFFLFTRRYSLCSHDTHSCSTLIHILSRSRLKSHASEIIRLALRLAATDEDED 142 Query: 418 SAIVLLEALVKAYRACDSAPFVFDLLVKACLESKKIDPALQIYSILKSKNVLLKTSTLNS 597 + + +L+K+Y C SAPFVFDLL+K+CL+SK+ID A+ + L+S+ + + ST N+ Sbjct: 143 RVLKVFRSLIKSYNRCGSAPFVFDLLIKSCLDSKEIDGAVMVMRKLRSRGINAQISTCNA 202 Query: 598 LIELVCKTRSCFAGYDLYKEIFHNDVDNEDKSRPRGKENFPISNTLNVVMVGFYREGMVD 777 LI V + R GY +Y+E+F D + D+++ + P + T N +MV FYREG + Sbjct: 203 LITEVSRRRGASNGYKMYREVFGLDDVSVDEAKKMIGKIKPNATTFNSMMVSFYREGETE 262 Query: 778 KVEEVWQE-HVRVGCVPNMYSFNVLMATYCDHGRMEDAIRVWEEMEDKGLKRDAVAYNTI 954 VE +W+E VGC PN+YS+NVLM YC G M +A +VWEEM+ +G+ D VAYNT+ Sbjct: 263 MVERIWREMEEEVGCSPNVYSYNVLMEAYCARGLMSEAEKVWEEMKVRGVVYDIVAYNTM 322 Query: 955 IGGFCRAGDVERAEEIYREMVMQGVESTCVTFEYLINGYCEIGDVDSVMMLYKNMCRKKF 1134 IGG C +V +A+E++R+M ++G+E TC+T+E+L+NGYC+ GDVDS +++Y+ M RK F Sbjct: 323 IGGLCSNFEVVKAKELFRDMGLKGIECTCLTYEHLVNGYCKAGDVDSGLVVYREMKRKGF 382 Query: 1135 SPSSSTVNVIIRLLCGKNE---ISAASDFWWRAAKKHEAALERENYENLVKGLCGEGKME 1305 T+ ++ LC + + A+D A ++ R YE LVK LC +GKM+ Sbjct: 383 EADGLTIEALVEGLCDDRDGQRVVEAADIVKDAVREAMFYPSRNCYELLVKRLCEDGKMD 442 Query: 1306 EALKLQAEMVVKGFEPNVGIYGAFIDGYDKLGNEAMASKLRKEMLD 1443 AL +QAEMV KGF+P+ Y AFIDGY +G+E ++ L EM + Sbjct: 443 RALNIQAEMVGKGFKPSQETYRAFIDGYGIVGDEETSALLAIEMAE 488 Score = 60.5 bits (145), Expect = 1e-06 Identities = 50/242 (20%), Positives = 104/242 (42%), Gaps = 18/242 (7%) Frame = +1 Query: 775 DKVEEVWQEHV----RVGCVPNMYSFNVLMATYCDHGRMEDAIRVWEEMEDKGLKRDAVA 942 D+V +V++ + R G P + F++L+ + D ++ A+ V ++ +G+ Sbjct: 142 DRVLKVFRSLIKSYNRCGSAP--FVFDLLIKSCLDSKEIDGAVMVMRKLRSRGINAQIST 199 Query: 943 YNTIIGGFCRAGDVERAEEIYREM-------------VMQGVESTCVTFEYLINGYCEIG 1083 N +I R ++YRE+ ++ ++ TF ++ + G Sbjct: 200 CNALITEVSRRRGASNGYKMYREVFGLDDVSVDEAKKMIGKIKPNATTFNSMMVSFYREG 259 Query: 1084 DVDSVMMLYKNMCRKK-FSPSSSTVNVIIRLLCGKNEISAASDFWWRAAKKHEAALEREN 1260 + + V +++ M + SP+ + NV++ C + +S A W K + Sbjct: 260 ETEMVERIWREMEEEVGCSPNVYSYNVLMEAYCARGLMSEAEKVW-EEMKVRGVVYDIVA 318 Query: 1261 YENLVKGLCGEGKMEEALKLQAEMVVKGFEPNVGIYGAFIDGYDKLGNEAMASKLRKEML 1440 Y ++ GLC ++ +A +L +M +KG E Y ++GY K G+ + +EM Sbjct: 319 YNTMIGGLCSNFEVVKAKELFRDMGLKGIECTCLTYEHLVNGYCKAGDVDSGLVVYREMK 378 Query: 1441 DK 1446 K Sbjct: 379 RK 380