BLASTX nr result
ID: Scutellaria24_contig00001021
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Scutellaria24_contig00001021 (2776 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002319806.1| predicted protein [Populus trichocarpa] gi|2... 642 0.0 emb|CBI20513.3| unnamed protein product [Vitis vinifera] 629 e-177 ref|XP_002279701.2| PREDICTED: pentatricopeptide repeat-containi... 628 e-177 ref|XP_002522838.1| pentatricopeptide repeat-containing protein,... 594 e-167 ref|NP_192388.3| pentatricopeptide repeat-containing protein [Ar... 512 e-142 >ref|XP_002319806.1| predicted protein [Populus trichocarpa] gi|222858182|gb|EEE95729.1| predicted protein [Populus trichocarpa] Length = 784 Score = 642 bits (1655), Expect = 0.0 Identities = 327/577 (56%), Positives = 422/577 (73%) Frame = -2 Query: 2769 PDLLPRLAYYEMLLWISINNEEKIQELCLSVATVDAKEKLYLRESYLMALCESNYKEEFL 2590 P RL YY+MLL+I +N+EEKIQELC + D + LRE+YL+ALCES+ K L Sbjct: 188 PSKNARLGYYDMLLYIGVNDEEKIQELCNYICIDDGDNNISLRENYLLALCESDQKNYLL 247 Query: 2589 LLFKTLDISKVTSVAYLESIFKALGKFLLETYAERFLSVLKGRGDIGAENISNFILQHTI 2410 L +T+DI+K +S+ +L SIFK+LG+ LE++A++FL VLK D GAE+IS I + Sbjct: 248 QLLETMDITKFSSLDHLASIFKSLGRLSLESFAKKFLLVLKSC-DYGAEDISTLIFSYAT 306 Query: 2409 NLPNLAVEDIIKKFQKLLAKFEVLPTSAQYEKLIEYCCGFFKVHEALDVVDEAFGSGIAL 2230 ++PNL VED++ KF+ L ++ P+S YEKL+ Y C KVH ALD+VD+ G+ + Sbjct: 307 SIPNLVVEDVVSKFKTLHMIMKMSPSSTSYEKLVVYNCNLLKVHLALDIVDQMCKEGLTI 366 Query: 2229 SLEAFHSILDACDRSCEFNLVHQINLRISHQNLKPNNETFRRMILLCVKMKDFEGAYGLI 2050 S+ HSIL+A + S +FNLV +I I H +L PNNETFR MI L VKMKDFEGAYGL+ Sbjct: 367 SINTIHSILNASEESFDFNLVRRIYSLIYHLDLTPNNETFRSMISLSVKMKDFEGAYGLL 426 Query: 2049 GDLQKMNLTPTASMYNSILAGYFREKNIKSARNVLKQMEDANVKPDASTYSYLIANSRSE 1870 DL+K+NL PTASMYN+I+ GYFREKNI+ A VLKQM+ A+VKPD+S+YSYLI+N +E Sbjct: 427 DDLKKLNLAPTASMYNAIMGGYFREKNIRGALMVLKQMKLADVKPDSSSYSYLISNCNNE 486 Query: 1869 KDLTKFYEDLIESGIPPTKQVFMALINAYASFGQFEKAQQVMLDKRVPVKNLNEIKSVLI 1690 +++ K+YE++ +GI +KQ+FMALINAYA+ GQFEKA+QV+LDK P+K+LNEI+SVL+ Sbjct: 487 EEIIKYYEEMKVAGIQVSKQIFMALINAYATCGQFEKAKQVLLDKEFPIKHLNEIRSVLV 546 Query: 1689 GALASNGKFSCALELYEEMKGAKCELDPKAVKTLIEYFQSEGXXXXXXXXXXXXXXSPYW 1510 ALAS+G+ + AL LYEEMK A L+PKAV +LIE+ SEG YW Sbjct: 547 SALASHGQMTDALNLYEEMKQAGSNLEPKAVISLIEHVDSEGEQSRLLKLLEELDDHNYW 606 Query: 1509 TDACFRVISHCVRQEDLRSTVDLLKQLKDRFIDAEVALEVLFDEVFCLFADQEPTDMHFG 1330 D CFRVI +C+R +DLRS VDLLKQLKDRF D E+A+EVLFDEVF A+ EP ++ G Sbjct: 607 VDGCFRVILYCIRNKDLRSAVDLLKQLKDRFSDDELAMEVLFDEVFSQVAETEPANVRIG 666 Query: 1329 LKLLQAIKEELGVRPSRKSLDFLLSACVITKDSKTSFLIWKEYVTSGLPYNILSFVRMYQ 1150 + LLQAIK+ELG PSRK LDFLL+ACV KD S L+WKEY +GLPYN+ S++RMYQ Sbjct: 667 MDLLQAIKDELGASPSRKCLDFLLTACVNAKDLGNSLLVWKEYQAAGLPYNVTSYLRMYQ 726 Query: 1149 ALLASGDTKSAAKLLNKIPKDDVHVRQVIRACQETYV 1039 ALLASG SA +LNKIPKDD HVR VI+ CQ TY+ Sbjct: 727 ALLASGGHVSAKVMLNKIPKDDPHVRIVIQECQRTYI 763 >emb|CBI20513.3| unnamed protein product [Vitis vinifera] Length = 618 Score = 629 bits (1623), Expect = e-177 Identities = 327/580 (56%), Positives = 416/580 (71%), Gaps = 5/580 (0%) Frame = -2 Query: 2760 LPRLAYYEMLLWISINNEEKIQELCLSVATVDAKEKLYLRESYLMALCESNYKEEFLLLF 2581 L RL YYEMLLW+ +NNEEKIQELC +A D +K L E+Y++ALCES KEE L + Sbjct: 20 LSRLGYYEMLLWVRVNNEEKIQELCNGIAADDGADKPNLTENYVLALCESGRKEELLKVL 79 Query: 2580 KTLDISKVTSVAYLESIFKALGKFLLETYAERFLSVLKGRGDI-----GAENISNFILQH 2416 + +DI+KV+SV Y+ SIFK+LG+ L ++ E+F+S K G + GAE IS+FI + Sbjct: 80 EIIDITKVSSVDYVASIFKSLGRLSLASFMEKFVSAFKACGTLMIMYYGAEEISDFIFYY 139 Query: 2415 TINLPNLAVEDIIKKFQKLLAKFEVLPTSAQYEKLIEYCCGFFKVHEALDVVDEAFGSGI 2236 N+PNLAVED+I KF+ L A+ V P+S Y KL+ YCC FKVH ALD+VD+ +G+ Sbjct: 140 ASNMPNLAVEDVILKFKDLHAQLVVTPSSTSYNKLVTYCCDSFKVHAALDIVDQMCEAGL 199 Query: 2235 ALSLEAFHSILDACDRSCEFNLVHQINLRISHQNLKPNNETFRRMILLCVKMKDFEGAYG 2056 LS+E FHSIL A + S EFNLVH+I I HQ+L+PN ETFR MI L VKMKDF+GAY Sbjct: 200 TLSIEMFHSILRASEESFEFNLVHRIYSVICHQSLEPNCETFRIMINLHVKMKDFDGAYD 259 Query: 2055 LIGDLQKMNLTPTASMYNSILAGYFREKNIKSARNVLKQMEDANVKPDASTYSYLIANSR 1876 L+ D++K+NLTPTA +YN+I+ GYFREKNI VLKQM DA+VKPD+ T+ YL+ N Sbjct: 260 LLKDMKKINLTPTAGIYNAIMGGYFREKNIYGGLMVLKQMGDADVKPDSQTFCYLLNNCE 319 Query: 1875 SEKDLTKFYEDLIESGIPPTKQVFMALINAYASFGQFEKAQQVMLDKRVPVKNLNEIKSV 1696 E+D+ K+YE L +G+ TK VFMALINAYAS GQFEKA+QV+LDK VP+K+LNEIKSV Sbjct: 320 CEEDIIKYYEKLKCAGVQVTKHVFMALINAYASCGQFEKAKQVVLDKGVPIKSLNEIKSV 379 Query: 1695 LIGALASNGKFSCALELYEEMKGAKCELDPKAVKTLIEYFQSEGXXXXXXXXXXXXXXSP 1516 L+ ALA +G+ S A ++YEE+K + L+PKA+ LIEY QSEG Sbjct: 380 LVSALALHGQISDAFDIYEEIKNSGFNLEPKAIILLIEYHQSEGDLTRLLQLLEELNDPD 439 Query: 1515 YWTDACFRVISHCVRQEDLRSTVDLLKQLKDRFIDAEVALEVLFDEVFCLFADQEPTDMH 1336 Y + R++ +CVR L S +DLLKQLKD F D E+A+E + DEVF L A+ EP ++ Sbjct: 440 YRVEGSCRILLYCVRYNHLSSAIDLLKQLKDTFHDNELAMEAILDEVFSLIAEIEPVNLK 499 Query: 1335 FGLKLLQAIKEELGVRPSRKSLDFLLSACVITKDSKTSFLIWKEYVTSGLPYNILSFVRM 1156 GL LL AIKEELG+RPSRKSLDFLL+ACV KD S LIW+EY T+G +N+LSF+RM Sbjct: 500 IGLDLLTAIKEELGLRPSRKSLDFLLAACVNGKDLDNSRLIWREYQTAGFTHNVLSFLRM 559 Query: 1155 YQALLASGDTKSAAKLLNKIPKDDVHVRQVIRACQETYVE 1036 YQA LA GD KSAA +L+KIPKDD HV VI+ACQ TY + Sbjct: 560 YQACLACGDHKSAANILHKIPKDDPHVCCVIKACQTTYAK 599 >ref|XP_002279701.2| PREDICTED: pentatricopeptide repeat-containing protein At4g04790, mitochondrial-like [Vitis vinifera] Length = 848 Score = 628 bits (1619), Expect = e-177 Identities = 327/581 (56%), Positives = 417/581 (71%), Gaps = 6/581 (1%) Frame = -2 Query: 2760 LPRLAYYEMLLWISINNEEKIQELCLSVATVDAKEK------LYLRESYLMALCESNYKE 2599 L RL YYEMLLW+ +NNEEKIQELC +A D +K +Y+ E+Y++ALCES KE Sbjct: 250 LSRLGYYEMLLWVRVNNEEKIQELCNGIAADDGADKPNLTDTVYIAENYVLALCESGRKE 309 Query: 2598 EFLLLFKTLDISKVTSVAYLESIFKALGKFLLETYAERFLSVLKGRGDIGAENISNFILQ 2419 E L + + +DI+KV+SV Y+ SIFK+LG+ L ++ E+F+S K D GAE IS+FI Sbjct: 310 ELLKVLEIIDITKVSSVDYVASIFKSLGRLSLASFMEKFVSAFKAC-DYGAEEISDFIFY 368 Query: 2418 HTINLPNLAVEDIIKKFQKLLAKFEVLPTSAQYEKLIEYCCGFFKVHEALDVVDEAFGSG 2239 + N+PNLAVED+I KF+ L A+ V P+S Y KL+ YCC FKVH ALD+VD+ +G Sbjct: 369 YASNMPNLAVEDVILKFKDLHAQLVVTPSSTSYNKLVTYCCDSFKVHAALDIVDQMCEAG 428 Query: 2238 IALSLEAFHSILDACDRSCEFNLVHQINLRISHQNLKPNNETFRRMILLCVKMKDFEGAY 2059 + LS+E FHSIL A + S EFNLVH+I I HQ+L+PN ETFR MI L VKMKDF+GAY Sbjct: 429 LTLSIEMFHSILRASEESFEFNLVHRIYSVICHQSLEPNCETFRIMINLHVKMKDFDGAY 488 Query: 2058 GLIGDLQKMNLTPTASMYNSILAGYFREKNIKSARNVLKQMEDANVKPDASTYSYLIANS 1879 L+ D++K+NLTPTA +YN+I+ GYFREKNI VLKQM DA+VKPD+ T+ YL+ N Sbjct: 489 DLLKDMKKINLTPTAGIYNAIMGGYFREKNIYGGLMVLKQMGDADVKPDSQTFCYLLNNC 548 Query: 1878 RSEKDLTKFYEDLIESGIPPTKQVFMALINAYASFGQFEKAQQVMLDKRVPVKNLNEIKS 1699 E+D+ K+YE L +G+ TK VFMALINAYAS GQFEKA+QV+LDK VP+K+LNEIKS Sbjct: 549 ECEEDIIKYYEKLKCAGVQVTKHVFMALINAYASCGQFEKAKQVVLDKGVPIKSLNEIKS 608 Query: 1698 VLIGALASNGKFSCALELYEEMKGAKCELDPKAVKTLIEYFQSEGXXXXXXXXXXXXXXS 1519 VL+ ALA +G+ S A ++YEE+K + L+PKA+ LIEY QSEG Sbjct: 609 VLVSALALHGQISDAFDIYEEIKNSGFNLEPKAIILLIEYHQSEGDLTRLLQLLEELNDP 668 Query: 1518 PYWTDACFRVISHCVRQEDLRSTVDLLKQLKDRFIDAEVALEVLFDEVFCLFADQEPTDM 1339 Y + R++ +CVR L S +DLLKQLKD F D E+A+E + DEVF L A+ EP ++ Sbjct: 669 DYRVEGSCRILLYCVRYNHLSSAIDLLKQLKDTFHDNELAMEAILDEVFSLIAEIEPVNL 728 Query: 1338 HFGLKLLQAIKEELGVRPSRKSLDFLLSACVITKDSKTSFLIWKEYVTSGLPYNILSFVR 1159 GL LL AIKEELG+RPSRKSLDFLL+ACV KD S LIW+EY T+G +N+LSF+R Sbjct: 729 KIGLDLLTAIKEELGLRPSRKSLDFLLAACVNGKDLDNSRLIWREYQTAGFTHNVLSFLR 788 Query: 1158 MYQALLASGDTKSAAKLLNKIPKDDVHVRQVIRACQETYVE 1036 MYQA LA GD KSAA +L+KIPKDD HV VI+ACQ TY + Sbjct: 789 MYQACLACGDHKSAANILHKIPKDDPHVCCVIKACQTTYAK 829 >ref|XP_002522838.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223537922|gb|EEF39536.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 867 Score = 594 bits (1531), Expect = e-167 Identities = 314/575 (54%), Positives = 405/575 (70%) Frame = -2 Query: 2760 LPRLAYYEMLLWISINNEEKIQELCLSVATVDAKEKLYLRESYLMALCESNYKEEFLLLF 2581 L RL YYEMLL I +NNEEKI+ELC +A + E L E+YL+ALCE + K E L L Sbjct: 285 LARLGYYEMLLCIRVNNEEKIRELCSYIANNNLNETFDLLENYLLALCERDRKNELLQLL 344 Query: 2580 KTLDISKVTSVAYLESIFKALGKFLLETYAERFLSVLKGRGDIGAENISNFILQHTINLP 2401 + +DI+KV+S+ ++ SIF +LG+ LLE+ A++FLS K D ENIS I + ++P Sbjct: 345 EIVDITKVSSLEHMVSIFNSLGRLLLESLAKKFLSTFK-ECDYDTENISTLIFSYATSVP 403 Query: 2400 NLAVEDIIKKFQKLLAKFEVLPTSAQYEKLIEYCCGFFKVHEALDVVDEAFGSGIALSLE 2221 NLAVED I KF+ L E+ P+S YEKLI Y C KVH ALD+VD+ + + LS++ Sbjct: 404 NLAVEDAILKFKNLHVMLEMPPSSKSYEKLIIYSCDLLKVHAALDIVDQMCKADLTLSID 463 Query: 2220 AFHSILDACDRSCEFNLVHQINLRISHQNLKPNNETFRRMILLCVKMKDFEGAYGLIGDL 2041 +SIL AC+ S EFNLV QI I H NL PNNETFR MI L VKMKDF GA+ ++ DL Sbjct: 464 VLNSILRACEESFEFNLVQQIYSLICHHNLTPNNETFRSMIKLRVKMKDFCGAHDMLDDL 523 Query: 2040 QKMNLTPTASMYNSILAGYFREKNIKSARNVLKQMEDANVKPDASTYSYLIANSRSEKDL 1861 +K LTPTASMYN+I+AG FREKNI VLK+ME A+VKPD+ TYS LIAN SE + Sbjct: 524 KKFKLTPTASMYNAIMAGCFREKNINGGLMVLKKMELADVKPDSQTYSNLIANCNSENQI 583 Query: 1860 TKFYEDLIESGIPPTKQVFMALINAYASFGQFEKAQQVMLDKRVPVKNLNEIKSVLIGAL 1681 +K+YE+L GI +KQ+FMALINAYA+ GQFEKA+QV+LDK +P++N+ EIKS L+ AL Sbjct: 584 SKYYEELKFVGIHVSKQIFMALINAYATCGQFEKAKQVLLDKGIPIENVIEIKSALVSAL 643 Query: 1680 ASNGKFSCALELYEEMKGAKCELDPKAVKTLIEYFQSEGXXXXXXXXXXXXXXSPYWTDA 1501 AS+G+ S AL +YEE+K A ++PK+V LIE++QSEG Y D Sbjct: 644 ASHGQMSDALVVYEEIKEAGGNMEPKSVICLIEHYQSEGELSRLLKLLEELQDPNYCVDG 703 Query: 1500 CFRVISHCVRQEDLRSTVDLLKQLKDRFIDAEVALEVLFDEVFCLFADQEPTDMHFGLKL 1321 C RV+ C+R + L S V+LLKQLKDR E+ ++V+FDEVF L A+ EPTD+ GL L Sbjct: 704 CCRVMLWCIRNKHLSSAVNLLKQLKDRLSSDELTMQVIFDEVFSLIAEMEPTDLLIGLDL 763 Query: 1320 LQAIKEELGVRPSRKSLDFLLSACVITKDSKTSFLIWKEYVTSGLPYNILSFVRMYQALL 1141 LQ IK+EL V PSRKSLDFLLSAC KD S IWKEY +G PYN++S++RMYQALL Sbjct: 764 LQVIKDELCVCPSRKSLDFLLSACAKAKDLTNSLFIWKEYHAAGYPYNVISYLRMYQALL 823 Query: 1140 ASGDTKSAAKLLNKIPKDDVHVRQVIRACQETYVE 1036 +SGD +SA +L +I KDD HVR++I+ACQ+TY++ Sbjct: 824 SSGDYRSAKVILAEIQKDDPHVRRMIQACQKTYIQ 858 >ref|NP_192388.3| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635640|sp|Q6NQ81.2|PP304_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g04790, mitochondrial; Flags: Precursor gi|332657026|gb|AEE82426.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 821 Score = 512 bits (1319), Expect = e-142 Identities = 264/576 (45%), Positives = 383/576 (66%) Frame = -2 Query: 2769 PDLLPRLAYYEMLLWISINNEEKIQELCLSVATVDAKEKLYLRESYLMALCESNYKEEFL 2590 P + RL YYEMLLWI + + EKI+ELC ++ + + L+E+YL+ALC+ + K Sbjct: 228 PGSVERLGYYEMLLWIHLGDGEKIEELCSTIDGDNGESLSVLQENYLLALCKKDQKYHLE 287 Query: 2589 LLFKTLDISKVTSVAYLESIFKALGKFLLETYAERFLSVLKGRGDIGAENISNFILQHTI 2410 L + +DI+KV S L +IF+ LG+F L++ A RFL L+ D G +N+S+ I ++ Sbjct: 288 RLLEIVDITKVRSSDLLANIFEYLGRFSLDSVASRFLWELR-ESDEGVKNVSDLISIYST 346 Query: 2409 NLPNLAVEDIIKKFQKLLAKFEVLPTSAQYEKLIEYCCGFFKVHEALDVVDEAFGSGIAL 2230 PN VED I KF K+ + +V+P+S YEKL++Y C +V ALDVV++ +G+ + Sbjct: 347 CTPNPTVEDTILKFNKMHEELDVMPSSTSYEKLVKYSCDSNEVVTALDVVEKMGEAGLMI 406 Query: 2229 SLEAFHSILDACDRSCEFNLVHQINLRISHQNLKPNNETFRRMILLCVKMKDFEGAYGLI 2050 S + HS+L A D EF+LV +I+ + +++KPN E FR +I LC ++KDFEGAY ++ Sbjct: 407 SADILHSLLHAIDEVLEFDLVRRIHSIMCTKSVKPNTENFRSIIRLCTRIKDFEGAYNML 466 Query: 2049 GDLQKMNLTPTASMYNSILAGYFREKNIKSARNVLKQMEDANVKPDASTYSYLIANSRSE 1870 G+L+ NL P +SM+N ILAGYFREKN+ SA V+KQM++A VKPD+ T+ YLI N E Sbjct: 467 GNLKNFNLEPNSSMFNCILAGYFREKNVSSALMVVKQMKEAGVKPDSITFGYLINNCTQE 526 Query: 1869 KDLTKFYEDLIESGIPPTKQVFMALINAYASFGQFEKAQQVMLDKRVPVKNLNEIKSVLI 1690 +TK+YE++ ++G+ TK+++M+LI+AYA+ G+FEKA+QV++D VP N NE+KSVLI Sbjct: 527 DAITKYYEEMKQAGVQATKRIYMSLIDAYAASGKFEKAKQVLVDPDVPAINQNELKSVLI 586 Query: 1689 GALASNGKFSCALELYEEMKGAKCELDPKAVKTLIEYFQSEGXXXXXXXXXXXXXXSPYW 1510 ALAS GK++ AL +YEEM+ A+C +DPK++ +LIEY S+G W Sbjct: 587 SALASRGKWADALHIYEEMRKAECHVDPKSIISLIEYSDSKGELSTLVQLADDLQDDTSW 646 Query: 1509 TDACFRVISHCVRQEDLRSTVDLLKQLKDRFIDAEVALEVLFDEVFCLFADQEPTDMHFG 1330 D FR+I VR + VDLLK+ K R + + +E FDEVF A+ EP+ +H G Sbjct: 647 IDGFFRMILFAVRNKKSSDIVDLLKRNKVRLLKKGIPVEAHFDEVFWAIAETEPSKVHLG 706 Query: 1329 LKLLQAIKEELGVRPSRKSLDFLLSACVITKDSKTSFLIWKEYVTSGLPYNILSFVRMYQ 1150 + LL+ +K+ELG PSRK LDFLL ACV KD + L+WKEY ++ P N+LSF+RMYQ Sbjct: 707 MDLLRFMKDELGFVPSRKCLDFLLHACVNAKDLEHGLLVWKEYQSAAFPCNVLSFLRMYQ 766 Query: 1149 ALLASGDTKSAAKLLNKIPKDDVHVRQVIRACQETY 1042 LLA+GD++ A L++KIPKDD V+ +I Q + Sbjct: 767 VLLAAGDSEGAKALVSKIPKDDKDVQHIIEESQSAF 802