BLASTX nr result

ID: Cephaelis21_contig00019926 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00019926
         (2474 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002332130.1| predicted protein [Populus trichocarpa] gi|2...   657   0.0  
ref|XP_002274185.2| PREDICTED: uncharacterized protein LOC100242...   656   0.0  
ref|XP_002869971.1| hypothetical protein ARALYDRAFT_914700 [Arab...   550   e-154
ref|NP_193700.2| Mitochondrial transcription termination factor ...   548   e-153
ref|XP_003545205.1| PREDICTED: uncharacterized protein LOC100803...   539   e-150

>ref|XP_002332130.1| predicted protein [Populus trichocarpa] gi|222875180|gb|EEF12311.1|
            predicted protein [Populus trichocarpa]
          Length = 577

 Score =  657 bits (1695), Expect = 0.0
 Identities = 327/573 (57%), Positives = 423/573 (73%), Gaps = 2/573 (0%)
 Frame = +2

Query: 362  MISQLNNLLKFPPLLERGSARHRPQPFPSTRRILCFKSSVPRSNNLKVSLVES-QKTVSL 538
            MI+Q+N  L F P+    S + +  PF S+ ++ CF SS   S N KVS   S Q  V+L
Sbjct: 1    MIAQVNKSLVFSPVDIEISYK-KQNPFLSSLKVQCFCSS-RSSQNAKVSHSGSVQSLVTL 58

Query: 539  PVNKISRVARNDAQAALFDYLHHTRGFNYLDAEHICKNSPHFIQNLVAKVENEQDVSRAL 718
               ++SRVAR DAQ  LFDYLH TR F++ DAEHI KNSPHFI+NL+ K++N++DV R L
Sbjct: 59   HSTRVSRVARTDAQRVLFDYLHCTRNFDFNDAEHISKNSPHFIENLLTKIDNDKDVVRLL 118

Query: 719  SKLLRYHPINEFEPFLESLGLRPSELPSLLPANLMFLEDDRMLLHNFHVLCDYGIPRSEI 898
            +K LRY+PINEFEPF ESLGLRPSE+PS+LP +LM+L D+ MLL NFHVLC+YGIPRS+I
Sbjct: 119  NKFLRYNPINEFEPFFESLGLRPSEVPSVLPPHLMYLGDNDMLLENFHVLCNYGIPRSKI 178

Query: 899  GKIYKEVVEIFGYGFGILSTKIMAYENLGLSRPTVIKVVTCCSSILVGDMDKELIEILDK 1078
            G++YKE +EIFGY +G+L  K++AYENLGLS+ TV+K+V+CC S+L+G +D+E + +L +
Sbjct: 179  GRMYKEAIEIFGYNYGVLKLKLLAYENLGLSKTTVVKLVSCCPSLLIGGVDREFVNVLGR 238

Query: 1079 LAGLGFQRDWFGGYLSSKSTYSWNKMLATIAFLVEVGYDDRQIGDLIRKSPAILFEGSGK 1258
            L   G + D  GGYLS+K +Y W +++ TI FL +VGY + Q  DL++ +P ++FEGSGK
Sbjct: 239  LNRAGLKNDLIGGYLSAKESYDWKRLIDTIYFLDKVGYSEEQFRDLLKTNPVLVFEGSGK 298

Query: 1259 QIYILVGGFLKLGLNVKEVYPLFVENPQILSPKCVQNIWKALHFLFEVGLETDYIAKIVC 1438
            ++Y+L G  LKLGL V E+Y LF + PQILS K  +N+ + +H L  +G+  + IA I+ 
Sbjct: 299  KVYLLFGRLLKLGLKVNEIYSLFTQYPQILSAKRAKNLLRGIHILLGIGMGVEDIANIIS 358

Query: 1439 THIHVLASHSLKGPKTLLRNFNGDRQRLLEVIKKNPSKFFTLASKSNVSSIEQLHANN-L 1615
            T + +L S +LKGP TL R F   +  L +++ +NP + F L SKS V S + L +    
Sbjct: 359  TQMELLCSAALKGPVTLRRQFKDKKDSLCQILMENPLELFHLDSKSEVESSKMLSSQGPT 418

Query: 1616 GLLEKTTFLLKLGYVENSDEMGKALKKFRGRGDQLQERFDCLVQAGLDSNVVSDMIRRAP 1795
              LEKT FLL+LGYVENSDEM +ALK FRGRGDQLQERFDC VQAGLD NVVS  I++AP
Sbjct: 419  NKLEKTAFLLRLGYVENSDEMARALKMFRGRGDQLQERFDCPVQAGLDCNVVSSFIKQAP 478

Query: 1796 TTMNQSKDVLEKKIACLRSLGYPIESLASFPSYLCYDIERIIRRFNMYMWLKEKGAANPM 1975
              +NQ+KDV+EKKI CL +LG  + SL +FPSYLCYD+ERI  RF MY WLKEKGAA P 
Sbjct: 479  MVLNQTKDVIEKKIDCLTNLGCSVNSLVAFPSYLCYDMERINLRFRMYTWLKEKGAAKPK 538

Query: 1976 LSPSTILTCSDARFVKYFVDVHPEGPTKWESLK 2074
            LS STIL CSDARF+KYFVDVHPEGP  WESL+
Sbjct: 539  LSLSTILACSDARFIKYFVDVHPEGPAMWESLR 571


>ref|XP_002274185.2| PREDICTED: uncharacterized protein LOC100242606 [Vitis vinifera]
          Length = 564

 Score =  656 bits (1692), Expect = 0.0
 Identities = 316/516 (61%), Positives = 410/516 (79%), Gaps = 2/516 (0%)
 Frame = +2

Query: 533  SLPVNKISRVARNDAQAALFDYLHHTRGFNYLDAEHICKNSPHFIQNLVAKVENEQDVSR 712
            S+P  ++SRV R +AQ  LFDYLH TR F+  DAEH+ KNSPHF+Q L++KVENEQDV+R
Sbjct: 45   SIPGRRVSRVVRTEAQDVLFDYLHCTRSFHLTDAEHMSKNSPHFLQKLLSKVENEQDVAR 104

Query: 713  ALSKLLRYHPINEFEPFLESLGLRPSELPSLLPANLMFLEDDRMLLHNFHVLCDYGIPRS 892
            +LSK LRY+PINEFEPF ESLGL PSE+ +LLP NLMFL DD +++ N+HVLCDYGI RS
Sbjct: 105  SLSKFLRYNPINEFEPFFESLGLAPSEISALLPRNLMFLSDDCVMIENYHVLCDYGIARS 164

Query: 893  EIGKIYKEVVEIFGYGFGILSTKIMAYENLGLSRPTVIKVVTCCSSILVGDMDKELIEIL 1072
             IG++YKEV  IF Y  G+L +K+ AYE LGLSR TVIK+V+CC  +LVG ++ + + +L
Sbjct: 165  SIGRMYKEVQAIFRYELGLLGSKVRAYEGLGLSRSTVIKLVSCCPWLLVGGVNSQFVMVL 224

Query: 1073 DKLAGLGFQRDWFGGYLSSKSTYSWNKMLATIAFLVEVGYDDRQIGDLIRKSPAILFEGS 1252
             ++ GLGF+ DW GGYLS KS+Y+W +M  TI FL +VGY + Q+  L + +P +LFEGS
Sbjct: 225  KRVKGLGFESDWIGGYLSGKSSYNWKRMHDTIDFLEKVGYSEEQMVSLFKTNPELLFEGS 284

Query: 1253 GKQIYILVGGFLKLGLNVKEVYPLFVENPQILSPKCVQNIWKALHFLFEVGLETDYIAKI 1432
            GK+ Y+L+G  LKLG  +K V  LF++NPQILS KCV+N+W+A+ FLFE+G++ + I  I
Sbjct: 285  GKKFYVLIGRLLKLGFKMKGVLSLFLQNPQILSKKCVKNLWQAVGFLFEIGMKVEDIVSI 344

Query: 1433 VCTHIHVLASHSLKGPKTLLRNFNGDRQRLLEVIKKNPSKFFTLASKSNVSSIEQLHANN 1612
            V +H+ +L S SLKGP+T+LR+    R+ L ++IK++PS+  +LASKS ++S+E +   +
Sbjct: 345  VSSHVQLLCSCSLKGPRTVLRSLKVGREGLCQIIKEDPSELLSLASKSKINSMEHVTCQS 404

Query: 1613 LGL-LEKTTFLLKLGYVENSDEMGKALKKFRGRGDQLQERFDCLVQAGLDSNVVSDMIRR 1789
                LEKTTFLL+LGYVENSDEM KALK FRGRGDQLQERFDCLVQAGLD NVVS+MI++
Sbjct: 405  PSKHLEKTTFLLRLGYVENSDEMFKALKLFRGRGDQLQERFDCLVQAGLDCNVVSNMIKQ 464

Query: 1790 APTTMNQSKDVLEKKIACLRS-LGYPIESLASFPSYLCYDIERIIRRFNMYMWLKEKGAA 1966
            AP+ +NQ+K V+EKKI CLR+ LGYP++S+ +FPSYLCYDIERI  RF+MY+WL++KGAA
Sbjct: 465  APSVLNQTKYVIEKKIDCLRNCLGYPLQSVVAFPSYLCYDIERINLRFSMYVWLRDKGAA 524

Query: 1967 NPMLSPSTILTCSDARFVKYFVDVHPEGPTKWESLK 2074
               LS STIL CSDARFVKYFVDVHPEGP +WE L+
Sbjct: 525  KSNLSLSTILACSDARFVKYFVDVHPEGPAQWERLR 560


>ref|XP_002869971.1| hypothetical protein ARALYDRAFT_914700 [Arabidopsis lyrata subsp.
            lyrata] gi|297315807|gb|EFH46230.1| hypothetical protein
            ARALYDRAFT_914700 [Arabidopsis lyrata subsp. lyrata]
          Length = 550

 Score =  550 bits (1418), Expect = e-154
 Identities = 293/588 (49%), Positives = 379/588 (64%), Gaps = 4/588 (0%)
 Frame = +2

Query: 320  AMLISRISKRVP--CTMISQLNNLLKFPPLLERGSARHRPQPFPSTRRILCFKSSVPRSN 493
            AML+ R  +RV     MIS LNN + F P                          +PR N
Sbjct: 6    AMLLCREQRRVHKLLNMISNLNNCIAFSP--------------------------IPRQN 39

Query: 494  NLKVSLVESQKTVSLPVNKISRVARNDAQAALFDYLHHTRGFNYLDAEHICKNSPHFIQN 673
                  V+  K V + +N      R   Q                  EHI KNSP F+  
Sbjct: 40   Q-----VQRLKAVFVRINLSYNNTRLTYQL-----------------EHISKNSPCFMST 77

Query: 674  LVAKVE-NEQDVSRALSKLLRYHPINEFEPFLESLGLRPSELPSLLPANLMFLEDDRMLL 850
            L++K++ N +DVSR L+K LRY+PINEFEPF ESLGL P E  + LP  LMFL DD ++ 
Sbjct: 78   LLSKIDDNHKDVSRGLTKFLRYNPINEFEPFFESLGLCPYEFETFLPQKLMFLSDDGIMF 137

Query: 851  HNFHVLCDYGIPRSEIGKIYKEVVEIFGYGFGILSTKIMAYENLGLSRPTVIKVVTCCSS 1030
             NFH LC+YGIPR +IG +YKE  EIF Y  G+L+ K+  YENLGLS+ TVIK+VT C  
Sbjct: 138  ENFHALCNYGIPRGKIGHMYKEAREIFRYESGLLAMKLRDYENLGLSKATVIKLVTSCPL 197

Query: 1031 ILVGDMDKELIEILDKLAGLGFQRDWFGGYLSSKSTYSWNKMLATIAFLVEVGYDDRQIG 1210
            +LVG +D E   ++DKL GL    DW G YLS + TYSW ++L TI FL +VG  D  + 
Sbjct: 198  LLVGGIDAEFASVVDKLKGLQVGCDWLGRYLSDRRTYSWRRILETIEFLDKVGCKDENLS 257

Query: 1211 DLIRKSPAILFEGSGKQIYILVGGFLKLGLNVKEVYPLFVENPQILSPKCVQNIWKALHF 1390
             L++  PA++ EGSGK+ Y+L G   K+GL V E+Y LF++NP++LS KCV+NI K L F
Sbjct: 258  SLLKTYPALVIEGSGKKFYVLFGRLFKVGLQVNEIYRLFIDNPEMLSDKCVKNIQKTLDF 317

Query: 1391 LFEVGLETDYIAKIVCTHIHVLASHSLKGPKTLLRNFNGDRQRLLEVIKKNPSKFFTLAS 1570
            L  + +ET +I KI+ +H+ ++ S SL  P+T   + N  +  L +++KK P + F+  S
Sbjct: 318  LIAIRMETQFITKILLSHMELIGSCSLPAPRTACLSLNVRQDELCQLLKKEPLRLFSFVS 377

Query: 1571 KSNVSSIEQLHANNLGLLEKTTFLLKLGYVENSDEMGKALKKFRGRGDQLQERFDCLVQA 1750
             +     + L  ++   LEKT FLL+LGYVENSDEM KALK+FRGRGDQLQERFDCLV+A
Sbjct: 378  TTKKRKSKPLSEDSRKYLEKTAFLLRLGYVENSDEMVKALKQFRGRGDQLQERFDCLVKA 437

Query: 1751 GLDSNVVSDMIRRAPTTMNQSKDVLEKKIACLRS-LGYPIESLASFPSYLCYDIERIIRR 1927
            GL+ NVV+++IR AP  +N SKDV+EKKI  L   LGYPIESL  FP+YLCYD++RI  R
Sbjct: 438  GLNHNVVTEIIRHAPMILNLSKDVIEKKIHSLTELLGYPIESLVRFPAYLCYDMQRIHHR 497

Query: 1928 FNMYMWLKEKGAANPMLSPSTILTCSDARFVKYFVDVHPEGPTKWESL 2071
            F+MY+WL+E+ AA PMLSPSTILTC DARFVKYFV+VHPEGP  WES+
Sbjct: 498  FSMYLWLRERDAAKPMLSPSTILTCGDARFVKYFVNVHPEGPAIWESI 545


>ref|NP_193700.2| Mitochondrial transcription termination factor family protein
            [Arabidopsis thaliana] gi|332658810|gb|AEE84210.1|
            Mitochondrial transcription termination factor family
            protein [Arabidopsis thaliana]
          Length = 575

 Score =  548 bits (1411), Expect = e-153
 Identities = 292/588 (49%), Positives = 381/588 (64%), Gaps = 4/588 (0%)
 Frame = +2

Query: 320  AMLISRISKRVP--CTMISQLNNLLKFPPLLERGSARHRPQPFPSTRRILCFKSSVPRSN 493
            AML+SR  +RV     MIS LN  + F                          SS+PR N
Sbjct: 31   AMLLSRNQRRVHKLLNMISNLNYCITF--------------------------SSIPRQN 64

Query: 494  NLKVSLVESQKTVSLPVNKISRVARNDAQAALFDYLHHTRGFNYLDAEHICKNSPHFIQN 673
                  V+  K V + +N      R   Q                  EHI KNSP F+  
Sbjct: 65   P-----VQRLKAVFVRINLSYNNTRLTYQL-----------------EHISKNSPCFMST 102

Query: 674  LVAKVE-NEQDVSRALSKLLRYHPINEFEPFLESLGLRPSELPSLLPANLMFLEDDRMLL 850
            L++K++ N++DVS+ L+K LRY+PINEFEPF ESLGL P E  + LP  LMFL DD ++ 
Sbjct: 103  LLSKIDDNQKDVSKGLTKFLRYNPINEFEPFFESLGLCPYEFETFLPRKLMFLSDDGIMF 162

Query: 851  HNFHVLCDYGIPRSEIGKIYKEVVEIFGYGFGILSTKIMAYENLGLSRPTVIKVVTCCSS 1030
             NFH LC+YGIPR +IG++YKE  EIF Y  G+L+ K+  YENLGLS+ TVIK+VT C  
Sbjct: 163  ENFHALCNYGIPRGKIGRMYKEAREIFRYESGMLAMKLRGYENLGLSKATVIKLVTSCPL 222

Query: 1031 ILVGDMDKELIEILDKLAGLGFQRDWFGGYLSSKSTYSWNKMLATIAFLVEVGYDDRQIG 1210
            +LVG +D E   ++DKL GL    DW G YLS + TYSW ++L TI FL +VG  + ++ 
Sbjct: 223  LLVGGIDAEFSSVVDKLKGLQVGCDWLGRYLSDRKTYSWRRILETIEFLDKVGCKEEKLS 282

Query: 1211 DLIRKSPAILFEGSGKQIYILVGGFLKLGLNVKEVYPLFVENPQILSPKCVQNIWKALHF 1390
             L++  PA++ EGSGK+ Y+L G   K GL V E+Y LF++NP++LS KCV+NI K L F
Sbjct: 283  SLLKTYPALVIEGSGKKFYVLFGRLFKAGLQVNEIYRLFIDNPEMLSDKCVKNIQKTLDF 342

Query: 1391 LFEVGLETDYIAKIVCTHIHVLASHSLKGPKTLLRNFNGDRQRLLEVIKKNPSKFFTLAS 1570
            L  + +ET +I KI+ +H+ ++ S SL  P+T   + N  +  L +++KK P + F   S
Sbjct: 343  LIAIRMETQFITKILLSHMELIGSCSLPAPRTACLSLNVKQDELCKILKKEPLRLFCFVS 402

Query: 1571 KSNVSSIEQLHANNLGLLEKTTFLLKLGYVENSDEMGKALKKFRGRGDQLQERFDCLVQA 1750
             +     + L  ++   LEKT FLL+LGYVENSDEM KALK+FRGRGDQLQERFDCLV+A
Sbjct: 403  TTKKRKSKPLSEDSRKYLEKTEFLLRLGYVENSDEMVKALKQFRGRGDQLQERFDCLVKA 462

Query: 1751 GLDSNVVSDMIRRAPTTMNQSKDVLEKKIACLRS-LGYPIESLASFPSYLCYDIERIIRR 1927
            GL+ NVV+++IR AP  +N SKDV+EKKI  L   LGYPIESL  FP+YLCYD++RI  R
Sbjct: 463  GLNYNVVTEIIRHAPMILNLSKDVIEKKIHSLTELLGYPIESLVRFPAYLCYDMQRIHHR 522

Query: 1928 FNMYMWLKEKGAANPMLSPSTILTCSDARFVKYFVDVHPEGPTKWESL 2071
            F+MY+WL+E+ AA PMLSPSTILTC DARFVKYFV+VHPEGP  WES+
Sbjct: 523  FSMYLWLRERDAAKPMLSPSTILTCGDARFVKYFVNVHPEGPAIWESI 570


>ref|XP_003545205.1| PREDICTED: uncharacterized protein LOC100803162 [Glycine max]
          Length = 564

 Score =  539 bits (1389), Expect = e-150
 Identities = 263/550 (47%), Positives = 382/550 (69%), Gaps = 4/550 (0%)
 Frame = +2

Query: 437  PFPSTRRILCFKSS--VPRSNNLKVSLVESQKTVSLPVNKISRVARNDAQAALFDYLHHT 610
            P   + ++ CF++S  +P  ++  +S V S ++      ++SR+ R++AQ AL DY+H T
Sbjct: 11   PSSISLKVRCFRASEPIPLPSSSPLSSVLSSRSPK----RVSRLLRSEAQHALMDYMHST 66

Query: 611  RGFNYLDAEHICKNSPHFIQNLVAKVENEQDVSRALSKLLRYHPINEFEPFLESLGLRPS 790
            RG+ + DAE+I +NSP FI++LV+ ++++ DV R+L + LRY+PINEFEPF ESLG+ PS
Sbjct: 67   RGYTFSDAEYISENSPRFIESLVSMIDDKDDVLRSLERFLRYNPINEFEPFFESLGIDPS 126

Query: 791  ELPSLLPANLMFLEDDRMLLHNFHVLCDYGIPRSEIGKIYKEVVEIFGYGFGILSTKIMA 970
            EL   LP  + FL DD +LL NFH LC+YG+PR+ +GK +KE  EIFGY  G+L +K+ A
Sbjct: 127  ELYLFLPHGMFFLADDHVLLQNFHALCNYGVPRNRMGKFFKEAKEIFGYASGVLLSKLEA 186

Query: 971  YENLGLSRPTVIKVVTCCSSILVGDMDKELIEILDKLAGLGFQRDWFGGYLSSKSTYSWN 1150
            YENLGL + TV+K+V CC  +LVGD++ E + +LD L  +G + DW   YLS   TYSW 
Sbjct: 187  YENLGLRKSTVVKLVVCCPLLLVGDVNFEFVSVLDWLKRIGIESDWMVNYLSCSRTYSWK 246

Query: 1151 KMLATIAFLVEVGYDDRQIGDLIRKSPAILFEGSGKQIYILVGGFLKLGLNVKEVYPLFV 1330
            +ML  + FL +VGY + Q+ +L R++P +L EG G+++Y++ G  LK+G+ +  VY  FV
Sbjct: 247  RMLDAMLFLHKVGYSEEQMHNLFRENPKLLLEGFGRKVYLVFGRLLKVGVEMNVVYSYFV 306

Query: 1331 ENPQILSPKCVQNIWKALHFLFEVGLETDYIAKIVCTHIHVLASHSLKGPKTLLRNFNGD 1510
            E P IL  KC  ++ + + FL  +G+  D I  I+  ++H+L + SLKG KT+ +     
Sbjct: 307  EYPNILLNKCANDMLRVIDFLGAIGMGKDDITHILSKYMHLLITRSLKGHKTVCQELKVG 366

Query: 1511 RQRLLEVIKKNPSKFFTLASKSNVSSIEQLHANN-LGLLEKTTFLLKLGYVENSDEMGKA 1687
            +  L ++IK +P K  +LASK       ++ +++    LEKTTFLLKLGY+ENS+EM KA
Sbjct: 367  KADLYQIIKDDPLKLISLASKQEQKGNGKVDSHDPRNYLEKTTFLLKLGYIENSEEMAKA 426

Query: 1688 LKKFRGRGDQLQERFDCLVQAGLDSNVVSDMIRRAPTTMNQSKDVLEKKIACLRS-LGYP 1864
            LK FRGRGDQLQERFDCLV+AGLD N V +MI+RAP  ++Q+K V++KKI  L++ L YP
Sbjct: 427  LKMFRGRGDQLQERFDCLVEAGLDYNSVIEMIKRAPMILSQNKAVIQKKIDFLKNVLDYP 486

Query: 1865 IESLASFPSYLCYDIERIIRRFNMYMWLKEKGAANPMLSPSTILTCSDARFVKYFVDVHP 2044
            +E L  FP+Y C+D+++I+ R +MY WLKE+ A NP L+ STI+  +D RFVKYFV+VHP
Sbjct: 487  LEGLVGFPTYFCHDLDKIVERLSMYAWLKERNAVNPTLTLSTIIASNDKRFVKYFVNVHP 546

Query: 2045 EGPTKWESLK 2074
            +G   W+ LK
Sbjct: 547  QGSAIWKGLK 556


Top