BLASTX nr result
ID: Cephaelis21_contig00013299
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00013299 (3456 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003631380.1| PREDICTED: LOW QUALITY PROTEIN: protein NLP7... 176 5e-41 ref|XP_002890396.1| RWP-RK domain-containing protein [Arabidopsi... 145 7e-32 ref|XP_003606812.1| Nodule inception protein [Medicago truncatul... 140 2e-30 ref|XP_003546980.1| PREDICTED: protein NLP8-like [Glycine max] 139 4e-30 ref|XP_002530298.1| transcription factor, putative [Ricinus comm... 139 4e-30 >ref|XP_003631380.1| PREDICTED: LOW QUALITY PROTEIN: protein NLP7-like [Vitis vinifera] Length = 982 Score = 176 bits (445), Expect = 5e-41 Identities = 137/432 (31%), Positives = 200/432 (46%), Gaps = 47/432 (10%) Frame = -1 Query: 3369 SDENSLLQIWAPVTIGDKQYLTTSDQPFAMKELVKGLCSYRKHCLDYLIPVDDDLSTEGG 3190 ++++ L Q+WAPV GD+ LTT QPF + GL YR L Y VD + G Sbjct: 150 TEQHVLAQVWAPVKNGDRCLLTTYGQPFVLDPHSNGLHQYRMISLTYTFSVDGE---SDG 206 Query: 3189 LVGPPGRVLRNGMPEVCLDVGHYTCTEYPLRDKALACGIYYYLALPM------SCGRVLE 3028 + P RV R +PE +V +Y+ EY + AL + LALP+ SC VLE Sbjct: 207 ALRLPARVFRQKLPEWTPNVQYYSSREYSRLNHALHYNVRGTLALPVFEPSGPSCVGVLE 266 Query: 3027 LAHDGPP----PRSWRDFRLM----------FEDPELSL-----GPTIPEIKEMLQAVCR 2905 L P + + + E P+ + + EI E+ VC Sbjct: 267 LIMTSQKINYAPEVDKVCKALEAVNLKSSEILEHPKAQICNEGRQNALAEILEIFTVVCE 326 Query: 2904 THGLPLAQTWVSMPPDSTRFENEAVKYKNGAFYAVNDDYDGD---------DEEGSFMVD 2752 T+ LPLAQTWV + +V G +DG + ++VD Sbjct: 327 TYKLPLAQTWVPC-------RHRSVLAGGGGLRKSCSSFDGSCMGQVCMSTTDVAFYVVD 379 Query: 2751 YDFRDFILACHFCCIKTGRGVVGKAFSSKGACFCKDVCQLSINEYPLVSSARKAGVTGCF 2572 F AC ++ G+GV G+AF S +C+C ++ Q EYPLV AR G+T CF Sbjct: 380 AHMWGFREACAEHHLQKGQGVAGRAFESHNSCYCSNITQFCKTEYPLVHYARMFGLTCCF 439 Query: 2571 AICLQHTQLADFVYIVEFFLPSNEAESVDPRTLIEMLLTTMKLHLKNFKVASGQEL-GGK 2395 AICL+ T + YI+EFFLP + +S D +TL++ LL TMK H ++ +VASG+E + Sbjct: 440 AICLRSTHTGNDDYILEFFLPPSITDSRDQQTLLDSLLATMKQHFQSLRVASGKEFEEEE 499 Query: 2394 LLFEVIKTSPNNEFDS--SGILDHTNFDSTPNSVGLTYGGQTAKLDNT----------MS 2251 E+IK N + DS I + S P L G+ +LD+T + Sbjct: 500 KSVEIIKLPMNGKLDSRLESIQISQSTPSPPGPDILPSRGEMQQLDSTKHQLMVEFDAIK 559 Query: 2250 NADNVLGGGAGK 2215 + +NV+G G + Sbjct: 560 DRENVVGAGVSQ 571 Score = 114 bits (285), Expect = 2e-22 Identities = 83/262 (31%), Positives = 132/262 (50%), Gaps = 31/262 (11%) Frame = -1 Query: 2055 KNQDNALNEIKKGLSAVCQTDQVHYAQTWL------------------ASFSADCLNVVI 1930 + + NAL EI + + VC+T ++ AQTW+ +SF C+ V Sbjct: 308 EGRQNALAEILEIFTVVCETYKLPLAQTWVPCRHRSVLAGGGGLRKSCSSFDGSCMGQVC 367 Query: 1929 KAQGGNHCRNSDAEL--FFKACKTAQIQSGHAVVGKAFSSLGACFCKNISQLGESDYCLA 1756 + DA + F +AC +Q G V G+AF S +C+C NI+Q +++Y L Sbjct: 368 MSTTDVAFYVVDAHMWGFREACAEHHLQKGQGVAGRAFESHNSCYCSNITQFCKTEYPLV 427 Query: 1755 PDSRKLGFTGCFAISLQSLHTVDIVYILEFF---SAIGSQDPRKMVCMILRKLRQELQSF 1585 +R G T CFAI L+S HT + YILEFF S S+D + ++ +L ++Q QS Sbjct: 428 HYARMFGLTCCFAICLRSTHTGNDDYILEFFLPPSITDSRDQQTLLDSLLATMKQHFQSL 487 Query: 1584 KVASGQEL-GEKLFVEVLKI----SPDDELESFEIQENSVSTSAFEFKELLDESEMMQVD 1420 +VASG+E E+ VE++K+ D LES +I +++ S + L EM Q+D Sbjct: 488 RVASGKEFEEEEKSVEIIKLPMNGKLDSRLESIQISQSTPSPPGPDI--LPSRGEMQQLD 545 Query: 1419 A---HIVNNVTSIGERNEVAGS 1363 + ++ +I +R V G+ Sbjct: 546 STKHQLMVEFDAIKDRENVVGA 567 Score = 62.4 bits (150), Expect = 8e-07 Identities = 27/81 (33%), Positives = 50/81 (61%) Frame = -1 Query: 423 DNSDIMVKASFRDDILKFGVSVSSSKMDLELEIVKRLNVPIQRCKIKYRDEDDSWILIAC 244 D + +KA++RDDI++F + ++S ++L+ E+ KRL + + IKY D+D W+LIAC Sbjct: 884 DVRTMTIKATYRDDIIRFRIPLTSGIVELKEEVAKRLKLEVGTFDIKYLDDDHEWVLIAC 943 Query: 243 DEDLRTAVSTLRSLGKTTMEM 181 + DL+ + + G + + Sbjct: 944 NADLQECMDISWTTGSNIIRL 964 >ref|XP_002890396.1| RWP-RK domain-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297336238|gb|EFH66655.1| RWP-RK domain-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 842 Score = 145 bits (366), Expect = 7e-32 Identities = 117/406 (28%), Positives = 181/406 (44%), Gaps = 40/406 (9%) Frame = -1 Query: 3378 NHVSDENSLLQIWAPVTIGDKQYLTTSDQPFAMKELVKGLCSYRKHCLDYLIPVDDDLST 3199 ++ ++ SL+Q+W PV G K+ LTT +QPF+ L + L +YR+ + Y + D S Sbjct: 118 DYTTERGSLIQLWVPVNRGGKRVLTTKEQPFSHDPLCQRLANYREISVKYQFSAEQDDSK 177 Query: 3198 EGGLVGPPGRVLRNGMPEVCLDVGHYTCTEYPLRDKALACGIYYYLALPM------SCGR 3037 L G PGRV +PE DV + EYP A CG+ LA+P+ C Sbjct: 178 --ALTGLPGRVFLGKLPEWTPDVRFFKSEEYPRVHHAQDCGVRGTLAIPVFEQGSKICLG 235 Query: 3036 VLELAHDGPPPRSWRDFRLM--------FEDPELSLGPT-----------IPEIKEMLQA 2914 V+E+ + + + EL + PT +PEI+ +L+ Sbjct: 236 VIEVVMTTEMVKLRPELESICRALQAVDLRSTELPIPPTLKGCDLSYQAALPEIRNLLRC 295 Query: 2913 VCRTHGLPLAQTWVSMPPDSTRFENEAVKYKNGAFYAVNDDYDGDDEEGSFMVDYDFRDF 2734 C TH LPLAQTWVS + ++ + + D + ++ D R+F Sbjct: 296 ACETHKLPLAQTWVSCQQQN----KSGCRHNDENYIHCVSTID----DACYVGDPTVREF 347 Query: 2733 ILACHFCCIKTGRGVVGKAFSSKGACFCKDVCQLSINEYPLVSSARKAGVTGCFAICLQ- 2557 AC + G+GV G+AF + G CF DV +EYPL A G+ G AI L+ Sbjct: 348 HEACSEHHLLKGQGVAGQAFLTNGPCFSSDVSNYKKSEYPLSHHANMYGLHGAVAIRLRC 407 Query: 2556 -HTQLADFVYIVEFFLPSNEAESVDPRTLIEMLLTTMKLHLKNFKVASGQELGGKLLF-- 2386 HT ADFV +EFFLP + + RT++ L T M ++ + + +EL + Sbjct: 408 IHTGPADFV--LEFFLPKECDDLEEQRTMLNALSTIMAHVPRSLRTVTDKELEEESEVIE 465 Query: 2385 --EVIKTSPNNEFDSSG---------ILDHTNFDSTPNSVGLTYGG 2281 E++ N + G + +N S P ++GL + G Sbjct: 466 REEIVTPKIENASELHGNSPWNASLEEIQRSNNTSNPQNLGLVFDG 511 Score = 67.4 bits (163), Expect = 3e-08 Identities = 77/327 (23%), Positives = 142/327 (43%), Gaps = 20/327 (6%) Frame = -1 Query: 2130 RAGGALDLEGDDNFLPNGHILIQSKKNQDNALNEIKKGLSAVCQTDQVHYAQTWLASFSA 1951 RA A+DL + LP L + AL EI+ L C+T ++ AQTW++ Sbjct: 257 RALQAVDLRSTE--LPIPPTLKGCDLSYQAALPEIRNLLRCACETHKLPLAQTWVSCQQQ 314 Query: 1950 DCLNVVIKAQGGNHCRNSDAEL----------FFKACKTAQIQSGHAVVGKAFSSLGACF 1801 + + HC ++ + F +AC + G V G+AF + G CF Sbjct: 315 NKSGCRHNDENYIHCVSTIDDACYVGDPTVREFHEACSEHHLLKGQGVAGQAFLTNGPCF 374 Query: 1800 CKNISQLGESDYCLAPDSRKLGFTGCFAISLQSLHTVDIVYILEFF---SAIGSQDPRKM 1630 ++S +S+Y L+ + G G AI L+ +HT ++LEFF ++ R M Sbjct: 375 SSDVSNYKKSEYPLSHHANMYGLHGAVAIRLRCIHTGPADFVLEFFLPKECDDLEEQRTM 434 Query: 1629 VCMILRKLRQELQSFKVASGQELGEKLFVEVLK----ISPDDELESFEIQENSVSTSAFE 1462 + + + +S + + +EL E+ EV++ ++P E S E+ NS ++ E Sbjct: 435 LNALSTIMAHVPRSLRTVTDKELEEE--SEVIEREEIVTPKIENAS-ELHGNSPWNASLE 491 Query: 1461 FKELLDESEMMQVDAHIVNNVTSIGERNEVAGSRSGREHFLAVRQSDSTSY---GFLLDQ 1291 + + + Q + V GE ++ G + G ++ + ++S+++ GF + Sbjct: 492 EIQRSNNTSNPQ----NLGLVFDGGEPHDGFGLKRGFDYTMDSNVNESSTFSSGGFSMMA 547 Query: 1290 TSRELDAVNGPTHTTLDQSFVCNIADS 1210 + A T L Q F ++ D+ Sbjct: 548 EKKRTKADKTITLDVLRQYFAGSLKDA 574 >ref|XP_003606812.1| Nodule inception protein [Medicago truncatula] gi|355507867|gb|AES89009.1| Nodule inception protein [Medicago truncatula] Length = 912 Score = 140 bits (353), Expect = 2e-30 Identities = 110/374 (29%), Positives = 175/374 (46%), Gaps = 30/374 (8%) Frame = -1 Query: 3429 ITPTPTGEIDQDFSIWLNHVSDENS---LLQIWAPVTIGDKQYLTTSDQPFAMKELVKGL 3259 I+ +P+ +D+ L+ + L Q+WAP+ GD LTTSDQP+ + + + G Sbjct: 139 ISKSPSWSLDERMMSALSFFKESAGGGILAQVWAPIKYGDDFILTTSDQPYLLDQKLAG- 197 Query: 3258 CSYRKHCLDYLIPVDDDLSTEGGLVGPPGRVLRNGMPEVCLDVGHYTCTEYPLRDKALAC 3079 YR+ + + + + G PGRV + +PE +VG+Y +EY D A++ Sbjct: 198 --YREVSRSFTFSAEMKMGS--CCAGLPGRVFNSHVPEWTSNVGYYHKSEYLRLDHAISH 253 Query: 3078 GIYYYLALPMS-------CGRVLELAHDGPPPRSWRDF------------------RLMF 2974 + +ALP+S C VLEL P ++ RL+ Sbjct: 254 EVRGSIALPISDMNSEVSCCAVLELVTTKEKPNFDKELEFVSHALQRVNLRTIMPPRLLP 313 Query: 2973 EDPELSLGPTIPEIKEMLQAVCRTHGLPLAQTWVSMP-PDSTRFENEAVKYKNGAFYAVN 2797 + + + EI ++L+AVC H LPLA TW+ + E+E ++ K G + N Sbjct: 314 QCVSSNKRAALTEITDVLRAVCHAHSLPLALTWIPCCYSEGKGEESERIRIKEGHITSSN 373 Query: 2796 DDYDGDDEEGS-FMVDYDFRDFILACHFCCIKTGRGVVGKAFSSKGACFCKDVCQLSINE 2620 + EE + ++ D F+ AC ++ G+G+ GKA S F DV ++E Sbjct: 374 EKCVLCIEESACYINDKMVGGFVHACSEHHLEEGQGISGKALQSNHPFFYTDVKAYDVSE 433 Query: 2619 YPLVSSARKAGVTGCFAICLQHTQLADFVYIVEFFLPSNEAESVDPRTLIEMLLTTMKLH 2440 YPLV ARK + AI L+ T D Y++EFFLP N S + + L++ L TM+ Sbjct: 434 YPLVHHARKYNLNAAVAIRLRSTYTNDDDYVLEFFLPINMIGSSEQQLLLDNLSDTMRRI 493 Query: 2439 LKNFKVASGQELGG 2398 K+ + S EL G Sbjct: 494 CKSLRTVSEAELRG 507 >ref|XP_003546980.1| PREDICTED: protein NLP8-like [Glycine max] Length = 973 Score = 139 bits (351), Expect = 4e-30 Identities = 102/345 (29%), Positives = 161/345 (46%), Gaps = 26/345 (7%) Frame = -1 Query: 3354 LLQIWAPVTIGDKQYLTTSDQPFAMKELVKGLCSYRKHCLDYLIPVDDDLSTEGGLVGPP 3175 L Q+W P+ GD+ L+TSDQP+ + +++ G YR+ + + G +G P Sbjct: 189 LAQVWVPIKHGDQFILSTSDQPYLLDQMLAG---YREVSRTFTFSTE---GKSGCFLGLP 242 Query: 3174 GRVLRNGMPEVCLDVGHYTCTEYPLRDKALACGIYYYLALPM-------SCGRVLELAHD 3016 GRV + +PE +VG+Y+ +EY + A+ + +A+P+ C VLEL Sbjct: 243 GRVFTSKVPEWTSNVGYYSMSEYLRFEHAINHKVRGSIAIPIFDLHSEFPCCAVLELVTT 302 Query: 3015 GPPP------------------RSWRDFRLMFEDPELSLGPTIPEIKEMLQAVCRTHGLP 2890 P R+ + R + + + T+ EI ++L++VC H LP Sbjct: 303 KEKPDFDRELEIVRHALQLVNLRTVKTLRCLPQSLSNNKKATLTEIVDVLRSVCHAHRLP 362 Query: 2889 LAQTWVSMP-PDSTRFENEAVKYKNGAFYAVNDDYDGDDEEGSFMVDYDFRDFILACHFC 2713 LA TW+ + +R E ++ K G + +E ++ D FI AC Sbjct: 363 LALTWIPCGYTECSRGEASRIRIKGGHSTSSEKSVLCLEESACYITDRAMAGFIRACMEH 422 Query: 2712 CIKTGRGVVGKAFSSKGACFCKDVCQLSINEYPLVSSARKAGVTGCFAICLQHTQLADFV 2533 ++ G+G+ GKA S F DV I+EYPLV ARK + AI L+ T D Sbjct: 423 HLEEGKGIAGKALQSNHPFFYPDVKTYDISEYPLVHHARKYNLNAAVAIRLRSTYTNDDD 482 Query: 2532 YIVEFFLPSNEAESVDPRTLIEMLLTTMKLHLKNFKVASGQELGG 2398 YI+EFFLP N S + + L++ L TM+ + + S EL G Sbjct: 483 YILEFFLPVNMRGSSEQQLLLDNLSGTMQRICSSLRTVSETELSG 527 Score = 58.9 bits (141), Expect = 9e-06 Identities = 51/193 (26%), Positives = 81/193 (41%), Gaps = 23/193 (11%) Frame = -1 Query: 2070 LIQSKKNQDNA-LNEIKKGLSAVCQTDQVHYAQTWLASFSADCLNVV---IKAQGGNHCR 1903 L QS N A L EI L +VC ++ A TW+ +C I+ +GG+ Sbjct: 333 LPQSLSNNKKATLTEIVDVLRSVCHAHRLPLALTWIPCGYTECSRGEASRIRIKGGHSTS 392 Query: 1902 NSDAEL----------------FFKACKTAQIQSGHAVVGKAFSSLGACFCKNISQLGES 1771 + + L F +AC ++ G + GKA S F ++ S Sbjct: 393 SEKSVLCLEESACYITDRAMAGFIRACMEHHLEEGKGIAGKALQSNHPFFYPDVKTYDIS 452 Query: 1770 DYCLAPDSRKLGFTGCFAISLQSLHTVDIVYILEFFSAI---GSQDPRKMVCMILRKLRQ 1600 +Y L +RK AI L+S +T D YILEFF + GS + + ++ + +++ Sbjct: 453 EYPLVHHARKYNLNAAVAIRLRSTYTNDDDYILEFFLPVNMRGSSEQQLLLDNLSGTMQR 512 Query: 1599 ELQSFKVASGQEL 1561 S + S EL Sbjct: 513 ICSSLRTVSETEL 525 >ref|XP_002530298.1| transcription factor, putative [Ricinus communis] gi|223530154|gb|EEF32065.1| transcription factor, putative [Ricinus communis] Length = 985 Score = 139 bits (351), Expect = 4e-30 Identities = 113/360 (31%), Positives = 163/360 (45%), Gaps = 26/360 (7%) Frame = -1 Query: 3381 LNHVSDENSLLQIWAPVTIGDKQYLTTSDQPFAMKELVKGLCSYRKHCLDYLIPVDDDLS 3202 L S L Q+W P+ GD+ +TT +QP+ + + + G YR+ Y + Sbjct: 176 LKESSGGGILAQVWIPIQHGDQYIMTTFEQPYLLDQSLAG---YREVSRTYTFSAE---- 228 Query: 3201 TEGGL-VGPPGRVLRNGMPEVCLDVGHYTCTEYPLRDKALACGIYYYLALP------MSC 3043 + GL +G PGRV + +PE +V +Y+ EY AL + +ALP MSC Sbjct: 229 VKPGLPLGLPGRVFISKVPEWTSNVAYYSNAEYLRVKHALHHRVQGSIALPVFQPPEMSC 288 Query: 3042 GRVLELAHDGPPP------------------RSWRDFRLMFEDPELSLGPTIPEIKEMLQ 2917 VLEL P RS RL+ + + + EI ++L+ Sbjct: 289 CAVLELVTVKEKPDFDSEMESVCLALQTVNLRSTAPPRLLPQSLSRNQKAALAEISDVLR 348 Query: 2916 AVCRTHGLPLAQTWVSMP-PDSTRFENEAVKYKNGAFYAVNDDYDGDDEEGSFMVDYDFR 2740 AVC H LPLA TWV + T E V+ ++G + ++ D Sbjct: 349 AVCHAHRLPLALTWVPCNYAEGTVDEIIKVRVRDGNSRPAEKSVLCIWRQACYVKDGKME 408 Query: 2739 DFILACHFCCIKTGRGVVGKAFSSKGACFCKDVCQLSINEYPLVSSARKAGVTGCFAICL 2560 F+ AC CI+ G+G+ GKA S F DV I EYPLV ARK G+ AI L Sbjct: 409 GFVHACSEHCIEEGQGIAGKALQSNHPFFFPDVKAYDITEYPLVHHARKYGLNAAVAIRL 468 Query: 2559 QHTQLADFVYIVEFFLPSNEAESVDPRTLIEMLLTTMKLHLKNFKVASGQELGGKLLFEV 2380 + T D YI+EFFLP N S + + L+ L TM+ + + S +LGG+ F+V Sbjct: 469 RSTYTGDDDYILEFFLPVNIKGSSEQQLLLNNLSGTMQKICISLRTVSDADLGGRETFKV 528 Score = 67.8 bits (164), Expect = 2e-08 Identities = 61/241 (25%), Positives = 107/241 (44%), Gaps = 23/241 (9%) Frame = -1 Query: 2055 KNQDNALNEIKKGLSAVCQTDQVHYAQTWL-ASFSADCLNVVIKAQ--GGN--------- 1912 +NQ AL EI L AVC ++ A TW+ +++ ++ +IK + GN Sbjct: 334 RNQKAALAEISDVLRAVCHAHRLPLALTWVPCNYAEGTVDEIIKVRVRDGNSRPAEKSVL 393 Query: 1911 -------HCRNSDAELFFKACKTAQIQSGHAVVGKAFSSLGACFCKNISQLGESDYCLAP 1753 + ++ E F AC I+ G + GKA S F ++ ++Y L Sbjct: 394 CIWRQACYVKDGKMEGFVHACSEHCIEEGQGIAGKALQSNHPFFFPDVKAYDITEYPLVH 453 Query: 1752 DSRKLGFTGCFAISLQSLHTVDIVYILEFFSAI---GSQDPRKMVCMILRKLRQELQSFK 1582 +RK G AI L+S +T D YILEFF + GS + + ++ + +++ S + Sbjct: 454 HARKYGLNAAVAIRLRSTYTGDDDYILEFFLPVNIKGSSEQQLLLNNLSGTMQKICISLR 513 Query: 1581 VASGQELGEKLFVEVLKIS-PDDELESFEIQENSVSTSAFEFKELLDESEMMQVDAHIVN 1405 S +LG + E K++ + SF S+S+ + L+ ++ + +DA Sbjct: 514 TVSDADLGGR---ETFKVNFQKGAVPSFPPMSASISSQTTLSEANLNSTDKIPLDASSSR 570 Query: 1404 N 1402 N Sbjct: 571 N 571