BLASTX nr result
ID: Cephaelis21_contig00015720
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00015720 (2137 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulga... 217 1e-53 gb|AAC63844.1| putative non-LTR retroelement reverse transcripta... 207 9e-51 emb|CAB78008.1| putative protein [Arabidopsis thaliana] gi|73210... 204 8e-50 emb|CCA66040.1| hypothetical protein [Beta vulgaris subsp. vulga... 188 4e-45 emb|CAD41785.1| OSJNBa0035M09.1 [Oryza sativa Japonica Group] gi... 186 2e-44 >emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1378 Score = 217 bits (552), Expect = 1e-53 Identities = 167/697 (23%), Positives = 289/697 (41%), Gaps = 30/697 (4%) Frame = +1 Query: 118 SGIFVPFALPRNRLPITHLTFADDLVVFTNGSRHAIRVLFNFLQNYESASGQRINRNKSS 297 +G + P RN PI++L FADDL++F+ S +V+ L + ASG ++N +KS Sbjct: 667 NGNWKPVKASRNGPPISNLAFADDLILFSEASVEQAQVMKWCLDRFCEASGSKVNEDKSK 726 Query: 298 FYVSKRLPTSRSHYIATFTGIHKGTFPFKYLGCPIAKGRIRTRNFQFLVDKVDSRIQGWQ 477 Y S + + KYLG P GR R +Q+LVD+++ ++ GW+ Sbjct: 727 IYFSANTHLDIRDAVCNTLAMEATADFGKYLGVPTINGRSSKREYQYLVDRINGKLAGWK 786 Query: 478 NKLLSSGGRLILVRHVLSTLSVYVFSSMLVPKTITSAMESCMASFLWGSFNGESKRIWRR 657 K LS GR L++ S++ Y S +P++ ++ SFLWG G+ + Sbjct: 787 TKTLSIAGRATLIQSAFSSIPYYTMQSTKLPRSTCDDIDRKSRSFLWGEQEGKRRVHLVA 846 Query: 658 WERLALPIEENGLGVRRLQDVLHSFTCKLWWKIKTE-TGLWAKF-----------VSSYR 801 WE ++ +E GLG+R ++ +F KL W++ E + LW++ + ++ Sbjct: 847 WENISKSKKEGGLGIRSMRQANSAFLVKLGWRLLAEPSSLWSRILRAKYCDNRCDIDMFK 906 Query: 802 NGVQDSYAWPRI-RRVQERMEAATTLVGRSGNSSFWCSNWNGSGLLLDRCSTIP-----D 963 S W I + + + VG + FW W S L+ S IP D Sbjct: 907 EKSNASSTWRGILSSIDVVRKGINSAVGNGAKTLFWHHRWATSEPLISLASPIPPIELQD 966 Query: 964 TTLSLNQVFINGCWRLDLFQDYLSAEDVQKVTDFQ-FEFLEGRDIYMWTPTQHGKFTVAS 1140 T+ ++G W++D+F +YL ++ + + + E D W + G FT+ S Sbjct: 967 ATVKEMWDLVSG-WKVDVFANYLPEATLKLIAAHELIDDEEAIDDIYWNGSPSGGFTIGS 1025 Query: 1141 AYEELR----YKATPCPSLKYVWHKFIPLKLSFCMWRXXXXXXXXXXXXMSMGFNMPSKC 1308 A R P VW P ++ F +W +C Sbjct: 1026 AMNITRNAELANMDAHPKWSAVWKIPTPQRVRFFIWLAIQDRLMTNSNRFLRRLTDDPRC 1085 Query: 1309 IFCDNI-ETVQHLFFDCSEASFIWKQFFLCLGIVFPHISSLWDFCQYCWTIRIASVSEII 1485 + C + E H+ C A +W++ LG++ H + + W + S ++ Sbjct: 1086 LVCGEVEENTDHILRRCPVARILWRK----LGMLGEH--NREEINLGSWITKNLSADTMM 1139 Query: 1486 ----LRILLTLGVWHIWKARCQFFFEGTLPSARHIVSNMCTYLSQLSSGHKFRAITSADF 1653 LR+ + W +W+ R F VS + + ++ Sbjct: 1140 GSEWLRV-FAVSCWWLWRWRNDRCFNRNPSIPIDQVSFIFARVKEIKEA----------- 1187 Query: 1654 SFVSNGVLSRLLNPKNRKILIIRWMFPPHAKFKINVDGSSRGNPGMAASGVIIRDCMGGF 1833 + ++ + RK +++RW P K+N DG+S+GNPG A G +IR G Sbjct: 1188 --MDRNDTNKSQHSGRRKEILVRWQCPKEGWVKLNTDGASKGNPGPAGGGGLIRGPRGEI 1245 Query: 1834 VAASSTFFGVQTNLYAEIFALREGILLCRSLGISSAIFETDSQLLVHMVHGHSSWPWKYN 2013 + G T AE+ A+ G+++ I DS+L+ ++ ++ Y Sbjct: 1246 HEVFAINCGSCTCTKAELLAVLRGLMIAWEGNHKQVIVSVDSELVAKLLISNAPPSSPYI 1305 Query: 2014 SIISRITQMV--NTGGFTVQHVFREANRVADSLARWG 2118 II+R ++ ++H +RE NR AD LA G Sbjct: 1306 HIINRCLSLIARKEWKIVIEHCYRETNRAADRLANMG 1342 >gb|AAC63844.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1231 Score = 207 bits (527), Expect = 9e-51 Identities = 177/688 (25%), Positives = 296/688 (43%), Gaps = 27/688 (3%) Frame = +1 Query: 133 PFALPRNRLPITHLTFADDLVVFTNGSRHAIRVLFNFLQNYESASGQRINRNKSSFYVSK 312 P A+ ++H+ FADDL++F S IR++ L+ + ASGQ+++ KS + S Sbjct: 529 PIAVSCGGSKLSHVCFADDLILFAEASVAQIRIIRRVLERFCEASGQKVSLEKSKIFFSH 588 Query: 313 RLPTSRSHYIATFTGIHKGTFPFKYLGCPIAKGRIRTRNFQFLVDKVDSRIQGWQNKLLS 492 + I+ +GI KYLG PI + R+ F ++++V +R+ GW+ + LS Sbjct: 589 NVSREMEQLISEESGIGCTKELGKYLGMPILQKRMNKETFGEVLERVSARLAGWKGRSLS 648 Query: 493 SGGRLILVRHVLSTLSVYVFSSMLVPKTITSAMESCMASFLWGSFNGESKRIWRRWERLA 672 GR+ L + VLS++ V+V S++L+P + ++ +FLWGS + K+ W ++ Sbjct: 649 LAGRITLTKAVLSSIPVHVMSAILLPVSTLDTLDRYSRTFLWGSTMEKKKQHLLSWRKIC 708 Query: 673 LPIEENGLGVRRLQDVLHSFTCKLWWK-IKTETGLWAKFV-SSYR-NGVQD-SYAWPRIR 840 P E G+G+R +D+ + K+ W+ ++ + LWA+ V Y+ GVQD S+ P+ R Sbjct: 709 KPKAEGGIGLRSARDMNKALVAKVGWRLLQDKESLWARVVRKKYKVGGVQDTSWLKPQPR 768 Query: 841 RVQERMEAATTL-----------VGRSGNSSFWCSNWNGSGLLLD-RCSTIPD---TTLS 975 A L G FW W L++ IP+ ++ Sbjct: 769 WSSTWRSVAVGLREVVVKGVGWVPGDGCTIRFWLDRWLLQEPLVELGTDMIPEGERIKVA 828 Query: 976 LNQVFINGCWRLDLFQDYLSAEDVQKVTDFQFE-FLEGRDIYMWTPTQHGKFTVASAYEE 1152 + W L++ YL +++ + FL D W TQ G FTV SAY Sbjct: 829 ADYWLPGSGWNLEILGLYLPETVKRRLLSVVVQVFLGNGDEISWKGTQDGAFTVRSAYSL 888 Query: 1153 LRYKATPCPSL----KYVWHKFIPLKLSFCMWRXXXXXXXXXXXXMSMGFNMPSKCIFCD 1320 L+ P++ +W P ++ +W + + + C C+ Sbjct: 889 LQGDVGDRPNMGSFFNRIWKLITPERVRVFIWLVSQNVIMTNVERVRRHLSENAICSVCN 948 Query: 1321 NI-ETVQHLFFDCSEASFIWKQFFLCLGIVFPHISSLWDFCQYCWTIRIASVSEIILRIL 1497 ET+ H+ DC IW++ + + W + I L Sbjct: 949 GAEETILHVLRDCPAMEPIWRRL-----LPLRRHHEFFSQSLLEWLFTNMDPVKGIWPTL 1003 Query: 1498 LTLGVWHIWKARCQFFFEGTLPSARHIVSNMCTYLSQLSSGHKFRAITSADFSFVSNGVL 1677 +G+W WK RC F R I + ++ ++ + R + NGV Sbjct: 1004 FGMGIWWAWKWRCCDVF-----GERKICRDRLKFIKDMA--EEVRRVHVGAVGNRPNGV- 1055 Query: 1678 SRLLNPKNRKILIIRWMFPPHAKFKINVDGSSRGNPGMAASGVIIRDCMGGFVAASSTFF 1857 R +IRW P KI DG+SRGN G+AA+G IR+ G ++ + Sbjct: 1056 --------RVERMIRWQVPSDGWVKITTDGASRGNHGLAAAGGAIRNGQGEWLGGFALNI 1107 Query: 1858 GVQTNLYAEIFALREGILLCRSLGISSAIFETDSQLLVHMVHGHSSWPWKYNSIISRITQ 2037 G AE++ G+L+ G + D +L+V + S S + R+ Q Sbjct: 1108 GSCAAPLAELWGAYYGLLIAWDKGFRRVELDLDCKLVVGFLSTGVSNAHPL-SFLVRLCQ 1166 Query: 2038 MVNTGGFTVQ--HVFREANRVADSLARW 2115 T + V+ HV+REANR+AD LA + Sbjct: 1167 GFFTRDWLVRVSHVYREANRLADGLANY 1194 >emb|CAB78008.1| putative protein [Arabidopsis thaliana] gi|7321072|emb|CAB82119.1| putative protein [Arabidopsis thaliana] Length = 947 Score = 204 bits (519), Expect = 8e-50 Identities = 169/626 (26%), Positives = 275/626 (43%), Gaps = 23/626 (3%) Frame = +1 Query: 163 ITHLTFADDLVVFTNGSRHAIRVLFNFLQNYESASGQRINRNKSSFYVSKRLPTSRSHYI 342 I+H+ FADDL++F S IRV+ L+ + ASGQ+++ +KS + SK + I Sbjct: 353 ISHICFADDLILFAEASVSQIRVIRRILETFCIASGQKVSLDKSKIFFSKNVSRDLEKLI 412 Query: 343 ATFTGIHKGTFPFKYLGCPIAKGRIRTRNFQFLVDKVDSRIQGWQNKLLSSGGRLILVRH 522 + +GI KYLG PI + RI F ++++V SR+ GW+ + LS GRL L + Sbjct: 413 SKESGIKSTRELGKYLGMPILQRRINKDTFGEVLERVSSRLAGWKGRSLSFAGRLTLTKS 472 Query: 523 VLSTLSVYVFSSMLVPKTITSAMESCMASFLWGSFNGESKRIWRRWERLALPIEENGLGV 702 VLS + ++ S++ +P++ ++ FL GS + K W+R+ LP E GLG+ Sbjct: 473 VLSLIPIHTMSTISLPQSTLEGLDKLARVFLLGSSAEKKKLHLVAWDRVCLPKSEGGLGI 532 Query: 703 RRLQDVLHSFTCKLWWK-IKTETGLWAKFV-SSYRNGVQDSYAWPRIRRVQERMEAATTL 876 R + + + K+ W+ I LWA+ + S YR G +R V R + + Sbjct: 533 RTSKCMNKALVSKVGWRLINDRYSLWARILRSKYRVG---------LREVVSR--GSRWV 581 Query: 877 VGRSGNSSFWCSNWNGSGLLLDRC-STIPDT--TLSLNQVFINGC-WRLDLFQDYLSAED 1044 VG + FW NW L++R IP++ L + ++ NG W+LD + Y+S Sbjct: 582 VGNGRDILFWSDNWLSHEALINRAVIEIPNSEKELRVKDLWANGLGWKLDKIEPYISYHT 641 Query: 1045 VQKVTDFQFEFLEG-RDIYMWTPTQHGKFTVASAYEELRYKATPCPSL----KYVWHKFI 1209 ++ + + G RD W + G FTV SAY L P P++ +W Sbjct: 642 RLELAAVVVDSVTGARDRLSWGYSADGVFTVKSAYRLLTEDHDPRPNMAAFFDRLWRVVA 701 Query: 1210 PLKLSFCMWRXXXXXXXXXXXXMSMGFNMPSKCIFC-DNIETVQHLFFDCSEASFIWKQ- 1383 ++ +W S C C ET+ H+ DC + IW++ Sbjct: 702 LERVKTFLWH----------------IGDTSVCQVCKGGDETILHVLKDCPSIAGIWRRL 745 Query: 1384 --------FF--LCLGIVFPHISSLWDFCQYCWTIRIASVSEIILRILLTLGVWHIWKAR 1533 FF G ++ ++ Y W A V VW WK R Sbjct: 746 VQVQRSYDFFNGSLFGWLYVNLGMKNAETGYAWATLFAIV------------VWWSWKWR 793 Query: 1534 CQFFFEGTLPSARHIVSNMCTYLSQLSSGHKFRAITSADFSFVSNGVLSRLLNPKNRKIL 1713 C + F G + R V +++S H + NG L + R Sbjct: 794 CGYVF-GEVGKCRDRVKFFRDLAAEVSHAHAIHS---------QNGGL------RTRVER 837 Query: 1714 IIRWMFPPHAKFKINVDGSSRGNPGMAASGVIIRDCMGGFVAASSTFFGVQTNLYAEIFA 1893 ++ W P K+N DG+SRGN G+A +G ++RD +G + + GV + AE++ Sbjct: 838 LVAWKPPDGEWVKLNTDGASRGNLGLATTGGVLRDGIGHWCGGFALDIGVCSAPLAELWG 897 Query: 1894 LREGILLCRSLGISSAIFETDSQLLV 1971 + G+ + + E DS+L+V Sbjct: 898 VYYGLYMAWERRFTRVELEVDSELVV 923 >emb|CCA66040.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1362 Score = 188 bits (478), Expect = 4e-45 Identities = 171/691 (24%), Positives = 272/691 (39%), Gaps = 40/691 (5%) Frame = +1 Query: 163 ITHLTFADDLVVFTNGSRHAIRVLFNFLQNYESASGQRINRNKSSFYVSKRLPTSRSHYI 342 ++HL FADD ++FT S ++ + + YE ASGQ++N +K+ S+ + R I Sbjct: 679 VSHLFFADDSILFTKASVQECSMVADIISKYERASGQQVNLSKTEVVFSRSVDRERRSAI 738 Query: 343 ATFTGIHKGTFPFKYLGCPIAKGRIRTRNFQFLVDKVDSRIQGWQNKLLSSGGRLILVRH 522 G+ + KYLG P GR + F + +++ ++QGW+ KLLS G+ +L++ Sbjct: 739 VNVLGVKEVDRQEKYLGLPTIIGRSKKVTFACIKERIWKKLQGWKEKLLSRPGKEVLIKS 798 Query: 523 VLSTLSVYVFSSMLVPKTITSAMESCMASFLWGSFNGESKRIWRRWERLALPIEENGLGV 702 V + Y+ S +P + + S +A F WGS + K W W+ L P GLG Sbjct: 799 VAQAIPTYMMSVFSLPSGLIDEIHSLLARFWWGSSDTNRKMHWHSWDTLCYPKSMGGLGF 858 Query: 703 RRLQDVLHSFTCKLWWKIKT--ETGLWAKFVSSY---------RNGVQDSYAWPRIRRVQ 849 R L S K W++ T +T L+ + Y R G S+ W I + Sbjct: 859 RDLHCFNQSLLAKQAWRLCTGDQTLLYRLLQARYFKSSELLEARRGYNPSFTWRSIWGSK 918 Query: 850 E-RMEAATTLVGRSGNSSFWCSNWNGSGLLLDRCSTIP----DTTLSLNQVFI----NGC 1002 +E VG W W +L + +P D+ L L + G Sbjct: 919 SLLLEGLKWCVGSGERIRVWEDAW----ILGEGAHMVPTPQADSNLDLKVCDLIDVARGA 974 Query: 1003 WRLDLFQDYLSAEDVQKVTDFQFEFLEGRDIYMWTPTQHGKFTVASAY----------EE 1152 W ++ Q E+ + V D W P+++G F+V S Y + Sbjct: 975 WNIESVQQTFVEEEWELVLSIPLSRFLPDDHRYWWPSRNGIFSVRSCYWLGRLGPVRTWQ 1034 Query: 1153 LRYKATPCPSLKYVWHKFIPLKLSFCMWRXXXXXXXXXXXXMSMGFNMPSKCIFC-DNIE 1329 L++ + VW P KLS +WR S ++ + C C D E Sbjct: 1035 LQHGERETELWRRVWQLQGPPKLSHFLWRACKGSLAVKGRLFSRHISVDATCSVCGDPDE 1094 Query: 1330 TVQHLFFDCSEASFIWKQFFLCLGIVFPHISSLWDFCQYCWTIRIASVSEIILRILLTLG 1509 ++ H FDC+ A IW+ ++ +SS + + W + A+ E R + + Sbjct: 1095 SINHALFDCTFARAIWQVSGFASLMMNAPLSSFSERLE--WLAKHATKEE--FRTMCSF- 1149 Query: 1510 VWHIWKARCQFFFEGTLPSA-------RHIVSNMCTYLSQLSSGHKFRAITSADFSFVSN 1668 +W W R + FE L A +V++ C Y + G +SA Sbjct: 1150 MWAGWFCRNKLIFENELSDAPLVAKRFSKLVADYCEYAGSVFRGSGGGCGSSA------- 1202 Query: 1669 GVLSRLLNPKNRKILIIRWMFPPHAKFKINVDGSSRGNPGMAASGVIIRDCMGGFVAASS 1848 W PP FK+N D N G GV+IR GG Sbjct: 1203 -----------------LWSPPPTGMFKVNFDAHLSPN-GEVGLGVVIRANDGGIKMLGV 1244 Query: 1849 TFFGVQ-TNLYAEIFALREGILLCRSLGISSAIFETDSQLLVHMVHGHSSWPWKYNSIIS 2025 + T + AE A + + LG + E D+ ++++ V I + Sbjct: 1245 KRVAARWTAVMAEAMAALFAVEVAHRLGFGRIVLEGDAMMVINAVKHKCEGVAPMFRIFN 1304 Query: 2026 RITQM-VNTGGFTVQHVFREANRVADSLARW 2115 I+ + F+V HV R N VA LARW Sbjct: 1305 DISSLGACLDVFSVSHVRRAGNTVAHLLARW 1335 >emb|CAD41785.1| OSJNBa0035M09.1 [Oryza sativa Japonica Group] gi|38346911|emb|CAE03883.2| OSJNBb0015N08.11 [Oryza sativa Japonica Group] Length = 1026 Score = 186 bits (472), Expect = 2e-44 Identities = 147/497 (29%), Positives = 228/497 (45%), Gaps = 29/497 (5%) Frame = +1 Query: 163 ITHLTFADDLVVFTNGSRHAIRVLFNFLQNYESASGQRINRNKSSFYVSKRLPTSRSHYI 342 I L +ADD + N + L L +E SG +IN NKS + + Y Sbjct: 483 IAILQYADDTIFLINDKLDHAKNLKYILCLFEQLSGLKINFNKSEVFCFGEAKEKQDLYS 542 Query: 343 ATFTGIHKGTFPFKYLGCPIAKGRIRTRNFQFLVDKVDSRIQGWQNKLLSSGGRLILVRH 522 FT G+ P KYLG PI + RI ++++ +K++ ++ WQ +L S GGRLIL+ Sbjct: 543 NIFT-CKVGSLPLKYLGIPIDQKRILNKDWKLAENKMEHKLGCWQGRLQSIGGRLILLNS 601 Query: 523 VLSTLSVYVFSSMLVPKTITSAMESCMASFLWGSFNGESKRIWRRWERLALPIEENGLGV 702 LS++ +Y+ S +PK + ++ FLW G K W + P ++ GLGV Sbjct: 602 TLSSVPMYMISFYRLPKGVQERIDYFRKRFLWQEDQGIRKYHLVNWPLVCSPRDQGGLGV 661 Query: 703 RRLQDVLHSFTCKLWWKIKTETGLWAKFV----------SSYRNGVQDSYAWPRIRRVQE 852 L+ + + K W+++ E G W + + S R S+ W + V++ Sbjct: 662 LDLEAMNKAMLGKWIWRLENEEGWWQEIIYAKYCSDKPLSGLRLKAGSSHFWQGVMEVKD 721 Query: 853 R-MEAATTLVGRSGNSSFWCSNWNGS-----------GLLLDRCSTIPDTTLSLNQVFIN 996 T +VG + FW +W G G+++ + TI D LN+ I+ Sbjct: 722 DFFSFCTKIVGNGEKTLFWEDSWLGGKPLAIQFPSLYGIVITKRITIAD----LNRKGID 777 Query: 997 GC--WRLDLFQDYLSAEDVQKVTDFQFEFL----EGRDIYMWTPTQHGKFTVASAYEELR 1158 C +R DL D L D +K+ + +E L +D WT ++ GKFTV S Y L+ Sbjct: 778 -CMKFRRDLHGDKL--RDWRKIVN-SWEGLNLVENCKDKLWWTLSKDGKFTVRSFYRALK 833 Query: 1159 YKATPCPSLKYVWHKFIPLKLSFCMWRXXXXXXXXXXXXMSMGFNM-PSKCIFCDNIETV 1335 + T P+ K +W +PLK+ +W + G+ +KC FCD +ETV Sbjct: 834 LQQTSFPN-KKIWKFRVPLKIRIFIWFFTKNKILTKDNLLKRGWRKGDNKCQFCDKVETV 892 Query: 1336 QHLFFDCSEASFIWKQFFLCLGIVFPHISSLWDFCQYCWTIRIASVSEIILRILLTLGVW 1515 QHLFFDC A IW C V P +S F + ++ + + +I+ I L W Sbjct: 893 QHLFFDCPLARLIW-NIIACALNVKPVLSRQDLFGSWIQSMDKFTKNLVIVGIAAVL--W 949 Query: 1516 HIWKARCQFFFEGTLPS 1566 IWK R + FE LP+ Sbjct: 950 SIWKCRNKACFERKLPN 966