BLASTX nr result
ID: Cephaelis21_contig00014500
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00014500 (1445 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI17281.3| unnamed protein product [Vitis vinifera] 354 4e-95 emb|CAN75046.1| hypothetical protein VITISV_023142 [Vitis vinifera] 339 9e-91 ref|XP_002518041.1| conserved hypothetical protein [Ricinus comm... 310 8e-82 ref|XP_003601650.1| Small subunit processome component-like prot... 296 7e-78 ref|XP_004167386.1| PREDICTED: LOW QUALITY PROTEIN: small subuni... 291 4e-76 >emb|CBI17281.3| unnamed protein product [Vitis vinifera] Length = 2629 Score = 354 bits (908), Expect = 4e-95 Identities = 194/379 (51%), Positives = 258/379 (68%), Gaps = 18/379 (4%) Frame = +2 Query: 362 SLFFALLINPLQTTSEGVDVTHKYHWSLPESIKDEF------------DIKGLSWMKRFG 505 +LFFA+L+ PL + S+G D T + WS E+ ++F +I LSW KR+G Sbjct: 1010 ALFFAMLLKPLLSISKGSDTTADWFWSSHENYMNDFQAFNVLKFFTVDNINSLSWKKRYG 1069 Query: 506 FLHVVEDVLVTFDESHLNPFLDLLMGCVVRILESCTGTLVCTKFKEVSLADFGFNSGASQ 685 FLHV+EDVL FDE H+ PFLDLLMGCVVR+L SCT +L K SL + N + Sbjct: 1070 FLHVIEDVLEVFDEFHVIPFLDLLMGCVVRVLGSCTSSLESAKSCGYSLVENYSNVNLNV 1129 Query: 686 DDIDG--EHDIMTSTAVKQFKELRSLCLKIISSALGKYENHDFGSDFWDRFFVALRPLIA 859 + DG + IMTSTAVKQ K+LR+L LKIIS AL KYE+HDFG +FWD FF +++PL+ Sbjct: 1130 PEKDGVVANPIMTSTAVKQLKDLRALTLKIISLALNKYEDHDFGYEFWDLFFTSVKPLVD 1189 Query: 860 GFKLEGASSKKPSSLFSCFLAMSRSFKFVHILFREKDLVPDIFSMFTITTASDAIISSVF 1039 GFK EG+SS+KPSSLFSCF+AMSRS V +L+REK+LV DIFS+ T+TTAS+AIIS V Sbjct: 1190 GFKQEGSSSEKPSSLFSCFVAMSRSHNLVSLLYREKNLVADIFSILTVTTASEAIISCVL 1249 Query: 1040 KFVXXXXXXXXXXXXXXXSVREVLHPHVNILVNSLQSLFTHGSRTKRKL----AENELRV 1207 KF+ ++++VL P++ L+ SL LF + TKRKL E ELR+ Sbjct: 1250 KFIENLLNLDSELDDEDVTIKKVLLPNIETLICSLHCLFQSCNATKRKLVKYPGETELRI 1309 Query: 1208 FKLLSHHIKEPLEAKKFIDILLPLLAKRYWSSDSYVDLLQMIQHVIEVTGVENSTRILIS 1387 FKLLS +IK+PL+A+KFID LLP L K+ +SD+ V+ LQ+I+ +I V+G E S +IL + Sbjct: 1310 FKLLSKYIKDPLQARKFIDNLLPFLGKKAQNSDACVEALQVIRDIIPVSGSETSPKILNA 1369 Query: 1388 VYPLLTFAELEVRKSICDV 1444 V PLL A L++R +ICD+ Sbjct: 1370 VSPLLISAGLDMRLAICDL 1388 Score = 137 bits (344), Expect = 9e-30 Identities = 72/134 (53%), Positives = 88/134 (65%) Frame = +3 Query: 3 SQLLKDVLEYRLLDETDAELQLKVLDCLLIWKDEFLVPYGQHLMDLINPRNLREELTTWS 182 SQ LKDVL+ RLLDE DAE+Q++VLDCLL WKD FL+PY QHL +LI+ +NLREELTTWS Sbjct: 890 SQFLKDVLQNRLLDENDAEIQMQVLDCLLFWKDNFLLPYDQHLKNLISSKNLREELTTWS 949 Query: 183 LSRESNQIDYEHXXXXXXXXXXXXXXXXXXPKGPPSRKHVSVQKRREILGFLAELDVEEL 362 LSRESN ++ +H K SRKH SV R+ +L F+A+LDV EL Sbjct: 950 LSRESNLVEEQHRTCLVPVVIRLLVPKVRKLKTLASRKHTSVHHRKAVLAFIAQLDVNEL 1009 Query: 363 PYFLPC**ILCKPL 404 F +L KPL Sbjct: 1010 ALFFA---MLLKPL 1020 >emb|CAN75046.1| hypothetical protein VITISV_023142 [Vitis vinifera] Length = 2461 Score = 339 bits (870), Expect = 9e-91 Identities = 194/406 (47%), Positives = 258/406 (63%), Gaps = 45/406 (11%) Frame = +2 Query: 362 SLFFALLINPLQTTSEGVDVTHKYHWSLPESIKDEF------------DIKGLSWMKRFG 505 +LFFA+L+ PL + S+G D T + WS E+ ++F +I LSW KR+G Sbjct: 734 ALFFAMLLKPLLSISKGSDTTADWFWSSHENYMNDFQAFNVLKFFTVDNINSLSWKKRYG 793 Query: 506 FLHVVEDVLVTFDESHLNPFLDLLMGCVVRILESCTGTLVCTKFKEVSLADFGFNSGASQ 685 FLHV+EDVL FDE H+ PFLDLLMGCVVR+L SCT +L K SL + N + Sbjct: 794 FLHVIEDVLEVFDEFHVIPFLDLLMGCVVRVLGSCTSSLESAKSCGYSLVENYSNVNLNV 853 Query: 686 DDIDG--EHDIMTSTAVKQFKELRSLCLKIISSALGKYENHDFGSDFWDRFFVALRPLIA 859 + DG + IMTSTAVKQ K+LR+L LKIIS AL KYE+HDFG +FWD FF +++PL+ Sbjct: 854 PEKDGVVANPIMTSTAVKQLKDLRALTLKIISLALNKYEDHDFGYEFWDLFFTSVKPLVD 913 Query: 860 GFKLEGASSKKPSSLFSCFLAMSRSFKFVHILFREKDLVPDIFSMFTITTASDAIISSVF 1039 GFK EG+SS+KPSSLFSCF+AMSRS V +L+REK+LV DIFS+ T+TTAS+AIIS V Sbjct: 914 GFKQEGSSSEKPSSLFSCFVAMSRSHNLVSLLYREKNLVADIFSILTVTTASEAIISCVL 973 Query: 1040 KFVXXXXXXXXXXXXXXXSVREVLHPHVNILVNSLQSLFTHGSRTK-------------- 1177 KF+ ++++VL P++ L+ SL LF + TK Sbjct: 974 KFIENLLNLDSELDDEDVTIKKVLLPNIETLICSLHCLFQSCNATKSDISCAYGIMILWL 1033 Query: 1178 -------------RKL----AENELRVFKLLSHHIKEPLEAKKFIDILLPLLAKRYWSSD 1306 RKL E ELR+FKLLS +IK+PL+A+KFID LLP L K+ +SD Sbjct: 1034 NELSLWLTFLDGNRKLVKYPGETELRIFKLLSKYIKDPLQARKFIDNLLPFLGKKAQNSD 1093 Query: 1307 SYVDLLQMIQHVIEVTGVENSTRILISVYPLLTFAELEVRKSICDV 1444 + V+ LQ+I+ +I V+G E S +IL +V PLL A L++R +ICD+ Sbjct: 1094 ACVEALQVIRDIIPVSGSETSPKILNAVSPLLISAGLDMRLAICDL 1139 Score = 124 bits (312), Expect = 5e-26 Identities = 65/124 (52%), Positives = 80/124 (64%) Frame = +3 Query: 33 RLLDETDAELQLKVLDCLLIWKDEFLVPYGQHLMDLINPRNLREELTTWSLSRESNQIDY 212 RLLDE DAE+Q++VLDCLL WKD FL+PY QHL +LI+ +NLREELTTWSLSRESN ++ Sbjct: 624 RLLDENDAEIQMQVLDCLLFWKDNFLLPYDQHLKNLISSKNLREELTTWSLSRESNLVEE 683 Query: 213 EHXXXXXXXXXXXXXXXXXXPKGPPSRKHVSVQKRREILGFLAELDVEELPYFLPC**IL 392 +H K SRKH SV R+ +L F+A+LDV EL F +L Sbjct: 684 QHRTCLVPVVIRLLVPKVRKLKTLASRKHTSVHHRKAVLAFIAQLDVNELALFFA---ML 740 Query: 393 CKPL 404 KPL Sbjct: 741 LKPL 744 >ref|XP_002518041.1| conserved hypothetical protein [Ricinus communis] gi|223542637|gb|EEF44174.1| conserved hypothetical protein [Ricinus communis] Length = 2535 Score = 310 bits (793), Expect = 8e-82 Identities = 175/379 (46%), Positives = 238/379 (62%), Gaps = 19/379 (5%) Frame = +2 Query: 362 SLFFALLINPLQTTSEGVDVTHKYHWSLPESIKDEF------------DIKGLSWMKRFG 505 SLFFALLI PL S G + T WSLP++ E +I LSW K++G Sbjct: 1004 SLFFALLIKPLHIISNGANSTMGMFWSLPKNSTVELQPLNILKYFTLENIMALSWKKKYG 1063 Query: 506 FLHVVEDVLVTFDESHLNPFLDLLMGCVVRILESCTGTLVCTKFKEVSLADFGFNSGASQ 685 FLHV+ED+L FDESH+ PFLDLLMGCV+R+L+SCT +L K N + Sbjct: 1064 FLHVIEDILGVFDESHIRPFLDLLMGCVIRMLKSCTSSLDVAKATGTE-GHSSVNVQLHK 1122 Query: 686 DDIDGEHDIMTSTAVKQFKELRSLCLKIISSALGKYENHDFGSDFWDRFFVALRPLIAGF 865 DD + + TA+KQ ++LRSLCLKI+S L KY++HDFG D WD FF +++ L+ GF Sbjct: 1123 DDSAAVNKSLVITALKQLRDLRSLCLKIVSVVLNKYDDHDFGCDLWDMFFASVKSLVDGF 1182 Query: 866 KLEGASSKKPSSLFSCFLAMSRSFKFVHILFREKDLVPDIFSMFTITTASDAIISSVFKF 1045 K EG SS+KPSSLFSCFLAMS S V +L RE +LVPDIFS+ T+TTAS+AI S V KF Sbjct: 1183 KQEGCSSEKPSSLFSCFLAMSSSHHLVPLLSREMNLVPDIFSILTVTTASEAIRSCVLKF 1242 Query: 1046 VXXXXXXXXXXXXXXXSVREVLHPHVNILVNSLQSLFTHGSRTKR---KLA----ENELR 1204 + V++VL P+++ L++SL F TK KLA E +R Sbjct: 1243 I-DNLLNLDEELDEDNKVKDVLLPNLDQLISSLHCFFQGNRATKSYTGKLAKYPEEIHIR 1301 Query: 1205 VFKLLSHHIKEPLEAKKFIDILLPLLAKRYWSSDSYVDLLQMIQHVIEVTGVENSTRILI 1384 +FK+LS +I++ L++ KF+D+LLP LAKR S + V+ LQ+I+ +I V G E++ +IL Sbjct: 1302 MFKMLSKYIRDQLQSNKFLDVLLPSLAKRSKDSGASVECLQVIRDIIPVLGNESTAKILN 1361 Query: 1385 SVYPLLTFAELEVRKSICD 1441 ++ PLL EL+ R +IC+ Sbjct: 1362 AISPLLISVELDTRLNICE 1380 Score = 116 bits (291), Expect = 1e-23 Identities = 61/124 (49%), Positives = 78/124 (62%) Frame = +3 Query: 33 RLLDETDAELQLKVLDCLLIWKDEFLVPYGQHLMDLINPRNLREELTTWSLSRESNQIDY 212 +L+DE DAE+Q++VLDCLL WKD+FL+PY HL +LI+ ++LREELTTWSLSRES I+ Sbjct: 894 QLMDENDAEIQMRVLDCLLTWKDDFLLPYEGHLRNLISSKHLREELTTWSLSRESLLIEE 953 Query: 213 EHXXXXXXXXXXXXXXXXXXPKGPPSRKHVSVQKRREILGFLAELDVEELPYFLPC**IL 392 H PK SRKH S R+ +L F+AELDV E+ F +L Sbjct: 954 SHRANLLPLIIFLLIPKVRKPKTLASRKHTSAHHRKAVLRFIAELDVNEISLFFA---LL 1010 Query: 393 CKPL 404 KPL Sbjct: 1011 IKPL 1014 >ref|XP_003601650.1| Small subunit processome component-like protein [Medicago truncatula] gi|355490698|gb|AES71901.1| Small subunit processome component-like protein [Medicago truncatula] Length = 2733 Score = 296 bits (759), Expect = 7e-78 Identities = 171/380 (45%), Positives = 232/380 (61%), Gaps = 20/380 (5%) Frame = +2 Query: 365 LFFALLINPLQTTSEGVDVTHKYHWSLPESIKDEF------------DIKGLSWMKRFGF 508 LFFALLI PLQ E D W+LP EF +I LSW K++GF Sbjct: 1048 LFFALLIKPLQIV-EKTDGPANLFWTLPIGCTSEFQASSLLEYFTLDNIATLSWKKKYGF 1106 Query: 509 LHVVEDVLVTFDESHLNPFLDLLMGCVVRILESCTGTLVCTKFKEVSLADFGFNSGASQD 688 LHV+ED++ FDE H+ PFLDLL+GCVVR+LESCT +L VS NS S Sbjct: 1107 LHVIEDIVGVFDELHIRPFLDLLVGCVVRLLESCTLSLDNVNLNGVSSNQH--NSSTSPI 1164 Query: 689 DIDGE----HDIMTSTAVKQFKELRSLCLKIISSALGKYENHDFGSDFWDRFFVALRPLI 856 + GE + I+ Q K++RSLCLKI+S + KYE+H+FGSDFWDRFF + +PLI Sbjct: 1165 TLSGESVPENQILIGNTSNQLKDMRSLCLKIVSRVVHKYEDHEFGSDFWDRFFSSAKPLI 1224 Query: 857 AGFKLEGASSKKPSSLFSCFLAMSRSFKFVHILFREKDLVPDIFSMFTITTASDAIISSV 1036 FK E ASS+KPSSL SCFLAMS + K V +L RE+ L+PDIFS+ ++ +AS+AI+ V Sbjct: 1225 NKFKHEAASSEKPSSLLSCFLAMSANHKLVALLCREESLIPDIFSIVSVNSASEAIVYCV 1284 Query: 1037 FKFVXXXXXXXXXXXXXXXSVREVLHPHVNILVNSLQSLFTHGSRTKRKL----AENELR 1204 KFV S +VL ++ +L++S+ LF + KRKL E +R Sbjct: 1285 LKFVENLLSLDNQLDYEDSSAHKVLLSNIEVLMDSICCLFGSDNAAKRKLIKSPGETVIR 1344 Query: 1205 VFKLLSHHIKEPLEAKKFIDILLPLLAKRYWSSDSYVDLLQMIQHVIEVTGVENSTRILI 1384 +FK L +IKE AK+F+DILL L K+ SSD +++LQ+IQ++I + G ++ +IL Sbjct: 1345 IFKFLPKYIKEAEFAKRFVDILLLFLEKKTQSSDVCIEVLQVIQNIIPILGNGSTAKILS 1404 Query: 1385 SVYPLLTFAELEVRKSICDV 1444 +V PL AEL++R ICD+ Sbjct: 1405 AVSPLYISAELDMRLRICDL 1424 Score = 112 bits (280), Expect = 2e-22 Identities = 63/135 (46%), Positives = 82/135 (60%) Frame = +3 Query: 3 SQLLKDVLEYRLLDETDAELQLKVLDCLLIWKDEFLVPYGQHLMDLINPRNLREELTTWS 182 SQ LK++L L++E D E+Q +VLDCLLIWKD++ +PY +HL++LI+ + REELTTWS Sbjct: 930 SQFLKEIL---LIEEDDPEIQFRVLDCLLIWKDDYFLPYTEHLINLISYKITREELTTWS 986 Query: 183 LSRESNQIDYEHXXXXXXXXXXXXXXXXXXPKGPPSRKHVSVQKRREILGFLAELDVEEL 362 LSRES I+ H KG SRK S+ R+ IL F+A LD EL Sbjct: 987 LSRESKMIEECHRAYLVPLVIRLLMPKVRKLKGLASRKKASICHRKAILSFIAGLDTTEL 1046 Query: 363 PYFLPC**ILCKPLQ 407 P F +L KPLQ Sbjct: 1047 PLFFA---LLIKPLQ 1058 >ref|XP_004167386.1| PREDICTED: LOW QUALITY PROTEIN: small subunit processome component 20 homolog, partial [Cucumis sativus] Length = 2538 Score = 291 bits (744), Expect = 4e-76 Identities = 163/377 (43%), Positives = 237/377 (62%), Gaps = 17/377 (4%) Frame = +2 Query: 365 LFFALLINPLQTTSEGVDVTHKYHWSL--------PESIKDEFD---IKGLSWMKRFGFL 511 LFF+LL+ PL D T + +L +I F I LSW K++GF+ Sbjct: 915 LFFSLLLKPLNIIPREADATANWFSNLHLVSMKASATNILKYFSTESIVALSWKKKYGFM 974 Query: 512 HVVEDVLVTFDESHLNPFLDLLMGCVVRILESCTGTLVCTKFKEVSLADFGFNSGASQDD 691 HV+E+VL FDE ++PFL++++GCVVRIL SCT +L + E+SL++ G + + Sbjct: 975 HVIEEVLAVFDEMLISPFLNIILGCVVRILASCTSSLHAARHNEMSLSEIGKTCNKNSLE 1034 Query: 692 IDGEHDI--MTSTAVKQFKELRSLCLKIISSALGKYENHDFGSDFWDRFFVALRPLIAGF 865 ++ E +T TAVKQ K+LRSLCL++IS L KYE+ DF +FWD FF +++ I F Sbjct: 1035 MNKEAAFPGLTYTAVKQHKDLRSLCLRVISVVLYKYEDFDFEMEFWDLFFTSVKSSIESF 1094 Query: 866 KLEGASSKKPSSLFSCFLAMSRSFKFVHILFREKDLVPDIFSMFTITTASDAIISSVFKF 1045 K EG+SS+KPSSL SCFLAMSRS K V +L RE++LVPDIF + TI+ AS II V +F Sbjct: 1095 KHEGSSSEKPSSLCSCFLAMSRSHKLVPLLARERNLVPDIFFILTISAASQPIILFVLQF 1154 Query: 1046 VXXXXXXXXXXXXXXXSVREVLHPHVNILVNSLQSLFTHGSRTKRKLAEN----ELRVFK 1213 + +VR +LHP+++ LV SL LF G KRKL E+ +R+FK Sbjct: 1155 IENLLSFDGELDGNDSAVRSILHPNLDSLVQSLHVLFQSGDAKKRKLIEHLNGPMIRIFK 1214 Query: 1214 LLSHHIKEPLEAKKFIDILLPLLAKRYWSSDSYVDLLQMIQHVIEVTGVENSTRILISVY 1393 LLS +++ L AKKF++I+LP L++ SS+ Y + LQ++Q+V+ + E++T+IL +V Sbjct: 1215 LLSKVVRDQLHAKKFVEIILPCLSQTGRSSEFYANTLQVVQNVVPILRSESTTKILKAVS 1274 Query: 1394 PLLTFAELEVRKSICDV 1444 PLL E ++R +CD+ Sbjct: 1275 PLLISVEQDLRLLVCDL 1291 Score = 125 bits (314), Expect = 3e-26 Identities = 67/134 (50%), Positives = 84/134 (62%) Frame = +3 Query: 3 SQLLKDVLEYRLLDETDAELQLKVLDCLLIWKDEFLVPYGQHLMDLINPRNLREELTTWS 182 S LK+VLE RLLD+ DAE+Q KVLDCLL+WKD+FL+ + QHL ++I+P+ LREELT WS Sbjct: 794 SDFLKEVLEQRLLDDNDAEIQSKVLDCLLMWKDDFLISHEQHLKNIISPKTLREELTRWS 853 Query: 183 LSRESNQIDYEHXXXXXXXXXXXXXXXXXXPKGPPSRKHVSVQKRREILGFLAELDVEEL 362 LS+E NQID H K SRK SV R+ +L F+A+LD EL Sbjct: 854 LSKEKNQIDERHRPKLVPLVTRLLMPKVRKLKVLGSRKQASVNLRKAVLQFIAQLDTVEL 913 Query: 363 PYFLPC**ILCKPL 404 P F +L KPL Sbjct: 914 PLFFS---LLLKPL 924