BLASTX nr result
ID: Cephaelis21_contig00021548
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00021548 (1845 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003607398.1| hypothetical protein MTR_4g077590 [Medicago ... 97 2e-17 ref|NP_186963.1| uncharacterized protein [Arabidopsis thaliana] ... 94 1e-16 gb|AAU44469.1| hypothetical protein AT3G03130 [Arabidopsis thali... 94 1e-16 ref|XP_002520009.1| conserved hypothetical protein [Ricinus comm... 91 8e-16 ref|XP_003538933.1| PREDICTED: uncharacterized protein LOC100793... 84 1e-13 >ref|XP_003607398.1| hypothetical protein MTR_4g077590 [Medicago truncatula] gi|355508453|gb|AES89595.1| hypothetical protein MTR_4g077590 [Medicago truncatula] Length = 666 Score = 96.7 bits (239), Expect = 2e-17 Identities = 118/539 (21%), Positives = 227/539 (42%), Gaps = 40/539 (7%) Frame = -1 Query: 1692 FHTLNRRQLQTLCKKNQIPANITNVAMADALSALPSVQGMEELSIDTP----ATPQLITT 1525 FHTL+R++LQ L K N+IPANITNVAMADALSALP V+G++E+ TP + Sbjct: 3 FHTLSRKELQALSKMNKIPANITNVAMADALSALPHVEGLDEILNQREGGDIGTPAVQPR 62 Query: 1524 TARRTRSFKHESDDFLHPPTTIRTTRHSSARDAHHQHQPHPTSLNI-------------- 1387 TARRT + + + + R R A + + N+ Sbjct: 63 TARRTTTQRKPVKEAESTKVSTRVNRGGRGGVAEGEVEQENLDANVDAGTPAVVPTSRRR 122 Query: 1386 ---INTALDFNLVEEEQQDENDANIRNEKQNLSHYTPAAVPSSRQRAALTLK-----SQE 1231 ++T ++ E +D+ + ++ + +++ TPAA PSSR RA +++ S Sbjct: 123 VPAVSTRRKKEVIVIEDEDDVVSEVQGKATDVAK-TPAAAPSSRTRAGRSVRNKTEISDG 181 Query: 1230 TKVQ---GTRRSARLAARNKQQENNTNDQISQTVLNMDLGQVMEINVKGSVDADVDSVAK 1060 T VQ TRRS RL ++ + + + + ++ N D+ + M ++ + ++ A Sbjct: 182 TSVQKAYSTRRSVRLVGKSLSKMSLADTEDMESTKNDDVSEEMSVSQNEGGSIETENGAS 241 Query: 1059 SFGEVDLXXXXXXXXXXXNIAAVVSDCSAAKDNIKFETGFNEVSKDGYGCKTENNSFENG 880 S E ++ ++ +DC + + E + ++D + EN Sbjct: 242 SQTESNVVSQNTDEVEVSSLNK--ADCESQSHDSGSEVKSTD-AEDVLQADPKEEGSENV 298 Query: 879 KELDINDVVTTDDVQEILTD--QSGDAFSIDNGVDLNMSKEELESKQDF--GQSHGMDLF 712 ++++ ++ ++Q+ S +A S + E+E+K+ F Q M+L Sbjct: 299 NHVEVSREDSSLNLQDSFETCADSNEAGSEQLEPEKTSDSAEIENKECFVAEQDQAMELA 358 Query: 711 QTEAKNL-----IEEGIALDDEKDTNLEDLNPLRICNAVSQRD-ESVDTEIEPKAEKQDA 550 +E ++ E + + D+ +L P V +D + E +A K+ A Sbjct: 359 ASEEVSVEIAASEEVSVEIADQTIASLTVAEPEDAFVDVPNQDVAGLSLEASEEAYKEIA 418 Query: 549 WIIKPAIEPKVEDDLVNNKLAETLSAMELNTSNSKYELSWVKATGFGLEPSMDIKEEMVK 370 ++ + V DD + L + ++ M + + EE+ Sbjct: 419 DLVIAPLNVVVPDDACGDDLDQDVADMSVVLPE-------------------ESSEEITH 459 Query: 369 DLSGDPAVSVNDVTLEQLDGDYEVLEI-QADPREEEDLKNNSSVEFSNKKQDATAVDQV 196 V + T+E +++V E+ + +P++ E + + VE D+ A ++V Sbjct: 460 HAIAPETAVVPNGTIETSSEEHQVEEVFEPEPKKVECVSSAILVEKDGTSGDSGAENEV 518 >ref|NP_186963.1| uncharacterized protein [Arabidopsis thaliana] gi|6714423|gb|AAF26111.1|AC012328_14 hypothetical protein [Arabidopsis thaliana] gi|61742693|gb|AAX55167.1| hypothetical protein At3g03130 [Arabidopsis thaliana] gi|332640384|gb|AEE73905.1| uncharacterized protein [Arabidopsis thaliana] Length = 520 Score = 94.0 bits (232), Expect = 1e-16 Identities = 119/474 (25%), Positives = 193/474 (40%), Gaps = 56/474 (11%) Frame = -1 Query: 1692 FHTLNRRQLQTLCKKNQIPANITNVAMADALSALPSVQGMEELSIDTPATPQLITTTARR 1513 FH+L RR LQ CK+N+IPAN+TN+AMADAL L V+GM+E P+ Q T+ AR Sbjct: 3 FHSLPRRDLQFFCKRNKIPANMTNIAMADALRDLEIVEGMDEFM--DPSRDQSPTSVARN 60 Query: 1512 TRSFKHESDDFLHPPTTIRTTRHSSARDAHHQHQPHPTSLNIINTAL---------DFNL 1360 S T RTTR S +D + S +++ +L D N+ Sbjct: 61 LPSAAR---------TAARTTRRKSTKDETQSSELVTRSCYVVSKSLAGEMDQENKDMNM 111 Query: 1359 VEEEQQDEN-----DANIRNEKQNLSHYTPAAVPSSRQRAALTLKSQET--KVQGTRRSA 1201 ++ ++ D N + N+S TPAA + R +AA + K E+ +V TRRS Sbjct: 112 LQNPSVPQSRAVKLDVNDIMPEANVSK-TPAARSTRRAQAAASSKKDESVQRVYSTRRSV 170 Query: 1200 RLAARN--------------------------KQQENNTNDQISQTVLNMDLGQVMEINV 1099 RL + K EN+ N + DL +E Sbjct: 171 RLLEESMADLSLKTNVPVKKHEDSPAGSKFQAKSDENSENTDKGGVMSGRDLNDSLEKEW 230 Query: 1098 KGSV-DADVDSVAKSFGEVDLXXXXXXXXXXXNIAAVVSDCSAAKDNIKFETGFNEVSKD 922 GS D D+D + G++ ++ S +A D+ +D Sbjct: 231 DGSKNDPDLDILYGDLGDITF---FDASTSKEHLNRTDSSTVSASDSFVLVNEHETSQED 287 Query: 921 GY------GCKTENNSFENGKELDINDVVTTDDVQEILTDQSGDAFSIDN-GVDLNMSKE 763 G+ T N+ KE + + + + T+ D + D+ GV ++ ++E Sbjct: 288 GFVVVDHATSTTTTNTLACNKESEPEQMKIDSESESEETEYETDPWEGDDFGVAVHTNQE 347 Query: 762 ELESKQDFGQSHGMDLFQTEAKNLIEEGIALDDEKDTNLEDLNPLRICNAVSQRDESVDT 583 ESK S + + A LI D+ K+ + +PL + DE D Sbjct: 348 AFESK--VSASDNVSKVDSVATVLI-----ADESKELDFSS-SPLAVEELEEDSDEWSDY 399 Query: 582 EIEPKAEKQDAWIIKPAIEPKVEDDLVNNK------LAETLSAMELNTSNSKYE 439 EI ++++ + +IE + E+ V++K + +L+ E TS S +E Sbjct: 400 EIGEVELEENSCGSEESIEIESEEAPVSDKKTPASSSSSSLAGNETRTSLSPFE 453 >gb|AAU44469.1| hypothetical protein AT3G03130 [Arabidopsis thaliana] Length = 520 Score = 94.0 bits (232), Expect = 1e-16 Identities = 119/474 (25%), Positives = 193/474 (40%), Gaps = 56/474 (11%) Frame = -1 Query: 1692 FHTLNRRQLQTLCKKNQIPANITNVAMADALSALPSVQGMEELSIDTPATPQLITTTARR 1513 FH+L RR LQ CK+N+IPAN+TN+AMADAL L V+GM+E P+ Q T+ AR Sbjct: 3 FHSLPRRDLQFFCKRNKIPANMTNIAMADALRDLEIVEGMDEFM--DPSRDQSPTSVARN 60 Query: 1512 TRSFKHESDDFLHPPTTIRTTRHSSARDAHHQHQPHPTSLNIINTAL---------DFNL 1360 S T RTTR S +D + S +++ +L D N+ Sbjct: 61 LPSAAR---------TAARTTRRKSTKDETQSSELVTRSCYVVSKSLAGEMDQENKDMNM 111 Query: 1359 VEEEQQDEN-----DANIRNEKQNLSHYTPAAVPSSRQRAALTLKSQET--KVQGTRRSA 1201 ++ ++ D N + N+S TPAA + R +AA + K E+ +V TRRS Sbjct: 112 LQNPSVPQSRAVKLDVNDIMPEANVSK-TPAARXTRRAQAAASSKKDESVQRVYSTRRSV 170 Query: 1200 RLAARN--------------------------KQQENNTNDQISQTVLNMDLGQVMEINV 1099 RL + K EN+ N + DL +E Sbjct: 171 RLLEESMADLSLKTNVPVKKHEDSPAGSKFQAKSDENSENTDKGGVMSGRDLNDSLEKEW 230 Query: 1098 KGSV-DADVDSVAKSFGEVDLXXXXXXXXXXXNIAAVVSDCSAAKDNIKFETGFNEVSKD 922 GS D D+D + G++ ++ S +A D+ +D Sbjct: 231 DGSKNDPDLDILYGDLGDITF---FDASTSKEHLNRTDSSTVSASDSFVLVNEHETSQED 287 Query: 921 GY------GCKTENNSFENGKELDINDVVTTDDVQEILTDQSGDAFSIDN-GVDLNMSKE 763 G+ T N+ KE + + + + T+ D + D+ GV ++ ++E Sbjct: 288 GFVVVDHATSTTTTNTLACNKESEPEQMKIDSESESEETEYETDPWEGDDFGVAVHTNQE 347 Query: 762 ELESKQDFGQSHGMDLFQTEAKNLIEEGIALDDEKDTNLEDLNPLRICNAVSQRDESVDT 583 ESK S + + A LI D+ K+ + +PL + DE D Sbjct: 348 AFESK--VSASDNVSKVDSVATVLI-----ADESKELDFSS-SPLAVEELEEDSDEWSDY 399 Query: 582 EIEPKAEKQDAWIIKPAIEPKVEDDLVNNK------LAETLSAMELNTSNSKYE 439 EI ++++ + +IE + E+ V++K + +L+ E TS S +E Sbjct: 400 EIGEVELEENSCGSEESIEIESEEAPVSDKKTPASSSSSSLAGNETRTSLSPFE 453 >ref|XP_002520009.1| conserved hypothetical protein [Ricinus communis] gi|223540773|gb|EEF42333.1| conserved hypothetical protein [Ricinus communis] Length = 737 Score = 91.3 bits (225), Expect = 8e-16 Identities = 129/571 (22%), Positives = 243/571 (42%), Gaps = 65/571 (11%) Frame = -1 Query: 1692 FHTLNRRQLQTLCKKNQIPANITNVAMADALSALPSVQGMEELSIDTPAT---------- 1543 FH+L R++LQ LCKKN+IPAN+TNVAMADAL AL V G++E+ I+ P + Sbjct: 3 FHSLARKELQALCKKNKIPANMTNVAMADALKALEKVDGLDEV-INAPRSDPQQSPEKTG 61 Query: 1542 ---PQLITTTARRTRSFKHESDDFLHPPTTIRTTRHSSARDAHHQHQPHPTSLN--IINT 1378 P+ + T+ R + E + P T RTT+ +SA + Q + L ++T Sbjct: 62 NPEPRTVCRTSTRRKPINVEPESSQLPTRTRRTTKKTSAAEEAEQENNNENLLETPAVST 121 Query: 1377 A------------LDFNLVEEEQQDENDANIRNEKQNLSHYTPAAVPSSRQRAAL--TLK 1240 + +D L+E + ++ A + EK ++ TP A+ SSR +A + T K Sbjct: 122 SRRRVTAASARRKIDTQLMESVEDEK--AAVGEEKSDVPE-TP-AIRSSRSKAPVVSTKK 177 Query: 1239 SQETK----VQGTRRSARLAARN----KQQENNTNDQISQTVLNMDLGQVMEINVKGSVD 1084 E K V GTR S RL ++ +E T + + L + V + D Sbjct: 178 KIEEKSVQRVYGTRHSVRLLEKSLADLSVKEKRTVEVVKIEGLCEETDHVEQQKGVPGGD 237 Query: 1083 ADVDSVAKSFGEVDLXXXXXXXXXXXNIA---AVVSDCSAAKDNIKFETGFNEVSK--DG 919 +++D ++ GE+ + A + S + N+ +G + K D Sbjct: 238 SEIDESLENEGELKHEFQEENKTITDHEVTDYAKLEIGSESCTNLDSHSGLDAEDKDDDS 297 Query: 918 YGCKTENNSFENGKELDINDVVTTDDVQEILTDQSGDAFSIDNGVDLNMSKEELESKQDF 739 G + + LD+ND ++ +++ + ++ S+ ++ +E +++ Sbjct: 298 SGESLLRQVETSDRALDMNDEPIHENGPDVVITE--NSHSVTAALEPETEREVTDNQDSL 355 Query: 738 GQSHGMD--LFQTEAKNL---------IEEGIALDDEKDTNLEDLNPLRI---------C 619 D F EA ++ +E + L K + +E + + C Sbjct: 356 VAQVSDDSVAFIMEADHISIVNATDEVSDEVVDLVTPKVSEVEGQVSMEVRNLSEVVSEC 415 Query: 618 NAVSQRDESVDTEIEPKAEKQDAWIIKPAIEPKVEDDLVNNKLAETLSAMELNTSNSKYE 439 + ++ +++ V + E + I A+EP++E +++ N+ + + A + + +++ Sbjct: 416 SKMNSKEDEVHGSYDMVTENSETVI--AALEPEIEKEMIENRDSLVVQASDDSAMETEH- 472 Query: 438 LSWVKATGFGLEPSMDIKEEMVKDLSGDPAVSVNDVTL---EQLDGDYEVLEIQADPREE 268 +S V A +D+ V ++ G V V D++ E + + + D E Sbjct: 473 ISIVNAATEVSVEVVDLLNPKVSEVEGQVCVEVMDLSAVVGESSEMNSMEDKQHLDAASE 532 Query: 267 EDLKNNSSVEFSNKKQDATAVDQVVTTPKLS 175 ED + E S+ + + D VT K S Sbjct: 533 EDSDGDDIEEESDGYETDSICDSNVTEAKES 563 >ref|XP_003538933.1| PREDICTED: uncharacterized protein LOC100793550 [Glycine max] Length = 722 Score = 84.3 bits (207), Expect = 1e-13 Identities = 74/237 (31%), Positives = 111/237 (46%), Gaps = 22/237 (9%) Frame = -1 Query: 1692 FHTLNRRQLQTLCKKNQIPANITNVAMADALSALPSVQGMEEL------SIDTPATPQLI 1531 FHTL+R+QLQ LCKKN+IPANITNVAMADAL+AL V+G+++ + TP+ Sbjct: 3 FHTLSRKQLQALCKKNKIPANITNVAMADALAALNQVEGLDDFFNPSEGDVGTPSVNHRT 62 Query: 1530 TTTARRTRSFKHESDDFLHPPTTIRTTR------HSSARDAHHQHQPHPTSLNIINTALD 1369 R E + L T+ R R +DA + P + TA+ Sbjct: 63 VVRTSTQRKAAIEEAEGLKVKTSTRRVRVAEEVVEQENKDA-NAPPITPAASRRRATAVS 121 Query: 1368 FNLVEEEQQDENDANIRNEKQNLSHYTPAAV-PSSRQRAAL---------TLKSQETKVQ 1219 +E + E DA ++ + TPAAV P SR+RA T + T V Sbjct: 122 TRRKKEVEMVEEDAGVQGNPK-----TPAAVAPVSRRRATSRSVCTTKIETPGAHGTSVY 176 Query: 1218 GTRRSARLAARNKQQENNTNDQISQTVLNMDLGQVMEINVKGSVDADVDSVAKSFGE 1048 TRRS RL ++ + + + + + ++ +D G V + + S + DS G+ Sbjct: 177 NTRRSVRLLEKDLSKMSLLDTEDTTGLVKID-GDVSQDSSNVSHQLEEDSSGNEKGD 232