BLASTX nr result

ID: Cephaelis21_contig00021548 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00021548
         (1845 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003607398.1| hypothetical protein MTR_4g077590 [Medicago ...    97   2e-17
ref|NP_186963.1| uncharacterized protein [Arabidopsis thaliana] ...    94   1e-16
gb|AAU44469.1| hypothetical protein AT3G03130 [Arabidopsis thali...    94   1e-16
ref|XP_002520009.1| conserved hypothetical protein [Ricinus comm...    91   8e-16
ref|XP_003538933.1| PREDICTED: uncharacterized protein LOC100793...    84   1e-13

>ref|XP_003607398.1| hypothetical protein MTR_4g077590 [Medicago truncatula]
            gi|355508453|gb|AES89595.1| hypothetical protein
            MTR_4g077590 [Medicago truncatula]
          Length = 666

 Score = 96.7 bits (239), Expect = 2e-17
 Identities = 118/539 (21%), Positives = 227/539 (42%), Gaps = 40/539 (7%)
 Frame = -1

Query: 1692 FHTLNRRQLQTLCKKNQIPANITNVAMADALSALPSVQGMEELSIDTP----ATPQLITT 1525
            FHTL+R++LQ L K N+IPANITNVAMADALSALP V+G++E+          TP +   
Sbjct: 3    FHTLSRKELQALSKMNKIPANITNVAMADALSALPHVEGLDEILNQREGGDIGTPAVQPR 62

Query: 1524 TARRTRSFKHESDDFLHPPTTIRTTRHSSARDAHHQHQPHPTSLNI-------------- 1387
            TARRT + +    +      + R  R      A  + +      N+              
Sbjct: 63   TARRTTTQRKPVKEAESTKVSTRVNRGGRGGVAEGEVEQENLDANVDAGTPAVVPTSRRR 122

Query: 1386 ---INTALDFNLVEEEQQDENDANIRNEKQNLSHYTPAAVPSSRQRAALTLK-----SQE 1231
               ++T     ++  E +D+  + ++ +  +++  TPAA PSSR RA  +++     S  
Sbjct: 123  VPAVSTRRKKEVIVIEDEDDVVSEVQGKATDVAK-TPAAAPSSRTRAGRSVRNKTEISDG 181

Query: 1230 TKVQ---GTRRSARLAARNKQQENNTNDQISQTVLNMDLGQVMEINVKGSVDADVDSVAK 1060
            T VQ    TRRS RL  ++  + +  + +  ++  N D+ + M ++       + ++ A 
Sbjct: 182  TSVQKAYSTRRSVRLVGKSLSKMSLADTEDMESTKNDDVSEEMSVSQNEGGSIETENGAS 241

Query: 1059 SFGEVDLXXXXXXXXXXXNIAAVVSDCSAAKDNIKFETGFNEVSKDGYGCKTENNSFENG 880
            S  E ++           ++    +DC +   +   E    + ++D      +    EN 
Sbjct: 242  SQTESNVVSQNTDEVEVSSLNK--ADCESQSHDSGSEVKSTD-AEDVLQADPKEEGSENV 298

Query: 879  KELDINDVVTTDDVQEILTD--QSGDAFSIDNGVDLNMSKEELESKQDF--GQSHGMDLF 712
              ++++   ++ ++Q+       S +A S     +      E+E+K+ F   Q   M+L 
Sbjct: 299  NHVEVSREDSSLNLQDSFETCADSNEAGSEQLEPEKTSDSAEIENKECFVAEQDQAMELA 358

Query: 711  QTEAKNL-----IEEGIALDDEKDTNLEDLNPLRICNAVSQRD-ESVDTEIEPKAEKQDA 550
             +E  ++      E  + + D+   +L    P      V  +D   +  E   +A K+ A
Sbjct: 359  ASEEVSVEIAASEEVSVEIADQTIASLTVAEPEDAFVDVPNQDVAGLSLEASEEAYKEIA 418

Query: 549  WIIKPAIEPKVEDDLVNNKLAETLSAMELNTSNSKYELSWVKATGFGLEPSMDIKEEMVK 370
             ++   +   V DD   + L + ++ M +                       +  EE+  
Sbjct: 419  DLVIAPLNVVVPDDACGDDLDQDVADMSVVLPE-------------------ESSEEITH 459

Query: 369  DLSGDPAVSVNDVTLEQLDGDYEVLEI-QADPREEEDLKNNSSVEFSNKKQDATAVDQV 196
                     V + T+E    +++V E+ + +P++ E + +   VE      D+ A ++V
Sbjct: 460  HAIAPETAVVPNGTIETSSEEHQVEEVFEPEPKKVECVSSAILVEKDGTSGDSGAENEV 518


>ref|NP_186963.1| uncharacterized protein [Arabidopsis thaliana]
            gi|6714423|gb|AAF26111.1|AC012328_14 hypothetical protein
            [Arabidopsis thaliana] gi|61742693|gb|AAX55167.1|
            hypothetical protein At3g03130 [Arabidopsis thaliana]
            gi|332640384|gb|AEE73905.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 520

 Score = 94.0 bits (232), Expect = 1e-16
 Identities = 119/474 (25%), Positives = 193/474 (40%), Gaps = 56/474 (11%)
 Frame = -1

Query: 1692 FHTLNRRQLQTLCKKNQIPANITNVAMADALSALPSVQGMEELSIDTPATPQLITTTARR 1513
            FH+L RR LQ  CK+N+IPAN+TN+AMADAL  L  V+GM+E     P+  Q  T+ AR 
Sbjct: 3    FHSLPRRDLQFFCKRNKIPANMTNIAMADALRDLEIVEGMDEFM--DPSRDQSPTSVARN 60

Query: 1512 TRSFKHESDDFLHPPTTIRTTRHSSARDAHHQHQPHPTSLNIINTAL---------DFNL 1360
              S            T  RTTR  S +D     +    S  +++ +L         D N+
Sbjct: 61   LPSAAR---------TAARTTRRKSTKDETQSSELVTRSCYVVSKSLAGEMDQENKDMNM 111

Query: 1359 VEEEQQDEN-----DANIRNEKQNLSHYTPAAVPSSRQRAALTLKSQET--KVQGTRRSA 1201
            ++     ++     D N    + N+S  TPAA  + R +AA + K  E+  +V  TRRS 
Sbjct: 112  LQNPSVPQSRAVKLDVNDIMPEANVSK-TPAARSTRRAQAAASSKKDESVQRVYSTRRSV 170

Query: 1200 RLAARN--------------------------KQQENNTNDQISQTVLNMDLGQVMEINV 1099
            RL   +                          K  EN+ N      +   DL   +E   
Sbjct: 171  RLLEESMADLSLKTNVPVKKHEDSPAGSKFQAKSDENSENTDKGGVMSGRDLNDSLEKEW 230

Query: 1098 KGSV-DADVDSVAKSFGEVDLXXXXXXXXXXXNIAAVVSDCSAAKDNIKFETGFNEVSKD 922
             GS  D D+D +    G++             ++    S   +A D+           +D
Sbjct: 231  DGSKNDPDLDILYGDLGDITF---FDASTSKEHLNRTDSSTVSASDSFVLVNEHETSQED 287

Query: 921  GY------GCKTENNSFENGKELDINDVVTTDDVQEILTDQSGDAFSIDN-GVDLNMSKE 763
            G+         T  N+    KE +   +    + +   T+   D +  D+ GV ++ ++E
Sbjct: 288  GFVVVDHATSTTTTNTLACNKESEPEQMKIDSESESEETEYETDPWEGDDFGVAVHTNQE 347

Query: 762  ELESKQDFGQSHGMDLFQTEAKNLIEEGIALDDEKDTNLEDLNPLRICNAVSQRDESVDT 583
              ESK     S  +    + A  LI      D+ K+ +    +PL +       DE  D 
Sbjct: 348  AFESK--VSASDNVSKVDSVATVLI-----ADESKELDFSS-SPLAVEELEEDSDEWSDY 399

Query: 582  EIEPKAEKQDAWIIKPAIEPKVEDDLVNNK------LAETLSAMELNTSNSKYE 439
            EI     ++++   + +IE + E+  V++K       + +L+  E  TS S +E
Sbjct: 400  EIGEVELEENSCGSEESIEIESEEAPVSDKKTPASSSSSSLAGNETRTSLSPFE 453


>gb|AAU44469.1| hypothetical protein AT3G03130 [Arabidopsis thaliana]
          Length = 520

 Score = 94.0 bits (232), Expect = 1e-16
 Identities = 119/474 (25%), Positives = 193/474 (40%), Gaps = 56/474 (11%)
 Frame = -1

Query: 1692 FHTLNRRQLQTLCKKNQIPANITNVAMADALSALPSVQGMEELSIDTPATPQLITTTARR 1513
            FH+L RR LQ  CK+N+IPAN+TN+AMADAL  L  V+GM+E     P+  Q  T+ AR 
Sbjct: 3    FHSLPRRDLQFFCKRNKIPANMTNIAMADALRDLEIVEGMDEFM--DPSRDQSPTSVARN 60

Query: 1512 TRSFKHESDDFLHPPTTIRTTRHSSARDAHHQHQPHPTSLNIINTAL---------DFNL 1360
              S            T  RTTR  S +D     +    S  +++ +L         D N+
Sbjct: 61   LPSAAR---------TAARTTRRKSTKDETQSSELVTRSCYVVSKSLAGEMDQENKDMNM 111

Query: 1359 VEEEQQDEN-----DANIRNEKQNLSHYTPAAVPSSRQRAALTLKSQET--KVQGTRRSA 1201
            ++     ++     D N    + N+S  TPAA  + R +AA + K  E+  +V  TRRS 
Sbjct: 112  LQNPSVPQSRAVKLDVNDIMPEANVSK-TPAARXTRRAQAAASSKKDESVQRVYSTRRSV 170

Query: 1200 RLAARN--------------------------KQQENNTNDQISQTVLNMDLGQVMEINV 1099
            RL   +                          K  EN+ N      +   DL   +E   
Sbjct: 171  RLLEESMADLSLKTNVPVKKHEDSPAGSKFQAKSDENSENTDKGGVMSGRDLNDSLEKEW 230

Query: 1098 KGSV-DADVDSVAKSFGEVDLXXXXXXXXXXXNIAAVVSDCSAAKDNIKFETGFNEVSKD 922
             GS  D D+D +    G++             ++    S   +A D+           +D
Sbjct: 231  DGSKNDPDLDILYGDLGDITF---FDASTSKEHLNRTDSSTVSASDSFVLVNEHETSQED 287

Query: 921  GY------GCKTENNSFENGKELDINDVVTTDDVQEILTDQSGDAFSIDN-GVDLNMSKE 763
            G+         T  N+    KE +   +    + +   T+   D +  D+ GV ++ ++E
Sbjct: 288  GFVVVDHATSTTTTNTLACNKESEPEQMKIDSESESEETEYETDPWEGDDFGVAVHTNQE 347

Query: 762  ELESKQDFGQSHGMDLFQTEAKNLIEEGIALDDEKDTNLEDLNPLRICNAVSQRDESVDT 583
              ESK     S  +    + A  LI      D+ K+ +    +PL +       DE  D 
Sbjct: 348  AFESK--VSASDNVSKVDSVATVLI-----ADESKELDFSS-SPLAVEELEEDSDEWSDY 399

Query: 582  EIEPKAEKQDAWIIKPAIEPKVEDDLVNNK------LAETLSAMELNTSNSKYE 439
            EI     ++++   + +IE + E+  V++K       + +L+  E  TS S +E
Sbjct: 400  EIGEVELEENSCGSEESIEIESEEAPVSDKKTPASSSSSSLAGNETRTSLSPFE 453


>ref|XP_002520009.1| conserved hypothetical protein [Ricinus communis]
            gi|223540773|gb|EEF42333.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 737

 Score = 91.3 bits (225), Expect = 8e-16
 Identities = 129/571 (22%), Positives = 243/571 (42%), Gaps = 65/571 (11%)
 Frame = -1

Query: 1692 FHTLNRRQLQTLCKKNQIPANITNVAMADALSALPSVQGMEELSIDTPAT---------- 1543
            FH+L R++LQ LCKKN+IPAN+TNVAMADAL AL  V G++E+ I+ P +          
Sbjct: 3    FHSLARKELQALCKKNKIPANMTNVAMADALKALEKVDGLDEV-INAPRSDPQQSPEKTG 61

Query: 1542 ---PQLITTTARRTRSFKHESDDFLHPPTTIRTTRHSSARDAHHQHQPHPTSLN--IINT 1378
               P+ +  T+ R +    E +    P  T RTT+ +SA +   Q   +   L    ++T
Sbjct: 62   NPEPRTVCRTSTRRKPINVEPESSQLPTRTRRTTKKTSAAEEAEQENNNENLLETPAVST 121

Query: 1377 A------------LDFNLVEEEQQDENDANIRNEKQNLSHYTPAAVPSSRQRAAL--TLK 1240
            +            +D  L+E  + ++  A +  EK ++   TP A+ SSR +A +  T K
Sbjct: 122  SRRRVTAASARRKIDTQLMESVEDEK--AAVGEEKSDVPE-TP-AIRSSRSKAPVVSTKK 177

Query: 1239 SQETK----VQGTRRSARLAARN----KQQENNTNDQISQTVLNMDLGQVMEINVKGSVD 1084
              E K    V GTR S RL  ++      +E  T + +    L  +   V +       D
Sbjct: 178  KIEEKSVQRVYGTRHSVRLLEKSLADLSVKEKRTVEVVKIEGLCEETDHVEQQKGVPGGD 237

Query: 1083 ADVDSVAKSFGEVDLXXXXXXXXXXXNIA---AVVSDCSAAKDNIKFETGFNEVSK--DG 919
            +++D   ++ GE+             +     A +   S +  N+   +G +   K  D 
Sbjct: 238  SEIDESLENEGELKHEFQEENKTITDHEVTDYAKLEIGSESCTNLDSHSGLDAEDKDDDS 297

Query: 918  YGCKTENNSFENGKELDINDVVTTDDVQEILTDQSGDAFSIDNGVDLNMSKEELESKQDF 739
             G         + + LD+ND    ++  +++  +  ++ S+   ++    +E  +++   
Sbjct: 298  SGESLLRQVETSDRALDMNDEPIHENGPDVVITE--NSHSVTAALEPETEREVTDNQDSL 355

Query: 738  GQSHGMD--LFQTEAKNL---------IEEGIALDDEKDTNLEDLNPLRI---------C 619
                  D   F  EA ++          +E + L   K + +E    + +         C
Sbjct: 356  VAQVSDDSVAFIMEADHISIVNATDEVSDEVVDLVTPKVSEVEGQVSMEVRNLSEVVSEC 415

Query: 618  NAVSQRDESVDTEIEPKAEKQDAWIIKPAIEPKVEDDLVNNKLAETLSAMELNTSNSKYE 439
            + ++ +++ V    +   E  +  I   A+EP++E +++ N+ +  + A + +   +++ 
Sbjct: 416  SKMNSKEDEVHGSYDMVTENSETVI--AALEPEIEKEMIENRDSLVVQASDDSAMETEH- 472

Query: 438  LSWVKATGFGLEPSMDIKEEMVKDLSGDPAVSVNDVTL---EQLDGDYEVLEIQADPREE 268
            +S V A        +D+    V ++ G   V V D++    E  + +    +   D   E
Sbjct: 473  ISIVNAATEVSVEVVDLLNPKVSEVEGQVCVEVMDLSAVVGESSEMNSMEDKQHLDAASE 532

Query: 267  EDLKNNSSVEFSNKKQDATAVDQVVTTPKLS 175
            ED   +   E S+  +  +  D  VT  K S
Sbjct: 533  EDSDGDDIEEESDGYETDSICDSNVTEAKES 563


>ref|XP_003538933.1| PREDICTED: uncharacterized protein LOC100793550 [Glycine max]
          Length = 722

 Score = 84.3 bits (207), Expect = 1e-13
 Identities = 74/237 (31%), Positives = 111/237 (46%), Gaps = 22/237 (9%)
 Frame = -1

Query: 1692 FHTLNRRQLQTLCKKNQIPANITNVAMADALSALPSVQGMEEL------SIDTPATPQLI 1531
            FHTL+R+QLQ LCKKN+IPANITNVAMADAL+AL  V+G+++        + TP+     
Sbjct: 3    FHTLSRKQLQALCKKNKIPANITNVAMADALAALNQVEGLDDFFNPSEGDVGTPSVNHRT 62

Query: 1530 TTTARRTRSFKHESDDFLHPPTTIRTTR------HSSARDAHHQHQPHPTSLNIINTALD 1369
                   R    E  + L   T+ R  R          +DA +     P +     TA+ 
Sbjct: 63   VVRTSTQRKAAIEEAEGLKVKTSTRRVRVAEEVVEQENKDA-NAPPITPAASRRRATAVS 121

Query: 1368 FNLVEEEQQDENDANIRNEKQNLSHYTPAAV-PSSRQRAAL---------TLKSQETKVQ 1219
                +E +  E DA ++   +     TPAAV P SR+RA           T  +  T V 
Sbjct: 122  TRRKKEVEMVEEDAGVQGNPK-----TPAAVAPVSRRRATSRSVCTTKIETPGAHGTSVY 176

Query: 1218 GTRRSARLAARNKQQENNTNDQISQTVLNMDLGQVMEINVKGSVDADVDSVAKSFGE 1048
             TRRS RL  ++  + +  + + +  ++ +D G V + +   S   + DS     G+
Sbjct: 177  NTRRSVRLLEKDLSKMSLLDTEDTTGLVKID-GDVSQDSSNVSHQLEEDSSGNEKGD 232


Top