BLASTX nr result

ID: Cephaelis21_contig00031669 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00031669
         (1110 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAF97969.1|AC000103_19 F21J9.30 [Arabidopsis thaliana]             174   5e-41
gb|ABA97040.1| retrotransposon protein, putative, unclassified [...   170   5e-40
gb|EEC83784.1| hypothetical protein OsI_29682 [Oryza sativa Indi...   170   7e-40
gb|EEC74955.1| hypothetical protein OsI_10942 [Oryza sativa Indi...   170   7e-40
gb|EEC81552.1| hypothetical protein OsI_24974 [Oryza sativa Indi...   168   2e-39

>gb|AAF97969.1|AC000103_19 F21J9.30 [Arabidopsis thaliana]
          Length = 1270

 Score =  174 bits (440), Expect = 5e-41
 Identities = 119/375 (31%), Positives = 178/375 (47%), Gaps = 15/375 (4%)
 Frame = -3

Query: 1108 SCLKIPAVVCHSIEQLAAKFWWGAQKDNKRKLHWKAWNKLAIDKDQGGFAFRNLQDFNQA 929
            SC K+P   C ++E   A FWW +  D+ RK+HW++W +L + KD GG  FR++Q FNQA
Sbjct: 769  SCFKLPITTCENLESAMASFWWDSC-DHSRKIHWQSWERLCLPKDSGGLGFRDIQSFNQA 827

Query: 928  LLAKQLWRLLTQPGLLMSRVLKGKYFPGGGLMNIKIKSQDSWLWKSWMGARKTXXXXXXX 749
            LLAKQ WRLL  P  L+SR+LK +YF     ++  +  + S+ W+S +  R+        
Sbjct: 828  LLAKQAWRLLHFPDCLLSRLLKSRYFDATDFLDAALSQRPSFGWRSILFGRELLSKGLQK 887

Query: 748  XXXXGKSIKIWESPWLPNCPNFRPSSQKP--QGCSLRWVAELMCSDGKAWNRDLIKSIFL 575
                G S+ +W  PW+ +   FR   +K      +L+  A L    G  W+ +++  +FL
Sbjct: 888  RVGDGASLFVWIDPWIDD-NGFRAPWRKNLIYDVTLKVKALLNPRTG-FWDEEVLHDLFL 945

Query: 574  PQDADDILQL-PINQLG*RDKLVWHHTSNGRYTVNSGYKWTADQRLCFLDRAECSNRREV 398
            P+D   I  + P+  +   D  VW    +G ++V S Y W A Q      R+E S +   
Sbjct: 946  PEDILRIKAIKPV--ISQADFFVWKLNKSGDFSVKSAY-WLAYQTKSQNLRSEVSMQPST 1002

Query: 397  EQ*KGGYKGLQVDPPC-----KTCGEHDETLEHLLFHCPMASLVWKLAPVQWP--GIHQL 239
               K     LQ DP       K CGE  E+  H LF CP++  +W L+   +P  G    
Sbjct: 1003 LGLKTQVWNLQTDPKIKIFLWKVCGELGESTNHTLFLCPLSRQIWALSDYPFPPDGFSNG 1062

Query: 238  SI--NFQHWWRQXXXXXXXXXXXSRLELSAYLLWSIWNARNSWCFNATKLTAVQIVEKAQ 65
            SI  N  H                  ++  ++LW IW  RNS+ F      A   V K +
Sbjct: 1063 SIYSNINHLLENKDNKEWPINLR---KIFPWILWRIWKNRNSFIFEGISYPATDTVIKIR 1119

Query: 64   ---IEWNEFKESDSQ 29
               +EW E +  D +
Sbjct: 1120 DDVVEWFEAQCLDGE 1134


>gb|ABA97040.1| retrotransposon protein, putative, unclassified [Oryza sativa
            Japonica Group]
          Length = 1913

 Score =  170 bits (431), Expect = 5e-40
 Identities = 108/358 (30%), Positives = 161/358 (44%), Gaps = 29/358 (8%)
 Frame = -3

Query: 1108 SCLKIPAVVCHSIEQLAAKFWWGAQKDNKRKLHWKAWNKLAIDKDQGGFAFRNLQDFNQA 929
            S  ++P  VC  + +LA  FWWGA K  KRK HW+AW+ L   K  GG AFR+ + FNQA
Sbjct: 1381 SIFRLPESVCEDLNKLARNFWWGADK-GKRKTHWRAWSCLTKPKHNGGLAFRDFRLFNQA 1439

Query: 928  LLAKQLWRLLTQPGLLMSRVLKGKYFPGGGLMNIKIKSQDSWLWKSWMGARKTXXXXXXX 749
            LLA+Q WRLL +P  L +RV+K KY+P G L++       S  W++              
Sbjct: 1440 LLARQAWRLLDKPDSLCARVMKAKYYPNGSLVDTAFGGNASPGWRAIEHGLALLKKGIVW 1499

Query: 748  XXXXGKSIKIWESPWLPNCPNFRPSSQKPQGCSLRWVAELMCSDGKAWNRDLIKSIFLPQ 569
                G+S++IW  PW+P   + RP + K + C ++WV++L+  DG +W+   +  +FLP 
Sbjct: 1500 RIGNGRSVRIWRDPWIPRDHSCRPITTK-RNCRVKWVSDLLGQDG-SWDVQKVSRVFLPI 1557

Query: 568  DADDILQLPINQLG*RDKLVWHHTSNGRYTVNSGYKWTADQRLCFLDRAECS-------- 413
            DAD+IL++  +     D L WH    G+++V S YK      L + D +  S        
Sbjct: 1558 DADEILKIRTSVRLEEDFLSWHPDRLGQFSVRSAYKLAIS--LDYADESSSSSGQNPQKI 1615

Query: 412  ---------------------NRREVEQ*KGGYKGLQVDPPCKTCGEHDETLEHLLFHCP 296
                                 N     Q     + L+    C  CG   E + H L  CP
Sbjct: 1616 WDLIWKCNVPQKVKVFCWRAANNCLANQENKKKRNLERSEICCICGNETEDVSHALSRCP 1675

Query: 295  MASLVWKLAPVQWPGIHQLSINFQHWWRQXXXXXXXXXXXSRLELSAYLLWSIWNARN 122
             A  +W+   ++  G   L++                       +   +LW IW  RN
Sbjct: 1676 HAVHLWE--AMKAAGSLSLNVAGNATGGSWFFDQIENTPAEERMMLCMMLWRIWFVRN 1731


>gb|EEC83784.1| hypothetical protein OsI_29682 [Oryza sativa Indica Group]
          Length = 666

 Score =  170 bits (430), Expect = 7e-40
 Identities = 110/371 (29%), Positives = 159/371 (42%), Gaps = 45/371 (12%)
 Frame = -3

Query: 1099 KIPAVVCHSIEQLAAKFWWGAQKDNKRKLHWKAWNKLAIDKDQGGFAFRNLQDFNQALLA 920
            K+P  VC  + ++   FWWG+  + KRK+HWKAW++L   K+ GG  FR+++ FNQALLA
Sbjct: 163  KLPDGVCEELTKIIRNFWWGSD-NGKRKVHWKAWSQLTKSKNAGGLGFRDMKAFNQALLA 221

Query: 919  KQLWRLLTQPGLLMSRVLKGKYFPGGGLMNIKIKSQDSWLWKSWMGARKTXXXXXXXXXX 740
            +Q WRL+  P  L +RVLK KY+P G LM+       S  W++     +           
Sbjct: 222  RQAWRLIDNPVSLCARVLKAKYYPNGCLMDTVFSGNASPTWRAIEHGLELLKEGLVWRIG 281

Query: 739  XGKSIKIWESPWLPNCPNFRPSSQK---PQG-CSLRWVAELMCSDGKAWNRDLIKSIFLP 572
             G  ++IW  PW+P     R S++K    QG C ++WVA+L+ ++   WN  L++ IFLP
Sbjct: 282  NGTRVRIWRDPWIP-----RSSTRKVITSQGRCRIKWVADLLDANTN-WNEQLVRQIFLP 335

Query: 571  QDADDILQLPINQLG*RDKLVWHHTSNGRYTVNSGYKWTADQRL---------------- 440
             DAD IL +  ++ G  D L WH   +G +TV + Y+   + +L                
Sbjct: 336  MDADAILSIRTSRQGEDDFLAWHLEKSGIFTVKTAYRLAIENKLNSKNSNASGSSIEGSK 395

Query: 439  -------------------------CFLDRAECSNRREVEQ*KGGYKGLQVDPPCKTCGE 335
                                     C   R     RR           L+    C  CG 
Sbjct: 396  SLWNTIWSCPVPPKVRIFAWRVASDCLATRVNKKGRR-----------LEALDTCTLCGT 444

Query: 334  HDETLEHLLFHCPMASLVWKLAPVQWPGIHQLSINFQHWWRQXXXXXXXXXXXSRLELSA 155
              ET  H L  C  A  +W      W    Q +  +Q    +                  
Sbjct: 445  ESETAFHALCRCTYARALWAALREVWQIPDQTTWTYQ--GTKWLLLTLVKLSEMERMFIL 502

Query: 154  YLLWSIWNARN 122
             LLW IW+ RN
Sbjct: 503  MLLWRIWHVRN 513


>gb|EEC74955.1| hypothetical protein OsI_10942 [Oryza sativa Indica Group]
          Length = 961

 Score =  170 bits (430), Expect = 7e-40
 Identities = 98/314 (31%), Positives = 147/314 (46%), Gaps = 30/314 (9%)
 Frame = -3

Query: 1108 SCLKIPAVVCHSIEQLAAKFWWGAQKDNKRKLHWKAWNKLAIDKDQGGFAFRNLQDFNQA 929
            S  ++PA +C    QL  KFWWG  ++N RK+HW +W +L   K QGG  FR+L+ FNQA
Sbjct: 546  SVFRLPASLCEEYMQLIRKFWWGEDQNN-RKVHWISWQQLIKPKGQGGIGFRDLKLFNQA 604

Query: 928  LLAKQLWRLLTQPGLLMSRVLKGKYFPGGGLMNIKIKSQDSWLWKSWMGARKTXXXXXXX 749
            LLA+Q WRL+  P  L ++VLK KYFP G L++       S  WK  M   +        
Sbjct: 605  LLARQAWRLIQYPSSLCAQVLKAKYFPSGDLIDTAFPVDSSETWKGIMHGLELLKKGLIW 664

Query: 748  XXXXGKSIKIWESPWLPNCPNFRPSSQKPQGCSLRWVAELMCSDGKAWNRDLIKSIFLPQ 569
                G  +KIW   W+P   N +   +K +   L+WV++L+  D + WN +L++++F P 
Sbjct: 665  RISDGSKVKIWRDNWIPREHNLKVIGRKTRS-RLKWVSDLLLPDRQQWNEELVRNMFYPP 723

Query: 568  DADDILQLPINQLG*RDKLVWHHTSNGRYTVNSGYKWTADQRLCFLDRAECSNRREVE-- 395
            DA+ IL++ +      D + WH+  +G +TV S YK   D  L   +  E S+    E  
Sbjct: 724  DAEGILRIQLPHSPGEDIVAWHYDKSGIFTVRSAYKVALDALLRSDNTGESSSAPNGERT 783

Query: 394  ----------------------------Q*KGGYKGLQVDPPCKTCGEHDETLEHLLFHC 299
                                        Q K   + +   P C+ CG+ +E   H    C
Sbjct: 784  LWKNIWNTQVPQKVRIFAWRLARDCLATQAKKKRRNIVKSPVCEICGKCEEDGFHATVAC 843

Query: 298  PMASLVWKLAPVQW 257
              A  +     + W
Sbjct: 844  TKARALRSEMRIYW 857


>gb|EEC81552.1| hypothetical protein OsI_24974 [Oryza sativa Indica Group]
          Length = 1015

 Score =  168 bits (426), Expect = 2e-39
 Identities = 107/358 (29%), Positives = 160/358 (44%), Gaps = 29/358 (8%)
 Frame = -3

Query: 1108 SCLKIPAVVCHSIEQLAAKFWWGAQKDNKRKLHWKAWNKLAIDKDQGGFAFRNLQDFNQA 929
            S  ++P  VC  + +LA  FWWGA K  KRK HW+AW+ L   K  GG  FR+ + FNQA
Sbjct: 639  SIFRLPESVCEDLNKLARNFWWGADK-GKRKTHWQAWSCLTKPKHNGGLGFRDFRLFNQA 697

Query: 928  LLAKQLWRLLTQPGLLMSRVLKGKYFPGGGLMNIKIKSQDSWLWKSWMGARKTXXXXXXX 749
            LLA+Q WRLL +P  L +RV+K KY+P G L++       S  W++              
Sbjct: 698  LLARQAWRLLDKPDSLCARVMKAKYYPNGSLVDTAFGGNASPGWRAIEHGLALLKKGIVW 757

Query: 748  XXXXGKSIKIWESPWLPNCPNFRPSSQKPQGCSLRWVAELMCSDGKAWNRDLIKSIFLPQ 569
                G+S++IW  PW+P   + RP + K + C ++WV++L+  DG +W+   +  +FLP 
Sbjct: 758  RIGNGRSVRIWRDPWIPRDHSRRPITTK-RNCRVKWVSDLLGQDG-SWDVQNVSRVFLPI 815

Query: 568  DADDILQLPINQLG*RDKLVWHHTSNGRYTVNSGYKWTADQRLCFLDRAECS-------- 413
            DAD+IL++  +     D L WH    G+++V S YK      L + D +  S        
Sbjct: 816  DADEILKIRTSVRLEEDFLAWHPDRLGQFSVRSAYKLAIS--LDYADESSSSSGQNPQKI 873

Query: 412  ---------------------NRREVEQ*KGGYKGLQVDPPCKTCGEHDETLEHLLFHCP 296
                                 N     Q     + L+    C  CG   E + H L  CP
Sbjct: 874  WDLIWKCNVPQKVKVFCWRAANNCLANQENKKKRNLERSEICCICGNETEDVSHALSRCP 933

Query: 295  MASLVWKLAPVQWPGIHQLSINFQHWWRQXXXXXXXXXXXSRLELSAYLLWSIWNARN 122
             A  +W+   ++  G   L++                       +   +LW IW  RN
Sbjct: 934  HAVHLWE--AMKAAGSLSLNVAGNATGGSWFFDQIENTPAEERMMLCMMLWRIWFVRN 989


Top