BLASTX nr result

ID: Cephaelis21_contig00022408 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00022408
         (1241 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAD32950.1| putative non-LTR retroelement reverse transcripta...   102   2e-19
ref|XP_002446679.1| hypothetical protein SORBIDRAFT_06g020406 [S...    82   3e-13
ref|NP_194638.1| Ribonuclease H-like protein [Arabidopsis thalia...    82   3e-13
gb|EEE59920.1| hypothetical protein OsJ_12548 [Oryza sativa Japo...    82   4e-13
gb|AAB82639.1| putative non-LTR retroelement reverse transcripta...    81   5e-13

>gb|AAD32950.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 773

 Score =  102 bits (254), Expect = 2e-19
 Identities = 71/228 (31%), Positives = 109/228 (47%), Gaps = 4/228 (1%)
 Frame = +2

Query: 17   WNSDLIYKTFCKPDAEAILILPTSALGSQDKLHKIHSKIGAYTVKSTSTWLAKLRHQHPH 196
            WN DL+ K   + D   I  +  S  G+ D +  I++  G Y+VKS    L KL  Q   
Sbjct: 425  WNEDLLCKLIHQNDIPHIRAIRPSITGANDAITWIYTHDGNYSVKSGYHLLRKLSQQQHA 484

Query: 197  QLESSSRLELTKKIWKRTWEMQIKGKIKHFRLRTYHLLLPTGHQLQLSEIHLDSTCKICR 376
             L S + +   + ++   W+     KIKHF  R+ H  LPT   L+   +  D TC+ C 
Sbjct: 485  SLPSPNEVS-AQTVFTNIWKQNAPPKIKHFWWRSAHNALPTAGNLKRRRLITDDTCQRCG 543

Query: 377  EAPEAIEHLLFHCTKV*RIWELSLVS-WPGLQKFTDHFQGWWEQICSIRMLSINQDRIEF 553
            EA E + HLLF C     IWE + +   PG    ++ F    E I  +   S  +D +  
Sbjct: 544  EASEDVNHLLFQCRVSKEIWEQAHIKLCPGDSLMSNSFNQNLESIQKLNQ-SARKD-VSL 601

Query: 554  TAYLL*SIWKTRNDFTFNAVMVSVAKIVERA---R*EWQEFLSIHEQK 688
              ++   IWK RND  FN    S+   +++A   + +W+E L+ +EQ+
Sbjct: 602  FPFIGWRIWKMRNDLIFNNKRWSIPDSIQKALIDQQQWKESLNCNEQQ 649


>ref|XP_002446679.1| hypothetical protein SORBIDRAFT_06g020406 [Sorghum bicolor]
            gi|241937862|gb|EES11007.1| hypothetical protein
            SORBIDRAFT_06g020406 [Sorghum bicolor]
          Length = 395

 Score = 82.0 bits (201), Expect = 3e-13
 Identities = 94/402 (23%), Positives = 159/402 (39%), Gaps = 18/402 (4%)
 Frame = +2

Query: 23   SDLIYKTFCKPDAEAILILPTSALGSQDKLHKIHSKIGAYTVKSTSTWLAKLRHQHPHQL 202
            S+L+ + F   D+EAIL +P S    +D     H K G +TV+S    L +L+       
Sbjct: 1    SELVKRVFYPIDSEAILQMPLSMRKQKDCWAWHHEKNGLFTVRSAYRMLIELKKSREDYF 60

Query: 203  E---SSSRLELTKKIWKRTWEMQIKGKIKHFRLRTYHLLLPTGHQLQLSEIHLDSTCKIC 373
            E   + S    ++K WK+ W M++  KIK F  R     +PT   L+   +   S CKIC
Sbjct: 61   EGRANCSDFATSQKEWKKLWSMKLPSKIKVFCWRLALNSIPTASVLKSRNLASTSHCKIC 120

Query: 374  REAPEAIEHLLFHCTKV*RIWEL------SLVSWPGLQKFTDHFQGWWEQICSIRMLSIN 535
                +  EH L  CT    +W L      +L+S   +          W       +   +
Sbjct: 121  GAVDDTWEHSLLFCTMSKCVWALLDEDITNLISHLRISN-----PKHWITFMCCNIPQAD 175

Query: 536  QDRIEFTAYLL*SIWKTRNDFTFNAVMVSVAKIVERAR*EWQEFLSIH--EQKTSTRLVG 709
              R+  T +   +IW+ R       V  S   I+     + +E   I   E K   +   
Sbjct: 176  GIRVLVTCW---AIWQARRKAIHEGVFQSPFSIMVTINRQIEELQMIRGMELKGGNQNQS 232

Query: 710  RNQLPL*QAPHHNDWMVGYETMRISAIFKKNIACGSYGLLTEDRHGITQQASAIFYEREV 889
            + +  L +AP       G   + + A   +  + G+ G++  +  G     SA+      
Sbjct: 233  KQKTRLWKAPDQ-----GKCKINVDAAVNRVGSKGAVGVVCRNDRGEFIAPSAMIIPNIT 287

Query: 890  SALTLSLRAIRDTQLRTYKLRWNKAILLSDEMDIVRHLQDTSPIFTDIFLLTNL------ 1051
               TL   A  +           K I+ SD ++IVR++ +  P+ T + +L ++      
Sbjct: 288  EPETLEGMACLEALALAEDCGIRKIIVASDCLNIVRNISE-MPLCTYVMILKDIQERAKS 346

Query: 1052 FQDCKFVLISKDANKGCCRLAQFALQ-SSSSETWNTSFPTWL 1174
            F   +F    ++ N+   RL ++A         W  S P +L
Sbjct: 347  FDYVRFAHEGRECNREADRLVKYACSLEDGRHVWLGSPPVFL 388


>ref|NP_194638.1| Ribonuclease H-like protein [Arabidopsis thaliana]
            gi|4972055|emb|CAB43923.1| putative protein [Arabidopsis
            thaliana] gi|7269807|emb|CAB79667.1| putative protein
            [Arabidopsis thaliana] gi|67633766|gb|AAY78807.1|
            putative reverse transcriptase/RNA-dependent DNA
            polymerase [Arabidopsis thaliana]
            gi|332660185|gb|AEE85585.1| Ribonuclease H-like protein
            [Arabidopsis thaliana]
          Length = 575

 Score = 82.0 bits (201), Expect = 3e-13
 Identities = 89/394 (22%), Positives = 154/394 (39%), Gaps = 19/394 (4%)
 Frame = +2

Query: 2    DNGTIWNSDLIYKTFCKPDAEAILILPTSALGSQDKLHKIHSKIGAYTVKSTSTWLAKL- 178
            ++G  W  D+I   F + + + I  L        D     ++  G YTVKS    L ++ 
Sbjct: 177  ESGREWRKDVIEMLFPEVERKLIGELRPGGRRILDSYTWDYTSSGDYTVKSGYWVLTQII 236

Query: 179  -RHQHPHQLESSSRLELTKKIWKRTWEMQIKGKIKHFRLRTYHLLLPTGHQLQLSEIHLD 355
             +   P ++   S   + +KIWK     Q   KI+HF  +     LP    L    +  +
Sbjct: 237  NKRSSPQEVSEPSLNPIYQKIWKS----QTSPKIQHFLWKCLSNSLPVAGALAYRHLSKE 292

Query: 356  STCKICREAPEAIEHLLFHCTKV*RIWELSLVSWPGLQKFTDHFQGWWEQICSIRMLSIN 535
            S C  C    E + HLLF CT     W +S +  P          G W     + +  + 
Sbjct: 293  SACIRCPSCKETVNHLLFKCTFARLTWAISSIPIP--------LGGEWADSIYVNLYWVF 344

Query: 536  ---------QDRIEFTAYLL*SIWKTRNDFTFNAVMVSVAKIVERAR*EWQEF-LSIHEQ 685
                     +   +   +LL  +WK RN+  F     +  +++ RA  + +E+ +    +
Sbjct: 345  NLGNGNPQWEKASQLVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAE 404

Query: 686  KTSTR-LVGRNQLPL*QAPHHNDWMVGYETMRISAIFKKNIACGSYGLLTEDRHGITQQA 862
               T+  V R+     + P H  W+   +    +   + N  CG  G +  +  G  +  
Sbjct: 405  SCGTKPQVNRSSCGRWRPPPH-QWV---KCNTDATWNRDNERCG-IGWVLRNEKGEVKWM 459

Query: 863  SAIFYEREVSALTLSLRAIRDTQLRTYKLRWNKAILLSDEMDIVRHLQD------TSPIF 1024
             A    +  S L   L A+R   L   + ++N  I  SD   ++  L +        P  
Sbjct: 460  GARALPKLKSVLEAELEAMRWAVLSLSRFQYNYVIFESDSQVLIEILNNDEIWPSLKPTI 519

Query: 1025 TDIFLLTNLFQDCKFVLISKDANKGCCRLAQFAL 1126
             D+  L + F + KFV I ++ N    R+A+ +L
Sbjct: 520  QDLQRLLSQFTEVKFVFIPREGNTLAERVARESL 553


>gb|EEE59920.1| hypothetical protein OsJ_12548 [Oryza sativa Japonica Group]
          Length = 1076

 Score = 81.6 bits (200), Expect = 4e-13
 Identities = 98/409 (23%), Positives = 152/409 (37%), Gaps = 26/409 (6%)
 Frame = +2

Query: 17   WNSDLIYKTFCKPDAEAILI--LPTSALGSQDKLHKIHSKIGAYTVKSTSTWLAKLRHQH 190
            WN  L+     + DA  +L   LP   +      H  H K G +TVKS       L  + 
Sbjct: 664  WNETLVRHVLKEEDANEVLKIRLPNHQMDDFPAWH--HEKSGLFTVKSAYKLAWNLSGKG 721

Query: 191  PHQLESSSRLELTKKIWKRTWEMQIKGKIKHFRLRTYHLLLPTGHQLQLSEIHLDSTCKI 370
              Q  SS+     +KIW R W  +++ K+K F  +     LPT    +  +I ++ TC +
Sbjct: 722  VVQSSSSTATSGERKIWSRVWNAKVQAKVKIFIWKLAQDKLPTWENKRRRKIEMNGTCPV 781

Query: 371  CREAPEAIEHLLFHCTKV*RIWELSLVSW--PGLQKFTDHFQGWWEQICSIRMLSINQDR 544
            C    E   H    CTK   + E     W  PG  KF      W      I +  +N+++
Sbjct: 782  CGTKGENSYHATVECTKARALREALRAVWHLPGEDKFLWTGPDW----LLILLDGVNEEQ 837

Query: 545  IEFTAYLL*SIWKTRNDFTFNAVMVSVAKIVERAR*EWQEFLSIHEQ-----KTSTRLVG 709
                 Y+L   W  RND        S+A  V      ++E L  + Q     K    +  
Sbjct: 838  RTHIMYMLWRAWYLRNDLIHGDGRCSIAGSVSFLT-SYEEVLLPNRQMPDDIKGKKPMYS 896

Query: 710  RNQLPL*QAPHHND-WMV---GYETMRISAIFKKNIACGSYGLLTEDRHGITQQASAIFY 877
              Q     A   +  W+    G   + + A F+      S G++  D  G+   A+    
Sbjct: 897  EGQKEKHMAEKQSSGWIAPPDGAAKINVDAGFRMETGEASAGIVIRDCRGLILLAACKTL 956

Query: 878  ----EREVSALTLSLRAIRDTQLRTYKLRW--NKAILLSDEMDIVRHLQ---DTSPIFTD 1030
                  E +    SL  IR        L+W     IL +D  ++V  L+    +  ++  
Sbjct: 957  HPCSSAEQAEALASLEGIR------CALQWIHMPVILETDNAEVVARLKTKHSSRSVWEG 1010

Query: 1031 IFLLTNL----FQDCKFVLISKDANKGCCRLAQFALQSSSSETWNTSFP 1165
            + +         Q  +   I +D+NK    LAQ AL S +   W    P
Sbjct: 1011 VIMEAKAAMQGLQAVEVAHIKRDSNKVAHTLAQMALSSGNCLEWRLCAP 1059


>gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1374

 Score = 81.3 bits (199), Expect = 5e-13
 Identities = 85/400 (21%), Positives = 156/400 (39%), Gaps = 10/400 (2%)
 Frame = +2

Query: 5    NGTIWNSDLIYKTFCKPDAEAILILPTSALGSQDKLHKIHSKIGAYTVKSTSTWLAKLRH 184
            +G  WN +L+   F     E IL L      ++D+    +S+ G Y+VKS    + ++ +
Sbjct: 975  DGRDWNWNLVSLLFPDNTQENILALRPGGKETRDRFTWEYSRSGHYSVKSGYWVMTEIIN 1034

Query: 185  Q--HPHQLESSSRLELTKKIWKRTWEMQIKGKIKHFRLRTYHLLLPTGHQLQLSEIHLDS 358
            Q  +P ++   S   + ++IWK    + +  KI HF  R  +  L     L    +  + 
Sbjct: 1035 QRNNPQEVLQPSLDPIFQQIWK----LDVPPKIHHFLWRCVNNCLSVASNLAYRHLAREK 1090

Query: 359  TCKICREAPEAIEHLLFHCTKV*RIWELS-LVSWPGLQKFTDHFQGWWEQICSIRMLSIN 535
            +C  C    E + HLLF C      W +S L + PG +     F+     +   +     
Sbjct: 1091 SCVRCPSHGETVNHLLFKCPFARLTWAISPLPAPPGGEWAESLFRNMHHVLSVHKSQPEE 1150

Query: 536  QDRIEFTAYLL*SIWKTRNDFTFNAVMVSVAKIVERAR*EWQEFLSIHEQKTSTRLVGRN 715
             D      ++L  +WK RND  F     +  +++ +A  +   + +  E +       R+
Sbjct: 1151 SDHHALIPWILWRLWKNRNDLVFKGREFTAPQVILKATEDMDAWNNRKEPQPQVTSSTRD 1210

Query: 716  QLPL*QAPHHNDWMVGYETMRISAIFKKNIACGSYGLLTEDRHGITQQASAIFYEREVSA 895
            +    Q P H     G+        + K++     G +  +  G            + S 
Sbjct: 1211 RCVKWQPPSH-----GWVKCNTDGAWSKDLGNCGVGWVLRNHTGRLLWLGLRALPSQQSV 1265

Query: 896  LTLSLRAIRDTQLRTYKLRWNKAILLSDEMDIVRHLQD------TSPIFTDIFLLTNLFQ 1057
            L   + A+R   L   +  + + I  SD   +V  +Q+       +P   DI  L   F+
Sbjct: 1266 LETEVEALRWAVLSLSRFNYRRVIFESDSQYLVSLIQNEMDIPSLAPRIQDIRNLLRHFE 1325

Query: 1058 DCKFVLISKDANKGCCRLAQFALQSSSSETWNTSF-PTWL 1174
            + KF    ++ N    R A+ +L   + +    S  P W+
Sbjct: 1326 EVKFQFTRREGNNVADRTARESLSLMNYDPKMYSITPDWI 1365


Top