BLASTX nr result

ID: Cephaelis21_contig00033995 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00033995
         (1369 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAV88076.1| putative retrotransposon polyprotein [Ipomoea bat...   178   3e-63
ref|XP_003548662.1| PREDICTED: uncharacterized protein LOC100800...   177   7e-62
gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group]     152   1e-57
gb|AAK91332.1|AC090441_14 Putative gag-pol polyprotein [Oryza sa...   156   3e-57
gb|AAQ56388.1| putative gag-pol polyprotein [Oryza sativa Japoni...   155   8e-57

>gb|AAV88076.1| putative retrotransposon polyprotein [Ipomoea batatas]
          Length = 1358

 Score =  178 bits (452), Expect(2) = 3e-63
 Identities = 102/265 (38%), Positives = 155/265 (58%), Gaps = 4/265 (1%)
 Frame = +1

Query: 25   DYESESDREDFYADMPPLKGESDGEDREMISPCDDHIM----VMMRSLNIQINANKTLQR 192
            +Y S ++ +D   ++ P+ GE   +D       +D  +    V+ ++L+  +  ++  QR
Sbjct: 345  EYMSANESDD--EELEPI-GERQKDDHSEEEVQEDDALHFNCVVHKALSTLVVLDQEEQR 401

Query: 193  QNIFHTKCTFKEKVCLLVIDSGSCTNVVSTLLIAKL*LPLIEHPNPYRLQWLNDHGELKI 372
            +NIF+ KC      C  +ID GSCTNV+S  ++  + +P I+HP PY+LQWLND GELK+
Sbjct: 402  ENIFYGKCKIPGATCSFIIDGGSCTNVISEDVVNAMKIPTIQHPQPYKLQWLNDDGELKV 461

Query: 373  TKQVLTSFEIGKYKDEALCDVVPMNAGHILLGRLWEFDRDVTHNGRSNRYSLTCNGCKFI 552
             KQ L S  IGKY+D+ LCDV+PM+A HILLGR W++DRD  H+G++N+Y++   G K+ 
Sbjct: 462  HKQALISISIGKYQDDVLCDVIPMHACHILLGRPWQYDRDTLHHGKTNKYTIHKGGKKYT 521

Query: 553  LLPMAPMQVYEAQLHLKDECDKRYTSLNSEAIKEHTAESLNRSTKAESKK*KKVVLSGIK 732
            L P+AP +VY  Q+  K +  +       EA+KE T+   N  T A  KK +K    G+K
Sbjct: 522  LTPLAPKEVYNLQVQSK-KLREELAQKAKEAMKETTSGKQN--TIAHEKKQRK---EGMK 575

Query: 733  VLNDCKVLSASNAKSEKEEIEEINR 807
                    S+ N    K E+E+  R
Sbjct: 576  ---KDTTQSSHNLLMTKREVEQALR 597



 Score = 91.7 bits (226), Expect(2) = 3e-63
 Identities = 50/107 (46%), Positives = 66/107 (61%)
 Frame = +3

Query: 894  ILYVLIFCDSYFSEADLPSNLPVEISKLLQDFHNVFPEELSSGLPPLRGIEHQIHFMLGA 1073
            +LY + FC +      +PS    ++S LL +F +VFPEEL  GLPP+RGIEHQI  + GA
Sbjct: 604  LLYPIDFCLNVIKSEIIPS----DVSALLSEFADVFPEELPKGLPPIRGIEHQIDLIPGA 659

Query: 1074 VILNRPAYRSNPKKTKELQR*VEELLSRGQDMLVASEKAAPTVSFEK 1214
             + NRPAYR+NP + KE+QR V+ELL  G      S  A P +   K
Sbjct: 660  SLPNRPAYRTNPDEAKEIQRQVDELLQAGFIQESLSPCAVPVLLVPK 706


>ref|XP_003548662.1| PREDICTED: uncharacterized protein LOC100800169, partial [Glycine
           max]
          Length = 973

 Score =  177 bits (448), Expect(2) = 7e-62
 Identities = 106/270 (39%), Positives = 144/270 (53%), Gaps = 1/270 (0%)
 Frame = +1

Query: 97  EDREMISPCDDHIMVMMRSL-NIQINANKTLQRQNIFHTKCTFKEKVCLLVIDSGSCTNV 273
           E  E + P ++  ++M+R L   Q       QR+NIFHT+C   +K C L++DSGSC N 
Sbjct: 287 ESSEEVYPHEEGDLLMVRRLLGGQSCDLSQSQRENIFHTRCKILDKTCSLIVDSGSCCNC 346

Query: 274 VSTLLIAKL*LPLIEHPNPYRLQWLNDHGELKITKQVLTSFEIGKYKDEALCDVVPMNAG 453
            ST L++KL L +I HP PY+LQWLN+ GE+ + +QV   F IG YKDE  CD+VPM AG
Sbjct: 347 CSTRLVSKLNLTIIPHPKPYKLQWLNEQGEMIVNQQVKVPFSIGTYKDEVHCDIVPMEAG 406

Query: 454 HILLGRLWEFDRDVTHNGRSNRYSLTCNGCKFILLPMAPMQVYEAQLHLKDECDKRYTSL 633
           HILLGR W+FDR + +NG +N  +LT  G KF+L P  P QV + QL +KD+ D+     
Sbjct: 407 HILLGRPWQFDRKIIYNGLTNEITLTHLGTKFVLHPQTPSQVAKDQLTMKDKRDE----- 461

Query: 634 NSEAIKEHTAESLNRSTKAESKK*KKVVLSGIKVLNDCKVLSASNAKSEKEEIEEINRHP 813
                            K E +K KK          D K LS+     EKE  +   +  
Sbjct: 462 ---------------EEKLEKQKKKK----------DSKALSSKARGKEKEGKDSSKKI- 495

Query: 814 PTYPGKK*LNFYVGIRDISRYIYMKIEFYM 903
                 K  N +    DI R + +K  FY+
Sbjct: 496 -----VKKENHFATKGDIKRALLLKQSFYL 520



 Score = 88.6 bits (218), Expect(2) = 7e-62
 Identities = 47/117 (40%), Positives = 74/117 (63%), Gaps = 2/117 (1%)
 Frame = +3

Query: 900  YVLIFCDSYFSEADLPS--NLPVEISKLLQDFHNVFPEELSSGLPPLRGIEHQIHFMLGA 1073
            Y+L+  ++  S A +P+   LP ++ +LL +F ++FP+E+   LPPLRGIEHQI  + GA
Sbjct: 519  YLLLSRETSLSTATIPTFETLPPKVQELLHEFGDIFPKEIPPRLPPLRGIEHQIDLVPGA 578

Query: 1074 VILNRPAYRSNPKKTKELQR*VEELLSRGQDMLVASEKAAPTVSFEKGWISHNLCNN 1244
             + NRPAYR+NP++TKE++  V+ELL +G      S  A P +   K   +  +C +
Sbjct: 579  SLPNRPAYRTNPQETKEIESQVKELLEKGWVQESLSPCAVPVLLVPKKDGTWRMCTD 635


>gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 1713

 Score =  152 bits (385), Expect(2) = 1e-57
 Identities = 86/264 (32%), Positives = 146/264 (55%)
 Frame = +1

Query: 22   GDYESESDREDFYADMPPLKGESDGEDREMISPCDDHIMVMMRSLNIQINANKTLQRQNI 201
            G+YES S+ E   ++      E +  ++++        +V+ + L++Q++  +  QR N+
Sbjct: 413  GEYESTSEEEQEDSE------EENNLEKDICEFESGAALVVTQILSVQMSDAENGQRHNL 466

Query: 202  FHTKCTFKEKVCLLVIDSGSCTNVVSTLLIAKL*LPLIEHPNPYRLQWLNDHGELKITKQ 381
            F T+   ++KV  ++ID GSC N+ S  ++ KL L L++HP+PY +QWLN+ G +KI ++
Sbjct: 467  FQTRAKVQDKVVKVIIDGGSCHNLASKEMVEKLGLKLLKHPHPYHVQWLNNSGSIKIAQR 526

Query: 382  VLTSFEIGKYKDEALCDVVPMNAGHILLGRLWEFDRDVTHNGRSNRYSLTCNGCKFILLP 561
            V   F+IG+Y D   CDV PM   H+LLGR W++DR   H GR+N+Y++   G + IL P
Sbjct: 527  VKVPFKIGEYIDTMECDVAPMTVCHMLLGRPWQYDRSSLHCGRTNQYTIKWKGKELILKP 586

Query: 562  MAPMQVYEAQLHLKDECDKRYTSLNSEAIKEHTAESLNRSTKAESKK*KKVVLSGIKVLN 741
            M P Q+    L    E       + +E+ KE    +L+   K+ S+  K  +    K   
Sbjct: 587  MTPQQILAEHLQKSSE-------VRNESAKEGQKNNLSAPHKSVSESHKPNMRDNKKREG 639

Query: 742  DCKVLSASNAKSEKEEIEEINRHP 813
            +  V+ A+     K E+ ++ R+P
Sbjct: 640  ENLVMIAT-----KSEMRDVRRNP 658



 Score = 98.6 bits (244), Expect(2) = 1e-57
 Identities = 52/115 (45%), Positives = 75/115 (65%)
 Frame = +3

Query: 894  ILYVLIFCDSYFSEADLPSNLPVEISKLLQDFHNVFPEELSSGLPPLRGIEHQIHFMLGA 1073
            +L++L+  D+  S  DL S +P  ++++LQ++ +VFPEE   GLPPLRGIEHQI  + GA
Sbjct: 661  VLFILVCKDTLLSANDLTS-VPSVVARVLQEYEDVFPEETPVGLPPLRGIEHQIDLIPGA 719

Query: 1074 VILNRPAYRSNPKKTKELQR*VEELLSRGQDMLVASEKAAPTVSFEKGWISHNLC 1238
             + NRPAYR+NP++TKE+QR V+ LL +G      S  A P +   K   S  +C
Sbjct: 720  TLPNRPAYRTNPEETKEIQRQVQALLDKGYVRESLSPCAVPVILVPKKDGSWRMC 774


>gb|AAK91332.1|AC090441_14 Putative gag-pol polyprotein [Oryza sativa Japonica Group]
            gi|15217296|gb|AAK92640.1|AC079634_1 Putative
            retroelement [Oryza sativa Japonica Group]
            gi|31431373|gb|AAP53161.1| retrotransposon protein,
            putative, Ty3-gypsy subclass [Oryza sativa Japonica
            Group]
          Length = 1708

 Score =  156 bits (395), Expect(2) = 3e-57
 Identities = 98/267 (36%), Positives = 142/267 (53%), Gaps = 9/267 (3%)
 Frame = +1

Query: 22   GDYESESDR-EDFYADMPPL---KGESDGEDREMI-SPCDDHI--MVMMRSLNIQINANK 180
            G Y S SD  E+ YA +      KG++  +D E I +   +H   +V+ R L+ Q+   +
Sbjct: 424  GGYSSASDLDEETYALLATNNAGKGDAPHQDEEHIGAEAAEHYESLVVQRVLSAQMERAE 483

Query: 181  TLQRQNIFHTKCTFKEKVCLLVIDSGSCTNVVSTLLIAKL*LPLIEHPNPYRLQWLNDHG 360
              QR  +F TKC  KE+ C ++ID GSC N+ S  ++ KL L    HP PY +QWLN  G
Sbjct: 484  QNQRHTLFQTKCVIKERSCRVIIDRGSCNNLASAEMVEKLALSTQPHPQPYYIQWLNSSG 543

Query: 361  ELKITKQVLTSFEIGKYKDEALCDVVPMNAGHILLGRLWEFDRDVTHNGRSNRYSLTCNG 540
            ++K+T+ V   F IG Y D   CDVVPM A  I LGR W+FD+D  H G+SN+YS   NG
Sbjct: 544  KVKVTRLVRVHFAIGSYHDSINCDVVPMQACSIFLGRPWQFDKDSLHFGKSNQYSFVHNG 603

Query: 541  CKFILLPMAPMQVYEAQLHLKDECDKRYTSLNSEAIKEHTAESLNRSTKAESKK*KKVV- 717
             K +L PM+P      ++ LKDE  +     N E  +     + N   K + K    V  
Sbjct: 604  KKLVLHPMSP------EVILKDELARASKQKNQEHTRSEHLIAANELEKHKKKPTNSVQN 657

Query: 718  -LSGIKVLNDCKVLSASNAKSEKEEIE 795
              + IK+   C + +    KS+ +E++
Sbjct: 658  NKNEIKLKGSCFIAT----KSDLDEVD 680



 Score = 93.2 bits (230), Expect(2) = 3e-57
 Identities = 47/115 (40%), Positives = 72/115 (62%)
 Frame = +3

Query: 894  ILYVLIFCDSYFSEADLPSNLPVEISKLLQDFHNVFPEELSSGLPPLRGIEHQIHFMLGA 1073
            + Y L+  ++ F   D P +LP  ++ LLQ++ ++FP+E+  GLPP+RGIEHQI  + GA
Sbjct: 685  VCYALVCKETLFPIEDTPISLPPPVTNLLQEYADIFPKEVPPGLPPIRGIEHQIDLIPGA 744

Query: 1074 VILNRPAYRSNPKKTKELQR*VEELLSRGQDMLVASEKAAPTVSFEKGWISHNLC 1238
             + NR  YR+NP++TKE+QR V+ELL +G      S  + P +   K   S  +C
Sbjct: 745  SLPNRAPYRTNPEETKEIQRQVQELLDKGYVRESLSPCSIPVLLVPKKDGSWRMC 799


>gb|AAQ56388.1| putative gag-pol polyprotein [Oryza sativa Japonica Group]
            gi|91795218|gb|ABE60890.1| putative polyprotein [Oryza
            sativa Japonica Group]
          Length = 1616

 Score =  155 bits (391), Expect(2) = 8e-57
 Identities = 95/267 (35%), Positives = 140/267 (52%), Gaps = 9/267 (3%)
 Frame = +1

Query: 22   GDYESESDRED----FYADMPPLKGESDGEDREMI-SPCDDHI--MVMMRSLNIQINANK 180
            G Y S SD +       A     +G++  +D E I +   +H   +V+ R L+ Q+   +
Sbjct: 424  GGYSSASDLDGETYALLATNNAREGDAPHQDEEHIGAEAAEHYESLVVQRVLSAQMERAE 483

Query: 181  TLQRQNIFHTKCTFKEKVCLLVIDSGSCTNVVSTLLIAKL*LPLIEHPNPYRLQWLNDHG 360
              QR  +F TKC  KE+ C ++ID GSC N+ S  ++ KL L    HP PY +QWLN  G
Sbjct: 484  QNQRHTLFQTKCVIKERSCRVIIDGGSCNNLASAEMVEKLALSTQPHPQPYYIQWLNSSG 543

Query: 361  ELKITKQVLTSFEIGKYKDEALCDVVPMNAGHILLGRLWEFDRDVTHNGRSNRYSLTCNG 540
            ++K+T+ V   F IG Y D   CDVVPM A  +LLGR W+FD+D  H G+SN+YS   NG
Sbjct: 544  KVKVTRLVRVHFAIGSYHDSINCDVVPMQACSMLLGRPWQFDKDSLHFGKSNQYSFVHNG 603

Query: 541  CKFILLPMAPMQVYEAQLHLKDECDKRYTSLNSEAIKEHTAESLNRSTKAESKK*KKVV- 717
             K +L PM+P      ++ LKDE  +     N E  +     + N   K + K    V  
Sbjct: 604  KKLVLHPMSP------EVILKDELARASKQKNQEHTRSEHLIAANELEKHKKKPTNSVQN 657

Query: 718  -LSGIKVLNDCKVLSASNAKSEKEEIE 795
              + IK+   C + +    KS+ +E++
Sbjct: 658  NKNEIKLKGSCFIAT----KSDLDEVD 680



 Score = 93.6 bits (231), Expect(2) = 8e-57
 Identities = 47/115 (40%), Positives = 72/115 (62%)
 Frame = +3

Query: 894  ILYVLIFCDSYFSEADLPSNLPVEISKLLQDFHNVFPEELSSGLPPLRGIEHQIHFMLGA 1073
            + Y L+  ++ F   D P +LP  ++ LLQ++ ++FP+E+  GLPP+RGIEHQI  + GA
Sbjct: 685  VCYALVCKETLFPIEDTPISLPPPVTNLLQEYADIFPKEVPPGLPPIRGIEHQIDLIPGA 744

Query: 1074 VILNRPAYRSNPKKTKELQR*VEELLSRGQDMLVASEKAAPTVSFEKGWISHNLC 1238
             + NR  YR+NP++TKE+QR V+ELL +G      S  + P +   K   S  +C
Sbjct: 745  SLPNRAPYRTNPEETKEIQRQVQELLDKGYVRESLSPCSVPVLLVPKKDGSWRMC 799


Top