BLASTX nr result

ID: Cephaelis21_contig00014467 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00014467
         (2139 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004173142.1| PREDICTED: uncharacterized protein LOC101232...    85   6e-20
gb|AFN88198.1| retropepsin-like protein [Phaseolus vulgaris]           80   3e-12
emb|CAN69639.1| hypothetical protein VITISV_040272 [Vitis vinifera]    52   8e-09
pir||S66306 hypothetical protein 1 - Arabidopsis thaliana retrot...    56   1e-08
gb|AAF79809.1|AC020646_32 T32E20.9 [Arabidopsis thaliana]              56   2e-08

>ref|XP_004173142.1| PREDICTED: uncharacterized protein LOC101232801 [Cucumis sativus]
          Length = 573

 Score = 85.1 bits (209), Expect(2) = 6e-20
 Identities = 53/136 (38%), Positives = 77/136 (56%), Gaps = 5/136 (3%)
 Frame = -1

Query: 534 LEYIPEIEKLARRLRKETKEKGASSEPPEEL*VDLNV---VGVSLFD--GDESMAAAADR 370
           LE+  +I    RRLRKE K+  +SS  P     + ++   +  SL +   D     A ++
Sbjct: 10  LEFEEDITLHERRLRKEKKKLVSSSLEPHIFTFEPSLEEPLEPSLENHLNDLMGEEAPEK 69

Query: 369 TLKELVVPDLNQQPLCITFPALEVAFELKSGLICLLSTFNGLTGEDPYKHLKEFNVVCSS 190
           TL++L  PD++Q+P+ I  P     F L+S LI  L  F G  GEDP+KHL++F+  C  
Sbjct: 70  TLRKLYEPDIHQRPIGIVIPPTTTNFHLRSVLISNLPIFRGSNGEDPHKHLRDFSWACDL 129

Query: 189 FKPQGVTEEHVKLRAF 142
            +P GVTEE + LRAF
Sbjct: 130 LRPHGVTEEQLNLRAF 145



 Score = 40.8 bits (94), Expect(2) = 6e-20
 Identities = 21/44 (47%), Positives = 29/44 (65%)
 Frame = -3

Query: 139 SLKDKAKR*LLDFPEGSITT*EVMKRLFLEKKFPASKATNIRKE 8
           SL D AKR L++    +ITT E +K+ FLEK FP ++A  + KE
Sbjct: 148 SLADSAKRWLINLESQTITTWEQLKKRFLEKFFPITRAQTVLKE 191


>gb|AFN88198.1| retropepsin-like protein [Phaseolus vulgaris]
          Length = 725

 Score = 79.7 bits (195), Expect = 3e-12
 Identities = 36/63 (57%), Positives = 50/63 (79%)
 Frame = -1

Query: 327 LCITFPALEVAFELKSGLICLLSTFNGLTGEDPYKHLKEFNVVCSSFKPQGVTEEHVKLR 148
           +CI +P  +   ELKSGLI LL  F+GL GEDP+KHLK+F+ +C++ +P GVTE+H+KL+
Sbjct: 1   MCIQYP--DGQCELKSGLIHLLPKFHGLAGEDPHKHLKKFHTMCTTMRPAGVTEKHIKLK 58

Query: 147 AFP 139
           AFP
Sbjct: 59  AFP 61


>emb|CAN69639.1| hypothetical protein VITISV_040272 [Vitis vinifera]
          Length = 437

 Score = 51.6 bits (122), Expect(2) = 8e-09
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 7/139 (5%)
 Frame = -1

Query: 534 LEYIP---EIEKLARRLRKETKEKGASSEPPEEL*VDLNVVGVSLFDGDESMAAAADRTL 364
           L+ +P   +I K  R+L ++ K++     P             ++ D +       +R L
Sbjct: 177 LDLVPIDLDINKTLRKLNRKCKQQVIQEVP-------------AMGDENHGDNGVPNRAL 223

Query: 363 KELVVPDLNQQPLCITFPALEVA-FELKSGLICLLST---FNGLTGEDPYKHLKEFNVVC 196
           K+  +P++    L I  P ++   FE+K  +I ++ +   F GL  +DP  H+  F  +C
Sbjct: 224 KDYSIPNVGV--LSIQRPPIQANNFEIKLAIIQMIRSSVQFGGLANDDPNLHIANFLEIC 281

Query: 195 SSFKPQGVTEEHVKLRAFP 139
            +FK  GV ++ ++LR FP
Sbjct: 282 DTFKHNGVIDDAIRLRLFP 300



 Score = 36.6 bits (83), Expect(2) = 8e-09
 Identities = 18/44 (40%), Positives = 27/44 (61%)
 Frame = -3

Query: 139 SLKDKAKR*LLDFPEGSITT*EVMKRLFLEKKFPASKATNIRKE 8
           SL +KAK  L+  P G+ITT + +   FL K FP +K+  +R +
Sbjct: 302 SLNNKAKAWLISLPPGTITTWDGLVNAFLTKYFPPAKSIKMRND 345


>pir||S66306 hypothetical protein 1 - Arabidopsis thaliana retrotransposon
           Athila gi|806535|emb|CAA57397.1| unnamed protein product
           [Arabidopsis thaliana]
          Length = 935

 Score = 55.8 bits (133), Expect(2) = 1e-08
 Identities = 46/143 (32%), Positives = 69/143 (48%), Gaps = 3/143 (2%)
 Frame = -1

Query: 558 HTSQQG-EPLEYIPEIEKLARRLRKETKEKGASSEPPEEL*VDLNVVGVSLFDGDESMAA 382
           HT  QG + L +   I+++AR+LR++T E    ++  +E     N+ G   F  +     
Sbjct: 2   HTRSQGNQNLLFNDNIDRIARQLREQT-ETDTMADVVDEQEQPTNI-GAGDFPHNH---- 55

Query: 381 AADRTLKELVVPDLNQQPLCITFPALEVAFELKSGLICLL--STFNGLTGEDPYKHLKEF 208
                         NQ+   +  P     FE+KSGLI ++  + F+GL  EDP  HL EF
Sbjct: 56  --------------NQRHGIVPPPVQNNNFEIKSGLIAMVQGNKFHGLLMEDPLDHLDEF 101

Query: 207 NVVCSSFKPQGVTEEHVKLRAFP 139
             +C   K  GV+E+  KLR FP
Sbjct: 102 ERLCRLTKINGVSEDGFKLRLFP 124



 Score = 32.0 bits (71), Expect(2) = 1e-08
 Identities = 19/44 (43%), Positives = 23/44 (52%)
 Frame = -3

Query: 139 SLKDKAKR*LLDFPEGSITT*EVMKRLFLEKKFPASKATNIRKE 8
           SL DKA       P GSITT +  K+ FL K F  S+   +R E
Sbjct: 126 SLGDKAHLWEKTLPHGSITTWDDCKKAFLAKFFSNSRTARLRNE 169


>gb|AAF79809.1|AC020646_32 T32E20.9 [Arabidopsis thaliana]
          Length = 1586

 Score = 55.8 bits (133), Expect(2) = 2e-08
 Identities = 46/144 (31%), Positives = 71/144 (49%), Gaps = 2/144 (1%)
 Frame = -1

Query: 564 MTHTSQQGEPLEYIPEIEKLARRLRKETKEKGASSEPPEEL*VDLNVVGVSLFDGDESMA 385
           M   S+  + L +   I+++AR+LR +T+    ++   E+  V  N +G     GD    
Sbjct: 1   MQTRSRGNQNLLFNDNIDRIARQLRTQTETDTMAAVVDEQ--VQPNNIGA----GD---- 50

Query: 384 AAADRTLKELVVPDLNQQPLCITFPALEVAFELKSGLICLLST--FNGLTGEDPYKHLKE 211
           A  +   +  +VP           P     FE+KSGLI ++ +  F+GL  EDP  HL E
Sbjct: 51  APRNHNQRNGIVPP----------PVQNNNFEIKSGLIAMVQSNKFHGLPMEDPLDHLDE 100

Query: 210 FNVVCSSFKPQGVTEEHVKLRAFP 139
           F+ +CS  K   V+E+  KLR FP
Sbjct: 101 FDRLCSLTKINRVSEDGFKLRLFP 124



 Score = 30.8 bits (68), Expect(2) = 2e-08
 Identities = 17/44 (38%), Positives = 24/44 (54%)
 Frame = -3

Query: 139 SLKDKAKR*LLDFPEGSITT*EVMKRLFLEKKFPASKATNIRKE 8
           SL DKA +     P+GSIT+    K+ FL K F  S+   +R +
Sbjct: 126 SLGDKAHQWEKSLPQGSITSWNDCKKAFLAKFFSNSRTARLRND 169


Top