BLASTX nr result

ID: Cocculus22_contig00013941 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00013941
         (1155 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulga...    83   5e-18
ref|XP_007207799.1| hypothetical protein PRUPE_ppa024472mg, part...    83   3e-13
ref|XP_006491472.1| PREDICTED: uncharacterized protein LOC102626...    82   4e-13
ref|XP_006483194.1| PREDICTED: putative ribonuclease H protein A...    82   4e-13
ref|XP_007207609.1| hypothetical protein PRUPE_ppa018907mg, part...    79   3e-12
gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]               79   3e-12
emb|CCA66222.1| hypothetical protein [Beta vulgaris subsp. vulga...    77   4e-12
ref|XP_004289367.1| PREDICTED: putative ribonuclease H protein A...    75   4e-11
ref|NP_567266.1| RNA-directed DNA polymerase (reverse transcript...    75   5e-11
gb|ABK28243.1| unknown [Arabidopsis thaliana]                          75   5e-11
gb|ABE65512.1| hypothetical protein At4g04650 [Arabidopsis thali...    75   5e-11
gb|AAD26953.1| putative non-LTR retrolelement reverse transcript...    75   7e-11
gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal...    73   1e-10
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...    74   2e-10
ref|NP_197389.1| RNA-directed DNA polymerase (reverse transcript...    74   2e-10
gb|AAF79357.1|AC007887_16 F15O4.34 [Arabidopsis thaliana]              72   3e-10
emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulga...    72   3e-10
ref|XP_006584439.1| PREDICTED: putative ribonuclease H protein A...    60   4e-10
ref|XP_007018598.1| Uncharacterized protein TCM_034780 [Theobrom...    72   4e-10
gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00...    72   4e-10

>emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1381

 Score = 82.8 bits (203), Expect(2) = 5e-18
 Identities = 78/284 (27%), Positives = 122/284 (42%), Gaps = 18/284 (6%)
 Frame = -1

Query: 816  KIQSRVAWHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPD--CCLGHTCDE 643
            K QS++     +W     P   + +W AL+GKL    +LA  NIIP  D  C + +   E
Sbjct: 1041 KPQSKIRIWGRLWRGLIPPRIEVFSWVALLGKLNSRQKLATLNIIPPDDAVCIMCNGAPE 1100

Query: 642  TENHLFFECQFSSALWSQVLRSIWPHSITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYS 463
            T +HL   C F+S++W   L  IW  S      L EA       +       +    F  
Sbjct: 1101 TSDHLLLHCPFASSIWLWWL-GIWNVSWVFPKNLFEAFEQWYCHKKNPFFRKVWCSIFSI 1159

Query: 462  TIHYVWVERNNMIFRGKRSSVKSLWKKIADILAFKVDG----------EMISHPPPLHSP 313
             I  +W ERN  IFRG   S   L   +   L + + G          E++ HP  L S 
Sbjct: 1160 IIWTIWKERNARIFRGISCSSNKLQDLVIIRLMWWIKGWGEAFPYSIVEVLRHPQCL-SW 1218

Query: 312  NNFIATRWKISISHNVGISSWWSPPSQGMAKLNTDGSLKGHNMGYGGIIRDCRGSPILAY 133
            +   A     ++S +      WSPP+ G+ K N D S+       GG++R+ +G  +  +
Sbjct: 1219 DYLKAAPAATAVSVD---GMLWSPPNDGVMKWNVDASVNAGRSAIGGVLRNSQGIFVCVF 1275

Query: 132  AGQK*GNLVIEAECFALFRGL------SFLRQAGFDSAIVESDA 19
            +       +  AE  A++R +       FL++A     ++ESD+
Sbjct: 1276 SCPIPSIEINSAEIIAIYRAMQICYSFEFLKRA---PLVLESDS 1316



 Score = 36.2 bits (82), Expect(2) = 5e-18
 Identities = 29/110 (26%), Positives = 47/110 (42%), Gaps = 12/110 (10%)
 Frame = -3

Query: 1144 SLRDLVEPLILHLVGKGNGTRLWVDRWHPDGILLWPFDKNFAKTFDLNLHLTRGRGENCF 965
            S R  V+  +   VG G  T  W+D W  D  L   F + F    D  +      G  C 
Sbjct: 917  SARSFVKTKLRKAVGNGVKTLFWLDTWLGDSPLKLRFPRLFT-IVDNPMAYIASCGSWCG 975

Query: 964  QDILTDLGLSNL--------WHDIE----TICKLYANEDDRVIWTPTANG 851
            ++ + +   S +        W +++    ++C L  + DDR+IWTP  +G
Sbjct: 976  REWVWNFSWSRVFRPRDAEEWEELQGLLGSVC-LSPSTDDRLIWTPHKSG 1024


>ref|XP_007207799.1| hypothetical protein PRUPE_ppa024472mg, partial [Prunus persica]
            gi|462403441|gb|EMJ08998.1| hypothetical protein
            PRUPE_ppa024472mg, partial [Prunus persica]
          Length = 920

 Score = 82.8 bits (203), Expect = 3e-13
 Identities = 68/273 (24%), Positives = 112/273 (41%), Gaps = 6/273 (2%)
 Frame = -1

Query: 804  RVAWHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDETENHLF 625
            R  W +I W    +P      WR ++  LP    L  R II SP C + +  +E+E H  
Sbjct: 601  RGVWKDI-WASPTLPKVKFFLWRMMVRALPTKLNLYRRRIISSPFCPICNQYEESEEHAI 659

Query: 624  FECQFSSALW--SQVLRSIWPHSITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYSTIHY 451
            F C ++ A+W  S +   + P SIT F         ++ F     +  L  ++F S    
Sbjct: 660  FLCPWTQAVWFGSPLNYRVNPQSITTFDRWFTGLLNSQMFSKSERVWVLSLVSFISW--E 717

Query: 450  VWVERNNMIFRGKRSSVKSLWKKIADILAFKVDGEMISHPPPLHSPNNFIATRWKISISH 271
            +W  R   +F       + + ++ A   A + D                +  R +IS  +
Sbjct: 718  IWKARCKFLFEDITIDPRCVVERAASA-AEEFD----------------VLRRHEISTRN 760

Query: 270  NVGISSW----WSPPSQGMAKLNTDGSLKGHNMGYGGIIRDCRGSPILAYAGQK*GNLVI 103
              G+ S     W PP  G  K+N D + K H  G G ++R+        +A ++  N  +
Sbjct: 761  GAGVFSQPTDIWKPPVNGAIKINFDAAWKNHEAGLGVVMRNHNKDFCYGFASKRCCNSAL 820

Query: 102  EAECFALFRGLSFLRQAGFDSAIVESDAKIVMD 4
             AE  A    L      G+    +ESD+K+++D
Sbjct: 821  NAETEAAIEALRCASLRGYSKIEMESDSKVLID 853


>ref|XP_006491472.1| PREDICTED: uncharacterized protein LOC102626455 [Citrus sinensis]
          Length = 1452

 Score = 82.0 bits (201), Expect = 4e-13
 Identities = 74/268 (27%), Positives = 108/268 (40%), Gaps = 6/268 (2%)
 Frame = -1

Query: 786  IIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDETENHLFFECQFS 607
            I W L+      I  WRAL   LP A  L  R  +  P C       ET +H+  EC+ +
Sbjct: 1135 IPWMLDLPEKVKIFMWRALKNILPTAENLWKRRSLQEPICQRCKLQVETVSHVLIECKAA 1194

Query: 606  SALWSQVLRSIWP---HSITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYSTIHYVWVER 436
              +W      + P   H+   F  +       ++   +S  +    +  Y  +  +W  R
Sbjct: 1195 RKIWDLAPLIVQPSKDHNQDFFSAI-------QEMWSRSSTAEAELMIVYCWV--IWSAR 1245

Query: 435  NNMIFRGKRSSVKSLWKKIADILAFKVDGEMISHPPPLHSPNNFIATRWKISISHNVGIS 256
            N  IF GK+S  + L  K   +L      + +S P  +H   +    + K          
Sbjct: 1246 NKFIFEGKKSDSRFLAAKADSVLKAY---QRVSKPGNVHGAKDRGIDQQK---------- 1292

Query: 255  SWWSPPSQGMAKLNTDG--SLKGHNMGYGGIIRDCRGSPILAYAGQ-K*GNLVIEAECFA 85
              W PPSQ + KLN D   S K   +G G I+RD  G  +     Q +    V  AE  A
Sbjct: 1293 --WKPPSQNVLKLNVDAAVSTKDQKVGLGAIVRDAEGKILAVGIKQAQFRERVSLAEAEA 1350

Query: 84   LFRGLSFLRQAGFDSAIVESDAKIVMDV 1
            +  GL    Q    S IVESD K V+++
Sbjct: 1351 IHWGLQVANQISSSSLIVESDCKEVVEL 1378


>ref|XP_006483194.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Citrus
            sinensis]
          Length = 765

 Score = 82.0 bits (201), Expect = 4e-13
 Identities = 74/268 (27%), Positives = 108/268 (40%), Gaps = 6/268 (2%)
 Frame = -1

Query: 786  IIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDETENHLFFECQFS 607
            I W L+      I  WRAL   LP A  L  R  +  P C       ET +H+  EC+ +
Sbjct: 448  IPWMLDLPEKVKIFMWRALKNILPTAENLWKRRSLQEPICQRCKLQVETVSHVLIECKAA 507

Query: 606  SALWSQVLRSIWP---HSITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYSTIHYVWVER 436
              +W      + P   H+   F  +       ++   +S  +    +  Y  +  +W  R
Sbjct: 508  RKIWDLAPLIVQPSKDHNQDFFSAI-------QEMWSRSSTAEAELMIVYCWV--IWSAR 558

Query: 435  NNMIFRGKRSSVKSLWKKIADILAFKVDGEMISHPPPLHSPNNFIATRWKISISHNVGIS 256
            N  IF GK+S  + L  K   +L      + +S P  +H   +    + K          
Sbjct: 559  NKFIFEGKKSDSRFLAAKADSVLKAY---QRVSKPGNVHGAKDRGIDQQK---------- 605

Query: 255  SWWSPPSQGMAKLNTDG--SLKGHNMGYGGIIRDCRGSPILAYAGQ-K*GNLVIEAECFA 85
              W PPSQ + KLN D   S K   +G G I+RD  G  +     Q +    V  AE  A
Sbjct: 606  --WKPPSQNVLKLNVDAAVSTKXQKVGLGAIVRDAEGKILAVGIKQAQFRERVSLAEAEA 663

Query: 84   LFRGLSFLRQAGFDSAIVESDAKIVMDV 1
            +  GL    Q    S IVESD K V+++
Sbjct: 664  IHWGLQVANQISSSSLIVESDCKEVVEL 691


>ref|XP_007207609.1| hypothetical protein PRUPE_ppa018907mg, partial [Prunus persica]
            gi|462403251|gb|EMJ08808.1| hypothetical protein
            PRUPE_ppa018907mg, partial [Prunus persica]
          Length = 1566

 Score = 79.3 bits (194), Expect = 3e-12
 Identities = 72/248 (29%), Positives = 108/248 (43%), Gaps = 5/248 (2%)
 Frame = -1

Query: 732  LIGKLPVASRLAA--RNIIPSPDCCLGHTCDETENHLFFECQFSSALWSQVLR--SIWPH 565
            ++ KL V SRL     NI P    C  H   ET NHLFFECQF+  +W  ++   +  PH
Sbjct: 1276 MLKKLQVRSRLYKFLPNIDPECPLCKNHM--ETINHLFFECQFAVNIWRCIIEWLASLPH 1333

Query: 564  SITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYSTIHYVWVERNNMIFRGKRSSVKSLWK 385
            +              +   G +ILS    + +      +W  RNN IF+     +     
Sbjct: 1334 T--------------KAADGPNILSKALLLCW-----QIWEARNNCIFK----DIDPHPV 1370

Query: 384  KIADILA-FKVDGEMISHPPPLHSPNNFIATRWKISISHNVGISSWWSPPSQGMAKLNTD 208
            ++ ++     +D   I+  PP  S         K++I         W PP     K+N D
Sbjct: 1371 RVLNVAGRIGLDYWKINSCPPQKSTG-------KVNIK--------WEPPPLDWVKVNFD 1415

Query: 207  GSLKGHNMGYGGIIRDCRGSPILAYAGQK*GNLVIEAECFALFRGLSFLRQAGFDSAIVE 28
            GS++G+    G +IRD  G+  LA         +  AECFAL  GL+     G+    VE
Sbjct: 1416 GSMRGNLAATGFVIRDWNGNVRLAGTKNSGQVSITVAECFALRDGLAHAIHKGWRKIFVE 1475

Query: 27   SDAKIVMD 4
             D+K+++D
Sbjct: 1476 GDSKLIID 1483


>gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]
          Length = 1161

 Score = 79.3 bits (194), Expect = 3e-12
 Identities = 43/156 (27%), Positives = 75/156 (48%)
 Frame = -1

Query: 819  QKIQSRVAWHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDET 640
            Q   + V WH  +WF +H+P  +   W     +L    RL        P C L +  DE+
Sbjct: 978  QPSSTSVLWHKAVWFKDHVPKQAFICWVVAHNRLHTRDRLRRWGFSIPPTCVLCNDLDES 1037

Query: 639  ENHLFFECQFSSALWSQVLRSIWPHSITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYST 460
              HLFF CQFSS +WS  +R++  +    F   L     A + +  ++++   K+ F+++
Sbjct: 1038 REHLFFRCQFSSEIWSFFMRALNLNPPPQFMHCLLWTLTASRDRNITLIT---KLLFHAS 1094

Query: 459  IHYVWVERNNMIFRGKRSSVKSLWKKIADILAFKVD 352
            ++++W ERN  I          + K+I  I+  ++D
Sbjct: 1095 VYFIWRERNLRIHSNSVRPAHLIIKEIQLIVRARLD 1130


>emb|CCA66222.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1383

 Score = 76.6 bits (187), Expect(2) = 4e-12
 Identities = 68/243 (27%), Positives = 105/243 (43%), Gaps = 16/243 (6%)
 Frame = -1

Query: 750  ITAWRALIGKLPVASRLAARNIIPSPD--CCLGHTCDETENHLFFECQFSSALWSQVLRS 577
            I  W AL+ K+   S+L    IIP  D  C   +   ET NHL   C+FS  LW+  L +
Sbjct: 1061 IFCWLALLEKINTKSKLGRIGIIPIEDAVCVFCNIGLETTNHLLLHCEFSWKLWTWWL-N 1119

Query: 576  IWPHSITIFPILLEAQ*VAEKFQGK-SILSSLGKIAFYSTIHYVWVERNNMIFRGKRSSV 400
            IW +S   FP  ++      +  G+ +    +    F+  I  +W ERN+ IF    SS+
Sbjct: 1120 IWGYS-WAFPKSIKNAFAQWQIYGRGAFFKKIWHAIFFIIIWSLWKERNSRIFNNSNSSL 1178

Query: 399  KSLWKKIADILAFKV----DGEMISHPPPLHSPNNFIATRWKISISHNVG-------ISS 253
            + +   I   L + V    DG   +    + +P      +W  S   N G       + +
Sbjct: 1179 EEIQDLILTRLCWWVKAWDDGFPFACSEVIRNP---ACLKWTQSKGCNFGTIGPTNLLKA 1235

Query: 252  WWSPPSQGMAKLNTDGSLKG--HNMGYGGIIRDCRGSPILAYAGQK*GNLVIEAECFALF 79
             WSPP     + N D S K    +   GG++RD  G  +  ++       +  AE +A+F
Sbjct: 1236 AWSPPPSNHLQWNVDASFKPGLEHAAVGGVLRDENGCFVCLFSSPIPRLEINSAEIYAIF 1295

Query: 78   RGL 70
            R L
Sbjct: 1296 RAL 1298



 Score = 22.3 bits (46), Expect(2) = 4e-12
 Identities = 23/95 (24%), Positives = 39/95 (41%), Gaps = 10/95 (10%)
 Frame = -3

Query: 1105 VGKGNGTRLWVDRWHPDGILLWPFDKNFAKTFD-----LNLHLTRGRGENC---FQDILT 950
            VGKG  T  W + W  +  L   F + +  T +      +L +  G   +    +Q  L 
Sbjct: 930  VGKGTQTAFWQEIWIGELPLKTLFPRLYRLTINPLATISSLGIWDGHEWHWVLPWQRALR 989

Query: 949  --DLGLSNLWHDIETICKLYANEDDRVIWTPTANG 851
              D+   +  H++     L    DD ++WTP  +G
Sbjct: 990  PRDIEERDALHELLKDVVLDLTNDDYLVWTPNKSG 1024


>ref|XP_004289367.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 1152

 Score = 75.5 bits (184), Expect = 4e-11
 Identities = 74/272 (27%), Positives = 113/272 (41%), Gaps = 10/272 (3%)
 Frame = -1

Query: 807  SRVAWHNI---IWFLEHIPNHSITAWRALIGKLPVASRLAARNI-IPSPDCCLGHTCDET 640
            S V W  +   +W  +  P   + AWR + G LP  + L  + + +P  +C    T  E 
Sbjct: 795  SDVQWSRLWCKLWRTQVPPKVRMHAWRLVKGTLPSRAALVKKQVQLPDVNCVFCSTNVED 854

Query: 639  ENHLFFECQFSSALWSQVLRSIWPHSITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYST 460
              HLF  C+     W Q +  I P +     + +    + E   G+ +        F   
Sbjct: 855  SLHLFKNCEALQPFWQQGMVQIHPRTHPSISVEVWFWDMVEMLSGEKLEG------FLMA 908

Query: 459  IHYVWVERNNMIFRGKRSSVKSL--WKKIADILAFKVDGEMISHPPPLHSPNNFIATRWK 286
            +  +WVERNNM++RG+  ++ ++  W   + +L +K            H     + TR K
Sbjct: 909  LWVIWVERNNMVWRGQFYNITNMMDWSS-SLLLEYK------------HCHQRSVGTRKK 955

Query: 285  ISISHNVGISSWWSPPSQGMAKLNTDGSLKGHNMGYGG---IIRDCRGSPILAYAGQ-K* 118
                     S W  PPS G  ++N DGS   H  G GG   +IRD +G+ + + A     
Sbjct: 956  -------NKSKWTCPPS-GRLRVNIDGSF-AHEEGRGGVGVVIRDHKGACVASLARPFPN 1006

Query: 117  GNLVIEAECFALFRGLSFLRQAGFDSAIVESD 22
                I  E  AL  GL    Q G+    VESD
Sbjct: 1007 AASAIHMEVEALRAGLLVCVQQGWRDVEVESD 1038


>ref|NP_567266.1| RNA-directed DNA polymerase (reverse transcriptase)-related family
           protein [Arabidopsis thaliana]
           gi|5732057|gb|AAD48956.1|AF149414_5 contains similarity
           to a family of Arabidopsis thaliana predicted proteins,
           which have similarity to reverse transcriptases; see
           T14P8.10 (GB:AF069298) [Arabidopsis thaliana]
           gi|7267223|emb|CAB80830.1| AT4g04650 [Arabidopsis
           thaliana] gi|332657009|gb|AEE82409.1| RNA-directed DNA
           polymerase (reverse transcriptase)-related family
           protein [Arabidopsis thaliana]
          Length = 332

 Score = 75.1 bits (183), Expect = 5e-11
 Identities = 41/150 (27%), Positives = 70/150 (46%)
 Frame = -1

Query: 801 VAWHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDETENHLFF 622
           V WH  +WF  H+P H+   W     +L    RL    +    +C L +  D++  HLFF
Sbjct: 127 VPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLSIPAECLLCNAHDDSRAHLFF 186

Query: 621 ECQFSSALWSQVLRSIWPHSITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYSTIHYVWV 442
           ECQFS  +W     S    ++     L++          +  +  + ++AF+S ++ +W 
Sbjct: 187 ECQFSGVVWRFFTAST---NLNPPAQLMDCLNWLLSPSREKNICLIIRLAFHSCVYAIWR 243

Query: 441 ERNNMIFRGKRSSVKSLWKKIADILAFKVD 352
           ERN  +  G   S +S+ K I  I+  ++D
Sbjct: 244 ERNQRLHSGVSRSTESILKDIQLIIRARLD 273


>gb|ABK28243.1| unknown [Arabidopsis thaliana]
          Length = 297

 Score = 75.1 bits (183), Expect = 5e-11
 Identities = 41/150 (27%), Positives = 70/150 (46%)
 Frame = -1

Query: 801 VAWHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDETENHLFF 622
           V WH  +WF  H+P H+   W     +L    RL    +    +C L +  D++  HLFF
Sbjct: 127 VPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLSIPAECLLCNAHDDSRAHLFF 186

Query: 621 ECQFSSALWSQVLRSIWPHSITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYSTIHYVWV 442
           ECQFS  +W     S    ++     L++          +  +  + ++AF+S ++ +W 
Sbjct: 187 ECQFSGVVWRFFTAST---NLNPPAQLMDCLNWLLSPSREKNICLIIRLAFHSCVYAIWR 243

Query: 441 ERNNMIFRGKRSSVKSLWKKIADILAFKVD 352
           ERN  +  G   S +S+ K I  I+  ++D
Sbjct: 244 ERNQRLHSGVSRSTESILKDIQLIIRARLD 273


>gb|ABE65512.1| hypothetical protein At4g04650 [Arabidopsis thaliana]
          Length = 296

 Score = 75.1 bits (183), Expect = 5e-11
 Identities = 41/150 (27%), Positives = 70/150 (46%)
 Frame = -1

Query: 801 VAWHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDETENHLFF 622
           V WH  +WF  H+P H+   W     +L    RL    +    +C L +  D++  HLFF
Sbjct: 127 VPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLSIPAECLLCNAHDDSRAHLFF 186

Query: 621 ECQFSSALWSQVLRSIWPHSITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYSTIHYVWV 442
           ECQFS  +W     S    ++     L++          +  +  + ++AF+S ++ +W 
Sbjct: 187 ECQFSGVVWRFFTAST---NLNPPAQLMDCLNWLLSPSREKNICLIIRLAFHSCVYAIWR 243

Query: 441 ERNNMIFRGKRSSVKSLWKKIADILAFKVD 352
           ERN  +  G   S +S+ K I  I+  ++D
Sbjct: 244 ERNQRLHSGVSRSTESILKDIQLIIRARLD 273


>gb|AAD26953.1| putative non-LTR retrolelement reverse transcriptase [Arabidopsis
           thaliana]
          Length = 323

 Score = 74.7 bits (182), Expect = 7e-11
 Identities = 48/174 (27%), Positives = 84/174 (48%), Gaps = 1/174 (0%)
 Frame = -1

Query: 801 VAWHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDETENHLFF 622
           V W   +WF + IP H+   W A   +L    RL    +     C L +  DET +HLFF
Sbjct: 152 VPWQKSVWFKDRIPKHAFICWVAAWKRLHTRDRLTQWGLNIPTVCVLCNVVDETHDHLFF 211

Query: 621 ECQFSSALWS-QVLRSIWPHSITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYSTIHYVW 445
           +CQFS+ +WS  ++R+         PILL      +       LS + K+ F ++++ +W
Sbjct: 212 QCQFSNEIWSFFMIRAGMTPPHLFGPILL----WLKSASSSKNLSLIIKLLFQASVYLIW 267

Query: 444 VERNNMIFRGKRSSVKSLWKKIADILAFKVDGEMISHPPPLHSPNNFIATRWKI 283
            ERN  I      +  ++ K++  ++  ++D      P  L S ++ +AT +++
Sbjct: 268 RERNCRIHTTHSRTPPTIIKEVQQLIRARLDPICRERPVGL-SRSSLLATWFEL 320


>gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana]
          Length = 629

 Score = 72.8 bits (177), Expect(2) = 1e-10
 Identities = 41/135 (30%), Positives = 61/135 (45%), Gaps = 5/135 (3%)
 Frame = -1

Query: 819 QKIQSRVAWHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDET 640
           +K  + VAW+  +WF    P +    W AL  +L    R+   N      C    T  ET
Sbjct: 454 RKKSNEVAWYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDVKCTFCSTSIET 513

Query: 639 ENHLFFECQFSSALWSQVLRSIWPHSI-----TIFPILLEAQ*VAEKFQGKSILSSLGKI 475
            +HLFF C ++SA+W+ + +++  H       TI   + E        Q   I S L + 
Sbjct: 514 RDHLFFSCSYASAIWTAIAKNVLQHRFSTDWQTIVNYISET-------QTDRIRSFLSRY 566

Query: 474 AFYSTIHYVWVERNN 430
            F  T+H VW ERN+
Sbjct: 567 IFQLTVHTVWKERND 581



 Score = 21.2 bits (43), Expect(2) = 1e-10
 Identities = 9/16 (56%), Positives = 10/16 (62%)
 Frame = -2

Query: 848 FSFKFVWNVVRKSNPE 801
           FS K  WN VRK + E
Sbjct: 444 FSTKDTWNQVRKKSNE 459


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score = 73.6 bits (179), Expect = 2e-10
 Identities = 42/126 (33%), Positives = 61/126 (48%), Gaps = 1/126 (0%)
 Frame = -1

Query: 807  SRVAWHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDETENHL 628
            +RV WH +IWF    P +S  +W A  G+LP   R+       + DC       ET +HL
Sbjct: 1052 ARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTLETRDHL 1111

Query: 627  FFECQFSSALWSQVLRSIWPHSITI-FPILLEAQ*VAEKFQGKSILSSLGKIAFYSTIHY 451
            FF C F+S +W  + R I+    T  +  ++EA       Q   +   L +  F +TI+ 
Sbjct: 1112 FFTCSFTSVIWVDLARGIFKTQYTSHWQSIIEA---ITNSQHHRVEWFLRRYVFQATIYI 1168

Query: 450  VWVERN 433
            VW ERN
Sbjct: 1169 VWRERN 1174


>ref|NP_197389.1| RNA-directed DNA polymerase (reverse transcriptase)-related family
           protein [Arabidopsis thaliana]
           gi|332005241|gb|AED92624.1| RNA-directed DNA polymerase
           (reverse transcriptase)-related family protein
           [Arabidopsis thaliana]
          Length = 295

 Score = 73.6 bits (179), Expect = 2e-10
 Identities = 44/137 (32%), Positives = 62/137 (45%)
 Frame = -1

Query: 801 VAWHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDETENHLFF 622
           V W  ++WF E+IP  S+  W + + +LP   RL    +       L    DET  HLFF
Sbjct: 125 VPWAKVVWFKEYIPRFSLITWMSFLERLPTRDRLRGWGMNIPSSWVLCSNGDETHAHLFF 184

Query: 621 ECQFSSALWSQVLRSIWPHSITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYSTIHYVWV 442
           EC FS A+W        P      P    A     +   +S  +++ K+   S +++VW 
Sbjct: 185 ECSFSLAIWEFFASKFRPSPPFGLP---AASSWILQLPLRSHSTTILKLLLQSAVYHVWK 241

Query: 441 ERNNMIFRGKRSSVKSL 391
           ERN  IF    SS  SL
Sbjct: 242 ERNARIFTSISSSASSL 258


>gb|AAF79357.1|AC007887_16 F15O4.34 [Arabidopsis thaliana]
          Length = 236

 Score = 72.4 bits (176), Expect = 3e-10
 Identities = 44/144 (30%), Positives = 68/144 (47%), Gaps = 1/144 (0%)
 Frame = -1

Query: 813 IQSRVAWHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDETEN 634
           I   V W+  +WF    P +S   W A++ +L    R+   N   S  C L H   ET +
Sbjct: 96  ISMDVDWYKGVWFGHSTPKYSFCVWLAVLNRLSTGDRMTHWNGGQSAACVLCHNAPETRD 155

Query: 633 HLFFECQFSSALWSQVLRSIWPHSI-TIFPILLEAQ*VAEKFQGKSILSSLGKIAFYSTI 457
           HLFF C F+S +WS + R I+     T +  L++A  ++  +    + S   +  F +T+
Sbjct: 156 HLFFSCDFASIVWSNLARGIYGDRFSTHWQDLIQA--ISGSWM-TPLDSFFARYLFQATV 212

Query: 456 HYVWVERNNMIFRGKRSSVKSLWK 385
           H +W ERN      K +S   L K
Sbjct: 213 HTIWRERNGRNHGEKPNSAALLIK 236


>emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1363

 Score = 72.4 bits (176), Expect = 3e-10
 Identities = 73/283 (25%), Positives = 111/283 (39%), Gaps = 7/283 (2%)
 Frame = -1

Query: 828  ECGQKIQSRVAWHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTC 649
            E G K   R  W   I F      + +  W  +   LP A  LA R    +P C      
Sbjct: 1031 ETGGKGSWRGLWRKNIPF-----KYKLLIWNGIHNILPTALFLAKRIHNFNPQCVACDHP 1085

Query: 648  DETENHLFFECQFSSALWSQVLRSIWPHSITIFPILLEAQ*VAEKFQGKSILSSLGKIAF 469
             E   HLF +C  +S++W ++L+   P++  +F  L   + +           +    AF
Sbjct: 1086 IEDMIHLFRDCCVASSVWIEILKHHKPNNQNLFFNLEWEEWIDFNLNQHDYWVTKFTTAF 1145

Query: 468  YSTIHYVWVERNNMIFRGKRSSVKSLWKKI-----ADILAFKVDGEMISHPPPLHSPNNF 304
            +    ++W  RN  +F    +  K  + ++      +I AF+V              NN 
Sbjct: 1146 W----HIWCSRNKTVFECAVNHPKFTYNRVVADFFTNIRAFQV--------------NNT 1187

Query: 303  IATRWKISISHNVGISSWWSPPSQGMAKLNTDGSLKG--HNMGYGGIIRDCRGSPILAYA 130
                 K+ +         W PP QG  KLNTDG+ K    N G GG+ RD  G+  L +A
Sbjct: 1188 QGNGSKVVLR--------WKPPHQGFLKLNTDGAWKADWENAGIGGVFRDAVGNWELGFA 1239

Query: 129  GQK*GNLVIEAECFALFRGLSFLRQAGFDSAIVESDAKIVMDV 1
             +        AE  A+  GL       +    VE DAK V+ +
Sbjct: 1240 KRVDAGSPEAAELMAIREGLQVAWDCNYHKLEVECDAKGVVQL 1282


>ref|XP_006584439.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 396

 Score = 59.7 bits (143), Expect(2) = 4e-10
 Identities = 56/236 (23%), Positives = 96/236 (40%), Gaps = 2/236 (0%)
 Frame = -1

Query: 783 IWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDETENHLFFECQFSS 604
           +W ++   N     W      LP    L  R++  +P CC      E+  HL  +C  + 
Sbjct: 189 MWLMKIPQNIKFFLWLTSHKSLPTKFFLVYRHLSSNPFCCRCSNQVESVLHLLRDCDKAC 248

Query: 603 ALWSQVLRSIWPHSITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYSTIHYVWVERNNMI 424
           ++WS       P  +  F     A+  +  +  K    + G + F     ++W +RN M 
Sbjct: 249 SVWSM----FQPTLVVDF-----AEHDSSVWLHKHATCATGAL-FCLICWFIWRDRNAMT 298

Query: 423 FRGKRSSVKSLWKKIADILAFKVDGEMISHPPPLHSPNNFIATRWKISISHNVGISSWWS 244
           F  +       W++    +A +V+  +           N I+ + +    +   +   W 
Sbjct: 299 FSNEN------WQEW--FIASQVNNML-----------NIISNQQECQPRNRYTVQVAWK 339

Query: 243 PPSQGMAKLNTDGS--LKGHNMGYGGIIRDCRGSPILAYAGQK*GNLVIEAECFAL 82
           PP   + KLNTDGS  +     G+GG+IRD +   I+ Y G       ++AE FAL
Sbjct: 340 PPPPTVLKLNTDGSSLVNPGQSGFGGVIRDSQRDWIIGYTGSCGVTTSLQAELFAL 395



 Score = 32.7 bits (73), Expect(2) = 4e-10
 Identities = 29/114 (25%), Positives = 51/114 (44%)
 Frame = -3

Query: 1135 DLVEPLILHLVGKGNGTRLWVDRWHPDGILLWPFDKNFAKTFDLNLHLTRGRGENCFQDI 956
            +L+EP     VG G+   +W DRW+P+G L    D    + F+L L      G   + ++
Sbjct: 69   ELLEPGFRFRVGTGD-MPVWYDRWNPNGFLCDMVDYVNIQDFNLTLKDVYENGMWLWNNM 127

Query: 955  LTDLGLSNLWHDIETICKLYANEDDRVIWTPTANGNSLSSLCGMWSENPIPSSL 794
             T +  S +  +  ++  L +   D VIW+   N   ++     W ++    SL
Sbjct: 128  ATIIP-SQVPQEFNSLF-LNSTIADTVIWSAAQNHVFMAKTAYWWLQSQANVSL 179


>ref|XP_007018598.1| Uncharacterized protein TCM_034780 [Theobroma cacao]
           gi|508723926|gb|EOY15823.1| Uncharacterized protein
           TCM_034780 [Theobroma cacao]
          Length = 398

 Score = 72.0 bits (175), Expect = 4e-10
 Identities = 85/297 (28%), Positives = 126/297 (42%), Gaps = 25/297 (8%)
 Frame = -1

Query: 831 VECGQKIQSRVAWHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIP--SPDCCLG 658
           + C   I +       +W     P   +  W+ L+GK+ V   L  R +I   +  C L 
Sbjct: 58  IHCQSNIWASQPHWRQLWKGHAPPKIEVFTWQVLLGKVAVKHELFKRGLIDINTSFCTLC 117

Query: 657 HTCDETENHLFFECQFSSALWSQVLRSIWPHSITIFPILLEAQ*VAEKF-----QGKSIL 493
           +   ET +HLFF C   S  W+     IW H+ +++ +       A  F       K   
Sbjct: 118 NAELETSSHLFFTC---SVAWN-----IWMHNCSLWGLSWVHPGDATSFFVSWQNNKPPY 169

Query: 492 SS--LGKIAFYSTIHYVWVERNNMIFRGKRSSVKSLWKKIADILAFKVDGEM-ISHPPPL 322
            S  +  + F+ST+  +W+ RN ++F+GK   V  L   I   LA    G+  ++H P  
Sbjct: 170 GSPEIWHMLFFSTLWSIWLCRNEILFQGKHLDVNQLQDIILVRLAHWCKGKWPVNHIPAS 229

Query: 321 HSPNNFIATRWKISIS----HNVGISSWWSPPSQGMAKLNTDGSLKGH--NMGYGGIIRD 160
           H    F+    +I I+        + SW  PP+ G  KLN DGS  G     G  G IRD
Sbjct: 230 H----FLFEPSRICINSRKCKTKVVCSWMRPPT-GSFKLNVDGSALGKPGPTGIRGAIRD 284

Query: 159 CR-------GSPILAYAGQK*GNLVIEAECFALFRGLSFLRQAGFDSAI--VESDAK 16
                     +PI    G +  N    AE  A+  GLSF   + + S+   VESD+K
Sbjct: 285 HESFIKGVFSTPI----GMEDSNY---AEFLAIKEGLSFFFSSPWASSTLHVESDSK 334


>gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis
            thaliana]
          Length = 1253

 Score = 72.0 bits (175), Expect = 4e-10
 Identities = 38/124 (30%), Positives = 61/124 (49%)
 Frame = -1

Query: 795  WHNIIWFLEHIPNHSITAWRALIGKLPVASRLAARNIIPSPDCCLGHTCDETENHLFFEC 616
            W   +WF   +P H+   W + + +LP   RLAA  +  + DCCL  +  E+ +HL   C
Sbjct: 922  WTKSVWFKGSVPKHAFNMWVSHLNRLPTRQRLAAWGVTTTTDCCLCSSRPESRDHLLLYC 981

Query: 615  QFSSALWSQVLRSIWPHSITIFPILLEAQ*VAEKFQGKSILSSLGKIAFYSTIHYVWVER 436
             FS+ +W  V   + P S  IF    E      +       S L KIA  +++ ++W +R
Sbjct: 982  VFSAVIWKLVFFRLTP-SQAIFNSWAELL-SWTRINSSKAPSLLRKIAAQASVFHLWKQR 1039

Query: 435  NNMI 424
            NN++
Sbjct: 1040 NNVL 1043


Top