BLASTX nr result

ID: Cocculus23_contig00030146 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00030146
         (1191 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_197389.1| RNA-directed DNA polymerase (reverse transcript...    94   1e-16
ref|NP_567266.1| RNA-directed DNA polymerase (reverse transcript...    86   3e-14
gb|ABK28243.1| unknown [Arabidopsis thaliana]                          86   3e-14
gb|ABE65512.1| hypothetical protein At4g04650 [Arabidopsis thali...    86   3e-14
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...    85   7e-14
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...    84   1e-13
ref|XP_003621690.1| Cytochrome c biogenesis protein ccsA [Medica...    80   1e-13
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]                83   2e-13
ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein A...    83   3e-13
emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|72678...    82   3e-13
gb|ABE65413.1| hypothetical protein At1g62890 [Arabidopsis thali...    82   5e-13
gb|ABK28152.1| unknown [Arabidopsis thaliana]                          82   5e-13
ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258...    60   5e-13
gb|AAD26953.1| putative non-LTR retrolelement reverse transcript...    81   8e-13
gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal...    81   1e-12
ref|XP_007213453.1| hypothetical protein PRUPE_ppa024777mg, part...    68   3e-12
gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]               79   4e-12
gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]              79   5e-12
ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A...    71   7e-12
ref|XP_004228797.1| PREDICTED: putative ribonuclease H protein A...    53   1e-11

>ref|NP_197389.1| RNA-directed DNA polymerase (reverse transcriptase)-related family
           protein [Arabidopsis thaliana]
           gi|332005241|gb|AED92624.1| RNA-directed DNA polymerase
           (reverse transcriptase)-related family protein
           [Arabidopsis thaliana]
          Length = 295

 Score = 93.6 bits (231), Expect = 1e-16
 Identities = 58/147 (39%), Positives = 75/147 (51%), Gaps = 6/147 (4%)
 Frame = +3

Query: 12  FSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNVPQA 191
           FS +   + IR  SP V W ++VWF +YIPR S+  W  F ERL T  RL+  G+N+P +
Sbjct: 109 FSSRDTWEQIRVHSPTVPWAKVVWFKEYIPRFSLITWMSFLERLPTRDRLRGWGMNIPSS 168

Query: 192 CSLCWNGIEDINHLFFKCEFSSRILQKVMRKCFSR------VSLSWKQLADLI*RSHGPN 353
             LC NG E   HLFF+C FS  I +    K           + SW  +  L  RSH   
Sbjct: 169 WVLCSNGDETHAHLFFECSFSLAIWEFFASKFRPSPPFGLPAASSW--ILQLPLRSHSTT 226

Query: 354 SLAGKLLRLAFGSTVAHIWWERNMRRF 434
                +L+L   S V H+W ERN R F
Sbjct: 227 -----ILKLLLQSAVYHVWKERNARIF 248


>ref|NP_567266.1| RNA-directed DNA polymerase (reverse transcriptase)-related family
           protein [Arabidopsis thaliana]
           gi|5732057|gb|AAD48956.1|AF149414_5 contains similarity
           to a family of Arabidopsis thaliana predicted proteins,
           which have similarity to reverse transcriptases; see
           T14P8.10 (GB:AF069298) [Arabidopsis thaliana]
           gi|7267223|emb|CAB80830.1| AT4g04650 [Arabidopsis
           thaliana] gi|332657009|gb|AEE82409.1| RNA-directed DNA
           polymerase (reverse transcriptase)-related family
           protein [Arabidopsis thaliana]
          Length = 332

 Score = 85.9 bits (211), Expect = 3e-14
 Identities = 57/174 (32%), Positives = 87/174 (50%), Gaps = 3/174 (1%)
 Frame = +3

Query: 3   SGKFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNV 182
           S +FS       +  +S  V W + VWF +++P+++   W V W RL T  RLQN GL++
Sbjct: 108 SNRFSAPRTWSALHPQSHTVPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLSI 167

Query: 183 PQACSLCWNGIEDINHLFFKCEFSSRILQKVMRKCFSRVSLS-WKQLADLI*RSHGPNSL 359
           P  C LC    +   HLFF+C+FS      V R   +  +L+   QL D +     P+  
Sbjct: 168 PAECLLCNAHDDSRAHLFFECQFSG----VVWRFFTASTNLNPPAQLMDCLNWLLSPSRE 223

Query: 360 AG--KLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKVEDCSR 515
                ++RLAF S V  IW ERN R     +RS   I+  +   ++ +++  SR
Sbjct: 224 KNICLIIRLAFHSCVYAIWRERNQRLHSGVSRSTESILKDIQLIIRARLDPLSR 277


>gb|ABK28243.1| unknown [Arabidopsis thaliana]
          Length = 297

 Score = 85.9 bits (211), Expect = 3e-14
 Identities = 57/174 (32%), Positives = 87/174 (50%), Gaps = 3/174 (1%)
 Frame = +3

Query: 3   SGKFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNV 182
           S +FS       +  +S  V W + VWF +++P+++   W V W RL T  RLQN GL++
Sbjct: 108 SNRFSAPRTWSALHPQSHTVPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLSI 167

Query: 183 PQACSLCWNGIEDINHLFFKCEFSSRILQKVMRKCFSRVSLS-WKQLADLI*RSHGPNSL 359
           P  C LC    +   HLFF+C+FS      V R   +  +L+   QL D +     P+  
Sbjct: 168 PAECLLCNAHDDSRAHLFFECQFSG----VVWRFFTASTNLNPPAQLMDCLNWLLSPSRE 223

Query: 360 AG--KLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKVEDCSR 515
                ++RLAF S V  IW ERN R     +RS   I+  +   ++ +++  SR
Sbjct: 224 KNICLIIRLAFHSCVYAIWRERNQRLHSGVSRSTESILKDIQLIIRARLDPLSR 277


>gb|ABE65512.1| hypothetical protein At4g04650 [Arabidopsis thaliana]
          Length = 296

 Score = 85.9 bits (211), Expect = 3e-14
 Identities = 57/174 (32%), Positives = 87/174 (50%), Gaps = 3/174 (1%)
 Frame = +3

Query: 3   SGKFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNV 182
           S +FS       +  +S  V W + VWF +++P+++   W V W RL T  RLQN GL++
Sbjct: 108 SNRFSAPRTWSALHPQSHTVPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLSI 167

Query: 183 PQACSLCWNGIEDINHLFFKCEFSSRILQKVMRKCFSRVSLS-WKQLADLI*RSHGPNSL 359
           P  C LC    +   HLFF+C+FS      V R   +  +L+   QL D +     P+  
Sbjct: 168 PAECLLCNAHDDSRAHLFFECQFSG----VVWRFFTASTNLNPPAQLMDCLNWLLSPSRE 223

Query: 360 AG--KLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKVEDCSR 515
                ++RLAF S V  IW ERN R     +RS   I+  +   ++ +++  SR
Sbjct: 224 KNICLIIRLAFHSCVYAIWRERNQRLHSGVSRSTESILKDIQLIIRARLDPLSR 277


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score = 84.7 bits (208), Expect = 7e-14
 Identities = 48/165 (29%), Positives = 84/165 (50%), Gaps = 1/165 (0%)
 Frame = +3

Query: 9    KFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNVPQ 188
            +F  K   + +R   P  +W++ VWFP   P+ S  LW     RL+TG R++        
Sbjct: 1335 RFITKVTWNNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLV 1394

Query: 189  ACSLCWNGIEDINHLFFKCEFSSRILQKVMRKCFS-RVSLSWKQLADLI*RSHGPNSLAG 365
             C+LC N  E  +HLFF C+++S + + + ++  S   S  W +L  L+  S+ P     
Sbjct: 1395 TCTLCNNAEETRDHLFFSCQYTSYVWEALTQRLLSTNYSRDWNRLFTLLCTSNLPRDHL- 1453

Query: 366  KLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKV 500
             L R  F +++ HIW ERN RR    +    +++ ++ + V+ ++
Sbjct: 1454 FLFRYVFQASIYHIWRERNARRHGEISSPTNRLIKLIDKTVRNRI 1498


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score = 84.0 bits (206), Expect = 1e-13
 Identities = 49/165 (29%), Positives = 79/165 (47%), Gaps = 1/165 (0%)
 Frame = +3

Query: 9    KFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNVPQ 188
            +FS K   + IR  S   +W + VWF    P+ S   W     RL+TG R+       P 
Sbjct: 759  RFSTKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPT 818

Query: 189  ACSLCWNGIEDINHLFFKCEFSSRILQKVMRKCF-SRVSLSWKQLADLI*RSHGPNSLAG 365
             C  C + +E  +HLFF+C +SS I   + +  +  R S  W  + + I  S  P+ +  
Sbjct: 819  TCVFCSSPMETRDHLFFQCCYSSEIWTSIAKNVYKDRFSTKWSAVVNYISDSQ-PDRIQS 877

Query: 366  KLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKV 500
             L R  F  ++  IW ERN RR   K+RS   ++  + + ++ ++
Sbjct: 878  FLSRYTFQVSIHSIWRERNSRRHGEKSRSASNLIRQIDKTIRNQL 922


>ref|XP_003621690.1| Cytochrome c biogenesis protein ccsA [Medicago truncatula]
           gi|355496705|gb|AES77908.1| Cytochrome c biogenesis
           protein ccsA [Medicago truncatula]
          Length = 666

 Score = 80.5 bits (197), Expect(2) = 1e-13
 Identities = 69/282 (24%), Positives = 123/282 (43%), Gaps = 4/282 (1%)
 Frame = +3

Query: 6   GKFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNVP 185
           G  + + A   I +    V W + +W     P  S   W++   +L T   L+  G  + 
Sbjct: 16  GDLTNQLAYKFINETGNHVLWDKFLWNSYIPPSRSFITWRLLHNKLPTDENLRKRGCLIV 75

Query: 186 QACSLCWNGIEDINHLFFKCEFSSRILQKVMRKCFSRVSLSWKQLADLI*RSHGPNS-LA 362
             C  C    E   H+FF+C  +SR+   + +     +  S      L+ R+ G  S L 
Sbjct: 76  SICCFCMKSAESSQHIFFECHVTSRLWDWLGKGTDKLLDCS--SCLQLLIRNWGSGSKLV 133

Query: 363 GKLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKVEDCSRPAPCTKCNF 542
             +L  A   T+  IW ERN R F  K++++  +  +++ EVK     C         ++
Sbjct: 134 NNILNSAIIHTIWSIWIERNQRCFHNKHQAMTTLFNIILAEVKMSFSLCMIKGNSAMQDY 193

Query: 543 FISRNWHMGFD-WQAKSNLLVSWQPPPYG*ICLNNSDGSYSPLR--AGFGAVLRCPNGLP 713
            +++ +++ F   +   +L + W+ PP G I   N DGS          G V+R  N   
Sbjct: 194 KVAKLFNIPFKVKRVTPHLDIIWK-PPIGDIVKINCDGSSVGRHPCGSIGIVIRDSNHHF 252

Query: 714 LLAIAGVEAPISVVDAEAKALLEGILLAISLGVINLELQTDS 839
           L AI+      + ++AE  A +  +  A  + ++++ L+TDS
Sbjct: 253 LGAISSNIGNATPLEAEFCAGMMAMEKAQEMQLMHVCLETDS 294



 Score = 23.5 bits (49), Expect(2) = 1e-13
 Identities = 9/18 (50%), Positives = 10/18 (55%)
 Frame = +2

Query: 944 CKHVYREANSPVDWLASH 997
           C H+ RE N   D LA H
Sbjct: 329 CVHILREGNMVADALAKH 346


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score = 83.2 bits (204), Expect = 2e-13
 Identities = 49/166 (29%), Positives = 81/166 (48%), Gaps = 3/166 (1%)
 Frame = +3

Query: 12   FSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRL--QNLGLNVP 185
            FS +    LI+  S  VSW + VWF    P+ ++  W     RL TG R+   N   +V 
Sbjct: 685  FSTRDTWHLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVS 744

Query: 186  QACSLCWNGIEDINHLFFKCEFSSRILQKVMRKCF-SRVSLSWKQLADLI*RSHGPNSLA 362
              C LC N  + + HLFF C ++S +   + +  + +R S  W  L   I  +H  + + 
Sbjct: 745  GNCVLCTNNSKTLEHLFFSCSYASTVWAALAKGIWKTRYSTRWSHLLTHI-STHFQDRVE 803

Query: 363  GKLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKV 500
            G L R  F +T+ H+W ERN RR      +   ++G + ++ + ++
Sbjct: 804  GFLTRYIFQATIYHVWRERNGRRHDAAPNTPATVIGWIDKQTRNQI 849


>ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
           vesca subsp. vesca]
          Length = 364

 Score = 82.8 bits (203), Expect = 3e-13
 Identities = 75/287 (26%), Positives = 127/287 (44%), Gaps = 8/287 (2%)
 Frame = +3

Query: 3   SGKFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNV 182
           SG+ S K A   +R + P + W +L+W    IPR S+  WKV   R+ +   LQ  G+ +
Sbjct: 12  SGELSAKEAFQFLRPRLPSLDWGKLIWSKFIIPRISLHSWKVLRGRVLSEDLLQRRGIAL 71

Query: 183 PQACSLCWNGIEDINHLFFKCEFSSRILQKVMRKCFSRVSLSWKQLADLI*RSH-GPNSL 359
              C LC    E + H+F  C F++ +     R     +    + L DL+     G +  
Sbjct: 72  ASRCVLCGRDGESLPHIFLTCSFAASLWNN--RAGLFELGCLPQNLVDLLYYGGVGRSHQ 129

Query: 360 AGKLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKVEDCSRPAPCTKCN 539
             ++  + + +T+  IW  RN  R       +  +  ++M  VKT     S+ A     N
Sbjct: 130 LKEIWLICYTTTLWFIWKARNKMRHDNCTIVVDAVRQLIMGHVKT----ASKLALGCMSN 185

Query: 540 FFISRNWHMGFDWQAKSNLL-----VSWQPPPYG*ICLNNSDGSYSPL--RAGFGAVLRC 698
                     F    + +       V+W PP +G I +N +DG++     ++G+G + R 
Sbjct: 186 SLTELRVLKKFGLLCRPHRAPRITEVNWHPPLFGWIKVN-TDGAWQKTTGKSGYGGIFRD 244

Query: 699 PNGLPLLAIAGVEAPISVVDAEAKALLEGILLAISLGVINLELQTDS 839
            +G  L A A     ++ VDAE  A+++ I LA      ++ L+ DS
Sbjct: 245 FHGSFLGAFASNLEILNSVDAEVMAVIQAIELAWVRDWEHIWLEVDS 291


>emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|7267871|emb|CAB78214.1|
           putative protein [Arabidopsis thaliana]
          Length = 473

 Score = 82.4 bits (202), Expect = 3e-13
 Identities = 45/161 (27%), Positives = 83/161 (51%), Gaps = 1/161 (0%)
 Frame = +3

Query: 12  FSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNVPQA 191
           FS K   + IR  S  V+W++ VWF   IP+++  +W     RL+TG R+    + V   
Sbjct: 287 FSTKDTWNHIRTVSNKVAWYKGVWFAQAIPKHAFCMWLAVHNRLSTGDRMTLWNMGVDAT 346

Query: 192 CSLCWNGIEDINHLFFKCEFSSRILQKVMRKCFSRVSLS-WKQLADLI*RSHGPNSLAGK 368
           C LC   +E  +HLFF C F++ I + + +  ++    + W+ + + + R + P+ +AG 
Sbjct: 347 CILCNKALESRDHLFFSCPFATEIWEPLAKTIYNTCFYTDWQTIINNVSR-NWPDRIAGF 405

Query: 369 LLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVK 491
           L R     T+  +W ERN R+      S  +++  + + ++
Sbjct: 406 LARCILQVTIYTLWRERNERKHGASPNSSSRLISWIDKHIR 446


>gb|ABE65413.1| hypothetical protein At1g62890 [Arabidopsis thaliana]
          Length = 195

 Score = 82.0 bits (201), Expect = 5e-13
 Identities = 45/150 (30%), Positives = 75/150 (50%), Gaps = 1/150 (0%)
 Frame = +3

Query: 54  PFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNVPQACSLCWNGIEDINHL 233
           P  +W++ VWFP   P+ S  LW     RL+TG  ++         C+LC N  E  N L
Sbjct: 17  PQTNWYKGVWFPYSTPKYSFLLWLTIQNRLSTGDHIKAWNSGQQVTCTLCGNAEETRNLL 76

Query: 234 FFKCEFSSRILQKVMRKCFSR-VSLSWKQLADLI*RSHGPNSLAGKLLRLAFGSTVAHIW 410
           FF C ++S + + + ++  S   S  W +L  L+  ++ P  L   L R  F ++V HIW
Sbjct: 77  FFSCHYTSEVWRTLTQRLLSNDYSRDWNRLLPLLCNTNMPTDLL-FLFRYVFQASVYHIW 135

Query: 411 WERNMRRFQRKNRSILQIVGVVMEEVKTKV 500
            ERN RR    +    +++  + + V+ ++
Sbjct: 136 RERNARRHGEISSPPNRLIKFIDKNVRNRI 165


>gb|ABK28152.1| unknown [Arabidopsis thaliana]
          Length = 196

 Score = 82.0 bits (201), Expect = 5e-13
 Identities = 45/150 (30%), Positives = 75/150 (50%), Gaps = 1/150 (0%)
 Frame = +3

Query: 54  PFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNVPQACSLCWNGIEDINHL 233
           P  +W++ VWFP   P+ S  LW     RL+TG  ++         C+LC N  E  N L
Sbjct: 17  PQTNWYKGVWFPYSTPKYSFLLWLTIQNRLSTGDHIKAWNSGQQVTCTLCGNAEETRNLL 76

Query: 234 FFKCEFSSRILQKVMRKCFSR-VSLSWKQLADLI*RSHGPNSLAGKLLRLAFGSTVAHIW 410
           FF C ++S + + + ++  S   S  W +L  L+  ++ P  L   L R  F ++V HIW
Sbjct: 77  FFSCHYTSEVWRTLTQRLLSNDYSRDWNRLLPLLCNTNMPTDLL-FLFRYVFQASVYHIW 135

Query: 411 WERNMRRFQRKNRSILQIVGVVMEEVKTKV 500
            ERN RR    +    +++  + + V+ ++
Sbjct: 136 RERNARRHGEISSPPNRLIKFIDKNVRNRI 165


>ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum
            lycopersicum]
          Length = 1454

 Score = 59.7 bits (143), Expect(2) = 5e-13
 Identities = 73/289 (25%), Positives = 122/289 (42%), Gaps = 9/289 (3%)
 Frame = +3

Query: 3    SGKFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNV 182
            SG+F+  SA D IRKK        ++W      + S  +W+    +L T   LQ +G N+
Sbjct: 1077 SGQFTISSAWDSIRKKRNKDPINNIIWHKQIPFKVSFFIWRALRGKLPTNENLQRIGKNL 1136

Query: 183  PQACSLCWN-GIEDINHLFFKCEFSSRILQKVMRKCFSRVSLSWKQLADLI*R---SHGP 350
               C  C+N G +DINH+     F ++ + K+       + ++   L DL+ +       
Sbjct: 1137 SD-CYCCYNKGKDDINHILINGNF-AKYIWKIYSSAVGVLPIN-TTLRDLLLQWRNQQYT 1193

Query: 351  NSLAGKLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKVEDCSRPAPCT 530
            N +   L+ +       ++W  R   ++  KN SI ++   + + +   +       P  
Sbjct: 1194 NEVHKLLIHILPNFICWNLWKNRCAVKYGLKNSSIYRVQYGIFKNIMQVITIVFPSIP-- 1251

Query: 531  KCNFFISRNWHMGFDWQAKSN---LLVSWQPPPYG*ICLNNSDGS--YSPLRAGFGAVLR 695
               +  S N  +    Q K +   L+V W  P  G   L N+DGS   +  + G G +LR
Sbjct: 1252 ---WQTSWNNLINIVEQCKQHYKILIVKWNKPDLGKYKL-NTDGSALQNSGKIGGGGILR 1307

Query: 696  CPNGLPLLAIAGVEAPISVVDAEAKALLEGILLAISLGVINLELQTDSK 842
               G  + A +      +   AE KA L G+      G   +EL+ DSK
Sbjct: 1308 DNQGKIIYAFSLPFGFGTNNFAEIKAALHGLDWCEQHGYKKIELEVDSK 1356



 Score = 42.4 bits (98), Expect(2) = 5e-13
 Identities = 17/46 (36%), Positives = 27/46 (58%)
 Frame = +2

Query: 854  WINGRSSTPWRIKPILDQIFINLEFLVVWKCKHVYREANSPVDWLA 991
            WIN   + PWR + ++ QI   +  +  ++C H+YREAN   D L+
Sbjct: 1361 WINSNINIPWRYEELIQQIHQIIRKMDQFQCHHIYREANCTADLLS 1406


>gb|AAD26953.1| putative non-LTR retrolelement reverse transcriptase [Arabidopsis
           thaliana]
          Length = 323

 Score = 81.3 bits (199), Expect = 8e-13
 Identities = 54/175 (30%), Positives = 82/175 (46%), Gaps = 1/175 (0%)
 Frame = +3

Query: 3   SGKFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNV 182
           S  FS       +      V W + VWF D IP+++   W   W+RL T  RL   GLN+
Sbjct: 133 SNIFSASKTWTALNPDGVLVPWQKSVWFKDRIPKHAFICWVAAWKRLHTRDRLTQWGLNI 192

Query: 183 PQACSLCWNGIEDINHLFFKCEFSSRILQKVM-RKCFSRVSLSWKQLADLI*RSHGPNSL 359
           P  C LC    E  +HLFF+C+FS+ I    M R   +   L    L  L  +S   +  
Sbjct: 193 PTVCVLCNVVDETHDHLFFQCQFSNEIWSFFMIRAGMTPPHLFGPILLWL--KSASSSKN 250

Query: 360 AGKLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKVEDCSRPAP 524
              +++L F ++V  IW ERN R     +R+   I+  V + ++ +++   R  P
Sbjct: 251 LSLIIKLLFQASVYLIWRERNCRIHTTHSRTPPTIIKEVQQLIRARLDPICRERP 305


>gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana]
          Length = 629

 Score = 80.9 bits (198), Expect = 1e-12
 Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 1/164 (0%)
 Frame = +3

Query: 12  FSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNVPQA 191
           FS K   + +RKKS  V+W++ VWF    P+     W     RL+TG R+Q         
Sbjct: 444 FSTKDTWNQVRKKSNEVAWYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDVK 503

Query: 192 CSLCWNGIEDINHLFFKCEFSSRILQKVMRKCFS-RVSLSWKQLADLI*RSHGPNSLAGK 368
           C+ C   IE  +HLFF C ++S I   + +     R S  W+ + + I  +   + +   
Sbjct: 504 CTFCSTSIETRDHLFFSCSYASAIWTAIAKNVLQHRFSTDWQTIVNYISETQ-TDRIRSF 562

Query: 369 LLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKV 500
           L R  F  TV  +W ERN RR   + R+   ++  + ++++ ++
Sbjct: 563 LSRYIFQLTVHTVWKERNDRRHGEEPRTSANLISWMDKQIRNQL 606


>ref|XP_007213453.1| hypothetical protein PRUPE_ppa024777mg, partial [Prunus persica]
           gi|462409318|gb|EMJ14652.1| hypothetical protein
           PRUPE_ppa024777mg, partial [Prunus persica]
          Length = 465

 Score = 67.8 bits (164), Expect(2) = 3e-12
 Identities = 84/294 (28%), Positives = 122/294 (41%), Gaps = 15/294 (5%)
 Frame = +3

Query: 3   SGKFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNV 182
           SG FS K A +  R K   V W +L+W P   P  S   WKV   RL T   LQ      
Sbjct: 114 SGGFSAKDAYEFTRPKFAKVPWCKLIWKPFIEPWKSFLAWKVMHGRLLTEDFLQ------ 167

Query: 183 PQACSLCWNGIEDINHLFFKCEFSSRILQKVMRKCFSRVSLSWKQLADLI*RSHGPNSLA 362
                  W   E+INHLF +C F+  I   +    F    L +      +  S G ++  
Sbjct: 168 ----KRAWMAPENINHLFSECPFTCSIWSSM----FIVFGLHFTSGPLAVILSSGLSAHF 219

Query: 363 GKLLR----LAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKVEDCSRPAPCT 530
              L     L F + V  IW  RN  RF+ K       V  V    +T +      +P  
Sbjct: 220 SPQLMDLWLLMFRTIVWLIWDLRNKLRFEEK-------VSTVSSNCRTIINHVPASSPLA 272

Query: 531 KCNFF-------ISRNWHMGFDWQAKSNLL-VSWQPPPYG*ICLNNSDGS--YSPLRAGF 680
           + +         I R+  + +  +  S ++ V+W PP +G + +   DG+      +AG 
Sbjct: 273 RGHILNKVHDLCIIRSIGVHYRPRPNSKIVEVTWHPPCFGFVKI-KIDGACKRDSGKAGS 331

Query: 681 GAVLRCPNGLPLLAI-AGVEAPISVVDAEAKALLEGILLAISLGVINLELQTDS 839
           G V R   G  L A  A ++ P S V AE  A+++ I LA      N+ ++TDS
Sbjct: 332 GGVFRNYQGHVLGAFSANLDVP-SGVHAEVLAVIKAIELAWLHAWHNIWIETDS 384



 Score = 32.0 bits (71), Expect(2) = 3e-12
 Identities = 14/40 (35%), Positives = 23/40 (57%)
 Frame = +2

Query: 878 PWRIKPILDQIFINLEFLVVWKCKHVYREANSPVDWLASH 997
           PWR++       + L+ +  +K  H++RE N  VD LA+H
Sbjct: 398 PWRLRVDWQNCLLRLQHMS-FKISHIFREGNHDVDALANH 436


>gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]
          Length = 1161

 Score = 79.0 bits (193), Expect = 4e-12
 Identities = 58/203 (28%), Positives = 87/203 (42%), Gaps = 1/203 (0%)
 Frame = +3

Query: 3    SGKFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNV 182
            S +FS       ++  S  V W + VWF D++P+ +   W V   RL T  RL+  G ++
Sbjct: 965  SNRFSTADTWSYLQPSSTSVLWHKAVWFKDHVPKQAFICWVVAHNRLHTRDRLRRWGFSI 1024

Query: 183  PQACSLCWNGIEDINHLFFKCEFSSRILQKVMRKCFSRVSLSWKQ-LADLI*RSHGPNSL 359
            P  C LC +  E   HLFF+C+FSS I    MR         +   L   +  S   N  
Sbjct: 1025 PPTCVLCNDLDESREHLFFRCQFSSEIWSFFMRALNLNPPPQFMHCLLWTLTASRDRNIT 1084

Query: 360  AGKLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKVEDCSRPAPCTKCN 539
               + +L F ++V  IW ERN+R      R    I+  +   V+ +++  SR +      
Sbjct: 1085 L--ITKLLFHASVYFIWRERNLRIHSNSVRPAHLIIKEIQLIVRARLDPLSRSSRVVS-- 1140

Query: 540  FFISRNWHMGFDWQAKSNLLVSW 608
                         Q  S+LL SW
Sbjct: 1141 -------------QPGSSLLASW 1150


>gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]
          Length = 653

 Score = 78.6 bits (192), Expect = 5e-12
 Identities = 44/165 (26%), Positives = 79/165 (47%), Gaps = 1/165 (0%)
 Frame = +3

Query: 9   KFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNVPQ 188
           KFS +   +  R  S  V+W   +WF    P+ S   W     RL+TG ++      +  
Sbjct: 467 KFSTRDTWNQTRNTSTPVTWHMGIWFAHATPKFSFCAWLAVQNRLSTGDKMLQWNRRLSP 526

Query: 189 ACSLCWNGIEDINHLFFKCEFSSRILQKVMRKCF-SRVSLSWKQLADLI*RSHGPNSLAG 365
            C LC N IE  NHLFF C +++ I + + +  + ++ S +W  +   +  +   N    
Sbjct: 527 TCVLCNNNIETRNHLFFSCCYTAEIWENLAKNIYKAKFSTNWSTILTSV-STTWRNRTES 585

Query: 366 KLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKV 500
            L R  F +T+  IW ERN RR   ++ S   ++  + ++++ ++
Sbjct: 586 FLARYIFQATIHTIWHERNGRRHGERSNSATHLIWWLDKQMRNQI 630


>ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 872

 Score = 71.2 bits (173), Expect(2) = 7e-12
 Identities = 69/275 (25%), Positives = 119/275 (43%), Gaps = 10/275 (3%)
 Frame = +3

Query: 3    SGKFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNV 182
            +G+ + K A   +++ SP V W + +W    +PR S+  WKV    + +   LQ  G+ +
Sbjct: 506  TGELTAKQAFLFLQQASPVVPWGKPLWSKFILPRMSLHAWKVMRGTVISYHLLQRRGVAL 565

Query: 183  PQACSLCWNGIEDINHLFFKCEFSSRILQKVMRKCFSRVSLSWKQLADL--I*RSHGPNS 356
               C  C N  E ++H+F  C F++ +    +      + L    +A++  +  +   + 
Sbjct: 566  VSRCEFCGNSTESLDHIFLHCSFAASVWNHFI--YIFEIGLVPNTIAEVFSLGLAMDRSP 623

Query: 357  LAGKLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKVEDCSRPAPCTKC 536
               +L  + F S + +IW  RN  RF  +  S+  +  +V   ++      SR A     
Sbjct: 624  QLKELWLICFTSILWYIWHARNQIRFDSRTFSVAGVCRLVSRHIQAS----SRLATGHMH 679

Query: 537  NFFISRNWHMGFDWQAKSNLL-----VSWQPPPYG*ICLNNSDGSYSPLR--AGFGAVLR 695
            N          F    +S  +     V W PP  G I + NSDG++       GFGAV R
Sbjct: 680  NTIHDLCILKSFGACCRSRRIPRMVEVIWHPPSIGWIKI-NSDGAWKHEEGIGGFGAVFR 738

Query: 696  CPNGLPLLAIAG-VEAPISVVDAEAKALLEGILLA 797
               G  + A A  ++ P S+  A+   ++  I LA
Sbjct: 739  YYKGQFVGAFASHIDIPSSIA-AKVMVVITAIELA 772



 Score = 26.9 bits (58), Expect(2) = 7e-12
 Identities = 13/48 (27%), Positives = 23/48 (47%)
 Frame = +2

Query: 854 WINGRSSTPWRIKPILDQIFINLEFLVVWKCKHVYREANSPVDWLASH 997
           +I   S  PW+++         +  +  +K  H++RE N   D LA+H
Sbjct: 792 YIRSPSLVPWQLRVRWLNCLYRISTMT-FKSSHIFREGNRVADALANH 838


>ref|XP_004228797.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
           lycopersicum]
          Length = 389

 Score = 52.8 bits (125), Expect(2) = 1e-11
 Identities = 66/295 (22%), Positives = 119/295 (40%), Gaps = 16/295 (5%)
 Frame = +3

Query: 6   GKFSYKSA*DLIRKKSPFVSWWQLVWFPDYIPRNSMTLWKVFWERLATGSRLQNLGLNVP 185
           G+F+  SA D+IRKK         VW  +   + S  +W+    +L T   L   G    
Sbjct: 13  GQFTIFSAWDIIRKKKDPDPIHNCVWHKNVPFKTSFFIWRALRSKLPTNENLLKFGKEEL 72

Query: 186 QACSLCW-NGIEDINHLFFKCEFSSRILQ--------KVMRKCFSRVSLSWKQLADLI*R 338
           + C  C+  G +D+ H+     F+  I +         ++        LSW++L      
Sbjct: 73  E-CYCCYRKGKDDLKHILITGNFAKYIWKIHTKRLGIAIVNTNLRSTLLSWRRLTSY--- 128

Query: 339 SHGPNSLAGKLLRLAFGSTVAHIWWERNMRRFQRKNRSILQIVGVVMEEVKTKVEDCSRP 518
               N +   +L +       ++W  R   ++  K  SI ++   + +++   ++     
Sbjct: 129 ----NEVHKLILHILPNIICWNLWKNRCSAKYGNKPSSIYRVESGIFKDIMQIIKAVYPN 184

Query: 519 APCTKCNFFISRNWHMGFDW--QAKSNL---LVSWQPPPYG*ICLNNSDGS--YSPLRAG 677
            P          +W   F+   Q + +L   +V+W+ PP G I   N+DGS  ++  + G
Sbjct: 185 IPW-------QSSWERLFNLVEQCQQHLKVTMVNWERPPEG-IHKLNTDGSAKHNTGKIG 236

Query: 678 FGAVLRCPNGLPLLAIAGVEAPISVVDAEAKALLEGILLAISLGVINLELQTDSK 842
            G +LR   G  + A A      +   AE +A L G+      G   + L+ DS+
Sbjct: 237 GGGILRDHQGKLIYAFAIPLGFGTNNFAEIQAALHGLQWCQQHGFEKIILEVDSE 291



 Score = 45.1 bits (105), Expect(2) = 1e-11
 Identities = 21/46 (45%), Positives = 27/46 (58%)
 Frame = +2

Query: 854 WINGRSSTPWRIKPILDQIFINLEFLVVWKCKHVYREANSPVDWLA 991
           WI  +SS PWR    + QI      + V++CKH+YREAN   D LA
Sbjct: 296 WIINKSSVPWRCLHYIQQIQNISNKMEVFQCKHIYREANGTADLLA 341


Top