BLASTX nr result

ID: Rehmannia22_contig00000772 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00000772
         (1315 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   336   1e-89
ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664...   328   2e-87
gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]             328   2e-87
emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   327   9e-87
ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665...   313   8e-83
ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663...   296   1e-77
ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268...   293   1e-76
ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661...   286   1e-74
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...   279   2e-72
emb|CAB72467.1| putative protein [Arabidopsis thaliana]               277   6e-72
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   271   3e-70
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   270   1e-69
gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]                270   1e-69
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   269   2e-69
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   265   2e-68
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           265   2e-68
dbj|BAB08270.1| non-LTR retroelement reverse transcriptase-like ...   264   7e-68
gb|ABD96948.1| hypothetical protein [Cleome spinosa]                  262   2e-67
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       261   3e-67
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   261   5e-67

>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  336 bits (862), Expect = 1e-89
 Identities = 174/448 (38%), Positives = 261/448 (58%), Gaps = 12/448 (2%)
 Frame = +3

Query: 6    FLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLC 185
            FL+ +L  LGFP++FI WIM CV + S+SI LNG     F  ++GLRQGDP+SP LF L 
Sbjct: 599  FLESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPFDAQKGLRQGDPLSPFLFALS 658

Query: 186  MEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKI 365
            MEYLSR +        F +HP+C+ +K++HL+FADDL++FA+ D  SI  ++   + F  
Sbjct: 659  MEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMFARADASSISKIMAAFNSFSK 718

Query: 366  VSGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPLH 545
             SGL  +  KS I+  G+  ++ + + D +  P G+LP RYLGVPLA++KLN     PL 
Sbjct: 719  ASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRYLGVPLASKKLNFSQCKPLI 778

Query: 546  DRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW-- 719
            D+I      W A+ LSYAGRL L+K++L  ++ +W QIFPLPK +IK +   CR FLW  
Sbjct: 779  DKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKLIKAVETTCRKFLWTG 838

Query: 720  ---NKKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFY 890
                  + P+AW  +  P   GGL + ++  WNKA + K+LW      + LWVRWV+ +Y
Sbjct: 839  TVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLWAITFKQDKLWVRWVNAYY 898

Query: 891  LRNQSIWTWNPKKDDSTLLKRICDIRDELIQKFGSQQSAIDALTPLVNDNGLNSSKMYDI 1070
            ++ Q+I       + S +L++I + R EL+ + G  ++       + N    +  K Y +
Sbjct: 899  IKRQNIENVTVSSNTSWILRKIFESR-ELLTRTGGWEA-------VSNHMNFSIKKTYKL 950

Query: 1071 FRNEGPRHFWHNAIWKQFI-----PPKFSFCTWLACKDRLSTLDNLS--YIDTDPLCKLC 1229
             + +     + N +WK+ I      PK  F  WLA  +RL+T + +S    D  PLCK+C
Sbjct: 951  LQED-----YENVVWKRLICNNKATPKSQFILWLAMLNRLATAERVSRWNRDVSPLCKMC 1005

Query: 1230 KNELESAPHLFFMCTVTNSLWRRIKNWL 1313
             NE+E+  HLFF C  +  +W ++  +L
Sbjct: 1006 GNEIETIQHLFFNCIYSKEIWGKVLLYL 1033


>ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max]
          Length = 939

 Score =  328 bits (842), Expect = 2e-87
 Identities = 167/442 (37%), Positives = 251/442 (56%), Gaps = 7/442 (1%)
 Frame = +3

Query: 9    LQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLCM 188
            L+ +L  LGFP+ FI WIM  V S ++  ++NG    +   +RG+RQGDP+SP LF+L M
Sbjct: 425  LEHILRELGFPDQFIKWIMIAVRSVTYVFNINGRFTRRLEARRGIRQGDPISPLLFILVM 484

Query: 189  EYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKIV 368
            EYL+R+++       F YH +C+++KI++L FADDL+LF++GD  S++I++D  + F   
Sbjct: 485  EYLNRILSQLDKIPNFNYHSKCEKMKITNLCFADDLLLFSRGDIGSVQIMLDKFNTFLRS 544

Query: 369  SGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPLHD 548
             GL++N SK NI+   +     + +L +  + +G +P RYLG+PL+++KLN  HY  L D
Sbjct: 545  MGLHVNPSKCNIYCGSVDINVKEQLLLISGFKEGKMPFRYLGIPLSSKKLNIKHYQVLID 604

Query: 549  RIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW--- 719
            +I   I  W+A  LSYAGR+ LI+SV+     FW+Q  PLPK VI RI  +CR+FLW   
Sbjct: 605  KIVGRITHWSAGLLSYAGRVQLIQSVIFATINFWMQCLPLPKFVIMRINAICRSFLWIGN 664

Query: 720  --NKKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFYL 893
                ++ PIAW  VC P   GGL I ++  WNK  + K+LW     ++ LW++W+H +Y+
Sbjct: 665  SNISRKSPIAWEKVCSPKINGGLNIINLAIWNKISILKLLWNVCNKSDNLWIKWLHTYYI 724

Query: 894  RNQSIWTWNPKKDDSTLLKRICDIRDELIQKFGSQQSAIDALTPLVNDNGLNSSKMYDIF 1073
            R QSIW+   KK  S ++  +  +R  L+Q     Q                  K+Y   
Sbjct: 725  RGQSIWSMVLKKSHSWIMSSMMKLRPLLLQYQSRMQDV------------FKMKKIYLAL 772

Query: 1074 RNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNLSY--IDTDPLCKLCKNELES 1247
              E  +  W   +      P+  FC W AC  RL++ D L    ++ D  C  C + +ES
Sbjct: 773  FEESEKMSWRTLMCNNLARPRALFCLWQACHFRLASKDRLIKFGLNVDANCAFC-SSMES 831

Query: 1248 APHLFFMCTVTNSLWRRIKNWL 1313
              HLFF C    ++W  + NWL
Sbjct: 832  HEHLFFGCIELKTIWTAVLNWL 853


>gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]
          Length = 653

 Score =  328 bits (842), Expect = 2e-87
 Identities = 172/440 (39%), Positives = 248/440 (56%), Gaps = 7/440 (1%)
 Frame = +3

Query: 3    SFLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLL 182
            SF++++L+ + FP  F+ WIM C+++ASFS+ +NG L G F  KRGLRQG  +SP LF++
Sbjct: 137  SFIRNILLSMDFPMEFVHWIMLCISTASFSVQVNGELVGFFQSKRGLRQGCSLSPYLFVM 196

Query: 183  CMEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFK 362
             M+ LS+L++   S   F YH RC+EL ++HL FADDLM+ + G  +SI  +++  D F 
Sbjct: 197  SMDVLSKLLDQAASAKKFGYHSRCKELSLTHLSFADDLMVLSDGKVRSIDGIVEVFDIFA 256

Query: 363  IVSGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPL 542
              SGL I+  KS I+ AG+      +I +   +  G LPVRYLG+PL  ++L    Y+PL
Sbjct: 257  KFSGLKISMEKSTIYLAGVTEDVYHEIQNRYQFDVGQLPVRYLGLPLVTKRLTATDYSPL 316

Query: 543  HDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW- 719
             + I   I  WT   LSYAGRL LI SVL  I  FWL  F LP+  I+ I K+C AFLW 
Sbjct: 317  LEHIKKKIGTWTTRYLSYAGRLNLITSVLWSICNFWLAAFRLPRECIREIDKICSAFLWS 376

Query: 720  ----NKKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGF 887
                N ++  + W DVC P +EGGLG+R +   N+    K++W+    T +LWVRW+  +
Sbjct: 377  GPDLNPRKTRVCWGDVCKPKQEGGLGLRSLKEMNEVSCLKLIWRIVSHTNSLWVRWIEQY 436

Query: 888  YLRNQSIWTWNPKKDDSTLLKRICDIRDELIQKFGSQQSAIDALTPLVNDNGLNSSKMYD 1067
             L++ + W+     +  ++L R     DE + KF ++ +                   ++
Sbjct: 437  LLKHDTFWSVQTTTNMDSVLWR--GRNDEYMPKFSTRDT-------------------WN 475

Query: 1068 IFRNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNLSYID--TDPLCKLCKNEL 1241
              RN      WH  IW     PKFSFC WLA ++RLST D +   +    P C LC N +
Sbjct: 476  QTRNTSTPVTWHMGIWFAHATPKFSFCAWLAVQNRLSTGDKMLQWNRRLSPTCVLCNNNI 535

Query: 1242 ESAPHLFFMCTVTNSLWRRI 1301
            E+  HLFF C  T  +W  +
Sbjct: 536  ETRNHLFFSCCYTAEIWENL 555


>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  327 bits (837), Expect = 9e-87
 Identities = 167/440 (37%), Positives = 243/440 (55%), Gaps = 7/440 (1%)
 Frame = +3

Query: 3    SFLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLL 182
            SFL+ +L   GFP+ F+ WIMECV++ S+S+ +NG     F  ++GLRQGDPMSP LF L
Sbjct: 595  SFLETLLYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQPFQARKGLRQGDPMSPFLFAL 654

Query: 183  CMEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFK 362
            CMEYLSR +     +  F +HP+C+ L I+HL+FADDL++F + D  S+  +     +F 
Sbjct: 655  CMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMFCRADKSSLDHMNVAFQKFS 714

Query: 363  IVSGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPL 542
              SGL  +  KSNI+  G+  +   ++ D V+   G LP RYLGVPL ++KL      PL
Sbjct: 715  HASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELPFRYLGVPLTSKKLTYAQCKPL 774

Query: 543  HDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW- 719
             + I      W A  LSYAGRL LIKS+L  ++ +W  IFPL K VI+ + K+CR FLW 
Sbjct: 775  VEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFLWT 834

Query: 720  ----NKKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGF 887
                  K+ P+AW  +  P   GG  + ++  WN+A + K+LW      + LWVRW+H +
Sbjct: 835  GKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWVRWIHSY 894

Query: 888  YLRNQSIWTWNPKKDDSTLLKRICDIRDELIQKFGSQQSAIDALTPLVNDNGLNSSKMYD 1067
            Y++ Q I T N     + +L++I   RD L        S I     +   +  +  K Y 
Sbjct: 895  YIKRQDILTVNISNQTTWILRKIVKARDHL--------SNIGDWDEICIGDKFSMKKAYK 946

Query: 1068 IFRNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNLSY--IDTDPLCKLCKNEL 1241
                 G R  W   I   +  PK  F  W+   +RL T+D +S   +  D   +LC+N+ 
Sbjct: 947  KISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQCDLNYRLCRNDG 1006

Query: 1242 ESAPHLFFMCTVTNSLWRRI 1301
            E+  HLFF C+ +  +W +I
Sbjct: 1007 ETIQHLFFSCSYSAGVWSKI 1026


>ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max]
          Length = 506

 Score =  313 bits (803), Expect = 8e-83
 Identities = 161/425 (37%), Positives = 242/425 (56%), Gaps = 8/425 (1%)
 Frame = +3

Query: 63   MECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLCMEYLSRLINVRTSTHTFKY 242
            M  V++ S+  ++NG        +RGLRQGDP+SP LF++ ME L+R +        F Y
Sbjct: 1    MIAVSTVSYRFNVNGYKTEIMGARRGLRQGDPISPMLFVIVMECLNRYLYKMQKDGDFNY 60

Query: 243  HPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKIVSGLNINSSKSNIFTAGIF 422
            HP+C +LKI++L FADDL+LF++GD  S+ +++   + F   +GL +N  K ++  AGI 
Sbjct: 61   HPKCDKLKITNLCFADDLLLFSRGDKISVGMMMRAYESFSKATGLLVNPQKCSLLCAGID 120

Query: 423  GKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAG 602
                 +IL++  + +G LP +YLGVP+ ++KL+ +HY+PL D+I   I  WTA  LSYAG
Sbjct: 121  AVTKREILEVSGFQEGQLPFKYLGVPVTSKKLSTIHYSPLIDKIVGKIKHWTARLLSYAG 180

Query: 603  RLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW-----NKKRHPIAWHDVCLPV 767
            RL L+ SV+  +  +WL  FP PKSV+++I  +CR FLW       ++ P+AW  +C P 
Sbjct: 181  RLQLVNSVMFALTNYWLNCFPFPKSVLQKIEAICRIFLWTGGFEGSRKSPVAWKQICSPR 240

Query: 768  EEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFYLRNQSIWTWNPKKDDSTLL 947
              GGL I D+  WNKA L K+LW      ++LWV+W+  +Y++   +     K  DS ++
Sbjct: 241  SCGGLNIIDIDIWNKANLMKLLWNLSSKEDSLWVKWIQAYYVKRSELMHIEMKNTDSWIM 300

Query: 948  KRICDIRDELIQKFGSQQSAIDALTPLVNDNGLNSSKMYDIFRNEGPRHFWHNAIWKQFI 1127
            K I   R++L          ID +  L+    +N  K+Y   ++ G R  W N ++    
Sbjct: 301  KAILKQREDL--------EKIDNMEELMIRGSINMGKLYRKLQDCGQRKEWKNLLYGNTA 352

Query: 1128 PPKFSFCTWLACKDRLSTLDNL---SYIDTDPLCKLCKNELESAPHLFFMCTVTNSLWRR 1298
             P+ +F  WLAC  RLST D L     ID D  C  C  E ES  HLFF+C  +  +W  
Sbjct: 353  RPRANFILWLACHGRLSTKDRLCKYGMID-DKSCCFCSEE-ESMNHLFFVCDNSKRVWME 410

Query: 1299 IKNWL 1313
            +  W+
Sbjct: 411  VLQWV 415


>ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max]
          Length = 514

 Score =  296 bits (759), Expect = 1e-77
 Identities = 147/436 (33%), Positives = 242/436 (55%), Gaps = 8/436 (1%)
 Frame = +3

Query: 9    LQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLCM 188
            L+ VL   G P  FI W+M+ +T+ ++  ++NG L      K G+ QGDP+SP LF+L M
Sbjct: 86   LEGVLTEFGLPKKFIGWVMKVITTVNYRFNINGELSNVLETKIGIWQGDPISPLLFVLMM 145

Query: 189  EYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKIV 368
            EY +R++       +F +H +C+ L I+HL FADD+ L  +GD +SIK++I     F   
Sbjct: 146  EYFNRIMVKMQRNPSFNHHSQCERLGITHLSFADDVFLLCRGDKKSIKMIIKAFSFFSKS 205

Query: 369  SGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPLHD 548
            +GL IN +K  +F  G+    +  I  +  + +GTLPVRYLGVPL+ +KLN  HY PL +
Sbjct: 206  TGLQINPAKCKVFCGGLNCDSIQVITKITGFEEGTLPVRYLGVPLSCKKLNVHHYLPLVE 265

Query: 549  RIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLWN-- 722
            +I   I  W++  LS AGR+ L++S++  I  +W+ +FP+PK VI++I  +CR+F+W+  
Sbjct: 266  KIVGKIRHWSSKLLSIAGRIQLVRSIITAIAQYWMSVFPMPKKVIQKIDSICRSFIWSGS 325

Query: 723  ---KKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFYL 893
               K++  +AW  VC P   GGL + ++  WN   + K LW      + LWV+W+H ++L
Sbjct: 326  AEVKRKSLVAWKQVCKPARCGGLNLINLELWNVTAMLKCLWNICSKEDNLWVKWIHAYFL 385

Query: 894  RNQSIWTWNPKKDDSTLLKRICDIRDELIQKFGSQQSAIDALTPLVNDNGLNSSKMYDIF 1073
            +  ++ +   K + + +LK +   R ++        +       ++     +  ++Y   
Sbjct: 386  KGDNVMSATIKSNSTWILKSVMKQRPQV-------NNLQLVWIEMLRKRKFSMKQVYMEL 438

Query: 1074 RNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLST---LDNLSYIDTDPLCKLCKNELE 1244
              +  +  W   +      P+ +   WLAC++RL+T   L N++ I    LC LCK + E
Sbjct: 439  VEDHNKIDWFRLLRYNRARPRANVTLWLACQNRLATKTRLKNMNMIQCS-LCSLCKEQDE 497

Query: 1245 SAPHLFFMCTVTNSLW 1292
               HL F C VT ++W
Sbjct: 498  DLDHLMFSCRVTKAIW 513


>ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum
            lycopersicum]
          Length = 717

 Score =  293 bits (749), Expect = 1e-76
 Identities = 135/307 (43%), Positives = 200/307 (65%), Gaps = 5/307 (1%)
 Frame = +3

Query: 6    FLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLC 185
            FL+ V+ GLGFP+LF  W+M+CV + +++I +NG    +F   +GLRQGDPMSP LF + 
Sbjct: 404  FLEQVMEGLGFPDLFTKWVMKCVKTVNYTIVVNGQNTQRFDAAKGLRQGDPMSPFLFAIA 463

Query: 186  MEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKI 365
            MEYLSRL+       +FKYHP+  +L ++HL FADDL+LF++GD  SIK L  C  EF  
Sbjct: 464  MEYLSRLLKGLKEDKSFKYHPKYAKLDVTHLCFADDLLLFSRGDLNSIKALQKCFTEFSQ 523

Query: 366  VSGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPLH 545
             SGL  N +KS+I+  G+  +    I+  + Y    LP +YLGVPL+++KLN + + PL 
Sbjct: 524  ASGLQANLNKSSIYCGGVQMEVRQQIIQQLGYTIEELPFKYLGVPLSSKKLNTIQWYPLI 583

Query: 546  DRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLWN- 722
            +++ A I+ WTA  LSYAGR  L+K+VL G++  W Q+F +P  +IK I  LCR++LW+ 
Sbjct: 584  EKVMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFIIPAKIIKLIEGLCRSYLWSG 643

Query: 723  ----KKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFY 890
                 K+  IAW  VC P  EGGLG+ ++  WN++ ++K+ W      + LW++W+H +Y
Sbjct: 644  VGYVTKKALIAWDKVCSPKYEGGLGLINLKIWNRSAVTKLCWDLANKEDKLWIKWIHAYY 703

Query: 891  LRNQSIW 911
            ++ Q  W
Sbjct: 704  IKGQREW 710


>ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max]
          Length = 947

 Score =  286 bits (732), Expect = 1e-74
 Identities = 147/403 (36%), Positives = 222/403 (55%), Gaps = 11/403 (2%)
 Frame = +3

Query: 132  KRGLRQGDPMSPALFLLCMEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAK 311
            KRG+RQGDP+SP LF++ MEYL+RL+        F +H +C++L I+HL FADD++LF +
Sbjct: 466  KRGIRQGDPISPLLFVVMMEYLNRLLVKLQLDLNFNHHAKCEKLGITHLTFADDVLLFCR 525

Query: 312  GDTQSIKILIDCLDEFKIVSGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYL 491
            GD  S+++++  +++F   +GL +N +K  I+  G+ G   + I  + +Y +G LPVRYL
Sbjct: 526  GDVMSVEMMLHVINKFSATTGLVVNPNKCRIYFGGVDGTTKNKIQQISSYEEGQLPVRYL 585

Query: 492  GVPLAAQKLNCVHYAPLHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLP 671
            GVPL ++KLN  +Y PL D+I   I  WT+  L+  GR+ ++   +  I  FW+Q  P+P
Sbjct: 586  GVPLTSKKLNIKYYLPLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQCLPIP 645

Query: 672  KSVIKRIYKLCRAFLWNK-----KRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILW 836
             SVIK+I  +CR+F+W++     ++ PIAW+ VC P  +GGL I ++  WN   +   LW
Sbjct: 646  MSVIKKIDSMCRSFVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHITVLNCLW 705

Query: 837  KFHQGTETLWVRWVHGFYLRNQSIWTWNPKKDDSTLLKRICDIRDELIQKFGSQQSAIDA 1016
               +  + LWV+W+H  Y++N S+       + S +LK +            SQ+  I  
Sbjct: 706  NLCKKVDNLWVKWIHAHYIKNSSVMNTMVTNNFSWVLKNVL-----------SQREYIHT 754

Query: 1017 LTP----LVNDNGLNSSKMYDIFRNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTL 1184
            L P    L+N       K YD    E  R  W   + K    P+    TWLAC  RL T 
Sbjct: 755  LQPVWDELLNSERFKMKKAYDKMM-EADRVHWSGLMRKNCARPRAIHTTWLACHGRLGTK 813

Query: 1185 DNLSYID--TDPLCKLCKNELESAPHLFFMCTVTNSLWRRIKN 1307
            D L      TD +  LCK   E+  H+ F C V   +W  + N
Sbjct: 814  DRLVRFGMITDKIWSLCKEVEETQNHILFSCKVATDIWSNVLN 856


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score =  279 bits (714), Expect = 2e-72
 Identities = 159/454 (35%), Positives = 233/454 (51%), Gaps = 22/454 (4%)
 Frame = +3

Query: 6    FLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLC 185
            FL + L  L  P  FI WI  C+++ASFS+ +NG           LRQG  +SP LF++C
Sbjct: 877  FLLNTLAALDIPEKFIHWINLCISTASFSVQVNG-----------LRQGCSLSPYLFVIC 925

Query: 186  MEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKI 365
            M  LS +++       F YHPRC+ + ++HL FADD+M+F+ G   S++ ++    +F  
Sbjct: 926  MNVLSAMLDKGAVEKRFGYHPRCRNMGLTHLCFADDIMVFSAGSAHSLEGVLAIFKDFAA 985

Query: 366  VSGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPLH 545
             SGLNI+  KS +F A I  +    IL    +  G+LPVRYLG+PL  +++      PL 
Sbjct: 986  FSGLNISLEKSTLFMASISSETCASILARFPFDSGSLPVRYLGLPLMTKRMTLADCLPLL 1045

Query: 546  DRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW-- 719
            ++I + I  W    LSYAGRL L+ SV+  +  FW+  F LP++ I+ I ++  AFLW  
Sbjct: 1046 EKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRACIREIEQISAAFLWSG 1105

Query: 720  ---NKKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFY 890
               N  +  +AWHDVC P  EGGLG+R +   NK    K++W+      +LWV W+    
Sbjct: 1106 TDLNPHKAKVAWHDVCKPKSEGGLGLRSLVDANKICCFKLIWRLVSAKHSLWVNWIQNNL 1165

Query: 891  LRN----QSIWTWNPKKDD----------STLLKRICDIRD-ELIQKFGSQQSAIDALTP 1025
            +R      S       +DD            L + IC  +D  L +  G Q  A      
Sbjct: 1166 IRTVAEALSSHRRRSHRDDILNDIEEELEKLLCRGICTEQDRSLCRSIGGQFKA------ 1219

Query: 1026 LVNDNGLNSSKMYDIFRNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNLSYID 1205
                    S +++   R +G    WH AIW     PKF+F +WLA  DRL+T D ++  +
Sbjct: 1220 -----KFFSPEIWHQIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWN 1274

Query: 1206 --TDPLCKLCKNELESAPHLFFMCTVTNSLWRRI 1301
                 +C LC    ES  HLFF C  ++ +W R+
Sbjct: 1275 RGISSVCVLCNISAESRDHLFFSCNFSSHIWDRL 1308


>emb|CAB72467.1| putative protein [Arabidopsis thaliana]
          Length = 762

 Score =  277 bits (709), Expect = 6e-72
 Identities = 168/507 (33%), Positives = 244/507 (48%), Gaps = 85/507 (16%)
 Frame = +3

Query: 3    SFLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLL 182
            SFL + L  + FP +FI WI  C+T+ SFS+ +NG L G F   RGLRQG  +SP LF++
Sbjct: 163  SFLINTLTAMHFPEMFIHWIRLCITTPSFSVQVNGELAGFFQSSRGLRQGCALSPYLFVI 222

Query: 183  CMEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFK 362
            CM+ LS+L++         YHP C+ + ++HL FADDLM+   G  +SI+ +I+  D F 
Sbjct: 223  CMDVLSKLLDKVVGIGRIGYHPHCKRMGLTHLSFADDLMILTDGQCRSIEGIIEVFDLFS 282

Query: 363  IVSGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPL 542
              SGL I+  KS IF+AG+       +     +  G LP+RYLG+PL  ++L+ V YAPL
Sbjct: 283  KWSGLKISMEKSTIFSAGLSSTSRAQLHTHFPFEVGELPIRYLGLPLVTKRLSSVDYAPL 342

Query: 543  HDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW- 719
             ++I   I  W++  LS+AGR  LI S++     FWL  F LP++ I+ I KLC +FLW 
Sbjct: 343  IEQIRKRIGSWSSRFLSFAGRFNLISSIIWSSCNFWLSAFQLPRACIQEIEKLCSSFLWS 402

Query: 720  ----NKKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGF 887
                N K+  I+W+ VC P  EGGLG+R +   N     K++W+     ++LWV+WV   
Sbjct: 403  GTNLNSKKAKISWNQVCKPKSEGGLGLRSLKEANDVCCLKLVWRIISHGDSLWVKWVEHN 462

Query: 888  YLRNQSIW---------TWNPKK--------------------------DDSTLLKRICD 962
             L+ +  W         +W  KK                          DD +LL R+ D
Sbjct: 463  LLKREIFWIVKENANLGSWIWKKILKYRGVAKRFCKAEVGNGESTSFWFDDWSLLGRLID 522

Query: 963  I-----------------------------RDELI-----------QKFGSQQSAIDALT 1022
            +                             R E++           QK   QQ     L 
Sbjct: 523  VAGIRGTIDMGISRTMSVADAWTSRRRRHHRQEILNTIEEVLSTQHQKRTQQQQQGRVLW 582

Query: 1023 PLVND---NGLNSSKMYDIFRNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNL 1193
               ND   +  ++   ++  R       WH  +W     PK+SFC WLA  DRL+T   +
Sbjct: 583  KGKNDIYKDKFSTKNTWNYLRTTSNEVAWHKGVWFPHATPKYSFCLWLAAHDRLATGARM 642

Query: 1194 SYIDTDPL--CKLCKNELESAPHLFFM 1268
               +      C  C+  +E+  HLFFM
Sbjct: 643  IKWNRGETGDCTFCRQGIETRDHLFFM 669


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  271 bits (694), Expect = 3e-70
 Identities = 161/442 (36%), Positives = 227/442 (51%), Gaps = 10/442 (2%)
 Frame = +3

Query: 6    FLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLC 185
            F+   L     P+  I WI  C++SA FS+ +NG L G F  +RGLRQGDP+SP LF++ 
Sbjct: 432  FIIATLQAFNIPSTLIGWIKSCISSAKFSVCVNGELAGFFARRRGLRQGDPLSPYLFVIA 491

Query: 186  MEYLSRLINVRTS-THTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFK 362
            ME LS  I  R + +  F+YH RC +L +SHL FADDL++F  GD  S++ L D    F+
Sbjct: 492  MEVLSLCIQRRINCSPCFRYHWRCDQLNLSHLCFADDLLMFCNGDENSVRTLHDAFSNFE 551

Query: 363  IVSGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPL 542
             +S L  N S+S IF AG+ G   D +L + N+  GT PVRYLG+PL   KL     +PL
Sbjct: 552  SLSSLKANVSESKIFLAGVDGNSSDSVLQVTNFSLGTCPVRYLGIPLITSKLRMQDCSPL 611

Query: 543  HDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW- 719
             DRI   I  W    LS+AGRL LI+SVL  I+ +W     LPK V+K I K  R FLW 
Sbjct: 612  LDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWASHLILPKKVLKDIEKRLRCFLWA 671

Query: 720  ----NKKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGF 887
                 +    +AW ++CLP  EGGLGI+D++ WNKAL+   +W     +   W  WV  +
Sbjct: 672  GNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWNKALMISHIWNLVSSSSNFWTDWVKVY 731

Query: 888  YLRNQSIWTWNPKKDDSTLLKRICDIRDELIQKFGSQQSAIDALTPLVNDNGLNSSKMYD 1067
             L+  S W        +  L  IC      + K   ++        ++ D G  +S  +D
Sbjct: 732  LLKGNSFW--------NAPLPSICSWNWRKLLKI--RELCCSFFVNIIGD-GRATSLWFD 780

Query: 1068 IFRNEGPRHF-WHNAIWKQFIPPKFSFCT---WLACKDRLSTLDNLSYIDTDPLCKLCKN 1235
             +   GP    W + I  +    K +  T   + +     +TL    +I   P  +L   
Sbjct: 781  NWHPLGPLTLRWSSNIIGESGLSKSAMLTPNGFYSTSSAWNTLRPSRFI--VPWYRLVWF 838

Query: 1236 ELESAPHLFFMCTVTNSLWRRI 1301
              E+  HLFF C  +  +W  +
Sbjct: 839  VAETHNHLFFDCAYSFGIWTHV 860


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  270 bits (690), Expect = 1e-69
 Identities = 147/366 (40%), Positives = 212/366 (57%), Gaps = 5/366 (1%)
 Frame = +3

Query: 6    FLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLC 185
            F+ + L     P  F++WI +C+TS SFSI+++GSL G F G +GLRQGDP+SP+LF++ 
Sbjct: 604  FIIETLKAANAPPRFVNWIKQCITSTSFSINVSGSLCGYFKGSKGLRQGDPLSPSLFVIA 663

Query: 186  MEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKI 365
            ME LSRL+  + S  +  YHP+  E++IS L FADDLM+F  G   S++ +   L+ FK 
Sbjct: 664  MEILSRLLENKFSDGSIGYHPKASEVRISSLAFADDLMIFYDGKASSLRGIKSVLESFKN 723

Query: 366  VSGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPLH 545
            +SGL +N+ KS ++TAG+   D +D L    +  GT P RYLG+PL  +KL    Y+ L 
Sbjct: 724  LSGLEMNTEKSAVYTAGLEDTDKEDTL-AFGFVNGTFPFRYLGLPLLHRKLRRSDYSQLI 782

Query: 546  DRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW-- 719
            D+IAA  + W   +LS+AGRL LI SV+     FWL  F LPK  +K I ++C  FLW  
Sbjct: 783  DKIAARFNHWATKTLSFAGRLQLISSVIYSTVNFWLSSFILPKCCLKTIEQMCNRFLWGN 842

Query: 720  ---NKKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFY 890
                +    ++W + CLP  EGGLG+R+ + WNK L  +++W      ++LWV W H   
Sbjct: 843  DITRRGDIKVSWQNSCLPKAEGGLGLRNFWTWNKTLNLRLIWMLFARRDSLWVAWNHANR 902

Query: 891  LRNQSIWTWNPKKDDSTLLKRICDIRDELIQKFGSQQSAIDALTPLVNDNGLNSSKMYDI 1070
            LR+ + W        S + K I  +R  L ++F   + A+         NG   S  YD 
Sbjct: 903  LRHVNFWNAEAASHHSWIWKAILGLR-PLAKRF--LRGAV--------GNGQLLSYWYDH 951

Query: 1071 FRNEGP 1088
            + N GP
Sbjct: 952  WSNLGP 957


>gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]
          Length = 740

 Score =  270 bits (689), Expect = 1e-69
 Identities = 136/328 (41%), Positives = 196/328 (59%), Gaps = 6/328 (1%)
 Frame = +3

Query: 6    FLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLC 185
            FL + L  L FP  F  WI  C+++A+FS+ +NG L G F  KRGLRQG  +SP LF++C
Sbjct: 184  FLLNTLEALNFPENFCHWIKLCISTATFSVQVNGELAGFFGSKRGLRQGCALSPYLFVIC 243

Query: 186  MEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKI 365
            M  LS +I+V        YHP+C++L ++HL FADDLM+F  G  +S++ +I+   EF  
Sbjct: 244  MNVLSHMIDVAAVHRNIGYHPKCKKLSLTHLCFADDLMVFIDGQQRSVEGVINIFKEFAG 303

Query: 366  VSGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPLH 545
             SGL+I+  KS ++ AG+   + ++IL    +  G LPVRYLG+PL  +++    Y+PL 
Sbjct: 304  KSGLHISLEKSTLYLAGVSELNRNNILSAFPFASGQLPVRYLGLPLLTKQMTTADYSPLL 363

Query: 546  DRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW-- 719
            D++ + I  WTA SLSYAGRL LI SV+  +  FW+  + LP   IK I KLC AFLW  
Sbjct: 364  DKVRSKISSWTARSLSYAGRLALINSVIVSLSNFWMSAYRLPAGCIKEIEKLCSAFLWSG 423

Query: 720  ---NKKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFY 890
               N K+  I W  +C   +EGGLGI+ +   NK    K++W+      +LWV WV  + 
Sbjct: 424  PELNPKKAKITWTSLCKLKQEGGLGIKSLLEANKVSCLKLIWRLVSRQSSLWVNWVWTYI 483

Query: 891  LRNQSIWTWNPKKD-DSTLLKRICDIRD 971
            +R  S W+ N +    S + K++   RD
Sbjct: 484  IRKGSFWSANDRSSLGSWMWKKLLKYRD 511


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  269 bits (687), Expect = 2e-69
 Identities = 134/307 (43%), Positives = 188/307 (61%), Gaps = 5/307 (1%)
 Frame = +3

Query: 6    FLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLC 185
            FL +V   LGFP  FI WI  C+T+ASFS+ +NG L G F   RGLRQG  +SP LF++C
Sbjct: 611  FLINVFTILGFPREFIHWINICITTASFSVQVNGELAGYFQSSRGLRQGCALSPYLFVIC 670

Query: 186  MEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKI 365
            M+ LS++++   +   F YHP+C+ + ++HL FADDLM+ + G  +SI+ +I   DEF  
Sbjct: 671  MDVLSKMLDKAAAARHFGYHPKCKTMGLTHLSFADDLMVLSDGKIRSIERIIKVFDEFAK 730

Query: 366  VSGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPLH 545
             SGL I+  KS ++ AG+     +++ D   +  G LPVRYLG+PL  ++L+     PL 
Sbjct: 731  WSGLRISLEKSTVYLAGLSATARNEVADRFPFSSGQLPVRYLGLPLITKRLSTTDCLPLL 790

Query: 546  DRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW-- 719
            +++   I  WT+  LSYAGRL LI SVL  I  FWL  F LP+  I+ + K+C AFLW  
Sbjct: 791  EQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNFWLAAFRLPRKCIRELEKMCSAFLWSG 850

Query: 720  ---NKKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFY 890
               N  +  I+WH VC P +EGGLG+R +   N     K++WK    + +LWV+WV    
Sbjct: 851  TEMNSNKAKISWHMVCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHL 910

Query: 891  LRNQSIW 911
            LRN S W
Sbjct: 911  LRNASFW 917



 Score = 63.9 bits (154), Expect = 1e-07
 Identities = 32/77 (41%), Positives = 42/77 (54%), Gaps = 4/77 (5%)
 Frame = +3

Query: 1074 RNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNL----SYIDTDPLCKLCKNEL 1241
            R+   R  WH  IW     PK+SFC+WLA   RL T D +    + I TD  C  C+  L
Sbjct: 1048 RSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATD--CIFCQGTL 1105

Query: 1242 ESAPHLFFMCTVTNSLW 1292
            E+  HLFF C+ T+ +W
Sbjct: 1106 ETRDHLFFTCSFTSVIW 1122


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  265 bits (678), Expect = 2e-68
 Identities = 137/335 (40%), Positives = 196/335 (58%), Gaps = 7/335 (2%)
 Frame = +3

Query: 6    FLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLC 185
            F+   L  L  P  +I+WI +C+T+ SF+IS+NG+  G F   +GLRQGDP+SP LF+L 
Sbjct: 465  FVTAALRALAIPERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLA 524

Query: 186  MEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKI 365
            ME  S+L+  R  +    YHP+  +L ISHL+FADD+M+F  G + S+  + + LD+F  
Sbjct: 525  MEVFSKLLYSRYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFAD 584

Query: 366  VSGLNINSSKSNIFTAGIFGKDLDDILDLVNY--PKGTLPVRYLGVPLAAQKLNCVHYAP 539
             SGL +N  KS +F AG+   DL + +    Y  P GT P+RYLG+PL  +KL    Y P
Sbjct: 585  WSGLKVNKDKSQLFQAGL---DLSERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGP 641

Query: 540  LHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW 719
            L ++++A +  W + +LS+AGR  LI SV+ G+  FW+  F LPK  IK+I  LC  FLW
Sbjct: 642  LLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLW 701

Query: 720  -----NKKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHG 884
                  +K   ++W D CLP  EGGLG R    WNK LL +++W       +LW +W   
Sbjct: 702  AGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRH 761

Query: 885  FYLRNQSIWTWNPKKDDSTLLKRICDIRDELIQKF 989
              L + S W  N  + D    K + ++R  L +KF
Sbjct: 762  HRLGHASFWQVNALQTDPWTWKMLLNLR-PLAEKF 795


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  265 bits (678), Expect = 2e-68
 Identities = 137/335 (40%), Positives = 196/335 (58%), Gaps = 7/335 (2%)
 Frame = +3

Query: 6    FLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLC 185
            F+   L  L  P  +I+WI +C+T+ SF+IS+NG+  G F   +GLRQGDP+SP LF+L 
Sbjct: 465  FVTAALRALAIPERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLA 524

Query: 186  MEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKI 365
            ME  S+L+  R  +    YHP+  +L ISHL+FADD+M+F  G + S+  + + LD+F  
Sbjct: 525  MEVFSKLLYSRYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFAD 584

Query: 366  VSGLNINSSKSNIFTAGIFGKDLDDILDLVNY--PKGTLPVRYLGVPLAAQKLNCVHYAP 539
             SGL +N  KS +F AG+   DL + +    Y  P GT P+RYLG+PL  +KL    Y P
Sbjct: 585  WSGLKVNKDKSQLFQAGL---DLSERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGP 641

Query: 540  LHDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW 719
            L ++++A +  W + +LS+AGR  LI SV+ G+  FW+  F LPK  IK+I  LC  FLW
Sbjct: 642  LLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLW 701

Query: 720  -----NKKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHG 884
                  +K   ++W D CLP  EGGLG R    WNK LL +++W       +LW +W   
Sbjct: 702  AGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRH 761

Query: 885  FYLRNQSIWTWNPKKDDSTLLKRICDIRDELIQKF 989
              L + S W  N  + D    K + ++R  L +KF
Sbjct: 762  HRLGHASFWQVNALQTDPWTWKMLLNLR-PLAEKF 795


>dbj|BAB08270.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 489

 Score =  264 bits (674), Expect = 7e-68
 Identities = 141/356 (39%), Positives = 205/356 (57%), Gaps = 6/356 (1%)
 Frame = +3

Query: 36   FPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLCMEYLSRLINV 215
            FP +FI WIM CVT+ASF + +NG L G F   RGLRQG  +SP LF++ M  LS+L++ 
Sbjct: 3    FPPVFIHWIMLCVTTASFLVQVNGELAGYFNSTRGLRQGCSLSPYLFVVSMNVLSKLLDK 62

Query: 216  RTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKIVSGLNINSSK 395
             T    F YHPRC+++ ++HL FADDLM+ + G  +SI+ +++  + F   SGL I+  K
Sbjct: 63   ATGQRRFGYHPRCKQMGLTHLSFADDLMVLSDGKVRSIEGIVEVFETFAKCSGLRISMEK 122

Query: 396  SNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPLHDRIAAYIDKW 575
            S ++ AG+      +++    +  GTLPVRYLG+PL  ++L+   Y PL + I   I  W
Sbjct: 123  STVYFAGLSHTSPQEVMAHFPFAVGTLPVRYLGLPLVTKQLSSTDYLPLIEHIKKKIGSW 182

Query: 576  TANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW-----NKKRHPI 740
            +A  LSYAGRL LI SVL  I  FW+  F LP+  I+ I K+C A+LW     N  +  I
Sbjct: 183  SARFLSYAGRLNLISSVLWSICNFWMGAFRLPRECIREIDKMCSAYLWSGGDLNTSKAKI 242

Query: 741  AWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFYLRNQSIWTWN 920
            AW DVC P +EGGLG+R +   N     K++W+     ++LWV+W+H   L+  S W   
Sbjct: 243  AWTDVCKPKDEGGLGLRSLKEANDVSCLKLIWRIISHADSLWVKWIHATLLKQVSFWAVR 302

Query: 921  PKKD-DSTLLKRICDIRDELIQKFGSQQSAIDALTPLVNDNGLNSSKMYDIFRNEG 1085
                  S + K++   RD  IQ   ++ +   A T    DN  +  ++ DI  + G
Sbjct: 303  ENTSLGSWMWKKVLKFRDAAIQLCKAEVNN-GAHTFFWYDNWSDMGRLIDIAGDRG 357


>gb|ABD96948.1| hypothetical protein [Cleome spinosa]
          Length = 539

 Score =  262 bits (670), Expect = 2e-67
 Identities = 157/453 (34%), Positives = 239/453 (52%), Gaps = 23/453 (5%)
 Frame = +3

Query: 6    FLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLC 185
            F+  ++  L  P  F++W+  C+ +  FS+S+NG L G F G+RGLRQGDP+SP LF++ 
Sbjct: 58   FITKIMQALNLPRTFVTWVKVCMETPKFSVSINGELAGYFKGRRGLRQGDPLSPYLFIMS 117

Query: 186  MEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKI 365
            ME LSR+++   +      HP+C    I+HL FADD+M+F  G+T+S+  + + LD F  
Sbjct: 118  MEVLSRMLDRCAAESRLSLHPKCHSPVITHLAFADDIMIFTSGETRSLLEVKNTLDSFSR 177

Query: 366  VSGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPLH 545
             SGL +N+ K+ IF  G+ G +   +  ++ + +G LPVRYLGV L+  +L    Y PL 
Sbjct: 178  ASGLYLNTEKTEIFLRGLNGTEASTLCAVIGFTRGYLPVRYLGVSLSPVRLTKSDYQPLL 237

Query: 546  DRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLWNK 725
            DR+ A I+ WT   LSYAGRL L+ +V+ G+   W  IF LPK   K++ +LC  FLW  
Sbjct: 238  DRVKAKINSWTTRYLSYAGRLQLVGTVIYGMVNAWGMIFMLPKFFTKQVDRLCAGFLWGA 297

Query: 726  -KRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFYLR-- 896
               H ++W   C P +EGGLG+R +  +N+       W  + G+   +V       LR  
Sbjct: 298  GTTHRVSWDTCCRPRKEGGLGLRKIAEFNQD-----PWTIY-GSLLRYVGLTGPRSLRIP 351

Query: 897  -----NQSIWTWNPKKDDSTLLKRICDIRDELIQKFGSQQSAIDALTP-LVNDNGL---- 1046
                 +Q++        DS +      +R + +Q+  +  S I   +P   +D+ L    
Sbjct: 352  LPSSVSQAV------AGDSWIFP---GVRSDRLQQVLAHISTIPPPSPDGPSDSALWKYK 402

Query: 1047 --------NSSKMYDIFRNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNLSY- 1199
                    +SS+ +++ R       W + +W     P+ +F  W     RL T D L   
Sbjct: 403  EEDFRPYFSSSRTWNLTRTVHVIAPWSSIVWFPLAIPRHAFLHWQVMLFRLPTKDRLQQW 462

Query: 1200 -IDTDPLCKLCKNELESAPHLFFMCTVTNSLWR 1295
             I +D  C+LC  E ES  HLFF CT  + LWR
Sbjct: 463  GITSDATCRLCDGEDESHQHLFFGCTYASHLWR 495


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  261 bits (668), Expect = 3e-67
 Identities = 142/371 (38%), Positives = 204/371 (54%), Gaps = 10/371 (2%)
 Frame = +3

Query: 6    FLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLC 185
            F+   L  L  P  FI+WI +C+++ +F++S+NG   G F   +GLRQGDP+SP LF+L 
Sbjct: 605  FVIAALRALAIPEKFINWISQCISTPTFTVSINGGNGGFFKSTKGLRQGDPLSPYLFVLA 664

Query: 186  MEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKI 365
            ME  S L++ R  +    YHP+   L ISHL+FADD+M+F  G + S+  + + LD+F  
Sbjct: 665  MEAFSNLLHSRYESGLIHYHPKASNLSISHLMFADDVMIFFDGGSFSLHGICETLDDFAS 724

Query: 366  VSGLNINSSKSNIFTAGIFGKDLDDILDLV-NYPKGTLPVRYLGVPLAAQKLNCVHYAPL 542
             SGL +N  KS+++ AG+    L+   +    +P GTLP+RYLG+PL  +KL    Y PL
Sbjct: 725  WSGLKVNKDKSHLYLAGL--NQLESNANAAYGFPIGTLPIRYLGLPLMNRKLRIAEYEPL 782

Query: 543  HDRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLWN 722
             ++I A    W    LS+AGR+ LI SV+ G   FW+  F LPK  IKRI  LC  FLW+
Sbjct: 783  LEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLWS 842

Query: 723  -----KKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGF 887
                  K   ++W  +CLP  EGGLG+R +  WNK L  +++W+     ++LW  W H  
Sbjct: 843  GNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLH 902

Query: 888  YLRNQSIWTWNPKKDDSTLLKRICDIR----DELIQKFGSQQSAIDALTPLVNDNGLNSS 1055
            +L   S W     + DS   KR+  +R      L+ K G               NGL + 
Sbjct: 903  HLSRGSFWAVEGGQSDSWTWKRLLSLRPLAHQFLVCKVG---------------NGLKAD 947

Query: 1056 KMYDIFRNEGP 1088
              YD + + GP
Sbjct: 948  YWYDNWTSLGP 958


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  261 bits (667), Expect = 5e-67
 Identities = 133/328 (40%), Positives = 189/328 (57%), Gaps = 6/328 (1%)
 Frame = +3

Query: 6    FLQDVLIGLGFPNLFISWIMECVTSASFSISLNGSLHGKFPGKRGLRQGDPMSPALFLLC 185
            FL + L  L FP  F  WI  C+++A+FS+ +NG L G F   RGLRQG  +SP LF++C
Sbjct: 908  FLLNTLEALNFPETFRHWIKLCISTATFSVQVNGELAGFFGSSRGLRQGCALSPYLFVIC 967

Query: 186  MEYLSRLINVRTSTHTFKYHPRCQELKISHLIFADDLMLFAKGDTQSIKILIDCLDEFKI 365
            M  LS +I+         YHP+C+++ ++HL FADDLM+F  G   SI+ +I+   EF  
Sbjct: 968  MNVLSHMIDEAAVHRNIGYHPKCEKIGLTHLCFADDLMVFVDGHQWSIEGVINVFKEFAG 1027

Query: 366  VSGLNINSSKSNIFTAGIFGKDLDDILDLVNYPKGTLPVRYLGVPLAAQKLNCVHYAPLH 545
             SGL I+  KS I+ AG+   D    L    +  G LPVRYLG+PL  +++    Y+PL 
Sbjct: 1028 RSGLQISLEKSTIYLAGVSASDRVQTLSSFPFANGQLPVRYLGLPLLTKQMTTADYSPLI 1087

Query: 546  DRIAAYIDKWTANSLSYAGRLLLIKSVLQGIECFWLQIFPLPKSVIKRIYKLCRAFLW-- 719
            + +   I  WTA SLSYAGRL L+ SV+  I  FW+  + LP   I+ I KLC AFLW  
Sbjct: 1088 EAVKTKISSWTARSLSYAGRLALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAFLWSG 1147

Query: 720  ---NKKRHPIAWHDVCLPVEEGGLGIRDVYAWNKALLSKILWKFHQGTETLWVRWVHGFY 890
               N K+  IAW  +C P +EGGLGI+ +   NK    K++W+      +LWV W+  F 
Sbjct: 1148 PVLNPKKAKIAWSSICQPKKEGGLGIKSLAEANKVSCLKLIWRLLSTQPSLWVTWIWTFI 1207

Query: 891  LRNQSIWTWNPKKD-DSTLLKRICDIRD 971
            +R  + W+ N +    S + K++   R+
Sbjct: 1208 IRKGTFWSANERSSLGSWMWKKLLKYRE 1235



 Score = 63.2 bits (152), Expect = 2e-07
 Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 2/78 (2%)
 Frame = +3

Query: 1074 RNEGPRHFWHNAIWKQFIPPKFSFCTWLACKDRLSTLDNLSYIDTDPL--CKLCKNELES 1247
            R   P+  W+  +W  +  PK+SF  WL  ++RLST D +   ++  L  C LC N  E+
Sbjct: 1346 RTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCTLCNNAEET 1405

Query: 1248 APHLFFMCTVTNSLWRRI 1301
              HLFF C  T+ +W  +
Sbjct: 1406 RDHLFFSCQYTSYVWEAL 1423


Top