BLASTX nr result

ID: Rehmannia22_contig00004633 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00004633
         (2120 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   342   3e-91
ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664...   328   7e-87
ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665...   322   5e-85
emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   295   4e-77
gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]             276   3e-71
ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663...   275   4e-71
ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661...   270   2e-69
ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268...   266   3e-68
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   263   2e-67
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   260   1e-66
ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670...   254   1e-64
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...   254   1e-64
ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein A...   251   9e-64
ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660...   250   2e-63
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   249   3e-63
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   246   3e-62
gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal...   240   2e-60
gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]                237   1e-59
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...   236   3e-59
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       235   5e-59

>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  342 bits (878), Expect = 3e-91
 Identities = 178/479 (37%), Positives = 263/479 (54%), Gaps = 7/479 (1%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            +NG     F  ++GLRQGDP+SP LF L MEYLSR +        F FHP+C  +K++HL
Sbjct: 630  LNGIPSIPFDAQKGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHL 689

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
            +FADDL++FA+ D  S+  +M   + F K SGL  +  KS ++  GV   + + L + + 
Sbjct: 690  MFADDLLMFARADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQ 749

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
             P GSLP RYLGVPLA++KLN     PL D+I      W A+ L+YAGRL L+K++L  +
Sbjct: 750  MPIGSLPFRYLGVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSM 809

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLW-----GKKRPPISWHKICMPSDEGGLGIRNVYA 1414
            + +W QIFPLPK ++K +   CR FLW        + P++W  +  P   GGL + N+  
Sbjct: 810  QNYWGQIFPLPKKLIKAVETTCRKFLWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVL 869

Query: 1413 WNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIWDWNPKKDDSTLLKRINDVKNELLC 1234
            WNKA + K LW    K D LWV+WV+A+Y+KRQ+I +     + S +L++I +   ELL 
Sbjct: 870  WNKAAILKLLWAITFKQDKLWVRWVNAYYIKRQNIENVTVSSNTSWILRKIFE-SRELLT 928

Query: 1233 KFGNQNAVIANLLAFSNHKGLISSKIYDIFRDHGEKNFWKAAVWKSFIPPKYSFCAWMAF 1054
            + G   AV       SNH      K Y + ++  E   WK  +  +   PK  F  W+A 
Sbjct: 929  RTGGWEAV-------SNHMNFSIKKTYKLLQEDYENVVWKRLICNNKATPKSQFILWLAM 981

Query: 1053 NDRLATINNLT--YTDINPMCKLCSQQLESAPHLFFTCPITNLLWNRIKAWLKIHRSMST 880
             +RLAT   ++    D++P+CK+C  ++E+  HLFF C  +  +W ++  +L +      
Sbjct: 982  LNRLATAERVSRWNRDVSPLCKMCGNEIETIQHLFFNCIYSKEIWGKVLLYLNLQPQADA 1041

Query: 879  LASAIKWIRKDKADPILKKARAVAFCCSIYHIWKARNAHVFDGDPFSYEAVFKKIQFHV 703
             A     I+K ++     K   + F  S+Y IW  RNA VF G   +     K I F +
Sbjct: 1042 QAKKELAIKKARSTKDRNKLYVMMFTESVYAIWLLRNAKVFRGIEINQNQAVKSIIFRI 1100


>ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max]
          Length = 939

 Score =  328 bits (840), Expect = 7e-87
 Identities = 181/483 (37%), Positives = 268/483 (55%), Gaps = 8/483 (1%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            ING    +    RG+RQGDP+SP LFIL MEYL+R+++      NF +H +C ++KI++L
Sbjct: 455  INGRFTRRLEARRGIRQGDPISPLLFILVMEYLNRILSQLDKIPNFNYHSKCEKMKITNL 514

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
             FADDL+LF++GD  SV+I++D  + F +  GL +N +K N++   V     + LL +  
Sbjct: 515  CFADDLLLFSRGDIGSVQIMLDKFNTFLRSMGLHVNPSKCNIYCGSVDINVKEQLLLISG 574

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
            F  G +P RYLG+PL+++KLN  HY  L D+I   I  W+A  L+YAGR+ LI+SV+   
Sbjct: 575  FKEGKMPFRYLGIPLSSKKLNIKHYQVLIDKIVGRITHWSAGLLSYAGRVQLIQSVIFAT 634

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLW-----GKKRPPISWHKICMPSDEGGLGIRNVYA 1414
              FW+Q  PLPK V+ RI  +CR+FLW       ++ PI+W K+C P   GGL I N+  
Sbjct: 635  INFWMQCLPLPKFVIMRINAICRSFLWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAI 694

Query: 1413 WNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIWDWNPKKDDSTLLKRINDVKNELLC 1234
            WNK  + K LWN   K+D+LW+KW+H +Y++ QSIW    KK  S ++  +  ++  LL 
Sbjct: 695  WNKISILKLLWNVCNKSDNLWIKWLHTYYIRGQSIWSMVLKKSHSWIMSSMMKLR-PLLL 753

Query: 1233 KFGNQNAVIANLLAFSNHKGLISSKIYDIFRDHGEKNFWKAAVWKSFIPPKYSFCAWMAF 1054
            ++ ++   +  +            KIY    +  EK  W+  +  +   P+  FC W A 
Sbjct: 754  QYQSRMQDVFKM-----------KKIYLALFEESEKMSWRTLMCNNLARPRALFCLWQAC 802

Query: 1053 NDRLATINNLTYTDIN--PMCKLCSQQLESAPHLFFTCPITNLLWNRIKAWLKIHRSMST 880
            + RLA+ + L    +N    C  CS  +ES  HLFF C     +W  +  WL+I    ST
Sbjct: 803  HFRLASKDRLIKFGLNVDANCAFCS-SMESHEHLFFGCIELKTIWTAVLNWLQIIHMPST 861

Query: 879  LASAIKWI-RKDKADPILKKARAVAFCCSIYHIWKARNAHVFDGDPFSYEAVFKKIQFHV 703
             +  + WI RK K           AF  +IYHIW  RN  VF G+  + +     I   +
Sbjct: 862  WSEELNWITRKCKGKGWRAMLLKCAFTETIYHIWAYRNHRVFGGNVNNRKVEDSIINTII 921

Query: 702  YQV 694
            Y+V
Sbjct: 922  YRV 924


>ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max]
          Length = 506

 Score =  322 bits (824), Expect = 5e-85
 Identities = 173/476 (36%), Positives = 260/476 (54%), Gaps = 8/476 (1%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            +NG         RGLRQGDP+SP LF++ ME L+R +       +F +HP+C +LKI++L
Sbjct: 13   VNGYKTEIMGARRGLRQGDPISPMLFVIVMECLNRYLYKMQKDGDFNYHPKCDKLKITNL 72

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
             FADDL+LF++GD  SV ++M     F K +GL +N  K ++  AG+       +L +  
Sbjct: 73   CFADDLLLFSRGDKISVGMMMRAYESFSKATGLLVNPQKCSLLCAGIDAVTKREILEVSG 132

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
            F  G LP +YLGVP+ ++KL+++HY+PL D+I   I  WTA  L+YAGRL L+ SV+  +
Sbjct: 133  FQEGQLPFKYLGVPVTSKKLSTIHYSPLIDKIVGKIKHWTARLLSYAGRLQLVNSVMFAL 192

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLW-----GKKRPPISWHKICMPSDEGGLGIRNVYA 1414
              +WL  FP PKSV+++I  +CR FLW     G ++ P++W +IC P   GGL I ++  
Sbjct: 193  TNYWLNCFPFPKSVLQKIEAICRIFLWTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDI 252

Query: 1413 WNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIWDWNPKKDDSTLLKRINDVKNELLC 1234
            WNKA L K LWN   K DSLWVKW+ A+Y+KR  +     K  DS ++K I   + +L  
Sbjct: 253  WNKANLMKLLWNLSSKEDSLWVKWIQAYYVKRSELMHIEMKNTDSWIMKAILKQREDL-- 310

Query: 1233 KFGNQNAVIANLLAFSNHKGLISSKIYDIFRDHGEKNFWKAAVWKSFIPPKYSFCAWMAF 1054
                    I N+        +   K+Y   +D G++  WK  ++ +   P+ +F  W+A 
Sbjct: 311  ------EKIDNMEELMIRGSINMGKLYRKLQDCGQRKEWKNLLYGNTARPRANFILWLAC 364

Query: 1053 NDRLATINNLTYTDI--NPMCKLCSQQLESAPHLFFTCPITNLLWNRIKAWLKIHRSMST 880
            + RL+T + L    +  +  C  CS++ ES  HLFF C  +  +W  +  W++I    S 
Sbjct: 365  HGRLSTKDRLCKYGMIDDKSCCFCSEE-ESMNHLFFVCDNSKRVWMEVLQWVQIRHDPSD 423

Query: 879  LASAIKWI-RKDKADPILKKARAVAFCCSIYHIWKARNAHVFDGDPFSYEAVFKKI 715
              + + W+    K          +A   +IY IW  RN  +F G       V KKI
Sbjct: 424  WPNELHWLTHHTKGKGTRAAVLKMAIAETIYEIWNIRNNKIF-GQAIDINTVGKKI 478


>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  295 bits (756), Expect = 4e-77
 Identities = 167/485 (34%), Positives = 249/485 (51%), Gaps = 13/485 (2%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            +NG     F   +GLRQGDPMSP LF LCMEYLSR +       +F FHP+C  L I+HL
Sbjct: 627  VNGIPTQPFQARKGLRQGDPMSPFLFALCMEYLSRCLEELKGSPDFNFHPKCERLNITHL 686

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
            +FADDL++F + D  S+  +     +F   SGL  +  KSN++  GV       L + ++
Sbjct: 687  MFADDLLMFCRADKSSLDHMNVAFQKFSHASGLAASHEKSNIYFCGVDDETARELADYVH 746

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
               G LP RYLGVPL ++KL      PL + I      W A  L+YAGRL LIKS+L  +
Sbjct: 747  MQLGELPFRYLGVPLTSKKLTYAQCKPLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSM 806

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLW-GK----KRPPISWHKICMPSDEGGLGIRNVYA 1414
            + +W  IFPL K V++ +  +CR FLW GK    K+ P++W  I  P   GG  + N+  
Sbjct: 807  QNYWAHIFPLSKKVIQAVEKVCRKFLWTGKTEETKKAPVAWATIQRPKSRGGWNVINMKY 866

Query: 1413 WNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIWDWNPKKDDSTLLKRINDVKNELLC 1234
            WN+A + K LW    K D LWV+W+H++Y+KRQ I   N     + +L++I   ++  L 
Sbjct: 867  WNRAAMLKLLWAIEFKRDKLWVRWIHSYYIKRQDILTVNISNQTTWILRKIVKARDH-LS 925

Query: 1233 KFGNQNAVIANLLAFSNHKGLISSKIYDIFRDHGEKNFWKAAVWKSFIPPKYSFCAWMAF 1054
              G+ + +                K Y    ++GE+  W+  +  ++  PK  F  WM  
Sbjct: 926  NIGDWDEICIG-------DKFSMKKAYKKISENGERVRWRRLICNNYATPKSKFILWMML 978

Query: 1053 NDRLATINNLT----YTDINPMCKLCSQQLESAPHLFFTCPITNLLWNRIKAWLKIHRS- 889
            ++RL T++ ++      D+N   +LC    E+  HLFF+C  +  +W++I   ++   S 
Sbjct: 979  HERLPTVDRISRWGVQCDLN--YRLCRNDGETIQHLFFSCSYSAGVWSKICYIMRFPNSG 1036

Query: 888  ---MSTLASAIKWIRKDKADPILKKARAVAFCCSIYHIWKARNAHVFDGDPFSYEAVFKK 718
                  ++S     RK K      K   + +   +Y IWK RN   F G+      V +K
Sbjct: 1037 VSHQEIISSVCGQARKKKG-----KLIVMLYTEFVYAIWKQRNKRTFTGENKDENEVLRK 1091

Query: 717  IQFHV 703
            I F V
Sbjct: 1092 ILFAV 1096


>gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]
          Length = 653

 Score =  276 bits (706), Expect = 3e-71
 Identities = 154/461 (33%), Positives = 240/461 (52%), Gaps = 11/461 (2%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            +NG L G F  +RGLRQG  +SP LF++ M+ LS+L++   S   F +H RC EL ++HL
Sbjct: 169  VNGELVGFFQSKRGLRQGCSLSPYLFVMSMDVLSKLLDQAASAKKFGYHSRCKELSLTHL 228

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
             FADDLM+ + G   S+  +++    F K SGL I+  KS ++ AGV       + N   
Sbjct: 229  SFADDLMVLSDGKVRSIDGIVEVFDIFAKFSGLKISMEKSTIYLAGVTEDVYHEIQNRYQ 288

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
            F  G LPVRYLG+PL  ++L +  Y+PL + I   I  WT   L+YAGRL LI SVL  +
Sbjct: 289  FDVGQLPVRYLGLPLVTKRLTATDYSPLLEHIKKKIGTWTTRYLSYAGRLNLITSVLWSI 348

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLW-----GKKRPPISWHKICMPSDEGGLGIRNVYA 1414
              FWL  F LP+  ++ I  +C  FLW       ++  + W  +C P  EGGLG+R++  
Sbjct: 349  CNFWLAAFRLPRECIREIDKICSAFLWSGPDLNPRKTRVCWGDVCKPKQEGGLGLRSLKE 408

Query: 1413 WNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIWDWNPKKD-DSTLLKRINDVKNELL 1237
             N+    K +W     T+SLWV+W+  + LK  + W      + DS L +  ND   E +
Sbjct: 409  MNEVSCLKLIWRIVSHTNSLWVRWIEQYLLKHDTFWSVQTTTNMDSVLWRGRND---EYM 465

Query: 1236 CKFGNQNAVIANLLAFSNHKGLISSKIYDIFRDHGEKNFWKAAVWKSFIPPKYSFCAWMA 1057
             KF  ++                    ++  R+      W   +W +   PK+SFCAW+A
Sbjct: 466  PKFSTRDT-------------------WNQTRNTSTPVTWHMGIWFAHATPKFSFCAWLA 506

Query: 1056 FNDRLATINNLTYTD--INPMCKLCSQQLESAPHLFFTCPITNLLWNRIKAWL---KIHR 892
              +RL+T + +   +  ++P C LC+  +E+  HLFF+C  T  +W  +   +   K   
Sbjct: 507  VQNRLSTGDKMLQWNRRLSPTCVLCNNNIETRNHLFFSCCYTAEIWENLAKNIYKAKFST 566

Query: 891  SMSTLASAIKWIRKDKADPILKKARAVAFCCSIYHIWKARN 769
            + ST+ +++    +++ +  L +     F  +I+ IW  RN
Sbjct: 567  NWSTILTSVSTTWRNRTESFLAR---YIFQATIHTIWHERN 604


>ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max]
          Length = 514

 Score =  275 bits (704), Expect = 4e-71
 Identities = 138/405 (34%), Positives = 225/405 (55%), Gaps = 7/405 (1%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            ING L      + G+ QGDP+SP LF+L MEY +R++     + +F  H +C  L I+HL
Sbjct: 116  INGELSNVLETKIGIWQGDPISPLLFVLMMEYFNRIMVKMQRNPSFNHHSQCERLGITHL 175

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
             FADD+ L  +GD  S+K+++   S F K +GL IN AK  VF  G+    +  +  +  
Sbjct: 176  SFADDVFLLCRGDKKSIKMIIKAFSFFSKSTGLQINPAKCKVFCGGLNCDSIQVITKITG 235

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
            F  G+LPVRYLGVPL+ +KLN  HY PL ++I   I  W++  L+ AGR+ L++S++  +
Sbjct: 236  FEEGTLPVRYLGVPLSCKKLNVHHYLPLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAI 295

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLWG-----KKRPPISWHKICMPSDEGGLGIRNVYA 1414
              +W+ +FP+PK V+++I  +CR+F+W      K++  ++W ++C P+  GGL + N+  
Sbjct: 296  AQYWMSVFPMPKKVIQKIDSICRSFIWSGSAEVKRKSLVAWKQVCKPARCGGLNLINLEL 355

Query: 1413 WNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIWDWNPKKDDSTLLKRINDVKNELLC 1234
            WN   + K LWN   K D+LWVKW+HA++LK  ++     K + + +LK +   + ++  
Sbjct: 356  WNVTAMLKCLWNICSKEDNLWVKWIHAYFLKGDNVMSATIKSNSTWILKSVMKQRPQV-- 413

Query: 1233 KFGNQNAVIANLLAFSNHKGLISSKIYDIFRDHGEKNFWKAAVWKSFIPPKYSFCAWMAF 1054
                 N  +  +      K  +     ++  DH  K  W   +  +   P+ +   W+A 
Sbjct: 414  ----NNLQLVWIEMLRKRKFSMKQVYMELVEDH-NKIDWFRLLRYNRARPRANVTLWLAC 468

Query: 1053 NDRLATINNLTYTDI--NPMCKLCSQQLESAPHLFFTCPITNLLW 925
             +RLAT   L   ++    +C LC +Q E   HL F+C +T  +W
Sbjct: 469  QNRLATKTRLKNMNMIQCSLCSLCKEQDEDLDHLMFSCRVTKAIW 513


>ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max]
          Length = 947

 Score =  270 bits (689), Expect = 2e-69
 Identities = 147/460 (31%), Positives = 244/460 (53%), Gaps = 12/460 (2%)
 Frame = -3

Query: 2085 ERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHLIFADDLMLFAK 1906
            +RG+RQGDP+SP LF++ MEYL+RL+       NF  H +C +L I+HL FADD++LF +
Sbjct: 466  KRGIRQGDPISPLLFVVMMEYLNRLLVKLQLDLNFNHHAKCEKLGITHLTFADDVLLFCR 525

Query: 1905 GDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLNFPSGSLPVRYL 1726
            GD  SV++++  +++F   +GL +N  K  ++  GV G   + +  + ++  G LPVRYL
Sbjct: 526  GDVMSVEMMLHVINKFSATTGLVVNPNKCRIYFGGVDGTTKNKIQQISSYEEGQLPVRYL 585

Query: 1725 GVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGVECFWLQIFPLP 1546
            GVPL ++KLN  +Y PL D+I   I  WT+  L   GR+ ++   +  +  FW+Q  P+P
Sbjct: 586  GVPLTSKKLNIKYYLPLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQCLPIP 645

Query: 1545 KSVVKRIYMLCRTFLWGK-----KRPPISWHKICMPSDEGGLGIRNVYAWNKALLSKNLW 1381
             SV+K+I  +CR+F+W +     ++ PI+W+ +C P  +GGL I N+  WN   +   LW
Sbjct: 646  MSVIKKIDSMCRSFVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHITVLNCLW 705

Query: 1380 NFHLKTDSLWVKWVHAFYLKRQSIWDWNPKKDDSTLLKRINDVKNELLCKFGNQNAVIAN 1201
            N   K D+LWVKW+HA Y+K  S+ +     + S +LK +   +  +         V   
Sbjct: 706  NLCKKVDNLWVKWIHAHYIKNSSVMNTMVTNNFSWVLKNVLSQREYI----HTLQPVWDE 761

Query: 1200 LLAFSNHKGLISSKIYDIFRDHGEKNFWKAAVWKSFIPPKYSFCAWMAFNDRLATINNLT 1021
            LL   N +     K YD   +  ++  W   + K+   P+     W+A + RL T + L 
Sbjct: 762  LL---NSERFKMKKAYDKMME-ADRVHWSGLMRKNCARPRAIHTTWLACHGRLGTKDRLV 817

Query: 1020 YTDI--NPMCKLCSQQLESAPHLFFTCPITNLLWNRIKAWLKIHRSMSTLASAIKWI--- 856
               +  + +  LC +  E+  H+ F+C +   +W+ +   + I          + W+   
Sbjct: 818  RFGMITDKIWSLCKEVEETQNHILFSCKVATDIWSNVLNRIGIDHVPQEWPLELDWLLNL 877

Query: 855  --RKDKADPILKKARAVAFCCSIYHIWKARNAHVFDGDPF 742
              RK     +LK    ++   +IY IW  RN+ +F  + +
Sbjct: 878  TNRKGWRAYLLK----LSVTETIYGIWINRNSKIFGDNTY 913


>ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum
            lycopersicum]
          Length = 717

 Score =  266 bits (679), Expect = 3e-68
 Identities = 122/276 (44%), Positives = 179/276 (64%), Gaps = 5/276 (1%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            +NG    +F   +GLRQGDPMSP LF + MEYLSRL+       +FK+HP+  +L ++HL
Sbjct: 435  VNGQNTQRFDAAKGLRQGDPMSPFLFAIAMEYLSRLLKGLKEDKSFKYHPKYAKLDVTHL 494

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
             FADDL+LF++GD  S+K L  C +EF + SGL  N  KS+++  GV       ++  L 
Sbjct: 495  CFADDLLLFSRGDLNSIKALQKCFTEFSQASGLQANLNKSSIYCGGVQMEVRQQIIQQLG 554

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
            +    LP +YLGVPL+++KLN++ + PL +++ A IN WTA  L+YAGR  L+K+VL GV
Sbjct: 555  YTIEELPFKYLGVPLSSKKLNTIQWYPLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGV 614

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLWG-----KKRPPISWHKICMPSDEGGLGIRNVYA 1414
            +  W Q+F +P  ++K I  LCR++LW       K+  I+W K+C P  EGGLG+ N+  
Sbjct: 615  QALWAQLFIIPAKIIKLIEGLCRSYLWSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKI 674

Query: 1413 WNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIW 1306
            WN++ ++K  W+   K D LW+KW+HA+Y+K Q  W
Sbjct: 675  WNRSAVTKLCWDLANKEDKLWIKWIHAYYIKGQREW 710


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  263 bits (672), Expect = 2e-67
 Identities = 166/485 (34%), Positives = 242/485 (49%), Gaps = 17/485 (3%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSN-FKFHPRCGELKISH 1942
            +NG L G F   RGLRQGDP+SP LF++ ME LS  I  + + S  F++H RC +L +SH
Sbjct: 463  VNGELAGFFARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRYHWRCDQLNLSH 522

Query: 1941 LIFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLL 1762
            L FADDL++F  GD  SV+ L D  S F+ +S L  N ++S +F AGV G   D++L + 
Sbjct: 523  LCFADDLLMFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDGNSSDSVLQVT 582

Query: 1761 NFPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQG 1582
            NF  G+ PVRYLG+PL   KL     +PL DRI   I  W    L++AGRL LI+SVL  
Sbjct: 583  NFSLGTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSS 642

Query: 1581 VECFWLQIFPLPKSVVKRIYMLCRTFLW-----GKKRPPISWHKICMPSDEGGLGIRNVY 1417
            ++ +W     LPK V+K I    R FLW     G+    ++W +IC+P  EGGLGI++++
Sbjct: 643  IQVYWASHLILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLH 702

Query: 1416 AWNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIWD--------WNPKKDDSTLLKRI 1261
             WNKAL+  ++WN    + + W  WV  + LK  S W+        WN +K    LLK  
Sbjct: 703  CWNKALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRK----LLK-- 756

Query: 1260 NDVKNELLCKFGNQNAVIANLLAFSNHKGLISSKIYDIFRDHGEKNF-WKAAVWKSFIPP 1084
                 EL C F        N++      G  +S  +D +   G     W + +       
Sbjct: 757  ---IRELCCSF------FVNIIG----DGRATSLWFDNWHPLGPLTLRWSSNIIGESGLS 803

Query: 1083 KYSFCAWMAFNDRLATINNLTYTD-INPMCKLCSQQLESAPHLFFTCPITNLLWNRIKAW 907
            K +      F    +  N L  +  I P  +L     E+  HLFF C  +  +W  + + 
Sbjct: 804  KSAMLTPNGFYSTSSAWNTLRPSRFIVPWYRLVWFVAETHNHLFFDCAYSFGIWTHVLSK 863

Query: 906  LKIHRSMSTLASAIKWIRKD-KADPILKKARAVAFCCSIYHIWKARNAHVFDGDPFSYEA 730
              + + +   +  I W+  + K + +      +A    +Y IW+ RN   F  +      
Sbjct: 864  CDVSKPLLPWSDFIFWVATNWKGNSLPVVILKLALQAVVYAIWRERNNRRFRNESLPPAV 923

Query: 729  VFKKI 715
            VFK I
Sbjct: 924  VFKGI 928


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  260 bits (665), Expect = 1e-66
 Identities = 162/533 (30%), Positives = 249/533 (46%), Gaps = 83/533 (15%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            +NG L G F   RGLRQG  +SP LF++CM+ LS++++   +  +F +HP+C  + ++HL
Sbjct: 642  VNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHL 701

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
             FADDLM+ + G   S++ ++    EF K SGL I+  KS V+ AG+     + + +   
Sbjct: 702  SFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFP 761

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
            F SG LPVRYLG+PL  ++L++    PL +++   I  WT+  L+YAGRL LI SVL  +
Sbjct: 762  FSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSI 821

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLWG-----KKRPPISWHKICMPSDEGGLGIRNVYA 1414
              FWL  F LP+  ++ +  +C  FLW        +  ISWH +C P DEGGLG+R++  
Sbjct: 822  CNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKE 881

Query: 1413 WNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSI-----------WDWNPKKDDSTLLK 1267
             N     K +W     ++SLWVKWV    L+  S            W W        + K
Sbjct: 882  ANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYREVAK 941

Query: 1266 RINDVK--NELLCKFGNQN-AVIANLLAFSNHKGLIS----------------------S 1162
             ++ V+  N     F   N + +  LL  +  +GLI                       +
Sbjct: 942  TLSKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRRMTVEEAWTNRRQRRHRN 1001

Query: 1161 KIYDIFRDHGEKNF----------------------------------------WKAAVW 1102
             +Y++  D  +K++                                        W   +W
Sbjct: 1002 DVYNVIEDALKKSWDTRTETEDKVLWRGKSDVFRTTFSTRDTWHHTRSTSARVPWHKVIW 1061

Query: 1101 KSFIPPKYSFCAWMAFNDRLATINNLT--YTDINPMCKLCSQQLESAPHLFFTCPITNLL 928
             S   PKYSFC+W+A + RL T + +      I   C  C   LE+  HLFFTC  T+++
Sbjct: 1062 FSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTLETRDHLFFTCSFTSVI 1121

Query: 927  WNRIKAWLKIHRSMSTLASAIKWIRKDKADPILKKARAVAFCCSIYHIWKARN 769
            W  +   +   +  S   S I+ I   +   +    R   F  +IY +W+ RN
Sbjct: 1122 WVDLARGIFKTQYTSHWQSIIEAITNSQHHRVEWFLRRYVFQATIYIVWRERN 1174


>ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max]
          Length = 383

 Score =  254 bits (648), Expect = 1e-64
 Identities = 143/431 (33%), Positives = 215/431 (49%), Gaps = 5/431 (1%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            +NGS+ G F G+RGLRQ DP+SP LF+L +EY +R I     ++NF+F+P C   ++SHL
Sbjct: 13   VNGSIYGHFKGQRGLRQWDPLSPYLFVLYIEYFARDIQSLKDNANFQFNPNCAVTQLSHL 72

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
             FADD+ML ++GD PSV  +   L  F  VSGL I+S  S                    
Sbjct: 73   TFADDIMLLSRGDLPSVSAIYAKLQHFCNVSGLSISSRWSR------------------- 113

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
                                 S+ YA   + I A I                     QG+
Sbjct: 114  --------------------KSLSYAGKVELIRAVI---------------------QGI 132

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLWGKK-----RPPISWHKICMPSDEGGLGIRNVYA 1414
              FW+ IFPLP+SV+  I   CR FLWGK      +P ++W ++C P  EGGLG+ N+  
Sbjct: 133  ANFWMSIFPLPQSVLDTIIATCRNFLWGKADGGKIKPLVAWSEVCTPKKEGGLGLFNLKD 192

Query: 1413 WNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIWDWNPKKDDSTLLKRINDVKNELLC 1234
            WN ALLS  LW+ H K DSLWV+ VH +Y K  ++WD+     DS  +     +++ ++ 
Sbjct: 193  WNIALLSCILWDLHSKKDSLWVRLVHHYYFKGGNVWDFISSSSDSVFI----HIRDIIIS 248

Query: 1233 KFGNQNAVIANLLAFSNHKGLISSKIYDIFRDHGEKNFWKAAVWKSFIPPKYSFCAWMAF 1054
            K  N       L ++  ++  ++ K+YD  R       W + +W   IP K SF  W+A 
Sbjct: 249  KEENIEVAKLMLNSWGCNEQTLAGKMYDYIRGTRPVVHWSSIIWNPVIPSKMSFILWLAT 308

Query: 1053 NDRLATINNLTYTDINPMCKLCSQQLESAPHLFFTCPITNLLWNRIKAWLKIHRSMSTLA 874
             +RL  ++   + +   +C LC+ + ES  HLFF+C  +  +W  I+ W+ + R   +L 
Sbjct: 309  KNRLLALDRAAFLNKGFLCPLCTNEAESHAHLFFSCRTSLRVWAHIRDWIPLKRQSISLQ 368

Query: 873  SAIKWIRKDKA 841
             +I  + + +A
Sbjct: 369  HSISALIRRRA 379


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score =  254 bits (648), Expect = 1e-64
 Identities = 139/466 (29%), Positives = 236/466 (50%), Gaps = 11/466 (2%)
 Frame = -3

Query: 2079 GLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHLIFADDLMLFAKGD 1900
            GLRQG  +SP LF++CM  LS +++       F +HPRC  + ++HL FADD+M+F+ G 
Sbjct: 910  GLRQGCSLSPYLFVICMNVLSAMLDKGAVEKRFGYHPRCRNMGLTHLCFADDIMVFSAGS 969

Query: 1899 PPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLNFPSGSLPVRYLGV 1720
              S++ ++    +F   SGL+I+  KS +F A +      ++L    F SGSLPVRYLG+
Sbjct: 970  AHSLEGVLAIFKDFAAFSGLNISLEKSTLFMASISSETCASILARFPFDSGSLPVRYLGL 1029

Query: 1719 PLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGVECFWLQIFPLPKS 1540
            PL  +++      PL ++I + I+ W    L+YAGRL L+ SV+  +  FW+  F LP++
Sbjct: 1030 PLMTKRMTLADCLPLLEKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRA 1089

Query: 1539 VVKRIYMLCRTFLW-----GKKRPPISWHKICMPSDEGGLGIRNVYAWNKALLSKNLWNF 1375
             ++ I  +   FLW        +  ++WH +C P  EGGLG+R++   NK    K +W  
Sbjct: 1090 CIREIEQISAAFLWSGTDLNPHKAKVAWHDVCKPKSEGGLGLRSLVDANKICCFKLIWRL 1149

Query: 1374 HLKTDSLWVKWVHAFYLK--RQSIWDWNPKKDDSTLLKRINDVKNELLCK--FGNQNAVI 1207
                 SLWV W+    ++   +++     +     +L  I +   +LLC+     Q+  +
Sbjct: 1150 VSAKHSLWVNWIQNNLIRTVAEALSSHRRRSHRDDILNDIEEELEKLLCRGICTEQDRSL 1209

Query: 1206 ANLLAFSNHKGLISSKIYDIFRDHGEKNFWKAAVWKSFIPPKYSFCAWMAFNDRLATINN 1027
               +         S +I+   R+ G    W  A+W S   PK++F +W+A +DRL T + 
Sbjct: 1210 CRSIGGQFKAKFFSPEIWHQIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTGDK 1269

Query: 1026 LTYTD--INPMCKLCSQQLESAPHLFFTCPITNLLWNRIKAWLKIHRSMSTLASAIKWIR 853
            +   +  I+ +C LC+   ES  HLFF+C  ++ +W+R+   L + R  +   + +  + 
Sbjct: 1270 MASWNRGISSVCVLCNISAESRDHLFFSCNFSSHIWDRLTRRLLLCRYTTNFPALLLLLS 1329

Query: 852  KDKADPILKKARAVAFCCSIYHIWKARNAHVFDGDPFSYEAVFKKI 715
                    +      F  +I+ +W+ RN       P   + + K I
Sbjct: 1330 GQDFSGTKRFLLRYVFQATIHTLWRERNKRRHGDLPIPSDHIIKFI 1375


>ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
            max]
          Length = 316

 Score =  251 bits (641), Expect = 9e-64
 Identities = 120/274 (43%), Positives = 173/274 (63%), Gaps = 5/274 (1%)
 Frame = -3

Query: 2037 LCMEYLSRLINIKTSHSNFKFHPRCGELKISHLIFADDLMLFAKGDPPSVKILMDCLSEF 1858
            LC  + +R ++     +NFKFHP C  +++SHL FADD+ML ++GD P +  +   L  F
Sbjct: 25   LCFVWSTRDMSSFKDDANFKFHPNCAGIQLSHLAFADDIMLLSRGDIPYMSTMFAKLQHF 84

Query: 1857 KKVSGLDINSAKSNVFTAGVFGPDLDALLNLLNFPSGSLPVRYLGVPLAAQKLNSVHYAP 1678
             +VSGL I+S KS +++AG+   +L  +  L  F  G  P RYLG PL + +LN  HYAP
Sbjct: 85   CRVSGLSISSDKSAIYSAGIRPYELSHIQQLTGFSLGGFPFRYLGAPLLSSRLNVCHYAP 144

Query: 1677 LYDRIAAYINKWTANSLTYAGRLLLIKSVLQGVECFWLQIFPLPKSVVKRIYMLCRTFLW 1498
            L  +I   I  W   SL+Y G+L LIK+V+QG+  FW++IFPLP+SV+ RI   C  FLW
Sbjct: 145  LLYKIVGLIQGWNKKSLSYVGKLELIKAVIQGIMNFWMRIFPLPQSVLDRINASCCNFLW 204

Query: 1497 -----GKKRPPISWHKICMPSDEGGLGIRNVYAWNKALLSKNLWNFHLKTDSLWVKWVHA 1333
                 GK +P ++W  +C P  EGGLG+ N+  WN ALLS  LW+FH K DSL V+WVH 
Sbjct: 205  SKADIGKNKPLVAWPVVCSPKQEGGLGLFNLKDWNLALLSHILWDFHCKKDSLRVRWVHH 264

Query: 1332 FYLKRQSIWDWNPKKDDSTLLKRINDVKNELLCK 1231
            +Y +R   W++N    +S L+K+I  +++ ++ K
Sbjct: 265  YYFRRSDEWNYNISSSNSVLIKKIIQIRDFIISK 298


>ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660482 [Glycine max]
          Length = 303

 Score =  250 bits (638), Expect = 2e-63
 Identities = 120/288 (41%), Positives = 179/288 (62%), Gaps = 5/288 (1%)
 Frame = -3

Query: 1989 SNFKFHPRCGELKISHLIFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVF 1810
            +NFKFHP C  +++SHL F DD+ML ++GD PS+  +   L  F +V GL I+S KS+++
Sbjct: 8    ANFKFHPNCAGIQLSHLAFVDDIMLLSRGDIPSMSTMFAKLQHFCRVLGLSISSDKSSIY 67

Query: 1809 TAGVFGPDLDALLNLLNFPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANS 1630
            ++ +   +L  +  L  F  G  P RYLGVPL + +LN  HYAPL  +I   I  W+  S
Sbjct: 68   SSSIRTHELSHIQQLTGFSLGGFPFRYLGVPLLSSRLNVCHYAPLLSKITGLIQGWSRKS 127

Query: 1629 LTYAGRLLLIKSVLQGVECFWLQIFPLPKSVVKRIYMLCRTFLW-----GKKRPPISWHK 1465
            L+YAG+L LI++V+QG+  FW+ IFPLP+SV+ RI   CR FLW     GKK+P ++W  
Sbjct: 128  LSYAGKLELIRAVIQGIVNFWIGIFPLPQSVLDRINASCRNFLWGKADIGKKKPLVAWSV 187

Query: 1464 ICMPSDEGGLGIRNVYAWNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIWDWNPKKD 1285
            +C P  EGGLG+ N+  WN ALLS  LW+FH K DSL   WVH +Y +R  +W++N    
Sbjct: 188  VCSPKREGGLGLFNLKDWNLALLSCILWDFHCKKDSL---WVHHYYFRRSDVWNYNTSSS 244

Query: 1284 DSTLLKRINDVKNELLCKFGNQNAVIANLLAFSNHKGLISSKIYDIFR 1141
             S L+K+I  +++ ++ K  +       + ++  +  L+  K+Y+  R
Sbjct: 245  YSVLIKKIIQIRDFIISKELSTEEAKKRIQSWRTNGQLLVGKVYEYIR 292


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  249 bits (637), Expect = 3e-63
 Identities = 160/534 (29%), Positives = 251/534 (47%), Gaps = 84/534 (15%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            +NG L G F  ERGLRQG  +SP L+++CM  LS +++         +HPRC  + ++HL
Sbjct: 789  VNGELSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEKKISYHPRCRNMNLTHL 848

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
             FADD+M+F+ G   S++  +    +F  +S L I+  KS +F AG+      ++L    
Sbjct: 849  CFADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSILQQFP 908

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
            F  G+LPV+YLG+PL  +++    Y PL ++I A I  WT   L++AGRL LIKSVL  +
Sbjct: 909  FELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSI 968

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLW-----GKKRPPISWHKICMPSDEGGLGIRNVYA 1414
              FWL +F LPK+ ++ I  +   FLW       K+  I+W ++C   +EGGLG++ +  
Sbjct: 969  TNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKE 1028

Query: 1413 WNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIWD-----------WNP--KKDDSTL 1273
             N+  L K +W      DSLWVKWV+   +++++ W            W    K+ D   
Sbjct: 1029 ANEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSVKENTGLGSWLWRKILKQRDKAR 1088

Query: 1272 LKRINDVKNELLCKFGN------------------------QNAVIANLLAFSNHK---- 1177
            L    +V++     F +                         NA +A ++     K    
Sbjct: 1089 LFHRMEVRSGTFTSFWHDHWCPLGRLHQHMGSRGTIDLGIPNNATVAEVMNTHRRKRHRA 1148

Query: 1176 ---GLISSKIYDIFRDH---GEKNFWK-----------------------------AAVW 1102
                 I S+I    +D    G+++ WK                               VW
Sbjct: 1149 DFLNQIKSQIELARQDRSTDGDRSLWKQKEDTFKSSFSSSKTWQQIRSISLRCDWYRGVW 1208

Query: 1101 KSFIPPKYSFCAWMAFNDRLATINNLTYTDINPM--CKLCSQQLESAPHLFFTCPITNLL 928
             S   PKYSF  W+AF++RL T + +   +      C  C ++LE+  HLFF+CP ++ +
Sbjct: 1209 FSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCVFCGEELETRDHLFFSCPYSSHV 1268

Query: 927  WNRIKAWLKIHRSMSTLASAIKWIRKDKADPILKK-ARAVAFCCSIYHIWKARN 769
            W  +   L   R++    + I     D + P L       AF  SI+ +W+ RN
Sbjct: 1269 WFSLTKGLLNGRNILNW-NLITPHLLDSSRPYLHVFTLRYAFQASIHSLWRERN 1321


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  246 bits (628), Expect = 3e-62
 Identities = 126/291 (43%), Positives = 173/291 (59%), Gaps = 5/291 (1%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            ++GSL G F G +GLRQGDP+SP LF++ ME LSRL+  K S  +  +HP+  E++IS L
Sbjct: 635  VSGSLCGYFKGSKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVRISSL 694

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
             FADDLM+F  G   S++ +   L  FK +SGL++N+ KS V+TAG+   D +  L    
Sbjct: 695  AFADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL-AFG 753

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
            F +G+ P RYLG+PL  +KL    Y+ L D+IAA  N W   +L++AGRL LI SV+   
Sbjct: 754  FVNGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYST 813

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLWG-----KKRPPISWHKICMPSDEGGLGIRNVYA 1414
              FWL  F LPK  +K I  +C  FLWG     +    +SW   C+P  EGGLG+RN + 
Sbjct: 814  VNFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWT 873

Query: 1413 WNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIWDWNPKKDDSTLLKRI 1261
            WNK L  + +W    + DSLWV W HA  L+  + W+       S + K I
Sbjct: 874  WNKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFWNAEAASHHSWIWKAI 924



 Score = 78.2 bits (191), Expect = 1e-11
 Identities = 47/156 (30%), Positives = 77/156 (49%), Gaps = 5/156 (3%)
 Frame = -3

Query: 1167 SSKI-YDIFRDHGEKNFWKAAVWKSFIPPKYSFCAWMAFNDRLATINNLTYTDIN--PMC 997
            SSK+ ++  R       W AAVW     PKY+F  W+A  +RL      T+   N   +C
Sbjct: 1034 SSKLTWECLRQRDTTKLWAAAVWYKGCIPKYAFNFWVAHLNRLPVRARTTHWSTNRPSLC 1093

Query: 996  KLCSQQLESAPHLFFTCPITNLLWNRIKAWLKIHRSMSTLASAIKWIRKDKA--DPILKK 823
             +C ++ E+  HLF  C + +L+W ++ A     +        I+W+  ++      LKK
Sbjct: 1094 CVCQRETETRDHLFIHCTLGSLIWQQVLARFGRSQMFREWKDIIEWMLSNQGSFSGTLKK 1153

Query: 822  ARAVAFCCSIYHIWKARNAHVFDGDPFSYEAVFKKI 715
               +A   +I+HIWK RN+ +      S+ A+FK+I
Sbjct: 1154 ---LAVQTAIFHIWKERNSRLHSAMSASHTAIFKQI 1186


>gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana]
          Length = 629

 Score =  240 bits (613), Expect = 2e-60
 Identities = 154/535 (28%), Positives = 241/535 (45%), Gaps = 85/535 (15%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            +NG L G F   RG+RQG  +SP LF++ ME LS++++       F FHP+C  L ++HL
Sbjct: 47   VNGELAGYFRSARGIRQGCALSPYLFVISMEVLSKMLDQAAGGKRFGFHPKCKNLGLTHL 106

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
             FADDLM+   G   SV  +++ ++ F K SGL IN  K+ ++TAGV   +   +++   
Sbjct: 107  CFADDLMILTDGKVRSVDGIVEVMNLFAKRSGLQINMEKTTLYTAGVSDHNRYMMISRYP 166

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
            F  G LPVRYLG+PL  ++L     +PL+++I   I  WT+  L++AGRL LI SVL   
Sbjct: 167  FGLGQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSVLWST 226

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLWG-----KKRPPISWHKICMPSDEGGLGIRNV-- 1420
              FW+  F LP + +K I  +C  FLW      +++  +SW  IC P  EGGLG+R++  
Sbjct: 227  MNFWMSAFRLPSACLKEINSICSAFLWSGPELHRRKAKVSWDDICKPKQEGGLGLRSLTE 286

Query: 1419 --------YAWNKALLSKNLW------NFHLKTDSLWV--------KWVHAFYLK----- 1321
                      W       +LW      N  LK +S W          W+    LK     
Sbjct: 287  ANVVSVLKLIWRVTSNDDSLWVKWSKMNL-LKQESFWSLTPNSSLGSWMWKKMLKYRETA 345

Query: 1320 ------------RQSIW--------------------------------DWNPKKDDSTL 1273
                        R S W                                 W+ ++     
Sbjct: 346  KPFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAEAWSNRRRRKHR 405

Query: 1272 LKRINDVKNELLCKFGNQNAVIANLLAFSNHKGLISSKI-----YDIFRDHGEKNFWKAA 1108
             +++ND++  L  K+  +N +  +   +     +  +       ++  R    +  W   
Sbjct: 406  TEQLNDIEAALNQKYQTRNLLREDATLWRGKGDVFKTSFSTKDTWNQVRKKSNEVAWYKG 465

Query: 1107 VWKSFIPPKYSFCAWMAFNDRLAT--INNLTYTDINPMCKLCSQQLESAPHLFFTCPITN 934
            VW S   PKY FC W+A  +RL+T     L     +  C  CS  +E+  HLFF+C   +
Sbjct: 466  VWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDVKCTFCSTSIETRDHLFFSCSYAS 525

Query: 933  LLWNRIKAWLKIHRSMSTLASAIKWIRKDKADPILKKARAVAFCCSIYHIWKARN 769
             +W  I   +  HR  +   + + +I + + D I        F  +++ +WK RN
Sbjct: 526  AIWTAIAKNVLQHRFSTDWQTIVNYISETQTDRIRSFLSRYIFQLTVHTVWKERN 580


>gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]
          Length = 740

 Score =  237 bits (605), Expect = 1e-59
 Identities = 115/281 (40%), Positives = 169/281 (60%), Gaps = 5/281 (1%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            +NG L G F  +RGLRQG  +SP LF++CM  LS +I++   H N  +HP+C +L ++HL
Sbjct: 215  VNGELAGFFGSKRGLRQGCALSPYLFVICMNVLSHMIDVAAVHRNIGYHPKCKKLSLTHL 274

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
             FADDLM+F  G   SV+ +++   EF   SGL I+  KS ++ AGV   + + +L+   
Sbjct: 275  CFADDLMVFIDGQQRSVEGVINIFKEFAGKSGLHISLEKSTLYLAGVSELNRNNILSAFP 334

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
            F SG LPVRYLG+PL  +++ +  Y+PL D++ + I+ WTA SL+YAGRL LI SV+  +
Sbjct: 335  FASGQLPVRYLGLPLLTKQMTTADYSPLLDKVRSKISSWTARSLSYAGRLALINSVIVSL 394

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLW-----GKKRPPISWHKICMPSDEGGLGIRNVYA 1414
              FW+  + LP   +K I  LC  FLW       K+  I+W  +C    EGGLGI+++  
Sbjct: 395  SNFWMSAYRLPAGCIKEIEKLCSAFLWSGPELNPKKAKITWTSLCKLKQEGGLGIKSLLE 454

Query: 1413 WNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIWDWNPK 1291
             NK    K +W    +  SLWV WV  + +++ S W  N +
Sbjct: 455  ANKVSCLKLIWRLVSRQSSLWVNWVWTYIIRKGSFWSANDR 495


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score =  236 bits (602), Expect = 3e-59
 Identities = 155/536 (28%), Positives = 249/536 (46%), Gaps = 85/536 (15%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            +NG L G F   RGLRQG  +SP LF++ M+ LSR+++       F +HPRC  L ++HL
Sbjct: 363  VNGELAGYFRSARGLRQGCSLSPYLFVISMDVLSRMLDKAAGAREFGYHPRCKTLGLTHL 422

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLNLLN 1759
             FADDLM+   G   SV  ++  L++F    GL I   K+ ++ AGV       + +  +
Sbjct: 423  CFADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTLYLAGVSDHSRQLMSSRYS 482

Query: 1758 FPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQGV 1579
            F  G LPVRYLG+PL  ++L +  Y+PL D+I   I  WT+  L++AGRL LI SVL  +
Sbjct: 483  FGVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSI 542

Query: 1578 ECFWLQIFPLPKSVVKRIYMLCRTFLWG-----KKRPPISWHKICMPSDEGGLGIRNVYA 1414
              FW+  F LP+  +  I  +    LW       K+  +SW +IC P  EGGLG++++  
Sbjct: 543  TNFWMNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDEICKPKKEGGLGLQSLRE 602

Query: 1413 WNKA--------LLS--KNLW------NFHLKTDSLWV--------KWVHAFYLKRQSI- 1309
             NK         LLS   +LW      N  LK +S W          W+    LK + + 
Sbjct: 603  ANKVSSLKLIWRLLSCQDSLWVKWTRMNL-LKKESFWSIGTHSTLGSWIWRRLLKHREVA 661

Query: 1308 --------------------WD----------------------------WNPKKDDSTL 1273
                                W                             W+ ++     
Sbjct: 662  KSFCKIEVNNGVNTSFWFDNWSEKGPLINLTGARGAIDMGISRHMTLAEAWSRRRRKRHR 721

Query: 1272 LKRINDVKNELLCKFGNQNAVIANLLAFSNHKGLISSKI-----YDIFRDHGEKNFWKAA 1108
            ++ +N+ +  LL K+ ++N  + + + +   + +  ++      ++  R    +  W   
Sbjct: 722  VEILNEFEEILLQKYQHRNIELEDAILWRGKEDVFKARFSTKDTWNHIRTSSNQRAWHKG 781

Query: 1107 VWKSFIPPKYSFCAWMAFNDRLATINNL-TYTDINPM-CKLCSQQLESAPHLFFTCPITN 934
            VW +   PK+SFCAW+A  +RL+T + + T+ +  P  C  CS  +E+  HLFF C  ++
Sbjct: 782  VWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCVFCSSPMETRDHLFFQCCYSS 841

Query: 933  LLWNRIKAWLKIHRSMSTLASAIKWIRKDKADPILKKARAVAFCCSIYHIWKARNA 766
             +W  I   +   R  +  ++ + +I   + D I        F  SI+ IW+ RN+
Sbjct: 842  EIWTSIAKNVYKDRFSTKWSAVVNYISDSQPDRIQSFLSRYTFQVSIHSIWRERNS 897


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  235 bits (600), Expect = 5e-59
 Identities = 125/309 (40%), Positives = 173/309 (55%), Gaps = 10/309 (3%)
 Frame = -3

Query: 2118 INGSLKGKFPGERGLRQGDPMSPGLFILCMEYLSRLINIKTSHSNFKFHPRCGELKISHL 1939
            ING   G F   +GLRQGDP+SP LF+L ME  S L++ +       +HP+   L ISHL
Sbjct: 636  INGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLSISHL 695

Query: 1938 IFADDLMLFAKGDPPSVKILMDCLSEFKKVSGLDINSAKSNVFTAGVFGPDLDALLN-LL 1762
            +FADD+M+F  G   S+  + + L +F   SGL +N  KS+++ AG+    L++  N   
Sbjct: 696  MFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGL--NQLESNANAAY 753

Query: 1761 NFPSGSLPVRYLGVPLAAQKLNSVHYAPLYDRIAAYINKWTANSLTYAGRLLLIKSVLQG 1582
             FP G+LP+RYLG+PL  +KL    Y PL ++I A    W    L++AGR+ LI SV+ G
Sbjct: 754  GFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFG 813

Query: 1581 VECFWLQIFPLPKSVVKRIYMLCRTFLWG-----KKRPPISWHKICMPSDEGGLGIRNVY 1417
               FW+  F LPK  +KRI  LC  FLW       K   +SW  +C+P  EGGLG+R + 
Sbjct: 814  SINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLL 873

Query: 1416 AWNKALLSKNLWNFHLKTDSLWVKWVHAFYLKRQSIWDWNPKKDDSTLLKRINDVK---- 1249
             WNK L  + +W   +  DSLW  W H  +L R S W     + DS   KR+  ++    
Sbjct: 874  EWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLRPLAH 933

Query: 1248 NELLCKFGN 1222
              L+CK GN
Sbjct: 934  QFLVCKVGN 942



 Score = 63.9 bits (154), Expect = 3e-07
 Identities = 41/170 (24%), Positives = 80/170 (47%), Gaps = 5/170 (2%)
 Frame = -3

Query: 1179 KGLISSKIYDIFRDHGEKNFWKAAVWKSFIPPKYSFCAWMAFNDRLATINNL-TYTDI-N 1006
            +G  ++K ++  R       W +++W     PKY+F  W++  +RL T   L ++  I +
Sbjct: 1031 QGFSAAKTWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQS 1090

Query: 1005 PMCKLCSQQLESAPHLFFTCPITNLLWNRI-KAWLKIHRSMSTLASAIKWIRKD--KADP 835
              C LCS   ES  HL   C  +  +W  + +      R  S+ +  + W+R+   +A P
Sbjct: 1091 DACVLCSFASESRDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSELLSWVRQSSPEAPP 1150

Query: 834  ILKKARAVAFCCSIYHIWKARNAHVFDGDPFSYEAVFKKIQFHVYQVIYS 685
            +L+K   +     +Y++W+ RN  + +    +   +FK +   +  +I S
Sbjct: 1151 LLRK---IVSQVVVYNLWRQRNNLLHNSLRLAPAVIFKLVDREIRNIISS 1197


Top