BLASTX nr result

ID: Rehmannia22_contig00005714 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00005714
         (900 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670...   211   4e-52
ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664...   200   5e-49
ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665...   198   3e-48
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   183   7e-44
gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]             178   3e-42
emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   168   3e-39
ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663...   163   7e-38
ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660...   163   9e-38
ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein A...   162   2e-37
ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661...   155   2e-35
ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein A...   148   3e-33
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   143   8e-32
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...   142   2e-31
ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268...   132   2e-28
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   131   3e-28
ref|XP_006586426.1| PREDICTED: putative ribonuclease H protein A...   129   1e-27
emb|CAB45965.1| putative reverse transcriptase [Arabidopsis thal...   127   6e-27
ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232...   126   1e-26
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...   125   3e-26
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       122   2e-25

>ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max]
          Length = 383

 Score =  211 bits (536), Expect = 4e-52
 Identities = 106/273 (38%), Positives = 158/273 (57%), Gaps = 12/273 (4%)
 Frame = -3

Query: 865 SYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFLWGKNTSP 686
           S  ++W+  SLSYAG++ LI++V+QG+  FW+ IFPLP +V+D I   CR FLWGK    
Sbjct: 106 SISSRWSRKSLSYAGKVELIRAVIQGIANFWMSIFPLPQSVLDTIIATCRNFLWGKADGG 165

Query: 685 -----IKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWIHSFYLKNQ 521
                + WS+VC P  EGGLGL ++  WN ALL+ ILW++HSK DSLWVR +H +Y K  
Sbjct: 166 KIKPLVAWSEVCTPKKEGGLGLFNLKDWNIALLSCILWDLHSKKDSLWVRLVHHYYFKGG 225

Query: 520 SIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCSSKMYDLFREA 341
           ++W +    SDS  +     IR+ I++K  + ++A   L+ +  ++   + KMYD  R  
Sbjct: 226 NVWDFISSSSDSVFI----HIRDIIISKEENIEVAKLMLNSWGCNEQTLAGKMYDYIRGT 281

Query: 340 GPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLDVDPTCKLCGNYLENASHLF 161
            P   W S +W   IP K SF +WLA  +RL  ++   +L+    C LC N  E+ +HLF
Sbjct: 282 RPVVHWSSIIWNPVIPSKMSFILWLATKNRLLALDRAAFLNKGFLCPLCTNEAESHAHLF 341

Query: 160 FDCIVTRLLWDRVKKW-------LKFSHSMSTI 83
           F C  +  +W  ++ W       +   HS+S +
Sbjct: 342 FSCRTSLRVWAHIRDWIPLKRQSISLQHSISAL 374


>ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max]
          Length = 939

 Score =  200 bits (509), Expect = 5e-49
 Identities = 108/288 (37%), Positives = 161/288 (55%), Gaps = 7/288 (2%)
 Frame = -3

Query: 895  HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716
            HY  L D+I   I  W+A  LSYAGR+ LI+SV+     FW+Q  PLP  VI RIN +CR
Sbjct: 598  HYQVLIDKIVGRITHWSAGLLSYAGRVQLIQSVIFATINFWMQCLPLPKFVIMRINAICR 657

Query: 715  VFLWGKNT-----SPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVR 551
             FLW  N+     SPI W KVC P   GGL + ++  WNK  + K+LWN+ +K+D+LW++
Sbjct: 658  SFLWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAIWNKISILKLLWNVCNKSDNLWIK 717

Query: 550  WIHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCS 371
            W+H++Y++ QSIW+   KKS S ++  +  +R  +L ++ S    +  +           
Sbjct: 718  WLHTYYIRGQSIWSMVLKKSHSWIMSSMMKLR-PLLLQYQSRMQDVFKM----------- 765

Query: 370  SKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNL-KY-LDVDPTCKL 197
             K+Y    E   K  W + +  N   P+  FC+W A + RLA+ + L K+ L+VD  C  
Sbjct: 766  KKIYLALFEESEKMSWRTLMCNNLARPRALFCLWQACHFRLASKDRLIKFGLNVDANCAF 825

Query: 196  CGNYLENASHLFFDCIVTRLLWDRVKKWLKFSHSMSTIAGALKWTKKE 53
            C + +E+  HLFF CI  + +W  V  WL+  H  ST +  L W  ++
Sbjct: 826  CSS-MESHEHLFFGCIELKTIWTAVLNWLQIIHMPSTWSEELNWITRK 872


>ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max]
          Length = 506

 Score =  198 bits (503), Expect = 3e-48
 Identities = 102/284 (35%), Positives = 156/284 (54%), Gaps = 7/284 (2%)
 Frame = -3

Query: 895 HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716
           HY+PL D+I   I  WTA  LSYAGRL L+ SV+  +  +WL  FP P +V+ +I  +CR
Sbjct: 156 HYSPLIDKIVGKIKHWTARLLSYAGRLQLVNSVMFALTNYWLNCFPFPKSVLQKIEAICR 215

Query: 715 VFLW-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVR 551
           +FLW     G   SP+ W ++C P   GGL + D+  WNKA L K+LWN+ SK DSLWV+
Sbjct: 216 IFLWTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIWNKANLMKLLWNLSSKEDSLWVK 275

Query: 550 WIHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCS 371
           WI ++Y+K   +   + K +DS ++K I   R ++          I N+        +  
Sbjct: 276 WIQAYYVKRSELMHIEMKNTDSWIMKAILKQREDL--------EKIDNMEELMIRGSINM 327

Query: 370 SKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNL-KY-LDVDPTCKL 197
            K+Y   ++ G +  W + ++ N   P+ +F +WLA + RL+T + L KY +  D +C  
Sbjct: 328 GKLYRKLQDCGQRKEWKNLLYGNTARPRANFILWLACHGRLSTKDRLCKYGMIDDKSCCF 387

Query: 196 CGNYLENASHLFFDCIVTRLLWDRVKKWLKFSHSMSTIAGALKW 65
           C    E+ +HLFF C  ++ +W  V +W++  H  S     L W
Sbjct: 388 CSEE-ESMNHLFFVCDNSKRVWMEVLQWVQIRHDPSDWPNELHW 430


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  183 bits (465), Expect = 7e-44
 Identities = 99/266 (37%), Positives = 147/266 (55%), Gaps = 7/266 (2%)
 Frame = -3

Query: 886  PLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFL 707
            PL D+I +    W A+ LSYAGRL L+K++L  ++ +W QIFPLP  +I  +   CR FL
Sbjct: 776  PLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKLIKAVETTCRKFL 835

Query: 706  WGKNT-----SPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWIH 542
            W         +P+ W  +  P   GGL + ++  WNKA + K+LW I  K D LWVRW++
Sbjct: 836  WTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLWAITFKQDKLWVRWVN 895

Query: 541  SFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCSSKM 362
            ++Y+K Q+I       + S +L++I + R E+L + G  + A+SN   F+        K 
Sbjct: 896  AYYIKRQNIENVTVSSNTSWILRKIFESR-ELLTRTGGWE-AVSNHMNFS------IKKT 947

Query: 361  YDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLK--YLDVDPTCKLCGN 188
            Y L +E      W   +  N   PK  F +WLA  +RLAT   +     DV P CK+CGN
Sbjct: 948  YKLLQEDYENVVWKRLICNNKATPKSQFILWLAMLNRLATAERVSRWNRDVSPLCKMCGN 1007

Query: 187  YLENASHLFFDCIVTRLLWDRVKKWL 110
             +E   HLFF+CI ++ +W +V  +L
Sbjct: 1008 EIETIQHLFFNCIYSKEIWGKVLLYL 1033


>gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]
          Length = 653

 Score =  178 bits (451), Expect = 3e-42
 Identities = 103/287 (35%), Positives = 144/287 (50%), Gaps = 10/287 (3%)
 Frame = -3

Query: 892  YAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRV 713
            Y+PL + I   I  WT   LSYAGRL LI SVL  +  FWL  F LP   I  I+++C  
Sbjct: 313  YSPLLEHIKKKIGTWTTRYLSYAGRLNLITSVLWSICNFWLAAFRLPRECIREIDKICSA 372

Query: 712  FLW-GKNTSPIK----WSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRW 548
            FLW G + +P K    W  VC P  EGGLGLR +   N+    K++W I S  +SLWVRW
Sbjct: 373  FLWSGPDLNPRKTRVCWGDVCKPKQEGGLGLRSLKEMNEVSCLKLIWRIVSHTNSLWVRW 432

Query: 547  IHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCSS 368
            I  + LK+ + W+     +   +L R     +E + KF + D                  
Sbjct: 433  IEQYLLKHDTFWSVQTTTNMDSVLWR--GRNDEYMPKFSTRD------------------ 472

Query: 367  KMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLD--VDPTCKLC 194
              ++  R       WH  +W     PK+SFC WLA  +RL+T + +   +  + PTC LC
Sbjct: 473  -TWNQTRNTSTPVTWHMGIWFAHATPKFSFCAWLAVQNRLSTGDKMLQWNRRLSPTCVLC 531

Query: 193  GNYLENASHLFFDCIVTRLLWDRVKKWL---KFSHSMSTIAGALKWT 62
             N +E  +HLFF C  T  +W+ + K +   KFS + STI  ++  T
Sbjct: 532  NNNIETRNHLFFSCCYTAEIWENLAKNIYKAKFSTNWSTILTSVSTT 578


>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  168 bits (425), Expect = 3e-39
 Identities = 91/271 (33%), Positives = 138/271 (50%), Gaps = 7/271 (2%)
 Frame = -3

Query: 886  PLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFL 707
            PL + I +    W A  LSYAGRL LIKS+L  ++ +W  IFPL   VI  + ++CR FL
Sbjct: 773  PLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKFL 832

Query: 706  WGKNT-----SPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWIH 542
            W   T     +P+ W+ +  P   GG  + ++  WN+A + K+LW I  K D LWVRWIH
Sbjct: 833  WTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWVRWIH 892

Query: 541  SFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCSSKM 362
            S+Y+K Q I T +     + +L++I   R+  L+  G  D       +    K     K 
Sbjct: 893  SYYIKRQDILTVNISNQTTWILRKIVKARDH-LSNIGDWD------EICIGDK-FSMKKA 944

Query: 361  YDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLDV--DPTCKLCGN 188
            Y    E G +  W   +  N+  PK  F +W+  ++RL T++ +    V  D   +LC N
Sbjct: 945  YKKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQCDLNYRLCRN 1004

Query: 187  YLENASHLFFDCIVTRLLWDRVKKWLKFSHS 95
              E   HLFF C  +  +W ++   ++F +S
Sbjct: 1005 DGETIQHLFFSCSYSAGVWSKICYIMRFPNS 1035


>ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max]
          Length = 514

 Score =  163 bits (413), Expect = 7e-38
 Identities = 86/267 (32%), Positives = 139/267 (52%), Gaps = 11/267 (4%)
 Frame = -3

Query: 898  NHYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLC 719
            +HY PL ++I   I  W++  LS AGR+ L++S++  +  +W+ +FP+P  VI +I+ +C
Sbjct: 258  HHYLPLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAIAQYWMSVFPMPKKVIQKIDSIC 317

Query: 718  RVFLWG-----KNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWV 554
            R F+W      K  S + W +VC P   GGL L ++  WN   + K LWNI SK D+LWV
Sbjct: 318  RSFIWSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELWNVTAMLKCLWNICSKEDNLWV 377

Query: 553  RWIHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSL----FANS 386
            +WIH+++LK  ++ +   K + + +LK +   R +           ++NL L        
Sbjct: 378  KWIHAYFLKGDNVMSATIKSNSTWILKSVMKQRPQ-----------VNNLQLVWIEMLRK 426

Query: 385  KGLCSSKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLDV--D 212
            +     ++Y    E   K  W   +  N   P+ +  +WLA  +RLAT   LK +++   
Sbjct: 427  RKFSMKQVYMELVEDHNKIDWFRLLRYNRARPRANVTLWLACQNRLATKTRLKNMNMIQC 486

Query: 211  PTCKLCGNYLENASHLFFDCIVTRLLW 131
              C LC    E+  HL F C VT+ +W
Sbjct: 487  SLCSLCKEQDEDLDHLMFSCRVTKAIW 513


>ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660482 [Glycine max]
          Length = 303

 Score =  163 bits (412), Expect = 9e-38
 Identities = 83/188 (44%), Positives = 114/188 (60%), Gaps = 5/188 (2%)
 Frame = -3

Query: 895 HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716
           HYAPL  +I   I  W+  SLSYAG+L LI++V+QG+  FW+ IFPLP +V+DRIN  CR
Sbjct: 108 HYAPLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWIGIFPLPQSVLDRINASCR 167

Query: 715 VFLW-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVR 551
            FLW     GK    + WS VC P  EGGLGL ++  WN ALL+ ILW+ H K DSLWV 
Sbjct: 168 NFLWGKADIGKKKPLVAWSVVCSPKREGGLGLFNLKDWNLALLSCILWDFHCKKDSLWV- 226

Query: 550 WIHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCS 371
             H +Y +   +W ++   S S L+K+I  IR+ I++K  S + A   +  +  +  L  
Sbjct: 227 --HHYYFRRSDVWNYNTSSSYSVLIKKIIQIRDFIISKELSTEEAKKRIQSWRTNGQLLV 284

Query: 370 SKMYDLFR 347
            K+Y+  R
Sbjct: 285 GKVYEYIR 292


>ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 316

 Score =  162 bits (409), Expect = 2e-37
 Identities = 77/158 (48%), Positives = 102/158 (64%), Gaps = 5/158 (3%)
 Frame = -3

Query: 895 HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716
           HYAPL  +I   I  W   SLSY G+L LIK+V+QG+  FW++IFPLP +V+DRIN  C 
Sbjct: 141 HYAPLLYKIVGLIQGWNKKSLSYVGKLELIKAVIQGIMNFWMRIFPLPQSVLDRINASCC 200

Query: 715 VFLW-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVR 551
            FLW     GKN   + W  VC P  EGGLGL ++  WN ALL+ ILW+ H K DSL VR
Sbjct: 201 NFLWSKADIGKNKPLVAWPVVCSPKQEGGLGLFNLKDWNLALLSHILWDFHCKKDSLRVR 260

Query: 550 WIHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAK 437
           W+H +Y +    W ++   S+S L+K+I  IR+ I++K
Sbjct: 261 WVHHYYFRRSDEWNYNISSSNSVLIKKIIQIRDFIISK 298


>ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max]
          Length = 947

 Score =  155 bits (392), Expect = 2e-35
 Identities = 86/273 (31%), Positives = 132/273 (48%), Gaps = 7/273 (2%)
 Frame = -3

Query: 895  HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716
            +Y PL D+I + I  WT+  L+  GR+ ++   +  +  FW+Q  P+P +VI +I+ +CR
Sbjct: 598  YYLPLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQCLPIPMSVIKKIDSMCR 657

Query: 715  VFLWGKNT-----SPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVR 551
             F+W ++T     SPI W+ VC P  +GGL + ++  WN   +   LWN+  K D+LWV+
Sbjct: 658  SFVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHITVLNCLWNLCKKVDNLWVK 717

Query: 550  WIHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCS 371
            WIH+ Y+KN S+       + S +LK +   R  I       D  +       NS+    
Sbjct: 718  WIHAHYIKNSSVMNTMVTNNFSWVLKNVLSQREYIHTLQPVWDELL-------NSERFKM 770

Query: 370  SKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLDV--DPTCKL 197
             K YD   EA  +  W   + KN   P+     WLA + RL T + L    +  D    L
Sbjct: 771  KKAYDKMMEA-DRVHWSGLMRKNCARPRAIHTTWLACHGRLGTKDRLVRFGMITDKIWSL 829

Query: 196  CGNYLENASHLFFDCIVTRLLWDRVKKWLKFSH 98
            C    E  +H+ F C V   +W  V   +   H
Sbjct: 830  CKEVEETQNHILFSCKVATDIWSNVLNRIGIDH 862


>ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 239

 Score =  148 bits (373), Expect = 3e-33
 Identities = 68/132 (51%), Positives = 88/132 (66%), Gaps = 5/132 (3%)
 Frame = -3

Query: 895 HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716
           HYAPL  +I   I  W+  SLSYAG+L LI++V+QG+  FW++IFPL  +V+DRIN  C 
Sbjct: 108 HYAPLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWMKIFPLSQSVLDRINASCC 167

Query: 715 VFLW-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVR 551
            FLW     GKN S I WS VC P  EGGLGL ++  WN  LL++ILW+ H K D LWVR
Sbjct: 168 NFLWGKADIGKNKSLIAWSVVCSPKKEGGLGLFNLKDWNLTLLSRILWDFHCKKDFLWVR 227

Query: 550 WIHSFYLKNQSI 515
           W+H +Y +   +
Sbjct: 228 WVHHYYFRASDV 239


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  143 bits (361), Expect = 8e-32
 Identities = 99/344 (28%), Positives = 140/344 (40%), Gaps = 83/344 (24%)
 Frame = -3

Query: 892  YAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRV 713
            Y PL ++I + I  WT   LS+AGRL LIKSVL  +  FWL +F LP   +  I ++   
Sbjct: 933  YLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSITNFWLSVFRLPKACLQEIEKMFSA 992

Query: 712  FLWGK---NTSPIK--WSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRW 548
            FLW     NT   K  WS+VC   +EGGLGL+ +   N+  L K++W I S  DSLWV+W
Sbjct: 993  FLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKEANEVSLLKLIWRILSARDSLWVKW 1052

Query: 547  IHSFYLKNQSI-----------WTWDP--KKSDSCLLKRICDIRNEILAKF--------- 434
            ++   ++ ++            W W    K+ D   L    ++R+     F         
Sbjct: 1053 VNKHLIRKETFWSVKENTGLGSWLWRKILKQRDKARLFHRMEVRSGTFTSFWHDHWCPLG 1112

Query: 433  ---------GSPDLAISNLSLFAN------------------------------------ 389
                     G+ DL I N +  A                                     
Sbjct: 1113 RLHQHMGSRGTIDLGIPNNATVAEVMNTHRRKRHRADFLNQIKSQIELARQDRSTDGDRS 1172

Query: 388  ---------SKGLCSSKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATIN 236
                          SSK +   R    +  W+  VW +   PKYSF  WLAF++RL T +
Sbjct: 1173 LWKQKEDTFKSSFSSSKTWQQIRSISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSD 1232

Query: 235  NLKYLDVDP--TCKLCGNYLENASHLFFDCIVTRLLWDRVKKWL 110
             +   +      C  CG  LE   HLFF C  +  +W  + K L
Sbjct: 1233 KICKWNSGARYDCVFCGEELETRDHLFFSCPYSSHVWFSLTKGL 1276


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score =  142 bits (358), Expect = 2e-31
 Identities = 93/275 (33%), Positives = 129/275 (46%), Gaps = 16/275 (5%)
 Frame = -3

Query: 886  PLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFL 707
            PL ++I S I+ W    LSYAGRL L+ SV+  +  FW+  F LP   I  I ++   FL
Sbjct: 1043 PLLEKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRACIREIEQISAAFL 1102

Query: 706  W-GKNTSP----IKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWIH 542
            W G + +P    + W  VC P  EGGLGLR +   NK    K++W + S   SLWV WI 
Sbjct: 1103 WSGTDLNPHKAKVAWHDVCKPKSEGGLGLRSLVDANKICCFKLIWRLVSAKHSLWVNWI- 1161

Query: 541  SFYLKNQSIWTWDPKKSDSCLLKRICDIRNEI---LAKFGSPDLAISNLSLFANSKG--- 380
                +N  I T     S         DI N+I   L K     +          S G   
Sbjct: 1162 ----QNNLIRTVAEALSSHRRRSHRDDILNDIEEELEKLLCRGICTEQDRSLCRSIGGQF 1217

Query: 379  ---LCSSKMYDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATINNLKYLD--V 215
                 S +++   RE G    WH A+W +   PK++F  WLA +DRL T + +   +  +
Sbjct: 1218 KAKFFSPEIWHQIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGI 1277

Query: 214  DPTCKLCGNYLENASHLFFDCIVTRLLWDRVKKWL 110
               C LC    E+  HLFF C  +  +WDR+ + L
Sbjct: 1278 SSVCVLCNISAESRDHLFFSCNFSSHIWDRLTRRL 1312


>ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum
           lycopersicum]
          Length = 717

 Score =  132 bits (332), Expect = 2e-28
 Identities = 60/139 (43%), Positives = 89/139 (64%), Gaps = 5/139 (3%)
 Frame = -3

Query: 886 PLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFL 707
           PL +++ + IN WTA  LSYAGR  L+K+VL GV+  W Q+F +P+ +I  I  LCR +L
Sbjct: 581 PLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFIIPAKIIKLIEGLCRSYL 640

Query: 706 WG-----KNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWIH 542
           W         + I W KVC P  EGGLGL ++  WN++ + K+ W++ +K D LW++WIH
Sbjct: 641 WSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKIWNRSAVTKLCWDLANKEDKLWIKWIH 700

Query: 541 SFYLKNQSIWTWDPKKSDS 485
           ++Y+K Q  W    KKS++
Sbjct: 701 AYYIKGQREW----KKSNT 715


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  131 bits (330), Expect = 3e-28
 Identities = 85/265 (32%), Positives = 123/265 (46%), Gaps = 9/265 (3%)
 Frame = -3

Query: 889  APLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVF 710
            +PL DRI + I  W    LS+AGRL LI+SVL  ++ +W     LP  V+  I +  R F
Sbjct: 609  SPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWASHLILPKKVLKDIEKRLRCF 668

Query: 709  LWGKNTS-----PIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWI 545
            LW  N S      + WS++CLP  EGGLG++D+H WNKAL+   +WN+ S + + W  W+
Sbjct: 669  LWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWNKALMISHIWNLVSSSSNFWTDWV 728

Query: 544  HSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFAN--SKGLCS 371
              + LK  S W        S   +++  IR E+   F            F N    G  +
Sbjct: 729  KVYLLKGNSFWNAPLPSICSWNWRKLLKIR-ELCCSF------------FVNIIGDGRAT 775

Query: 370  SKMYDLFREAGPKTF-WHSAVWKNFIPPKYSFCVWLAFNDRLATINNLK-YLDVDPTCKL 197
            S  +D +   GP T  W S +       K +      F    +  N L+    + P  +L
Sbjct: 776  SLWFDNWHPLGPLTLRWSSNIIGESGLSKSAMLTPNGFYSTSSAWNTLRPSRFIVPWYRL 835

Query: 196  CGNYLENASHLFFDCIVTRLLWDRV 122
                 E  +HLFFDC  +  +W  V
Sbjct: 836  VWFVAETHNHLFFDCAYSFGIWTHV 860


>ref|XP_006586426.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 192

 Score =  129 bits (325), Expect = 1e-27
 Identities = 62/118 (52%), Positives = 79/118 (66%), Gaps = 5/118 (4%)
 Frame = -3

Query: 895 HYAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCR 716
           HYA L  +I   I  W+  SLSYAG+L LI++V+QG+  FW++IF LP +V+D IN  CR
Sbjct: 65  HYALLLSKITGLIQGWSKKSLSYAGKLELIRAVIQGIVNFWMEIFSLPQSVMDWINASCR 124

Query: 715 VFLW-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLW 557
            FLW     GKN   + WS VC P  EGGLGL ++  WN ALL++ILW+ H K DSLW
Sbjct: 125 NFLWGKADIGKNKPLVAWSVVCSPKKEGGLGLLNLKDWNLALLSRILWDFHCKKDSLW 182


>emb|CAB45965.1| putative reverse transcriptase [Arabidopsis thaliana]
           gi|7267919|emb|CAB78261.1| putative reverse
           transcriptase [Arabidopsis thaliana]
          Length = 662

 Score =  127 bits (319), Expect = 6e-27
 Identities = 68/190 (35%), Positives = 97/190 (51%), Gaps = 5/190 (2%)
 Frame = -3

Query: 892 YAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRV 713
           Y+PL ++I   I  WTA  LSYAGRL L+ SVL  +  FWL  F LP   +  I++LC  
Sbjct: 304 YSPLLEQIKRRIGTWTARFLSYAGRLNLVSSVLWSICNFWLSAFRLPRECVREIDKLCSA 363

Query: 712 FLWG-----KNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRW 548
           FLW       N + I W  VC P  EGGLGL+ +   N     K++W I S+ DSLWV+W
Sbjct: 364 FLWSGPELSTNKAKIAWETVCRPKREGGLGLQSIKEANDVCCLKLIWRIVSQGDSLWVQW 423

Query: 547 IHSFYLKNQSIWTWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNLSLFANSKGLCSS 368
           I ++ LK  + W++      S + K++   R+   A F   D+     + F         
Sbjct: 424 IRTYLLKRNTFWSFRSASQGSWMWKKLLKYRDTAKA-FSKVDIRNGETASFWYDDWSSKG 482

Query: 367 KMYDLFREAG 338
           ++ D+  E G
Sbjct: 483 RLIDVLGERG 492


>ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232446, partial [Cucumis
           sativus]
          Length = 382

 Score =  126 bits (317), Expect = 1e-26
 Identities = 58/132 (43%), Positives = 84/132 (63%), Gaps = 5/132 (3%)
 Frame = -3

Query: 886 PLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRVFL 707
           PL  RI S I  W+A  LS+AGRL L++SVL+ ++ +W  +F LP  V   ++++ R +L
Sbjct: 55  PLIQRITSRIRSWSARVLSFAGRLQLVRSVLRSLQVYWASVFMLPMKVHRDVDKILRSYL 114

Query: 706 W-----GKNTSPIKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRWIH 542
           W     G+  + + W +VCLPFDEGGL +RD  SWN A   KILW +  K+ SLWV W+ 
Sbjct: 115 WRGKEEGRGGAKVAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVE 174

Query: 541 SFYLKNQSIWTW 506
           ++ LK +S+  W
Sbjct: 175 AYILKGRSMLGW 186


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score =  125 bits (313), Expect = 3e-26
 Identities = 87/343 (25%), Positives = 137/343 (39%), Gaps = 84/343 (24%)
 Frame = -3

Query: 892  YAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRV 713
            Y+PL D+I   I  WT+  LS+AGRL LI SVL  +  FW+  F LP   I+ INR+   
Sbjct: 507  YSPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSITNFWMNAFRLPRECINEINRISSA 566

Query: 712  FLW-GKNTSP----IKWSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHS--------- 575
             LW G   +P    + W ++C P  EGGLGL+ +   NK    K++W + S         
Sbjct: 567  LLWSGPELNPKKAKVSWDEICKPKKEGGLGLQSLREANKVSSLKLIWRLLSCQDSLWVKW 626

Query: 574  ------KADSLWV--------RWIHSFYLK-----------------NQSIW-------- 512
                  K +S W          WI    LK                 N S W        
Sbjct: 627  TRMNLLKKESFWSIGTHSTLGSWIWRRLLKHREVAKSFCKIEVNNGVNTSFWFDNWSEKG 686

Query: 511  ------------------------TWDPKKSDSCLLKRICDIRNEILAKFGSPDLAISNL 404
                                     W  ++     ++ + +    +L K+   ++ + + 
Sbjct: 687  PLINLTGARGAIDMGISRHMTLAEAWSRRRRKRHRVEILNEFEEILLQKYQHRNIELEDA 746

Query: 403  SLFANSKGLCSSKM-----YDLFREAGPKTFWHSAVWKNFIPPKYSFCVWLAFNDRLATI 239
             L+   + +  ++      ++  R +  +  WH  VW     PK+SFC WLA  +RL+T 
Sbjct: 747  ILWRGKEDVFKARFSTKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTG 806

Query: 238  NNLKYLD--VDPTCKLCGNYLENASHLFFDCIVTRLLWDRVKK 116
            + +   +     TC  C + +E   HLFF C  +  +W  + K
Sbjct: 807  DRMMTWNNGTPTTCVFCSSPMETRDHLFFQCCYSSEIWTSIAK 849


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  122 bits (306), Expect = 2e-25
 Identities = 61/151 (40%), Positives = 80/151 (52%), Gaps = 5/151 (3%)
 Frame = -3

Query: 892  YAPLYDRIASYINKWTANSLSYAGRLLLIKSVLQGVECFWLQIFPLPSTVIDRINRLCRV 713
            Y PL ++I +    W    LS+AGR+ LI SV+ G   FW+  F LP   I RI  LC  
Sbjct: 779  YEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSR 838

Query: 712  FLWGKNTSPIK-----WSKVCLPFDEGGLGLRDVHSWNKALLAKILWNIHSKADSLWVRW 548
            FLW  N    K     W+ +CLP  EGGLGLR +  WNK L  +++W +    DSLW  W
Sbjct: 839  FLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADW 898

Query: 547  IHSFYLKNQSIWTWDPKKSDSCLLKRICDIR 455
             H  +L   S W  +  +SDS   KR+  +R
Sbjct: 899  QHLHHLSRGSFWAVEGGQSDSWTWKRLLSLR 929


Top