BLASTX nr result

ID: Atropa21_contig00038819 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00038819
         (908 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665...   237   5e-60
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   216   7e-54
ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664...   215   2e-53
emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   213   1e-52
ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663...   192   2e-46
ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661...   175   2e-41
ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268...   144   3e-32
gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]             129   2e-27
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...   127   6e-27
ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670...   123   8e-26
gb|ABD33260.1| non-LTR retroelement reverse transcriptase-like p...   114   4e-23
gb|AAD39277.1|AC007203_9 Hypothetical protein [Arabidopsis thali...   114   5e-23
ref|XP_002331075.1| predicted protein [Populus trichocarpa]           112   1e-22
ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein A...   110   9e-22
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   109   1e-21
ref|XP_006598659.1| PREDICTED: uncharacterized protein LOC102659...   108   3e-21
ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232...   107   8e-21
ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660...   104   5e-20
ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein A...   103   9e-20
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   101   3e-19

>ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max]
          Length = 506

 Score =  237 bits (604), Expect = 5e-60
 Identities = 120/301 (39%), Positives = 188/301 (62%), Gaps = 2/301 (0%)
 Frame = +3

Query: 6    PLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFL 185
            PL+DK++G+IK  T + LSYAGR QL+ S++F++  +    F   K ++Q IEA  R FL
Sbjct: 159  PLIDKIVGKIKHWTARLLSYAGRLQLVNSVMFALTNYWLNCFPFPKSVLQKIEAICRIFL 218

Query: 186  WTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWIH 365
            WTGG E S+   +AW +IC PR   GLNI+D+++WNKA + KLLWN+ +K+D LWVKWI 
Sbjct: 219  WTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIWNKANLMKLLWNLSSKEDSLWVKWIQ 278

Query: 366  SYYERNRDILN-DISK*ASRIIQRILKATEYVIQAGYSMTDVQKMQQFSTKKLYIKLRDQ 542
            +YY +  ++++ ++    S I++ ILK  E + +   +M ++      +  KLY KL+D 
Sbjct: 279  AYYVKRSELMHIEMKNTDSWIMKAILKQREDLEKID-NMEELMIRGSINMGKLYRKLQDC 337

Query: 543  FQKVTWRRLVRNNIGPPR*NFMLLLSANWRLATRDRLHKWGVKNKQQCPLCETTDESLNH 722
             Q+  W+ L+  N   PR NF+L L+ + RL+T+DRL K+G+ + + C  C + +ES+NH
Sbjct: 338  GQRKEWKNLLYGNTARPRANFILWLACHGRLSTKDRLCKYGMIDDKSCCFC-SEEESMNH 396

Query: 723  L-FLCDYSSQLWGKLLAWFGIRRAAKGWDEEVRGAVHNARGKTPQAEVYRMLLVAAVYYF 899
            L F+CD S ++W ++L W  IR     W  E+    H+ +GK  +A V +M +   +Y  
Sbjct: 397  LFFVCDNSKRVWMEVLQWVQIRHDPSDWPNELHWLTHHTKGKGTRAAVLKMAIAETIYEI 456

Query: 900  W 902
            W
Sbjct: 457  W 457


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  216 bits (551), Expect = 7e-54
 Identities = 114/302 (37%), Positives = 179/302 (59%), Gaps = 2/302 (0%)
 Frame = +3

Query: 3    QPLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNF 182
            +PL+DK+  R +      LSYAGR QL+ +IL+S+Q +  Q+F L KK+I+A+E T R F
Sbjct: 775  KPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKLIKAVETTCRKF 834

Query: 183  LWTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWI 362
            LWTG  + S    +AWD +  P+ + GLN+ ++ +WNKAAI KLLW I  KQDKLWV+W+
Sbjct: 835  LWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLWAITFKQDKLWVRWV 894

Query: 363  HSYYERNRDILN-DISK*ASRIIQRILKATEYVIQAGYSMTDVQKMQQFSTKKLYIKLRD 539
            ++YY + ++I N  +S   S I+++I ++ E + + G     V     FS KK Y  L++
Sbjct: 895  NAYYIKRQNIENVTVSSNTSWILRKIFESRELLTRTG-GWEAVSNHMNFSIKKTYKLLQE 953

Query: 540  QFQKVTWRRLVRNNIGPPR*NFMLLLSANWRLATRDRLHKWGVKNKQQCPLCETTDESLN 719
             ++ V W+RL+ NN   P+  F+L L+   RLAT +R+ +W       C +C    E++ 
Sbjct: 954  DYENVVWKRLICNNKATPKSQFILWLAMLNRLATAERVSRWNRDVSPLCKMCGNEIETIQ 1013

Query: 720  HLFL-CDYSSQLWGKLLAWFGIRRAAKGWDEEVRGAVHNARGKTPQAEVYRMLLVAAVYY 896
            HLF  C YS ++WGK+L +  ++  A     +   A+  AR    + ++Y M+   +VY 
Sbjct: 1014 HLFFNCIYSKEIWGKVLLYLNLQPQADA-QAKKELAIKKARSTKDRNKLYVMMFTESVYA 1072

Query: 897  FW 902
             W
Sbjct: 1073 IW 1074


>ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max]
          Length = 939

 Score =  215 bits (548), Expect = 2e-53
 Identities = 115/302 (38%), Positives = 175/302 (57%), Gaps = 2/302 (0%)
 Frame = +3

Query: 3    QPLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNF 182
            Q L+DK++GRI   +   LSYAGR QLI S++F+   F  Q   L K +I  I A  R+F
Sbjct: 600  QVLIDKIVGRITHWSAGLLSYAGRVQLIQSVIFATINFWMQCLPLPKFVIMRINAICRSF 659

Query: 183  LWTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWI 362
            LW G S +S+   IAW+K+C P+++ GLNI+++ +WNK +I KLLWN+C K D LW+KW+
Sbjct: 660  LWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAIWNKISILKLLWNVCNKSDNLWIKWL 719

Query: 363  HSYYERNRDILNDISK*A-SRIIQRILKATEYVIQAGYSMTDVQKMQQFSTKKLYIKLRD 539
            H+YY R + I + + K + S I+  ++K    ++Q    M DV KM     KK+Y+ L +
Sbjct: 720  HTYYIRGQSIWSMVLKKSHSWIMSSMMKLRPLLLQYQSRMQDVFKM-----KKIYLALFE 774

Query: 540  QFQKVTWRRLVRNNIGPPR*NFMLLLSANWRLATRDRLHKWGVKNKQQCPLCETTDESLN 719
            + +K++WR L+ NN+  PR  F L  + ++RLA++DRL K+G+     C  C +  ES  
Sbjct: 775  ESEKMSWRTLMCNNLARPRALFCLWQACHFRLASKDRLIKFGLNVDANCAFCSSM-ESHE 833

Query: 720  HLFL-CDYSSQLWGKLLAWFGIRRAAKGWDEEVRGAVHNARGKTPQAEVYRMLLVAAVYY 896
            HLF  C     +W  +L W  I      W EE+       +GK  +A + +      +Y+
Sbjct: 834  HLFFGCIELKTIWTAVLNWLQIIHMPSTWSEELNWITRKCKGKGWRAMLLKCAFTETIYH 893

Query: 897  FW 902
             W
Sbjct: 894  IW 895


>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  213 bits (541), Expect = 1e-52
 Identities = 117/304 (38%), Positives = 179/304 (58%), Gaps = 2/304 (0%)
 Frame = +3

Query: 3    QPLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNF 182
            +PL++ +  R ++   K LSYAGR QLI SIL S+Q + A +F LSKK+IQA+E   R F
Sbjct: 772  KPLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVCRKF 831

Query: 183  LWTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWI 362
            LWTG +E +K   +AW  I  P+   G N+++++ WN+AA+ KLLW I  K+DKLWV+WI
Sbjct: 832  LWTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWVRWI 891

Query: 363  HSYYERNRDILN-DISK*ASRIIQRILKATEYVIQAGYSMTDVQKMQQFSTKKLYIKLRD 539
            HSYY + +DIL  +IS   + I+++I+KA +++   G    ++    +FS KK Y K+ +
Sbjct: 892  HSYYIKRQDILTVNISNQTTWILRKIVKARDHLSNIG-DWDEICIGDKFSMKKAYKKISE 950

Query: 540  QFQKVTWRRLVRNNIGPPR*NFMLLLSANWRLATRDRLHKWGVKNKQQCPLCETTDESLN 719
              ++V WRRL+ NN   P+  F+L +  + RL T DR+ +WGV+      LC    E++ 
Sbjct: 951  NGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQCDLNYRLCRNDGETIQ 1010

Query: 720  HLFL-CDYSSQLWGKLLAWFGIRRAAKGWDEEVRGAVHNARGKTPQAEVYRMLLVAAVYY 896
            HLF  C YS+ +W K+        +     E +      AR K  + ++  ML    VY 
Sbjct: 1011 HLFFSCSYSAGVWSKICYIMRFPNSGVSHQEIISSVCGQARKK--KGKLIVMLYTEFVYA 1068

Query: 897  FWRE 908
             W++
Sbjct: 1069 IWKQ 1072


>ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max]
          Length = 514

 Score =  192 bits (487), Expect = 2e-46
 Identities = 93/252 (36%), Positives = 158/252 (62%), Gaps = 2/252 (0%)
 Frame = +3

Query: 6    PLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFL 185
            PL++K++G+I+  ++K LS AGR QL+ SI+ +I  +   VF + KK+IQ I++  R+F+
Sbjct: 262  PLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAIAQYWMSVFPMPKKVIQKIDSICRSFI 321

Query: 186  WTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWIH 365
            W+G +E+ +  L+AW ++C P    GLN++++E+WN  A+ K LWNIC+K+D LWVKWIH
Sbjct: 322  WSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELWNVTAMLKCLWNICSKEDNLWVKWIH 381

Query: 366  SYYERNRDILN-DISK*ASRIIQRILKATEYVIQAGYSMTDVQKMQQFSTKKLYIKLRDQ 542
            +Y+ +  ++++  I   ++ I++ ++K    V        ++ + ++FS K++Y++L + 
Sbjct: 382  AYFLKGDNVMSATIKSNSTWILKSVMKQRPQVNNLQLVWIEMLRKRKFSMKQVYMELVED 441

Query: 543  FQKVTWRRLVRNNIGPPR*NFMLLLSANWRLATRDRLHKWGVKNKQQCPLCETTDESLNH 722
              K+ W RL+R N   PR N  L L+   RLAT+ RL    +     C LC+  DE L+H
Sbjct: 442  HNKIDWFRLLRYNRARPRANVTLWLACQNRLATKTRLKNMNMIQCSLCSLCKEQDEDLDH 501

Query: 723  L-FLCDYSSQLW 755
            L F C  +  +W
Sbjct: 502  LMFSCRVTKAIW 513


>ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max]
          Length = 947

 Score =  175 bits (443), Expect = 2e-41
 Identities = 90/301 (29%), Positives = 164/301 (54%), Gaps = 2/301 (0%)
 Frame = +3

Query: 6    PLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFL 185
            PL+DK+  RI+  T+K L+  GR Q++   + +I  F  Q   +   +I+ I++  R+F+
Sbjct: 601  PLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQCLPIPMSVIKKIDSMCRSFV 660

Query: 186  WTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWIH 365
            W+  +E+++   IAW+ +C P+   GLNI ++++WN   +   LWN+C K D LWVKWIH
Sbjct: 661  WSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHITVLNCLWNLCKKVDNLWVKWIH 720

Query: 366  SYYERNRDILND-ISK*ASRIIQRILKATEYVIQAGYSMTDVQKMQQFSTKKLYIKLRDQ 542
            ++Y +N  ++N  ++   S +++ +L   EY+        ++   ++F  KK Y K+ + 
Sbjct: 721  AHYIKNSSVMNTMVTNNFSWVLKNVLSQREYIHTLQPVWDELLNSERFKMKKAYDKMMEA 780

Query: 543  FQKVTWRRLVRNNIGPPR*NFMLLLSANWRLATRDRLHKWGVKNKQQCPLCETTDESLNH 722
              +V W  L+R N   PR      L+ + RL T+DRL ++G+   +   LC+  +E+ NH
Sbjct: 781  -DRVHWSGLMRKNCARPRAIHTTWLACHGRLGTKDRLVRFGMITDKIWSLCKEVEETQNH 839

Query: 723  -LFLCDYSSQLWGKLLAWFGIRRAAKGWDEEVRGAVHNARGKTPQAEVYRMLLVAAVYYF 899
             LF C  ++ +W  +L   GI    + W  E+   ++    K  +A + ++ +   +Y  
Sbjct: 840  ILFSCKVATDIWSNVLNRIGIDHVPQEWPLELDWLLNLTNRKGWRAYLLKLSVTETIYGI 899

Query: 900  W 902
            W
Sbjct: 900  W 900


>ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum
           lycopersicum]
          Length = 717

 Score =  144 bits (364), Expect = 3e-32
 Identities = 60/123 (48%), Positives = 93/123 (75%)
 Frame = +3

Query: 6   PLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFL 185
           PL++K++ RI S T K LSYAGRAQL+ ++LF +Q   AQ+F++  KII+ IE   R++L
Sbjct: 581 PLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFIIPAKIIKLIEGLCRSYL 640

Query: 186 WTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWIH 365
           W+G   ++K  LIAWDK+C P+   GL ++++++WN++A+ KL W++  K+DKLW+KWIH
Sbjct: 641 WSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKIWNRSAVTKLCWDLANKEDKLWIKWIH 700

Query: 366 SYY 374
           +YY
Sbjct: 701 AYY 703


>gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]
          Length = 653

 Score =  129 bits (323), Expect = 2e-27
 Identities = 79/302 (26%), Positives = 143/302 (47%), Gaps = 1/302 (0%)
 Frame = +3

Query: 6    PLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFL 185
            PLL+ +  +I + TT++LSYAGR  LI S+L+SI  F    F L ++ I+ I+     FL
Sbjct: 315  PLLEHIKKKIGTWTTRYLSYAGRLNLITSVLWSICNFWLAAFRLPRECIREIDKICSAFL 374

Query: 186  WTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWIH 365
            W+G     +   + W  +C P+   GL +  ++  N+ +  KL+W I +  + LWV+WI 
Sbjct: 375  WSGPDLNPRKTRVCWGDVCKPKQEGGLGLRSLKEMNEVSCLKLIWRIVSHTNSLWVRWIE 434

Query: 366  SYYERNRDILNDISK*ASRIIQRILKATEYVIQAGYSMTDVQKMQQFSTKKLYIKLRDQF 545
             Y  ++    +             ++ T  +    +   + + M +FST+  + + R+  
Sbjct: 435  QYLLKHDTFWS-------------VQTTTNMDSVLWRGRNDEYMPKFSTRDTWNQTRNTS 481

Query: 546  QKVTWRRLVRNNIGPPR*NFMLLLSANWRLATRDRLHKWGVKNKQQCPLCETTDESLNHL 725
              VTW   +      P+ +F   L+   RL+T D++ +W  +    C LC    E+ NHL
Sbjct: 482  TPVTWHMGIWFAHATPKFSFCAWLAVQNRLSTGDKMLQWNRRLSPTCVLCNNNIETRNHL 541

Query: 726  FL-CDYSSQLWGKLLAWFGIRRAAKGWDEEVRGAVHNARGKTPQAEVYRMLLVAAVYYFW 902
            F  C Y++++W  L       + +  W   +       R +T ++ + R +  A ++  W
Sbjct: 542  FFSCCYTAEIWENLAKNIYKAKFSTNWSTILTSVSTTWRNRT-ESFLARYIFQATIHTIW 600

Query: 903  RE 908
             E
Sbjct: 601  HE 602


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score =  127 bits (319), Expect = 6e-27
 Identities = 83/268 (30%), Positives = 130/268 (48%), Gaps = 15/268 (5%)
 Frame = +3

Query: 6    PLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFL 185
            PLL+K+  RI S   +FLSYAGR QL+ S++ S+  F    F L +  I+ IE     FL
Sbjct: 1043 PLLEKIRSRISSWKNRFLSYAGRLQLLNSVISSLTKFWISAFRLPRACIREIEQISAAFL 1102

Query: 186  WTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWI- 362
            W+G         +AW  +C P+   GL +  +   NK    KL+W + + +  LWV WI 
Sbjct: 1103 WSGTDLNPHKAKVAWHDVCKPKSEGGLGLRSLVDANKICCFKLIWRLVSAKHSLWVNWIQ 1162

Query: 363  -------------HSYYERNRDILNDISK*ASRIIQRILKATEYVIQAGYSMTDVQKMQQ 503
                         H       DILNDI +   +++ R +  TE       S+    K + 
Sbjct: 1163 NNLIRTVAEALSSHRRRSHRDDILNDIEEELEKLLCRGI-CTEQDRSLCRSIGGQFKAKF 1221

Query: 504  FSTKKLYIKLRDQFQKVTWRRLVRNNIGPPR*NFMLLLSANWRLATRDRLHKWGVKNKQQ 683
            FS  +++ ++R+Q     W + +  +   P+  F+  L+A+ RL T D++  W       
Sbjct: 1222 FS-PEIWHQIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGISSV 1280

Query: 684  CPLCETTDESLNHLFL-CDYSSQLWGKL 764
            C LC  + ES +HLF  C++SS +W +L
Sbjct: 1281 CVLCNISAESRDHLFFSCNFSSHIWDRL 1308


>ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max]
          Length = 383

 Score =  123 bits (309), Expect = 8e-26
 Identities = 78/252 (30%), Positives = 129/252 (51%), Gaps = 6/252 (2%)
 Frame = +3

Query: 51  KFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFLWTGGSEMSKSVLIAW 230
           K LSYAG+ +LI +++  I  F   +F L + ++  I AT RNFLW          L+AW
Sbjct: 114 KSLSYAGKVELIRAVIQGIANFWMSIFPLPQSVLDTIIATCRNFLWGKADGGKIKPLVAW 173

Query: 231 DKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWIHSYYERNRDILNDISK 410
            ++C P+   GL + +++ WN A +  +LW++ +K+D LWV+ +H YY +  ++ + IS 
Sbjct: 174 SEVCTPKKEGGLGLFNLKDWNIALLSCILWDLHSKKDSLWVRLVHHYYFKGGNVWDFISS 233

Query: 411 *ASRI---IQRILKATEYVIQAGYSMTDVQKM-QQFSTKKLYIKLRDQFQKVTWRRLVRN 578
            +  +   I+ I+ + E  I+    M +     +Q    K+Y  +R     V W  ++ N
Sbjct: 234 SSDSVFIHIRDIIISKEENIEVAKLMLNSWGCNEQTLAGKMYDYIRGTRPVVHWSSIIWN 293

Query: 579 NIGPPR*NFMLLLSANWRLATRDRLHKWGVKNKQ-QCPLCETTDESLNHLFL-CDYSSQL 752
            + P + +F+L L+   RL   DR       NK   CPLC    ES  HLF  C  S ++
Sbjct: 294 PVIPSKMSFILWLATKNRLLALDRA---AFLNKGFLCPLCTNEAESHAHLFFSCRTSLRV 350

Query: 753 WGKLLAWFGIRR 788
           W  +  W  ++R
Sbjct: 351 WAHIRDWIPLKR 362


>gb|ABD33260.1| non-LTR retroelement reverse transcriptase-like protein, related
           [Medicago truncatula]
          Length = 120

 Score =  114 bits (286), Expect = 4e-23
 Identities = 52/100 (52%), Positives = 78/100 (78%)
 Frame = +3

Query: 18  KMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFLWTGG 197
           +++ RI++ ++KFLSYAGR QLI S+LF +Q + +QVFVL +K+++ I+   R FLWTG 
Sbjct: 21  RIIYRIENWSSKFLSYAGRLQLIKSVLFGVQTYWSQVFVLPQKVLKLIQTACRVFLWTGK 80

Query: 198 SEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLL 317
           S  SK  LIAW++IC P+ + G N++D+++WN+AAICKLL
Sbjct: 81  SGTSKRALIAWERICLPKTAGGWNVIDLKVWNQAAICKLL 120


>gb|AAD39277.1|AC007203_9 Hypothetical protein [Arabidopsis thaliana]
          Length = 355

 Score =  114 bits (285), Expect = 5e-23
 Identities = 77/269 (28%), Positives = 117/269 (43%), Gaps = 1/269 (0%)
 Frame = +3

Query: 6   PLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFL 185
           PL++K+  RI S T +FLSY GR QLI S+L SI  F +  F L    ++ IE     FL
Sbjct: 33  PLIEKIRKRISSWTGRFLSYCGRLQLIKSVLMSITNFWSSAFRLPGNCMKEIERLCSAFL 92

Query: 186 WTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWIH 365
           W+G    + +  IAW K+C P    GL +  ++  N     KL+W + A Q  LW +W+ 
Sbjct: 93  WSGPDLKTHNAKIAWSKVCLPMCEGGLGLRPLKEINTVCGLKLIWRLLASQTSLWGQWVQ 152

Query: 366 SYYERNRDILNDISK*ASRIIQRILKATEYVIQAGYSMTDVQKMQQFSTKKLYIKLRDQF 545
           +Y  R  +                +KA+ Y               Q S   +  +   +F
Sbjct: 153 TYLIRRNNFW-------------AIKASSY---------------QGSWMCMVPQATPKF 184

Query: 546 QKVTWRRLVRNNIGPPR*NFMLLLSANWRLATRDRLHKWGVKNKQQCPLCETTDESLNHL 725
             +TW                  L  + RL+T DR+ KW  +    C  C+   E+ +HL
Sbjct: 185 AFITW------------------LGMHNRLSTGDRMQKWNGQADSTCVFCQDPLETRDHL 226

Query: 726 FL-CDYSSQLWGKLLAWFGIRRAAKGWDE 809
           F  C Y++Q+W  +   F   +    WD+
Sbjct: 227 FFHCHYANQIWEIIAKGFMGVQYTSNWDQ 255


>ref|XP_002331075.1| predicted protein [Populus trichocarpa]
          Length = 517

 Score =  112 bits (281), Expect = 1e-22
 Identities = 51/128 (39%), Positives = 81/128 (63%), Gaps = 1/128 (0%)
 Frame = +3

Query: 9   LLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFLW 188
           L+D++  +++  T + LSYAGR QLI S+LFSIQV+ A +F+L  ++I+ +E   ++FLW
Sbjct: 76  LVDRITSKVRHWTCRTLSYAGRVQLINSVLFSIQVYWASLFLLPGQVIKNVEQIMKSFLW 135

Query: 189 TGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQD-KLWVKWIH 365
           +G    +    +AWD++C P+   GL I  ++ WNK A+ K +WN+C   D  +W  WI 
Sbjct: 136 SGSDMRTTGAKVAWDQVCLPKKEGGLGIKSIKEWNKIALLKHIWNLCNDSDGSIWSTWIR 195

Query: 366 SYYERNRD 389
           S   R R+
Sbjct: 196 SNLLRGRN 203



 Score = 87.0 bits (214), Expect = 9e-15
 Identities = 46/137 (33%), Positives = 72/137 (52%), Gaps = 1/137 (0%)
 Frame = +3

Query: 501 QFSTKKLYIKLRDQFQKVTWRRLVRNNIGPPR*NFMLLLSANWRLATRDRLHKWGVKNKQ 680
           +FS K  + +LR   Q V W  +V      PR +F+L ++   +L T+D+LH++G+    
Sbjct: 323 RFSVKVAWEQLRRHRQMVEWHDIVWFKNAVPRHSFLLWMAVQQKLTTQDKLHRFGIHGPN 382

Query: 681 QCPLCETTDESLNHLFL-CDYSSQLWGKLLAWFGIRRAAKGWDEEVRGAVHNARGKTPQA 857
           +C LC   +E  NHLF  C Y+  +W  +     I R  KGWDE +R A  +  GK+   
Sbjct: 383 RCSLCLRNNEDHNHLFFECSYTKAIWWDVCDRCDIPRMTKGWDEWIRWATVSWHGKSFVN 442

Query: 858 EVYRMLLVAAVYYFWRE 908
              ++   A VY+ W+E
Sbjct: 443 FSCKLSFAATVYHVWQE 459


>ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 316

 Score =  110 bits (274), Expect = 9e-22
 Identities = 58/173 (33%), Positives = 100/173 (57%), Gaps = 2/173 (1%)
 Frame = +3

Query: 6   PLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFL 185
           PLL K++G I+    K LSY G+ +LI +++  I  F  ++F L + ++  I A+  NFL
Sbjct: 144 PLLYKIVGLIQGWNKKSLSYVGKLELIKAVIQGIMNFWMRIFPLPQSVLDRINASCCNFL 203

Query: 186 WTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWIH 365
           W+         L+AW  +C P+   GL + +++ WN A +  +LW+   K+D L V+W+H
Sbjct: 204 WSKADIGKNKPLVAWPVVCSPKQEGGLGLFNLKDWNLALLSHILWDFHCKKDSLRVRWVH 263

Query: 366 SYYERNRDILN-DISK*ASRIIQRILKATEYVIQAGYSMTDVQK-MQQFSTKK 518
            YY R  D  N +IS   S +I++I++  +++I    SM + +K +Q +ST +
Sbjct: 264 HYYFRRSDEWNYNISSSNSVLIKKIIQIRDFIISKELSMEETKKRIQSWSTNE 316


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  109 bits (273), Expect = 1e-21
 Identities = 53/131 (40%), Positives = 73/131 (55%)
 Frame = +3

Query: 6    PLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFL 185
            PLLD++  RIKS   K LS+AGR QLI S+L SIQV+ A   +L KK+++ IE   R FL
Sbjct: 610  PLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWASHLILPKKVLKDIEKRLRCFL 669

Query: 186  WTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWIH 365
            W G      +  +AW +IC P+   GL I D+  WNKA +   +WN+ +     W  W+ 
Sbjct: 670  WAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWNKALMISHIWNLVSSSSNFWTDWVK 729

Query: 366  SYYERNRDILN 398
             Y  +     N
Sbjct: 730  VYLLKGNSFWN 740


>ref|XP_006598659.1| PREDICTED: uncharacterized protein LOC102659749 [Glycine max]
          Length = 686

 Score =  108 bits (270), Expect = 3e-21
 Identities = 67/254 (26%), Positives = 117/254 (46%), Gaps = 2/254 (0%)
 Frame = +3

Query: 147  IIQAIEATYRNFLWTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNI 326
            +IQ I++  R+F+W+G +E+ +   +AW +                              
Sbjct: 427  VIQKIDSICRSFIWSGSAEVKRKSPVAWKQ------------------------------ 456

Query: 327  CAKQDKLWVKWIHSYYERNRDILNDISK*ASR-IIQRILKATEYVIQAGYSMTDVQKMQQ 503
                    VKWIH+Y+ +  ++++   K  S  I++ I+K    V        ++ + ++
Sbjct: 457  --------VKWIHAYFLKGNNVMSATVKSNSTWILKSIMKQRPQVNNLQQIWIEMLRKRK 508

Query: 504  FSTKKLYIKLRDQFQKVTWRRLVRNNIGPPR*NFMLLLSANWRLATRDRLHKWGVKNKQQ 683
            FS K++Y++L +   +  W RL+R N   PR N  L L+   RLAT+ RL    +     
Sbjct: 509  FSMKQVYMELVEDHNRADWFRLLRYNRARPRANVTLWLACQNRLATKTRLKNMNLIQCSL 568

Query: 684  CPLCETTDESLNHL-FLCDYSSQLWGKLLAWFGIRRAAKGWDEEVRGAVHNARGKTPQAE 860
            C LC+  DE L+HL F C  +  +W ++L W  I    + W +EVR  +   +GK  + E
Sbjct: 569  CSLCKEQDEDLDHLMFSCRVTKAIWLEVLKWMDIDHTPQMWRDEVRWVMQYTKGKGWKKE 628

Query: 861  VYRMLLVAAVYYFW 902
            + ++     VY  W
Sbjct: 629  ILKLAFSKVVYGTW 642


>ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232446, partial [Cucumis
           sativus]
          Length = 382

 Score =  107 bits (266), Expect = 8e-21
 Identities = 50/130 (38%), Positives = 78/130 (60%)
 Frame = +3

Query: 6   PLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFL 185
           PL+ ++  RI+S + + LS+AGR QL+ S+L S+QV+ A VF+L  K+ + ++   R++L
Sbjct: 55  PLIQRITSRIRSWSARVLSFAGRLQLVRSVLRSLQVYWASVFMLPMKVHRDVDKILRSYL 114

Query: 186 WTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWIH 365
           W G  E      +AWD++C P    GL I D   WN A+  K+LW +  K   LWV W+ 
Sbjct: 115 WRGKEEGRGGAKVAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVE 174

Query: 366 SYYERNRDIL 395
           +Y  + R +L
Sbjct: 175 AYILKGRSML 184


>ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660482 [Glycine max]
          Length = 303

 Score =  104 bits (259), Expect = 5e-20
 Identities = 57/167 (34%), Positives = 93/167 (55%), Gaps = 1/167 (0%)
 Frame = +3

Query: 6   PLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFL 185
           PLL K+ G I+  + K LSYAG+ +LI +++  I  F   +F L + ++  I A+ RNFL
Sbjct: 111 PLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWIGIFPLPQSVLDRINASCRNFL 170

Query: 186 WTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWIH 365
           W       K  L+AW  +C P+   GL + +++ WN A +  +LW+   K+D LWV   H
Sbjct: 171 WGKADIGKKKPLVAWSVVCSPKREGGLGLFNLKDWNLALLSCILWDFHCKKDSLWV---H 227

Query: 366 SYYERNRDILN-DISK*ASRIIQRILKATEYVIQAGYSMTDVQKMQQ 503
            YY R  D+ N + S   S +I++I++  +++I    S  + +K  Q
Sbjct: 228 HYYFRRSDVWNYNTSSSYSVLIKKIIQIRDFIISKELSTEEAKKRIQ 274


>ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 239

 Score =  103 bits (257), Expect = 9e-20
 Identities = 48/129 (37%), Positives = 76/129 (58%)
 Frame = +3

Query: 6   PLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFL 185
           PLL K+ G I+  + K LSYAG+ +LI +++  I  F  ++F LS+ ++  I A+  NFL
Sbjct: 111 PLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWMKIFPLSQSVLDRINASCCNFL 170

Query: 186 WTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWIH 365
           W          LIAW  +C P+   GL + +++ WN   + ++LW+   K+D LWV+W+H
Sbjct: 171 WGKADIGKNKSLIAWSVVCSPKKEGGLGLFNLKDWNLTLLSRILWDFHCKKDFLWVRWVH 230

Query: 366 SYYERNRDI 392
            YY R  D+
Sbjct: 231 HYYFRASDV 239


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  101 bits (252), Expect = 3e-19
 Identities = 51/125 (40%), Positives = 76/125 (60%)
 Frame = +3

Query: 6    PLLDKMLGRIKS*TTKFLSYAGRAQLIISILFSIQVF*AQVFVLSKKIIQAIEATYRNFL 185
            PL++K+  RI S T +FLS+AGR QLI S+L SI  F   VF L K  +Q IE  +  FL
Sbjct: 935  PLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSITNFWLSVFRLPKACLQEIEKMFSAFL 994

Query: 186  WTGGSEMSKSVLIAWDKICYPRLSRGLNILDVEMWNKAAICKLLWNICAKQDKLWVKWIH 365
            W+G    +K   IAW ++C  +   GL +  ++  N+ ++ KL+W I + +D LWVKW++
Sbjct: 995  WSGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKEANEVSLLKLIWRILSARDSLWVKWVN 1054

Query: 366  SYYER 380
             +  R
Sbjct: 1055 KHLIR 1059


Top