BLASTX nr result

ID: Atropa21_contig00013282 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00013282
         (1214 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   153   2e-50
emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   139   3e-48
ref|XP_006598323.1| PREDICTED: uncharacterized protein LOC102660...   144   8e-47
ref|XP_004240779.1| PREDICTED: uncharacterized protein LOC101256...   144   1e-42
gb|ABD33261.1| RNA-directed DNA polymerase (Reverse transcriptas...   115   4e-41
ref|XP_004252692.1| PREDICTED: uncharacterized protein LOC101261...   109   1e-38
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   100   3e-33
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           100   3e-33
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   101   3e-32
ref|XP_006586520.1| PREDICTED: uncharacterized protein LOC102662...   112   4e-32
ref|XP_004247247.1| PREDICTED: uncharacterized protein LOC101256...    88   4e-32
gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]               91   1e-30
ref|XP_006577697.1| PREDICTED: uncharacterized protein LOC102664...   140   1e-30
dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal...    90   2e-30
gb|AAD12028.1| putative non-LTR retroelement reverse transcripta...    99   3e-30
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...    90   4e-30
gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip...    94   4e-30
gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00...    96   1e-29
emb|CCA66235.1| hypothetical protein [Beta vulgaris subsp. vulga...    92   2e-29
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]        87   4e-29

>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  153 bits (387), Expect(2) = 2e-50
 Identities = 100/310 (32%), Positives = 158/310 (50%), Gaps = 1/310 (0%)
 Frame = +1

Query: 43   GCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRTEIEKQ 222
            G  +SW+NK      I S ID +F N  W   Y ++   Y+     DH P I     +  
Sbjct: 180  GLFYSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAGISDHSPLIFNLATQHD 239

Query: 223  *LPRPCKMLNVLLSQAVF*NTVLEGLRKEIPVHIMFKICKKLTKIQQAS*QMN-R*MTSL 399
               RP K LN L  Q  F   V E          M  I  +L  +++A    + +  +  
Sbjct: 240  EGGRPFKFLNFLADQNGFVEVVKEAWGSANHRFKMKNIWVRLQAVKRALKSFHSKKFSKA 299

Query: 400  DVKLENIQEKVDTIQRALKDDKFSLQLIQEEKKLIMQLEHWSHIHERVLRQKPRATWIGH 579
              ++E ++ K+  +Q AL +     +L +EEK LI QL  WS I E +L+QK R  W+  
Sbjct: 300  HCQVEELRRKLAAVQ-ALPEVSQVSELQEEEKDLIAQLRKWSTIDESILKQKSRIQWLSL 358

Query: 580  GNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*GLLGQATKELPCI 759
            G+SN+K+F  ++KVR+++N +  + N++   LT+   +Q E   F+  LLG ++ +L  I
Sbjct: 359  GDSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICNFYRRLLGTSSSQLEAI 418

Query: 760  DVKIARDGH*LSYDKKQ*ILQPVTSSDIVGVLKELPADKSPSIDGFPTDFFKEFWDILGD 939
            D+ + R G  LS      ++QP+T  +I   L ++   K+P +DGF + FFK+ W ++  
Sbjct: 419  DLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFFKKSWLVIKQ 478

Query: 940  EITTVILDFF 969
            EI   ILDFF
Sbjct: 479  EIYEGILDFF 488



 Score = 73.9 bits (180), Expect(2) = 2e-50
 Identities = 35/84 (41%), Positives = 57/84 (67%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF N  + K INCT +TL+ K+      K +R IACC+++YKII+K+LT  L+ V+  +V
Sbjct: 487  FFENGFMHKPINCTAVTLIPKIDEAKHAKDYRPIACCSTLYKIISKILTKRLQAVITEVV 546

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
              +Q+ FI  ++I +N+++A EL+
Sbjct: 547  DCAQTGFIPERHIGDNILLATELI 570


>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  139 bits (350), Expect(2) = 3e-48
 Identities = 91/308 (29%), Positives = 148/308 (48%), Gaps = 2/308 (0%)
 Frame = +1

Query: 52   FSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRTEIEKQ*LP 231
            +SWSN       + S ID A+ N  W   Y+ +   Y  P   DH P +      +    
Sbjct: 180  YSWSNSSIGRDRVLSRIDKAYVNLVWLGMYAEVSVQYLPPGISDHSPLLFNLMTGRPQGG 239

Query: 232  RPCKMLNVLLSQAVF*NTVLEGLRKEIPVHIMFKICKKLTKIQQAS*QMNR*MTSL-DVK 408
            +P K +NV+  Q  F  TV +          +  I   L  +++   QM      L   K
Sbjct: 240  KPFKFMNVMAEQGEFLETVEKAWNSVNGRFKLQAIWLNLKAVKRELKQMKTQKIGLAHEK 299

Query: 409  LENIQEKVDTIQRALKDDKFSLQLIQEEKKLIMQ-LEHWSHIHERVLRQKPRATWIGHGN 585
            ++N++ ++  +Q   +DD     ++Q + K IM  L HWSHI + +L+QK R TW+  G+
Sbjct: 300  VKNLRHQLQDLQS--QDDFDHNDIMQTDAKSIMNDLRHWSHIEDSILQQKSRITWLQQGD 357

Query: 586  SNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*GLLGQATKELPCIDV 765
            +N+K F  ++K R + N +  +      ++ D   VQ E ++F+  LLG     L  +D+
Sbjct: 358  TNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILEFYKKLLGTRASTLMGVDL 417

Query: 766  KIARDGH*LSYDKKQ*ILQPVTSSDIVGVLKELPADKSPSIDGFPTDFFKEFWDILGDEI 945
               R G  LS   K+ +++ V S++I   L  +  DK+P +DGF   FFK+ W  +  EI
Sbjct: 418  NTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKKSWGSIKQEI 477

Query: 946  TTVILDFF 969
               I +FF
Sbjct: 478  YAGIQEFF 485



 Score = 81.3 bits (199), Expect(2) = 3e-48
 Identities = 38/84 (45%), Positives = 60/84 (71%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF N ++ + INC  +TL+ KV + T VK+FR IACCT +YKII+K+LT  +K ++  +V
Sbjct: 484  FFNNSRMHRPINCIVVTLLPKVQHATRVKEFRPIACCTVIYKIISKMLTNRMKGIIGEVV 543

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
              +QS FI  ++I +N+++A EL+
Sbjct: 544  NEAQSGFIPGRHIADNILLASELI 567


>ref|XP_006598323.1| PREDICTED: uncharacterized protein LOC102660513 [Glycine max]
          Length = 543

 Score =  144 bits (364), Expect(2) = 8e-47
 Identities = 89/317 (28%), Positives = 154/317 (48%), Gaps = 3/317 (0%)
 Frame = +1

Query: 28   QLNMKGCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRT 207
            +++ KG  ++WSNK+  N+ IYS ID   GN  WF    NL     TP   DH    +R 
Sbjct: 142  EMDSKGDYYTWSNKQSENI-IYSRIDRILGNTEWFSKNLNLSLTNMTPGISDHAMLCLRD 200

Query: 208  EIEKQ*LPRPCKMLNVLLSQAVF*NTVLEG---LRKEIPVHIMFKICKKLTKIQQAS*QM 378
            +          K  N +     F  TV       R+  P   M  +  KL K+Q     +
Sbjct: 201  DSVPVKRKARFKYANCVSGMDNFTETVANSWNSARRGGPP--MKMLWHKLKKLQPVINNL 258

Query: 379  NR*MTSLDVKLENIQEKVDTIQRALKDDKFSLQLIQEEKKLIMQLEHWSHIHERVLRQKP 558
            ++ +  + VKL+  +EK+   Q  L  D+ +   I         +  W+ + E++L+Q+ 
Sbjct: 259  SKPLIGIKVKLQEAREKLTHAQMELTLDRLNKDKIDRTNDCTEAVIKWTEMEEQMLQQRA 318

Query: 559  RATWIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*GLLGQA 738
            +  W+  G+ N  YFHASLK + ++  +  +  N    +T  + ++ E ++F+  L+G+ 
Sbjct: 319  KIRWLRLGDGNNAYFHASLKAKYNQTSIKKLYMNDGNFVTTQKEIEDEIMRFYGDLMGRE 378

Query: 739  TKELPCIDVKIARDGH*LSYDKKQ*ILQPVTSSDIVGVLKELPADKSPSIDGFPTDFFKE 918
               L  +D+ I R G  L++D+++ ++  +T  +I   LK +   K+P IDG+   FFK+
Sbjct: 379  EPNLDSVDINIMRKGCQLNFDQRKYLIGRITDEEIDKALKSIGDLKAPGIDGYGAKFFKD 438

Query: 919  FWDILGDEITTVILDFF 969
             W I+  + T  I +FF
Sbjct: 439  AWSIIKSDFTDAIREFF 455



 Score = 70.9 bits (172), Expect(2) = 8e-47
 Identities = 32/84 (38%), Positives = 55/84 (65%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF   K+ + IN + + L+ K       + +R I+CCT++YK+I+KVLT  L  V+ S+V
Sbjct: 454  FFEKGKMYEPINTSLVILIPKNQEAKYARDYRPISCCTTIYKVISKVLTTRLSRVIKSIV 513

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
            + SQ+AF+  + I + +++AYEL+
Sbjct: 514  HQSQAAFVPGQKIHDQILLAYELI 537


>ref|XP_004240779.1| PREDICTED: uncharacterized protein LOC101256493 [Solanum
            lycopersicum]
          Length = 441

 Score =  144 bits (364), Expect(2) = 1e-42
 Identities = 95/315 (30%), Positives = 154/315 (48%), Gaps = 1/315 (0%)
 Frame = +1

Query: 28   QLNMKGCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRT 207
            +L  KG  ++WSNK+  N  +   ID AFGND W   + ++   Y  P   DH    +  
Sbjct: 80   ELQWKGSYYTWSNKQIGNARVSRRIDRAFGNDEWMDKWGHVILEYGNPGVSDHSTMQLVL 139

Query: 208  EIEKQ*LPRPCKMLNVLLSQAVF*NTVLEGLRKEIPVHIMFKICKKLTKIQQAS*QMNR* 387
                Q +    K  N+     +F + V +  ++E     + K+  KL  +Q    Q+NR 
Sbjct: 140  HQSNQHVRASFKFFNIWTEHDLFLDLVEKVWKQEKDRDAIKKVWYKLKALQPVLKQLNRK 199

Query: 388  -MTSLDVKLENIQEKVDTIQRALKDDKFSLQLIQEEKKLIMQLEHWSHIHERVLRQKPRA 564
                +  ++E  + ++  IQ  L       +L+ +EK+L+ +LE  S I E  LRQK RA
Sbjct: 200  EFKYISNQIEEARNELIDIQNQLCHQAKD-ELVTKEKELLTKLEKLSLIKESALRQKVRA 258

Query: 565  TWIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*GLLGQATK 744
             WI  G++N KY  + +K R  K  +  + +     L++PQ +Q EFV F   L+G A  
Sbjct: 259  KWIKLGDANNKYLSSVIKERNHKKNIRILMSLDGRKLSEPQEIQDEFVLFDKSLMGTAAN 318

Query: 745  ELPCIDVKIARDGH*LSYDKKQ*ILQPVTSSDIVGVLKELPADKSPSIDGFPTDFFKEFW 924
             L  I+V++ + G  LS   +  +   +T  +IV  LK +  +K+P IDG+   FFK  W
Sbjct: 319  NLSAINVQVMKRGPVLSRQHRIQLCATITDQEIVEALKSIGNEKAPGIDGYNALFFKHTW 378

Query: 925  DILGDEITTVILDFF 969
             I+  ++   +  FF
Sbjct: 379  KIIEHDVIDAVKSFF 393



 Score = 57.0 bits (136), Expect(2) = 1e-42
 Identities = 25/48 (52%), Positives = 35/48 (72%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVL 1106
            FF   KL K  NCT +T++ KV +P +VK++R IACC  +YKII+KV+
Sbjct: 392  FFTTGKLFKPFNCTLVTVIPKVHSPKNVKEYRPIACCRVLYKIISKVI 439


>gb|ABD33261.1| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
           truncatula]
          Length = 402

 Score =  115 bits (287), Expect(2) = 4e-41
 Identities = 58/165 (35%), Positives = 91/165 (55%)
 Frame = +1

Query: 478 LIQEEKKLIMQLEHWSHIHERVLRQKPRATWIGHGNSNTKYFHASLKVRQSKNMVSSICN 657
           LI+ EK  +  LE WS I E++  QK RA WI  G+SNTK+FHA  K R+ +N +  +  
Sbjct: 5   LIEAEKICLSSLEKWSTIEEKIWMQKSRANWIQLGDSNTKFFHAYAKERRCQNNIKFLIT 64

Query: 658 NQDILLTDPQLVQYEFVQFF*GLLGQATKELPCIDVKIARDGH*LSYDKKQ*ILQPVTSS 837
                +    L++ E   F+  L+G +   LP +D  + + G  LS  ++  +    T+ 
Sbjct: 65  EDGTRIDKHNLIKEEIRGFYLKLMGSSVDSLPMVDKNVVKRGPMLSQHQQDLLCSKFTAV 124

Query: 838 DIVGVLKELPADKSPSIDGFPTDFFKEFWDILGDEITTVILDFFR 972
           ++  VL  + + K+P IDG+   FFK  W+I+GD +   ILDFF+
Sbjct: 125 EVKNVLFSMDSSKAPGIDGYNVHFFKCSWNIIGDSVIDAILDFFK 169



 Score = 81.6 bits (200), Expect(2) = 4e-41
 Identities = 42/84 (50%), Positives = 62/84 (73%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF    + K INCT +TL+ K  N TSVK FR IACC+ +YKII+K+LT  ++ V++S+V
Sbjct: 167  FFKTGFMPKIINCTYMTLLPKEVNVTSVKNFRPIACCSVIYKIISKILTSRMQGVLNSVV 226

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
              +QSAF+K + I +N+I+++ELV
Sbjct: 227  SENQSAFVKGRVIFDNIILSHELV 250


>ref|XP_004252692.1| PREDICTED: uncharacterized protein LOC101261795 [Solanum
           lycopersicum]
          Length = 413

 Score =  109 bits (273), Expect(2) = 1e-38
 Identities = 82/318 (25%), Positives = 143/318 (44%), Gaps = 4/318 (1%)
 Frame = +1

Query: 28  QLNMKGCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRT 207
           ++  KG  ++W+NK+ +N  I S ID AFGN  W   + +       P   DH P     
Sbjct: 5   EVQWKGNYYTWTNKQISNARIASRIDRAFGNVTWMDKWGHAAIESGNPGVSDHIPMHFLL 64

Query: 208 EIEKQ*LPRPCKMLNVLLSQAVF*NTVLEGLRKEIPVHIMFKICKKLTKIQQAS*QMNR* 387
                 +    K+ NVL+    F   V +  +++    +M +I   L ++Q    Q+NR 
Sbjct: 65  HQSYHQIKVSFKLFNVLIEHKSFLELVDKVWKQKHGSEVMKEIWYNLKELQPVLRQLNR- 123

Query: 388 MTSLDVKLENIQEK-VDTIQRALKDDKFSL---QLIQEEKKLIMQLEHWSHIHERVLRQK 555
                   +NI++K ++ ++  L++  +S    +L  +EK L+++++ WS I E  LRQK
Sbjct: 124 -KEFQYIGQNIEKKRIELVE--LQEQLYSQASDELFTKEKDLLIKVDKWSMIEESALRQK 180

Query: 556 PRATWIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*GLLGQ 735
            RA WI  G++  KYF + +K R  K  + S                             
Sbjct: 181 ARARWITLGDAKNKYFSSVIKERNQKKHIRS----------------------------- 211

Query: 736 ATKELPCIDVKIARDGH*LSYDKKQ*ILQPVTSSDIVGVLKELPADKSPSIDGFPTDFFK 915
              +LP I+ ++ + G   S  ++  +   +T  +I   L+    DK+P IDG+   FFK
Sbjct: 212 ---KLPAINAQVMKRGPVSSRQQRIQLCTDITEQEIYSTLQSYGNDKAPGIDGYNALFFK 268

Query: 916 EFWDILGDEITTVILDFF 969
             W I+  ++   + +FF
Sbjct: 269 HTWKIIKKDVIEAVKNFF 286



 Score = 78.6 bits (192), Expect(2) = 1e-38
 Identities = 38/84 (45%), Positives = 57/84 (67%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF   KL K  NCT ++L+ KV  P +VK++  IACCT +YKII+KV+T  +  V+  ++
Sbjct: 285  FFTTGKLFKPFNCTLVSLIPKVQCPKTVKEYTPIACCTVLYKIISKVITRRMHDVIHDVI 344

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
              SQ+ FI  + I +N+I+A+ELV
Sbjct: 345  CESQAGFIPGRKIADNIILAHELV 368


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  100 bits (249), Expect(2) = 3e-33
 Identities = 84/331 (25%), Positives = 147/331 (44%), Gaps = 8/331 (2%)
 Frame = +1

Query: 1    SKLPKVCTGQLNMKGCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFF 180
            S L ++    L  KG  F+W NK      I   +D    ND W   Y +    +    F 
Sbjct: 31   SCLSEMELSDLVFKGNSFTWWNKSSIRP-IAKKLDRILANDSWCNLYPSSHGLFGNLDFS 89

Query: 181  DHDPSIIRTEIEKQ*LPRPCKMLNVLLSQAVF*NTVLEG-LRKEIPVHIMFKICKKLTKI 357
            DH    +  E       RP K  N LL    F N V++      +    M+++ KKL  +
Sbjct: 90   DHVSCGVVLEANGISAKRPFKFFNFLLKNEDFLNVVMDNWFSTNVVGSSMYRVSKKLKAM 149

Query: 358  QQAS*QMNR*MTS-LDVKLENIQEKVDTIQR-ALKDDKFS---LQLIQEEKKLIMQLEHW 522
            ++     +R   S ++++ +   E + T Q   L +   S   L+L  + K +++     
Sbjct: 150  KKPIKDFSRLNYSGIELRTKEAHELLITCQNLTLANPSVSNAALELEAQRKWVLLSCAEE 209

Query: 523  SHIHERVLRQKPRATWIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYE 702
            S  H+R      R +W   G+SNT YFH  +  R+S N ++S+ ++  +L+   Q +   
Sbjct: 210  SFFHQR-----SRVSWFAEGDSNTHYFHRMVDSRKSFNTINSLVDSNGLLIDSQQGILDH 264

Query: 703  FVQFF*GLLG--QATKELPCIDVKIARDGH*LSYDKKQ*ILQPVTSSDIVGVLKELPADK 876
             V ++  LLG  ++   +   D+ +    +  S D+   + +  T  +I    K LP +K
Sbjct: 265  CVTYYERLLGSIESPFSMEQEDMNLLLT-YRCSQDQCSELEKSFTDDEIKAAFKSLPRNK 323

Query: 877  SPSIDGFPTDFFKEFWDILGDEITTVILDFF 969
            +   DG+  +FF++ W I+G E+   I +FF
Sbjct: 324  TSGPDGYSVEFFRDTWSIIGPEVLAAIHEFF 354



 Score = 69.7 bits (169), Expect(2) = 3e-33
 Identities = 32/84 (38%), Positives = 60/84 (71%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF + +L+K+ N TT+ L+ K SN  ++ +FR I+C  ++YK+I+K+LT  L+ ++ +++
Sbjct: 353  FFDSGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVI 412

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
              SQSAF+  +++  NV++A E+V
Sbjct: 413  GHSQSAFLPGRSLAENVLLATEMV 436


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  100 bits (249), Expect(2) = 3e-33
 Identities = 84/331 (25%), Positives = 147/331 (44%), Gaps = 8/331 (2%)
 Frame = +1

Query: 1    SKLPKVCTGQLNMKGCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFF 180
            S L ++    L  KG  F+W NK      I   +D    ND W   Y +    +    F 
Sbjct: 31   SCLSEMELSDLVFKGNSFTWWNKSSIRP-IAKKLDRILANDSWCNLYPSSHGLFGNLDFS 89

Query: 181  DHDPSIIRTEIEKQ*LPRPCKMLNVLLSQAVF*NTVLEG-LRKEIPVHIMFKICKKLTKI 357
            DH    +  E       RP K  N LL    F N V++      +    M+++ KKL  +
Sbjct: 90   DHVSCGVVLEANGISAKRPFKFFNFLLKNEDFLNVVMDNWFSTNVVGSSMYRVSKKLKAM 149

Query: 358  QQAS*QMNR*MTS-LDVKLENIQEKVDTIQR-ALKDDKFS---LQLIQEEKKLIMQLEHW 522
            ++     +R   S ++++ +   E + T Q   L +   S   L+L  + K +++     
Sbjct: 150  KKPIKDFSRLNYSGIELRTKEAHELLITCQNLTLANPSVSNAALELEAQRKWVLLSCAEE 209

Query: 523  SHIHERVLRQKPRATWIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYE 702
            S  H+R      R +W   G+SNT YFH  +  R+S N ++S+ ++  +L+   Q +   
Sbjct: 210  SFFHQR-----SRVSWFAEGDSNTHYFHRMVDSRKSFNTINSLVDSNGLLIDSQQGILDH 264

Query: 703  FVQFF*GLLG--QATKELPCIDVKIARDGH*LSYDKKQ*ILQPVTSSDIVGVLKELPADK 876
             V ++  LLG  ++   +   D+ +    +  S D+   + +  T  +I    K LP +K
Sbjct: 265  CVTYYERLLGSIESPFSMEQEDMNLLLT-YRCSQDQCSELEKSFTDDEIKAAFKSLPRNK 323

Query: 877  SPSIDGFPTDFFKEFWDILGDEITTVILDFF 969
            +   DG+  +FF++ W I+G E+   I +FF
Sbjct: 324  TSGPDGYSVEFFRDTWSIIGPEVLAAIHEFF 354



 Score = 69.7 bits (169), Expect(2) = 3e-33
 Identities = 32/84 (38%), Positives = 60/84 (71%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF + +L+K+ N TT+ L+ K SN  ++ +FR I+C  ++YK+I+K+LT  L+ ++ +++
Sbjct: 353  FFDSGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVI 412

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
              SQSAF+  +++  NV++A E+V
Sbjct: 413  GHSQSAFLPGRSLAENVLLATEMV 436


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  101 bits (252), Expect(2) = 3e-32
 Identities = 83/319 (26%), Positives = 143/319 (44%), Gaps = 9/319 (2%)
 Frame = +1

Query: 40   KGCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRTEIE- 216
            +G LF+W NKR+  + I   +D    ND W   +S   + ++     DH    I    E 
Sbjct: 183  QGPLFTWCNKREHGL-IMKKLDRVLINDCWNQTFSQSYSVFEAGGCSDHLRCRISLNSEA 241

Query: 217  --KQ*LPRPCKMLNVLLSQAVF*NTVLEGLRKEIPVHI----MFKICKKLTKIQQAS*QM 378
              K    +P K +N L     F   V    +   P+ +    +F+  K L  ++     M
Sbjct: 242  GNKVQGLKPFKFVNALTDMEDFKPMVSTYWKDTEPLILSTSTLFRFSKNLKGLKPKIRSM 301

Query: 379  NR*MTSLDVKLENIQEKVDTIQRALKDDKFSLQLIQEEKKLIMQLEHWSHIHERVLRQKP 558
             R       K  N   K+   ++ +     S   ++EE     + +  + + E+ L+QK 
Sbjct: 302  ARDRLGNLSKKANEAYKILCAKQHVNLTNPSSMAMEEENAAYSRWDRVAILEEKYLKQKS 361

Query: 559  RATWIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*GLLGQA 738
            +  W   G+ NTK FH +   R++ N +  I +N  I+ T    ++ E  +FF   L   
Sbjct: 362  KLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGDEIKAEAERFFREFLQLI 421

Query: 739  TKELPCIDVKIARDGH*L--SYDKKQ*ILQPVTSSDIVGVLKELPADKSPSIDGFPTDFF 912
              +   + +   +    +  S   +Q +++PVT+ +I  VL  +P+DKSP  DG+ ++FF
Sbjct: 422  PNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRKVLFRMPSDKSPGPDGYTSEFF 481

Query: 913  KEFWDILGDEITTVILDFF 969
            K  W+I+GDE T  +  FF
Sbjct: 482  KATWEIIGDEFTLAVQSFF 500



 Score = 65.1 bits (157), Expect(2) = 3e-32
 Identities = 32/84 (38%), Positives = 53/84 (63%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF    L K IN T + L+ K +    +K +R I+CC  +YK+I+K++   LKLV+   +
Sbjct: 499  FFTKGFLPKGINSTILALIPKKTEAREMKDYRPISCCNVLYKVISKIIANRLKLVLPKFI 558

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
              +QSAF+K + ++ N+++A ELV
Sbjct: 559  AGNQSAFVKDRLLIENLLLATELV 582


>ref|XP_006586520.1| PREDICTED: uncharacterized protein LOC102662200 [Glycine max]
          Length = 490

 Score =  112 bits (279), Expect(2) = 4e-32
 Identities = 77/314 (24%), Positives = 146/314 (46%)
 Frame = +1

Query: 28  QLNMKGCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRT 207
           +++  G +F+W+NK+  N  IYS ID    N  WF  +S+       P  +    S I  
Sbjct: 46  EMDSSGEIFTWTNKQADNP-IYSRIDRILANIDWFQTHSDANLTILPPPMYLIITSFI-- 102

Query: 208 EIEKQ*LPRPCKMLNVLLSQAVF*NTVLEGLRKEIPVHIMFKICKKLTKIQQAS*QMNR* 387
                     C  L  +L + +  ++   G  + + + +   +   L +  Q        
Sbjct: 103 ----------CLSL-FMLEKEISSDSTTAGWMQWVFIVLWKDVGTNLLEALQCR------ 145

Query: 388 MTSLDVKLENIQEKVDTIQRALKDDKFSLQLIQEEKKLIMQLEHWSHIHERVLRQKPRAT 567
                      +EK+D  Q+ L+++      I+E K+L  ++ HW+ + E++L Q+ +  
Sbjct: 146 --------GCAREKLDQAQQDLRNNIMDAPRIEEVKRLTDEVIHWNEMEEKMLMQRSKID 197

Query: 568 WIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*GLLGQATKE 747
           WI  G+ N  +FHA LK RQ+   +  I  +   +LT  + +  E + F+  L+G  +  
Sbjct: 198 WIRAGDGNNAFFHAYLKSRQNAKRIKVIHKDDGTILTTHKEITQEVLAFYGKLMGHDSIS 257

Query: 748 LPCIDVKIARDGH*LSYDKKQ*ILQPVTSSDIVGVLKELPADKSPSIDGFPTDFFKEFWD 927
           L  +D+   R G  L+  +++ +++PVT  +I   L  +   K P +DG+ + FFK  W+
Sbjct: 258 LQHVDIYALRRGDHLTMVQREDLVRPVTVKEIEDALNGISDLKLPEVDGYSSKFFKSCWN 317

Query: 928 ILGDEITTVILDFF 969
           I+ +++     +FF
Sbjct: 318 IVKEDVVNAAQEFF 331



 Score = 54.3 bits (129), Expect(2) = 4e-32
 Identities = 28/65 (43%), Positives = 42/65 (64%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF   +L    N T +TLV K  N ++VK+++ IA CT+ YKI++K+LT  L  V+ S+V
Sbjct: 330  FFAQDQLFLPFNQTVVTLVPKSDNASTVKEYKPIAVCTTFYKIMSKILTARLNKVLPSVV 389

Query: 1143 YPSQS 1157
              SQ+
Sbjct: 390  SLSQA 394


>ref|XP_004247247.1| PREDICTED: uncharacterized protein LOC101256917 [Solanum
           lycopersicum]
          Length = 421

 Score = 88.2 bits (217), Expect(2) = 4e-32
 Identities = 53/166 (31%), Positives = 81/166 (48%)
 Frame = +1

Query: 475 QLIQEEKKLIMQLEHWSHIHERVLRQKPRATWIGHGNSNTKYFHASLKVRQSKNMVSSIC 654
           +LI +E++L ++LE WS I E   RQK RA WI  G++N KYF + +K R     + +I 
Sbjct: 17  KLITKEEELPIKLEKWSMIEESAQRQKARAKWIQLGDANNKYFSSVIKERTQNKHIRNIL 76

Query: 655 NNQDILLTDPQLVQYEFVQFF*GLLGQATKELPCIDVKIARDGH*LSYDKKQ*ILQPVTS 834
           +    +L +PQ +Q E V F+  L+G +                             VT 
Sbjct: 77  SIHGRMLYEPQEIQDEVVLFYKSLMGTSA----------------------------VTE 108

Query: 835 SDIVGVLKELPADKSPSIDGFPTDFFKEFWDILGDEITTVILDFFR 972
             I   L+ +  DK+P IDG+   FFK  W I+ ++I  V+  FF+
Sbjct: 109 EKIFAALQSIGNDKAPGIDGYNAFFFKYTWKIIKNDIIEVVQSFFK 154



 Score = 78.2 bits (191), Expect(2) = 4e-32
 Identities = 37/84 (44%), Positives = 57/84 (67%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF   KL K  NCT ++L+ KV +P +VK++R I CCT +YKII+KV+T  +  V+ +++
Sbjct: 152  FFKPGKLFKPFNCTLVSLIPKVQSPKNVKEYRTITCCTVLYKIISKVITNRMHDVIHNVI 211

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
              SQ  FI  + I  N+++A+ELV
Sbjct: 212  CDSQVGFILGRKISENILLAHELV 235


>gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]
          Length = 1161

 Score = 90.5 bits (223), Expect(2) = 1e-30
 Identities = 74/326 (22%), Positives = 141/326 (43%), Gaps = 13/326 (3%)
 Frame = +1

Query: 31   LNMKGCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRTE 210
            L  +G  F+WSN +  N  I   +D A  N  WF  + +  A +  P   DH P II  +
Sbjct: 223  LPSRGVFFTWSNHQQDNP-ILRKLDRALANGEWFAVFPSALAVFDPPGDSDHAPCIILID 281

Query: 211  IEKQ*LPRPCKMLNVLLSQAVF*NTVLEGLRKEIPVHI-MFKICKKLTKIQQAS*QMNR* 387
             +     +  K  + L S   +   +     +   V   MF + + L   +     +NR 
Sbjct: 282  NQPPPSKKSFKYFSFLSSHPSYLAALSTAWEENTLVGSHMFSLRQHLKVAKLCCRTLNR- 340

Query: 388  MTSLDVKLENIQEKVDTIQRALKDDKFSLQLIQEEKKLIMQLEH--------WSHIHERV 543
                 ++  NIQ++  T Q   + +   ++L+      + + EH        ++   E  
Sbjct: 341  -----LRFSNIQQR--TAQSLTRLEDIQVELLTSPSDTLFRREHVARKQWIFFAAALESF 393

Query: 544  LRQKPRATWIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*G 723
             RQK R  W+  G++NT++FH ++   Q+ N++  +  +    + +   ++   + ++  
Sbjct: 394  FRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIAYYSH 453

Query: 724  LLGQATKELPCIDVKIARDGH*LSYDKKQ*ILQPVTS----SDIVGVLKELPADKSPSID 891
            LLG  ++ +    V+  +    L +     +   +T+     +I  VL  +P +K+P  D
Sbjct: 454  LLGIPSENVTPFSVEKIKGL--LPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKAPGPD 511

Query: 892  GFPTDFFKEFWDILGDEITTVILDFF 969
            GFP +FF E W I+   +   I +FF
Sbjct: 512  GFPVEFFIEAWAIVKSSVVAAIREFF 537



 Score = 71.2 bits (173), Expect(2) = 1e-30
 Identities = 34/84 (40%), Positives = 54/84 (64%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF +  L +  N T ITL+ KV+    + +FR +ACCT++YK+I ++++  LKL +D  V
Sbjct: 536  FFISGNLPRGFNATAITLIPKVTGADRLTQFRPVACCTTIYKVITRIISRRLKLFIDQAV 595

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
              +Q  FIK + +  NV++A ELV
Sbjct: 596  QANQVGFIKGRLLCENVLLASELV 619


>ref|XP_006577697.1| PREDICTED: uncharacterized protein LOC102664381 [Glycine max]
          Length = 515

 Score =  140 bits (352), Expect = 1e-30
 Identities = 86/321 (26%), Positives = 158/321 (49%), Gaps = 7/321 (2%)
 Frame = +1

Query: 28   QLNMKGCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRT 207
            +++  G  F+W+NK+  N  IYS ID   GN +W   + +       P   DH    +  
Sbjct: 174  EMDTCGDFFTWTNKQADNT-IYSRIDRFLGNLNWLQMHIDSTLKILAPSVSDHALMFLSC 232

Query: 208  EIEKQ*LPRPCKMLNVLLSQAVF*NTVLEGLRKEIPVHIMFKICKKLTKIQQAS*QMNR* 387
            + +   L    K  N L     F + V +     +  + M+K+  KL+++Q         
Sbjct: 233  KDQSSRLRGRFKYRNSLARLNGFHDEVKKNWNLGVHGNPMYKLWTKLSRLQSV------- 285

Query: 388  MTSLDVKLENIQEKVDTIQRALKD-------DKFSLQLIQEEKKLIMQLEHWSHIHERVL 546
            + +L   L  ++EK+D  +R L+        D+F++  I   K    +L   + + +  L
Sbjct: 286  LKNLSSPLNGLREKIDEARRNLQQAHEDLCRDRFNVDNINRVKDRTSELLQLNELEDNDL 345

Query: 547  RQKPRATWIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*GL 726
            RQK +  WI  G+ N  YFHA++K R   N + S+       +T  + ++ E ++F+  L
Sbjct: 346  RQKAKINWIRQGDGNNSYFHATIKGRYKHNAIRSLIKEDGSCITSHEDIEEEVLKFYSAL 405

Query: 727  LGQATKELPCIDVKIARDGH*LSYDKKQ*ILQPVTSSDIVGVLKELPADKSPSIDGFPTD 906
            LG +   L  +++   R+G+ L+  ++  ++ PV++++I   +K +  +K+P IDG+   
Sbjct: 406  LGSSESNLAGLNIPAIRNGNTLNQFQRDMLIGPVSNAEIDTTIKGMDVNKTPGIDGYGVG 465

Query: 907  FFKEFWDILGDEITTVILDFF 969
            FFK+ W I+G ++   ILDFF
Sbjct: 466  FFKDAWSIVGSDVREAILDFF 486


>dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana]
          Length = 910

 Score = 89.7 bits (221), Expect(2) = 2e-30
 Identities = 74/326 (22%), Positives = 140/326 (42%), Gaps = 13/326 (3%)
 Frame = +1

Query: 31   LNMKGCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRTE 210
            L  +G  F+WSN +  N  I   +D A  N  WF  + +  A +  P   DH P II  +
Sbjct: 180  LPSRGVFFTWSNHQQDNP-ILRKLDRALANGEWFAVFPSALAVFDPPGDSDHAPCIILID 238

Query: 211  IEKQ*LPRPCKMLNVLLSQAVF*NTVLEGLRKEIPVHI-MFKICKKLTKIQQAS*QMNR* 387
             +     +  K  + L S   +   +         V   MF + + L   +     +NR 
Sbjct: 239  NQPPPSKKSFKYFSFLSSHPSYLAALSTAWEANTLVGSHMFSLRQHLKVAKLCCRTLNR- 297

Query: 388  MTSLDVKLENIQEKVDTIQRALKDDKFSLQLIQEEKKLIMQLEH--------WSHIHERV 543
                 ++  NIQ++  T Q   + +   ++L+      + + EH        ++   E  
Sbjct: 298  -----LRFSNIQQR--TAQSLTRLEDIQVELLTSPSDTLFRREHVARKQWIFFAAALESF 350

Query: 544  LRQKPRATWIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*G 723
             RQK R  W+  G++NT++FH ++   Q+ N++  +  +    + +   ++   + ++  
Sbjct: 351  FRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIAYYSH 410

Query: 724  LLGQATKELPCIDVKIARDGH*LSYDKKQ*ILQPVTS----SDIVGVLKELPADKSPSID 891
            LLG  ++ +    V+  +    L +     +   +T+     +I  VL  +P +K+P  D
Sbjct: 411  LLGIPSENVTPFSVEKIKGL--LPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKAPGPD 468

Query: 892  GFPTDFFKEFWDILGDEITTVILDFF 969
            GFP +FF E W I+   +   I +FF
Sbjct: 469  GFPVEFFIEAWAIVKSSVVAAIREFF 494



 Score = 71.2 bits (173), Expect(2) = 2e-30
 Identities = 34/84 (40%), Positives = 54/84 (64%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF +  L +  N T ITL+ KV+    + +FR +ACCT++YK+I ++++  LKL +D  V
Sbjct: 493  FFISGNLPRGFNATAITLIPKVTGADRLTQFRPVACCTTIYKVITRIISRRLKLFIDQAV 552

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
              +Q  FIK + +  NV++A ELV
Sbjct: 553  QANQVGFIKGRLLCENVLLASELV 576


>gb|AAD12028.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1447

 Score = 99.0 bits (245), Expect(2) = 3e-30
 Identities = 84/325 (25%), Positives = 152/325 (46%), Gaps = 16/325 (4%)
 Frame = +1

Query: 43   GCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRTEIEKQ 222
            G L++WSNKR+ ++ I   +D    ND W  ++    + ++     DH    +R  I   
Sbjct: 592  GPLYTWSNKREHDL-IAKKLDRVMVNDVWTQSFPQSYSVFEAGGCLDH----LRGRINLN 646

Query: 223  *LP-------RPCKMLNVLLSQAVF*NTVLEGLRKEIPVHI----MFKICKKLTKIQQAS 369
              P       RP K +NVL     F  TV    ++  P+ +    +F+  KKL  ++   
Sbjct: 647  DGPGSIVRGKRPFKFVNVLTEMEDFKPTVDSYWKETEPIFLSTSSLFRFSKKLKSLKPLL 706

Query: 370  *QMNR*MTSLDVKLENIQEKVDTIQRALKD--DKFSLQLIQEEKKLIMQLEHWSHIHERV 543
              + +    L   ++  +E  DT+ +  +   +  +   ++EE +   + EH + + E+ 
Sbjct: 707  RNLAK--ERLGNLVKKTREAYDTLCKKQESTLNNPTPNAMKEEVEAHDRWEHVAGLEEKF 764

Query: 544  LRQKPRATWIGHGNSNTKYFHASLKVRQSKNMVSSI-CNNQDILLTDPQLVQYEFVQFF* 720
            L++K +  W+  G+ N K FH ++  R+++N +S I C +  +     ++  Y   +FF 
Sbjct: 765  LKKKSKLHWLDGGDKNNKAFHRAVVTREAQNSISEIQCQDGSVTAKGDEIKAYA-ERFFR 823

Query: 721  GLLGQATKELPCIDVKIARD--GH*LSYDKKQ*ILQPVTSSDIVGVLKELPADKSPSIDG 894
              L     E   + +   +D      S  + + + + VT+ +I  VL  +P DKSP  DG
Sbjct: 824  EFLQLIPNEYEGVTMADLQDLLPFRCSETEHELLTRVVTAEEIKKVLFSMPNDKSPGPDG 883

Query: 895  FPTDFFKEFWDILGDEITTVILDFF 969
            F ++FFK  W+ILG+E    I  FF
Sbjct: 884  FTSEFFKATWEILGNEFILAIQSFF 908



 Score = 61.2 bits (147), Expect(2) = 3e-30
 Identities = 30/80 (37%), Positives = 48/80 (60%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF    L K IN T + L+ K      +K +R I+CC  +YK+I+K++   LKLV    +
Sbjct: 907  FFAKGFLPKGINTTILALIPKKKEAKEMKDYRPISCCNVIYKVISKIIANRLKLVPPKFI 966

Query: 1143 YPSQSAFIK*KNILNNVIVA 1202
              +QSAF+K + ++ NV++A
Sbjct: 967  AGNQSAFVKDRLLIENVLLA 986


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score = 90.1 bits (222), Expect(2) = 4e-30
 Identities = 80/318 (25%), Positives = 136/318 (42%), Gaps = 4/318 (1%)
 Frame = +1

Query: 31   LNMKGCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRTE 210
            L  +G  ++W N ++ N  I   ID    ND W +A       +   +F DH PS +   
Sbjct: 179  LPFRGNHYTWWNNQENNP-IAKKIDRILVNDSWLIASPLSYGSFCAMEFSDHCPSCVNIS 237

Query: 211  IEKQ*LPRPCKMLNVLLSQAVF*NTV-LEGLRKEIPVHIMFKICKKLTKIQQAS*QMNR* 387
             +     +P K+ N L+    F   + +   R       MF + KK   ++      NR 
Sbjct: 238  NQSGGRNKPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFLKGTIRTFNRE 297

Query: 388  MTS-LDVKLENIQEKVDTIQRALKDDKFSLQLIQEEKKLIMQLEHWSHIHERVLRQKPRA 564
              S L+ ++    + + T Q  L     S  L   EK+        +   ER L QK R 
Sbjct: 298  HYSGLEKRVVQAAQNLKTCQNNLLAAPSSY-LAGLEKEAHRSWAELALAEERFLCQKSRV 356

Query: 565  TWIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*GLLGQATK 744
             W+  G+SNT +FH  +  R++ N +  + +     + +   +Q   V FF  L G ++ 
Sbjct: 357  LWLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVDFFKELFGSSSH 416

Query: 745  ELPCIDVKIARDGH*LSYDK--KQ*ILQPVTSSDIVGVLKELPADKSPSIDGFPTDFFKE 918
             +    +           D+  +Q +   V+ +DI      LP++KSP  DG+ ++FFK+
Sbjct: 417  LISAEGISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGPDGYTSEFFKK 476

Query: 919  FWDILGDEITTVILDFFR 972
             W I+G  +   + +FFR
Sbjct: 477  TWSIVGPSLIAAVQEFFR 494



 Score = 69.7 bits (169), Expect(2) = 4e-30
 Identities = 33/84 (39%), Positives = 56/84 (66%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF + +L+ + N T +T+V K  N   + +FR I+CC ++YK+I+K+L   L+ ++   +
Sbjct: 492  FFRSGRLLGQWNSTAVTMVPKKPNADRITEFRPISCCNAIYKVISKLLARRLENILPLWI 551

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
             PSQSAF+K + +  NV++A ELV
Sbjct: 552  SPSQSAFVKGRLLTENVLLATELV 575


>gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score:
            72.31) [Arabidopsis thaliana]
          Length = 928

 Score = 94.4 bits (233), Expect(2) = 4e-30
 Identities = 77/320 (24%), Positives = 148/320 (46%), Gaps = 11/320 (3%)
 Frame = +1

Query: 43   GCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRTEIEKQ 222
            G LF+WSNKR+ ++ I   +D    ND W  ++    + ++     DH    I   +   
Sbjct: 72   GPLFTWSNKRENDL-IAKKLDRVLVNDVWLQSFPRSYSVFEAGGCSDHLRCRINLNVGAG 130

Query: 223  *L---PRPCKMLNVLLSQAVF*NTVLEGLRKEIPVHI----MFKICKKLTKIQQAS*QMN 381
             +    RP K +NV+     F  TV     +   + +    +F+  KKL  ++     + 
Sbjct: 131  AVVKGKRPFKFVNVITEMEHFIPTVESYWNETEAIFMSTSSLFRFSKKLKGLKPLLRNLG 190

Query: 382  R*MTSLDVKLENIQEKVDTI--QRALKDDKFSLQLIQEEKKLIMQLEHWSHIHERVLRQK 555
            +    L   ++  +E  +T+  ++A+K    S   +QEE +   + +H + + E+ L+Q+
Sbjct: 191  K--ERLGNLVKQTKEAFETLCQKQAMKMANPSPSSMQEENEAYAKWDHIAVLEEKFLKQR 248

Query: 556  PRATWIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*GLLGQ 735
             +  W+  G+ N K FH ++  R+++N +  I  +   + +  + ++ E    F   L  
Sbjct: 249  SKLHWLDIGDRNNKAFHRAVVAREAQNSIREIICHDGSVASQEEKIKTEAEHHFREFLQL 308

Query: 736  ATKELPCIDVKIARD--GH*LSYDKKQ*ILQPVTSSDIVGVLKELPADKSPSIDGFPTDF 909
               +   I V+  +D   +  S   K+ +   V++ +I  V+  +P DKSP  DG+  +F
Sbjct: 309  IPNDFEGIAVEELQDLLPYRCSDSDKEMLTNHVSAEEIHKVVFSMPNDKSPGPDGYTAEF 368

Query: 910  FKEFWDILGDEITTVILDFF 969
            +K  W+I+G E    I  FF
Sbjct: 369  YKGAWNIIGAEFILAIQSFF 388



 Score = 65.5 bits (158), Expect(2) = 4e-30
 Identities = 32/84 (38%), Positives = 52/84 (61%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF    L K IN T + L+ K      +K +R I+CC  +YK+I+K++   LKLV+   +
Sbjct: 387  FFAKGFLPKGINSTILALIPKKKEAKEMKDYRPISCCNVLYKVISKIIANRLKLVLPKFI 446

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
              +QSAF+K + ++ NV++A E+V
Sbjct: 447  VGNQSAFVKDRLLIENVLLATEIV 470


>gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis
            thaliana]
          Length = 1253

 Score = 95.9 bits (237), Expect(2) = 1e-29
 Identities = 87/327 (26%), Positives = 144/327 (44%), Gaps = 14/327 (4%)
 Frame = +1

Query: 31   LNMKGCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPS--IIR 204
            L  KG  F+W NK  A   +   +D    N+ W   + +  A +  P F DH     II 
Sbjct: 130  LVFKGNTFTWWNK-SATRPVAKKLDRILVNESWCSRFPSAYAVFGEPDFSDHASCGVIIN 188

Query: 205  TEIEKQ*LPRPCKMLNVLLSQAVF*NTVLEGLRKEIPV--HIMFKICKKLTKIQQAS*QM 378
              + ++   RP +  N LL    F + V E L   I V    MFK+ KKL  ++      
Sbjct: 189  PLMHRE--KRPFRFYNFLLQNPDFISLVGE-LWYSINVVGSSMFKMSKKLKALKNPIRTF 245

Query: 379  N-R*MTSLDVKLENIQEKVDTIQRALKDD----KFSLQLIQEEKKLIMQLEHWSHIHERV 543
            +    ++L+ +++     V   Q     D      +L++  + K LI+         E  
Sbjct: 246  SMENFSNLEKRVKEAHNLVLYRQNKTLSDPTIPNAALEMEAQRKWLILV-----KAEESF 300

Query: 544  LRQKPRATWIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*G 723
              Q+ R TW+G G+SNT YFH     R++ N +  I ++  + +     ++   +++F  
Sbjct: 301  FCQRSRVTWMGEGDSNTSYFHRMADSRKAVNTIHIIIDDNGVKIDTQLGIKEHCIEYFSN 360

Query: 724  LLGQATKELPCIDVKIARDGH*L-----SYDKKQ*ILQPVTSSDIVGVLKELPADKSPSI 888
            LLG          + I  D   L     S+D+K+ +    +  DI       P++K+   
Sbjct: 361  LLGGEVGP----PMLIQEDFDLLLPFRCSHDQKKELAMSFSRQDIKSAFFSFPSNKTSGP 416

Query: 889  DGFPTDFFKEFWDILGDEITTVILDFF 969
            DGFP +FFKE W ++G E+T  + +FF
Sbjct: 417  DGFPVEFFKETWSVIGTEVTDAVSEFF 443



 Score = 62.0 bits (149), Expect(2) = 1e-29
 Identities = 32/88 (36%), Positives = 57/88 (64%), Gaps = 4/88 (4%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCT----SVYKIIAKVLTVNLKLVV 1130
            FF +  L+K+ N TT+ L+ K++N + +  FR I+C      ++YK+IA++LT  L+ ++
Sbjct: 442  FFTSSVLLKQWNATTLVLIPKITNASKMNDFRPISCNDFGPITLYKVIARLLTNRLQCLL 501

Query: 1131 DSLVYPSQSAFIK*KNILNNVIVAYELV 1214
              ++ P QSAF+  + +  NV++A ELV
Sbjct: 502  SQVISPFQSAFLPGRFLAENVLLATELV 529


>emb|CCA66235.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1380

 Score = 92.4 bits (228), Expect(2) = 2e-29
 Identities = 63/293 (21%), Positives = 136/293 (46%), Gaps = 1/293 (0%)
 Frame = +1

Query: 94   SLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRTEIEKQ*LPRPCKMLNVLLSQAV 273
            S +D  F N  W   Y  L+         DH P ++ + +     P+P K  N  LS   
Sbjct: 192  SKLDRCFVNPEWLTHYPTLKLSLLNRGLSDHCPLLLNSSVRNWG-PKPFKFQNCWLSDPR 250

Query: 274  F*NTVLEGLRKEIPVHIMFKICKKLTKIQQAS*QMN-R*MTSLDVKLENIQEKVDTIQRA 450
                V +  +K  P+ ++    +KL  +++     N +   +++  ++ ++ +++ + + 
Sbjct: 251  CMRLVKDTWQKSSPMGLV----QKLKTVKKDLKDWNEKVFGNIEANIKQLEHEINQLDKI 306

Query: 451  LKDDKFSLQLIQEEKKLIMQLEHWSHIHERVLRQKPRATWIGHGNSNTKYFHASLKVRQS 630
              +       ++++KK  + L  W    E    Q+ R  W+  G+ NTK+FH    +R+ 
Sbjct: 307  SNERDLDSFELEKKKKAQVDLWSWMKTKESYWSQQSRIKWLKQGDRNTKFFHVVASIRKH 366

Query: 631  KNMVSSICNNQDILLTDPQLVQYEFVQFF*GLLGQATKELPCIDVKIARDGH*LSYDKKQ 810
            +N ++SI  N D  +++P+ ++ E +++F     + +   P ++     D   L+  +  
Sbjct: 367  RNSITSIEVNGD-KISEPEKIKLEAMKYFRKAFKEESYNRPLLE---GLDFKHLTEAQSA 422

Query: 811  *ILQPVTSSDIVGVLKELPADKSPSIDGFPTDFFKEFWDILGDEITTVILDFF 969
             ++ P +  +I   +    +DK+P  DGF   F K+ WD++ +EI   + +F+
Sbjct: 423  DLIAPFSHEEIDKAVASCSSDKAPGPDGFNFTFIKKAWDVIKEEIYETVQEFW 475



 Score = 65.1 bits (157), Expect(2) = 2e-29
 Identities = 34/84 (40%), Positives = 57/84 (67%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            F+ + +L K  N   I L+ K  +P   + FR I+    VYKI+AK+LT+ L+ V++SLV
Sbjct: 474  FWNSSRLPKGCNMAFIALIPKTDSPKGFQDFRPISMVGCVYKIVAKLLTMRLQKVMNSLV 533

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
             P+QS+FI+ ++IL++ ++A EL+
Sbjct: 534  GPAQSSFIEGRHILDSALIAGELI 557


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score = 86.7 bits (213), Expect(2) = 4e-29
 Identities = 86/321 (26%), Positives = 133/321 (41%), Gaps = 8/321 (2%)
 Frame = +1

Query: 31   LNMKGCLFSWSNKRDANVGIYSLID*AFGNDHWFMAYSNLEAYYQTPQFFDHDPSIIRTE 210
            L  KG  F+W NK      +   ID    ND W   + +    + +  F DH    +  E
Sbjct: 181  LRYKGNTFTWWNKSHTTP-VAKKIDRILVNDSWNALFPSSLGIFGSLDFSDHVSCGVVLE 239

Query: 211  IEKQ*LPRPCKMLNVLLSQAVF*NTVLEG-LRKEIPVHIMFKICKKLTKIQQAS*QMNR* 387
                   RP K  N LL    F N V +      +    MF++ KKL  +++     +R 
Sbjct: 240  ETSIKAKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPIKDFSRL 299

Query: 388  MTS-LDVKLENIQEKVDTIQ-RALKDD---KFSLQLIQEEKKLIMQLEHWSHIHERVLRQ 552
              S L+ + +   + +   Q R L D      S +L  E K  I+     +   E   RQ
Sbjct: 300  NYSELEKRTKEAHDFLIGCQDRTLADPTPINASFELEAERKWHIL-----TAAEESFFRQ 354

Query: 553  KPRATWIGHGNSNTKYFHASLKVRQSKNMVSSICNNQDILLTDPQLVQYEFVQFF*GLLG 732
            K R +W   G+ NTKYFH     R S N +S++ +    L+   + +      +F  LLG
Sbjct: 355  KSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLCASYFGSLLG 414

Query: 733  QATKE--LPCIDVKIARDGH*LSYDKKQ*ILQPVTSSDIVGVLKELPADKSPSIDGFPTD 906
                   +   D+ +    +  S  +   +    ++ DI   L  LP +KS   DGF  +
Sbjct: 415  DEVDPYLMEQNDMNLLL-SYRCSPAQVCELESTFSNEDIRAALFSLPRNKSCGPDGFTAE 473

Query: 907  FFKEFWDILGDEITTVILDFF 969
            FF + W I+G E+T  I +FF
Sbjct: 474  FFIDSWSIVGAEVTDAIKEFF 494



 Score = 69.7 bits (169), Expect(2) = 4e-29
 Identities = 33/84 (39%), Positives = 57/84 (67%)
 Frame = +3

Query: 963  FF*NRKLIKEINCTTITLVSKVSNPTSVKKFRLIACCTSVYKIIAKVLTVNLKLVVDSLV 1142
            FF +  L+K+ N TTI L+ K+ NPT    FR I+C  ++YK+IA++LT  L+ ++  ++
Sbjct: 493  FFSSGCLLKQWNATTIVLIPKIVNPTCTSDFRPISCLNTLYKVIARLLTDRLQRLLSGVI 552

Query: 1143 YPSQSAFIK*KNILNNVIVAYELV 1214
              +QSAF+  +++  NV++A +LV
Sbjct: 553  SSAQSAFLPGRSLAENVLLATDLV 576


Top