BLASTX nr result

ID: Mentha23_contig00035812 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00035812
         (739 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   109   1e-21
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...    99   2e-18
ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659...    96   2e-17
ref|XP_004240779.1| PREDICTED: uncharacterized protein LOC101256...    94   6e-17
ref|XP_004239563.1| PREDICTED: uncharacterized protein LOC101259...    93   1e-16
dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal...    91   4e-16
gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]               89   1e-15
ref|XP_006595271.1| PREDICTED: uncharacterized protein LOC100781...    89   2e-15
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...    88   3e-15
ref|XP_007207581.1| hypothetical protein PRUPE_ppa018489mg, part...    87   8e-15
ref|XP_004305774.1| PREDICTED: uncharacterized protein LOC101293...    86   1e-14
emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694...    85   2e-14
ref|XP_006582542.1| PREDICTED: uncharacterized protein LOC102668...    84   4e-14
gb|AAF63113.1|AC006423_14 Hypothetical protein [Arabidopsis thal...    84   5e-14
gb|AAF63129.1|AC009526_14 Similar to reverse transcriptase [Arab...    84   5e-14
ref|NP_175044.1| DNAse I-like superfamily protein [Arabidopsis t...    84   5e-14
ref|XP_006598323.1| PREDICTED: uncharacterized protein LOC102660...    81   3e-13
ref|XP_004253414.1| PREDICTED: uncharacterized protein LOC101253...    81   3e-13
dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ...    81   4e-13
emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li...    81   4e-13

>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  109 bits (272), Expect = 1e-21
 Identities = 77/238 (32%), Positives = 114/238 (47%), Gaps = 1/238 (0%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDHS  + +L    PQ  K FKF N   +   F E ++ AW+  N R + Q +  + LK 
Sbjct: 222 SDHSPLLFNLMTGRPQGGKPFKFMNVMAEQGEFLETVEKAWNSVNGRFKLQAIWLN-LKA 280

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQFLEKA 386
           +++ L ++KT++     EKV+  R  LQ  Q   D D  N  ++    S     +     
Sbjct: 281 VKRELKQMKTQKIGLAHEKVKNLRHQLQDLQSQDDFDH-NDIMQTDAKSIMNDLRHWSHI 339

Query: 387 ERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIEDFTM 566
           E S   Q+++   L   D NSK F++ V   +A N I  L  EDG V +D + + E+   
Sbjct: 340 EDSILQQKSRITWLQQGDTNSKLFFTAVKARHAINRIDMLNTEDGRVIQDADEVQEEILE 399

Query: 567 FYNNLFGASVPR-SPLDWNAARSGPMLNCTEQSALVVPVDTKEIKDVLFSIGNDKAPG 737
           FY  L G        +D N  R G  L+   + +L+  V + EI + L  IGNDKAPG
Sbjct: 400 FYKKLLGTRASTLMGVDLNTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPG 457


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score = 98.6 bits (244), Expect = 2e-18
 Identities = 68/238 (28%), Positives = 115/238 (48%), Gaps = 1/238 (0%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDHS  I +L  Q  +  + FKF NF  D   F E++  AW   N R + + +   +L+ 
Sbjct: 225 SDHSPLIFNLATQHDEGGRPFKFLNFLADQNGFVEVVKEAWGSANHRFKMKNIWV-RLQA 283

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQFLEKA 386
           +++AL    +K+++    +VE  R  L   Q + ++  ++ E++ +E       +     
Sbjct: 284 VKRALKSFHSKKFSKAHCQVEELRRKLAAVQALPEVSQVS-ELQEEEKDLIAQLRKWSTI 342

Query: 387 ERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIEDFTM 566
           + S   Q+++ + L+L D NSKFF++ +    ARN I  L  + G    +   I  +   
Sbjct: 343 DESILKQKSRIQWLSLGDSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICN 402

Query: 567 FYNNLFGASVPR-SPLDWNAARSGPMLNCTEQSALVVPVDTKEIKDVLFSIGNDKAPG 737
           FY  L G S  +   +D +  R G  L+ T  + LV P+  +EI   L  I + KAPG
Sbjct: 403 FYRRLLGTSSSQLEAIDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPG 460


>ref|XP_006576082.1| PREDICTED: uncharacterized protein LOC102659506 [Glycine max]
          Length = 964

 Score = 95.5 bits (236), Expect = 2e-17
 Identities = 67/237 (28%), Positives = 105/237 (44%)
 Frame = +3

Query: 27   SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
            SDH+  + +    VP+    FKF N  +D+P F  I+   W   N      + +  KLK 
Sbjct: 593  SDHTPLVVTTELVVPRGNSPFKFNNLIVDHPNFLRIVADGWK-QNIHGCSMFKVCKKLKA 651

Query: 207  LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQFLEKA 386
            L+  L  L  +E++N+S +VE + A           +P +  +      TR     L KA
Sbjct: 652  LKAPLKNLFKQEFSNISNRVELAEAEYNSVLNSIKQNPQDPSLLALANRTRGQTIMLRKA 711

Query: 387  ERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIEDFTM 566
            E   FAQ  K K+L   DK SKFF++ + R      I  +  EDG  T   + I   F  
Sbjct: 712  ESMKFAQLIKNKYLLQADKCSKFFHALIKRNKHSRFIAAIRLEDGHNTSSQDEIALAFVN 771

Query: 567  FYNNLFGASVPRSPLDWNAARSGPMLNCTEQSALVVPVDTKEIKDVLFSIGNDKAPG 737
             + N F A         +    GP +     +AL+ P   +++ +++  + N+KAPG
Sbjct: 772  HFRNFFSAHELTQTPSISICNRGPKVPTDCFAALLCPTSKQKVWNIISVMANNKAPG 828


>ref|XP_004240779.1| PREDICTED: uncharacterized protein LOC101256493 [Solanum
           lycopersicum]
          Length = 441

 Score = 93.6 bits (231), Expect = 6e-17
 Identities = 66/239 (27%), Positives = 113/239 (47%), Gaps = 2/239 (0%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDHS     L +     R +FKF N W ++  F ++++  W     R      +  KLK 
Sbjct: 130 SDHSTMQLVLHQSNQHVRASFKFFNIWTEHDLFLDLVEKVWKQEKDR-DAIKKVWYKLKA 188

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQ-QMCDLDPLNREIRHQEISTRKHCQFLEK 383
           L+  L +L  KE+  +S ++E +R  L   Q Q+C         + +E+ T+   + L  
Sbjct: 189 LQPVLKQLNRKEFKYISNQIEEARNELIDIQNQLCHQAKDELVTKEKELLTK--LEKLSL 246

Query: 384 AERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIEDFT 563
            + S   Q+ + K + L D N+K+  S +   N + +I  L   DG    +   I ++F 
Sbjct: 247 IKESALRQKVRAKWIKLGDANNKYLSSVIKERNHKKNIRILMSLDGRKLSEPQEIQDEFV 306

Query: 564 MFYNNLFGASVPR-SPLDWNAARSGPMLNCTEQSALVVPVDTKEIKDVLFSIGNDKAPG 737
           +F  +L G +    S ++    + GP+L+   +  L   +  +EI + L SIGN+KAPG
Sbjct: 307 LFDKSLMGTAANNLSAINVQVMKRGPVLSRQHRIQLCATITDQEIVEALKSIGNEKAPG 365


>ref|XP_004239563.1| PREDICTED: uncharacterized protein LOC101259634 [Solanum
           lycopersicum]
          Length = 425

 Score = 92.8 bits (229), Expect = 1e-16
 Identities = 65/235 (27%), Positives = 115/235 (48%), Gaps = 5/235 (2%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDHS+    L +   Q R +FKF N W ++  F E++++ W  N  R     ++  KLK 
Sbjct: 197 SDHSSMQLLLHQNYQQVRASFKFFNVWTEHESFLELVETVWKQNKGR-DAMKMVWYKLKA 255

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQ-QMCDLDPLNREIRHQEISTRKHCQFLEK 383
           L+  L +L  +E+  + +++E +R  L   Q Q+C+    +   + +++ T+     LEK
Sbjct: 256 LQPVLKQLNRREFKYIGKQIEEARNDLADIQNQLCNQANDDLVTKEKDLLTK-----LEK 310

Query: 384 ---AERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIE 554
               E S   Q+ + K + L D N+K+F S +   N +  I  L   DG +  D   I +
Sbjct: 311 WSLIEESSLRQKARAKWIKLGDANNKYFSSVIKERNYKKHIRSLMSIDGKMLYDPQEIQD 370

Query: 555 DFTMFYNNLFGASVPRSP-LDWNAARSGPMLNCTEQSALVVPVDTKEIKDVLFSI 716
           +F +FY +L G +    P ++    + G +L+   +  L   +  +EI + L SI
Sbjct: 371 EFVLFYKSLMGTAADNLPAINVRVMKRGHVLSRQHRIQLCATITDQEIAEALKSI 425


>dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana]
          Length = 910

 Score = 90.9 bits (224), Expect = 4e-16
 Identities = 66/240 (27%), Positives = 106/240 (44%), Gaps = 3/240 (1%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDH+ CI  +  Q P  +K+FK+ +F   +P +   L +AW  N       + L   LK 
Sbjct: 228 SDHAPCIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEANTLVGSHMFSLRQHLKV 287

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQFLEKA 386
            +     L    ++N+ ++   S   L+  Q      P +   R + ++ RK   F   A
Sbjct: 288 AKLCCRTLNRLRFSNIQQRTAQSLTRLEDIQVELLTSPSDTLFRREHVA-RKQWIFFAAA 346

Query: 387 ERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIEDFTM 566
             SFF Q+++ + L   D N++FF+  V    A N I FL  +DG    +++ I      
Sbjct: 347 LESFFRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIA 406

Query: 567 FYNNLFG-ASVPRSPLDWNAARSGPMLNCTE--QSALVVPVDTKEIKDVLFSIGNDKAPG 737
           +Y++L G  S   +P      +      C     S L      +EI  VLFS+  +KAPG
Sbjct: 407 YYSHLLGIPSENVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKAPG 466


>gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]
          Length = 1161

 Score = 89.4 bits (220), Expect = 1e-15
 Identities = 66/240 (27%), Positives = 106/240 (44%), Gaps = 3/240 (1%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDH+ CI  +  Q P  +K+FK+ +F   +P +   L +AW  N       + L   LK 
Sbjct: 271 SDHAPCIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEENTLVGSHMFSLRQHLKV 330

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQFLEKA 386
            +     L    ++N+ ++   S   L+  Q      P +   R + ++ RK   F   A
Sbjct: 331 AKLCCRTLNRLRFSNIQQRTAQSLTRLEDIQVELLTSPSDTLFRREHVA-RKQWIFFAAA 389

Query: 387 ERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIEDFTM 566
             SFF Q+++ + L   D N++FF+  V    A N I FL  +DG    +++ I      
Sbjct: 390 LESFFRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLIA 449

Query: 567 FYNNLFG-ASVPRSPLDWNAARSGPMLNCTE--QSALVVPVDTKEIKDVLFSIGNDKAPG 737
           +Y++L G  S   +P      +      C     S L      +EI  VLFS+  +KAPG
Sbjct: 450 YYSHLLGIPSENVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKAPG 509


>ref|XP_006595271.1| PREDICTED: uncharacterized protein LOC100781932 [Glycine max]
          Length = 952

 Score = 88.6 bits (218), Expect = 2e-15
 Identities = 63/237 (26%), Positives = 102/237 (43%)
 Frame = +3

Query: 27   SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
            SDH+  + +    VP+    FKF N  +D+P F  I+   W   N      + +  KLK 
Sbjct: 668  SDHTPLVVTTKLVVPRGNSPFKFNNAIVDHPNFSRIVADGWK-QNIHGCSMFKVCKKLKV 726

Query: 207  LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQFLEKA 386
            L+ +L  L  +E++N+S +VE +             +P +  +      TR       K 
Sbjct: 727  LKASLKNLFKQEFSNISNRVELAEVEYNSVLNSLKQNPQDHSLLALANRTRGQTIMFRKV 786

Query: 387  ERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIEDFTM 566
            E   FAQ  K ++L   D  SKFF++ + R      I  +  EDG  T   + I   F  
Sbjct: 787  ESMKFAQLIKNRYLLQVDICSKFFHALIKRNRHSRFIAAIRLEDGHNTSSQDEIALAFVN 846

Query: 567  FYNNLFGASVPRSPLDWNAARSGPMLNCTEQSALVVPVDTKEIKDVLFSIGNDKAPG 737
             + NLF A         +    G  +     + ++ P   +E+ +V+F + N+KAPG
Sbjct: 847  HFRNLFSAHELTQTPSISICNRGLKVPTDCFATILCPTSKQEVWNVIFVMDNNKAPG 903


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
           [Arabidopsis thaliana]
          Length = 1164

 Score = 88.2 bits (217), Expect = 3e-15
 Identities = 69/243 (28%), Positives = 113/243 (46%), Gaps = 6/243 (2%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDHS+C  SL    P+ +K F+F NF + +  F  ++   W   +      Y +S KLK 
Sbjct: 126 SDHSSCELSLMSASPRSKKPFRFNNFLLKDENFLSLICLKWFSTSVTGSAMYRVSVKLKA 185

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQQ--MCDLDPLNREIRHQEISTRKHCQFLE 380
           L+K + +     Y+++ ++ + +  AL   Q   +    P N  I   E  T++  + L 
Sbjct: 186 LKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASPCPSNAAI---EAETQRKWRILA 242

Query: 381 KAERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGS-VTRDINIIIED 557
           +AE SFF QR++   L   D NS +F+   +   + N I FL    G  +    N+    
Sbjct: 243 EAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIHFLSDPVGDRIEGQQNLENHC 302

Query: 558 FTMFYNNLFGASVPRSPLDWNAARSGPM-LNCT--EQSALVVPVDTKEIKDVLFSIGNDK 728
              F +NL   S    PL   A  S  +   C+  +Q +L  P  +++IK+  FS+  +K
Sbjct: 303 VEYFQSNL--GSEQGLPLFEQADISNLLSYRCSPAQQVSLDTPFSSEQIKNAFFSLPRNK 360

Query: 729 APG 737
           A G
Sbjct: 361 ASG 363


>ref|XP_007207581.1| hypothetical protein PRUPE_ppa018489mg, partial [Prunus persica]
           gi|462403223|gb|EMJ08780.1| hypothetical protein
           PRUPE_ppa018489mg, partial [Prunus persica]
          Length = 1146

 Score = 86.7 bits (213), Expect = 8e-15
 Identities = 64/244 (26%), Positives = 107/244 (43%), Gaps = 7/244 (2%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDHS  +     ++ +    FKF  +W D      I+   W  N+        LS+ L  
Sbjct: 128 SDHSPLVLYFAPKIQRRAGGFKFEAYWADEHDCGTIIQRGWK-NDIVGDSFAALSANLGV 186

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQFLEKA 386
            R+ L +   +++ N   ++     +L + Q      PL    RHQE +       L   
Sbjct: 187 CREELQKWSKEKFPNNLSRINLLMKSLSNLQS----GPLEENYRHQESAIWDEMSVLWSR 242

Query: 387 ERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIEDFTM 566
           E +++ QR++   L+  D N+KFF++   +   RN I  L + +G+       I E+F +
Sbjct: 243 EETYWKQRSRLNWLSAGDANTKFFHTTTLQRRQRNKIETLLKSEGNCISGDQAIREEFGI 302

Query: 567 FYNNLFGASVPRSPLDWNAARSGPMLNCTEQS-------ALVVPVDTKEIKDVLFSIGND 725
           F+ NLF +  PR   +W     G +LNC   S        L  P   +E++  +  +G+ 
Sbjct: 303 FFGNLFKSGGPR---NW-----GGILNCVHASITEAQNKRLTDPFSMEEVRTAVKQLGSL 354

Query: 726 KAPG 737
           KAPG
Sbjct: 355 KAPG 358


>ref|XP_004305774.1| PREDICTED: uncharacterized protein LOC101293221 [Fragaria vesca
           subsp. vesca]
          Length = 461

 Score = 86.3 bits (212), Expect = 1e-14
 Identities = 60/238 (25%), Positives = 110/238 (46%), Gaps = 2/238 (0%)
 Frame = +3

Query: 30  DHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKTL 209
           DH+  I S  +  P   K F+F + W+++P F++ + + W+ +       Y++  KLK L
Sbjct: 91  DHTPLIFSASKLSPCGPKPFRFQSMWLNHPTFRDTIATCWTSSKFWGWPMYVIVQKLKAL 150

Query: 210 RKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQFLEKAE 389
           +  L       + ++ + V  +R AL   QQ   +  +  +    E+  +       K +
Sbjct: 151 KSCLRNWNKMVFGDVHQNVNKAREALSAIQQDIAIHGMTDQKFEDEVDAKFRVLNAVKMQ 210

Query: 390 RSFFAQRNKTKHLTLFDKNSKFF--YSKVNRANARNSIPFLCREDGSVTRDINIIIEDFT 563
            S++  R + K LT  D+++ FF  Y+KV  A+AR    F   +   +  + + I+    
Sbjct: 211 ESYWKDRARVKWLTDGDRSTSFFHAYAKVRSASAR---MFSIHDGERILFEPSDIVAHVV 267

Query: 564 MFYNNLFGASVPRSPLDWNAARSGPMLNCTEQSALVVPVDTKEIKDVLFSIGNDKAPG 737
            FY NL+ +S     LD   +    ++   E   L V   T+EIK+ +F++    APG
Sbjct: 268 GFYQNLYSSSSTPRNLDEVCSVIPSLVTNAENDWLTVIPSTEEIKNAVFAMDASSAPG 325


>emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1|
           putative protein [Arabidopsis thaliana]
          Length = 1141

 Score = 85.1 bits (209), Expect = 2e-14
 Identities = 59/239 (24%), Positives = 106/239 (44%), Gaps = 2/239 (0%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDH++C   L     + ++ FKF NF + NP F  ++   W   N      + +S KLK 
Sbjct: 221 SDHASCGVVLELDPIKAKRPFKFFNFLLKNPEFLNLVWDVWYSTNVVGSSMFRVSKKLKA 280

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQFLEKA 386
           L+K + +     Y+NL ++ E +   L   Q +  LD  + E    E+  ++  Q L  A
Sbjct: 281 LKKPIKDFSRLNYSNLEKRTEEAHETLLSFQNL-TLDNPSLENAAHELEAQRKWQILATA 339

Query: 387 ERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIEDFTM 566
           E SFF QR++       D N+++F+   +   + N+I  L  + G+       I +   +
Sbjct: 340 EESFFRQRSRVTWFAEGDGNTRYFHRMADSRKSVNTITTLVDDSGTQIDSQQGIADHCAL 399

Query: 567 FYNNLFGASVPRSPLDWNAARSGPMLNC--TEQSALVVPVDTKEIKDVLFSIGNDKAPG 737
           ++ NL         L+ +         C  ++ + L      ++IK   F + ++KA G
Sbjct: 400 YFENLLSDDNDPYSLEQDDMNLLLTYRCPYSQVADLEAMFSDEDIKAAFFGLPSNKACG 458


>ref|XP_006582542.1| PREDICTED: uncharacterized protein LOC102668030 [Glycine max]
          Length = 411

 Score = 84.3 bits (207), Expect = 4e-14
 Identities = 59/188 (31%), Positives = 84/188 (44%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDH+  + +    VP+    FKF N  MD+P F  I+  +W   N      + +  KLK 
Sbjct: 221 SDHTPLVVTTELVVPRGNSPFKFNNAIMDHPNFLRIVADSWK-QNIHGYSMFKVCKKLKA 279

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQFLEKA 386
           L+  L  L  +E+ N+S +VE + A           +P +  +      TR     L KA
Sbjct: 280 LKAPLKNLFKQEFRNISNRVELAEAEYNSVLNSLKQNPQDPSLLALANRTRGQTIMLRKA 339

Query: 387 ERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIEDFTM 566
           E   FAQ  K K+L   DK SKFF++ + R      I  +  EDG  T   + I   F  
Sbjct: 340 ESMKFAQLIKNKYLLQADKCSKFFHALIKRNRHSRFIAAIRLEDGHNTSSQDEISLAFVN 399

Query: 567 FYNNLFGA 590
            + NLF A
Sbjct: 400 HFRNLFSA 407


>gb|AAF63113.1|AC006423_14 Hypothetical protein [Arabidopsis thaliana]
          Length = 668

 Score = 84.0 bits (206), Expect = 5e-14
 Identities = 64/240 (26%), Positives = 106/240 (44%), Gaps = 3/240 (1%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDHS CI  L     + +K F++ +F   +P F   L  AW    P     + L   LK 
Sbjct: 248 SDHSPCIIILENLPKRSKKCFRYFSFLSTHPTFLVSLTVAWEEQIPVGSHMFSLGEHLKA 307

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQFLEKA 386
            +K    L  + + N+  K + +  +L+  Q     +P +   R + ++ RK   F   A
Sbjct: 308 AKKCCKLLNRQGFGNIQHKTKEALDSLESIQSQLLTNPSDSLFRVEHVA-RKKWNFFAAA 366

Query: 387 ERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIEDFTM 566
             SF+ Q+++ K L   D N++FF+  +    A+N I FL  +D     ++  + E    
Sbjct: 367 LESFYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVA 426

Query: 567 FYNNLFGA-SVPRSPLDWNAARSGPMLNC--TEQSALVVPVDTKEIKDVLFSIGNDKAPG 737
           +Y +L G+ S   +P      +      C  T  S L      KEI   +F++  +KAPG
Sbjct: 427 YYTHLLGSDSDILTPDSVQRIKDIHPFRCNDTLASRLSALPSDKEITAAVFAMPRNKAPG 486


>gb|AAF63129.1|AC009526_14 Similar to reverse transcriptase [Arabidopsis thaliana]
          Length = 602

 Score = 84.0 bits (206), Expect = 5e-14
 Identities = 64/240 (26%), Positives = 106/240 (44%), Gaps = 3/240 (1%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDHS CI  L     + +K F++ +F   +P F   L  AW    P     + L   LK 
Sbjct: 248 SDHSPCIIILENLPKRSKKCFRYFSFLSTHPTFLVSLTVAWEEQIPVGSHMFSLGEHLKA 307

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQFLEKA 386
            +K    L  + + N+  K + +  +L+  Q     +P +   R + ++ RK   F   A
Sbjct: 308 AKKCCKLLNRQGFGNIQHKTKEALDSLESIQSQLLTNPSDSLFRVEHVA-RKKWNFFAAA 366

Query: 387 ERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIEDFTM 566
             SF+ Q+++ K L   D N++FF+  +    A+N I FL  +D     ++  + E    
Sbjct: 367 LESFYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVA 426

Query: 567 FYNNLFGA-SVPRSPLDWNAARSGPMLNC--TEQSALVVPVDTKEIKDVLFSIGNDKAPG 737
           +Y +L G+ S   +P      +      C  T  S L      KEI   +F++  +KAPG
Sbjct: 427 YYTHLLGSDSDILTPDSVQRIKDIHPFRCNDTLASRLSALPSDKEITAAVFAMPRNKAPG 486


>ref|NP_175044.1| DNAse I-like superfamily protein [Arabidopsis thaliana]
            gi|332193872|gb|AEE31993.1| DNAse I-like superfamily
            protein [Arabidopsis thaliana]
          Length = 626

 Score = 84.0 bits (206), Expect = 5e-14
 Identities = 64/240 (26%), Positives = 106/240 (44%), Gaps = 3/240 (1%)
 Frame = +3

Query: 27   SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
            SDHS CI  L     + +K F++ +F   +P F   L  AW    P     + L   LK 
Sbjct: 312  SDHSPCIIILENLPKRSKKCFRYFSFLSTHPTFLVSLTVAWEEQIPVGSHMFSLGEHLKA 371

Query: 207  LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQFLEKA 386
             +K    L  + + N+  K + +  +L+  Q     +P +   R + ++ RK   F   A
Sbjct: 372  AKKCCKLLNRQGFGNIQHKTKEALDSLESIQSQLLTNPSDSLFRVEHVA-RKKWNFFAAA 430

Query: 387  ERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINIIIEDFTM 566
              SF+ Q+++ K L   D N++FF+  +    A+N I FL  +D     ++  + E    
Sbjct: 431  LESFYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVA 490

Query: 567  FYNNLFGA-SVPRSPLDWNAARSGPMLNC--TEQSALVVPVDTKEIKDVLFSIGNDKAPG 737
            +Y +L G+ S   +P      +      C  T  S L      KEI   +F++  +KAPG
Sbjct: 491  YYTHLLGSDSDILTPDSVQRIKDIHPFRCNDTLASRLSALPSDKEITAAVFAMPRNKAPG 550


>ref|XP_006598323.1| PREDICTED: uncharacterized protein LOC102660513 [Glycine max]
          Length = 543

 Score = 81.3 bits (199), Expect = 3e-13
 Identities = 70/246 (28%), Positives = 109/246 (44%), Gaps = 3/246 (1%)
 Frame = +3

Query: 9   LTRGTSSDHSACIASLFEQVPQFRKT-FKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYL 185
           +T G S     C+    + VP  RK  FK+ N       F E + ++W+          +
Sbjct: 186 MTPGISDHAMLCLRD--DSVPVKRKARFKYANCVSGMDNFTETVANSWNSARRGGPPMKM 243

Query: 186 LSSKLKTLRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNRE-IRHQEISTRK 362
           L  KLK L+  +  L +K    +  K++ +R  L H Q    LD LN++ I      T  
Sbjct: 244 LWHKLKKLQPVINNL-SKPLIGIKVKLQEAREKLTHAQMELTLDRLNKDKIDRTNDCTEA 302

Query: 363 HCQFLEKAERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDIN 542
             ++ E  E+    QR K + L L D N+ +F++ +     + SI  L   DG+      
Sbjct: 303 VIKWTEMEEQ-MLQQRAKIRWLRLGDGNNAYFHASLKAKYNQTSIKKLYMNDGNFVTTQK 361

Query: 543 IIIEDFTMFYNNLFGASVPR-SPLDWNAARSGPMLNCTEQSALVVPVDTKEIKDVLFSIG 719
            I ++   FY +L G   P    +D N  R G  LN  ++  L+  +  +EI   L SIG
Sbjct: 362 EIEDEIMRFYGDLMGREEPNLDSVDINIMRKGCQLNFDQRKYLIGRITDEEIDKALKSIG 421

Query: 720 NDKAPG 737
           + KAPG
Sbjct: 422 DLKAPG 427


>ref|XP_004253414.1| PREDICTED: uncharacterized protein LOC101253574 [Solanum
           lycopersicum]
          Length = 258

 Score = 81.3 bits (199), Expect = 3e-13
 Identities = 59/214 (27%), Positives = 101/214 (47%), Gaps = 8/214 (3%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSS---K 197
           SDHS  + +L +     + +FKF N W ++ RF EI+++AW     R  G   +     K
Sbjct: 51  SDHSTMMLTLEKTQQHGKCSFKFFNVWTEHERFMEIVETAWK----RQYGYDAMKKVWCK 106

Query: 198 LKTLRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQEISTRKHCQF- 374
           LK L+  L +L  KE+  + +K+E +R  + + Q       LN +   + I   K     
Sbjct: 107 LKDLQHRLQQLNMKEFKYIGKKIEQARIDVANVQNQ-----LNEQATDELIMKEKELLIN 161

Query: 375 LEK---AERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDGSVTRDINI 545
           LEK    E++   Q+++ K + L D N+K+F + +        +  +    G +  D   
Sbjct: 162 LEKWSLIEKNALRQKSRIKWIQLGDANNKYFSAVIKERTQEKQVRSIMTLSGQMIYDPQE 221

Query: 546 IIEDFTMFYNNLFGASVPRSP-LDWNAARSGPML 644
           I E+F +FY +L G S  + P ++    + GP L
Sbjct: 222 IQEEFVIFYKSLMGTSAGKLPAVNVKVMKRGPAL 255


>dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein
           [Arabidopsis thaliana]
          Length = 893

 Score = 80.9 bits (198), Expect = 4e-13
 Identities = 47/167 (28%), Positives = 84/167 (50%), Gaps = 2/167 (1%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDHS+C   L   V + ++ F+F N+++ NP F +++   W   N      Y +S KLK 
Sbjct: 226 SDHSSCEVVLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKH 285

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQ--EISTRKHCQFLE 380
           L+  +     + Y+++ ++V  + A + H Q++   +P    + H   E+   +  Q L 
Sbjct: 286 LKLPICCFSRENYSDIEKRVSEAHAIVLHRQRITLTNP---SVVHATLELEATRKWQILA 342

Query: 381 KAERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDG 521
           KAE SFF Q++    L   D N+ +F+   +   + N+I FL  + G
Sbjct: 343 KAEESFFCQKSSISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFG 389


>emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein
           [Arabidopsis thaliana]
          Length = 893

 Score = 80.9 bits (198), Expect = 4e-13
 Identities = 47/167 (28%), Positives = 84/167 (50%), Gaps = 2/167 (1%)
 Frame = +3

Query: 27  SDHSACIASLFEQVPQFRKTFKFCNFWMDNPRFKEILDSAWSCNNPRAQGQYLLSSKLKT 206
           SDHS+C   L   V + ++ F+F N+++ NP F +++   W   N      Y +S KLK 
Sbjct: 226 SDHSSCEVVLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKH 285

Query: 207 LRKALWELKTKEYNNLSEKVEASRAALQHTQQMCDLDPLNREIRHQ--EISTRKHCQFLE 380
           L+  +     + Y+++ ++V  + A + H Q++   +P    + H   E+   +  Q L 
Sbjct: 286 LKLPICCFSRENYSDIEKRVSEAHAIVLHRQRITLTNP---SVVHATLELEATRKWQILA 342

Query: 381 KAERSFFAQRNKTKHLTLFDKNSKFFYSKVNRANARNSIPFLCREDG 521
           KAE SFF Q++    L   D N+ +F+   +   + N+I FL  + G
Sbjct: 343 KAEESFFCQKSSISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFG 389


Top