BLASTX nr result

ID: Atropa21_contig00032352 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00032352
         (1293 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670...    72   6e-18
ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664...    84   9e-18
ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665...    97   1e-17
ref|XP_006586426.1| PREDICTED: putative ribonuclease H protein A...    72   3e-17
ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660...    71   1e-16
emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...    90   2e-15
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...    52   3e-15
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...    88   9e-15
ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein A...    64   4e-14
ref|XP_004149382.1| PREDICTED: putative ribonuclease H protein A...    54   2e-13
ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663...    83   3e-13
ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661...    81   8e-13
ref|XP_006344520.1| PREDICTED: uncharacterized protein LOC102602...    72   2e-12
gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip...    55   4e-12
ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcript...    59   1e-11
gb|AAC24057.1| Contains similarity to reverse transcriptase-like...    49   1e-11
gb|ABD33260.1| non-LTR retroelement reverse transcriptase-like p...    76   3e-11
gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana]        50   3e-10
ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268...    72   4e-10
ref|XP_002331075.1| predicted protein [Populus trichocarpa]            72   7e-10

>ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max]
          Length = 383

 Score = 72.4 bits (176), Expect(2) = 6e-18
 Identities = 37/88 (42%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
 Frame = +2

Query: 194 SNMNSKHLSYAGRLQILNAILFSIKNFWCSIFIMPQSIKGG-R*IYRNYL*GNKEEKRKV 370
           S  + K LSYAG+++++ A++  I NFW SIF +PQS+        RN+L G  +  +  
Sbjct: 109 SRWSRKSLSYAGKVELIRAVIQGIANFWMSIFPLPQSVLDTIIATCRNFLWGKADGGKIK 168

Query: 371 PLVAWERICCPTKFGGPNVQGCKSWNIA 454
           PLVAW  +C P K GG  +   K WNIA
Sbjct: 169 PLVAWSEVCTPKKEGGLGLFNLKDWNIA 196



 Score = 46.6 bits (109), Expect(2) = 6e-18
 Identities = 36/141 (25%), Positives = 60/141 (42%), Gaps = 3/141 (2%)
 Frame = +3

Query: 459 LIILFWQL*EKQDSLWVRWVHGLYMKQDLSIWDHLPLHDCSWYWRKINTLKLDMKN*YTQ 638
           L  + W L  K+DSLWVR VH  Y K   ++WD +     S +   I  + +  +     
Sbjct: 198 LSCILWDLHSKKDSLWVRLVHHYYFKGG-NVWDFISSSSDSVFIH-IRDIIISKEENIEV 255

Query: 639 GRYKLTP---SRQYSIIASYNELLGGQTRLKIADLIWTSVAQPSHRMIVWLVSQGQLLTK 809
            +  L     + Q      Y+ + G +  +  + +IW  V       I+WL ++ +LL  
Sbjct: 256 AKLMLNSWGCNEQTLAGKMYDYIRGTRPVVHWSSIIWNPVIPSKMSFILWLATKNRLLAL 315

Query: 810 ERMIHFNIPIDNVNCCLCSNQ 872
           +R    N       C LC+N+
Sbjct: 316 DRAAFLN---KGFLCPLCTNE 333


>ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max]
          Length = 939

 Score = 84.3 bits (207), Expect(2) = 9e-18
 Identities = 77/301 (25%), Positives = 128/301 (42%), Gaps = 12/301 (3%)
 Frame = +3

Query: 3    GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
            GL  N  K +++   V+  +KEQLL ++GF  G+ P RYLG+ LSSKK +    Q LIDK
Sbjct: 546  GLHVNPSKCNIYCGSVDINVKEQLLLISGFKEGKMPFRYLGIPLSSKKLNIKHYQVLIDK 605

Query: 183  MTQNQT*ILSTCLMLEGCKY*MQYCFQ*RTSGAQYSSCPKVLREVDKFTETIFRAIRRKR 362
            +    T   +  L   G    +Q      T        P     + KF      AI R  
Sbjct: 606  IVGRITHWSAGLLSYAGRVQLIQSVIF-ATINFWMQCLP-----LPKFVIMRINAICRS- 658

Query: 363  GRFHWWHGREYVVQQS--LEDLMCKGVKVGTLH----------QLIILFWQL*EKQDSLW 506
                 W G   + ++S    + +C     G L+           ++ L W +  K D+LW
Sbjct: 659  ---FLWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAIWNKISILKLLWNVCNKSDNLW 715

Query: 507  VRWVHGLYMKQDLSIWDHLPLHDCSWYWRKINTLKLDMKN*YTQGRYKLTPSRQYSIIAS 686
            ++W+H  Y++   SIW  +     SW    +  L+  +       +Y+      + +   
Sbjct: 716  IKWLHTYYIRGQ-SIWSMVLKKSHSWIMSSMMKLRPLLL------QYQSRMQDVFKMKKI 768

Query: 687  YNELLGGQTRLKIADLIWTSVAQPSHRMIVWLVSQGQLLTKERMIHFNIPIDNVNCCLCS 866
            Y  L     ++    L+  ++A+P     +W     +L +K+R+I F + +D  NC  CS
Sbjct: 769  YLALFEESEKMSWRTLMCNNLARPRALFCLWQACHFRLASKDRLIKFGLNVD-ANCAFCS 827

Query: 867  N 869
            +
Sbjct: 828  S 828



 Score = 33.9 bits (76), Expect(2) = 9e-18
 Identities = 17/75 (22%), Positives = 36/75 (48%), Gaps = 4/75 (5%)
 Frame = +2

Query: 890  HLLSECDWFRQVKNGIMQWAGV-QVPSG---EVKQVIERIKRKHWKQFYKEVVVAIWSGV 1057
            HL   C   + +   ++ W  +  +PS    E+  +  + K K W+     ++   ++  
Sbjct: 834  HLFFGCIELKTIWTAVLNWLQIIHMPSTWSEELNWITRKCKGKGWRAM---LLKCAFTET 890

Query: 1058 VYHVWRARN*KLYGG 1102
            +YH+W  RN +++GG
Sbjct: 891  IYHIWAYRNHRVFGG 905


>ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max]
          Length = 506

 Score = 97.4 bits (241), Expect = 1e-17
 Identities = 79/295 (26%), Positives = 135/295 (45%), Gaps = 5/295 (1%)
 Frame = +3

Query: 3   GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
           GL+ N +K SL  AG++   K ++L ++GF  GQ P +YLG+ ++SKK S +    LIDK
Sbjct: 104 GLLVNPQKCSLLCAGIDAVTKREILEVSGFQEGQLPFKYLGVPVTSKKLSTIHYSPLIDK 163

Query: 183 MTQNQT*ILSTCLMLEGCKY*MQYCFQ*RTSGAQYSSCPKVLREVDKFTETIFRAIRRKR 362
           +        +  L   G    +       T+   + +C    + V +  E I R I    
Sbjct: 164 IVGKIKHWTARLLSYAGRLQLVNSVMFALTN--YWLNCFPFPKSVLQKIEAICR-IFLWT 220

Query: 363 GRFHWWHGREYVVQQSLEDLMCKGVKVGTLH-----QLIILFWQL*EKQDSLWVRWVHGL 527
           G F          +Q      C G+ +  +       L+ L W L  K+DSLWV+W+   
Sbjct: 221 GGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIWNKANLMKLLWNLSSKEDSLWVKWIQAY 280

Query: 528 YMKQDLSIWDHLPLHDCSWYWRKINTLKLDMKN*YTQGRYKLTPSRQYSIIASYNELLGG 707
           Y+K+   +   +   D SW  + I   + D++        +L      ++   Y +L   
Sbjct: 281 YVKRSELMHIEMKNTD-SWIMKAILKQREDLEK--IDNMEELMIRGSINMGKLYRKLQDC 337

Query: 708 QTRLKIADLIWTSVAQPSHRMIVWLVSQGQLLTKERMIHFNIPIDNVNCCLCSNQ 872
             R +  +L++ + A+P    I+WL   G+L TK+R+  + + ID+ +CC CS +
Sbjct: 338 GQRKEWKNLLYGNTARPRANFILWLACHGRLSTKDRLCKYGM-IDDKSCCFCSEE 391


>ref|XP_006586426.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 192

 Score = 71.6 bits (174), Expect(2) = 3e-17
 Identities = 35/85 (41%), Positives = 50/85 (58%), Gaps = 1/85 (1%)
 Frame = +2

Query: 203 NSKHLSYAGRLQILNAILFSIKNFWCSIFIMPQSIKGG-R*IYRNYL*GNKEEKRKVPLV 379
           + K LSYAG+L+++ A++  I NFW  IF +PQS+        RN+L G  +  +  PLV
Sbjct: 81  SKKSLSYAGKLELIRAVIQGIVNFWMEIFSLPQSVMDWINASCRNFLWGKADIGKNKPLV 140

Query: 380 AWERICCPTKFGGPNVQGCKSWNIA 454
           AW  +C P K GG  +   K WN+A
Sbjct: 141 AWSVVCSPKKEGGLGLLNLKDWNLA 165



 Score = 45.1 bits (105), Expect(2) = 3e-17
 Identities = 22/62 (35%), Positives = 35/62 (56%)
 Frame = +3

Query: 3   GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
           GL  + +KS+++  G+       +  +TGFSLG FP RYLG+ L S + +      L+ K
Sbjct: 13  GLSISSDKSAIYSTGIRPHELSHIQQLTGFSLGDFPFRYLGVPLLSSRLNVCHYALLLSK 72

Query: 183 MT 188
           +T
Sbjct: 73  IT 74


>ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660482 [Glycine max]
          Length = 303

 Score = 70.9 bits (172), Expect(2) = 1e-16
 Identities = 35/83 (42%), Positives = 50/83 (60%), Gaps = 1/83 (1%)
 Frame = +2

Query: 209 KHLSYAGRLQILNAILFSIKNFWCSIFIMPQSIKGG-R*IYRNYL*GNKEEKRKVPLVAW 385
           K LSYAG+L+++ A++  I NFW  IF +PQS+        RN+L G  +  +K PLVAW
Sbjct: 126 KSLSYAGKLELIRAVIQGIVNFWIGIFPLPQSVLDRINASCRNFLWGKADIGKKKPLVAW 185

Query: 386 ERICCPTKFGGPNVQGCKSWNIA 454
             +C P + GG  +   K WN+A
Sbjct: 186 SVVCSPKREGGLGLFNLKDWNLA 208



 Score = 43.9 bits (102), Expect(2) = 1e-16
 Identities = 22/62 (35%), Positives = 35/62 (56%)
 Frame = +3

Query: 3   GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
           GL  + +KSS++ + +       +  +TGFSLG FP RYLG+ L S + +      L+ K
Sbjct: 56  GLSISSDKSSIYSSSIRTHELSHIQQLTGFSLGGFPFRYLGVPLLSSRLNVCHYAPLLSK 115

Query: 183 MT 188
           +T
Sbjct: 116 IT 117


>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score = 90.1 bits (222), Expect = 2e-15
 Identities = 70/309 (22%), Positives = 140/309 (45%), Gaps = 11/309 (3%)
 Frame = +3

Query: 3    GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
            GL A+ EKS+++  GV+D    +L       LG+ P RYLG+ L+SKK +  +C+ L++ 
Sbjct: 718  GLAASHEKSNIYFCGVDDETARELADYVHMQLGELPFRYLGVPLTSKKLTYAQCKPLVEM 777

Query: 183  MTQNQT*ILSTCLMLEGCKY*MQYCFQ*RTSGAQYSSCPKVLREVDKFTETIFRAIRRKR 362
            +T      ++  L          Y  + +   +  SS       +   ++ + +A+ +  
Sbjct: 778  ITNRAQTWMAKLL---------SYAGRLQLIKSILSSMQNYWAHIFPLSKKVIQAVEKVC 828

Query: 363  GRFHWWHGREY-----VVQQSLEDLMCKG------VKVGTLHQLIILFWQL*EKQDSLWV 509
             +F W    E      V   +++    +G      +K      ++ L W +  K+D LWV
Sbjct: 829  RKFLWTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLKLLWAIEFKRDKLWV 888

Query: 510  RWVHGLYMKQDLSIWDHLPLHDCSWYWRKINTLKLDMKN*YTQGRYKLTPSRQYSIIASY 689
            RW+H  Y+K+   +  ++  +  +W  RKI   +  + N       ++    ++S+  +Y
Sbjct: 889  RWIHSYYIKRQDILTVNIS-NQTTWILRKIVKARDHLSN--IGDWDEICIGDKFSMKKAY 945

Query: 690  NELLGGQTRLKIADLIWTSVAQPSHRMIVWLVSQGQLLTKERMIHFNIPIDNVNCCLCSN 869
             ++     R++   LI  + A P  + I+W++   +L T +R+  + +  D +N  LC N
Sbjct: 946  KKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQCD-LNYRLCRN 1004

Query: 870  QVTRISSIY 896
                I  ++
Sbjct: 1005 DGETIQHLF 1013


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
           [Arabidopsis thaliana]
          Length = 1164

 Score = 52.4 bits (124), Expect(3) = 3e-15
 Identities = 26/69 (37%), Positives = 42/69 (60%), Gaps = 1/69 (1%)
 Frame = +2

Query: 215 LSYAGRLQILNAILFSIKNFWCSIFIMPQS-IKGGR*IYRNYL*GNKEEKRKVPLVAWER 391
           LS+AGR+Q+L +++  I NFW S FI+P   IK    +   +L  ++ +K+ +  VAW +
Sbjct: 695 LSFAGRVQLLASVISGIVNFWISSFILPLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQ 754

Query: 392 ICCPTKFGG 418
           +C P   GG
Sbjct: 755 VCLPKAEGG 763



 Score = 47.8 bits (112), Expect(3) = 3e-15
 Identities = 25/62 (40%), Positives = 37/62 (59%)
 Frame = +3

Query: 3   GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
           GL+ N  K+ L+ AG+     + + +  GF LG  P+RYLGL L S+K +  E   LI+K
Sbjct: 624 GLLMNTNKTQLYHAGLSQSESDSMASY-GFKLGSLPVRYLGLPLMSRKLTIAEYAPLIEK 682

Query: 183 MT 188
           +T
Sbjct: 683 IT 684



 Score = 29.3 bits (64), Expect(3) = 3e-15
 Identities = 13/49 (26%), Positives = 21/49 (42%)
 Frame = +3

Query: 468 LFWQL*EKQDSLWVRWVHGLYMKQDLSIWDHLPLHDCSWYWRKINTLKL 614
           + W L     SLWV W     + +  S W+       SW W+ +  L++
Sbjct: 780 MIWLLFSNSGSLWVAWHKQHSLGKSTSFWNQPEKPHDSWNWKCLLRLRV 828


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score = 87.8 bits (216), Expect = 9e-15
 Identities = 80/300 (26%), Positives = 133/300 (44%), Gaps = 2/300 (0%)
 Frame = +3

Query: 3    GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
            GL A+IEKS ++  GV     EQL       +G  P RYLG+ L+SKK +  +C+ LIDK
Sbjct: 721  GLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFRYLGVPLASKKLNFSQCKPLIDK 780

Query: 183  MTQNQT*ILSTCLMLEG-CKY*MQYCFQ*RTSGAQYSSCPKVLREVDKFTETIFRAIRRK 359
            +T      ++  L   G  +      +  +    Q    PK L +  + T   F      
Sbjct: 781  ITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFPLPKKLIKAVETTCRKFLWTGTV 840

Query: 360  RGRFHWWHGREYVVQ-QSLEDLMCKGVKVGTLHQLIILFWQL*EKQDSLWVRWVHGLYMK 536
               +      +++ Q +S   L    + +     ++ L W +  KQD LWVRWV+  Y+K
Sbjct: 841  DTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILKLLWAITFKQDKLWVRWVNAYYIK 900

Query: 537  QDLSIWDHLPLHDCSWYWRKINTLKLDMKN*YTQGRYKLTPSRQYSIIASYNELLGGQTR 716
            +  +I +     + SW  RKI   +  +    T G   ++    +SI  +Y  L      
Sbjct: 901  RQ-NIENVTVSSNTSWILRKIFESRELLTR--TGGWEAVSNHMNFSIKKTYKLLQEDYEN 957

Query: 717  LKIADLIWTSVAQPSHRMIVWLVSQGQLLTKERMIHFNIPIDNVNCCLCSNQVTRISSIY 896
            +    LI  + A P  + I+WL    +L T ER+  +N  +  + C +C N++  I  ++
Sbjct: 958  VVWKRLICNNKATPKSQFILWLAMLNRLATAERVSRWNRDVSPL-CKMCGNEIETIQHLF 1016


>ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 316

 Score = 63.9 bits (154), Expect(2) = 4e-14
 Identities = 32/91 (35%), Positives = 50/91 (54%), Gaps = 1/91 (1%)
 Frame = +2

Query: 203 NSKHLSYAGRLQILNAILFSIKNFWCSIFIMPQSIKGG-R*IYRNYL*GNKEEKRKVPLV 379
           N K LSY G+L+++ A++  I NFW  IF +PQS+         N+L    +  +  PLV
Sbjct: 157 NKKSLSYVGKLELIKAVIQGIMNFWMRIFPLPQSVLDRINASCCNFLWSKADIGKNKPLV 216

Query: 380 AWERICCPTKFGGPNVQGCKSWNIASVDNFV 472
           AW  +C P + GG  +   K WN+A + + +
Sbjct: 217 AWPVVCSPKQEGGLGLFNLKDWNLALLSHIL 247



 Score = 42.0 bits (97), Expect(2) = 4e-14
 Identities = 22/61 (36%), Positives = 34/61 (55%)
 Frame = +3

Query: 3   GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
           GL  + +KS+++ AG+       +  +TGFSLG FP RYLG  L S + +      L+ K
Sbjct: 89  GLSISSDKSAIYSAGIRPYELSHIQQLTGFSLGGFPFRYLGAPLLSSRLNVCHYAPLLYK 148

Query: 183 M 185
           +
Sbjct: 149 I 149


>ref|XP_004149382.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Cucumis
           sativus]
          Length = 268

 Score = 53.5 bits (127), Expect(2) = 2e-13
 Identities = 28/87 (32%), Positives = 49/87 (56%), Gaps = 1/87 (1%)
 Frame = +2

Query: 257 FSIKNFWCSIFIMPQSI-KGGR*IYRNYL*GNKEEKRKVPLVAWERICCPTKFGGPNVQG 433
           +S++ +W S+F++P  + +    I R YL   KEE R    VAW+ +C P   GG +++ 
Sbjct: 130 WSLQVYWASVFMLPMKVHRDVDKILRAYLWRGKEEGRGGAKVAWDEVCLPFDEGGLDIRD 189

Query: 434 CKSWNIASVDNFVLAIMRKARFSMGEV 514
             SWNIA+    +  ++ K+  S+ E+
Sbjct: 190 GSSWNIATTLKILWLLLVKSGRSLWEI 216



 Score = 50.1 bits (118), Expect(2) = 2e-13
 Identities = 26/64 (40%), Positives = 35/64 (54%)
 Frame = +3

Query: 3   GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
           GL AN  KSS+FL GV       L     FS+G  P+R+LGL L S +    +C  LI +
Sbjct: 63  GLFANRGKSSIFLVGVNSSKASWLAANMDFSIGHLPVRHLGLPLLSGRLRSSDCDPLIQR 122

Query: 183 MTQN 194
           +T +
Sbjct: 123 ITSH 126


>ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max]
          Length = 514

 Score = 82.8 bits (203), Expect = 3e-13
 Identities = 75/305 (24%), Positives = 134/305 (43%), Gaps = 17/305 (5%)
 Frame = +3

Query: 3    GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
            GL  N  K  +F  G+     + +  +TGF  G  P+RYLG+ LS KK +      L++K
Sbjct: 207  GLQINPAKCKVFCGGLNCDSIQVITKITGFEEGTLPVRYLGVPLSCKKLNVHHYLPLVEK 266

Query: 183  MTQNQT*ILSTCLMLEGCKY*MQYCFQ*RTSGAQY--SSCP---KVLREVDKFTETIFRA 347
            +        S  L + G    +Q      T+ AQY  S  P   KV++++D    +    
Sbjct: 267  IVGKIRHWSSKLLSIAGR---IQLVRSIITAIAQYWMSVFPMPKKVIQKIDSICRSFI-- 321

Query: 348  IRRKRGRFHWWHGREYVVQQSLE--DLMCKGVKVGTLH----------QLIILFWQL*EK 491
                      W G   V ++SL     +CK  + G L+           ++   W +  K
Sbjct: 322  ----------WSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELWNVTAMLKCLWNICSK 371

Query: 492  QDSLWVRWVHGLYMKQDLSIWDHLPLHDCSWYWRKINTLKLDMKN*YTQGRYKLTPSRQY 671
            +D+LWV+W+H  ++K D ++       + +W  + +   +  + N       ++   R++
Sbjct: 372  EDNLWVKWIHAYFLKGD-NVMSATIKSNSTWILKSVMKQRPQVNN-LQLVWIEMLRKRKF 429

Query: 672  SIIASYNELLGGQTRLKIADLIWTSVAQPSHRMIVWLVSQGQLLTKERMIHFNIPIDNVN 851
            S+   Y EL+    ++    L+  + A+P   + +WL  Q +L TK R+ + N+    + 
Sbjct: 430  SMKQVYMELVEDHNKIDWFRLLRYNRARPRANVTLWLACQNRLATKTRLKNMNM----IQ 485

Query: 852  CCLCS 866
            C LCS
Sbjct: 486  CSLCS 490


>ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max]
          Length = 947

 Score = 81.3 bits (199), Expect = 8e-13
 Identities = 75/303 (24%), Positives = 137/303 (45%), Gaps = 21/303 (6%)
 Frame = +3

Query: 3    GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
            GLV N  K  ++  GV+   K ++  ++ +  GQ P+RYLG+ L+SKK +      LIDK
Sbjct: 546  GLVVNPNKCRIYFGGVDGTTKNKIQQISSYEEGQLPVRYLGVPLTSKKLNIKYYLPLIDK 605

Query: 183  MTQNQT*ILSTCLMLEGCKY*MQYCFQ*RTSGAQY-SSCPKVLREVDKFTETIFRAIRRK 359
            +T       S  L + G +  M  C    T+  Q+   C  +   V K  +++ R+    
Sbjct: 606  ITTRIRHWTSKLLNMTG-RVQMVNCT--ITAIVQFWMQCLPIPMSVIKKIDSMCRS---- 658

Query: 360  RGRFHWWHGREYVVQQSLE-DLMCK----------GVKVGTLHQLIILFWQL*EKQDSLW 506
               F W    E   +  +  + +C+           +KV     ++   W L +K D+LW
Sbjct: 659  ---FVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWNHITVLNCLWNLCKKVDNLW 715

Query: 507  VRWVHGLYMKQDLSIWDHLPLHDCSWYWRKINTLKLDMKN*YTQGRY---------KLTP 659
            V+W+H  Y+K   S+ + +  ++ SW           +KN  +Q  Y         +L  
Sbjct: 716  VKWIHAHYIKNS-SVMNTMVTNNFSWV----------LKNVLSQREYIHTLQPVWDELLN 764

Query: 660  SRQYSIIASYNELLGGQTRLKIADLIWTSVAQPSHRMIVWLVSQGQLLTKERMIHFNIPI 839
            S ++ +  +Y++++    R+  + L+  + A+P      WL   G+L TK+R++ F +  
Sbjct: 765  SERFKMKKAYDKMMEAD-RVHWSGLMRKNCARPRAIHTTWLACHGRLGTKDRLVRFGMIT 823

Query: 840  DNV 848
            D +
Sbjct: 824  DKI 826


>ref|XP_006344520.1| PREDICTED: uncharacterized protein LOC102602464 [Solanum tuberosum]
          Length = 1041

 Score = 72.0 bits (175), Expect(2) = 2e-12
 Identities = 26/74 (35%), Positives = 45/74 (60%)
 Frame = +2

Query: 887  LHLLSECDWFRQVKNGIMQWAGVQVPSGEVKQVIERIKRKHWKQFYKEVVVAIWSGVVYH 1066
            +HL  +CDW + +   + QW G+++ +  +KQ ++RI   HWK+F KE + A    ++YH
Sbjct: 523  MHLFVKCDWAQAIWKEVKQWIGIELQNNGIKQTLKRIMLTHWKKFKKETIAATCGAILYH 582

Query: 1067 VWRARN*KLYGGTY 1108
             WR R+ K + G +
Sbjct: 583  TWRVRSWKKFKGKF 596



 Score = 28.5 bits (62), Expect(2) = 2e-12
 Identities = 9/25 (36%), Positives = 17/25 (68%)
 Frame = +3

Query: 801 LTKERMIHFNIPIDNVNCCLCSNQV 875
           L+ ER +  +I +++ +CCLC  +V
Sbjct: 494 LSSERKLKLHIQVEDTDCCLCDKKV 518


>gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score:
           72.31) [Arabidopsis thaliana]
          Length = 928

 Score = 54.7 bits (130), Expect(2) = 4e-12
 Identities = 23/61 (37%), Positives = 43/61 (70%)
 Frame = +3

Query: 3   GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
           GL  ++EKS+++LAGV+  + ++++    F +G+ P+RYLGL L SK+ +  +C  LI++
Sbjct: 621 GLKISMEKSTMYLAGVQASVYQEIVQKFSFDVGKLPVRYLGLPLVSKRLTASDCLPLIEQ 680

Query: 183 M 185
           +
Sbjct: 681 L 681



 Score = 44.7 bits (104), Expect(2) = 4e-12
 Identities = 25/69 (36%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
 Frame = +2

Query: 206 SKHLSYAGRLQILNAILFSIKNFWCSIFIMPQS-IKGGR*IYRNYL*GNKEEKRKVPLVA 382
           S+ LS+AGRL ++++ L+SI NFW + F +P++ I+    +   +L    E       V+
Sbjct: 690 SRFLSFAGRLNLISSTLWSICNFWMAAFRLPRACIREIDKLCSAFLWSGTELSSNKAKVS 749

Query: 383 WERICCPTK 409
           WE IC P K
Sbjct: 750 WEAICKPKK 758


>ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcriptase)-related protein
           [Arabidopsis thaliana] gi|332643359|gb|AEE76880.1|
           RNA-directed DNA polymerase (reverse
           transcriptase)-related protein [Arabidopsis thaliana]
          Length = 746

 Score = 58.5 bits (140), Expect(2) = 1e-11
 Identities = 31/85 (36%), Positives = 48/85 (56%), Gaps = 1/85 (1%)
 Frame = +2

Query: 206 SKHLSYAGRLQILNAILFSIKNFWCSIFIMPQS-IKGGR*IYRNYL*GNKEEKRKVPLVA 382
           ++HLS+AGRLQ++++++ S+ NFW S F +P + IK    I  ++L    E   K   VA
Sbjct: 58  ARHLSFAGRLQLISSVIHSLTNFWMSAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVA 117

Query: 383 WERICCPTKFGGPNVQGCKSWNIAS 457
           W  +C P   GG  ++  K  N  S
Sbjct: 118 WSDVCTPKDEGGLGIRSLKEANKGS 142



 Score = 39.3 bits (90), Expect(2) = 1e-11
 Identities = 19/49 (38%), Positives = 30/49 (61%)
 Frame = +3

Query: 39  LAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDKM 185
           +AGV+D  K  +L    F+ G  P+RYLGL L +KK +  +   L++K+
Sbjct: 1   MAGVKDNDKADILHSFPFASGALPVRYLGLPLLTKKMTTSDYGPLVEKI 49


>gb|AAC24057.1| Contains similarity to reverse transcriptase-like protein
           gb|2244803 from A. thaliana chromosome 4 contig
           gb|Z97336 [Arabidopsis thaliana]
          Length = 404

 Score = 48.9 bits (115), Expect(2) = 1e-11
 Identities = 25/63 (39%), Positives = 39/63 (61%)
 Frame = +3

Query: 3   GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
           GL  ++EKS+LFLAG      + + +   FS+G  P+RYLGL L +K+ S  +   L++K
Sbjct: 26  GLRISMEKSTLFLAGTSATAHQTITSTYPFSVGTLPVRYLGLPLVNKRLSAADYLPLLEK 85

Query: 183 MTQ 191
           + Q
Sbjct: 86  IRQ 88



 Score = 48.9 bits (115), Expect(2) = 1e-11
 Identities = 26/72 (36%), Positives = 43/72 (59%), Gaps = 1/72 (1%)
 Frame = +2

Query: 206 SKHLSYAGRLQILNAILFSIKNFWCSIFIMPQS-IKGGR*IYRNYL*GNKEEKRKVPLVA 382
           ++ LS+AGRL +++++L+SI NFW + F +P+  I+    I   +L   +E K     V+
Sbjct: 95  ARFLSFAGRLNLISSVLWSICNFWMAAFRLPRGCIREVDKICSAFLWSGQEMKTTKAKVS 154

Query: 383 WERICCPTKFGG 418
           W+ IC P   GG
Sbjct: 155 WQEICKPKHEGG 166


>gb|ABD33260.1| non-LTR retroelement reverse transcriptase-like protein, related
           [Medicago truncatula]
          Length = 120

 Score = 76.3 bits (186), Expect = 3e-11
 Identities = 38/89 (42%), Positives = 55/89 (61%), Gaps = 1/89 (1%)
 Frame = +2

Query: 197 NMNSKHLSYAGRLQILNAILFSIKNFWCSIFIMPQSI-KGGR*IYRNYL*GNKEEKRKVP 373
           N +SK LSYAGRLQ++ ++LF ++ +W  +F++PQ + K  +   R +L   K    K  
Sbjct: 28  NWSSKFLSYAGRLQLIKSVLFGVQTYWSQVFVLPQKVLKLIQTACRVFLWTGKSGTSKRA 87

Query: 374 LVAWERICCPTKFGGPNVQGCKSWNIASV 460
           L+AWERIC P   GG NV   K WN A++
Sbjct: 88  LIAWERICLPKTAGGWNVIDLKVWNQAAI 116


>gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana]
          Length = 438

 Score = 49.7 bits (117), Expect(2) = 3e-10
 Identities = 27/83 (32%), Positives = 43/83 (51%), Gaps = 1/83 (1%)
 Frame = +2

Query: 209 KHLSYAGRLQILNAILFSIKNFWCSIFIMPQS-IKGGR*IYRNYL*GNKEEKRKVPLVAW 385
           K LS+AGRLQ+L++++  I  FW S F +P+  I+    +   +L     ++     V+W
Sbjct: 67  KSLSFAGRLQLLSSVISGIVVFWMSTFRLPKGCIREIESMCARFLWSGGTDEHHKAKVSW 126

Query: 386 ERICCPTKFGGPNVQGCKSWNIA 454
             +C P   GG  V+    WN A
Sbjct: 127 STVCLPKAEGGLGVRKFTEWNTA 149



 Score = 43.1 bits (100), Expect(2) = 3e-10
 Identities = 24/57 (42%), Positives = 34/57 (59%)
 Frame = +3

Query: 15  NIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDKM 185
           N +K+ LFLAGV+D ++   +   GF     PIRYLGL L S+K    E + L+ K+
Sbjct: 2   NSDKTELFLAGVDD-VEYAAVYAYGFPFANLPIRYLGLPLMSRKLKISEFEPLVVKI 57


>ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum
            lycopersicum]
          Length = 717

 Score = 72.4 bits (176), Expect = 4e-10
 Identities = 51/191 (26%), Positives = 91/191 (47%), Gaps = 13/191 (6%)
 Frame = +3

Query: 3    GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
            GL AN+ KSS++  GV+  +++Q++   G+++ + P +YLG+ LSSKK + ++   LI+K
Sbjct: 526  GLQANLNKSSIYCGGVQMEVRQQIIQQLGYTIEELPFKYLGVPLSSKKLNTIQWYPLIEK 585

Query: 183  MTQNQT*ILSTCLMLEG-CKY*MQYCFQ*RTSGAQYSSCPKVLREVDKFTETIFRAIRRK 359
            +        +  L   G  +      F  +   AQ    P    ++ K  E + R+    
Sbjct: 586  VMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFIIP---AKIIKLIEGLCRS---- 638

Query: 360  RGRFHWWHGREYVVQQSL--EDLMCK----------GVKVGTLHQLIILFWQL*EKQDSL 503
                + W G  YV +++L   D +C            +K+     +  L W L  K+D L
Sbjct: 639  ----YLWSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKIWNRSAVTKLCWDLANKEDKL 694

Query: 504  WVRWVHGLYMK 536
            W++W+H  Y+K
Sbjct: 695  WIKWIHAYYIK 705


>ref|XP_002331075.1| predicted protein [Populus trichocarpa]
          Length = 517

 Score = 71.6 bits (174), Expect = 7e-10
 Identities = 32/79 (40%), Positives = 52/79 (65%), Gaps = 1/79 (1%)
 Frame = +2

Query: 215 LSYAGRLQILNAILFSIKNFWCSIFIMP-QSIKGGR*IYRNYL*GNKEEKRKVPLVAWER 391
           LSYAGR+Q++N++LFSI+ +W S+F++P Q IK    I +++L    + +     VAW++
Sbjct: 92  LSYAGRVQLINSVLFSIQVYWASLFLLPGQVIKNVEQIMKSFLWSGSDMRTTGAKVAWDQ 151

Query: 392 ICCPTKFGGPNVQGCKSWN 448
           +C P K GG  ++  K WN
Sbjct: 152 VCLPKKEGGLGIKSIKEWN 170



 Score = 62.4 bits (150), Expect = 4e-07
 Identities = 51/209 (24%), Positives = 92/209 (44%), Gaps = 6/209 (2%)
 Frame = +3

Query: 3   GLVANIEKSSLFLAGVEDGIKEQLLTMTGFSLGQFPIRYLGLALSSKKWSKLECQQLIDK 182
           GL  N  KS +FL+GV +  +EQ++ + GF  G+ P++YLG+ L S +   + C+ L+D+
Sbjct: 20  GLYPNPNKSDIFLSGVLNAEREQIIHILGFREGELPMKYLGVPLLSSRLKAIYCKGLVDR 79

Query: 183 MTQNQT*ILSTCLMLEG-CKY*MQYCFQ*RTSGAQYSSCP-KVLREVDKFTETIFRA--- 347
           +T          L   G  +      F  +   A     P +V++ V++  ++   +   
Sbjct: 80  ITSKVRHWTCRTLSYAGRVQLINSVLFSIQVYWASLFLLPGQVIKNVEQIMKSFLWSGSD 139

Query: 348 IRRKRGRFHWWHGREYVVQQSLEDLMCKGVKVGTLHQLIILFWQL*EKQD-SLWVRWVHG 524
           +R    +  W    +  + +    L  K +K      L+   W L    D S+W  W+  
Sbjct: 140 MRTTGAKVAW---DQVCLPKKEGGLGIKSIKEWNKIALLKHIWNLCNDSDGSIWSTWIRS 196

Query: 525 LYMKQDLSIWDHLPLHDCSWYWRKINTLK 611
             ++   + W      +CSW W KI  L+
Sbjct: 197 NLLR-GRNFWTIKTPQNCSWAWGKILKLR 224


Top