BLASTX nr result

ID: Rehmannia28_contig00020160 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia28_contig00020160
         (2482 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAF02855.1|AC009324_4 Similar to retrotransposon proteins [Ar...   268   5e-72
gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsi...   262   2e-70
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas...   263   2e-70
ref|XP_008367623.1| PREDICTED: uncharacterized protein LOC103431...   261   3e-70
emb|CAN74381.1| hypothetical protein VITISV_007944 [Vitis vinifera]   261   1e-69
ref|XP_010051209.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   261   2e-69
emb|CAA19715.1| putative protein [Arabidopsis thaliana] gi|72695...   259   5e-69
gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsi...   259   6e-69
emb|CAB43904.1| putative protein [Arabidopsis thaliana] gi|72697...   257   2e-68
gb|ACP30598.1| disease resistance protein [Brassica rapa subsp. ...   255   1e-67
gb|KYP59043.1| Retrovirus-related Pol polyprotein from transposo...   239   1e-67
gb|AAL78658.1|AF405555_1 Hopscotch polyprotein [Fagus sylvatica]      232   2e-67
ref|XP_013658035.1| PREDICTED: uncharacterized protein LOC106362...   251   7e-67
emb|CAN68489.1| hypothetical protein VITISV_037543 [Vitis vinifera]   252   1e-66
ref|XP_008374284.1| PREDICTED: uncharacterized protein LOC103437...   241   2e-66
gb|AAC35532.1| contains similarity to proteases [Arabidopsis tha...   249   7e-66
emb|CAB40035.1| retrotransposon like protein [Arabidopsis thalia...   249   8e-66
gb|KYP75940.1| Retrovirus-related Pol polyprotein from transposo...   249   1e-65
emb|CAN79884.1| hypothetical protein VITISV_002539 [Vitis vinifera]   249   1e-65
emb|CAN81099.1| hypothetical protein VITISV_017741 [Vitis vinifera]   248   3e-65

>gb|AAF02855.1|AC009324_4 Similar to retrotransposon proteins [Arabidopsis thaliana]
          Length = 1522

 Score =  268 bits (685), Expect = 5e-72
 Identities = 122/242 (50%), Positives = 170/242 (70%), Gaps = 1/242 (0%)
 Frame = +3

Query: 501  LFGAYTRTAPLEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRTKSESVNCFL 680
            +   +  + PLE IHCD+WGP  ++S  GFRY+++FID +SRFTWFY ++ KS+  + F+
Sbjct: 506  MLSTFNASRPLERIHCDLWGPSPTSSVQGFRYYVVFIDHYSRFTWFYPLKLKSDFFSTFV 565

Query: 681  HFKALVENQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQNGMAERKIR 860
             F+ LVENQL   IK FQCDG  E I  QF   +Q  GIQ  +SCP+TP+QNGMAERK R
Sbjct: 566  MFQKLVENQLGHKIKIFQCDGGGEFISSQFLKHLQDHGIQQNMSCPYTPQQNGMAERKHR 625

Query: 861  HITEIGNTLMFQASLPKTFWVESFSTAVFLINRLPTPIL-NQTSPFQKLFPTPPDYSSLR 1037
            HI E+G +++FQ+ LP  +W+ESF TA F+IN LPT  L N  SP+QKL+   P+YS+LR
Sbjct: 626  HIVELGLSMIFQSKLPLKYWLESFFTANFVINLLPTSSLDNNESPYQKLYGKAPEYSALR 685

Query: 1038 VYGCPCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSRHVIFNETSF 1217
            V+GC C+P L    + K  P+S+ C F+GY+  + GYRC    +G++++SRHV+F+E + 
Sbjct: 686  VFGCACYPTLRDYASTKFDPRSLKCVFLGYNEKYKGYRCLYPPTGRIYISRHVVFDENTH 745

Query: 1218 PF 1223
            PF
Sbjct: 746  PF 747



 Score =  154 bits (388), Expect = 1e-34
 Identities = 85/193 (44%), Positives = 116/193 (60%)
 Frame = +2

Query: 1619 YPMVTRNRDNTRLAKKISDFIIYTTTVRPSVSTLALDFEPTSYSIASKHYQWRDAMNTEY 1798
            + MVTR ++   ++K    +++ T  V           EP + + A KH  W +AM  E 
Sbjct: 887  HAMVTRGKEG--ISKPNKRYVLLTHKVSIP--------EPKTVTEALKHPGWNNAMQEEM 936

Query: 1799 NALLQNNTWTLIPKPANANIIGCKWVFRIKRKADGSVERYKARLVAKGFNQEEGIDYSET 1978
                +  TWTL+P   N N++G  WVFR K  ADGS+++ KARLVAKGF QEEGIDY ET
Sbjct: 937  GNCKETETWTLVPYSPNMNVLGSMWVFRTKLHADGSLDKLKARLVAKGFKQEEGIDYLET 996

Query: 1979 FSPVVKPTTIRIILALAISRGWPIRQVDVNNAFLNGLLSEDVYMSQLPGFVHPQFPTHVW 2158
            +SPVV+  T+R+IL +A    W ++Q+DV NAFL+G L+E VYM Q  GFV    P HV 
Sbjct: 997  YSPVVRTPTVRLILHVATVLKWELKQMDVKNAFLHGDLTETVYMRQPAGFVDKSKPDHV- 1055

Query: 2159 MMVKKVVYGFYPS 2197
             ++ K +YG   S
Sbjct: 1056 CLLHKSLYGLKQS 1068


>gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1149

 Score =  262 bits (669), Expect = 2e-70
 Identities = 126/245 (51%), Positives = 168/245 (68%), Gaps = 1/245 (0%)
 Frame = +3

Query: 513  YTRTAPLEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRTKSESVNCFLHFKA 692
            +  + PLE +HCD+WGP   +S  GF+Y++IFID  SRF WFY ++ KS+  + F+ F++
Sbjct: 498  FIASRPLERVHCDLWGPAPVSSIQGFQYYVIFIDNRSRFCWFYPLKHKSDFCSLFMKFQS 557

Query: 693  LVENQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQNGMAERKIRHITE 872
             VEN L T I  FQ DG  E    +F   +Q  GIQ+ ISCPHTP+QNG+AERK R +TE
Sbjct: 558  FVENLLQTKIGTFQSDGGGEFTSNRFLQHLQESGIQHYISCPHTPQQNGLAERKHRQLTE 617

Query: 873  IGNTLMFQASLPKTFWVESFSTAVFLINRLPTPIL-NQTSPFQKLFPTPPDYSSLRVYGC 1049
             G TLMFQ+  P+ FWVE+F TA FL N LPT  L + T+P+Q LF   PDYS+LR +GC
Sbjct: 618  RGLTLMFQSKAPQRFWVEAFFTANFLSNLLPTSALDSSTTPYQVLFGKAPDYSALRTFGC 677

Query: 1050 PCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSRHVIFNETSFPFIM 1229
             CFP L A   +K  P+S+ C F+GY+  + GYRCF   + +V+LSRHV+F+E+SFPFI 
Sbjct: 678  ACFPTLRAYARNKFDPRSLKCIFLGYTEKYKGYRCFFPPTNRVYLSRHVLFDESSFPFID 737

Query: 1230 SPESL 1244
            +  SL
Sbjct: 738  TYTSL 742



 Score =  111 bits (277), Expect = 2e-21
 Identities = 53/102 (51%), Positives = 72/102 (70%)
 Frame = +2

Query: 1880 RIKRKADGSVERYKARLVAKGFNQEEGIDYSETFSPVVKPTTIRIILALAISRGWPIRQV 2059
            R+K   DGS+++YKARLVA+GF QEEGIDY ET+SPVV+  T+R +L L+    W ++Q+
Sbjct: 902  RVKLNVDGSLDKYKARLVAQGFKQEEGIDYLETYSPVVRSATVRAVLHLSTIMNWELKQM 961

Query: 2060 DVNNAFLNGLLSEDVYMSQLPGFVHPQFPTHVWMMVKKVVYG 2185
            DV N FL+G L+E VYM Q  GF+    P HV  ++ K +YG
Sbjct: 962  DVKNGFLHGDLTETVYMKQPAGFIDKAHPDHV-CLLHKALYG 1002


>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
            Arabidopsis thaliana BAC gb|AF080119 and is a member of
            the reverse transcriptase family PF|00078 [Arabidopsis
            thaliana]
          Length = 1415

 Score =  263 bits (672), Expect = 2e-70
 Identities = 122/256 (47%), Positives = 175/256 (68%)
 Frame = +3

Query: 480  LVSSHKVLFGAYTRTAPLEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRTKS 659
            L+S  +VL        PL+ IHCD+WGP    S  G +Y+ IF+D++SR++WFY +  KS
Sbjct: 502  LISDSRVLH-------PLDRIHCDLWGPSPVVSNQGLKYYAIFVDDYSRYSWFYPLHNKS 554

Query: 660  ESVNCFLHFKALVENQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQNG 839
            E ++ F+ F+ LVENQL+T IK FQ DG  E +  + K+ +   GI +RISCP+TP+QNG
Sbjct: 555  EFLSVFISFQKLVENQLNTKIKVFQSDGGGEFVSNKLKTHLSEHGIHHRISCPYTPQQNG 614

Query: 840  MAERKIRHITEIGNTLMFQASLPKTFWVESFSTAVFLINRLPTPILNQTSPFQKLFPTPP 1019
            +AERK RH+ E+G +++F +  P+ FWVESF TA ++INRLP+ +L   SP++ LF   P
Sbjct: 615  LAERKHRHLVELGLSMLFHSHTPQKFWVESFFTANYIINRLPSSVLKNLSPYEALFGEKP 674

Query: 1020 DYSSLRVYGCPCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSRHVI 1199
            DYSSLRV+G  C+P L     +K  P+S+ C F+GY+  + GYRCF   +GKV++SR+VI
Sbjct: 675  DYSSLRVFGSACYPCLRPLAQNKFDPRSLQCVFLGYNSQYKGYRCFYPPTGKVYISRNVI 734

Query: 1200 FNETSFPFIMSPESLL 1247
            FNE+  PF    +SL+
Sbjct: 735  FNESELPFKEKYQSLV 750



 Score =  167 bits (423), Expect = 6e-39
 Identities = 94/246 (38%), Positives = 139/246 (56%)
 Frame = +2

Query: 1448 SPTRLASSPINPELSTTSESTPIPHIVSPTLMXXXXXXXXXXXXXTFPLRNSQSHPSYPM 1627
            +P +L S PI+      S+ T       PT                      Q   S+ M
Sbjct: 774  APVQLFSKPIDLNTYAGSQVTEQLTDPEPTSNNEGSDEEVNPVAEEIAANQEQVINSHAM 833

Query: 1628 VTRNRDNTRLAKKISDFIIYTTTVRPSVSTLALDFEPTSYSIASKHYQWRDAMNTEYNAL 1807
             TR++    + K  + + + T+ +  +        EP + + A KH  W +A++ E N +
Sbjct: 834  TTRSKAG--IQKPNTRYALITSRMNTA--------EPKTLASAMKHPGWNEAVHEEINRV 883

Query: 1808 LQNNTWTLIPKPANANIIGCKWVFRIKRKADGSVERYKARLVAKGFNQEEGIDYSETFSP 1987
               +TW+L+P   + NI+  KWVF+ K   DGS+++ KARLVAKGF+QEEG+DY ETFSP
Sbjct: 884  HMLHTWSLVPPTDDMNILSSKWVFKTKLHPDGSIDKLKARLVAKGFDQEEGVDYLETFSP 943

Query: 1988 VVKPTTIRIILALAISRGWPIRQVDVNNAFLNGLLSEDVYMSQLPGFVHPQFPTHVWMMV 2167
            VV+  TIR++L ++ S+GWPI+Q+DV+NAFL+G L E V+M Q  GF+ PQ PTHV  + 
Sbjct: 944  VVRTATIRLVLDVSTSKGWPIKQLDVSNAFLHGELQEPVFMYQPSGFIDPQKPTHVCRLT 1003

Query: 2168 KKVVYG 2185
             K +YG
Sbjct: 1004 -KAIYG 1008


>ref|XP_008367623.1| PREDICTED: uncharacterized protein LOC103431265 [Malus domestica]
          Length = 1110

 Score =  261 bits (666), Expect = 3e-70
 Identities = 119/259 (45%), Positives = 177/259 (68%), Gaps = 2/259 (0%)
 Frame = +3

Query: 453  TNTIGANKVLVSSHKV--LFGAYTRTAPLEIIHCDVWGPISSASFSGFRYFIIFIDEFSR 626
            T +  ++  L  S K+  +F + T + PLE++H DVWGP    S  GF Y+IIF+D+F+R
Sbjct: 732  TKSFCSDCALAKSSKLPFVFSSCTTSKPLELVHTDVWGPSPQXSSQGFXYYIIFVDDFTR 791

Query: 627  FTWFYLMRTKSESVNCFLHFKALVENQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQYR 806
            ++WFY ++ KS+ ++ F+ +K+LVEN L T I   + D   E +   F   + + GI ++
Sbjct: 792  YSWFYPLKCKSDVLSTFMEYKSLVENSLCTKIXTLRSDSGXEYLSTSFSQFLATHGIHHQ 851

Query: 807  ISCPHTPEQNGMAERKIRHITEIGNTLMFQASLPKTFWVESFSTAVFLINRLPTPILNQT 986
            ++CPHTPEQNG AERK RH+ E   TL+  + +P  +WVE+F+TA++LINRLPT   ++ 
Sbjct: 852  LTCPHTPEQNGCAERKHRHLVETARTLLTASKVPHIYWVEAFNTAIYLINRLPT--ASRQ 909

Query: 987  SPFQKLFPTPPDYSSLRVYGCPCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQ 1166
            SP++ L+   P+Y  L+V+GC CFP+L    + KL PKS +C F+GYSL+H GYRC D  
Sbjct: 910  SPWESLYHRAPNYDLLKVFGCACFPWLKPYTSSKLDPKSRACVFLGYSLNHKGYRCLDPA 969

Query: 1167 SGKVHLSRHVIFNETSFPF 1223
            + +V++SRHVIFNE+SFPF
Sbjct: 970  THRVYISRHVIFNESSFPF 988


>emb|CAN74381.1| hypothetical protein VITISV_007944 [Vitis vinifera]
          Length = 1884

 Score =  261 bits (668), Expect = 1e-69
 Identities = 125/250 (50%), Positives = 168/250 (67%), Gaps = 2/250 (0%)
 Frame = +3

Query: 480  LVSSHKVLFGAYTRTA--PLEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRT 653
            L  S K+ FG   + +  PL+ IHCD+WG   + S  G++Y+ +FID+ +R+TW Y +R 
Sbjct: 443  LGKSCKLPFGLRNKISSNPLDKIHCDLWGXAPNNSTQGYKYYXVFIDDHTRYTWLYPLRR 502

Query: 654  KSESVNCFLHFKALVENQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQ 833
            KS+   CFL F+ LVENQL   IK FQ DG  E    +F++ +  CGI  ++SCP TPEQ
Sbjct: 503  KSDFFECFLKFQILVENQLERRIKIFQSDGGGEFQSIKFQNHLSKCGILQQVSCPGTPEQ 562

Query: 834  NGMAERKIRHITEIGNTLMFQASLPKTFWVESFSTAVFLINRLPTPILNQTSPFQKLFPT 1013
            NG+AERK RHI E+G T++F A LP + WV++F TAV+LINRLP+ +L   SPF  LF  
Sbjct: 563  NGVAERKHRHIVEMGLTMLFNAKLPLSLWVDAFLTAVYLINRLPSTVLKMESPFFMLFKQ 622

Query: 1014 PPDYSSLRVYGCPCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSRH 1193
             P+Y SLR++GC CFP+L     +K SPK+  C FIGYS  H GYRC    + +V++SRH
Sbjct: 623  YPEYRSLRIFGCQCFPYLRDYGKNKFSPKTYPCVFIGYSSLHKGYRCLHPSTKRVYISRH 682

Query: 1194 VIFNETSFPF 1223
            VIFNE  FP+
Sbjct: 683  VIFNENCFPY 692



 Score =  172 bits (436), Expect = 2e-40
 Identities = 80/151 (52%), Positives = 107/151 (70%)
 Frame = +2

Query: 1733 EPTSYSIASKHYQWRDAMNTEYNALLQNNTWTLIPKPANANIIGCKWVFRIKRKADGSVE 1912
            EP +Y  A K   W  AM  E  AL+QN TW L+P+P   NI+G KWVF+ K K DG+++
Sbjct: 920  EPKTYRTALKIPHWLKAMQEEIKALIQNRTWDLVPRPPTTNIVGSKWVFKTKLKEDGTID 979

Query: 1913 RYKARLVAKGFNQEEGIDYSETFSPVVKPTTIRIILALAISRGWPIRQVDVNNAFLNGLL 2092
            RYKARLVA+GF+Q  G+D+ ETFSPV+K TTIR+I +LA+  GW +RQ+DV NAFL+G L
Sbjct: 980  RYKARLVARGFSQIPGLDFGETFSPVIKHTTIRMIFSLAVILGWKMRQLDVKNAFLHGFL 1039

Query: 2093 SEDVYMSQLPGFVHPQFPTHVWMMVKKVVYG 2185
             E+V+M Q PGF++   P HV   + + +YG
Sbjct: 1040 KEEVFMEQPPGFINEVLPNHV-CKLNRSLYG 1069


>ref|XP_010051209.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC104439906
            [Eucalyptus grandis]
          Length = 2021

 Score =  261 bits (666), Expect = 2e-69
 Identities = 113/234 (48%), Positives = 162/234 (69%)
 Frame = +3

Query: 522  TAPLEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRTKSESVNCFLHFKALVE 701
            + PLE +HCD+WGP    SF  F+Y++I +D+FSRF+W Y MR K++  + F+ F+  VE
Sbjct: 584  STPLEKVHCDIWGPSPVISFQNFKYYVILVDDFSRFSWIYPMRAKADFYDIFILFQKQVE 643

Query: 702  NQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQNGMAERKIRHITEIGN 881
            NQL   IK FQCDG  E I  +F++ + S GI+ ++SCPHTPEQNG++ERK RH+ E+G 
Sbjct: 644  NQLDCKIKTFQCDGGGEFISHKFQTHLHSHGIKQQVSCPHTPEQNGVSERKHRHVVELGL 703

Query: 882  TLMFQASLPKTFWVESFSTAVFLINRLPTPILNQTSPFQKLFPTPPDYSSLRVYGCPCFP 1061
             +++ A +P  +WV++F TA+++IN LPTP L   SP+ +L+   P Y  LRV+GC CFP
Sbjct: 704  AMLYHAEVPLKYWVDAFLTAIYIINMLPTPKLQMQSPYSRLYQKEPSYDHLRVFGCACFP 763

Query: 1062 FLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSRHVIFNETSFPF 1223
             L      KL  +S++C FIGYS  H GYRC    +G++++S+HVIF+E  FPF
Sbjct: 764  CLRPYAQHKLDSRSLTCIFIGYSARHKGYRCLVPTTGRIYISKHVIFDERFFPF 817



 Score =  176 bits (445), Expect = 1e-41
 Identities = 87/169 (51%), Positives = 115/169 (68%)
 Frame = +2

Query: 1679 IIYTTTVRPSVSTLALDFEPTSYSIASKHYQWRDAMNTEYNALLQNNTWTLIPKPANANI 1858
            II T      ++   +  EP S   A  +  W  AM  E +AL  N TW LIP+  + N+
Sbjct: 949  IIKTNPKYACLAEYKVPAEPRSVKSALANQGWYTAMKEEMDALHHNQTWILIPRTTDMNV 1008

Query: 1859 IGCKWVFRIKRKADGSVERYKARLVAKGFNQEEGIDYSETFSPVVKPTTIRIILALAISR 2038
            IGCKWVF+ K  +DGS++R KARLVAKGF+QEEG+D++ETFSPVVK  TIR++L++A+  
Sbjct: 1009 IGCKWVFKTKLASDGSLDRLKARLVAKGFHQEEGVDFTETFSPVVKHATIRLVLSVAMMN 1068

Query: 2039 GWPIRQVDVNNAFLNGLLSEDVYMSQLPGFVHPQFPTHVWMMVKKVVYG 2185
             WPI Q+DV NAFL+G L+E VYM Q PGF+  Q P HV  ++KK +YG
Sbjct: 1069 QWPIHQLDVKNAFLHGTLTETVYMEQPPGFIQSQTPNHV-CLLKKSLYG 1116


>emb|CAA19715.1| putative protein [Arabidopsis thaliana] gi|7269574|emb|CAB79576.1|
            putative protein [Arabidopsis thaliana]
          Length = 1318

 Score =  259 bits (661), Expect = 5e-69
 Identities = 119/233 (51%), Positives = 165/233 (70%), Gaps = 1/233 (0%)
 Frame = +3

Query: 528  PLEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRTKSESVNCFLHFKALVENQ 707
            PLE +HCD+WGP +  S  GFRY+ +FID +SRF+W Y ++ KS+  N FL F  LVENQ
Sbjct: 326  PLERVHCDLWGPTTITSVQGFRYYAVFIDHYSRFSWIYPLKLKSDFYNIFLAFHKLVENQ 385

Query: 708  LSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQNGMAERKIRHITEIGNTL 887
            LS  I  FQCDG  E +  +F   +QS GIQ ++SCPHTP+QNG+AERK RH+ E+G ++
Sbjct: 386  LSQKISVFQCDGGGEFVSHKFLQHLQSHGIQQQLSCPHTPQQNGLAERKHRHLVELGLSM 445

Query: 888  MFQASLPKTFWVESFSTAVFLINRLPTPILNQT-SPFQKLFPTPPDYSSLRVYGCPCFPF 1064
            +FQ+ +P  FWVE+F TA FLIN LPT  L ++ SP++KL+   PDY+SLR +G  CFP 
Sbjct: 446  LFQSHVPHKFWVEAFFTANFLINLLPTSALKESISPYEKLYDKKPDYTSLRSFGSACFPT 505

Query: 1065 LGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSRHVIFNETSFPF 1223
            L     +K +P S+ C F+GY+  + GYRC    +G++++SRHVIF+E+ +PF
Sbjct: 506  LRDYAENKFNPCSLKCVFLGYNEKYKGYRCLYPPTGRLYISRHVIFDESVYPF 558



 Score =  154 bits (389), Expect = 8e-35
 Identities = 76/155 (49%), Positives = 103/155 (66%)
 Frame = +2

Query: 1733 EPTSYSIASKHYQWRDAMNTEYNALLQNNTWTLIPKPANANIIGCKWVFRIKRKADGSVE 1912
            EP + + A KH  W  AM  E     +  TW+L+P  ++ +++G KWVFR K  ADG++ 
Sbjct: 713  EPKTVTAALKHPGWTGAMTEEIGNCSETQTWSLVPYKSDMHVLGSKWVFRTKLHADGTLN 772

Query: 1913 RYKARLVAKGFNQEEGIDYSETFSPVVKPTTIRIILALAISRGWPIRQVDVNNAFLNGLL 2092
            + KAR+VAKGF QEEGIDY ET+SPVV+  T+R++L LA +  W I+Q+DV NAFL+G L
Sbjct: 773  KLKARIVAKGFLQEEGIDYLETYSPVVRTPTVRLVLHLATALNWDIKQMDVKNAFLHGDL 832

Query: 2093 SEDVYMSQLPGFVHPQFPTHVWMMVKKVVYGFYPS 2197
             E VYM+Q  GFV P  P HV  ++ K +YG   S
Sbjct: 833  KETVYMTQPAGFVDPSKPDHV-CLLHKSIYGLKQS 866


>gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1402

 Score =  259 bits (661), Expect = 6e-69
 Identities = 126/271 (46%), Positives = 177/271 (65%), Gaps = 12/271 (4%)
 Frame = +3

Query: 447  LSTNTIGANKV---------LVSSHKVLF--GAYTRTAPLEIIHCDVWGPISSASFSGFR 593
            + TN+I  NK          L  S ++ F   ++T   PLE +HCD+WGP    S  GFR
Sbjct: 483  VKTNSISINKTSKSLCEACQLGKSTRLPFVSSSFTSNRPLERVHCDLWGPSPITSVQGFR 542

Query: 594  YFIIFIDEFSRFTWFYLMRTKSESVNCFLHFKALVENQLSTTIKKFQCDGARELIFGQFK 773
            Y+ +FID +SRF+W Y ++ KS+  N F+ F  LVENQL+  I  FQCDG  E +  +F 
Sbjct: 543  YYAVFIDHYSRFSWIYPLKLKSDFYNIFVAFHKLVENQLNHKISVFQCDGGGEFVNHKFL 602

Query: 774  SIIQSCGIQYRISCPHTPEQNGMAERKIRHITEIGNTLMFQASLPKTFWVESFSTAVFLI 953
              +Q+ GIQ  IS PHTP+QNG+AERK RH+ E+G +++FQ+ +P  FWVE+F TA FLI
Sbjct: 603  QHLQNHGIQQHISYPHTPQQNGLAERKHRHLVELGLSMLFQSKVPLKFWVEAFFTANFLI 662

Query: 954  NRLPTPILNQT-SPFQKLFPTPPDYSSLRVYGCPCFPFLGATRTDKLSPKSVSCAFIGYS 1130
            N LPT  +    SP++KL  T PDY++LR +GC CFP +     +K  P+S+ C F+GY+
Sbjct: 663  NLLPTSAVEDAISPYEKLHQTTPDYTALRSFGCACFPTMRDYAMNKFDPRSLKCVFLGYN 722

Query: 1131 LDHCGYRCFDLQSGKVHLSRHVIFNETSFPF 1223
              + GYRC    +G+V++SRHVIF+ET++PF
Sbjct: 723  DKYKGYRCLYPPTGRVYISRHVIFDETAYPF 753



 Score =  149 bits (377), Expect = 2e-33
 Identities = 83/206 (40%), Positives = 119/206 (57%), Gaps = 3/206 (1%)
 Frame = +2

Query: 1589 PLRNSQSHP---SYPMVTRNRDNTRLAKKISDFIIYTTTVRPSVSTLALDFEPTSYSIAS 1759
            P+   Q+ P   ++PM+TR +           F+ +  T            EP + + A 
Sbjct: 883  PVPTQQAPPPTNTHPMITRAKVGITKPNPRYVFLSHKVTYP----------EPKTVTAAL 932

Query: 1760 KHYQWRDAMNTEYNALLQNNTWTLIPKPANANIIGCKWVFRIKRKADGSVERYKARLVAK 1939
            KH  W  AM  E     + NTW+L+P   N +++G KWVFR K  ADG++ + KAR+VAK
Sbjct: 933  KHPGWTGAMTEEMGNCSETNTWSLVPYTPNMHVLGSKWVFRTKLHADGTLNKLKARIVAK 992

Query: 1940 GFNQEEGIDYSETFSPVVKPTTIRIILALAISRGWPIRQVDVNNAFLNGLLSEDVYMSQL 2119
             F QEEGI Y ET+SPVV+  T++++L LA +  W ++Q+DV NAFL+G L+E VYM+Q 
Sbjct: 993  CFLQEEGIGYLETYSPVVRTPTVQLVLHLATALNWELKQMDVKNAFLHGDLNETVYMTQP 1052

Query: 2120 PGFVHPQFPTHVWMMVKKVVYGFYPS 2197
             GFV    PTHV  ++ K +YG   S
Sbjct: 1053 AGFVDKSKPTHV-CLLHKSIYGLKQS 1077


>emb|CAB43904.1| putative protein [Arabidopsis thaliana] gi|7269745|emb|CAB81478.1|
            putative protein [Arabidopsis thaliana]
          Length = 1415

 Score =  257 bits (657), Expect = 2e-68
 Identities = 118/232 (50%), Positives = 163/232 (70%), Gaps = 1/232 (0%)
 Frame = +3

Query: 531  LEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRTKSESVNCFLHFKALVENQL 710
            LE +HCD+WGP    S  GFRY++IFID +SRFTWFY +R KS+  + FL F+ +VENQ 
Sbjct: 482  LERVHCDLWGPAPVVSSQGFRYYVIFIDNYSRFTWFYPLRLKSDFFSVFLTFQKMVENQC 541

Query: 711  STTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQNGMAERKIRHITEIGNTLM 890
               I  FQCDG  E I  QF S +  CGI+  ISCP+TP+QNG+AERK RHITE+G+++M
Sbjct: 542  QQKIASFQCDGGGEFISNQFVSHLAECGIRQLISCPYTPQQNGIAERKHRHITELGSSMM 601

Query: 891  FQASLPKTFWVESFSTAVFLINRLPTPIL-NQTSPFQKLFPTPPDYSSLRVYGCPCFPFL 1067
            FQ  +P+  WVE+F T+ FL N LP+ +L +Q SP++ L    P Y+SLRV+GC C+P L
Sbjct: 602  FQGKVPQFLWVEAFYTSNFLCNLLPSSVLKDQKSPYEVLMGKAPVYTSLRVFGCACYPNL 661

Query: 1068 GATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSRHVIFNETSFPF 1223
                ++K  PKS+ C F GY+  + GY+CF   +GK++++RHV+F+E+ F F
Sbjct: 662  RPYASNKFDPKSLLCVFTGYNEKYKGYKCFHPPTGKIYINRHVLFDESKFLF 713



 Score =  154 bits (390), Expect = 6e-35
 Identities = 84/196 (42%), Positives = 117/196 (59%)
 Frame = +2

Query: 1598 NSQSHPSYPMVTRNRDNTRLAKKISDFIIYTTTVRPSVSTLALDFEPTSYSIASKHYQWR 1777
            N QSHP   M+TR++    + K    + ++T      V        P +   A K   W 
Sbjct: 845  NDQSHP---MITRSKSG--IFKPNPKYAMFTVKSNYPV--------PKTVKTALKDPGWT 891

Query: 1778 DAMNTEYNALLQNNTWTLIPKPANANIIGCKWVFRIKRKADGSVERYKARLVAKGFNQEE 1957
            DAM  EY++  + +TW L+P  +    +GC+WVF+ K KADG+++R KARLVAKG+ QEE
Sbjct: 892  DAMGEEYDSFEETHTWDLVPPDSFITPLGCRWVFKTKLKADGTLDRLKARLVAKGYEQEE 951

Query: 1958 GIDYSETFSPVVKPTTIRIILALAISRGWPIRQVDVNNAFLNGLLSEDVYMSQLPGFVHP 2137
            G+DY ET+SPVV+  T+R IL +A    W I+Q+DV NAFL+G L E VYM Q PGF + 
Sbjct: 952  GVDYMETYSPVVRTATVRTILHVATINKWEIKQLDVKNAFLHGDLKETVYMYQPPGFENQ 1011

Query: 2138 QFPTHVWMMVKKVVYG 2185
              P +V   + K +YG
Sbjct: 1012 DRPDYV-CKLNKAIYG 1026


>gb|ACP30598.1| disease resistance protein [Brassica rapa subsp. pekinensis]
          Length = 2301

 Score =  255 bits (652), Expect = 1e-67
 Identities = 121/251 (48%), Positives = 167/251 (66%), Gaps = 3/251 (1%)
 Frame = +3

Query: 480  LVSSHKVLFGA--YTRTAPLEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRT 653
            +  S ++ F A  +  T PLE IHCDVWGP    S   F+Y+++ ID +SR+ W Y M+ 
Sbjct: 501  MAKSSRLPFSASQFVATRPLERIHCDVWGPSPVVSVQEFKYYVVLIDNYSRYCWMYPMKK 560

Query: 654  KSESVNCFLHFKALVENQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQ 833
            KS+  + F+ F++LV+NQ  TTI  FQCDG  E I  QF   +Q  GIQ  +SCPHTP+Q
Sbjct: 561  KSDFHSIFIAFQSLVQNQFHTTIGTFQCDGGGEFISNQFLLHLQKNGIQQLLSCPHTPQQ 620

Query: 834  NGMAERKIRHITEIGNTLMFQASLPKTFWVESFSTAVFLINRLP-TPILNQTSPFQKLFP 1010
            NG+AER+ RHI E+G +L+FQ+  P+ +WVE+F TA FL N LP +   N  SP++KL  
Sbjct: 621  NGLAERRHRHIVELGLSLLFQSRAPQKYWVEAFMTANFLSNLLPHSANTNTASPYEKLHN 680

Query: 1011 TPPDYSSLRVYGCPCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSR 1190
              P Y +LR++GC CFP L     +KL P+S+ C F+GYS  + GYRC    +G+V++SR
Sbjct: 681  KSPSYDALRIFGCACFPMLRPYTQNKLDPRSLQCVFLGYSEKYKGYRCLLPATGRVYISR 740

Query: 1191 HVIFNETSFPF 1223
            HVIF+E+ FPF
Sbjct: 741  HVIFDESKFPF 751



 Score =  136 bits (342), Expect = 5e-29
 Identities = 72/154 (46%), Positives = 98/154 (63%)
 Frame = +2

Query: 1736 PTSYSIASKHYQWRDAMNTEYNALLQNNTWTLIPKPANANIIGCKWVFRIKRKADGSVER 1915
            P + + A KH  W ++M  E        TW+L+P   + ++IG  WVFR K  ADG+V+ 
Sbjct: 893  PRTVAEALKHPGWNNSMKEEIGNCELTKTWSLVPYTPDMHVIGNGWVFREKLNADGTVKS 952

Query: 1916 YKARLVAKGFNQEEGIDYSETFSPVVKPTTIRIILALAISRGWPIRQVDVNNAFLNGLLS 2095
             ++RLVA+G +QEEGIDY ET+SPVV+  T+RI+L +A    W I+Q+DV NAFL+G L 
Sbjct: 953  LRSRLVAQGCSQEEGIDYLETYSPVVRTATVRIVLHIATVLQWDIKQMDVANAFLHGDLH 1012

Query: 2096 EDVYMSQLPGFVHPQFPTHVWMMVKKVVYGFYPS 2197
            E VYMSQ  GFV    P HV  ++ K +YG   S
Sbjct: 1013 ETVYMSQPKGFVDESKPDHV-CLLHKSLYGLKQS 1045


>gb|KYP59043.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 426

 Score =  239 bits (611), Expect = 1e-67
 Identities = 121/243 (49%), Positives = 156/243 (64%)
 Frame = +3

Query: 528  PLEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRTKSESVNCFLHFKALVENQ 707
            PL+++  D+WGP   AS SG RY+IIFID FS+++W +L+ TKS++ + F  FK  VE Q
Sbjct: 19   PLQLVFSDIWGPSPVASSSGARYYIIFIDAFSKYSWIFLLHTKSQAFDAFNKFKTNVELQ 78

Query: 708  LSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQNGMAERKIRHITEIGNTL 887
            L   +K  Q D A+E +  +F   +   GIQ+R+SCPHT EQNG  ERK RHITE   TL
Sbjct: 79   LGYRLKAIQTDNAKEYL--KFTKYLTENGIQHRLSCPHTHEQNGRPERKHRHITETSLTL 136

Query: 888  MFQASLPKTFWVESFSTAVFLINRLPTPILNQTSPFQKLFPTPPDYSSLRVYGCPCFPFL 1067
            +  ASLP  FW E+F TA  LIN LPTP L   SP+Q LF  PPDY+ L+ +GC C+P L
Sbjct: 137  LANASLPLHFWGEAFLTATTLINLLPTPSLQNISPYQILFHKPPDYNFLKTFGCGCYPLL 196

Query: 1068 GATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSRHVIFNETSFPFIMSPESLL 1247
                  KL  KS  C F+GYS  H GY+C    +GK ++SRHV FNE+ FP+  SP   L
Sbjct: 197  RPYNQHKLDFKSHQCLFLGYSNQHKGYKCLS-PTGKCYISRHVKFNESFFPYKHSPNPFL 255

Query: 1248 LSN 1256
             S+
Sbjct: 256  SSS 258


>gb|AAL78658.1|AF405555_1 Hopscotch polyprotein [Fagus sylvatica]
          Length = 226

 Score =  232 bits (591), Expect = 2e-67
 Identities = 109/217 (50%), Positives = 144/217 (66%)
 Frame = +3

Query: 573  ASFSGFRYFIIFIDEFSRFTWFYLMRTKSESVNCFLHFKALVENQLSTTIKKFQCDGARE 752
            AS   F+Y++IF+D+ +R+TW Y ++ KS+  N FL F+ +VENQ    I+ FQCDG  E
Sbjct: 4    ASVQNFKYYVIFVDDHTRYTWLYPLKHKSDFFNTFLTFQRMVENQFERKIQIFQCDGGGE 63

Query: 753  LIFGQFKSIIQSCGIQYRISCPHTPEQNGMAERKIRHITEIGNTLMFQASLPKTFWVESF 932
                 F + +  CGI   +SCP TPEQNG+AERK RHI E G T++F A LPK  W+E+F
Sbjct: 64   FSLQAFLTHLNMCGIVQHVSCPGTPEQNGVAERKHRHIVETGLTMLFHARLPKNLWIEAF 123

Query: 933  STAVFLINRLPTPILNQTSPFQKLFPTPPDYSSLRVYGCPCFPFLGATRTDKLSPKSVSC 1112
             TAV+LINRLP+  L   +PF KL    PDY+SL+V+GC CFP+L     +K  PKS  C
Sbjct: 124  MTAVYLINRLPSSKLAHDTPFFKLHGVHPDYNSLKVFGCRCFPYLRDYAKNKFEPKSYPC 183

Query: 1113 AFIGYSLDHCGYRCFDLQSGKVHLSRHVIFNETSFPF 1223
             FIGYS  H GYRC    + +V+LSRHV+F+E   P+
Sbjct: 184  IFIGYSPLHKGYRCLHPPTKRVYLSRHVVFDEGILPY 220


>ref|XP_013658035.1| PREDICTED: uncharacterized protein LOC106362726 [Brassica napus]
          Length = 1062

 Score =  251 bits (640), Expect = 7e-67
 Identities = 125/264 (47%), Positives = 171/264 (64%), Gaps = 1/264 (0%)
 Frame = +3

Query: 435  KFYFLSTNTIGANKVLVSSHKVLFGAYTRTAPLEIIHCDVWGPISSASFSGFRYFIIFID 614
            +F F S N +  +K    +H++L     R    E IHCDVWGP    S  GFRY+IIFID
Sbjct: 409  EFTFDSQNVLVKDK---QTHQLLSQGSKRK---ERIHCDVWGPAPVTSVQGFRYYIIFID 462

Query: 615  EFSRFTWFYLMRTKSESVNCFLHFKALVENQLSTTIKKFQCDGARELIFGQFKSIIQSCG 794
              SRF+W Y ++ KSE    F  F+ LVENQ    I+  Q DG  + +  QF S + +CG
Sbjct: 463  NHSRFSWLYPLKLKSEVFATFKSFQCLVENQFERKIQILQTDGGGDFVNNQFASHLTNCG 522

Query: 795  IQYRISCPHTPEQNGMAERKIRHITEIGNTLMFQASLPKTFWVESFSTAVFLINRLPTPI 974
            I++ +SCPHTPEQNG+AERK RH+ E+G ++MF+A +P++ WVE+  TA FL N LP   
Sbjct: 523  IKHYLSCPHTPEQNGLAERKHRHLVELGMSMMFEAKMPQSLWVEALFTANFLTNLLPHTA 582

Query: 975  LNQT-SPFQKLFPTPPDYSSLRVYGCPCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYR 1151
            L +T SP++KL    P YS+LRV+GC C+P+L     +K  PKS+ C F+GY+    GYR
Sbjct: 583  LGKTSSPYEKLNGFTPHYSALRVFGCCCYPYLRPYSQNKFDPKSLLCVFLGYTEKQKGYR 642

Query: 1152 CFDLQSGKVHLSRHVIFNETSFPF 1223
            C    +G+V+LSRHV+F+ET FP+
Sbjct: 643  CLHPPTGRVYLSRHVLFDETRFPY 666



 Score = 96.7 bits (239), Expect = 7e-17
 Identities = 51/135 (37%), Positives = 82/135 (60%)
 Frame = +2

Query: 1604 QSHPSYPMVTRNRDNTRLAKKISDFIIYTTTVRPSVSTLALDFEPTSYSIASKHYQWRDA 1783
            Q   ++ MVTR +   R  K    ++++T  V+  V+      +P + + A  H  W  A
Sbjct: 938  QEDNTHTMVTRAKAGVR--KPNPRYVLHT--VKSEVT------KPKNLAEALHHPGWTAA 987

Query: 1784 MNTEYNALLQNNTWTLIPKPANANIIGCKWVFRIKRKADGSVERYKARLVAKGFNQEEGI 1963
            M  EY+   +  TW+L+P P++A+++GC WV +IK  ADG+ E+ ++RLVA+G  QEEG+
Sbjct: 988  MVDEYDTCQEMKTWSLVPPPSHAHVLGCGWVHKIKLNADGTAEKLRSRLVARGNEQEEGV 1047

Query: 1964 DYSETFSPVVKPTTI 2008
            DY ET+ PVV+  T+
Sbjct: 1048 DYLETYIPVVRTGTV 1062



 Score = 90.9 bits (224), Expect = 4e-15
 Identities = 39/80 (48%), Positives = 56/80 (70%)
 Frame = +3

Query: 984  TSPFQKLFPTPPDYSSLRVYGCPCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDL 1163
            +SP++KL    P YS+LRV+GC C+P+L     +K  PKS+ C F+GY+    GYRC   
Sbjct: 773  SSPYEKLNGFTPHYSALRVFGCCCYPYLRPYSQNKFDPKSLLCVFLGYTEKQKGYRCLHP 832

Query: 1164 QSGKVHLSRHVIFNETSFPF 1223
             +G+V+LSRHV+F+ET FP+
Sbjct: 833  PTGRVYLSRHVLFDETRFPY 852


>emb|CAN68489.1| hypothetical protein VITISV_037543 [Vitis vinifera]
          Length = 1449

 Score =  252 bits (643), Expect = 1e-66
 Identities = 118/252 (46%), Positives = 172/252 (68%), Gaps = 2/252 (0%)
 Frame = +3

Query: 489  SHKVLFGAYTRTA--PLEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRTKSE 662
            SHK+ F      A  PL ++H D+WGP S  S +G RYFI+F+D+FSRF+W Y + +K +
Sbjct: 561  SHKLPFNVXVSRASHPLALLHADLWGPXSIPSTTGARYFILFVDDFSRFSWIYPLHSKDQ 620

Query: 663  SVNCFLHFKALVENQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQNGM 842
            +++ F+ FK+LVENQ ++ I+  + D   E  F  F S + + GI+ + SCP+TPEQNG 
Sbjct: 621  ALSVFIKFKSLVENQFNSRIQCLRSDNGGE--FKAFSSYLATHGIKSQFSCPYTPEQNGR 678

Query: 843  AERKIRHITEIGNTLMFQASLPKTFWVESFSTAVFLINRLPTPILNQTSPFQKLFPTPPD 1022
            AERK+RHI E G  L+  ASLP  FW+ +F TA+FLINRLPT +LN  SPFQ LF   P+
Sbjct: 679  AERKLRHIIETGLALLATASLPFKFWLYAFHTAIFLINRLPTKVLNYQSPFQILFGKSPN 738

Query: 1023 YSSLRVYGCPCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSRHVIF 1202
            Y   +++GC C+P++     +KLS +S  C F+GYS +H GY C +  +G+++++RHV+F
Sbjct: 739  YHIFKIFGCLCYPYIRPYNKNKLSYRSSQCVFLGYSSNHKGYMCLNPLTGRLYVTRHVVF 798

Query: 1203 NETSFPFIMSPE 1238
            +ET FPF  +P+
Sbjct: 799  HETVFPFQSTPD 810



 Score =  143 bits (360), Expect = 3e-31
 Identities = 85/200 (42%), Positives = 120/200 (60%), Gaps = 5/200 (2%)
 Frame = +2

Query: 1463 ASSPINPELSTTSEST---PIPHIVSPTLMXXXXXXXXXXXXXTFPLRNSQSHPS--YPM 1627
            +S P++   S T+ ST   P+ ++ S T+              T     S+ HP+  +PM
Sbjct: 827  SSPPVSSLRSHTTXSTSSPPLTNMPSSTISLPDLIQVPFADIST-----SEPHPTNQHPM 881

Query: 1628 VTRNRDNTRLAKKISDFIIYTTTVRPSVSTLALDFEPTSYSIASKHYQWRDAMNTEYNAL 1807
            VTR ++   ++KK   F  + +             EPT+++ A K   W  AM  E++AL
Sbjct: 882  VTRAKNG--ISKKKVYFSSHIS-------------EPTTFTQAVKDSNWVLAMEKEFSAL 926

Query: 1808 LQNNTWTLIPKPANANIIGCKWVFRIKRKADGSVERYKARLVAKGFNQEEGIDYSETFSP 1987
             +NNTW L+P P+N NIIGCKWV+++K K DG+V+RYKARLVA+GF Q  G+DY ETFSP
Sbjct: 927  QRNNTWHLVPPPSNGNIIGCKWVYKLKYKPDGTVDRYKARLVAQGFTQTLGLDYFETFSP 986

Query: 1988 VVKPTTIRIILALAISRGWP 2047
            VVK + IRIILA+A+S  WP
Sbjct: 987  VVKASXIRIILAVALSFNWP 1006


>ref|XP_008374284.1| PREDICTED: uncharacterized protein LOC103437576 [Malus domestica]
          Length = 581

 Score =  241 bits (614), Expect = 2e-66
 Identities = 115/255 (45%), Positives = 168/255 (65%), Gaps = 2/255 (0%)
 Frame = +3

Query: 465  GANKVLVSSHKVLFGAYTRTAP--LEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWF 638
            G +  L  S K+ F + + T+   LE++H DVWGP    S +G+RY++IF+D+++++TWF
Sbjct: 323  GLDCALGKSSKLSFASVSCTSSKLLELLHTDVWGPAPLISVNGYRYYLIFVDDYTKYTWF 382

Query: 639  YLMRTKSESVNCFLHFKALVENQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCP 818
            + +++KS   + F+ FK LVE  LS  I   + D   E +  +F   +Q  GI +++SCP
Sbjct: 383  FSLKSKSNVFDTFVQFKVLVETLLSAKIVILRSDSGGEFLSLKFIKFLQEHGISHQLSCP 442

Query: 819  HTPEQNGMAERKIRHITEIGNTLMFQASLPKTFWVESFSTAVFLINRLPTPILNQTSPFQ 998
            HTPEQNG  ERK RH+ E   TL+  + +P ++WVE+ +TA++LINR+PT I    SP++
Sbjct: 443  HTPEQNGCDERKHRHLVETTQTLLAASKVPYSYWVEAIATAIYLINRMPTTI--NFSPWE 500

Query: 999  KLFPTPPDYSSLRVYGCPCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKV 1178
             LF    DY SL  +GC CFP+L    T KL PKS  C F+GYSL+H GY+C D  +G+V
Sbjct: 501  LLFHRSTDYISLNFFGCRCFPWLKPYTTSKLDPKSKECVFLGYSLNHKGYKCLDPSNGRV 560

Query: 1179 HLSRHVIFNETSFPF 1223
            +LSRHVIF+E +F F
Sbjct: 561  YLSRHVIFDEDTFLF 575


>gb|AAC35532.1| contains similarity to proteases [Arabidopsis thaliana]
          Length = 1392

 Score =  249 bits (637), Expect = 7e-66
 Identities = 114/238 (47%), Positives = 163/238 (68%), Gaps = 1/238 (0%)
 Frame = +3

Query: 513  YTRTAPLEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRTKSESVNCFLHFKA 692
            +  + PLE IHCD+WGP    S  GF+Y++IFID +SRFTWFY ++ KS+  + F+ F+ 
Sbjct: 510  FVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSRFTWFYPLKLKSDFFSVFVLFQQ 569

Query: 693  LVENQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQNGMAERKIRHITE 872
            LVENQ    I  FQCDG  E +  +F + + SCGI+  ISCPHTP+QNG+AER+ R++TE
Sbjct: 570  LVENQYQHKIAMFQCDGGGEFVSYKFVAHLASCGIKQLISCPHTPQQNGIAERRHRYLTE 629

Query: 873  IGNTLMFQASLPKTFWVESFSTAVFLINRLPTPILNQT-SPFQKLFPTPPDYSSLRVYGC 1049
            +G +LMF + +P   WVE+F T+ FL N LP+  L+   SP++ L  TPP Y++LRV+G 
Sbjct: 630  LGLSLMFHSKVPHKLWVEAFFTSNFLSNLLPSSTLSDNKSPYEMLHGTPPVYTALRVFGS 689

Query: 1050 PCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSRHVIFNETSFPF 1223
             C+P+L     +K  PKS+ C F+GY+  + GYRC    +GKV++ RHV+F+E  FP+
Sbjct: 690  ACYPYLRPYAKNKFDPKSLLCVFLGYNNKYKGYRCLHPPTGKVYICRHVLFDERKFPY 747



 Score =  144 bits (363), Expect = 1e-31
 Identities = 70/150 (46%), Positives = 97/150 (64%)
 Frame = +2

Query: 1733 EPTSYSIASKHYQWRDAMNTEYNALLQNNTWTLIPKPANANIIGCKWVFRIKRKADGSVE 1912
            EP S   A K   W +AM  E   + + +TW L+P      ++GCKWVF+ K  +DGS++
Sbjct: 796  EPKSVKEALKDEGWTNAMGEEMGTMHETDTWDLVPPEMVDRLLGCKWVFKTKLNSDGSLD 855

Query: 1913 RYKARLVAKGFNQEEGIDYSETFSPVVKPTTIRIILALAISRGWPIRQVDVNNAFLNGLL 2092
            R KARLVA+G+ QEEG+DY ET+SPVV+  T+R IL +A    W ++Q+DV NAFL+  L
Sbjct: 856  RLKARLVARGYEQEEGVDYVETYSPVVRSATVRSILHVATINKWSLKQLDVKNAFLHDEL 915

Query: 2093 SEDVYMSQLPGFVHPQFPTHVWMMVKKVVY 2182
             E V+M+Q PGF  P  P +V   +KK +Y
Sbjct: 916  KETVFMTQPPGFEDPSRPDYV-CKLKKAIY 944


>emb|CAB40035.1| retrotransposon like protein [Arabidopsis thaliana]
            gi|7267767|emb|CAB81170.1| retrotransposon like protein
            [Arabidopsis thaliana]
          Length = 1515

 Score =  249 bits (637), Expect = 8e-66
 Identities = 114/238 (47%), Positives = 163/238 (68%), Gaps = 1/238 (0%)
 Frame = +3

Query: 513  YTRTAPLEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRTKSESVNCFLHFKA 692
            +  + PLE IHCD+WGP    S  GF+Y++IFID +SRFTWFY ++ KS+  + F+ F+ 
Sbjct: 507  FVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSRFTWFYPLKLKSDFFSVFVLFQQ 566

Query: 693  LVENQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQNGMAERKIRHITE 872
            LVENQ    I  FQCDG  E +  +F + + SCGI+  ISCPHTP+QNG+AER+ R++TE
Sbjct: 567  LVENQYQHKIAMFQCDGGGEFVSYKFVAHLASCGIKQLISCPHTPQQNGIAERRHRYLTE 626

Query: 873  IGNTLMFQASLPKTFWVESFSTAVFLINRLPTPILNQT-SPFQKLFPTPPDYSSLRVYGC 1049
            +G +LMF + +P   WVE+F T+ FL N LP+  L+   SP++ L  TPP Y++LRV+G 
Sbjct: 627  LGLSLMFHSKVPHKLWVEAFFTSNFLSNLLPSSTLSDNKSPYEMLHGTPPVYTALRVFGS 686

Query: 1050 PCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSRHVIFNETSFPF 1223
             C+P+L     +K  PKS+ C F+GY+  + GYRC    +GKV++ RHV+F+E  FP+
Sbjct: 687  ACYPYLRPYAKNKFDPKSLLCVFLGYNNKYKGYRCLHPPTGKVYICRHVLFDERKFPY 744



 Score =  148 bits (373), Expect = 7e-33
 Identities = 89/254 (35%), Positives = 131/254 (51%), Gaps = 9/254 (3%)
 Frame = +2

Query: 1448 SPTRLASSPINPELST-------TSESTPIPHIVSPTLMXXXXXXXXXXXXXTFPLRNSQ 1606
            SP    S P  PE ST       T   T I   ++P  +                + ++ 
Sbjct: 825  SPITSTSLPTQPEESTSDQNHYSTDSETAISSAMTPQSINVSLFEDSDFPPLQSVISSTT 884

Query: 1607 SHP--SYPMVTRNRDNTRLAKKISDFIIYTTTVRPSVSTLALDFEPTSYSIASKHYQWRD 1780
            + P  S+PM+TR +    + K    + +++              EP S   A K   W +
Sbjct: 885  AAPETSHPMITRAKSG--ITKPNPKYALFSVKSNYP--------EPKSVKEALKDEGWTN 934

Query: 1781 AMNTEYNALLQNNTWTLIPKPANANIIGCKWVFRIKRKADGSVERYKARLVAKGFNQEEG 1960
            AM  E   + + +TW L+P      ++GCKWVF+ K  +DGS++R KARLVA+G+ QEEG
Sbjct: 935  AMGEEMGTMHETDTWDLVPPEMVDRLLGCKWVFKTKLNSDGSLDRLKARLVARGYEQEEG 994

Query: 1961 IDYSETFSPVVKPTTIRIILALAISRGWPIRQVDVNNAFLNGLLSEDVYMSQLPGFVHPQ 2140
            +DY ET+SPVV+  T+R IL +A    W ++Q+DV NAFL+  L E V+M+Q PGF  P 
Sbjct: 995  VDYVETYSPVVRSATVRSILHVATINKWSLKQLDVKNAFLHDELKETVFMTQPPGFEDPS 1054

Query: 2141 FPTHVWMMVKKVVY 2182
             P +V   +KK +Y
Sbjct: 1055 RPDYV-CKLKKAIY 1067


>gb|KYP75940.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Cajanus cajan]
          Length = 1403

 Score =  249 bits (635), Expect = 1e-65
 Identities = 121/246 (49%), Positives = 165/246 (67%), Gaps = 2/246 (0%)
 Frame = +3

Query: 489  SHKVLFG-AYT-RTAPLEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRTKSE 662
            SH + F  +YT   +PLE+++ DVWGP   AS  GFRY++ F D FS++TW Y M  KSE
Sbjct: 523  SHTLPFSDSYTVYNSPLELVYSDVWGPSHYASREGFRYYVHFTDAFSKYTWIYFMHNKSE 582

Query: 663  SVNCFLHFKALVENQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQNGM 842
            + + F+HFK++VENQ    IK FQ DG +E  F     +    GI +R+SCPH+ +QNG 
Sbjct: 583  TAHHFIHFKSMVENQFGHKIKMFQSDGGKE--FTCLTKLFNENGITHRLSCPHSHQQNGT 640

Query: 843  AERKIRHITEIGNTLMFQASLPKTFWVESFSTAVFLINRLPTPILNQTSPFQKLFPTPPD 1022
            AERK RHITE+G TL+  + LP  FW ++FSTA+ +INRLPTPIL   SP++ LF   PD
Sbjct: 641  AERKHRHITEVGLTLLSSSGLPHIFWSDAFSTAIHVINRLPTPILGHKSPYEVLFNKIPD 700

Query: 1023 YSSLRVYGCPCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSRHVIF 1202
            Y  LR +GC C+P L    + K+  +S  C FIGYS+ H GY+CF   +GK+ +SR+V+F
Sbjct: 701  YLLLRPFGCACYPHLRPFNSHKMDFRSSQCVFIGYSMQHKGYKCF-TTNGKIIISRNVVF 759

Query: 1203 NETSFP 1220
            +E +FP
Sbjct: 760  DEDTFP 765



 Score =  174 bits (440), Expect = 5e-41
 Identities = 84/151 (55%), Positives = 111/151 (73%)
 Frame = +2

Query: 1733 EPTSYSIASKHYQWRDAMNTEYNALLQNNTWTLIPKPANANIIGCKWVFRIKRKADGSVE 1912
            EP S + A  +  W+ AM  EYNALL+N+TW L+  P     I CKW+F+ K K DGS+ 
Sbjct: 883  EPISVAQALDNKHWKQAMQDEYNALLKNHTWDLVSLPPGRTAITCKWIFKNKYKQDGSIL 942

Query: 1913 RYKARLVAKGFNQEEGIDYSETFSPVVKPTTIRIILALAISRGWPIRQVDVNNAFLNGLL 2092
            R+KARLVA+GF+Q++G+DY++TFSPVVKP +IRI+LALA+S+GWPI Q+DV+NAFLNG L
Sbjct: 943  RHKARLVARGFSQQQGLDYTDTFSPVVKPVSIRIVLALAVSKGWPIHQIDVDNAFLNGDL 1002

Query: 2093 SEDVYMSQLPGFVHPQFPTHVWMMVKKVVYG 2185
             ED+YM Q  GF H   P  V   ++K +YG
Sbjct: 1003 KEDIYMLQPQGF-HNGAPNTV-CKLRKALYG 1031


>emb|CAN79884.1| hypothetical protein VITISV_002539 [Vitis vinifera]
          Length = 1453

 Score =  249 bits (635), Expect = 1e-65
 Identities = 116/261 (44%), Positives = 168/261 (64%), Gaps = 2/261 (0%)
 Frame = +3

Query: 447  LSTNTIGANKVLVSSHKVLFGAYTRTAP--LEIIHCDVWGPISSASFSGFRYFIIFIDEF 620
            L T ++ +   L  SH++ F + T  +   L ++HCD+WG     S  GF Y+++FID++
Sbjct: 439  LPTPSLCSTCQLAKSHRLPFSSNTTRSNVVLGLVHCDIWGLAPVKSNLGFNYYVLFIDDY 498

Query: 621  SRFTWFYLMRTKSESVNCFLHFKALVENQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQ 800
            SRFTW Y ++ KS+  + FL F+ LVENQ ST IK FQ DG  E    +F+S +Q  GI 
Sbjct: 499  SRFTWLYPLKLKSDFFDIFLQFQKLVENQYSTKIKIFQSDGGAEFTSNRFQSHLQQFGIH 558

Query: 801  YRISCPHTPEQNGMAERKIRHITEIGNTLMFQASLPKTFWVESFSTAVFLINRLPTPILN 980
            +++SCP+TP QNG AERK RH+TE G  L+F + +P  +WV++FSTA ++INRLP P+L 
Sbjct: 559  HQMSCPYTPSQNGRAERKHRHVTETGLALLFHSHVPPRYWVDAFSTATYIINRLPLPVLG 618

Query: 981  QTSPFQKLFPTPPDYSSLRVYGCPCFPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFD 1160
              SPF+ LF   P+Y +   +GC  +P L      K SP+S+ C F+GYS  H G+RCFD
Sbjct: 619  GLSPFEVLFGKSPNYENFHPFGCRVYPCLRDYAPHKFSPRSLPCIFLGYSSSHKGFRCFD 678

Query: 1161 LQSGKVHLSRHVIFNETSFPF 1223
              + + +++RH  F+E  FPF
Sbjct: 679  TTTSRTYITRHARFDEHFFPF 699



 Score =  154 bits (389), Expect = 8e-35
 Identities = 87/231 (37%), Positives = 123/231 (53%)
 Frame = +2

Query: 1490 STTSESTPIPHIVSPTLMXXXXXXXXXXXXXTFPLRNSQSHPSYPMVTRNRDNTRLAKKI 1669
            S+  EST     VSP                      S +  S+PM+TR +      +  
Sbjct: 758  SSAPESTSSSAAVSPVPASMTTLVPFAAPMDPIHTTTSATPASHPMITRAKSGIFKPRHP 817

Query: 1670 SDFIIYTTTVRPSVSTLALDFEPTSYSIASKHYQWRDAMNTEYNALLQNNTWTLIPKPAN 1849
            S      ++  P +  L    EP  +  A+K+  W  AM+ E   L  N+TW L+P+P+N
Sbjct: 818  SHLSFVQSS--PLIHALLATSEPKGFKSAAKNPAWLAAMDDEIKVLQTNHTWDLVPRPSN 875

Query: 1850 ANIIGCKWVFRIKRKADGSVERYKARLVAKGFNQEEGIDYSETFSPVVKPTTIRIILALA 2029
             NI+G KWVFR K  +DGS+ER+KARLVAKG+ Q             +  +T+R++L+L 
Sbjct: 876  TNIVGSKWVFRTKFLSDGSIERFKARLVAKGYTQ-------------LPASTVRVVLSLV 922

Query: 2030 ISRGWPIRQVDVNNAFLNGLLSEDVYMSQLPGFVHPQFPTHVWMMVKKVVY 2182
            +S  WP+ Q+DV NAFLNG+L E VYM Q PG+V P+ P HV   +KK +Y
Sbjct: 923  VSHKWPLCQLDVKNAFLNGILHETVYMEQPPGYVDPRHPLHV-CKLKKALY 972


>emb|CAN81099.1| hypothetical protein VITISV_017741 [Vitis vinifera]
          Length = 1455

 Score =  248 bits (632), Expect = 3e-65
 Identities = 129/251 (51%), Positives = 163/251 (64%), Gaps = 2/251 (0%)
 Frame = +3

Query: 516  TRTAPLEIIHCDVWGPISSASFSGFRYFIIFIDEFSRFTWFYLMRTKSESVNCFLHFKAL 695
            T T PLE+IH D+WGP    S SG+RY+I F+D FSRF+W +L+R KSE++  F++FK  
Sbjct: 575  TYTKPLELIHLDLWGPTLVLSNSGYRYYIHFVDAFSRFSWIFLLRNKSEAIKTFVNFKTQ 634

Query: 696  VENQLSTTIKKFQCDGARELIFGQFKSIIQSCGIQYRISCPHTPEQNGMAERKIRHITEI 875
            VE Q    IK  Q D   E  F  F+S +   GI +R+SCPHT +QNG+AERK R I E 
Sbjct: 635  VELQFDLKIKSLQTDWGGE--FRAFQSYLAENGIVHRVSCPHTQQQNGVAERKHRTIVEH 692

Query: 876  GNTLMFQASLPKTFWVESFSTAVFLINRLPTPILNQTSPFQKLFPTPPDYSSLRVYGCPC 1055
            G TL+  ASLP  FW ESF T V+L NRLPT IL+   P + LF + PDYS L+V+GC C
Sbjct: 693  GLTLLHTASLPLKFWDESFRTVVYLSNRLPTAILHHKCPIEVLFKSIPDYSFLKVFGCSC 752

Query: 1056 FPFLGATRTDKLSPKSVSCAFIGYSLDHCGYRCFDLQSGKVHLSRHVIFNETSFPF--IM 1229
            FP L    T KL  +S  C F+GYSL H GY+C    +G+V++S  VIFNETSFP+   +
Sbjct: 753  FPNLRPYNTHKLQYRSEECTFLGYSLKHKGYKCMS-SNGRVYISHDVIFNETSFPYSKTI 811

Query: 1230 SPESLLLSNSS 1262
               S LLS  S
Sbjct: 812  QVSSCLLSTVS 822



 Score =  181 bits (459), Expect = 2e-43
 Identities = 98/246 (39%), Positives = 142/246 (57%)
 Frame = +2

Query: 1448 SPTRLASSPINPELSTTSESTPIPHIVSPTLMXXXXXXXXXXXXXTFPLRNSQSHPSYPM 1627
            +P ++ S+P+         +TP+ H+VS                 T           +PM
Sbjct: 877  TPAQVVSNPV---------ATPVQHVVSSIADASVTRTIAKDADNT-----------HPM 916

Query: 1628 VTRNRDNTRLAKKISDFIIYTTTVRPSVSTLALDFEPTSYSIASKHYQWRDAMNTEYNAL 1807
            +TR +                  V+P +   A+  EP+S S A +  +W+ AM  EY+AL
Sbjct: 917  ITRAKSGI---------------VKPKIFIAAIR-EPSSVSAALQQDEWKKAMVAEYDAL 960

Query: 1808 LQNNTWTLIPKPANANIIGCKWVFRIKRKADGSVERYKARLVAKGFNQEEGIDYSETFSP 1987
             +NNTW+L+P PA    IGCKWV++ K   DG+V++YKARLVAKGF+Q+ G D++ETFSP
Sbjct: 961  QRNNTWSLVPLPAGRQAIGCKWVYKTKENPDGTVQKYKARLVAKGFHQQAGFDFTETFSP 1020

Query: 1988 VVKPTTIRIILALAISRGWPIRQVDVNNAFLNGLLSEDVYMSQLPGFVHPQFPTHVWMMV 2167
            VVKP+T+R++  +A+SR W I+Q+DVNNAFLNG L E+V+M Q  GF+  Q P  V   +
Sbjct: 1021 VVKPSTVRVVFTIALSRNWAIKQLDVNNAFLNGDLQEEVFMQQPQGFIDEQNPNLV-CRL 1079

Query: 2168 KKVVYG 2185
             K +YG
Sbjct: 1080 HKALYG 1085



 Score = 68.9 bits (167), Expect = 3e-08
 Identities = 47/150 (31%), Positives = 71/150 (47%), Gaps = 16/150 (10%)
 Frame = -3

Query: 2477 VSFEYFPDSCLIKDLNTGTPLLWGHIDSNLYS-----LPVSPPLSHASPMALAVSQLPSA 2313
            V FE+  DSC +KD  T   L+ G +   LY+     L + P  S +   ++  S   S 
Sbjct: 452  VFFEFHSDSCFVKDQVTQAVLMVGKVRDGLYAFDSSHLALRPTQSLSKSPSVVASSFSSK 511

Query: 2312 V-----------WHQRLGHPSQSTLALLKSRNFVNFVDSPISCSSCNSCMMGKSHKLPFS 2166
            V           WH+RLGHPS +T+  + S+  V  ++  +  + C+SC +GK H+ PFS
Sbjct: 512  VCTTSLSSTFDLWHKRLGHPSAATIKNVLSKCNVAHINK-MDSNFCSSCCLGKIHRFPFS 570

Query: 2165 PSSTRE*EIVDEQNQEVETYRHPQKVIHLE 2076
             S T              TY  P ++IHL+
Sbjct: 571  LSHT--------------TYTKPLELIHLD 586


Top