BLASTX nr result

ID: Alisma22_contig00017425 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Alisma22_contig00017425
         (2356 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

OMO61427.1 Zinc finger, CCCH-type [Corchorus capsularis]              728   0.0  
OMO89257.1 hypothetical protein CCACVL1_07965 [Corchorus capsula...   660   0.0  
AAK51235.1 polyprotein [Arabidopsis thaliana]                         659   0.0  
CAA19715.1 putative protein [Arabidopsis thaliana] CAB79576.1 pu...   655   0.0  
AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease ho...   647   0.0  
AAD43604.1 T3P18.3 [Arabidopsis thaliana]                             643   0.0  
JAU34057.1 Retrovirus-related Pol polyprotein from transposon TN...   614   0.0  
AAF02855.1 Similar to retrotransposon proteins [Arabidopsis thal...   634   0.0  
CAN67588.1 hypothetical protein VITISV_036280 [Vitis vinifera]        619   0.0  
CAB40035.1 retrotransposon like protein [Arabidopsis thaliana] C...   614   0.0  
CAA19714.1 putative protein [Arabidopsis thaliana] CAB79575.1 pu...   591   0.0  
ACP30598.1 disease resistance protein [Brassica rapa subsp. peki...   626   0.0  
BAH94406.1 Os08g0544300 [Oryza sativa Japonica Group]                 582   0.0  
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]        589   0.0  
GAU44375.1 hypothetical protein TSUD_243070 [Trifolium subterran...   581   0.0  
JAU83197.1 Copia protein, partial [Noccaea caerulescens]              576   0.0  
AAR88589.1 putative copia-like retrotransposon protein [Oryza sa...   578   0.0  
AAC67200.1 putative retroelement pol polyprotein [Arabidopsis th...   577   0.0  
JAU04955.1 Copia protein, partial [Noccaea caerulescens]              558   0.0  
AAT85031.1 putative polyprotein [Oryza sativa Japonica Group] AB...   575   0.0  

>OMO61427.1 Zinc finger, CCCH-type [Corchorus capsularis]
          Length = 1996

 Score =  728 bits (1880), Expect = 0.0
 Identities = 391/785 (49%), Positives = 504/785 (64%), Gaps = 8/785 (1%)
 Frame = -2

Query: 2355 DLQSPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCL 2176
            D +SP   LY K PDY+ LR FG  CFP +R ++K+KLEPRSLPC+FLGYS+ +KGYRCL
Sbjct: 638  DWKSPFELLYNKSPDYSCLRVFGSKCFPFLRSHSKNKLEPRSLPCIFLGYSELHKGYRCL 697

Query: 2175 HVSSGRVYFSRHVIFNEDVFPYKD--KVVSQVPTGDVVHFNEFLNPKLADDVSESCTTIT 2002
            H  SGRVY SRHV F+E VFP+KD   + +   T D   F ++ +   AD+ +    T  
Sbjct: 698  HPPSGRVYISRHVTFDEKVFPFKDHGSLFAPSDTCDFTEFIDWFSGSPADEDTLGKPTTF 757

Query: 2001 SPTYVSTNSDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCDDSHESISA 1822
            +P +VS         +   SF D   VS++  P        S +ST +S           
Sbjct: 758  TPLHVSETQSLEDVSLGASSFPD---VSYATTP--------SHSSTPESA---------- 796

Query: 1821 NVDLSDITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEELNTNVHPMLT 1642
              D + I E+      +   +H+ ++  +                   +  +TN  P+ T
Sbjct: 797  -TDFTLINEEVHNPDPSIPMNHVQQEEVVI--------------NSEPQVPSTNSSPIAT 841

Query: 1641 RSKTGIVKPNPKY-----ALATYTIPQPPKTIRTALQHPGWFEAMTHEL*ALHSNNTWTL 1477
            R   GI KPNPKY          +IP  PK+++TAL+HP W  AM  E+ AL  N+TW L
Sbjct: 842  RQSHGIAKPNPKYFNDDFCFTATSIPIEPKSVKTALKHPDWKTAMEEEIHALMQNDTWEL 901

Query: 1476 VPKPTNANVIGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPVVKSATVR 1297
            VP+  + N++GCKWV+KTK KADGSL+RLKARLVAKGF+Q  GVD+ ETFSPVVK AT+R
Sbjct: 902  VPQSNSMNIVGCKWVFKTKTKADGSLERLKARLVAKGFNQVPGVDFLETFSPVVKPATIR 961

Query: 1296 VVLTLAVSCGWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHKALYGLKQ 1117
            VVLT+A++  W++ Q+D+ NAFL+G LNEPV+M QPPGF +   P  VCKL+KALYGL+Q
Sbjct: 962  VVLTIALARDWEIRQLDVKNAFLHGFLNEPVFMTQPPGFQNSQHPNYVCKLNKALYGLRQ 1021

Query: 1116 APRAWFNRLSEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNAAIVVDFQ 937
            APRAWF+R S FL   GFTCS ADSSLFV Q +              LTG+N+  + DF 
Sbjct: 1022 APRAWFDRFSTFLLSFGFTCSVADSSLFVLQSSRGTILLLLYVDDIILTGSNSHFLRDFI 1081

Query: 936  EKLSAEFNLRILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPASTTIATN 757
              L  EF+++ LGPL YFLG+ V P   GILL Q +Y  ++L  A + +CKP ST +AT 
Sbjct: 1082 AALGREFSMKDLGPLHYFLGVSVTPFDGGILLHQAQYARELLDRALMHNCKPISTPMAT- 1140

Query: 756  FNKMLSSTEYAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTSAHYILVK 577
              K  SS       +D+  YR IVG LQY T TRPDI Y+VN  CQF+  PT+ H+ LVK
Sbjct: 1141 --KSGSSPNDDALYSDAPFYRSIVGGLQYLTFTRPDICYSVNYLCQFMHQPTNLHFRLVK 1198

Query: 576  HILRYLKGTLHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLGDTCVSWSAKK 397
             +LRY++GT+  G+ +     L +  FSD+DWAGC  TRRSTTGYC +LG  C+SWSAKK
Sbjct: 1199 RLLRYVQGTIDYGIRLLRHQPLELCGFSDADWAGCSLTRRSTTGYCTYLGGNCISWSAKK 1258

Query: 396  QSTVARSSCEAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSALHLTVNLVFH 217
            Q TVARSS +AEYRALA+ AAEMTWL+++LRD+GV+L++P VLF DN SALH+T+N VFH
Sbjct: 1259 QPTVARSSTKAEYRALASAAAEMTWLSFVLRDIGVYLKKPPVLFSDNISALHMTINPVFH 1318

Query: 216  ARTKHIEIDYHFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALRTKLGL-TTPT 40
            ARTKHIEIDYHF+REKV+ G +VT  ++S  Q AD+ TKALP  +   LR KLGL   P 
Sbjct: 1319 ARTKHIEIDYHFVREKVSAGSLVTQFVSSSNQVADVFTKALPRHALLLLRVKLGLCQIPQ 1378

Query: 39   PSLRG 25
            PSLRG
Sbjct: 1379 PSLRG 1383


>OMO89257.1 hypothetical protein CCACVL1_07965 [Corchorus capsularis]
          Length = 1215

 Score =  660 bits (1702), Expect = 0.0
 Identities = 360/732 (49%), Positives = 465/732 (63%), Gaps = 8/732 (1%)
 Frame = -2

Query: 2196 YKGYRCLHVSSGRVYFSRHVIFNEDVFPYKD--KVVSQVPTGDVVHFNEFLNPKLADDVS 2023
            +KGYRCLH  SGRVY SRHV F+E VFP+KD   + +   T D+  F ++ +   AD+ +
Sbjct: 493  HKGYRCLHPPSGRVYISRHVTFDEKVFPFKDPGSLFAPSDTCDLTEFIDWFSGSPADEDT 552

Query: 2022 ESCTTITSPTYVSTNSDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCDD 1843
                T ++P +VS         +   SF D   VS +  P        S +ST +S    
Sbjct: 553  LGKPTTSTPLHVSETQSLEDVSLGASSFPD---VSCATTP--------SHSSTPESA--- 598

Query: 1842 SHESISANVDLSDITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEELNT 1663
                     D++ I E+      +   +H+ ++  +                   +  +T
Sbjct: 599  --------TDVTLINEEVHNPDPSIPMNHVQQEEVVI--------------NSEPQVPST 636

Query: 1662 NVHPMLTRSKTGIVKPNPKY-----ALATYTIPQPPKTIRTALQHPGWFEAMTHEL*ALH 1498
            N  P+ TR   GI KPNPKY          +IP  PK+++TAL+HP W  AM  E+ AL 
Sbjct: 637  NSSPIATRQSHGIAKPNPKYFNDDFCFTATSIPIEPKSVKTALKHPDWKAAMEEEIHALM 696

Query: 1497 SNNTWTLVPKPTNANVIGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPV 1318
             N+TW LVP   + N++GCKWV+KTK KADGSL+RLKARLVAKGF+Q  GVD+ ETFSPV
Sbjct: 697  QNDTWELVPPSNSMNIVGCKWVFKTKTKADGSLERLKARLVAKGFNQVPGVDFLETFSPV 756

Query: 1317 VKSATVRVVLTLAVSCGWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHK 1138
            VK AT+RVVLT+A++  W++ Q+D+ NAFL+G LNEPV+M QPPGF +   P  VCKL+K
Sbjct: 757  VKPATIRVVLTIALARDWEIRQLDVKNAFLHGFLNEPVFMTQPPGFQNSQHPNYVCKLNK 816

Query: 1137 ALYGLKQAPRAWFNRLSEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNA 958
            ALYGL+QAPRAWF+R S FL   GFTCS ADSSLFV Q +              LTG+N+
Sbjct: 817  ALYGLRQAPRAWFDRFSTFLLSFGFTCSVADSSLFVLQSSRGTILLLLYVDDIILTGSNS 876

Query: 957  AIVVDFQEKLSAEFNLRILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPA 778
              + DF   L  EF+++ LGPL YFLG+ V P   GILL Q +Y  ++L  A + +CKP 
Sbjct: 877  HFLRDFIAALGREFSMKDLGPLHYFLGVSVTPFDGGILLHQAQYARELLDRALMHNCKPI 936

Query: 777  STTIATNFNKMLSSTEYAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTS 598
            ST +AT   K  SS       +D+  YR IVG LQY T TRPDI Y+VN  CQF+  PT+
Sbjct: 937  STPMAT---KSGSSPNDDALYSDAPFYRSIVGGLQYLTFTRPDICYSVNYLCQFMHQPTN 993

Query: 597  AHYILVKHILRYLKGTLHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLGDTC 418
             H+ LVK +LRY++GT+  G+ +     L +  FSD+DWAGC  TRRSTTGYC +LG  C
Sbjct: 994  LHFRLVKRLLRYVQGTIDYGIRLLRHQPLELCGFSDADWAGCSLTRRSTTGYCTYLGGNC 1053

Query: 417  VSWSAKKQSTVARSSCEAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSALHL 238
            +SWSAKKQ TVARSS EAEYRALA+ AAEMTWL+++LRD+GV+L++P VLF DN SALH+
Sbjct: 1054 ISWSAKKQPTVARSSTEAEYRALASAAAEMTWLSFVLRDIGVYLKKPPVLFSDNISALHM 1113

Query: 237  TVNLVFHARTKHIEIDYHFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALRTKL 58
            T+N VFHARTKHIEIDYHF+REKV+ G +VT  ++S  Q AD+ TKALP  +   LR KL
Sbjct: 1114 TINPVFHARTKHIEIDYHFVREKVSAGSLVTQFVSSSNQVADVFTKALPRHALLLLRVKL 1173

Query: 57   GL-TTPTPSLRG 25
            GL   P PSLRG
Sbjct: 1174 GLCQIPQPSLRG 1185


>AAK51235.1 polyprotein [Arabidopsis thaliana]
          Length = 1453

 Score =  659 bits (1700), Expect = 0.0
 Identities = 348/778 (44%), Positives = 479/778 (61%), Gaps = 2/778 (0%)
 Frame = -2

Query: 2346 SPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHVS 2167
            SPL  L  + P+Y  LR FG  C+PC+R   +HK EPRSL CVFLGY+ QYKGYRCL+  
Sbjct: 667  SPLEALLKQKPNYAMLRVFGTACYPCLRPLGEHKFEPRSLQCVFLGYNSQYKGYRCLYPP 726

Query: 2166 SGRVYFSRHVIFNEDVFPYKDKVVSQVPTGDVVHFNEFLNPKLADDVSESCTTITSPTYV 1987
            +GRVY SRHVIF+E+ FP+K K              +FL P+    +  +  +       
Sbjct: 727  TGRVYISRHVIFDEETFPFKQKY-------------QFLVPQYESSLLSAWQSSIP---- 769

Query: 1986 STNSDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCDDSHESISANVDLS 1807
               +D S  P  E   +++L      +PP      I +T+T  +      E +    +  
Sbjct: 770  --QADQSLIPQAEEGKIESLA-----KPPSIQKNTIQDTTTQPAILT---EGVLNEEEEE 819

Query: 1806 DITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEELNTNVHPMLTRSKTG 1627
            D  E+T   +        +++ ++T               +  ++   N HPM TRSK G
Sbjct: 820  DSFEETETESLNEETHTQNDEAEVTV-------------EEEVQQEPENTHPMTTRSKAG 866

Query: 1626 IVKPNPKYALATYTIP-QPPKTIRTALQHPGWFEAMTHEL*ALHSNNTWTLVPKPTNANV 1450
            I K N +YAL T     + PK+I  AL HPGW  A+  E+  +H  +TW+LV    + N+
Sbjct: 867  IHKSNTRYALLTSKFSVEEPKSIDEALNHPGWNNAVNDEMRTIHMLHTWSLVQPTEDMNI 926

Query: 1449 IGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPVVKSATVRVVLTLAVSC 1270
            +GC+WV+KTK+K DGS+D+LKARLVAKGF QEEG+DY ETFSPVV++AT+R+VL +A + 
Sbjct: 927  LGCRWVFKTKLKPDGSVDKLKARLVAKGFHQEEGLDYLETFSPVVRTATIRLVLDVATAK 986

Query: 1269 GWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHKALYGLKQAPRAWFNRL 1090
            GW + Q+D+ NAFL+GEL EPVYM QPPGF+ +  P  VC+L KALYGLKQAPRAWF+ +
Sbjct: 987  GWNIKQLDVSNAFLHGELKEPVYMLQPPGFVDQEKPSYVCRLTKALYGLKQAPRAWFDTI 1046

Query: 1089 SEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNAAIVVDFQEKLSAEFNL 910
            S +L   GF+CS +D SLF Y  NG             LTG++  ++ +    L+  F++
Sbjct: 1047 SNYLLDFGFSCSKSDPSLFTYHKNGKTLVLLLYVDDILLTGSDHNLLQELLMSLNKRFSM 1106

Query: 909  RILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPASTTIATNFNKMLSSTE 730
            + LG  SYFLG+++  +  G+ L QT Y  DIL +A + +C    T +  +    L+S  
Sbjct: 1107 KDLGAPSYFLGVEIESSPEGLFLHQTAYAKDILHQAAMSNCNSMPTPLPQHIEN-LNSDL 1165

Query: 729  YAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTSAHYILVKHILRYLKGT 550
            + EP      +R + G LQY TITRPDI +AVN  CQ + SPT+A + L+K ILRY+KGT
Sbjct: 1166 FPEPT----YFRSLAGKLQYLTITRPDIQFAVNFICQRMHSPTTADFGLLKRILRYVKGT 1221

Query: 549  LHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLGDTCVSWSAKKQSTVARSSC 370
            +H GLHI+ + +L++ A+SDSDWAGC  TRRSTTG+C  LG   +SWSAK+Q TV++SS 
Sbjct: 1222 IHLGLHIKKNQNLSLVAYSDSDWAGCKETRRSTTGFCTLLGCNLISWSAKRQETVSKSST 1281

Query: 369  EAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSALHLTVNLVFHARTKHIEID 190
            EAEYRAL   A E+TWL++LLRD+GV    P ++ CDN SA++L+ N   H R+KH + D
Sbjct: 1282 EAEYRALTAVAQELTWLSFLLRDIGVTQTHPTLVKCDNLSAVYLSANPALHNRSKHFDTD 1341

Query: 189  YHFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALRTKLGLT-TPTPSLRGGI 19
            YH+IRE+VALG + T HI++  Q ADI TK LP  +   LR KLG+   PT SLRG +
Sbjct: 1342 YHYIREQVALGLVETKHISATLQLADIFTKPLPRRAFIDLRIKLGVAEPPTTSLRGNV 1399


>CAA19715.1 putative protein [Arabidopsis thaliana] CAB79576.1 putative protein
            [Arabidopsis thaliana]
          Length = 1318

 Score =  655 bits (1689), Expect = 0.0
 Identities = 346/781 (44%), Positives = 478/781 (61%), Gaps = 5/781 (0%)
 Frame = -2

Query: 2346 SPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHVS 2167
            SP  +LY K PDY SLR+FG  CFP +RDY ++K  P SL CVFLGY+++YKGYRCL+  
Sbjct: 480  SPYEKLYDKKPDYTSLRSFGSACFPTLRDYAENKFNPCSLKCVFLGYNEKYKGYRCLYPP 539

Query: 2166 SGRVYFSRHVIFNEDVFPYKDKVVSQVPTGDVVHFNEFLNPKLADDVSESCTTITSPTYV 1987
            +GR+Y SRHVIF+E V+P+        P          L   L    S + +T TSP+  
Sbjct: 540  TGRLYISRHVIFDESVYPFSHTYKHLHPQPRT----PLLAAWLRSSDSPAPSTSTSPSSR 595

Query: 1986 S---TNSDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCDDSHESISANV 1816
            S   T++DF   P  +   L TL+       P ++ +H S  +T  SP  DS  +   + 
Sbjct: 596  SPLFTSADFPPLPQRKTPLLPTLV-------PISSVSHASNITTQQSPDFDSERT--TDF 646

Query: 1815 DLSDITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEELNTNVHPMLTRS 1636
            D + I + + +S   +      +Q  +                      +TNVHPM+TR+
Sbjct: 647  DSASIGDSSHSSQAGSDSEETIQQASVNVHQT---------------HASTNVHPMVTRA 691

Query: 1635 KTGIVKPNPKYALATYTIPQP-PKTIRTALQHPGWFEAMTHEL*ALHSNNTWTLVPKPTN 1459
            K GI KPNP+Y   ++ +  P PKT+  AL+HPGW  AMT E+       TW+LVP  ++
Sbjct: 692  KVGISKPNPRYVFLSHKVSYPEPKTVTAALKHPGWTGAMTEEIGNCSETQTWSLVPYKSD 751

Query: 1458 ANVIGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPVVKSATVRVVLTLA 1279
             +V+G KWV++TK+ ADG+L++LKAR+VAKGF QEEG+DY ET+SPVV++ TVR+VL LA
Sbjct: 752  MHVLGSKWVFRTKLHADGTLNKLKARIVAKGFLQEEGIDYLETYSPVVRTPTVRLVLHLA 811

Query: 1278 VSCGWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHKALYGLKQAPRAWF 1099
             +  W + Q+D+ NAFL+G+L E VYM QP GF+  S P  VC LHK++YGLKQ+PRAWF
Sbjct: 812  TALNWDIKQMDVKNAFLHGDLKETVYMTQPAGFVDPSKPDHVCLLHKSIYGLKQSPRAWF 871

Query: 1098 NRLSEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNAAIVVDFQEKLSAE 919
            ++ S FL   GF CS +D SLF+Y HN              +TGN++  +      L+ E
Sbjct: 872  DKFSTFLLEFGFFCSKSDPSLFIYAHNNNLILLLLYVDDMVITGNSSQTLTSLLAALNKE 931

Query: 918  FNLRILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPASTTIATNFNKMLS 739
            F +  +G L YFLGIQV   Q+G+ +SQ +Y  D+L+ A ++ C P  T +    +++  
Sbjct: 932  FRMTDMGQLHYFLGIQVQRQQNGLFMSQQKYAEDLLIAASMEHCTPLPTPLPVQLDRV-- 989

Query: 738  STEYAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTSAHYILVKHILRYL 559
                 E  +D   +R I G LQY T+TRPDI +AVN  CQ +  PT + + L+K ILRY+
Sbjct: 990  -PHQEELFSDPTYFRSIAGKLQYLTLTRPDIQFAVNFVCQKMHQPTISDFHLLKRILRYI 1048

Query: 558  KGTLHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLGDTCVSWSAKKQSTVAR 379
            KGT+  G+     S   ++A+SDSDW  C  TRRS  G C F+G   VSWS+KK  TV+R
Sbjct: 1049 KGTITMGISYSRDSPTLLQAYSDSDWGNCKQTRRSVGGLCTFMGTNLVSWSSKKHPTVSR 1108

Query: 378  SSCEAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSALHLTVNLVFHARTKHI 199
            SS EAEY++L+  A+E+ WL+ LLR++ + L     LFCDN SA++LT N  FHARTKH 
Sbjct: 1109 SSTEAEYKSLSDAASEILWLSTLLRELRIPLPDTPELFCDNLSAVYLTANPAFHARTKHF 1168

Query: 198  EIDYHFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALRTKLGLT-TPTPSLRGG 22
            +ID+HF+RE+VAL  +V  HI   +Q ADI TK+LP  +   LR KLG+T +PTPSLRG 
Sbjct: 1169 DIDFHFVRERVALKALVVKHIPGSEQIADIFTKSLPYEAFIHLRGKLGVTLSPTPSLRGT 1228

Query: 21   I 19
            I
Sbjct: 1229 I 1229


>AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease homolog from
            Arabidopsis thaliana BAC gb|AF080119 and is a member of
            the reverse transcriptase family PF|00078 [Arabidopsis
            thaliana]
          Length = 1415

 Score =  647 bits (1670), Expect = 0.0
 Identities = 347/778 (44%), Positives = 475/778 (61%), Gaps = 2/778 (0%)
 Frame = -2

Query: 2346 SPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHVS 2167
            SP   L+ + PDY+SLR FG  C+PC+R   ++K +PRSL CVFLGY+ QYKGYRC +  
Sbjct: 664  SPYEALFGEKPDYSSLRVFGSACYPCLRPLAQNKFDPRSLQCVFLGYNSQYKGYRCFYPP 723

Query: 2166 SGRVYFSRHVIFNEDVFPYKDKVVSQVPTGDVVHFNEFLNPKLADDVSESCTTITSPTYV 1987
            +G+VY SR+VIFNE   P+K+K  S VP        ++  P L        + I+ P   
Sbjct: 724  TGKVYISRNVIFNESELPFKEKYQSLVP--------QYSTPLLQAWQHNKISEISVP--- 772

Query: 1986 STNSDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCDDSHESISANVDLS 1807
            +      S+P+  +++  + +      P P      S    +D   +   E I+AN    
Sbjct: 773  AAPVQLFSKPIDLNTYAGSQVTEQLTDPEPT-----SNNEGSDEEVNPVAEEIAAN---- 823

Query: 1806 DITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEELNTNVHPMLTRSKTG 1627
                                                       +E   N H M TRSK G
Sbjct: 824  -------------------------------------------QEQVINSHAMTTRSKAG 840

Query: 1626 IVKPNPKYALATYTI-PQPPKTIRTALQHPGWFEAMTHEL*ALHSNNTWTLVPKPTNANV 1450
            I KPN +YAL T  +    PKT+ +A++HPGW EA+  E+  +H  +TW+LVP   + N+
Sbjct: 841  IQKPNTRYALITSRMNTAEPKTLASAMKHPGWNEAVHEEINRVHMLHTWSLVPPTDDMNI 900

Query: 1449 IGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPVVKSATVRVVLTLAVSC 1270
            +  KWV+KTK+  DGS+D+LKARLVAKGF QEEGVDY ETFSPVV++AT+R+VL ++ S 
Sbjct: 901  LSSKWVFKTKLHPDGSIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDVSTSK 960

Query: 1269 GWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHKALYGLKQAPRAWFNRL 1090
            GW + Q+D+ NAFL+GEL EPV+M QP GFI    P  VC+L KA+YGLKQAPRAWF+  
Sbjct: 961  GWPIKQLDVSNAFLHGELQEPVFMYQPSGFIDPQKPTHVCRLTKAIYGLKQAPRAWFDTF 1020

Query: 1089 SEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNAAIVVDFQEKLSAEFNL 910
            S FL   GF CS +D SLFV   +G             LTG++ +++ D  + L   F++
Sbjct: 1021 SNFLLDYGFVCSKSDPSLFVCHQDGKILYLLLYVDDILLTGSDQSLLEDLLQALKNRFSM 1080

Query: 909  RILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPASTTIATNFNKMLSSTE 730
            + LGP  YFLGIQ+    +G+ L QT Y TDIL +AG+ DC P  T +    +  L+S  
Sbjct: 1081 KDLGPPRYFLGIQIEDYANGLFLHQTAYATDILQQAGMSDCNPMPTPLPQQLDN-LNSEL 1139

Query: 729  YAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTSAHYILVKHILRYLKGT 550
            +AEP      +R + G LQY TITRPDI +AVN  CQ + SPT++ + L+K ILRY+KGT
Sbjct: 1140 FAEPT----YFRSLAGKLQYLTITRPDIQFAVNFICQRMHSPTTSDFGLLKRILRYIKGT 1195

Query: 549  LHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLGDTCVSWSAKKQSTVARSSC 370
            +  GL I+ +S+L + A+SDSD AGC +TRRSTTG+CI LG   +SWSAK+Q TV+ SS 
Sbjct: 1196 IGMGLPIKRNSTLTLSAYSDSDHAGCKNTRRSTTGFCILLGSNLISWSAKRQPTVSNSST 1255

Query: 369  EAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSALHLTVNLVFHARTKHIEID 190
            EAEYRAL   A E+TW+++LLRD+G+    P  ++CDN SA++L+ N   H R+KH + D
Sbjct: 1256 EAEYRALTYAAREITWISFLLRDLGIPQYLPTQVYCDNLSAVYLSANPALHNRSKHFDTD 1315

Query: 189  YHFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALRTKLGLT-TPTPSLRGGI 19
            YH+IRE+VALG I T HI++  Q AD+ TK+LP  +   LR+KLG++ +PTPSLRG +
Sbjct: 1316 YHYIREQVALGLIETQHISATFQLADVFTKSLPRRAFVDLRSKLGVSGSPTPSLRGSV 1373


>AAD43604.1 T3P18.3 [Arabidopsis thaliana]
          Length = 1309

 Score =  643 bits (1658), Expect = 0.0
 Identities = 345/781 (44%), Positives = 477/781 (61%), Gaps = 5/781 (0%)
 Frame = -2

Query: 2346 SPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHVS 2167
            SP   L+ +  DY  LR FG  C+PC+R   K+K +PRSL CVFLGY +QYKGYRCL+  
Sbjct: 509  SPYETLFQQKVDYTPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQYKGYRCLYPP 568

Query: 2166 SGRVYFSRHVIFNEDVFPYKDKVVSQVPTGDVVHFNEFLNPKLADDVSESCTTITSPTYV 1987
            +G+VY SRHVIF+E  FP+K+K  S VP         + +           T +T P+  
Sbjct: 569  TGKVYISRHVIFDEAQFPFKEKYHSLVPKYQTTLLQAWQH-----------TDLTPPSVP 617

Query: 1986 STNSDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCDDSHESISANVDLS 1807
            S+     +  VT         ++ S+  P  N               ++ E+++ N++ S
Sbjct: 618  SSQLQPLARQVTP--------MATSENQPMMNY--------------ETEEAVNVNMETS 655

Query: 1806 DITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEELNTNVHPMLTRSKTG 1627
               E         ++S+ +  H++                 +      N+HPM+TRSK G
Sbjct: 656  SDEE---------TESNDEFDHEVAPVLNDQNEDNALGQGSLE-----NLHPMITRSKDG 701

Query: 1626 IVKPNPKYAL-ATYTIPQPPKTIRTALQHPGWFEAMTHEL*ALHSNNTWTLVPKPTNANV 1450
            I KPNP+YAL  + +    PKTI TA++HPGW  A+  E+  +H  NTW+LVP   + N+
Sbjct: 702  IQKPNPRYALIVSKSSFDEPKTITTAMKHPGWNAAVMDEIDRIHMLNTWSLVPATEDMNI 761

Query: 1449 IGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPVVKSATVRVVLTLAVSC 1270
            +  KWV+KTK+K DG++D+LKARLVAKGF QEEGVDY ETFSPVV++AT+R+VL  A + 
Sbjct: 762  LTSKWVFKTKLKPDGTIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDTATAN 821

Query: 1269 GWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHKALYGLKQAPRAWFNRL 1090
             W L Q+D+ NAFL+GEL EPV+M QP GF+  + P  VC+L KALYGLKQAPRAWF+  
Sbjct: 822  EWPLKQLDVSNAFLHGELQEPVFMFQPSGFVDPNKPNHVCRLTKALYGLKQAPRAWFDTF 881

Query: 1089 SEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNAAIVVDFQEKLSAEFNL 910
            S FL   GF CST+D SLFV   NG             LTG++  ++    + L+  F++
Sbjct: 882  SNFLLDFGFECSTSDPSLFVCHQNGQSLILLLYVDDILLTGSDQLLMDKLLQALNNRFSM 941

Query: 909  RILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPASTTIATNFNKMLSSTE 730
            + LGP  YFLGI++    +G+ L Q  Y +DIL +AG+ +C P  T +  +   + S   
Sbjct: 942  KDLGPPRYFLGIEIESYNNGLFLHQHAYASDILHQAGMTECNPMPTPLPQHLEDLNS--- 998

Query: 729  YAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTSAHYILVKHILRYLKGT 550
              EP  +   +R + G LQY TITRPDI YAVN  CQ + +PT++ + L+K ILRY+KGT
Sbjct: 999  --EPFEEPTYFRSLAGKLQYLTITRPDIQYAVNFICQRMHAPTNSDFGLLKRILRYVKGT 1056

Query: 549  LHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLGDTCVSWSAKKQSTVARSSC 370
            ++ GL IR   +  +  F DSD+AGC  TRRSTTG+CI LG T +SWSAK+Q T++ SS 
Sbjct: 1057 INMGLPIRKHHNPVLSGFCDSDYAGCKDTRRSTTGFCILLGSTLISWSAKRQPTISHSST 1116

Query: 369  EAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSALHLTVNLVFHARTKHIEID 190
            EAEYRAL+ TA E+TW++ LLRD+G+   QP  +FCDN SA++L+ N   H R+KH + D
Sbjct: 1117 EAEYRALSDTAREITWISSLLRDLGISQHQPTRVFCDNLSAVYLSANPALHKRSKHFDKD 1176

Query: 189  YHFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALRTKLGLT----TPTPSLRGG 22
            +H+IRE+VALG I T HI +  Q AD+ TK+LP      LR KLG++    +PTPSL+ G
Sbjct: 1177 FHYIRERVALGLIETQHIPATIQLADVFTKSLPRRPFITLRAKLGVSASPVSPTPSLKEG 1236

Query: 21   I 19
            +
Sbjct: 1237 V 1237


>JAU34057.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Noccaea caerulescens]
          Length = 872

 Score =  614 bits (1584), Expect = 0.0
 Identities = 333/776 (42%), Positives = 464/776 (59%), Gaps = 2/776 (0%)
 Frame = -2

Query: 2346 SPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHVS 2167
            SP   LY + PDY  LRTFG  C+P +R    HKLEPRSL CVF+GYS Q+KGYRCL+  
Sbjct: 105  SPFQVLYKQKPDYTMLRTFGAACYPYLRPLADHKLEPRSLQCVFVGYSAQHKGYRCLYPP 164

Query: 2166 SGRVYFSRHVIFNEDVFPYKDKVVSQVPTGDVVHFNEFLNPKLADDVSESCTTITSPTYV 1987
            +G+VY  RHV+F+E++FP++ +  S VP     + ++ L                   + 
Sbjct: 165  TGKVYLCRHVVFDEELFPFRLQYESLVPR----YHSKLLK-----------------AWQ 203

Query: 1986 STNSDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCDDSHESISANVDLS 1807
            ++ +  S+E   ++  +  L +    Q PPA    ++E           HE +       
Sbjct: 204  ASTATHSTEKRQDNQVVRALPL----QSPPAT---VTEQQNDADGGFQVHEQVG------ 250

Query: 1806 DITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEELNTNVHPMLTRSKTG 1627
                D  + + A  ++ ++  H +T                             TR+K G
Sbjct: 251  ----DVSSGSEAGDEAQVENIHPMT-----------------------------TRAKAG 277

Query: 1626 IVKPNPKYALAT-YTIPQPPKTIRTALQHPGWFEAMTHEL*ALHSNNTWTLVPKPTNANV 1450
            I KPN +Y L T  ++P+ PK+I  A++HPGW  A+  E+  +H  NTWTLVP+  + NV
Sbjct: 278  IHKPNTRYVLLTSKSVPEVPKSIAAAMKHPGWNLAVMDEIGRIHMLNTWTLVPQTEDMNV 337

Query: 1449 IGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPVVKSATVRVVLTLAVSC 1270
            +  KWV+  K+  +G L++LKARLVAKGF QEEG+DY ETFSPVV++AT+R++L +A S 
Sbjct: 338  LSNKWVFTPKMNPNGELNKLKARLVAKGFDQEEGLDYLETFSPVVRTATIRMILDIATSK 397

Query: 1269 GWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHKALYGLKQAPRAWFNRL 1090
             W + Q+D+ NAFL+GEL EPVYM QP GF     P  VCKL KALYGLKQAPRAWF+  
Sbjct: 398  EWSIKQLDVSNAFLHGELKEPVYMFQPAGFEDAEKPDHVCKLTKALYGLKQAPRAWFDTF 457

Query: 1089 SEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNAAIVVDFQEKLSAEFNL 910
            S ++   GFTCS AD SLF Y  NG             LTG++++++ +  + L+  F++
Sbjct: 458  SNYIIEFGFTCSKADPSLFTYYKNGKTMALLMYVDDMLLTGSDSSLLQELLDCLNKRFSM 517

Query: 909  RILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPASTTIATNFNKMLSSTE 730
            + LG   YFLG+++     G+ L QT Y TDIL +A + DC P  T +    +  LSS  
Sbjct: 518  KDLGKPHYFLGVEIETYDGGMFLHQTAYATDILKQAAMFDCNPMPTPLPLQLDD-LSSEA 576

Query: 729  YAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTSAHYILVKHILRYLKGT 550
            + EP      +R + G LQY TITRPDI +AVN  CQ +  PT + + L+K +LRY++GT
Sbjct: 577  FPEPT----YFRSLAGKLQYLTITRPDIQFAVNFICQRMHLPTVSDFSLLKRVLRYIRGT 632

Query: 549  LHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLGDTCVSWSAKKQSTVARSSC 370
            L  G+HIR    L + A+ DSD+AGC  TRRST+G+C  LG   +SWSAK+Q TV++SS 
Sbjct: 633  LTMGMHIRKDQELILHAYCDSDYAGCKETRRSTSGFCTMLGPNLLSWSAKRQQTVSKSST 692

Query: 369  EAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSALHLTVNLVFHARTKHIEID 190
            EAEYRAL  TA E+TWL+ LL+D+G+   Q  VL CDN SA++L+ N   H+R+KH + D
Sbjct: 693  EAEYRALTATAQELTWLSLLLKDLGIEQHQATVLKCDNLSAVYLSTNPALHSRSKHFDTD 752

Query: 189  YHFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALRTKLGLT-TPTPSLRG 25
            YH++RE+VALG I T H+ ++ Q ADI TK+LP      LR+KLG+   PT SLRG
Sbjct: 753  YHYVREQVALGFIETQHVPAELQLADIFTKSLPKGPFCDLRSKLGVAGPPTLSLRG 808


>AAF02855.1 Similar to retrotransposon proteins [Arabidopsis thaliana]
          Length = 1522

 Score =  634 bits (1634), Expect = 0.0
 Identities = 346/802 (43%), Positives = 473/802 (58%), Gaps = 25/802 (3%)
 Frame = -2

Query: 2349 QSPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHV 2170
            +SP  +LY K P+Y++LR FGC C+P +RDY   K +PRSL CVFLGY+++YKGYRCL+ 
Sbjct: 668  ESPYQKLYGKAPEYSALRVFGCACYPTLRDYASTKFDPRSLKCVFLGYNEKYKGYRCLYP 727

Query: 2169 SSGRVYFSRHVIFNEDVFPYKDKVVSQVPTGDVVHFNE-------FLNPKLADDVSESCT 2011
             +GR+Y SRHV+F+E+  P+ + + S +   D     E        + P   D      +
Sbjct: 728  PTGRIYISRHVVFDENTHPF-ESIYSHLHPQDKTPLLEAWFKSFHHVTPTQPDQSRYPVS 786

Query: 2010 TITSPTYVSTNSDFSSEPVTEHSFLDTLIVSHSDQPPPAN------SAHISETSTTDSPC 1849
            +I  P      +D S+ P +  +  +T   + SD     N      S     T+  DS  
Sbjct: 787  SIPQPE----TTDLSAAPASVAA--ETAGPNASDDTSQDNETISVVSGSPERTTGLDSAS 840

Query: 1848 -DDSHESISANVDLSDITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEE 1672
              DS+ S +A+         +PAS+   S   M     + A                   
Sbjct: 841  IGDSYHSPTADSSHPSPARSSPASSPQGSPIQMAPAQQVQAPV----------------- 883

Query: 1671 LNTNVHPMLTRSKTGIVKPNPKYALATYTIPQP-PKTIRTALQHPGWFEAMTHEL*ALHS 1495
              TN H M+TR K GI KPN +Y L T+ +  P PKT+  AL+HPGW  AM  E+     
Sbjct: 884  --TNEHAMVTRGKEGISKPNKRYVLLTHKVSIPEPKTVTEALKHPGWNNAMQEEMGNCKE 941

Query: 1494 NNTWTLVPKPTNANVIGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPVV 1315
              TWTLVP   N NV+G  WV++TK+ ADGSLD+LKARLVAKGF QEEG+DY ET+SPVV
Sbjct: 942  TETWTLVPYSPNMNVLGSMWVFRTKLHADGSLDKLKARLVAKGFKQEEGIDYLETYSPVV 1001

Query: 1314 KSATVRVVLTLAVSCGWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHKA 1135
            ++ TVR++L +A    W+L Q+D+ NAFL+G+L E VYM QP GF+ KS P  VC LHK+
Sbjct: 1002 RTPTVRLILHVATVLKWELKQMDVKNAFLHGDLTETVYMRQPAGFVDKSKPDHVCLLHKS 1061

Query: 1134 LYGLKQAPRAWFNRLSEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNAA 955
            LYGLKQ+PRAWF+R S FL   GF CS  D SLFVY  N              +TGNN+ 
Sbjct: 1062 LYGLKQSPRAWFDRFSNFLLEFGFICSLFDPSLFVYSSNNDVILLLLYVDDMVITGNNSQ 1121

Query: 954  IVVDFQEKLSAEFNLRILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPAS 775
             +      L+ EF ++ +G + YFLGIQ+     G+ +SQ +Y  D+L+ A + +C P  
Sbjct: 1122 SLTHLLAALNKEFRMKDMGQVHYFLGIQIQTYDGGLFMSQQKYAEDLLITASMANCSPMP 1181

Query: 774  TTIATNFNKMLSSTEYAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTSA 595
            T +    +++ +  E     +D   +R + G LQY T+TRPDI +AVN  CQ +  P+ +
Sbjct: 1182 TPLPLQLDRVSNQDEV---FSDPTYFRSLAGKLQYLTLTRPDIQFAVNFVCQKMHQPSVS 1238

Query: 594  HYILVKHILRYLKGTLHTGLHIRPSSSLAI---------RAFSDSDWAGCPSTRRSTTGY 442
             + L+K ILRY+KGT+  G+    +SS  +          A+SDSD+A C  TRRS  GY
Sbjct: 1239 DFNLLKRILRYIKGTVSMGIQYNSNSSSVVSAYESDYDLSAYSDSDYANCKETRRSVGGY 1298

Query: 441  CIFLGDTCVSWSAKKQSTVARSSCEAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFC 262
            C F+G   +SWS+KKQ TV+RSS EAEYR+L+ TA+E+ W++ +LR++GV L     LFC
Sbjct: 1299 CTFMGQNIISWSSKKQPTVSRSSTEAEYRSLSETASEIKWMSSILREIGVSLPDTPELFC 1358

Query: 261  DNTSALHLTVNLVFHARTKHIEIDYHFIREKVALGHIVTAHIASDQQPADILTKALPTPS 82
            DN SA++LT N  FHARTKH ++D+H+IRE+VAL  +V  HI    Q ADI TK+LP  +
Sbjct: 1359 DNLSAVYLTANPAFHARTKHFDVDHHYIRERVALKTLVVKHIPGHLQLADIFTKSLPFEA 1418

Query: 81   HAALRTKLGLT-TPTPSLRGGI 19
               LR KLG+   PTPSLRG I
Sbjct: 1419 FTRLRFKLGVDFPPTPSLRGCI 1440


>CAN67588.1 hypothetical protein VITISV_036280 [Vitis vinifera]
          Length = 1379

 Score =  619 bits (1597), Expect = 0.0
 Identities = 350/791 (44%), Positives = 466/791 (58%), Gaps = 15/791 (1%)
 Frame = -2

Query: 2346 SPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHVS 2167
            SP   L+ K P+Y +   FGC  +PC+RDY  HK  PRSLPC+FLGYS  +KG+RC   +
Sbjct: 61   SPFEVLFGKSPNYENFHPFGCRVYPCLRDYAPHKFSPRSLPCIFLGYSSSHKGFRCFDTT 120

Query: 2166 SGRVYFSRHVIFNEDVFPYKDKVVSQVPTGDVVHFNEFLNPKLADDVSESCTTIT----S 1999
            + R Y +RH  F+E  FP+ +   S     D+   N F    L    S S  T T    S
Sbjct: 121  TSRTYITRHARFDEHFFPFSN-TSSATSIADIGLSNFFEPCALEPSPSTSSPTTTRVPPS 179

Query: 1998 PTYVSTNSDFSSEPV-TEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCDDSHESISA 1822
            P       DF+ EP+    S  ++   S +  P PA++  +   +   +P D  H + SA
Sbjct: 180  PPCHFCADDFAVEPLQVSSSAPESTSSSAAVSPVPASATTLVPFA---APMDPIHTTTSA 236

Query: 1821 NVDLSDITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEELNTNVHPMLT 1642
                       PAS                                         HPM+T
Sbjct: 237  ----------APAS-----------------------------------------HPMIT 245

Query: 1641 RSKTGIVKP-NPKYALATYTIP--------QPPKTIRTALQHPGWFEAMTHEL*ALHSNN 1489
            R+K+GI KP +P +     + P          PK  ++A ++P W  AM  ++ AL +N+
Sbjct: 246  RAKSGIFKPRHPAHLSFVQSSPLIHALLATSEPKGFKSAAKNPAWLAAMDDKIKALQTNH 305

Query: 1488 TWTLVPKPTNANVIGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPVVKS 1309
            TW LVP+P+N N++G KWV++TK  + GS++R KARLVAKG++Q  G+DY +TFSPVVK+
Sbjct: 306  TWDLVPRPSNTNIVGSKWVFRTKFLSYGSIERFKARLVAKGYTQLPGLDYKDTFSPVVKA 365

Query: 1308 ATVRVVLTLAVSCGWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHKALY 1129
            +TVRVVL+LAVS  W L Q+D+ NAFLNG L+E VYMEQP G++    P  VCKL KALY
Sbjct: 366  STVRVVLSLAVSHKWPLRQLDVKNAFLNGILHETVYMEQPLGYVDPRHPLHVCKLKKALY 425

Query: 1128 GLKQAPRAWFNRLSEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNAAIV 949
            GLKQAPRAWF R S FL   GF CS AD+SLFV+                 LTGNN A++
Sbjct: 426  GLKQAPRAWFQRFSSFLLKLGFFCSRADTSLFVFTKKDDLIYLLLYVDDIILTGNNPALI 485

Query: 948  VDFQEKLSAEFNLRILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPASTT 769
              F  +L +EF ++ LGPLSYFLG++V     G  LSQ +Y TDIL  A L D KP  T 
Sbjct: 486  NRFISQLHSEFAVKDLGPLSYFLGLEVSYIPDGFFLSQVKYATDILARAQLLDSKPVPTP 545

Query: 768  IATNFNKMLSSTEYAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTSAHY 589
            +  +  + LSS     P AD  +YR +VG LQY TITRP++++++N+  QFL +PT  H+
Sbjct: 546  MIVS--QRLSSE--GTPFADPTLYRSLVGALQYLTITRPNLAHSINSVSQFLHAPTEVHF 601

Query: 588  ILVKHILRYLKGTLHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLGDTCVSW 409
              VK ILRY++GTLH GL     SS+ + A+SD+DWAGCP TRRST+GY IFLG+  VSW
Sbjct: 602  QAVKRILRYVQGTLHFGLKFTSCSSMGLVAYSDADWAGCPDTRRSTSGYSIFLGNNLVSW 661

Query: 408  SAKKQSTVARSSCEAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSALHLTVN 229
            SAKKQ TV+RSSCE+EYRALA TAA++ WL +LLRD+ V L    +L CDN SA+ L+ N
Sbjct: 662  SAKKQPTVSRSSCESEYRALALTAAKVLWLTHLLRDLRVTLTHRPLLLCDNKSAIFLSSN 721

Query: 228  LVFHARTKHIEIDYHFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALRTKLGL- 52
             V H R KH+++DYHF+RE +  G +   HI S  Q AD+ TK++    +   R+KL + 
Sbjct: 722  PVSHKRAKHVDLDYHFLRELIVAGTLRIQHIPSHLQLADVFTKSVSRDLYVFFRSKLRVC 781

Query: 51   TTPTPSLRGGI 19
              PT SLRG +
Sbjct: 782  VNPTLSLRGAV 792


>CAB40035.1 retrotransposon like protein [Arabidopsis thaliana] CAB81170.1
            retrotransposon like protein [Arabidopsis thaliana]
          Length = 1515

 Score =  614 bits (1583), Expect = 0.0
 Identities = 334/797 (41%), Positives = 468/797 (58%), Gaps = 18/797 (2%)
 Frame = -2

Query: 2355 DLQSPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCL 2176
            D +SP   L+   P Y +LR FG  C+P +R Y K+K +P+SL CVFLGY+++YKGYRCL
Sbjct: 663  DNKSPYEMLHGTPPVYTALRVFGSACYPYLRPYAKNKFDPKSLLCVFLGYNNKYKGYRCL 722

Query: 2175 HVSSGRVYFSRHVIFNEDVFPYKDKVVSQVPT--------------GDVVHFNEFLNPKL 2038
            H  +G+VY  RHV+F+E  FPY D + SQ  T                     E  +  +
Sbjct: 723  HPPTGKVYICRHVLFDERKFPYSD-IYSQFQTISGSPLFTAWQKGFSSTALSRETPSTNV 781

Query: 2037 ADDVSESCTTITS-PTYVSTN-SDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETST 1864
             D +  S T  +S PT  + N ++ ++ P  + +    ++V     P P  S  +  T  
Sbjct: 782  EDIIFPSATVSSSVPTGCAPNIAETATAPDVDVAAAHDMVVP----PSPITSTSLP-TQP 836

Query: 1863 TDSPCDDSHESISANVDLSDITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRD 1684
             +S  D +H S  +   +S        +     DS       + +               
Sbjct: 837  EESTSDQNHYSTDSETAISSAMTPQSINVSLFEDSDFPPLQSVISS-------------- 882

Query: 1683 MTEELNTNVHPMLTRSKTGIVKPNPKYALATYTIPQP-PKTIRTALQHPGWFEAMTHEL* 1507
             T       HPM+TR+K+GI KPNPKYAL +     P PK+++ AL+  GW  AM  E+ 
Sbjct: 883  -TTAAPETSHPMITRAKSGITKPNPKYALFSVKSNYPEPKSVKEALKDEGWTNAMGEEMG 941

Query: 1506 ALHSNNTWTLVPKPTNANVIGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETF 1327
             +H  +TW LVP      ++GCKWV+KTK+ +DGSLDRLKARLVA+G+ QEEGVDY ET+
Sbjct: 942  TMHETDTWDLVPPEMVDRLLGCKWVFKTKLNSDGSLDRLKARLVARGYEQEEGVDYVETY 1001

Query: 1326 SPVVKSATVRVVLTLAVSCGWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCK 1147
            SPVV+SATVR +L +A    W L Q+D+ NAFL+ EL E V+M QPPGF   S P  VCK
Sbjct: 1002 SPVVRSATVRSILHVATINKWSLKQLDVKNAFLHDELKETVFMTQPPGFEDPSRPDYVCK 1061

Query: 1146 LHKALYGLKQAPRAWFNRLSEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTG 967
            L KA+Y LKQAPRAWF++ S +L   GF CS +D SLFVY                 LTG
Sbjct: 1062 LKKAIYDLKQAPRAWFDKFSSYLLKYGFICSFSDPSLFVYLKGRDVMFLLLYVDDMILTG 1121

Query: 966  NNAAIVVDFQEKLSAEFNLRILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDC 787
            NN  ++      LS EF ++ +G L YFLGIQ   +  G+ LSQ +Y +D+L+ AG+ DC
Sbjct: 1122 NNDVLLQQLLNILSTEFRMKDMGALHYFLGIQAHYHNDGLFLSQEKYTSDLLVNAGMSDC 1181

Query: 786  KPASTTIATNFNKMLSSTEYAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLAS 607
                T +  +   +L      +P  +   +R + G LQY T+TRPDI +AVN  CQ + +
Sbjct: 1182 SSMPTPLQLD---LLQGNN--KPFPEPTYFRRLAGKLQYLTLTRPDIQFAVNFVCQKMHA 1236

Query: 606  PTSAHYILVKHILRYLKGTLHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLG 427
            PT + + L+K IL YLKGT+  G+++  ++   +R +SDSDWAGC  TRRST G+C FLG
Sbjct: 1237 PTMSDFHLLKRILHYLKGTMTMGINLSSNTDSVLRCYSDSDWAGCKDTRRSTGGFCTFLG 1296

Query: 426  DTCVSWSAKKQSTVARSSCEAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSA 247
               +SWSAK+  TV++SS EAEYR L+  A+E++W+ +LL+++G+  QQ   ++CDN SA
Sbjct: 1297 YNIISWSAKRHPTVSKSSTEAEYRTLSFAASEVSWIGFLLQEIGLPQQQIPEMYCDNLSA 1356

Query: 246  LHLTVNLVFHARTKHIEIDYHFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALR 67
            ++L+ N   H+R+KH ++DY+++RE+VALG +   HI + QQ ADI TK+LP      LR
Sbjct: 1357 VYLSANPALHSRSKHFQVDYYYVRERVALGALTVKHIPASQQLADIFTKSLPQAPFCDLR 1416

Query: 66   TKLGLT-TPTPSLRGGI 19
             KLG+   P  SLRG I
Sbjct: 1417 FKLGVVLPPDTSLRGCI 1433


>CAA19714.1 putative protein [Arabidopsis thaliana] CAB79575.1 putative protein
            [Arabidopsis thaliana]
          Length = 819

 Score =  591 bits (1523), Expect = 0.0
 Identities = 325/763 (42%), Positives = 452/763 (59%), Gaps = 6/763 (0%)
 Frame = -2

Query: 2289 GCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHVSSGRVYFSRHVIFNEDVFPY 2110
            G  CFP +RDY ++K  P SL CVFLGY+++YKGYRCL+  +GR+Y SRHVIF+E V+P+
Sbjct: 15   GSACFPTLRDYAENKFNPCSLKCVFLGYNEKYKGYRCLYPPTGRLYISRHVIFDESVYPF 74

Query: 2109 KDKVVSQVPTGDVVHFNEFLNPKLADDVSESCTTITSPTYVS---TNSDFSSEPVTEHSF 1939
                    P          L   L    S + +T TSP+  S   T++DF   P  +   
Sbjct: 75   SHTYKHLHPQPRT----PLLAAWLRSSDSPAPSTSTSPSSRSPLFTSADFPPLPQRKTPL 130

Query: 1938 LDTLIVSHSDQPPPANSAHISETSTTDSPCDDSHESISANVDLSDITEDTPASTCANSDS 1759
            L TL+       P ++ +H S  +T  SP  DS  +   + D + I + + +S   +   
Sbjct: 131  LPTLV-------PISSVSHASNITTQQSPDFDSERT--TDFDSASIGDSSHSSQAGSDSE 181

Query: 1758 HMDEQHDITAXXXXXXXXXXXSTRDMTEELNTNVHPMLTRSKTGIVKPNPKYALATYTIP 1579
               +Q  +                      +TNVHPM+TR+K GI KPNP+Y   ++ + 
Sbjct: 182  ETIQQASVNVHQTPA---------------STNVHPMVTRAKVGISKPNPRYVFLSHKVS 226

Query: 1578 QP-PKTIRTALQHPGWFEAMTHEL*ALHSNNTWTLVPKPTNANVIGCKWVYKTKIKADGS 1402
             P PKT+  AL+HPGW  AMT E+       TW+LVP  ++ +V+G KWV++TK+ ADG+
Sbjct: 227  YPEPKTVTAALKHPGWTGAMTEEIGNCSETQTWSLVPYKSDMHVLGSKWVFRTKLHADGT 286

Query: 1401 LDRLKARLVAKGFSQEEGVDYFETFSPVVKSATVRVVLTLAVSCGWKLIQVDICNAFLNG 1222
            L++LKAR+VAKGF QEEG+DY ET+SPVV++ TVR+VL LA +  W + Q+D+ NAFL+G
Sbjct: 287  LNKLKARIVAKGFLQEEGIDYLETYSPVVRTPTVRLVLHLATALNWDIKQMDVKNAFLHG 346

Query: 1221 ELNEPVYMEQPPGFISKSGPQLVCKLHKALYGLKQAPRAWFNRLSEFLACSGFTCSTADS 1042
            +L E VYM QP      +    VC LHK++YGLKQ+PRAWF++ S FL   GF C  +D 
Sbjct: 347  DLKETVYMTQPA-----ANRDHVCLLHKSIYGLKQSPRAWFDKFSTFLLEFGFFCRKSDP 401

Query: 1041 SLFVYQHNGXXXXXXXXXXXXXLTGNNAAIVVDFQEKLSAEFNLRILGPLSY-FLGIQVL 865
            SLF+Y HN                   +  +      L+ EF +  +G  S  FLGIQV 
Sbjct: 402  SLFIYAHNNNLILLLL-----------SQTLTSLLAALNKEFRMTDMGQHSLTFLGIQVQ 450

Query: 864  PNQSGILLSQTRYITDILMEAGLQDCKPASTTIATNFNKMLSSTEYAEPLADSHIYRHIV 685
              Q+G+ +SQ +Y  D+L+ A ++ C P  T +    +++    E     +D   +R I 
Sbjct: 451  RQQNGLFMSQQKYAEDLLIAASMEHCTPLPTPLPVQLDRVPHQEEL---FSDPTYFRSIA 507

Query: 684  GCLQYATITRPDISYAVNTACQFLASPTSAHYILVKHILRYLKGTLHTGLHIRPSSSLAI 505
            G LQY T+TRPDI +AVN  CQ +  PT + + L+K ILRY+KGT+  G+     S   +
Sbjct: 508  GKLQYLTLTRPDIQFAVNFVCQKMHQPTISDFHLLKRILRYIKGTITMGISYSRDSPTLL 567

Query: 504  RAFSDSDWAGCPSTRRSTTGYCIFLGDTCVSWSAKKQSTVARSSCEAEYRALATTAAEMT 325
            +A+SDSDW  C  TRRS  G C F+G   VSWS+KK  TV+RSS EAEY++L+  A+E+ 
Sbjct: 568  QAYSDSDWGNCKQTRRSVGGLCTFMGTNLVSWSSKKHPTVSRSSTEAEYKSLSDAASEIL 627

Query: 324  WLAYLLRDVGVHLQQPAVLFCDNTSALHLTVNLVFHARTKHIEIDYHFIREKVALGHIVT 145
            WL+ LLR++ + L     LFCDN SA++LT N  FHARTKH +ID+HF+RE+VAL  +V 
Sbjct: 628  WLSTLLRELRIPLPDTPELFCDNLSAVYLTANPAFHARTKHFDIDFHFVRERVALKALVV 687

Query: 144  AHIASDQQPADILTKALPTPSHAALRTKLGLT-TPTPSLRGGI 19
             HI   +Q ADI TK+LP  +   LR KLG+T  PTPSLRG I
Sbjct: 688  KHIPGSEQIADIFTKSLPYEAFIHLRGKLGVTLPPTPSLRGTI 730


>ACP30598.1 disease resistance protein [Brassica rapa subsp. pekinensis]
          Length = 2301

 Score =  626 bits (1615), Expect = 0.0
 Identities = 335/783 (42%), Positives = 480/783 (61%), Gaps = 3/783 (0%)
 Frame = -2

Query: 2346 SPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHVS 2167
            SP  +L+ K P Y++LR FGC CFP +R Y ++KL+PRSL CVFLGYS++YKGYRCL  +
Sbjct: 673  SPYEKLHNKSPSYDALRIFGCACFPMLRPYTQNKLDPRSLQCVFLGYSEKYKGYRCLLPA 732

Query: 2166 SGRVYFSRHVIFNEDVFPYKDKVVSQVPTGDVVHFNEFLNPKLADDVSESCTTITSPTYV 1987
            +GRVY SRHVIF+E  FP+ D                 L+P     + E+    ++ + V
Sbjct: 733  TGRVYISRHVIFDESKFPFADVY-------------GHLHPPALTPLMEAWLQ-SNRSAV 778

Query: 1986 STNSDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCDDSHESISANVDLS 1807
            S +        T    L  +   H   P  +++   S  S        S E++S ++ ++
Sbjct: 779  SQSQSTQGRQETMQPRLCVIKPQHFVAPNSSSTGSCSVIS--------SSETMSTSLPIT 830

Query: 1806 DITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEELNTNVHPMLTRSKTG 1627
            D T        +NS      +H+ TA                   +  N H M TR K G
Sbjct: 831  DGTSQRLIDRESNSPQ---VEHNETALPRA--------------NMPVNNHQMTTRLKAG 873

Query: 1626 IVKPNPKYALATYTIPQP-PKTIRTALQHPGWFEAMTHEL*ALHSNNTWTLVPKPTNANV 1450
            I KPNP+YAL T  +  P P+T+  AL+HPGW  +M  E+       TW+LVP   + +V
Sbjct: 874  ITKPNPRYALLTQKVLCPRPRTVAEALKHPGWNNSMKEEIGNCELTKTWSLVPYTPDMHV 933

Query: 1449 IGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPVVKSATVRVVLTLAVSC 1270
            IG  WV++ K+ ADG++  L++RLVA+G SQEEG+DY ET+SPVV++ATVR+VL +A   
Sbjct: 934  IGNGWVFREKLNADGTVKSLRSRLVAQGCSQEEGIDYLETYSPVVRTATVRIVLHIATVL 993

Query: 1269 GWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHKALYGLKQAPRAWFNRL 1090
             W + Q+D+ NAFL+G+L+E VYM QP GF+ +S P  VC LHK+LYGLKQ+PRAWF++ 
Sbjct: 994  QWDIKQMDVANAFLHGDLHETVYMSQPKGFVDESKPDHVCLLHKSLYGLKQSPRAWFDKF 1053

Query: 1089 SEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNAAIVVDFQEKLSAEFNL 910
            S +L   GF CS  D SLF+Y+                +TGN++ ++    ++L+ +F +
Sbjct: 1054 STYLIEFGFVCSIKDPSLFIYRRGKDIIMLLLYVDDMLITGNSSTVLAKLLDELNKQFRM 1113

Query: 909  RILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPASTTIATNFNKMLSSTE 730
            + LG + YFLGIQ   + SG+ LSQ RY  D+L  AG+ +C    TT+AT     LS   
Sbjct: 1114 KDLGRMHYFLGIQATFHSSGMFLSQERYAKDLLATAGMSEC----TTVATPLPLQLSKVP 1169

Query: 729  YAEP-LADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTSAHYILVKHILRYLKG 553
            + +    D   +R + G LQY T+TRPD+ Y+VN  CQ +  PT + ++L+K ILRY++G
Sbjct: 1170 HQDKKFEDPTYFRSLAGKLQYLTLTRPDLQYSVNYVCQKMHEPTVSDFMLLKRILRYVQG 1229

Query: 552  TLHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLGDTCVSWSAKKQSTVARSS 373
            TL  G++I   +   +RA+SDSDWAGC +TRRST G+C +LG   +SWS+KKQ TV+RSS
Sbjct: 1230 TLDYGVNIFKDTDFTLRAYSDSDWAGCHNTRRSTGGFCTYLGLNIISWSSKKQPTVSRSS 1289

Query: 372  CEAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSALHLTVNLVFHARTKHIEI 193
             EAEYR+L+ TA+E++W+  +LR++GV +Q    L+CDN SA++LT N  +H R+KH E+
Sbjct: 1290 TEAEYRSLSETASELSWMCSILREIGVPIQTTPELYCDNLSAVYLTANPAYHKRSKHFEL 1349

Query: 192  DYHFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALRTKLGL-TTPTPSLRGGIG 16
            DYH++RE+VALG ++  HI +  Q ADI TK L   +  +LR KLG+ ++PTPSLRG + 
Sbjct: 1350 DYHYVRERVALGALLVKHIPAHLQLADIFTKPLTFKAFDSLRYKLGVDSSPTPSLRGAVE 1409

Query: 15   *RA 7
             RA
Sbjct: 1410 DRA 1412


>BAH94406.1 Os08g0544300 [Oryza sativa Japonica Group]
          Length = 821

 Score =  582 bits (1501), Expect = 0.0
 Identities = 340/801 (42%), Positives = 461/801 (57%), Gaps = 37/801 (4%)
 Frame = -2

Query: 2343 PLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHVSS 2164
            PL QL+ + P+Y +LR FGC  +P +R YNKHKL  RS  CVFLGYS+ +KG++CL +++
Sbjct: 30   PLEQLFKEKPNYTALRIFGCAVWPNLRPYNKHKLAFRSKRCVFLGYSNLHKGFKCLEIAT 89

Query: 2163 GRVYFSRHVIFNEDVFPYKD----------KVVSQVPTGDVVHF--------NEFLN--P 2044
            GRVY SR V F+E +FP+ +            +S +P   V H         N  LN  P
Sbjct: 90   GRVYVSRDVTFDESIFPFSELHSNAGACLRAEISLLPPSLVPHLSSLGGEQNNHVLNYPP 149

Query: 2043 KLADDVSESCTTITSPTYVSTNSDFSSEPVTEHSFL--------DTLIVSHSDQP----P 1900
             + D   E    I     V+   + ++    E++          D   V++   P    P
Sbjct: 150  NVTDQFGEENAEI-GEEIVANGEENAAAAADENAAAAANGGAQDDVHGVAYDASPEHSSP 208

Query: 1899 PANSAHISETSTTDSPCDDSHESISANVDLSDITEDTPASTCANSDSHMDEQHDITAXXX 1720
              + A  S      +P  + H  + A+   +  T  + AS+    D    +Q D T    
Sbjct: 209  VTDDATASAAEQHGNPIQEEH-LVQASPQTASSTSPSVASSAGVHDDVTTDQSDQT---- 263

Query: 1719 XXXXXXXXSTRDMTEELNTNVHPMLTRSKTGIVKPNPKYALAT-----YTIPQPPKTIRT 1555
                      + M E     + P  TR ++GI K    Y   T     +T    P+++  
Sbjct: 264  ---------DQAMPEAAVAPIRPK-TRLQSGIRKEKV-YTDGTVKWLNFTSSGEPQSLEE 312

Query: 1554 ALQHPGWFEAMTHEL*ALHSNNTWTLVPKPTNANVIGCKWVYKTKIKADGSLDRLKARLV 1375
            A+ +  W EAM  E  AL  N TW LVP     NVI CKWVYK K KADGSLDR KARLV
Sbjct: 313  AVNNKHWKEAMDAEYMALIENKTWHLVPPQKGRNVIDCKWVYKVKRKADGSLDRYKARLV 372

Query: 1374 AKGFSQEEGVDYFETFSPVVKSATVRVVLTLAVSCGWKLIQVDICNAFLNGELNEPVYME 1195
            AKGF Q  G+DY +TFSPVVK+AT+R+VL+LAVS GW L Q+D+ NAFL+G L E VYME
Sbjct: 373  AKGFKQRYGIDYEDTFSPVVKAATIRIVLSLAVSRGWSLRQLDVKNAFLHGVLEEEVYME 432

Query: 1194 QPPGFISKSGPQLVCKLHKALYGLKQAPRAWFNRLSEFLACSGFTCSTADSSLFVYQHNG 1015
            QPPG+  KS P  VCKL KALYGLKQAPRAW++RLS  L+  GF  S AD+SLF Y+   
Sbjct: 433  QPPGYEKKSMPNYVCKLDKALYGLKQAPRAWYSRLSTKLSELGFVPSKADTSLFFYKKGQ 492

Query: 1014 XXXXXXXXXXXXXLTGNNAAIVVDFQEKLSAEFNLRILGPLSYFLGIQVLPNQSGILLSQ 835
                         +  +         ++LS +F L+ LG L YFLGI+V   + G++LSQ
Sbjct: 493  VSIFLLIYVDDIIMASSVPDATSTLLQELSKDFALKDLGDLHYFLGIEVHKVKDGLMLSQ 552

Query: 834  TRYITDILMEAGLQDCKPASTTIATNFNKMLSSTEYAEPLADSHIYRHIVGCLQYATITR 655
             +Y +D+L   G+ +CKP ST ++T+    ++      P  DS  YR +VG LQY T+TR
Sbjct: 553  EKYASDLLRRVGMYECKPVSTPLSTSEKLSVNEGTLLGP-QDSTQYRSVVGALQYLTLTR 611

Query: 654  PDISYAVNTACQFLASPTSAHYILVKHILRYLKGTLHTGLHIRPSSSLAIRAFSDSDWAG 475
            PDIS+++N  CQFL +PT+ H+  VK ILRY+K T+ TGL    + SL +  FSD+DWAG
Sbjct: 612  PDISFSINKVCQFLHAPTTTHWAAVKRILRYVKYTVDTGLKFCRNPSLLVSGFSDADWAG 671

Query: 474  CPSTRRSTTGYCIFLGDTCVSWSAKKQSTVARSSCEAEYRALATTAAEMTWLAYLLRDVG 295
             P  RRST G+ +FLG   VSWSA+KQ+TV+RSS EAEY+ALA   AE+ W+  LL+++G
Sbjct: 672  SPDDRRSTGGFAVFLGPNLVSWSARKQATVSRSSTEAEYKALANATAEIMWVQTLLQELG 731

Query: 294  VHLQQPAVLFCDNTSALHLTVNLVFHARTKHIEIDYHFIREKVALGHIVTAHIASDQQPA 115
            V   + A L+CDN  A +L+ N +FHARTKHIE+D+HF+RE+VA   +  A+I++  Q A
Sbjct: 732  VESPRAAKLWCDNLGAKYLSANPIFHARTKHIEVDFHFVRERVARKLLEIAYISTKDQVA 791

Query: 114  DILTKALPTPSHAALRTKLGL 52
            D  TKA+P       +  L L
Sbjct: 792  DGFTKAIPVRQMEMFKNNLNL 812


>CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]
          Length = 1455

 Score =  589 bits (1519), Expect = 0.0
 Identities = 333/777 (42%), Positives = 457/777 (58%), Gaps = 2/777 (0%)
 Frame = -2

Query: 2343 PLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHVSS 2164
            P+  L+   PDY+ L+ FGC CFP +R YN HKL+ RS  C FLGYS ++KGY+C+  S+
Sbjct: 731  PIEVLFKSIPDYSFLKVFGCSCFPNLRPYNTHKLQYRSEECTFLGYSLKHKGYKCMS-SN 789

Query: 2163 GRVYFSRHVIFNEDVFPYKDKV-VSQVPTGDVVHFNEFLNPKLADDVSESCTTITSPTYV 1987
            GRVY S  VIFNE  FPY   + VS      V      L+P  +  V  S T + +PT  
Sbjct: 790  GRVYISHDVIFNETSFPYSKTIQVSSCLLSTVSPSTSHLSPSASPPVL-SPTMLPTPTSP 848

Query: 1986 STNSDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCDDSHESISANVDLS 1807
             +    S+ P++E   +D ++ +H   P  A+                            
Sbjct: 849  IS----SARPISE---MDNIVSTHPHAPNSAD---------------------------- 873

Query: 1806 DITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEELNTNVHPMLTRSKTG 1627
              T  TPA   +N  +    QH +++            TR + ++ + N HPM+TR+K+G
Sbjct: 874  --TTLTPAQVVSNPVA-TPVQHVVSSIADASV------TRTIAKDAD-NTHPMITRAKSG 923

Query: 1626 IVKPNPKYALATYTIPQPPKTIRTALQHPGWFEAMTHEL*ALHSNNTWTLVPKPTNANVI 1447
            IVKP  K  +A     + P ++  ALQ   W +AM  E  AL  NNTW+LVP P     I
Sbjct: 924  IVKP--KIFIAAI---REPSSVSAALQQDEWKKAMVAEYDALQRNNTWSLVPLPAGRQAI 978

Query: 1446 GCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPVVKSATVRVVLTLAVSCG 1267
            GCKWVYKTK   DG++ + KARLVAKGF Q+ G D+ ETFSPVVK +TVRVV T+A+S  
Sbjct: 979  GCKWVYKTKENPDGTVQKYKARLVAKGFHQQAGFDFTETFSPVVKPSTVRVVFTIALSRN 1038

Query: 1266 WKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHKALYGLKQAPRAWFNRLS 1087
            W + Q+D+ NAFLNG+L E V+M+QP GFI +  P LVC+LHKALYGLKQAPRAWF +L 
Sbjct: 1039 WAIKQLDVNNAFLNGDLQEEVFMQQPQGFIDEQNPNLVCRLHKALYGLKQAPRAWFEKLH 1098

Query: 1086 EFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNAAIVVDFQEKLSAEFNLR 907
              L   GF  + +D SLF+                  + G++ A +     +L++EF+L+
Sbjct: 1099 RALLSFGFVSAKSDQSLFLRFTPNHITYVLVYVDDILVIGSDTAAITSLIAQLNSEFSLK 1158

Query: 906  ILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPASTTIATNFNKMLSSTEY 727
             LG + YFLGIQV    +G+ LSQT+YI D+L +  +  CKPA T + T     +     
Sbjct: 1159 DLGEVHYFLGIQVSHTNNGLHLSQTKYIRDLLQKTKMVHCKPARTPLPTGLKLRVGD--- 1215

Query: 726  AEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTSAHYILVKHILRYLKGTL 547
             +P+ D H YR  VG LQY TITRP++S++VN  CQF+ +PT  H+ +VK ILRYL+GTL
Sbjct: 1216 GDPVEDLHGYRSTVGALQYVTITRPELSFSVNKVCQFMQNPTEEHWKVVKRILRYLQGTL 1275

Query: 546  HTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLGDTCVSWSAKKQSTVARSSCE 367
              GLH++ SS+L +  F D+DWA     RRST+G+C+FLG   +SW +KKQ  V+RSS E
Sbjct: 1276 QHGLHLKKSSNLDLIGFCDADWASDLDDRRSTSGHCVFLGPNLISWQSKKQHIVSRSSIE 1335

Query: 366  AEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSALHLTVNLVFHARTKHIEIDY 187
             EYR+LA   AE+TWL  LL ++ + L +P +++CDN S + L+ N V HARTKHIE+D 
Sbjct: 1336 IEYRSLAGLVAEITWLRSLLSELQLPLAKPPLVWCDNLSTVLLSANPVLHARTKHIELDL 1395

Query: 186  HFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALRTKLGL-TTPTPSLRGGI 19
            +F+REKV    +   H+ S  Q AD+LTK + +      R KL +    T SLRG +
Sbjct: 1396 YFVREKVIRKEVEVRHVPSADQLADVLTKTVSSTQFIEFRHKLRIENLSTLSLRGDV 1452


>GAU44375.1 hypothetical protein TSUD_243070 [Trifolium subterraneum]
          Length = 1244

 Score =  581 bits (1498), Expect = 0.0
 Identities = 337/797 (42%), Positives = 455/797 (57%), Gaps = 21/797 (2%)
 Frame = -2

Query: 2349 QSPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHV 2170
            ++P S+L+ K+PDY+ +R                             YS  +KGYRCL  
Sbjct: 487  ETPYSKLFGKNPDYSGIR-----------------------------YSPLHKGYRCLDP 517

Query: 2169 SSGRVYFSRHVIFNEDVFPYKDKVVSQVPTGDVVHFNEFLNPKLADDVSESCTTITSPTY 1990
             + RVY SRHV+FNE+ FPY  +  S            F  PK  +  SE    +T    
Sbjct: 518  HTHRVYISRHVVFNENHFPYSPQNNSMTTFSHDSSITTF--PKFDEWFSEKVKDVTIH-- 573

Query: 1989 VSTNSDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCD----DSHESISA 1822
               + D +     +  FL T        PPPA    +  +S T SP      D++ S   
Sbjct: 574  -DDHLDDTPPKYLDFDFLAT--------PPPA----LDPSSRTPSPIQNQILDNNSSPIQ 620

Query: 1821 NVDLSDITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEELNTNVHPMLT 1642
            N +L +++ D  +S   N +   D    I                D +  + T   P+  
Sbjct: 621  NQNLDNVS-DNDSSPIQNQNIDNDSSPPIETTNLPIHID-----NDFSPPIETTNLPIPP 674

Query: 1641 RSKTGIVKPNPKYALATYTIP----------------QPPKTIRTALQHPGWFEAMTHEL 1510
             +++   K  P Y +  Y  P                + PKT +TAL++  W  AM  E+
Sbjct: 675  PTRSSRDKRPPAYLVKDYHCPTITNISPPHNTLIVSIEEPKTYKTALKYSNWQAAMQDEI 734

Query: 1509 *ALHSNNTWTLVPKPTNANVIGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFET 1330
             ALHSNNTWTLV +P +ANVIG KWV++TK+  DGS+DR KARLVAKG++Q  G+D+ ET
Sbjct: 735  DALHSNNTWTLVQRPLDANVIGSKWVFRTKLNEDGSIDRFKARLVAKGYTQIPGLDFGET 794

Query: 1329 FSPVVKSATVRVVLTLAVSCGWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVC 1150
            FSPV+K+ T+R++L+LAV   W L Q+D+ NAFL+G LNE VYMEQPPGF     P  VC
Sbjct: 795  FSPVIKAPTIRIILSLAVHFKWPLKQLDVKNAFLHGTLNERVYMEQPPGFEHPHLPNHVC 854

Query: 1149 KLHKALYGLKQAPRAWFNRLSEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLT 970
            +LHK+LYGLKQAPRAWF +LS  L   GF CS AD SLF+++++              LT
Sbjct: 855  QLHKSLYGLKQAPRAWFEKLSACLISLGFICSKADPSLFIHRYDTNFTLLLVYVDDIILT 914

Query: 969  GNNAAIVVDFQEKLSAEFNLRILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQD 790
            GN  + +    ++L  +F L+ LG L YFLGI++     GI +SQT+Y  D+L  A +  
Sbjct: 915  GNAPSFISHLVKQLHEKFALKDLGQLHYFLGIEIKHFCGGITISQTKYAHDLLKRAHMLG 974

Query: 789  CKPASTTIATNFNKMLSSTEYAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLA 610
                +T IA+  N++           D+  YR + G LQY T TRPD+++AVN  CQ   
Sbjct: 975  ASKINTPIASKPNELPDDNNPV----DATEYRRLCGSLQYLTFTRPDLTHAVNLVCQHFQ 1030

Query: 609  SPTSAHYILVKHILRYLKGTLHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFL 430
            +PT      VK ILRY+KGTL  GL     SSL + AF D+DWAGCP+TRRSTTG+CI+L
Sbjct: 1031 NPTQKDLQAVKRILRYIKGTLTHGLRYLNQSSLNLTAFCDADWAGCPTTRRSTTGFCIYL 1090

Query: 429  GDTCVSWSAKKQSTVARSSCEAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTS 250
            G  C+SW++KKQ TV+RSS EAEY+ALATTAAE+TWL YLL D+G+ L++  ++FCDN S
Sbjct: 1091 GSHCISWASKKQPTVSRSSAEAEYKALATTAAELTWLQYLLHDLGISLERRPLIFCDNQS 1150

Query: 249  ALHLTVNLVFHARTKHIEIDYHFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAAL 70
            A+H++ N VFHARTKHI IDYHFIREKV  G +   ++ + QQ AD+ TK+LP  S +  
Sbjct: 1151 AIHMSHNPVFHARTKHIAIDYHFIREKVTAGDLRLRYLLTTQQIADVFTKSLPKDSFSTF 1210

Query: 69   RTKLGL-TTPTPSLRGG 22
            R KLG+     PSL+GG
Sbjct: 1211 RRKLGVHCLSLPSLKGG 1227


>JAU83197.1 Copia protein, partial [Noccaea caerulescens]
          Length = 1080

 Score =  576 bits (1485), Expect = 0.0
 Identities = 329/806 (40%), Positives = 451/806 (55%), Gaps = 33/806 (4%)
 Frame = -2

Query: 2352 LQSPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLH 2173
            L SP   L+  DP+Y+ L+ FGCLCFP +R Y K+KLE RS PCVFLGYS     Y CL 
Sbjct: 305  LNSPFKTLFQSDPNYSKLKIFGCLCFPWLRPYTKNKLESRSAPCVFLGYSLTQSAYLCLE 364

Query: 2172 VSSGRVYFSRHVIFNEDVFPYKDKVVSQVPTGDVVHFNEFLNPKLADDVSESCTTITSPT 1993
              S R+Y SRHV F+E +FP++  + +   T +    + F++P     +S +      P 
Sbjct: 365  PKSSRLYISRHVRFDESIFPFQSILSTTPATSNTKAPSSFVSPI---PLSHTPLITAPPA 421

Query: 1992 YVSTNSDFSSEPVTEHSFLDTLIVSHSDQPP----PANSAHISE---TSTTDSPCDDSHE 1834
             + + S F+ EP  EH      + S S + P    PA+S  +++   TS++DS    S  
Sbjct: 422  PLESPSPFT-EPSPEH------LTSTSTETPTARLPADSPSLADRRSTSSSDS-MPPSTP 473

Query: 1833 SISAN-----VDLSDITEDTPASTCA-------NSDSHMDEQHDITAXXXXXXXXXXXST 1690
            SIS N      DL+ I    P  T +       NS S   +Q                 T
Sbjct: 474  SISVNSGDSAADLNPIPSPDPGPTSSTNLSPPPNSTSQAQQQSP---------------T 518

Query: 1689 RDMTEELNTNVHP-------------MLTRSKTGIVKPNPKYAL-ATYTIPQPPKTIRTA 1552
               TE  N N  P             M TRSK  I KPN KY L AT  I   P+ +  A
Sbjct: 519  HSQTENQNLNAQPENQNPPPPENRHQMQTRSKNNISKPNTKYGLTATTAIESEPQNLTQA 578

Query: 1551 LQHPGWFEAMTHEL*ALHSNNTWTLVPKPTNANVIGCKWVYKTKIKADGSLDRLKARLVA 1372
            L+   W  AM+ E      + TWTLVP   + +++GC+WV++ K   +G+LD+ KAR VA
Sbjct: 579  LKSKYWRAAMSTEFNDQLRHGTWTLVPPEPHQHIVGCRWVFRLKQLPNGALDKYKARFVA 638

Query: 1371 KGFSQEEGVDYFETFSPVVKSATVRVVLTLAVSCGWKLIQVDICNAFLNGELNEPVYMEQ 1192
            KG+SQ+ G+D+ ETFSPV+KS T+R +L +A    W L Q+D+ NAFL G L+E VY++Q
Sbjct: 639  KGYSQQPGIDFAETFSPVIKSTTIRTILKVAACRDWCLRQIDVNNAFLQGTLDEEVYVQQ 698

Query: 1191 PPGFISKSGPQLVCKLHKALYGLKQAPRAWFNRLSEFLACSGFTCSTADSSLFVYQHNGX 1012
            PPGF+    P  VCKL KALYGLKQAPRAW+  L  FL   GF  S  D+SLFV      
Sbjct: 699  PPGFVDPDRPDYVCKLQKALYGLKQAPRAWYMELKHFLLSLGFKNSATDTSLFVLHRGTT 758

Query: 1011 XXXXXXXXXXXXLTGNNAAIVVDFQEKLSAEFNLRILGPLSYFLGIQVLPNQSGILLSQT 832
                         TGN++  V D    LSA F+++ +G LSYFLGI+V+ +  G+ L+Q 
Sbjct: 759  LVYLLVYVDDIVATGNDSGAVEDILATLSARFSVKDMGALSYFLGIEVIRSTKGLHLNQR 818

Query: 831  RYITDILMEAGLQDCKPASTTIATNFNKMLSSTEYAEPLADSHIYRHIVGCLQYATITRP 652
            +YI D+L    + D KP S+ +AT+    LS   ++ P      YR +VG LQY   TRP
Sbjct: 819  KYIHDLLKRMHMMDAKPVSSPMATSPKLTLSGETHSNPTE----YRTLVGSLQYLAFTRP 874

Query: 651  DISYAVNTACQFLASPTSAHYILVKHILRYLKGTLHTGLHIRPSSSLAIRAFSDSDWAGC 472
            DI++ VN   QF+  PT  H+   K +LRYL GT   GL I   + L + AFSD+DW   
Sbjct: 875  DIAFVVNRLSQFMHKPTVDHWQAAKRVLRYLAGTSTHGLFISKHTDLTLHAFSDADWGTN 934

Query: 471  PSTRRSTTGYCIFLGDTCVSWSAKKQSTVARSSCEAEYRALATTAAEMTWLAYLLRDVGV 292
                 ST  Y ++LGD  +SWS+KKQ +VARSS EAEYR++A TA+E+ W+  LL ++G+
Sbjct: 935  TDDYISTNAYIVYLGDQAISWSSKKQKSVARSSTEAEYRSVANTASEINWVRNLLSEIGI 994

Query: 291  HLQQPAVLFCDNTSALHLTVNLVFHARTKHIEIDYHFIREKVALGHIVTAHIASDQQPAD 112
             L +P V++CDN  A +L+ N VFH+R KH+ +DYHFIRE+V    +  +H+++  Q AD
Sbjct: 995  PLSKPPVIYCDNVGATYLSANPVFHSRMKHVALDYHFIREQVQSNQLRVSHVSTHDQLAD 1054

Query: 111  ILTKALPTPSHAALRTKLGLTTPTPS 34
             LTK L       LR K+G++   PS
Sbjct: 1055 ALTKPLSRARFQLLRDKIGVSQAPPS 1080


>AAR88589.1 putative copia-like retrotransposon protein [Oryza sativa Japonica
            Group]
          Length = 1399

 Score =  578 bits (1490), Expect = 0.0
 Identities = 325/782 (41%), Positives = 454/782 (58%), Gaps = 18/782 (2%)
 Frame = -2

Query: 2343 PLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHVSS 2164
            PL QL+ + P+Y +LR FGC  +P +R YNKHKL  RS  CVFLGYS+ +KG++CL +++
Sbjct: 627  PLEQLFKEKPNYTALRIFGCAVWPNLRPYNKHKLAFRSKRCVFLGYSNLHKGFKCLEIAT 686

Query: 2163 GRVYFSRHVIFNEDVFPYKD----------KVVSQVPTGDVVHFNEFLNPKLADDVSESC 2014
            GRVY SR V F+E +FP+ +            +S +P   V H +  L  +  + V    
Sbjct: 687  GRVYVSRDVTFDESIFPFSELHSNAGARLRAEISLLPPSLVPHLSS-LGGEQNNHVLNYP 745

Query: 2013 TTITSPTYVSTNSDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCDDSHE 1834
              +T   +   N++   E           IV++ ++   A +   +  +      DD H 
Sbjct: 746  PNVTDQ-FGEENAEIGEE-----------IVANGEENAAAAADENAAAAANGGAQDDVHG 793

Query: 1833 SI--SANVDLSDITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTR------DMT 1678
                ++    S +T+D  AS      + + E+H + A                    D+T
Sbjct: 794  VAYDASPEHSSPVTDDAMASAAEQHGNPIQEEHLVQASPQTASSTSPSVASSAGVHDDVT 853

Query: 1677 EELNTNVHPMLTRSKTGIVKPNPKYALATYTIPQPPKTIRTALQHPGWFEAMTHEL*ALH 1498
             + +      +  +    ++P  +         +  K++  A+ +  W EAM  E  AL 
Sbjct: 854  TDQSDQTDQAMPEAAVAPIRPKTRLQSGI----RKEKSLEEAVNNKHWKEAMDAEYMALI 909

Query: 1497 SNNTWTLVPKPTNANVIGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPV 1318
             N TW LVP     NVI CKWVYK K KADGSLDR KARLVAKGF Q  G+DY +TFSPV
Sbjct: 910  ENKTWHLVPPQKGRNVIDCKWVYKVKRKADGSLDRYKARLVAKGFKQRYGIDYEDTFSPV 969

Query: 1317 VKSATVRVVLTLAVSCGWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHK 1138
            VK+AT+R+VL+LAVS GW L Q+D+ NAFL+G L E VYM+QPPG+  KS P  VCKL K
Sbjct: 970  VKAATIRIVLSLAVSRGWSLRQLDVKNAFLHGVLEEEVYMKQPPGYEKKSMPNYVCKLDK 1029

Query: 1137 ALYGLKQAPRAWFNRLSEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNA 958
            ALYGLKQAPRAW++RLS  L+  GF  S AD+SLF Y+                +  +  
Sbjct: 1030 ALYGLKQAPRAWYSRLSTKLSELGFVPSKADTSLFFYKKGQVSIFLLIYVDDIIVASSVP 1089

Query: 957  AIVVDFQEKLSAEFNLRILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPA 778
                   ++LS +F L+ LG L YFLGI+V   + G++LSQ +Y +D+L   G+ +CKP 
Sbjct: 1090 DATSTLLQELSKDFALKDLGDLHYFLGIEVHKVKDGLMLSQEKYASDLLRRVGMYECKPV 1149

Query: 777  STTIATNFNKMLSSTEYAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTS 598
            ST ++T+    ++      P  DS  YR +VG LQY T+TRPDIS+++N  CQFL +PT+
Sbjct: 1150 STPLSTSEKLSVNEGTLLGP-QDSTQYRSVVGALQYLTLTRPDISFSINKVCQFLHAPTT 1208

Query: 597  AHYILVKHILRYLKGTLHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLGDTC 418
             H+  VK ILRY+K T+ TGL    + SL +  FSD+DWAG P  RRST G+ +FLG   
Sbjct: 1209 THWAAVKRILRYVKYTVDTGLKFCRNPSLLVSGFSDADWAGSPDDRRSTGGFAVFLGPNL 1268

Query: 417  VSWSAKKQSTVARSSCEAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSALHL 238
            VSWSA+KQ+TV+RSS EAEY+ALA   AE+ W+  LL+++GV   + A L+CDN  A +L
Sbjct: 1269 VSWSARKQATVSRSSIEAEYKALANATAEIMWVQTLLQELGVESPRAAKLWCDNLGAKYL 1328

Query: 237  TVNLVFHARTKHIEIDYHFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALRTKL 58
            + N +FHARTKHIE+D+HF+RE+VA   +  A+I++  Q AD  TKA+P       +  L
Sbjct: 1329 SANPIFHARTKHIEVDFHFVRERVARKLLEIAYISTKDQVADGFTKAIPVRQMEMFKNNL 1388

Query: 57   GL 52
             L
Sbjct: 1389 NL 1390


>AAC67200.1 putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1402

 Score =  577 bits (1486), Expect = 0.0
 Identities = 313/732 (42%), Positives = 434/732 (59%), Gaps = 18/732 (2%)
 Frame = -2

Query: 2355 DLQSPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCL 2176
            D  SP  +L+   PDY +LR+FGC CFP MRDY  +K +PRSL CVFLGY+D+YKGYRCL
Sbjct: 672  DAISPYEKLHQTTPDYTALRSFGCACFPTMRDYAMNKFDPRSLKCVFLGYNDKYKGYRCL 731

Query: 2175 HVSSGRVYFSRHVIFNEDVFPYKD--KVVSQVPTGDVVHFNEFLNPKLADDVSESCTTIT 2002
            +  +GRVY SRHVIF+E  +P+    K +   PT           P LA       ++++
Sbjct: 732  YPPTGRVYISRHVIFDETAYPFSHHYKHLHSQPT----------TPLLAAWFKGFESSVS 781

Query: 2001 -SPTYVSTNSDFSSEPVTEHSFLDTL-IVSHSDQPP-PANSAHISETSTTDSPCDDSHES 1831
             +P  VS      ++P    + L T  + + +D PP P  S  +S+ S        S  +
Sbjct: 782  QAPPKVSP-----AQPPQRKATLPTPPLFTAADFPPLPRRSPQLSQNSAAALVSQPSTTT 836

Query: 1830 ISANVDLSDITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEELN----- 1666
            I++    + + E +  +   +S S  D  H                  D  E+L      
Sbjct: 837  INSTHPPAVVNESSERTINFDSASIGDSSHS-----------SQLLVDDTVEDLMAAPVP 885

Query: 1665 -------TNVHPMLTRSKTGIVKPNPKYALATYTIPQP-PKTIRTALQHPGWFEAMTHEL 1510
                   TN HPM+TR+K GI KPNP+Y   ++ +  P PKT+  AL+HPGW  AMT E+
Sbjct: 886  TQQAPPPTNTHPMITRAKVGITKPNPRYVFLSHKVTYPEPKTVTAALKHPGWTGAMTEEM 945

Query: 1509 *ALHSNNTWTLVPKPTNANVIGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFET 1330
                  NTW+LVP   N +V+G KWV++TK+ ADG+L++LKAR+VAK F QEEG+ Y ET
Sbjct: 946  GNCSETNTWSLVPYTPNMHVLGSKWVFRTKLHADGTLNKLKARIVAKCFLQEEGIGYLET 1005

Query: 1329 FSPVVKSATVRVVLTLAVSCGWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVC 1150
            +SPVV++ TV++VL LA +  W+L Q+D+ NAFL+G+LNE VYM QP GF+ KS P  VC
Sbjct: 1006 YSPVVRTPTVQLVLHLATALNWELKQMDVKNAFLHGDLNETVYMTQPAGFVDKSKPTHVC 1065

Query: 1149 KLHKALYGLKQAPRAWFNRLSEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLT 970
             LHK++YGLKQ+PRAWF++ S FL   GF CS +D SLF+Y HN              +T
Sbjct: 1066 LLHKSIYGLKQSPRAWFDKFSTFLLEFGFFCSKSDPSLFIYAHNNNLILLLLYVDDMVIT 1125

Query: 969  GNNAAIVVDFQEKLSAEFNLRILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQD 790
            GN++  +      L+ EF +  +G L YFLGIQV  NQ G+ +SQ +Y  D+L+ + +++
Sbjct: 1126 GNSSQTLSSLLAALNKEFRMTDMGQLHYFLGIQVQRNQHGLFMSQQKYAEDLLVASAMEN 1185

Query: 789  CKPASTTIATNFNKMLSSTEYAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLA 610
            C P  T +    +++       EP  D   +R I G LQY T+TRPDI +AVN  CQ + 
Sbjct: 1186 CTPLPTPLPVQLDRV---PHQEEPFTDPTYFRSIAGKLQYLTLTRPDIHFAVNFVCQKMH 1242

Query: 609  SPTSAHYILVKHILRYLKGTLHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFL 430
             PT + + L+K ILRY+KGT+  G+    +S   ++A+SDSDW  C  TRRS  G C F+
Sbjct: 1243 QPTMSDFHLLKRILRYIKGTITMGISYNQNSPTLLQAYSDSDWGNCKLTRRSVGGLCTFM 1302

Query: 429  GDTCVSWSAKKQSTVARSSCEAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTS 250
                VSWS+KK  TV+RSS EAEYR L+  A+E+ WL+ LLR++G+ L     LFCDN S
Sbjct: 1303 ATNLVSWSSKKHPTVSRSSTEAEYRTLSDAASEILWLSTLLRELGIPLPDTPELFCDNLS 1362

Query: 249  ALHLTVNLVFHA 214
            A++ T N  FHA
Sbjct: 1363 AVYHTANPAFHA 1374


>JAU04955.1 Copia protein, partial [Noccaea caerulescens]
          Length = 817

 Score =  558 bits (1438), Expect = 0.0
 Identities = 305/765 (39%), Positives = 431/765 (56%)
 Frame = -2

Query: 2343 PLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHVSS 2164
            P ++L+ K   Y  LR FGCLC+P +     +KL PRS  C+FLGY   +KGYRCL +S+
Sbjct: 74   PFTRLFNKPVSYEHLRVFGCLCYPNLLPTAPNKLSPRSARCIFLGYPTNHKGYRCLDLST 133

Query: 2163 GRVYFSRHVIFNEDVFPYKDKVVSQVPTGDVVHFNEFLNPKLADDVSESCTTITSPTYVS 1984
             R+  SRHV+F+E+ FP+   +          H   +  P L           T+P    
Sbjct: 134  RRIIISRHVVFDENSFPFTSTLSPSSSPPAAPH--SYPQPLLITTSPPPLPVTTTPPASP 191

Query: 1983 TNSDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCDDSHESISANVDLSD 1804
             N   S+ P    S   +  +  +  P  + SA    +    SP     + +S++ D+ D
Sbjct: 192  AN---SASPQHSSSLNGSTALPSTISPVSSPSASHQPSPILSSPAQS--QPVSSSHDIPD 246

Query: 1803 ITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXSTRDMTEELNTNVHPMLTRSKTGI 1624
            +T  + + + ++S S        T+               +TE+       M TRS++GI
Sbjct: 247  VTSSSTSISHSSSSSPTPPPSPPTSTA-------------VTEQAPPPPTRMTTRSQSGI 293

Query: 1623 VKPNPKYALATYTIPQPPKTIRTALQHPGWFEAMTHEL*ALHSNNTWTLVPKPTNANVIG 1444
            +K     +L T  +   P++   A + P W  AM  E   L   +TWTLVP+P N N+I 
Sbjct: 294  IKAKKIISLHTALVSPLPRSHIDAARDPNWNPAMNDEYDTLKMRDTWTLVPRPPNTNIIR 353

Query: 1443 CKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPVVKSATVRVVLTLAVSCGW 1264
              W++  K KADG+L R KARLVA G SQE GVD  ETFSPVVK  T+R VL LA+S  W
Sbjct: 354  SMWLFTHKFKADGTLSRYKARLVANGKSQEVGVDCDETFSPVVKPTTIRTVLHLALSRDW 413

Query: 1263 KLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHKALYGLKQAPRAWFNRLSE 1084
             + Q+D+ NAFL G L E VYM QPPGF   + P  VC L K+LYGLKQAPRAW+ R S+
Sbjct: 414  PIHQLDVKNAFLYGNLEETVYMHQPPGFTDPTKPDHVCLLKKSLYGLKQAPRAWYQRFSQ 473

Query: 1083 FLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNAAIVVDFQEKLSAEFNLRI 904
              +  GF  S +D+SLF+ +                LT ++  ++      L  EF +  
Sbjct: 474  AASKIGFKNSKSDASLFILRQGSDIAYLLLYVDDIVLTSSSPTLLRSILTFLKTEFQMTD 533

Query: 903  LGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPASTTIATNFNKMLSSTEYA 724
            LG L +FLGI V  N++G+ LSQ  Y  DIL  A + +CKP ST + T+        +  
Sbjct: 534  LGSLHFFLGISVSRNKNGMTLSQHNYAADILHRANMSNCKPCSTPVDTSAK---LHADAG 590

Query: 723  EPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTSAHYILVKHILRYLKGTLH 544
            +P +D  +YR + G LQY T TRPDI+YAV   C ++  P   H+  +K ILRY+KGT+ 
Sbjct: 591  QPFSDPTMYRRLAGALQYLTFTRPDIAYAVQQICLYMHDPREPHFNALKRILRYVKGTIT 650

Query: 543  TGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLGDTCVSWSAKKQSTVARSSCEA 364
             GLH+  S+S  + A++D+DWAGCP+TRRST+G+C++LGD  +SWS+K+Q TV+RSS EA
Sbjct: 651  HGLHLHRSTSTTLTAYTDADWAGCPNTRRSTSGFCVYLGDNLISWSSKRQPTVSRSSAEA 710

Query: 363  EYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSALHLTVNLVFHARTKHIEIDYH 184
            EYR +A   AE TW+  LL ++   ++   +++CDN SA++L+ N + H RTKHIE+D  
Sbjct: 711  EYRGVANAVAETTWIRNLLLELQCPIKTATLVYCDNVSAVYLSTNPIQHQRTKHIELDIL 770

Query: 183  FIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALRTKLGLT 49
            F+RE+VALG +   H+ S  Q ADI TK LPT      R+ L +T
Sbjct: 771  FVRERVALGQVRVLHVPSSHQYADIFTKGLPTTLFNDFRSSLHVT 815


>AAT85031.1 putative polyprotein [Oryza sativa Japonica Group] ABF96679.1
            retrotransposon protein, putative, Ty1-copia subclass
            [Oryza sativa Japonica Group]
          Length = 1437

 Score =  575 bits (1483), Expect = 0.0
 Identities = 334/777 (42%), Positives = 447/777 (57%), Gaps = 12/777 (1%)
 Frame = -2

Query: 2346 SPLSQLYIKDPDYNSLRTFGCLCFPCMRDYNKHKLEPRSLPCVFLGYSDQYKGYRCLHVS 2167
            SPL +L    PDYN+LR FGC C+P +R YNKHKL+ RS  C FLGYS  +KG++CL  S
Sbjct: 662  SPLERLLGHKPDYNALRVFGCACWPNLRPYNKHKLQFRSTTCTFLGYSTLHKGFKCLDPS 721

Query: 2166 SGRVYFSRHVIFNEDVFPYKDKVVSQVPTGDVVHFNEFLNPKLADDVSESCTTITSPTYV 1987
            +GRVY SR V+F+E  FP+  K+   V  G  +     L P+LA  +      I+S    
Sbjct: 722  TGRVYISRDVVFDETQFPFT-KLHPNV--GAKLRAEIALVPELAASLPRGLQQISSVINT 778

Query: 1986 STNSDFSSEPVTEHSFLDTLIVSHSDQPPPANSAHISETSTTDSPCDDSHESISANVDLS 1807
              N++ S+E + + S  D    + +D  P   SA+    S+   P ++           S
Sbjct: 779  PENANVSNENMQQDSTYDNEPETETDGAPDTVSANAPAESSGSPPINEPASPFGE----S 834

Query: 1806 DITEDTPASTCANSDSHMDEQHDITAXXXXXXXXXXXST------RDMTEELNTNVHPML 1645
            D    +PAS   NS  H D     ++            +         T           
Sbjct: 835  DSATASPASAPVNSAPHPDAAASGSSAPRGSTSQGGTPSVAIDDPHPATTVTGQEAQRPR 894

Query: 1644 TRSKTGIVKPNP------KYALATYTIPQPPKTIRTALQHPGWFEAMTHEL*ALHSNNTW 1483
            TR ++GI K         K+ + T T    P+ ++ ALQ+  W  AM  E  AL  NNTW
Sbjct: 895  TRLQSGIRKEKVYTDGTVKWGMLTST--GEPENLQDALQNNNWKCAMDAEYMALIKNNTW 952

Query: 1482 TLVPKPTNANVIGCKWVYKTKIKADGSLDRLKARLVAKGFSQEEGVDYFETFSPVVKSAT 1303
             LVP     NVI CKWVYK K K DGSLDR KARLVAKGF Q  G+DY +TFSPVVK+AT
Sbjct: 953  HLVPPQQGRNVIDCKWVYKIKRKQDGSLDRYKARLVAKGFKQRYGIDYEDTFSPVVKAAT 1012

Query: 1302 VRVVLTLAVSCGWKLIQVDICNAFLNGELNEPVYMEQPPGFISKSGPQLVCKLHKALYGL 1123
            +R++L++AVS GW L Q+D+ NAFL+G L E VYM+QPPG+ + S P  VCKL KALYGL
Sbjct: 1013 IRIILSIAVSRGWCLRQLDVQNAFLHGVLEEEVYMKQPPGYENPSTPDYVCKLDKALYGL 1072

Query: 1122 KQAPRAWFNRLSEFLACSGFTCSTADSSLFVYQHNGXXXXXXXXXXXXXLTGNNAAIVVD 943
            KQAPRAW++RLS  L   GF  S AD+SLF Y                 +  +    V  
Sbjct: 1073 KQAPRAWYSRLSGKLHDLGFKGSKADTSLFFYNKGSLTIFLLIYVDDIIVVSSRKEAVSA 1132

Query: 942  FQEKLSAEFNLRILGPLSYFLGIQVLPNQSGILLSQTRYITDILMEAGLQDCKPASTTIA 763
              + L  EF L+ LG L YFLGI+V     GIL+SQ +Y +D+L    + DCK  +T ++
Sbjct: 1133 LLQDLQKEFALKDLGDLHYFLGIEVTKIPGGILMSQEKYASDLLKRVNMSDCKSVATPLS 1192

Query: 762  TNFNKMLSSTEYAEPLADSHIYRHIVGCLQYATITRPDISYAVNTACQFLASPTSAHYIL 583
             +  K+++         D+  YR IVG LQY T+TR DI+++VN  CQFL +PT+ H+  
Sbjct: 1193 AS-EKLIAGKGTILGPNDATQYRSIVGALQYLTLTRLDIAFSVNKVCQFLHNPTTEHWAA 1251

Query: 582  VKHILRYLKGTLHTGLHIRPSSSLAIRAFSDSDWAGCPSTRRSTTGYCIFLGDTCVSWSA 403
            VK ILRY+K     GL I  SSS+ +  +SD+DWAGC   RRST G+ ++LGD  VSW+A
Sbjct: 1252 VKRILRYIKQCTGLGLRICKSSSMIVSGYSDADWAGCLDDRRSTGGFAVYLGDNLVSWNA 1311

Query: 402  KKQSTVARSSCEAEYRALATTAAEMTWLAYLLRDVGVHLQQPAVLFCDNTSALHLTVNLV 223
            KKQ+TV+RSS EAEY+ALA   AE+ W+  LL+++ +     A L+CDN  A +L+ N V
Sbjct: 1312 KKQATVSRSSTEAEYKALANATAEIMWVQTLLQELNIVSPAMAQLWCDNMGAKYLSFNPV 1371

Query: 222  FHARTKHIEIDYHFIREKVALGHIVTAHIASDQQPADILTKALPTPSHAALRTKLGL 52
            FHARTKHIE+DYHF+RE+VA   +   +++++ Q AD  TKALP       +  L L
Sbjct: 1372 FHARTKHIEVDYHFVRERVARKLLQVDYVSTNDQVADGFTKALPVKQLENFKYNLNL 1428


Top