BLASTX nr result

ID: Cocculus22_contig00019876 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00019876
         (2150 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...   183   7e-50
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   182   9e-50
gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal...   178   5e-49
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   184   6e-47
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   162   1e-44
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               176   1e-44
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   170   2e-43
emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|72678...   182   4e-43
emb|CAB72467.1| putative protein [Arabidopsis thaliana]               169   5e-43
gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata sub...   175   7e-41
dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana]           171   1e-40
gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thali...   160   9e-40
gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CA...   145   1e-39
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       145   1e-37
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   138   5e-36
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           138   5e-36
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...   131   7e-36
gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]                137   6e-35
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...   124   2e-30
dbj|BAB08692.1| non-LTR retroelement reverse transcriptase-like ...   139   5e-30

>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score =  183 bits (465), Expect(2) = 7e-50
 Identities = 110/423 (26%), Positives = 194/423 (45%), Gaps = 14/423 (3%)
 Frame = +2

Query: 374  SYILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKF 553
            S ++D++   +    +   S A R+ L+  V+ ++  +W   F +P   + +   + +  
Sbjct: 508  SPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSITNFWMNAFRLPRECINEINRISSAL 567

Query: 554  L----NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWI 721
            L      N +   +SW+ +CKP++EGGLGL+ L+E +  + LKL W + S +D LW+KW 
Sbjct: 568  LWSGPELNPKKAKVSWDEICKPKKEGGLGLQSLREANKVSSLKLIWRLLSCQDSLWVKWT 627

Query: 722  HSQYIRSDTYWVSQAHMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGV 898
                ++ +++W    H +  SWI +RL+K R   ++     +   + T  W D WS  G 
Sbjct: 628  RMNLLKKESFWSIGTHSTLGSWIWRRLLKHREVAKSFCKIEVNNGVNTSFWFDNWSEKGP 687

Query: 899  IARDCGKNARRGSDLRRDATIDDLSKCSSLSPIVVELKDKLNEV-----QRISGDHADRL 1063
            +    G        + R  T+ +           VE+ ++  E+     Q  + +  D +
Sbjct: 688  LINLTGARGAIDMGISRHMTLAEAWSRRRRKRHRVEILNEFEEILLQKYQHRNIELEDAI 747

Query: 1064 IWRLEP---SGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIAD 1234
            +WR +       FS K TW  I+   +   W   +W+ H  P+ S   W     RL   D
Sbjct: 748  LWRGKEDVFKARFSTKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGD 807

Query: 1235 RLPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSK 1414
            R+   N G    C  C + +ET +HLFF   + S+    IA  ++ D R    W  + + 
Sbjct: 808  RMMTWNNGTPTTCVFCSSPMETRDHLFFQCCYSSEIWTSIAKNVYKD-RFSTKWSAVVNY 866

Query: 1415 MSSYDFAGTKLNTSV-KLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIISN 1591
            +S  D    ++ + + + +F  +IH IW ERN RR   K R    ++  I   +RN +S 
Sbjct: 867  IS--DSQPDRIQSFLSRYTFQVSIHSIWRERNSRRHGEKSRSASNLIRQIDKTIRNQLST 924

Query: 1592 VSQ 1600
            + +
Sbjct: 925  IKK 927



 Score = 43.5 bits (101), Expect(2) = 7e-50
 Identities = 31/108 (28%), Positives = 58/108 (53%), Gaps = 3/108 (2%)
 Frame = +3

Query: 66  FGHNERTCLKSAGILTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRI 245
           FG++ R   K+ G LT++ FADD+++    ++  ++   K+L     K GL +  EK+ +
Sbjct: 408 FGYHPRC--KTLG-LTHLCFADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTL 464

Query: 246 VASWVRGH---LFW**VPLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380
             + V  H   L       G+ K   P++YLGLP+V+++   ++  P+
Sbjct: 465 YLAGVSDHSRQLMSSRYSFGVGK--LPVRYLGLPLVTKRLTTSDYSPL 510


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  182 bits (463), Expect(2) = 9e-50
 Identities = 119/423 (28%), Positives = 191/423 (45%), Gaps = 18/423 (4%)
 Frame = +2

Query: 380  ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556
            +L++V   +    +   S A R+ L+  V+ ++  +W   F +P   +++ E + + FL 
Sbjct: 789  LLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNFWLAAFRLPRKCIRELEKMCSAFLW 848

Query: 557  ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727
                 NS    +SW  VCKP++EGGLGLR LKE +    LKL W + S  + LW+KW+  
Sbjct: 849  SGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQ 908

Query: 728  QYIRSDTYW-VSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIA 904
              +R+ ++W V Q     SWI K+L+K R   +      +G    T  W D WS LG + 
Sbjct: 909  HLLRNASFWEVKQTVSQGSWIWKKLLKYREVAKTLSKVEVGNGKQTSFWYDNWSDLGQLL 968

Query: 905  RDCGKNARRGSDLRRDATIDDL----SKCSSLSPIVVELKDKLNEVQRISGDHADRLIWR 1072
               G        + R  T+++      +    + +   ++D L +      +  D+++WR
Sbjct: 969  ERTGDRGLIDLGISRRMTVEEAWTNRRQRRHRNDVYNVIEDALKKSWDTRTETEDKVLWR 1028

Query: 1073 LEPS---GEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLP 1243
             +       FS + TW   +       W  +IW+ H  P++S   W  A+ RL   DR+ 
Sbjct: 1029 GKSDVFRTTFSTRDTWHHTRSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMI 1088

Query: 1244 KLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKMSS 1423
                G+  +C  C   LET +HLFF   F         S+++VD  R +F     S   S
Sbjct: 1089 NWANGIATDCIFCQGTLETRDHLFFTCSF--------TSVIWVDLARGIFKTQYTSHWQS 1140

Query: 1424 YDFAGTKLNTS------VKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNII 1585
               A T            +  F ATI+ +W ERN RR         Q+V  I   +RN +
Sbjct: 1141 IIEAITNSQHHRVEWFLRRYVFQATIYIVWRERNGRRHGEPPNTASQLVGWIDKQIRNQL 1200

Query: 1586 SNV 1594
            S++
Sbjct: 1201 SSI 1203



 Score = 43.9 bits (102), Expect(2) = 9e-50
 Identities = 28/109 (25%), Positives = 59/109 (54%), Gaps = 4/109 (3%)
 Frame = +3

Query: 66  FGHNERTCLKSAGILTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRI 245
           FG++ +   K+ G LT++ FADD++V    ++  +E   K+  +  + +GL ++ EKS +
Sbjct: 687 FGYHPKC--KTMG-LTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTV 743

Query: 246 ----VASWVRGHLFW**VPLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380
               +++  R  +              P++YLGLP+++++    +CLP+
Sbjct: 744 YLAGLSATARNEVA---DRFPFSSGQLPVRYLGLPLITKRLSTTDCLPL 789


>gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana]
          Length = 629

 Score =  178 bits (452), Expect(2) = 5e-49
 Identities = 110/421 (26%), Positives = 192/421 (45%), Gaps = 16/421 (3%)
 Frame = +2

Query: 374  SYILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKF 553
            S + +++ + +    +   S A R+ L+  V+ +   +W   F +PS+ LK+  S+ + F
Sbjct: 192  SPLFEQIRNRIGTWTSRYLSFAGRLNLISSVLWSTMNFWMSAFRLPSACLKEINSICSAF 251

Query: 554  L----NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWI 721
            L      + R   +SW+++CKP++EGGLGLR L E ++ + LKL W V S  D LW+KW 
Sbjct: 252  LWSGPELHRRKAKVSWDDICKPKQEGGLGLRSLTEANVVSVLKLIWRVTSNDDSLWVKWS 311

Query: 722  HSQYIRSDTYWVSQAHMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGV 898
                ++ +++W    + S  SW+ K+++K R   +      +     T  W D WS +G 
Sbjct: 312  KMNLLKQESFWSLTPNSSLGSWMWKKMLKYRETAKPFSRVEVNNGARTSFWFDNWSGMGH 371

Query: 899  IARDCGKNARRGSDLRRDATIDDL--------SKCSSLSPIVVELKDKLNEVQRISGDHA 1054
            +    G+  +    + R+ T+ +          +   L+ I   L  K      +     
Sbjct: 372  LMDVTGQRGQIDLGISRNKTVAEAWSNRRRRKHRTEQLNDIEAALNQKYQTRNLL---RE 428

Query: 1055 DRLIWRLEP---SGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLK 1225
            D  +WR +       FS K TW  +++K +   W   +W+ H  P++    W     RL 
Sbjct: 429  DATLWRGKGDVFKTSFSTKDTWNQVRKKSNEVAWYKGVWFSHSTPKYQFCTWLALRNRLS 488

Query: 1226 IADRLPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETL 1405
               R+   N G D  C  C  ++ET +HLFF   + S     IA  + +  R    W+T+
Sbjct: 489  TGYRMQLWNNGSDVKCTFCSTSIETRDHLFFSCSYASAIWTAIAKNV-LQHRFSTDWQTI 547

Query: 1406 GSKMSSYDFAGTKLNTSVKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNII 1585
             + +S       +   S +  F  T+H +W ERN RR   + R    ++  +   +RN +
Sbjct: 548  VNYISETQTDRIRSFLS-RYIFQLTVHTVWKERNDRRHGEEPRTSANLISWMDKQIRNQL 606

Query: 1586 S 1588
            S
Sbjct: 607  S 607



 Score = 45.4 bits (106), Expect(2) = 5e-49
 Identities = 25/95 (26%), Positives = 55/95 (57%), Gaps = 3/95 (3%)
 Frame = +3

Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287
           LT++ FADD+++    +V  ++   +++    +++GL +N EK+ +  + V  H  +  +
Sbjct: 103 LTHLCFADDLMILTDGKVRSVDGIVEVMNLFAKRSGLQINMEKTTLYTAGVSDHNRYMMI 162

Query: 288 ---PLGIQKPSFPLKYLGLPIVSRKRFVNECLPIF 383
              P G+ +   P++YLGLP+V+++    +  P+F
Sbjct: 163 SRYPFGLGQ--LPVRYLGLPLVTKRLTKEDLSPLF 195


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  184 bits (466), Expect(2) = 6e-47
 Identities = 106/392 (27%), Positives = 192/392 (48%), Gaps = 13/392 (3%)
 Frame = +2

Query: 380  ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556
            +++K+   +        S A R+ L+K V+ ++  +W   F +P + L++ E + + FL 
Sbjct: 936  LVEKIRARITSWTNRFLSFAGRLQLIKSVLSSITNFWLSVFRLPKACLQEIEKMFSAFLW 995

Query: 557  ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727
               + N++   ++W  VCK +EEGGLGL+ LKE +  + LKL W + S +D LW+KW++ 
Sbjct: 996  SGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKEANEVSLLKLIWRILSARDSLWVKWVNK 1055

Query: 728  QYIRSDTYW-VSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIA 904
              IR +T+W V +     SW+ ++++K R          +    +T  W D W PLG + 
Sbjct: 1056 HLIRKETFWSVKENTGLGSWLWRKILKQRDKARLFHRMEVRSGTFTSFWHDHWCPLGRLH 1115

Query: 905  RDCGKNARRGSDLRRDATIDDL----SKCSSLSPIVVELKDKLNEVQRISGDHADRLIWR 1072
            +  G        +  +AT+ ++     +    +  + ++K ++   ++      DR +W+
Sbjct: 1116 QHMGSRGTIDLGIPNNATVAEVMNTHRRKRHRADFLNQIKSQIELARQDRSTDGDRSLWK 1175

Query: 1073 LEP---SGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLP 1243
             +       FS   TW+ I+       W   +W+    P++S   W   + RL  +D++ 
Sbjct: 1176 QKEDTFKSSFSSSKTWQQIRSISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSDKIC 1235

Query: 1244 KLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKMSS 1423
            K N G   +C  C   LET +HLFF   + S     +   L ++GR  L W  +   +  
Sbjct: 1236 KWNSGARYDCVFCGEELETRDHLFFSCPYSSHVWFSLTKGL-LNGRNILNWNLITPHL-- 1292

Query: 1424 YDFAGTKLNT-SVKLSFAATIHQIWWERNCRR 1516
             D +   L+  +++ +F A+IH +W ERNCRR
Sbjct: 1293 LDSSRPYLHVFTLRYAFQASIHSLWRERNCRR 1324



 Score = 33.1 bits (74), Expect(2) = 6e-47
 Identities = 22/92 (23%), Positives = 47/92 (51%), Gaps = 1/92 (1%)
 Frame = +3

Query: 108  LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287
            LT++ FADD++VF       ++    +       + L ++ EKS I  + +  +     +
Sbjct: 845  LTHLCFADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSIL 904

Query: 288  P-LGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380
                 +  + P+KYLGLP+++++   ++ LP+
Sbjct: 905  QQFPFELGTLPVKYLGLPLLTKRMTQSDYLPL 936


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  162 bits (411), Expect(2) = 1e-44
 Identities = 108/397 (27%), Positives = 175/397 (44%), Gaps = 17/397 (4%)
 Frame = +2

Query: 374  SYILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKF 553
            S ++DK+    +       S A R+ L+  V+ +   +W   F +P   LK  E + N+F
Sbjct: 779  SQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVNFWLSSFILPKCCLKTIEQMCNRF 838

Query: 554  LNFNSRI----ISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWI 721
            L  N       I +SW+N C P+ EGGLGLR     +    L+L W + +++D LW+ W 
Sbjct: 839  LWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTWNKTLNLRLIWMLFARRDSLWVAWN 898

Query: 722  HSQYIRSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVI 901
            H+  +R   +W ++A    SWI K ++ +R   +  +   +G       W D WS LG +
Sbjct: 899  HANRLRHVNFWNAEAASHHSWIWKAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSNLGPL 958

Query: 902  ARDCGKNARRGSDLRRDATIDDLSKCS--------SLSPIVVELKDKLNEVQRISGDHA- 1054
                G +  + + +   A + + S  +        + +  +  L+  L      SGD   
Sbjct: 959  IEAIGASGPQLTGIHESAVVTEASSSTGWILPSARTRNASLANLRSTLLNSPAPSGDRGE 1018

Query: 1055 DRLIWRLEPSG--EFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKI 1228
            D   W +E S    FS K TWE ++++  T  W   +WY   IP+++   W     RL +
Sbjct: 1019 DTYTWYIEGSSSTSFSSKLTWECLRQRDTTKLWAAAVWYKGCIPKYAFNFWVAHLNRLPV 1078

Query: 1229 ADRLPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLF--WET 1402
              R    +    + CC+C    ET +HLF      S   + + +     GR  +F  W+ 
Sbjct: 1079 RARTTHWSTNRPSLCCVCQRETETRDHLFIHCTLGSLIWQQVLARF---GRSQMFREWKD 1135

Query: 1403 LGSKMSSYDFAGTKLNTSVKLSFAATIHQIWWERNCR 1513
            +   M S    G+   T  KL+    I  IW ERN R
Sbjct: 1136 IIEWMLSNQ--GSFSGTLKKLAVQTAIFHIWKERNSR 1170



 Score = 47.0 bits (110), Expect(2) = 1e-44
 Identities = 25/82 (30%), Positives = 43/82 (52%)
 Frame = +3

Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287
           ++ + FADD+++F   + S L     +L   +  +GL MN EKS +  + +        +
Sbjct: 691 ISSLAFADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL 750

Query: 288 PLGIQKPSFPLKYLGLPIVSRK 353
             G    +FP +YLGLP++ RK
Sbjct: 751 AFGFVNGTFPFRYLGLPLLHRK 772


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  176 bits (446), Expect(2) = 1e-44
 Identities = 121/405 (29%), Positives = 188/405 (46%), Gaps = 14/405 (3%)
 Frame = +2

Query: 428  FSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFLNFNSRIIS----MSWEN 595
            FS A R  L+K V+ ++  +W   F +P   +++ + L + FL   S + S    +SW+ 
Sbjct: 452  FSFAGRFNLIKSVLWSICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDI 511

Query: 596  VCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHSQYIRSDTYW-VSQAHM 772
            VCKP+ EGGLGLR LKE +  + LKL W + S  + LW KW+    IR  + W + Q+  
Sbjct: 512  VCKPKAEGGLGLRNLKEANDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTS 571

Query: 773  SSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIARDCGKNARRGSDLRRD 952
              SWI ++++KIR   ++     +G       W D WS  G +    G        + R+
Sbjct: 572  MGSWIWRKILKIRDVAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPRE 631

Query: 953  ATIDDLSKCSSLSPIVVELKDKLNEV---QRI-SGDHADRLIWRLEP---SGEFSMKSTW 1111
            A++ D     S       L +++ E+   QRI   D  D ++WR +       FS + TW
Sbjct: 632  ASVADAWTRRSRRRHRTSLLNEIEEMMAYQRIHHSDAEDTVLWRGKNDVFKPHFSTRDTW 691

Query: 1112 EFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLPKLNI--GVDANCCLCW 1285
              I+    T  W   +W+ H  P+++L  W   + RL   DR+ K N    V  NC LC 
Sbjct: 692  HLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLCT 751

Query: 1286 NALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKMSSYDFAGTKLNTSVKL 1465
            N  +T  HLFF   + S     +A  ++   R    W  L + +S++ F         + 
Sbjct: 752  NNSKTLEHLFFSCSYASTVWAALAKGIW-KTRYSTRWSHLLTHISTH-FQDRVEGFLTRY 809

Query: 1466 SFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIISNVSQ 1600
             F ATI+ +W ERN RR          ++  I    RN I+ + Q
Sbjct: 810  IFQATIYHVWRERNGRRHDAAPNTPATVIGWIDKQTRNQITIIRQ 854



 Score = 33.1 bits (74), Expect(2) = 1e-44
 Identities = 23/91 (25%), Positives = 47/91 (51%), Gaps = 9/91 (9%)
 Frame = +3

Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287
           LT++ FADD++V    +   +E   ++  +  +++GL ++ EKS +  + V         
Sbjct: 345 LTHLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVS-------- 396

Query: 288 PLGIQK---------PSFPLKYLGLPIVSRK 353
           P+  Q+            P++YLGLP+V+++
Sbjct: 397 PIIKQEIAAKFLFDVGQLPVRYLGLPLVTKR 427


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  170 bits (430), Expect(2) = 2e-43
 Identities = 111/420 (26%), Positives = 194/420 (46%), Gaps = 13/420 (3%)
 Frame = +2

Query: 374  SYILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKF 553
            S +++ V   +    A   S A R+ LL  V+ ++  +W   + +P+  +++ E L + F
Sbjct: 1084 SPLIEAVKTKISSWTARSLSYAGRLALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAF 1143

Query: 554  L----NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWI 721
            L      N +   ++W ++C+P++EGGLG++ L E +  + LKL W + S +  LW+ WI
Sbjct: 1144 LWSGPVLNPKKAKIAWSSICQPKKEGGLGIKSLAEANKVSCLKLIWRLLSTQPSLWVTWI 1203

Query: 722  HSQYIRSDTYWVSQAHMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGV 898
             +  IR  T+W +    S  SW+ K+L+K R   ++     +     T  W D WS LG 
Sbjct: 1204 WTFIIRKGTFWSANERSSLGSWMWKKLLKYRELAKSMHKVEVRNGSSTSFWYDHWSHLGR 1263

Query: 899  IARDCGKNARRGSDLRRDATIDDLSKCSSLSPIVVELKDKLN-EVQRISGDHA----DRL 1063
            +    G        +  +  ++ + +          + +++N E+QR+         D  
Sbjct: 1264 LLDITGTRRVIDLGIPLETNLETVLRTHQHRQHRAAIYNRINAEIQRLQQQEREAGPDIS 1323

Query: 1064 IWRL---EPSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIAD 1234
            +WR    + +  F  K TW  ++  +    W   +W+P+  P++S  LW     RL   D
Sbjct: 1324 LWRSLKNDFNKRFITKVTWNNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGD 1383

Query: 1235 RLPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSK 1414
            R+   N G    C LC NA ET +HLFF   + S Y+    +   +       W  L + 
Sbjct: 1384 RIKAWNSGQLVTCTLCNNAEETRDHLFFSCQYTS-YVWEALTQRLLSTNYSRDWNRLFTL 1442

Query: 1415 MSSYDFAGTKLNTSVKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIISNV 1594
            + + +     L    +  F A+I+ IW ERN RR      P  ++++ I   VRN IS++
Sbjct: 1443 LCTSNLPRDHL-FLFRYVFQASIYHIWRERNARRHGEISSPTNRLIKLIDKTVRNRISSI 1501



 Score = 35.4 bits (80), Expect(2) = 2e-43
 Identities = 23/92 (25%), Positives = 46/92 (50%), Gaps = 1/92 (1%)
 Frame = +3

Query: 108  LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWV-RGHLFW** 284
            LT++ FADD++VF+      +E    + ++   ++GL ++ EKS I  + V         
Sbjct: 995  LTHLCFADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQISLEKSTIYLAGVSASDRVQTL 1054

Query: 285  VPLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380
                      P++YLGLP+++++    +  P+
Sbjct: 1055 SSFPFANGQLPVRYLGLPLLTKQMTTADYSPL 1086


>emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|7267871|emb|CAB78214.1|
            putative protein [Arabidopsis thaliana]
          Length = 473

 Score =  182 bits (463), Expect = 4e-43
 Identities = 108/406 (26%), Positives = 193/406 (47%), Gaps = 12/406 (2%)
 Frame = +2

Query: 419  ASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL----NFNSRIISMS 586
            A   S A R+ L+  V+ ++  +W   F +P   +++ + + + +L      N+    ++
Sbjct: 51   ARFLSYAGRLNLISSVLWSICNFWMGAFRLPRDCIREIDKMCSAYLWSGGELNTSKAKIT 110

Query: 587  WENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHSQYIRSDTYWVSQA 766
            W  VCKP+EEGGLGLR LKE +    LKL W + S  D LW+KWI S  ++  ++W  + 
Sbjct: 111  WAFVCKPKEEGGLGLRSLKEANDVCCLKLIWRIISHADSLWVKWIQSSLLKKVSFWAVRE 170

Query: 767  HMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIARDCGKNARRGSDL 943
            + S  SW+ ++++K R          I     T  W D WS LG +    G        +
Sbjct: 171  NTSLGSWMWRKILKFRDIARTLCKVEINNGARTSFWYDDWSDLGRLIDSAGDRGAIDLGI 230

Query: 944  RRDATIDDL----SKCSSLSPIVVELKDKLNEVQRISGDHADRLIWRLEPS---GEFSMK 1102
             + AT+ +      +    +  +  ++++L           DR +W+ + +     FS K
Sbjct: 231  NKHATVVEAWGNRRRRRHRTNFLNRVEERLILSWNSRNQAEDRALWKGKENRFRSIFSTK 290

Query: 1103 STWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLPKLNIGVDANCCLC 1282
             TW  I+   +   W   +W+   IP+H+  +W   + RL   DR+   N+GVDA C LC
Sbjct: 291  DTWNHIRTVSNKVAWYKGVWFAQAIPKHAFCMWLAVHNRLSTGDRMTLWNMGVDATCILC 350

Query: 1283 WNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKMSSYDFAGTKLNTSVK 1462
              ALE+ +HLFF   F ++  +P+A  ++ +   +  W+T+ + +S  ++         +
Sbjct: 351  NKALESRDHLFFSCPFATEIWEPLAKTIY-NTCFYTDWQTIINNVSR-NWPDRIAGFLAR 408

Query: 1463 LSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIISNVSQ 1600
                 TI+ +W ERN R+         +++  I  ++RN +  + Q
Sbjct: 409  CILQVTIYTLWRERNERKHGASPNSSSRLISWIDKHIRNHLMAIKQ 454


>emb|CAB72467.1| putative protein [Arabidopsis thaliana]
          Length = 762

 Score =  169 bits (429), Expect(2) = 5e-43
 Identities = 92/328 (28%), Positives = 158/328 (48%), Gaps = 14/328 (4%)
 Frame = +2

Query: 380  ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556
            +++++   +    +   S A R  L+  ++ +   +W   F +P + +++ E L + FL 
Sbjct: 342  LIEQIRKRIGSWSSRFLSFAGRFNLISSIIWSSCNFWLSAFQLPRACIQEIEKLCSSFLW 401

Query: 557  ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727
               N NS+   +SW  VCKP+ EGGLGLR LKE +    LKL W + S  D LW+KW+  
Sbjct: 402  SGTNLNSKKAKISWNQVCKPKSEGGLGLRSLKEANDVCCLKLVWRIISHGDSLWVKWVEH 461

Query: 728  QYIRSDTYWVSQAHMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIA 904
              ++ + +W+ + + +  SWI K+++K R   +      +G    T  W D WS LG + 
Sbjct: 462  NLLKREIFWIVKENANLGSWIWKKILKYRGVAKRFCKAEVGNGESTSFWFDDWSLLGRLI 521

Query: 905  RDCGKNARRGSDLRRDATIDDLSKCSSLSPIVVELKDKLNEV------QRISGDHADRLI 1066
               G        + R  ++ D            E+ + + EV      +R       R++
Sbjct: 522  DVAGIRGTIDMGISRTMSVADAWTSRRRRHHRQEILNTIEEVLSTQHQKRTQQQQQGRVL 581

Query: 1067 WRLEP---SGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADR 1237
            W+ +      +FS K+TW +++   +   W   +W+PH  P++S  LW  A+ RL    R
Sbjct: 582  WKGKNDIYKDKFSTKNTWNYLRTTSNEVAWHKGVWFPHATPKYSFCLWLAAHDRLATGAR 641

Query: 1238 LPKLNIGVDANCCLCWNALETNNHLFFL 1321
            + K N G   +C  C   +ET +HLFF+
Sbjct: 642  MIKWNRGETGDCTFCRQGIETRDHLFFM 669



 Score = 34.3 bits (77), Expect(2) = 5e-43
 Identities = 23/86 (26%), Positives = 45/86 (52%), Gaps = 4/86 (4%)
 Frame = +3

Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRI----VASWVRGHLF 275
           LT++ FADD+++    +   +E   ++     + +GL ++ EKS I    ++S  R  L 
Sbjct: 251 LTHLSFADDLMILTDGQCRSIEGIIEVFDLFSKWSGLKISMEKSTIFSAGLSSTSRAQLH 310

Query: 276 W**VPLGIQKPSFPLKYLGLPIVSRK 353
                   +    P++YLGLP+V+++
Sbjct: 311 ---THFPFEVGELPIRYLGLPLVTKR 333


>gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata subsp. lyrata]
          Length = 441

 Score =  175 bits (444), Expect = 7e-41
 Identities = 116/422 (27%), Positives = 193/422 (45%), Gaps = 17/422 (4%)
 Frame = +2

Query: 380  ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556
            +++++   +    A   S A R+ L+  V+ +L  +W   F +P++ +K+ + L + FL 
Sbjct: 9    LIERIRERISCWTARHLSFAGRLQLISSVIHSLTNFWMSAFRLPNACIKEIDGLCSAFLW 68

Query: 557  ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727
                 N +   +SW +VC P+EEGGLGLR L E +    LKL W + S   L W++W+  
Sbjct: 69   SGPELNRKKAKVSWNDVCMPKEEGGLGLRSLTEANKVCCLKLIWRLLSSSSL-WVQWLRQ 127

Query: 728  QYIRSDTYW-VSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIA 904
              IR  ++W +       SW+ ++L+K R        Y I        W D WSPLG + 
Sbjct: 128  YVIRKGSFWSLRDTSTLGSWMWRKLLKYRHLASGFTQYEIRNGKGVSFWHDNWSPLGPLI 187

Query: 905  RDCGKNARRGSDLRRDATIDDL---SKCSSLSPIVVELKDKLNEVQRISG--DHADRLIW 1069
               G        +   AT+ +     +    +  + +++ +L E+ R  G  +  D ++W
Sbjct: 188  AISGTRGCIDMGIDIHATVAEALTHRRRRHRADHLNQMEAQLEEL-RTKGLVETEDVVLW 246

Query: 1070 -----RLEPSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIAD 1234
                 R +PS  FS K TW   + +K    W   IW+ H  P++S   W     RL   D
Sbjct: 247  KGKGGRFKPS--FSTKETWADTREQKPRNEWYQGIWFSHATPKYSFITWLATKNRLSTGD 304

Query: 1235 RLPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLF--WETLG 1408
            R+   N GV+ +C  C    ET NHLFF   +  +    + S L     RH    W T+ 
Sbjct: 305  RMMSWNAGVNLSCVFCQEQTETRNHLFFTCRYSREVWSGLTSKLLT---RHYSTDWTTIL 361

Query: 1409 SKMSSYDFAGTKLNTSVKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIIS 1588
              ++       +L   ++ +F   ++ IW ERN RR   +  P   +++ +   VRN +S
Sbjct: 362  KLLTDKTLGNNRL-FLLRYAFQILVYSIWKERNSRRHGEEPLPSALLLKRLDKEVRNKLS 420

Query: 1589 NV 1594
             +
Sbjct: 421  TI 422


>dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana]
          Length = 478

 Score =  171 bits (433), Expect(2) = 1e-40
 Identities = 112/422 (26%), Positives = 201/422 (47%), Gaps = 17/422 (4%)
 Frame = +2

Query: 380  ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556
            +++K+   +    A   S A R+ L+  V+ +L  +W   F +PS+ +K+ +S+ + FL 
Sbjct: 45   LVEKIRVRIGKWTARHLSFAGRLQLISSVIHSLTNFWMSAFRLPSACIKEIDSICSSFLW 104

Query: 557  ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727
                 N++   ++W +VC P++EGGLG+R LKE +  + LKL W + S   L W++W+  
Sbjct: 105  SGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKEANKVSLLKLIWRMLSSTSL-WVQWLRL 163

Query: 728  QYIRSDTYW-VSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIA 904
              +R  ++W +S      SW+ K+++K R      V + I     T  W D WS +G + 
Sbjct: 164  YLLRKGSFWSISGNTTLGSWMWKKILKHRALASGFVKHDIHNGSNTSFWFDNWSKIGRLI 223

Query: 905  RDCGKNARRGSDLRRDATIDDL----SKCSSLSPIVVELKDKLNEVQR---ISGDHADRL 1063
               G        +   A++ +              ++ ++D + EV+     SG+  D +
Sbjct: 224  DVTGHRGCIDMGITLHASVAEAVVNHRPRRHRHDTLLRIEDVIAEVRHQGLTSGE--DTV 281

Query: 1064 IWRLEPSGE-----FSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKI 1228
             W+   +G+     F+ K TW   +  K    W   +W+ H  P++S+  W     RL  
Sbjct: 282  RWK--GNGDIFKPCFNTKETWAATREPKLKVNWYKGVWFSHATPKYSVLAWIAIKNRLTT 339

Query: 1229 ADRLPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLG 1408
             DR+   N G D++C LC + +ET +HLFF   + ++    +   L      +  WE + 
Sbjct: 340  GDRMLSWNAGADSSCVLCHHLVETRDHLFFTCPYSAEVWSTLTRKLLSQHFTNR-WEAI- 397

Query: 1409 SKMSSYDFAGTKLNTSVKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIIS 1588
             K+ +    G ++    + +F  T+H +W ERN RR     +   Q+V  +   VRN IS
Sbjct: 398  LKLLTNKSLGHEVPFLTRYTFQLTLHSLWKERNGRRHGEVPQAAAQMVRFLDKQVRNRIS 457

Query: 1589 NV 1594
            ++
Sbjct: 458  SI 459



 Score = 25.0 bits (53), Expect(2) = 1e-40
 Identities = 8/24 (33%), Positives = 18/24 (75%)
 Frame = +3

Query: 309 SFPLKYLGLPIVSRKRFVNECLPI 380
           + P++YLGLP++++K   ++  P+
Sbjct: 22  ALPVRYLGLPLLTKKMTTSDYGPL 45


>gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thaliana]
          Length = 504

 Score =  160 bits (405), Expect(2) = 9e-40
 Identities = 88/323 (27%), Positives = 152/323 (47%), Gaps = 12/323 (3%)
 Frame = +2

Query: 380  ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556
            ++D +   +    A   S   R+ L+  ++ ++  +W   F +P   +++ + + + +L 
Sbjct: 119  LIDHIKQKICSWSARFLSYTGRLNLISSILWSICNFWMGAFRLPRDCIREIDKMCSAYLW 178

Query: 557  ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727
                 N+    ++W  VCKP+EEGGLGLR LKE +    LKL W + S  D LW+KWI S
Sbjct: 179  SGGELNTSKAKIAWAFVCKPKEEGGLGLRSLKEANDVCCLKLIWRIISHADSLWVKWIQS 238

Query: 728  QYIRSDTYWVSQAHMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIA 904
              ++   +W  + + S  SW+ ++++K R          I     T  W D WS LG + 
Sbjct: 239  SLLKKVFFWAVRENTSLGSWMWRKILKFRDIARTLCKVEINNGAQTSFWYDDWSDLGRLI 298

Query: 905  RDCGKNARRGSDLRRDATIDDL----SKCSSLSPIVVELKDKLNEVQRISGDHADRLIWR 1072
               G        + + AT+ +      +    +  +  ++++L           D  +W+
Sbjct: 299  ESAGDRGAIDLGINKHATVVEAWGNRRRRRHRANFLNRVEERLVLSWNSRNQAEDCALWK 358

Query: 1073 LEPS---GEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLP 1243
             + +     FS K TW  I+   +   W   +W+   IP+H+  +W   + RL   DR+ 
Sbjct: 359  GKENRFRSIFSTKDTWNHIRTVSNKVAWYKGVWFAQAIPKHAFCMWLAVHNRLSTGDRMT 418

Query: 1244 KLNIGVDANCCLCWNALETNNHL 1312
              N+GVDA C LC NALE+ +HL
Sbjct: 419  LWNMGVDATCILCNNALESRDHL 441



 Score = 32.7 bits (73), Expect(2) = 9e-40
 Identities = 32/110 (29%), Positives = 57/110 (51%), Gaps = 5/110 (4%)
 Frame = +3

Query: 66  FGHNERTCLKSAGILTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRI 245
           FG++ R   K  G LT++ FADD++V    +V  +E    +     + + L ++ EKS +
Sbjct: 17  FGYHPRC--KQIG-LTHLSFADDLMVLSDGKVRSIEGIVDVFDTFAKCSDLKISMEKSTV 73

Query: 246 VASWVRGHLFW**VPLGIQKPSF-----PLKYLGLPIVSRKRFVNECLPI 380
             + +  H     V   I + SF     P++YLGLP+V+++    + LP+
Sbjct: 74  YLAGL-SHTTRQEV---IDRFSFAVGTLPVRYLGLPLVTKQFSSTDYLPL 119


>gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CAB80742.1| AT4g02490
            [Arabidopsis thaliana]
          Length = 657

 Score =  145 bits (365), Expect(2) = 1e-39
 Identities = 94/335 (28%), Positives = 150/335 (44%), Gaps = 16/335 (4%)
 Frame = +2

Query: 380  ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556
            ++D+++       A   S A R+ LLK V+ +   +W   F +P+  L K E + N FL 
Sbjct: 330  LVDRINSRFTSWTARHLSFAGRLQLLKSVIYSTINFWASIFILPNQCLHKLEQMCNAFLW 389

Query: 557  ---NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727
                 ++R   +SW+ VC  +E GGLGL+RL   +    LKL W + +    LW+ W+  
Sbjct: 390  SGAPNSAREAKISWDIVCSSKESGGLGLKRLSSWNKVLALKLIWLLFTASGSLWVSWVR- 448

Query: 728  QYIRSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIAR 907
                              W+ ++L K+R      V   +G  +  + W D W+  G +  
Sbjct: 449  ------------------WVWRKLCKLREVARPFVICEVGSGITARFWQDNWTGHGPLIH 490

Query: 908  DCGKNARRGSDLR-----RDATIDD---LSKCSSLSPIVVELKDKLNEVQR-ISGDHADR 1060
              G    +   L      RDA  +D   ++   S +P+++ LK  L  V   +  +H D 
Sbjct: 491  LTGLTGPQLVGLSITSVVRDAIRNDDWWIASSRSRNPVILLLKSLLPPVGNLVDCEHDDS 550

Query: 1061 LIWRLE---PSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIA 1231
             +W++    PS +FS   TW  +Q    +  W   +W+ + +P+H+   W  A  RL   
Sbjct: 551  YLWKVGDRVPSSKFSTADTWRALQPFSVSVSWHKAVWFTNQVPKHAFISWVTAWNRLHTR 610

Query: 1232 DRLPKLNIGVDANCCLCWNALETNNHLFFLMHFCS 1336
            DRL    + V A C LC    ET +HLFF   F S
Sbjct: 611  DRLRSWGLIVPAECVLCNLVDETRDHLFFACRFSS 645



 Score = 47.4 bits (111), Expect(2) = 1e-39
 Identities = 28/97 (28%), Positives = 57/97 (58%), Gaps = 3/97 (3%)
 Frame = +3

Query: 99  AGILTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIV---ASWVRGH 269
           A ++T++ FADD+LVF    +S L     +L   ++ +GL +N +K+ ++    ++ R  
Sbjct: 236 APMITHLSFADDILVFCDGSLSSLVAILDILDVFKKGSGLGINLQKTALLLDGGNFERNR 295

Query: 270 LFW**VPLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380
           +      LG+ + S P++YLG+P++S+K   ++  P+
Sbjct: 296 IMA--ASLGVSQGSLPVRYLGVPLMSQKMKKHDYQPL 330


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  145 bits (366), Expect(2) = 1e-37
 Identities = 112/419 (26%), Positives = 180/419 (42%), Gaps = 15/419 (3%)
 Frame = +2

Query: 380  ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFL- 556
            +L+K+          C S A R+ L+  V+     +W   F +P   +K+ ESL ++FL 
Sbjct: 782  LLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLW 841

Query: 557  --NFN-SRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727
              N   ++ I +SW  +C P+ EGGLGLRRL E +    ++L W +   KD LW  W H 
Sbjct: 842  SGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHL 901

Query: 728  QYIRSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIAR 907
             ++   ++W  +   S SW  KRL+ +R      +   +G  L    W D W+ LG + R
Sbjct: 902  HHLSRGSFWAVEGGQSDSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGPLFR 961

Query: 908  ---DCGKNARRGSDLRRDATI---DDLSKCSSLSPIVVELKDKL--NEVQRISGDHADRL 1063
               D G ++ R   L + A+    D      S S     + D L    V   + +  DR 
Sbjct: 962  IIGDIGPSSLRVPLLAKVASAFSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQEDVDRY 1021

Query: 1064 IWRLEP--SGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADR 1237
             W +       FS   TWE I+ K     W + IW+   +P+++  +W     RL    R
Sbjct: 1022 EWSVNGFLCQGFSAAKTWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQR 1081

Query: 1238 LPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKM 1417
            L          C LC  A E+ +HL  +  F +   + +   +    R    W  L S +
Sbjct: 1082 LASWGHIQSDACVLCSFASESRDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSELLSWV 1141

Query: 1418 SSYDFAGTKLNTSVKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAII-FYVRNIISN 1591
                     L    K+     ++ +W +RN   + N LR    ++  ++   +RNIIS+
Sbjct: 1142 RQSSPEAPPLLR--KIVSQVVVYNLWRQRN-NLLHNSLRLAPAVIFKLVDREIRNIISS 1197



 Score = 40.8 bits (94), Expect(2) = 1e-37
 Identities = 25/91 (27%), Positives = 47/91 (51%)
 Frame = +3

Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287
           +++++FADDV++F       L    + L D    +GL +N +KS +  + +         
Sbjct: 692 ISHLMFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLNQLESNANA 751

Query: 288 PLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380
             G    + P++YLGLP+++RK  + E  P+
Sbjct: 752 AYGFPIGTLPIRYLGLPLMNRKLRIAEYEPL 782


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  138 bits (347), Expect(2) = 5e-36
 Identities = 99/390 (25%), Positives = 155/390 (39%), Gaps = 14/390 (3%)
 Frame = +2

Query: 380  ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFLN 559
            +L+K+   L    +   S A R  L+  V+  L  +W   F +P   +KK ESL +KFL 
Sbjct: 642  LLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLW 701

Query: 560  FNS----RIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727
              S    +   +SW + C P+ EGGLG R   E +    L+L W +  +   LW +W   
Sbjct: 702  AGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRH 761

Query: 728  QYIRSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIAR 907
              +   ++W   A  +  W  K L+ +R   E  +   +G       W D W+ LG + +
Sbjct: 762  HRLGHASFWQVNALQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIK 821

Query: 908  DCGKNARRGSDLRRDATIDD----------LSKCSSLSPIVVELKDKLNEVQRISGDHAD 1057
              G    R   +   A + D          LS+  +   I+  L         +  D   
Sbjct: 822  YLGDVGSRPLRIPFSAKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYS 881

Query: 1058 RLIWRLEPSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADR 1237
              +  ++  G FS   TWE ++ ++   RW   +W+   +P+H+   W     RL    R
Sbjct: 882  WCVDDVDCQG-FSAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQR 940

Query: 1238 LPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKM 1417
            L    +   A CCLC    ET +HL  L  F S   + +   L    R    W  L S  
Sbjct: 941  LVSWGLVSSAECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWT 1000

Query: 1418 SSYDFAGTKLNTSVKLSFAATIHQIWWERN 1507
                 A   L   V       ++ +W +RN
Sbjct: 1001 RQSTAAAPSLLRKVVAQL--VVYNLWRQRN 1028



 Score = 42.4 bits (98), Expect(2) = 5e-36
 Identities = 25/91 (27%), Positives = 49/91 (53%)
 Frame = +3

Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287
           +++++FADDV++F     S +    + L D    +GL +N +KS++  + +         
Sbjct: 552 ISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGLDLSERITSA 611

Query: 288 PLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380
             G    +FP++YLGLP++ RK  + +  P+
Sbjct: 612 AYGFPAGTFPIRYLGLPLMCRKLRIADYGPL 642


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  138 bits (347), Expect(2) = 5e-36
 Identities = 99/390 (25%), Positives = 155/390 (39%), Gaps = 14/390 (3%)
 Frame = +2

Query: 380  ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFLN 559
            +L+K+   L    +   S A R  L+  V+  L  +W   F +P   +KK ESL +KFL 
Sbjct: 642  LLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLW 701

Query: 560  FNS----RIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727
              S    +   +SW + C P+ EGGLG R   E +    L+L W +  +   LW +W   
Sbjct: 702  AGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRH 761

Query: 728  QYIRSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIAR 907
              +   ++W   A  +  W  K L+ +R   E  +   +G       W D W+ LG + +
Sbjct: 762  HRLGHASFWQVNALQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIK 821

Query: 908  DCGKNARRGSDLRRDATIDD----------LSKCSSLSPIVVELKDKLNEVQRISGDHAD 1057
              G    R   +   A + D          LS+  +   I+  L         +  D   
Sbjct: 822  YLGDVGSRPLRIPFSAKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYS 881

Query: 1058 RLIWRLEPSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADR 1237
              +  ++  G FS   TWE ++ ++   RW   +W+   +P+H+   W     RL    R
Sbjct: 882  WCVDDVDCQG-FSAAKTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQR 940

Query: 1238 LPKLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKM 1417
            L    +   A CCLC    ET +HL  L  F S   + +   L    R    W  L S  
Sbjct: 941  LVSWGLVSSAECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWT 1000

Query: 1418 SSYDFAGTKLNTSVKLSFAATIHQIWWERN 1507
                 A   L   V       ++ +W +RN
Sbjct: 1001 RQSTAAAPSLLRKVVAQL--VVYNLWRQRN 1028



 Score = 42.4 bits (98), Expect(2) = 5e-36
 Identities = 25/91 (27%), Positives = 49/91 (53%)
 Frame = +3

Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287
           +++++FADDV++F     S +    + L D    +GL +N +KS++  + +         
Sbjct: 552 ISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGLDLSERITSA 611

Query: 288 PLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380
             G    +FP++YLGLP++ RK  + +  P+
Sbjct: 612 AYGFPAGTFPIRYLGLPLMCRKLRIADYGPL 642


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
            [Arabidopsis thaliana]
          Length = 1164

 Score =  131 bits (329), Expect(2) = 7e-36
 Identities = 89/335 (26%), Positives = 143/335 (42%), Gaps = 16/335 (4%)
 Frame = +2

Query: 380  ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFLN 559
            +++K+    +       S A R+ LL  V+  +  +W   F +P   +KK ESL ++FL 
Sbjct: 679  LIEKITARFNSWVVRLLSFAGRVQLLASVISGIVNFWISSFILPLGCIKKIESLCSRFL- 737

Query: 560  FNSRI-----ISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIH 724
            ++SRI       ++W  VC P+ EGG+GLRR    +    L++ W + S    LW+ W H
Sbjct: 738  WSSRIDKKGIAKVAWSQVCLPKAEGGIGLRRFAVSNRTLYLRMIWLLFSNSGSLWVAW-H 796

Query: 725  SQYI--RSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGV 898
             Q+   +S ++W        SW  K L+++R   E  +   +G       W D W+P G 
Sbjct: 797  KQHSLGKSTSFWNQPEKPHDSWNWKCLLRLRVVAERFIRCNVGNGRDASFWFDNWTPFGP 856

Query: 899  IARDCGKNARRGSDLRRDATIDDL------SKCSSLSPIVVELKDKLNEVQRIS-GDHAD 1057
            + +  G    R   +  +A I D+      S     S   + L   L  +   S     D
Sbjct: 857  LIKFLGNEGPRDLRVHLNAKISDVCTSEGWSIADPRSDQALSLHTHLTNISMPSDAQDLD 916

Query: 1058 RLIWRLEPS--GEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIA 1231
               W ++      FS  +TW  ++       W   +W+    P+H+  LW     RL   
Sbjct: 917  SYDWVVDNKVCQGFSAAATWSALRPSSAPVPWARAVWFKGATPKHAFHLWTAHLDRLPTK 976

Query: 1232 DRLPKLNIGVDANCCLCWNALETNNHLFFLMHFCS 1336
             RL    + +D  C LC    ET +HLF    F +
Sbjct: 977  VRLASWGMQIDTTCGLCSLHPETRDHLFLSCDFAN 1011



 Score = 48.9 bits (115), Expect(2) = 7e-36
 Identities = 28/91 (30%), Positives = 50/91 (54%)
 Frame = +3

Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWVRGHLFW**V 287
           +++++FADDV++F   + S L    + L D    +GL MN  K+++  + +         
Sbjct: 589 ISHLMFADDVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGLSQSESDSMA 648

Query: 288 PLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380
             G +  S P++YLGLP++SRK  + E  P+
Sbjct: 649 SYGFKLGSLPVRYLGLPLMSRKLTIAEYAPL 679


>gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]
          Length = 740

 Score =  137 bits (346), Expect(2) = 6e-35
 Identities = 108/397 (27%), Positives = 171/397 (43%), Gaps = 31/397 (7%)
 Frame = +2

Query: 374  SYILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKF 553
            S +LDKV   +    A   S A R+ L+  V+ +L  +W   + +P+  +K+ E L + F
Sbjct: 360  SPLLDKVRSKISSWTARSLSYAGRLALINSVIVSLSNFWMSAYRLPAGCIKEIEKLCSAF 419

Query: 554  L----NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWI 721
            L      N +   ++W ++CK ++EGGLG++ L E +  + LKL W + S++  LW+ W+
Sbjct: 420  LWSGPELNPKKAKITWTSLCKLKQEGGLGIKSLLEANKVSCLKLIWRLVSRQSSLWVNWV 479

Query: 722  HSQYIRSDTYWVSQAHMS-SSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGV 898
             +  IR  ++W +    S  SW+ K+L+K R   ++     I     T  W D WS LG 
Sbjct: 480  WTYIIRKGSFWSANDRSSLGSWMWKKLLKYRDVAKSMCKVEIKSGSSTSFWYDNWSQLGQ 539

Query: 899  IARDCGKNARRGSD--LRRDATIDDLSKCSSLSPIVVELKDKLNE-----VQRISGDHAD 1057
            +      NARR  D  +   AT+  +            + +K+       +QR      D
Sbjct: 540  LVD--VTNARRTIDMGIPLAATVATVLASHRTKHHRTAIYNKIEAEIQSILQRERSGAPD 597

Query: 1058 RLIWRLEPSG---EFSMKSTWEFIQRKKHTFR-WVNLIWYPHHIPRHSLTLWKLANQRLK 1225
              +WR         F  K TW  I R  HT R W   +W+ ++ P++S  LW   + RL 
Sbjct: 598  IFLWRSSGDNFRQSFITKVTWHNI-RVIHTHRQWYKGVWFSYNTPKYSFLLWLAIHDRLS 656

Query: 1226 IADRLPKLNIGVDA---------------NCCLCWNALETNNHLFFLMHFCSDYMKPIAS 1360
              DR+ K N G                   C  C N +      F+L  F S   KPI+ 
Sbjct: 657  TGDRIKKWNSGQQTFSTPLSIFTLKFLRNRCIFCNNMISK----FYLTIFDS-LSKPIS- 710

Query: 1361 MLFVDGRRHLFWETLGSKMSSYDFAGTKLNTSVKLSF 1471
                      F + L +K     F  + +NT   L +
Sbjct: 711  ----------FIDCLTNKSHKLSFTESSINTICPLEY 737



 Score = 39.3 bits (90), Expect(2) = 6e-35
 Identities = 25/95 (26%), Positives = 50/95 (52%), Gaps = 4/95 (4%)
 Frame = +3

Query: 108 LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIVASWV----RGHLF 275
           LT++ FADD++VFI  +   +E    + ++   K+GL ++ EKS +  + V    R ++ 
Sbjct: 271 LTHLCFADDLMVFIDGQQRSVEGVINIFKEFAGKSGLHISLEKSTLYLAGVSELNRNNIL 330

Query: 276 W**VPLGIQKPSFPLKYLGLPIVSRKRFVNECLPI 380
                        P++YLGLP+++++    +  P+
Sbjct: 331 ---SAFPFASGQLPVRYLGLPLLTKQMTTADYSPL 362


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  124 bits (312), Expect(2) = 2e-30
 Identities = 132/522 (25%), Positives = 226/522 (43%), Gaps = 22/522 (4%)
 Frame = +2

Query: 380  ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFLN 559
            ++ K+   + G +    S   R+ LL+  + +L +Y  +    P  +L++   L N FL 
Sbjct: 2902 LVAKIEERITGWENKILSPGGRITLLRSTLSSLPIYLLQVLKPPIIVLERINRLFNNFLW 2961

Query: 560  FNS----RIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727
              S    RI   SW  +  P  EGGL +R L++V  A  +KL WW     + LW++++ +
Sbjct: 2962 GGSASSKRIHWASWGKIALPIAEGGLDIRNLEDVFKAFSMKL-WWRFRTTNSLWMQFMRA 3020

Query: 728  QYIRSD--TYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPW---SPL 892
            +Y      T+   + H S +W  KR++ I    E N+ + +G       W D W    PL
Sbjct: 3021 KYCGGQLPTHVQPKLHDSQTW--KRMVTISSITEQNIRWRVGHGKLF-FWHDCWMGEEPL 3077

Query: 893  GVIARDCGKNARRGSDLRRDATIDDLSKCSSLSPIVVELKDKLNEVQRISGDHADRLIWR 1072
             +  ++   +  + SD   + + D     S L   VVE   K+     I+    DR  W 
Sbjct: 3078 VIRNQEFASSMAQVSDFFLNNSWDIEKLKSVLQQEVVEEIAKIP----INASSNDRAYWT 3133

Query: 1073 LEPSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLPKLN 1252
              P+G+FS KS W+  + +K      N IW+       S  LW+L +  + +  ++    
Sbjct: 3134 PTPNGDFSTKSAWQLSRERKVVNPTYNYIWHKSVPLTTSFFLWRLLHDWVPVELKMKSKG 3193

Query: 1253 IGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRR----HLFWE-TLGSKM 1417
              + A+ C C  + E+      LMH   D   P+A+ ++    +    H+    T+   +
Sbjct: 3194 FQL-ASRCRCCKSEES------LMHVMWD--NPVANQVWSYFAKVFQIHIINPCTINHII 3244

Query: 1418 SSYDFAGT-----KLNTSVKLSFAATIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNI 1582
            S++ ++G       + T V L     +  +W ERN  + +N      +IV  I+  +  +
Sbjct: 3245 SAWFYSGDYSKPGHIRTLVPLFI---LWFLWVERNDAKHRNLGMYPNRIVWKILKLIHQL 3301

Query: 1583 ISNVSQDSVCSKEALYMARQWQINLTWKAKS-FFLISWSPPHYGWICLNVDAS--YSQFR 1753
                       +    +A++W I L   A S   L+ W+ P  G   LNVD S  Y+   
Sbjct: 3302 FQGKQLQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSSKYNLQT 3361

Query: 1754 LGFGGLLRDHLGTPLVAFAGAQDPSSVILAEITDMLEGVQAC 1879
               GGLLRDH G+ +  F+        + AE+  +  G+  C
Sbjct: 3362 AAGGGLLRDHTGSMIFGFSENFGSQDSLQAELMALHRGLLLC 3403



 Score = 37.0 bits (84), Expect(2) = 2e-30
 Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 5/83 (6%)
 Frame = +3

Query: 108  LTYIIFADDVLVFIYLRVSYLEEFNKLLRDIERKTGLAMNPEKSRIV-----ASWVRGHL 272
            ++++ FADDV++F     S L+     L++ E  +G  +NP+KS +V     AS  R  +
Sbjct: 2810 ISHLAFADDVIIFANGSKSALQRILAFLQEYEELSGQRINPQKSCVVTHTNMASSRRQII 2869

Query: 273  FW**VPLGIQKPSFPLKYLGLPI 341
                   G      P+ YLG P+
Sbjct: 2870 L---QATGFSHRPLPITYLGAPL 2889



 Score =  117 bits (293), Expect = 2e-23
 Identities = 127/521 (24%), Positives = 228/521 (43%), Gaps = 21/521 (4%)
 Frame = +2

Query: 380  ILDKVHHHLHG*KASCFSTARRMVLLKHVMQALHLYWEKCFAIPSSILKKAESLMNKFLN 559
            ++ K+   + G +    S   R+ LL+ V+ +  +Y  +    P ++++K E L N FL 
Sbjct: 1108 LISKIRDRISGWENKILSPGGRITLLRSVLSSQPMYLLQVLKPPVTVIEKIERLFNSFLW 1167

Query: 560  FNS----RIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHS 727
             +S    ++   +W  +  P  EGGL +R L++V  A  LKL WW     + LW +++ +
Sbjct: 1168 GDSCDGKKLHWTAWSKITFPVSEGGLDIRNLRDVFEAFSLKL-WWRFQTCNSLWTRFLRT 1226

Query: 728  QYIRSDTYWVSQAHMSSSWIQKRLMKIRRDLEANVSYLIGK-DLYTKVWLDPW---SPLG 895
            +Y       + Q  +  S + KR++  R     N+ + IGK +L+   W D W    PL 
Sbjct: 1227 KYCLGRIPHLVQPKLHDSQVWKRMIVGRDVALQNIRWRIGKGELF--FWHDCWMGDQPLA 1284

Query: 896  VIARDCGKNARRGSDLRRDATID--DLSKCSSLSPIVVELKDKLNEVQRISGDHA--DRL 1063
             +      +    S + +    D  D+ K +S  P  +     ++E+ +I  D +  D  
Sbjct: 1285 TLFPSFHNDM---SHVHKFYNGDEWDIVKLNSYLPTSL-----VDEILQIPFDRSQEDVA 1336

Query: 1064 IWRLEPSGEFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLP 1243
             W L  +GEFS  S WE I++++     ++  W+       S  LW++ N  + +  R+ 
Sbjct: 1337 YWALTSNGEFSFWSAWEIIRQRQTPNALLSFNWHRSIPLSISFFLWRVLNNWIPVELRMK 1396

Query: 1244 KLNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIAS--MLFVDGRRHLFWETLGSKM 1417
               I + A+ C+C  + E+  H+ +            A    ++V   +H+  + + +  
Sbjct: 1397 DKGIHL-ASKCVCCRSEESLIHVLWENPVAKQVWNFFAKSFQIYVSKPKHIS-QIIWAWF 1454

Query: 1418 SSYDFAGTKLNTSVKLSFAATI-HQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIISNV 1594
             S D+     N  +++     I   +W ERN      K R +      +I+ +  +++ +
Sbjct: 1455 FSGDYT---RNGHIRILIPLFICWFLWLERN----DAKHRHMGMYPNRVIWRIMKLLNQL 1507

Query: 1595 SQDSVCS----KEALYMARQWQINLTWK-AKSFFLISWSPPHYGWICLNVD-ASYSQFRL 1756
               S+      K    +A  W      K  +S  +ISW  P  G   LNVD +S S    
Sbjct: 1508 HAGSLLKQWQWKGDTDIATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGSSKSSQNA 1567

Query: 1757 GFGGLLRDHLGTPLVAFAGAQDPSSVILAEITDMLEGVQAC 1879
              GG+LRDH G    AF+    P   + AE+  +L G+  C
Sbjct: 1568 AGGGVLRDHTGKLAFAFSENLGPLPSLQAELHALLRGLLLC 1608


>dbj|BAB08692.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana] gi|93007380|gb|ABE97193.1| hypothetical protein
            At5g13655 [Arabidopsis thaliana]
          Length = 385

 Score =  139 bits (350), Expect = 5e-30
 Identities = 95/358 (26%), Positives = 161/358 (44%), Gaps = 12/358 (3%)
 Frame = +2

Query: 557  NFNSRIISMSWENVCKPREEGGLGLRRLKEVSLAAGLKLAWWVASKKDLLWIKWIHSQYI 736
            + N+R   ++W  VC P+ EGGLGLR ++E +    LKL W + S K  LW+ W+    +
Sbjct: 11   SLNARKTKVAWSVVCTPKSEGGLGLRAVEETNKVCMLKLIWRILSAKGSLWVDWVKKHLL 70

Query: 737  RSDTYW-VSQAHMSSSWIQKRLMKIRRDLEANVSYLIGKDLYTKVWLDPWSPLGVIARDC 913
            R  + W V +     SWI K+L+K R   +      +     T  W D WS LG +    
Sbjct: 71   RGGSLWAVKETSSRGSWIWKKLLKYRDKAKCFHKVDVRNGESTSFWYDSWSSLGCLYDKF 130

Query: 914  GKNARRGSDLRRDATIDD----LSKCSSLSPIV--VELKDKLNEVQRISGDHADRLIWRL 1075
            G+       + +D+T+        +     P++  VE + +  +  RI  +  D  +W+ 
Sbjct: 131  GERGCIDMGIPKDSTLSSAIMTTRRRKHRQPLLNAVETEIQKQKQSRIVTER-DVALWKG 189

Query: 1076 EPSG---EFSMKSTWEFIQRKKHTFRWVNLIWYPHHIPRHSLTLWKLANQRLKIADRLPK 1246
            +  G    F  K TW  I+  +   +    IW+ +  P+++L  W +   R+   +++  
Sbjct: 190  KEDGFHPTFLSKETWSQIRNTQPEMQGYRGIWFSNATPKYALLTWLMVRNRIATGEKMGL 249

Query: 1247 LNIGVDANCCLCWNALETNNHLFFLMHFCSDYMKPIASMLFVDGRRHLFWETLGSKMSSY 1426
             N   D +C  C N  ET  HLFF   +       +   L +D +    W+ +   ++  
Sbjct: 250  WNQNTDTSCIFCKNPNETREHLFFQCVYTRKVWNGLIKGLLLD-KYSDRWQDIILMLTRK 308

Query: 1427 DFAGTKLNTSVKLSFAA--TIHQIWWERNCRRIQNKLRPVEQIVEAIIFYVRNIISNV 1594
            DF  TK   S  L +    +IH IW ER+ RR        E++++ I   +RN +S +
Sbjct: 309  DFDTTK---SFILGYVLQNSIHSIWRERDDRRHGEDPSNEERLIKFIDKNIRNRLSTL 363


Top