BLASTX nr result

ID: Catharanthus23_contig00011222 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00011222
         (3671 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga...   267   3e-89
gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]                268   3e-79
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   261   6e-79
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           261   6e-79
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   244   4e-78
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       256   7e-78
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               253   2e-77
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   257   5e-77
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   252   1e-76
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   248   3e-76
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   259   3e-76
emb|CAB72467.1| putative protein [Arabidopsis thaliana]               251   9e-76
gb|AAD15471.1| putative non-LTR retroelement reverse transcripta...   261   5e-74
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...   248   2e-73
dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ...   234   4e-70
gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip...   234   2e-69
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   236   2e-69
emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li...   231   2e-69
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...   237   2e-67
gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00...   214   9e-65

>emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1110

 Score =  267 bits (682), Expect(4) = 3e-89
 Identities = 135/359 (37%), Positives = 208/359 (57%), Gaps = 5/359 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PI+CC V YK+I ++L +R+   +   + + Q  F+ GR + +NI+LA  L+ GYTRK  
Sbjct: 516  PIACCTVIYKIISKMLTNRMKGIIGEVVNEAQSGFIPGRHIADNILLASELIRGYTRKHM 575

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            SP+C +K+D+RKAY ++ W FL+ +L    F  + + W+M CV+T S+S+ VNG      
Sbjct: 576  SPRCIMKVDIRKAYDSVEWSFLETLLYEFGFPSRFVGWIMECVSTVSYSVLVNGIPTQPF 635

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
            ++ + LRQGDP+ PFLF +CM+YL R L +     +FN+HPKCE+L I  L+F DDL++ 
Sbjct: 636  QARKGLRQGDPMSPFLFALCMEYLSRCLEELKGSPDFNFHPKCERLNITHLMFADDLLMF 695

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
             R D   +  +    + F HAS L A+  KSNI   G+DD     ++       G +PFR
Sbjct: 696  CRADKSSLDHMNVAFQKFSHASGLAASHEKSNIYFCGVDDETARELADYVHMQLGELPFR 755

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P     L  A   PLV+ ++N  Q W    LSYAGRL +I++++  ++++   I  
Sbjct: 756  YLGVPLTSKKLTYAQCKPLVEMITNRAQTWMAKLLSYAGRLQLIKSILSSMQNYWAHIFP 815

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIRDTRCWNVSYYLK 3203
            +S  V   +  +CR+FLW G     K++ V W T+   K   G  + + + WN +  LK
Sbjct: 816  LSKKVIQAVEKVCRKFLWTGKTEETKKAPVAWATIQRPKSRGGWNVINMKYWNRAAMLK 874



 Score = 61.6 bits (148), Expect(4) = 3e-89
 Identities = 37/127 (29%), Positives = 66/127 (51%), Gaps = 1/127 (0%)
 Frame = +2

Query: 1676 SAQLFHSLANTNAKKHFVAALIKEDGTITTTSFGDI*EQFLCLYKELLGTREE-IISIDE 1852
            +++LF +        + +  L  EDG +   +  ++ E+ L  YK+LLGTR   ++ +D 
Sbjct: 359  NSKLFFTAVKARHAINRIDMLNTEDGRVIQDA-DEVQEEILEFYKKLLGTRASTLMGVDL 417

Query: 1853 TIVDMGLKISTSQADFFVHEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTNS 2032
              V  G  +S    +  + E +  +I  AL  I +DK+PG DG+++ FFKK+W  +    
Sbjct: 418  NTVRGGKCLSAQAKESLIREVASTEIDEALAGIGNDKAPGLDGFNAYFFKKSWGSIKQEI 477

Query: 2033 SSALMEF 2053
             + + EF
Sbjct: 478  YAGIQEF 484



 Score = 37.0 bits (84), Expect(4) = 3e-89
 Identities = 17/51 (33%), Positives = 26/51 (50%)
 Frame = +2

Query: 3200 KVLWNIHTKKDSLWYRWIDHVDMKNSTILRPKHKEGFPLFYQKVIEYQDNL 3352
            K+LW I  K+D LW RWI    +K   IL            +K+++ +D+L
Sbjct: 874  KLLWAIEFKRDKLWVRWIHSYYIKRQDILTVNISNQTTWILRKIVKARDHL 924



 Score = 36.2 bits (82), Expect(4) = 3e-89
 Identities = 32/110 (29%), Positives = 45/110 (40%), Gaps = 5/110 (4%)
 Frame = +1

Query: 3352 DYNGGFGNWHCF*A*CLAYEKFLCTTADYGYFSTKCHRKLWAKIAWNTISPPKFSFTLWQ 3531
            D+    G+W      C+  +KF    A Y   S    R  W ++  N  + PK  F LW 
Sbjct: 922  DHLSNIGDWDEI---CIG-DKFSMKKA-YKKISENGERVRWRRLICNNYATPKSKFILWM 976

Query: 3532 AVLGR-PTMDRLYFQNVDKGCKWGKRRG----LITLIFYCSLSKQVWKQI 3666
             +  R PT+DR+    V     +   R     +  L F CS S  VW +I
Sbjct: 977  MLHERLPTVDRISRWGVQCDLNYRLCRNDGETIQHLFFSCSYSAGVWSKI 1026


>gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]
          Length = 740

 Score =  268 bits (685), Expect(3) = 3e-79
 Identities = 139/347 (40%), Positives = 202/347 (58%), Gaps = 5/347 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISCCNV YKVI +++ +RL   LP FI + Q AFV+ R ++EN++LA  L+  Y +   
Sbjct: 104  PISCCNVIYKVISKIIANRLKVMLPTFILQNQSAFVRERLLIENVLLATELVKDYHKDSI 163

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            SP+C +KID+ KA+ ++ W+FL   L ALNF +    W+  C++T +FS++VNGE  GF 
Sbjct: 164  SPRCAMKIDISKAFDSVQWQFLLNTLEALNFPENFCHWIKLCISTATFSVQVNGELAGFF 223

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
             S R LRQG  L P+LF+ICM  L   +       N  YHPKC+KL +  L F DDLM+ 
Sbjct: 224  GSKRGLRQGCALSPYLFVICMNVLSHMIDVAAVHRNIGYHPKCKKLSLTHLCFADDLMVF 283

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
              G    V  V ++ + F   S L  +  KS + LAG+ +L ++ I +   F+ G +P R
Sbjct: 284  IDGQQRSVEGVINIFKEFAGKSGLHISLEKSTLYLAGVSELNRNNILSAFPFASGQLPVR 343

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P +   +  ADY PL+DKV + I  W    LSYAGRL +I +++  + +F +    
Sbjct: 344  YLGLPLLTKQMTTADYSPLLDKVRSKISSWTARSLSYAGRLALINSVIVSLSNFWMSAYR 403

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIR 3167
            + A     I  LC  FLW G     K++++TW +L   K   GLGI+
Sbjct: 404  LPAGCIKEIEKLCSAFLWSGPELNPKKAKITWTSLCKLKQEGGLGIK 450



 Score = 50.8 bits (120), Expect(3) = 3e-79
 Identities = 23/72 (31%), Positives = 39/72 (54%)
 Frame = +2

Query: 1838 ISIDETIVDMGLKISTSQADFFVHEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNI 2017
            ++++E    M  + S +  D    E +  + +  LF +  +K PGPDGY+S FFK  W+I
Sbjct: 1    MTVEELQNLMSFRCSATDQDMLTREVTSEENQKVLFAMPSNKFPGPDGYTSEFFKATWSI 60

Query: 2018 VGTNSSSALMEF 2053
             G +  +A+  F
Sbjct: 61   TGQDFIAAIKSF 72



 Score = 28.1 bits (61), Expect(3) = 3e-79
 Identities = 14/70 (20%), Positives = 33/70 (47%), Gaps = 1/70 (1%)
 Frame = +2

Query: 3200 KVLWNIHTKKDSLWYRWI-DHVDMKNSTILRPKHKEGFPLFYQKVIEYQDNLITMEGLAT 3376
            K++W + +++ SLW  W+  ++  K S              ++K+++Y+D   +M  +  
Sbjct: 462  KLIWRLVSRQSSLWVNWVWTYIIRKGSFWSANDRSSLGSWMWKKLLKYRDVAKSMCKVEI 521

Query: 3377 GTASRLDVWH 3406
             + S    W+
Sbjct: 522  KSGSSTSFWY 531


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  261 bits (668), Expect(2) = 6e-79
 Identities = 145/367 (39%), Positives = 207/367 (56%), Gaps = 5/367 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISC N  YKVI +LL SRL   L   IG  Q AF+ GRS+ EN++LA  ++ GY R   
Sbjct: 385  PISCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLATEMVHGYNRLNI 444

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            SP+  LK+DL+KA+ ++ WEF+   L AL   ++ I W+  C+TTPSF++ VNG   GF 
Sbjct: 445  SPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFTISVNGATGGFF 504

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
             S + LRQGDPL P+LF++ M+   + L         +YHPK   L I  L+F DD+MI 
Sbjct: 505  RSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSISHLMFADDVMIF 564

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
              G +  +  +C+ L +F   S LK N  KS +  AG+ DL +   S   GF  G+ P R
Sbjct: 565  FDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL-DLSERITSAAYGFPAGTFPIR 623

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P +   L +ADYGPL++K+S  ++ W    LS+AGR  +I +++ G+ +F +    
Sbjct: 624  YLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFL 683

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIRDTRCWNVSYYLKC 3206
            +      +I SLC +FLW G     K S+V+W    L K   GLG R    WN +  L+ 
Sbjct: 684  LPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRL 743

Query: 3207 YGIFIQR 3227
              +   R
Sbjct: 744  IWVLFDR 750



 Score = 63.2 bits (152), Expect(2) = 6e-79
 Identities = 38/143 (26%), Positives = 75/143 (52%), Gaps = 5/143 (3%)
 Frame = +2

Query: 1688 FHSLANTNAKKHFVAALIKEDGTITTTSFGDI*EQFLCLYKELLGTREEIISIDETIVDM 1867
            FH + ++    + + +L+  +G +  +  G I +  +  Y+ LLG+ E   S+++  +++
Sbjct: 231  FHRMVDSRKSFNTINSLVDSNGLLIDSQQG-ILDHCVTYYERLLGSIESPFSMEQEDMNL 289

Query: 1868 GL--KISTSQADFFVHEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTNSSSA 2041
             L  + S  Q       F+ ++I+ A   +  +K+ GPDGYS  FF+  W+I+G    +A
Sbjct: 290  LLTYRCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAA 349

Query: 2042 LMEFL---RLVCCWNKLIMLLCP 2101
            + EF    +L+  WN   ++L P
Sbjct: 350  IHEFFDSGQLLKQWNATTLVLIP 372


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  261 bits (668), Expect(2) = 6e-79
 Identities = 145/367 (39%), Positives = 207/367 (56%), Gaps = 5/367 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISC N  YKVI +LL SRL   L   IG  Q AF+ GRS+ EN++LA  ++ GY R   
Sbjct: 385  PISCLNTLYKVISKLLTSRLQGLLSAVIGHSQSAFLPGRSLAENVLLATEMVHGYNRLNI 444

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            SP+  LK+DL+KA+ ++ WEF+   L AL   ++ I W+  C+TTPSF++ VNG   GF 
Sbjct: 445  SPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFTISVNGATGGFF 504

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
             S + LRQGDPL P+LF++ M+   + L         +YHPK   L I  L+F DD+MI 
Sbjct: 505  RSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSISHLMFADDVMIF 564

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
              G +  +  +C+ L +F   S LK N  KS +  AG+ DL +   S   GF  G+ P R
Sbjct: 565  FDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL-DLSERITSAAYGFPAGTFPIR 623

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P +   L +ADYGPL++K+S  ++ W    LS+AGR  +I +++ G+ +F +    
Sbjct: 624  YLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFL 683

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIRDTRCWNVSYYLKC 3206
            +      +I SLC +FLW G     K S+V+W    L K   GLG R    WN +  L+ 
Sbjct: 684  LPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRL 743

Query: 3207 YGIFIQR 3227
              +   R
Sbjct: 744  IWVLFDR 750



 Score = 63.2 bits (152), Expect(2) = 6e-79
 Identities = 38/143 (26%), Positives = 75/143 (52%), Gaps = 5/143 (3%)
 Frame = +2

Query: 1688 FHSLANTNAKKHFVAALIKEDGTITTTSFGDI*EQFLCLYKELLGTREEIISIDETIVDM 1867
            FH + ++    + + +L+  +G +  +  G I +  +  Y+ LLG+ E   S+++  +++
Sbjct: 231  FHRMVDSRKSFNTINSLVDSNGLLIDSQQG-ILDHCVTYYERLLGSIESPFSMEQEDMNL 289

Query: 1868 GL--KISTSQADFFVHEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTNSSSA 2041
             L  + S  Q       F+ ++I+ A   +  +K+ GPDGYS  FF+  W+I+G    +A
Sbjct: 290  LLTYRCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAA 349

Query: 2042 LMEFL---RLVCCWNKLIMLLCP 2101
            + EF    +L+  WN   ++L P
Sbjct: 350  IHEFFDSGQLLKQWNATTLVLIP 372


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  244 bits (623), Expect(3) = 4e-78
 Identities = 130/350 (37%), Positives = 198/350 (56%), Gaps = 5/350 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISCCNV YKVI +++ +RL   LP FI   Q AFVK R ++EN++LA  L+  Y +   
Sbjct: 531  PISCCNVLYKVISKIIANRLKLVLPKFIAGNQSAFVKDRLLIENLLLATELVKDYHKDTI 590

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            S +C +KID+ KA+ ++ W FL  V   L F ++ I W+  C+TT SFS++VNGE  G+ 
Sbjct: 591  STRCAIKIDISKAFDSVQWPFLINVFTILGFPREFIHWINICITTASFSVQVNGELAGYF 650

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
            +S R LRQG  L P+LF+ICM  L + L K  +  +F YHPKC+ + +  L F DDLM+ 
Sbjct: 651  QSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLSFADDLMVL 710

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
            + G    +  +  V   F   S L+ +  KS + LAG+    ++ ++    FS G +P R
Sbjct: 711  SDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPFSSGQLPVR 770

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P +   L+  D  PL+++V   I  W    LSYAGRL++I +++  I +F L    
Sbjct: 771  YLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNFWLAAFR 830

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIRDTR 3176
            +       +  +C  FLW G      +++++W  +   K   GLG+R  +
Sbjct: 831  LPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLK 880



 Score = 62.0 bits (149), Expect(3) = 4e-78
 Identities = 47/159 (29%), Positives = 75/159 (47%), Gaps = 6/159 (3%)
 Frame = +2

Query: 1595 WDPRLATVEVSKYTPQVMHTSITCKV--ISAQLFHSLANTNAKKHFVAALIKEDGTITTT 1768
            WD R+A +E  KY  Q       C+V   + + FH  A      + +  ++  DG + T 
Sbjct: 346  WD-RVAILE-EKYLKQKSKLH-WCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTK 402

Query: 1769 SFGDI*----EQFLCLYKELLGTREEIISIDETIVDMGLKISTSQADFFVHEFSINDIRT 1936
              GD      E+F   + +L+    E ++I E    + ++ S +     +   +  +IR 
Sbjct: 403  --GDEIKAEAERFFREFLQLIPNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRK 460

Query: 1937 ALFDIEDDKSPGPDGYSSTFFKKAWNIVGTNSSSALMEF 2053
             LF +  DKSPGPDGY+S FFK  W I+G   + A+  F
Sbjct: 461  VLFRMPSDKSPGPDGYTSEFFKATWEIIGDEFTLAVQSF 499



 Score = 37.0 bits (84), Expect(3) = 4e-78
 Identities = 16/71 (22%), Positives = 38/71 (53%), Gaps = 2/71 (2%)
 Frame = +2

Query: 3200 KVLWNIHTKKDSLWYRWIDHVDMKNSTILRPKH--KEGFPLFYQKVIEYQDNLITMEGLA 3373
            K++W I +  +SLW +W+D   ++N++    K    +G    ++K+++Y++   T+  + 
Sbjct: 889  KLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQG-SWIWKKLLKYREVAKTLSKVE 947

Query: 3374 TGTASRLDVWH 3406
             G   +   W+
Sbjct: 948  VGNGKQTSFWY 958


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  256 bits (654), Expect(2) = 7e-78
 Identities = 142/372 (38%), Positives = 212/372 (56%), Gaps = 6/372 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISC N  YKVI  LL  RL   L   I   Q AF+ GRS+ EN++LA  L+ GY     
Sbjct: 525  PISCLNTLYKVIARLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNWSNI 584

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            SP+  LK+DL+KA+ ++ WEF+   L AL   +K I W+  C++TP+F++ +NG N GF 
Sbjct: 585  SPRGMLKVDLKKAFDSVRWEFVIAALRALAIPEKFINWISQCISTPTFTVSINGGNGGFF 644

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
            +S + LRQGDPL P+LF++ M+     L         +YHPK   L I  L+F DD+MI 
Sbjct: 645  KSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLSISHLMFADDVMIF 704

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
              G +  +  +C+ L +F   S LK N  KS++ LAG++ LE S  +   GF  G++P R
Sbjct: 705  FDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLNQLE-SNANAAYGFPIGTLPIR 763

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P +   L +A+Y PL++K++   + W    LS+AGR+ +I +++ G  +F +    
Sbjct: 764  YLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFL 823

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIRDTRCWNVSYYLK- 3203
            +      RI SLC RFLW G     K  +V+W  L L K   GLG+R    WN +  ++ 
Sbjct: 824  LPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRL 883

Query: 3204 CYGIFIQRRILY 3239
             + +F+ +  L+
Sbjct: 884  IWRLFVAKDSLW 895



 Score = 65.1 bits (157), Expect(2) = 7e-78
 Identities = 43/147 (29%), Positives = 73/147 (49%), Gaps = 5/147 (3%)
 Frame = +2

Query: 1676 SAQLFHSLANTNAKKHFVAALIKEDGTITTTSFGDI*EQFLCLYKELLGTREEIISIDET 1855
            + + FH +A+     + ++AL   +G +  +  G I +     +  LLG   +   +++ 
Sbjct: 367  NTKYFHRMADARNSSNSISALYDGNGKLVDSQEG-ILDLCASYFGSLLGDEVDPYLMEQN 425

Query: 1856 IVDMGLKISTSQADFFVHE--FSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTN 2029
             +++ L    S A     E  FS  DIR ALF +  +KS GPDG+++ FF  +W+IVG  
Sbjct: 426  DMNLLLSYRCSPAQVCELESTFSNEDIRAALFSLPRNKSCGPDGFTAEFFIDSWSIVGAE 485

Query: 2030 SSSALMEFLRLVCC---WNKLIMLLCP 2101
             + A+ EF    C    WN   ++L P
Sbjct: 486  VTDAIKEFFSSGCLLKQWNATTIVLIP 512


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  253 bits (647), Expect(3) = 2e-77
 Identities = 131/350 (37%), Positives = 200/350 (57%), Gaps = 5/350 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISCCNV YKVI +++ +RL   LP FI + Q AFVK R ++EN++LA  L+  Y +   
Sbjct: 178  PISCCNVLYKVISKIIANRLKLLLPRFIAENQSAFVKDRLLIENLLLATELVKDYHKDSI 237

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            S +C +KID+ KA+ ++ W FL   L+A+NF    I W+  C+TT SFS++VNG+  G+ 
Sbjct: 238  SARCAIKIDISKAFDSVQWSFLTNTLVAMNFSPTFIHWINLCITTASFSVQVNGDLVGYF 297

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
            +S R LRQG  L P+LF+ICM  L + L K      F +HPKC++L +  L F DDLM+ 
Sbjct: 298  QSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGVRKFGFHPKCQRLGLTHLSFADDLMVL 357

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
            + G    +  + +V   F   S L+ +  KS + +AG+  + K  I+    F  G +P R
Sbjct: 358  SDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPIIKQEIAAKFLFDVGQLPVR 417

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P V   L  ADY PL++++   I  W     S+AGR ++I++++  I +F L    
Sbjct: 418  YLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVLWSICNFWLAAFR 477

Query: 3042 ISAAV*DRIISLCRRFLWDGKQ-----SRVTWKTLYLYKVHSGLGIRDTR 3176
            +       I  LC  FLW G +     ++++W  +   K   GLG+R+ +
Sbjct: 478  LPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIVCKPKAEGGLGLRNLK 527



 Score = 57.8 bits (138), Expect(3) = 2e-77
 Identities = 39/144 (27%), Positives = 66/144 (45%), Gaps = 6/144 (4%)
 Frame = +2

Query: 1688 FHSLANTNAKKHFVAALIKEDGTITTTSFGDI*EQFLCLYKELLGT-REEIISIDETIVD 1864
            FH        K+ +  +   DG +      DI  +    +KE L    E+ + ++   + 
Sbjct: 24   FHRAVIERETKNMIKEIYCTDGRVVQGD--DIMVEAEKFFKEFLQLIPEDFVGVEVRELQ 81

Query: 1865 --MGLKISTSQADFFVHEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTNSSS 2038
              +  + + S  +    E S  +I+T LF +  DKSPGPDGY+S F+K  W+I+G   + 
Sbjct: 82   DLLQFRCTNSDNEMLTREVSSEEIKTVLFSMPKDKSPGPDGYTSEFYKATWDIIGQEFTL 141

Query: 2039 ALMEFLR---LVCCWNKLIMLLCP 2101
             +  F +   L    N +I+ L P
Sbjct: 142  PVQSFFQKGFLPKGINSIILALIP 165



 Score = 30.0 bits (66), Expect(3) = 2e-77
 Identities = 13/70 (18%), Positives = 31/70 (44%), Gaps = 1/70 (1%)
 Frame = +2

Query: 3200 KVLWNIHTKKDSLWYRWIDHVDMKNSTILRPKHKEGF-PLFYQKVIEYQDNLITMEGLAT 3376
            K++W I +  +SLW +W+    ++  +I   K         ++K+++ +D   +   +  
Sbjct: 536  KLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSWIWRKILKIRDVAKSFSRVEV 595

Query: 3377 GTASRLDVWH 3406
            G       W+
Sbjct: 596  GNGESASFWY 605


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  257 bits (657), Expect(3) = 5e-77
 Identities = 135/347 (38%), Positives = 201/347 (57%), Gaps = 5/347 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISCCNV YKVI ++L +RL   LP FI + Q AFVK R ++EN++LA  L+  Y ++  
Sbjct: 828  PISCCNVLYKVISKILANRLKLLLPSFILQNQSAFVKERLLMENVLLATELVKDYHKESV 887

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            +P+C +KID+ KA+ ++ W+FL   L ALNF +    W+  C++T +FS++VNGE  GF 
Sbjct: 888  TPRCAMKIDISKAFDSVQWQFLLNTLEALNFPETFRHWIKLCISTATFSVQVNGELAGFF 947

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
             S R LRQG  L P+LF+ICM  L   + +     N  YHPKCEK+ +  L F DDLM+ 
Sbjct: 948  GSSRGLRQGCALSPYLFVICMNVLSHMIDEAAVHRNIGYHPKCEKIGLTHLCFADDLMVF 1007

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
              G    +  V +V + F   S L+ +  KS I LAG+   ++ +  +   F++G +P R
Sbjct: 1008 VDGHQWSIEGVINVFKEFAGRSGLQISLEKSTIYLAGVSASDRVQTLSSFPFANGQLPVR 1067

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P +   +  ADY PL++ V   I  W    LSYAGRL ++ +++  I +F +    
Sbjct: 1068 YLGLPLLTKQMTTADYSPLIEAVKTKISSWTARSLSYAGRLALLNSVIVSIANFWMSAYR 1127

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIR 3167
            + A     I  LC  FLW G     K++++ W ++   K   GLGI+
Sbjct: 1128 LPAGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPKKEGGLGIK 1174



 Score = 53.9 bits (128), Expect(3) = 5e-77
 Identities = 24/72 (33%), Positives = 42/72 (58%)
 Frame = +2

Query: 1838 ISIDETIVDMGLKISTSQADFFVHEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNI 2017
            IS+++    M  + S +  +    E +  +I+  LF + ++KSPGPDGY+S FFK  W++
Sbjct: 725  ISVEDLRNLMSYRCSVTDQNILTREVTGEEIQKVLFAMPNNKSPGPDGYTSEFFKATWSL 784

Query: 2018 VGTNSSSALMEF 2053
             G +  +A+  F
Sbjct: 785  TGPDFIAAIQSF 796



 Score = 28.5 bits (62), Expect(3) = 5e-77
 Identities = 13/70 (18%), Positives = 31/70 (44%), Gaps = 1/70 (1%)
 Frame = +2

Query: 3200 KVLWNIHTKKDSLWYRWIDHVDMKNSTILRPKHKEGF-PLFYQKVIEYQDNLITMEGLAT 3376
            K++W + + + SLW  WI    ++  T      +       ++K+++Y++   +M  +  
Sbjct: 1186 KLIWRLLSTQPSLWVTWIWTFIIRKGTFWSANERSSLGSWMWKKLLKYRELAKSMHKVEV 1245

Query: 3377 GTASRLDVWH 3406
               S    W+
Sbjct: 1246 RNGSSTSFWY 1255


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  252 bits (643), Expect(2) = 1e-76
 Identities = 142/369 (38%), Positives = 205/369 (55%), Gaps = 6/369 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISCCN  YKVI +LL  RL   LP +I   Q AFVKGR + EN++LA  L+ G+ +   
Sbjct: 524  PISCCNAIYKVISKLLARRLENILPLWISPSQSAFVKGRLLTENVLLATELVQGFGQANI 583

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            S +  LK+DLRKA+ ++ W F+   L A N   + + W+  C+T+ SFS+ V+G   G+ 
Sbjct: 584  SSRGVLKVDLRKAFDSVGWGFIIETLKAANAPPRFVNWIKQCITSTSFSINVSGSLCGYF 643

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
            +  + LRQGDPL P LF+I M+ L R L    S  +  YHPK  +++I  L F DDLMI 
Sbjct: 644  KGSKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVRISSLAFADDLMIF 703

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLT-GFSHGSMPF 2858
              G    +R +  VL +F + S L+ N  KS +  AG++D +K    TL  GF +G+ PF
Sbjct: 704  YDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKE--DTLAFGFVNGTFPF 761

Query: 2859 RYLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGIL 3038
            RYL +P +   L  +DY  L+DK++     WA   LS+AGRL +I +++    +F L   
Sbjct: 762  RYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVNFWLSSF 821

Query: 3039 HISAAV*DRIISLCRRFLWDGKQSR-----VTWKTLYLYKVHSGLGIRDTRCWNVSYYLK 3203
             +       I  +C RFLW    +R     V+W+   L K   GLG+R+   WN +  L+
Sbjct: 822  ILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTWNKTLNLR 881

Query: 3204 CYGIFIQRR 3230
               +   RR
Sbjct: 882  LIWMLFARR 890



 Score = 65.1 bits (157), Expect(2) = 1e-76
 Identities = 38/105 (36%), Positives = 55/105 (52%), Gaps = 6/105 (5%)
 Frame = +2

Query: 1805 YKELLGTREEIISIDETIVDMGL---KISTSQADFFVHEFSINDIRTALFDIEDDKSPGP 1975
            +KEL G+   +IS +       L   K   +       E S  DI++  F +  +KSPGP
Sbjct: 407  FKELFGSSSHLISAEGISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGP 466

Query: 1976 DGYSSTFFKKAWNIVGTNSSSALMEFL---RLVCCWNKLIMLLCP 2101
            DGY+S FFKK W+IVG +  +A+ EF    RL+  WN   + + P
Sbjct: 467  DGYTSEFFKKTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMVP 511


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  248 bits (634), Expect(2) = 3e-76
 Identities = 130/359 (36%), Positives = 202/359 (56%), Gaps = 5/359 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PI+CC+  YK+I ++L  RL   +   +   Q  F+  R + +NI+LA  L+ GY R+  
Sbjct: 519  PIACCSTLYKIISKILTKRLQAVITEVVDCAQTGFIPERHIGDNILLATELIRGYNRRHV 578

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            SP+C +K+D+RKAY ++ W FL+ +L  L F    IRW+MACV T S+S+ +NG      
Sbjct: 579  SPRCVIKVDIRKAYDSVEWVFLESMLKELGFPSMFIRWIMACVKTVSYSILLNGIPSIPF 638

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
            ++ + LRQGDPL PFLF + M+YL R +        FN+HPKCE++K+  L+F DDL++ 
Sbjct: 639  DAQKGLRQGDPLSPFLFALSMEYLSRCMGNMCKDPEFNFHPKCERIKLTHLMFADDLLMF 698

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
             R D   +  +     +F  AS L+A+  KS I   G+   E  +++       GS+PFR
Sbjct: 699  ARADASSISKIMAAFNSFSKASGLQASIEKSCIYFGGVCHEEAEQLADRIQMPIGSLPFR 758

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P     LN +   PL+DK++   QGW    LSYAGRL +++T++  ++++   I  
Sbjct: 759  YLGVPLASKKLNFSQCKPLIDKITTRAQGWVAHLLSYAGRLQLVKTILYSMQNYWGQIFP 818

Query: 3042 ISAAV*DRIISLCRRFLWDGK-----QSRVTWKTLYLYKVHSGLGIRDTRCWNVSYYLK 3203
            +   +   + + CR+FLW G      ++ V W  L   K   GL + +   WN +  LK
Sbjct: 819  LPKKLIKAVETTCRKFLWTGTVDTSYKAPVAWDFLQQPKSTGGLNVTNMVLWNKAAILK 877



 Score = 67.4 bits (163), Expect(2) = 3e-76
 Identities = 41/131 (31%), Positives = 68/131 (51%), Gaps = 11/131 (8%)
 Frame = +2

Query: 1694 SLANTNAKKHFVA----------ALIKEDGTITTTSFGDI*EQFLCLYKELLGTRE-EII 1840
            SL ++N+K  F A           L++ D     T   +I  +    Y+ LLGT   ++ 
Sbjct: 357  SLGDSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICNFYRRLLGTSSSQLE 416

Query: 1841 SIDETIVDMGLKISTSQADFFVHEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIV 2020
            +ID  +V +G K+S +     V   +I +I  AL DI+D K+PG DG++S FFKK+W ++
Sbjct: 417  AIDLHVVRVGAKLSATSCAQLVQPITIQEIDQALADIDDTKAPGLDGFNSVFFKKSWLVI 476

Query: 2021 GTNSSSALMEF 2053
                   +++F
Sbjct: 477  KQEIYEGILDF 487


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  259 bits (662), Expect(3) = 3e-76
 Identities = 142/354 (40%), Positives = 199/354 (56%), Gaps = 6/354 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISCCN FYK+I +LL +RL  TL   +G  Q  F+ GR + +NI+LAQ ++  Y +   
Sbjct: 352  PISCCNTFYKIIAKLLANRLKGTLHLIVGPSQSTFIPGRRIGDNILLAQEIICDYHKADG 411

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
             P+CT  +D+ KA  T+ W+F+   L A N    LI W+ +C+++  FS+ VNGE  GF 
Sbjct: 412  QPRCTFMVDMMKANDTVEWDFIIATLQAFNIPSTLIGWIKSCISSAKFSVCVNGELAGFF 471

Query: 2502 ESPRELRQGDPLCPFLFMICMKYL-LRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMI 2678
               R LRQGDPL P+LF+I M+ L L   R+      F YH +C++L +  L F DDL++
Sbjct: 472  ARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRYHWRCDQLNLSHLCFADDLLM 531

Query: 2679 STRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPF 2858
               GD   VR + D   NF   SSLKAN  +S I LAG+D      +  +T FS G+ P 
Sbjct: 532  FCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDGNSSDSVLQVTNFSLGTCPV 591

Query: 2859 RYLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGIL 3038
            RYL IP +   L + D  PL+D++   I+ W    LS+AGRL +I++++  I+ +    L
Sbjct: 592  RYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWASHL 651

Query: 3039 HISAAV*DRIISLCRRFLWDGKQS-----RVTWKTLYLYKVHSGLGIRDTRCWN 3185
             +   V   I    R FLW G  S     +V W  + L K   GLGI+D  CWN
Sbjct: 652  ILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWN 705



 Score = 54.3 bits (129), Expect(3) = 3e-76
 Identities = 28/69 (40%), Positives = 43/69 (62%), Gaps = 4/69 (5%)
 Frame = +2

Query: 1907 HEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTN-SSSALMEFL---RLVCCW 2074
            +EF+ +DIR   F +  +KSPGPDG++  FF+KAW ++G N  ++A+ EF     L+   
Sbjct: 271  NEFTHDDIRAVFFSMNPNKSPGPDGFNGCFFQKAWLVIGDNVVAAAVKEFFSYGSLLMEL 330

Query: 2075 NKLIMLLCP 2101
            N  I+ L P
Sbjct: 331  NSTIITLVP 339



 Score = 23.5 bits (49), Expect(3) = 3e-76
 Identities = 16/87 (18%), Positives = 35/87 (40%), Gaps = 7/87 (8%)
 Frame = +2

Query: 3167 GYKMLEC---VLLSKVLWNIHTKKDSLWYRWIDHVDMKNSTILRPKHKEGFPLFYQKVIE 3337
            G K L C    L+   +WN+ +   + W  W+    +K ++             ++K+++
Sbjct: 697  GIKDLHCWNKALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRKLLK 756

Query: 3338 YQD----NLITMEGLATGTASRLDVWH 3406
             ++      + + G    T+   D WH
Sbjct: 757  IRELCCSFFVNIIGDGRATSLWFDNWH 783


>emb|CAB72467.1| putative protein [Arabidopsis thaliana]
          Length = 762

 Score =  251 bits (642), Expect(3) = 9e-76
 Identities = 135/350 (38%), Positives = 198/350 (56%), Gaps = 5/350 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISCCNV YKVI ++L +RL   LP FI   Q +FVK R ++EN++LA  L+  Y +   
Sbjct: 84   PISCCNVMYKVISKILANRLKLLLPQFIAGNQSSFVKDRLLIENVLLATDLVKDYHKDSI 143

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            S +C +KID+ KA  ++ W FL   L A++F +  I W+  C+TTPSFS++VNGE  GF 
Sbjct: 144  SERCAIKIDISKASDSVQWSFLINTLTAMHFPEMFIHWIRLCITTPSFSVQVNGELAGFF 203

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
            +S R LRQG  L P+LF+ICM  L + L K        YHP C+++ +  L F DDLMI 
Sbjct: 204  QSSRGLRQGCALSPYLFVICMDVLSKLLDKVVGIGRIGYHPHCKRMGLTHLSFADDLMIL 263

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
            T G    +  + +V   F   S LK +  KS I  AG+    ++++ T   F  G +P R
Sbjct: 264  TDGQCRSIEGIIEVFDLFSKWSGLKISMEKSTIFSAGLSSTSRAQLHTHFPFEVGELPIR 323

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P V   L+  DY PL++++   I  W+   LS+AGR ++I +++    +F L    
Sbjct: 324  YLGLPLVTKRLSSVDYAPLIEQIRKRIGSWSSRFLSFAGRFNLISSIIWSSCNFWLSAFQ 383

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIRDTR 3176
            +  A    I  LC  FLW G     K+++++W  +   K   GLG+R  +
Sbjct: 384  LPRACIQEIEKLCSSFLWSGTNLNSKKAKISWNQVCKPKSEGGLGLRSLK 433



 Score = 52.4 bits (124), Expect(3) = 9e-76
 Identities = 22/49 (44%), Positives = 32/49 (65%)
 Frame = +2

Query: 1916 SINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTNSSSALMEFLRL 2062
            S  +I+  LF + +DKSPGPDG++S FFK++W I+G     A+  F  L
Sbjct: 7    SAEEIKKVLFSMPNDKSPGPDGFTSEFFKESWEILGPEFILAIQSFFAL 55



 Score = 31.6 bits (70), Expect(3) = 9e-76
 Identities = 13/49 (26%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
 Frame = +2

Query: 3200 KVLWNIHTKKDSLWYRWIDHVDMKNSTILRPKHKEGF-PLFYQKVIEYQ 3343
            K++W I +  DSLW +W++H  +K       K         ++K+++Y+
Sbjct: 442  KLVWRIISHGDSLWVKWVEHNLLKREIFWIVKENANLGSWIWKKILKYR 490


>gb|AAD15471.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1277

 Score =  261 bits (668), Expect(3) = 5e-74
 Identities = 137/347 (39%), Positives = 200/347 (57%), Gaps = 5/347 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISCCNV YKVI +++ +RL   LP FI + Q AFV+ R ++EN++LA  L+  Y +   
Sbjct: 673  PISCCNVIYKVISKIIANRLKVMLPTFILQNQSAFVRERLLMENVLLATELVKDYHKDSI 732

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            SP+C +KID+ KA+ ++ W+FL   L AL F +K   W+  C++T +FS++VN E  GF 
Sbjct: 733  SPRCAMKIDISKAFDSVQWQFLLNTLEALKFPEKFRHWIKLCISTATFSVQVNSEQAGFF 792

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
             S R LRQG  L P+LF+ICM  L   +       N  YHPKC+KL +  L F DDLM+ 
Sbjct: 793  GSKRGLRQGCALSPYLFVICMNVLSHMIDVAAVHRNIGYHPKCKKLSLTHLCFADDLMVF 852

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
              G    V  V ++ ++F   S L  +  KS + LA + +L ++ I +   F+ G +P R
Sbjct: 853  IDGQQRSVEGVINIFKDFAGKSGLHISLEKSTLYLAEVSELNRNNILSAFPFASGQLPVR 912

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL  P +   +  ADY PL+DKV + I  W    LSYAGRL +I +++  + +F +    
Sbjct: 913  YLGFPLLTKQMTTADYSPLLDKVRSKISSWTARSLSYAGRLALINSVIVSLSNFWMSAYR 972

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIR 3167
            + A     I  LC  FLW G     K++++TW +L   K   GLGI+
Sbjct: 973  LPAGCIKEIEKLCSAFLWSGPELNPKKAKITWTSLCKLKQEGGLGIK 1019



 Score = 39.7 bits (91), Expect(3) = 5e-74
 Identities = 28/131 (21%), Positives = 59/131 (45%), Gaps = 3/131 (2%)
 Frame = +2

Query: 1688 FHSLANTNAKKHFVAALIKEDGTITTTSFGDI*EQFLCLYKELLGTRE---EIISIDETI 1858
            FH  A     ++ +  +   +G +  TS  +I  +    ++E L  +    + ++++E  
Sbjct: 548  FHKAAQVRRMQNSIREIQGPNGVVLQTS-EEIKGEAERFFQEFLNHQPSDFQGMTVEELQ 606

Query: 1859 VDMGLKISTSQADFFVHEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTNSSS 2038
              M  + S +  D    E +  +I+  LF +  +KSPGPDGY+         I   + ++
Sbjct: 607  NLMSFRCSATDQDMLTREVTSEEIQKVLFAMPSNKSPGPDGYTRLNATILALIPKKDEAT 666

Query: 2039 ALMEFLRLVCC 2071
             + ++  + CC
Sbjct: 667  LMRDYRPISCC 677



 Score = 28.5 bits (62), Expect(3) = 5e-74
 Identities = 18/85 (21%), Positives = 38/85 (44%), Gaps = 1/85 (1%)
 Frame = +2

Query: 3200 KVLWNIHTKKDSLWYRWI-DHVDMKNSTILRPKHKEGFPLFYQKVIEYQDNLITMEGLAT 3376
            K++W + +++ SLW  W+  ++  K S              ++K++ Y+D   +M  +  
Sbjct: 1031 KLIWRLVSRQSSLWVNWVWTYIIRKGSFWSANDRSSLGSWMWKKLLNYRDVAKSMCKVEI 1090

Query: 3377 GTASRLDVWHMRSSFVQLLTMATLA 3451
             + S    W+   S ++ L   T A
Sbjct: 1091 KSGSSTSFWYDNWSQLRQLVDVTNA 1115


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score =  248 bits (633), Expect(3) = 2e-73
 Identities = 136/350 (38%), Positives = 196/350 (56%), Gaps = 5/350 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISCCNV YK I ++L +RL   LP FI   Q AFVK R ++EN++LA  L+  Y +   
Sbjct: 252  PISCCNVLYKAISKILANRLKRILPKFIVGNQSAFVKDRLLIENVLLATELVKDYHKDSI 311

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            S +C +KID+ KA+ ++ W FL +VL A+NF  + I W+  C++T SFS++VNGE  G+ 
Sbjct: 312  STRCAMKIDISKAFDSLQWSFLTHVLAAMNFPGEFIHWISLCMSTASFSIQVNGELAGYF 371

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
             S R LRQG  L P+LF+I M  L R L K      F YHP+C+ L +  L F DDLMI 
Sbjct: 372  RSARGLRQGCSLSPYLFVISMDVLSRMLDKAAGAREFGYHPRCKTLGLTHLCFADDLMIL 431

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
            T G    V  +  VL  F     LK    K+ + LAG+ D  +  +S+   F  G +P R
Sbjct: 432  TDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTLYLAGVSDHSRQLMSSRYSFGVGKLPVR 491

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P V   L  +DY PL+D++   I  W   +LS+AGRL +I +++  I +F +    
Sbjct: 492  YLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSITNFWMNAFR 551

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIRDTR 3176
            +     + I  +    LW G     K+++V+W  +   K   GLG++  R
Sbjct: 552  LPRECINEINRISSALLWSGPELNPKKAKVSWDEICKPKKEGGLGLQSLR 601



 Score = 53.9 bits (128), Expect(3) = 2e-73
 Identities = 32/125 (25%), Positives = 58/125 (46%), Gaps = 3/125 (2%)
 Frame = +2

Query: 1688 FHSLANTNAKKHFVAALIKEDGTITTTSFGDI*EQFLCLYKELLGTRE---EIISIDETI 1858
            FH    T    + +  ++  DG + T+   DI  + +  +++ L T     E + ++E  
Sbjct: 97   FHRAITTREAVNSIREIVTRDGLVVTSQ-QDIQTEAVNYFQDFLQTIPADYEGMCVEELE 155

Query: 1859 VDMGLKISTSQADFFVHEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTNSSS 2038
              +  + S           +  +I+  +F +  DKSPGPDGY+S F+K +W I+G     
Sbjct: 156  NLLPFRCSEDDHRLLTRVVTGEEIKKVIFSMPKDKSPGPDGYTSEFYKASWEIIGDEVII 215

Query: 2039 ALMEF 2053
            A+  F
Sbjct: 216  AIQSF 220



 Score = 25.4 bits (54), Expect(3) = 2e-73
 Identities = 7/17 (41%), Positives = 13/17 (76%)
 Frame = +2

Query: 3200 KVLWNIHTKKDSLWYRW 3250
            K++W + + +DSLW +W
Sbjct: 610  KLIWRLLSCQDSLWVKW 626


>dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 893

 Score =  234 bits (596), Expect(2) = 4e-70
 Identities = 132/372 (35%), Positives = 208/372 (55%), Gaps = 6/372 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISC N  YKVI +LL SRL + L   I   Q AF+ GR + EN++LA  ++ GY  K  
Sbjct: 526  PISCLNTLYKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNI 585

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            S +  LK+DLRKA+ ++ W+F+     AL   +K + W+  C++TP FS+ VNG + GF 
Sbjct: 586  SSRGMLKVDLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFF 645

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
            +S + LRQGDPL P+LF++ M+     L+        +YHPK   L I  L+F DD+M+ 
Sbjct: 646  KSNKGLRQGDPLSPYLFVLAMEVFSSLLKARFDAGYIHYHPKTADLSISHLMFADDVMVF 705

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
              G +  +  + + L +F   S L  N  K+N+ LAG D++E   IS   GF   ++P R
Sbjct: 706  FDGGSSSLHGISEALDDFASWSGLHVNKDKTNLYLAGTDEVEALAISHY-GFPISTLPIR 764

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P +   L +++Y     ++    + WA   LS+AGR+ +I +++ G+ +F +    
Sbjct: 765  YLGLPLMSRKLKISEY-----ELVKRFRSWAVKSLSFAGRVQLITSVITGLVNFWMSTFV 819

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIRDTRCWNVSYYLK- 3203
            +      +I SLC RFLW G     K +++ W  + L K   G+G+R    WN ++YL+ 
Sbjct: 820  LLLGCVKKIESLCSRFLWSGSIDASKGAKIAWSGVCLPKNEGGVGLRRFTPWNKTFYLRF 879

Query: 3204 CYGIFIQRRILY 3239
             + +F    +L+
Sbjct: 880  IWPLFADNDVLW 891



 Score = 61.6 bits (148), Expect(2) = 4e-70
 Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 8/146 (5%)
 Frame = +2

Query: 1688 FHSLANTNAKKHFVAALIKEDGTITTTSFG---DI*EQFLCLYKELLGTREEIISIDETI 1858
            FH +A+     + +  LI + G    T  G    I E     ++ LL   E   S+ ++ 
Sbjct: 368  FHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSCNFFESLLCGVEGENSLAQSD 427

Query: 1859 VDMGL--KISTSQADFFVHEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTNS 2032
            +++ L  + S  Q +     FS  DI+ A F +  +K+ GPDGYSS FFK  W +VG   
Sbjct: 428  MNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGYSSEFFKGVWFVVGPEV 487

Query: 2033 SSALMEFLR---LVCCWNKLIMLLCP 2101
            + A+ EF R   L+  WN   ++L P
Sbjct: 488  TEAVQEFFRSGQLLKQWNATTLVLIP 513


>gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score:
            72.31) [Arabidopsis thaliana]
          Length = 928

 Score =  234 bits (598), Expect(2) = 2e-69
 Identities = 131/335 (39%), Positives = 187/335 (55%), Gaps = 5/335 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISCCNV YKVI +++ +RL   LP FI   Q AFVK R ++EN++LA  ++  Y +   
Sbjct: 419  PISCCNVLYKVISKIIANRLKLVLPKFIVGNQSAFVKDRLLIENVLLATEIVKDYHKDSV 478

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            S +C LKID+ KA+ ++ W+FL  VL A+NF  +   W+  C+TT SFS++VNGE  G  
Sbjct: 479  SSRCALKIDISKAFDSVQWKFLINVLEAMNFPPEFTHWITLCITTASFSVQVNGELAGVF 538

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
             S RELRQG  L P+LF+I M  L + L K      F YHPKC  + +  L F DDLMI 
Sbjct: 539  SSARELRQGCSLSPYLFVISMDVLSKMLDKAVGARQFGYHPKCRAIGLTHLSFADDLMIL 598

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
            + G    +  +  VL  F   S LK +  KS + LAG+       I     F  G +P R
Sbjct: 599  SDGKVRSIDGIVKVLYEFAKWSGLKISMEKSTMYLAGVQASVYQEIVQKFSFDVGKLPVR 658

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P V   L  +D  PL++++   I+ W    LS+AGRL++I + +  I +F +    
Sbjct: 659  YLGLPLVSKRLTASDCLPLIEQLRKKIEAWTSRFLSFAGRLNLISSTLWSICNFWMAAFR 718

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTL 3131
            +  A    I  LC  FLW G      +++V+W+ +
Sbjct: 719  LPRACIREIDKLCSAFLWSGTELSSNKAKVSWEAI 753



 Score = 58.5 bits (140), Expect(2) = 2e-69
 Identities = 33/124 (26%), Positives = 59/124 (47%), Gaps = 2/124 (1%)
 Frame = +2

Query: 1688 FHSLANTNAKKHFVAALIKEDGTITTTS--FGDI*EQFLCLYKELLGTREEIISIDETIV 1861
            FH        ++ +  +I  DG++ +         E     + +L+    E I+++E   
Sbjct: 264  FHRAVVAREAQNSIREIICHDGSVASQEEKIKTEAEHHFREFLQLIPNDFEGIAVEELQD 323

Query: 1862 DMGLKISTSQADFFVHEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTNSSSA 2041
             +  + S S  +   +  S  +I   +F + +DKSPGPDGY++ F+K AWNI+G     A
Sbjct: 324  LLPYRCSDSDKEMLTNHVSAEEIHKVVFSMPNDKSPGPDGYTAEFYKGAWNIIGAEFILA 383

Query: 2042 LMEF 2053
            +  F
Sbjct: 384  IQSF 387


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  236 bits (602), Expect(3) = 2e-69
 Identities = 129/359 (35%), Positives = 197/359 (54%), Gaps = 5/359 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISCCNV YK++ +L+ +RL E LP  I   Q AF+K R M+EN++LA  L+  Y ++  
Sbjct: 678  PISCCNVLYKIVSKLMANRLKEILPASIAPNQSAFIKDRLMMENLLLASELVKDYHKESI 737

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            S +  LKID+ KA+  + W FL  VL A++  +  I W+  C+ T SFS++VNGE  GF 
Sbjct: 738  SSRSALKIDISKAFDFVQWPFLINVLKAIHLPEMFIHWIELCIGTASFSVQVNGELSGFF 797

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
             S R LRQG  L P+L++ICM  L   L K       +YHP+C  + +  L F DD+M+ 
Sbjct: 798  RSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEKKISYHPRCRNMNLTHLCFADDIMVF 857

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
            + G +  ++    +   F   S LK +  KS I +AG+    K+ I     F  G++P +
Sbjct: 858  SDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSILQQFPFELGTLPVK 917

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P +   +  +DY PLV+K+   I  W    LS+AGRL +I++++  I +F L +  
Sbjct: 918  YLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSITNFWLSVFR 977

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIRDTRCWNVSYYLK 3203
            +  A    I  +   FLW G     K++++ W  +   K   GLG++  +  N    LK
Sbjct: 978  LPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKEANEVSLLK 1036



 Score = 40.8 bits (94), Expect(3) = 2e-69
 Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 4/54 (7%)
 Frame = +2

Query: 1904 VHEFSINDIR--TALFDIEDD--KSPGPDGYSSTFFKKAWNIVGTNSSSALMEF 2053
            + E    D R  T+  DI+++  KSPGPDGY+  FFK AW ++G +   A+  F
Sbjct: 593  IREIQCTDGRVCTSHDDIKEEAHKSPGPDGYTVEFFKTAWPVLGRDLVIAIQSF 646



 Score = 37.4 bits (85), Expect(3) = 2e-69
 Identities = 17/74 (22%), Positives = 35/74 (47%), Gaps = 1/74 (1%)
 Frame = +2

Query: 3188 VLLSKVLWNIHTKKDSLWYRWIDHVDMKNSTILRPKHKEGF-PLFYQKVIEYQDNLITME 3364
            V L K++W I + +DSLW +W++   ++  T    K   G     ++K+++ +D      
Sbjct: 1032 VSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSVKENTGLGSWLWRKILKQRDKARLFH 1091

Query: 3365 GLATGTASRLDVWH 3406
             +   + +    WH
Sbjct: 1092 RMEVRSGTFTSFWH 1105


>emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein
            [Arabidopsis thaliana]
          Length = 893

 Score =  231 bits (589), Expect(2) = 2e-69
 Identities = 131/372 (35%), Positives = 206/372 (55%), Gaps = 6/372 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISC N  YKVI +LL SRL + L   I   Q AF+ GR + EN++LA  ++ GY  K  
Sbjct: 526  PISCLNTLYKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNI 585

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            S +  LK+DLRKA+ ++ W+F+     AL   +K + W+  C++TP FS+ VNG + GF 
Sbjct: 586  SSRGMLKVDLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFF 645

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
            +S + LRQGDPL P+LF++ M+     L+         YHPK   L I  L+F DD+M+ 
Sbjct: 646  KSNKGLRQGDPLSPYLFVLAMEVFSSLLKARFDAGYIQYHPKTADLSISHLMFADDVMVF 705

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
              G +  +  + + L +F   S L  N  K+N+ LAG D++E   IS   GF   ++P R
Sbjct: 706  FDGGSSSLHGISEALDDFASWSGLHVNKDKTNLYLAGTDEVEALAISHY-GFPISTLPIR 764

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P +   L +++Y     ++    + WA   LS+AGR+ +I +++ G+ +F +    
Sbjct: 765  YLGLPLMSRKLKISEY-----ELVKRFRSWAVKSLSFAGRVQLITSVITGLVNFWMSTFV 819

Query: 3042 ISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIRDTRCWNVSYYLK- 3203
            +      +I SLC RFLW G     K +++ W  + L K   G+ +R    WN ++YL+ 
Sbjct: 820  LLLGCVKKIESLCSRFLWSGSIDASKGAKIAWSGVCLPKNEGGVALRRFTPWNKTFYLRF 879

Query: 3204 CYGIFIQRRILY 3239
             + +F    +L+
Sbjct: 880  IWPLFADNDVLW 891



 Score = 61.6 bits (148), Expect(2) = 2e-69
 Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 8/146 (5%)
 Frame = +2

Query: 1688 FHSLANTNAKKHFVAALIKEDGTITTTSFG---DI*EQFLCLYKELLGTREEIISIDETI 1858
            FH +A+     + +  LI + G    T  G    I E     ++ LL   E   S+ ++ 
Sbjct: 368  FHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSCNFFESLLCGVEGENSLAQSD 427

Query: 1859 VDMGL--KISTSQADFFVHEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTNS 2032
            +++ L  + S  Q +     FS  DI+ A F +  +K+ GPDGYSS FFK  W +VG   
Sbjct: 428  MNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASGPDGYSSEFFKGVWFVVGPEV 487

Query: 2033 SSALMEFLR---LVCCWNKLIMLLCP 2101
            + A+ EF R   L+  WN   ++L P
Sbjct: 488  TEAVQEFFRSGQLLKQWNATTLVLIP 513


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
            [Arabidopsis thaliana]
          Length = 1164

 Score =  237 bits (605), Expect(3) = 2e-67
 Identities = 129/359 (35%), Positives = 203/359 (56%), Gaps = 5/359 (1%)
 Frame = +3

Query: 2142 PISCCNVFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYTRKRT 2321
            PISC N  YKVI +LL  RL + LP  I   Q AF+ GR  +EN++LA  L+ GY +K  
Sbjct: 422  PISCLNTVYKVISKLLTDRLKDFLPAAISHSQSAFMPGRLFLENVLLATELVHGYNKKNI 481

Query: 2322 SPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGENFGFL 2501
            +P   LK+DLRKA+ ++ W+F+   L ALN  +K   W++ C++T SFS+ +NG + G  
Sbjct: 482  APSSMLKVDLRKAFDSVRWDFIVSALRALNVPEKFTCWILECLSTASFSVILNGHSAGHF 541

Query: 2502 ESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDDLMIS 2681
             S + LRQGDP+ P+LF++ M+     L+   +     YHPK  +L+I  L+F DD+MI 
Sbjct: 542  WSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQLEISHLMFADDVMIF 601

Query: 2682 TRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGSMPFR 2861
              G +  +  + + L +F   S L  N  K+ +  AG+   E   +++  GF  GS+P R
Sbjct: 602  FDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGLSQSESDSMASY-GFKLGSLPVR 660

Query: 2862 YLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLLGILH 3041
            YL +P +   L +A+Y PL++K++     W    LS+AGR+ ++ +++ GI +F +    
Sbjct: 661  YLGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASVISGIVNFWISSFI 720

Query: 3042 ISAAV*DRIISLCRRFLWDGK-----QSRVTWKTLYLYKVHSGLGIRDTRCWNVSYYLK 3203
            +      +I SLC RFLW  +      ++V W  + L K   G+G+R     N + YL+
Sbjct: 721  LPLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAEGGIGLRRFAVSNRTLYLR 779



 Score = 48.1 bits (113), Expect(3) = 2e-67
 Identities = 24/66 (36%), Positives = 36/66 (54%), Gaps = 3/66 (4%)
 Frame = +2

Query: 1913 FSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTNSSSALMEFL---RLVCCWNKL 2083
            FS   I+ A F +  +K+ GPDG+S  FF   W I+G   + A+ EF    +L+  WN  
Sbjct: 344  FSSEQIKNAFFSLPRNKASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNAT 403

Query: 2084 IMLLCP 2101
             ++L P
Sbjct: 404  NLVLIP 409



 Score = 21.9 bits (45), Expect(3) = 2e-67
 Identities = 8/30 (26%), Positives = 14/30 (46%)
 Frame = +2

Query: 3191 LLSKVLWNIHTKKDSLWYRWIDHVDMKNST 3280
            L  +++W + +   SLW  W     +  ST
Sbjct: 776  LYLRMIWLLFSNSGSLWVAWHKQHSLGKST 805


>gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis
            thaliana]
          Length = 1253

 Score =  214 bits (546), Expect(2) = 9e-65
 Identities = 124/351 (35%), Positives = 189/351 (53%), Gaps = 9/351 (2%)
 Frame = +3

Query: 2142 PISCCN----VFYKVIVELLGSRLGETLPFFIGKVQ*AFVKGRSMVENIILAQVLMWGYT 2309
            PISC +      YKVI  LL +RL   L   I   Q AF+ GR + EN++LA  L+ GY 
Sbjct: 474  PISCNDFGPITLYKVIARLLTNRLQCLLSQVISPFQSAFLPGRFLAENVLLATELVQGYN 533

Query: 2310 RKRTSPKCTLKIDLRKAYYTISWEFLQYVLLALNFLKKLIRWVMACVTTPSFSLRVNGEN 2489
            R+   P+  LK+DLRKA+ +I W+F+   L A+    + + W+  C++TP+FS+ VNG  
Sbjct: 534  RQNIDPRGMLKVDLRKAFDSIRWDFIISALKAIGIPDRFVYWITQCISTPTFSVCVNGNT 593

Query: 2490 FGFLESPRELRQGDPLCPFLFMICMKYLLRSLRKTTS*DNFNYHPKCEKLKICLLVFVDD 2669
             GF +S R LRQG+PL PFLF++ M+     L         +YHPK   L I  L+F DD
Sbjct: 594  GGFFKSTRGLRQGNPLSPFLFVLAMEVFSSLLNSRFQAGYIHYHPKTSPLSISHLMFADD 653

Query: 2670 LMISTRGDNPFVRIVCDVLRNFVHASSLKANCLKSNILLAGMDDLEKSRISTLTGFSHGS 2849
            +M+   G +  +  + + L +F   S L  N  K+++ LAG+D +E S I+         
Sbjct: 654  IMVFFDGGSSSLHGISEALEDFAFWSGLVLNREKTHLYLAGLDRIEASTIAR-------- 705

Query: 2850 MPFRYLAIPSVGVYLNVADYGPLVDKVSNTIQGWA*LHLSYAGRLDVIRTMVQGIESFLL 3029
                          L +A+YGPL++K++   + W+   LS+AGR+ +I +++ GI +F +
Sbjct: 706  -------------KLRIAEYGPLLEKLAKRFRSWSVKCLSFAGRVQLIASVISGIINFWI 752

Query: 3030 GILHISAAV*DRIISLCRRFLWDG-----KQSRVTWKTLYLYKVHSGLGIR 3167
                +      RI +LC RFLW G     K ++V W  + L K   G+G+R
Sbjct: 753  STFILPKGCVKRIEALCARFLWSGNIDVKKGAKVAWSEVCLPKEEGGVGLR 803



 Score = 62.8 bits (151), Expect(2) = 9e-65
 Identities = 41/143 (28%), Positives = 69/143 (48%), Gaps = 5/143 (3%)
 Frame = +2

Query: 1688 FHSLANTNAKKHFVAALIKEDGTITTTSFGDI*EQFLCLYKELLGTREEIISIDETIVDM 1867
            FH +A++    + +  +I ++G    T  G I E  +  +  LLG       + +   D+
Sbjct: 320  FHRMADSRKAVNTIHIIIDDNGVKIDTQLG-IKEHCIEYFSNLLGGEVGPPMLIQEDFDL 378

Query: 1868 GL--KISTSQADFFVHEFSINDIRTALFDIEDDKSPGPDGYSSTFFKKAWNIVGTNSSSA 2041
             L  + S  Q       FS  DI++A F    +K+ GPDG+   FFK+ W+++GT  + A
Sbjct: 379  LLPFRCSHDQKKELAMSFSRQDIKSAFFSFPSNKTSGPDGFPVEFFKETWSVIGTEVTDA 438

Query: 2042 LMEFLR---LVCCWNKLIMLLCP 2101
            + EF     L+  WN   ++L P
Sbjct: 439  VSEFFTSSVLLKQWNATTLVLIP 461


Top