BLASTX nr result

ID: Catharanthus22_contig00014972 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00014972
         (2648 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN79321.1| hypothetical protein VITISV_018984 [Vitis vinifera]   489   e-135
emb|CAN69233.1| hypothetical protein VITISV_003380 [Vitis vinifera]   435   e-119
gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobrom...   429   e-117
emb|CAN71532.1| hypothetical protein VITISV_018180 [Vitis vinifera]   427   e-117
gb|EOY19305.1| Uncharacterized protein TCM_044370 [Theobroma cacao]   397   e-107
gb|EMJ22494.1| hypothetical protein PRUPE_ppa024499mg, partial [...   395   e-107
gb|AAV88076.1| putative retrotransposon polyprotein [Ipomoea bat...   390   e-105
gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group]     387   e-104
gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]                 377   e-101
gb|AAM94350.1| gag-pol polyprotein [Zea mays]                         376   e-101
gb|AAX95495.1| Retrotransposon gag protein, putative [Oryza sati...   375   e-101
gb|AAX96717.1| retrotransposon protein, putative, Ty3-gypsy sub-...   375   e-101
ref|XP_004309164.1| PREDICTED: uncharacterized protein LOC101300...   372   e-100
gb|EMJ11389.1| hypothetical protein PRUPE_ppa017790mg [Prunus pe...   370   2e-99
gb|EMJ08431.1| hypothetical protein PRUPE_ppa026856mg [Prunus pe...   370   2e-99
gb|EMJ00160.1| hypothetical protein PRUPE_ppa020671mg, partial [...   369   3e-99
gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Ja...   369   4e-99
emb|CAE04927.2| OSJNBa0017P10.4 [Oryza sativa Japonica Group] gi...   368   8e-99
gb|ADP20180.1| mutant gag-pol polyprotein [Pisum sativum]             367   1e-98
gb|AAQ56338.1| putative gag-pol polyprotein [Oryza sativa Japoni...   366   3e-98

>emb|CAN79321.1| hypothetical protein VITISV_018984 [Vitis vinifera]
          Length = 1521

 Score =  489 bits (1260), Expect = e-135
 Identities = 270/537 (50%), Positives = 347/537 (64%), Gaps = 10/537 (1%)
 Frame = -1

Query: 1583 QPTISLIMADRNRSQMTES---KRQDANLECFNCGLRGHYAWECLKKKNLHIGVE-PNDE 1416
            QPT ++   + N+ + + S   ++ DA   CF CG  GHYA  C   K LH  VE P  E
Sbjct: 297  QPTSNVAHQNGNKGKNSMSNGDRKVDATPLCFKCGGHGHYAVVC-PTKGLHFCVEEPESE 355

Query: 1415 QETEEGKEVDFIERIXXXXXXXXXXXXXDTTFLSVVRRILSTPKQQ-KKDWRGTTILQTL 1239
             E+   KE  + E                 +   VVR +L+ PK + ++DWR  +I QT 
Sbjct: 356  LESYLKKEETYNEDEVSEECDYYDGMTEGHSL--VVRPLLTIPKVKGEEDWRRISIFQTR 413

Query: 1238 VCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSSGIPVSKRCLVTF 1059
            + C      +IIDGGSS+N+ S+  VEKLNL TE HP+P++VAW++ + IPVS RCLVTF
Sbjct: 414  ISCHGRLCTMIIDGGSSLNIASQELVEKLNLKTERHPNPFRVAWVNDTSIPVSFRCLVTF 473

Query: 1058 THGT-YTDSIWCDVILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSE 882
              G  + +S+WC+V+ + ++HILLG PWL+DR+V+HDG E+TY+   N ++ +LRP    
Sbjct: 474  LFGKDFEESVWCEVLPIKVSHILLGRPWLFDRKVQHDGYENTYALIHNGRKKILRP---- 529

Query: 881  AMNNKRATKDKRRNQEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELK 702
             M      K    N +     +  +   E  +  +IF  + ++V+     + E Y    +
Sbjct: 530  -MKEVPPIKKSNENAQPKKVLTMCQFENESKETXVIFALMARKVEEFKEQDKE-YPANAR 587

Query: 701  QLLVDL*DV----APEDLPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELL 534
            ++L D  D+     P +LPPMR+IQHAID + G+ LPNL  YRM+  EH ELKRQV+ELL
Sbjct: 588  KILDDFSDLWPVELPNELPPMRDIQHAIDLIPGASLPNLPAYRMNPTEHAELKRQVDELL 647

Query: 533  DDGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDST 354
              G IRESLSPC VPALLTPKKD +WRMC D R INKIT+KYRFPIPRLDD LDMM  S 
Sbjct: 648  TKGFIRESLSPCGVPALLTPKKDGSWRMCVDSRAINKITIKYRFPIPRLDDMLDMMVGSV 707

Query: 353  IYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPF 174
            I+SKIDL   Y+++RIR GDEWKT+FKTKDG YEWLVMPFGLTNAPSTFMR MTQVL+PF
Sbjct: 708  IFSKIDLRSGYHQIRIRPGDEWKTSFKTKDGLYEWLVMPFGLTNAPSTFMRIMTQVLKPF 767

Query: 173  IGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3
            IGRF+VVYF       +      +HL+QVMR LR EK YINLKKC+FM  SVVFLGF
Sbjct: 768  IGRFVVVYFDDILIYSRSCEDHEEHLKQVMRTLRAEKFYINLKKCTFMSPSVVFLGF 824



 Score = 88.2 bits (217), Expect(2) = 2e-18
 Identities = 42/111 (37%), Positives = 68/111 (61%), Gaps = 1/111 (0%)
 Frame = -2

Query: 2179 EKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGKE 2000
            +K  ++V  F GK++P AF DW ++++ +FDW  M + RKVR+   KL G A++WW   E
Sbjct: 86   KKVRLEVAEFYGKLNPTAFLDWIMSMEDYFDWYAMPENRKVRFVKAKLKGAARLWWHNIE 145

Query: 1999 FDLQLAGNYSV-TWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850
                  G   + TW+EMKL++K   L   Y+Q ++ +L +L+Q + +V EY
Sbjct: 146  NQAHRTGQPPIDTWDEMKLKMKEHFLPTDYEQLMYTKLFSLKQGTKSVEEY 196



 Score = 33.9 bits (76), Expect(2) = 2e-18
 Identities = 24/90 (26%), Positives = 44/90 (48%), Gaps = 13/90 (14%)
 Frame = -3

Query: 1782 FKQGLKPEIWN*MLTHQVNNVDDAFQLAYMMES--QKQPAKRFSSQVGE----------- 1642
            +K GL+ EI   M+      VDD +QLA  +E   + + ++  SSQ+G            
Sbjct: 220  YKAGLRMEIQLEMIAAHTYTVDDVYQLALKIEEGLKFRVSRHPSSQIGSTFSNRTTSKPL 279

Query: 1641 ATNTRKFTANIRGTNSTAATANNKFNNGRQ 1552
            +T+  + + ++ G ++T  T+N    NG +
Sbjct: 280  STSNFRTSIHVNGGDNTQPTSNVAHQNGNK 309


>emb|CAN69233.1| hypothetical protein VITISV_003380 [Vitis vinifera]
          Length = 1292

 Score =  435 bits (1118), Expect = e-119
 Identities = 229/430 (53%), Positives = 299/430 (69%), Gaps = 6/430 (1%)
 Frame = -1

Query: 1313 VVRRILSTPK-QQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTE 1137
            VVR +L+ PK ++++DWR T+I QT + C      +IIDGGSS+N+ S+  VEKLNL TE
Sbjct: 306  VVRPLLTVPKVKREEDWRRTSIFQTRISCQGRLCTMIIDGGSSLNIASQELVEKLNLKTE 365

Query: 1136 PHPDPYKVAWIDSSGIPVSKRCLVTFTHGT-YTDSIWCDVILMTITHILLG*PWLYDREV 960
             HP+P++VAW++ + IPVS RCLVTF  G  + +S+WC+V+ + ++HILLG PWL+DR V
Sbjct: 366  RHPNPFRVAWVNDTSIPVSFRCLVTFLFGKDFEESVWCEVLPIKVSHILLGRPWLFDRXV 425

Query: 959  KHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRNQEETTSNSR*EIV*ERSKGG 780
            +HDG E+TY+   N  + +LRP+   +   K    D+    ++  S  + E   + +K  
Sbjct: 426  QHDGYENTYALIHNGCKTILRPMKEVSPIKK---SDENAQPKKVLSMCQFENESKETK-- 480

Query: 779  LIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DV----APEDLPPMREIQHAIDFVLGS 612
            +IF  + ++V+     + E Y   ++++L D  D      P  LPPMR++QHAID + G+
Sbjct: 481  VIFALMARKVEESKEQDKE-YPANVRKILDDFSDFWPTELPNQLPPMRDVQHAIDLIPGA 539

Query: 611  QLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCRT 432
             LPNL  YRM+  EH ELKRQV+ELL  G IRESLSP  VPALLTPKKD +WRMC D R 
Sbjct: 540  SLPNLPAYRMNPTEHAELKRQVDELLTKGFIRESLSPYGVPALLTPKKDGSWRMCVDSRA 599

Query: 431  INKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYE 252
            +NKIT+KYRFPIPRLDD LDMM  S I+SKIDL   Y+++RIR GDEWKT+FKTKDG YE
Sbjct: 600  MNKITIKYRFPIPRLDDMLDMMVRSVIFSKIDLRSGYHQIRIRPGDEWKTSFKTKDGLYE 659

Query: 251  WLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLR 72
            WLVM FGLTNAPSTFMR MTQVL+PFIGRF+VVYF       +      +HL+QVM  L+
Sbjct: 660  WLVMLFGLTNAPSTFMRIMTQVLKPFIGRFVVVYFDDILIYSRSCEDHEEHLKQVMCTLK 719

Query: 71   REKLYINLKK 42
             EK YINLKK
Sbjct: 720  AEKFYINLKK 729


>gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1452

 Score =  429 bits (1103), Expect = e-117
 Identities = 239/531 (45%), Positives = 326/531 (61%), Gaps = 14/531 (2%)
 Frame = -1

Query: 1553 RNRSQMTESKR-QDANLECFNCGLRGHYAWECLKKK----NLHIGVEPNDEQETEEGKEV 1389
            +N S  + +KR  ++++ CF CG +GH ++ C ++K     L   +EP  ++  EE +E+
Sbjct: 234  QNSSGSSTNKRGSNSHIRCFTCGEKGHTSFACPQRKVNLAELGEELEPVYDEYKEEVEEI 293

Query: 1388 DFIERIXXXXXXXXXXXXXDTTFLSVVRRILSTP-KQQKKDWRGTTILQTLVCCGNVTRK 1212
            D                        VVRRI++T   ++ +DW+  +I +T V C      
Sbjct: 294  DVYPAQGESL---------------VVRRIMTTTVNEEAEDWKRRSIFRTRVVCEGKVCD 338

Query: 1211 LIIDGGSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDS 1035
            L+IDGGS  N++S+  V KL L T  HP PYK+ W+     +PV+ +CLV FT G  +D 
Sbjct: 339  LVIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEVPVTTQCLVKFTMGDNSDD 398

Query: 1034 -IWCDVILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMN--NKR 864
               CDV+ M + HIL+G PWLYD ++ H  K +TYSF  N K+  L PL  E     N +
Sbjct: 399  EALCDVVPMDVGHILVGRPWLYDHDMVHKTKPNTYSFYKNNKRYTLYPLREETKKSANHK 458

Query: 863  ATKDKRRNQEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL 684
             +K  R    E           E S+ G+++  V K +K+   + +  Y  E++QLL + 
Sbjct: 459  ISKITRYLSAENFEA-------EGSEMGIMYALVTKHLKSDQMSKSPQYPTEIQQLLKEF 511

Query: 683  *DVAPEDLP----PMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIR 516
             ++  EDLP    P+R IQHAID V G+ LPNL  YRM   +  E++RQVEEL + GL+R
Sbjct: 512  GELFNEDLPKSLPPLRSIQHAIDLVPGAALPNLPAYRMPPMQRAEVQRQVEELFEKGLVR 571

Query: 515  ESLSPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKID 336
            ES SPCA PALL PKKD +WRMC D R INKIT+KYRFPIPRLD+ LD +  S ++SKID
Sbjct: 572  ESKSPCACPALLAPKKDGSWRMCVDSRAINKITIKYRFPIPRLDEMLDQLVGSRVFSKID 631

Query: 335  LTK*YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLV 156
            L   Y+++R+R GDEWKTAFKT DG +EWLVMPFGL+NAPSTFMR M +VL+PF+  F+V
Sbjct: 632  LKSGYHQIRMRDGDEWKTAFKTPDGLFEWLVMPFGLSNAPSTFMRVMAEVLKPFLNSFVV 691

Query: 155  VYF*GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3
            VYF           + + HL+QV+ VL++E+LYINLKKCSFM   VVFLGF
Sbjct: 692  VYFDDILIYSHTKEKHLKHLRQVLEVLQKEQLYINLKKCSFMQPEVVFLGF 742


>emb|CAN71532.1| hypothetical protein VITISV_018180 [Vitis vinifera]
          Length = 1323

 Score =  427 bits (1099), Expect = e-117
 Identities = 234/480 (48%), Positives = 299/480 (62%), Gaps = 8/480 (1%)
 Frame = -1

Query: 1562 MADRNRSQMTES-----KRQDANLECFNCGLRGHYAWECLKKKNLHIGVE-PNDEQETEE 1401
            +A +N ++ T S     ++ D    CF CG  GHYA  C   K LH  VE P  E E+  
Sbjct: 205  VAHKNGNKGTNSMSNGDRKVDVTPLCFKCGGHGHYAVVC-PTKGLHFRVEEPESELESYP 263

Query: 1400 GKEVDFIERIXXXXXXXXXXXXXDTTFLSVVRRILSTPKQQ-KKDWRGTTILQTLVCCGN 1224
             +E  + E                 +   VVR +L+ PK + +KDWR T+I QT + C  
Sbjct: 264  KEEETYNEDEVSEECDYYDGMTEGHSL--VVRPLLTVPKVKGEKDWRXTSIFQTRISCQG 321

Query: 1223 VTRKLIIDGGSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSSGIPVSKRCLVTFTHGT- 1047
                +IIDGGSS+N+ S+  VEKLNL TE HP+P++VAW++ + IP S RCL TF  G  
Sbjct: 322  RLCTMIIDGGSSLNIASQELVEKLNLKTERHPNPFRVAWVNDTSIPXSFRCLXTFLFGKD 381

Query: 1046 YTDSIWCDVILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNK 867
            + + +WC+V+ + ++HILLG PWL+DR V+HDG E+TY+   N ++ +LRP     M   
Sbjct: 382  FEEFVWCEVLPIKVSHILLGRPWLFDRRVQHDGYENTYALIHNXRKKILRP-----MKEV 436

Query: 866  RATKDKRRNQEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVD 687
               K    N +     +  +   E  +  +IF  + ++V+     + E Y   L      
Sbjct: 437  PPIKKSNENAQPKKVLTMCQFENESKETKVIFALMARKVEEFKEQDKE-YPANL------ 489

Query: 686  L*DVAPEDLPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESL 507
                 P  LPPMR++QHAID + G+ LPNL  YRM+  EH ELKRQV+ELL    IRESL
Sbjct: 490  -----PNQLPPMRDVQHAIDLIPGASLPNLXAYRMNPTEHXELKRQVDELLTKCFIRESL 544

Query: 506  SPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK 327
            SPC VP LLTPKKD +WRMC D R INKIT KY+FPIPRLDD LDMM  S I+SKIDL  
Sbjct: 545  SPCGVPTLLTPKKDGSWRMCVDSRAINKITTKYQFPIPRLDDMLDMMVGSVIFSKIDLRS 604

Query: 326  *YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF 147
             Y+++R RLGDEWKT+FKTKDG YEWLVMPFGLTNAPSTFMR MTQVL+PFIGRF VVYF
Sbjct: 605  GYHQIRXRLGDEWKTSFKTKDGLYEWLVMPFGLTNAPSTFMRIMTQVLKPFIGRFFVVYF 664


>gb|EOY19305.1| Uncharacterized protein TCM_044370 [Theobroma cacao]
          Length = 1306

 Score =  397 bits (1020), Expect = e-107
 Identities = 220/503 (43%), Positives = 306/503 (60%), Gaps = 7/503 (1%)
 Frame = -1

Query: 1517 DANLECFNCGLRGHYAWECLKKKNLHIGVEPNDEQETEEGKEVDFIERIXXXXXXXXXXX 1338
            ++++ CF CG  GH ++   +++     +    E   +E +E++ I+             
Sbjct: 249  NSHIRCFTCGENGHTSFAGPQRRVNLAELREELEPVYDEYEEIEEIDVYPAQGESL---- 304

Query: 1337 XXDTTFLSVVRRILSTP-KQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATV 1161
                    VVRR+++T   ++ +DW+  +I +T V C      L+IDGGS  N++S+  V
Sbjct: 305  --------VVRRVMTTTVNEEAEDWKRRSIFRTRVVCEGKVCDLVIDGGSMENIISKEAV 356

Query: 1160 EKLNLLTEPHPDPYKVAWIDSSG-IPVSKRCLVTFTHG-TYTDSIWCDVILMTITHILLG 987
             KL L T  HP PYK+ W+     +PV+ + LV FT G    D   CDV+ M + HIL+G
Sbjct: 357  NKLKLPTNKHPYPYKIGWLKKGHEVPVTTQYLVKFTMGDNLDDEALCDVVPMDVGHILVG 416

Query: 986  *PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRNQEETTSNSR*E 807
             PWLYD ++ H  + +TYSF  + K+    PL  E    K++   K        S    E
Sbjct: 417  RPWLYDHDMVHKTEPNTYSFYNDNKRYTSYPLKEET---KKSANSKINKITGYLSVENFE 473

Query: 806  IV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DVAPEDLP----PMREIQ 639
               E S+ G+++  V K +K+     +  Y  E++QLL +  ++  EDLP    P+R IQ
Sbjct: 474  A--EGSEMGIMYALVTKHLKSDQMGKSPQYPTEIQQLLKEFGELFNEDLPKSLPPLRSIQ 531

Query: 638  HAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDET 459
            HAID V G+ LPNL  YRM   +  E++RQVEELL+ GL+RES SPCA PALL PKKD +
Sbjct: 532  HAIDLVPGAALPNLPAYRMPPMQRVEVQRQVEELLEKGLVRESKSPCACPALLAPKKDGS 591

Query: 458  WRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTA 279
            WRMC D R INKIT+KYRFPIPRLD+ LD +  S ++SKIDL   Y+++R+R GDEWKTA
Sbjct: 592  WRMCVDSRAINKITIKYRFPIPRLDEMLDQLVGSRVFSKIDLKSEYHQIRMRDGDEWKTA 651

Query: 278  FKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDH 99
            FKT DG +EWLVMPFGL+NAPSTFMR M +VL+PF+  F+VVYF           + + H
Sbjct: 652  FKTPDGLFEWLVMPFGLSNAPSTFMRVMAEVLKPFLNSFVVVYFDDILIYSHTKEKHLKH 711

Query: 98   LQQVMRVLRREKLYINLKKCSFM 30
            L+QV+ VL++E+LYINLKKCSFM
Sbjct: 712  LRQVLEVLQKEQLYINLKKCSFM 734



 Score = 61.6 bits (148), Expect = 2e-06
 Identities = 32/93 (34%), Positives = 51/93 (54%), Gaps = 1/93 (1%)
 Frame = -2

Query: 2125 FTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGKEFDLQLAGNYSV-TWEEMK 1949
            + DW  +L+ +F+W  M++ RKV +  +KL G A  W    E          + TWE MK
Sbjct: 53   YLDWEASLENYFEWKPMAENRKVLFVKLKLKGTALQWLKRVEEQRARQSKLKISTWEHMK 112

Query: 1948 LELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850
             +L+++ L   Y  EL+++   L+Q +MTV EY
Sbjct: 113  SKLRKQFLPADYTMELYEKFHCLKQNNMTVEEY 145


>gb|EMJ22494.1| hypothetical protein PRUPE_ppa024499mg, partial [Prunus persica]
          Length = 1364

 Score =  395 bits (1014), Expect = e-107
 Identities = 223/490 (45%), Positives = 295/490 (60%), Gaps = 2/490 (0%)
 Frame = -1

Query: 1508 LECFNCGLRGHYAWECLKKKNLHIGVEPNDEQETEEGKEVDFIERIXXXXXXXXXXXXXD 1329
            +ECF+C  +GH A  C  ++ L I    +D  + E    VD +E +              
Sbjct: 287  IECFHCHAKGHIASRC-PQRTLTISASTDDHCDVEI---VDPLEGVYDPEIDDCFDDDIL 342

Query: 1328 TTFLSVVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLN 1149
               +SV+R I S        W+ T+I  T V C N T KL+ID GS+MNV+S++ V +LN
Sbjct: 343  HQ-VSVMRCIYSA-LALLDSWKRTSIFHTYVPCNNQTCKLVIDSGSTMNVISKSAVTRLN 400

Query: 1148 LLTEPHPDPYKVAWIDSSGIPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYD 969
            L  EPHP P+ VAW+D + +PV++RCLV+   GT  + I+ D++ M + H+LLG PWLYD
Sbjct: 401  LKPEPHPHPFHVAWVDKTKLPVTERCLVSLKLGTCDEDIYLDLLPMNVAHVLLGRPWLYD 460

Query: 968  REVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRNQEETTSNSR*EIV*--E 795
              V++ G+E+TY+F    K I LRP +      K        +Q    S  +  ++   E
Sbjct: 461  HCVQNCGRENTYTFQHEGKSITLRPANPAIKPTKTNITTSSPSQTGNVSGHQLALLSYGE 520

Query: 794  RSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DVAPEDLPPMREIQHAIDFVLG 615
              K  +      +Q + L    NE   + L  L        P +LPPMR+IQHAID V G
Sbjct: 521  FEKEKISAAPSYQQPEPLHQLLNEFSDVMLDDL--------PNELPPMRDIQHAIDLVPG 572

Query: 614  SQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCR 435
            SQL NL  YRM+ +E  EL  Q++ LLD G IR SLS CAVP LLTPKKD +WRMC D R
Sbjct: 573  SQLLNLPHYRMNSSERAELNTQIQGLLDKGFIRHSLSSCAVPVLLTPKKDGSWRMCVDSR 632

Query: 434  TINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFY 255
             INKITVKYRFPIPRL+  L+ +  S  +SKIDL   Y+++RIR GDEWKTAFKT DG Y
Sbjct: 633  AINKITVKYRFPIPRLEAMLEELAGSKWFSKIDLRSGYHQIRIREGDEWKTAFKTPDGLY 692

Query: 254  EWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVL 75
            EWLVMPFG++NAPSTFMR MT VL+P+IG+FLVVYF             + HL+ +  +L
Sbjct: 693  EWLVMPFGMSNAPSTFMRVMTHVLRPYIGKFLVVYFDDILIYSHSKEDHLQHLRTIFHML 752

Query: 74   RREKLYINLK 45
            R+EKL++NLK
Sbjct: 753  RQEKLFVNLK 762


>gb|AAV88076.1| putative retrotransposon polyprotein [Ipomoea batatas]
          Length = 1358

 Score =  390 bits (1002), Expect = e-105
 Identities = 236/556 (42%), Positives = 312/556 (56%), Gaps = 47/556 (8%)
 Frame = -1

Query: 1529 SKRQDANLECFNCGLRGHYAWECLK-KKNLHIGVEPNDEQETEEGKEVDFI---ERIXXX 1362
            SK++ + + C+ C  RGHYA EC   KK L  G +  +     E  + +     ER    
Sbjct: 308  SKQKVSTVTCYRCQGRGHYARECPNTKKILTTGKDEREYMSANESDDEELEPIGERQKDD 367

Query: 1361 XXXXXXXXXXDTTFLSVVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMN 1182
                         F  VV + LST     ++ +   I          T   IIDGGS  N
Sbjct: 368  HSEEEVQEDDALHFNCVVHKALSTLVVLDQEEQRENIFYGKCKIPGATCSFIIDGGSCTN 427

Query: 1181 VVSEATVEKLNLLTEPHPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTI 1005
            V+SE  V  + + T  HP PYK+ W++  G + V K+ L++ + G Y D + CDVI M  
Sbjct: 428  VISEDVVNAMKIPTIQHPQPYKLQWLNDDGELKVHKQALISISIGKYQDDVLCDVIPMHA 487

Query: 1004 THILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRNQ---- 837
             HILLG PW YDR+  H GK + Y+ +   K+  L PL+ + + N +    K R +    
Sbjct: 488  CHILLGRPWQYDRDTLHHGKTNKYTIHKGGKKYTLTPLAPKEVYNLQVQSKKLREELAQK 547

Query: 836  -----EETTSNSR*EIV*ERS--KGGLIFMAVVKQVKNLLNTNNE--------------- 723
                 +ETTS  +  I  E+   K G+      +   NLL T  E               
Sbjct: 548  AKEAMKETTSGKQNTIAHEKKQRKEGMK-KDTTQSSHNLLMTKREVEQALRRGEGVFLLY 606

Query: 722  --DYSL----------ELKQLLVDL*DVAPEDLP----PMREIQHAIDFVLGSQLPNLLG 591
              D+ L          ++  LL +  DV PE+LP    P+R I+H ID + G+ LPN   
Sbjct: 607  PIDFCLNVIKSEIIPSDVSALLSEFADVFPEELPKGLPPIRGIEHQIDLIPGASLPNRPA 666

Query: 590  YRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINKITVK 411
            YR +  E +E++RQV+ELL  G I+ESLSPCAVP LL PKKD TWRMC DCR IN ITVK
Sbjct: 667  YRTNPDEAKEIQRQVDELLQAGFIQESLSPCAVPVLLVPKKDGTWRMCVDCRAINNITVK 726

Query: 410  YRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFG 231
            YR+PIPRLDD LD +  + I+SKIDL + Y+++R++ GDEWKTAFKTK+G YEWLVMPFG
Sbjct: 727  YRYPIPRLDDMLDELHGAKIFSKIDLRRGYHQIRMQKGDEWKTAFKTKNGLYEWLVMPFG 786

Query: 230  LTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREKLYIN 51
            LTNAPSTFMR M  VL+ FIG+F+VVYF       +D ++ I HL++V  VLRRE+LY N
Sbjct: 787  LTNAPSTFMRLMNHVLRNFIGKFVVVYFDDILIYSKDPQKHIIHLKEVFLVLRREQLYAN 846

Query: 50   LKKCSFMCSSVVFLGF 3
            L+KC F   SVVFLGF
Sbjct: 847  LEKCYFGVESVVFLGF 862


>gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 1713

 Score =  387 bits (995), Expect = e-104
 Identities = 224/563 (39%), Positives = 324/563 (57%), Gaps = 44/563 (7%)
 Frame = -1

Query: 1559 ADRNRSQMTESKRQDANLECFNCGLRGHYAWECLKKKNLHIG----VEPNDEQETEEGKE 1392
            A    S    S  + + ++CF CG RGH A EC   + + +      E   E+E E+ +E
Sbjct: 369  AANTSSTSVGSSTKSSGIQCFKCGGRGHVARECPNNRTIVVNDQGEYESTSEEEQEDSEE 428

Query: 1391 VDFIERIXXXXXXXXXXXXXDTTFLSVVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRK 1212
             + +E+              ++    VV +ILS      ++ +   + QT     +   K
Sbjct: 429  ENNLEK---------DICEFESGAALVVTQILSVQMSDAENGQRHNLFQTRAKVQDKVVK 479

Query: 1211 LIIDGGSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDS 1035
            +IIDGGS  N+ S+  VEKL L    HP PY V W+++SG I +++R  V F  G Y D+
Sbjct: 480  VIIDGGSCHNLASKEMVEKLGLKLLKHPHPYHVQWLNNSGSIKIAQRVKVPFKIGEYIDT 539

Query: 1034 IWCDVILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLS----------- 888
            + CDV  MT+ H+LLG PW YDR   H G+ + Y+  +  K+++L+P++           
Sbjct: 540  MECDVAPMTVCHMLLGRPWQYDRSSLHCGRTNQYTIKWKGKELILKPMTPQQILAEHLQK 599

Query: 887  SEAMNNKRATKDKRRN---QEETTSNSR*EIV*ERSKG---GLIFMAVVKQVKN------ 744
            S  + N+ A + ++ N     ++ S S    + +  K     L+ +A   ++++      
Sbjct: 600  SSEVRNESAKEGQKNNLSAPHKSVSESHKPNMRDNKKREGENLVMIATKSEMRDVRRNPE 659

Query: 743  -----------LLNTNN-EDYSLELKQLLVDL*DVAPED----LPPMREIQHAIDFVLGS 612
                       LL+ N+       + ++L +  DV PE+    LPP+R I+H ID + G+
Sbjct: 660  QVLFILVCKDTLLSANDLTSVPSVVARVLQEYEDVFPEETPVGLPPLRGIEHQIDLIPGA 719

Query: 611  QLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCRT 432
             LPN   YR +  E +E++RQV+ LLD G +RESLSPCAVP +L PKKD +WRMC DCR 
Sbjct: 720  TLPNRPAYRTNPEETKEIQRQVQALLDKGYVRESLSPCAVPVILVPKKDGSWRMCVDCRA 779

Query: 431  INKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYE 252
            IN ITV+YR PIPRLDD LD ++ S I+SKIDL   ++++R+++GDEWKTAFKTK G YE
Sbjct: 780  INNITVRYRHPIPRLDDMLDELSGSMIFSKIDLRSGFHQIRMKIGDEWKTAFKTKFGLYE 839

Query: 251  WLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLR 72
            WLVMPFGLTNAPSTFMR M  VL+ FIG+F+VVYF       +     + H+QQV+ VLR
Sbjct: 840  WLVMPFGLTNAPSTFMRLMNHVLRAFIGKFVVVYFDDILIYSKTLEEHVAHIQQVLDVLR 899

Query: 71   REKLYINLKKCSFMCSSVVFLGF 3
            +E+LY NL+KC+F    VVFLGF
Sbjct: 900  KEQLYANLEKCTFCTDQVVFLGF 922



 Score = 73.6 bits (179), Expect = 4e-10
 Identities = 36/112 (32%), Positives = 64/112 (57%), Gaps = 1/112 (0%)
 Frame = -2

Query: 2182 YEKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGK 2003
            + K    +P F+G  DP  +  W + + + F   + S+ +KV    ++  G A IWW   
Sbjct: 137  FGKLKFTMPKFEGGSDPEVYLTWELKVDKIFRLHNYSERKKVAMAALEFDGYALIWWEQM 196

Query: 2002 EFDLQLAGNYSV-TWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850
              + + AG   V +W EMK E++ + + ++Y+++LFD+L NL+Q S++V EY
Sbjct: 197  LNEREEAGQGDVRSWAEMKREMRARFVPKHYRRDLFDKLQNLKQGSLSVDEY 248


>gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]
          Length = 1475

 Score =  377 bits (969), Expect = e-101
 Identities = 221/536 (41%), Positives = 308/536 (57%), Gaps = 18/536 (3%)
 Frame = -1

Query: 1556 DRNRSQMTESKRQDANLECFNCGLRGHYAWECLKKKNL------HIGVEP----NDEQET 1407
            D+ ++  T  K+     +C+ C   GH+A EC  K+ L      H G +     ++E E 
Sbjct: 291  DKGKAAETSQKKTMPLKKCYQCQGYGHFAKECPTKRALSSFEVVHWGDDEILVCDEEVEG 350

Query: 1406 EEGKEVDFIERIXXXXXXXXXXXXXDTTFLSVVR-RILST-PKQQKKDWRGTTILQTLVC 1233
             + +E D +                    LS+V  R++ T P+  + D R          
Sbjct: 351  TDHEEDDVV---------------MPDAGLSLVTWRVMHTQPQPLEMDQRQQIFRSRCTI 395

Query: 1232 CGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSSG-IPVSKRCLVTFT 1056
             G V   LIIDGGS  NV S   +EKL+L T+ HP PYK+ W++    + V K+CLVTF+
Sbjct: 396  KGRVCN-LIIDGGSCTNVASSTLIEKLSLPTQDHPSPYKLRWLNKGAEVRVDKQCLVTFS 454

Query: 1055 HG-TYTDSIWCDVILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEA 879
             G  Y+D   CDV+ M   H+LLG PW +DR+  H G+++TY+F F  ++++L PL    
Sbjct: 455  IGKNYSDEALCDVLPMDACHLLLGRPWEFDRDSVHHGRDNTYTFKFRSRKVILTPLPPVL 514

Query: 878  MNNKRATKDKRRNQEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQ 699
             +            +E    +  E++ E      ++  + K V   +   N     E+++
Sbjct: 515  KHT--TPPSMLEPSKEVLLINEAEMLQELKGDEDVYALIAKDV---VFGQNVSLPKEVQE 569

Query: 698  LLVDL*DVAPEDLP----PMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLD 531
            LL    DV P +LP    P+R I+H IDF+ G+ LPN   YR      +EL++Q+ EL+ 
Sbjct: 570  LLQSYEDVFPNELPSGLPPLRGIEHQIDFIPGATLPNKAAYRSDPKATQELQQQIGELVS 629

Query: 530  DGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTI 351
             G +RESLSPC+VPALL PKKD +WRMC D R IN IT+KYRFPIPRLDD LD ++ + +
Sbjct: 630  KGFVRESLSPCSVPALLVPKKDGSWRMCTDSRAINNITIKYRFPIPRLDDILDELSGAQL 689

Query: 350  YSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFI 171
            +SKIDL + Y+++RI+ GDEWKTAFKTK G YEWLVMPFGL+NAPSTFMR MT+VL+P++
Sbjct: 690  FSKIDLRQGYHQVRIKEGDEWKTAFKTKHGLYEWLVMPFGLSNAPSTFMRLMTEVLRPYL 749

Query: 170  GRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3
            GRF+VVYF             + HLQ +   LR  KLY  L+KCSFM + V FLGF
Sbjct: 750  GRFVVVYFDDILVYSPSKEEHLKHLQVLFETLREHKLYGKLEKCSFMQNEVQFLGF 805



 Score = 66.6 bits (161), Expect = 5e-08
 Identities = 30/107 (28%), Positives = 60/107 (56%), Gaps = 1/107 (0%)
 Frame = -2

Query: 2167 IDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGKEFDLQ 1988
            +++P+F G ++P    DWF T++R F++   SD +  +  ++KL G A +W+   +   +
Sbjct: 90   VEIPDFHGSLNPEDLLDWFRTIERVFEFKGYSDGKAFKVAILKLKGYASLWYENLKNQRR 149

Query: 1987 LAGNYSV-TWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850
              G   + +W ++K +L  K + + Y Q++F +LT L+Q    +  Y
Sbjct: 150  RDGKEPIKSWLKLKKKLNEKFIPKEYTQDIFIKLTQLKQDQQPLESY 196


>gb|AAM94350.1| gag-pol polyprotein [Zea mays]
          Length = 1618

 Score =  376 bits (965), Expect = e-101
 Identities = 200/464 (43%), Positives = 280/464 (60%), Gaps = 27/464 (5%)
 Frame = -1

Query: 1313 VVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEP 1134
            +V+R+LS   ++ +  +  T+ QT       + +LIIDGGS  N+ S   VEKL L T+P
Sbjct: 462  IVQRVLSAQMEKAEQNQRHTLFQTKCVIKERSCRLIIDGGSCNNLASSDMVEKLALTTKP 521

Query: 1133 HPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYDREVK 957
            HP PY + W+++SG + V+K   + F  G+Y D + CDV+ M   +ILLG PW +D +  
Sbjct: 522  HPHPYHIQWLNNSGKVKVTKLVRINFAIGSYRDVVDCDVVPMDACNILLGRPWQFDSDCM 581

Query: 956  HDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRNQEETTSNSR*EIV*ERSKG-- 783
            H G+ + YS   + K+I+L P+S EA+      K     + +T +N   ++V     G  
Sbjct: 582  HHGRSNQYSLIHHDKKIILLPMSPEAIVRDDVAK---ATKAKTENNKNIKVVGNNKDGIK 638

Query: 782  --GLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DVA--------------------- 672
              G   +A    V  L  +    Y+L  K  L+ + D+                      
Sbjct: 639  LKGHCLLATKTDVNELFASTTVAYALVCKDALISIQDMQHSLPPVITNILQEYSDVFPSE 698

Query: 671  -PEDLPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCA 495
             PE LPP+R I+H ID + G+ LPN   YR +  E +E++RQV+ELLD G +RESLSPCA
Sbjct: 699  IPEGLPPIRGIEHQIDLIPGASLPNRAPYRTNPEETKEIQRQVQELLDKGYVRESLSPCA 758

Query: 494  VPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYR 315
            VP +L PKKD TWRMC DCR IN IT++YR PIPRLDD LD ++ + ++SK+DL   Y++
Sbjct: 759  VPVILVPKKDGTWRMCVDCRAINNITIRYRHPIPRLDDMLDELSGAIVFSKVDLRSGYHQ 818

Query: 314  LRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT 135
            +R++LGDEWKTAFKTK G YEWLVMPFGLTNAPSTFMR M +VL+ FIG+F+VVYF    
Sbjct: 819  IRMKLGDEWKTAFKTKFGLYEWLVMPFGLTNAPSTFMRLMNEVLRAFIGKFVVVYFDDIL 878

Query: 134  *I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3
               +     +DH++ V   LR  +L+ NL+KC+F    V FLG+
Sbjct: 879  IYSKSMDEHVDHMRAVFNALRDARLFGNLEKCTFCTDRVSFLGY 922



 Score = 60.8 bits (146), Expect = 3e-06
 Identities = 30/111 (27%), Positives = 55/111 (49%)
 Frame = -2

Query: 2182 YEKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGK 2003
            + K    +P FDGK DP A+  W + + + F   +  +  +VR    +    A +WW   
Sbjct: 144  FSKVKFKIPPFDGKYDPDAYITWEIAVDQKFACHEFPENARVRAATSEFTEFASVWWI-- 201

Query: 2002 EFDLQLAGNYSVTWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850
            E   +   N   TW+ +K  ++ + +  YY +++ ++L  LRQ + +V EY
Sbjct: 202  EHGKKNPNNMPQTWDALKRVMRARFVPSYYARDMLNKLQQLRQGTKSVEEY 252


>gb|AAX95495.1| Retrotransposon gag protein, putative [Oryza sativa Japonica Group]
          Length = 1739

 Score =  375 bits (964), Expect = e-101
 Identities = 214/560 (38%), Positives = 317/560 (56%), Gaps = 11/560 (1%)
 Frame = -1

Query: 1649 SGRLQTPENSQPTLEALTRQLQQPTISLIMADRNRSQMTESKRQDANLECFNCGLRGHYA 1470
            +GR  +P ++  T  A        +++        +    S  +  +++C  C   GH  
Sbjct: 701  AGRTASPSSTPTTSRAAPPPSSDKSVTKAAQPAPSASSMVSTGRMRDVQCHRCKGFGHVQ 760

Query: 1469 WECLKKKNLHIGVEPNDEQETEEGKEVDFIERIXXXXXXXXXXXXXDTTFLS------VV 1308
             +C  K+ L   V+ + E  +    + D +  +                  +      +V
Sbjct: 761  RDCPSKRVLV--VKNDGEYSSASDFDDDTLALLAADHADNEPPEEHIGAAFADHYESLIV 818

Query: 1307 RRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEPHP 1128
            +R+LS   ++ +  +  T+ QT         ++IIDGGS  N+ S   VEKL L T+PHP
Sbjct: 819  QRVLSAQMEKAEQNQRHTLFQTKCVLKERCCRMIIDGGSCNNLASSEMVEKLALSTKPHP 878

Query: 1127 DPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYDREVKHD 951
             PY + W+++SG + V+K   + F  G Y D + CDV+ M   +ILLG PW +DR+  H 
Sbjct: 879  HPYYIQWLNNSGKVKVTKLVHINFAIGNYHDVVECDVVPMQACNILLGRPWQFDRDSMHH 938

Query: 950  GKESTYSFNFNKKQIVLRPLSSEAM---NNKRATKDKRRNQEETTSNSR*-EIV*ERSKG 783
            G+ + YSF ++ K+IVL P+SSE +   +  +A K K  + ++  S+ +  E +  + K 
Sbjct: 939  GRSNQYSFLYHDKKIVLHPMSSEDILRDDVAKAAKSKCESDKKAQSDGKKPETINLKPK- 997

Query: 782  GLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DVAPEDLPPMREIQHAIDFVLGSQLP 603
                +A    +  L+ + +  Y+LE   +        P  LPP+R I+H ID + G+ LP
Sbjct: 998  --CLLATKSDITELIASPSVAYALEYSDVFPK---EVPPGLPPVRGIEHQIDLIPGASLP 1052

Query: 602  NLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINK 423
            N   YR +  E +E++RQV ELLD G +RESLSPCAVP +L PKKD +WRMC DCR IN 
Sbjct: 1053 NRAPYRTNPEETKEIQRQVHELLDKGYVRESLSPCAVPVILVPKKDGSWRMCVDCRAINN 1112

Query: 422  ITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLV 243
            IT++YR PIPRLDD LD ++ S ++SK+DL   Y+++R++LGDEWKT FKTK G YEWLV
Sbjct: 1113 ITIRYRHPIPRLDDMLDELSGSIVFSKVDLRSGYHQIRMKLGDEWKTTFKTKFGLYEWLV 1172

Query: 242  MPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREK 63
            MPFGLTNAPSTFMR M +VL+PFIG+F+VVYF       +      +HL+ V   LR  +
Sbjct: 1173 MPFGLTNAPSTFMRLMNEVLRPFIGKFVVVYFDDILIYSKSMGEHFNHLRAVFNALRDAR 1232

Query: 62   LYINLKKCSFMCSSVVFLGF 3
            L+ NL+KC+F    V FLG+
Sbjct: 1233 LFGNLEKCTFCTDRVSFLGY 1252



 Score = 64.7 bits (156), Expect = 2e-07
 Identities = 32/111 (28%), Positives = 55/111 (49%)
 Frame = -2

Query: 2182 YEKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGK 2003
            + K    +P FDGK DP AF  W + + + F + +  +  +VR    +    A +WW   
Sbjct: 508  FSKIKFKIPPFDGKYDPDAFLSWEIAVDQKFAYHEFPENTRVRAATSEFTDFASVWWI-- 565

Query: 2002 EFDLQLAGNYSVTWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850
            E   +   N   TW+ +K  ++ + +  YY ++L + L  LRQ + +V EY
Sbjct: 566  EHGKKNPNNMPQTWDALKRVMRARFVPSYYARDLLNRLQQLRQGAKSVEEY 616


>gb|AAX96717.1| retrotransposon protein, putative, Ty3-gypsy sub-class [Oryza sativa
            Japonica Group] gi|108864301|gb|ABA93040.2|
            retrotransposon protein, putative, Ty3-gypsy subclass
            [Oryza sativa Japonica Group]
          Length = 1748

 Score =  375 bits (964), Expect = e-101
 Identities = 214/560 (38%), Positives = 317/560 (56%), Gaps = 11/560 (1%)
 Frame = -1

Query: 1649 SGRLQTPENSQPTLEALTRQLQQPTISLIMADRNRSQMTESKRQDANLECFNCGLRGHYA 1470
            +GR  +P ++  T  A        +++        +    S  +  +++C  C   GH  
Sbjct: 710  AGRTASPSSTPTTSRAAPPPSSDKSVTKAAQPAPSASSMVSTGRMRDVQCHRCKGFGHVQ 769

Query: 1469 WECLKKKNLHIGVEPNDEQETEEGKEVDFIERIXXXXXXXXXXXXXDTTFLS------VV 1308
             +C  K+ L   V+ + E  +    + D +  +                  +      +V
Sbjct: 770  RDCPSKRVLV--VKNDGEYSSASDFDDDTLALLAADHADNEPPEEHIGAAFADHYESLIV 827

Query: 1307 RRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEPHP 1128
            +R+LS   ++ +  +  T+ QT         ++IIDGGS  N+ S   VEKL L T+PHP
Sbjct: 828  QRVLSAQMEKAEQNQRHTLFQTKCVLKERCCRMIIDGGSCNNLASSEMVEKLALSTKPHP 887

Query: 1127 DPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYDREVKHD 951
             PY + W+++SG + V+K   + F  G Y D + CDV+ M   +ILLG PW +DR+  H 
Sbjct: 888  HPYYIQWLNNSGKVKVTKLVHINFAIGNYHDVVECDVVPMQACNILLGRPWQFDRDSMHH 947

Query: 950  GKESTYSFNFNKKQIVLRPLSSEAM---NNKRATKDKRRNQEETTSNSR*-EIV*ERSKG 783
            G+ + YSF ++ K+IVL P+SSE +   +  +A K K  + ++  S+ +  E +  + K 
Sbjct: 948  GRSNQYSFLYHDKKIVLHPMSSEDILRDDVAKAAKSKCESDKKAQSDGKKPETINLKPK- 1006

Query: 782  GLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DVAPEDLPPMREIQHAIDFVLGSQLP 603
                +A    +  L+ + +  Y+LE   +        P  LPP+R I+H ID + G+ LP
Sbjct: 1007 --CLLATKSDITELIASPSVAYALEYSDVFPK---EVPPGLPPVRGIEHQIDLIPGASLP 1061

Query: 602  NLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINK 423
            N   YR +  E +E++RQV ELLD G +RESLSPCAVP +L PKKD +WRMC DCR IN 
Sbjct: 1062 NRAPYRTNPEETKEIQRQVHELLDKGYVRESLSPCAVPVILVPKKDGSWRMCVDCRAINN 1121

Query: 422  ITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLV 243
            IT++YR PIPRLDD LD ++ S ++SK+DL   Y+++R++LGDEWKT FKTK G YEWLV
Sbjct: 1122 ITIRYRHPIPRLDDMLDELSGSIVFSKVDLRSGYHQIRMKLGDEWKTTFKTKFGLYEWLV 1181

Query: 242  MPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREK 63
            MPFGLTNAPSTFMR M +VL+PFIG+F+VVYF       +      +HL+ V   LR  +
Sbjct: 1182 MPFGLTNAPSTFMRLMNEVLRPFIGKFVVVYFDDILIYSKSMGEHFNHLRAVFNALRDAR 1241

Query: 62   LYINLKKCSFMCSSVVFLGF 3
            L+ NL+KC+F    V FLG+
Sbjct: 1242 LFGNLEKCTFCTDRVSFLGY 1261



 Score = 64.7 bits (156), Expect = 2e-07
 Identities = 32/111 (28%), Positives = 55/111 (49%)
 Frame = -2

Query: 2182 YEKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGK 2003
            + K    +P FDGK DP AF  W + + + F + +  +  +VR    +    A +WW   
Sbjct: 517  FSKIKFKIPPFDGKYDPDAFLSWEIAVDQKFAYHEFPENTRVRAATSEFTDFASVWWI-- 574

Query: 2002 EFDLQLAGNYSVTWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850
            E   +   N   TW+ +K  ++ + +  YY ++L + L  LRQ + +V EY
Sbjct: 575  EHGKKNPNNMPQTWDALKRVMRARFVPSYYARDLLNRLQQLRQGAKSVEEY 625


>ref|XP_004309164.1| PREDICTED: uncharacterized protein LOC101300012 [Fragaria vesca
            subsp. vesca]
          Length = 1034

 Score =  372 bits (954), Expect = e-100
 Identities = 216/491 (43%), Positives = 294/491 (59%), Gaps = 14/491 (2%)
 Frame = -1

Query: 1433 VEPNDEQETEEGKEVDFIERIXXXXXXXXXXXXXDTTFLSVVRRILSTPKQQKKDWRGTT 1254
            +E +DEQ  EE +E + +E                  +  V +R+L + KQ+ +     +
Sbjct: 397  IEGDDEQHEEE-EEDEVVEEAEEYSGDDRE-------YNLVTQRLLCSTKQENQRH---S 445

Query: 1253 ILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSS-GIPVSK 1077
            I ++          LIID GS  N VS+  VE  NLLT  H  PY + WI     + +++
Sbjct: 446  IFRSTCTIKEKPMSLIIDSGSCENFVSKKVVEHFNLLTMKHRAPYAIGWIKKGLEVRITE 505

Query: 1076 RCLVTFTHGT-YTDSIWCDVILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVL 900
             C V+ + G  Y D + CDV+ M  +H+LLG PW +D    H+G+E+T SF + K  I L
Sbjct: 506  TCKVSISIGKFYQDEVECDVVDMDASHVLLGKPWQHDVNTIHNGRENTVSFIWEKHHITL 565

Query: 899  RP------LSSEAMNNKRATKDKRRNQEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLL 738
            +P      L S   +N     +     EE   ++             I+  VV++V  + 
Sbjct: 566  KPKTKPTNLVSPKESNFLIVAEPCEKVEELVKDAE-----------AIYPLVVREVM-VA 613

Query: 737  NTNNEDYSL--ELKQLLVD----L*DVAPEDLPPMREIQHAIDFVLGSQLPNLLGYRMSL 576
              N E+  +  E++QLL D    L D  P +LPPMR+IQH ID V G+ LPNL  YRMS 
Sbjct: 614  EDNKEEKKIPKEVQQLLQDFEELLADDLPNELPPMRDIQHQIDLVSGASLPNLPHYRMSP 673

Query: 575  AEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPI 396
             E+E LK ++EELL  G IRES+SPCAVP LL PKKD +WRMC D R INKIT+KYRFPI
Sbjct: 674  KENEILKEKIEELLRKGHIRESMSPCAVPVLLVPKKDRSWRMCVDSRAINKITIKYRFPI 733

Query: 395  PRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAP 216
            P+L+D LD++  S ++SKIDL   Y+++RI+LGDEWKTAFK+KDG YEWLVMPFGL+NAP
Sbjct: 734  PQLEDMLDVLGGSVVFSKIDLRSGYHQIRIKLGDEWKTAFKSKDGLYEWLVMPFGLSNAP 793

Query: 215  STFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCS 36
            STFMR M QVL+P+IG  +VVYF       +     + HL++V+ VL+  KLY+NLKKCS
Sbjct: 794  STFMRVMNQVLKPYIGTCVVVYFDDILIYSKSKEEHLQHLRKVLEVLQENKLYVNLKKCS 853

Query: 35   FMCSSVVFLGF 3
            FM   ++FLG+
Sbjct: 854  FMTKKLLFLGY 864


>gb|EMJ11389.1| hypothetical protein PRUPE_ppa017790mg [Prunus persica]
          Length = 1485

 Score =  370 bits (950), Expect = 2e-99
 Identities = 216/519 (41%), Positives = 302/519 (58%), Gaps = 2/519 (0%)
 Frame = -1

Query: 1553 RNRSQMTESKRQDANLECFNCGLRGHYAWECLKKKNLHIGVEPNDEQETEEGKEVDFIER 1374
            RN+SQ   +K       C+ C   GH +  C + K  +   E ++++E +E  E D+   
Sbjct: 336  RNQSQNLYAKPMTDI--CYRCQKPGHRSNVCPELKQANFIEEADEDEENDEVGENDYA-- 391

Query: 1373 IXXXXXXXXXXXXXDTTFLSVVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGG 1194
                                V++R+L  P+++ +     +I ++L    N    +I+D G
Sbjct: 392  -----GAEFAVEEGMEKITLVLQRVLLAPREEGQRH---SIFRSLCSIKNKVCDVIVDNG 443

Query: 1193 SSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSS-GIPVSKRCLVTFTHGT-YTDSIWCDV 1020
            S  N VS+  VE L L TEPH  PY + W+     + V++ C V  + G  Y D + CDV
Sbjct: 444  SCENFVSKKLVEYLQLSTEPHVSPYSLGWVKKGPSVRVAETCRVPLSIGKHYRDEVLCDV 503

Query: 1019 ILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRN 840
            I M   HILLG PW +D +    G+++   F++N ++I +    +    +K + + K R+
Sbjct: 504  IDMDACHILLGRPWQFDVDATFKGRDNVILFSWNNRKIAM----TTTQPSKPSVEVKTRS 559

Query: 839  QEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DVAPEDL 660
                T  S  + + E  K       + + V+ +L+   E +S  L           P +L
Sbjct: 560  SSFLTLISNEQELNEAVKEAEGEGDIPQDVQQILSQFQELFSENL-----------PNEL 608

Query: 659  PPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALL 480
            PPMR+IQH ID V G+ L NL  YRMS  E++ L+ Q+EELL  G IRESLSPCAVP LL
Sbjct: 609  PPMRDIQHRIDLVPGASLQNLPHYRMSPKENDILREQIEELLRKGFIRESLSPCAVPVLL 668

Query: 479  TPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRL 300
             PKKD+TWRMC D R INKITVKYRFPIPRL+D LD+++ S ++SKIDL   Y+++RIR 
Sbjct: 669  VPKKDKTWRMCVDSRAINKITVKYRFPIPRLEDMLDVLSGSKVFSKIDLRSGYHQIRIRP 728

Query: 299  GDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QD 120
            GDEWKTAFK+KDG +EWLVMPFGL+N PSTFMR M QVL+PFIG F+VVYF         
Sbjct: 729  GDEWKTAFKSKDGLFEWLVMPFGLSNTPSTFMRLMNQVLRPFIGSFVVVYFDDILIYSTT 788

Query: 119  GRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3
                + HL+QV+ VLR  KL++NLKKC+F  + ++FLGF
Sbjct: 789  KEEHLVHLRQVLDVLRENKLFVNLKKCTFCTNKLLFLGF 827


>gb|EMJ08431.1| hypothetical protein PRUPE_ppa026856mg [Prunus persica]
          Length = 1493

 Score =  370 bits (949), Expect = 2e-99
 Identities = 217/528 (41%), Positives = 301/528 (57%), Gaps = 11/528 (2%)
 Frame = -1

Query: 1553 RNRSQMTESKRQDANLECFNCGLRGHYAWECLKKKNLHIGVEPNDEQETEEGKEVDFIER 1374
            RN+SQ   +K       C+ C   GH +  C ++K  +   E ++++E +E  E D+   
Sbjct: 347  RNQSQNPYAKPMTDI--CYRCQKPGHRSNVCPERKQANFIEEADEDEEKDEVGENDYA-- 402

Query: 1373 IXXXXXXXXXXXXXDTTFLSVVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGG 1194
                                V++R+L  PK++ +      I ++L    N    +I+D G
Sbjct: 403  -----GAEFAVEEGIEKITLVLQRVLLAPKEEGQRHN---IFRSLCSIKNKVCDVIVDNG 454

Query: 1193 SSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSS-GIPVSKRCLVTFTHGT-YTDSIWCDV 1020
            S  N VS+  VE L L TEPH  PY + W+     + V++ C V  + G  Y D + CDV
Sbjct: 455  SCENFVSKKLVEYLQLSTEPHVSPYSLGWVKKGPSVRVAETCRVPLSIGKHYRDDVLCDV 514

Query: 1019 ILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRN 840
            I M   HILLG PW +D +    G+++   F++N ++I +            AT    R 
Sbjct: 515  IDMDACHILLGRPWQFDVDATFKGRDNVILFSWNNRKIAM------------ATTQPSRK 562

Query: 839  QEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLLNTNNE-----DYSLELKQLLVDL*DV 675
            QE  +S+               F+ ++   + L     E     D   +++Q+L    ++
Sbjct: 563  QELRSSS---------------FLTLISNEQELNEAVKEAEGEGDIPQDVQQILSQFQEL 607

Query: 674  A----PEDLPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESL 507
                 P +LPPMR+IQH ID V G+ LPNL  YRMS  E++ L+ Q+EELL  G IRESL
Sbjct: 608  LSENLPNELPPMRDIQHRIDLVHGASLPNLPHYRMSPKENDILREQIEELLRKGFIRESL 667

Query: 506  SPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK 327
            SPCAVP LL PKKD+TWRMC D R +NKI VKYRF IPRL+D LD+++ S ++SKIDL  
Sbjct: 668  SPCAVPVLLVPKKDKTWRMCVDSRAVNKIKVKYRFSIPRLEDILDVLSGSKVFSKIDLRS 727

Query: 326  *YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF 147
             Y+++RIR GDEWKTAFK+KDG +EWLVMPFGL+NAPSTFMR M QVL+PFIG F+VVYF
Sbjct: 728  GYHQIRIRPGDEWKTAFKSKDGLFEWLVMPFGLSNAPSTFMRLMNQVLRPFIGSFVVVYF 787

Query: 146  *GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3
                         + HL+QV+ VLR  KLY+NLKKC+F  + ++FLGF
Sbjct: 788  DDILIYSTTKEEHLVHLRQVLDVLRENKLYVNLKKCTFCTNKLLFLGF 835


>gb|EMJ00160.1| hypothetical protein PRUPE_ppa020671mg, partial [Prunus persica]
          Length = 1460

 Score =  369 bits (948), Expect = 3e-99
 Identities = 197/408 (48%), Positives = 257/408 (62%), Gaps = 10/408 (2%)
 Frame = -1

Query: 1196 GSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSSGIPVSKRCLVTFTHGTYTDSIWCDVI 1017
            GS+MNV+S++ V +LNL  EPHP P+ VAW+D + +PV++ CLV+   GT  + I+ D +
Sbjct: 436  GSTMNVISKSAVTRLNLKPEPHPHPFHVAWVDKTKLPVTEWCLVSLKLGTCDEDIYLDQL 495

Query: 1016 LMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRNQ 837
             M + H+LLG PWLYD  V++ G+E+TY+F    K I+LRP +      K        +Q
Sbjct: 496  PMNVAHVLLGRPWLYDHRVQNCGRENTYTFQHEGKSIMLRPANPAIKPTKTNITTSSPSQ 555

Query: 836  EETTSNSR*------EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DV 675
                S  R       E   E  + G++F  V+K++    +    +    L Q L +  DV
Sbjct: 556  TGNMSGHRLALLSYGEFEKESLETGVVFALVIKEISAAPSYQQPE---PLHQFLNEFSDV 612

Query: 674  APEDLP----PMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESL 507
             P+DLP    PMR+IQHAID V GSQLPNL  YRM+ +EH EL  Q++ LLD G IR SL
Sbjct: 613  MPDDLPNELPPMRDIQHAIDLVPGSQLPNLPHYRMNSSEHAELNTQIQGLLDKGFIRHSL 672

Query: 506  SPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK 327
            SPCAVP L TPKKD +WRMC D R INKIT           D LD +  S  +SKIDL  
Sbjct: 673  SPCAVPVLFTPKKDGSWRMCVDSRAINKIT-----------DMLDELAGSKWFSKIDLHS 721

Query: 326  *YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF 147
             Y+++RIR GDEWKTAFKT DG YEWLVMPFG++NAPSTFMR MT V +P+IG+FLVVYF
Sbjct: 722  GYHQIRIREGDEWKTAFKTPDGLYEWLVMPFGMSNAPSTFMRVMTHVFRPYIGKFLVVYF 781

Query: 146  *GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3
                         + HL+ +  +LR+EKL++NLKKCSF+   V+FLGF
Sbjct: 782  DDILIYSHSKEDHLQHLRTIFHMLRQEKLFVNLKKCSFLQEQVLFLGF 829



 Score = 79.7 bits (195), Expect(3) = 7e-16
 Identities = 35/88 (39%), Positives = 55/88 (62%), Gaps = 1/88 (1%)
 Frame = -2

Query: 2158 PNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGKEFDLQLAG 1979
            P+FDG+ DP  F DW   ++ +F+W DMSD +++R+  +KLVG  + +W   E  LQ  G
Sbjct: 198  PDFDGRGDPTLFVDWISAMEDYFEWDDMSDAQRIRFAKLKLVGAVKQYWKATEHHLQQLG 257

Query: 1978 NYSV-TWEEMKLELKRKNLLRYYQQELF 1898
               V  W+EMKL+L+ + L  +Y Q+ +
Sbjct: 258  QTPVILWDEMKLKLREQYLPSFYLQDYY 285



 Score = 28.5 bits (62), Expect(3) = 7e-16
 Identities = 21/91 (23%), Positives = 42/91 (46%)
 Frame = -3

Query: 1845 KFKEMKICF*GAEDSR*TLS*FKQGLKPEIWN*MLTHQVNNVDDAFQLAYMMESQKQPAK 1666
            +F E K+     E+   T+S F  GL+ +I   +   + + ++DA+  A   E+  +P +
Sbjct: 287  RFVEHKLHSALQEELAVTVSRFIHGLRIDIKREVSRSRPDVLEDAYCQALEAETYLRPQR 346

Query: 1665 RFSSQVGEATNTRKFTANIRGTNSTAATANN 1573
            R+    G+ T T +      G  +  +  +N
Sbjct: 347  RYPGYPGQPTTTNQARTTTSGLKTEFSEPSN 377



 Score = 24.6 bits (52), Expect(3) = 7e-16
 Identities = 11/38 (28%), Positives = 21/38 (55%)
 Frame = -1

Query: 1517 DANLECFNCGLRGHYAWECLKKKNLHIGVEPNDEQETE 1404
            ++++E F+C  +GH A  C  ++ L I    +D  + E
Sbjct: 391  NSHIEWFHCHAKGHIASRC-PQRTLTISASTDDHCDVE 427


>gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Japonica Group]
            gi|31431012|gb|AAP52850.1| retrotransposon protein,
            putative, Ty3-gypsy subclass [Oryza sativa Japonica
            Group]
          Length = 2447

 Score =  369 bits (947), Expect = 4e-99
 Identities = 200/464 (43%), Positives = 282/464 (60%), Gaps = 27/464 (5%)
 Frame = -1

Query: 1313 VVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEP 1134
            +V+R+LS   ++ +  +  T+ QT         ++IIDGGS  N+ S   VEKL L T+P
Sbjct: 459  IVQRVLSAQMEKAEQNQRHTLFQTKCVVKERCCRMIIDGGSCNNLASSEMVEKLALSTKP 518

Query: 1133 HPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYDREVK 957
            HP PY + W+++SG   V+K   + F  G Y D + CDV+ M   +ILLG PW +DR+  
Sbjct: 519  HPHPYYIQWLNNSGKAKVTKLVHINFAIGNYHDVVECDVVPMQACNILLGRPWQFDRDSM 578

Query: 956  HDGKESTYSFNFNKKQIVLRPLSSEAM---NNKRATKDKRRNQEETTSNSR*-EIV*ERS 789
            H G+ + YSF ++ K+IVL  +S E +   +  +A K K  + ++  S+ +  E +  + 
Sbjct: 579  HHGRSNQYSFLYHDKKIVLHSMSPEDILRDDVAKAAKSKCESDKKAQSDGKKPETINLKP 638

Query: 788  KGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DV---------------------- 675
            K     +A    +  L+ + +  Y+L  K  L+ L D+                      
Sbjct: 639  K---CLLATKSDINELIASPSVAYALVCKDALISLHDMQHSLPPAVANILQEYSDVFPKE 695

Query: 674  APEDLPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCA 495
             P  LPP+R I+H ID + G+ LPN   YR +  E +E++RQV ELLD G +RESLSPCA
Sbjct: 696  VPPGLPPVRGIEHQIDLIPGASLPNRAPYRTNPEETKEIQRQVHELLDKGYVRESLSPCA 755

Query: 494  VPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYR 315
            VP +L PKKD +WRMC DCR IN IT++YR PIPRLDD LD ++ S ++SK+DL   Y++
Sbjct: 756  VPVILVPKKDGSWRMCVDCRAINNITIRYRHPIPRLDDMLDELSGSIVFSKVDLRSGYHQ 815

Query: 314  LRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT 135
            +R++LGDEWKTAFKTK G YEWLVMPFGLTNAPSTFMR M +VL+PFIG+F+VVYF    
Sbjct: 816  IRMKLGDEWKTAFKTKFGLYEWLVMPFGLTNAPSTFMRLMNEVLRPFIGKFVVVYFDDIL 875

Query: 134  *I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3
               +      +HL+ V   LR  +L+ NL+KC+F    V FLG+
Sbjct: 876  IYSKSMGEHFNHLRAVFNALRDARLFGNLEKCTFCTDRVSFLGY 919



 Score = 68.2 bits (165), Expect = 2e-08
 Identities = 49/151 (32%), Positives = 72/151 (47%), Gaps = 5/151 (3%)
 Frame = -1

Query: 443  DCRTINKIT---VKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKT--A 279
            D  TI+  T   +++  PI R   R     D     ++D    +++    +GD+  T  +
Sbjct: 1563 DINTIDTSTSPHIQHDGPITRARARQLNYQDECPRGRVDA---HHKFAATIGDDRATNPS 1619

Query: 278  FKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDH 99
                 G YE+ VM FGLTNAP+ FM  M +V   ++ +F+VV+        Q       H
Sbjct: 1620 GFASYGLYEFTVMSFGLTNAPAFFMNLMNKVFMEYLDKFVVVFIDDILVYSQSEEDHQHH 1679

Query: 98   LQQVMRVLRREKLYINLKKCSFMCSSVVFLG 6
            L+ V+  LR  +LY  L KC F  S V FLG
Sbjct: 1680 LRLVLGKLREHQLYAKLSKCEFWLSEVKFLG 1710



 Score = 62.0 bits (149), Expect = 1e-06
 Identities = 31/111 (27%), Positives = 54/111 (48%)
 Frame = -2

Query: 2182 YEKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGK 2003
            + K    +P FDGK DP A+  W + + + F   +  +  +VR    +    A +WW   
Sbjct: 150  FSKIKFKIPPFDGKYDPDAYLSWEIAVDQKFACHEFPESTRVRAATSEFTDFASVWWI-- 207

Query: 2002 EFDLQLAGNYSVTWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850
            E   +   N   TW+ +K  ++ + +  YY ++L + L  LRQ + +V EY
Sbjct: 208  EHGKKNPNNMPQTWDALKRVMRARFVPSYYARDLLNRLQQLRQGAKSVEEY 258


>emb|CAE04927.2| OSJNBa0017P10.4 [Oryza sativa Japonica Group]
            gi|38345441|emb|CAE03293.2| OSJNBb0046P18.9 [Oryza sativa
            Japonica Group]
          Length = 1134

 Score =  368 bits (944), Expect = 8e-99
 Identities = 200/464 (43%), Positives = 286/464 (61%), Gaps = 27/464 (5%)
 Frame = -1

Query: 1313 VVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEP 1134
            +V+R+LST  ++ +  +  T+ QT         ++IIDGGS  N+ S   VEKL L T+P
Sbjct: 642  IVQRVLSTQMEKAEQNQRHTLFQTKCVVKERCCRMIIDGGSCNNLASSEMVEKLALSTKP 701

Query: 1133 HPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYDREVK 957
            HP PY + W+++SG   V+K   + F  G Y D + CDV+ M   +ILLG PW +D++  
Sbjct: 702  HPHPYYIQWLNNSGKAKVTKLVHINFAIGNYHDVVECDVVPMQACNILLGRPWQFDKDSL 761

Query: 956  HDGKESTYSFNFNKKQIVLRPLSSEAMNNK---RATKDKRRNQEETTSNSR*-EIV*ERS 789
            H G+ + YSF ++ K+IVL P+SSE + +    +A K K  + ++  S+ +  E +  + 
Sbjct: 762  HHGRSNQYSFLYHDKKIVLHPMSSEDILHDDVAKAAKSKCESDKKAQSDGKKPETINLKP 821

Query: 788  KGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*D------------------VAPED 663
            K     +A    +  L+ + +  Y+L  K  L+ L D                  V P++
Sbjct: 822  K---CLLATKSDINELIASPSVAYALVCKDALISLHDMQHSLPPAIANILQEYSDVFPKE 878

Query: 662  LPP----MREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCA 495
            +PP    +  I+H ID +LG+ LPN   YR +  E +E++RQV ELLD G +RESLSPCA
Sbjct: 879  VPPGLLPVHGIEHQIDLILGASLPNRAPYRTNPEETKEIQRQVHELLDKGYVRESLSPCA 938

Query: 494  VPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYR 315
            VP +L PKKD +WRMC DCR IN IT++YR PIPRLDD LD ++ S ++SK+DL   Y++
Sbjct: 939  VPVILVPKKDGSWRMCVDCRAINNITIRYRHPIPRLDDMLDELSGSIVFSKVDLRSGYHQ 998

Query: 314  LRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT 135
            +R++LGDEWKTAFKTK G YEWLVMPFGLTNAP+TFMR M +VL+PFI +F+VVYF    
Sbjct: 999  IRMKLGDEWKTAFKTKFGLYEWLVMPFGLTNAPNTFMRLMNEVLRPFIEKFVVVYFDDIL 1058

Query: 134  *I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3
               +      +HL+ V   LR  +L+ NL+KC+F    V FLG+
Sbjct: 1059 IYSKSMGEHFNHLRAVFNALRDARLFGNLEKCTFCIDRVSFLGY 1102



 Score = 63.9 bits (154), Expect = 3e-07
 Identities = 32/111 (28%), Positives = 55/111 (49%)
 Frame = -2

Query: 2182 YEKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGK 2003
            + K    +P FDGK DP A+  W + + + F   +  +  +VR    +    A +WW   
Sbjct: 333  FSKIKFKIPPFDGKYDPDAYLSWEIAVDQKFACHEFPENTRVRAATSEFTDFASVWWI-- 390

Query: 2002 EFDLQLAGNYSVTWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850
            E   +   N S TW+ +K  ++ + +  YY ++L + L  LRQ + +V EY
Sbjct: 391  EHGKKNPNNMSQTWDALKRVMRARFVPSYYARDLLNRLQQLRQGAKSVEEY 441


>gb|ADP20180.1| mutant gag-pol polyprotein [Pisum sativum]
          Length = 1004

 Score =  367 bits (943), Expect = 1e-98
 Identities = 226/586 (38%), Positives = 323/586 (55%), Gaps = 39/586 (6%)
 Frame = -1

Query: 1643 RLQTPENSQPTLEALTRQLQQPTISLIMADRNRSQMTESKRQDAN--LECFNCGLRGHYA 1470
            R  T  NSQ   +   ++    +    + ++ ++  + S     N  ++CF C  +GH A
Sbjct: 246  RNSTTFNSQSWKDKTKKEGASSSKEATVENKGKTITSSSSSVSTNKSVKCFKCQGQGHIA 305

Query: 1469 WECLKKKNLHIGVEPNDEQETEEGKEVD--FIERIXXXXXXXXXXXXXDTTFLSVVRRIL 1296
             +C  K+ + +  E N+E   EE  + D  F E I                 L +VRR+L
Sbjct: 306  SQCPTKRTMLM--EENEEIVEEEDGDYDKEFGEEIPSGD-------------LLMVRRML 350

Query: 1295 STPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEPHPDPYK 1116
             +  +++   +   +             LIIDGGS  NV S   V +L L T+PHP PYK
Sbjct: 351  GSQIKEEDTSQRENLFHIRCFVQGKVCSLIIDGGSCTNVASTRLVSRLKLETKPHPKPYK 410

Query: 1115 VAWIDSS-GIPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYDREVKHDGKES 939
            + W++ S  + V+K+  + F  G Y D + CDV+ M  +H+LLG PW +DR+  HDG  +
Sbjct: 411  LQWLNESVEMLVNKQVEICFKIGKYEDVVLCDVVPMEASHLLLGRPWQFDRKANHDGYSN 470

Query: 938  TYSFNFNKKQIVLRPLS-----------SEAMNNKRATKDKRRNQEETTSNSR*EIV*ER 792
             YSF ++ ++I L PL+           SE  + +R  K+K + + E   N + E     
Sbjct: 471  KYSFMYHDQKINLVPLNPSEVREDQRKMSEKYDQERKEKEKEKEKNEKKKNDKRE----- 525

Query: 791  SKGGLIFMAVVKQVKNLLNTNNEDYSLELKQ-------------------LLVDL*DVAP 669
             K  LI  A ++ VK  + ++   Y L  K+                   LL +  ++ P
Sbjct: 526  KKQSLI--AKIRDVKEAIVSHQPLYLLFCKEVPLLTTISNEKKLPNCIESLLQEFKELFP 583

Query: 668  ED----LPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSP 501
            E+    LPP+R I+H ID   G+ LPN   YR +  + +E++RQV EL+  G +RESLSP
Sbjct: 584  EEVPSGLPPIRGIEHHIDLNPGASLPNRPAYRSNPQQTQEIQRQVAELISKGWVRESLSP 643

Query: 500  CAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*Y 321
            CAVP +L PKKD +WRMC DCR I+ IT+KYR PIPRLDD LD +  + ++SKIDL   Y
Sbjct: 644  CAVPIILVPKKDGSWRMCTDCRAISNITIKYRHPIPRLDDLLDELFGACLFSKIDLKSGY 703

Query: 320  YRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*G 141
            +++RIR GDEWKTAFKTK G YEW+VMPFGLTNAPSTFMR M  VL+ F+G+F+VVYF  
Sbjct: 704  HQIRIREGDEWKTAFKTKFGLYEWMVMPFGLTNAPSTFMRLMNHVLREFLGKFVVVYFDD 763

Query: 140  HT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3
                 ++      HL+ V++VLR E LY NL+KC F    V+FLGF
Sbjct: 764  ILIYSKNLDDHCIHLKAVLQVLRYENLYANLEKCVFCTDHVIFLGF 809



 Score = 67.0 bits (162), Expect = 4e-08
 Identities = 37/107 (34%), Positives = 58/107 (54%), Gaps = 1/107 (0%)
 Frame = -2

Query: 2167 IDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGKEFDLQ 1988
            I VP F GK DP A+ +W   L++ F+  + S+  KV+   ++    A +WW     D +
Sbjct: 74   IKVPTFVGKSDPEAYLEWETKLEQIFNCHNYSNLEKVQVASIEFKEYALVWWDQLTKDRR 133

Query: 1987 LAGNYSV-TWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850
                  + TWEEMK  ++R+ +  YY +EL ++L  L Q S +V EY
Sbjct: 134  RYAERPIDTWEEMKRIMRRRFVPSYYHRELHNKLQRLTQGSKSVEEY 180


>gb|AAQ56338.1| putative gag-pol polyprotein [Oryza sativa Japonica Group]
          Length = 1619

 Score =  366 bits (939), Expect = 3e-98
 Identities = 217/596 (36%), Positives = 321/596 (53%), Gaps = 33/596 (5%)
 Frame = -1

Query: 1691 WKARNNQLRDFLLKSGRLQTPENSQPTLEALTRQLQQPTISLIMADRNRSQMTESKRQDA 1512
            W+ R   L      +GR  +P ++  T  A        + +        +    S  +  
Sbjct: 314  WQTRTTPL------AGRTASPSSTPTTSRAAPPPSSDKSATKAAQPAPSASSMASTGRMR 367

Query: 1511 NLECFNCGLRGHYAWECLKKKNLHIGVEPNDEQETEEGKEVDFIERIXXXXXXXXXXXXX 1332
            +++C  C   GH   +C  K+ L   V+ + E  +    + D +  +             
Sbjct: 368  DVQCHRCKGFGHVQRDCPSKRVLV--VKNDGEYSSASDFDDDTLALLAADHADNEPPEEH 425

Query: 1331 DTTFLS------VVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSE 1170
                 +      +V+R+LS   ++ +  +  T+ QT         ++IIDGGS  N+ S 
Sbjct: 426  IGAAFADHYESLIVQRVLSAQMEKAEQNQRHTLFQTKCVVKERCCRMIIDGGSCNNLASS 485

Query: 1169 ATVEKLNLLTEPHPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTITHIL 993
              VEKL L T+PHP  Y + W+++SG   V+K   + F  G Y D + CDV+ M   +IL
Sbjct: 486  EMVEKLALSTKPHPHSYYIQWLNNSGKAKVTKLVHINFAIGNYHDVVECDVVPMQACNIL 545

Query: 992  LG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAM---NNKRATKDKRRNQEETTS 822
            LG PW +DR+  H G+ + YSF ++ K+IVL P+S E +   +  +A K K  + ++  S
Sbjct: 546  LGRPWQFDRDSMHHGRSNQYSFLYHDKKIVLHPMSPEDILRDDVAKAAKSKCESDKKAQS 605

Query: 821  NSR*-EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DV---------- 675
            + +  E +  + K     +A    +  L+ + +  Y+L  K  L+ L D+          
Sbjct: 606  DGKKPETINLKPK---CLLATKSDINELIASPSVAYALVCKDALISLHDMQHSLPPAVAN 662

Query: 674  ------------APEDLPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLD 531
                         P  LPP+R I+H ID + G+ LPN   YR +  E +E++RQV ELLD
Sbjct: 663  ILQEYSDVFPKEVPPGLPPVRGIEHQIDLIPGASLPNRAPYRTNPEETKEIQRQVHELLD 722

Query: 530  DGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTI 351
             G +RESLSPCAVP +L PKKD +WRMC DCR IN IT++YR PIPRLDD LD ++ S +
Sbjct: 723  KGYVRESLSPCAVPVILVPKKDGSWRMCVDCRAINNITIRYRHPIPRLDDMLDELSGSIV 782

Query: 350  YSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFI 171
            +SK++L   Y+++ ++LGDEWKTAFKTK G YEWLVMPFGLTNAPSTFMR M +VL+PFI
Sbjct: 783  FSKVELRSGYHQIHMKLGDEWKTAFKTKFGLYEWLVMPFGLTNAPSTFMRLMNEVLRPFI 842

Query: 170  GRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3
            G+F+VVYF       +      +HL+ V   LR  +L+ NL+KC+F    V FLG+
Sbjct: 843  GKFVVVYFDDILIYSKSMGEHFNHLRAVFNALRDARLFGNLEKCTFCTDRVSFLGY 898


Top