BLASTX nr result

ID: Sinomenium22_contig00046101 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00046101
         (1017 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007206823.1| hypothetical protein PRUPE_ppa025991mg [Prun...   105   4e-29
ref|XP_007221295.1| hypothetical protein PRUPE_ppa024499mg, part...    97   5e-28
gb|ABF97027.1| retrotransposon protein, putative, Ty3-gypsy subc...   108   8e-27
gb|AAK91332.1|AC090441_14 Putative gag-pol polyprotein [Oryza sa...   107   1e-26
dbj|BAA89466.1| gag-pol polyprotein [Oryza sativa Indica Group]       106   2e-26
gb|AAQ56388.1| putative gag-pol polyprotein [Oryza sativa Japoni...   105   7e-26
ref|XP_007207981.1| hypothetical protein PRUPE_ppa015715mg, part...   105   2e-25
emb|CAN79389.1| hypothetical protein VITISV_004909 [Vitis vinifera]   109   8e-24
emb|CAN77045.1| hypothetical protein VITISV_035256 [Vitis vinifera]   115   2e-23
emb|CAN79339.1| hypothetical protein VITISV_044312 [Vitis vinifera]   106   3e-23
emb|CAN71532.1| hypothetical protein VITISV_018180 [Vitis vinifera]   103   3e-22
gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Ja...   111   5e-22
ref|XP_007221749.1| hypothetical protein PRUPE_ppb022800mg, part...    97   7e-22
gb|AAK94516.1| gag-pol polyprotein [Hordeum vulgare]                  110   1e-21
ref|XP_007048683.1| DNA/RNA polymerases superfamily protein [The...    95   2e-21
gb|AAM94350.1| gag-pol polyprotein [Zea mays]                         109   2e-21
gb|ABI96971.1| putative gag-pol polyprotein [Triticum monococcum...   109   2e-21
ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [The...    95   3e-21
gb|AAK94517.1| gag-pol polyprotein [Hordeum vulgare]                  108   3e-21
ref|XP_007019612.1| Uncharacterized protein TCM_035725 [Theobrom...    94   4e-21

>ref|XP_007206823.1| hypothetical protein PRUPE_ppa025991mg [Prunus persica]
            gi|462402465|gb|EMJ08022.1| hypothetical protein
            PRUPE_ppa025991mg [Prunus persica]
          Length = 1274

 Score =  105 bits (263), Expect(3) = 4e-29
 Identities = 57/142 (40%), Positives = 77/142 (54%), Gaps = 2/142 (1%)
 Frame = +2

Query: 185  FSRSMWKMVNMRLDFSSAYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAH 364
            F +++WK+    L FSSA+ PQTDGQTEV+N SLG+L H LVGD    WD  L   EF +
Sbjct: 962  FWKTLWKLFGTTLKFSSAFHPQTDGQTEVVNRSLGDLLHCLVGDKPGNWDLLLPVAEFTY 1021

Query: 365  NHAMNCITGFSPFTIVYGIVLRCPLGSVTDHIQLHC--KAEDFVALLQQIHQTTPDHLVA 538
            N+++N  TG SPF +V+G   R P+  V   +       A  F   ++Q+H      +  
Sbjct: 1022 NNSVNRSTGKSPFEVVHGFSPRSPVDLVALPVAARSSDSATSFAEHIRQLHDDVRRQISM 1081

Query: 539  ATRKYKAAVDKRRHHVEFEVGD 604
             T  YK A +  R   EF  GD
Sbjct: 1082 HTDTYKLAANAHRRQQEFREGD 1103



 Score = 38.5 bits (88), Expect(3) = 4e-29
 Identities = 19/60 (31%), Positives = 34/60 (56%)
 Frame = +1

Query: 598  G*FVWAILTKDCFPTHEYNKLVTRKIGLVEIIEKINPNAYRLKLLSTVRALRMSSISNIS 777
            G FV   +  + FP H + KL  R +G   II+K+  NAY ++L + +    + ++S++S
Sbjct: 1102 GDFVMVRVCPERFPKHSFKKLHARSMGPYRIIKKLGSNAYLIELPANMHISPIFNVSDLS 1161



 Score = 32.0 bits (71), Expect(3) = 4e-29
 Identities = 15/26 (57%), Positives = 19/26 (73%)
 Frame = +3

Query: 111  FWEIYRLHGLPTSIVFYRDTRFLSHF 188
            F E+ RLHGL  SIV  RD +F+S+F
Sbjct: 937  FKEVVRLHGLLVSIVSDRDFKFVSYF 962


>ref|XP_007221295.1| hypothetical protein PRUPE_ppa024499mg, partial [Prunus persica]
            gi|462417929|gb|EMJ22494.1| hypothetical protein
            PRUPE_ppa024499mg, partial [Prunus persica]
          Length = 1364

 Score = 96.7 bits (239), Expect(3) = 5e-28
 Identities = 51/117 (43%), Positives = 69/117 (58%)
 Frame = +2

Query: 185  FSRSMWKMVNMRLDFSSAYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAH 364
            F +++WK+    L FSSA+ PQTDGQTEV+N SLG+L   LVGD    WD  L   EFA+
Sbjct: 1144 FWKTLWKLFGTSLKFSSAFHPQTDGQTEVVNRSLGDLLRCLVGDKQGNWDLILPVAEFAY 1203

Query: 365  NHAMNCITGFSPFTIVYGIVLRCPLGSVTDHIQLHCKAEDFVALLQQIHQTTPDHLV 535
            N++ N  TG SPF IVYG++ R P+      I   C +E      + IH    D+++
Sbjct: 1204 NNSANRTTGKSPFEIVYGVMPRPPIDLAPLPIDA-CPSESATTFAEHIHFQEGDYVM 1259



 Score = 39.3 bits (90), Expect(3) = 5e-28
 Identities = 23/63 (36%), Positives = 34/63 (53%)
 Frame = +1

Query: 598  G*FVWAILTKDCFPTHEYNKLVTRKIGLVEIIEKINPNAYRLKLLSTVRALRMSSISNIS 777
            G +V   +  + FP H + KL  R +GL  I+ K+  NAY ++L S V    +S I N+S
Sbjct: 1255 GDYVMVRVCPERFPKHSFKKLHARSMGLYRILRKLGANAYLVELPSDV---HISPIFNVS 1311

Query: 778  VRF 786
              F
Sbjct: 1312 DLF 1314



 Score = 36.6 bits (83), Expect(3) = 5e-28
 Identities = 16/26 (61%), Positives = 20/26 (76%)
 Frame = +3

Query: 111  FWEIYRLHGLPTSIVFYRDTRFLSHF 188
            F E+ RLHGLP SIV  RD +F+S+F
Sbjct: 1119 FKEVIRLHGLPVSIVSDRDVKFVSYF 1144


>gb|ABF97027.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 889

 Score =  108 bits (269), Expect(2) = 8e-27
 Identities = 57/163 (34%), Positives = 93/163 (57%), Gaps = 2/163 (1%)
 Frame = +2

Query: 185  FSRSMWKMVNMRLDFSSAYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAH 364
            F R++W  +  +L FS+   PQTDGQTEV+N +L  +  +++  N+K W++ L   EFA+
Sbjct: 681  FWRTLWAKLGTKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHVEFAY 740

Query: 365  NHAMNCITGFSPFTIVYGIVLRCPLG--SVTDHIQLHCKAEDFVALLQQIHQTTPDHLVA 538
            NH+ +  T   PF IVYG++ R P+    +    +++  A+    L+ ++H+TT +++  
Sbjct: 741  NHSQHSTTKKCPFEIVYGLLPRAPIDLLPLPTSERVNFDAKHRAELMLKLHETTKENIER 800

Query: 539  ATRKYKAAVDKRRHHVEFEVGDLFGLS*LRTVFPPMNTTNLLP 667
               KYK A  K + HV FE GDL  L   +  FP +  + L P
Sbjct: 801  MNIKYKLAGSKGKKHVAFEPGDLVWLHLRKDRFPNLRKSKLQP 843



 Score = 40.0 bits (92), Expect(2) = 8e-27
 Identities = 18/26 (69%), Positives = 21/26 (80%)
 Frame = +3

Query: 111 FWEIYRLHGLPTSIVFYRDTRFLSHF 188
           F EI RLHG+P +IV  RDT+FLSHF
Sbjct: 656 FREIVRLHGVPNTIVSDRDTKFLSHF 681


>gb|AAK91332.1|AC090441_14 Putative gag-pol polyprotein [Oryza sativa Japonica Group]
            gi|15217296|gb|AAK92640.1|AC079634_1 Putative
            retroelement [Oryza sativa Japonica Group]
            gi|31431373|gb|AAP53161.1| retrotransposon protein,
            putative, Ty3-gypsy subclass [Oryza sativa Japonica
            Group]
          Length = 1708

 Score =  107 bits (267), Expect(2) = 1e-26
 Identities = 57/163 (34%), Positives = 93/163 (57%), Gaps = 2/163 (1%)
 Frame = +2

Query: 185  FSRSMWKMVNMRLDFSSAYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAH 364
            F R++W  +  +L FS+   PQTDGQTEV+N +L  +  +++  N+K W++ L   EFA+
Sbjct: 1368 FWRTLWAKLGTKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHVEFAY 1427

Query: 365  NHAMNCITGFSPFTIVYGIVLRCPLG--SVTDHIQLHCKAEDFVALLQQIHQTTPDHLVA 538
            N + +  T   PF IVYG++ R P+    +    +++  A+    L+ ++H+TT +++  
Sbjct: 1428 NRSQHSTTKKCPFEIVYGLLPRAPIDLLPLPTSERVNFDAKYHAELMLKLHETTKENIER 1487

Query: 539  ATRKYKAAVDKRRHHVEFEVGDLFGLS*LRTVFPPMNTTNLLP 667
               KYK A  K + HV FE GDL  L   +  FP +  + LLP
Sbjct: 1488 MNIKYKLAGSKGKKHVAFEPGDLVWLHLRKDRFPNLRKSKLLP 1530



 Score = 40.0 bits (92), Expect(2) = 1e-26
 Identities = 18/26 (69%), Positives = 21/26 (80%)
 Frame = +3

Query: 111  FWEIYRLHGLPTSIVFYRDTRFLSHF 188
            F EI RLHG+P +IV  RDT+FLSHF
Sbjct: 1343 FREIVRLHGVPNTIVSDRDTKFLSHF 1368


>dbj|BAA89466.1| gag-pol polyprotein [Oryza sativa Indica Group]
          Length = 1587

 Score =  106 bits (265), Expect(2) = 2e-26
 Identities = 57/163 (34%), Positives = 93/163 (57%), Gaps = 2/163 (1%)
 Frame = +2

Query: 185  FSRSMWKMVNMRLDFSSAYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAH 364
            F R++W  +  +L FS+   PQTDGQTEV+N +L  +  +++  N+K W++ L   EFA+
Sbjct: 1368 FWRTLWAKLGTKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHVEFAY 1427

Query: 365  NHAMNCITGFSPFTIVYGIVLRCPLG--SVTDHIQLHCKAEDFVALLQQIHQTTPDHLVA 538
            N + +  T   PF IVYG++ R P+    +    +++  A+    L+ ++H+TT +++  
Sbjct: 1428 NRSQHSTTKKCPFEIVYGLLPRAPIDLLPLPTSERVNFDAKYRAELMLKLHETTKENIER 1487

Query: 539  ATRKYKAAVDKRRHHVEFEVGDLFGLS*LRTVFPPMNTTNLLP 667
               KYK A  K + HV FE GDL  L   +  FP +  + LLP
Sbjct: 1488 MNIKYKLAGSKGKKHVAFEPGDLVWLHLRKDRFPNLRKSKLLP 1530



 Score = 40.0 bits (92), Expect(2) = 2e-26
 Identities = 18/26 (69%), Positives = 21/26 (80%)
 Frame = +3

Query: 111  FWEIYRLHGLPTSIVFYRDTRFLSHF 188
            F EI RLHG+P +IV  RDT+FLSHF
Sbjct: 1343 FREIVRLHGVPNTIVSDRDTKFLSHF 1368


>gb|AAQ56388.1| putative gag-pol polyprotein [Oryza sativa Japonica Group]
            gi|91795218|gb|ABE60890.1| putative polyprotein [Oryza
            sativa Japonica Group]
          Length = 1616

 Score =  105 bits (261), Expect(2) = 7e-26
 Identities = 56/163 (34%), Positives = 92/163 (56%), Gaps = 2/163 (1%)
 Frame = +2

Query: 185  FSRSMWKMVNMRLDFSSAYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAH 364
            F R++W  +  +  FS+   PQTDGQTEV+N +L  +  +++  N+K W++ L   EFA+
Sbjct: 1368 FWRTLWAKLGTKFLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHVEFAY 1427

Query: 365  NHAMNCITGFSPFTIVYGIVLRCPLGSVTDHI--QLHCKAEDFVALLQQIHQTTPDHLVA 538
            N + +  T   PF IVYG++ R P+  +      +++  A+    L+ ++H+TT +++  
Sbjct: 1428 NRSQHSTTKKCPFEIVYGLLPRAPIDLLPHPTSERVNFDAKYRAELMLKLHETTKENIER 1487

Query: 539  ATRKYKAAVDKRRHHVEFEVGDLFGLS*LRTVFPPMNTTNLLP 667
               KYK A  K + HV FE GDL  L   +  FP +  + LLP
Sbjct: 1488 MNIKYKLAGSKGKKHVAFEPGDLVWLHLRKDRFPNLRKSKLLP 1530



 Score = 40.0 bits (92), Expect(2) = 7e-26
 Identities = 18/26 (69%), Positives = 21/26 (80%)
 Frame = +3

Query: 111  FWEIYRLHGLPTSIVFYRDTRFLSHF 188
            F EI RLHG+P +IV  RDT+FLSHF
Sbjct: 1343 FREIVRLHGVPNTIVSDRDTKFLSHF 1368


>ref|XP_007207981.1| hypothetical protein PRUPE_ppa015715mg, partial [Prunus persica]
            gi|462403623|gb|EMJ09180.1| hypothetical protein
            PRUPE_ppa015715mg, partial [Prunus persica]
          Length = 1445

 Score =  105 bits (262), Expect(2) = 2e-25
 Identities = 71/184 (38%), Positives = 91/184 (49%), Gaps = 7/184 (3%)
 Frame = +2

Query: 74   KTTDAINVAILFF-----LGNLSTSWASYFHCFLP*HAFP*PFSRSMWKMVNMRLDFSSA 238
            K TDA  VA LFF     L  L  S  S          F   F +++WK+    L FSSA
Sbjct: 1096 KNTDASYVAKLFFKEVVRLHGLPVSIVSDRDV-----KFVSYFWKTLWKLFGTTLKFSSA 1150

Query: 239  YDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAHNHAMNCITGFSPFTIVYG 418
            + PQTDGQTEV+N SLG+L   LVGD    WD  L   EFA+N+++N  TG SPF +V+G
Sbjct: 1151 FHPQTDGQTEVVNRSLGDLLRCLVGDKPGNWDLLLPVAEFAYNNSVNRSTGKSPFEVVHG 1210

Query: 419  IVLRCPLGSVTDHIQLHC--KAEDFVALLQQIHQTTPDHLVAATRKYKAAVDKRRHHVEF 592
               R P+  V   +       A  F   ++Q+H      +   T  YK A +  R   EF
Sbjct: 1211 FSPRSPVDLVALPVAARTSDSATSFAEHIRQLHDDVRRQISMHTDTYKLAANAHRRQQEF 1270

Query: 593  EVGD 604
              GD
Sbjct: 1271 REGD 1274



 Score = 38.1 bits (87), Expect(2) = 2e-25
 Identities = 19/60 (31%), Positives = 34/60 (56%)
 Frame = +1

Query: 598  G*FVWAILTKDCFPTHEYNKLVTRKIGLVEIIEKINPNAYRLKLLSTVRALRMSSISNIS 777
            G FV   +  + FP H + KL  R +G   II+K+  NAY ++L + +    + ++S++S
Sbjct: 1273 GDFVMVRVCPERFPKHSFKKLHARSMGPYRIIKKLGSNAYLIELPADMHISPIFNVSDLS 1332


>emb|CAN79389.1| hypothetical protein VITISV_004909 [Vitis vinifera]
          Length = 895

 Score =  109 bits (272), Expect(2) = 8e-24
 Identities = 61/181 (33%), Positives = 94/181 (51%), Gaps = 2/181 (1%)
 Frame = +2

Query: 71   KKTTDAINVAILFFLGNLSTSWASYFHCFLP*HAFP*PFSRSMWKMVNMRLDFSSAYDPQ 250
            KK +DA  VA LFF   +          F     F   F +++W  +  +L FSS++ PQ
Sbjct: 549  KKASDASYVAALFFKEVVRLHGLPQSIVFYRDVNFMSYFWKTLWAKLGAQLKFSSSFHPQ 608

Query: 251  TDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAHNHAMNCITGFSPFTIVYGIVLR 430
            TDGQTEV+N SLG L   +V D ++ WD  L Q EFA N + N   G SPF + YG+  +
Sbjct: 609  TDGQTEVVNRSLGNLLRCIVRDQLRNWDNVLPQAEFAFNSSTNRTIGHSPFEVAYGLKPK 668

Query: 431  CPLGSV--TDHIQLHCKAEDFVALLQQIHQTTPDHLVAATRKYKAAVDKRRHHVEFEVGD 604
             P+  +  +  ++     + F   ++ IH+   + +  +   YK A D  R +++F+ GD
Sbjct: 669  QPIDLIPLSTSVRTSQDGDAFARHIRDIHEKVREKIKISNENYKEAADAHRRYIQFQEGD 728

Query: 605  L 607
            L
Sbjct: 729  L 729



 Score = 28.9 bits (63), Expect(2) = 8e-24
 Identities = 20/59 (33%), Positives = 28/59 (47%)
 Frame = +1

Query: 598 G*FVWAILTKDCFPTHEYNKLVTRKIGLVEIIEKINPNAYRLKLLSTVRALRMSSISNI 774
           G  V A L  + F    Y KL  +K G   +++ +  NAY L+L S    L  S I N+
Sbjct: 727 GDLVMARLRPERFHPSTYQKLQAKKAGPFRVLKWLGENAYLLELPSN---LHFSPIFNV 782


>emb|CAN77045.1| hypothetical protein VITISV_035256 [Vitis vinifera]
          Length = 665

 Score =  115 bits (289), Expect = 2e-23
 Identities = 70/193 (36%), Positives = 103/193 (53%), Gaps = 15/193 (7%)
 Frame = +2

Query: 74   KTTDAINVAILFF-------------LGNLSTSWASYFHCFLP*HAFP*PFSRSMWKMVN 214
            KT DA++VA LFF             + +    + SYF              RS+WKM+N
Sbjct: 461  KTLDAVHVAKLFFKEIVRLHGLPKTIVSDQDAKFMSYFW-------------RSLWKMLN 507

Query: 215  MRLDFSSAYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAHNHAMNCITGF 394
             +L FSSA+ PQT+GQTEV+N SLG+L   LVG++V  WDQ L   EFA+N ++N  TG 
Sbjct: 508  TKLKFSSAFHPQTEGQTEVVNRSLGDLLRCLVGEHVSNWDQILPMAEFAYNSSVNRSTGH 567

Query: 395  SPFTIVYGIVLRCPLGSVTDHIQLH--CKAEDFVALLQQIHQTTPDHLVAATRKYKAAVD 568
            SPF IV G++ R P+  V   ++     +A+ F   +  +H+     +  +   YKA  D
Sbjct: 568  SPFEIVTGLLPRKPIDLVPLPMEARPSVEADAFSKHILDLHKDVQRKIALSNENYKAQAD 627

Query: 569  KRRHHVEFEVGDL 607
             +R   +F+  D+
Sbjct: 628  LKRKVADFKERDM 640


>emb|CAN79339.1| hypothetical protein VITISV_044312 [Vitis vinifera]
          Length = 354

 Score =  106 bits (264), Expect(2) = 3e-23
 Identities = 64/186 (34%), Positives = 95/186 (51%), Gaps = 7/186 (3%)
 Frame = +2

Query: 71  KKTTDAINVAILFF-----LGNLSTSWASYFHCFLP*HAFP*PFSRSMWKMVNMRLDFSS 235
           KK +DA  V+ LFF     L  L  S  S          F   F +++W  +  +L FSS
Sbjct: 8   KKASDASYVSALFFKEVVRLHGLPQSIVSDRDV-----KFMSYFWKTLWAKLGTQLKFSS 62

Query: 236 AYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAHNHAMNCITGFSPFTIVY 415
           ++ PQTDGQ EV+N SLG L   +V D ++ WD  L Q EFA N + N  TG+SPF + Y
Sbjct: 63  SFHPQTDGQIEVVNRSLGNLLRCIVRDQLRNWDNVLPQAEFAFNSSTNRTTGYSPFEVAY 122

Query: 416 GIVLRCPLGSVTDHIQLHCK--AEDFVALLQQIHQTTPDHLVAATRKYKAAVDKRRHHVE 589
           G+  +  +  +     +H     + F   +Q IH+   + +  +   YK A D  R +++
Sbjct: 123 GLKPKQLVDLIPLPTSVHTSQDGDAFTRHIQDIHENVREKIKISNENYKEAADAHRRYIQ 182

Query: 590 FEVGDL 607
           F+ GDL
Sbjct: 183 FQEGDL 188



 Score = 30.0 bits (66), Expect(2) = 3e-23
 Identities = 19/59 (32%), Positives = 29/59 (49%)
 Frame = +1

Query: 598 G*FVWAILTKDCFPTHEYNKLVTRKIGLVEIIEKINPNAYRLKLLSTVRALRMSSISNI 774
           G  V   L  + F    Y KL  +K G  ++++++  NAY L+L S    L  S I N+
Sbjct: 186 GDLVMVRLRPERFHPSTYQKLQAKKAGPFQVLKRLGENAYLLELPSN---LHFSPIFNV 241


>emb|CAN71532.1| hypothetical protein VITISV_018180 [Vitis vinifera]
          Length = 1323

 Score =  103 bits (258), Expect(2) = 3e-22
 Identities = 60/194 (30%), Positives = 95/194 (48%), Gaps = 15/194 (7%)
 Frame = +2

Query: 71   KKTTDAINVAILFF-------------LGNLSTSWASYFHCFLP*HAFP*PFSRSMWKMV 211
            KKT++A  V  LFF             + N    + SYF              +++W  +
Sbjct: 976  KKTSNASYVTALFFKEVVQLHGLPQSIVSNRDVKFMSYFW-------------KTLWVKL 1022

Query: 212  NMRLDFSSAYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAHNHAMNCITG 391
              +L FSS++ PQTDGQTEV+N SLG L   +V D ++ WD  L Q EFA N + N  TG
Sbjct: 1023 GTQLKFSSSFHPQTDGQTEVVNRSLGNLLRCIVRDQLRNWDNVLPQAEFAFNSSTNRTTG 1082

Query: 392  FSPFTIVYGIVLRCPLGSV--TDHIQLHCKAEDFVALLQQIHQTTPDHLVAATRKYKAAV 565
            + PF + YG+  + P+  +     ++     + F   ++ IH+   + +  +   YK A 
Sbjct: 1083 YLPFEVAYGLKPKQPVDLIPLPTSVRTSQDGDAFARHIRDIHEKVREKIKISNENYKEAX 1142

Query: 566  DKRRHHVEFEVGDL 607
            D  R +++F+ G L
Sbjct: 1143 DAHRRYIQFQEGGL 1156



 Score = 28.9 bits (63), Expect(2) = 3e-22
 Identities = 19/59 (32%), Positives = 28/59 (47%)
 Frame = +1

Query: 598  G*FVWAILTKDCFPTHEYNKLVTRKIGLVEIIEKINPNAYRLKLLSTVRALRMSSISNI 774
            G  V   L  + F    Y KL  +K G   +++++  NAY L+L S    L  S I N+
Sbjct: 1154 GGLVMVRLRPERFHPSTYQKLQAKKAGPFRVLKRLGENAYLLELPSN---LXFSPIFNV 1209


>gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Japonica Group]
            gi|31431012|gb|AAP52850.1| retrotransposon protein,
            putative, Ty3-gypsy subclass [Oryza sativa Japonica
            Group]
          Length = 2447

 Score =  111 bits (277), Expect = 5e-22
 Identities = 68/213 (31%), Positives = 110/213 (51%), Gaps = 15/213 (7%)
 Frame = +2

Query: 74   KTTDAINVAILFF-------------LGNLSTSWASYFHCFLP*HAFP*PFSRSMWKMVN 214
            KT DA ++A LFF             + +  T + S+F              R++W  + 
Sbjct: 1303 KTDDASHIADLFFREIVRLHGVPNTIVSDRDTKFLSHFW-------------RTLWAKLG 1349

Query: 215  MRLDFSSAYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAHNHAMNCITGF 394
             +L FS+   PQTDGQTEV+N +L  +  +++  N+K W++ L   EFA+N +++  T  
Sbjct: 1350 TKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEECLPHIEFAYNRSLHSTTKM 1409

Query: 395  SPFTIVYGIVLRCP--LGSVTDHIQLHCKAEDFVALLQQIHQTTPDHLVAATRKYKAAVD 568
             PF IVYG++ R P  L  +    +L+  A+    L+ ++H+TT +++     KYK A D
Sbjct: 1410 CPFQIVYGLLPRAPIDLMPLPSSEKLNFDAKQRAELMLKLHETTKENIERMNAKYKFAGD 1469

Query: 569  KRRHHVEFEVGDLFGLS*LRTVFPPMNTTNLLP 667
            K R  + FE GDL  L   +  FP +  + L+P
Sbjct: 1470 KGRRELTFEPGDLVWLHLRKERFPDLRKSKLMP 1502



 Score = 79.3 bits (194), Expect(2) = 8e-16
 Identities = 49/145 (33%), Positives = 76/145 (52%), Gaps = 5/145 (3%)
 Frame = +2

Query: 185  FSRSMWKMVN----MRLDFSSAYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQT 352
            F+   WK +      RL+FS+AY PQTDGQTE +N  L ++ H+ V D  K WD+ L   
Sbjct: 2158 FTSHFWKKLQEELGTRLNFSTAYHPQTDGQTERLNQILEDMLHACVLDFGKTWDKSLPYA 2217

Query: 353  EFAHNHAMNCITGFSPFTIVYGIVLRCPLGSVTDHI-QLHCKAEDFVALLQQIHQTTPDH 529
            EF++N++       +P+  +YG   R PL  + D + +      D +   +   +T  D+
Sbjct: 2218 EFSYNNSYQASIQMAPYEALYGRKCRTPL--LWDQVGESQVFGTDILREAEAKVRTIWDN 2275

Query: 530  LVAATRKYKAAVDKRRHHVEFEVGD 604
            L  A  + K+  D RR ++EF V D
Sbjct: 2276 LKVAQSRQKSYADNRRRNLEFAVDD 2300



 Score = 32.0 bits (71), Expect(2) = 8e-16
 Identities = 16/45 (35%), Positives = 27/45 (60%), Gaps = 5/45 (11%)
 Frame = +3

Query: 69   ARRLPMRSMWLSSSFWEIY-----RLHGLPTSIVFYRDTRFLSHF 188
            AR +P+++ +  +   E+Y      LHG+P  IV  R+++F SHF
Sbjct: 2118 ARFIPVKTTYGGNKLAELYFARIVSLHGVPKKIVSDRESQFTSHF 2162


>ref|XP_007221749.1| hypothetical protein PRUPE_ppb022800mg, partial [Prunus persica]
           gi|462418685|gb|EMJ22948.1| hypothetical protein
           PRUPE_ppb022800mg, partial [Prunus persica]
          Length = 722

 Score = 97.1 bits (240), Expect(2) = 7e-22
 Identities = 62/172 (36%), Positives = 87/172 (50%)
 Frame = +2

Query: 185 FSRSMWKMVNMRLDFSSAYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAH 364
           F +++WK+    L FSSA+ PQTDGQTEV+N SL +L   LVGD    WD  L   EFA+
Sbjct: 417 FWKTLWKLFGTSLKFSSAFHPQTDGQTEVVNRSLRDLLRCLVGDKQGNWDLILPVAEFAY 476

Query: 365 NHAMNCITGFSPFTIVYGIVLRCPLGSVTDHIQLHCKAEDFVALLQQIHQTTPDHLVAAT 544
           N++ N  TG SPF IVYG++ R P+      I     +E      + I Q     +  +T
Sbjct: 477 NNSANRTTGKSPFEIVYGVMPRPPIDLAPLPIDAR-PSESATTFAEHIRQ----KISLST 531

Query: 545 RKYKAAVDKRRHHVEFEVGDLFGLS*LRTVFPPMNTTNLLPARLV*SRLLRK 700
             Y+ A +  R   +F+ GD   +      FP  +   L    +   R+LRK
Sbjct: 532 NTYQLAANTHRRTQDFQEGDYVMVRVCPERFPKHSFKKLHARSMGPYRILRK 583



 Score = 34.7 bits (78), Expect(2) = 7e-22
 Identities = 15/26 (57%), Positives = 19/26 (73%)
 Frame = +3

Query: 111 FWEIYRLHGLPTSIVFYRDTRFLSHF 188
           F E+  LHGLP SIV  RD +F+S+F
Sbjct: 392 FKEVIHLHGLPVSIVSDRDVKFVSYF 417


>gb|AAK94516.1| gag-pol polyprotein [Hordeum vulgare]
          Length = 1720

 Score =  110 bits (274), Expect = 1e-21
 Identities = 66/200 (33%), Positives = 105/200 (52%), Gaps = 2/200 (1%)
 Frame = +2

Query: 74   KTTDAINVAILFFLGNLSTSWASYFHCFLP*HAFP*PFSRSMWKMVNMRLDFSSAYDPQT 253
            K+ DA NVA LFF   +                F   F R +W  +  +L FS+   PQT
Sbjct: 1362 KSDDAANVADLFFREIIRLHGVPNTIVSDRDAKFLSHFWRCLWAKLGTKLLFSTTCHPQT 1421

Query: 254  DGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAHNHAMNCITGFSPFTIVYGIVLRC 433
            DGQTEV+N SL  +  +++ +N+K W++ L   EFA+N +++  T   PF IVYG + R 
Sbjct: 1422 DGQTEVVNRSLSTMLRAVLKNNIKLWEECLPHIEFAYNRSLHSTTKMCPFEIVYGFLPRA 1481

Query: 434  PLG--SVTDHIQLHCKAEDFVALLQQIHQTTPDHLVAATRKYKAAVDKRRHHVEFEVGDL 607
            P+    +    +++  A++   L+ ++H+ T +++     +YK A DK R HV F  GDL
Sbjct: 1482 PIDLLPIPSSEKVNFDAKERAELILKMHELTKENIERMNARYKLAGDKGRKHVVFAPGDL 1541

Query: 608  FGLS*LRTVFPPMNTTNLLP 667
              L   +  FP +  + L+P
Sbjct: 1542 VWLHLRKDRFPDLRKSKLMP 1561


>ref|XP_007048683.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508700944|gb|EOX92840.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 647

 Score = 94.7 bits (234), Expect(2) = 2e-21
 Identities = 62/184 (33%), Positives = 94/184 (51%), Gaps = 7/184 (3%)
 Frame = +2

Query: 74  KTTDAINVAILFF-----LGNLSTSWASYFHCFLP*HAFP*PFSRSMWKMVNMRLDFSSA 238
           KT+DA ++A LFF     L  + TS  S        H     F R++W+     L +SS 
Sbjct: 395 KTSDATHIAELFFCEVVRLHGIPTSIVSDRDVKFMGH-----FWRTLWRKFGTELKYSST 449

Query: 239 YDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAHNHAMNCITGFSPFTIVYG 418
             PQTDGQTEV+N SLG +   L+ +N K WD  + Q EFA+N+++N     +PF + YG
Sbjct: 450 CHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRSIKKTPFEVAYG 509

Query: 419 IVLRCPLGSV--TDHIQLHCKAEDFVALLQQIHQTTPDHLVAATRKYKAAVDKRRHHVEF 592
           +  +  L  V      ++  + E F   +++IH+     L A+  +Y    ++ R   EF
Sbjct: 510 LKPQHVLDLVPLPQEARVSNEGELFAYHIRKIHEEVKAALKASNAEYSFTANQHRRKQEF 569

Query: 593 EVGD 604
           E GD
Sbjct: 570 EEGD 573



 Score = 35.8 bits (81), Expect(2) = 2e-21
 Identities = 20/52 (38%), Positives = 32/52 (61%)
 Frame = +1

Query: 619 LTKDCFPTHEYNKLVTRKIGLVEIIEKINPNAYRLKLLSTVRALRMSSISNI 774
           L ++ FP   Y+KL +RK G  ++I+KI+ NAY   L+     L++S I N+
Sbjct: 579 LRQERFPKGTYHKLKSRKFGPCKVIKKISSNAY---LIELPPELQISPIFNV 627


>gb|AAM94350.1| gag-pol polyprotein [Zea mays]
          Length = 1618

 Score =  109 bits (273), Expect = 2e-21
 Identities = 70/206 (33%), Positives = 108/206 (52%), Gaps = 8/206 (3%)
 Frame = +2

Query: 74   KTTDAINVAILFFL------GNLSTSWASYFHCFLP*HAFP*PFSRSMWKMVNMRLDFSS 235
            KT DA ++A LFF       G  +T  +     FL        F R++W  +  +L FS+
Sbjct: 1306 KTDDATHIADLFFREIVRLHGVPNTIVSDRDAKFLS------HFWRTLWAKLGTKLLFST 1359

Query: 236  AYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAHNHAMNCITGFSPFTIVY 415
               PQTDGQTEV+N +L  +  +++  N+K W+  L   EFA+N +++  T   PF IVY
Sbjct: 1360 TCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEDCLPHIEFAYNRSLHSTTKMCPFQIVY 1419

Query: 416  GIVLRCP--LGSVTDHIQLHCKAEDFVALLQQIHQTTPDHLVAATRKYKAAVDKRRHHVE 589
            G++ R P  L  +    +L+  A     L+ ++H+TT +++     +YK A DK R  + 
Sbjct: 1420 GLLPRAPIDLMPLPSSEKLNFDATRRAELMLKLHETTKENIERMNARYKFASDKGRKEIN 1479

Query: 590  FEVGDLFGLS*LRTVFPPMNTTNLLP 667
            FE GDL  L   +  FP +  + LLP
Sbjct: 1480 FEPGDLVWLHLRKERFPELRKSKLLP 1505


>gb|ABI96971.1| putative gag-pol polyprotein [Triticum monococcum subsp.
            aegilopoides]
          Length = 1704

 Score =  109 bits (272), Expect = 2e-21
 Identities = 68/213 (31%), Positives = 109/213 (51%), Gaps = 15/213 (7%)
 Frame = +2

Query: 74   KTTDAINVAILFF-------------LGNLSTSWASYFHCFLP*HAFP*PFSRSMWKMVN 214
            K+ DA+NVA LFF             + +  T + S+F              R +W  + 
Sbjct: 1347 KSDDAVNVADLFFREIIRLHGVPNTIVSDRDTKFLSHFW-------------RCLWAKLG 1393

Query: 215  MRLDFSSAYDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAHNHAMNCITGF 394
             +L FS+   PQTDGQTEV+N +L  +  +++ +N K W++ L   EFA+N +++  T  
Sbjct: 1394 NKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKNNKKMWEECLPHIEFAYNRSLHSTTKM 1453

Query: 395  SPFTIVYGIVLRCPLG--SVTDHIQLHCKAEDFVALLQQIHQTTPDHLVAATRKYKAAVD 568
             PF IVYG + R P+    +    +++  A++   L+ +IH+ T +++     KYK A D
Sbjct: 1454 CPFEIVYGFLPRAPIDLLPLPSSEKVNFDAKERSELILKIHELTKENIERMNAKYKLARD 1513

Query: 569  KRRHHVEFEVGDLFGLS*LRTVFPPMNTTNLLP 667
            K R HV F  GDL  L   +  FP +  + L+P
Sbjct: 1514 KGRKHVVFAPGDLVWLHLRKDRFPNLRKSKLMP 1546


>ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508703673|gb|EOX95569.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1452

 Score = 94.7 bits (234), Expect(2) = 3e-21
 Identities = 62/184 (33%), Positives = 94/184 (51%), Gaps = 7/184 (3%)
 Frame = +2

Query: 74   KTTDAINVAILFF-----LGNLSTSWASYFHCFLP*HAFP*PFSRSMWKMVNMRLDFSSA 238
            +T+DA ++A LFF     L  + TS  S  H       F   F R++W+     L +SS 
Sbjct: 1127 RTSDATHIAELFFREIVILHGIPTSIVSDRHV-----KFMGYFWRTLWRKFGTELKYSST 1181

Query: 239  YDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAHNHAMNCITGFSPFTIVYG 418
              PQTDGQTEV+N SLG +   L+ +N K WD  + Q EFA+N+++N     +PF   YG
Sbjct: 1182 CHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRSIKKTPFEAAYG 1241

Query: 419  IVLRCPLGSV--TDHIQLHCKAEDFVALLQQIHQTTPDHLVAATRKYKAAVDKRRHHVEF 592
            +  +  L  V      ++  + E F   +++IH+     L A+  +Y    ++ R   EF
Sbjct: 1242 LKPQHVLDLVPLPQEARVSNEGELFADQIRKIHEEVKAALKASNAEYSFTANQHRRKQEF 1301

Query: 593  EVGD 604
            E GD
Sbjct: 1302 EEGD 1305



 Score = 34.7 bits (78), Expect(2) = 3e-21
 Identities = 15/37 (40%), Positives = 26/37 (70%)
 Frame = +1

Query: 619  LTKDCFPTHEYNKLVTRKIGLVEIIEKINPNAYRLKL 729
            L ++ FP   Y+KL +RK G  ++++KI+ NAY ++L
Sbjct: 1311 LRQERFPKGTYHKLKSRKFGPCKVLKKISSNAYLIEL 1347


>gb|AAK94517.1| gag-pol polyprotein [Hordeum vulgare]
          Length = 1717

 Score =  108 bits (270), Expect = 3e-21
 Identities = 66/200 (33%), Positives = 104/200 (52%), Gaps = 2/200 (1%)
 Frame = +2

Query: 74   KTTDAINVAILFFLGNLSTSWASYFHCFLP*HAFP*PFSRSMWKMVNMRLDFSSAYDPQT 253
            K+ DA NVA LFF   +                F   F R +W  +  +L FS+   PQT
Sbjct: 1359 KSDDAANVADLFFREIIRLHGVPNTIVSDRDAKFLSHFWRCLWAKLGTKLLFSTTCHPQT 1418

Query: 254  DGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAHNHAMNCITGFSPFTIVYGIVLRC 433
            DGQTEV+N SL  +  +++  N+K W++ L   EFA+N +++  T   PF IVYG + R 
Sbjct: 1419 DGQTEVVNRSLSTMLRAVLKTNLKLWEECLPHIEFAYNRSLHSTTKMCPFEIVYGFLPRA 1478

Query: 434  PLG--SVTDHIQLHCKAEDFVALLQQIHQTTPDHLVAATRKYKAAVDKRRHHVEFEVGDL 607
            P+    +    +++  A++   L+ ++H+ T +++     +YK A DK R HV F  GDL
Sbjct: 1479 PIDLLPIPSSEKVNFDAKERAELILKMHELTKENIERMNARYKLAGDKGRKHVVFAPGDL 1538

Query: 608  FGLS*LRTVFPPMNTTNLLP 667
              L   +  FP +  + L+P
Sbjct: 1539 VWLHLRKDRFPDLRKSKLMP 1558


>ref|XP_007019612.1| Uncharacterized protein TCM_035725 [Theobroma cacao]
           gi|508724940|gb|EOY16837.1| Uncharacterized protein
           TCM_035725 [Theobroma cacao]
          Length = 499

 Score = 93.6 bits (231), Expect(2) = 4e-21
 Identities = 61/184 (33%), Positives = 94/184 (51%), Gaps = 7/184 (3%)
 Frame = +2

Query: 74  KTTDAINVAILFF-----LGNLSTSWASYFHCFLP*HAFP*PFSRSMWKMVNMRLDFSSA 238
           +T+DA ++A LFF     L  + TS  S        H     F R++W+     L +SS 
Sbjct: 174 RTSDATHIAELFFREIVRLHGIPTSIVSDRDVKFMGH-----FWRTLWRKFGTELKYSST 228

Query: 239 YDPQTDGQTEVINLSLGELFHSLVGDNVKRWDQKLCQTEFAHNHAMNCITGFSPFTIVYG 418
             PQTDGQTEV+N SLG +   L+ +N K WD  + Q EFA+N+++N     +PF + YG
Sbjct: 229 CHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRSIKKTPFEVAYG 288

Query: 419 IVLRCPLGSV--TDHIQLHCKAEDFVALLQQIHQTTPDHLVAATRKYKAAVDKRRHHVEF 592
           +  +  L  V      ++  + E F   +++IH+     L A+  +Y    ++ R   EF
Sbjct: 289 LKPQHVLDLVPLPQEARVSNEGELFADHIRKIHEEVKAALKASNAEYSFTANQHRRKQEF 348

Query: 593 EVGD 604
           E GD
Sbjct: 349 EEGD 352



 Score = 35.4 bits (80), Expect(2) = 4e-21
 Identities = 20/52 (38%), Positives = 32/52 (61%)
 Frame = +1

Query: 619 LTKDCFPTHEYNKLVTRKIGLVEIIEKINPNAYRLKLLSTVRALRMSSISNI 774
           L ++ FP   Y+KL +RK G  ++++KI+ NAY   L+     L++S I NI
Sbjct: 358 LRQERFPKGTYHKLKSRKFGPCKVLKKISSNAY---LIELPPELQISHIFNI 406


Top