BLASTX nr result

ID: Papaver25_contig00037232 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver25_contig00037232
         (730 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein A...   189   6e-46
gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptas...   184   4e-44
ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A...   160   5e-37
gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptas...   148   2e-33
ref|XP_004301440.1| PREDICTED: putative ribonuclease H protein A...   135   1e-29
emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulga...   135   1e-29
emb|CCA66140.1| hypothetical protein [Beta vulgaris subsp. vulga...   135   1e-29
emb|CCA66178.1| hypothetical protein [Beta vulgaris subsp. vulga...   134   2e-29
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...   134   3e-29
emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulga...   134   3e-29
ref|XP_004292011.1| PREDICTED: uncharacterized protein LOC101291...   133   7e-29
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...   132   9e-29
ref|XP_006357717.1| PREDICTED: uncharacterized protein LOC102595...   131   3e-28
ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...   131   3e-28
gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam...   131   3e-28
emb|CAB40051.1| putative protein [Arabidopsis thaliana] gi|72677...   131   3e-28
gb|EPS72636.1| hypothetical protein M569_02121, partial [Genlise...   130   4e-28
ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596...   130   5e-28
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...   130   6e-28
emb|CAN60483.1| hypothetical protein VITISV_033959 [Vitis vinifera]   129   8e-28

>ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
           vesca subsp. vesca]
          Length = 751

 Score =  189 bits (481), Expect = 6e-46
 Identities = 96/245 (39%), Positives = 146/245 (59%), Gaps = 4/245 (1%)
 Frame = +1

Query: 1   GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
           GF   +   ++I+L SA +S+L+NGSP GFFS  +G+RQGDPLSP++F + E+ LSR +T
Sbjct: 44  GFGSRFTDLMLILLNSAHLSILINGSPHGFFSCTKGVRQGDPLSPILFCIAEEALSRGLT 103

Query: 181 KLFQQGKMKTMVTRKGISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSRE 360
            LF   K++++   +G S TH+ +ADD+ IFC+G+ +SLR L   L +Y  ASGQ V+++
Sbjct: 104 ALFSSKKVRSISLPRGCSLTHVLYADDLFIFCRGDTKSLRQLQSFLDNYGAASGQLVNKD 163

Query: 361 KSNLFYGGGSLGRRATISKFLGMNVVSFPDTYLGVRAMPG--AVKYNQ--VKKIKGQLAG 528
           KS  + G     RR  + K LG  + + P +YLGV    G    K+ Q  V K K +LAG
Sbjct: 164 KSTFYLGASHFHRRHQVKKILGFKLGTSPFSYLGVPIFKGKPCRKHLQALVDKAKARLAG 223

Query: 529 *KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVSRSFM 708
            KG++LS    V LV  V  S  +H+ ++Y W    +        NF+WSGD  + +   
Sbjct: 224 WKGKLLSMAGRVQLVHDVFQSMLLHSFSIYLWATSLLSHLSACARNFIWSGDLAIRKLVT 283

Query: 709 VAYDK 723
           +++ +
Sbjct: 284 ISWQQ 288


>gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
            truncatula]
          Length = 642

 Score =  184 bits (466), Expect = 4e-44
 Identities = 100/246 (40%), Positives = 147/246 (59%), Gaps = 5/246 (2%)
 Frame = +1

Query: 1    GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
            GF++ +C WI  IL S+++ + +NG+  GFF+  RG+RQGDPLSPL+F ++E+VLSR+I+
Sbjct: 344  GFNELFCNWIKTILHSSKMFISMNGAQHGFFNCNRGVRQGDPLSPLLFCIVEEVLSRSIS 403

Query: 181  KLFQQGKMKTM-VTRKGISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSR 357
             L  +G +  +  +R    P H F+ DD+M+FCK  M SL  L  L + Y   SGQ ++ 
Sbjct: 404  ILADKGLIDLIAASRNNCLPFHCFYVDDLMVFCKAKMSSLIVLKSLFTRYADCSGQIMNI 463

Query: 358  EKSNLFYGGGSLGRRATISKFLGMNVVSFPDTYLGV---RAMPGAVKYNQV-KKIKGQLA 525
             KS +F GG +  R   I   LG NV S P TYLG    +  P  + +  +  K+K +LA
Sbjct: 464  RKSFIFAGGITDTRMNNIVNILGFNVGSLPFTYLGAPIFKGKPKGIHFQPIADKVKAKLA 523

Query: 526  G*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVSRSF 705
              K  +LS    + LVKSV+ S  +H M++Y WP K +++ E  I NF+WSGD    +  
Sbjct: 524  KWKASLLSIAGRIQLVKSVVQSMLVHTMSIYSWPIKILKEMEKWIKNFIWSGDVTKRKMV 583

Query: 706  MVAYDK 723
             VA+ K
Sbjct: 584  TVAWRK 589


>ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
           vesca subsp. vesca]
          Length = 872

 Score =  160 bits (404), Expect = 5e-37
 Identities = 91/247 (36%), Positives = 148/247 (59%), Gaps = 5/247 (2%)
 Frame = +1

Query: 1   GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
           GF +S+   + ++L SAR+S+L+NG   G+FS  +G+RQGDPLSPL+F L E+VLSR I+
Sbjct: 105 GFHESFV-QVRVLLLSARLSLLINGRTYGYFSCGQGVRQGDPLSPLLFCLAEEVLSRGIS 163

Query: 181 KLFQQGKMKTMVTRKG-ISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSR 357
            L   G++K + + +G +SP+++ FA D+++FC+GN ++L  ++     Y   SGQ +++
Sbjct: 164 MLVSSGQVKRIHSPRGTLSPSYVLFAGDVIVFCRGNRQNLLRVMSFFYEYGSVSGQIINK 223

Query: 358 EKSNLFYGGGSLGRRATISKFLGMNVVSFPDTYLGVRAMPGAVKYNQ----VKKIKGQLA 525
           +KS +F G  +  RR +IS  LG+ + + P  YLG     G  +       V K++ +L+
Sbjct: 224 DKSQVFIGKHN-RRRHSISDCLGIPLGTAPFMYLGAPIFHGKPRVAHFQAIVDKVRLKLS 282

Query: 526 G*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVSRSF 705
              G  LS    + L+KSVI S  ++   VY+WP   +++ E    NFLWSGD       
Sbjct: 283 SWVGSFLSMAGRLQLIKSVIYSMFVYTFQVYEWPVSLLRKVERWCRNFLWSGDIDKRGIP 342

Query: 706 MVAYDKC 726
           +V++  C
Sbjct: 343 LVSWTSC 349


>gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H;
            Endonuclease/exonuclease/phosphatase [Medicago
            truncatula]
          Length = 1246

 Score =  148 bits (374), Expect = 2e-33
 Identities = 75/164 (45%), Positives = 103/164 (62%), Gaps = 1/164 (0%)
 Frame = +1

Query: 1    GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
            GF + +  WI++IL+SAR+SVLVNG   GFF+   G+RQGDPLSPL+F L+E+VLSR ++
Sbjct: 609  GFDEKFVHWILVILQSARLSVLVNGKAVGFFTCSHGVRQGDPLSPLLFCLVEEVLSRALS 668

Query: 181  KLFQQGKMKTMVTRKGIS-PTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSR 357
                 G++  M   +G+S PTH+ +ADD++IFC G  R++R L+ + S Y   SGQ ++ 
Sbjct: 669  MAATDGQLIPMSYCRGVSFPTHILYADDVLIFCTGTKRNIRRLIKIFSQYSEVSGQLINN 728

Query: 358  EKSNLFYGGGSLGRRATISKFLGMNVVSFPDTYLGVRAMPGAVK 489
             KS  F    +  R   IS  LG NV S P TYLG     G  K
Sbjct: 729  AKSRFFTSAMTGSRVQMISSLLGFNVGSLPFTYLGCPIFRGKPK 772


>ref|XP_004301440.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
           vesca subsp. vesca]
          Length = 786

 Score =  135 bits (341), Expect = 1e-29
 Identities = 79/239 (33%), Positives = 132/239 (55%), Gaps = 6/239 (2%)
 Frame = +1

Query: 1   GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
           GF+QSW   I+  + S   S+++NG P   F   RG+RQGDPLSP +FL+I +VLS  +T
Sbjct: 36  GFAQSWVNLIMACVSSVSFSIVLNGCPGKSFFPGRGLRQGDPLSPYLFLIISEVLSVRLT 95

Query: 181 KLFQ-QGKMKTMVTRKGISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSR 357
           +  Q +  +   + R+G + +H+FFADD + F K  + ++  L+++   Y +ASGQ ++ 
Sbjct: 96  RAVQDKSLLGIKLCRRGPTLSHMFFADDALFFLKATLGNVCRLMEIFKEYCKASGQLING 155

Query: 358 EKSNLFYGGGSLGRRA-TISKFLGMNVVSFPDTYLGVRAMPGAVKYNQ----VKKIKGQL 522
           EKS+ ++   +  + +  + + +G   V  P  YLG+  + G  K       V +++ +L
Sbjct: 156 EKSSAYFSPNTPDQMSRLLGELMGFAEVEDPGKYLGLPTLWGRSKKEAVGYIVDRVQRKL 215

Query: 523 AG*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVSR 699
            G K R LS+    +L+KSV  +   + MA +K+PK        A+ NF W   S  +R
Sbjct: 216 VGWKQRSLSWAGKEILIKSVATAIPAYPMACFKFPKGVCDTINSALSNFWWGSTSTGNR 274


>emb|CCA66188.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1381

 Score =  135 bits (341), Expect = 1e-29
 Identities = 89/246 (36%), Positives = 131/246 (53%), Gaps = 5/246 (2%)
 Frame = +1

Query: 1    GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
            GF   W  WI   + SA  S+L+NGSP   F + RG+RQGDPLSP +F L+ + LS  I 
Sbjct: 594  GFPPRWRMWISSCITSAAASILINGSPTAPFKLHRGLRQGDPLSPFLFDLVVETLSLVIQ 653

Query: 181  KLFQQGKMKTM-VTRKGISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSR 357
            K    G  + + VT+ G   THL +ADD +IFC  N+  L N+   L  +Q ASG  V+ 
Sbjct: 654  KASHLGLWEGVEVTKNGEKITHLQYADDTIIFCPPNLDYLLNIKKTLILFQLASGLQVNF 713

Query: 358  EKSNLFYGGGSLGRRATISKFLGMNVVSFPDTYLGVRAMPGAVKYNQ----VKKIKGQLA 525
             KS++             +  L   V   P TYLG+       +       +KKI+G+LA
Sbjct: 714  HKSSIMGIHVDEIWLQEAANALLCKVGRLPFTYLGLPIGGNISRLAHWDPIIKKIEGKLA 773

Query: 526  G*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVSRSF 705
              KGRMLS    + L+K+ I+S  ++ M+++  P+  I+       NFLWSG+ + S   
Sbjct: 774  SWKGRMLSIAGRITLIKASISSLPLYYMSLFPAPRGVIEAINKLQRNFLWSGELRKSSLA 833

Query: 706  MVAYDK 723
            +VA+++
Sbjct: 834  LVAWNQ 839


>emb|CCA66140.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1381

 Score =  135 bits (341), Expect = 1e-29
 Identities = 87/244 (35%), Positives = 131/244 (53%), Gaps = 6/244 (2%)
 Frame = +1

Query: 4    FSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNITK 183
            F   WC WI+  + +A +S+LVNGSP   F +QRG+RQGDPLS  +F+LI + L++ I K
Sbjct: 595  FPDQWCKWIMNCVSTAAVSILVNGSPCAPFKLQRGLRQGDPLSSFLFVLIAESLNQIIMK 654

Query: 184  LFQQGKMKTMVTRKG-ISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSRE 360
               Q   K +   +G I  THL +ADD +IFC  N+ SL+N+   L  +Q ASG  ++  
Sbjct: 655  ATSQNLWKGVEVGQGEIIVTHLQYADDTLIFCDANIESLKNVKKALILFQLASGLQINFH 714

Query: 361  KSNLFYGGGSLGRRATISKFLGMNVVSFPDTYLGVRAMPGAVKYNQ-----VKKIKGQLA 525
            KS+L     S G     ++ L   +   P TYLGV  + G     Q     + KI  +LA
Sbjct: 715  KSSLIGLNTSSGWIKVAAEALLCKIGEIPFTYLGV-PIGGQCSRIQLWDPIIAKISRRLA 773

Query: 526  G*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVSRSF 705
              K +MLS    + L+KS + S  ++ M++Y  P+  + +       FLW+G    +   
Sbjct: 774  TWKCKMLSIGGRLTLIKSSLISLPVYFMSIYPMPQDVVNKIIGLARQFLWAGSDGKNAMP 833

Query: 706  MVAY 717
            +VA+
Sbjct: 834  LVAW 837


>emb|CCA66178.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1381

 Score =  134 bits (338), Expect = 2e-29
 Identities = 83/243 (34%), Positives = 128/243 (52%), Gaps = 5/243 (2%)
 Frame = +1

Query: 4    FSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNITK 183
            F + WC WI+  + +A  S+LVNGSP   F ++RG+RQGDPLSP +F+LI + L++ I K
Sbjct: 595  FPEQWCQWIMTCVTTASASILVNGSPSTPFKLKRGLRQGDPLSPFLFVLIGEALNQVILK 654

Query: 184  LFQQGKMKTM-VTRKGISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSRE 360
                G    + V R G+  THL +ADD ++F    + SL+N+   L  +  ASG  V+  
Sbjct: 655  ATNMGLWSGVEVCRNGLKITHLQYADDTLVFSDARLESLKNIKMALILFHLASGLQVNFH 714

Query: 361  KSNLFYGGGSLGRRATISKFLGMNVVSFPDTYLGVRAMPGAVKYNQ----VKKIKGQLAG 528
            KS++     S       +  L       P TYLG+       K       + KI  +LA 
Sbjct: 715  KSSIIGMNTSKTWLNEAANSLLCKTGDIPFTYLGLPIGENIHKIKAWDPIINKISMKLAT 774

Query: 529  *KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVSRSFM 708
             KGRMLS    + L+KS +++  ++ M+++  PK  +++       FLWSGD +     +
Sbjct: 775  WKGRMLSIGGRLTLIKSSLSNLPLYFMSLFPIPKGVVEKINKITRRFLWSGDMEKRSIPL 834

Query: 709  VAY 717
            VA+
Sbjct: 835  VAW 837


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  134 bits (337), Expect = 3e-29
 Identities = 79/246 (32%), Positives = 135/246 (54%), Gaps = 5/246 (2%)
 Frame = +1

Query: 1    GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
            GF+  W   I   + +   S+L+NGS  G+F  +RG+RQGD +SP +F+L  + LSR + 
Sbjct: 1445 GFNALWINMIKACISNCWFSLLINGSLVGYFKSERGLRQGDSISPSLFILAAEYLSRGLN 1504

Query: 181  KLFQQGKMKTMVTRKGISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSRE 360
            +LF +      ++   +S +HL FADDI+IF  G   +L+ ++  L  Y++ SGQ V+ +
Sbjct: 1505 QLFSRYNSLHYLSGCSMSVSHLAFADDIVIFTNGCHSALQKILVFLQEYEQVSGQQVNHQ 1564

Query: 361  KSNLFYGGG-SLGRRATISKFLGMNVVSFPDTYLGVRAMPGAVKY----NQVKKIKGQLA 525
            KS      G  L RR  I++  G    + P TYLG     G  K     + + KI+ +++
Sbjct: 1565 KSCFITANGCPLSRRQIIAQVTGFQHKTLPVTYLGAPLHKGPKKVFLFDSLISKIRDRIS 1624

Query: 526  G*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVSRSF 705
            G + ++LS    + L++SV++S  ++ + V K P   I++ E   ++FLW   ++  R  
Sbjct: 1625 GWENKILSPGSRITLLRSVLSSLPMYLLQVLKPPAIVIEKIERLFNSFLWGDSNEGKRMH 1684

Query: 706  MVAYDK 723
              A++K
Sbjct: 1685 WAAWNK 1690


>emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1355

 Score =  134 bits (337), Expect = 3e-29
 Identities = 83/237 (35%), Positives = 133/237 (56%), Gaps = 7/237 (2%)
 Frame = +1

Query: 1    GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
            GF   W   I+  + S   S ++NGS  G     RG+RQGDPLSP +F+++ D  S+ I 
Sbjct: 601  GFDGRWVNLIMEFVSSVTYSFIINGSVCGSVVPARGLRQGDPLSPYLFIMVADAFSKMIQ 660

Query: 181  KLFQQGKMK-TMVTRKGISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSR 357
            +  Q  ++     +R G   +HLFFADD ++F + N +    +VD+L+ Y+ ASGQ ++ 
Sbjct: 661  RKVQDKQLHGAKASRSGPEISHLFFADDSLLFTRANRQECTIIVDILNQYELASGQKINY 720

Query: 358  EKSNLFYGGG-SLGRRATISKFLGMNVVSFPDTYLGVRAMPG----AVKYNQVKKIKGQL 522
            EKS + Y  G S+ ++  ++  L M  V   + YLG+ ++ G    A+  + + +I  +L
Sbjct: 721  EKSEVSYSRGVSVSQKDELTNILNMRQVDRHEKYLGIPSISGRSKKAIFDSLIDRIWKKL 780

Query: 523  AG*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLW-SGDSQ 690
             G K ++LS     VL+KSVI +   + M VYK+P   IQ+ + A+  F W S D+Q
Sbjct: 781  QGWKEKLLSRAGKEVLLKSVIQAIPTYLMGVYKFPVFIIQKIQSAMARFWWGSSDTQ 837


>ref|XP_004292011.1| PREDICTED: uncharacterized protein LOC101291306 [Fragaria vesca
           subsp. vesca]
          Length = 948

 Score =  133 bits (334), Expect = 7e-29
 Identities = 77/246 (31%), Positives = 131/246 (53%), Gaps = 6/246 (2%)
 Frame = +1

Query: 1   GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
           GF++ W   ++  + S   S+++NG P  +FS  RG+RQGDPLSP +FL++ + LS  +T
Sbjct: 250 GFARPWVNLVLACVSSVSFSIVLNGKPGRYFSPSRGLRQGDPLSPYLFLIVSEALSLRLT 309

Query: 181 KLFQQGKMKTMVTRKGI-SPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSR 357
           K   +  +  +   +G  + +HLFFADD + F K  + ++  L  +   Y RASGQ +SR
Sbjct: 310 KAVNEKHLLGIKLCRGCPTLSHLFFADDALFFVKATLSNVSKLAAIFEEYCRASGQVISR 369

Query: 358 EKSNLFYGGGSLGRRATI-SKFLGMNVVSFPDTYLGVRAMPGAVKYNQV----KKIKGQL 522
           EKS++F+   +  + A +  + +G   V  P  YLG+  + G +K + +    ++I  +L
Sbjct: 370 EKSSIFFSPNTPAQMARLMCELMGFVEVENPGKYLGLPTIWGRLKKDALSYITERINRKL 429

Query: 523 AG*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVSRS 702
            G K + LS+     L+KSV        M+ +  PK    Q   AI NF W     +++ 
Sbjct: 430 DGWKEKNLSWAGKETLIKSVAMVIPSFPMSCFLLPKYLGNQINSAISNFWWGKSESINKI 489

Query: 703 FMVAYD 720
             + ++
Sbjct: 490 HWICWE 495


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  132 bits (333), Expect = 9e-29
 Identities = 80/234 (34%), Positives = 129/234 (55%), Gaps = 5/234 (2%)
 Frame = +1

Query: 1    GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
            GF+  W   I   + +   S+L+NGS  G+F  +RG+RQGD +SP++F+L  D LSR + 
Sbjct: 1358 GFNAHWINMIKSCISNCWFSLLINGSLAGYFKSERGLRQGDSISPMLFILAADYLSRGLN 1417

Query: 181  KLFQQGKMKTMVTRKGISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSRE 360
             LF        ++   +  +HL FADDI+IF  G   +L+ ++  L  Y++ SGQ V+ +
Sbjct: 1418 HLFSCYSSLQYLSGCQMPISHLSFADDIVIFTNGGRSALQKILSFLQEYEQVSGQKVNHQ 1477

Query: 361  KSNLFYGGG-SLGRRATISKFLGMNVVSFPDTYLGVRAMPGAVKY----NQVKKIKGQLA 525
            KS      G SL RR  IS   G    + P TYLG     G  K     + + KI+ +++
Sbjct: 1478 KSCFITANGCSLSRRQIISHTTGFQHKTLPVTYLGAPLHKGPKKVLLFDSLISKIRDRIS 1537

Query: 526  G*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDS 687
            G + ++LS    + L++SV++S  ++ + V K P   I++ +   ++FLW GDS
Sbjct: 1538 GWENKILSPGGRITLLRSVLSSLPMYLLQVLKPPVTVIERIDRLFNSFLW-GDS 1590


>ref|XP_006357717.1| PREDICTED: uncharacterized protein LOC102595469 [Solanum tuberosum]
          Length = 1079

 Score =  131 bits (329), Expect = 3e-28
 Identities = 78/250 (31%), Positives = 131/250 (52%), Gaps = 7/250 (2%)
 Frame = +1

Query: 1    GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
            GF + W   +   + S   S++VNG+   FF  +RG+RQGDP+SP +F++  + LS  + 
Sbjct: 388  GFCEIWIDMVFRHISSNWYSLIVNGNRHDFFQSKRGLRQGDPISPALFVISAEYLSLKLN 447

Query: 181  KLFQQGKMKTM-VTRKGISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSR 357
            +L       +  + +KG    HL FA+D+++F  G  RSL  L++ L++Y+R SGQ +++
Sbjct: 448  ELNNNTDFSSFSMNKKGPRINHLAFANDVILFSSGCRRSLDLLMETLNNYERVSGQKINK 507

Query: 358  EKSNLFYGGGSLGR-RATISKFLGMNVVSFPDTYLGVRAMPGAVKY----NQVKKIKGQL 522
             KS++        + R  + +  GM   S P  YLG     G   Y      + KI  ++
Sbjct: 508  SKSSVSLSSKENEQARQRVQEITGMTYRSLPIKYLGCPLYEGRKDYALFSEMMSKILHKI 567

Query: 523  AG*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVSRS 702
             G + + LS    VVL+K V+ S S+H+ A    P   + Q E  ++ FLW G S+ ++ 
Sbjct: 568  GGWQNKFLSIGGRVVLIKHVLMSLSVHSFAAIHPPIGVLNQMEKMLNRFLWGGSSEKTKM 627

Query: 703  FMVAYDK-CY 729
               +++  CY
Sbjct: 628  HWASWESMCY 637


>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
            gi|508727303|gb|EOY19200.1| Retrotransposon,
            unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  131 bits (329), Expect = 3e-28
 Identities = 81/245 (33%), Positives = 132/245 (53%), Gaps = 6/245 (2%)
 Frame = +1

Query: 1    GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
            GF+  W   I   + +   SVL+NG   G+F  +RG+RQGD +SP++F+L  + LSR I 
Sbjct: 565  GFNDMWIDMIRRCITNCWFSVLINGHSAGYFKSERGLRQGDSISPMLFILAAEYLSRGIN 624

Query: 181  KLFQQGKMKTMVTRKGISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSRE 360
            +LF +       +   ++ +HL FADDIMIF  G+   L  +++ L  Y++ SGQ V+ +
Sbjct: 625  ELFSRYISLHYHSGCSLNISHLAFADDIMIFTNGSKSVLEKILEFLQEYEQISGQRVNHQ 684

Query: 361  KSNLFYGGGSL--GRRATISKFLGMNVVSFPDTYLGVRAMPGAVKY----NQVKKIKGQL 522
            KS  F    ++   RR  IS+ +G    + P TYLG     G  K     + + KI+ ++
Sbjct: 685  KS-CFVTANNMPSSRRQIISQTIGFLHKTLPITYLGAPLFKGPKKVMLFDSLINKIRERI 743

Query: 523  AG*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVSRS 702
             G + ++LS    + L++SV++S  I+ + V K P   IQ+ E   ++FLW      +R 
Sbjct: 744  TGWENKILSPGGRITLLRSVLSSMPIYLLQVLKPPACVIQKIERLFNSFLWGSSMDSTRI 803

Query: 703  FMVAY 717
               A+
Sbjct: 804  HWTAW 808


>gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam: rvt.hmm, score:
            42.57) [Arabidopsis thaliana]
          Length = 1662

 Score =  131 bits (329), Expect = 3e-28
 Identities = 76/236 (32%), Positives = 126/236 (53%), Gaps = 6/236 (2%)
 Frame = +1

Query: 1    GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
            GFS++W  WI+  +KS   SVLVNG P G    QRG+RQGDPLSP +F+L  D+L+  I 
Sbjct: 979  GFSETWIKWIMGAVKSVNYSVLVNGIPHGTIQPQRGIRQGDPLSPYLFILCADILNHLIK 1038

Query: 181  KLFQQGKMKTMVTRKGI-SPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSR 357
                +G ++ +    G+   THL FADD + FC+ N+R+ + L D+   Y+  SGQ ++ 
Sbjct: 1039 NRVAEGDIRGIRIGNGVPGVTHLQFADDSLFFCQSNVRNCQALKDVFDVYEYYSGQKINM 1098

Query: 358  EKSNLFYGGGSLG-RRATISKFLGMNVVSFPDTYLGVRAMPGAVKYNQ----VKKIKGQL 522
             KS + +G    G  +  +   LG+        YLG+    G  K +     ++++K + 
Sbjct: 1099 SKSMITFGSRVHGTTQNRLKNILGIQSHGGGGKYLGLPEQFGRKKRDMFNYIIERVKKRT 1158

Query: 523  AG*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQ 690
            +    + LS     +++KSV  S  ++ M+ +K P   + + E  + NF W  +++
Sbjct: 1159 SSWSAKYLSPAGKEIMLKSVAMSMPVYAMSCFKLPLNIVSEIEALLMNFWWEKNAK 1214


>emb|CAB40051.1| putative protein [Arabidopsis thaliana] gi|7267781|emb|CAB81184.1|
            putative protein [Arabidopsis thaliana]
          Length = 1294

 Score =  131 bits (329), Expect = 3e-28
 Identities = 76/236 (32%), Positives = 126/236 (53%), Gaps = 6/236 (2%)
 Frame = +1

Query: 1    GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
            GFS++W  WI+  +KS   SVLVNG P G    QRG+RQGDPLSP +F+L  D+L+  I 
Sbjct: 959  GFSETWIKWIMGAVKSVNYSVLVNGIPHGTIQPQRGIRQGDPLSPYLFILCADILNHLIK 1018

Query: 181  KLFQQGKMKTMVTRKGI-SPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSR 357
                +G ++ +    G+   THL FADD + FC+ N+R+ + L D+   Y+  SGQ ++ 
Sbjct: 1019 NRVAEGDIRGIRIGNGVPGVTHLQFADDSLFFCQSNVRNCQALKDVFDVYEYYSGQKINM 1078

Query: 358  EKSNLFYGGGSLG-RRATISKFLGMNVVSFPDTYLGVRAMPGAVKYNQ----VKKIKGQL 522
             KS + +G    G  +  +   LG+        YLG+    G  K +     ++++K + 
Sbjct: 1079 SKSMITFGSRVHGTTQNRLKNILGIQSHGGGGKYLGLPEQFGRKKRDMFNYIIERVKKRT 1138

Query: 523  AG*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQ 690
            +    + LS     +++KSV  S  ++ M+ +K P   + + E  + NF W  +++
Sbjct: 1139 SSWSAKYLSPAGKEIMLKSVAMSMPVYAMSCFKLPLNIVSEIEALLMNFWWEKNAK 1194


>gb|EPS72636.1| hypothetical protein M569_02121, partial [Genlisea aurea]
          Length = 1503

 Score =  130 bits (328), Expect = 4e-28
 Identities = 79/231 (34%), Positives = 127/231 (54%), Gaps = 6/231 (2%)
 Frame = +1

Query: 1    GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
            GF  S+   I++ + S   S+++NG   G  + QRG+RQGDPLSP +FL   + LS  + 
Sbjct: 936  GFHISFVELILLAVSSVSYSLVINGDRVGLINPQRGLRQGDPLSPYLFLFCAEGLSSALR 995

Query: 181  KLFQQGKMKTM-VTRKGISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSR 357
               Q   +    VTR+G S +HLFFADD MIFC+ +  +L  + D+L  Y+RASGQ V+ 
Sbjct: 996  AAEQSQSITGFRVTRRGPSISHLFFADDAMIFCEASCAALSRVSDILQDYERASGQKVNT 1055

Query: 358  EKSNLFYGGGSLGRRATI-SKFLGMNVVSFPDTYLGVRAMPGAVK----YNQVKKIKGQL 522
             KS + +   +      I S+ LG  V S  D YLG+ ++ G+ K       ++++  ++
Sbjct: 1056 HKSAMVFSPNTPDSEKEIWSRGLGFLVKSHHDIYLGLPSLTGSSKKRLFSGLLERVNRKI 1115

Query: 523  AG*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLW 675
             G   + LS    +VL+K+V+ +   + M+ +  PK F+   + AI  + W
Sbjct: 1116 EGWNSKFLSQAGKLVLIKAVLQAIPAYTMSCFALPKSFLGDLQSAISRYWW 1166


>ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596481 [Solanum tuberosum]
          Length = 1135

 Score =  130 bits (327), Expect = 5e-28
 Identities = 80/251 (31%), Positives = 130/251 (51%), Gaps = 8/251 (3%)
 Frame = +1

Query: 1    GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
            GF++     I+ ++ +   SVL+NG   GFF   RG++QGDPLSP +F++  +VLSR + 
Sbjct: 467  GFAERIIDMIVRLISNNWYSVLMNGQSFGFFQSTRGLKQGDPLSPTLFIIAAEVLSRGLN 526

Query: 181  KLFQQGKMKTMVTRKGISP--THLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVS 354
             LF+          K  SP  +HL +ADD ++FC G   S+R ++++L  Y++ SGQ ++
Sbjct: 527  SLFEDPDYIGYGMPKW-SPVVSHLSYADDTILFCSGQTTSMRKMINILRGYEKVSGQMIN 585

Query: 355  REKSNLFYGGGSLGRRAT-ISKFLGMNVVSFPDTYLGVRAMPGAVK----YNQVKKIKGQ 519
             +KS ++       R    + +  G+   SFP TYLG     G        N +KK+  +
Sbjct: 586  LDKSMIYLHKQVPNRVCNLVKRITGIRQGSFPFTYLGCPIFYGRKNKGHFENLLKKVSNR 645

Query: 520  LAG*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVSR 699
            +   + +++SF +  +L+  V+ S  ++ +A    PK  I Q       F WS  S    
Sbjct: 646  MNTWQNKLMSFGERYILIAHVLQSIPVYLLAAMNPPKSIIDQLHKLFAIFFWSNSSGARN 705

Query: 700  SFMVAYDK-CY 729
               VA+DK CY
Sbjct: 706  KHWVAWDKMCY 716


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  130 bits (326), Expect = 6e-28
 Identities = 80/246 (32%), Positives = 130/246 (52%), Gaps = 5/246 (2%)
 Frame = +1

Query: 1    GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
            GF+  W   I   + +   S+L+NGS  G+F  +RG+RQGD +SPL+F+L  D LSR I 
Sbjct: 1184 GFNDRWISMIKACISNCWFSLLINGSLVGYFKSERGLRQGDSISPLLFVLAADYLSRGIN 1243

Query: 181  KLFQQGKMKTMVTRKGISPTHLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQTVSRE 360
            +LF + K    ++   +  +HL FADDI+IF  G   +L+ ++  L  Y+  SGQ V+ +
Sbjct: 1244 QLFNRHKSLLYLSGCFMPISHLAFADDIVIFTNGCRPALQKILVFLQEYEEVSGQQVNHQ 1303

Query: 361  KSNLFYGGG-SLGRRATISKFLGMNVVSFPDTYLGVRAMPGAVKY----NQVKKIKGQLA 525
            KS      G  + RR  I+   G    + P  YLG     G  K     + + KI+ +++
Sbjct: 1304 KSCFITANGCPMTRRQIIAHTTGFQHKTLPVIYLGAPLHKGPKKVTLFDSLITKIRDRIS 1363

Query: 526  G*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVSRSF 705
            G + + LS    + L++SV++S  ++ + V K P   I++ E   ++FLW   +   R  
Sbjct: 1364 GWENKTLSPGGRITLLRSVLSSLPLYLLQVLKPPVVVIEKIERLFNSFLWGDSTNDKRIH 1423

Query: 706  MVAYDK 723
              A+ K
Sbjct: 1424 WAAWHK 1429


>emb|CAN60483.1| hypothetical protein VITISV_033959 [Vitis vinifera]
          Length = 326

 Score =  129 bits (325), Expect = 8e-28
 Identities = 82/248 (33%), Positives = 130/248 (52%), Gaps = 8/248 (3%)
 Frame = +1

Query: 1   GFSQSWC*WIIIILKSARISVLVNGSPEGFFSIQRGMRQGDPLSPLIFLLIEDVLSRNIT 180
           GF   W  W+   L SA+ SVLVNG P GFF   +G+RQGDPLSP +F++  +VL   I 
Sbjct: 2   GFGPKWVGWMWSCLSSAKFSVLVNGVPAGFFPSTKGLRQGDPLSPYLFIMGMEVLDVLIR 61

Query: 181 KLFQQGKMKTMVTRKGISP----THLFFADDIMIFCKGNMRSLRNLVDLLSSYQRASGQT 348
           +  + G +     R G  P    +HLFFA+D +IFC+     L +L  +L  ++ ASG  
Sbjct: 62  RAVEGGFLSRCNIRGGSGPPLNISHLFFANDTIIFCEARKDHLTHLSWILFWFEAASGLR 121

Query: 349 VSREKSNLFYGGGSLGRRATISKFLGMNVVSFPDTYLGV-RAMPGAVKY---NQVKKIKG 516
           ++  KS +    G +     ++  LG  V S P  YLG+    P    Y      ++++ 
Sbjct: 122 INLAKSEII-PVGEVVEXEELAVELGCRVGSLPSQYLGLPLGAPNRAPYIWDGVEERVRR 180

Query: 517 QLAG*KGRMLSF*DMVVLVKSVIASYSIHNMAVYKWPKKFIQQCEVAIHNFLWSGDSQVS 696
           +LA  K + +S    V L+KS +AS  I+ M +++ PK  +++ E    +FLW G +   
Sbjct: 181 RLALWKRQYISKGGRVTLIKSTLASMPIYQMFIFRMPKVVVRRJEKXQRDFLWGGGNMEG 240

Query: 697 RSFMVAYD 720
           +  +V ++
Sbjct: 241 KVHLVKWE 248


Top