BLASTX nr result

ID: Rehmannia22_contig00001112 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00001112
         (1854 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A...   444   e-122
gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptas...   406   e-110
ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein A...   374   e-100
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   335   5e-89
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]   323   1e-85
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   320   2e-84
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...   311   5e-82
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]   310   2e-81
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]   308   7e-81
dbj|BAE79385.1| unnamed protein product [Ipomoea batatas]             307   1e-80
dbj|BAE79382.1| unnamed protein product [Ipomoea batatas]             306   2e-80
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   305   3e-80
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]   305   4e-80
emb|CAN75646.1| hypothetical protein VITISV_031269 [Vitis vinifera]   305   6e-80
dbj|BAE79384.1| unnamed protein product [Ipomoea batatas]             303   1e-79
emb|CAN82037.1| hypothetical protein VITISV_033902 [Vitis vinifera]   303   1e-79
gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptas...   303   2e-79
emb|CAN82456.1| hypothetical protein VITISV_010028 [Vitis vinifera]   302   3e-79
emb|CAN77370.1| hypothetical protein VITISV_033119 [Vitis vinifera]   301   8e-79
emb|CAN74986.1| hypothetical protein VITISV_008771 [Vitis vinifera]   300   1e-78

>ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 872

 Score =  444 bits (1143), Expect = e-122
 Identities = 234/576 (40%), Positives = 351/576 (60%), Gaps = 3/576 (0%)
 Frame = +1

Query: 133  ANTVDNFRPIVMSNFIFKIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGF 312
            A++++ FRPI ++N +FKII KILA R + I  RI+SP Q  F+ GR+I DCI + SE F
Sbjct: 8    ADSIEQFRPITLTNLVFKIILKILALRLSSIASRIVSPQQHAFVVGRNISDCILVTSECF 67

Query: 313  NILHNRS-DSNMILKIDIRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLL 489
            N+L ++    N+ +K DI KAFDTLSWDFLL+VLQ FGF E FV  + V+L SAR+S+L+
Sbjct: 68   NLLDSKCYGGNVAIKTDITKAFDTLSWDFLLHVLQAFGFHESFVQ-VRVLLLSARLSLLI 126

Query: 490  NGSPVGYFACTRGVRQGDPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSM 669
            NG   GYF+C +GVRQGDPLSPLLFC+AEEVL + IS LVSS  +K   +PR  + PS +
Sbjct: 127  NGRTYGYFSCGQGVRQGDPLSPLLFCLAEEVLSRGISMLVSSGQVKRIHSPRGTLSPSYV 186

Query: 670  LYADDIVILCAATSGNARYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIM 849
            L+A D+++ C     N   +      Y S+SGQ+ N  KS+VF G    +  RH I   +
Sbjct: 187  LFAGDVIVFCRGNRQNLLRVMSFFYEYGSVSGQIINKDKSQVFIGK--HNRRRHSISDCL 244

Query: 850  GLSVGSFPTNYLGVPIFKGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIAS 1029
            G+ +G+ P  YLG PIF G P+    + + DK+ +K   W GS LS+AGR+ L+ S+I S
Sbjct: 245  GIPLGTAPFMYLGAPIFHGKPRVAHFQAIVDKVRLKLSSWVGSFLSMAGRLQLIKSVIYS 304

Query: 1030 SLVHSMLIYKWPIALLKQLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLI 1209
              V++  +Y+WP++LL+++E+  RNF+W+GDI+++G  +V+WT CC+P +EGGLG++ L 
Sbjct: 305  MFVYTFQVYEWPVSLLRKVERWCRNFLWSGDIDKRGIPLVSWTSCCAPIDEGGLGLKKLD 364

Query: 1210 SANRAFIMKMGWKLLTSDSIVFDTLRRRFLMDNGSIRRSFLPSSIWAGLRPTVEDCQTDS 1389
              N + ++K  W++ TS       +R RF     S RRS+ PSSIW G+R      Q ++
Sbjct: 365  VLNSSLLLKRCWEIFTSSFEGCCFIRNRF-----SKRRSYAPSSIWPGVRKFWGLVQNNT 419

Query: 1390 RWIPGAHSSVNFWTDNWLGYVIADRIGIPHEFRANFRNPISDFFFDNKWHLTMSFVEAYL 1569
            RW+ G    ++FW DN+LG  + +  G  H    +  + +SD+  +  W L         
Sbjct: 420  RWLVGTGDKISFWRDNFLGRPLIEFFG-NHGALNDNSSLVSDYIDNGSWVLPPLLQLNLS 478

Query: 1570 DIVRDIVRCPIA--PDSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIWSPYIPE 1743
             +   I + PI+  P   DK +W  S  G +T+K A+ F++ + P V WG  +WS +I  
Sbjct: 479  AVCNLICQVPISINPSMEDKLIWQASSTGELTAKQAFLFLQQASPVVPWGKPLWSKFILP 538

Query: 1744 RRTVVVWRAIFGRLTVMDVHRPKGFIGPTACCLCNS 1851
            R ++  W+ + G +    + + +G    + C  C +
Sbjct: 539  RMSLHAWKVMRGTVISYHLLQRRGVALVSRCEFCGN 574


>gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
            truncatula]
          Length = 642

 Score =  406 bits (1043), Expect = e-110
 Identities = 203/421 (48%), Positives = 278/421 (66%), Gaps = 1/421 (0%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            F+Q  W+I+  DV  AV  FF++G LP+  N++ ++LIPK+ +A++VD +R I + NF F
Sbjct: 204  FFQIYWNIVKKDVYEAVLDFFKNGWLPNNFNANSIILIPKTPNADSVDQYRTIALVNFKF 263

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRS-DSNMILKID 360
            KII K+LA R A+I+  I+S  Q GF+ GR+I DCIAL SE  N+L N+S   N+ LKID
Sbjct: 264  KIINKVLADRLAKILPSIISKEQRGFVQGRNIRDCIALTSEAINVLDNKSFGGNLALKID 323

Query: 361  IRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQG 540
            + KAFDTL+WDFLL VL+ FGF+E F  WI  IL+S+++ + +NG+  G+F C RGVRQG
Sbjct: 324  VTKAFDTLNWDFLLLVLKTFGFNELFCNWIKTILHSSKMFISMNGAQHGFFNCNRGVRQG 383

Query: 541  DPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSMLYADDIVILCAATSGNA 720
            DPLSPLLFCI EEVL + IS L    L+    A RN   P    Y DD+++ C A   + 
Sbjct: 384  DPLSPLLFCIVEEVLSRSISILADKGLIDLIAASRNNCLPFHCFYVDDLMVFCKAKMSSL 443

Query: 721  RYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVPIF 900
              +      YA  SGQ+ N  KS +F G G++    + I  I+G +VGS P  YLG PIF
Sbjct: 444  IVLKSLFTRYADCSGQIMNIRKSFIFAG-GITDTRMNNIVNILGFNVGSLPFTYLGAPIF 502

Query: 901  KGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIALLK 1080
            KG PK    +P+ DK+  K  +WK S LS+AGR+ LV S++ S LVH+M IY WPI +LK
Sbjct: 503  KGKPKGIHFQPIADKVKAKLAKWKASLLSIAGRIQLVKSVVQSMLVHTMSIYSWPIKILK 562

Query: 1081 QLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKLLTS 1260
            ++EK ++NFIW+GD+ ++  V V W + C+   EGGLGV+SLI  N A  +K+ W L+ S
Sbjct: 563  EMEKWIKNFIWSGDVTKRKMVTVAWRKICADYEEGGLGVKSLICLNEATNLKICWNLMQS 622

Query: 1261 D 1263
            D
Sbjct: 623  D 623


>ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 751

 Score =  374 bits (959), Expect = e-100
 Identities = 199/521 (38%), Positives = 298/521 (57%), Gaps = 3/521 (0%)
 Frame = +1

Query: 295  LVSEGFNILHNRS-DSNMILKIDIRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSA 471
            +VSEGFN+L  +  D N+ +K+DI KAFDTL+W FL+ VL RFGF  +F   + ++LNSA
Sbjct: 1    MVSEGFNLLDRKIVDGNVGIKVDIAKAFDTLNWQFLIEVLHRFGFGSRFTDLMLILLNSA 60

Query: 472  RISVLLNGSPVGYFACTRGVRQGDPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNI 651
             +S+L+NGSP G+F+CT+GVRQGDPLSP+LFCIAEE L + ++ L SS  ++    PR  
Sbjct: 61   HLSILINGSPHGFFSCTKGVRQGDPLSPILFCIAEEALSRGLTALFSSKKVRSISLPRGC 120

Query: 652  VFPSSMLYADDIVILCAATSGNARYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRH 831
               + +LYADD+ I C   + + R +   L +Y + SGQ+ N  KS  + G    H  RH
Sbjct: 121  SL-THVLYADDLFIFCRGDTKSLRQLQSFLDNYGAASGQLVNKDKSTFYLGASHFHR-RH 178

Query: 832  RIQTIMGLSVGSFPTNYLGVPIFKGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLV 1011
            +++ I+G  +G+ P +YLGVPIFKG P    L+ L DK   +   WKG  LS+AGRV LV
Sbjct: 179  QVKKILGFKLGTSPFSYLGVPIFKGKPCRKHLQALVDKAKARLAGWKGKLLSMAGRVQLV 238

Query: 1012 NSIIASSLVHSMLIYKWPIALLKQLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGL 1191
            + +  S L+HS  IY W  +LL  L    RNFIW+GD+  +  V ++W + C+P+NE GL
Sbjct: 239  HDVFQSMLLHSFSIYLWATSLLSHLSACARNFIWSGDLAIRKLVTISWQQVCTPRNEAGL 298

Query: 1192 GVRSLISANRAFIMKMGWKLLTSDSIVFDTLRRRFLMDNGSIRRSFLPSSIWAGLRPTVE 1371
             +R+L +   A ++ + W+ L   S       RRF +    ++  +  SS+W GL+  + 
Sbjct: 299  DLRNLKALYTAGLISLAWQTLLQSSSWGSFACRRFTIFR-HMKFQYFTSSVWHGLKRVLP 357

Query: 1372 DCQTDSRWIPGAHSSVNFWTDNWLGYVIADRIGIPHEFRANFRNPISDFFFDNKWHLTMS 1551
                 SRWI G  +S+ FW+D WL   I  ++ +         + ++DF +D +W L   
Sbjct: 358  LLFEHSRWIIGDGNSILFWSDKWLHSSIIQQLNM-GSLSHLLNSRVADFIWDQQWALPSH 416

Query: 1552 FVEAYLDIVRDIVRCPI--APDSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIW 1725
            F   + D  + I+  P+   P+S D  +W  S  G  +    Y  +RP F  + W S +W
Sbjct: 417  FSNLFPDCAKQILEIPLPNTPES-DILIWEHSSSGIFSFSDGYELVRPYFEKLDWASSVW 475

Query: 1726 SPYIPERRTVVVWRAIFGRLTVMDVHRPKGFIGPTACCLCN 1848
              +IP R +V+ WR    +L   D  + +G    + C LC+
Sbjct: 476  HSFIPPRYSVLAWRIFHLKLPTDDQLQRRGIPFVSVCQLCS 516


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  335 bits (858), Expect = 5e-89
 Identities = 201/621 (32%), Positives = 319/621 (51%), Gaps = 5/621 (0%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            FYQ CWDII++D+  AV+ FF    +P G+ S+ +VLIPK+  A+    FRPI +   + 
Sbjct: 1304 FYQQCWDIIAHDLFEAVKEFFHGADIPQGMTSTTLVLIPKTTSASKWSEFRPISLCTVMN 1363

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRS-DSNMILKID 360
            KIITKILA R A+I+  I++  Q GF+ GR I D I L  E    L  ++   N+ LK+D
Sbjct: 1364 KIITKILANRLAKILPSIITENQSGFVGGRLISDNILLAQELIGKLDQKNRGGNVALKLD 1423

Query: 361  IRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQG 540
            + KA+D L W FL  VLQ  GF+ +++  I   +++   S+LLNG  VGYF   RG+RQG
Sbjct: 1424 MMKAYDRLDWSFLFKVLQHLGFNAQWIGMIQKCISNCWFSLLLNGRTVGYFKSERGLRQG 1483

Query: 541  DPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSMLYADDIVILCAATSGNA 720
            D +SP LF +A E L + ++ L   +    + +  ++   S + +ADD++I    +    
Sbjct: 1484 DSISPQLFILAAEYLARGLNALYDQYPSLHYSSGCSLSV-SHLAFADDVIIFANGSKSAL 1542

Query: 721  RYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVPIF 900
            + I   L  Y  LSGQ  NP KS V   T ++   R  I    G S    P  YLG P++
Sbjct: 1543 QKIMAFLQEYEKLSGQRINPQKSCVVTHTNMASSRRQIILQATGFSHRPLPITYLGAPLY 1602

Query: 901  KGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIALLK 1080
            KG  K  +   L  KI  +   W+  +LS  GR+ L+ S ++S  ++ + + K P+ +L+
Sbjct: 1603 KGHKKVMLFNDLVAKIEERITGWENKTLSPGGRITLLRSTLSSLPIYLLQVLKPPVIVLE 1662

Query: 1081 QLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKLLTS 1260
            ++ + + NF+W G    K     +W +   P  EGGL +R++     AF MK+ W+  T+
Sbjct: 1663 RINRLLNNFLWGGSTASKRIHWASWGKIALPIAEGGLDIRNVEDVCEAFSMKLWWRFRTT 1722

Query: 1261 DSIVFDTLRRRFLMDNGSIRRSFLP----SSIWAGLRPTVEDCQTDSRWIPGAHSSVNFW 1428
            +S+    +R ++    G +     P    S  W  +       + + RW  G H  + FW
Sbjct: 1723 NSLWTQFMRAKYC--GGQLPTDVQPKLHDSQTWKRMVTISSITEQNIRWRIG-HGELFFW 1779

Query: 1429 TDNWLGYVIADRIGIPHEFRANFRNPISDFFFDNKWHLTMSFVEAYLDIVRDIVRCPIAP 1608
             D W+G    + +   ++  A+    +SDFF +N W++         ++V +IV+ PI  
Sbjct: 1780 HDCWMG---EEPLVNRNQAFASSMAQVSDFFLNNSWNVEKLKTVLQQEVVEEIVKIPIDT 1836

Query: 1609 DSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIWSPYIPERRTVVVWRAIFGRLT 1788
             S DK  WT + +G  ++KSA+  IR         ++IW   +P   +  +WR +   + 
Sbjct: 1837 SSNDKAYWTTTPNGDFSTKSAWQLIRNRKVENPVFNFIWHKSVPLTTSFFLWRLLHDWIP 1896

Query: 1789 VMDVHRPKGFIGPTACCLCNS 1851
            V    + KGF   + C  C S
Sbjct: 1897 VELKMKTKGFQLASRCRCCKS 1917


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  323 bits (829), Expect = 1e-85
 Identities = 199/621 (32%), Positives = 314/621 (50%), Gaps = 5/621 (0%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            FYQ CW+II++D++ AVR FF   ++P G+ S+ ++L+PK   A+   +FRPI +   + 
Sbjct: 1511 FYQQCWNIIAHDLLDAVRDFFHGANIPRGVTSTTLILLPKKPSASKWSDFRPISLCTVMN 1570

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRS-DSNMILKID 360
            KIITK+L+ R A+I+  I++  Q GF+ GR I D I L  E    L+ +S   N+ LK+D
Sbjct: 1571 KIITKLLSNRLAKILPSIITENQSGFVGGRLISDNILLAQELIGKLNTKSRGGNLALKLD 1630

Query: 361  IRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQG 540
            + KA+D L W FL+ VLQ FGF+++++  I   +++   S+LLNG   GYF   RG+RQG
Sbjct: 1631 MMKAYDRLDWSFLIKVLQHFGFNDQWIGMIQKCISNCWFSLLLNGRTEGYFKFERGLRQG 1690

Query: 541  DPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSMLYADDIVILCAATSGNA 720
            DP+SP LF IA E L + ++ L   +    +    +I   S + +ADD++I    +    
Sbjct: 1691 DPISPQLFLIAAEYLSRGLNALYEQYPSLHYSTGVSIPV-SHLAFADDVLIFTNGSKSAL 1749

Query: 721  RYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVPIF 900
            + I   L  Y  +S Q  N  KS     T VS   R  I    G +    P  YLG P++
Sbjct: 1750 QRILAFLQEYEEISRQRINAQKSCFVTHTNVSSSRRQIIAQTTGFNHQLLPITYLGAPLY 1809

Query: 901  KGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIALLK 1080
            KG  K  +   L  KI  +   W+   LS  GR+ L+ S++ S  ++   + K P+ +L+
Sbjct: 1810 KGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLKSVLTSLPIYLFQVLKPPVCVLE 1869

Query: 1081 QLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKLLTS 1260
            ++ +   +F+W G    K     +W +   P  EGGL +RSL     AF MK+ W+  T+
Sbjct: 1870 RINRIFNSFLWGGSAASKKIHWTSWAKISLPVKEGGLDIRSLAEVFEAFSMKLWWRFRTT 1929

Query: 1261 DSIVFDTLRRRFLMDNGSIRRSFLP----SSIWAGLRPTVEDCQTDSRWIPGAHSSVNFW 1428
            DS+    +R ++    G +     P    S  W  +  +    + + RW  G   ++ FW
Sbjct: 1930 DSLWTRFMRMKYC--RGQLPMHTQPKLHDSQTWKRMVASSAITEQNMRWRVG-QGNLFFW 1986

Query: 1429 TDNWLGYVIADRIGIPHEFRANFRNPISDFFFDNKWHLTMSFVEAYLDIVRDIVRCPIAP 1608
             D W+G      I   HEF  +    + DFF +N W +         ++V +I + PI  
Sbjct: 1987 HDCWMGE--TPLISSNHEFSLSMVQ-VCDFFMNNSWDIEKLKTVLQQEVVDEIAKIPIDA 2043

Query: 1609 DSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIWSPYIPERRTVVVWRAIFGRLT 1788
             S D+  W  + +G  ++KSA+  IR         ++IW   IP   +  +WR +   + 
Sbjct: 2044 MSKDEAYWAPTPNGEFSTKSAWQLIRKREVVNPVFNFIWHKAIPLTTSFFLWRLLHDWIP 2103

Query: 1789 VMDVHRPKGFIGPTACCLCNS 1851
            V    + KGF   + C  C S
Sbjct: 2104 VELRMKSKGFQLASRCRCCRS 2124


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  320 bits (819), Expect = 2e-84
 Identities = 197/623 (31%), Positives = 316/623 (50%), Gaps = 7/623 (1%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            FYQ CW+ I++D++ AVR FF   ++P G+ S+ +VL+PK   A+    FRPI +   + 
Sbjct: 1341 FYQQCWNTIAHDLLDAVRDFFHGANIPRGVTSTTLVLLPKKSSASKWSEFRPISLCTVMN 1400

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRS-DSNMILKID 360
            KIITK+L+ R A+I+  I++  Q GF+ GR I D I L  E    L  +S   N+ LK+D
Sbjct: 1401 KIITKLLSNRLAKILPSIITENQSGFVGGRLISDNILLAQELIRKLDTKSRGGNLALKLD 1460

Query: 361  IRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQG 540
            + KA+D L W FL+ VLQ FGF+E+++  I   +++   S+LLNG   GYF   RG+RQG
Sbjct: 1461 MMKAYDRLDWSFLIKVLQHFGFNEQWIGMIQKCISNCWFSLLLNGRIEGYFKSERGLRQG 1520

Query: 541  DPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMA--PRNIVFPSSMLYADDIVILCAATSG 714
            D +SP LF +A E L + ++ L   +    + +  P ++   S + +ADD++I    +  
Sbjct: 1521 DSISPQLFILAAEYLSRGLNALYDQYPSLHYSSGVPLSV---SHLAFADDVLIFTNGSKS 1577

Query: 715  NARYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVP 894
              + I   L  Y  +SGQ  N  KS     T + +  R  I    G +    P  YLG P
Sbjct: 1578 ALQRILVFLQEYEEISGQRINAQKSCFVTHTNIPNSRRQIIAQATGFNHQLLPITYLGAP 1637

Query: 895  IFKGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIAL 1074
            ++KG  K  +   L  KI  +   W+   LS  GR+ L+ S++AS  ++ + + K P+ +
Sbjct: 1638 LYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRSVLASLPIYLLQVLKPPVCV 1697

Query: 1075 LKQLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKLL 1254
            L+++ +   +F+W G    K     +W +   P  EGGL +RSL     AF MK+ W+  
Sbjct: 1698 LERVNRLFNSFLWGGSAASKRIHWASWAKIALPVTEGGLDIRSLAEVFEAFSMKLWWRFR 1757

Query: 1255 TSDSIVFDTLRRRFLMDNGSIRRSFLP----SSIWAGLRPTVEDCQTDSRWIPGAHSSVN 1422
            T+DS+    +R ++    G +     P    S  W  +  +    +   RW  G   +V 
Sbjct: 1758 TTDSLWTRFMRMKYC--RGQLPMQTQPKLHDSQTWKRMLTSSTITEQHMRWRVG-QGNVF 1814

Query: 1423 FWTDNWLGYVIADRIGIPHEFRANFRNPISDFFFDNKWHLTMSFVEAYLDIVRDIVRCPI 1602
            FW D W+G   A  I    EF ++    + DFF +N W++         ++V +I + PI
Sbjct: 1815 FWHDCWMGE--APLISSNQEFTSSMVQ-VCDFFTNNSWNIEKLKTVLQQEVVDEIAKIPI 1871

Query: 1603 APDSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIWSPYIPERRTVVVWRAIFGR 1782
               + D+  WT + +G  ++KSA+  IR         ++IW   +P   +  +WR +   
Sbjct: 1872 DTMNKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLWRLLHDW 1931

Query: 1783 LTVMDVHRPKGFIGPTACCLCNS 1851
            + V    + KG    + C  C S
Sbjct: 1932 IPVELKMKSKGLQLASRCRCCKS 1954


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  311 bits (798), Expect = 5e-82
 Identities = 192/621 (30%), Positives = 309/621 (49%), Gaps = 5/621 (0%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            FYQ CW II+ D++ AVR FF+    P G+ S+ +VL+ K  DA T  +FRPI +   + 
Sbjct: 425  FYQQCWPIIAEDLLAAVRDFFKGAVFPRGVTSTTLVLLAKKPDAATWSDFRPISLCTILN 484

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFN-ILHNRSDSNMILKID 360
            KI+TK+LA R ++++  ++S  Q GF+ GR I+D I L  E    I +     N++LK+D
Sbjct: 485  KIVTKLLANRLSKVLPSLISENQSGFVSGRLINDNILLAQELIGKIDYKARGGNVVLKLD 544

Query: 361  IRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQG 540
            + KA+D L+WDFL+ VL+RFGF++ ++  I   + +   SVL+NG   GYF   RG+RQG
Sbjct: 545  MMKAYDRLNWDFLILVLERFGFNDMWIDMIRRCITNCWFSVLINGHSAGYFKSERGLRQG 604

Query: 541  DPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSMLYADDIVILCAATSGNA 720
            D +SP+LF +A E L + I+ L S ++   + +  ++   S + +ADDI+I    +    
Sbjct: 605  DSISPMLFILAAEYLSRGINELFSRYISLHYHSGCSLNI-SHLAFADDIMIFTNGSKSVL 663

Query: 721  RYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVPIF 900
              I + L  Y  +SGQ  N  KS       +    R  I   +G    + P  YLG P+F
Sbjct: 664  EKILEFLQEYEQISGQRVNHQKSCFVTANNMPSSRRQIISQTIGFLHKTLPITYLGAPLF 723

Query: 901  KGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIALLK 1080
            KG  K  +   L +KI  +   W+   LS  GR+ L+ S+++S  ++ + + K P  +++
Sbjct: 724  KGPKKVMLFDSLINKIRERITGWENKILSPGGRITLLRSVLSSMPIYLLQVLKPPACVIQ 783

Query: 1081 QLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKLLTS 1260
            ++E+   +F+W   ++        W     P +EGGLG+RSL  +  AF  K+ W+  T 
Sbjct: 784  KIERLFNSFLWGSSMDSTRIHWTAWHNITFPSSEGGLGIRSLKDSFDAFSAKLWWRFDTC 843

Query: 1261 DSIVFDTLRRRFLMDNGSIRRSFLP----SSIWAGLRPTVEDCQTDSRWIPGAHSSVNFW 1428
             S+    +R ++    G I  +  P    S+ W  L           RW  G    + FW
Sbjct: 844  QSLWVRYMRLKYC--TGQIHHNIAPKPHDSATWKPLLAGRATASQQIRWRIG-KGDIFFW 900

Query: 1429 TDNWLGYVIADRIGIPHEFRANFRNPISDFFFDNKWHLTMSFVEAYLDIVRDIVRCPIAP 1608
             D W+G    + +       +     ++ FF D+ W +          IV +I++ PI+ 
Sbjct: 901  HDAWMG---DEPLVNSFPSFSQSMMKVNYFFNDDAWDVDKLKTFIPNAIVEEILKIPISR 957

Query: 1609 DSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIWSPYIPERRTVVVWRAIFGRLT 1788
            +  D   W  + +G  + KSA+  +R        G  IW   IP   +  +WR +   L 
Sbjct: 958  EKEDIAYWALTANGDFSIKSAWELLRQRKQVNLVGQLIWHKSIPLTVSFFLWRTLHNWLP 1017

Query: 1789 VMDVHRPKGFIGPTACCLCNS 1851
            V    + KG    + C  C S
Sbjct: 1018 VEVRMKAKGIQLASKCLCCKS 1038


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  310 bits (793), Expect = 2e-81
 Identities = 194/621 (31%), Positives = 309/621 (49%), Gaps = 5/621 (0%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            FYQ CW+II+ D++ AVR FF   ++P G+ S+ ++L+PK   A+   +FRPI +   + 
Sbjct: 1339 FYQQCWNIIAQDLLDAVRDFFHGANIPRGVTSTTLILLPKKSSASKWSDFRPISLCTVMN 1398

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRS-DSNMILKID 360
            KIITK+L+ R A+++  I++  Q GF+ GR I D I L  E    L+ +S   N+ LK+D
Sbjct: 1399 KIITKLLSNRLAKVLPSIITENQSGFVGGRLISDNILLAQELIGKLNTKSRGGNLALKLD 1458

Query: 361  IRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQG 540
            + KA+D L W FL  VLQ FGF+ +++  I   +++   S+LLNG   GYF   RG+RQG
Sbjct: 1459 MMKAYDKLDWSFLFKVLQHFGFNGQWIKMIQKCISNCWFSLLLNGRTEGYFKSERGLRQG 1518

Query: 541  DPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSMLYADDIVILCAATSGNA 720
            D +SP LF IA E L + ++ L   +    + +  +I   S + +ADD++I    +    
Sbjct: 1519 DSISPQLFIIAAEYLSRGLNALYDQYPSLHYSSGVSISV-SHLAFADDVLIFTNGSKSAL 1577

Query: 721  RYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVPIF 900
            + I   L  Y  +SGQ  N  KS     T VS   R  I    G S       YLG P++
Sbjct: 1578 QRILAFLQEYQEISGQRINVQKSCFVTHTNVSSSRRQIIAQTTGFSHQLLLITYLGAPLY 1637

Query: 901  KGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIALLK 1080
            KG  K  +   L  KI  +   W+   LS  GR+ L+ S++AS  ++ + + K PI +L+
Sbjct: 1638 KGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRSVLASLPIYLLQVLKPPICVLE 1697

Query: 1081 QLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKLLTS 1260
            ++ +   +F+W G    K     +W +   P  EGGL +R+L     AF MK+ W+  T 
Sbjct: 1698 RVNRIFNSFLWGGSAASKKIHWASWAKISLPIKEGGLDIRNLAEVFEAFSMKLWWRFRTI 1757

Query: 1261 DSIVFDTLRRRFLMDNGSIRRSFLP----SSIWAGLRPTVEDCQTDSRWIPGAHSSVNFW 1428
            DS+    +R ++    G +     P    S  W  +       + + RW  G    + FW
Sbjct: 1758 DSLWTRFMRMKYC--RGQLPMHTQPKLHDSQTWKRMVANSAITEQNMRWRVG-QGKLFFW 1814

Query: 1429 TDNWLGYVIADRIGIPHEFRANFRNPISDFFFDNKWHLTMSFVEAYLDIVRDIVRCPIAP 1608
             D W+G      +   ++  +     + DFF +N W +         ++V +I + PI  
Sbjct: 1815 HDCWMG---ETPLTSSNQELSLSMVQVCDFFMNNSWDIEKLKTVLQQEVVDEIAKIPIDA 1871

Query: 1609 DSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIWSPYIPERRTVVVWRAIFGRLT 1788
             S D+  W  + +G  ++KSA+  IR         ++IW   +P   +  +WR +   + 
Sbjct: 1872 MSKDEAYWAPTPNGEFSTKSAWQLIRKREVVNPVFNFIWHKTVPLTISFFLWRLLHDWIP 1931

Query: 1789 VMDVHRPKGFIGPTACCLCNS 1851
            V    + KGF   + C  C S
Sbjct: 1932 VELKMKSKGFQLASRCRCCKS 1952


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  308 bits (788), Expect = 7e-81
 Identities = 199/625 (31%), Positives = 318/625 (50%), Gaps = 9/625 (1%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            FYQ CWDII  D++ AV  FF    +P G+ S+ +VL+PK  ++    +FRPI +   + 
Sbjct: 1044 FYQHCWDIIKQDLLEAVLDFFNGTPMPQGVTSTTLVLLPKKPNSCQWSDFRPISLCTVLN 1103

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRS-DSNMILKID 360
            KI+TK LA R ++I+  I+S  Q GF+ GR I D I L  E    L  ++   N++LK+D
Sbjct: 1104 KIVTKTLANRLSKILPSIISENQSGFVNGRLISDNILLAQELVGKLDAKARGGNVVLKLD 1163

Query: 361  IRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQG 540
            + KA+D L+WDFL  ++++FGF++++++ I   +++   S+L+NGS VGYF   RG+RQG
Sbjct: 1164 MAKAYDRLNWDFLYLMMKQFGFNDRWISMIKACISNCWFSLLINGSLVGYFKSERGLRQG 1223

Query: 541  DPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSML-YADDIVILCAATSGN 717
            D +SPLLF +A + L + I+ L + H  K  +       P S L +ADDIVI        
Sbjct: 1224 DSISPLLFVLAADYLSRGINQLFNRH--KSLLYLSGCFMPISHLAFADDIVIFTNGCRPA 1281

Query: 718  ARYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVPI 897
             + I   L  Y  +SGQ  N  KS      G     R  I    G    + P  YLG P+
Sbjct: 1282 LQKILVFLQEYEEVSGQQVNHQKSCFITANGCPMTRRQIIAHTTGFQHKTLPVIYLGAPL 1341

Query: 898  FKGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIALL 1077
             KG  K  +   L  KI  +   W+  +LS  GR+ L+ S+++S  ++ + + K P+ ++
Sbjct: 1342 HKGPKKVTLFDSLITKIRDRISGWENKTLSPGGRITLLRSVLSSLPLYLLQVLKPPVVVI 1401

Query: 1078 KQLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKLLT 1257
            +++E+   +F+W    N K      W +   P +EGGL +R L     AF +K+ W+  T
Sbjct: 1402 EKIERLFNSFLWGDSTNDKRIHWAAWHKLTFPCSEGGLDIRRLTDMFDAFSLKLWWRFST 1461

Query: 1258 SDSIVFDTLRRRFLMDNGSIRRSFLP----SSIWAGLRPTVEDCQTDSRWIPGAHSSVNF 1425
             + +    L+ ++ M  G I     P    S +W  +    E    ++RW  G   S+ F
Sbjct: 1462 CEGLWTKFLKTKYCM--GQIPHYVHPKLHDSQVWKRMVRGREVAIQNTRWRIG-KGSLFF 1518

Query: 1426 WTDNWLGYVIADR---IGIPHEFRANFRNPISDFFFDNKWHLTMSFVEAYLDIVRDIVRC 1596
            W D W+G    D+      PH FR N  + + +FF  + W +    +   +++V +I++ 
Sbjct: 1519 WHDCWMG----DQPLVTSFPH-FR-NDMSTVHNFFNGHNWDVDKLNLYLPMNLVDEILQI 1572

Query: 1597 PIAPDSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIWSPYIPERRTVVVWRAIF 1776
            PI     D   W+ + +G  +++SA+  IR         S +W   IP   +  +WR   
Sbjct: 1573 PIDRSQDDVAYWSLTSNGEFSTRSAWEAIRLRKSPNVLCSLLWHKSIPLSISFFLWRVFH 1632

Query: 1777 GRLTVMDVHRPKGFIGPTACCLCNS 1851
              + V    + KGF   + C  CNS
Sbjct: 1633 NWIPVDIRLKEKGFHLASKCICCNS 1657


>dbj|BAE79385.1| unnamed protein product [Ipomoea batatas]
          Length = 1366

 Score =  307 bits (786), Expect = 1e-80
 Identities = 194/627 (30%), Positives = 307/627 (48%), Gaps = 13/627 (2%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            FYQ  W  +   +   V H F++GS       +FM LIPK     T  +FRPI + N  F
Sbjct: 460  FYQQFWGEVGPAMTDMVNHAFENGSTYISQLQAFMTLIPKKDTPETAADFRPITLLNVSF 519

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHN--RSDSNMILKI 357
            K+I+K+L  R   I+  I+ P Q  F+PGR   D + L  E  + ++N  R    MILK+
Sbjct: 520  KVISKVLVNRLRPIMSNIIGPHQNSFLPGRSTMDNVILTQEVVHSMNNPRRKKKQMILKV 579

Query: 358  DIRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQ 537
            D++KA+D++SWD+L   L+ FGF  + +  I   L  + +++L NG  +  F   RG+RQ
Sbjct: 580  DLQKAYDSVSWDYLEETLEDFGFPRRLIDLILFSLQESSLAILWNGGRLPPFKPGRGLRQ 639

Query: 538  GDPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSMLYADDIVILCAATSGN 717
            GDPL+P LF +  E L   I   V++   KP    R     S + +ADD+++   A+   
Sbjct: 640  GDPLAPYLFNLVMERLAHDIQTRVNARTWKPVHITRGGTGISHLFFADDLMLFGEASEHQ 699

Query: 718  ARYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVPI 897
            A+ +FD L  +++ SG   N  KS +F  + V+  ++  I +I+ + V      YLG+P+
Sbjct: 700  AQIMFDCLDSFSNASGLKVNFSKSLLFCSSNVNAGLKRAIGSILQVPVAESLGTYLGIPM 759

Query: 898  FKGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIALL 1077
             K          + DK+  K   WK SSL++AGR  LV + +A+   ++M +   P++  
Sbjct: 760  LKERVSRNTFNAVIDKMRTKLSSWKASSLNMAGRRVLVQASLATVPTYTMQVMALPVSTC 819

Query: 1078 KQLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKLLT 1257
             +++K  RNF+W  D N +    VNW   C P+NEGGLG+R     NRAF+ KM W++ +
Sbjct: 820  NEIDKTCRNFLWGHDTNTRKLHSVNWAEICKPRNEGGLGLRMARDFNRAFLTKMAWQIFS 879

Query: 1258 S-DSIVFDTLRRRFLMDNGSIRRSFLPSSIWAGLRPTVEDCQTDS---RWIPGAHSSVNF 1425
            + D +    LR +++ +   +      +  W G R  ++     +   +W  G    +NF
Sbjct: 880  NIDKLWVKVLREKYVKNADFLHLQSQSNCSW-GWRSIMKGKDVLAGAIKWNVGNGRKINF 938

Query: 1426 WTDNWLG----YVIADRIGIPHEFRANFRNPISDFFFDNKWHLTMSFVEAYLDIVRDIVR 1593
            W D W+G        D I  PH       + I+      +W  T +        + D+VR
Sbjct: 939  WNDWWVGDGPLASNTDCINQPHMTDIKVEDLITS---QRRWD-TGALHNILPTNMIDMVR 994

Query: 1594 C-PIAPDS--LDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIWSPYIPERRTVVVW 1764
              PIA +S   D   W  S  G VT  SAY+ I       +   WIW     E+  + +W
Sbjct: 995  ATPIAINSEQEDFLSWPHSTTGMVTVSSAYSLIAGHDGDDRSHDWIWRATCTEKIKLFMW 1054

Query: 1765 RAIFGRLTVMDVHRPKGFIGPTACCLC 1845
            + +   L V    + +G     +C +C
Sbjct: 1055 KIVKNGLMVNVERKRRGLADAASCPVC 1081


>dbj|BAE79382.1| unnamed protein product [Ipomoea batatas]
          Length = 1366

 Score =  306 bits (785), Expect = 2e-80
 Identities = 194/627 (30%), Positives = 307/627 (48%), Gaps = 13/627 (2%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            FYQ  W  +   +   V H F++GS       +FM LIPK     T  +FRPI + N  F
Sbjct: 460  FYQQFWGEVGPAMTDMVNHAFENGSTYISQLQAFMTLIPKKDTPETAADFRPITLLNASF 519

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHN--RSDSNMILKI 357
            K+I+K+L  R   I+  I+ P Q  F+PGR   D + L  E  + ++N  R    MILK+
Sbjct: 520  KVISKVLVNRLRPIMSNIIGPHQNSFLPGRSTMDNVILTQEVVHSMNNPRRKKKQMILKV 579

Query: 358  DIRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQ 537
            D++KA+D++SWD+L   L+ FGF  + +  I   L  + +++L NG  +  F   RG+RQ
Sbjct: 580  DLQKAYDSVSWDYLEETLEDFGFPRRLIDLILFSLQESSLAILWNGGRLPPFKPGRGLRQ 639

Query: 538  GDPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSMLYADDIVILCAATSGN 717
            GDPL+P LF +  E L   I   V++   KP    R     S + +ADD+++   A+   
Sbjct: 640  GDPLAPYLFNLVMERLAHDIQTRVNARTWKPVHITRGGTGISHLFFADDLMLFGEASEHQ 699

Query: 718  ARYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVPI 897
            A+ +FD L  +++ SG   N  KS +F  + V+  ++  I +I+ + V      YLG+P+
Sbjct: 700  AQIMFDCLDSFSNASGLKVNFSKSLLFCSSNVNAGLKRAIGSILQVPVAESLGTYLGIPM 759

Query: 898  FKGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIALL 1077
             K          + DK+  K   WK SSL++AGR  LV + +A+   ++M +   P++  
Sbjct: 760  LKERVSRNTFNAVIDKMRTKLSSWKASSLNMAGRRVLVQASLATVPTYTMQVMALPVSTC 819

Query: 1078 KQLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKLLT 1257
             +++K  RNF+W  D N +    VNW   C P+NEGGLG+R     NRAF+ KM W++ +
Sbjct: 820  NEIDKTCRNFLWGHDTNTRKLHSVNWAEICKPRNEGGLGLRMARDFNRAFLTKMAWQIFS 879

Query: 1258 S-DSIVFDTLRRRFLMDNGSIRRSFLPSSIWAGLRPTVEDCQTDS---RWIPGAHSSVNF 1425
            + D +    LR +++ +   +      +  W G R  ++     +   +W  G    +NF
Sbjct: 880  NIDKLWVKVLREKYVKNADFLHLQSQSNCSW-GWRSIMKGKDVLAGAIKWNVGNGRKINF 938

Query: 1426 WTDNWLG----YVIADRIGIPHEFRANFRNPISDFFFDNKWHLTMSFVEAYLDIVRDIVR 1593
            W D W+G        D I  PH       + I+      +W  T +        + D+VR
Sbjct: 939  WNDWWVGDGPLASNTDCINQPHMTDIKVEDLITS---QRRWD-TGALHNILPTNMIDMVR 994

Query: 1594 C-PIAPDS--LDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIWSPYIPERRTVVVW 1764
              PIA +S   D   W  S  G VT  SAY+ I       +   WIW     E+  + +W
Sbjct: 995  ATPIAINSEQEDFLSWPHSTTGMVTVSSAYSLIAGHDGDDRSHDWIWRATCTEKIKLFMW 1054

Query: 1765 RAIFGRLTVMDVHRPKGFIGPTACCLC 1845
            + +   L V    + +G     +C +C
Sbjct: 1055 KIVKNGLMVNVERKRRGLADAASCPVC 1081


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  305 bits (782), Expect = 3e-80
 Identities = 197/626 (31%), Positives = 315/626 (50%), Gaps = 10/626 (1%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            FYQ CWDII ND++ AV  FF+   LP G+ S+ +VL+PK  +A     +RPI +   + 
Sbjct: 1218 FYQHCWDIIKNDLLDAVLDFFRGSPLPRGVTSTTLVLLPKKPNACHWSEYRPISLCTVLN 1277

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRS-DSNMILKID 360
            KI+TK+LA R ++I+  I+S  Q GF+ GR I D I L  E    +  +S   N++LK+D
Sbjct: 1278 KIVTKLLANRLSKILPSIISENQSGFVNGRLISDNILLAQELIGKIDAKSRGGNVVLKLD 1337

Query: 361  IRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQG 540
            + KA+D L+WDFL  +++ FGF+  ++  I   +++   S+L+NGS  GYF   RG+RQG
Sbjct: 1338 MAKAYDRLNWDFLYLMMEHFGFNAHWINMIKSCISNCWFSLLINGSLAGYFKSERGLRQG 1397

Query: 541  DPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSMLYADDIVILCAATSGNA 720
            D +SP+LF +A + L + ++HL S +    +++   +   S + +ADDIVI         
Sbjct: 1398 DSISPMLFILAADYLSRGLNHLFSCYSSLQYLSGCQMPI-SHLSFADDIVIFTNGGRSAL 1456

Query: 721  RYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVPIF 900
            + I   L  Y  +SGQ  N  KS      G S   R  I    G    + P  YLG P+ 
Sbjct: 1457 QKILSFLQEYEQVSGQKVNHQKSCFITANGCSLSRRQIISHTTGFQHKTLPVTYLGAPLH 1516

Query: 901  KGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIALLK 1080
            KG  K  +   L  KI  +   W+   LS  GR+ L+ S+++S  ++ + + K P+ +++
Sbjct: 1517 KGPKKVLLFDSLISKIRDRISGWENKILSPGGRITLLRSVLSSLPMYLLQVLKPPVTVIE 1576

Query: 1081 QLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKLLTS 1260
            ++++   +F+W      K      W +   P  EGGLG+R L     AF +K+ W+  T 
Sbjct: 1577 RIDRLFNSFLWGDSTECKKMHWAEWAKISFPCAEGGLGIRKLEDVCAAFTLKLWWRFQTG 1636

Query: 1261 DSIVFDTLRRRFLMDNGSIRRSFLP----SSIWAGLRPTVEDCQTDSRWIPGAHSSVNFW 1428
            +S+    LR ++ +  G I     P    S +W  +    E    + RW  G    + FW
Sbjct: 1637 NSLWTQFLRTKYCL--GRIPHHIQPKLHDSHVWKRMISGREMALQNIRWKIG-KGDLFFW 1693

Query: 1429 TDNWLGYVIADRIGIPHEFRANFRNPIS---DFFFDNKWHL--TMSFVEAYLDIVRDIVR 1593
             D W+G    D+  +   F   F+N +S    F+  + W +    SF+   L  V +I++
Sbjct: 1694 HDCWMG----DK-PLAASF-PEFQNDMSHGYHFYNGDTWDVDKLRSFLPTIL--VEEILQ 1745

Query: 1594 CPIAPDSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIWSPYIPERRTVVVWRAI 1773
             P      D   WT + +G  +++SA+  IR    S    S+IW   IP   +  +W+ +
Sbjct: 1746 VPFDKSREDVAYWTLTSNGDFSTRSAWEMIRQRQTSNALCSFIWHRSIPLSISFFLWKTL 1805

Query: 1774 FGRLTVMDVHRPKGFIGPTACCLCNS 1851
               + V    + KG    + C  CNS
Sbjct: 1806 HNWIPVELRMKEKGIQLASKCVCCNS 1831


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  305 bits (781), Expect = 4e-80
 Identities = 200/628 (31%), Positives = 323/628 (51%), Gaps = 12/628 (1%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            FYQ CWDII  D+  AV  FF+   LP G+ S+ +VL+PK+++ +    FRPI +   + 
Sbjct: 1305 FYQHCWDIIKQDLFEAVLDFFKGSPLPRGITSTTLVLLPKTQNVSQWSEFRPISLCTVLN 1364

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRS-DSNMILKID 360
            KI+TK+LA R ++I+  I+S  Q GF+ GR I D I L  E  + ++ RS   N++LK+D
Sbjct: 1365 KIVTKLLANRLSKILPSIISENQSGFVNGRLISDNILLAQELVDKINARSRGGNVVLKLD 1424

Query: 361  IRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQG 540
            + KA+D L+W+FL  ++++FGF+  ++  I   +++   S+L+NGS VGYF   RG+RQG
Sbjct: 1425 MAKAYDRLNWEFLYLMMEQFGFNALWINMIKACISNCWFSLLINGSLVGYFKSERGLRQG 1484

Query: 541  DPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSMLYADDIVILCAATSGNA 720
            D +SP LF +A E L + ++ L S +    +++  ++   S + +ADDIVI         
Sbjct: 1485 DSISPSLFILAAEYLSRGLNQLFSRYNSLHYLSGCSMSV-SHLAFADDIVIFTNGCHSAL 1543

Query: 721  RYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVPIF 900
            + I   L  Y  +SGQ  N  KS      G     R  I  + G    + P  YLG P+ 
Sbjct: 1544 QKILVFLQEYEQVSGQQVNHQKSCFITANGCPLSRRQIIAQVTGFQHKTLPVTYLGAPLH 1603

Query: 901  KGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIALLK 1080
            KG  K  +   L  KI  +   W+   LS   R+ L+ S+++S  ++ + + K P  +++
Sbjct: 1604 KGPKKVFLFDSLISKIRDRISGWENKILSPGSRITLLRSVLSSLPMYLLQVLKPPAIVIE 1663

Query: 1081 QLEKAMRNFIWTGDINQ-KGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKLLT 1257
            ++E+   +F+W GD N+ K      W +   P +EGGL +R+L     AF +K+ W+  T
Sbjct: 1664 KIERLFNSFLW-GDSNEGKRMHWAAWNKINFPCSEGGLDIRNLKDVFDAFTLKLWWRFYT 1722

Query: 1258 SDSIVFDTLRRRFLMDNGSIRRSFLP----SSIWAGLRPTVEDCQTDSRWIPGAHSSVNF 1425
             DS+    L+ ++ +  G I     P    SSIW  +    +    ++RW  G    + F
Sbjct: 1723 CDSLWTLFLKTKYCL--GRIPHYVQPKIHSSSIWKRITGGRDVTIQNTRWKIG-RGELFF 1779

Query: 1426 WTDNWLGYVIADR---IGIPHEFRANFRNPIS---DFFFDNKWHLTMSFVEAYLDIVRDI 1587
            W D W+G    D+   I  P     +FRN +S    F+  + W +    +   ++++ +I
Sbjct: 1780 WHDCWMG----DQPLVISFP-----SFRNDMSFVHKFYKGDSWDVDKLRLFLPVNLIYEI 1830

Query: 1588 VRCPIAPDSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIWSPYIPERRTVVVWR 1767
            +  P      D   WT + +G  ++KSA+  IR        GS IW   IP   +  +WR
Sbjct: 1831 LLIPFDRTQQDVAYWTLTSNGEFSTKSAWETIRQQQSHNTLGSLIWHRSIPLSISFFIWR 1890

Query: 1768 AIFGRLTVMDVHRPKGFIGPTACCLCNS 1851
            A+   + V    + KG    + C  CNS
Sbjct: 1891 ALNNWIPVELRMKGKGIHLASKCVCCNS 1918


>emb|CAN75646.1| hypothetical protein VITISV_031269 [Vitis vinifera]
          Length = 1701

 Score =  305 bits (780), Expect = 6e-80
 Identities = 187/645 (28%), Positives = 314/645 (48%), Gaps = 31/645 (4%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            F+Q  WD+   +++  +  F + G     LN++F+VLIPK   A  + +FRPI +   ++
Sbjct: 950  FWQFYWDVAKEEIMGFLLDFHERGRFVRSLNATFLVLIPKKPSAEDLRDFRPISLVGGLY 1009

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRSDSNMILKIDI 363
            K++ K+LA R  ++V +++S  Q  F+ GR I D   + +E  + L  R++S ++ K+D+
Sbjct: 1010 KLLAKVLANRLKKVVGKVVSSAQNAFVEGRQILDAALIANEAIDSLLKRNESGVLCKLDL 1069

Query: 364  RKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQGD 543
             KA+D ++W+FLL+VLQ  GF EK++ WIS  ++ A  SVL+NG+P GYF  +RG+RQGD
Sbjct: 1070 EKAYDHINWNFLLFVLQNMGFGEKWIGWISWCISIATFSVLINGTPEGYFNSSRGLRQGD 1129

Query: 544  PLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAP---RNIVFPSSMLYADDIVILCAATSG 714
            PLSP LF I  E L +LI+  V    L          N    S +L+ DD ++ C A+  
Sbjct: 1130 PLSPYLFVIGMEALSRLINRAVGGGFLSGCRVDGRGGNGALVSHLLFDDDTLVFCEASED 1189

Query: 715  NARYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVP 894
               ++   L  + ++SG   N  KS++     V +     ++   G  VG  P++YLG+P
Sbjct: 1190 QMVHLSWLLMWFEAISGLRINLDKSEILPVGRVENLENLALEA--GYKVGRLPSSYLGIP 1247

Query: 895  IFKGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIAL 1074
            +        V   + ++   +   WK   +   GR+ L+ S ++S  ++ M + + P  +
Sbjct: 1248 LGANHKSVAVWDGVEERFRKRLALWKRQFIFKGGRITLIRSTLSSMPIYLMSLLRMPRVV 1307

Query: 1075 LKQLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKL- 1251
              +LEK  R+F+W G   ++   +VNW   C  K +GGLGVR L   NRA + K  W+  
Sbjct: 1308 CLRLEKIQRDFLWGGGALERKPHLVNWDTVCMDKRKGGLGVRRLSILNRALLCKWNWRFA 1367

Query: 1252 LTSDSIVFDTLRRRFLMDNG-----SIRRSFLPSSIWAGLRPTVEDCQTDSRWIPGAHSS 1416
            +  +++    + R+F  + G      +R S+     W  +R      Q    ++ G    
Sbjct: 1368 IERENLWRHVISRKFGEEEGGWSSRDVRESY-GVGFWKEIRKEGALMQKKVAFLVGNGRR 1426

Query: 1417 VNFWTDNWLG-----------YVIA-----------DRIGIPHEFRANFRNPISDFFFDN 1530
            V FW D W G           Y  A           D  G+   + A F  P +D+  + 
Sbjct: 1427 VKFWKDLWWGNVPLCNSFPSLYAFASSKEAWVEEFWDTSGVEGVWSARFSRPFNDWEVEE 1486

Query: 1531 KWHLTMSFVEAYLDIVRDIVRCPIAPDSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKW 1710
                    VE  L  +R     P+  DS+   +W  + +G+ + +S Y  +      +  
Sbjct: 1487 --------VERLLLTIRGARLSPLMEDSM---MWKVTSNGSFSVRSLYNDLSSRRAGLFP 1535

Query: 1711 GSWIWSPYIPERRTVVVWRAIFGRLTVMDVHRPKGFIGPTACCLC 1845
               IW+P +P +     W A +G++  MD  + +G+     C LC
Sbjct: 1536 HGLIWNPSVPSKVCFFAWEASWGKVLTMDQFKKRGWAVANRCFLC 1580


>dbj|BAE79384.1| unnamed protein product [Ipomoea batatas]
          Length = 1898

 Score =  303 bits (777), Expect = 1e-79
 Identities = 189/626 (30%), Positives = 301/626 (48%), Gaps = 12/626 (1%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            FYQ  W  +   +   V H F++GS       +FM LIPK     T  +FRPI + N  F
Sbjct: 992  FYQQFWGEVGPAMTDMVNHAFENGSTYISQLQAFMTLIPKKDTPETAADFRPITLPNVSF 1051

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHN--RSDSNMILKI 357
            K+I+K+L  R   I+  I+ P Q  F+PGR   D + L  E  + ++N  R    MILK+
Sbjct: 1052 KVISKVLVNRLRPIMSNIIGPHQNSFLPGRSTMDNVILTQEVVHSMNNPRRKKKQMILKV 1111

Query: 358  DIRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQ 537
            D++KA+D++SWD+L   L+ FGF  + +  I   L  + +++L NG     F   RG+RQ
Sbjct: 1112 DLQKAYDSVSWDYLEETLEDFGFPRRLIDLILFSLQESSLAILWNGGRPPPFKPGRGLRQ 1171

Query: 538  GDPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSMLYADDIVILCAATSGN 717
            GDPL P LF +  E L   I   V++   KP    R     S + +ADD+++   A+   
Sbjct: 1172 GDPLVPYLFNLVMERLAHDIQTRVNARTWKPVHITRGGTGISHLFFADDLMLFGEASEHQ 1231

Query: 718  ARYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVPI 897
            A+ +FD L  ++  SG   N  KS +F  + V+  ++  I +I+ + V      YLG+P+
Sbjct: 1232 AQIMFDCLDSFSDASGLKVNFSKSLLFCSSNVNAGLKRAIGSILQVPVAESLGTYLGIPM 1291

Query: 898  FKGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIALL 1077
             K          + DK+  K   WK SSL++AGR  LV + +A+   ++M +   P++  
Sbjct: 1292 LKERVSRNTFNAVIDKMRTKLSSWKASSLNMAGRRVLVQASLATVPTYTMQVMALPVSTC 1351

Query: 1078 KQLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKLLT 1257
             +++K  RNF+W  D N +    VNW   C P+NEGGLG+R     NRAF+ KM W++ +
Sbjct: 1352 NEIDKTCRNFLWGHDTNTRKLHSVNWAEICKPRNEGGLGLRMARDFNRAFLTKMAWQIFS 1411

Query: 1258 S-DSIVFDTLRRRFLMDNGSIRRSFLPSSIWAGLRPTVEDCQTDS---RWIPGAHSSVNF 1425
            + D +    LR +++ +   +      +  W G R  ++     +   +W  G    +NF
Sbjct: 1412 NIDKLWVKVLREKYVKNADFLHLQSQSNCSW-GWRSIMKGKDVLAGAIKWNVGNGRKINF 1470

Query: 1426 WTDNWLG----YVIADRIGIPHEFRANFRNPISDFFFDNKWHLTMSFVEAYLDIVRDIVR 1593
            W D W+G        D I  PH       +  +      +W          ++++  +  
Sbjct: 1471 WNDWWVGDGPLASNTDCINQPHMTDIKVEDLTTS---QRRWDTGALHNILPINMIDMVRA 1527

Query: 1594 CPIAPDS--LDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIWSPYIPERRTVVVWR 1767
             PIA +S   D   W  S  G VT  SAY+ I       +   WIW     E+  + +W+
Sbjct: 1528 TPIAINSEQEDFPSWPHSTTGMVTVSSAYSLIAGHDGDGRSHDWIWRATCTEKIKLFMWK 1587

Query: 1768 AIFGRLTVMDVHRPKGFIGPTACCLC 1845
             +   L V    + +G     +C +C
Sbjct: 1588 IVKNGLMVNVERKRRGLADAASCPVC 1613


>emb|CAN82037.1| hypothetical protein VITISV_033902 [Vitis vinifera]
          Length = 1109

 Score =  303 bits (777), Expect = 1e-79
 Identities = 185/645 (28%), Positives = 313/645 (48%), Gaps = 31/645 (4%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            F+Q CWD++  +++  +  F + G     LNS+F+VLIPK   A  + +FRPI +   ++
Sbjct: 358  FWQFCWDVVKEEIMGFLLEFHERGRFVRSLNSTFLVLIPKKPGAEDLRDFRPISLVGGLY 417

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRSDSNMILKIDI 363
            K++ K+LA R  ++V +++S  Q  F+ GR I D   + +E  + L  R++  ++ K+D+
Sbjct: 418  KLLAKVLANRLKKVVGKVVSSAQNAFVEGRQILDAALIANEAIDSLLKRNECGLLCKLDL 477

Query: 364  RKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQGD 543
             KA+D ++W+FL+ VLQ  GF EK++ WIS  +++A  SVL+NG+P G+F  +RG+RQGD
Sbjct: 478  EKAYDHINWNFLMVVLQSMGFGEKWIGWISWCISTATFSVLINGTPEGFFNSSRGLRQGD 537

Query: 544  PLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAP---RNIVFPSSMLYADDIVILCAATSG 714
            P+SP LF I  E L +LI   V    L          N    S +L+ADD ++ C A+  
Sbjct: 538  PISPYLFVIGMEALSRLIHRAVEGGFLSGCRVDGRGGNGALVSHLLFADDTLVFCEASED 597

Query: 715  NARYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVP 894
               ++   L  + ++SG   N  KS++     V +     ++   G  VG  P++YLG+P
Sbjct: 598  QMVHLSWLLMWFEAISGLRINLDKSEILPVGRVENLENLALEA--GCKVGXLPSSYLGIP 655

Query: 895  IFKGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIAL 1074
            +        V   + ++   +   WK   +S  GR+ L+ S ++S  ++ M + + P  +
Sbjct: 656  LGANHKSVAVWDGVEERFRKRLALWKRQFISKGGRITLIRSTLSSMPIYLMSLLRMPRVV 715

Query: 1075 LKQLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKL- 1251
              +LEK  R+F+W G   ++   +VNW   C  K +GGLGVR L   NRA + K  W+  
Sbjct: 716  SLRLEKIQRDFLWGGGALERKPHLVNWDTVCMDKRKGGLGVRRLSILNRALLCKWNWRFA 775

Query: 1252 LTSDSIVFDTLRRRFLMDNG-----SIRRSFLPSSIWAGLRPTVEDCQTDSRWIPGAHSS 1416
            +  ++     + R+F  + G      +R S+     W  +R      Q    ++ G    
Sbjct: 776  IERENFWRHVISRKFGEEEGGWSSREVRESY-GVGFWKEIRKEGALMQNKVAFLVGNGRR 834

Query: 1417 VNFWTDNWLG-----------YVIA-----------DRIGIPHEFRANFRNPISDFFFDN 1530
            V FW D W G           Y  A           D  G+   +   F  P +D+  + 
Sbjct: 835  VKFWKDIWWGNLPLCNSFPSLYAFAXSKEAWVEEFWDTSGVEGAWCPRFSRPFNDWEVEE 894

Query: 1531 KWHLTMSFVEAYLDIVRDIVRCPIAPDSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKW 1710
                    VE  L  +R     PI     D+ +W  + + + + KS Y  +      +  
Sbjct: 895  --------VERLLLTIRGARLSPIME---DRMMWKVTSNESFSVKSLYNDLSSRRAGLFP 943

Query: 1711 GSWIWSPYIPERRTVVVWRAIFGRLTVMDVHRPKGFIGPTACCLC 1845
               IW+P +P + +   W A +G++  MD  + +G+     C +C
Sbjct: 944  HGLIWNPSVPSKVSFFAWEAAWGKVLTMDQLKKRGWAVANRCFMC 988


>gb|ABE87589.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H;
            Endonuclease/exonuclease/phosphatase [Medicago
            truncatula]
          Length = 1246

 Score =  303 bits (776), Expect = 2e-79
 Identities = 156/305 (51%), Positives = 201/305 (65%), Gaps = 1/305 (0%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            FYQ+ WDI+  DVI +V+ FF SG L   +NS+ +VLIPK   A  + ++RPI ++NF F
Sbjct: 469  FYQTYWDIVGADVIQSVQDFFISGQLAQNINSNLIVLIPKVPGARVMGDYRPIALANFQF 528

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRS-DSNMILKID 360
            KII+KILA R A I  RI+S  Q GFI  R I  C+ L SE  N+L  R    N+ LK+D
Sbjct: 529  KIISKILADRLADITMRIISVEQRGFIRDRDISKCVILASEAINLLEKRQYGGNVALKVD 588

Query: 361  IRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQG 540
            I KAFDTL W+FLL VLQRFGF EKFV WI VIL SAR+SVL+NG  VG+F C+ GVRQG
Sbjct: 589  IAKAFDTLDWNFLLAVLQRFGFDEKFVHWILVILQSARLSVLVNGKAVGFFTCSHGVRQG 648

Query: 541  DPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSMLYADDIVILCAATSGNA 720
            DPLSPLLFC+ EEVL + +S   +   L P    R + FP+ +LYADD++I C  T  N 
Sbjct: 649  DPLSPLLFCLVEEVLSRALSMAATDGQLIPMSYCRGVSFPTHILYADDVLIFCTGTKRNI 708

Query: 721  RYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVPIF 900
            R +      Y+ +SGQ+ N  KS+ FF + ++      I +++G +VGS P  YLG PIF
Sbjct: 709  RRLIKIFSQYSEVSGQLINNAKSR-FFTSAMTGSRVQMISSLLGFNVGSLPFTYLGCPIF 767

Query: 901  KGAPK 915
            +G PK
Sbjct: 768  RGKPK 772



 Score =  111 bits (277), Expect = 1e-21
 Identities = 64/185 (34%), Positives = 95/185 (51%), Gaps = 2/185 (1%)
 Frame = +1

Query: 1147 VNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKLLTSDSIVFDTLRRRFLMDNGSIRRS 1326
            V+W   C P +EGGL ++S    N A ++K+ W LL+S+S     L+RRF      IR  
Sbjct: 776  VSWKILCRPWSEGGLDIKSTRLINNAAMLKLAWNLLSSNSQWAVLLKRRFFSQGQPIRY- 834

Query: 1327 FLPSSIWAGLRPTVEDCQTDSRWIPGAHSSVNFWTDNWLGYVIADRIGIPHEFRANFRNP 1506
            F+ SS+W G++  +   + +  WI G    +N WT+NWLG  +     I   F A+F   
Sbjct: 835  FVKSSVWHGVKNHMSILRQNKLWIVGTGDRINLWTNNWLGEPLVTLFNIDPFFHASFTGK 894

Query: 1507 ISDFFFDNKWHLTMSFVEAYLDIVRDIVRCPIA--PDSLDKRVWTRSVDGTVTSKSAYAF 1680
            +S+   +  W L  S +   +      +  P    PDSL   VWT S DG +TSK A +F
Sbjct: 895  VSEVIVNGNWDLPASLLVPEVTSRLASITLPRTELPDSL---VWTHSADGQLTSKHAVSF 951

Query: 1681 IRPSF 1695
            +R +F
Sbjct: 952  LRNAF 956


>emb|CAN82456.1| hypothetical protein VITISV_010028 [Vitis vinifera]
          Length = 4128

 Score =  302 bits (774), Expect = 3e-79
 Identities = 186/643 (28%), Positives = 329/643 (51%), Gaps = 28/643 (4%)
 Frame = +1

Query: 1    SFYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFI 180
            +F+  CWD++  ++I   R F+  G+    LNS+F++LIPK +    + +FRPI +   +
Sbjct: 2865 AFWLFCWDVVKPEIIGLFREFYLHGTFQRSLNSTFLLLIPKKEGTEDLKDFRPISLVGSV 2924

Query: 181  FKIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRSDSNMILKID 360
            +K++ K+LA R   ++  ++S +Q  F+ GR I D + + +E  +     +   ++LK+D
Sbjct: 2925 YKLLAKVLANRLKTVMGEVISDSQHAFVHGRQILDXVLIANEALDSRLKDNIPGLLLKMD 2984

Query: 361  IRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQG 540
            I KAFD ++W+FL+ V+ + GF  +++ WI    ++   S+L+NGSP G+F  +RG+RQG
Sbjct: 2985 IEKAFDHVNWNFLMEVMSKMGFGHRWINWIKWCCSTTSFSILINGSPSGFFRSSRGLRQG 3044

Query: 541  DPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMA---PRNIVFPSSMLYADDIVILCAATS 711
            DPLSP LF +A E L +L+S   + + +  F         +  S +L+ADD +I C A +
Sbjct: 3045 DPLSPYLFLLAMEALSQLLSRARNGNFISGFRVGGRGSEGLVVSHLLFADDTLIFCDADA 3104

Query: 712  GNARYIFDTLGHYASLSGQVFNPVKSKVF-FGTGVSHYIRHRIQTIMGLSVGSFPTNYLG 888
               +Y+  T   + ++SG   N  K++    G  +       +  ++G  +GS PT+YLG
Sbjct: 3105 DQLQYLSWTFMWFEAISGLKVNLNKTEAIPVGEDIP---METLAAVLGCKIGSLPTSYLG 3161

Query: 889  VPIFKGAPKHGVLRPLFDKIMVKFKR----WKGSSLSLAGRVCLVNSIIASSLVHSMLIY 1056
            +P+  GAP   +   ++D +  +F++    WK   LS  GR+ L+ S ++S   + + ++
Sbjct: 3162 LPL--GAPYKSI--RVWDAVEERFRKRLSLWKRQYLSKGGRLTLLKSTLSSLPTYFLSLF 3217

Query: 1057 KWPIALLKQLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMK 1236
              P  +  +LEK  R+F+W G   +K   +V+W   C+ K +GGLG+RSL + N+A + K
Sbjct: 3218 VIPKRVCARLEKIQRDFLWGGGALEKKPHLVSWKVVCADKKKGGLGIRSLATFNKALLGK 3277

Query: 1237 MGWKLLTSDSIVFD--TLRRRFLMDNG---SIRRSFLPSSIWAGLRPTVEDCQTDSRWIP 1401
              W+    +  ++    L +  L + G      R++    +W  +R   E+ ++ SR+I 
Sbjct: 3278 WLWRFANENEPLWKQIILSKYDLQEGGWCSKDARNWYGVGVWKAIRKGWENFRSHSRFII 3337

Query: 1402 GAHSSVNFWTDNWLG-YVIADRIGIPHEFRANFRNPISDFFFDNK----W------HLTM 1548
            G  + V FW D W G   + +   I      N    +++ + +++    W      HL  
Sbjct: 3338 GDGTKVKFWKDLWCGNQSLKETFPILFNLSVNKEGWVAEAWEEDEGGXSWGLRFNRHLND 3397

Query: 1549 SFVEAYLDIVRDIVRCPIAPDSLDKRVWTRSVDGTVTSKSAYAFI----RPSFPSVKWGS 1716
              V     ++  +    I     D   W  +  GT + KS Y+      +P FP+     
Sbjct: 3398 WEVGEVESLLSKLHPLTIRRGVEDMFRWKENKIGTFSVKSFYSSFSRDSKPPFPA----R 3453

Query: 1717 WIWSPYIPERRTVVVWRAIFGRLTVMDVHRPKGFIGPTACCLC 1845
             IW+P++P R +   W A + RL   D  +  G+  P  C LC
Sbjct: 3454 TIWTPWVPIRASFFGWEAAWNRLLTTDRLKRIGWSIPNRCFLC 3496



 Score =  214 bits (545), Expect = 1e-52
 Identities = 135/449 (30%), Positives = 225/449 (50%), Gaps = 13/449 (2%)
 Frame = +1

Query: 133  ANTVDNFRPIVMSNFIFKIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGF 312
            A  + +FRPI +    +K++ K+LA R  + +  ++S  Q  FI  R I D   + +E  
Sbjct: 1215 AKELKDFRPISLVGSFYKLLAKVLANRLKQXIGEVVSEYQHAFIRNRQILDAALIANETV 1274

Query: 313  NILHNRSDSNMILKIDIRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLN 492
            +     +   ++LK+DI KAFD ++WD L+ V+ + GF +K++ WIS  +++   S+L+N
Sbjct: 1275 DSRLKVNIPGLLLKLDIEKAFDHVNWDCLVSVMSKMGFGQKWINWISWCISTTNFSILIN 1334

Query: 493  GSPVGYFACTRGVRQGDPLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPRNIVFPSSML 672
            G+P  +F  TRG+RQGDPLSP LF +  E                               
Sbjct: 1335 GTPSDFFRSTRGLRQGDPLSPYLFLLVME------------------------------- 1363

Query: 673  YADDIVILCAATSGNARYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMG 852
                      A SG  RY+   L  + ++SG   N  KS+V    G   Y+ + I +++G
Sbjct: 1364 ----------ADSGQLRYLSWVLLWFEAISGLXVNRDKSEVI-PVGRVDYLEN-IVSVLG 1411

Query: 853  LSVGSFPTNYLGVPIFKGAPKHGVLRPLFDKIMVKFKR----WKGSSLSLAGRVCLVNSI 1020
              +G+ P++YLG+P+  GAP       ++D +  +F++    WK   LS  GR+ L+ S 
Sbjct: 1412 CRIGNLPSSYLGLPL--GAPFKSPR--VWDVVEERFRKCLSLWKRQYLSKGGRLTLIKST 1467

Query: 1021 IASSLVHSMLIYKWPIALLKQLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVR 1200
            ++S  ++ M ++  P  +  ++EK  R+F+W G   +K   +VNW+  C+   +GGLG+R
Sbjct: 1468 LSSLPIYLMSLFVIPRKVCARIEKIQRDFLWGGGALEKKPHLVNWSAVCTDMRQGGLGIR 1527

Query: 1201 SLISANRAFIMKMGWKLLTSDSIVFDTLRRRFLMDN---------GSIRRSFLPSSIWAG 1353
            SL++ NRA + K  WK     SI  ++L ++ ++D              R      +W  
Sbjct: 1528 SLVALNRALLGKWNWKF----SIERNSLWKQVIIDKYGEEEGGWCSKEVRGAYGVGLWKA 1583

Query: 1354 LRPTVEDCQTDSRWIPGAHSSVNFWTDNW 1440
            +R   E  ++ SR+I G    V FW D W
Sbjct: 1584 IRKDWEIIRSRSRFIVGNGRKVKFWKDLW 1612


>emb|CAN77370.1| hypothetical protein VITISV_033119 [Vitis vinifera]
          Length = 1190

 Score =  301 bits (770), Expect = 8e-79
 Identities = 177/641 (27%), Positives = 320/641 (49%), Gaps = 26/641 (4%)
 Frame = +1

Query: 1    SFYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFI 180
            +F+Q+CWD    +++   +  +   S    LN++F+V+IPK   A  +  FRPI +   +
Sbjct: 414  AFWQACWDFAKEEIVELFQELYDQKSFAKSLNATFLVIIPKKGGAEDLGEFRPISLLGGL 473

Query: 181  FKIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRSDSNMILKID 360
            +K++ K+LA R   ++ +++S  Q  F+ GR I D   + +E  +    R +  ++ K+D
Sbjct: 474  YKLMAKVLANRLKMVLDKVVSVDQNAFVRGRQILDASLIANEVVDYWQKRKEKGLVCKLD 533

Query: 361  IRKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQG 540
            I KA+D++SW FLL VL++ GF  +++ W+    ++A+ SV +NG+P G+F+ ++G+RQG
Sbjct: 534  IEKAYDSISWSFLLKVLKKMGFGSRWMDWMWWCFSTAKFSVFINGAPAGFFSSSKGLRQG 593

Query: 541  DPLSPLLFCIAEEVLGKLISHLVSSHLLKPFM---APRNIVFPSSMLYADDIVILCAATS 711
            DP+SP LF +  EVL  LI   V  + +            +  S +L+ADD +I C A+ 
Sbjct: 594  DPISPYLFILGMEVLSALIRRAVQGNFISGCRLRGRGEEEIMVSHLLFADDTIIFCEASK 653

Query: 712  GNARYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGV 891
                ++   L  + + SG   N  KS++     + +     ++  +G  +GSFP  YLG+
Sbjct: 654  DQLTHLGWILAWFEAASGLRINLAKSELIPVGEIDNVEEMAVE--LGCRIGSFPVKYLGL 711

Query: 892  PIFKGAPKHGVLRPLFDKIMVKFK----RWKGSSLSLAGRVCLVNSIIASSLVHSMLIYK 1059
            P+     +H  L P++D +  + +    RWK   LS  GR+ L+ S + S  ++ M I++
Sbjct: 712  PL---GARHKAL-PMWDGVEERMRRRLARWKRQYLSKGGRITLIKSTLVSIPIYQMSIFR 767

Query: 1060 WPIALLKQLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKM 1239
             P +++K+LEK  R+F+W      +   +VNW   C+ K++GGLG+R + S N+A + K 
Sbjct: 768  MPKSVVKRLEKLQRDFLWGXGNTARKIHLVNWKVXCTQKDKGGLGIRRMGSLNKALLGKW 827

Query: 1240 GWKLLTSDSIVF-DTLRRRFLMDNGSIR----RSFLPSSIWAGLRPTVEDCQTDSRWIPG 1404
             W+      +++   +  +  ++ G  +    R  +   +W  +   +  C  + ++  G
Sbjct: 828  IWRFAVEKDVLWKKVIGVKHGLEGGGWKSKEARGLVGVGVWKEILKEMGWCWNNMKFKVG 887

Query: 1405 AHSSVNFWTDNWLGYVIADRIGIPHEFRANFRNPISDFFFDNK-----WHLT-------- 1545
              + V FWTD+W G     +        A  RN   D  +D +     W+L+        
Sbjct: 888  RGNKVMFWTDHWCGNEALSQAFPQICALAACRNAAVDEVWDPRLGQGGWNLSLVRDSNDW 947

Query: 1546 -MSFVEAYLDIVRDIVRCPIAPDSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWI 1722
             +  +E  L ++RDI    + P+  D  +W      +   + AY  +    P V  G  I
Sbjct: 948  ELGLIEDLLFLLRDI---RVTPEE-DSVLWKGGDSDSFRIRGAYNLLAAPNPLVFPGKNI 1003

Query: 1723 WSPYIPERRTVVVWRAIFGRLTVMDVHRPKGFIGPTACCLC 1845
            W   +P +     W A + ++  +D  +  G+  P  C LC
Sbjct: 1004 WVDMVPSKVAFFAWEATWEKILTLDRLQVHGWQLPNCCFLC 1044


>emb|CAN74986.1| hypothetical protein VITISV_008771 [Vitis vinifera]
          Length = 1971

 Score =  300 bits (768), Expect = 1e-78
 Identities = 182/634 (28%), Positives = 314/634 (49%), Gaps = 20/634 (3%)
 Frame = +1

Query: 4    FYQSCWDIISNDVITAVRHFFQSGSLPDGLNSSFMVLIPKSKDANTVDNFRPIVMSNFIF 183
            F+Q CWD++  +++  +  F + G     LNS+F+VLIPK   A  + +FRPI +   ++
Sbjct: 768  FWQFCWDVVKEEIMGFLLEFHERGRFVRSLNSTFLVLIPKKAGAEDLRDFRPISLVGGLY 827

Query: 184  KIITKILATRFARIVRRILSPTQFGFIPGRHIHDCIALVSEGFNILHNRSDSNMILKIDI 363
            K++ K+LA R  ++V +++S  Q  F+ GR I D   + +E  + L  R++  ++ K+D+
Sbjct: 828  KLLAKVLANRLKKVVGKVVSSAQNAFVEGRQILDAALIANEAIDSLLKRNERGVLCKLDL 887

Query: 364  RKAFDTLSWDFLLYVLQRFGFSEKFVAWISVILNSARISVLLNGSPVGYFACTRGVRQGD 543
             KA+D ++W+FLL+VLQ  GF EK++ WIS  +++A  SVL+NG+P GYF  +RG+RQGD
Sbjct: 888  EKAYDHINWNFLLFVLQSMGFGEKWIGWISWCISTATFSVLINGTPEGYFNSSRGLRQGD 947

Query: 544  PLSPLLFCIAEEVLGKLISHLVSSHLLKPFMAPR---NIVFPSSMLYADDIVILCAATSG 714
            PLSP LF +  E L +LI   V    L          N    S +L+ADD ++ C A+  
Sbjct: 948  PLSPYLFVLGMEALSRLIHRAVGGGFLSGCRVNGRGGNGALVSHLLFADDTLVFCEASED 1007

Query: 715  NARYIFDTLGHYASLSGQVFNPVKSKVFFGTGVSHYIRHRIQTIMGLSVGSFPTNYLGVP 894
               ++   L  + ++SG   N  KS++     V +     ++   G  VG  P++YLG+P
Sbjct: 1008 QMVHLSWLLMWFEAISGLRINLDKSEILPVGRVENLENLALEA--GCKVGRLPSSYLGIP 1065

Query: 895  IFKGAPKHGVLRPLFDKIMVKFKRWKGSSLSLAGRVCLVNSIIASSLVHSMLIYKWPIAL 1074
            +        V   + +K   +   WK   +S  GR+ L+ S ++S  ++ M + + P  +
Sbjct: 1066 LGANHKSVAVWDGVEEKFRKRLALWKRQFISKGGRITLIRSTLSSMPIYLMSLLRIPRVV 1125

Query: 1075 LKQLEKAMRNFIWTGDINQKGSVVVNWTRCCSPKNEGGLGVRSLISANRAFIMKMGWKL- 1251
              +LEK  R+F+W G   ++   +VNW   C  K +GGLGVR L   N A + K   +  
Sbjct: 1126 SLRLEKIQRDFLWGGGALERKPHLVNWDTVCMDKRKGGLGVRRLSILNXALLCKWNXRFA 1185

Query: 1252 LTSDSIVFDTLRRRFLMDNGS-----IRRSFLPSSIWAGLRPTVEDCQTDSRWIPGAHSS 1416
            +  ++     + R+F  + G      +R S+    +W  +R      Q    ++ G    
Sbjct: 1186 IEXENFWRHVISRKFGEEEGGWSSREVRXSY-GVGLWKEIRKEGALMQNKVAFVVGNGRR 1244

Query: 1417 VNFWTDNWLGYV-IADRIGIPHEFRANFRNPISDFFF----DNKWHLTMSF------VEA 1563
            V FW D W G + + +     + F  +    + +++     +  W    S       VE 
Sbjct: 1245 VKFWKDIWWGNLALCNSFPSLYAFAXSKEAWVEEYWDTSXGEGAWSPRFSRPFNDWEVEE 1304

Query: 1564 YLDIVRDIVRCPIAPDSLDKRVWTRSVDGTVTSKSAYAFIRPSFPSVKWGSWIWSPYIPE 1743
               ++  I    + P   D+ +W  + +G  + KS Y  +      +     IW+P +P 
Sbjct: 1305 VERLLLTIRGARLXPLMEDRMMWKANXNGIFSVKSLYNDLFSRRAGJFPHGLIWNPXVPS 1364

Query: 1744 RRTVVVWRAIFGRLTVMDVHRPKGFIGPTACCLC 1845
            + +   W A +G++  MD  + +G+     C LC
Sbjct: 1365 KVSFFAWEASWGKVLTMDQLKKRGWXVANRCFLC 1398


Top