BLASTX nr result

ID: Rheum21_contig00019259 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00019259
         (1218 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABW81176.1| non-LTR reverse transcriptase [Arabidopsis cebenn...   187   8e-45
emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulga...   135   4e-31
gb|EOY08302.1| Uncharacterized protein TCM_022640 [Theobroma cacao]   130   6e-28
emb|CAB10338.1| hypothetical protein [Arabidopsis thaliana] gi|7...   124   8e-26
gb|AAC63844.1| putative non-LTR retroelement reverse transcripta...   123   1e-25
emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulga...   118   3e-24
ref|XP_006354967.1| PREDICTED: uncharacterized protein LOC102599...   117   1e-23
gb|AAD37019.2| putative non-LTR retrolelement reverse transcript...    98   7e-22
emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulga...   107   1e-20
emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulga...   103   1e-19
emb|CCA66040.1| hypothetical protein [Beta vulgaris subsp. vulga...   100   2e-18
gb|EOY06958.1| Uncharacterized protein TCM_021520 [Theobroma cacao]    98   6e-18
emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga...    91   1e-17
dbj|BAE79384.1| unnamed protein product [Ipomoea batatas]              96   3e-17
dbj|BAE79385.1| unnamed protein product [Ipomoea batatas]              95   7e-17
dbj|BAE79382.1| unnamed protein product [Ipomoea batatas]              95   7e-17
gb|AAS55787.1| hypothetical protein [Oryza sativa Japonica Group...    94   9e-17
gb|EOY08210.1| Uncharacterized protein TCM_022554 [Theobroma cacao]    94   1e-16
gb|EOY08834.1| Uncharacterized protein TCM_024073 [Theobroma cacao]    94   1e-16
gb|EMJ21964.1| hypothetical protein PRUPE_ppa026078mg, partial [...    91   1e-15

>gb|ABW81176.1| non-LTR reverse transcriptase [Arabidopsis cebennensis]
          Length = 464

 Score =  187 bits (475), Expect = 8e-45
 Identities = 97/231 (41%), Positives = 129/231 (55%)
 Frame = +1

Query: 361  MNCLIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQRVCDRFSFSNCVRVK 540
            MNCL+WNC GA    FR + RY+L+  + D+L L E    G R QR C    F N  RV 
Sbjct: 1    MNCLLWNCRGANKPNFRCSIRYILKKFNTDILALFETHAGGDREQRSCQGLGFENSFRVD 60

Query: 541  AKGFRGGIWVLWHHSRASLVVENLHRHFIHMSMADCNDWVHFVFVHAPPRVSDRRRLWTG 720
              G  GG+W+LW      + + N    FIH  + +  + +H V V+A P  S R  LW  
Sbjct: 61   VVGQSGGLWLLWKAEVGEVTIVNSSSQFIHARIVNGVEALHLVTVYAAPSASRRSGLWEK 120

Query: 721  LSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSIEFLQCMNLLGLIDMGFSGTPF 900
            L   ++ I  PL +G DFN I+   ER+GG G L  DS+ F   ++ L LIDMGF G  +
Sbjct: 121  LRDVVQAIDEPLIIGGDFNTIIRTDERIGGNGQLSPDSVAFGNWVSDLFLIDMGFKGNRY 180

Query: 901  TWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLLLSL 1053
            TW RG  +  ++AKRLD +F  A  RL+W +A V HL   SSDH+PL + L
Sbjct: 181  TWKRGRAASNFIAKRLDRVFCCAHSRLKWQEAFVKHLAVLSSDHTPLYIQL 231


>emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1378

 Score =  135 bits (341), Expect(2) = 4e-31
 Identities = 80/244 (32%), Positives = 127/244 (52%), Gaps = 3/244 (1%)
 Frame = +1

Query: 370  LIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQRVCDRFSFSNCVRVKAKG 549
            ++WN  G G        R ++R ++  VL L+E  ISG +AQR+CDR  FS   RV+A+G
Sbjct: 6    MVWNVQGVGTK--LTILRELMRINNPTVLALVETHISGDQAQRICDRIGFSGQTRVEAEG 63

Query: 550  FRGGIWVLWHHSRASLVVENLHRHFIHMSMADCND--WVHFVFVHAPPRVSDRRRLWTGL 723
            FRGGIW+ W     ++     H   + + +    D  W+ F  ++A P  + R+ LW  L
Sbjct: 64   FRGGIWLFWKSEEVTVTPYGSHSQHLTVEIRRIGDDPWL-FSAIYASPDSTLRKELWREL 122

Query: 724  SSNLENITAPLFVGRDFNCILSLSERMGG-AGALHADSIEFLQCMNLLGLIDMGFSGTPF 900
                   T P  +  DFN   SL ER G  +  +     +F   +    LID+GF+G   
Sbjct: 123  EQIKNQYTGPWLLAGDFNETSSLCERNGSESSEMQRRCKDFANWIENNALIDLGFTGPAH 182

Query: 901  TWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLLLSLAPTLVVIGV 1080
            TW+RG++   + + RLD   +N+  +L++ +  V +LP+  SDH P+L+S +    V  +
Sbjct: 183  TWSRGLSPTTFKSARLDRGLANSEWKLKFTEGVVRNLPKSQSDHCPILISTSGFAPVPRI 242

Query: 1081 VDPF 1092
            + PF
Sbjct: 243  IKPF 246



 Score = 27.3 bits (59), Expect(2) = 4e-31
 Identities = 12/39 (30%), Positives = 19/39 (48%)
 Frame = +3

Query: 1098 ETACLTHPHFMSYVEEVWVRDGPASVALLNMKNRLLQWN 1214
            + A L H  F  +V + W  D P    L +  ++L +WN
Sbjct: 249  QAAWLNHQVFCEFVRKNWNADAPIVPFLKSFADKLNKWN 287


>gb|EOY08302.1| Uncharacterized protein TCM_022640 [Theobroma cacao]
          Length = 531

 Score =  130 bits (326), Expect(2) = 6e-28
 Identities = 79/219 (36%), Positives = 116/219 (52%), Gaps = 2/219 (0%)
 Frame = +1

Query: 403  GFRGACRYMLRHHDIDVLVLLEPRISGVRAQRVCDRFSFSNCVRVKAKGFRGGIWVLWHH 582
            GF      + R H   ++VLLE R+SG  A +V     F    RV+  GF  GI VLW  
Sbjct: 48   GFLHFADDLQRIHHFSIMVLLEQRVSGTIADKVIRSVKFDRSHRVEVIGFLEGIQVLWIE 107

Query: 583  SRASLVVENLHRHFIHMSMADC--NDWVHFVFVHAPPRVSDRRRLWTGLSSNLENITAPL 756
                L++ N H   IH+++ D     W+  + V+       +R LW  LSS  +++T P 
Sbjct: 108  HLQILIIRN-HDQCIHLNVRDGFGESWL-LIAVYGHLDHKTKRALWAELSSFAKHVTCPW 165

Query: 757  FVGRDFNCILSLSERMGGAGALHADSIEFLQCMNLLGLIDMGFSGTPFTWARGVNSDFYV 936
             + RDFN  L   E++GG+       + F + ++  GLID+GF G+ +TW RG+     V
Sbjct: 166  LLSRDFNAFLYAHEKVGGSSQGSKFCLYFQRLISAYGLIDLGFKGSKYTWKRGL-----V 220

Query: 937  AKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLLLSL 1053
            ++R+D    N   RL++ +A V HLPR  SDH PLL+SL
Sbjct: 221  SERIDWAICNTDWRLKFHEATVQHLPRVKSDHRPLLISL 259



 Score = 22.3 bits (46), Expect(2) = 6e-28
 Identities = 10/39 (25%), Positives = 16/39 (41%)
 Frame = +3

Query: 1098 ETACLTHPHFMSYVEEVWVRDGPASVALLNMKNRLLQWN 1214
            + A L+H  F  +V++ W        AL    +    WN
Sbjct: 274  QAAWLSHSKFSDFVKQNWDSSSDIQGALKKFSDSAHVWN 312


>emb|CAB10338.1| hypothetical protein [Arabidopsis thaliana]
           gi|7268308|emb|CAB78602.1| hypothetical protein
           [Arabidopsis thaliana]
          Length = 655

 Score =  124 bits (311), Expect = 8e-26
 Identities = 62/153 (40%), Positives = 84/153 (54%)
 Frame = +1

Query: 343 ILN*FLMNCLIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQRVCDRFSFS 522
           +++ F+MNCL+WNC G     FR + RY+L+  D DVL L E    G RA R+C R  F 
Sbjct: 502 VIHIFMMNCLLWNCRGPNKPIFRRSIRYVLKKFDTDVLALFETHAGGDRAGRICQRLGFE 561

Query: 523 NCVRVKAKGFRGGIWVLWHHSRASLVVENLHRHFIHMSMADCNDWVHFVFVHAPPRVSDR 702
           N  RV A G   G+W+LW  S  ++ + +    FIH  +    + VH V V+A P VS R
Sbjct: 562 NQFRVDAVGQSSGLWLLWRSSVGNVEIVSSTSQFIHAKILTGLESVHLVVVYAAPSVSRR 621

Query: 703 RRLWTGLSSNLENITAPLFVGRDFNCILSLSER 801
             LW  L   +  +  PL +G DFN IL + ER
Sbjct: 622 SGLWNELREAVSGLDGPLIIGGDFNTILRVDER 654


>gb|AAC63844.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1231

 Score =  123 bits (309), Expect = 1e-25
 Identities = 64/134 (47%), Positives = 80/134 (59%)
 Frame = +1

Query: 658  VHFVFVHAPPRVSDRRRLWTGLSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSI 837
            +H + V+A P VS R  LW  L   +  +  PL +G DFN IL + ERMGG G L  DS+
Sbjct: 1    MHLIVVYAAPSVSRRSGLWGELKDVVNGLEGPLLIGGDFNTILWVDERMGGNGRLSPDSL 60

Query: 838  EFLQCMNLLGLIDMGFSGTPFTWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPR 1017
             F   +N L LID+GF G  FTW RG      VAKRLD +F  A  RL+W +A V HLP 
Sbjct: 61   AFGDWINELSLIDLGFKGNKFTWRRGRQESTVVAKRLDRVFVCAHARLKWQEAVVSHLPF 120

Query: 1018 YSSDHSPLLLSLAP 1059
             +SDH+PL + L P
Sbjct: 121  MASDHAPLYVQLEP 134


>emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1363

 Score =  118 bits (296), Expect(2) = 3e-24
 Identities = 74/232 (31%), Positives = 111/232 (47%), Gaps = 3/232 (1%)
 Frame = +1

Query: 361  MNCLIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQRVCDRFSFSNCVRVK 540
            M  +IWN  GA +  F      +++ H  D+L+LLE + S +RA +   R  + N   + 
Sbjct: 1    MKAIIWNVRGANSKAFLWHALDLVKMHKPDLLILLETKCSSLRADQATKRLGYVNFRIIP 60

Query: 541  AKGFRGGIWVLWHHSRASLVVENL---HRHFIHMSMADCNDWVHFVFVHAPPRVSDRRRL 711
            A G RGGIW++W    A +   +    H H +    +D  + V    +HAP  VS+R + 
Sbjct: 61   AFGKRGGIWLMWKADIALVHYADYQPNHFHALFKLRSDIPE-VLLTGMHAPSVVSERNKY 119

Query: 712  WTGLSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSIEFLQCMNLLGLIDMGFSG 891
            W  L+ +      P  V  D N +L  +E+MGG         +    +    L+D+GF G
Sbjct: 120  WVDLTEDSPPRGTPWLVAGDMNEVLHGNEKMGGRQVGKEQGKQCKDWIAANALLDLGFQG 179

Query: 892  TPFTWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLLL 1047
              FTW  G      + +RLD    N+     + D +VIHLPR  SDH PLL+
Sbjct: 180  PKFTWTNGRTGGSLIKERLDRALVNSEWLDLFPDTKVIHLPRTFSDHCPLLI 231



 Score = 21.2 bits (43), Expect(2) = 3e-24
 Identities = 6/12 (50%), Positives = 8/12 (66%)
 Frame = +3

Query: 1116 HPHFMSYVEEVW 1151
            HP F + +EE W
Sbjct: 253  HPDFTNVIEETW 264


>ref|XP_006354967.1| PREDICTED: uncharacterized protein LOC102599840 [Solanum tuberosum]
          Length = 288

 Score =  117 bits (292), Expect = 1e-23
 Identities = 68/243 (27%), Positives = 124/243 (51%), Gaps = 3/243 (1%)
 Frame = +1

Query: 358  LMNCLIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQRVCDRFSFSNCVRV 537
            LM   +WNC GA N  F    R ++  H+  +L L E R+  +   ++     +++ ++V
Sbjct: 30   LMKIFLWNCRGANNAKFMNNIRALIDSHNPTILALTETRMEDL--DKILQALDYTDVIQV 87

Query: 538  KAKGFRGGIWVLWHHSRASLVVENLHRHFIHMSMADCNDWVHFVF--VHAPPRVSDRRRL 711
             A G+ GGI +LW +S  ++    +    IH+++   + +  F F  ++A      R+ L
Sbjct: 88   PAFGYSGGIALLWRNSEINVEPFVITEQEIHVTIKVSSSFPKFFFSVIYAKNTYGLRKIL 147

Query: 712  WTGLSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSIEFLQCMNLLGLIDMGFSG 891
            W  L +    I  P  V  DFN + + SE++GG    +    +F+ C++ + +ID+GF+G
Sbjct: 148  WENLKNLTARIKGPWLVCGDFNEVTNASEKLGGRPINNTKCAKFISCLDDMDMIDLGFTG 207

Query: 892  TPFTWA-RGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLLLSLAPTLV 1068
              +TW+ +  N++  + +R+D   SN +    + D+ V HLPR  SDH    L++    +
Sbjct: 208  QKYTWSNKHKNNNTLIMERIDRFLSNHSWLNLFPDSHVHHLPRTHSDHVLFFLTVKEAPI 267

Query: 1069 VIG 1077
              G
Sbjct: 268  TQG 270


>gb|AAD37019.2| putative non-LTR retrolelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 855

 Score = 97.8 bits (242), Expect(2) = 7e-22
 Identities = 56/175 (32%), Positives = 80/175 (45%)
 Frame = +1

Query: 535  VKAKGFRGGIWVLWHHSRASLVVENLHRHFIHMSMADCNDWVHFVFVHAPPRVSDRRRLW 714
            V A+G  GGIW+LW      + +      FIH  + +    +H + V+A P VS R  LW
Sbjct: 508  VDARGQSGGIWLLWKSEVGDVSIVESAEQFIHAKVGNGLAAIHLLAVYAAPSVSRRSGLW 567

Query: 715  TGLSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSIEFLQCMNLLGLIDMGFSGT 894
            + LS  ++++  P+ VG                                    DMGF G 
Sbjct: 568  SLLSRIVQSVDEPIIVG------------------------------------DMGFKGN 591

Query: 895  PFTWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLLLSLAP 1059
             FTW RG     +VAKRLD +      RL+W +A V HLP ++SDH+P+ + L P
Sbjct: 592  KFTWKRGRVESTFVAKRLDRVLCRPQTRLKWQEASVTHLPFFASDHAPIYIQLEP 646



 Score = 34.3 bits (77), Expect(2) = 7e-22
 Identities = 15/39 (38%), Positives = 22/39 (56%)
 Frame = +3

Query: 1098 ETACLTHPHFMSYVEEVWVRDGPASVALLNMKNRLLQWN 1214
            E A LTH  F   ++  W  +G   VAL  +K++L +WN
Sbjct: 660  EAAWLTHSGFKDLLQASWNTEGETPVALAALKSKLKKWN 698


>emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1355

 Score =  107 bits (266), Expect = 1e-20
 Identities = 73/234 (31%), Positives = 112/234 (47%), Gaps = 5/234 (2%)
 Frame = +1

Query: 361  MNCLIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQRVCDRFSFSNCVRVK 540
            MN L WNC G GN       R        D++ + E  I+ +  + +     FSN   V 
Sbjct: 1    MNILCWNCRGLGNPWSVRQLRSWSNQFAPDIIFVSETMINKIEVEALKSWLGFSNAFGVA 60

Query: 541  AKGFRGGIWVLWHHSRASLVVENLHRHFIHMSMADCNDWVHFVFVHAPPRVSDRRRLWTG 720
            + G  GG+ + W       +V +  +H I   + D N    FV V+   +  ++   W+ 
Sbjct: 61   SVGRAGGLCLYWKEEVMFSLV-SFSQHHICGDVEDGNKKWRFVGVYGWAKEEEKHLTWSL 119

Query: 721  LSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSIEFLQCMNLLGLIDMGFSGTPF 900
            L    E+ + P+ +G DFN ILS +E+ GGA  +  + I F   ++ L L D+G+ GT +
Sbjct: 120  LRHLCEDTSLPILLGGDFNEILSAAEKEGGANRVRREMINFRDTLDTLALRDLGYVGTWY 179

Query: 901  TWARGVNSDFYVAKRLD-CLFSNAAGRLRWLDARVIHLP----RYSSDHSPLLL 1047
            TW RG +    + +RLD  L SN+     WLD     +P    RY SDHS ++L
Sbjct: 180  TWERGRSPSTCIRERLDRYLCSNS-----WLDLYPDSVPEHTIRYKSDHSAIVL 228


>emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1355

 Score =  103 bits (258), Expect = 1e-19
 Identities = 63/229 (27%), Positives = 105/229 (45%)
 Frame = +1

Query: 361  MNCLIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQRVCDRFSFSNCVRVK 540
            MN L WNC G GN       R     +  D++ L E  I+   ++ +  R  F+N   V 
Sbjct: 1    MNILCWNCRGVGNPRTVRQLRKWSTFYAPDIMFLSETMINKTESEALKSRLGFANAFGVS 60

Query: 541  AKGFRGGIWVLWHHSRASLVVENLHRHFIHMSMADCNDWVHFVFVHAPPRVSDRRRLWTG 720
            ++G  GG+ V W    +  +V     H           W  FV ++   +  ++   W+ 
Sbjct: 61   SRGRAGGLCVFWREELSFSLVSFSQHHICGDIDDGAKKW-RFVGIYGWAKEEEKHHTWSL 119

Query: 721  LSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSIEFLQCMNLLGLIDMGFSGTPF 900
            +    E+++ P+ +G DFN I+S  E+ GGA  +     +F + M+ L L D+G++G   
Sbjct: 120  MRFLCEDLSRPILMGGDFNEIMSYEEKEGGADRVRRGMYQFRETMDDLFLRDLGYNGVWH 179

Query: 901  TWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLLL 1047
            TW RG +    + +RLD    + +    + +  V H  RY SDH  + L
Sbjct: 180  TWERGNSLSTCIRERLDRFVCSPSWATMYPNTIVDHSMRYKSDHLAICL 228


>emb|CCA66040.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1362

 Score =  100 bits (248), Expect = 2e-18
 Identities = 64/232 (27%), Positives = 109/232 (46%), Gaps = 3/232 (1%)
 Frame = +1

Query: 361  MNCLIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQRVCDRFSFSNCVRVK 540
            M  L WNC G  N     A   +      +++ ++E  +     +++  R  F N + + 
Sbjct: 1    MKLLSWNCQGLANPWTVNALHSLCWRDRPNIVFVMETMVDSQVLEKIRKRCGFMNGLCLS 60

Query: 541  AKGFRGGIWVLWHHSRASLVVENLHRHFIHMSMADCND---WVHFVFVHAPPRVSDRRRL 711
            + G  GG+ + W+     + VE+   H IH  + D N    W + + ++  P  S++   
Sbjct: 61   SNGNSGGMGLWWNEM--DVTVESFSAHHIHAVVLDENKNPIW-NAMGIYGWPETSNKHLT 117

Query: 712  WTGLSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSIEFLQCMNLLGLIDMGFSG 891
            W+ L    +  + P+    DFN I S+ E+ GGA         F + ++   + D+G+ G
Sbjct: 118  WSLLRRLKQQCSLPVLFFGDFNEITSIEEKEGGAPRCERVMDAFREVIDDCAVKDLGYVG 177

Query: 892  TPFTWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLLL 1047
              FTW RG +    + +RLD + +N      +    V+HLPRY SDH+PLLL
Sbjct: 178  NRFTWQRGNSPSTLIRERLDRMLANDEWCDNFPSWEVVHLPRYRSDHAPLLL 229


>gb|EOY06958.1| Uncharacterized protein TCM_021520 [Theobroma cacao]
          Length = 754

 Score = 98.2 bits (243), Expect = 6e-18
 Identities = 72/240 (30%), Positives = 111/240 (46%), Gaps = 5/240 (2%)
 Frame = +1

Query: 358  LMNCLIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQRVCDRFSFSNCVRV 537
            ++NCL+WN  G      +   + +   H + +LV+LEP ++  R   +  R  F N +  
Sbjct: 201  MINCLLWNVRGIAGTAVQRRLKKLKLMHKVKLLVVLEPMVNTSRINYIKRRLGFDNAL-- 258

Query: 538  KAKGFRGGIWVLWHHSRA-SLVVENLHRHFIHMSMADCNDWVHFVFVHAPPRVSDRRRLW 714
                    IW+   +     +V++ +    + +S       V+  FV+A     +RR LW
Sbjct: 259  --SNCSHKIWLFCSNEICCEVVLDQIQCLHVKLSSPWLPHPVYTSFVYAKCTRLERRELW 316

Query: 715  TGLSSNLENITAPLFVGRDFNCILSLSERMGGA----GALHADSIEFLQCMNLLGLIDMG 882
            + L    +++ AP  VG DFN I+S  ER+ GA    G++   S   L C    GL+D G
Sbjct: 317  SNLRIISDSMQAPWLVGGDFNSIVSCDERLHGAIPHDGSMEDLSSTLLDC----GLLDAG 372

Query: 883  FSGTPFTWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLLLSLAPT 1062
            F G  FTW         + +RLD +  N      +   RV HL R  SDH PLL+S + T
Sbjct: 373  FEGNSFTWTNN-----RMFQRLDRVVYNHEWAEFFSSTRVQHLNRDGSDHCPLLISCSNT 427


>emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1369

 Score = 91.3 bits (225), Expect(2) = 1e-17
 Identities = 60/232 (25%), Positives = 100/232 (43%), Gaps = 4/232 (1%)
 Frame = +1

Query: 370  LIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQRVCDRFSFSNCVRVKAKG 549
            L WNC G G+     A R +L   +  ++ L E ++     + V  +  + + V V  +G
Sbjct: 5    LSWNCRGMGSPSALSALRRLLASENPQIVFLSETKLKSYEMESVKKKLKWEHMVAVDCEG 64

Query: 550  F----RGGIWVLWHHSRASLVVENLHRHFIHMSMADCNDWVHFVFVHAPPRVSDRRRLWT 717
                 RGG+ +LW       V+     H   +   +      F  ++  P    + +   
Sbjct: 65   ECRKRRGGLAMLWRSEIKVQVMSMSSNHIDIVVGEEAQGEWRFTGIYGYPEEEHKDKTGA 124

Query: 718  GLSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSIEFLQCMNLLGLIDMGFSGTP 897
             LS+       P   G DFN +L  SE+ GG G    ++  F   M     +D+GF G  
Sbjct: 125  LLSALARASRRPWLCGGDFNLMLVASEKKGGDGFNSREADIFRNAMEECHFMDLGFVGYE 184

Query: 898  FTWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLLLSL 1053
            FTW      D  + +RLD   +N   ++++  + V HLP+  SDH P++ S+
Sbjct: 185  FTWTNNRGGDANIQERLDRFVANDLWKIKFPGSFVSHLPKRKSDHVPIVASV 236



 Score = 26.6 bits (57), Expect(2) = 1e-17
 Identities = 10/26 (38%), Positives = 15/26 (57%)
 Frame = +3

Query: 1137 VEEVWVRDGPASVALLNMKNRLLQWN 1214
            V+E W+R   A + L    N+LL W+
Sbjct: 268  VKETWMRGTDAGINLARTANKLLSWS 293


>dbj|BAE79384.1| unnamed protein product [Ipomoea batatas]
          Length = 1898

 Score = 95.9 bits (237), Expect = 3e-17
 Identities = 68/231 (29%), Positives = 101/231 (43%), Gaps = 1/231 (0%)
 Frame = +1

Query: 355  FLMNCLIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQ-RVCDRFSFSNCV 531
            FLM  L WNC G  N   R   + +L     D L LLE R S       +  R   +N  
Sbjct: 531  FLMRILSWNCRGIANSRVRRFVKDLLSTTKADALCLLEIRSSKAEKMIALASRLGLTNHF 590

Query: 532  RVKAKGFRGGIWVLWHHSRASLVVENLHRHFIHMSMADCNDWVHFVFVHAPPRVSDRRRL 711
             V   GF GG+ +LW  +  +L V + +   IH   +         F +  P    +   
Sbjct: 591  IVNPLGFAGGLLLLWKPA-LNLSVISHNSQAIHTLASHRLGNCFITFAYIRPNTFAKCGF 649

Query: 712  WTGLSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSIEFLQCMNLLGLIDMGFSG 891
            W        +I +P  V  D N I +  E+ G +   +     F+   +  GL+D G SG
Sbjct: 650  WEYCKQLANSIQSPWMVVGDLNDIATSDEQWGSSSLNYTSLQNFVDAYSDCGLLDPGSSG 709

Query: 892  TPFTWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLL 1044
              FTW R + +     +RLD +  N + +L + +A+V  LPR  SDH+P+L
Sbjct: 710  PNFTWCRFIGNRVVQRRRLDRVLWNVSAQLTFPEAKVSVLPRLCSDHNPIL 760


>dbj|BAE79385.1| unnamed protein product [Ipomoea batatas]
          Length = 1366

 Score = 94.7 bits (234), Expect = 7e-17
 Identities = 67/229 (29%), Positives = 100/229 (43%), Gaps = 1/229 (0%)
 Frame = +1

Query: 361  MNCLIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQ-RVCDRFSFSNCVRV 537
            M  L WNC G  N   R   + +L     D L LLE R S       +  R   +N   V
Sbjct: 1    MRILSWNCRGIANSRVRRFVKDLLSTTKADALCLLEIRSSKAEKMIALASRLGLTNHFIV 60

Query: 538  KAKGFRGGIWVLWHHSRASLVVENLHRHFIHMSMADCNDWVHFVFVHAPPRVSDRRRLWT 717
               GF GG+ +LW  +  +L V + +   IH   +         F +  P    + R W 
Sbjct: 61   NPLGFAGGLLLLWKPA-LNLSVISHNSQAIHTLASHRLGNCFITFAYIRPNTFAKCRFWE 119

Query: 718  GLSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSIEFLQCMNLLGLIDMGFSGTP 897
                   +I +P  V  D N I +  E+ G +   +     F+   +  GL+D G SG  
Sbjct: 120  YCKQLANSIQSPWMVVGDLNDIATSDEQWGSSSLNYTSLQNFVDAYSDCGLLDPGSSGPN 179

Query: 898  FTWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLL 1044
            FTW R + +     +RLD +  N + +L + +A+V  LPR  SDH+P+L
Sbjct: 180  FTWCRFIGNRVVQRRRLDRVLWNVSAQLTFPEAKVSVLPRLCSDHNPIL 228


>dbj|BAE79382.1| unnamed protein product [Ipomoea batatas]
          Length = 1366

 Score = 94.7 bits (234), Expect = 7e-17
 Identities = 67/229 (29%), Positives = 100/229 (43%), Gaps = 1/229 (0%)
 Frame = +1

Query: 361  MNCLIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQ-RVCDRFSFSNCVRV 537
            M  L WNC G  N   R   + +L     D L LLE R S       +  R   +N   V
Sbjct: 1    MRILSWNCRGIANSRVRRFVKDLLSTTKADALCLLEIRSSKAEKMIALASRLGLTNHFIV 60

Query: 538  KAKGFRGGIWVLWHHSRASLVVENLHRHFIHMSMADCNDWVHFVFVHAPPRVSDRRRLWT 717
               GF GG+ +LW  +  +L V + +   IH   +         F +  P    + R W 
Sbjct: 61   NPLGFAGGLLLLWKPA-LNLSVISHNSQAIHTLASHRLGNCFITFAYIRPNTFAKCRFWE 119

Query: 718  GLSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSIEFLQCMNLLGLIDMGFSGTP 897
                   +I +P  V  D N I +  E+ G +   +     F+   +  GL+D G SG  
Sbjct: 120  YCKQLANSIQSPWMVVGDLNDIATSDEQWGSSSLNYTSLQNFVDAYSDCGLLDPGSSGPN 179

Query: 898  FTWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLL 1044
            FTW R + +     +RLD +  N + +L + +A+V  LPR  SDH+P+L
Sbjct: 180  FTWCRFIGNRVVQRRRLDRVLWNVSAQLTFPEAKVSVLPRLCSDHNPIL 228


>gb|AAS55787.1| hypothetical protein [Oryza sativa Japonica Group]
            gi|54291856|gb|AAV32224.1| hypothetical protein [Oryza
            sativa Japonica Group]
          Length = 1936

 Score = 94.4 bits (233), Expect = 9e-17
 Identities = 67/231 (29%), Positives = 109/231 (47%), Gaps = 2/231 (0%)
 Frame = +1

Query: 361  MNCLIWNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQRVCDRFSFSNCVRVK 540
            M+CL WNC G GN       R +++     ++ L E R S  +  R+  + +F   V V 
Sbjct: 636  MSCLAWNCRGLGNTATVQDLRALIQKAGSQLVFLCETRQSVEKMSRLRRKLAFRGFVGVS 695

Query: 541  AKGFRGGIWVLWHHSRASLVVENLHRHFI--HMSMADCNDWVHFVFVHAPPRVSDRRRLW 714
            ++G  GG+ + W  S  S+ V+++++ +I  ++ ++      H  FV+  PRV +R R+W
Sbjct: 696  SEGKSGGLALYWDES-VSVDVKDINKRYIDAYVRLSPDEPQWHITFVYGEPRVENRHRMW 754

Query: 715  TGLSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSIEFLQCMNLLGLIDMGFSGT 894
            + L +  ++   P  V  DFN  L   E              F   +    L D+GF G 
Sbjct: 755  SLLRTIRQSSALPWMVIGDFNETLWQFEHFSKNPRCETQMQNFRDALYDCDLQDLGFKGV 814

Query: 895  PFTWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHSPLLL 1047
            P T+    +    V  RLD   ++   R  + +A+V HL    SDHSP+LL
Sbjct: 815  PHTYDNRRDGWRNVKVRLDRAVADDKWRDLFPEAQVSHLVSPCSDHSPILL 865


>gb|EOY08210.1| Uncharacterized protein TCM_022554 [Theobroma cacao]
          Length = 669

 Score = 94.0 bits (232), Expect = 1e-16
 Identities = 52/147 (35%), Positives = 79/147 (53%), Gaps = 1/147 (0%)
 Frame = +1

Query: 376 WNCCGAGNDGFRGACRYMLRHHDIDVLVLLEPRISGVRAQRVCDRFSFSNCVRVKAKGFR 555
           WNC GA +  F    + +++ + I++L+LLEPRISG  A     +  F    RV+  GF 
Sbjct: 247 WNCHGASDKKFIRVVKDLVKSYAINMLILLEPRISGRLADGTIKQLGFDYSHRVEFVGFS 306

Query: 556 GGIWVLWHHSRASLVVENLHRHFIHMSMAD-CNDWVHFVFVHAPPRVSDRRRLWTGLSSN 732
           GGIW LW  +    +++N H   +HM++ D  N++  F  V+  P  + R +LW  LSS 
Sbjct: 307 GGIWCLWKENVKLHIIKN-HNQCVHMTIEDKPNEFWFFTTVYGNPSPNIRCQLWEELSSF 365

Query: 733 LENITAPLFVGRDFNCILSLSERMGGA 813
              +T P  +  DFN  L   E+ GG+
Sbjct: 366 ENTVTGPWLLANDFNAFLYSHEKAGGS 392


>gb|EOY08834.1| Uncharacterized protein TCM_024073 [Theobroma cacao]
          Length = 660

 Score = 93.6 bits (231), Expect = 1e-16
 Identities = 55/184 (29%), Positives = 82/184 (44%)
 Frame = +1

Query: 496  RVCDRFSFSNCVRVKAKGFRGGIWVLWHHSRASLVVENLHRHFIHMSMADCNDWVHFVFV 675
            ++C ++ F N  +VKA G+ GGIWV W+     + V        H+ +    +      +
Sbjct: 184  KMCCKYGFQNYFKVKANGYSGGIWVFWNAEVIEVEVLAYSSQLTHLLLNPSKEQWLLTEI 243

Query: 676  HAPPRVSDRRRLWTGLSSNLENITAPLFVGRDFNCILSLSERMGGAGALHADSIEFLQCM 855
            +  P V +R+ LW  L     +   P  V  DFN I+S  E+ G          + L CM
Sbjct: 244  YGSPLVKERKHLWDSLKLASNDQDIPWMVIGDFNQIISPDEKHGRNSVNLTQCNQLLNCM 303

Query: 856  NLLGLIDMGFSGTPFTWARGVNSDFYVAKRLDCLFSNAAGRLRWLDARVIHLPRYSSDHS 1035
            +   L D   SG  FTW        Y   RLD +F N    + + +   I+LPR  SDH 
Sbjct: 304  SYCNLYDFEASGFKFTWWNKKEGLDYTQVRLDRVFVNDRWHVMFPNVVAINLPRTHSDHH 363

Query: 1036 PLLL 1047
            P+L+
Sbjct: 364  PVLV 367


>gb|EMJ21964.1| hypothetical protein PRUPE_ppa026078mg, partial [Prunus persica]
          Length = 400

 Score = 90.9 bits (224), Expect = 1e-15
 Identities = 56/201 (27%), Positives = 100/201 (49%), Gaps = 2/201 (0%)
 Frame = +1

Query: 454  LVLLEPRISGVRAQRVCDRFSFSNCVRVKAKGFRGGIWVLWHHSRASLVVENLHRHFIHM 633
            L+ ++ + S VRA+ +  R  F     +   G  GGI +LW+ S  ++ + + H  FIH 
Sbjct: 133  LISVDLQASRVRAEVIAQRLGFDGFFCIPGLGQCGGIILLWNPSFINITILDYHERFIHY 192

Query: 634  SMADCNDWVHF--VFVHAPPRVSDRRRLWTGLSSNLENITAPLFVGRDFNCILSLSERMG 807
             + D  D  ++   FV+A P+   +++LW  +       +    +  DFN + + SE++G
Sbjct: 193  QVQDIIDHKNWKATFVYAYPQKHKQKQLWIDILGLKPTASEAWILMGDFNNVCTPSEKLG 252

Query: 808  GAGALHADSIEFLQCMNLLGLIDMGFSGTPFTWARGVNSDFYVAKRLDCLFSNAAGRLRW 987
            G+ +L +   +F   +N    I +  +G PFTW  G   +  + +RLD +  N      +
Sbjct: 253  GSISLPSAMADFNGFINDSETISLNAAGIPFTWCNGHRDNSVIYERLDRVLLNPNWLNLY 312

Query: 988  LDARVIHLPRYSSDHSPLLLS 1050
             +  + +LP   SDH P+LLS
Sbjct: 313  PNCAIQNLPILRSDHGPILLS 333