BLASTX nr result

ID: Sinomenium22_contig00035351 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00035351
         (975 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002527444.1| protein dimerization, putative [Ricinus comm...   369   1e-99
ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854...   368   2e-99
emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera]   367   3e-99
ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615...   353   8e-95
ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citr...   352   1e-94
ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobrom...   140   1e-30
ref|XP_007144620.1| hypothetical protein PHAVU_007G170700g [Phas...   119   1e-24
ref|XP_007132504.1| hypothetical protein PHAVU_011G099800g [Phas...   117   7e-24
ref|XP_007158104.1| hypothetical protein PHAVU_002G124400g [Phas...   117   9e-24
ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   112   3e-22
ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222...   112   3e-22
ref|XP_006579099.1| PREDICTED: uncharacterized protein LOC102660...   110   1e-21
ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao]...   106   2e-20
ref|XP_006396596.1| hypothetical protein EUTSA_v10029312mg [Eutr...   105   3e-20
ref|XP_006842452.1| hypothetical protein AMTR_s00077p00056600 [A...   105   4e-20
ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805...   104   5e-20
ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627...   103   1e-19
ref|XP_003611303.1| hypothetical protein MTR_5g012510 [Medicago ...   100   2e-18
emb|CAN68527.1| hypothetical protein VITISV_044224 [Vitis vinifera]   100   2e-18
ref|XP_007144025.1| hypothetical protein PHAVU_007G122800g [Phas...    99   3e-18

>ref|XP_002527444.1| protein dimerization, putative [Ricinus communis]
           gi|223533179|gb|EEF34936.1| protein dimerization,
           putative [Ricinus communis]
          Length = 633

 Score =  369 bits (947), Expect = 1e-99
 Identities = 178/283 (62%), Positives = 211/283 (74%)
 Frame = +3

Query: 120 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 299
           MPSESDKWGW+HVSVFGGF+  +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP
Sbjct: 1   MPSESDKWGWEHVSVFGGFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60

Query: 300 AIDRSLRAAFHILEEERLAXXXXXXXXXXXXXXXXXXXQPPFSSTGKVFGKEDVDDVVAR 479
           AIDRSLR AF ILEEERL                    Q   S   K   KEDVDD+VAR
Sbjct: 61  AIDRSLREAFQILEEERLVRKKKKNSANGKPGKRTRISQASIS--WKTITKEDVDDIVAR 118

Query: 480 FFFADGLNFNIINSPYFHEMAKAIGSFGPGYEPPSVEKLWGSFLSKEKARLEKAVVPIRE 659
           FF+ADGLN +++NSPYFHEM KAIG+FG GYE PS++KL  SFL KEK R+EK++  +RE
Sbjct: 119 FFYADGLNIDVVNSPYFHEMVKAIGAFGSGYELPSIDKLSDSFLGKEKGRIEKSLALLRE 178

Query: 660 SWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKEDDRADTIFNEVLSKVI 839
           SW HT CTILC+ +LDG +GCF+++IF SS RGLIFL+++D+ + D  D +    LS  I
Sbjct: 179 SWPHTGCTILCVGRLDGAIGCFHINIFVSSPRGLIFLKAVDVDDCDEGDHVLAGALSDAI 238

Query: 840 MYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSI 968
           + VGP+NVLQ+I + G A   SE +I SKFP IFWS CTS SI
Sbjct: 239 LEVGPSNVLQIISHLGDACKSSESYILSKFPHIFWSPCTSHSI 281


>ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854857 [Vitis vinifera]
          Length = 635

 Score =  368 bits (945), Expect = 2e-99
 Identities = 181/283 (63%), Positives = 209/283 (73%)
 Frame = +3

Query: 120 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 299
           MP+ESDKWGWKHVSVFGGF+  +GTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP
Sbjct: 1   MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 60

Query: 300 AIDRSLRAAFHILEEERLAXXXXXXXXXXXXXXXXXXXQPPFSSTGKVFGKEDVDDVVAR 479
           AIDRSLR AF ILEEERLA                   QP  +   K   KEDVDD+VAR
Sbjct: 61  AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120

Query: 480 FFFADGLNFNIINSPYFHEMAKAIGSFGPGYEPPSVEKLWGSFLSKEKARLEKAVVPIRE 659
           FF+ADGL+FNI+NSPYF EM KAI +FGPGYEPP+ EKL   FLSKEKA++EKA+  +RE
Sbjct: 121 FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180

Query: 660 SWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKEDDRADTIFNEVLSKVI 839
           SW HT CTILC+N+L  T G +  +IF SS RGL+FL+++D+ + D  D +F +VLS  I
Sbjct: 181 SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240

Query: 840 MYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSI 968
           M V P NVLQ+I   GHAS   E  I SKF  +FWS CTS SI
Sbjct: 241 MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSI 283


>emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera]
          Length = 635

 Score =  367 bits (943), Expect = 3e-99
 Identities = 180/283 (63%), Positives = 209/283 (73%)
 Frame = +3

Query: 120 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 299
           MP+ESDKWGWKHVSVFGGF+  +GTKRWKCNHCN+RYNGSYSRVRAHLLGFTGVGVKSCP
Sbjct: 1   MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNIRYNGSYSRVRAHLLGFTGVGVKSCP 60

Query: 300 AIDRSLRAAFHILEEERLAXXXXXXXXXXXXXXXXXXXQPPFSSTGKVFGKEDVDDVVAR 479
           AIDRSLR AF ILEEERLA                   QP  +   K   KEDVDD+VAR
Sbjct: 61  AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120

Query: 480 FFFADGLNFNIINSPYFHEMAKAIGSFGPGYEPPSVEKLWGSFLSKEKARLEKAVVPIRE 659
           FF+ADGL+FNI+NSPYF EM KAI +FGPGYEPP+ EKL   FLSKEKA++EKA+  +RE
Sbjct: 121 FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180

Query: 660 SWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKEDDRADTIFNEVLSKVI 839
           SW HT CTILC+N+L  T G +  +IF SS RGL+FL+++D+ + D  D +F +VLS  I
Sbjct: 181 SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240

Query: 840 MYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSI 968
           M V P NVLQ+I   GHAS   E  I SKF  +FWS CTS SI
Sbjct: 241 MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSI 283


>ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615434 isoform X1 [Citrus
           sinensis] gi|568863036|ref|XP_006484969.1| PREDICTED:
           uncharacterized protein LOC102615434 isoform X2 [Citrus
           sinensis]
          Length = 636

 Score =  353 bits (905), Expect = 8e-95
 Identities = 172/285 (60%), Positives = 206/285 (72%)
 Frame = +3

Query: 120 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 299
           MPSESDKWGW+HVSVFGGFE  +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP
Sbjct: 1   MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60

Query: 300 AIDRSLRAAFHILEEERLAXXXXXXXXXXXXXXXXXXXQPPFSSTGKVFGKEDVDDVVAR 479
           AIDRS+R  F ILEEER+A                   Q   S   K   KEDVD++VAR
Sbjct: 61  AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQS--SIVSKAISKEDVDEMVAR 118

Query: 480 FFFADGLNFNIINSPYFHEMAKAIGSFGPGYEPPSVEKLWGSFLSKEKARLEKAVVPIRE 659
           FF+A GLN N++NSPYF EM ++I +FG GY+ PS+E L  SFLSKEK ++EK +  +RE
Sbjct: 119 FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178

Query: 660 SWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKEDDRADTIFNEVLSKVI 839
           SW HT CTILC++ LDG LGCF   IF SS RGL+FL+++DL + D A+ +F  VLS  I
Sbjct: 179 SWPHTGCTILCVSSLDGRLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238

Query: 840 MYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSIHI 974
           + VGP NVLQ+I + GHA    E  + SKFP IF S CT +SIH+
Sbjct: 239 LEVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHM 283


>ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citrus clementina]
           gi|557526284|gb|ESR37590.1| hypothetical protein
           CICLE_v10028008mg [Citrus clementina]
          Length = 636

 Score =  352 bits (904), Expect = 1e-94
 Identities = 172/285 (60%), Positives = 206/285 (72%)
 Frame = +3

Query: 120 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 299
           MPSESDKWGW+HVSVFGGFE  +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP
Sbjct: 1   MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60

Query: 300 AIDRSLRAAFHILEEERLAXXXXXXXXXXXXXXXXXXXQPPFSSTGKVFGKEDVDDVVAR 479
           AIDRS+R  F ILEEER+A                   Q   S   K   KEDVD++VAR
Sbjct: 61  AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQS--SIVSKAISKEDVDEMVAR 118

Query: 480 FFFADGLNFNIINSPYFHEMAKAIGSFGPGYEPPSVEKLWGSFLSKEKARLEKAVVPIRE 659
           FF+A GLN N++NSPYF EM ++I +FG GY+ PS+E L  SFLSKEK ++EK +  +RE
Sbjct: 119 FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178

Query: 660 SWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKEDDRADTIFNEVLSKVI 839
           SW HT CTILC++ LDG LGCF   IF SS RGL+FL+++DL + D A+ +F  VLS  I
Sbjct: 179 SWPHTGCTILCVSSLDGQLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238

Query: 840 MYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSIHI 974
           + VGP NVLQ+I + GHA    E  + SKFP IF S CT +SIH+
Sbjct: 239 LDVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHM 283


>ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobroma cacao]
           gi|508784897|gb|EOY32153.1| Uncharacterized protein
           TCM_039722 [Theobroma cacao]
          Length = 381

 Score =  140 bits (352), Expect = 1e-30
 Identities = 64/79 (81%), Positives = 71/79 (89%)
 Frame = +3

Query: 120 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 299
           M SE DKWGW+HV+VFG F+  +GTKRWKCNHCNLRYNGSYSRVRAHLL F+GVGVKSC 
Sbjct: 1   MASEFDKWGWEHVTVFGVFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLRFSGVGVKSCL 60

Query: 300 AIDRSLRAAFHILEEERLA 356
           AI+R+LR AFHILEEERLA
Sbjct: 61  AINRTLREAFHILEEERLA 79



 Score = 69.7 bits (169), Expect = 2e-09
 Identities = 32/56 (57%), Positives = 41/56 (73%)
 Frame = +3

Query: 555 SFGPGYEPPSVEKLWGSFLSKEKARLEKAVVPIRESWTHTSCTILCLNQLDGTLGC 722
           +FG GYEPPS++KL   FLSKEK R+EK++  +RESW HT  T+LC+    G LGC
Sbjct: 92  TFGCGYEPPSMDKLSDCFLSKEKGRIEKSITLVRESWPHTGYTVLCV----GCLGC 143


>ref|XP_007144620.1| hypothetical protein PHAVU_007G170700g [Phaseolus vulgaris]
           gi|561017810|gb|ESW16614.1| hypothetical protein
           PHAVU_007G170700g [Phaseolus vulgaris]
          Length = 612

 Score =  119 bits (299), Expect = 1e-24
 Identities = 88/301 (29%), Positives = 131/301 (43%), Gaps = 25/301 (8%)
 Frame = +3

Query: 147 WKHVSVF----GGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRS 314
           W++VS      GG     G    KC+ CN  +NGSY++VRAHLL  TGVGV+ CP +  S
Sbjct: 18  WRYVSKLRKTPGG-----GNNMIKCSLCNFSFNGSYTQVRAHLLKLTGVGVRICPKVTPS 72

Query: 315 LRAAFHILEEERLAXXXXXXXXXXXXXXXXXXXQPPFSSTGKVFGKED------------ 458
               F  L+ E                       PP S  G     +D            
Sbjct: 73  KLVEFKKLDNEATLKIEGLKQKEVHL--------PPVSDEGNQTNSDDNPKFKGSLQAAF 124

Query: 459 -------VDDVVARFFFADGLNFNIINSPYFHEMAK--AIGSFGPGYEPPSVEKLWGSFL 611
                  +D  +AR F++ GL F++  SPY+       A  S    Y PP+  KL G  L
Sbjct: 125 NIQARDTLDCEIARMFYSSGLPFHLSRSPYYRSAFSYAANTSNLSEYVPPTYNKLRGHLL 184

Query: 612 SKEKARLEKAVVPIRESWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKE 791
           SKE++ +E  + PI+ SW     TI+     D       +   A++  G +FL+SI+   
Sbjct: 185 SKERSHVENLLQPIQNSWNQKGVTIVSDGWSDPQRKPL-IDFMATTESGSVFLKSINRYG 243

Query: 792 DDRADTIFNEVLSKVIMYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSIH 971
           + +      + +  VIM VG  NV+Q+I         +   I  +FP I+WS C   +++
Sbjct: 244 EIKDKDFIAKHIRDVIMEVGQNNVVQIITDNADVCKAAGMLIELEFPSIYWSPCVVHTLN 303

Query: 972 I 974
           +
Sbjct: 304 L 304


>ref|XP_007132504.1| hypothetical protein PHAVU_011G099800g [Phaseolus vulgaris]
           gi|561005504|gb|ESW04498.1| hypothetical protein
           PHAVU_011G099800g [Phaseolus vulgaris]
          Length = 511

 Score =  117 bits (293), Expect = 7e-24
 Identities = 80/289 (27%), Positives = 134/289 (46%), Gaps = 13/289 (4%)
 Frame = +3

Query: 147 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAI------- 305
           W++VS      T  G    +C+ CN   NGSY+RVRAHLL  TG GV+SCP +       
Sbjct: 18  WRYVSKLRK-TTGGGNNMIQCSLCNFILNGSYTRVRAHLLKLTGAGVRSCPYVTASKLVE 76

Query: 306 ----DRSLRAAFHILEEERLAXXXXXXXXXXXXXXXXXXXQPPFSSTGKVFGKEDVDDVV 473
               D   +     L++++++                   +    +   +  ++ +D  +
Sbjct: 77  LKKLDNEAKLKIEGLKQKKVSLPPVSDEGNQTRSDVNPKFKGFLQAAFNIQMRDTLDCEI 136

Query: 474 ARFFFADGLNFNIINSPYFHE-MAKAIGSFG-PGYEPPSVEKLWGSFLSKEKARLEKAVV 647
           AR F++ GL F++  SPY+    + A  +    GY PP+  KL G  LSKE++ +E  + 
Sbjct: 137 ARMFYSSGLPFHLAISPYYRSAFSNATNTSNLSGYVPPTYNKLRGPLLSKERSHVENLLQ 196

Query: 648 PIRESWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKEDDRADTIFNEVL 827
           PIR SW     TI+     D       ++  A +  G +FL+S+D   + +      + +
Sbjct: 197 PIRNSWNQKGVTIVSDGWSDPQRRPL-INFMAITEWGSMFLKSVDGSGEIKDKEFIAKHM 255

Query: 828 SKVIMYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSIHI 974
             VIM VG  NV+Q+I         +   I  +FP I+W+ C   ++++
Sbjct: 256 RDVIMEVGHNNVVQIITDNAVVCKAAGMLIGYEFPSIYWTPCAVHTLNL 304


>ref|XP_007158104.1| hypothetical protein PHAVU_002G124400g [Phaseolus vulgaris]
           gi|561031519|gb|ESW30098.1| hypothetical protein
           PHAVU_002G124400g [Phaseolus vulgaris]
          Length = 591

 Score =  117 bits (292), Expect = 9e-24
 Identities = 78/283 (27%), Positives = 128/283 (45%), Gaps = 7/283 (2%)
 Frame = +3

Query: 147 WKHVSVF----GGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRS 314
           W++VS      GG     G    KC+ C+  +NGSY+RVRAHLL  TG  ++    +D  
Sbjct: 18  WRYVSKLRKTPGG-----GNNMIKCSSCDFSFNGSYTRVRAHLLRITGEELEFFKKLDNE 72

Query: 315 LRAAFHILEEERL-AXXXXXXXXXXXXXXXXXXXQPPFSSTGKVFGKEDVDDVVARFFFA 491
                  L+++++                     +    +   + G++ VD  VAR F++
Sbjct: 73  ASLKIEYLKKKKVPLPHVSDEGKQTNNNDLNPKLKGSLQAAFNIQGRDTVDCAVARMFYS 132

Query: 492 DGLNFNIINSPYFHEMAKAIGSFG--PGYEPPSVEKLWGSFLSKEKARLEKAVVPIRESW 665
            GL F++  +PY+        +     GY PP+  KL G  LSKE+  +E  + PIR SW
Sbjct: 133 SGLPFHLARNPYYRNAFSVATNTSNLSGYVPPTYNKLRGPLLSKERRHVENLLQPIRNSW 192

Query: 666 THTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKEDDRADTIFNEVLSKVIMY 845
                TI+     D       ++  A +  G +FL+S+D   + +      + +  VIM 
Sbjct: 193 NQKGVTIVSDGWSDPQRRPL-INFMAITESGPMFLKSVDGSGEIKDKDFIAKHIRDVIME 251

Query: 846 VGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSIHI 974
           VGP NV+Q+I             I  +FP I+W+ C   ++++
Sbjct: 252 VGPKNVVQIITDNASVCKAVGMLIELEFPSIYWTPCVVHTLNL 294


>ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101222344
           [Cucumis sativus]
          Length = 673

 Score =  112 bits (279), Expect = 3e-22
 Identities = 62/186 (33%), Positives = 99/186 (53%)
 Frame = +3

Query: 414 QPPFSSTGKVFGKEDVDDVVARFFFADGLNFNIINSPYFHEMAKAIGSFGPGYEPPSVEK 593
           QPP     K   K++ D  VA FFF + + F+   S Y+ EM  AI  +G GY+ PS EK
Sbjct: 125 QPPIDDAQKQ-KKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEK 183

Query: 594 LWGSFLSKEKARLEKAVVPIRESWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQ 773
           L  + L K K  +  +    R+ W  T CTILC +  DG    F V I  + ++G +FL+
Sbjct: 184 LKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLV-ISVTCSKGTLFLK 242

Query: 774 SIDLKEDDRADTIFNEVLSKVIMYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQC 953
           S+D+   +   T  +++L  +I+ VG  NV+Q+I     +  ++   + +K+  +FWS C
Sbjct: 243 SVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPC 302

Query: 954 TSRSIH 971
            S  ++
Sbjct: 303 VSYCVN 308


>ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222344 [Cucumis sativus]
          Length = 673

 Score =  112 bits (279), Expect = 3e-22
 Identities = 62/186 (33%), Positives = 99/186 (53%)
 Frame = +3

Query: 414 QPPFSSTGKVFGKEDVDDVVARFFFADGLNFNIINSPYFHEMAKAIGSFGPGYEPPSVEK 593
           QPP     K   K++ D  VA FFF + + F+   S Y+ EM  AI  +G GY+ PS EK
Sbjct: 125 QPPIDDAQKQ-KKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEK 183

Query: 594 LWGSFLSKEKARLEKAVVPIRESWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQ 773
           L  + L K K  +  +    R+ W  T CTILC +  DG    F V I  + ++G +FL+
Sbjct: 184 LKSTLLDKVKGDIHSSYKKHRDEWKETGCTILCDSWSDGQTKSFLV-ISVTCSKGTLFLK 242

Query: 774 SIDLKEDDRADTIFNEVLSKVIMYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQC 953
           S+D+   +   T  +++L  +I+ VG  NV+Q+I     +  ++   + +K+  +FWS C
Sbjct: 243 SVDISGHEDDATYLSDLLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPC 302

Query: 954 TSRSIH 971
            S  ++
Sbjct: 303 VSYCVN 308


>ref|XP_006579099.1| PREDICTED: uncharacterized protein LOC102660479 [Glycine max]
          Length = 765

 Score =  110 bits (274), Expect = 1e-21
 Identities = 77/285 (27%), Positives = 128/285 (44%), Gaps = 9/285 (3%)
 Frame = +3

Query: 147 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAI-DRSLRA 323
           W  V++        G + W CN C      SYSRV+AHLL   G G+ +CP + D  L  
Sbjct: 21  WSFVTIKEKIGDGGGNRLWSCNFCEKVVKSSYSRVKAHLLRICGSGIDTCPKVTDAYLVY 80

Query: 324 AFHILEEERLAXXXXXXXXXXXXXXXXXXXQPP----FSSTGKVFGKEDVDDV---VARF 482
              + EE                        PP     S+    F  ED + +   +AR 
Sbjct: 81  LRRVCEEAESILKSKNVPLPTDKRTPTPPTLPPKRRKSSNIESAFNIEDRNHLRAEIARM 140

Query: 483 FFADGLNFNIINSPYF-HEMAKAIGSFGPGYEPPSVEKLWGSFLSKEKARLEKAVVPIRE 659
           F++  L+F++  +PYF    + A      G+ PPS   L  S L +E++ +E+ + PI+ 
Sbjct: 141 FYSASLSFHLARNPYFVSSYSFAANCNLSGFLPPSYNALRTSLLQQERSYIERLLQPIKS 200

Query: 660 SWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKEDDRADTIFNEVLSKVI 839
            W+    T++     D  +    ++  A S  G +FL++ID  ++ +      ++L  VI
Sbjct: 201 LWSLKGVTLVVDGWTDAQIRPL-INFMAISEEGPMFLKAIDGSKEYKDKHYMFDLLKDVI 259

Query: 840 MYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSIHI 974
             VGP +V+QVI    +    +   I  +FP IFW+ C   ++++
Sbjct: 260 KEVGPQSVVQVITDNAYVCKAAGLLIEVEFPHIFWTPCVVHTLNL 304


>ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao]
           gi|508777206|gb|EOY24462.1| HAT transposon superfamily
           [Theobroma cacao]
          Length = 674

 Score =  106 bits (264), Expect = 2e-20
 Identities = 63/187 (33%), Positives = 96/187 (51%), Gaps = 1/187 (0%)
 Frame = +3

Query: 414 QPPFSSTGKVFGKEDVDDVVARFFFADGLNFNIINSPYFHEMAKAIGSFGPGYEPPSVEK 593
           + P    G+   +ED D  +A FFF + + F+   S Y+ EM  AI   G GY+ PS E 
Sbjct: 126 EQPAVDDGQKQKQEDADKKIAVFFFHNSIPFSAAKSMYYQEMVDAIAKCGVGYKAPSYEN 185

Query: 594 LWGSFLSKEKARLEKAVVPIRESWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQ 773
           L  + L K K  +       R+ W  T CTILC +  DG    F V    +  +G +FL+
Sbjct: 186 LRSTLLEKVKGDIHDCYKKYRDEWKETGCTILCDSWSDGRTKSF-VIFSVTCPKGTLFLK 244

Query: 774 SIDLK-EDDRADTIFNEVLSKVIMYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQ 950
           S+D+   +D A  +F E+L  V++ VG  NV+QVI     +  ++   + +K+  +FWS 
Sbjct: 245 SVDVSGHEDDASYLF-ELLESVVLEVGLENVIQVITDTAASYVYAGRLLMAKYSSLFWSP 303

Query: 951 CTSRSIH 971
           C S  I+
Sbjct: 304 CASYCIN 310


>ref|XP_006396596.1| hypothetical protein EUTSA_v10029312mg [Eutrema salsugineum]
           gi|557097613|gb|ESQ38049.1| hypothetical protein
           EUTSA_v10029312mg [Eutrema salsugineum]
          Length = 671

 Score =  105 bits (262), Expect = 3e-20
 Identities = 73/277 (26%), Positives = 125/277 (45%), Gaps = 1/277 (0%)
 Frame = +3

Query: 147 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLRAA 326
           W +VS       + GT+++KC+ C+    GSYSRVRAHLLG    G+  C  + R+ + A
Sbjct: 30  WSYVSKLEKQGEKGGTRKFKCSFCSEIRQGSYSRVRAHLLGIKYAGIVVCKKVPRTEKLA 89

Query: 327 FHILEEERLAXXXXXXXXXXXXXXXXXXXQPPFSSTGKVFGKEDVDDVVARFFFADGLNF 506
              LEEE                           S  K     D  + + R FF  GL  
Sbjct: 90  MQRLEEE-------FEKKKNESGPREVSLPCEVGSALKKRKAADSPNEIGRMFFTGGLAS 142

Query: 507 NIINSPYFHEMAK-AIGSFGPGYEPPSVEKLWGSFLSKEKARLEKAVVPIRESWTHTSCT 683
           N+  +P++H   + A  +   GY PP   KL  + L KE+  +EK + P++ +W     T
Sbjct: 143 NLARNPHYHRAFQFAAANKIDGYVPPGYNKLQTTLLEKERNHVEKLLDPLKSTWKEGGVT 202

Query: 684 ILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKEDDRADTIFNEVLSKVIMYVGPANV 863
           I+     D  L    ++  A+S  G +F+++++   + +     + ++ +VI  V   NV
Sbjct: 203 IVSDGWSD-PLKKPLINSMATSGNGPVFIKAVNYFGEVKDRVFISGLMEEVINKVWKQNV 261

Query: 864 LQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSIHI 974
           +Q+I         +   I S +  I+W+ C   ++++
Sbjct: 262 VQIITDNAANCKAAGDIIESMYSHIYWTPCVVHTLNL 298


>ref|XP_006842452.1| hypothetical protein AMTR_s00077p00056600 [Amborella trichopoda]
           gi|548844538|gb|ERN04127.1| hypothetical protein
           AMTR_s00077p00056600 [Amborella trichopoda]
          Length = 435

 Score =  105 bits (261), Expect = 4e-20
 Identities = 73/294 (24%), Positives = 123/294 (41%), Gaps = 18/294 (6%)
 Frame = +3

Query: 147 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLRAA 326
           W++  V  G      T++ +C  C    +   +R R HL G +   V  C  +   +R  
Sbjct: 9   WQYGKVVDG-----KTQKVECCFCGANMSSGITRFRNHLAGVSKKDVAPCKQVPDEVRML 63

Query: 327 FHIL---------------EEERLAXXXXXXXXXXXXXXXXXXX---QPPFSSTGKVFGK 452
            + L               +E  LA                       P   +     GK
Sbjct: 64  AYNLVKTKDKEADAKKQRKKELTLASRHESTSMGESLLSPSPIRTLLHPSIENIWPKRGK 123

Query: 453 EDVDDVVARFFFADGLNFNIINSPYFHEMAKAIGSFGPGYEPPSVEKLWGSFLSKEKARL 632
           E VDD++ +FFF +GL FN+  S Y+  +  AI ++G GY+ PS E L    L   K  +
Sbjct: 124 ELVDDLMGKFFFDNGLPFNVARSRYYQPLIDAIAAYGVGYKGPSSETLRTDILQNVKEEV 183

Query: 633 EKAVVPIRESWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKEDDRADTI 812
           +K V   R+ W  T CTI+  +  D       ++   +  +G +FL+S D+         
Sbjct: 184 QKFVDDRRKDWAETGCTIMSDSWTDARDRSL-INFLVACPKGTVFLRSADITAHVNDPKY 242

Query: 813 FNEVLSKVIMYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSIHI 974
            + +  ++I  VGP NV+Q+I   G +       +  K+P++FW+ C +  + +
Sbjct: 243 LSNLFEEIIQEVGPENVVQIITDIGDSFKAVGNILCGKYPKLFWAGCATHGVDL 296


>ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805582 isoform X1 [Glycine
           max] gi|571487050|ref|XP_006590550.1| PREDICTED:
           uncharacterized protein LOC100805582 isoform X2 [Glycine
           max]
          Length = 675

 Score =  104 bits (260), Expect = 5e-20
 Identities = 58/173 (33%), Positives = 89/173 (51%)
 Frame = +3

Query: 450 KEDVDDVVARFFFADGLNFNIINSPYFHEMAKAIGSFGPGYEPPSVEKLWGSFLSKEKAR 629
           ++D D  +A FFF + + F+   S Y+ EM  A+   G GY+ PS EKL  + L K KA 
Sbjct: 139 QDDADRKLAIFFFHNSIPFSAAKSIYYQEMVDAVAQCGVGYKAPSYEKLRSTLLEKVKAD 198

Query: 630 LEKAVVPIRESWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKEDDRADT 809
           +       R+ W  T CT+LC N  DG  G   V   A   +G +FL+S+D+   +   T
Sbjct: 199 IHSDYKKYRDEWKETGCTVLCDNWSDGRTGSLAVFSVA-CPKGTLFLKSVDVSGHENDST 257

Query: 810 IFNEVLSKVIMYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSI 968
              E+L  V++ VG  NV+QVI     +   +   + +++  +FWS C +  I
Sbjct: 258 YLFELLESVVLEVGAENVVQVITDASASYVCAGRLLIARYSFLFWSPCVAYCI 310


>ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627361 [Citrus sinensis]
          Length = 674

 Score =  103 bits (257), Expect = 1e-19
 Identities = 60/176 (34%), Positives = 90/176 (51%), Gaps = 3/176 (1%)
 Frame = +3

Query: 450 KEDVDDVVARFFFADGLNFNIINSPYFHEMAKAIGSFGPGYEPPSVEKLWGSFLSKEKAR 629
           ++D D  +A FFF + + F+   S Y+ EM  AI   G GY  PS EKL  + L K K  
Sbjct: 138 QDDTDKKIAVFFFHNSIPFSAAKSMYYQEMVNAIAECGVGYIAPSYEKLRSTLLEKVKVD 197

Query: 630 LEKAVVPIRESWTHTSCTILCLNQLD---GTLGCFNVHIFASSTRGLIFLQSIDLKEDDR 800
           ++      RE W  T CTILC N  D    +L  F+V    +  +G +FL+S+D+   + 
Sbjct: 198 IDDCCKKYREEWKETGCTILCDNWSDERTKSLVVFSV----ACPKGTLFLKSVDVSGHEE 253

Query: 801 ADTIFNEVLSKVIMYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSI 968
             T   E+L  V++ VG  NV+QVI        ++   + +K+  +FWS C +  I
Sbjct: 254 DATFLFELLESVVLDVGVENVIQVITDSAACYVYAGRLLMTKYSSLFWSPCAAYCI 309


>ref|XP_003611303.1| hypothetical protein MTR_5g012510 [Medicago truncatula]
           gi|355512638|gb|AES94261.1| hypothetical protein
           MTR_5g012510 [Medicago truncatula]
          Length = 725

 Score = 99.8 bits (247), Expect = 2e-18
 Identities = 49/176 (27%), Positives = 92/176 (52%), Gaps = 2/176 (1%)
 Frame = +3

Query: 453 EDVDDVVARFFFADGLNFNIINSPYFHEMAKAIGSFGPGYEPPSVEKLWGSFLSKEKARL 632
           E  D  +A++F A  + FN  NSPYF     A+   G GY+ PS+  L G  L+K     
Sbjct: 161 EKCDLALAKWFIAASIPFNAANSPYFQSAVDALCCMGAGYKAPSIHDLRGPLLNKWVDET 220

Query: 633 EKAVVPIRESWTHTSCTILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKEDDRADTI 812
           +K +   RE W +T CT++     DG      ++      +G +F++S+D     +   +
Sbjct: 221 KKKIEKYREIWKNTGCTLMADGWTDGVRRTL-INFLVYCPKGTVFIKSVDASGASKTGEM 279

Query: 813 FNEVLSKVIMYVGPANVLQVIIYPGHASNF--SEPFINSKFPQIFWSQCTSRSIHI 974
             ++  ++++Y+GP NV+Q++    +A+N+  +   +  +FP ++WS C +  I++
Sbjct: 280 LFKLFKEIVLYIGPENVVQIV--TDNAANYVAAGRLLEKEFPGLYWSPCAAHCINL 333


>emb|CAN68527.1| hypothetical protein VITISV_044224 [Vitis vinifera]
          Length = 926

 Score = 99.8 bits (247), Expect = 2e-18
 Identities = 61/174 (35%), Positives = 93/174 (53%), Gaps = 4/174 (2%)
 Frame = +3

Query: 450 KEDVDDVVARFFFADGLNFNIINSPYFHEMAKAIGSFGPGYEPPSVEKLWGSFLSKEKAR 629
           ++D D  VA FFF + + F+   S Y+ EM  AI   G GY+ PS EKL  + + K K  
Sbjct: 390 QDDADKKVAVFFFHNSVPFSAAKSMYYQEMVDAIAECGVGYKAPSYEKLRSTLMEKVKCD 449

Query: 630 LEKAVVPIRESWTHTSCTILCLNQLDG---TLGCFNVHIFASSTRGLIFLQSIDLK-EDD 797
           +      +R+ W  T CTILC    DG   +L  F+V    +  +G +FL+S+D+    D
Sbjct: 450 VNDCCKKLRDGWRXTGCTILCDCWSDGRTKSLXVFSV----TCPKGTLFLKSVDISGHAD 505

Query: 798 RADTIFNEVLSKVIMYVGPANVLQVIIYPGHASNFSEPFINSKFPQIFWSQCTS 959
            A  +F E+L  V++ VG  NV+QVI     +  ++   + +K+  +FWS C S
Sbjct: 506 DAHYLF-ELLESVVLEVGLENVVQVITDSAASYVYAGRLLMAKYTTLFWSPCAS 558


>ref|XP_007144025.1| hypothetical protein PHAVU_007G122800g [Phaseolus vulgaris]
           gi|561017215|gb|ESW16019.1| hypothetical protein
           PHAVU_007G122800g [Phaseolus vulgaris]
          Length = 550

 Score = 99.0 bits (245), Expect = 3e-18
 Identities = 80/278 (28%), Positives = 119/278 (42%), Gaps = 2/278 (0%)
 Frame = +3

Query: 147 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLRAA 326
           W++VS      T  G    +C+ CN  +NGSY+RVRAHLL   G GV+ CP +  S    
Sbjct: 20  WRYVSKLRK-TTGGGNNMIQCSLCNFIFNGSYTRVRAHLLKLMGAGVRRCPYVTTSKLVE 78

Query: 327 FHILEEERLAXXXXXXXXXXXXXXXXXXXQPPFSSTGKVFGKEDVDDVVARFFFADGLNF 506
              L+ E                      Q  F+    +  ++ +D  +AR F++ GL F
Sbjct: 79  LKKLDNEAKLKIEGNQTRSDVNPKFKGSLQAAFN----IQARDTLDCDIARMFYSSGLPF 134

Query: 507 NIINSPYFHEMAK--AIGSFGPGYEPPSVEKLWGSFLSKEKARLEKAVVPIRESWTHTSC 680
           ++  SPY+       A  S   GY  P+  KL G  LSKE++ +E  + P R SW     
Sbjct: 135 HLARSPYYRSAFSNAANTSKLSGYVAPTYNKLRGPLLSKERSHVENLLQPTRHSWNQKGV 194

Query: 681 TILCLNQLDGTLGCFNVHIFASSTRGLIFLQSIDLKEDDRADTIFNEVLSKVIMYVGPAN 860
           TI+     D       ++    +  G +FL+SID  E D      N V+ K       A 
Sbjct: 195 TIVSDGWSDPQRRPL-INFMTITESGPMFLKSID--ESD------NVVVCKA------AG 239

Query: 861 VLQVIIYPGHASNFSEPFINSKFPQIFWSQCTSRSIHI 974
           +L                I SKFP I+W+ C   ++++
Sbjct: 240 ML----------------IESKFPSIYWTPCVVHTLNL 261


Top