BLASTX nr result

ID: Catharanthus22_contig00003695 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00003695
         (1624 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265131.1| PREDICTED: uncharacterized protein LOC100268...   333   1e-88
ref|XP_006350362.1| PREDICTED: uncharacterized protein LOC102588...   328   3e-87
ref|XP_006437510.1| hypothetical protein CICLE_v10032158mg [Citr...   320   1e-84
ref|XP_006484658.1| PREDICTED: uncharacterized protein LOC102619...   318   3e-84
ref|XP_006484657.1| PREDICTED: uncharacterized protein LOC102619...   318   3e-84
ref|XP_004231527.1| PREDICTED: uncharacterized protein LOC101255...   318   3e-84
gb|EOX92620.1| Uncharacterized protein TCM_001539 [Theobroma cacao]   318   4e-84
ref|XP_004307383.1| PREDICTED: uncharacterized protein LOC101294...   310   9e-82
ref|XP_006406456.1| hypothetical protein EUTSA_v10021111mg [Eutr...   302   2e-79
gb|EXB61829.1| hypothetical protein L484_012263 [Morus notabilis]     300   1e-78
ref|NP_566649.1| uncharacterized protein [Arabidopsis thaliana] ...   297   8e-78
ref|XP_003629796.1| hypothetical protein MTR_8g086630 [Medicago ...   296   2e-77
ref|XP_002315057.1| hypothetical protein POPTR_0010s17720g [Popu...   296   2e-77
gb|EMJ20588.1| hypothetical protein PRUPE_ppa020238mg, partial [...   294   8e-77
ref|XP_002885329.1| hypothetical protein ARALYDRAFT_479498 [Arab...   293   2e-76
ref|XP_006298144.1| hypothetical protein CARUB_v10014192mg [Caps...   292   2e-76
ref|XP_002528748.1| conserved hypothetical protein [Ricinus comm...   290   9e-76
ref|XP_003524967.1| PREDICTED: uncharacterized protein LOC100792...   290   1e-75
gb|AGV54555.1| hypothetical protein [Phaseolus vulgaris]              288   6e-75
gb|ESW31471.1| hypothetical protein PHAVU_002G240600g [Phaseolus...   286   1e-74

>ref|XP_002265131.1| PREDICTED: uncharacterized protein LOC100268166 [Vitis vinifera]
            gi|297743783|emb|CBI36666.3| unnamed protein product
            [Vitis vinifera]
          Length = 320

 Score =  333 bits (854), Expect = 1e-88
 Identities = 167/269 (62%), Positives = 200/269 (74%)
 Frame = +3

Query: 402  RKRPRSALGLSAGEDISDFLADESLSSFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSH 581
            R +P SAL  +A  +   F  +E+   FDW+D  ++E   D  SPW+ AV+YKRNPS+ H
Sbjct: 57   RNKPHSALKFTARYNFESF-DEENTKKFDWNDEREIE---DTGSPWEGAVVYKRNPSILH 112

Query: 582  TEYCTTLESLGLANLSTDVSKSRASVMGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLD 761
             E+CTTLE LGL  LST++SKSRASVMGLRVTK  KD+P GTPV IS+DVTRKKHKLRLD
Sbjct: 113  VEHCTTLERLGLGKLSTEISKSRASVMGLRVTKAAKDYPQGTPVHISIDVTRKKHKLRLD 172

Query: 762  GIIRTVITVGCNRCGEPAAKSIFSNFSLLLTEEPIQELETINMGIIYGKEKSRGLGNTEV 941
            G++RTVIT+GCNRCGEPAA+ IFSNFSLLLTEEPI+E E INMG+I+G E  +   +TE 
Sbjct: 173  GLLRTVITLGCNRCGEPAAECIFSNFSLLLTEEPIEEQEVINMGVIFG-EDDKLKTSTES 231

Query: 942  XXXXXXXXXXXXXXLYFPPEEKEIDISKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLN 1121
                          LYFPPEE EIDISKHIRD+VH+EITI+ +CD +CKG+CL CG+NLN
Sbjct: 232  SEEDDEASIDLDDWLYFPPEETEIDISKHIRDMVHLEITINAVCDSRCKGICLKCGINLN 291

Query: 1122 VDSCQCRVEEMDRKSYGPLGNLRKQILQK 1208
              SC C  EE+  K YGPLG LRKQI QK
Sbjct: 292  TASCNCSKEEVKEKGYGPLGVLRKQIQQK 320


>ref|XP_006350362.1| PREDICTED: uncharacterized protein LOC102588036 [Solanum tuberosum]
          Length = 302

 Score =  328 bits (842), Expect = 3e-87
 Identities = 166/276 (60%), Positives = 202/276 (73%)
 Frame = +3

Query: 378  DANLRADFRKRPRSALGLSAGEDISDFLADESLSSFDWDDHGQLEAEEDNESPWKEAVIY 557
            D + R +FRK+    +     E  S+F+       FDW+D  + E EED +SPW+ AV+Y
Sbjct: 36   DCHRRLNFRKKGSKFV---VREQKSNFV------DFDWEDEDEYE-EEDQDSPWEGAVVY 85

Query: 558  KRNPSLSHTEYCTTLESLGLANLSTDVSKSRASVMGLRVTKTVKDHPDGTPVLISLDVTR 737
            KRN S++H EYCTTLE LGL  LST VSK RASVMGLRVTK V D+PDGTPVL+S DVTR
Sbjct: 86   KRNSSVTHLEYCTTLERLGLGKLSTKVSKCRASVMGLRVTKQVNDYPDGTPVLVSFDVTR 145

Query: 738  KKHKLRLDGIIRTVITVGCNRCGEPAAKSIFSNFSLLLTEEPIQELETINMGIIYGKEKS 917
            KKHKLRLDGIIRTVI + CNRCGEPAA+SIFSNFSLLL+EEPI+E ET++MGI++G++K 
Sbjct: 146  KKHKLRLDGIIRTVIALPCNRCGEPAAESIFSNFSLLLSEEPIKEPETLDMGIMFGEDKF 205

Query: 918  RGLGNTEVXXXXXXXXXXXXXXLYFPPEEKEIDISKHIRDLVHVEITIDELCDPKCKGLC 1097
            +   N E               LYFP EEK IDISK IRDLVH+EITI+ +CDPKCKGLC
Sbjct: 206  KSFVNMEEEMEENDGWIPLEDQLYFPGEEKMIDISKQIRDLVHIEITINAVCDPKCKGLC 265

Query: 1098 LGCGMNLNVDSCQCRVEEMDRKSYGPLGNLRKQILQ 1205
            L CG NLNV  C C +++++ K YGPLG L+KQ+ Q
Sbjct: 266  LKCGANLNVSRCNCHMQKVEEKGYGPLGGLKKQMQQ 301


>ref|XP_006437510.1| hypothetical protein CICLE_v10032158mg [Citrus clementina]
            gi|557539706|gb|ESR50750.1| hypothetical protein
            CICLE_v10032158mg [Citrus clementina]
          Length = 315

 Score =  320 bits (820), Expect = 1e-84
 Identities = 155/260 (59%), Positives = 199/260 (76%)
 Frame = +3

Query: 420  ALGLSAGEDISDFLADESLSSFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTT 599
            A+  +  +D   F  DE+  S+DW+D  Q + EED  SPW+ A+IYKRNPS++H EYCTT
Sbjct: 56   AISNAIAKDSKSFTEDET-ESYDWED--QEDVEEDAGSPWEGAIIYKRNPSITHLEYCTT 112

Query: 600  LESLGLANLSTDVSKSRASVMGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTV 779
            LE LGL  LST+VS+SRAS MGLRVTK VKD+P+GTPV IS+DVT+KK KLRLDGIIRTV
Sbjct: 113  LERLGLGKLSTEVSRSRASAMGLRVTKAVKDYPNGTPVQISIDVTKKKQKLRLDGIIRTV 172

Query: 780  ITVGCNRCGEPAAKSIFSNFSLLLTEEPIQELETINMGIIYGKEKSRGLGNTEVXXXXXX 959
            +T+GCNRCGEPAA+S+FS+FS+LL+E+PI+E E I++G+++G++KS+             
Sbjct: 173  LTLGCNRCGEPAAQSVFSDFSVLLSEQPIEEPEIIDIGMMFGEDKSKSSTGNGSEEEDDD 232

Query: 960  XXXXXXXXLYFPPEEKEIDISKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQC 1139
                    LYFP EEKEIDISK+IRD+VH+EITI+ +CDP CKG+CL CG NLN  +C C
Sbjct: 233  ASIDWDDRLYFPLEEKEIDISKNIRDMVHLEITINVICDPSCKGICLKCGTNLNTSTCNC 292

Query: 1140 RVEEMDRKSYGPLGNLRKQI 1199
              EE+  K+YGPLGNLRKQ+
Sbjct: 293  SKEEVKGKTYGPLGNLRKQM 312


>ref|XP_006484658.1| PREDICTED: uncharacterized protein LOC102619910 isoform X2 [Citrus
            sinensis]
          Length = 315

 Score =  318 bits (816), Expect = 3e-84
 Identities = 154/260 (59%), Positives = 198/260 (76%)
 Frame = +3

Query: 420  ALGLSAGEDISDFLADESLSSFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTT 599
            A+  +  +D   F  DE+  S+DW+D  Q + EED  SPW+ A+IYKRNPS++H EYCTT
Sbjct: 56   AISNAIAKDSKSFTEDET-ESYDWED--QEDVEEDAGSPWEGAIIYKRNPSITHLEYCTT 112

Query: 600  LESLGLANLSTDVSKSRASVMGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTV 779
            LE LGL  LST+VS+SRAS MGLRVTK VKD+P+GTPV IS+DVT+KK KLRLDGIIRTV
Sbjct: 113  LERLGLGKLSTEVSRSRASAMGLRVTKAVKDYPNGTPVQISIDVTKKKQKLRLDGIIRTV 172

Query: 780  ITVGCNRCGEPAAKSIFSNFSLLLTEEPIQELETINMGIIYGKEKSRGLGNTEVXXXXXX 959
            +T+GCNRCGEPA +S+FS+FS+LL+E+PI+E E I++G+++G++KS+             
Sbjct: 173  LTLGCNRCGEPATQSVFSDFSVLLSEQPIEEPEIIDIGMMFGEDKSKSSTGNGSEEEDDD 232

Query: 960  XXXXXXXXLYFPPEEKEIDISKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQC 1139
                    LYFP EEKEIDISK+IRD+VH+EITI+ +CDP CKG+CL CG NLN  +C C
Sbjct: 233  ASIDWDDRLYFPLEEKEIDISKNIRDMVHLEITINVICDPSCKGICLKCGTNLNTSTCNC 292

Query: 1140 RVEEMDRKSYGPLGNLRKQI 1199
              EE+  K+YGPLGNLRKQ+
Sbjct: 293  SKEEVKGKTYGPLGNLRKQM 312


>ref|XP_006484657.1| PREDICTED: uncharacterized protein LOC102619910 isoform X1 [Citrus
            sinensis]
          Length = 338

 Score =  318 bits (816), Expect = 3e-84
 Identities = 154/260 (59%), Positives = 198/260 (76%)
 Frame = +3

Query: 420  ALGLSAGEDISDFLADESLSSFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTT 599
            A+  +  +D   F  DE+  S+DW+D  Q + EED  SPW+ A+IYKRNPS++H EYCTT
Sbjct: 79   AISNAIAKDSKSFTEDET-ESYDWED--QEDVEEDAGSPWEGAIIYKRNPSITHLEYCTT 135

Query: 600  LESLGLANLSTDVSKSRASVMGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTV 779
            LE LGL  LST+VS+SRAS MGLRVTK VKD+P+GTPV IS+DVT+KK KLRLDGIIRTV
Sbjct: 136  LERLGLGKLSTEVSRSRASAMGLRVTKAVKDYPNGTPVQISIDVTKKKQKLRLDGIIRTV 195

Query: 780  ITVGCNRCGEPAAKSIFSNFSLLLTEEPIQELETINMGIIYGKEKSRGLGNTEVXXXXXX 959
            +T+GCNRCGEPA +S+FS+FS+LL+E+PI+E E I++G+++G++KS+             
Sbjct: 196  LTLGCNRCGEPATQSVFSDFSVLLSEQPIEEPEIIDIGMMFGEDKSKSSTGNGSEEEDDD 255

Query: 960  XXXXXXXXLYFPPEEKEIDISKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQC 1139
                    LYFP EEKEIDISK+IRD+VH+EITI+ +CDP CKG+CL CG NLN  +C C
Sbjct: 256  ASIDWDDRLYFPLEEKEIDISKNIRDMVHLEITINVICDPSCKGICLKCGTNLNTSTCNC 315

Query: 1140 RVEEMDRKSYGPLGNLRKQI 1199
              EE+  K+YGPLGNLRKQ+
Sbjct: 316  SKEEVKGKTYGPLGNLRKQM 335


>ref|XP_004231527.1| PREDICTED: uncharacterized protein LOC101255042 [Solanum
            lycopersicum]
          Length = 298

 Score =  318 bits (816), Expect = 3e-84
 Identities = 155/241 (64%), Positives = 187/241 (77%)
 Frame = +3

Query: 483  FDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTTLESLGLANLSTDVSKSRASVM 662
            FDW+D  + E E D +SPW+ AV+YKRN S++H +Y TTLE LGL  LST VSK RASVM
Sbjct: 58   FDWEDEYEDEYE-DEDSPWEGAVVYKRNSSVTHLDYYTTLERLGLGKLSTKVSKCRASVM 116

Query: 663  GLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTVITVGCNRCGEPAAKSIFSNFS 842
            GLRVT+ VKD+PDGTPVLIS DVTR KHKLRLDGIIRTVI + CNRCGEPAA+SIFSNFS
Sbjct: 117  GLRVTRQVKDYPDGTPVLISFDVTRMKHKLRLDGIIRTVIALPCNRCGEPAAESIFSNFS 176

Query: 843  LLLTEEPIQELETINMGIIYGKEKSRGLGNTEVXXXXXXXXXXXXXXLYFPPEEKEIDIS 1022
            LLL+EEP++E ET++MGI++G +K +   N E               LYFP +EK IDIS
Sbjct: 177  LLLSEEPLKEAETLDMGIMFGDDKFKSFVNVEEEMEENDGWIPLEDQLYFPGDEKMIDIS 236

Query: 1023 KHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQCRVEEMDRKSYGPLGNLRKQIL 1202
            KHIRDLVH+EITI+ +CDPKCKGLCL CG NLNV+ C C +E+++ K YGPLG L+KQ+ 
Sbjct: 237  KHIRDLVHIEITINAVCDPKCKGLCLKCGANLNVNRCSCHMEKIEEKGYGPLGGLKKQMQ 296

Query: 1203 Q 1205
            Q
Sbjct: 297  Q 297


>gb|EOX92620.1| Uncharacterized protein TCM_001539 [Theobroma cacao]
          Length = 324

 Score =  318 bits (815), Expect = 4e-84
 Identities = 155/253 (61%), Positives = 192/253 (75%)
 Frame = +3

Query: 450  SDFLADESLSSFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTTLESLGLANLS 629
            S++  +E+  +FDW+D   +E   D  SPW+ AV+Y+RNPS++H EYCTTLE LGL  LS
Sbjct: 74   SEYFTEENTITFDWEDQEDIE---DIGSPWEGAVMYRRNPSITHLEYCTTLERLGLGKLS 130

Query: 630  TDVSKSRASVMGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTVITVGCNRCGE 809
            +D+SKSRASVMGLRVT+ VKD+P+GTPV IS+DVTRKK K+RLDGII+TVIT+GCNRCGE
Sbjct: 131  SDISKSRASVMGLRVTRAVKDYPNGTPVQISIDVTRKKQKMRLDGIIKTVITLGCNRCGE 190

Query: 810  PAAKSIFSNFSLLLTEEPIQELETINMGIIYGKEKSRGLGNTEVXXXXXXXXXXXXXXLY 989
            PAA+ IFSNFS+LL+EEPI+E E I+MG  + +      G+ +               LY
Sbjct: 191  PAAEGIFSNFSVLLSEEPIEEPEIIDMGATFEEGFKSVYGSNQEVEEDDDASIDWDDRLY 250

Query: 990  FPPEEKEIDISKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQCRVEEMDRKSY 1169
            FPPEEKEIDISKHIRD+VH+EITI+ +CDP+CKG+CL CG NLN  SC C  EE+  K Y
Sbjct: 251  FPPEEKEIDISKHIRDMVHLEITINAVCDPRCKGICLKCGTNLNTSSCNCS-EEIKEKGY 309

Query: 1170 GPLGNLRKQILQK 1208
            GPLGNL KQI QK
Sbjct: 310  GPLGNLGKQIQQK 322


>ref|XP_004307383.1| PREDICTED: uncharacterized protein LOC101294601 [Fragaria vesca
            subsp. vesca]
          Length = 324

 Score =  310 bits (795), Expect = 9e-82
 Identities = 161/287 (56%), Positives = 200/287 (69%), Gaps = 5/287 (1%)
 Frame = +3

Query: 363  YLSSLDANLRADFRKRPRSALG---LSAGEDISDFLADESLSSFDWDDHGQLEAEEDNES 533
            YLS +  N+    R++P   L    ++  +     + DE     D  D G  E  ED +S
Sbjct: 43   YLSCITENIHTVLRRKPNDILMSTVMNCTKPNFQSITDEDTVFIDLGDQGN-EDGEDIDS 101

Query: 534  PWKEAVIYKRNPSLSHTEYCTTLESLGLANLSTDVSKSRASVMGLRVTKTVKDHPDGTPV 713
            PW+ AV+YKRN S++H EYCTTLE LGL NLST VSKSRASVMGLRVTK VKD+P+GTPV
Sbjct: 102  PWEGAVVYKRNASITHVEYCTTLERLGLGNLSTTVSKSRASVMGLRVTKAVKDYPNGTPV 161

Query: 714  LISLDVTRKKHKLRLDGIIRTVITVGCNRCGEPAAKSIFSNFSLLLTEEPIQELETINMG 893
             IS+D+TR+K KLRLDGII+TVIT+ CNRCG+PAA+SIFSNFSLLLT+EPI+E + INMG
Sbjct: 162  QISIDITRRKQKLRLDGIIKTVITLTCNRCGDPAAESIFSNFSLLLTDEPIEEPDIINMG 221

Query: 894  IIYG--KEKSRGLGNTEVXXXXXXXXXXXXXXLYFPPEEKEIDISKHIRDLVHVEITIDE 1067
            +IYG   +   G G  E               LYF PE+KEIDISKHIRD VH+EITI  
Sbjct: 222  VIYGDNAKTHTGFGGEE----NEDDSIDFEDQLYFRPEDKEIDISKHIRDSVHLEITISA 277

Query: 1068 LCDPKCKGLCLGCGMNLNVDSCQCRVEEMDRKSYGPLGNLRKQILQK 1208
             C+P CKGLCL CG NLN  +C C  +E+ + ++GPLGNL+KQ+ +K
Sbjct: 278  TCNPNCKGLCLNCGKNLNTSNCICGKQEVKKTTFGPLGNLKKQMQKK 324


>ref|XP_006406456.1| hypothetical protein EUTSA_v10021111mg [Eutrema salsugineum]
            gi|557107602|gb|ESQ47909.1| hypothetical protein
            EUTSA_v10021111mg [Eutrema salsugineum]
          Length = 331

 Score =  302 bits (774), Expect = 2e-79
 Identities = 153/255 (60%), Positives = 184/255 (72%), Gaps = 6/255 (2%)
 Frame = +3

Query: 465  DESLSSFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTTLESLGLANLSTDVSK 644
            + S    DW+D   +E   D  SPW+ +V+Y+RN S++H EYCTTLE LGL  LST VSK
Sbjct: 77   ENSTIDIDWEDEEDIE---DTGSPWEGSVMYRRNASVTHVEYCTTLERLGLGRLSTQVSK 133

Query: 645  SRASVMGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTVITVGCNRCGEPAAKS 824
             RAS MGLRVTK VKD+PDGTPV +S+DV RKK KLRLDGI+RTVIT+GCNRCGEPA +S
Sbjct: 134  KRASAMGLRVTKDVKDYPDGTPVQVSVDVIRKKKKLRLDGIVRTVITLGCNRCGEPAGES 193

Query: 825  IFSNFSLLLTEEPIQELETINMGIIYGKEKSR---GLGNTE---VXXXXXXXXXXXXXXL 986
            IFSNFSLLLTEEP++E + I++G  +GK+K+    GL N E                  L
Sbjct: 194  IFSNFSLLLTEEPVEEPDVIDLGFTFGKDKANSFSGLSNDEEDNADDDDDDSLIDWEDKL 253

Query: 987  YFPPEEKEIDISKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQCRVEEMDRKS 1166
            +FPPE KEIDISKHIRDLVH+EITI+ +CD  CKG+CL CG NLN   C C  EE D K 
Sbjct: 254  HFPPEVKEIDISKHIRDLVHLEITINAICDAACKGMCLKCGANLNKRKCDCGREEKD-KG 312

Query: 1167 YGPLGNLRKQILQKK 1211
            YGPLGNLRKQ+ +K+
Sbjct: 313  YGPLGNLRKQMQEKE 327


>gb|EXB61829.1| hypothetical protein L484_012263 [Morus notabilis]
          Length = 281

 Score =  300 bits (768), Expect = 1e-78
 Identities = 157/278 (56%), Positives = 194/278 (69%), Gaps = 6/278 (2%)
 Frame = +3

Query: 384  NLRADFRKRPRSAL---GLSAGEDISDFLADESLSSFDWDDHGQLEAEEDNESPWKEAVI 554
            +++  FRK+  + L    L   +   D    E+  S D++D  Q + EED   PW+ AVI
Sbjct: 4    SIQTIFRKKVPNVLRSTALDCTKHDYDHSNSENTVSLDFED--QEKEEEDTGCPWEGAVI 61

Query: 555  YKRNPSLSHTEYCTTLESLGLANLSTDVSKSRASVMGLRVTKTVKDHPDGTPVLISLDVT 734
            YKRN S+SH EYCTTLE LGL +LST++SKSRAS MGLRVTK VKD+P GTPV +S+DV 
Sbjct: 62   YKRNSSISHIEYCTTLERLGLGSLSTELSKSRASAMGLRVTKAVKDYPFGTPVQVSVDVM 121

Query: 735  RKKHKLRLDGIIRTVITVGCNRCGEPAAKSIFSNFSLLLTEEPIQELETINMGIIYGKEK 914
            RKK KLRLDGI++TVIT+GCN CG PAA+SIFS+FSLLLTEEP++E + IN+G I+G  K
Sbjct: 122  RKKQKLRLDGIVKTVITLGCNSCGGPAAQSIFSDFSLLLTEEPVEEPDIINLGTIHGDNK 181

Query: 915  SR---GLGNTEVXXXXXXXXXXXXXXLYFPPEEKEIDISKHIRDLVHVEITIDELCDPKC 1085
            SR   GLG+                 LYFPP EKEIDISKHIRDLVH+EI I  +CDP C
Sbjct: 182  SRPYSGLGDD--GEEDDDASIDFEDRLYFPPGEKEIDISKHIRDLVHLEINIKAICDPNC 239

Query: 1086 KGLCLGCGMNLNVDSCQCRVEEMDRKSYGPLGNLRKQI 1199
            KG C  CG NLN   C C  +E+ + SYGPLGNL++Q+
Sbjct: 240  KGFCFKCGANLNTSRCTCSKQEVKKSSYGPLGNLKQQM 277


>ref|NP_566649.1| uncharacterized protein [Arabidopsis thaliana]
            gi|11994192|dbj|BAB01295.1| unnamed protein product
            [Arabidopsis thaliana] gi|21593774|gb|AAM65741.1| unknown
            [Arabidopsis thaliana] gi|109946589|gb|ABG48473.1|
            At3g19810 [Arabidopsis thaliana]
            gi|110742135|dbj|BAE98996.1| hypothetical protein
            [Arabidopsis thaliana] gi|332642771|gb|AEE76292.1|
            uncharacterized protein AT3G19810 [Arabidopsis thaliana]
          Length = 321

 Score =  297 bits (761), Expect = 8e-78
 Identities = 149/249 (59%), Positives = 179/249 (71%)
 Frame = +3

Query: 465  DESLSSFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTTLESLGLANLSTDVSK 644
            + S    DW+D  ++E   D  SPW+ +V+Y+RN S++H EYCTTLE LGL  LSTDVSK
Sbjct: 77   ETSTIDMDWEDQEEIE---DTGSPWEGSVMYRRNASVTHVEYCTTLERLGLGRLSTDVSK 133

Query: 645  SRASVMGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTVITVGCNRCGEPAAKS 824
             RAS MGLRVTK VKD+PDGTPV +S+DV RKK KLRLDGI+RTVIT+GCNRCGE   +S
Sbjct: 134  KRASAMGLRVTKDVKDYPDGTPVQVSVDVIRKKKKLRLDGIVRTVITLGCNRCGESTGES 193

Query: 825  IFSNFSLLLTEEPIQELETINMGIIYGKEKSRGLGNTEVXXXXXXXXXXXXXXLYFPPEE 1004
            IFSNFSLLLTEEP++E + I++G  +G +K  G    E               L+FPPE 
Sbjct: 194  IFSNFSLLLTEEPVEEPDVIDLGFTFGNDKEEG----EDDDDNDDSWIDWEDKLHFPPEV 249

Query: 1005 KEIDISKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQCRVEEMDRKSYGPLGN 1184
            KEIDISKHIRDLVH+EITI  +CD  CKG+CL CG NLN   C C  EE D K YGPLGN
Sbjct: 250  KEIDISKHIRDLVHLEITITAICDSACKGMCLKCGANLNKRKCDCGREEKD-KGYGPLGN 308

Query: 1185 LRKQILQKK 1211
            LR+Q+ QK+
Sbjct: 309  LREQMQQKE 317


>ref|XP_003629796.1| hypothetical protein MTR_8g086630 [Medicago truncatula]
            gi|355523818|gb|AET04272.1| hypothetical protein
            MTR_8g086630 [Medicago truncatula]
          Length = 317

 Score =  296 bits (757), Expect = 2e-77
 Identities = 146/250 (58%), Positives = 184/250 (73%), Gaps = 1/250 (0%)
 Frame = +3

Query: 453  DFLADESLSSFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTTLESLGLANLST 632
            D   +E  +SFDW D  + E +ED   PW+ AVIYKRN S+ H EYCTTLE LGL NLST
Sbjct: 68   DLYTEEGTTSFDWGDEEEEEIDEDEGLPWEGAVIYKRNASILHLEYCTTLERLGLGNLST 127

Query: 633  DVSKSRASVMGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTVITVGCNRCGEP 812
            DVSK++ASVMGLR+TK VKD P+GTP+ IS+DVTRKK KLRLDGII+TV+T+ CNRC  P
Sbjct: 128  DVSKNKASVMGLRITKAVKDFPNGTPIQISIDVTRKKKKLRLDGIIKTVLTLVCNRCCMP 187

Query: 813  AAKSIFSNFSLLLTEE-PIQELETINMGIIYGKEKSRGLGNTEVXXXXXXXXXXXXXXLY 989
            +A+SIFS FSLLLTEE P+ E ET++ G+I+G++K   LG +                LY
Sbjct: 188  SAESIFSEFSLLLTEEPPVNEPETMDFGVIFGEDKIPTLGKS--GDDDEDALIDLDDQLY 245

Query: 990  FPPEEKEIDISKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQCRVEEMDRKSY 1169
            FPPEEK+IDISK+IRD VH+EIT++ +CD  CKG+CL CG N N  +C C  EE+  +S+
Sbjct: 246  FPPEEKQIDISKNIRDRVHLEITMNSVCDSGCKGVCLKCGQNFNTGNCSCSKEEVKEESF 305

Query: 1170 GPLGNLRKQI 1199
            GPL NLR+Q+
Sbjct: 306  GPLRNLREQM 315


>ref|XP_002315057.1| hypothetical protein POPTR_0010s17720g [Populus trichocarpa]
            gi|566191354|ref|XP_006378596.1| hypothetical protein
            POPTR_0010s17720g [Populus trichocarpa]
            gi|566191357|ref|XP_006378597.1| hypothetical protein
            POPTR_0010s17720g [Populus trichocarpa]
            gi|222864097|gb|EEF01228.1| hypothetical protein
            POPTR_0010s17720g [Populus trichocarpa]
            gi|550330028|gb|ERP56393.1| hypothetical protein
            POPTR_0010s17720g [Populus trichocarpa]
            gi|550330029|gb|ERP56394.1| hypothetical protein
            POPTR_0010s17720g [Populus trichocarpa]
          Length = 322

 Score =  296 bits (757), Expect = 2e-77
 Identities = 148/243 (60%), Positives = 181/243 (74%)
 Frame = +3

Query: 480  SFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTTLESLGLANLSTDVSKSRASV 659
            S +WDD  + +AE D ESPW+ A+IYKRN S+SH EYCTTLE LGL  LST++SKSRASV
Sbjct: 82   SLNWDDQEEEDAE-DMESPWEGAIIYKRNSSISHVEYCTTLERLGLGKLSTEISKSRASV 140

Query: 660  MGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTVITVGCNRCGEPAAKSIFSNF 839
            MGLRVTK VKD+P GTPV IS+DVT+KK +LRLDGII+TVIT+GC RCGEP A+ IFSNF
Sbjct: 141  MGLRVTKAVKDYPLGTPVQISIDVTKKKKRLRLDGIIKTVITLGCYRCGEPVAEGIFSNF 200

Query: 840  SLLLTEEPIQELETINMGIIYGKEKSRGLGNTEVXXXXXXXXXXXXXXLYFPPEEKEIDI 1019
            SLLL+EEP+ E E INMG ++G +K +     E               L+FPPE+KEIDI
Sbjct: 201  SLLLSEEPVAEPEIINMGKVFGNDKLKSSIFEE--EDGDEASIEWDDRLHFPPEDKEIDI 258

Query: 1020 SKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQCRVEEMDRKSYGPLGNLRKQI 1199
            SK +RD+VHVEIT+D +CDP CKGLCL CG NLN  SC C  E+   +  GPL +L+KQ+
Sbjct: 259  SKPLRDMVHVEITLDVICDPSCKGLCLECGTNLNKSSCNCSKEKEKERGPGPLKDLKKQM 318

Query: 1200 LQK 1208
            L +
Sbjct: 319  LSE 321


>gb|EMJ20588.1| hypothetical protein PRUPE_ppa020238mg, partial [Prunus persica]
          Length = 241

 Score =  294 bits (752), Expect = 8e-77
 Identities = 146/232 (62%), Positives = 175/232 (75%)
 Frame = +3

Query: 516  EEDNESPWKEAVIYKRNPSLSHTEYCTTLESLGLANLSTDVSKSRASVMGLRVTKTVKDH 695
            E++  SPW+ AVIYKRN S+SH EYCTTLE LGL NLST+VSKS+ASVMGLRVTK VKD+
Sbjct: 15   EDETGSPWEGAVIYKRNTSISHVEYCTTLERLGLGNLSTEVSKSKASVMGLRVTKAVKDY 74

Query: 696  PDGTPVLISLDVTRKKHKLRLDGIIRTVITVGCNRCGEPAAKSIFSNFSLLLTEEPIQEL 875
            P GTPV IS+D+TRKK KLRLDGII+TVI + C+RC +PAA+ IFSNFSLLLT+EPI+E 
Sbjct: 75   PQGTPVQISIDITRKKQKLRLDGIIKTVIALTCSRCEDPAAECIFSNFSLLLTDEPIEEP 134

Query: 876  ETINMGIIYGKEKSRGLGNTEVXXXXXXXXXXXXXXLYFPPEEKEIDISKHIRDLVHVEI 1055
            E INMG+IYG     G G  +               LYF P +KEIDISKHIRD+VH+EI
Sbjct: 135  EIINMGVIYGDTGISGQGEED-----DEGTIDFEDQLYFRPGDKEIDISKHIRDMVHLEI 189

Query: 1056 TIDELCDPKCKGLCLGCGMNLNVDSCQCRVEEMDRKSYGPLGNLRKQILQKK 1211
            TI   C+P CKGLCL CG NLN  SC C  ++  +K +GPLGNL+KQ+ Q+K
Sbjct: 190  TITATCNPSCKGLCLSCGKNLNTGSCNCS-KQQAKKGFGPLGNLKKQLQQQK 240


>ref|XP_002885329.1| hypothetical protein ARALYDRAFT_479498 [Arabidopsis lyrata subsp.
            lyrata] gi|297331169|gb|EFH61588.1| hypothetical protein
            ARALYDRAFT_479498 [Arabidopsis lyrata subsp. lyrata]
          Length = 317

 Score =  293 bits (749), Expect = 2e-76
 Identities = 146/249 (58%), Positives = 179/249 (71%)
 Frame = +3

Query: 465  DESLSSFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTTLESLGLANLSTDVSK 644
            + S    DW+D  ++E   D  SPW+ +V+Y+RN S +H EYCTTLE LGL  LST+VSK
Sbjct: 74   ETSTIDMDWEDQEEIE---DTGSPWEGSVMYRRNASATHVEYCTTLERLGLGRLSTEVSK 130

Query: 645  SRASVMGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTVITVGCNRCGEPAAKS 824
             RAS MGLRVTK VKD+PDGTPV +S+DV RKK KLRLDGI+RTVIT+GCNRCGE   +S
Sbjct: 131  KRASAMGLRVTKDVKDYPDGTPVQVSVDVIRKKKKLRLDGIVRTVITLGCNRCGESTGES 190

Query: 825  IFSNFSLLLTEEPIQELETINMGIIYGKEKSRGLGNTEVXXXXXXXXXXXXXXLYFPPEE 1004
            IFSNFSLLLTE+P++E + I++G  +G +K  G  + +               L+FPPE 
Sbjct: 191  IFSNFSLLLTEDPVEEPDVIDLGFTFGGDKEEGEDDDD-----DDSWIDWEDTLHFPPEV 245

Query: 1005 KEIDISKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQCRVEEMDRKSYGPLGN 1184
            KEIDISKHIRDLVH+EITI  +CD  CKG+CL CG NLN   C C  EE D K YGPLGN
Sbjct: 246  KEIDISKHIRDLVHLEITITAICDSACKGMCLKCGANLNKRKCDCGREEKD-KGYGPLGN 304

Query: 1185 LRKQILQKK 1211
            LR+Q+ QK+
Sbjct: 305  LREQMQQKE 313


>ref|XP_006298144.1| hypothetical protein CARUB_v10014192mg [Capsella rubella]
            gi|482566853|gb|EOA31042.1| hypothetical protein
            CARUB_v10014192mg [Capsella rubella]
          Length = 323

 Score =  292 bits (748), Expect = 2e-76
 Identities = 145/253 (57%), Positives = 182/253 (71%), Gaps = 4/253 (1%)
 Frame = +3

Query: 465  DESLSSFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTTLESLGLANLSTDVSK 644
            + S    DW+D  ++E   D  SPW+ +V+Y+RN S++H EYCTTLE LGL  LST+VSK
Sbjct: 71   ETSTIDMDWEDQEEIE---DIGSPWEGSVMYRRNASVTHVEYCTTLERLGLGRLSTEVSK 127

Query: 645  SRASVMGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTVITVGCNRCGEPAAKS 824
             RAS MGLRVTK VKD+PDGTPV I++DV RKK KLRLDGI++TVIT+GCNRCGE   +S
Sbjct: 128  KRASAMGLRVTKDVKDYPDGTPVQIAVDVIRKKKKLRLDGIVKTVITLGCNRCGESTGES 187

Query: 825  IFSNFSLLLTEEPIQELETINMGIIYGKEKSRGLG----NTEVXXXXXXXXXXXXXXLYF 992
            IFSNFSLLLTE+P++E + I++G  +G +K+        + E               L+F
Sbjct: 188  IFSNFSLLLTEDPVEEPDVIDLGFTFGSDKANSFSGLSDDKEETEDDDDSWIDWEDKLHF 247

Query: 993  PPEEKEIDISKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQCRVEEMDRKSYG 1172
            PPE KEIDISKHIRDLVH+EITI  +CDP CKG+CL CG NLN   C+C  EE D K YG
Sbjct: 248  PPEAKEIDISKHIRDLVHLEITITAICDPGCKGMCLKCGANLNKRKCECGREEKD-KGYG 306

Query: 1173 PLGNLRKQILQKK 1211
            PLGNLR+++ QK+
Sbjct: 307  PLGNLREKMQQKE 319


>ref|XP_002528748.1| conserved hypothetical protein [Ricinus communis]
            gi|223531842|gb|EEF33660.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 313

 Score =  290 bits (743), Expect = 9e-76
 Identities = 150/246 (60%), Positives = 178/246 (72%), Gaps = 3/246 (1%)
 Frame = +3

Query: 480  SFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTTLESLGLANLSTDVSKSRASV 659
            S  WDD  + E  ED ESPW+ A+IYKRNPS+SH EYCTTLE LGL  +ST+VSKSRASV
Sbjct: 70   SLGWDDLEE-ENPEDMESPWEGAIIYKRNPSVSHIEYCTTLERLGLGKVSTEVSKSRASV 128

Query: 660  MGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTVITVGCNRCGEPAAKSIFSNF 839
            MGLRVTK VKD P GTPV IS+DVTRKK KLRLDGII+TV+T+ CNRCG P A SI+SNF
Sbjct: 129  MGLRVTKAVKDFPLGTPVQISIDVTRKKQKLRLDGIIKTVLTLTCNRCGVPTAGSIYSNF 188

Query: 840  SLLLTEEPIQELETINMGIIYGKEK---SRGLGNTEVXXXXXXXXXXXXXXLYFPPEEKE 1010
            SLLL+EE I+E E ++MG+I+G++K   S   G  E                YFPPEEKE
Sbjct: 189  SLLLSEEQIEEPEIVDMGMIFGEDKFESSAASGYEE--EDDDDASIDWDDRFYFPPEEKE 246

Query: 1011 IDISKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQCRVEEMDRKSYGPLGNLR 1190
            IDISK+IRDLVH+EI  + +CD  CKG+CL CG NLN  SC C  E+   K YGPL +L+
Sbjct: 247  IDISKNIRDLVHIEIADNAICDASCKGVCLNCGTNLNTSSCSCSKEKNKEKGYGPLKDLK 306

Query: 1191 KQILQK 1208
            KQ+  K
Sbjct: 307  KQMQPK 312


>ref|XP_003524967.1| PREDICTED: uncharacterized protein LOC100792185 isoform X1 [Glycine
            max] gi|571455764|ref|XP_006580175.1| PREDICTED:
            uncharacterized protein LOC100792185 isoform X2 [Glycine
            max]
          Length = 318

 Score =  290 bits (742), Expect = 1e-75
 Identities = 149/256 (58%), Positives = 187/256 (73%)
 Frame = +3

Query: 432  SAGEDISDFLADESLSSFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTTLESL 611
            S G D    + DESL    WDD    E  ED  SPW+ AVIYKRN ++ H EYCTTLE L
Sbjct: 69   SKGHDFES-INDESLG---WDDD---EEVEDMGSPWEGAVIYKRNATILHLEYCTTLERL 121

Query: 612  GLANLSTDVSKSRASVMGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTVITVG 791
            GLA LS+DVSK+RA+ MGLRVTK VKD P+GTPV IS+DVTRKK KLRLDGII+TVIT+ 
Sbjct: 122  GLAKLSSDVSKTRAAAMGLRVTKAVKDFPNGTPVQISIDVTRKKKKLRLDGIIKTVITLL 181

Query: 792  CNRCGEPAAKSIFSNFSLLLTEEPIQELETINMGIIYGKEKSRGLGNTEVXXXXXXXXXX 971
            CNRC  P+A+SIFS FSLLLT+EPI+E ETI+MG+I+G++K    GN+            
Sbjct: 182  CNRCCAPSAESIFSEFSLLLTDEPIEEPETIDMGVIFGEDKLTTSGNSG-EDDDDDALID 240

Query: 972  XXXXLYFPPEEKEIDISKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQCRVEE 1151
                LYFPP++++IDISK+IRD VH+EIT++ +C P CKG+CL CG N N  +C C  EE
Sbjct: 241  MDDQLYFPPQQRQIDISKNIRDRVHLEITMNSVCGPGCKGMCLKCGQNFNTGNCNCSKEE 300

Query: 1152 MDRKSYGPLGNLRKQI 1199
            +  KS+GPLGNL++++
Sbjct: 301  VQEKSFGPLGNLKEKM 316


>gb|AGV54555.1| hypothetical protein [Phaseolus vulgaris]
          Length = 315

 Score =  288 bits (736), Expect = 6e-75
 Identities = 144/240 (60%), Positives = 180/240 (75%)
 Frame = +3

Query: 480  SFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTTLESLGLANLSTDVSKSRASV 659
            S D +  G  + E D  SPW+ AV+YKRN S+ H EYCTTLE LGLA LSTDVSK+RA+ 
Sbjct: 75   SIDNESLGFDDDEVDTGSPWEGAVVYKRNASILHLEYCTTLERLGLAKLSTDVSKTRAAA 134

Query: 660  MGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTVITVGCNRCGEPAAKSIFSNF 839
            MGLRVTK V++ P+GTPV IS+DVTRKK KLRLDGII+TVIT+ CNRC  P+A+SIFS F
Sbjct: 135  MGLRVTKAVREFPNGTPVQISIDVTRKKKKLRLDGIIKTVITLLCNRCCMPSAESIFSEF 194

Query: 840  SLLLTEEPIQELETINMGIIYGKEKSRGLGNTEVXXXXXXXXXXXXXXLYFPPEEKEIDI 1019
            SLLLTEEPI+E ETI+MG+I+G++K    GN+                LYFP +EK+IDI
Sbjct: 195  SLLLTEEPIEEPETIDMGVIFGEDKLTTSGNSG-QDDDEDALIDLEDQLYFPSQEKQIDI 253

Query: 1020 SKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQCRVEEMDRKSYGPLGNLRKQI 1199
            SK+IRD VH+EIT++ +CDP CKG+CL CG N N  +C C  EE+  KSYGPLGNL++++
Sbjct: 254  SKNIRDRVHLEITMNSVCDPGCKGMCLKCGQNFNTGNCMCSNEEVKEKSYGPLGNLKEKM 313


>gb|ESW31471.1| hypothetical protein PHAVU_002G240600g [Phaseolus vulgaris]
          Length = 315

 Score =  286 bits (733), Expect = 1e-74
 Identities = 144/240 (60%), Positives = 179/240 (74%)
 Frame = +3

Query: 480  SFDWDDHGQLEAEEDNESPWKEAVIYKRNPSLSHTEYCTTLESLGLANLSTDVSKSRASV 659
            S D +  G  + E D  SPW+ AV+YKRN S+ H EYCTTLE LGLA LSTDVSK+RA+ 
Sbjct: 75   SIDNESLGFDDDEVDTGSPWEGAVVYKRNASILHLEYCTTLERLGLAKLSTDVSKTRAAA 134

Query: 660  MGLRVTKTVKDHPDGTPVLISLDVTRKKHKLRLDGIIRTVITVGCNRCGEPAAKSIFSNF 839
            MGLRVTK V++ P+GTPV IS+DVTRKK KLRLDGII+TVIT+ CNRC  P+A+SIFS F
Sbjct: 135  MGLRVTKAVREFPNGTPVQISIDVTRKKKKLRLDGIIKTVITLLCNRCCMPSAESIFSEF 194

Query: 840  SLLLTEEPIQELETINMGIIYGKEKSRGLGNTEVXXXXXXXXXXXXXXLYFPPEEKEIDI 1019
            SLLLTEEPI+E ETI+MG+I+G++K    GN                 LYFP +EK+IDI
Sbjct: 195  SLLLTEEPIEEPETIDMGVIFGEDKLTTSGNGG-QDDDEDALIDLEDQLYFPSQEKQIDI 253

Query: 1020 SKHIRDLVHVEITIDELCDPKCKGLCLGCGMNLNVDSCQCRVEEMDRKSYGPLGNLRKQI 1199
            SK+IRD VH+EIT++ +CDP CKG+CL CG N N  +C C  EE+  KSYGPLGNL++++
Sbjct: 254  SKNIRDRVHLEITMNSVCDPGCKGMCLKCGQNFNTGNCMCSNEEVKEKSYGPLGNLKEKM 313


Top