BLASTX nr result

ID: Catharanthus22_contig00012022 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00012022
         (1124 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EXB74572.1| hypothetical protein L484_026269 [Morus notabilis]     255   2e-65
ref|XP_002515974.1| conserved hypothetical protein [Ricinus comm...   251   3e-64
gb|EOY26911.1| Sequence-specific DNA binding, putative isoform 3...   250   8e-64
ref|XP_006426887.1| hypothetical protein CICLE_v10026320mg [Citr...   248   2e-63
ref|XP_006343045.1| PREDICTED: protein SAWADEE HOMEODOMAIN HOMOL...   248   3e-63
ref|XP_004235649.1| PREDICTED: uncharacterized protein LOC101256...   242   2e-61
ref|XP_002283948.1| PREDICTED: uncharacterized protein LOC100258...   240   7e-61
ref|XP_006426886.1| hypothetical protein CICLE_v10026320mg [Citr...   235   3e-59
ref|XP_002299736.1| hypothetical protein POPTR_0001s19000g [Popu...   233   8e-59
gb|EOY26909.1| F9L1.16, putative isoform 1 [Theobroma cacao]          231   5e-58
ref|NP_849666.2| uncharacterized protein [Arabidopsis thaliana] ...   229   2e-57
emb|CAN77675.1| hypothetical protein VITISV_013721 [Vitis vinifera]   228   3e-57
ref|XP_004155951.1| PREDICTED: uncharacterized protein LOC101230...   228   4e-57
ref|XP_004141662.1| PREDICTED: uncharacterized protein LOC101213...   225   2e-56
ref|XP_004287547.1| PREDICTED: uncharacterized protein LOC101293...   224   4e-56
ref|XP_004510301.1| PREDICTED: uncharacterized protein LOC101512...   223   8e-56
ref|NP_001031048.1| uncharacterized protein [Arabidopsis thalian...   219   1e-54
gb|AFK39040.1| unknown [Medicago truncatula]                          219   2e-54
ref|XP_002890094.1| hypothetical protein ARALYDRAFT_334813 [Arab...   215   2e-53
ref|XP_003529753.1| PREDICTED: protein SAWADEE HOMEODOMAIN HOMOL...   213   9e-53

>gb|EXB74572.1| hypothetical protein L484_026269 [Morus notabilis]
          Length = 259

 Score =  255 bits (652), Expect = 2e-65
 Identities = 130/249 (52%), Positives = 168/249 (67%), Gaps = 1/249 (0%)
 Frame = -1

Query: 1022 RLEDDFESEFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVH 843
            ++  +  SEFTLAEI+EME++YK++ E+S  Q+FC++LA  FS S  R GKS I W+QV 
Sbjct: 7    KISRNSSSEFTLAEILEMENIYKEVEEQSLGQEFCQDLAMSFSGSSTRAGKSTITWEQVQ 66

Query: 842  SWFHDTKKELEAK-ISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRR 666
            +WF D  K+L  +  SS   +   L  +SA                  K+ +     P  
Sbjct: 67   NWFEDKHKKLHPESTSSAVDKHKELNPESASFELVVHLSDSKTSSIVPKSSQTPEGRPSS 126

Query: 665  PIAERIAELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINV 486
               E + +L ELA+EAKS+KD AWYDVA F+ YR  +T ELEVRVRF+GFGK++DEW+NV
Sbjct: 127  SHDEGMMDLHELAYEAKSSKDNAWYDVAAFLTYRFLNTGELEVRVRFSGFGKEEDEWVNV 186

Query: 485  RRGVRQRSIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFV 306
            R GVR+RSIPLEPSECDKV VGDLVLCF+E EHHA+YCDA++  I+R  HD   C+CIFV
Sbjct: 187  RTGVRERSIPLEPSECDKVNVGDLVLCFQEREHHAVYCDAYVVNIQRRLHDLNGCRCIFV 246

Query: 305  VRYDLDDFE 279
            +RYD DD E
Sbjct: 247  IRYDDDDTE 255


>ref|XP_002515974.1| conserved hypothetical protein [Ricinus communis]
            gi|223544879|gb|EEF46394.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 285

 Score =  251 bits (642), Expect = 3e-64
 Identities = 128/256 (50%), Positives = 168/256 (65%), Gaps = 2/256 (0%)
 Frame = -1

Query: 1001 SEFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSWFHDTK 822
            SEFTLAE+VEME++YK++GE+S D +FCE LAT FS +  R GK  I W+QV SWF D +
Sbjct: 49   SEFTLAEMVEMENIYKELGEESLDSEFCERLATSFSFTANRAGKPAITWEQVQSWFEDRQ 108

Query: 821  KELEAKISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPR--RPIAERI 648
            KE   ++S     P+ L++K                       K +++ P   R    ++
Sbjct: 109  KESRPRVS-----PSPLSLK---------------LFVDLSNAKISSDAPESSRNSKGKV 148

Query: 647  AELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRGVRQ 468
             +LSEL FEA+S++D AWYDVA F++YR+ ST ELE RVRF+GF    DEW+NV+R VR+
Sbjct: 149  TDLSELIFEARSSRDNAWYDVAAFLNYRVLSTGELEARVRFSGFRNTDDEWVNVKRAVRE 208

Query: 467  RSIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRYDLD 288
            RSIPLEPSEC +VKVGDLVLCFRE    A+YCDA +  I+R  H+   C+CIFVVRYD D
Sbjct: 209  RSIPLEPSECHRVKVGDLVLCFRERFDQAVYCDAHVVGIQRRPHEAASCRCIFVVRYDHD 268

Query: 287  DFEDKVPPSRICIRPT 240
            + E+     R+C RPT
Sbjct: 269  NTEEAAQLERLCCRPT 284


>gb|EOY26911.1| Sequence-specific DNA binding, putative isoform 3 [Theobroma cacao]
          Length = 246

 Score =  250 bits (638), Expect = 8e-64
 Identities = 125/256 (48%), Positives = 172/256 (67%)
 Frame = -1

Query: 1010 DFESEFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSWFH 831
            D  SEFTLAEI+EME++YK++GEK+ +++FC+ELAT FS S  R GKS + WQQV  WF 
Sbjct: 8    DSVSEFTLAEILEMENIYKEIGEKTLNKEFCQELATNFSCSSNRMGKSAVTWQQVQIWFQ 67

Query: 830  DTKKELEAKISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPIAER 651
            + + E ++K     QRP+ +A++                     + KP     R     +
Sbjct: 68   EKQMETQSK-----QRPSPMALE------------LFVDLSSANSSKPPGSLRRHK--GK 108

Query: 650  IAELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRGVR 471
            + +L EL+FEA+S+KD AWYDV +F+ YR+ ST ELEVRVRF+GF K +DEW+NV + VR
Sbjct: 109  VEDLKELSFEARSSKDYAWYDVDSFLTYRVLSTGELEVRVRFSGFAKTEDEWVNVEKAVR 168

Query: 470  QRSIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRYDL 291
            +RSIPLEPSEC+ VK+GDLVLC+++ EH+ +Y DA + +I+R  HD   C CIFVV YD 
Sbjct: 169  ERSIPLEPSECNIVKIGDLVLCYQDREHYQVYYDAHVVDIQRRVHDVRGCSCIFVVCYDH 228

Query: 290  DDFEDKVPPSRICIRP 243
            D  ++KVP  R+C RP
Sbjct: 229  DYSKEKVPLQRLCCRP 244


>ref|XP_006426887.1| hypothetical protein CICLE_v10026320mg [Citrus clementina]
            gi|568822531|ref|XP_006465684.1| PREDICTED: protein
            SAWADEE HOMEODOMAIN HOMOLOG 1-like [Citrus sinensis]
            gi|557528877|gb|ESR40127.1| hypothetical protein
            CICLE_v10026320mg [Citrus clementina]
          Length = 245

 Score =  248 bits (634), Expect = 2e-63
 Identities = 129/259 (49%), Positives = 167/259 (64%)
 Frame = -1

Query: 1016 EDDFESEFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSW 837
            ++D   +FTLAEI EMES+YK++GE S  Q++C+ LAT FS S  R  +  I W QV SW
Sbjct: 3    DEDSWPDFTLAEIKEMESMYKEIGEASLTQEYCKALATSFSFSASRAARPAITWLQVQSW 62

Query: 836  FHDTKKELEAKISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPIA 657
            F D +K+ +AK  S  +   +                         + +P  E   +PI 
Sbjct: 63   FRDKQKKSQAKSKSSSKDLKLFI---------------DLCGESISSNEP--EMSDKPIG 105

Query: 656  ERIAELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRG 477
             RI+EL ELAFEA+S+KD AWYDVA+F+ YR+T   ELEVRVRF+GF   +DEW+NV+  
Sbjct: 106  SRISELKELAFEARSSKDDAWYDVASFLTYRVTCAGELEVRVRFSGFNNTEDEWVNVKTA 165

Query: 476  VRQRSIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRY 297
            VRQRSIPLE SEC KV VGDLVLC++E E  A+YCDA + +I+R  HD E C+CIFVVRY
Sbjct: 166  VRQRSIPLEQSECVKVNVGDLVLCYQEREDQAVYCDAHVLDIQRRVHDTEGCQCIFVVRY 225

Query: 296  DLDDFEDKVPPSRICIRPT 240
            D D  E++V   R+C RPT
Sbjct: 226  DHDFSEEQVKVERLCCRPT 244


>ref|XP_006343045.1| PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 1-like [Solanum
            tuberosum]
          Length = 307

 Score =  248 bits (633), Expect = 3e-63
 Identities = 135/279 (48%), Positives = 176/279 (63%), Gaps = 16/279 (5%)
 Frame = -1

Query: 1031 DSLRLEDDFESEFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQ 852
            D L   D+   +FTLAE +EM + +K +  KS  Q+ C+E ATKFSSS +R GKS IK +
Sbjct: 2    DDLMETDEELMDFTLAEAMEMTTFFKGLKGKSISQELCQEFATKFSSSPFRTGKSLIKGE 61

Query: 851  QVHSWFHDTKKELEAKIS--------SLHQRPAV-----LAIKSAPGXXXXXXXXXXXXX 711
            QV SWF D KK   A++           ++ P V        KS                
Sbjct: 62   QVQSWFLDKKKPKAAEVPVDDYVEHVDDYEEPVVPKRRGRKPKSKNTSSSLVVYKKYDAC 121

Query: 710  XXXKAQKPAAETPRRP---IAERIAELSELAFEAKSAKDLAWYDVATFIHYRITSTAELE 540
               +  + A + P+RP    AE   EL+ LAFEA SAKDLAWYDVA+F+++R+  T ELE
Sbjct: 122  GYTRLPECAYDMPQRPRVSAAEMAKELTGLAFEALSAKDLAWYDVASFLNFRVLYTGELE 181

Query: 539  VRVRFAGFGKDQDEWINVRRGVRQRSIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFI 360
            VRVRFAGFG ++DEW+NV+RGVR+RS+PLEPSEC K+ VGD V+CFREDE+ A+Y D+ +
Sbjct: 182  VRVRFAGFGNEEDEWVNVKRGVRERSVPLEPSECVKLSVGDPVMCFREDEYLAVYGDSEV 241

Query: 359  EEIERTSHDNEVCKCIFVVRYDLDDFEDKVPPSRICIRP 243
             EI+R  HDN  C CIFVVRYDLD  E+K+   ++C RP
Sbjct: 242  VEIQRNLHDNTRCTCIFVVRYDLDKAEEKITLDKMCCRP 280


>ref|XP_004235649.1| PREDICTED: uncharacterized protein LOC101256958 [Solanum
            lycopersicum]
          Length = 304

 Score =  242 bits (618), Expect = 2e-61
 Identities = 133/279 (47%), Positives = 173/279 (62%), Gaps = 16/279 (5%)
 Frame = -1

Query: 1031 DSLRLEDDFESEFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQ 852
            D L   D+   +FTLAE +EM + +K +  KS  Q+ C+E A KFSSS +R GKS IK +
Sbjct: 2    DDLMETDEELMDFTLAEAMEMTTFFKGLKGKSISQELCQEFANKFSSSPFRTGKSIIKGE 61

Query: 851  QVHSWFHDTKKELEAKISSL--------HQRPAV-----LAIKSAPGXXXXXXXXXXXXX 711
            QV SWF D +K   A++           ++ P V        KS                
Sbjct: 62   QVKSWFLDKQKPKAAEVPDDDYVEHVDDYEEPIVPKRRGRKPKSKNTSSSLVVYKKYDAC 121

Query: 710  XXXKAQKPAAETPRRP---IAERIAELSELAFEAKSAKDLAWYDVATFIHYRITSTAELE 540
               +  + A + P+RP    AE   EL  L+FEA SAKDLAWYDV +F+++R+  T ELE
Sbjct: 122  GYTRLPECAYDLPQRPRVSAAEMAKELRGLSFEALSAKDLAWYDVGSFLNFRVLYTGELE 181

Query: 539  VRVRFAGFGKDQDEWINVRRGVRQRSIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFI 360
            VRVRFAGFG ++DEW+NV+RGVR+RS+PLEPSEC K+ VGD V+CFREDE+ A+Y DA +
Sbjct: 182  VRVRFAGFGNEEDEWVNVKRGVRERSVPLEPSECVKLSVGDPVMCFREDEYLAVYGDAEV 241

Query: 359  EEIERTSHDNEVCKCIFVVRYDLDDFEDKVPPSRICIRP 243
             EI+R  HDN  C CIFVVRYDLD  E+K+   +IC RP
Sbjct: 242  VEIQRNLHDNTRCTCIFVVRYDLDKAEEKIVLDKICCRP 280


>ref|XP_002283948.1| PREDICTED: uncharacterized protein LOC100258357 [Vitis vinifera]
           gi|297743205|emb|CBI36072.3| unnamed protein product
           [Vitis vinifera]
          Length = 247

 Score =  240 bits (613), Expect = 7e-61
 Identities = 123/257 (47%), Positives = 169/257 (65%), Gaps = 5/257 (1%)
 Frame = -1

Query: 995 FTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSWFHDTKKE 816
           FT +EI+EME+L+++ GE++  Q+FC++LAT FS+S    G   + W++V  WF   +KE
Sbjct: 11  FTQSEILEMENLFEEFGEETLGQEFCQDLATSFSASPGCSGNMPVGWKEVRDWFQTKQKE 70

Query: 815 LEAKISSLHQRP-AVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPIAER---- 651
           L A+++S    P  + A+  AP                      +   P+  I  R    
Sbjct: 71  LVARVTSSPVAPRGIDALPEAP---------------------MSNNAPQNSIVPRGDMV 109

Query: 650 IAELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRGVR 471
            A+LSEL +EAKS+KD AWYDVA F+ YR+ S+ ELE RVRF+GFG ++DEW+NV++G+R
Sbjct: 110 AADLSELTYEAKSSKDDAWYDVAAFLTYRVLSSGELEARVRFSGFGNEEDEWVNVKKGIR 169

Query: 470 QRSIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRYDL 291
           +RSIPLEPSEC +V+VGDLVLCF+E    A+YCDA I EI+R  HD + C+CIFVVRYD 
Sbjct: 170 KRSIPLEPSECYRVRVGDLVLCFQERSDQAVYCDAHIIEIQRRLHDIKGCRCIFVVRYDH 229

Query: 290 DDFEDKVPPSRICIRPT 240
           D  E+KV   R+C RPT
Sbjct: 230 DHGEEKVNLKRLCCRPT 246


>ref|XP_006426886.1| hypothetical protein CICLE_v10026320mg [Citrus clementina]
            gi|557528876|gb|ESR40126.1| hypothetical protein
            CICLE_v10026320mg [Citrus clementina]
          Length = 256

 Score =  235 bits (599), Expect = 3e-59
 Identities = 125/256 (48%), Positives = 163/256 (63%), Gaps = 4/256 (1%)
 Frame = -1

Query: 1016 EDDFESEFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSW 837
            ++D   +FTLAEI EMES+YK++GE S  Q++C+ LAT FS S  R  +  I W QV SW
Sbjct: 3    DEDSWPDFTLAEIKEMESMYKEIGEASLTQEYCKALATSFSFSASRAARPAITWLQVQSW 62

Query: 836  FHDTKKELEAKISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPIA 657
            F D +K+ +AK  S  +   +                         + +P  E   +PI 
Sbjct: 63   FRDKQKKSQAKSKSSSKDLKLFI---------------DLCGESISSNEP--EMSDKPIG 105

Query: 656  ERIAELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRG 477
             RI+EL ELAFEA+S+KD AWYDVA+F+ YR+T   ELEVRVRF+GF   +DEW+NV+  
Sbjct: 106  SRISELKELAFEARSSKDDAWYDVASFLTYRVTCAGELEVRVRFSGFNNTEDEWVNVKTA 165

Query: 476  VRQRSIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRY 297
            VRQRSIPLE SEC KV VGDLVLC++E E  A+YCDA + +I+R  HD E C+CIFVVRY
Sbjct: 166  VRQRSIPLEQSECVKVNVGDLVLCYQEREDQAVYCDAHVLDIQRRVHDTEGCQCIFVVRY 225

Query: 296  DLD----DFEDKVPPS 261
            D D    + ++ V PS
Sbjct: 226  DHDFSEVNLQNSVIPS 241


>ref|XP_002299736.1| hypothetical protein POPTR_0001s19000g [Populus trichocarpa]
           gi|222846994|gb|EEE84541.1| hypothetical protein
           POPTR_0001s19000g [Populus trichocarpa]
          Length = 239

 Score =  233 bits (595), Expect = 8e-59
 Identities = 124/253 (49%), Positives = 164/253 (64%)
 Frame = -1

Query: 998 EFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSWFHDTKK 819
           EFTL+E++EME+++K++ E     QFCE+LA+ FS +  R GK  I  +QV SWF D  K
Sbjct: 4   EFTLSEMLEMENMFKELEEGPLAPQFCEKLASSFSLAPSRDGKQAITPRQVKSWFQDRLK 63

Query: 818 ELEAKISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPIAERIAEL 639
           + + +++S +    + A  S                    A   A E+ ++ +    ++L
Sbjct: 64  KSQPRVASSNMALKLFADLSDAS-----------------ASFGATESSQK-LKGNASDL 105

Query: 638 SELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRGVRQRSI 459
           SEL FEA S+KD AWYDVA+F++YR+  + ELEVRVRFAGF    DEW+NVRR VR+RSI
Sbjct: 106 SELIFEALSSKDNAWYDVASFLNYRVVCSGELEVRVRFAGFRNTDDEWVNVRRAVRERSI 165

Query: 458 PLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRYDLDDFE 279
           PLE SEC +VKVGDLVLCF+E E  A+YCDA I EI R  HD   C+C FVVRYD DDFE
Sbjct: 166 PLESSECQRVKVGDLVLCFQEREERAVYCDAHIVEINRKLHDINGCRCTFVVRYDHDDFE 225

Query: 278 DKVPPSRICIRPT 240
           ++V   R+C RPT
Sbjct: 226 EEVRLDRLCGRPT 238


>gb|EOY26909.1| F9L1.16, putative isoform 1 [Theobroma cacao]
          Length = 320

 Score =  231 bits (588), Expect = 5e-58
 Identities = 125/296 (42%), Positives = 172/296 (58%), Gaps = 40/296 (13%)
 Frame = -1

Query: 1010 DFESEFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSWFH 831
            D  SEFTLAEI+EME++YK++GEK+ +++FC+ELAT FS S  R GKS + WQQV  WF 
Sbjct: 42   DSVSEFTLAEILEMENIYKEIGEKTLNKEFCQELATNFSCSSNRMGKSAVTWQQVQIWFQ 101

Query: 830  DTKKELEAKISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPIAER 651
            + + E ++K     QRP+ +A++                     + KP     R     +
Sbjct: 102  EKQMETQSK-----QRPSPMALE------------LFVDLSSANSSKPPGSLRRHK--GK 142

Query: 650  IAELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRGVR 471
            + +L EL+FEA+S+KD AWYDV +F+ YR+ ST ELEVRVRF+GF K +DEW+NV + VR
Sbjct: 143  VEDLKELSFEARSSKDYAWYDVDSFLTYRVLSTGELEVRVRFSGFAKTEDEWVNVEKAVR 202

Query: 470  QRSIPLEPSECDKVKVGDLVLCF------------------------------------- 402
            +RSIPLEPSEC+ VK+GDLVLC+                                     
Sbjct: 203  ERSIPLEPSECNIVKIGDLVLCYQTQRPDDLLIFPCNSPVYGLALRSNGRVYLTDIDGEG 262

Query: 401  ---REDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRYDLDDFEDKVPPSRICIRP 243
               ++ EH+ +Y DA + +I+R  HD   C CIFVV YD D  ++KVP  R+C RP
Sbjct: 263  TVRKDREHYQVYYDAHVVDIQRRVHDVRGCSCIFVVCYDHDYSKEKVPLQRLCCRP 318


>ref|NP_849666.2| uncharacterized protein [Arabidopsis thaliana]
            gi|75215641|sp|Q9XI47.1|SHH1_ARATH RecName: Full=Protein
            SAWADEE HOMEODOMAIN HOMOLOG 1; AltName: Full=DNA-binding
            transcription factor 1
            gi|5103848|gb|AAD39678.1|AC007591_43 F9L1.16 [Arabidopsis
            thaliana] gi|332191165|gb|AEE29286.1| uncharacterized
            protein AT1G15215 [Arabidopsis thaliana]
          Length = 258

 Score =  229 bits (584), Expect = 2e-57
 Identities = 115/254 (45%), Positives = 162/254 (63%), Gaps = 1/254 (0%)
 Frame = -1

Query: 1001 SEFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSWFHDT- 825
            +EFTL+EIV+ME+LYK++G++S  + FC+ +A+ FS SV R+GKS I W+QV  WF +  
Sbjct: 12   TEFTLSEIVDMENLYKELGDQSLHKDFCQTVASTFSCSVNRNGKSSITWKQVQIWFQEKL 71

Query: 824  KKELEAKISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPIAERIA 645
            K + + K  +L   P  +   S P                   Q    +          +
Sbjct: 72   KHQSQPKSKTLPSPPLQIHDLSNPSSYASNASNATFVGNSTFVQTRKGKA---------S 122

Query: 644  ELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRGVRQR 465
            +L++LAFEAKSA+D AWYDV++F+ YR+  T ELEVRVRF+GF    DEW+NV+  VR+R
Sbjct: 123  DLADLAFEAKSARDYAWYDVSSFLTYRVLRTGELEVRVRFSGFDNRHDEWVNVKTSVRER 182

Query: 464  SIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRYDLDD 285
            SIP+EPSEC +V VGDL+LCF+E E  A+YCD  +  I+R  HD+  C C+F+VRY+LD+
Sbjct: 183  SIPVEPSECGRVNVGDLLLCFQEREDQALYCDGHVLNIKRGIHDHARCNCVFLVRYELDN 242

Query: 284  FEDKVPPSRICIRP 243
             E+ +   RIC RP
Sbjct: 243  TEESLGLERICRRP 256


>emb|CAN77675.1| hypothetical protein VITISV_013721 [Vitis vinifera]
          Length = 266

 Score =  228 bits (582), Expect = 3e-57
 Identities = 117/247 (47%), Positives = 162/247 (65%), Gaps = 5/247 (2%)
 Frame = -1

Query: 995 FTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSWFHDTKKE 816
           FT +EI+EME+L+++ GE++  Q+FC++LAT FS+S    G   + W++V  WF   +KE
Sbjct: 11  FTQSEILEMENLFEEFGEETLGQEFCQDLATSFSASPGCSGNMSVGWKEVRDWFQTKQKE 70

Query: 815 LEAKISSLHQRP-AVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPIAER---- 651
           L A+++S    P  + A+  AP                      +   P+  I  R    
Sbjct: 71  LVARVTSSPVAPRGIDALPEAP---------------------MSNNAPQNSIVPRGDMV 109

Query: 650 IAELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRGVR 471
            A+LSEL +EAKS+KD AWYDVA F+ YR+ S+ ELE RVRF+GFG ++DEW+NV++G+R
Sbjct: 110 AADLSELTYEAKSSKDDAWYDVAAFLTYRVLSSGELEARVRFSGFGNEEDEWVNVKKGIR 169

Query: 470 QRSIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRYDL 291
           +RSIPLEPSEC +V+VGDLVLCF+E    A+YCDA I EI+R  HD + C+CIFVVRYD 
Sbjct: 170 KRSIPLEPSECYRVRVGDLVLCFQERSDQAVYCDAHIIEIQRRLHDIKGCRCIFVVRYDH 229

Query: 290 DDFEDKV 270
           D  E+ V
Sbjct: 230 DHGENSV 236


>ref|XP_004155951.1| PREDICTED: uncharacterized protein LOC101230634 [Cucumis sativus]
          Length = 279

 Score =  228 bits (580), Expect = 4e-57
 Identities = 122/269 (45%), Positives = 164/269 (60%), Gaps = 6/269 (2%)
 Frame = -1

Query: 1028 SLRLEDDFESEFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQ 849
            S +L DD   EFTLAEIVEM+++ K   +++  Q+F +++A  FS S +R  KS +  + 
Sbjct: 6    SSKLLDDSSFEFTLAEIVEMDNILKDSRDQTLGQEFFQDVALHFSCSPWRAAKSPVTTEH 65

Query: 848  VHSWFHDTKKELEAKISSLHQRPAV------LAIKSAPGXXXXXXXXXXXXXXXXKAQKP 687
            VH+WF + +KEL A        P        L   S+P                     P
Sbjct: 66   VHAWFENRRKELRASSKKARPPPPPPSELPPLPTPSSPPPSPPPKLLLYHSESDFLTHAP 125

Query: 686  AAETPRRPIAERIAELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKD 507
            ++  P      +  +LSELAFEA S++D AWYDVA+F+ YR+    EL+ RVR+AGF KD
Sbjct: 126  SSGPPE--FKGKATDLSELAFEAFSSRDHAWYDVASFLTYRVNCHGELDARVRYAGFTKD 183

Query: 506  QDEWINVRRGVRQRSIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNE 327
            +DEW+NV RGVR RSIPLE SEC +VKVGDLVLCF+E + HA+Y DA + EI+R  HD  
Sbjct: 184  EDEWVNVGRGVRDRSIPLESSECYRVKVGDLVLCFQERQDHALYFDAHVVEIQRRLHDIG 243

Query: 326  VCKCIFVVRYDLDDFEDKVPPSRICIRPT 240
             C+CIFVVRY+ D  E+KV   R+C RP+
Sbjct: 244  GCRCIFVVRYEHDRHEEKVHIGRLCCRPS 272


>ref|XP_004141662.1| PREDICTED: uncharacterized protein LOC101213827 [Cucumis sativus]
          Length = 287

 Score =  225 bits (574), Expect = 2e-56
 Identities = 122/277 (44%), Positives = 164/277 (59%), Gaps = 14/277 (5%)
 Frame = -1

Query: 1028 SLRLEDDFESEFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQ 849
            S +L DD   EFTLAEIVEM+++ K   +++  Q+F +++A  FS S +R  KS +  + 
Sbjct: 6    SSKLLDDSSFEFTLAEIVEMDNILKDSRDQTLGQEFFQDVALHFSCSPWRAAKSPVTTEH 65

Query: 848  VHSWFHDTKKELEAKISSLHQRPAV--------------LAIKSAPGXXXXXXXXXXXXX 711
            VH+WF + +KEL A        P                L   S+P              
Sbjct: 66   VHAWFENRRKELRASSKKARPPPPPPSEPPPPPPSELPPLPTPSSPPPSPPPKLLLYHSE 125

Query: 710  XXXKAQKPAAETPRRPIAERIAELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRV 531
                   P++  P      +  +LSELAFEA S++D AWYDVA+F+ YR+    EL+ RV
Sbjct: 126  SDFLTHAPSSGPPE--FKGKATDLSELAFEAFSSRDHAWYDVASFLTYRVNCHGELDARV 183

Query: 530  RFAGFGKDQDEWINVRRGVRQRSIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEI 351
            R+AGF KD+DEW+NV RGVR RSIPLE SEC +VKVGDLVLCF+E + HA+Y DA + EI
Sbjct: 184  RYAGFRKDEDEWVNVGRGVRDRSIPLESSECYRVKVGDLVLCFQERQDHALYFDAHVVEI 243

Query: 350  ERTSHDNEVCKCIFVVRYDLDDFEDKVPPSRICIRPT 240
            +R  HD   C+CIFVVRY+ D  E+KV   R+C RP+
Sbjct: 244  QRRLHDISGCRCIFVVRYEHDRHEEKVHIGRLCCRPS 280


>ref|XP_004287547.1| PREDICTED: uncharacterized protein LOC101293712 [Fragaria vesca
            subsp. vesca]
          Length = 302

 Score =  224 bits (572), Expect = 4e-56
 Identities = 120/257 (46%), Positives = 161/257 (62%)
 Frame = -1

Query: 1010 DFESEFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSWFH 831
            D  S FT +EI+++E+++K+  +++  Q F + LA  FS    R GKS I  QQV  WF 
Sbjct: 23   DSLSVFTHSEIMKLENIFKETPQQALTQGFFQNLAIDFSCQPSRLGKSDITQQQVEGWFQ 82

Query: 830  DTKKELEAKISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPIAER 651
              +KE++AK ++            + G                        + + P    
Sbjct: 83   SRRKEIQAKGTT------------SSGAFDWVVESHEDLSDLTMFSNEPDNSQKDPC--- 127

Query: 650  IAELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRGVR 471
            I +LSELAFEAKS+KD AWYDVA+F+ YR+ S+ ELEVRVR+AGFG+++DEW+NVRR VR
Sbjct: 128  ITDLSELAFEAKSSKDGAWYDVASFLTYRVVSSGELEVRVRYAGFGREEDEWVNVRRAVR 187

Query: 470  QRSIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRYDL 291
             RSIPLE SEC KVKVGDLVLCF+E E  A+Y DA + EI+R  HD   C+CIFVVR+D 
Sbjct: 188  DRSIPLEESECHKVKVGDLVLCFQEREDEAVYFDALVVEIQRNLHDQTGCRCIFVVRFDH 247

Query: 290  DDFEDKVPPSRICIRPT 240
            D  +++VP  RIC RP+
Sbjct: 248  DKSKEQVPLGRICCRPS 264


>ref|XP_004510301.1| PREDICTED: uncharacterized protein LOC101512036 [Cicer arietinum]
          Length = 269

 Score =  223 bits (569), Expect = 8e-56
 Identities = 115/253 (45%), Positives = 159/253 (62%)
 Frame = -1

Query: 998 EFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSWFHDTKK 819
           ++++ EI+E+E +Y + GE S DQ FC+E+AT FSSS  R GK+ + W+QVH WF   ++
Sbjct: 13  KYSMDEILELERIYNEKGEHSLDQSFCKEIATNFSSSSNRVGKTSVSWEQVHQWFQSKQR 72

Query: 818 ELEAKISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPIAERIAEL 639
           E     S  HQ      + S+P                  + +    +   P   + A+L
Sbjct: 73  E-----SKDHQ------VASSP-----DGLNLYVDLSDKSSSRTGHGSSPDPEGTQAADL 116

Query: 638 SELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRGVRQRSI 459
           S+L FEA S KD AW+DVA F++YR+ ST ELEVRVR+ GFGK++DEWINVR GVR+RSI
Sbjct: 117 SDLTFEAVSIKDNAWHDVAMFLNYRVLSTGELEVRVRYHGFGKEEDEWINVREGVRERSI 176

Query: 458 PLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRYDLDDFE 279
           PLE S+C KVK GDLVLCF     +A+YCDA + +I+R  HD++ C C F VR+  D  E
Sbjct: 177 PLEASDCHKVKEGDLVLCFHVKSDYALYCDARVLKIQRRIHDSKECSCSFTVRFYHDKSE 236

Query: 278 DKVPPSRICIRPT 240
           ++V  + +C RPT
Sbjct: 237 EEVSWTSLCCRPT 249


>ref|NP_001031048.1| uncharacterized protein [Arabidopsis thaliana]
            gi|332191167|gb|AEE29288.1| uncharacterized protein
            AT1G15215 [Arabidopsis thaliana]
          Length = 252

 Score =  219 bits (559), Expect = 1e-54
 Identities = 110/242 (45%), Positives = 155/242 (64%), Gaps = 1/242 (0%)
 Frame = -1

Query: 1001 SEFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSWFHDT- 825
            +EFTL+EIV+ME+LYK++G++S  + FC+ +A+ FS SV R+GKS I W+QV  WF +  
Sbjct: 12   TEFTLSEIVDMENLYKELGDQSLHKDFCQTVASTFSCSVNRNGKSSITWKQVQIWFQEKL 71

Query: 824  KKELEAKISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPIAERIA 645
            K + + K  +L   P  +   S P                   Q    +          +
Sbjct: 72   KHQSQPKSKTLPSPPLQIHDLSNPSSYASNASNATFVGNSTFVQTRKGKA---------S 122

Query: 644  ELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRGVRQR 465
            +L++LAFEAKSA+D AWYDV++F+ YR+  T ELEVRVRF+GF    DEW+NV+  VR+R
Sbjct: 123  DLADLAFEAKSARDYAWYDVSSFLTYRVLRTGELEVRVRFSGFDNRHDEWVNVKTSVRER 182

Query: 464  SIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRYDLDD 285
            SIP+EPSEC +V VGDL+LCF+E E  A+YCD  +  I+R  HD+  C C+F+VRY+LD+
Sbjct: 183  SIPVEPSECGRVNVGDLLLCFQEREDQALYCDGHVLNIKRGIHDHARCNCVFLVRYELDN 242

Query: 284  FE 279
             E
Sbjct: 243  TE 244


>gb|AFK39040.1| unknown [Medicago truncatula]
          Length = 270

 Score =  219 bits (557), Expect = 2e-54
 Identities = 114/253 (45%), Positives = 150/253 (59%)
 Frame = -1

Query: 998 EFTLAEIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSWFHDTKK 819
           + +L EI+E+E +Y  +GEKS D  FC+++A  FSSS    GK+ + W+QV  W  +   
Sbjct: 13  KLSLDEILELERIYNDVGEKSLDPNFCKDIAANFSSSSNSDGKTSLTWEQVQQWLQNKHT 72

Query: 818 ELEAKISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPIAERIAEL 639
           E +   +S           S  G                    P      +P   + A+L
Sbjct: 73  ETKGHFAS-----------SPEGLNLVVDLSGKSSSIKGNKSSP------KPKGIQAADL 115

Query: 638 SELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRGVRQRSI 459
           SELAFEA S KD AW+DV+ F++YR+  T ELEVRVR+ GFGKD+DEWINV+ GVRQRSI
Sbjct: 116 SELAFEAVSIKDNAWHDVSMFLNYRVLCTGELEVRVRYHGFGKDEDEWINVKYGVRQRSI 175

Query: 458 PLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRYDLDDFE 279
           PLE SEC KVK G LVLCF     +A+YCDA + +I+R  HD+E C CIF VR+  D FE
Sbjct: 176 PLEASECHKVKEGHLVLCFHVKSDYALYCDAIVLKIQRREHDSEECSCIFTVRFYHDKFE 235

Query: 278 DKVPPSRICIRPT 240
           ++V    +C RPT
Sbjct: 236 EEVRWDSLCCRPT 248


>ref|XP_002890094.1| hypothetical protein ARALYDRAFT_334813 [Arabidopsis lyrata subsp.
           lyrata] gi|297335936|gb|EFH66353.1| hypothetical protein
           ARALYDRAFT_334813 [Arabidopsis lyrata subsp. lyrata]
          Length = 276

 Score =  215 bits (548), Expect = 2e-53
 Identities = 110/259 (42%), Positives = 162/259 (62%), Gaps = 11/259 (4%)
 Frame = -1

Query: 986 AEIVEMESLYKQMGEKSADQQFCEELATKFS----------SSVYRHGKSFIKWQQVHSW 837
           A+IV+ME+LYK++G++S  + FC+ +A+ FS           SV R+GKS + W+Q+ SW
Sbjct: 34  AKIVDMENLYKELGDQSLHKDFCQTVASTFSFMSSSIVSQSCSVNRNGKSTVTWKQIQSW 93

Query: 836 FHDT-KKELEAKISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPI 660
           F +  K++ + K  +L   P  +   S P                  A     +T +   
Sbjct: 94  FQEKLKQQSQPKFKTLPSPPLQIHDLSNPSCYA--------------ANATFVQTRKG-- 137

Query: 659 AERIAELSELAFEAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRR 480
             + ++L++LAFEAKSA+D AWYDV++F+ YR+  T ELEVRVRF+GF    DEW+NV+ 
Sbjct: 138 --KASDLADLAFEAKSARDYAWYDVSSFLTYRVLRTGELEVRVRFSGFDNRHDEWVNVKT 195

Query: 479 GVRQRSIPLEPSECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVR 300
            VR+RSIPLEPSEC +V +GDL+LCF+E +  A+YCD  +  I+R  HD+  C C+F+VR
Sbjct: 196 SVRERSIPLEPSECGRVNIGDLLLCFQERDDQALYCDGHVVNIKRGIHDHRRCNCVFLVR 255

Query: 299 YDLDDFEDKVPPSRICIRP 243
           YDLD+ E+ +   +IC RP
Sbjct: 256 YDLDNTEEPLGLEKICRRP 274


>ref|XP_003529753.1| PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 1-like [Glycine max]
          Length = 273

 Score =  213 bits (543), Expect = 9e-53
 Identities = 114/248 (45%), Positives = 153/248 (61%)
 Frame = -1

Query: 983 EIVEMESLYKQMGEKSADQQFCEELATKFSSSVYRHGKSFIKWQQVHSWFHDTKKELEAK 804
           EI+E+E +Y+ MG K  +++ C E+A +FSSS    GK+ + WQQV  WF + ++ L  K
Sbjct: 18  EILELERIYEDMGGKVLNRKSCLEIAKRFSSSSNGAGKTSLSWQQVRLWFKNNQRMLLGK 77

Query: 803 ISSLHQRPAVLAIKSAPGXXXXXXXXXXXXXXXXKAQKPAAETPRRPIAERIAELSELAF 624
             S      + A                       A+ P     +    ++ A L +L F
Sbjct: 78  DISSSDLLKISA---------------------DLAESPLLGNGK---GKQAAALDDLGF 113

Query: 623 EAKSAKDLAWYDVATFIHYRITSTAELEVRVRFAGFGKDQDEWINVRRGVRQRSIPLEPS 444
           EA+S KD+AW+DV+ F++YR+ ST ELEVRVR+AGFGK+QDEW+NV+ GVR+RSIPLEPS
Sbjct: 114 EARSTKDIAWHDVSMFLNYRVLSTGELEVRVRYAGFGKEQDEWMNVKLGVRERSIPLEPS 173

Query: 443 ECDKVKVGDLVLCFREDEHHAIYCDAFIEEIERTSHDNEVCKCIFVVRYDLDDFEDKVPP 264
           EC KVK GDLVLCF E E +A+YCDA I +I R  HD   C C F+VR+  D+ E+ V  
Sbjct: 174 ECHKVKDGDLVLCFLEREDYALYCDARIVKIHRKIHDPTECTCTFIVRFVHDNTEEGVSF 233

Query: 263 SRICIRPT 240
            RIC RPT
Sbjct: 234 DRICCRPT 241


Top