BLASTX nr result

ID: Catharanthus23_contig00011909 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00011909
         (1150 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006360599.1| PREDICTED: uncharacterized protein LOC102594...   285   2e-74
ref|XP_004234759.1| PREDICTED: uncharacterized protein LOC101264...   279   2e-72
ref|XP_002280431.1| PREDICTED: uncharacterized protein LOC100267...   273   1e-70
ref|XP_002308356.2| hypothetical protein POPTR_0006s20910g [Popu...   270   6e-70
ref|XP_006429915.1| hypothetical protein CICLE_v10012299mg [Citr...   269   2e-69
ref|XP_006492831.1| PREDICTED: uncharacterized protein LOC102618...   266   9e-69
gb|EOY09543.1| Uncharacterized protein TCM_024954 [Theobroma cacao]   264   6e-68
ref|XP_004136559.1| PREDICTED: uncharacterized protein LOC101219...   257   7e-66
gb|EMJ03627.1| hypothetical protein PRUPE_ppa009807mg [Prunus pe...   256   1e-65
ref|XP_004303355.1| PREDICTED: uncharacterized protein LOC101312...   255   2e-65
emb|CBI29994.3| unnamed protein product [Vitis vinifera]              253   1e-64
ref|XP_002530364.1| conserved hypothetical protein [Ricinus comm...   253   1e-64
gb|EXC30859.1| hypothetical protein L484_028038 [Morus notabilis]     250   9e-64
ref|XP_004303356.1| PREDICTED: uncharacterized protein LOC101312...   248   3e-63
ref|XP_006411465.1| hypothetical protein EUTSA_v10017089mg [Eutr...   241   5e-61
gb|AAO37222.1| hypothetical protein [Arabidopsis thaliana]            238   4e-60
ref|XP_002879962.1| hypothetical protein ARALYDRAFT_345995 [Arab...   236   2e-59
ref|NP_181726.1| uncharacterized protein [Arabidopsis thaliana] ...   235   2e-59
ref|XP_006294802.1| hypothetical protein CARUB_v10023853mg [Caps...   231   5e-58
ref|XP_006576143.1| PREDICTED: uncharacterized protein LOC100305...   223   1e-55

>ref|XP_006360599.1| PREDICTED: uncharacterized protein LOC102594828 isoform X1 [Solanum
           tuberosum] gi|565389725|ref|XP_006360600.1| PREDICTED:
           uncharacterized protein LOC102594828 isoform X2 [Solanum
           tuberosum]
          Length = 279

 Score =  285 bits (729), Expect = 2e-74
 Identities = 147/255 (57%), Positives = 188/255 (73%), Gaps = 1/255 (0%)
 Frame = +3

Query: 186 RSHLLHQKPHNCCSNPYPSDTNQHPSVFPKFSFISLKKSPS-NLLSTISAIGAANEGTIP 362
           R  L  Q+ H+C    Y  +++ + ++       S    PS NL S  SAI ++N+GT+ 
Sbjct: 21  RQRLSLQQAHSCYQR-YAFNSHGNITLSIHVQLASPHPFPSSNLKSCCSAIASSNDGTVS 79

Query: 363 VINYEDLMMKDWSFLENDETNSEEELMQKTDQIISGAEIEETSKVVISIGSEGFVDRLMN 542
           +IN+ED+M KDWSFLE  +  S EE  QK D+IIS  EI ETSKVVI+I S+ FVDR+++
Sbjct: 80  MINFEDVMEKDWSFLEYPD--SSEEHKQKIDEIISAGEITETSKVVIAICSDEFVDRVVD 137

Query: 543 SSPCEQLLVVHDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGLSSQLE 722
           SS C+QLLVVHDSL  LACIKEKYDK+KCWQGELIY+PEKW  FD  FLYFLP L  +L+
Sbjct: 138 SSNCKQLLVVHDSLFMLACIKEKYDKVKCWQGELIYIPEKWTPFDVVFLYFLPALPFELD 197

Query: 723 QVLEMLAKRSLPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANYSFQID 902
           Q+L+ L KR  PGAR+VISHPQGRQ  +EQ+K+YPD+VVS LP+K  LQ  AA++SF++ 
Sbjct: 198 QILDALRKRCSPGARVVISHPQGRQMVEEQQKQYPDVVVSNLPEKMLLQNVAAHHSFEVV 257

Query: 903 KYVDEPGFYLAVLKF 947
           K+VDEP FYLA+LKF
Sbjct: 258 KFVDEPAFYLAILKF 272


>ref|XP_004234759.1| PREDICTED: uncharacterized protein LOC101264102 [Solanum
           lycopersicum]
          Length = 275

 Score =  279 bits (713), Expect = 2e-72
 Identities = 137/235 (58%), Positives = 178/235 (75%)
 Frame = +3

Query: 243 DTNQHPSVFPKFSFISLKKSPSNLLSTISAIGAANEGTIPVINYEDLMMKDWSFLENDET 422
           D +  P +FP           SNL S  SAI ++N+GT+ +IN+ED+M KDWSFLE+ ++
Sbjct: 50  DQSASPHLFPS----------SNLKSCCSAIASSNDGTVSMINFEDVMEKDWSFLEHPDS 99

Query: 423 NSEEELMQKTDQIISGAEIEETSKVVISIGSEGFVDRLMNSSPCEQLLVVHDSLLTLACI 602
           ++E +  QK D+IIS  EI ETSKV+I+I S+ FVDR++ SS C+QLLVVHDSL  LACI
Sbjct: 100 SAEHK--QKIDEIISAGEITETSKVMIAISSDEFVDRVVESSICKQLLVVHDSLFMLACI 157

Query: 603 KEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGLSSQLEQVLEMLAKRSLPGARLVISH 782
           KEKYDK+ CWQGE+IY+PEKW  FD  FLYFLP L  +L+Q+L+ L K   PGAR+VISH
Sbjct: 158 KEKYDKVMCWQGEVIYIPEKWTPFDVVFLYFLPALPFELDQILDALRKCCSPGARVVISH 217

Query: 783 PQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANYSFQIDKYVDEPGFYLAVLKF 947
           PQGRQ  +EQ+K+YPD+VVS LP+K  LQ  AA++SF++ K+VDEP FYLA+LKF
Sbjct: 218 PQGRQMVEEQQKQYPDVVVSNLPEKMLLQNVAAHHSFEVVKFVDEPAFYLAILKF 272


>ref|XP_002280431.1| PREDICTED: uncharacterized protein LOC100267633 [Vitis vinifera]
          Length = 271

 Score =  273 bits (697), Expect = 1e-70
 Identities = 135/247 (54%), Positives = 180/247 (72%)
 Frame = +3

Query: 213 HNCCSNPYPSDTNQHPSVFPKFSFISLKKSPSNLLSTISAIGAANEGTIPVINYEDLMMK 392
           H+C    Y   ++ H   +P F  +S     S    T+ A   +NE  + VI++ED M K
Sbjct: 27  HHC----YSHYSSYHRGNYPSFPLLS----SSIHRLTVGAATPSNEEAVSVIDFEDFMEK 78

Query: 393 DWSFLENDETNSEEELMQKTDQIISGAEIEETSKVVISIGSEGFVDRLMNSSPCEQLLVV 572
           DWSFL++D TNSEEE  QKTD IIS   I E S+V++S G+E FVD+L++SSPC+ LLVV
Sbjct: 79  DWSFLDSDGTNSEEEHKQKTDWIISKGNIGENSRVLVSTGAEEFVDQLVDSSPCQLLLVV 138

Query: 573 HDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGLSSQLEQVLEMLAKRS 752
           HDSL  LA IKEKYDK+KCWQGELIY+PEKW  FD  FLYFLP L  +L+++   LAKR 
Sbjct: 139 HDSLFVLAGIKEKYDKVKCWQGELIYVPEKWTPFDVVFLYFLPALPFELDRIFGELAKRC 198

Query: 753 LPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANYSFQIDKYVDEPGFYL 932
           LPGAR+VISH QGR+  ++Q+++YPD+++S+LPDK TLQ  AA++SF++ ++V+EP FYL
Sbjct: 199 LPGARVVISHLQGREVLEQQRRQYPDVIISDLPDKMTLQKVAADHSFEMTEFVEEPSFYL 258

Query: 933 AVLKFRD 953
           AVL FR+
Sbjct: 259 AVLNFRE 265


>ref|XP_002308356.2| hypothetical protein POPTR_0006s20910g [Populus trichocarpa]
           gi|550336758|gb|EEE91879.2| hypothetical protein
           POPTR_0006s20910g [Populus trichocarpa]
          Length = 275

 Score =  270 bits (691), Expect = 6e-70
 Identities = 129/210 (61%), Positives = 167/210 (79%)
 Frame = +3

Query: 324 ISAIGAANEGTIPVINYEDLMMKDWSFLENDETNSEEELMQKTDQIISGAEIEETSKVVI 503
           I+A   ++EG + VIN+ED + KDWSFL+++E+NS+E   Q   +IIS   IEETS+V++
Sbjct: 64  IAAAVPSDEGPVSVINFEDFIEKDWSFLDSEESNSKEH-DQNIGRIISAGRIEETSRVLV 122

Query: 504 SIGSEGFVDRLMNSSPCEQLLVVHDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAF 683
           S+GSEGFVDRL+++SPC  LL+VHDSL  LAC+KEKYDK+KCWQGELI++ EKW   D  
Sbjct: 123 SLGSEGFVDRLVDTSPCSLLLIVHDSLFLLACVKEKYDKVKCWQGELIHVSEKWAPLDVV 182

Query: 684 FLYFLPGLSSQLEQVLEMLAKRSLPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKT 863
           FLYFLP L  +L++VL  LAKR  PGARLVISHPQGR+  ++QKK+Y D+V S+LPDK T
Sbjct: 183 FLYFLPALPFKLDEVLGSLAKRCSPGARLVISHPQGREVLEQQKKQYQDVVTSDLPDKMT 242

Query: 864 LQTAAANYSFQIDKYVDEPGFYLAVLKFRD 953
           LQ AAAN+SF++ +YVDEPGFYL VL+  D
Sbjct: 243 LQKAAANHSFEMVEYVDEPGFYLTVLRLSD 272


>ref|XP_006429915.1| hypothetical protein CICLE_v10012299mg [Citrus clementina]
           gi|567874653|ref|XP_006429916.1| hypothetical protein
           CICLE_v10012299mg [Citrus clementina]
           gi|557531972|gb|ESR43155.1| hypothetical protein
           CICLE_v10012299mg [Citrus clementina]
           gi|557531973|gb|ESR43156.1| hypothetical protein
           CICLE_v10012299mg [Citrus clementina]
          Length = 305

 Score =  269 bits (687), Expect = 2e-69
 Identities = 147/261 (56%), Positives = 182/261 (69%), Gaps = 4/261 (1%)
 Frame = +3

Query: 189 SHLLHQKPHNCCSNPYPSDTNQHPSVFPKFSFISLKKSPSNLLSTIS-AIGAAN---EGT 356
           S  L  KP   C+     + + H    P  S  SLK S    LST   +IGAA+   EGT
Sbjct: 51  SPYLRPKPQYICT----FNRHHHDHDLPTLSH-SLKPSLQLSLSTRKMSIGAASPPDEGT 105

Query: 357 IPVINYEDLMMKDWSFLENDETNSEEELMQKTDQIISGAEIEETSKVVISIGSEGFVDRL 536
           + VIN+ED   KDWSFL++DE N +E + Q+ DQIIS  EI+E+SKV++SI SE FVDR+
Sbjct: 106 VSVINFEDFTEKDWSFLDSDELNFKEHI-QRIDQIISAGEIDESSKVLVSISSEEFVDRV 164

Query: 537 MNSSPCEQLLVVHDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGLSSQ 716
           + SSP   LLVVHDSL  LA IKEKYD +KCWQGELIY+P+KWG  D  FLYFLP +   
Sbjct: 165 VESSP-SLLLVVHDSLFVLAGIKEKYDTVKCWQGELIYVPDKWGPLDVVFLYFLPAMPFP 223

Query: 717 LEQVLEMLAKRSLPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANYSFQ 896
           L+QV E LA R  PGAR+VISHPQGR+  Q+Q+K++PD++VS+LPD+ TLQ  AAN+ FQ
Sbjct: 224 LDQVFETLANRCSPGARVVISHPQGREALQKQRKQFPDVIVSDLPDQMTLQKVAANHCFQ 283

Query: 897 IDKYVDEPGFYLAVLKFRDNK 959
           ID +VDE GFYL VLKF   K
Sbjct: 284 IDNFVDESGFYLVVLKFSKAK 304


>ref|XP_006492831.1| PREDICTED: uncharacterized protein LOC102618593 isoform X1 [Citrus
           sinensis] gi|568879797|ref|XP_006492832.1| PREDICTED:
           uncharacterized protein LOC102618593 isoform X2 [Citrus
           sinensis]
          Length = 274

 Score =  266 bits (681), Expect = 9e-69
 Identities = 146/261 (55%), Positives = 183/261 (70%), Gaps = 4/261 (1%)
 Frame = +3

Query: 189 SHLLHQKPHNCCSNPYPSDTNQHPSVFPKFSFISLKKSPSNLLSTI-SAIGAAN---EGT 356
           S  L  KP   C+     + + H    P  S  SLK S    LST  ++IGAA+   EGT
Sbjct: 20  SPYLRPKPQYICT----FNRHHHDYNLPTLSH-SLKPSLQLSLSTRKTSIGAASPPDEGT 74

Query: 357 IPVINYEDLMMKDWSFLENDETNSEEELMQKTDQIISGAEIEETSKVVISIGSEGFVDRL 536
           + VIN+ED   KDWSFL++DE N +E + Q+ DQIIS  EI ++SKV++SI SE FVDR+
Sbjct: 75  VSVINFEDFTEKDWSFLDSDELNFKEHI-QRIDQIISAGEIGKSSKVLVSISSEEFVDRV 133

Query: 537 MNSSPCEQLLVVHDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGLSSQ 716
           + SSP   LLVVHDSL  LA IKEKYD +KCWQGELIY+P+KWG  D  FLYFLP +   
Sbjct: 134 VESSP-SLLLVVHDSLFALAGIKEKYDTVKCWQGELIYVPDKWGPLDVVFLYFLPAMPFP 192

Query: 717 LEQVLEMLAKRSLPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANYSFQ 896
           L+QV E LA R  PGAR+VISHPQGR+  Q+Q+K++PD++VS+LPD+ TLQ  AAN+SF+
Sbjct: 193 LDQVFETLANRCSPGARVVISHPQGREALQKQRKQFPDVIVSDLPDQMTLQKVAANHSFE 252

Query: 897 IDKYVDEPGFYLAVLKFRDNK 959
           ID +VDE GFYL VLKF   K
Sbjct: 253 IDNFVDESGFYLVVLKFSKAK 273


>gb|EOY09543.1| Uncharacterized protein TCM_024954 [Theobroma cacao]
          Length = 276

 Score =  264 bits (674), Expect = 6e-68
 Identities = 131/217 (60%), Positives = 164/217 (75%)
 Frame = +3

Query: 297 KSPSNLLSTISAIGAANEGTIPVINYEDLMMKDWSFLENDETNSEEELMQKTDQIISGAE 476
           KS  +  S      A+NEG + VIN ED   KDWSFL++D+ NSE+ + Q  D+I S  E
Sbjct: 56  KSLCSHQSLAGTANASNEGAVSVINIEDFYEKDWSFLDSDDLNSEQ-VRQNIDRITSAGE 114

Query: 477 IEETSKVVISIGSEGFVDRLMNSSPCEQLLVVHDSLLTLACIKEKYDKIKCWQGELIYLP 656
           IEETS+V++SIGSEGFVD L+ SSP + LLVVHDS+L LA IKEKYD++KCWQGELI +P
Sbjct: 115 IEETSRVLVSIGSEGFVDHLVESSPSQLLLVVHDSILILAGIKEKYDEVKCWQGELIGVP 174

Query: 657 EKWGSFDAFFLYFLPGLSSQLEQVLEMLAKRSLPGARLVISHPQGRQGAQEQKKKYPDIV 836
           EKW   D  FLYFLP L  +L+Q+  +LAKR  PGARLVISHPQGR   Q+Q K++PDI+
Sbjct: 175 EKWSPLDVVFLYFLPALPFKLDQIFTLLAKRCSPGARLVISHPQGRAVLQQQGKQFPDII 234

Query: 837 VSELPDKKTLQTAAANYSFQIDKYVDEPGFYLAVLKF 947
           V+ LPDK TLQ  AA++SF++ ++ DEPGFYLAVLKF
Sbjct: 235 VANLPDKTTLQRVAADHSFEMTEFEDEPGFYLAVLKF 271


>ref|XP_004136559.1| PREDICTED: uncharacterized protein LOC101219545 [Cucumis sativus]
           gi|449518537|ref|XP_004166298.1| PREDICTED:
           uncharacterized protein LOC101227832 [Cucumis sativus]
          Length = 270

 Score =  257 bits (656), Expect = 7e-66
 Identities = 124/226 (54%), Positives = 173/226 (76%)
 Frame = +3

Query: 270 PKFSFISLKKSPSNLLSTISAIGAANEGTIPVINYEDLMMKDWSFLENDETNSEEELMQK 449
           P  SF +L  S S   S+  +    NEG + V+N+EDL+ KD+SFL++D+ +S EE  QK
Sbjct: 44  PLISFPALHISNSIACSSTPS----NEGVVSVVNFEDLVEKDFSFLDSDDFSSIEEHGQK 99

Query: 450 TDQIISGAEIEETSKVVISIGSEGFVDRLMNSSPCEQLLVVHDSLLTLACIKEKYDKIKC 629
             +IIS  EI E+S+V++SI SEGFVD+L   +P   LLVVHDS+LTLACIKEKYDK+KC
Sbjct: 100 IRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKC 159

Query: 630 WQGELIYLPEKWGSFDAFFLYFLPGLSSQLEQVLEMLAKRSLPGARLVISHPQGRQGAQE 809
           WQGE+IY+PEKWG FDA FLY+LP +  +L+ +   L++R + GARLVISHP GR+  ++
Sbjct: 160 WQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSERCVAGARLVISHPNGRKALEQ 219

Query: 810 QKKKYPDIVVSELPDKKTLQTAAANYSFQIDKYVDEPGFYLAVLKF 947
           +++++PD+VVS+LPD+ TLQ AAA++SF + +++DE GFYLA+LKF
Sbjct: 220 EQQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKF 265


>gb|EMJ03627.1| hypothetical protein PRUPE_ppa009807mg [Prunus persica]
          Length = 276

 Score =  256 bits (654), Expect = 1e-65
 Identities = 122/210 (58%), Positives = 157/210 (74%)
 Frame = +3

Query: 318 STISAIGAANEGTIPVINYEDLMMKDWSFLENDETNSEEELMQKTDQIISGAEIEETSKV 497
           S   A    +EG + VIN++D   KDWSFL++ + +S  +     D+II+  EIEETS+V
Sbjct: 62  SFCGATAPPDEGVVSVINFDDFAEKDWSFLDSADFSSGPDYNLNIDRIITAGEIEETSRV 121

Query: 498 VISIGSEGFVDRLMNSSPCEQLLVVHDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFD 677
           ++SIGSEGFVDR++ SSPC  LLVVHDSL  LA IKEKYDK+KCWQGELIY+P+KW   D
Sbjct: 122 MVSIGSEGFVDRVVESSPCNLLLVVHDSLFVLAGIKEKYDKVKCWQGELIYVPDKWAPLD 181

Query: 678 AFFLYFLPGLSSQLEQVLEMLAKRSLPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDK 857
             FLYFLP +   L++    LA+  L GARLVISHPQGR+  ++Q+++YPD+V S+LP+K
Sbjct: 182 VVFLYFLPAMPFTLDEAFGALARCFLAGARLVISHPQGREVLEQQRQQYPDVVTSDLPEK 241

Query: 858 KTLQTAAANYSFQIDKYVDEPGFYLAVLKF 947
           KTLQ  AA +SF++  YVDEPGFYLAVLKF
Sbjct: 242 KTLQEVAAQHSFELTDYVDEPGFYLAVLKF 271


>ref|XP_004303355.1| PREDICTED: uncharacterized protein LOC101312459 isoform 1 [Fragaria
           vesca subsp. vesca]
          Length = 280

 Score =  255 bits (652), Expect = 2e-65
 Identities = 130/244 (53%), Positives = 173/244 (70%), Gaps = 1/244 (0%)
 Frame = +3

Query: 219 CCSNPYPSDTNQHPSVFPKFSFISLKKSPSNLLSTI-SAIGAANEGTIPVINYEDLMMKD 395
           C  N Y S  +     FP  S  SL+ S S + +T+ S  G   EG + VIN+ED+  KD
Sbjct: 37  CHRNTYVSPLSLS---FPLHSHYSLRTSSSLVGATVPSDEGVVTEGVVSVINFEDVAEKD 93

Query: 396 WSFLENDETNSEEELMQKTDQIISGAEIEETSKVVISIGSEGFVDRLMNSSPCEQLLVVH 575
           WSFL++D+ +SE++ + K ++IIS  EI+ETS+V++S+GSE FVD+L+ SSPC  LLVVH
Sbjct: 94  WSFLDSDDFSSEQDKL-KVERIISAGEIQETSRVMVSVGSEAFVDQLVESSPCSMLLVVH 152

Query: 576 DSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGLSSQLEQVLEMLAKRSL 755
           DSL  LA IKEKYDK+KCWQGELIY+P+KW   D  FLYFLP +     QV   +AK   
Sbjct: 153 DSLFVLAGIKEKYDKVKCWQGELIYVPDKWAPLDVVFLYFLPAVPFNHGQVFGAVAKCFS 212

Query: 756 PGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANYSFQIDKYVDEPGFYLA 935
           PGARLVISHPQGR+  ++Q+++YPD++ SELP+K+TL+  AA + F++  YVDE G YLA
Sbjct: 213 PGARLVISHPQGREVFEKQRQQYPDVITSELPEKRTLKEVAAEHFFELTDYVDEQGLYLA 272

Query: 936 VLKF 947
           VL F
Sbjct: 273 VLTF 276


>emb|CBI29994.3| unnamed protein product [Vitis vinifera]
          Length = 196

 Score =  253 bits (646), Expect = 1e-64
 Identities = 119/190 (62%), Positives = 154/190 (81%)
 Frame = +3

Query: 384 MMKDWSFLENDETNSEEELMQKTDQIISGAEIEETSKVVISIGSEGFVDRLMNSSPCEQL 563
           M KDWSFL++D TNSEEE  QKTD IIS   I E S+V++S G+E FVD+L++SSPC+ L
Sbjct: 1   MEKDWSFLDSDGTNSEEEHKQKTDWIISKGNIGENSRVLVSTGAEEFVDQLVDSSPCQLL 60

Query: 564 LVVHDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGLSSQLEQVLEMLA 743
           LVVHDSL  LA IKEKYDK+KCWQGELIY+PEKW  FD  FLYFLP L  +L+++   LA
Sbjct: 61  LVVHDSLFVLAGIKEKYDKVKCWQGELIYVPEKWTPFDVVFLYFLPALPFELDRIFGELA 120

Query: 744 KRSLPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANYSFQIDKYVDEPG 923
           KR LPGAR+VISH QGR+  ++Q+++YPD+++S+LPDK TLQ  AA++SF++ ++V+EP 
Sbjct: 121 KRCLPGARVVISHLQGREVLEQQRRQYPDVIISDLPDKMTLQKVAADHSFEMTEFVEEPS 180

Query: 924 FYLAVLKFRD 953
           FYLAVL FR+
Sbjct: 181 FYLAVLNFRE 190


>ref|XP_002530364.1| conserved hypothetical protein [Ricinus communis]
           gi|223530111|gb|EEF32025.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 276

 Score =  253 bits (645), Expect = 1e-64
 Identities = 127/228 (55%), Positives = 169/228 (74%)
 Frame = +3

Query: 264 VFPKFSFISLKKSPSNLLSTISAIGAANEGTIPVINYEDLMMKDWSFLENDETNSEEELM 443
           +FP  S  SL    S++ +T+ +    +EG + VI++ED + KDWSFL+ ++ NS+E   
Sbjct: 48  LFPSNSRYSLHTHQSSIGATVPS---NDEGPVSVIHFEDFIEKDWSFLDFEDLNSKEH-K 103

Query: 444 QKTDQIISGAEIEETSKVVISIGSEGFVDRLMNSSPCEQLLVVHDSLLTLACIKEKYDKI 623
           QK  QIIS  EIEETS++++SIGSE FVD+L+++SP   L VVHDSL  LA IKEKYDK+
Sbjct: 104 QKVGQIISAGEIEETSRILVSIGSEEFVDQLVDTSPKSHLFVVHDSLFLLAMIKEKYDKV 163

Query: 624 KCWQGELIYLPEKWGSFDAFFLYFLPGLSSQLEQVLEMLAKRSLPGARLVISHPQGRQGA 803
           KCWQGELI++PEKW   D  FLYFLP L  +L+QV   LA+R   GAR+++SH QGR+  
Sbjct: 164 KCWQGELIHVPEKWAPLDVVFLYFLPVLPFKLDQVFGTLAQRCSQGARVIVSHLQGREVL 223

Query: 804 QEQKKKYPDIVVSELPDKKTLQTAAANYSFQIDKYVDEPGFYLAVLKF 947
           ++Q+K+Y D++VSELPDK TLQ  AAN SFQ+ +YVDEPG YLAVL+F
Sbjct: 224 EKQRKQYQDVIVSELPDKMTLQNVAANNSFQVTEYVDEPGLYLAVLRF 271


>gb|EXC30859.1| hypothetical protein L484_028038 [Morus notabilis]
          Length = 278

 Score =  250 bits (638), Expect = 9e-64
 Identities = 133/262 (50%), Positives = 179/262 (68%), Gaps = 23/262 (8%)
 Frame = +3

Query: 237 PSDTNQHP---SVFPKFSFISLKK------------SPS---NLL----STISAIGAANE 350
           PS  +Q P    + P+ SF  L +            SPS   NLL    S+I   G ++E
Sbjct: 14  PSSISQFPPYHQIQPRNSFTILHRRNHHLHCPLSLTSPSISHNLLRIRRSSIVETGPSDE 73

Query: 351 GTIPVINYE-DLMMKDWSFLENDETNSEEELMQKTDQIISGAEIEETSKVVISIGSEGFV 527
           G IP+I++E D + KDWSFL++ +  S ++ +QK  +II+  +IEE+S+V++S  SE F+
Sbjct: 74  GAIPLIDFEEDFVEKDWSFLDSGDLTSNQDYIQKVGRIIAAGQIEESSRVMVSATSEVFI 133

Query: 528 DRLMNSSPCEQLLVVHDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGL 707
           D L+ SSPC  LLVVHDSL  LA IKEK+DK+KCWQGELIY+PEKWG  D  FLYFLP L
Sbjct: 134 DELVYSSPCNLLLVVHDSLFVLAGIKEKHDKVKCWQGELIYVPEKWGLLDVAFLYFLPAL 193

Query: 708 SSQLEQVLEMLAKRSLPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANY 887
              L+ V   LAK  LPGARLVI HPQGR+  ++Q+K+YPD+++S LP+K TL+  AA++
Sbjct: 194 PFTLDHVFGALAKCCLPGARLVICHPQGREVLEQQRKQYPDVIISNLPEKVTLEKVAADH 253

Query: 888 SFQIDKYVDEPGFYLAVLKFRD 953
            F + ++VD+P FYLAVLKFRD
Sbjct: 254 PFDLVEFVDDPEFYLAVLKFRD 275


>ref|XP_004303356.1| PREDICTED: uncharacterized protein LOC101312459 isoform 2 [Fragaria
           vesca subsp. vesca]
          Length = 218

 Score =  248 bits (633), Expect = 3e-63
 Identities = 117/204 (57%), Positives = 155/204 (75%)
 Frame = +3

Query: 336 GAANEGTIPVINYEDLMMKDWSFLENDETNSEEELMQKTDQIISGAEIEETSKVVISIGS 515
           G   EG + VIN+ED+  KDWSFL++D+ +SE++ + K ++IIS  EI+ETS+V++S+GS
Sbjct: 12  GVVTEGVVSVINFEDVAEKDWSFLDSDDFSSEQDKL-KVERIISAGEIQETSRVMVSVGS 70

Query: 516 EGFVDRLMNSSPCEQLLVVHDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAFFLYF 695
           E FVD+L+ SSPC  LLVVHDSL  LA IKEKYDK+KCWQGELIY+P+KW   D  FLYF
Sbjct: 71  EAFVDQLVESSPCSMLLVVHDSLFVLAGIKEKYDKVKCWQGELIYVPDKWAPLDVVFLYF 130

Query: 696 LPGLSSQLEQVLEMLAKRSLPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKTLQTA 875
           LP +     QV   +AK   PGARLVISHPQGR+  ++Q+++YPD++ SELP+K+TL+  
Sbjct: 131 LPAVPFNHGQVFGAVAKCFSPGARLVISHPQGREVFEKQRQQYPDVITSELPEKRTLKEV 190

Query: 876 AANYSFQIDKYVDEPGFYLAVLKF 947
           AA + F++  YVDE G YLAVL F
Sbjct: 191 AAEHFFELTDYVDEQGLYLAVLTF 214


>ref|XP_006411465.1| hypothetical protein EUTSA_v10017089mg [Eutrema salsugineum]
           gi|557112634|gb|ESQ52918.1| hypothetical protein
           EUTSA_v10017089mg [Eutrema salsugineum]
          Length = 266

 Score =  241 bits (614), Expect = 5e-61
 Identities = 124/255 (48%), Positives = 169/255 (66%)
 Frame = +3

Query: 183 DRSHLLHQKPHNCCSNPYPSDTNQHPSVFPKFSFISLKKSPSNLLSTISAIGAANEGTIP 362
           D SHLL  +P          +  Q  S  PK   +S      +   ++S+     EGT+ 
Sbjct: 20  DSSHLLSLQPKKL-------NRLQSSSFSPKLVSLSQSYVSRHRAFSVSSTSPPPEGTVS 72

Query: 363 VINYEDLMMKDWSFLENDETNSEEELMQKTDQIISGAEIEETSKVVISIGSEGFVDRLMN 542
           V+++ +   KDWSFLE+ E +S E   QK ++II   EI E+S+V++SI SE FVD L+ 
Sbjct: 73  VVDFHE---KDWSFLESMELDSPEHT-QKMEKIIKAGEISESSRVLVSISSEAFVDCLVE 128

Query: 543 SSPCEQLLVVHDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGLSSQLE 722
           SS  + LL+VHDSL  LAC+KEKYDK+KCWQGELIY+PEKW   D  FLYFLP L  +L+
Sbjct: 129 SSASQLLLIVHDSLFVLACVKEKYDKVKCWQGELIYVPEKWSPLDVVFLYFLPALPFELD 188

Query: 723 QVLEMLAKRSLPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANYSFQID 902
           +V + L++R   GAR+VISHPQGRQG ++Q+K++ D+VVS+LPD  TL   A  +SF++ 
Sbjct: 189 EVFKTLSQRCSSGARVVISHPQGRQGLEQQRKEFSDVVVSDLPDNSTLMNVAKKHSFELT 248

Query: 903 KYVDEPGFYLAVLKF 947
           ++VDE G YLAVLKF
Sbjct: 249 QFVDEQGLYLAVLKF 263


>gb|AAO37222.1| hypothetical protein [Arabidopsis thaliana]
          Length = 266

 Score =  238 bits (606), Expect = 4e-60
 Identities = 113/199 (56%), Positives = 153/199 (76%)
 Frame = +3

Query: 348 EGTIPVINYEDLMMKDWSFLENDETNSEEELMQKTDQIISGAEIEETSKVVISIGSEGFV 527
           EGT+ V+++ +   KDWSFLE+ E +S E   QK ++II   E+ E+S+V++SIGSE FV
Sbjct: 68  EGTVSVVDFHE---KDWSFLESMEIDSTEHT-QKIERIIKAGELSESSRVLVSIGSETFV 123

Query: 528 DRLMNSSPCEQLLVVHDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGL 707
           DRL+ SSP + LL+VHDSL TLACIKEKYDK+KCWQGE+IY+PEKW   DA FLYFLP L
Sbjct: 124 DRLVESSPSQLLLIVHDSLFTLACIKEKYDKVKCWQGEMIYVPEKWSPLDAVFLYFLPAL 183

Query: 708 SSQLEQVLEMLAKRSLPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANY 887
              L+++ + L++R   GAR+VISHPQGR G ++Q+K++ D+VVS+LPD+ TL   A  +
Sbjct: 184 PFDLDELFKTLSQRCSSGARVVISHPQGRLGLEQQRKEFSDVVVSDLPDESTLSNVAEKH 243

Query: 888 SFQIDKYVDEPGFYLAVLK 944
           SF++ ++VDE G YLAVLK
Sbjct: 244 SFELTQFVDEQGLYLAVLK 262


>ref|XP_002879962.1| hypothetical protein ARALYDRAFT_345995 [Arabidopsis lyrata subsp.
           lyrata] gi|297325801|gb|EFH56221.1| hypothetical protein
           ARALYDRAFT_345995 [Arabidopsis lyrata subsp. lyrata]
          Length = 263

 Score =  236 bits (601), Expect = 2e-59
 Identities = 114/199 (57%), Positives = 149/199 (74%)
 Frame = +3

Query: 348 EGTIPVINYEDLMMKDWSFLENDETNSEEELMQKTDQIISGAEIEETSKVVISIGSEGFV 527
           EGT+ V ++ +   KDWSFLE+ E  S E   QK ++II   EI E+S+V++SI SE FV
Sbjct: 65  EGTVSVFDFHE---KDWSFLESMEIESTEHT-QKIERIIKAGEISESSRVLVSISSEAFV 120

Query: 528 DRLMNSSPCEQLLVVHDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGL 707
           DRL+ SSP + LL+VHDSL TLAC+KEKYDK+KCWQGELIY+PEKW   DA FLYFLP L
Sbjct: 121 DRLVESSPSQLLLIVHDSLFTLACVKEKYDKVKCWQGELIYVPEKWSPLDAVFLYFLPAL 180

Query: 708 SSQLEQVLEMLAKRSLPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANY 887
              L+ + + L++R   GAR+VISHPQGRQG ++Q+K++ D+VVS+LPD  TL+  A   
Sbjct: 181 PFDLDDLFKTLSQRCSSGARVVISHPQGRQGLEQQRKEFSDVVVSDLPDDSTLRNVAKKR 240

Query: 888 SFQIDKYVDEPGFYLAVLK 944
           SF++ ++VDE G YLAVLK
Sbjct: 241 SFELTQFVDEQGLYLAVLK 259


>ref|NP_181726.1| uncharacterized protein [Arabidopsis thaliana]
           gi|2257716|gb|AAB63558.1| hypothetical protein
           [Arabidopsis thaliana] gi|18491233|gb|AAL69441.1|
           At2g41950/T6D20.26 [Arabidopsis thaliana]
           gi|61742665|gb|AAX55153.1| hypothetical protein
           At2g41950 [Arabidopsis thaliana]
           gi|330254961|gb|AEC10055.1| uncharacterized protein
           AT2G41950 [Arabidopsis thaliana]
          Length = 266

 Score =  235 bits (600), Expect = 2e-59
 Identities = 112/199 (56%), Positives = 152/199 (76%)
 Frame = +3

Query: 348 EGTIPVINYEDLMMKDWSFLENDETNSEEELMQKTDQIISGAEIEETSKVVISIGSEGFV 527
           EGT+ V+++ +   KDWSFLE+ E +S E   QK ++II   E+ E+S+V++SI SE FV
Sbjct: 68  EGTVSVVDFHE---KDWSFLESMEIDSTEHT-QKIERIIKAGELSESSRVLVSISSETFV 123

Query: 528 DRLMNSSPCEQLLVVHDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGL 707
           DRL+ SSP + LL+VHDSL TLACIKEKYDK+KCWQGE+IY+PEKW   DA FLYFLP L
Sbjct: 124 DRLVESSPSQLLLIVHDSLFTLACIKEKYDKVKCWQGEMIYVPEKWSPLDAVFLYFLPAL 183

Query: 708 SSQLEQVLEMLAKRSLPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANY 887
              L+++ + L++R   GAR+VISHPQGR G ++Q+K++ D+VVS+LPD+ TL   A  +
Sbjct: 184 PFDLDELFKTLSQRCSSGARVVISHPQGRLGLEQQRKEFSDVVVSDLPDESTLSNVAEKH 243

Query: 888 SFQIDKYVDEPGFYLAVLK 944
           SF++ ++VDE G YLAVLK
Sbjct: 244 SFELTQFVDEQGLYLAVLK 262


>ref|XP_006294802.1| hypothetical protein CARUB_v10023853mg [Capsella rubella]
           gi|482563510|gb|EOA27700.1| hypothetical protein
           CARUB_v10023853mg [Capsella rubella]
          Length = 266

 Score =  231 bits (588), Expect = 5e-58
 Identities = 107/199 (53%), Positives = 149/199 (74%)
 Frame = +3

Query: 348 EGTIPVINYEDLMMKDWSFLENDETNSEEELMQKTDQIISGAEIEETSKVVISIGSEGFV 527
           E T+ V+ + +   KDWSFLE+ E +S E   QK ++II   EI E+S++++S+ SE FV
Sbjct: 68  EETVSVVGFHE---KDWSFLESMENDSTEHT-QKMERIIKAGEISESSRILVSMSSEAFV 123

Query: 528 DRLMNSSPCEQLLVVHDSLLTLACIKEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGL 707
           DRL+ SSP + LL+VHDSL TLAC+KEKYDK+KCWQGE+IY+PEKW   DA FLYF+P L
Sbjct: 124 DRLVESSPSQLLLIVHDSLFTLACVKEKYDKVKCWQGEMIYVPEKWSPLDAVFLYFIPAL 183

Query: 708 SSQLEQVLEMLAKRSLPGARLVISHPQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANY 887
              L+++   L++R   GAR+VISHPQGRQG ++Q+K++ D+VVS+LPD   L+  A  +
Sbjct: 184 PFDLDELFNTLSQRCSSGARVVISHPQGRQGLEQQRKEFSDVVVSDLPDDSVLRDVAKRH 243

Query: 888 SFQIDKYVDEPGFYLAVLK 944
           SF++ +++DE G YLAVLK
Sbjct: 244 SFELSQFIDEQGLYLAVLK 262


>ref|XP_006576143.1| PREDICTED: uncharacterized protein LOC100305496 isoform X1 [Glycine
           max]
          Length = 250

 Score =  223 bits (568), Expect = 1e-55
 Identities = 116/233 (49%), Positives = 161/233 (69%), Gaps = 3/233 (1%)
 Frame = +3

Query: 252 QHPSVFPKFSFISLKKSPSNLLSTISAIGAA---NEGTIPVINYEDLMMKDWSFLENDET 422
           +H  +F   S I L + P+ L +T S I A     EG    ++  +++ KDWS L+  E 
Sbjct: 23  KHQPIFRTLSPIPLTQKPTFLRATDSNIDAPISLPEGA-SFVSIPEIIEKDWSVLDCAEH 81

Query: 423 NSEEELMQKTDQIISGAEIEETSKVVISIGSEGFVDRLMNSSPCEQLLVVHDSLLTLACI 602
            +       TD+II+   IE++S+V++S GSE FVD L   +P   + VVHDSLLTLACI
Sbjct: 82  RT-------TDRIIASGNIEQSSRVLVSTGSEDFVDSLAGLTP--SVFVVHDSLLTLACI 132

Query: 603 KEKYDKIKCWQGELIYLPEKWGSFDAFFLYFLPGLSSQLEQVLEMLAKRSLPGARLVISH 782
           KEKYD++KCWQGE+IY+PEKW  FDA FLYFLP L  +L Q+LE LA +  PG R++ISH
Sbjct: 133 KEKYDRVKCWQGEIIYVPEKWAPFDAVFLYFLPALPFKLHQILESLAGKCAPGGRVIISH 192

Query: 783 PQGRQGAQEQKKKYPDIVVSELPDKKTLQTAAANYSFQIDKYVDEPGFYLAVL 941
           P+G++  ++Q+K+YPD+VVS+LP+K  LQ+ AA +SF + ++VDEPG YLAVL
Sbjct: 193 PKGKEVLEQQRKQYPDVVVSDLPNKTYLQSVAAAHSFDVAEFVDEPGLYLAVL 245


Top