BLASTX nr result
ID: Forsythia22_contig00018638
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia22_contig00018638 (1072 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011092235.1| PREDICTED: uncharacterized protein LOC105172... 438 e-120 ref|XP_009783127.1| PREDICTED: uncharacterized protein LOC104231... 419 e-114 ref|XP_009783126.1| PREDICTED: uncharacterized protein LOC104231... 417 e-114 ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593... 403 e-109 emb|CDP17763.1| unnamed protein product [Coffea canephora] 401 e-109 ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247... 400 e-109 ref|XP_012842312.1| PREDICTED: uncharacterized protein LOC105962... 373 e-100 ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Popu... 355 4e-95 ref|XP_011009424.1| PREDICTED: uncharacterized protein LOC105114... 353 1e-94 ref|XP_010252239.1| PREDICTED: uncharacterized protein LOC104593... 352 3e-94 gb|KDO53849.1| hypothetical protein CISIN_1g014334mg [Citrus sin... 349 2e-93 ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citr... 348 3e-93 gb|KHG17286.1| DNA-3-methyladenine glycosylase 1 [Gossypium arbo... 348 4e-93 ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629... 348 5e-93 ref|XP_011009421.1| PREDICTED: uncharacterized protein LOC105114... 347 7e-93 ref|XP_007023216.1| Uncharacterized protein isoform 1 [Theobroma... 345 2e-92 ref|XP_012442875.1| PREDICTED: uncharacterized protein LOC105767... 345 3e-92 ref|XP_012442874.1| PREDICTED: uncharacterized protein LOC105767... 338 4e-90 ref|XP_012078851.1| PREDICTED: uncharacterized protein LOC105639... 336 2e-89 ref|XP_010104208.1| hypothetical protein L484_002408 [Morus nota... 334 6e-89 >ref|XP_011092235.1| PREDICTED: uncharacterized protein LOC105172486 [Sesamum indicum] Length = 503 Score = 438 bits (1127), Expect = e-120 Identities = 232/394 (58%), Positives = 267/394 (67%), Gaps = 50/394 (12%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEVHEEAKNRGFGRVFRSPTLFEDMVKCILLCNCQWSRTLS 889 QVRRMLRLS+ EN R+ +F E+H+EAK RGFGRVFRSPTLFEDMVKCILLCNCQWSRTLS Sbjct: 101 QVRRMLRLSEAENRRMNEFHELHKEAKGRGFGRVFRSPTLFEDMVKCILLCNCQWSRTLS 160 Query: 888 MAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKESKKNPGVRKSS 709 MA+A A A N TIS CQT E FVPKTPA KESK+ GVRK S Sbjct: 161 MAQALCELQLELQHPLSSAANAMAENGTISSCQTTEMKHFVPKTPAVKESKRRLGVRKCS 220 Query: 708 INSANRFAEVKETEENANLERSVQISDCFQ------------------------------ 619 IN + +A++ E S +IS+C Q Sbjct: 221 INLESGYADILAVEAAERKTSSAEISECSQETGKLTPTFTSPDVKDFLQKSDSWQTSTSD 280 Query: 618 --PLEGKEVS------------------SCTKIGNFPSPRELVSLDEKFLAKRCNLGYRA 499 PLEG E + T IGNFPSPREL LD KFLA+RC+LGYRA Sbjct: 281 LLPLEGPEGKPDSSFVPVLQTLVETEGYAGTAIGNFPSPRELAGLDVKFLARRCSLGYRA 340 Query: 498 GRIMNLAREIVDGRVRLRELEDICGTLSLPEYDKLAEKLKVIDGFGPFTCANVLMCMGFY 319 R++NLA+++++GR+ L ELE C TL+L +YDKLAEKL+ IDGFGPFTCANVLMCMGFY Sbjct: 341 ARVINLAQQVIEGRIPLTELEYACDTLNLSKYDKLAEKLRAIDGFGPFTCANVLMCMGFY 400 Query: 318 HVIPTDSETIRHLEQVHAKSSMIRTVQRDVEVIYGKYAPFQFLAYWSEVWQFYEEWFGKL 139 HV+PTDSETIRHL+QVHAKSS I+TVQ DVE IYGKYAPFQFLAYWSE+W FYEEWFG L Sbjct: 401 HVVPTDSETIRHLKQVHAKSSTIQTVQGDVEKIYGKYAPFQFLAYWSEIWHFYEEWFGNL 460 Query: 138 SEMSPSDYKLITAANMRPKRNGKNKRIKLSVADI 37 SEM S YKLITAANMRPK N ++KR+KLS D+ Sbjct: 461 SEMHHSSYKLITAANMRPKTN-RSKRMKLSPKDM 493 >ref|XP_009783127.1| PREDICTED: uncharacterized protein LOC104231771 isoform X2 [Nicotiana sylvestris] Length = 480 Score = 419 bits (1076), Expect = e-114 Identities = 213/350 (60%), Positives = 255/350 (72%), Gaps = 7/350 (2%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEVHEEAKNRGFGRVFRSPTLFEDMVKCILLCNCQWSRTLS 889 QVRRMLRLS EEN R+R FQE+ EAK RGFGRVFRSPTLFEDMVKC+LLCNCQWSRTLS Sbjct: 127 QVRRMLRLSVEENERVRKFQEICGEAKERGFGRVFRSPTLFEDMVKCVLLCNCQWSRTLS 186 Query: 888 MAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKESKKNPGVRKSS 709 MA+A L+ A N T ++ F PKTPAGKES+K GV Sbjct: 187 MAEALCELQLELNRPSSAVLLSAADNLNQFKGVTAKSEHFSPKTPAGKESRKRAGVYGCC 246 Query: 708 INSANRFAEVKETEENANLERSVQISDCF-------QPLEGKEVSSCTKIGNFPSPRELV 550 N R EV+E + + + ++ + P +E+SS +IGNFPSP+EL Sbjct: 247 RNLLERLTEVEEIVDEGKADATTEVCEVSTSAPFNADPSVDRELSSFNQIGNFPSPKELA 306 Query: 549 SLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLRELEDICGTLSLPEYDKLAEKLKVID 370 LDE FLAKRC LGYRAGRI+ LA+ IV+GR+ L+ELE+ C SL YDK+AE+L+ ID Sbjct: 307 GLDESFLAKRCGLGYRAGRIIKLAKGIVEGRISLKELEEACCNPSLSNYDKMAEQLREID 366 Query: 369 GFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVHAKSSMIRTVQRDVEVIYGKYAPFQFL 190 GFGPFTCANVLMC+G+ HVIPTDSETIRHL+QVHA++S I+ VQ+DVE IY KYAPFQFL Sbjct: 367 GFGPFTCANVLMCLGYCHVIPTDSETIRHLKQVHARTSSIQKVQKDVEKIYAKYAPFQFL 426 Query: 189 AYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMRPKRNGKNKRIKLSVAD 40 AYWSEVW FYEEWFGK+SEM SDYKLITAANMRPKR+GK K++K++ A+ Sbjct: 427 AYWSEVWHFYEEWFGKVSEMPHSDYKLITAANMRPKRSGKCKKLKITPAE 476 >ref|XP_009783126.1| PREDICTED: uncharacterized protein LOC104231771 isoform X1 [Nicotiana sylvestris] Length = 502 Score = 417 bits (1072), Expect = e-114 Identities = 220/372 (59%), Positives = 259/372 (69%), Gaps = 29/372 (7%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEVHEEAKNRGFGRVFRSPTLFEDMVKCILLCNCQWSRTLS 889 QVRRMLRLS EEN R+R FQE+ EAK RGFGRVFRSPTLFEDMVKC+LLCNCQWSRTLS Sbjct: 127 QVRRMLRLSVEENERVRKFQEICGEAKERGFGRVFRSPTLFEDMVKCVLLCNCQWSRTLS 186 Query: 888 MAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKESKKNPGVRKSS 709 MA+A L+ A N T ++ F PKTPAGKES+K GV Sbjct: 187 MAEALCELQLELNRPSSAVLLSAADNLNQFKGVTAKSEHFSPKTPAGKESRKRAGVYGCC 246 Query: 708 INSANRFAEVKETEENANLERSV------------QISDCFQ-----------------P 616 N R EV+E + + SV QI+D FQ P Sbjct: 247 RNLLERLTEVEEIVDEGKADVSVKPAFSDGKEAVLQITDAFQATTEVCEVSTSAPFNADP 306 Query: 615 LEGKEVSSCTKIGNFPSPRELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLRELE 436 +E+SS +IGNFPSP+EL LDE FLAKRC LGYRAGRI+ LA+ IV+GR+ L+ELE Sbjct: 307 SVDRELSSFNQIGNFPSPKELAGLDESFLAKRCGLGYRAGRIIKLAKGIVEGRISLKELE 366 Query: 435 DICGTLSLPEYDKLAEKLKVIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVHAKSS 256 + C SL YDK+AE+L+ IDGFGPFTCANVLMC+G+ HVIPTDSETIRHL+QVHA++S Sbjct: 367 EACCNPSLSNYDKMAEQLREIDGFGPFTCANVLMCLGYCHVIPTDSETIRHLKQVHARTS 426 Query: 255 MIRTVQRDVEVIYGKYAPFQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMRPKRN 76 I+ VQ+DVE IY KYAPFQFLAYWSEVW FYEEWFGK+SEM SDYKLITAANMRPKR+ Sbjct: 427 SIQKVQKDVEKIYAKYAPFQFLAYWSEVWHFYEEWFGKVSEMPHSDYKLITAANMRPKRS 486 Query: 75 GKNKRIKLSVAD 40 GK K++K++ A+ Sbjct: 487 GKCKKLKITPAE 498 >ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593287 isoform X1 [Solanum tuberosum] gi|565385158|ref|XP_006358485.1| PREDICTED: uncharacterized protein LOC102593287 isoform X2 [Solanum tuberosum] Length = 485 Score = 403 bits (1035), Expect = e-109 Identities = 211/371 (56%), Positives = 259/371 (69%), Gaps = 28/371 (7%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEVHEEAKNRGFGRVFRSPTLFEDMVKCILLCNCQWSRTLS 889 QVRRM+RLS EEN R++ FQE+ EAK+RG GRVFRSPTLFEDMVKC+LLCNCQWSRTLS Sbjct: 111 QVRRMVRLSVEENKRVKQFQEICGEAKDRGLGRVFRSPTLFEDMVKCMLLCNCQWSRTLS 170 Query: 888 MAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKESKKNPGVRKSS 709 MA+A + N+ T ++ F P+TPAGKES+K G S Sbjct: 171 MAEALCELQLELNCPSSAASFPDPDNQNQLKGVTFKSEHFTPRTPAGKESRKRAGAYGCS 230 Query: 708 INSANRFAEVKETEE--------------------NANLER-SVQISDC-------FQPL 613 R EV+E + +NL R + ++ D P Sbjct: 231 RKLLERLTEVEEIIDIGKPGVTVTPAFSVGEEVLKKSNLCRDTTEVCDVGTSAPFNLDPS 290 Query: 612 EGKEVSSCTKIGNFPSPRELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLRELED 433 E +++SS ++GNFPSP+EL SLDE FLAKRC LGYRAGRI+ LA+ IV+G ++L+ELE+ Sbjct: 291 EDRKLSSFNQLGNFPSPKELASLDESFLAKRCGLGYRAGRIIKLAKGIVEGSIQLKELEE 350 Query: 432 ICGTLSLPEYDKLAEKLKVIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVHAKSSM 253 C SL +YDK+AE+L+ IDGFGPFTCANVLMC+G+YHVIPTDSETIRHL+QVHA++S Sbjct: 351 ACSNPSLSDYDKMAEQLREIDGFGPFTCANVLMCLGYYHVIPTDSETIRHLKQVHARTST 410 Query: 252 IRTVQRDVEVIYGKYAPFQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMRPKRNG 73 I+ VQRDVE IYGKYAPFQFLAYWSEVW FYEE FGKLSEM S+YKLITAANMR KRNG Sbjct: 411 IQNVQRDVENIYGKYAPFQFLAYWSEVWHFYEERFGKLSEMPHSEYKLITAANMRRKRNG 470 Query: 72 KNKRIKLSVAD 40 K K++K++ A+ Sbjct: 471 KCKKLKITSAE 481 >emb|CDP17763.1| unnamed protein product [Coffea canephora] Length = 430 Score = 401 bits (1030), Expect = e-109 Identities = 204/364 (56%), Positives = 251/364 (68%), Gaps = 20/364 (5%) Frame = -2 Query: 1071 DQVRRMLRLSDEENWRIRDFQEVHEEAKNRGFGRVFRSPTLFEDMVKCILLCNCQWSRTL 892 +QVRRMLRLS+E+N +RDFQE+H EAK R FGR+FRSPTLFEDM+KCILLCNCQW R+L Sbjct: 67 NQVRRMLRLSEEDNRTVRDFQEIHTEAKEREFGRIFRSPTLFEDMIKCILLCNCQWPRSL 126 Query: 891 SMAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKESKKNPGVRKS 712 SMA A N T S QT ++ F+PKTPAGKE+K+ V+K Sbjct: 127 SMATALCELQWELQYPLSRD---KVHNDTDSRSQTADSEHFIPKTPAGKETKRKMEVQKC 183 Query: 711 SINSANRFAEVKETEENANLERSV--------------------QISDCFQPLEGKEVSS 592 N AN+F + E ++ + Q+ D F +G E + Sbjct: 184 PENLANKFTDANAVGEEVSVFKMACDHVLHCSKMVGDGRLINFPQLDD-FSCSDGSEPYN 242 Query: 591 CTKIGNFPSPRELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLRELEDICGTLSL 412 C +IGNFPSP EL SLDE LA+RCNLGYRA RI+ LA+ +V G ++L ELE+ +L Sbjct: 243 CCRIGNFPSPNELASLDESVLARRCNLGYRASRILKLAQLVVQGGIKLGELEETGRQPTL 302 Query: 411 PEYDKLAEKLKVIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVHAKSSMIRTVQRD 232 Y+ LAE+LK IDGFGPFTCANVLMCMGFYHVIP+DSETIRH++QVHA+ + I+ V D Sbjct: 303 SSYNILAEQLKEIDGFGPFTCANVLMCMGFYHVIPSDSETIRHMKQVHARQTTIKAVDGD 362 Query: 231 VEVIYGKYAPFQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMRPKRNGKNKRIKL 52 +E+IYGKYAPFQFLAYWSEVW FYE+WFGK SEM P++YKLITA NMRPKRN K KR K+ Sbjct: 363 LEIIYGKYAPFQFLAYWSEVWSFYEDWFGKPSEMPPTNYKLITATNMRPKRNAKCKRKKI 422 Query: 51 SVAD 40 SV++ Sbjct: 423 SVSE 426 >ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247758 [Solanum lycopersicum] Length = 483 Score = 400 bits (1029), Expect = e-109 Identities = 212/368 (57%), Positives = 254/368 (69%), Gaps = 28/368 (7%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEVHEEAKNRGFGRVFRSPTLFEDMVKCILLCNCQWSRTLS 889 QVRRM+RLS EEN R++ FQE+ EAK RGFGRVFRSPTLFEDMVKC+LLCNCQWSRTLS Sbjct: 109 QVRRMVRLSVEENKRVKLFQEICGEAKERGFGRVFRSPTLFEDMVKCMLLCNCQWSRTLS 168 Query: 888 MAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKESKKNPGVRKSS 709 MA+A + N+ T ++ F P+TPAGKE +K G S Sbjct: 169 MAEALCELQLELNCPSSAASFPDPDNQNQLKGVTSKSEHFTPRTPAGKELRKRAGAYGCS 228 Query: 708 INSANRFAEVKETEE--------------------NANL--------ERSVQISDCFQPL 613 N R EV+E + +NL E SV P Sbjct: 229 RNLLERLNEVEEIVDIDKPGVTVTPAFSVGEEVLQKSNLCQDTTEVWEVSVSAPLNPDPS 288 Query: 612 EGKEVSSCTKIGNFPSPRELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLRELED 433 E +++SS ++GNFPSP++L SLDE FLAKRC LGYRAGRI+ LA+ IV+G ++L ELE+ Sbjct: 289 EDRKLSSFNQLGNFPSPKQLASLDESFLAKRCGLGYRAGRIIKLAKGIVEGSIQLNELEE 348 Query: 432 ICGTLSLPEYDKLAEKLKVIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVHAKSSM 253 C SL YDK+AE+L+ IDGFGPFTCANVLMC+G+YHVIPTDSETIRHL+QVHA++S Sbjct: 349 ACSNPSLSNYDKMAEQLREIDGFGPFTCANVLMCLGYYHVIPTDSETIRHLKQVHARTST 408 Query: 252 IRTVQRDVEVIYGKYAPFQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMRPKRNG 73 I+ VQRDVE IYGKYAPFQFLAYWSEVW FYEE FGKLSEM S+YKLITAANMRPKRNG Sbjct: 409 IQNVQRDVENIYGKYAPFQFLAYWSEVWHFYEERFGKLSEMPHSEYKLITAANMRPKRNG 468 Query: 72 KNKRIKLS 49 K K++K++ Sbjct: 469 KCKKLKIA 476 >ref|XP_012842312.1| PREDICTED: uncharacterized protein LOC105962546 [Erythranthe guttatus] Length = 311 Score = 373 bits (957), Expect = e-100 Identities = 198/342 (57%), Positives = 237/342 (69%), Gaps = 3/342 (0%) Frame = -2 Query: 1056 MLRLSDEENWRIRDFQEVHEEAKNRGFGRVFRSPTLFEDMVKCILLCNCQWSRTLSMAKA 877 MLRLSD+EN R+ DFQ+VHE+AK GFGRVFRSPTLFEDM+KC+LLCNCQWSRTLSMA++ Sbjct: 1 MLRLSDQENRRVVDFQKVHEKAKETGFGRVFRSPTLFEDMIKCMLLCNCQWSRTLSMAQS 60 Query: 876 XXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKESKKNPGVRKSSINSA 697 NA+N T E + F PKTPA KES K Sbjct: 61 LCELQSELQNPLP-----NASNITAPKSPKTEVNLFAPKTPARKESNKK----------- 104 Query: 696 NRFAEVKETEENANLERSVQISDCFQPLEGKEVSSCTKIGNFPSPRELVSLDEKFLAKRC 517 + LE DC+ + T I NFPSP EL +L+ +FLAKRC Sbjct: 105 ------------SRLE-----VDCY---------ASTTIANFPSPSELANLEVEFLAKRC 138 Query: 516 NLGYRAGRIMNLAREIVDGRVRLRELEDIC--GTLS-LPEYDKLAEKLKVIDGFGPFTCA 346 NLGYRA R++NLAR +++G V+L E+E C T+S L +YDKLAEKL+VIDGFGPFTCA Sbjct: 139 NLGYRASRVINLARGVIEGSVKLTEIEFACEYDTVSNLSDYDKLAEKLRVIDGFGPFTCA 198 Query: 345 NVLMCMGFYHVIPTDSETIRHLEQVHAKSSMIRTVQRDVEVIYGKYAPFQFLAYWSEVWQ 166 NVLMC+G+YHVIPTDSETIRHL+QVHAK+S +T++RD+E IYGKYAPFQFLAYWSEVW+ Sbjct: 199 NVLMCIGYYHVIPTDSETIRHLKQVHAKTSTKKTIERDLEDIYGKYAPFQFLAYWSEVWR 258 Query: 165 FYEEWFGKLSEMSPSDYKLITAANMRPKRNGKNKRIKLSVAD 40 FYEEWFG LSEM S YKLITAANMRPK+ +KR K+ + D Sbjct: 259 FYEEWFGNLSEMPRSSYKLITAANMRPKKASGSKRTKVPLED 300 >ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa] gi|550342350|gb|EEE79091.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa] Length = 489 Score = 355 bits (910), Expect = 4e-95 Identities = 202/377 (53%), Positives = 241/377 (63%), Gaps = 39/377 (10%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEVHEEAKNR-------GFG-RVFRSPTLFEDMVKCILLCN 913 QV RMLRLS+ + R+F+++ E A GFG RVFRSPTLFEDMVKCILLCN Sbjct: 113 QVVRMLRLSETDERNAREFRKIAEAAAAEENNSWLTGFGGRVFRSPTLFEDMVKCILLCN 172 Query: 912 CQWSRTLSMAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKESKK 733 CQW RTLSMA+A +A A N T+ F+P T AGKESK+ Sbjct: 173 CQWPRTLSMARALCELQCELQCKSSGVFVAQAVNATVKNKCNDTAHNFIPNTSAGKESKR 232 Query: 732 NPGVRKSSINSANRFAEVKET-EENANLER-SVQISDCFQPLEGKEVSSCTK-------- 583 N K + N A++ E + E +ANL+ S I + LE E SC + Sbjct: 233 NIRASKVTKNLASKIVETETLLEADANLKTDSAHIGR--ETLESVENDSCARCSSRHGSD 290 Query: 582 --------------------IGNFPSPRELVSLDEKFLAKRCNLGYRAGRIMNLAREIVD 463 I NFPSPREL +LDE FLAKRCNLGYRA RI+ LA+ IV+ Sbjct: 291 SWAPDSLQSQHGIQPGVNKMICNFPSPRELANLDESFLAKRCNLGYRAIRIIKLAQSIVE 350 Query: 462 GRVRLRELEDICGT-LSLPEYDKLAEKLKVIDGFGPFTCANVLMCMGFYHVIPTDSETIR 286 GR+ LRE+E+ C S Y+KLA++ + IDGFGPFTCANVLMCMGFYH+IPTDSET+R Sbjct: 351 GRIPLREVEEDCANGASSSCYNKLADQFRQIDGFGPFTCANVLMCMGFYHIIPTDSETVR 410 Query: 285 HLEQVHAKSSMIRTVQRDVEVIYGKYAPFQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLI 106 HL+QVHAK S I+TVQRDVE IYGKYAPFQFLAYW+E+W FYE+ FGKLSE+ SDYKLI Sbjct: 411 HLKQVHAKKSTIQTVQRDVEEIYGKYAPFQFLAYWAELWHFYEKRFGKLSEIPTSDYKLI 470 Query: 105 TAANMRPKRNGKNKRIK 55 TA+NMR K KNKR K Sbjct: 471 TASNMRSKGGQKNKRTK 487 >ref|XP_011009424.1| PREDICTED: uncharacterized protein LOC105114550 isoform X2 [Populus euphratica] Length = 483 Score = 353 bits (906), Expect = 1e-94 Identities = 199/371 (53%), Positives = 238/371 (64%), Gaps = 33/371 (8%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEVHEEAKNR---GFG-RVFRSPTLFEDMVKCILLCNCQWS 901 QV RMLRLS+ + R+F+++ E N GFG RVFRSPTLFEDMVKCILLCNCQW Sbjct: 111 QVVRMLRLSETDERNAREFRKMAEAENNSWLTGFGGRVFRSPTLFEDMVKCILLCNCQWP 170 Query: 900 RTLSMAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKESKKNPGV 721 RTLSMA+A +A A N T+ F+P T AGKESK+N Sbjct: 171 RTLSMARALCELQCELQCKSSGVFVAQAVNATVKNKCNDTAHNFIPNTSAGKESKRNIRE 230 Query: 720 RKSSINSANRFAEVKET-EENANLE-----------RSVQISDCFQPLEGKEVSSCTK-- 583 K S N A++ E E +ANL+ SV+ C + + SC Sbjct: 231 SKVSKNLASKIVETGTLLEADANLKTDSAHIGRETLESVENDSCARCISCHGSDSCAPDS 290 Query: 582 --------------IGNFPSPRELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLR 445 I NFPSPREL +LDE FLAKRCNLGYRA RI+ LA+ IV+GR+ LR Sbjct: 291 LQSQHGIQPGVNKMICNFPSPRELANLDESFLAKRCNLGYRAIRIIKLAQSIVEGRIPLR 350 Query: 444 ELEDICGT-LSLPEYDKLAEKLKVIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVH 268 E+E+ C S Y+KLA++ + IDGFGPFTCANVLMC+GFYH+IPTDSET+RHL+QVH Sbjct: 351 EIEEGCANGASSSCYNKLADQFRQIDGFGPFTCANVLMCLGFYHIIPTDSETVRHLKQVH 410 Query: 267 AKSSMIRTVQRDVEVIYGKYAPFQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMR 88 AK S I+TVQRDVE IYG YAPFQFLAYW+E+W FYE+ FGKLSE+ SDYKLITA+NMR Sbjct: 411 AKKSTIQTVQRDVEEIYGNYAPFQFLAYWAELWHFYEKRFGKLSEIPISDYKLITASNMR 470 Query: 87 PKRNGKNKRIK 55 K KNKR K Sbjct: 471 SKGGHKNKRTK 481 >ref|XP_010252239.1| PREDICTED: uncharacterized protein LOC104593879 [Nelumbo nucifera] Length = 493 Score = 352 bits (903), Expect = 3e-94 Identities = 196/379 (51%), Positives = 232/379 (61%), Gaps = 46/379 (12%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEVHEEAKNRGFGRVFRSPTLFEDMVKCILLCNCQWSRTLS 889 QV RMLRLSD + IR+F ++H EAK RGFGRVFRSPTLFEDMVKCILLCNCQW RTL+ Sbjct: 114 QVTRMLRLSDSDERNIREFHKIHHEAKERGFGRVFRSPTLFEDMVKCILLCNCQWPRTLA 173 Query: 888 MAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKESKKNPGVRKSS 709 MAKA + ++ S C + F PKTP G++SKK V K S Sbjct: 174 MAKALFELQSDLKCNSLGCSDSQGSSLD-SRCSKAKYEDFFPKTPIGRDSKKRRAVHKIS 232 Query: 708 INSANRFAEVKETEENANL---ERSVQISDCFQ-----------PLEGKEVSS------- 592 +N ++F + E E A++ S + C Q PLEG E Sbjct: 233 LNLDSKFKKA-ENELEADVYGKTNSDHPTQCLQLKEKISATLASPLEGDESQEHCCYNKQ 291 Query: 591 -CTK------------------------IGNFPSPRELVSLDEKFLAKRCNLGYRAGRIM 487 CTK IGNFP+PRE+ L+E LAKRCNLGYRA RI+ Sbjct: 292 LCTKVKVDANPALDLQFSEDKVSGTNGKIGNFPNPREIAGLNEALLAKRCNLGYRASRIL 351 Query: 486 NLAREIVDGRVRLRELEDICGTLSLPEYDKLAEKLKVIDGFGPFTCANVLMCMGFYHVIP 307 LA+ IV G+++LRELE+ C S Y L K + IDGFGPFTCANVLMCMGFY +IP Sbjct: 352 KLAQSIVQGKLQLRELEEDCNGESSSLYAMLFNKFREIDGFGPFTCANVLMCMGFYEMIP 411 Query: 306 TDSETIRHLEQVHAKSSMIRTVQRDVEVIYGKYAPFQFLAYWSEVWQFYEEWFGKLSEMS 127 DSETIRHL+QVHA+ S I++V RDVE IYG YAPFQFLAYWSE+W FY FGKLSEM Sbjct: 412 VDSETIRHLKQVHARQSTIQSVHRDVEKIYGGYAPFQFLAYWSELWHFYGARFGKLSEML 471 Query: 126 PSDYKLITAANMRPKRNGK 70 PS+Y LITA+NMR KR K Sbjct: 472 PSEYHLITASNMRTKRTNK 490 >gb|KDO53849.1| hypothetical protein CISIN_1g014334mg [Citrus sinensis] Length = 426 Score = 349 bits (896), Expect = 2e-93 Identities = 195/369 (52%), Positives = 236/369 (63%), Gaps = 30/369 (8%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEV-----HEEAKNRGF-----GRVFRSPTLFEDMVKCILL 919 QV+RMLRLS+ + +RDF+ + EE + + GRVFRSPTLFEDMVKC+LL Sbjct: 74 QVKRMLRLSEADERNVRDFKRIVRQVAQEEGEESQYMTDFSGRVFRSPTLFEDMVKCMLL 133 Query: 918 CNCQWSRTLSMAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKES 739 CNCQW RTLSMA+A + C + F+P+TPAGKES Sbjct: 134 CNCQWPRTLSMARALCELQWE-----------------LQHCSPSISEDFIPQTPAGKES 176 Query: 738 KKNPGVRKSSINSANRFAEVKETEEN-------------ANLERSVQISDCFQPLEGKEV 598 K+ V K + +R AE K + E+ N++ S +D L G Sbjct: 177 KRRQKVSKVASKLTSRIAESKASSEDYMNLKLDCAGVLEENVQPSFPQNDIESDLHGLNE 236 Query: 597 SSCT-------KIGNFPSPRELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLREL 439 S T +IGNFPSPREL +LDE FLAKRCNLGYRAGRI+ LAR IVDG+++LREL Sbjct: 237 LSTTDPPSARDRIGNFPSPRELANLDESFLAKRCNLGYRAGRILKLARGIVDGQIQLREL 296 Query: 438 EDICGTLSLPEYDKLAEKLKVIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVHAKS 259 ED+C SL Y KLAE+L I+GFGPFT NVL+C+GFYHVIPTDSETIRHL+QVHA++ Sbjct: 297 EDMCNEASLTAYVKLAEQLSQINGFGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARN 356 Query: 258 SMIRTVQRDVEVIYGKYAPFQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMRPKR 79 +TVQ E IYGKYAPFQFLAYWSE+W FYE+ FGKLSEM SDYKLITA+NM K Sbjct: 357 CTSKTVQMIAESIYGKYAPFQFLAYWSELWHFYEKRFGKLSEMPYSDYKLITASNMGIKN 416 Query: 78 NGKNKRIKL 52 K KR K+ Sbjct: 417 IRKVKRTKI 425 >ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citrus clementina] gi|557533482|gb|ESR44600.1| hypothetical protein CICLE_v10001110mg [Citrus clementina] Length = 454 Score = 348 bits (894), Expect = 3e-93 Identities = 195/369 (52%), Positives = 238/369 (64%), Gaps = 30/369 (8%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEV-----HEEAKNRGF-----GRVFRSPTLFEDMVKCILL 919 QV+RMLRLS+ + +RDF+ + EE + + GRVFRSPTLFEDMVKC+LL Sbjct: 102 QVKRMLRLSEADERNVRDFKRIVRQVAQEEGEESQYMTDFSGRVFRSPTLFEDMVKCMLL 161 Query: 918 CNCQWSRTLSMAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKES 739 CNCQW RTL+MA+A + C + F+P+TPAGKES Sbjct: 162 CNCQWPRTLNMARALCELQWE-----------------LQHCSPSISEDFIPQTPAGKES 204 Query: 738 KKNPGVRKSSINSANRFAEVKETEEN---------ANLERSVQIS----DCFQPLEGKEV 598 K+ V K + +R AE K + E+ LE +VQ S D L G Sbjct: 205 KRRQKVSKVASKLTSRIAESKASSEDDMNLKLDCTGALEENVQPSFPRNDIESDLHGLNE 264 Query: 597 -------SSCTKIGNFPSPRELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLREL 439 S+C +IGNFPSPREL +LDE FLAKRCNLGYRAGRI+ LA+ IVDG+++LREL Sbjct: 265 LSTTDPPSACDRIGNFPSPRELANLDESFLAKRCNLGYRAGRILKLAQGIVDGQIQLREL 324 Query: 438 EDICGTLSLPEYDKLAEKLKVIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVHAKS 259 ED C SL Y+KLAE+L I+GFGPFT NVL+C+GFYHVIPTDSETIRHL+QVHA++ Sbjct: 325 EDTCNEASLTTYNKLAEQLSQINGFGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARN 384 Query: 258 SMIRTVQRDVEVIYGKYAPFQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMRPKR 79 +TVQ E IYGKY+PFQFLAYWSE+W FYE+ FGKLSEM SDYKLITA+NM K Sbjct: 385 CTSKTVQIIAESIYGKYSPFQFLAYWSELWHFYEKRFGKLSEMPYSDYKLITASNMGIKN 444 Query: 78 NGKNKRIKL 52 K KR K+ Sbjct: 445 IRKVKRTKI 453 >gb|KHG17286.1| DNA-3-methyladenine glycosylase 1 [Gossypium arboreum] Length = 451 Score = 348 bits (893), Expect = 4e-93 Identities = 190/353 (53%), Positives = 234/353 (66%), Gaps = 9/353 (2%) Frame = -2 Query: 1071 DQVRRMLRLSDEENWRIRDFQEV------HEEAKN--RGF-GRVFRSPTLFEDMVKCILL 919 +QV RMLRLS+ E ++R+F+ + EEA R F GRVFRSPTLFEDMVKCILL Sbjct: 125 NQVSRMLRLSESEENKVREFRSIVEALHGEEEATEYLRSFSGRVFRSPTLFEDMVKCILL 184 Query: 918 CNCQWSRTLSMAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKES 739 CNCQ+SRTLSMAKA IS + E F+PKTPAGKES Sbjct: 185 CNCQFSRTLSMAKALCELQFEI-------------QHQISSSKAAE-DDFIPKTPAGKES 230 Query: 738 KKNPGVRKSSINSANRFAEVKETEENANLERSVQISDCFQPLEGKEVSSCTKIGNFPSPR 559 K+ V K SI ++ E K ++L+ S ++ D +G+FPSP Sbjct: 231 KRKLRVSKVSIRLESKLTESKVDNSVSDLQLSQELHDF------------VGMGSFPSPE 278 Query: 558 ELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLRELEDICGTLSLPEYDKLAEKLK 379 EL LDE FLAKRCNLGYRA RI+ LA+ +V G ++L +LE+ C SL YDKL+++L+ Sbjct: 279 ELAKLDESFLAKRCNLGYRASRILKLAQGVVQGNIQLTQLEEDCKETSLSSYDKLSQRLR 338 Query: 378 VIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVHAKSSMIRTVQRDVEVIYGKYAPF 199 IDGFGPFTCANVLMCMGFYHVIP DSETIRHL+QVH+KS ++TV RDVE+IY KYAPF Sbjct: 339 QIDGFGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSCTVQTVGRDVELIYAKYAPF 398 Query: 198 QFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMRPKRNGKNKRIKLSVAD 40 QFLAYW+E+W FY + FGKLSE+ SDYKLITA+NM+ K+ KR K S + Sbjct: 399 QFLAYWAEMWHFYGQRFGKLSELPVSDYKLITASNMKHKKIATRKRSKTSAEE 451 >ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629917 isoform X1 [Citrus sinensis] Length = 454 Score = 348 bits (892), Expect = 5e-93 Identities = 194/370 (52%), Positives = 237/370 (64%), Gaps = 30/370 (8%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEV-----HEEAKNRGF-----GRVFRSPTLFEDMVKCILL 919 QV+RMLRLS+ + +R+F+ + EE + + GRVFRSPTLFEDMVKC+LL Sbjct: 102 QVKRMLRLSEADERNVREFKRIVRQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLL 161 Query: 918 CNCQWSRTLSMAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKES 739 CNCQW RTLSMA+A + C + F+P+TPAGKES Sbjct: 162 CNCQWPRTLSMARALCELQWE-----------------LQHCSPSISEDFIPQTPAGKES 204 Query: 738 KKNPGVRKSSINSANRFAEVKETEEN-------------ANLERSVQISDCFQPLEGKEV 598 K+ V K + +R AE K + E+ N++ S +D L G Sbjct: 205 KRRQKVSKVASKLTSRIAESKASSEDYMNLKLDCAGVLEENVQPSFPQNDIESDLHGLNE 264 Query: 597 SSCT-------KIGNFPSPRELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLREL 439 S T +IGNFPSPREL +LDE FLAKRCNLGYRAGRI+ LAR IVDG+++LREL Sbjct: 265 LSTTDPPSARDRIGNFPSPRELANLDESFLAKRCNLGYRAGRILKLARGIVDGQIQLREL 324 Query: 438 EDICGTLSLPEYDKLAEKLKVIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVHAKS 259 ED+C SL Y KLAE+L I+GFGPFT NVL+C+GFYHVIPTDSETIRHL+QVHA++ Sbjct: 325 EDMCNEASLTAYVKLAEQLSQINGFGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARN 384 Query: 258 SMIRTVQRDVEVIYGKYAPFQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMRPKR 79 +TVQ E IYGKYAPFQFLAYWSE+W FYE+ FGKLSEM SDYKLITA+NM K Sbjct: 385 CTSKTVQMIAESIYGKYAPFQFLAYWSELWHFYEKRFGKLSEMPYSDYKLITASNMGIKN 444 Query: 78 NGKNKRIKLS 49 + KR K+S Sbjct: 445 IRQVKRTKIS 454 >ref|XP_011009421.1| PREDICTED: uncharacterized protein LOC105114550 isoform X1 [Populus euphratica] gi|743930350|ref|XP_011009422.1| PREDICTED: uncharacterized protein LOC105114550 isoform X1 [Populus euphratica] Length = 487 Score = 347 bits (891), Expect = 7e-93 Identities = 199/375 (53%), Positives = 238/375 (63%), Gaps = 37/375 (9%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEVHEEAKNR---GFG-RVFRSPTLFEDMVKCILLCNCQWS 901 QV RMLRLS+ + R+F+++ E N GFG RVFRSPTLFEDMVKCILLCNCQW Sbjct: 111 QVVRMLRLSETDERNAREFRKMAEAENNSWLTGFGGRVFRSPTLFEDMVKCILLCNCQWP 170 Query: 900 RTLSMAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKESKKNPGV 721 RTLSMA+A +A A N T+ F+P T AGKESK+N Sbjct: 171 RTLSMARALCELQCELQCKSSGVFVAQAVNATVKNKCNDTAHNFIPNTSAGKESKRNIRE 230 Query: 720 RKSSINSANRFAEVKET-EENANLE-----------RSVQISDCFQPLEGKEVSSCTK-- 583 K S N A++ E E +ANL+ SV+ C + + SC Sbjct: 231 SKVSKNLASKIVETGTLLEADANLKTDSAHIGRETLESVENDSCARCISCHGSDSCAPDS 290 Query: 582 --------------IGNFPSPRELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLR 445 I NFPSPREL +LDE FLAKRCNLGYRA RI+ LA+ IV+GR+ LR Sbjct: 291 LQSQHGIQPGVNKMICNFPSPRELANLDESFLAKRCNLGYRAIRIIKLAQSIVEGRIPLR 350 Query: 444 ELEDICGT-LSLPEYDKLAEKLKVIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQ-- 274 E+E+ C S Y+KLA++ + IDGFGPFTCANVLMC+GFYH+IPTDSET+RHL+Q Sbjct: 351 EIEEGCANGASSSCYNKLADQFRQIDGFGPFTCANVLMCLGFYHIIPTDSETVRHLKQLS 410 Query: 273 --VHAKSSMIRTVQRDVEVIYGKYAPFQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITA 100 VHAK S I+TVQRDVE IYG YAPFQFLAYW+E+W FYE+ FGKLSE+ SDYKLITA Sbjct: 411 IQVHAKKSTIQTVQRDVEEIYGNYAPFQFLAYWAELWHFYEKRFGKLSEIPISDYKLITA 470 Query: 99 ANMRPKRNGKNKRIK 55 +NMR K KNKR K Sbjct: 471 SNMRSKGGHKNKRTK 485 >ref|XP_007023216.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508778582|gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 467 Score = 345 bits (886), Expect = 2e-92 Identities = 188/348 (54%), Positives = 235/348 (67%), Gaps = 10/348 (2%) Frame = -2 Query: 1071 DQVRRMLRLSDEENWRIRDFQEV------HEEAKN---RGF-GRVFRSPTLFEDMVKCIL 922 +QV RMLRLS+EE ++R+F+++ EEA R F GRVFRSPTLFEDMVKCIL Sbjct: 139 NQVSRMLRLSEEEESKVREFRKIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCIL 198 Query: 921 LCNCQWSRTLSMAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKE 742 LCNCQ+SRTLSMAKA + SG + E F+PKTPAG E Sbjct: 199 LCNCQFSRTLSMAKALCELQFE-------------TQRPFSGVRAAE-DDFIPKTPAGNE 244 Query: 741 SKKNPGVRKSSINSANRFAEVKETEENANLERSVQISDCFQPLEGKEVSSCTKIGNFPSP 562 K+ V K S+ +FAE + ++L+ S ++ E + +G+FPSP Sbjct: 245 LKRKLRVSKVSMRLEGKFAEPRADHSKSDLQPSQELD---------EPHAYKGMGSFPSP 295 Query: 561 RELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLRELEDICGTLSLPEYDKLAEKL 382 EL +LDE FLAKRCNLGYRA RI+ LA+ IV G ++L +LE+ C +SL Y+KLAE+L Sbjct: 296 EELANLDESFLAKRCNLGYRASRILKLAKGIVQGIIQLMQLEEGCKEISLSSYNKLAEQL 355 Query: 381 KVIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVHAKSSMIRTVQRDVEVIYGKYAP 202 + IDGFGPFTCANVLMCMGFYHVIP DSETIRHL+QVH+KSS ++TV RDVE IY KYAP Sbjct: 356 RQIDGFGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSSTMQTVGRDVEGIYAKYAP 415 Query: 201 FQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMRPKRNGKNKRI 58 FQFLAYW+E+W +YE+ FGKLSEM YKLITA+NM+ K K ++ Sbjct: 416 FQFLAYWAELWHYYEQRFGKLSEMPFCGYKLITASNMKMKATSKRTKV 463 >ref|XP_012442875.1| PREDICTED: uncharacterized protein LOC105767847 isoform X2 [Gossypium raimondii] gi|763789632|gb|KJB56628.1| hypothetical protein B456_009G128100 [Gossypium raimondii] Length = 428 Score = 345 bits (885), Expect = 3e-92 Identities = 189/353 (53%), Positives = 236/353 (66%), Gaps = 9/353 (2%) Frame = -2 Query: 1071 DQVRRMLRLSDEENWRIRDFQEV------HEEAKN--RGF-GRVFRSPTLFEDMVKCILL 919 +QV RMLRLS+ E ++R+F+ + EEA R F GRVFRSPTLFEDMVKCILL Sbjct: 102 NQVSRMLRLSESEENKVREFRSIVEALHGEEEATEYLRSFSGRVFRSPTLFEDMVKCILL 161 Query: 918 CNCQWSRTLSMAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKES 739 CNCQ+SRTLSMAKA IS + E F+PKTPAGKES Sbjct: 162 CNCQFSRTLSMAKALCELQFEI-------------QHQISSSKAAE-DDFIPKTPAGKES 207 Query: 738 KKNPGVRKSSINSANRFAEVKETEENANLERSVQISDCFQPLEGKEVSSCTKIGNFPSPR 559 K+ V K S+ ++F E K ++L+ S + PL+ +G+FPSP Sbjct: 208 KRKLRVSKVSMRLESKFTESKVDNSVSDLQLSQE------PLD------FVGMGSFPSPE 255 Query: 558 ELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLRELEDICGTLSLPEYDKLAEKLK 379 EL +LDE FLAKRCNLGYRA RI+ LA+ +V G ++L +LE+ C S YDKL+++L+ Sbjct: 256 ELANLDESFLAKRCNLGYRASRILKLAQGVVQGNIQLTQLEEDCKETSFSSYDKLSQRLR 315 Query: 378 VIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVHAKSSMIRTVQRDVEVIYGKYAPF 199 IDGFGPFTCANVLMCMGFYHVIP DSETIRHL+QVH+KS ++TV RDVE+IY KYAPF Sbjct: 316 QIDGFGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSCTVQTVGRDVELIYAKYAPF 375 Query: 198 QFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMRPKRNGKNKRIKLSVAD 40 QFLAYW+E+W FY + FGKLSE+ SDYKL+TA+NM+ K+ KR K S + Sbjct: 376 QFLAYWAEMWHFYGQRFGKLSELPVSDYKLMTASNMKNKKIATRKRSKTSAEE 428 >ref|XP_012442874.1| PREDICTED: uncharacterized protein LOC105767847 isoform X1 [Gossypium raimondii] gi|763789633|gb|KJB56629.1| hypothetical protein B456_009G128100 [Gossypium raimondii] Length = 435 Score = 338 bits (867), Expect = 4e-90 Identities = 189/360 (52%), Positives = 236/360 (65%), Gaps = 16/360 (4%) Frame = -2 Query: 1071 DQVRRMLRLSDEENWRIRDFQEV------HEEAKN--RGF-GRVFRSPTLFEDMVKCILL 919 +QV RMLRLS+ E ++R+F+ + EEA R F GRVFRSPTLFEDMVKCILL Sbjct: 102 NQVSRMLRLSESEENKVREFRSIVEALHGEEEATEYLRSFSGRVFRSPTLFEDMVKCILL 161 Query: 918 CNCQ-------WSRTLSMAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPK 760 CNCQ +SRTLSMAKA IS + E F+PK Sbjct: 162 CNCQAPPTFYRFSRTLSMAKALCELQFEI-------------QHQISSSKAAE-DDFIPK 207 Query: 759 TPAGKESKKNPGVRKSSINSANRFAEVKETEENANLERSVQISDCFQPLEGKEVSSCTKI 580 TPAGKESK+ V K S+ ++F E K ++L+ S + PL+ + Sbjct: 208 TPAGKESKRKLRVSKVSMRLESKFTESKVDNSVSDLQLSQE------PLD------FVGM 255 Query: 579 GNFPSPRELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRLRELEDICGTLSLPEYD 400 G+FPSP EL +LDE FLAKRCNLGYRA RI+ LA+ +V G ++L +LE+ C S YD Sbjct: 256 GSFPSPEELANLDESFLAKRCNLGYRASRILKLAQGVVQGNIQLTQLEEDCKETSFSSYD 315 Query: 399 KLAEKLKVIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVHAKSSMIRTVQRDVEVI 220 KL+++L+ IDGFGPFTCANVLMCMGFYHVIP DSETIRHL+QVH+KS ++TV RDVE+I Sbjct: 316 KLSQRLRQIDGFGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSCTVQTVGRDVELI 375 Query: 219 YGKYAPFQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMRPKRNGKNKRIKLSVAD 40 Y KYAPFQFLAYW+E+W FY + FGKLSE+ SDYKL+TA+NM+ K+ KR K S + Sbjct: 376 YAKYAPFQFLAYWAEMWHFYGQRFGKLSELPVSDYKLMTASNMKNKKIATRKRSKTSAEE 435 >ref|XP_012078851.1| PREDICTED: uncharacterized protein LOC105639414 [Jatropha curcas] gi|643722707|gb|KDP32457.1| hypothetical protein JCGZ_13382 [Jatropha curcas] Length = 481 Score = 336 bits (861), Expect = 2e-89 Identities = 184/375 (49%), Positives = 237/375 (63%), Gaps = 35/375 (9%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEVHEEAKNRGF-------GRVFRSPTLFEDMVKCILLCNC 910 QV RMLRLSD + IR+F+++ + F GRVFRSPTLFEDMVKCILLCNC Sbjct: 117 QVLRMLRLSDADEMNIREFRKIIAMGEGEEFDWMKGFSGRVFRSPTLFEDMVKCILLCNC 176 Query: 909 QWSRTLSMAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKESKKN 730 QWSRTLSMA+A + + + Q + + F+PKTP GKES+K Sbjct: 177 QWSRTLSMARALCELQLELQFH----------SSSCTKAQQTDMNNFIPKTPVGKESQKR 226 Query: 729 PG-VRKSSINSANRFAEVKETEENA--------------NLERSVQISD----------- 628 G V +S N + + K + NL + I+ Sbjct: 227 KGRVSSASSNLSTKLLVTKMDWDEVDTCLTMVDTRIKRENLTPNFSINSIEDNSCGICKS 286 Query: 627 CFQP--LEGKEVSSCTKIGNFPSPRELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRV 454 C P ++ + + C +I NFPSP EL +LDE+FL+KRC LGYRAGRI+ L++ IV+GR+ Sbjct: 287 CVGPSGIQSLQQTQCKRIWNFPSPWELANLDERFLSKRCGLGYRAGRIIKLSQGIVEGRI 346 Query: 453 RLRELEDICGTLSLPEYDKLAEKLKVIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQ 274 +RELE +C SL Y++LA++LK IDGFGPFT ANVLMCMGFYHVIP DSET+RH++Q Sbjct: 347 PMRELEQVCNGGSLNSYNELADQLKEIDGFGPFTRANVLMCMGFYHVIPADSETVRHIKQ 406 Query: 273 VHAKSSMIRTVQRDVEVIYGKYAPFQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAAN 94 VHAK+S I+TV + +E IYGKY P QFLAYW+E+W FYE+ FGK EM S+YKLITA+N Sbjct: 407 VHAKNSTIQTVHKHIEEIYGKYTPLQFLAYWTELWHFYEQRFGKFYEMPCSEYKLITASN 466 Query: 93 MRPKRNGKNKRIKLS 49 MR K + K KR K+S Sbjct: 467 MRNKGSCKIKRAKIS 481 >ref|XP_010104208.1| hypothetical protein L484_002408 [Morus notabilis] gi|587962478|gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis] Length = 472 Score = 334 bits (857), Expect = 6e-89 Identities = 189/371 (50%), Positives = 228/371 (61%), Gaps = 33/371 (8%) Frame = -2 Query: 1068 QVRRMLRLSDEENWRIRDFQEVHEEAKNRGFGRVFRSPTLFEDMVKCILLCNCQWSRTLS 889 QV RMLRLS E R+F EV+ G GRVFRSPTLFEDMVKCILLCNCQW RTLS Sbjct: 101 QVSRMLRLSQTEERICREFSEVY--GCGSGLGRVFRSPTLFEDMVKCILLCNCQWPRTLS 158 Query: 888 MAKAXXXXXXXXXXXXXXXXLANAANKTISGCQTVETSQFVPKTPAGKESKKNPGVRKSS 709 MA+A + +KT+ FVPKTPAGKE K+ K+S Sbjct: 159 MAQALCDLQRELQLQ-------SVPSKTVD---------FVPKTPAGKEPKRKVEKLKAS 202 Query: 708 INSANRF-AEVKETEENANLERSVQISDCFQPLEGKEVSSCTKI---------------- 580 ++F A+ E E+ + + S+ IS + SS + Sbjct: 203 TCLTSQFDAQSNEGLESHSNDLSIDISQPTPSAQNLSPSSLLSVPMENVTCEESYGVDSA 262 Query: 579 ----------------GNFPSPRELVSLDEKFLAKRCNLGYRAGRIMNLAREIVDGRVRL 448 G+FP+P EL LDEKFLAKRC LGYRAGRI+ LAR IV+GR++L Sbjct: 263 SLCNPQILRDREFEGTGDFPTPTELAKLDEKFLAKRCKLGYRAGRILKLARGIVEGRIQL 322 Query: 447 RELEDICGTLSLPEYDKLAEKLKVIDGFGPFTCANVLMCMGFYHVIPTDSETIRHLEQVH 268 RELE+ C SL Y KLA +L+ IDGFGPFTCANVLMCMGFYHVIP+DSETIRHL+QVH Sbjct: 323 RELEETCMERSLCSYSKLAVQLRQIDGFGPFTCANVLMCMGFYHVIPSDSETIRHLQQVH 382 Query: 267 AKSSMIRTVQRDVEVIYGKYAPFQFLAYWSEVWQFYEEWFGKLSEMSPSDYKLITAANMR 88 ++S +RT++RDV+ IY KY PFQFLAYWSE+W FYE+ FGK+SEM S YKL TA+NM+ Sbjct: 383 GRNSTVRTIERDVQQIYAKYEPFQFLAYWSELWHFYEKKFGKISEMPCSAYKLFTASNMK 442 Query: 87 PKRNGKNKRIK 55 K N R K Sbjct: 443 TKAERPNNRKK 453