BLASTX nr result
ID: Cocculus23_contig00014479
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00014479 (1254 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006437510.1| hypothetical protein CICLE_v10032158mg [Citr... 343 1e-91 ref|XP_006484658.1| PREDICTED: uncharacterized protein LOC102619... 341 3e-91 ref|XP_006484657.1| PREDICTED: uncharacterized protein LOC102619... 341 3e-91 ref|XP_002265131.1| PREDICTED: uncharacterized protein LOC100268... 338 4e-90 ref|XP_007048463.1| Uncharacterized protein TCM_001539 [Theobrom... 330 1e-87 ref|XP_006350362.1| PREDICTED: uncharacterized protein LOC102588... 325 3e-86 ref|XP_006406456.1| hypothetical protein EUTSA_v10021111mg [Eutr... 323 1e-85 ref|NP_566649.1| uncharacterized protein [Arabidopsis thaliana] ... 318 2e-84 ref|XP_002885329.1| hypothetical protein ARALYDRAFT_479498 [Arab... 315 3e-83 ref|XP_002528748.1| conserved hypothetical protein [Ricinus comm... 314 4e-83 gb|EXB61829.1| hypothetical protein L484_012263 [Morus notabilis] 313 1e-82 ref|XP_006298144.1| hypothetical protein CARUB_v10014192mg [Caps... 311 3e-82 ref|XP_002315057.1| hypothetical protein POPTR_0010s17720g [Popu... 310 8e-82 ref|XP_004231527.1| PREDICTED: uncharacterized protein LOC101255... 309 2e-81 ref|XP_007219389.1| hypothetical protein PRUPE_ppa020238mg, part... 305 4e-80 ref|XP_007159477.1| hypothetical protein PHAVU_002G240600g [Phas... 304 5e-80 gb|AGV54555.1| hypothetical protein [Phaseolus vulgaris] 304 5e-80 ref|XP_004307383.1| PREDICTED: uncharacterized protein LOC101294... 300 9e-79 ref|XP_003524967.1| PREDICTED: uncharacterized protein LOC100792... 299 1e-78 ref|XP_003629796.1| hypothetical protein MTR_8g086630 [Medicago ... 297 6e-78 >ref|XP_006437510.1| hypothetical protein CICLE_v10032158mg [Citrus clementina] gi|557539706|gb|ESR50750.1| hypothetical protein CICLE_v10032158mg [Citrus clementina] Length = 315 Score = 343 bits (879), Expect = 1e-91 Identities = 165/239 (69%), Positives = 201/239 (84%), Gaps = 5/239 (2%) Frame = +3 Query: 339 FDWGDENMV----GSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASAMGM 506 +DW D+ V GSPWEGA++Y+R+PS+THLEYCTTLERLGLGKLSTEVSRSRASAMG+ Sbjct: 76 YDWEDQEDVEEDAGSPWEGAIIYKRNPSITHLEYCTTLERLGLGKLSTEVSRSRASAMGL 135 Query: 507 RVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNFALL 686 RVTKAVKDYP GTPV +S+DV +KK+KLRLDGIIRTVL+LGCNRCGEPAA+SVFS+F++L Sbjct: 136 RVTKAVKDYPNGTPVQISIDVTKKKQKLRLDGIIRTVLTLGCNRCGEPAAQSVFSDFSVL 195 Query: 687 LTEEPIEEAEVVDMGLIFGEDKSRSAV-SXXXXXXXXXXXXXXXRLYFPAEEKEIDISKQ 863 L+E+PIEE E++D+G++FGEDKS+S+ + RLYFP EEKEIDISK Sbjct: 196 LSEQPIEEPEIIDIGMMFGEDKSKSSTGNGSEEEDDDASIDWDDRLYFPLEEKEIDISKN 255 Query: 864 IRDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGNLREQLQ 1040 IRD+VH+EITIN ICD SCKG+CL+CG NLN S CNCS K+E K K+YGPLGNLR+Q++ Sbjct: 256 IRDMVHLEITINVICDPSCKGICLKCGTNLNTSTCNCS-KEEVKGKTYGPLGNLRKQME 313 >ref|XP_006484658.1| PREDICTED: uncharacterized protein LOC102619910 isoform X2 [Citrus sinensis] Length = 315 Score = 341 bits (875), Expect = 3e-91 Identities = 164/239 (68%), Positives = 200/239 (83%), Gaps = 5/239 (2%) Frame = +3 Query: 339 FDWGDENMV----GSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASAMGM 506 +DW D+ V GSPWEGA++Y+R+PS+THLEYCTTLERLGLGKLSTEVSRSRASAMG+ Sbjct: 76 YDWEDQEDVEEDAGSPWEGAIIYKRNPSITHLEYCTTLERLGLGKLSTEVSRSRASAMGL 135 Query: 507 RVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNFALL 686 RVTKAVKDYP GTPV +S+DV +KK+KLRLDGIIRTVL+LGCNRCGEPA +SVFS+F++L Sbjct: 136 RVTKAVKDYPNGTPVQISIDVTKKKQKLRLDGIIRTVLTLGCNRCGEPATQSVFSDFSVL 195 Query: 687 LTEEPIEEAEVVDMGLIFGEDKSRSAV-SXXXXXXXXXXXXXXXRLYFPAEEKEIDISKQ 863 L+E+PIEE E++D+G++FGEDKS+S+ + RLYFP EEKEIDISK Sbjct: 196 LSEQPIEEPEIIDIGMMFGEDKSKSSTGNGSEEEDDDASIDWDDRLYFPLEEKEIDISKN 255 Query: 864 IRDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGNLREQLQ 1040 IRD+VH+EITIN ICD SCKG+CL+CG NLN S CNCS K+E K K+YGPLGNLR+Q++ Sbjct: 256 IRDMVHLEITINVICDPSCKGICLKCGTNLNTSTCNCS-KEEVKGKTYGPLGNLRKQME 313 >ref|XP_006484657.1| PREDICTED: uncharacterized protein LOC102619910 isoform X1 [Citrus sinensis] Length = 338 Score = 341 bits (875), Expect = 3e-91 Identities = 164/239 (68%), Positives = 200/239 (83%), Gaps = 5/239 (2%) Frame = +3 Query: 339 FDWGDENMV----GSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASAMGM 506 +DW D+ V GSPWEGA++Y+R+PS+THLEYCTTLERLGLGKLSTEVSRSRASAMG+ Sbjct: 99 YDWEDQEDVEEDAGSPWEGAIIYKRNPSITHLEYCTTLERLGLGKLSTEVSRSRASAMGL 158 Query: 507 RVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNFALL 686 RVTKAVKDYP GTPV +S+DV +KK+KLRLDGIIRTVL+LGCNRCGEPA +SVFS+F++L Sbjct: 159 RVTKAVKDYPNGTPVQISIDVTKKKQKLRLDGIIRTVLTLGCNRCGEPATQSVFSDFSVL 218 Query: 687 LTEEPIEEAEVVDMGLIFGEDKSRSAV-SXXXXXXXXXXXXXXXRLYFPAEEKEIDISKQ 863 L+E+PIEE E++D+G++FGEDKS+S+ + RLYFP EEKEIDISK Sbjct: 219 LSEQPIEEPEIIDIGMMFGEDKSKSSTGNGSEEEDDDASIDWDDRLYFPLEEKEIDISKN 278 Query: 864 IRDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGNLREQLQ 1040 IRD+VH+EITIN ICD SCKG+CL+CG NLN S CNCS K+E K K+YGPLGNLR+Q++ Sbjct: 279 IRDMVHLEITINVICDPSCKGICLKCGTNLNTSTCNCS-KEEVKGKTYGPLGNLRKQME 336 >ref|XP_002265131.1| PREDICTED: uncharacterized protein LOC100268166 [Vitis vinifera] gi|297743783|emb|CBI36666.3| unnamed protein product [Vitis vinifera] Length = 320 Score = 338 bits (866), Expect = 4e-90 Identities = 167/265 (63%), Positives = 201/265 (75%), Gaps = 3/265 (1%) Frame = +3 Query: 255 SDHHPSRSLNFTCRDATITNLDLEDPIDFDWGDENMV---GSPWEGAVVYRRDPSVTHLE 425 S + P +L FT R + D E+ FDW DE + GSPWEGAVVY+R+PS+ H+E Sbjct: 56 SRNKPHSALKFTAR-YNFESFDEENTKKFDWNDEREIEDTGSPWEGAVVYKRNPSILHVE 114 Query: 426 YCTTLERLGLGKLSTEVSRSRASAMGMRVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGI 605 +CTTLERLGLGKLSTE+S+SRAS MG+RVTKA KDYP GTPV +S+DV RKK KLRLDG+ Sbjct: 115 HCTTLERLGLGKLSTEISKSRASVMGLRVTKAAKDYPQGTPVHISIDVTRKKHKLRLDGL 174 Query: 606 IRTVLSLGCNRCGEPAAESVFSNFALLLTEEPIEEAEVVDMGLIFGEDKSRSAVSXXXXX 785 +RTV++LGCNRCGEPAAE +FSNF+LLLTEEPIEE EV++MG+IFGED + Sbjct: 175 LRTVITLGCNRCGEPAAECIFSNFSLLLTEEPIEEQEVINMGVIFGEDDKLKTSTESSEE 234 Query: 786 XXXXXXXXXXRLYFPAEEKEIDISKQIRDLVHIEITINAICDSSCKGLCLRCGANLNNSV 965 LYFP EE EIDISK IRD+VH+EITINA+CDS CKG+CL+CG NLN + Sbjct: 235 DDEASIDLDDWLYFPPEETEIDISKHIRDMVHLEITINAVCDSRCKGICLKCGINLNTAS 294 Query: 966 CNCSKKQEQKEKSYGPLGNLREQLQ 1040 CNCS K+E KEK YGPLG LR+Q+Q Sbjct: 295 CNCS-KEEVKEKGYGPLGVLRKQIQ 318 >ref|XP_007048463.1| Uncharacterized protein TCM_001539 [Theobroma cacao] gi|508700724|gb|EOX92620.1| Uncharacterized protein TCM_001539 [Theobroma cacao] Length = 324 Score = 330 bits (845), Expect = 1e-87 Identities = 168/280 (60%), Positives = 208/280 (74%), Gaps = 9/280 (3%) Frame = +3 Query: 228 LPQLISATPSDHHPSRSLNFT--CRDATITNLDL---EDPIDFDWGDENMV---GSPWEG 383 LP ++ + S+SLN RD+ N + E+ I FDW D+ + GSPWEG Sbjct: 43 LPWSVTLNSQKNFRSKSLNVLKPVRDSINQNSEYFTEENTITFDWEDQEDIEDIGSPWEG 102 Query: 384 AVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASAMGMRVTKAVKDYPLGTPVLVSL 563 AV+YRR+PS+THLEYCTTLERLGLGKLS+++S+SRAS MG+RVT+AVKDYP GTPV +S+ Sbjct: 103 AVMYRRNPSITHLEYCTTLERLGLGKLSSDISKSRASVMGLRVTRAVKDYPNGTPVQISI 162 Query: 564 DVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNFALLLTEEPIEEAEVVDMGLIFG 743 DV RKK+K+RLDGII+TV++LGCNRCGEPAAE +FSNF++LL+EEPIEE E++DMG F Sbjct: 163 DVTRKKQKMRLDGIIKTVITLGCNRCGEPAAEGIFSNFSVLLSEEPIEEPEIIDMGATFE 222 Query: 744 ED-KSRSAVSXXXXXXXXXXXXXXXRLYFPAEEKEIDISKQIRDLVHIEITINAICDSSC 920 E KS + RLYFP EEKEIDISK IRD+VH+EITINA+CD C Sbjct: 223 EGFKSVYGSNQEVEEDDDASIDWDDRLYFPPEEKEIDISKHIRDMVHLEITINAVCDPRC 282 Query: 921 KGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGNLREQLQ 1040 KG+CL+CG NLN S CNCS +E KEK YGPLGNL +Q+Q Sbjct: 283 KGICLKCGTNLNTSSCNCS--EEIKEKGYGPLGNLGKQIQ 320 >ref|XP_006350362.1| PREDICTED: uncharacterized protein LOC102588036 [Solanum tuberosum] Length = 302 Score = 325 bits (833), Expect = 3e-86 Identities = 165/242 (68%), Positives = 191/242 (78%), Gaps = 6/242 (2%) Frame = +3 Query: 333 IDFDWGDENMV-----GSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASA 497 +DFDW DE+ SPWEGAVVY+R+ SVTHLEYCTTLERLGLGKLST+VS+ RAS Sbjct: 60 VDFDWEDEDEYEEEDQDSPWEGAVVYKRNSSVTHLEYCTTLERLGLGKLSTKVSKCRASV 119 Query: 498 MGMRVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNF 677 MG+RVTK V DYP GTPVLVS DV RKK KLRLDGIIRTV++L CNRCGEPAAES+FSNF Sbjct: 120 MGLRVTKQVNDYPDGTPVLVSFDVTRKKHKLRLDGIIRTVIALPCNRCGEPAAESIFSNF 179 Query: 678 ALLLTEEPIEEAEVVDMGLIFGEDKSRSAVS-XXXXXXXXXXXXXXXRLYFPAEEKEIDI 854 +LLL+EEPI+E E +DMG++FGEDK +S V+ +LYFP EEK IDI Sbjct: 180 SLLLSEEPIKEPETLDMGIMFGEDKFKSFVNMEEEMEENDGWIPLEDQLYFPGEEKMIDI 239 Query: 855 SKQIRDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGNLREQ 1034 SKQIRDLVHIEITINA+CD CKGLCL+CGANLN S CNC Q+ +EK YGPLG L++Q Sbjct: 240 SKQIRDLVHIEITINAVCDPKCKGLCLKCGANLNVSRCNC-HMQKVEEKGYGPLGGLKKQ 298 Query: 1035 LQ 1040 +Q Sbjct: 299 MQ 300 >ref|XP_006406456.1| hypothetical protein EUTSA_v10021111mg [Eutrema salsugineum] gi|557107602|gb|ESQ47909.1| hypothetical protein EUTSA_v10021111mg [Eutrema salsugineum] Length = 331 Score = 323 bits (828), Expect = 1e-85 Identities = 158/246 (64%), Positives = 193/246 (78%), Gaps = 10/246 (4%) Frame = +3 Query: 333 IDFDWGDENMV---GSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASAMG 503 ID DW DE + GSPWEG+V+YRR+ SVTH+EYCTTLERLGLG+LST+VS+ RASAMG Sbjct: 81 IDIDWEDEEDIEDTGSPWEGSVMYRRNASVTHVEYCTTLERLGLGRLSTQVSKKRASAMG 140 Query: 504 MRVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNFAL 683 +RVTK VKDYP GTPV VS+DV RKK+KLRLDGI+RTV++LGCNRCGEPA ES+FSNF+L Sbjct: 141 LRVTKDVKDYPDGTPVQVSVDVIRKKKKLRLDGIVRTVITLGCNRCGEPAGESIFSNFSL 200 Query: 684 LLTEEPIEEAEVVDMGLIFGEDKSRS-------AVSXXXXXXXXXXXXXXXRLYFPAEEK 842 LLTEEP+EE +V+D+G FG+DK+ S +L+FP E K Sbjct: 201 LLTEEPVEEPDVIDLGFTFGKDKANSFSGLSNDEEDNADDDDDDSLIDWEDKLHFPPEVK 260 Query: 843 EIDISKQIRDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGN 1022 EIDISK IRDLVH+EITINAICD++CKG+CL+CGANLN C+C + E+K+K YGPLGN Sbjct: 261 EIDISKHIRDLVHLEITINAICDAACKGMCLKCGANLNKRKCDCGR--EEKDKGYGPLGN 318 Query: 1023 LREQLQ 1040 LR+Q+Q Sbjct: 319 LRKQMQ 324 >ref|NP_566649.1| uncharacterized protein [Arabidopsis thaliana] gi|11994192|dbj|BAB01295.1| unnamed protein product [Arabidopsis thaliana] gi|21593774|gb|AAM65741.1| unknown [Arabidopsis thaliana] gi|109946589|gb|ABG48473.1| At3g19810 [Arabidopsis thaliana] gi|110742135|dbj|BAE98996.1| hypothetical protein [Arabidopsis thaliana] gi|332642771|gb|AEE76292.1| uncharacterized protein AT3G19810 [Arabidopsis thaliana] Length = 321 Score = 318 bits (816), Expect = 2e-84 Identities = 155/239 (64%), Positives = 187/239 (78%), Gaps = 3/239 (1%) Frame = +3 Query: 333 IDFDWGDENMV---GSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASAMG 503 ID DW D+ + GSPWEG+V+YRR+ SVTH+EYCTTLERLGLG+LST+VS+ RASAMG Sbjct: 81 IDMDWEDQEEIEDTGSPWEGSVMYRRNASVTHVEYCTTLERLGLGRLSTDVSKKRASAMG 140 Query: 504 MRVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNFAL 683 +RVTK VKDYP GTPV VS+DV RKK+KLRLDGI+RTV++LGCNRCGE ES+FSNF+L Sbjct: 141 LRVTKDVKDYPDGTPVQVSVDVIRKKKKLRLDGIVRTVITLGCNRCGESTGESIFSNFSL 200 Query: 684 LLTEEPIEEAEVVDMGLIFGEDKSRSAVSXXXXXXXXXXXXXXXRLYFPAEEKEIDISKQ 863 LLTEEP+EE +V+D+G FG DK +L+FP E KEIDISK Sbjct: 201 LLTEEPVEEPDVIDLGFTFGNDKEE---GEDDDDNDDSWIDWEDKLHFPPEVKEIDISKH 257 Query: 864 IRDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGNLREQLQ 1040 IRDLVH+EITI AICDS+CKG+CL+CGANLN C+C + E+K+K YGPLGNLREQ+Q Sbjct: 258 IRDLVHLEITITAICDSACKGMCLKCGANLNKRKCDCGR--EEKDKGYGPLGNLREQMQ 314 >ref|XP_002885329.1| hypothetical protein ARALYDRAFT_479498 [Arabidopsis lyrata subsp. lyrata] gi|297331169|gb|EFH61588.1| hypothetical protein ARALYDRAFT_479498 [Arabidopsis lyrata subsp. lyrata] Length = 317 Score = 315 bits (807), Expect = 3e-83 Identities = 154/239 (64%), Positives = 185/239 (77%), Gaps = 3/239 (1%) Frame = +3 Query: 333 IDFDWGDENMV---GSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASAMG 503 ID DW D+ + GSPWEG+V+YRR+ S TH+EYCTTLERLGLG+LSTEVS+ RASAMG Sbjct: 78 IDMDWEDQEEIEDTGSPWEGSVMYRRNASATHVEYCTTLERLGLGRLSTEVSKKRASAMG 137 Query: 504 MRVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNFAL 683 +RVTK VKDYP GTPV VS+DV RKK+KLRLDGI+RTV++LGCNRCGE ES+FSNF+L Sbjct: 138 LRVTKDVKDYPDGTPVQVSVDVIRKKKKLRLDGIVRTVITLGCNRCGESTGESIFSNFSL 197 Query: 684 LLTEEPIEEAEVVDMGLIFGEDKSRSAVSXXXXXXXXXXXXXXXRLYFPAEEKEIDISKQ 863 LLTE+P+EE +V+D+G FG DK L+FP E KEIDISK Sbjct: 198 LLTEDPVEEPDVIDLGFTFGGDKEEG----EDDDDDDSWIDWEDTLHFPPEVKEIDISKH 253 Query: 864 IRDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGNLREQLQ 1040 IRDLVH+EITI AICDS+CKG+CL+CGANLN C+C + E+K+K YGPLGNLREQ+Q Sbjct: 254 IRDLVHLEITITAICDSACKGMCLKCGANLNKRKCDCGR--EEKDKGYGPLGNLREQMQ 310 >ref|XP_002528748.1| conserved hypothetical protein [Ricinus communis] gi|223531842|gb|EEF33660.1| conserved hypothetical protein [Ricinus communis] Length = 313 Score = 314 bits (805), Expect = 4e-83 Identities = 163/268 (60%), Positives = 201/268 (75%), Gaps = 6/268 (2%) Frame = +3 Query: 255 SDHHPSRSL----NFTCRDATITNLDLEDPIDFDWGDENMVGSPWEGAVVYRRDPSVTHL 422 S+ + SRS+ FT +L+ E+P D + SPWEGA++Y+R+PSV+H+ Sbjct: 52 SNKNNSRSIPNNQKFTDVSLGWDDLEEENPEDME--------SPWEGAIIYKRNPSVSHI 103 Query: 423 EYCTTLERLGLGKLSTEVSRSRASAMGMRVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDG 602 EYCTTLERLGLGK+STEVS+SRAS MG+RVTKAVKD+PLGTPV +S+DV RKK+KLRLDG Sbjct: 104 EYCTTLERLGLGKVSTEVSKSRASVMGLRVTKAVKDFPLGTPVQISIDVTRKKQKLRLDG 163 Query: 603 IIRTVLSLGCNRCGEPAAESVFSNFALLLTEEPIEEAEVVDMGLIFGEDK--SRSAVSXX 776 II+TVL+L CNRCG P A S++SNF+LLL+EE IEE E+VDMG+IFGEDK S +A Sbjct: 164 IIKTVLTLTCNRCGVPTAGSIYSNFSLLLSEEQIEEPEIVDMGMIFGEDKFESSAASGYE 223 Query: 777 XXXXXXXXXXXXXRLYFPAEEKEIDISKQIRDLVHIEITINAICDSSCKGLCLRCGANLN 956 R YFP EEKEIDISK IRDLVHIEI NAICD+SCKG+CL CG NLN Sbjct: 224 EEDDDDASIDWDDRFYFPPEEKEIDISKNIRDLVHIEIADNAICDASCKGVCLNCGTNLN 283 Query: 957 NSVCNCSKKQEQKEKSYGPLGNLREQLQ 1040 S C+CS K++ KEK YGPL +L++Q+Q Sbjct: 284 TSSCSCS-KEKNKEKGYGPLKDLKKQMQ 310 >gb|EXB61829.1| hypothetical protein L484_012263 [Morus notabilis] Length = 281 Score = 313 bits (802), Expect = 1e-82 Identities = 155/262 (59%), Positives = 195/262 (74%), Gaps = 6/262 (2%) Frame = +3 Query: 273 RSLNFTCRDATITNLDLEDPIDFDWGD----ENMVGSPWEGAVVYRRDPSVTHLEYCTTL 440 RS C + + E+ + D+ D E G PWEGAV+Y+R+ S++H+EYCTTL Sbjct: 18 RSTALDCTKHDYDHSNSENTVSLDFEDQEKEEEDTGCPWEGAVIYKRNSSISHIEYCTTL 77 Query: 441 ERLGLGKLSTEVSRSRASAMGMRVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVL 620 ERLGLG LSTE+S+SRASAMG+RVTKAVKDYP GTPV VS+DV RKK+KLRLDGI++TV+ Sbjct: 78 ERLGLGSLSTELSKSRASAMGLRVTKAVKDYPFGTPVQVSVDVMRKKQKLRLDGIVKTVI 137 Query: 621 SLGCNRCGEPAAESVFSNFALLLTEEPIEEAEVVDMGLIFGEDKSR--SAVSXXXXXXXX 794 +LGCN CG PAA+S+FS+F+LLLTEEP+EE +++++G I G++KSR S + Sbjct: 138 TLGCNSCGGPAAQSIFSDFSLLLTEEPVEEPDIINLGTIHGDNKSRPYSGLGDDGEEDDD 197 Query: 795 XXXXXXXRLYFPAEEKEIDISKQIRDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNC 974 RLYFP EKEIDISK IRDLVH+EI I AICD +CKG C +CGANLN S C C Sbjct: 198 ASIDFEDRLYFPPGEKEIDISKHIRDLVHLEINIKAICDPNCKGFCFKCGANLNTSRCTC 257 Query: 975 SKKQEQKEKSYGPLGNLREQLQ 1040 S KQE K+ SYGPLGNL++Q+Q Sbjct: 258 S-KQEVKKSSYGPLGNLKQQMQ 278 >ref|XP_006298144.1| hypothetical protein CARUB_v10014192mg [Capsella rubella] gi|482566853|gb|EOA31042.1| hypothetical protein CARUB_v10014192mg [Capsella rubella] Length = 323 Score = 311 bits (798), Expect = 3e-82 Identities = 151/244 (61%), Positives = 186/244 (76%), Gaps = 8/244 (3%) Frame = +3 Query: 333 IDFDWGDENMV---GSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASAMG 503 ID DW D+ + GSPWEG+V+YRR+ SVTH+EYCTTLERLGLG+LSTEVS+ RASAMG Sbjct: 75 IDMDWEDQEEIEDIGSPWEGSVMYRRNASVTHVEYCTTLERLGLGRLSTEVSKKRASAMG 134 Query: 504 MRVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNFAL 683 +RVTK VKDYP GTPV +++DV RKK+KLRLDGI++TV++LGCNRCGE ES+FSNF+L Sbjct: 135 LRVTKDVKDYPDGTPVQIAVDVIRKKKKLRLDGIVKTVITLGCNRCGESTGESIFSNFSL 194 Query: 684 LLTEEPIEEAEVVDMGLIFGEDKSRSAV-----SXXXXXXXXXXXXXXXRLYFPAEEKEI 848 LLTE+P+EE +V+D+G FG DK+ S +L+FP E KEI Sbjct: 195 LLTEDPVEEPDVIDLGFTFGSDKANSFSGLSDDKEETEDDDDSWIDWEDKLHFPPEAKEI 254 Query: 849 DISKQIRDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGNLR 1028 DISK IRDLVH+EITI AICD CKG+CL+CGANLN C C + E+K+K YGPLGNLR Sbjct: 255 DISKHIRDLVHLEITITAICDPGCKGMCLKCGANLNKRKCECGR--EEKDKGYGPLGNLR 312 Query: 1029 EQLQ 1040 E++Q Sbjct: 313 EKMQ 316 >ref|XP_002315057.1| hypothetical protein POPTR_0010s17720g [Populus trichocarpa] gi|566191354|ref|XP_006378596.1| hypothetical protein POPTR_0010s17720g [Populus trichocarpa] gi|566191357|ref|XP_006378597.1| hypothetical protein POPTR_0010s17720g [Populus trichocarpa] gi|222864097|gb|EEF01228.1| hypothetical protein POPTR_0010s17720g [Populus trichocarpa] gi|550330028|gb|ERP56393.1| hypothetical protein POPTR_0010s17720g [Populus trichocarpa] gi|550330029|gb|ERP56394.1| hypothetical protein POPTR_0010s17720g [Populus trichocarpa] Length = 322 Score = 310 bits (794), Expect = 8e-82 Identities = 147/258 (56%), Positives = 196/258 (75%), Gaps = 5/258 (1%) Frame = +3 Query: 279 LNFTCRDATITNLDLEDPIDFDWGDENM-----VGSPWEGAVVYRRDPSVTHLEYCTTLE 443 + FT R A +N + +W D+ + SPWEGA++Y+R+ S++H+EYCTTLE Sbjct: 63 IRFTTRHAVYSNSQKFTDVSLNWDDQEEEDAEDMESPWEGAIIYKRNSSISHVEYCTTLE 122 Query: 444 RLGLGKLSTEVSRSRASAMGMRVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLS 623 RLGLGKLSTE+S+SRAS MG+RVTKAVKDYPLGTPV +S+DV +KK++LRLDGII+TV++ Sbjct: 123 RLGLGKLSTEISKSRASVMGLRVTKAVKDYPLGTPVQISIDVTKKKKRLRLDGIIKTVIT 182 Query: 624 LGCNRCGEPAAESVFSNFALLLTEEPIEEAEVVDMGLIFGEDKSRSAVSXXXXXXXXXXX 803 LGC RCGEP AE +FSNF+LLL+EEP+ E E+++MG +FG DK +S++ Sbjct: 183 LGCYRCGEPVAEGIFSNFSLLLSEEPVAEPEIINMGKVFGNDKLKSSI-FEEEDGDEASI 241 Query: 804 XXXXRLYFPAEEKEIDISKQIRDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKK 983 RL+FP E+KEIDISK +RD+VH+EIT++ ICD SCKGLCL CG NLN S CNCSK+ Sbjct: 242 EWDDRLHFPPEDKEIDISKPLRDMVHVEITLDVICDPSCKGLCLECGTNLNKSSCNCSKE 301 Query: 984 QEQKEKSYGPLGNLREQL 1037 +E KE+ GPL +L++Q+ Sbjct: 302 KE-KERGPGPLKDLKKQM 318 >ref|XP_004231527.1| PREDICTED: uncharacterized protein LOC101255042 [Solanum lycopersicum] Length = 298 Score = 309 bits (791), Expect = 2e-81 Identities = 155/242 (64%), Positives = 189/242 (78%), Gaps = 6/242 (2%) Frame = +3 Query: 333 IDFDWGDE-----NMVGSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASA 497 +DFDW DE SPWEGAVVY+R+ SVTHL+Y TTLERLGLGKLST+VS+ RAS Sbjct: 56 VDFDWEDEYEDEYEDEDSPWEGAVVYKRNSSVTHLDYYTTLERLGLGKLSTKVSKCRASV 115 Query: 498 MGMRVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNF 677 MG+RVT+ VKDYP GTPVL+S DV R K KLRLDGIIRTV++L CNRCGEPAAES+FSNF Sbjct: 116 MGLRVTRQVKDYPDGTPVLISFDVTRMKHKLRLDGIIRTVIALPCNRCGEPAAESIFSNF 175 Query: 678 ALLLTEEPIEEAEVVDMGLIFGEDKSRSAVS-XXXXXXXXXXXXXXXRLYFPAEEKEIDI 854 +LLL+EEP++EAE +DMG++FG+DK +S V+ +LYFP +EK IDI Sbjct: 176 SLLLSEEPLKEAETLDMGIMFGDDKFKSFVNVEEEMEENDGWIPLEDQLYFPGDEKMIDI 235 Query: 855 SKQIRDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGNLREQ 1034 SK IRDLVHIEITINA+CD CKGLCL+CGANLN + C+C ++ +EK YGPLG L++Q Sbjct: 236 SKHIRDLVHIEITINAVCDPKCKGLCLKCGANLNVNRCSC-HMEKIEEKGYGPLGGLKKQ 294 Query: 1035 LQ 1040 +Q Sbjct: 295 MQ 296 >ref|XP_007219389.1| hypothetical protein PRUPE_ppa020238mg, partial [Prunus persica] gi|462415851|gb|EMJ20588.1| hypothetical protein PRUPE_ppa020238mg, partial [Prunus persica] Length = 241 Score = 305 bits (780), Expect = 4e-80 Identities = 146/239 (61%), Positives = 187/239 (78%) Frame = +3 Query: 324 EDPIDFDWGDENMVGSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASAMG 503 ED + D GDE+ GSPWEGAV+Y+R+ S++H+EYCTTLERLGLG LSTEVS+S+AS MG Sbjct: 5 EDAVFLDLGDEDETGSPWEGAVIYKRNTSISHVEYCTTLERLGLGNLSTEVSKSKASVMG 64 Query: 504 MRVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNFAL 683 +RVTKAVKDYP GTPV +S+D+ RKK+KLRLDGII+TV++L C+RC +PAAE +FSNF+L Sbjct: 65 LRVTKAVKDYPQGTPVQISIDITRKKQKLRLDGIIKTVIALTCSRCEDPAAECIFSNFSL 124 Query: 684 LLTEEPIEEAEVVDMGLIFGEDKSRSAVSXXXXXXXXXXXXXXXRLYFPAEEKEIDISKQ 863 LLT+EPIEE E+++MG+I+G+ + +S +LYF +KEIDISK Sbjct: 125 LLTDEPIEEPEIINMGVIYGD----TGISGQGEEDDEGTIDFEDQLYFRPGDKEIDISKH 180 Query: 864 IRDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGNLREQLQ 1040 IRD+VH+EITI A C+ SCKGLCL CG NLN CNCSK +Q +K +GPLGNL++QLQ Sbjct: 181 IRDMVHLEITITATCNPSCKGLCLSCGKNLNTGSCNCSK--QQAKKGFGPLGNLKKQLQ 237 >ref|XP_007159477.1| hypothetical protein PHAVU_002G240600g [Phaseolus vulgaris] gi|561032892|gb|ESW31471.1| hypothetical protein PHAVU_002G240600g [Phaseolus vulgaris] Length = 315 Score = 304 bits (779), Expect = 5e-80 Identities = 147/238 (61%), Positives = 188/238 (78%) Frame = +3 Query: 327 DPIDFDWGDENMVGSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASAMGM 506 + + FD DE GSPWEGAVVY+R+ S+ HLEYCTTLERLGL KLST+VS++RA+AMG+ Sbjct: 79 ESLGFD-DDEVDTGSPWEGAVVYKRNASILHLEYCTTLERLGLAKLSTDVSKTRAAAMGL 137 Query: 507 RVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNFALL 686 RVTKAV+++P GTPV +S+DV RKK+KLRLDGII+TV++L CNRC P+AES+FS F+LL Sbjct: 138 RVTKAVREFPNGTPVQISIDVTRKKKKLRLDGIIKTVITLLCNRCCMPSAESIFSEFSLL 197 Query: 687 LTEEPIEEAEVVDMGLIFGEDKSRSAVSXXXXXXXXXXXXXXXRLYFPAEEKEIDISKQI 866 LTEEPIEE E +DMG+IFGEDK ++ + +LYFP++EK+IDISK I Sbjct: 198 LTEEPIEEPETIDMGVIFGEDKLTTSGNGGQDDDEDALIDLEDQLYFPSQEKQIDISKNI 257 Query: 867 RDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGNLREQLQ 1040 RD VH+EIT+N++CD CKG+CL+CG N N C CS +E KEKSYGPLGNL+E++Q Sbjct: 258 RDRVHLEITMNSVCDPGCKGMCLKCGQNFNTGNCMCS-NEEVKEKSYGPLGNLKEKMQ 314 >gb|AGV54555.1| hypothetical protein [Phaseolus vulgaris] Length = 315 Score = 304 bits (779), Expect = 5e-80 Identities = 147/238 (61%), Positives = 188/238 (78%) Frame = +3 Query: 327 DPIDFDWGDENMVGSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASAMGM 506 + + FD DE GSPWEGAVVY+R+ S+ HLEYCTTLERLGL KLST+VS++RA+AMG+ Sbjct: 79 ESLGFD-DDEVDTGSPWEGAVVYKRNASILHLEYCTTLERLGLAKLSTDVSKTRAAAMGL 137 Query: 507 RVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNFALL 686 RVTKAV+++P GTPV +S+DV RKK+KLRLDGII+TV++L CNRC P+AES+FS F+LL Sbjct: 138 RVTKAVREFPNGTPVQISIDVTRKKKKLRLDGIIKTVITLLCNRCCMPSAESIFSEFSLL 197 Query: 687 LTEEPIEEAEVVDMGLIFGEDKSRSAVSXXXXXXXXXXXXXXXRLYFPAEEKEIDISKQI 866 LTEEPIEE E +DMG+IFGEDK ++ + +LYFP++EK+IDISK I Sbjct: 198 LTEEPIEEPETIDMGVIFGEDKLTTSGNSGQDDDEDALIDLEDQLYFPSQEKQIDISKNI 257 Query: 867 RDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGNLREQLQ 1040 RD VH+EIT+N++CD CKG+CL+CG N N C CS +E KEKSYGPLGNL+E++Q Sbjct: 258 RDRVHLEITMNSVCDPGCKGMCLKCGQNFNTGNCMCS-NEEVKEKSYGPLGNLKEKMQ 314 >ref|XP_004307383.1| PREDICTED: uncharacterized protein LOC101294601 [Fragaria vesca subsp. vesca] Length = 324 Score = 300 bits (768), Expect = 9e-79 Identities = 151/268 (56%), Positives = 197/268 (73%), Gaps = 5/268 (1%) Frame = +3 Query: 252 PSDHHPSRSLNFTCRDATITNLDLEDPIDFDWGDENM-----VGSPWEGAVVYRRDPSVT 416 P+D S +N C ++ ED + D GD+ + SPWEGAVVY+R+ S+T Sbjct: 59 PNDILMSTVMN--CTKPNFQSITDEDTVFIDLGDQGNEDGEDIDSPWEGAVVYKRNASIT 116 Query: 417 HLEYCTTLERLGLGKLSTEVSRSRASAMGMRVTKAVKDYPLGTPVLVSLDVCRKKRKLRL 596 H+EYCTTLERLGLG LST VS+SRAS MG+RVTKAVKDYP GTPV +S+D+ R+K+KLRL Sbjct: 117 HVEYCTTLERLGLGNLSTTVSKSRASVMGLRVTKAVKDYPNGTPVQISIDITRRKQKLRL 176 Query: 597 DGIIRTVLSLGCNRCGEPAAESVFSNFALLLTEEPIEEAEVVDMGLIFGEDKSRSAVSXX 776 DGII+TV++L CNRCG+PAAES+FSNF+LLLT+EPIEE ++++MG+I+G D +++ Sbjct: 177 DGIIKTVITLTCNRCGDPAAESIFSNFSLLLTDEPIEEPDIINMGVIYG-DNAKTHTGFG 235 Query: 777 XXXXXXXXXXXXXRLYFPAEEKEIDISKQIRDLVHIEITINAICDSSCKGLCLRCGANLN 956 +LYF E+KEIDISK IRD VH+EITI+A C+ +CKGLCL CG NLN Sbjct: 236 GEENEDDSIDFEDQLYFRPEDKEIDISKHIRDSVHLEITISATCNPNCKGLCLNCGKNLN 295 Query: 957 NSVCNCSKKQEQKEKSYGPLGNLREQLQ 1040 S C C KQE K+ ++GPLGNL++Q+Q Sbjct: 296 TSNCICG-KQEVKKTTFGPLGNLKKQMQ 322 >ref|XP_003524967.1| PREDICTED: uncharacterized protein LOC100792185 isoform X1 [Glycine max] gi|571455764|ref|XP_006580175.1| PREDICTED: uncharacterized protein LOC100792185 isoform X2 [Glycine max] Length = 318 Score = 299 bits (766), Expect = 1e-78 Identities = 141/235 (60%), Positives = 186/235 (79%), Gaps = 3/235 (1%) Frame = +3 Query: 345 WGDENMV---GSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEVSRSRASAMGMRVT 515 W D+ V GSPWEGAV+Y+R+ ++ HLEYCTTLERLGL KLS++VS++RA+AMG+RVT Sbjct: 84 WDDDEEVEDMGSPWEGAVIYKRNATILHLEYCTTLERLGLAKLSSDVSKTRAAAMGLRVT 143 Query: 516 KAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAAESVFSNFALLLTE 695 KAVKD+P GTPV +S+DV RKK+KLRLDGII+TV++L CNRC P+AES+FS F+LLLT+ Sbjct: 144 KAVKDFPNGTPVQISIDVTRKKKKLRLDGIIKTVITLLCNRCCAPSAESIFSEFSLLLTD 203 Query: 696 EPIEEAEVVDMGLIFGEDKSRSAVSXXXXXXXXXXXXXXXRLYFPAEEKEIDISKQIRDL 875 EPIEE E +DMG+IFGEDK ++ + +LYFP ++++IDISK IRD Sbjct: 204 EPIEEPETIDMGVIFGEDKLTTSGNSGEDDDDDALIDMDDQLYFPPQQRQIDISKNIRDR 263 Query: 876 VHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGPLGNLREQLQ 1040 VH+EIT+N++C CKG+CL+CG N N CNCS K+E +EKS+GPLGNL+E++Q Sbjct: 264 VHLEITMNSVCGPGCKGMCLKCGQNFNTGNCNCS-KEEVQEKSFGPLGNLKEKMQ 317 >ref|XP_003629796.1| hypothetical protein MTR_8g086630 [Medicago truncatula] gi|355523818|gb|AET04272.1| hypothetical protein MTR_8g086630 [Medicago truncatula] Length = 317 Score = 297 bits (761), Expect = 6e-78 Identities = 157/309 (50%), Positives = 205/309 (66%), Gaps = 18/309 (5%) Frame = +3 Query: 168 LCTNPSNLNFRLFNSNQ-SPFLPQLISATPSDH----------HPSRSLNFTCRDATITN 314 LC + N + F+ N +PFL + + + H + C+++ Sbjct: 10 LCFSQWNNSLTTFSCNSFTPFLHRTVPICKARHIGGVYFRTESNVLHKYVSKCKESGRDL 69 Query: 315 LDLEDPIDFDWGDENMV------GSPWEGAVVYRRDPSVTHLEYCTTLERLGLGKLSTEV 476 E FDWGDE G PWEGAV+Y+R+ S+ HLEYCTTLERLGLG LST+V Sbjct: 70 YTEEGTTSFDWGDEEEEEIDEDEGLPWEGAVIYKRNASILHLEYCTTLERLGLGNLSTDV 129 Query: 477 SRSRASAMGMRVTKAVKDYPLGTPVLVSLDVCRKKRKLRLDGIIRTVLSLGCNRCGEPAA 656 S+++AS MG+R+TKAVKD+P GTP+ +S+DV RKK+KLRLDGII+TVL+L CNRC P+A Sbjct: 130 SKNKASVMGLRITKAVKDFPNGTPIQISIDVTRKKKKLRLDGIIKTVLTLVCNRCCMPSA 189 Query: 657 ESVFSNFALLLTEE-PIEEAEVVDMGLIFGEDKSRSAVSXXXXXXXXXXXXXXXRLYFPA 833 ES+FS F+LLLTEE P+ E E +D G+IFGEDK + +LYFP Sbjct: 190 ESIFSEFSLLLTEEPPVNEPETMDFGVIFGEDKI-PTLGKSGDDDEDALIDLDDQLYFPP 248 Query: 834 EEKEIDISKQIRDLVHIEITINAICDSSCKGLCLRCGANLNNSVCNCSKKQEQKEKSYGP 1013 EEK+IDISK IRD VH+EIT+N++CDS CKG+CL+CG N N C+CS K+E KE+S+GP Sbjct: 249 EEKQIDISKNIRDRVHLEITMNSVCDSGCKGVCLKCGQNFNTGNCSCS-KEEVKEESFGP 307 Query: 1014 LGNLREQLQ 1040 L NLREQ+Q Sbjct: 308 LRNLREQMQ 316