BLASTX nr result
ID: Cocculus23_contig00033450
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00033450 (1131 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626... 378 e-102 ref|XP_007009265.1| HAT and BED zinc finger domain-containing pr... 371 e-100 ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phas... 367 6e-99 ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phas... 367 6e-99 gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] 363 9e-98 ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu... 363 9e-98 ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806... 360 6e-97 ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226... 358 2e-96 ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591... 357 7e-96 ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817... 353 7e-95 ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302... 351 3e-94 ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254... 347 4e-93 ref|XP_002316272.2| hypothetical protein POPTR_0010s20835g [Popu... 335 3e-89 ref|XP_007049027.1| HAT transposon superfamily, putative [Theobr... 325 2e-86 ref|XP_006651967.1| PREDICTED: uncharacterized protein LOC102714... 293 7e-77 ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thal... 292 2e-76 gb|AAM98154.1| putative protein [Arabidopsis thaliana] 292 2e-76 gb|EAY92386.1| hypothetical protein OsI_14116 [Oryza sativa Indi... 289 2e-75 ref|NP_001051738.1| Os03g0822900 [Oryza sativa Japonica Group] g... 289 2e-75 ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Popu... 285 2e-74 >ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis] Length = 745 Score = 378 bits (971), Expect = e-102 Identities = 203/376 (53%), Positives = 253/376 (67%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITPGKDEMDA 182 RIKEHLACQKGNAS C RVP DVR MQQSL+GV VKK+KKQKIAEEI + P E+ A Sbjct: 49 RIKEHLACQKGNASTCSRVPLDVRLAMQQSLDGVVVKKKKKQKIAEEITNNNPTFGEVYA 108 Query: 183 FGTQPDVNTGLQLLVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENVSPSSIIPD 362 F Q DV GL LL S N E + + + RD +N S ++ Sbjct: 109 FTDQGDVTPGLPLLDDS--NTPEACSNLVVSRDVISNTTGDKRKRWRG-KNSSVNAYT-- 163 Query: 363 GNMIPISELTSRSIKEKDQVHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRGPGVEAI 542 G MI S +R + + MA+ RFLYD+GA LDAVNS YFQPM+DAIA GP Sbjct: 164 GAMISASLDATRG---NNPIFMAVGRFLYDIGAPLDAVNSEYFQPMVDAIASGGPEAAMP 220 Query: 543 SYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVHCSEGIV 722 SYHD+RGWILKNS+EEV +R TWG+ GCSILVD+W TE GR LL F +C EG V Sbjct: 221 SYHDIRGWILKNSVEEVKNDVDRYTTTWGKTGCSILVDQWNTEAGRTLLCFLAYCPEGTV 280 Query: 723 FLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTATFRTIYW 902 FLKS+D S I+ SSDALYELLK +VLQVIT + E ++ AG+RLT TF T+YW Sbjct: 281 FLKSVDASGIMNSSDALYELLKQVVEEVGVRHVLQVITSSEEQFIAAGRRLTDTFPTLYW 340 Query: 903 TPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDLVQPTIS 1082 TPCAAR ++ +L+D K+EWI+ ++ A+++TRF+YNH VVLNM+RRYT G D+V+P I+ Sbjct: 341 TPCAARCLDLILEDFAKLEWINAIIEQARAVTRFVYNHSVVLNMLRRYTFGNDIVEPGIT 400 Query: 1083 RFATDFVTLKSMVNLK 1130 R AT+F TL+ M++LK Sbjct: 401 RSATNFTTLRRMISLK 416 >ref|XP_007009265.1| HAT and BED zinc finger domain-containing protein, putative [Theobroma cacao] gi|508726178|gb|EOY18075.1| HAT and BED zinc finger domain-containing protein, putative [Theobroma cacao] Length = 749 Score = 371 bits (953), Expect = e-100 Identities = 190/376 (50%), Positives = 251/376 (66%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITPGKDEMDA 182 RIKEHLA QKGNAS C VP DVR M++SL+GV VKKRKKQKIAEE+ + E+D Sbjct: 49 RIKEHLAGQKGNASTCFHVPSDVRLLMRESLDGVEVKKRKKQKIAEEMSNANQVSSEIDT 108 Query: 183 FGTQPDVNTGLQLLVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENVSPSSIIPD 362 + Q D NTGL L+ P+ L+P + + + R+ + S + Sbjct: 109 YDNQVDTNTGL--LMIEGPDTLQPSSSLLVNREGTSNVSGDRR------KRGKGKSSAAE 160 Query: 363 GNMIPISELTSRSIKEKDQVHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRGPGVEAI 542 N + ++ + + + + VH+AI RFL+D+GA LDAVNS YFQPM+DAI G GV Sbjct: 161 SNALVVNTVGLGAKRVNNHVHVAIGRFLFDIGAPLDAVNSVYFQPMVDAIISGGSGVLMP 220 Query: 543 SYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVHCSEGIV 722 S DL+GWILK S+EEV ++ A W R GCSILV++W T+ GR+LLNF V+C EG V Sbjct: 221 SCSDLQGWILKKSVEEVKSDNDKVTAAWVRTGCSILVNQWNTQTGRILLNFLVYCPEGTV 280 Query: 723 FLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTATFRTIYW 902 FLKS+D S ++ SSDALYELLK +VLQVIT+ E Y++AG+RL TF T+YW Sbjct: 281 FLKSVDASSVINSSDALYELLKQVVEEVGSKHVLQVITNAEEQYIVAGRRLAETFPTLYW 340 Query: 903 TPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDLVQPTIS 1082 TPCAA IN +L+D K+EWI+V ++ A+SITRF+YNH VVLNM+RRYT G D+V+P ++ Sbjct: 341 TPCAAHCINLILEDFAKLEWINVIIEQARSITRFVYNHSVVLNMVRRYTLGNDIVEPAVT 400 Query: 1083 RFATDFVTLKSMVNLK 1130 AT+F TLK M++LK Sbjct: 401 CSATNFTTLKQMIDLK 416 >ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] gi|561036895|gb|ESW35425.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 756 Score = 367 bits (941), Expect = 6e-99 Identities = 192/381 (50%), Positives = 250/381 (65%), Gaps = 5/381 (1%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITPGKDEMDA 182 RIKEHLACQKGNAS C RVP DVR MQQSL+GV VKKR+KQKI EEI S+ P +++ Sbjct: 49 RIKEHLACQKGNASTCSRVPHDVRLHMQQSLDGVVVKKRRKQKIEEEIMSVNPLTTVVNS 108 Query: 183 F--GTQPDVNTGLQLLVT--STPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENVSPSS 350 Q DVN GLQ + ++ V+ P GM + + +P++ Sbjct: 109 LPNNNQVDVNQGLQAIGVDHNSSLVVNPGEGMSKNMERRKKMRA----------SKNPAA 158 Query: 351 IIPDGNMIPISELTSRSIKEKDQ-VHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRGP 527 I + + E K D +HMAI RFLYD+GA DAVNS YF M+DAI+ RG Sbjct: 159 IYANSEGVVAVEKNGLFPKRVDNHIHMAIGRFLYDIGAPFDAVNSVYFHEMVDAISSRGA 218 Query: 528 GVEAISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVHC 707 G E S+H+LRGWILKNS+EEV +RCK TWGR GCSILVD+W TE GRVL++F +C Sbjct: 219 GFERPSHHELRGWILKNSVEEVKNDIDRCKMTWGRTGCSILVDQWATETGRVLISFLAYC 278 Query: 708 SEGIVFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTATF 887 EG+VFLKS+D ++I S+D LY+++K VLQVIT E Y +AG+RLT TF Sbjct: 279 PEGVVFLKSMDATEISTSADFLYDMIKQVVDEVGVGQVLQVITSGEEQYAVAGRRLTDTF 338 Query: 888 RTIYWTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDLV 1067 T+YW+P AA I+ +L+D G +EWIS ++ AKS+TRF+YN+ +L M++RYT G D+V Sbjct: 339 PTLYWSPSAAHCIDFILEDFGNLEWISAVIEQAKSVTRFVYNYSAILIMVKRYTLGNDIV 398 Query: 1068 QPTISRFATDFVTLKSMVNLK 1130 P+ S+FAT+F TLK MV+LK Sbjct: 399 DPSFSQFATNFTTLKRMVDLK 419 >ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] gi|561036894|gb|ESW35424.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 869 Score = 367 bits (941), Expect = 6e-99 Identities = 192/381 (50%), Positives = 250/381 (65%), Gaps = 5/381 (1%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITPGKDEMDA 182 RIKEHLACQKGNAS C RVP DVR MQQSL+GV VKKR+KQKI EEI S+ P +++ Sbjct: 162 RIKEHLACQKGNASTCSRVPHDVRLHMQQSLDGVVVKKRRKQKIEEEIMSVNPLTTVVNS 221 Query: 183 F--GTQPDVNTGLQLLVT--STPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENVSPSS 350 Q DVN GLQ + ++ V+ P GM + + +P++ Sbjct: 222 LPNNNQVDVNQGLQAIGVDHNSSLVVNPGEGMSKNMERRKKMRA----------SKNPAA 271 Query: 351 IIPDGNMIPISELTSRSIKEKDQ-VHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRGP 527 I + + E K D +HMAI RFLYD+GA DAVNS YF M+DAI+ RG Sbjct: 272 IYANSEGVVAVEKNGLFPKRVDNHIHMAIGRFLYDIGAPFDAVNSVYFHEMVDAISSRGA 331 Query: 528 GVEAISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVHC 707 G E S+H+LRGWILKNS+EEV +RCK TWGR GCSILVD+W TE GRVL++F +C Sbjct: 332 GFERPSHHELRGWILKNSVEEVKNDIDRCKMTWGRTGCSILVDQWATETGRVLISFLAYC 391 Query: 708 SEGIVFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTATF 887 EG+VFLKS+D ++I S+D LY+++K VLQVIT E Y +AG+RLT TF Sbjct: 392 PEGVVFLKSMDATEISTSADFLYDMIKQVVDEVGVGQVLQVITSGEEQYAVAGRRLTDTF 451 Query: 888 RTIYWTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDLV 1067 T+YW+P AA I+ +L+D G +EWIS ++ AKS+TRF+YN+ +L M++RYT G D+V Sbjct: 452 PTLYWSPSAAHCIDFILEDFGNLEWISAVIEQAKSVTRFVYNYSAILIMVKRYTLGNDIV 511 Query: 1068 QPTISRFATDFVTLKSMVNLK 1130 P+ S+FAT+F TLK MV+LK Sbjct: 512 DPSFSQFATNFTTLKRMVDLK 532 >gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] Length = 752 Score = 363 bits (931), Expect = 9e-98 Identities = 181/376 (48%), Positives = 259/376 (68%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITPGKDEMDA 182 RIKEHLA QKGNAS C VPP+V++ MQ+SL+GV +KKRK+QK+ EE+ ++ E+DA Sbjct: 49 RIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNAMTAEVDA 108 Query: 183 FGTQPDVNTGLQLLVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENVSPSSIIPD 362 D+++ + L+ + P L+ ++ + + + + S S + + Sbjct: 109 ISNHMDMDSSIHLIEVAEP--LDTNSALLLTHEEGTSNKVGRKKGS---KGKSSSCLDRE 163 Query: 363 GNMIPISELTSRSIKEKDQVHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRGPGVEAI 542 +IP S ++++QVHMAI RFLYD+GASL+AVNS YFQPMI++IAL G G+ Sbjct: 164 MIVIPNGGGILDSNRDRNQVHMAIGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPP 223 Query: 543 SYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVHCSEGIV 722 SYHD+RGWILKNS+EEV G F+RCKATWG GCS++VD+W TE GR +LNF V+C +G V Sbjct: 224 SYHDIRGWILKNSVEEVRGDFDRCKATWGMTGCSVMVDQWCTEAGRTMLNFLVYCPKGTV 283 Query: 723 FLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTATFRTIYW 902 FL+S+D S I+ S D LYELLK +V+QVIT E++ IAG++L+ T+ T+YW Sbjct: 284 FLESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYW 343 Query: 903 TPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDLVQPTIS 1082 TPCAA ++ +L DIG +E ++ ++ A+SITRF+YN+ +VLNM+R+ T G D+V+P ++ Sbjct: 344 TPCAASCVDLILADIGNIEDVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLT 403 Query: 1083 RFATDFVTLKSMVNLK 1130 R AT+F TL MV+LK Sbjct: 404 RSATNFATLNRMVDLK 419 >ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis] gi|223536481|gb|EEF38128.1| DNA binding protein, putative [Ricinus communis] Length = 753 Score = 363 bits (931), Expect = 9e-98 Identities = 197/381 (51%), Positives = 256/381 (67%), Gaps = 5/381 (1%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITP--GKDEM 176 RIKEHLA QKGNAS C +VP DV+ MQQSL+GV VKKRKKQKIAEEI ++ P G E+ Sbjct: 50 RIKEHLAGQKGNASTCLQVPTDVKLIMQQSLDGVVVKKRKKQKIAEEITNLNPVIGGGEI 109 Query: 177 DAFGT-QPDVNTGLQLLVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIEN--VSPS 347 + F Q +V+TG++L+ S NV+EP + + + + + + Sbjct: 110 EVFANDQIEVSTGMELIGVS--NVIEPSSSLLISGQEGKANKGGERRKRGRSKGSGANAN 167 Query: 348 SIIPDGNMIPISELTSRSIKEKDQVHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRGP 527 +I+ + + + + + D VHMAI RFLYD+GA LDAVNS YFQPM+DAIA G Sbjct: 168 AIVSMNS----NRMALGAKRVNDHVHMAIGRFLYDIGAPLDAVNSVYFQPMVDAIASGGL 223 Query: 528 GVEAISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVHC 707 V S HDLRGWILKNS+EEV ++ ATW R GCS+LVD+W T MGR LL+F V+C Sbjct: 224 DVGMPSCHDLRGWILKNSVEEVKTEVDKHMATWARTGCSVLVDQWNTLMGRTLLSFLVYC 283 Query: 708 SEGIVFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTATF 887 SEG+VFLKS+D SDI+ SSDALYEL+K +VLQVIT E Y++ G+RLT TF Sbjct: 284 SEGVVFLKSVDASDIINSSDALYELIKKVVEEVGVRHVLQVITSMEEQYIVVGRRLTDTF 343 Query: 888 RTIYWTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDLV 1067 T+Y PCAA I+ +L+D K+EWIS + A+SITRF+YNH VVLNM++RYT G ++V Sbjct: 344 PTLYRAPCAAHCIDLILEDFAKLEWISTVILQARSITRFVYNHSVVLNMVKRYTFGSEIV 403 Query: 1068 QPTISRFATDFVTLKSMVNLK 1130 ++ FAT+F TLK MV+LK Sbjct: 404 ATGLTHFATNFETLKRMVDLK 424 >ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806265 isoform X1 [Glycine max] gi|571542833|ref|XP_006601996.1| PREDICTED: uncharacterized protein LOC100806265 isoform X2 [Glycine max] Length = 758 Score = 360 bits (924), Expect = 6e-97 Identities = 188/380 (49%), Positives = 247/380 (65%), Gaps = 4/380 (1%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITPGKDEMDA 182 RIKEHLACQKGNAS C RVP DVR MQQSL+GV VKKR+KQ+I EEI S+ P +++ Sbjct: 49 RIKEHLACQKGNASTCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSVNPLTTVVNS 108 Query: 183 FGTQP---DVNTGLQLLVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENVSPSSI 353 DVN GLQ + + L + G M R+ +P+++ Sbjct: 109 LPNNNQVVDVNQGLQAIGVEHNSTLVVNPGEGMSRNMERRKKMRAAK--------NPAAV 160 Query: 354 IPDGNMIPISELTSRSIKEKDQ-VHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRGPG 530 + + E K+ D ++MAI RFLYD+GA DAVN +FQ M+DAIA +G G Sbjct: 161 YANSEDVVAVEKNGLFPKKMDNHIYMAIGRFLYDIGAPFDAVNLVFFQEMVDAIASKGTG 220 Query: 531 VEAISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVHCS 710 E S+H+LRGWILKNS+EEV +RCK TWGR GCSILVD+W TE R+L++F +C Sbjct: 221 FERPSHHELRGWILKNSVEEVKNDIDRCKMTWGRTGCSILVDQWTTETSRILISFLAYCP 280 Query: 711 EGIVFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTATFR 890 EG+VFLKS+D ++IL S D LY+L+K V+QVIT E Y IAG+RL TF Sbjct: 281 EGLVFLKSLDATEILTSPDFLYDLIKQVVEEIGVGKVVQVITSGEEQYGIAGRRLMDTFP 340 Query: 891 TIYWTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDLVQ 1070 T+YW+P AA I+ +L+D G +EWIS ++ AKS+TRF+YN+ +LNM++RYT G D+V Sbjct: 341 TLYWSPSAAHCIDLILEDFGNLEWISAVIEQAKSVTRFVYNYSAILNMVKRYTLGNDIVD 400 Query: 1071 PTISRFATDFVTLKSMVNLK 1130 P+ SRFAT+F TLK MV+LK Sbjct: 401 PSFSRFATNFTTLKRMVDLK 420 >ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226173 [Cucumis sativus] Length = 752 Score = 358 bits (920), Expect = 2e-96 Identities = 179/376 (47%), Positives = 257/376 (68%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITPGKDEMDA 182 RIKEHLA QKGNAS C VPP+V++ MQ+SL+GV +KKRK+QK+ EE+ ++ E+D Sbjct: 49 RIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNVNTMTGEVDG 108 Query: 183 FGTQPDVNTGLQLLVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENVSPSSIIPD 362 D+++ + L+ + P LE ++ + + + + S S + + Sbjct: 109 ISNHMDMDSSIHLIEVAEP--LETNSVLLLTHEKGTSNKVGRKKGS---KGKSSSCLERE 163 Query: 363 GNMIPISELTSRSIKEKDQVHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRGPGVEAI 542 +IP S ++++QVHMA+ RFLYD+GASL+AVNS YFQPMI++IAL G G+ Sbjct: 164 MIVIPNGGGILDSNRDRNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPP 223 Query: 543 SYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVHCSEGIV 722 SYHD+RGWILKNS+EEV F+RCKATWG GCS++VD+W TE GR +LNF V+C +G V Sbjct: 224 SYHDIRGWILKNSMEEVRSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTV 283 Query: 723 FLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTATFRTIYW 902 FL+S+D S I+ S D LYELLK +V+QVIT E++ IAG++L+ T+ T+YW Sbjct: 284 FLESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYW 343 Query: 903 TPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDLVQPTIS 1082 TPCAA ++ +L DIG +E ++ ++ A+SITRF+YN+ +VLNM+R+ T G D+V+P ++ Sbjct: 344 TPCAASCVDLILGDIGNIEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLT 403 Query: 1083 RFATDFVTLKSMVNLK 1130 R AT+F TL MV+LK Sbjct: 404 RSATNFATLNRMVDLK 419 >ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591442 [Solanum tuberosum] Length = 755 Score = 357 bits (915), Expect = 7e-96 Identities = 187/377 (49%), Positives = 246/377 (65%), Gaps = 1/377 (0%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITPGKDEMDA 182 RIKEHLA QKGNAS C RV PDVR MQ SL GV +KKRKKQK+AEEI + G D Sbjct: 49 RIKEHLAGQKGNASTCLRVQPDVRLLMQDSLNGVVMKKRKKQKLAEEITTYNAGTATSDI 108 Query: 183 FGTQPDV-NTGLQLLVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENVSPSSIIP 359 D Q+ + P +E + +F+ RD + S S+ Sbjct: 109 AAEFTDTCGLDTQVDLLPMPQAIEHTSNLFLNRDQGPNNIGARKKKSRIRKGASSSN--N 166 Query: 360 DGNMIPISELTSRSIKEKDQVHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRGPGVEA 539 + ++PI++ S + + VHMA+ARFL D LDAVNS YFQPMID IA +GP V A Sbjct: 167 NAMLLPINQ----SKRVNNHVHMAVARFLLDARVPLDAVNSVYFQPMIDVIASQGPQVSA 222 Query: 540 ISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVHCSEGI 719 SYH+LR W+LK S++EV ++C +TW R+GCS+LVDEWIT G+ LLNF V+C EG Sbjct: 223 PSYHELRSWVLKASVQEVRNDIDQCSSTWARSGCSVLVDEWITGKGKTLLNFLVYCPEGT 282 Query: 720 VFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTATFRTIY 899 +FL+S+D S ++ S+D LYELLK NVLQV+T E Y+IAGKRLT + T++ Sbjct: 283 MFLRSVDASTLINSTDYLYELLKEVVEEVGVRNVLQVVTSNEERYIIAGKRLTDAYPTLF 342 Query: 900 WTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDLVQPTI 1079 WTPCAA SI+ ML+D+ K+EWI ++ AKSI+RFIYN+ ++L+MMR++T G DLV + Sbjct: 343 WTPCAAHSIDLMLEDLKKLEWIDTIMEQAKSISRFIYNNNILLSMMRKFTLGVDLVDLGV 402 Query: 1080 SRFATDFVTLKSMVNLK 1130 +R ATDF+TLK MVN+K Sbjct: 403 TRSATDFLTLKRMVNIK 419 >ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817502 isoform X1 [Glycine max] gi|571489936|ref|XP_006591345.1| PREDICTED: uncharacterized protein LOC100817502 isoform X2 [Glycine max] gi|571489939|ref|XP_006591346.1| PREDICTED: uncharacterized protein LOC100817502 isoform X3 [Glycine max] Length = 759 Score = 353 bits (906), Expect = 7e-95 Identities = 186/381 (48%), Positives = 247/381 (64%), Gaps = 5/381 (1%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITPGKDEMDA 182 RIKEHLACQKGNAS C RVP DVR MQQSL+GV VKKR+KQ+I EEI S+ P +++ Sbjct: 49 RIKEHLACQKGNASTCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSVNPLTTVVNS 108 Query: 183 FGTQP----DVNTGLQLLVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENVSPSS 350 DVN GLQ + + L + G M R+ +P++ Sbjct: 109 LPNNNNRVVDVNQGLQAIGVEHNSSLVVNPGEGMSRNMERRKKMRATK--------NPAA 160 Query: 351 IIPDGNMIPISELTSRSIKEKDQ-VHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRGP 527 + + + E K+ D ++MAI RFLYD+GA DAVNS YFQ M+DAIA RG Sbjct: 161 VYANSEGVIAVEKNGLFPKKMDNHIYMAIGRFLYDIGAPFDAVNSVYFQEMVDAIASRGV 220 Query: 528 GVEAISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVHC 707 G E +H+LRGWILKNS+EEV +RCK TWGR GCSILVD+W TE G++L++F +C Sbjct: 221 GFERPWHHELRGWILKNSVEEVKNDIDRCKMTWGRTGCSILVDQWTTETGKILISFLAYC 280 Query: 708 SEGIVFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTATF 887 EG+VFL+S+D ++I S+D LY+L+K V+QVIT E Y IAG+RLT TF Sbjct: 281 PEGLVFLRSLDATEISTSADFLYDLIKQVVEEVGAGQVVQVITSGEEQYGIAGRRLTDTF 340 Query: 888 RTIYWTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDLV 1067 T+Y +P AA I+ +L+D G +EWIS ++ A+S+TRF+YN+ +LNM++RYT G D+V Sbjct: 341 PTLYLSPSAAHCIDLILEDFGNLEWISAVIEQARSVTRFVYNYSAILNMVKRYTLGNDIV 400 Query: 1068 QPTISRFATDFVTLKSMVNLK 1130 P+ S FAT+F TLK MV+LK Sbjct: 401 DPSFSHFATNFTTLKRMVDLK 421 >ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302111 [Fragaria vesca subsp. vesca] Length = 754 Score = 351 bits (901), Expect = 3e-94 Identities = 187/378 (49%), Positives = 252/378 (66%), Gaps = 2/378 (0%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITPGKD-EMD 179 RIKEHLA QKGNAS C RVPPDVR MQQSL+GV VKKR +QK+ EEI +ITP +D ++D Sbjct: 45 RIKEHLAGQKGNASTCLRVPPDVRGLMQQSLDGVVVKKRNRQKLDEEITNITPPQDGDVD 104 Query: 180 AFG-TQPDVNTGLQLLVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENVSPSSII 356 + G TQ DVN +QL+ S +EP + + + R+ ++ S + Sbjct: 105 SLGGTQSDVNNAVQLVGVS----VEPISRLLVNREGVTSVRSMDRRKRGRGKSSWSSHGV 160 Query: 357 PDGNMIPISELTSRSIKEKDQVHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRGPGVE 536 + L SR + VH AI RFL+D+GA +AVNS YFQPMIDAIA GPG+E Sbjct: 161 H--GVCNGGALVSRKVNS--YVHEAIGRFLFDIGAPPEAVNSAYFQPMIDAIASGGPGME 216 Query: 537 AISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVHCSEG 716 + HDLR WILKNS+EE ++ +ATWGR GCSILVD+W TE+ V+L+F V+ EG Sbjct: 217 PPTCHDLRSWILKNSVEEARNNIDKHRATWGRTGCSILVDQWNTELDNVMLSFLVYSPEG 276 Query: 717 IVFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTATFRTI 896 VFL+S+D S I+ SSDALY+LL+ +V+QVIT E +V+AG+RL TF + Sbjct: 277 TVFLESVDASAIINSSDALYDLLRRVVEDVGVGDVVQVITSGEEQFVVAGRRLADTFPNL 336 Query: 897 YWTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDLVQPT 1076 +W PCAAR ++ +L+D G ++WI ++ A+SIT+F+YNH VVLN++RR T G D+V+P Sbjct: 337 FWIPCAARCLDLILEDFGSLDWIHAVIEQARSITKFVYNHNVVLNLVRRSTFGNDIVEPG 396 Query: 1077 ISRFATDFVTLKSMVNLK 1130 ++RF T F TLK +V+LK Sbjct: 397 VTRFGTSFTTLKRLVDLK 414 >ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254391 [Solanum lycopersicum] Length = 748 Score = 347 bits (891), Expect = 4e-93 Identities = 191/382 (50%), Positives = 246/382 (64%), Gaps = 6/382 (1%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITP--GKDEM 176 RIKEHLA QKGNAS C RV PDVR MQ SL GV +KKRKKQK+AEEI + D Sbjct: 49 RIKEHLAGQKGNASTCLRVQPDVRLLMQDSLNGVVMKKRKKQKLAEEITTYNAIDTSDIA 108 Query: 177 DAFGTQPDVNTGLQLLVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENVSPSSII 356 F +NT + LL S +E + +F+ RD N S I Sbjct: 109 AEFTDTCGLNTQVDLLPMS--QAIEHTSSLFLNRDQGP-------------NNRKKKSRI 153 Query: 357 PDG----NMIPISELTSRSIKEKDQVHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRG 524 G N +PI ++S + +QVHMA+ARFL D LDAVNS YFQPMID IA +G Sbjct: 154 RKGASSSNNLPI---INQSKRVNNQVHMAVARFLLDARVPLDAVNSVYFQPMIDVIASQG 210 Query: 525 PGVEAISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVH 704 P V A SYHDLR W+LK+S++EV ++C +TW R GCS+L+DE IT G++LLNF V+ Sbjct: 211 PPVSAPSYHDLRSWVLKSSVQEVRTDIDQCSSTWARTGCSVLIDELITGKGKILLNFLVY 270 Query: 705 CSEGIVFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTAT 884 C +G +FL+S+D S ++ S+D LYELLK NVLQV+T E YVIAGKRLT Sbjct: 271 CPQGTMFLRSVDASTLINSTDYLYELLKEVVDEIGVRNVLQVVTSNEERYVIAGKRLTDA 330 Query: 885 FRTIYWTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDL 1064 + T++WTPCAA SI+ ML+D K+EWI ++ AKSI+RFIYN+ ++L+MMR++T G DL Sbjct: 331 YPTLFWTPCAAHSIDLMLEDFNKLEWIDTIMEQAKSISRFIYNNNILLSMMRKFTLGVDL 390 Query: 1065 VQPTISRFATDFVTLKSMVNLK 1130 V ++R ATDF+TLK M N+K Sbjct: 391 VDLGVTRSATDFLTLKRMQNIK 412 >ref|XP_002316272.2| hypothetical protein POPTR_0010s20835g [Populus trichocarpa] gi|550330253|gb|EEF02443.2| hypothetical protein POPTR_0010s20835g [Populus trichocarpa] Length = 608 Score = 335 bits (858), Expect = 3e-89 Identities = 185/382 (48%), Positives = 239/382 (62%), Gaps = 6/382 (1%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITPGKDEMDA 182 RIKEHLA QKGNA+ C +VP DVR MQQSL+GV VKKRKKQKIAEEI ++ P E+ Sbjct: 49 RIKEHLAGQKGNAATCVQVPSDVRLMMQQSLDGVVVKKRKKQKIAEEITNLNPVSSEIGV 108 Query: 183 FGTQPDVNTGLQLLVTSTPNVLEPDTGMF------MRRDXXXXXXXXXXXXXXXIENVSP 344 F DVNTG++L T + ++P + + M + + N Sbjct: 109 F--DKDVNTGMEL--TGVTDAIDPVSSLLVTGEDGMGKKGGERRKRGRGRGRGSVTNAK- 163 Query: 345 SSIIPDGNMIPISELTSRSIKEKDQVHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRG 524 +++ G+ +P+S ++ D +HMAI RFLYD+GASLDAVNS YFQ M+ AIA G Sbjct: 164 -AVVTMGSGMPLSG----GKRKNDHIHMAIGRFLYDIGASLDAVNSAYFQLMVQAIASGG 218 Query: 525 PGVEAISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVH 704 V SYHDLRGW+LKNS+EEV ++ ATW R GCS+LVD+W T MGR L+NF V+ Sbjct: 219 SEVVVPSYHDLRGWVLKNSVEEVKNDVDKHIATWERTGCSVLVDQWNTVMGRTLINFLVY 278 Query: 705 CSEGIVFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTAT 884 C EG+VFLKS+D SDI+ DALYELLK +VLQVIT E + AG+RL T Sbjct: 279 CPEGVVFLKSVDASDIINLPDALYELLKQVVEEIGARHVLQVITRMEEQLICAGRRLADT 338 Query: 885 FRTIYWTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDL 1064 F +YW PCAA ++ +L+D K+EWI+ ++ A+SITRF+YNH Sbjct: 339 FPNLYWAPCAAHCLDLILEDFAKLEWINSVIEQARSITRFVYNH---------------- 382 Query: 1065 VQPTISRFATDFVTLKSMVNLK 1130 +P ISRFAT+F TLK MV+LK Sbjct: 383 -KPGISRFATNFGTLKRMVDLK 403 >ref|XP_007049027.1| HAT transposon superfamily, putative [Theobroma cacao] gi|508701288|gb|EOX93184.1| HAT transposon superfamily, putative [Theobroma cacao] Length = 750 Score = 325 bits (833), Expect = 2e-86 Identities = 172/379 (45%), Positives = 234/379 (61%), Gaps = 3/379 (0%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSI---TPGKDE 173 R KEHLA +KG C++VPP VR MQ+SL GV +K+ KQ E+ + +P E Sbjct: 49 RFKEHLAGRKGQGPICEQVPPGVRALMQESLNGVLLKQDNKQNAIPELLACGGSSPHAGE 108 Query: 174 MDAFGTQPDVNTGLQLLVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENVSPSSI 353 +D DVN G++ + N LEPD+ + + + S+ Sbjct: 109 IDKSAYSDDVNNGVKPI--QVLNSLEPDSSLVLNGKGEVSQGIRDSK-----KRGRDRSL 161 Query: 354 IPDGNMIPISELTSRSIKEKDQVHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRGPGV 533 + + + S+L SI ++ VHMAI RFLYD+G +LDAVNS YFQPMIDAIA G G+ Sbjct: 162 LANSHSCAKSDLALVSIGAENPVHMAIGRFLYDIGVNLDAVNSVYFQPMIDAIASTGSGI 221 Query: 534 EAISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVHCSE 713 S DLRGWILKN +EEV +R K WG+ GCSILV++W + GR LL+F V+C + Sbjct: 222 VPPSSQDLRGWILKNVMEEVKDDIDRNKTMWGKTGCSILVEQWSPKSGRTLLSFLVYCPQ 281 Query: 714 GIVFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTATFRT 893 VFLKS+D S ++ S+D L ELLK NV+QVIT+ E Y +AGKRL +F + Sbjct: 282 ATVFLKSVDASRVIFSADHLNELLKQVVEEVGVENVVQVITNCEEQYFLAGKRLMESFPS 341 Query: 894 IYWTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDLVQP 1073 +YW PC ++ ML+D +EWIS T++ AKS+TRF+YNH VVLNMMRR+T D+V+P Sbjct: 342 LYWAPCLVHCVDMMLEDFANLEWISETIEQAKSVTRFVYNHSVVLNMMRRFTFHNDIVEP 401 Query: 1074 TISRFATDFVTLKSMVNLK 1130 ++RFA++F TLK M +LK Sbjct: 402 AVTRFASNFATLKRMADLK 420 >ref|XP_006651967.1| PREDICTED: uncharacterized protein LOC102714280 [Oryza brachyantha] Length = 787 Score = 293 bits (751), Expect = 7e-77 Identities = 169/400 (42%), Positives = 236/400 (59%), Gaps = 24/400 (6%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSIT-------- 158 R KEHLA + GNASCC +VPP+V++TM SL+ V+ KK++KQ +AE I+ +T Sbjct: 58 RFKEHLARRPGNASCCPKVPPEVQETMHHSLDVVAAKKKRKQSLAEGIRRMTHSAPPAAA 117 Query: 159 -----PGKDEMDAFGTQPDVNTGLQL----LVTSTPNVLEPDTGMFMRRDXXXXXXXXXX 311 G EM++ +N L L L + P E +R Sbjct: 118 PPVDATGAAEMESPIRMIPLNEVLDLGSVPLEETPPEAREMKGSTSKKRKKLAARHASAA 177 Query: 312 XXXXXIENVSPSSIIPDGNMIPISELTS-------RSIKEKDQVHMAIARFLYDVGASLD 470 +N +P + P M+ + + +S K+QV+MAI RFLYD G SL+ Sbjct: 178 PPAH--QNPAPQTQ-PFHQMVMAFDAAASQLRHFDQSASNKEQVYMAIGRFLYDAGVSLE 234 Query: 471 AVNSPYFQPMIDAIALRGPGVEAISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSIL 650 AVNS YFQPM++A+A G EA SYHD RG ILK S++EV K +W R GC++L Sbjct: 235 AVNSVYFQPMLEAVASAGGRPEAFSYHDFRGSILKKSLDEVTAQVEFYKGSWTRTGCTLL 294 Query: 651 VDEWITEMGRVLLNFFVHCSEGIVFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQV 830 DEW T+ GR L+NF V+C EG +FLKS+D +D++ SSD LYELLK NV+QV Sbjct: 295 ADEWTTDRGRTLINFSVYCPEGTMFLKSVDATDMVVSSDPLYELLKNVVEEVGEKNVVQV 354 Query: 831 ITDTAEHYVIAGKRLTATFRTIYWTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIY 1010 IT+ +E + +AGKRL TF T++W+PC+ + I+ ML+D KV I+ + +AK+IT FIY Sbjct: 355 ITNNSEIHAVAGKRLGETFPTLFWSPCSFQCIDGMLEDFSKVGAINEIICNAKAITGFIY 414 Query: 1011 NHGVVLNMMRRYTGGRDLVQPTISRFATDFVTLKSMVNLK 1130 N LN+M+R+ G+DL+ +R A +FVTLK+M NLK Sbjct: 415 NSAFALNLMKRHLHGKDLLVRAETRAAMNFVTLKNMYNLK 454 >ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thaliana] gi|240255844|ref|NP_193238.5| hAT transposon superfamily [Arabidopsis thaliana] gi|332658140|gb|AEE83540.1| hAT transposon superfamily [Arabidopsis thaliana] gi|332658141|gb|AEE83541.1| hAT transposon superfamily [Arabidopsis thaliana] Length = 768 Score = 292 bits (747), Expect = 2e-76 Identities = 161/385 (41%), Positives = 230/385 (59%), Gaps = 9/385 (2%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEE---IKSITPGKDE 173 R+KEHLA +KG + C +VP DVR +QQ ++G ++RK+ K + E + S+ P E Sbjct: 49 RVKEHLAGKKGQGTICDQVPEDVRLFLQQCIDGTVRRQRKRHKSSSEPLSVASLPP--IE 106 Query: 174 MDAFGTQPDVNTGLQL-----LVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENV 338 D QPDVN G + +V ++L T R + Sbjct: 107 GDMMVVQPDVNDGFKSPGSSDVVVQNESLLSGRTKQRTYRSKKNAFENGSASNNVDLIGR 166 Query: 339 SPSSIIPDGNMIPISELTSRSIKEKDQ-VHMAIARFLYDVGASLDAVNSPYFQPMIDAIA 515 ++IP + + + S ++++ +HMAI RFL+ +GA DAVNS FQPMIDAIA Sbjct: 167 DMDNLIPVA-ISSVKNIVHPSFRDRENTIHMAIGRFLFGIGADFDAVNSVNFQPMIDAIA 225 Query: 516 LRGPGVEAISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNF 695 G GV A ++ DLRGWILKN +EE+ + CKA W R GCSILV+E ++ G +LNF Sbjct: 226 SGGFGVSAPTHDDLRGWILKNCVEEMAKEIDECKAMWKRTGCSILVEELNSDKGFKVLNF 285 Query: 696 FVHCSEGIVFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRL 875 V+C E +VFLKS+D S++L S+D L+ELL +NV+QVIT ++YV AGKRL Sbjct: 286 LVYCPEKVVFLKSVDASEVLSSADKLFELLSELVEEVGSTNVVQVITKCDDYYVDAGKRL 345 Query: 876 TATFRTIYWTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGG 1055 + ++YW PCAA I+ ML++ GK+ WIS T++ A++ITRF+YNH VLN+M ++T G Sbjct: 346 MLVYPSLYWVPCAAHCIDQMLEEFGKLGWISETIEQAQAITRFVYNHSGVLNLMWKFTSG 405 Query: 1056 RDLVQPTISRFATDFVTLKSMVNLK 1130 D++ P S AT+F TL + LK Sbjct: 406 NDILLPAFSSSATNFATLGRIAELK 430 >gb|AAM98154.1| putative protein [Arabidopsis thaliana] Length = 768 Score = 292 bits (747), Expect = 2e-76 Identities = 161/385 (41%), Positives = 230/385 (59%), Gaps = 9/385 (2%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEE---IKSITPGKDE 173 R+KEHLA +KG + C +VP DVR +QQ ++G ++RK+ K + E + S+ P E Sbjct: 49 RVKEHLAGKKGQGTICDQVPEDVRLFLQQCIDGTVRRQRKRHKSSSEPLSVASLPP--IE 106 Query: 174 MDAFGTQPDVNTGLQL-----LVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENV 338 D QPDVN G + +V ++L T R + Sbjct: 107 GDMMVVQPDVNDGFKSPGSSDVVVQNESLLSGRTKQRTYRSKKNAFENGSASNNVDLIGR 166 Query: 339 SPSSIIPDGNMIPISELTSRSIKEKDQ-VHMAIARFLYDVGASLDAVNSPYFQPMIDAIA 515 ++IP + + + S ++++ +HMAI RFL+ +GA DAVNS FQPMIDAIA Sbjct: 167 DMDNLIPVA-ISSVKNIVHPSFRDRENTIHMAIGRFLFGIGADFDAVNSVNFQPMIDAIA 225 Query: 516 LRGPGVEAISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNF 695 G GV A ++ DLRGWILKN +EE+ + CKA W R GCSILV+E ++ G +LNF Sbjct: 226 SGGFGVSAPTHDDLRGWILKNCVEEMAKEIDECKAMWKRTGCSILVEELNSDKGFKVLNF 285 Query: 696 FVHCSEGIVFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRL 875 V+C E +VFLKS+D S++L S+D L+ELL +NV+QVIT ++YV AGKRL Sbjct: 286 LVYCPEKVVFLKSVDASEVLSSADKLFELLSELVEEVGSTNVVQVITKCDDYYVDAGKRL 345 Query: 876 TATFRTIYWTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGG 1055 + ++YW PCAA I+ ML++ GK+ WIS T++ A++ITRF+YNH VLN+M ++T G Sbjct: 346 MLVYPSLYWVPCAAHCIDQMLEEFGKLGWISETIEQAQAITRFVYNHSGVLNLMWKFTSG 405 Query: 1056 RDLVQPTISRFATDFVTLKSMVNLK 1130 D++ P S AT+F TL + LK Sbjct: 406 NDILLPAFSSSATNFATLGRIAELK 430 >gb|EAY92386.1| hypothetical protein OsI_14116 [Oryza sativa Indica Group] Length = 796 Score = 289 bits (739), Expect = 2e-75 Identities = 167/408 (40%), Positives = 228/408 (55%), Gaps = 32/408 (7%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITPGKDEMDA 182 R KEHLA + GNA CC +VP +V++TM SL+ V+ KK++KQ +AE I+ IT A Sbjct: 58 RFKEHLARRPGNACCCPKVPREVQETMLHSLDAVAAKKKRKQSLAEGIRRITHSAPAAAA 117 Query: 183 FGTQP---------------DVNTGLQL----LVTSTPNVLEPDTGMFMRRDXXXXXXXX 305 + P +N L L L + P E + +R Sbjct: 118 SASPPAPADAAEMESPIHMIPLNEVLDLGSVPLEETPPETREMKGSISKKRKKLAARQAS 177 Query: 306 XXXXXXXIENVSPSSIIPDGNMIPISELT-------------SRSIKEKDQVHMAIARFL 446 +N P P G P ++ + K+QV+MAI RFL Sbjct: 178 TAPLAH--QNQQPLQSTPAGLTQPFHQMVVAFDSAASQLRHFDQPGSNKEQVYMAIGRFL 235 Query: 447 YDVGASLDAVNSPYFQPMIDAIALRGPGVEAISYHDLRGWILKNSIEEVNGFFNRCKATW 626 YD G SL+AVNS YFQPM++A+A G EA SYHD RG ILK S++EV K +W Sbjct: 236 YDAGVSLEAVNSVYFQPMLEAVASAGGKPEAFSYHDFRGSILKKSLDEVTAQLEFYKGSW 295 Query: 627 GRAGCSILVDEWITEMGRVLLNFFVHCSEGIVFLKSIDVSDILRSSDALYELLKXXXXXX 806 R GC++L DEW T+ GR L+NF V+C EG +FLKS+D +DI+ SSD LYELLK Sbjct: 296 TRTGCTLLADEWTTDRGRTLINFSVYCPEGTMFLKSVDATDIVVSSDPLYELLKNVVEEV 355 Query: 807 XXSNVLQVITDTAEHYVIAGKRLTATFRTIYWTPCAARSINSMLDDIGKVEWISVTLDSA 986 NV+QVIT+ +E + +AGKRL TF T++W+ C+ + I+ ML+D KV I+ + +A Sbjct: 356 GEKNVVQVITNNSEIHAVAGKRLCETFPTLFWSQCSFQCIDGMLEDFSKVGAINEIICNA 415 Query: 987 KSITRFIYNHGVVLNMMRRYTGGRDLVQPTISRFATDFVTLKSMVNLK 1130 K IT FIYN N+M+R+ G+DL+ P +R A +FVTLK+M NLK Sbjct: 416 KVITGFIYNSAFAFNLMKRHLHGKDLLVPAETRAAMNFVTLKNMYNLK 463 >ref|NP_001051738.1| Os03g0822900 [Oryza sativa Japonica Group] gi|108711817|gb|ABF99612.1| hAT family dimerisation domain containing protein, expressed [Oryza sativa Japonica Group] gi|113550209|dbj|BAF13652.1| Os03g0822900 [Oryza sativa Japonica Group] gi|215704668|dbj|BAG94296.1| unnamed protein product [Oryza sativa Japonica Group] gi|222626069|gb|EEE60201.1| hypothetical protein OsJ_13162 [Oryza sativa Japonica Group] Length = 796 Score = 289 bits (739), Expect = 2e-75 Identities = 167/408 (40%), Positives = 228/408 (55%), Gaps = 32/408 (7%) Frame = +3 Query: 3 RIKEHLACQKGNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEEIKSITPGKDEMDA 182 R KEHLA + GNA CC +VP +V++TM SL+ V+ KK++KQ +AE I+ IT A Sbjct: 58 RFKEHLANRPGNACCCPKVPREVQETMLHSLDAVAAKKKRKQSLAEGIRRITHSAPAAAA 117 Query: 183 FGTQP---------------DVNTGLQL----LVTSTPNVLEPDTGMFMRRDXXXXXXXX 305 + P +N L L L + P E + +R Sbjct: 118 SASPPAPADAAEMESPIHMIPLNEVLDLGSVPLEETPPETREMKGSISKKRKKLAARQAS 177 Query: 306 XXXXXXXIENVSPSSIIPDGNMIPISELT-------------SRSIKEKDQVHMAIARFL 446 +N P P G P ++ + K+QV+MAI RFL Sbjct: 178 TAPLAH--QNQQPLQSTPAGLTQPFHQMVVAFDSAASQLMHFDQPGSNKEQVYMAIGRFL 235 Query: 447 YDVGASLDAVNSPYFQPMIDAIALRGPGVEAISYHDLRGWILKNSIEEVNGFFNRCKATW 626 YD G SL+AVNS YFQPM++A+A G EA SYHD RG ILK S++EV K +W Sbjct: 236 YDAGVSLEAVNSVYFQPMLEAVASAGGKPEAFSYHDFRGSILKKSLDEVTAQLEFYKGSW 295 Query: 627 GRAGCSILVDEWITEMGRVLLNFFVHCSEGIVFLKSIDVSDILRSSDALYELLKXXXXXX 806 R GC++L DEW T+ GR L+NF V+C EG +FLKS+D +DI+ SSD LYELLK Sbjct: 296 TRTGCTLLADEWTTDRGRTLINFSVYCPEGTMFLKSVDATDIVVSSDPLYELLKNVVEEV 355 Query: 807 XXSNVLQVITDTAEHYVIAGKRLTATFRTIYWTPCAARSINSMLDDIGKVEWISVTLDSA 986 NV+QVIT+ +E + +AGKRL TF T++W+ C+ + I+ ML+D KV I+ + +A Sbjct: 356 GEKNVVQVITNNSEIHAVAGKRLCETFPTLFWSQCSFQCIDGMLEDFSKVGAINEIICNA 415 Query: 987 KSITRFIYNHGVVLNMMRRYTGGRDLVQPTISRFATDFVTLKSMVNLK 1130 K IT FIYN N+M+R+ G+DL+ P +R A +FVTLK+M NLK Sbjct: 416 KVITGFIYNSAFAFNLMKRHLHGKDLLVPAETRAAMNFVTLKNMYNLK 463 >ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Populus trichocarpa] gi|550335284|gb|ERP58729.1| hypothetical protein POPTR_0006s02210g [Populus trichocarpa] Length = 847 Score = 285 bits (730), Expect = 2e-74 Identities = 157/381 (41%), Positives = 224/381 (58%), Gaps = 5/381 (1%) Frame = +3 Query: 3 RIKEHLACQK-GNASCCQRVPPDVRDTMQQSLEGVSVKKRKKQKIAEE----IKSITPGK 167 R KEHLA + G C RVP DVRD M+Q L + V++RKK+K E + S G+ Sbjct: 156 RFKEHLAGRNSGGVPSCTRVPSDVRDLMEQHLSPIVVRQRKKRKSKREKLDDVDSPPGGE 215 Query: 168 DEMDAFGTQPDVNTGLQLLVTSTPNVLEPDTGMFMRRDXXXXXXXXXXXXXXXIENVSPS 347 D D+ T L+ + N++E ++ + + + + Sbjct: 216 DVYIFADYSDDMITPLRAVAAC--NLVEVNSDFLLDGEGTSNGNLGTRKSAIAVAASDDA 273 Query: 348 SIIPDGNMIPISELTSRSIKEKDQVHMAIARFLYDVGASLDAVNSPYFQPMIDAIALRGP 527 + + S + +H RFLYD+GASLDA++S + QP+ID +A P Sbjct: 274 DAL----------IAMGSETADNPIHAIWGRFLYDIGASLDAMDSNFSQPLIDTVAYGRP 323 Query: 528 GVEAISYHDLRGWILKNSIEEVNGFFNRCKATWGRAGCSILVDEWITEMGRVLLNFFVHC 707 G+ A S+ DLRG ILK+ +EEV N+ K W + GCS+LV+E +E G LNF V+C Sbjct: 324 GIAAPSHQDLRGRILKSLVEEVKSDINQYKTRWVKTGCSLLVEECNSESGVTTLNFLVYC 383 Query: 708 SEGIVFLKSIDVSDILRSSDALYELLKXXXXXXXXSNVLQVITDTAEHYVIAGKRLTATF 887 S+G VFLKS+D S+++ S+D LYELLK N+LQVIT+ EHY+ AGK+L TF Sbjct: 384 SKGTVFLKSVDASNLIHSTDGLYELLKLMVEEVGAGNILQVITNGEEHYIAAGKKLMDTF 443 Query: 888 RTIYWTPCAARSINSMLDDIGKVEWISVTLDSAKSITRFIYNHGVVLNMMRRYTGGRDLV 1067 ++YW PCAAR I+ +L+DIGK++WI+ L+ AKS+TRF+YN+ VLN+MR++T G D+V Sbjct: 444 PSLYWAPCAARCIDLILEDIGKLDWINTVLEQAKSVTRFVYNNSAVLNLMRKFTSGSDIV 503 Query: 1068 QPTISRFATDFVTLKSMVNLK 1130 Q I+R AT+F LK M N K Sbjct: 504 QQGITRSATNFTALKRMANFK 524