BLASTX nr result
ID: Akebia24_contig00023331
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00023331 (1526 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002280689.1| PREDICTED: uncharacterized protein LOC100260... 315 4e-83 emb|CAN72489.1| hypothetical protein VITISV_028959 [Vitis vinifera] 310 1e-81 emb|CAN64751.1| hypothetical protein VITISV_011968 [Vitis vinifera] 299 2e-78 ref|XP_002273017.2| PREDICTED: uncharacterized protein LOC100260... 298 6e-78 ref|XP_007148047.1| hypothetical protein PHAVU_006G176000g [Phas... 285 4e-74 ref|XP_006362410.1| PREDICTED: uncharacterized protein LOC102590... 284 7e-74 ref|XP_007025374.1| Sequence-specific DNA binding transcription ... 275 4e-71 ref|XP_003542983.1| PREDICTED: ESF1 homolog [Glycine max] 274 9e-71 ref|XP_004233061.1| PREDICTED: uncharacterized protein LOC101252... 273 1e-70 gb|EYU43336.1| hypothetical protein MIMGU_mgv1a010263mg [Mimulus... 273 2e-70 ref|XP_007025373.1| Sequence-specific DNA binding transcription ... 272 3e-70 gb|EYU28085.1| hypothetical protein MIMGU_mgv1a010659mg [Mimulus... 270 2e-69 ref|XP_004485895.1| PREDICTED: ensconsin-like [Cicer arietinum] 268 4e-69 ref|XP_003543305.1| PREDICTED: uncharacterized protein LOC100811... 267 8e-69 ref|XP_004233060.1| PREDICTED: uncharacterized protein LOC101252... 267 1e-68 ref|XP_004505494.1| PREDICTED: uncharacterized protein LOC101509... 266 2e-68 ref|XP_004293945.1| PREDICTED: uncharacterized protein LOC101295... 266 2e-68 ref|XP_003539553.1| PREDICTED: uncharacterized protein LOC100784... 266 2e-68 ref|XP_002303549.1| hypothetical protein POPTR_0003s11850g [Popu... 265 3e-68 ref|XP_004505493.1| PREDICTED: uncharacterized protein LOC101509... 265 5e-68 >ref|XP_002280689.1| PREDICTED: uncharacterized protein LOC100260870 [Vitis vinifera] Length = 271 Score = 315 bits (806), Expect = 4e-83 Identities = 169/281 (60%), Positives = 197/281 (70%), Gaps = 1/281 (0%) Frame = +1 Query: 271 DGTETPT-TTVPQRPSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVA 447 DGT TP+ T P RP PL REDCW+EDAT+TL+EAWG+R+LELNRGNLRQKHWQEVA Sbjct: 3 DGTGTPSPATAPPRPLQ-PLACREDCWTEDATHTLIEAWGDRYLELNRGNLRQKHWQEVA 61 Query: 448 DAVNARHGAVKKARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIG 627 DAVNA HG +KKARRTDVQCKNRIDTLKKKYK+EK+RV S G LTS WPFY RLDALIG Sbjct: 62 DAVNALHGHLKKARRTDVQCKNRIDTLKKKYKIEKSRVSDSNGALTSQWPFYERLDALIG 121 Query: 628 TNEPSKKVAQSPLLALPLPYRKNPPLLPHASLVPVAPRSAREKRSAPIAAVDDSFFXXXX 807 +N P+KK + P YRK PP+LP VPV PRS KR AP+ A D+SF Sbjct: 122 SNMPAKKPS-------PPVYRKTPPMLPP---VPVGPRSVMHKRPAPVTA-DESFRRNFS 170 Query: 808 XXXXXXXXXXXXXXXXXXXXXXXKDGEVDGVGELAQAIVRFGEIYERVEGVKQRQMVELE 987 DG +GV ELAQAIVRFGEIYE+VE KQ+QM++LE Sbjct: 171 VVAAAAAAVEEVEEAESARSESDGDGGREGVKELAQAIVRFGEIYEKVEESKQKQMIDLE 230 Query: 988 KQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKRAGSDDIYN 1110 +R++F + LE QRM+LFMD QVQLEKIK AKR+ +D Y+ Sbjct: 231 VKRMQFARDLEIQRMKLFMDTQVQLEKIKHAKRSSGNDSYS 271 >emb|CAN72489.1| hypothetical protein VITISV_028959 [Vitis vinifera] Length = 484 Score = 310 bits (794), Expect = 1e-81 Identities = 167/277 (60%), Positives = 194/277 (70%), Gaps = 1/277 (0%) Frame = +1 Query: 271 DGTETPT-TTVPQRPSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVA 447 DGT TP+ T P RP PL REDCW+EDAT+TL+EAWG+R+LELNRGNLRQKHWQEVA Sbjct: 3 DGTGTPSPATAPPRPLQ-PLACREDCWTEDATHTLIEAWGDRYLELNRGNLRQKHWQEVA 61 Query: 448 DAVNARHGAVKKARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIG 627 DAVNA HG +KKARRTDVQCKNRIDTLKKKYK+EK+RV S G LTS WPFY RLDALIG Sbjct: 62 DAVNALHGHLKKARRTDVQCKNRIDTLKKKYKIEKSRVSDSNGALTSQWPFYERLDALIG 121 Query: 628 TNEPSKKVAQSPLLALPLPYRKNPPLLPHASLVPVAPRSAREKRSAPIAAVDDSFFXXXX 807 +N P+KK + P YRK PP+LP VPV PRS KR AP+ A D+SF Sbjct: 122 SNMPAKKPS-------PPVYRKTPPMLPP---VPVGPRSVMHKRPAPVTA-DESFRRNFS 170 Query: 808 XXXXXXXXXXXXXXXXXXXXXXXKDGEVDGVGELAQAIVRFGEIYERVEGVKQRQMVELE 987 DG +GV ELAQAIVRFGEIYE+VE KQ+QM++LE Sbjct: 171 VVAAAAAAVEEVEEAESARSESDGDGGREGVKELAQAIVRFGEIYEKVEESKQKQMIDLE 230 Query: 988 KQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKRAGSD 1098 +R++F + LE QRM+LFMD QVQLEKIK AKR+ + Sbjct: 231 VKRMQFARDLEIQRMKLFMDTQVQLEKIKHAKRSSGN 267 >emb|CAN64751.1| hypothetical protein VITISV_011968 [Vitis vinifera] Length = 304 Score = 299 bits (766), Expect = 2e-78 Identities = 166/283 (58%), Positives = 197/283 (69%), Gaps = 11/283 (3%) Frame = +1 Query: 286 PTTTVPQRPSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADAVNAR 465 P +T PQR + P REDCWSEDAT TLV+AWG R++ELNRGNLRQK WQEVADAVNAR Sbjct: 9 PPSTAPQRSA----PFREDCWSEDATSTLVDAWGRRYIELNRGNLRQKDWQEVADAVNAR 64 Query: 466 HGAVKKARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTNEPSK 645 HG VKKARRTDVQCKNRIDT+KKKYK+EKARV S G LTS+WPF+SRLDALIG +K Sbjct: 65 HGHVKKARRTDVQCKNRIDTIKKKYKIEKARVTTSNGALTSSWPFFSRLDALIGPTLSAK 124 Query: 646 KVAQ-SPLLALPLPYRKNP-PLLPHASLVPVAPRSAREKRSAPIAAVDDSFFXXXXXXXX 819 K + SP LALPLPY K P P P AS+ + +KR P+ AVDDS+F Sbjct: 125 KPSSASPPLALPLPYWKTPSPAAPSASV-----GALPQKR--PMPAVDDSYFRRNYSAVA 177 Query: 820 XXXXXXXXXXXXXXXXXXXKD---------GEVDGVGELAQAIVRFGEIYERVEGVKQRQ 972 ++ G+VDG+ +LA+AI RFGEIYE+VE KQ+Q Sbjct: 178 AAAAAEAVDEDEDEDEEEEEEESRWSAERSGDVDGMRQLARAIERFGEIYEKVEAEKQKQ 237 Query: 973 MVELEKQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKRAGSDD 1101 M ELEKQR++F K +EFQRM++FMD QVQLEKIKRAKR+G +D Sbjct: 238 MFELEKQRMQFAKDVEFQRMKMFMDTQVQLEKIKRAKRSGLND 280 >ref|XP_002273017.2| PREDICTED: uncharacterized protein LOC100260025 [Vitis vinifera] Length = 279 Score = 298 bits (762), Expect = 6e-78 Identities = 165/278 (59%), Positives = 194/278 (69%), Gaps = 9/278 (3%) Frame = +1 Query: 286 PTTTVPQRPSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADAVNAR 465 P +T PQR + P REDCWSEDAT TLV+AWG R++ELNRGNLRQK WQEVADAVNAR Sbjct: 9 PPSTAPQRSA----PFREDCWSEDATSTLVDAWGRRYIELNRGNLRQKDWQEVADAVNAR 64 Query: 466 HGAVKKARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTNEPSK 645 HG VKKARRTDVQCKNRIDT+KKKYK+EKARV S G LTS+WPF+SRLDALIG +K Sbjct: 65 HGHVKKARRTDVQCKNRIDTIKKKYKIEKARVTTSNGALTSSWPFFSRLDALIGPTLSAK 124 Query: 646 KVAQ-SPLLALPLPYRKNP-PLLPHASLVPVAPRSAREKRSAPIAAVDDSFF-------X 798 K + SP LALPLPY K P P P AS+ + +KR P+ AVDDS+F Sbjct: 125 KPSSASPPLALPLPYWKTPSPAAPSASV-----GALPQKR--PMPAVDDSYFRRNYSAVA 177 Query: 799 XXXXXXXXXXXXXXXXXXXXXXXXXXKDGEVDGVGELAQAIVRFGEIYERVEGVKQRQMV 978 + G+VDG+ +LA+AI RFGEIYE+VE KQ+QM Sbjct: 178 AAAAAEAVDEDEDEEEEEEESRWSAERSGDVDGMRQLARAIERFGEIYEKVEAEKQKQMF 237 Query: 979 ELEKQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKRAG 1092 ELEKQR++F K +EFQRM++FMD QVQLEKIKRAKR+G Sbjct: 238 ELEKQRMQFAKDVEFQRMKMFMDTQVQLEKIKRAKRSG 275 >ref|XP_007148047.1| hypothetical protein PHAVU_006G176000g [Phaseolus vulgaris] gi|561021270|gb|ESW20041.1| hypothetical protein PHAVU_006G176000g [Phaseolus vulgaris] Length = 301 Score = 285 bits (729), Expect = 4e-74 Identities = 156/293 (53%), Positives = 190/293 (64%), Gaps = 26/293 (8%) Frame = +1 Query: 310 PSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADAVNARHGAVKKAR 489 P P+P+REDCWSE+A+ TLV+AWG R+LELNRGNLRQK WQ+VADAVNA HG KK Sbjct: 10 PPSRPVPVREDCWSEEASSTLVDAWGRRYLELNRGNLRQKDWQDVADAVNALHGHTKKTH 69 Query: 490 RTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTN-----EPSKKVA 654 RTDVQCKNRIDT+KKKYK+EKARV S GTL+S+WPFY RLDALIG N PS + Sbjct: 70 RTDVQCKNRIDTIKKKYKIEKARVSSSNGTLSSSWPFYERLDALIGPNFNAKKPPSSSPS 129 Query: 655 QSPLLALP-LPYRKNPPLLPHASLVPVAPRSAREKRSAPIAAVDDSFFXXXXXXXXXXXX 831 SP +ALP LP+RKN P ++ P A + +KRSA AA+D+ +F Sbjct: 130 PSPPVALPLLPHRKNLSSSPAIAVTPTAV-ALPQKRSAAAAAMDEGYFRRNYSAVAAAAA 188 Query: 832 XXXXXXXXXXXXXXXKDG--------------------EVDGVGELAQAIVRFGEIYERV 951 D E +G+ LA+AI RFGE+YERV Sbjct: 189 AAEADEEEEEEEEEEADDVEEEEEEDEGRGSEVEEGEKEREGMRRLAKAIERFGEVYERV 248 Query: 952 EGVKQRQMVELEKQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKRAGSDDIYN 1110 EG K RQMV+LEKQR++F K LE QRMQ+FMD QVQLE+IKR KR+GS+D+Y+ Sbjct: 249 EGQKLRQMVDLEKQRMQFAKDLEVQRMQMFMDTQVQLERIKRGKRSGSNDMYS 301 >ref|XP_006362410.1| PREDICTED: uncharacterized protein LOC102590724 isoform X1 [Solanum tuberosum] Length = 283 Score = 284 bits (727), Expect = 7e-74 Identities = 155/275 (56%), Positives = 186/275 (67%), Gaps = 9/275 (3%) Frame = +1 Query: 313 SHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADAVNARHGAVKKARR 492 S PLP REDCWSE AT+TLV+AWG R++ELNRGNLRQK WQ+V+D+VN HG KK+RR Sbjct: 14 SSRPLPFREDCWSEQATWTLVDAWGRRYMELNRGNLRQKDWQQVSDSVNVLHGHSKKSRR 73 Query: 493 TDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTNEPSKKVAQSPLLA 672 TDVQCKNRIDTLKKKYKVEKA+++ S GTLTS+WPF+ RLD LIG ++ KKV + Sbjct: 74 TDVQCKNRIDTLKKKYKVEKAKIIESNGTLTSSWPFFERLDVLIGNSD--KKVTPVMVSP 131 Query: 673 LPLPYRKNPPL---LPHASLVPVAPRSAREKRSAPIAAVDDSFF------XXXXXXXXXX 825 LPLP +PP+ LP+ V P S +KR P A D+S F Sbjct: 132 LPLP-MSSPPMGVPLPYRRSAMVTPVSLPQKRQLP--AFDESCFRRNYSAVAAAAAAGGG 188 Query: 826 XXXXXXXXXXXXXXXXXKDGEVDGVGELAQAIVRFGEIYERVEGVKQRQMVELEKQRLEF 1005 + E DG+ LA+AI RFG+IYERVEG+KQRQMVELEKQR++F Sbjct: 189 EEDYDDEEEEEDVEMTEESWEEDGMRRLAKAIERFGDIYERVEGMKQRQMVELEKQRMQF 248 Query: 1006 MKSLEFQRMQLFMDWQVQLEKIKRAKRAGSDDIYN 1110 K LE QRMQLFMD QVQLEKIK KR+GSDD+Y+ Sbjct: 249 AKDLEVQRMQLFMDTQVQLEKIKHTKRSGSDDLYS 283 >ref|XP_007025374.1| Sequence-specific DNA binding transcription factors isoform 2 [Theobroma cacao] gi|508780740|gb|EOY27996.1| Sequence-specific DNA binding transcription factors isoform 2 [Theobroma cacao] Length = 301 Score = 275 bits (703), Expect = 4e-71 Identities = 156/294 (53%), Positives = 191/294 (64%), Gaps = 18/294 (6%) Frame = +1 Query: 283 TPTTTVPQRPSHT---PLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADA 453 TP+TT + T PLP+REDCWSE+AT TLV+AWG R+LELNRGNLRQK WQ+VADA Sbjct: 9 TPSTTPTPLSTTTHSRPLPVREDCWSEEATSTLVDAWGRRYLELNRGNLRQKDWQDVADA 68 Query: 454 VNARHGAVKKARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTN 633 VNA HG KK RTDVQCKNRIDT+KKKYK+EKARV S GTLTS+WPF+ RLDALIG+N Sbjct: 69 VNALHGHTKKTHRTDVQCKNRIDTIKKKYKIEKARVTSSNGTLTSSWPFFERLDALIGSN 128 Query: 634 EPSKKVAQSPLLALPLPYRKNPPLLPHASLV--PVAPRSAREKRSAPIAAV------DDS 789 +KK + SP ++ P P + P +P + V P+ R SA + A+ DD Sbjct: 129 FSAKKPSPSPKIS-PKPSPRLSPRIPGSPPVALPLPMPYRRTPASATVVALPQKRPADDG 187 Query: 790 FFXXXXXXXXXXXXXXXXXXXXXXXXXXXKD------GEVDGVGELAQAIVRFGEIYERV 951 +F + E +G+ LA+AI RFGE+YERV Sbjct: 188 YFRRNYSAVAAAAAAAAAETDEEEGEESEAEESEGEGEEREGMSRLARAIERFGEVYERV 247 Query: 952 EGVKQRQMVELEKQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKR-AGSDDIYN 1110 EG K RQMVELEKQR++F K LE QRM++FMD QVQLE+IKR KR +GS DIY+ Sbjct: 248 EGEKLRQMVELEKQRMQFAKDLEVQRMRMFMDTQVQLERIKRGKRSSGSSDIYS 301 >ref|XP_003542983.1| PREDICTED: ESF1 homolog [Glycine max] Length = 312 Score = 274 bits (700), Expect = 9e-71 Identities = 156/307 (50%), Positives = 192/307 (62%), Gaps = 34/307 (11%) Frame = +1 Query: 292 TTVPQRPSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADAVNARHG 471 TT P P P+P+REDCWSE+A+ TLV+AWG R+LELNRGNLRQK WQ+VADAVNA HG Sbjct: 10 TTTP--PPSRPVPVREDCWSEEASSTLVDAWGRRYLELNRGNLRQKDWQDVADAVNALHG 67 Query: 472 AVKKARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTNEPSKKV 651 KK RTDVQCKNRIDT+KKKYK+EKARV S GTL+S+WPFY RLDALIG N +KK Sbjct: 68 HTKKTHRTDVQCKNRIDTIKKKYKIEKARVSASNGTLSSSWPFYERLDALIGPNFSAKKS 127 Query: 652 AQSPL----LALP-LPYRK------NPPLLPHASLVPVAPRSAREKRSAPIAAVDDSFFX 798 SP +ALP LP+RK NP P ++ P A +++ SA AA+D+ +F Sbjct: 128 TSSPSPSPPVALPLLPHRKNQNQNQNPSSSPAIAVPPTAVALPQKRSSA--AAMDEGYFR 185 Query: 799 XXXXXXXXXXXXXXXXXXXXXXXXXXK-----------------------DGEVDGVGEL 909 D E +G+ L Sbjct: 186 RNYSAVAAAAAAAEADEDDEEEDEEEAEDLEEEEEEECEEEGRGSEVEEGDKEREGMRRL 245 Query: 910 AQAIVRFGEIYERVEGVKQRQMVELEKQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKRA 1089 A+AI RFGE+YERVE K RQMV+LEKQR++F K LE QRM++FMD QVQLE+IKR KR+ Sbjct: 246 AKAIERFGEVYERVEAQKLRQMVDLEKQRMQFAKDLEVQRMEMFMDTQVQLERIKRGKRS 305 Query: 1090 GSDDIYN 1110 GS+D+Y+ Sbjct: 306 GSNDMYS 312 >ref|XP_004233061.1| PREDICTED: uncharacterized protein LOC101252652 isoform 2 [Solanum lycopersicum] Length = 281 Score = 273 bits (698), Expect = 1e-70 Identities = 151/273 (55%), Positives = 181/273 (66%), Gaps = 7/273 (2%) Frame = +1 Query: 313 SHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADAVNARHGAVKKARR 492 S PLP REDCWSE AT+TLV+AWG R++ELNRGNLRQK WQ V+D+VNA HG +K R Sbjct: 14 SSRPLPFREDCWSEQATWTLVDAWGRRYMELNRGNLRQKDWQRVSDSVNALHGHSRKTHR 73 Query: 493 TDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTNEPSKKVAQSPLLA 672 TDVQCKNRIDTLKKKYKVEKA+++ S GTLTS+WPF+ RLD LI ++ KKV + Sbjct: 74 TDVQCKNRIDTLKKKYKVEKAKIIESNGTLTSSWPFFERLDMLIENSD--KKVTPVMVSL 131 Query: 673 LPLPYRKNPPL---LPHASLVPVAPRSAREKRSAPIAAVDDSFF----XXXXXXXXXXXX 831 LPLP +PP+ LP+ V P S +KR P A D+S F Sbjct: 132 LPLP-MSSPPMGVPLPYRRSAVVTPVSLPQKRQLP--AFDESCFRRNYSAVAAAAAAGGG 188 Query: 832 XXXXXXXXXXXXXXXKDGEVDGVGELAQAIVRFGEIYERVEGVKQRQMVELEKQRLEFMK 1011 + E DG+ LA+AI FGEIYERVEG+KQRQMVELEKQR++F K Sbjct: 189 EEDYDDEEEDVEMTEESWEEDGMRRLAKAIESFGEIYERVEGMKQRQMVELEKQRMQFAK 248 Query: 1012 SLEFQRMQLFMDWQVQLEKIKRAKRAGSDDIYN 1110 LE QRMQLFMD QVQLEKIK K +GS+D+Y+ Sbjct: 249 DLEVQRMQLFMDTQVQLEKIKHTKLSGSNDLYS 281 >gb|EYU43336.1| hypothetical protein MIMGU_mgv1a010263mg [Mimulus guttatus] Length = 317 Score = 273 bits (697), Expect = 2e-70 Identities = 158/318 (49%), Positives = 193/318 (60%), Gaps = 38/318 (11%) Frame = +1 Query: 271 DGTETPTTTVPQRPSHT-PLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVA 447 D TE+ T P S + +P REDCW+++AT TLV+ WG R+LELNRGNLRQK WQEVA Sbjct: 3 DLTESSTPQAPPSASSSRQVPFREDCWTQEATSTLVDVWGRRYLELNRGNLRQKDWQEVA 62 Query: 448 DAVNARHGAVKKARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIG 627 DAVN+RHG KK RTDVQCKNRIDTLKKKYKVEK+++ S GTLTS+WPF+ RLD LIG Sbjct: 63 DAVNSRHGLTKKTHRTDVQCKNRIDTLKKKYKVEKSKISDSNGTLTSSWPFFPRLDHLIG 122 Query: 628 --TNEPSKKVAQ-----------------SPLLALPLPYRKNPPLLPHASLVPVAPRSAR 750 N+ K AQ SP + +PLPYRK LL A + PV Sbjct: 123 PNLNKTPNKTAQAASPLQSSQFTPPMSLPSPPMGVPLPYRK---LLTSAMVTPVNLPILP 179 Query: 751 EKRSAPIAAVDDSFF-----------------XXXXXXXXXXXXXXXXXXXXXXXXXXXK 879 +KR P+ D+S+F + Sbjct: 180 QKRPLPLQVADESYFRRNYSAMAAAAAAAEDEAEAEEDDGESEGEEEAAEGSDRAVEEEE 239 Query: 880 DGEVDGVGELAQAIVRFGEIYERVEGVKQRQMVELEKQRLEFMKSLEFQRMQLFMDWQVQ 1059 +G G+ LA+AI RFGEIYE+VE +KQRQM+ELEKQR++F K LE QRM+LFMD QVQ Sbjct: 240 EGGEHGMKRLAKAIERFGEIYEKVESMKQRQMIELEKQRMQFAKDLEVQRMRLFMDTQVQ 299 Query: 1060 LEKIKRAKRAGS-DDIYN 1110 LEKIK+AKR+GS DDIY+ Sbjct: 300 LEKIKQAKRSGSNDDIYS 317 >ref|XP_007025373.1| Sequence-specific DNA binding transcription factors isoform 1 [Theobroma cacao] gi|508780739|gb|EOY27995.1| Sequence-specific DNA binding transcription factors isoform 1 [Theobroma cacao] Length = 358 Score = 272 bits (695), Expect = 3e-70 Identities = 155/293 (52%), Positives = 189/293 (64%), Gaps = 18/293 (6%) Frame = +1 Query: 283 TPTTTVPQRPSHT---PLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADA 453 TP+TT + T PLP+REDCWSE+AT TLV+AWG R+LELNRGNLRQK WQ+VADA Sbjct: 9 TPSTTPTPLSTTTHSRPLPVREDCWSEEATSTLVDAWGRRYLELNRGNLRQKDWQDVADA 68 Query: 454 VNARHGAVKKARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTN 633 VNA HG KK RTDVQCKNRIDT+KKKYK+EKARV S GTLTS+WPF+ RLDALIG+N Sbjct: 69 VNALHGHTKKTHRTDVQCKNRIDTIKKKYKIEKARVTSSNGTLTSSWPFFERLDALIGSN 128 Query: 634 EPSKKVAQSPLLALPLPYRKNPPLLPHASLV--PVAPRSAREKRSAPIAAV------DDS 789 +KK + SP ++ P P + P +P + V P+ R SA + A+ DD Sbjct: 129 FSAKKPSPSPKIS-PKPSPRLSPRIPGSPPVALPLPMPYRRTPASATVVALPQKRPADDG 187 Query: 790 FFXXXXXXXXXXXXXXXXXXXXXXXXXXXKD------GEVDGVGELAQAIVRFGEIYERV 951 +F + E +G+ LA+AI RFGE+YERV Sbjct: 188 YFRRNYSAVAAAAAAAAAETDEEEGEESEAEESEGEGEEREGMSRLARAIERFGEVYERV 247 Query: 952 EGVKQRQMVELEKQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKR-AGSDDIY 1107 EG K RQMVELEKQR++F K LE QRM++FMD QVQLE+IKR KR +GS IY Sbjct: 248 EGEKLRQMVELEKQRMQFAKDLEVQRMRMFMDTQVQLERIKRGKRSSGSSGIY 300 >gb|EYU28085.1| hypothetical protein MIMGU_mgv1a010659mg [Mimulus guttatus] Length = 306 Score = 270 bits (689), Expect = 2e-69 Identities = 160/325 (49%), Positives = 198/325 (60%), Gaps = 45/325 (13%) Frame = +1 Query: 271 DGTETPTTTVPQRPSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVAD 450 D TE+ T PQ P+ +P REDCW+++AT TLV+AWG R+L+LNRGNLRQK WQEVAD Sbjct: 3 DLTESST---PQAPASRQVPFREDCWTQEATSTLVDAWGRRYLDLNRGNLRQKDWQEVAD 59 Query: 451 AVNARHGAVKKARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGT 630 VNA HG KK RRTDVQCKNRIDTLKKKYKVEKA++ SGG +S WPF+ RL+ LIG+ Sbjct: 60 VVNAHHGHTKKTRRTDVQCKNRIDTLKKKYKVEKAKITDSGGAASSPWPFFPRLEYLIGS 119 Query: 631 --NEPSKKVAQSPL---------------LALPLPYRK-------NPPLLPHASLVPVAP 738 N+ KV+ SP+ LA+PLPYR +PP+LP Sbjct: 120 NLNKTQGKVSPSPMPLASYTPPLCLPAPPLAVPLPYRNTRTPAVFSPPILP--------- 170 Query: 739 RSAREKRSAPIAAVDDSFFXXXXXXXXXXXXXXXXXXXXXXXXXXXKDGEVDG------- 897 +KR P AV++S+F DGE+DG Sbjct: 171 ----QKRPMPSPAVEESYF-----RKNYSAMAAAAAAAAAEDEAEEDDGELDGDEAEGSE 221 Query: 898 --------VGE-----LAQAIVRFGEIYERVEGVKQRQMVELEKQRLEFMKSLEFQRMQL 1038 VGE LA+AI FGEI+E+VE +KQRQM+ELEKQR++F K LE QRMQL Sbjct: 222 DRAAEEEEVGETGMRRLAKAIESFGEIFEKVESMKQRQMIELEKQRMQFAKDLEVQRMQL 281 Query: 1039 FMDWQVQLEKIKRAKRAG-SDDIYN 1110 FMD Q+QL+KIK+AKR+G SDDIY+ Sbjct: 282 FMDTQIQLQKIKQAKRSGSSDDIYS 306 >ref|XP_004485895.1| PREDICTED: ensconsin-like [Cicer arietinum] Length = 306 Score = 268 bits (686), Expect = 4e-69 Identities = 151/299 (50%), Positives = 190/299 (63%), Gaps = 32/299 (10%) Frame = +1 Query: 310 PSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADAVNARHGAVKKAR 489 P P+P+REDCWSE+A+ TLV+AWG R+LELNRGNLRQK WQ+VADAVNA HG KK Sbjct: 10 PPSRPVPVREDCWSEEASSTLVDAWGRRYLELNRGNLRQKDWQDVADAVNALHGHTKKTH 69 Query: 490 RTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTNEPSKKVAQSPL- 666 RTDVQCKNRIDT+KKKYK+EKARV S G ++S+WPF+ RLDALIG N +KK + SP Sbjct: 70 RTDVQCKNRIDTIKKKYKIEKARVSSSNGVVSSSWPFFERLDALIGPNFSAKKSSSSPSP 129 Query: 667 ---LALP-LPYRKNPPLLPHASLVPVAPRSAREKRSAPIAAVDDSFFXXXXXXXXXXXXX 834 +ALP LP+RK P LP S+ P A + +KRS AA+D+ +F Sbjct: 130 SPPVALPLLPHRKFHPSLPAISVPPTAV-ALPQKRSI-AAAMDNGYFRRNYSAVAAAAAA 187 Query: 835 XXXXXXXXXXXXXXKDGEV---------------------------DGVGELAQAIVRFG 933 + E +G+ LA+AI +FG Sbjct: 188 AEADEEEEEEEEEEVEEEEEVEVEVEEHEHEEEGRGSEVEEGDKEREGMKRLAKAIEKFG 247 Query: 934 EIYERVEGVKQRQMVELEKQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKRAGSDDIYN 1110 E+YERVEG K RQMV+LEKQR++F K LE QRM++FMD QVQLE+IKR KR+GS+D+Y+ Sbjct: 248 EMYERVEGQKLRQMVDLEKQRMQFAKDLEVQRMKMFMDTQVQLERIKRGKRSGSNDMYS 306 >ref|XP_003543305.1| PREDICTED: uncharacterized protein LOC100811154 [Glycine max] Length = 306 Score = 267 bits (683), Expect = 8e-69 Identities = 153/301 (50%), Positives = 183/301 (60%), Gaps = 25/301 (8%) Frame = +1 Query: 274 GTETPTTTVPQRPSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADA 453 G P P +P P REDCWSE+AT+TL+EAWG+RHLELNRGNLRQ+HWQEVADA Sbjct: 6 GAPPHADVTPPPPRQSPFPGREDCWSEEATFTLIEAWGQRHLELNRGNLRQRHWQEVADA 65 Query: 454 VNARHGAVK-KARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGT 630 VNARHG V KARRTDVQCKNRIDTLKKKYK+EKARV SG + T+ WPF+ RLD LIG Sbjct: 66 VNARHGHVSTKARRTDVQCKNRIDTLKKKYKIEKARVSDSGDSATT-WPFFRRLDFLIGD 124 Query: 631 NEPSKKVAQSPLLALPLPYRKNPPL-LPHASLVPVAPRSAREKRSA---PIAAVDDSFFX 798 N P+KK SP + R PP P +++PV PRS +KR A P +A DS Sbjct: 125 NFPAKK--PSPPATAGITRRSTPPAKSPPWAVIPVGPRSGTKKRPAAAKPASASPDSVAN 182 Query: 799 XXXXXXXXXXXXXXXXXXXXXXXXXXKDGE--------------------VDGVGELAQA 918 +G G E+A+A Sbjct: 183 SYFRRNFSVFAAAAAAAAAAAADSENSNGSKWSSGSEKGTMKKKRTRGDWEFGYREMAEA 242 Query: 919 IVRFGEIYERVEGVKQRQMVELEKQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKRAGSD 1098 + +FGEIYERVEG KQRQMVELEKQR++F K LE QRM+LFM+ QV L+KI R+KR+ + Sbjct: 243 LEKFGEIYERVEGAKQRQMVELEKQRMQFAKDLETQRMKLFMETQVHLQKINRSKRSSAS 302 Query: 1099 D 1101 D Sbjct: 303 D 303 >ref|XP_004233060.1| PREDICTED: uncharacterized protein LOC101252652 isoform 1 [Solanum lycopersicum] Length = 317 Score = 267 bits (682), Expect = 1e-68 Identities = 149/269 (55%), Positives = 177/269 (65%), Gaps = 7/269 (2%) Frame = +1 Query: 313 SHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADAVNARHGAVKKARR 492 S PLP REDCWSE AT+TLV+AWG R++ELNRGNLRQK WQ V+D+VNA HG +K R Sbjct: 14 SSRPLPFREDCWSEQATWTLVDAWGRRYMELNRGNLRQKDWQRVSDSVNALHGHSRKTHR 73 Query: 493 TDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTNEPSKKVAQSPLLA 672 TDVQCKNRIDTLKKKYKVEKA+++ S GTLTS+WPF+ RLD LI ++ KKV + Sbjct: 74 TDVQCKNRIDTLKKKYKVEKAKIIESNGTLTSSWPFFERLDMLIENSD--KKVTPVMVSL 131 Query: 673 LPLPYRKNPPL---LPHASLVPVAPRSAREKRSAPIAAVDDSFF----XXXXXXXXXXXX 831 LPLP +PP+ LP+ V P S +KR P A D+S F Sbjct: 132 LPLP-MSSPPMGVPLPYRRSAVVTPVSLPQKRQLP--AFDESCFRRNYSAVAAAAAAGGG 188 Query: 832 XXXXXXXXXXXXXXXKDGEVDGVGELAQAIVRFGEIYERVEGVKQRQMVELEKQRLEFMK 1011 + E DG+ LA+AI FGEIYERVEG+KQRQMVELEKQR++F K Sbjct: 189 EEDYDDEEEDVEMTEESWEEDGMRRLAKAIESFGEIYERVEGMKQRQMVELEKQRMQFAK 248 Query: 1012 SLEFQRMQLFMDWQVQLEKIKRAKRAGSD 1098 LE QRMQLFMD QVQLEKIK K +GS+ Sbjct: 249 DLEVQRMQLFMDTQVQLEKIKHTKLSGSN 277 >ref|XP_004505494.1| PREDICTED: uncharacterized protein LOC101509429 isoform X2 [Cicer arietinum] Length = 302 Score = 266 bits (680), Expect = 2e-68 Identities = 155/289 (53%), Positives = 182/289 (62%), Gaps = 18/289 (6%) Frame = +1 Query: 289 TTTVPQRPSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADAVNARH 468 T+ P+ PS +P REDCWSEDAT+TL++AWGE +L+LNRGNLRQKHWQEVADAVN H Sbjct: 12 TSAPPRPPSVSPFTGREDCWSEDATFTLIDAWGEHYLDLNRGNLRQKHWQEVADAVNDIH 71 Query: 469 --GAVKKARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTNEPS 642 G +KARRTDVQCKNRIDTLKKKYK+EKARV + G S W F+SRLDALIG P Sbjct: 72 AAGNNRKARRTDVQCKNRIDTLKKKYKIEKARVSETDGGYQSPWSFFSRLDALIGDTFPI 131 Query: 643 KKVAQSPLLALPLPYRKNPPLLPHASLV---PVAPRSAREKRSAPIAAVDDSFFXXXXXX 813 KK++ + K PPL P + + PV PRS +KR A +A DD+ F Sbjct: 132 KKLSPPANVRTTPAAVKPPPLPPPPAWIISHPVGPRSGTQKRPA-LANRDDASFRRNFSA 190 Query: 814 XXXXXXXXXXXXXXXXXXXXXKDG--------EVD-----GVGELAQAIVRFGEIYERVE 954 G E D G ELAQAI RFG+IYERVE Sbjct: 191 FAAAAAAAAEAESEESEEWRSSSGTGSGKKGKESDKNLEFGFRELAQAIERFGDIYERVE 250 Query: 955 GVKQRQMVELEKQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKRAGSDD 1101 KQRQMVELEKQR++F K LEFQRMQLFM+ QVQL+KIKR+KR+ D Sbjct: 251 DAKQRQMVELEKQRMQFAKDLEFQRMQLFMETQVQLQKIKRSKRSSGSD 299 >ref|XP_004293945.1| PREDICTED: uncharacterized protein LOC101295202 isoform 2 [Fragaria vesca subsp. vesca] Length = 310 Score = 266 bits (680), Expect = 2e-68 Identities = 156/318 (49%), Positives = 189/318 (59%), Gaps = 31/318 (9%) Frame = +1 Query: 250 NLTETAMDGTETPTTTVPQRPSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQK 429 +LTE+ T T T P P+P+REDCWSEDAT TL++AWG R++ELNRGNLRQK Sbjct: 3 DLTESLTPSTATATPNSSSNPR--PMPVREDCWSEDATSTLIDAWGRRYVELNRGNLRQK 60 Query: 430 HWQEVADAVNARHGAVKKARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSR 609 WQEVADAVNA HG KK RTDVQCKNRIDT+KKKYKVEKARV SGG S+WPF+ R Sbjct: 61 DWQEVADAVNALHGHTKKTHRTDVQCKNRIDTIKKKYKVEKARV--SGG-FNSSWPFFDR 117 Query: 610 LDALIGTNEPSKKVAQSPLLALPLP--YRKNPPLLPHASLVPVAPRSAREKRSAPIAAVD 783 LD+LIG+ + SP +A+PLP YRK P P + V + +KRSA AA++ Sbjct: 118 LDSLIGSTVKKPSPSLSPPVAVPLPLAYRKPSPARPVVTAVALP-----QKRSASAAALN 172 Query: 784 DSFFXXXXXXXXXXXXXXXXXXXXXXXXXXXK-------------------DGEVDGVG- 903 + FF + DGE+ G G Sbjct: 173 EGFFRMNYSAVAAAAAAENDDDDDEDEEDEEEEEDNDDGDEEAEVEKERDSDGEIGGGGG 232 Query: 904 ---------ELAQAIVRFGEIYERVEGVKQRQMVELEKQRLEFMKSLEFQRMQLFMDWQV 1056 LA+AI RFGE+Y+RVE K RQM ELEKQR++F K LE QRM +FMD QV Sbjct: 233 GGVGGEGLKRLARAIERFGEVYQRVEADKIRQMTELEKQRMQFAKDLEIQRMSMFMDTQV 292 Query: 1057 QLEKIKRAKRAGSDDIYN 1110 QLE+IK KR GS+DIY+ Sbjct: 293 QLERIKHGKRPGSNDIYS 310 >ref|XP_003539553.1| PREDICTED: uncharacterized protein LOC100784918 [Glycine max] Length = 307 Score = 266 bits (679), Expect = 2e-68 Identities = 156/294 (53%), Positives = 185/294 (62%), Gaps = 24/294 (8%) Frame = +1 Query: 292 TTVPQRPSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADAVNARHG 471 T P RPS P P REDCWSE+AT+TL+EAWG+RHLELNRGNLRQ+HWQEVADAVNA HG Sbjct: 14 TPPPPRPS--PFPGREDCWSEEATFTLIEAWGQRHLELNRGNLRQRHWQEVADAVNALHG 71 Query: 472 AVK-KARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTNEPSKK 648 V KARRTDVQCKNRIDTLKKKYK+EKARV SG + T+ WPF+ RLD LIG N P+KK Sbjct: 72 HVSAKARRTDVQCKNRIDTLKKKYKIEKARVSDSGDSATT-WPFFRRLDFLIGDNFPAKK 130 Query: 649 VAQSPLLALPLPYRKNPPL-LPHASLVPVAPRSAREKR-------SAPIAAVDDSFF--- 795 + P + + R PP P ++PV PRS +KR SA +V DS+F Sbjct: 131 PSPPPPSSAGVTRRSTPPAKSPLWPVIPVGPRSGTKKRPAQAKPASASPDSVADSYFRRN 190 Query: 796 ---------XXXXXXXXXXXXXXXXXXXXXXXXXXXKDGEVD---GVGELAQAIVRFGEI 939 K G D G E+A+A+ RFGEI Sbjct: 191 FSVFAAAAAAAAAEADSENSDGSKWSSGSEKGTMKKKRGRGDWEFGYREMAEALERFGEI 250 Query: 940 YERVEGVKQRQMVELEKQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKRAGSDD 1101 YERVE KQRQMVELEKQR++F K LE QRM+LFM+ QV L+KI R+KR+ + D Sbjct: 251 YERVEEAKQRQMVELEKQRMQFAKDLETQRMKLFMETQVHLQKINRSKRSSASD 304 >ref|XP_002303549.1| hypothetical protein POPTR_0003s11850g [Populus trichocarpa] gi|222840981|gb|EEE78528.1| hypothetical protein POPTR_0003s11850g [Populus trichocarpa] Length = 290 Score = 265 bits (678), Expect = 3e-68 Identities = 152/286 (53%), Positives = 184/286 (64%), Gaps = 14/286 (4%) Frame = +1 Query: 283 TPTTTVPQRPSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADAVNA 462 TP+TT P PLPIREDCWSE+AT TLV+AWG R+LELNRGNLRQK WQ+VADAVNA Sbjct: 11 TPSTT----PHSRPLPIREDCWSEEATSTLVDAWGRRYLELNRGNLRQKDWQDVADAVNA 66 Query: 463 RHGAVKKARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTNEPS 642 HG KK RTDVQCKNRIDT+KKKYK+EK+ V+ S GTLTS+WPF+ RLDALIG+N S Sbjct: 67 LHGHTKKTYRTDVQCKNRIDTIKKKYKIEKSHVVSSNGTLTSSWPFFERLDALIGSNFNS 126 Query: 643 ---KKVAQSPLLALPLP--YRKNPPLLPHASLVPVAPRSA-REKRSAPIAAVDDSFF--- 795 K ++ SP +ALPLP YR+ P + P A A +KR P VDD +F Sbjct: 127 SGKKHLSPSPPVALPLPPSYRRTPQVSSTPPPQPPALAVALPQKRPLP---VDDDYFRRN 183 Query: 796 -----XXXXXXXXXXXXXXXXXXXXXXXXXXXKDGEVDGVGELAQAIVRFGEIYERVEGV 960 +D E +G+ LA AI RFGE+YERVE Sbjct: 184 YSAMAAAAAAVESDSEEDEDEEFEGGERERAEEDVEGEGIKRLALAIERFGEVYERVESE 243 Query: 961 KQRQMVELEKQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKRAGSD 1098 K +QMV+LEKQR++F K LE +RM++F + QVQLEKIK+ KRA D Sbjct: 244 KLKQMVDLEKQRMKFAKDLEMERMRIFTETQVQLEKIKKGKRAPED 289 >ref|XP_004505493.1| PREDICTED: uncharacterized protein LOC101509429 isoform X1 [Cicer arietinum] Length = 303 Score = 265 bits (676), Expect = 5e-68 Identities = 157/290 (54%), Positives = 184/290 (63%), Gaps = 19/290 (6%) Frame = +1 Query: 289 TTTVPQRPSHTPLPIREDCWSEDATYTLVEAWGERHLELNRGNLRQKHWQEVADAVNARH 468 T+ P+ PS +P REDCWSEDAT+TL++AWGE +L+LNRGNLRQKHWQEVADAVN H Sbjct: 12 TSAPPRPPSVSPFTGREDCWSEDATFTLIDAWGEHYLDLNRGNLRQKHWQEVADAVNDIH 71 Query: 469 --GAVKKARRTDVQCKNRIDTLKKKYKVEKARVLGSGGTLTSNWPFYSRLDALIGTNEPS 642 G +KARRTDVQCKNRIDTLKKKYK+EKARV + G S W F+SRLDALIG P Sbjct: 72 AAGNNRKARRTDVQCKNRIDTLKKKYKIEKARVSETDGGYQSPWSFFSRLDALIGDTFPI 131 Query: 643 KKVAQSPLLALPLPYRKNPPLLPHASLV---PVAPRSAREKRSAPIAAVDDSFFXXXXXX 813 KK++ + K PPL P + + PV PRS +KR A +A DD+ F Sbjct: 132 KKLSPPANVRTTPAAVKPPPLPPPPAWIISHPVGPRSGTQKRPA-LANRDDASFRRNFSA 190 Query: 814 XXXXXXXXXXXXXXXXXXXXXKDG--------EVD-----GVGELAQAIVRFGEIYERVE 954 G E D G ELAQAI RFG+IYERVE Sbjct: 191 FAAAAAAAAEAESEESEEWRSSSGTGSGKKGKESDKNLEFGFRELAQAIERFGDIYERVE 250 Query: 955 GVKQRQMVELEKQRLEFMKSLEFQRMQLFMDWQVQLEKIKRAKR-AGSDD 1101 KQRQMVELEKQR++F K LEFQRMQLFM+ QVQL+KIKR+KR +GS D Sbjct: 251 DAKQRQMVELEKQRMQFAKDLEFQRMQLFMETQVQLQKIKRSKRSSGSAD 300