BLASTX nr result
ID: Mentha28_contig00001875
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00001875 (2296 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU46216.1| hypothetical protein MIMGU_mgv1a023243mg, partial... 735 0.0 ref|XP_006362316.1| PREDICTED: uncharacterized protein LOC102579... 553 e-154 ref|XP_004251353.1| PREDICTED: uncharacterized protein LOC101256... 548 e-153 emb|CBI24209.3| unnamed protein product [Vitis vinifera] 532 e-148 ref|XP_007214563.1| hypothetical protein PRUPE_ppa000168mg [Prun... 529 e-147 gb|EYU35929.1| hypothetical protein MIMGU_mgv1a024138mg, partial... 522 e-145 ref|XP_007015163.1| DNA binding,zinc ion binding,DNA binding, pu... 517 e-143 ref|XP_007015161.1| DNA binding,zinc ion binding,DNA binding, pu... 517 e-143 ref|XP_004291756.1| PREDICTED: uncharacterized protein LOC101311... 516 e-143 ref|XP_002274937.2| PREDICTED: uncharacterized protein LOC100260... 514 e-143 gb|EXC04604.1| Nucleosome-remodeling factor subunit BPTF [Morus ... 511 e-142 ref|XP_002513535.1| hypothetical protein RCOM_1578820 [Ricinus c... 510 e-142 ref|XP_002313363.2| hypothetical protein POPTR_0009s05370g [Popu... 506 e-140 ref|XP_006590775.1| PREDICTED: uncharacterized protein LOC100800... 503 e-139 ref|XP_003538947.1| PREDICTED: uncharacterized protein LOC100800... 503 e-139 ref|XP_002299794.2| hypothetical protein POPTR_0001s26130g, part... 501 e-139 ref|XP_003540620.1| PREDICTED: uncharacterized protein LOC100791... 499 e-138 ref|XP_006470705.1| PREDICTED: uncharacterized protein LOC102628... 497 e-138 ref|XP_006446212.1| hypothetical protein CICLE_v10014020mg [Citr... 494 e-137 ref|XP_004505800.1| PREDICTED: uncharacterized protein LOC101501... 484 e-134 >gb|EYU46216.1| hypothetical protein MIMGU_mgv1a023243mg, partial [Mimulus guttatus] Length = 1772 Score = 735 bits (1897), Expect = 0.0 Identities = 386/648 (59%), Positives = 451/648 (69%), Gaps = 3/648 (0%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSDVSG-PSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GSA+HIVSSSAR SSKHGI RK K+SDV PSSNAA GLSLFWWRGG SR LFNW Sbjct: 1125 TMGSASHIVSSSARVSSKHGIGRKSIKNSDVERTPSSNAAKGLSLFWWRGGTSSRKLFNW 1184 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 K LP SLASKAARQGG +KIP ILYPD+G+YAKRTKY +WRAAVE+S SV+QLALQVREL Sbjct: 1185 KSLPRSLASKAARQGGCKKIPTILYPDNGDYAKRTKYVAWRAAVESSTSVDQLALQVREL 1244 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 DANI+WDDIGNN LLS I+ DSKKP RSFKKVVIRRKCSEGA VRYLLDFGKRRFIPD+V Sbjct: 1245 DANIKWDDIGNNNLLSKIDKDSKKPARSFKKVVIRRKCSEGAVVRYLLDFGKRRFIPDVV 1304 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 +++GS LED S+ KKRYWLEE++VPL+LLKAFEEK+IARKSN MKSG LCESS ++KP Sbjct: 1305 LKHGSILEDSSSAKKRYWLEESYVPLHLLKAFEEKKIARKSNQMKSGNLCESSGKLRKPF 1364 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKY 902 K+KGF YLF+RAERLE QC CKKDVLIR ++ FFHK H+RKSAGS+T + Y Sbjct: 1365 KDKGFQYLFARAERLENYQCGHCKKDVLIRYNIALIYFYSFFHKRHIRKSAGSVTTECTY 1424 Query: 903 TCQKCQDGKFVKTDARKGKPELPKLRNASKSVXXXXXXXXXXXXXXXXQVNLTNKKRVPF 1082 TC KCQ GK VK D R+G E KL+ + S +VN +K VP Sbjct: 1425 TCHKCQSGKLVKVDTREGISESSKLKKSFHS------RKGKKKGKEKPKVNPKGRKGVPL 1478 Query: 1083 VVPLRRSARNAERVAKVTVQNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMPVTSSYW 1262 VVPLRRSARNA RV K+ ++N+ +R PV SSYW Sbjct: 1479 VVPLRRSARNAARVTKLALKNTKVKKRKRGRKAKAEKVIPKKSKNKSLKNKRTPVNSSYW 1538 Query: 1263 LNGLRLSRRPGDER--HLRNRKLLVLSGEVDCILNKTICSLCREVEHKSELNYVGCEICG 1436 LNGL+ SRRP DER H RNR LLVLSGEV +K CSLC EVEHKS LNYV CEICG Sbjct: 1539 LNGLQFSRRPNDERLAHFRNRMLLVLSGEVTSFQDKPKCSLCSEVEHKSVLNYVSCEICG 1598 Query: 1437 DWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPHHCPVESSNPEIFSENNAKIECTEE 1616 WFHGDALNLGA +I NLIGFKC+ CLNK PP+CPHHC S+ ++ ENN EC E Sbjct: 1599 VWFHGDALNLGAGEIGNLIGFKCYTCLNKKPPVCPHHCSTGSNKADLVLENNTNTECVVE 1658 Query: 1617 DPHCLASPDNELAHQKSHFNDKSADTLSAVNMEKQLQVSVGELDLKDEDLEMAEKIFLGN 1796 + N +S D VNMEKQ S+ D KD++ +E I L N Sbjct: 1659 NS-----------------NKESNDLFLTVNMEKQSSESISASDQKDKEFPSSENILLPN 1701 Query: 1797 DPIELGGKKGDVSSTLETESTIQNSDMVREAECPTIIHDLVENGMANN 1940 D ++ KKG+ + +ETE+TI NSDMV++ EC + +LVE+G+ NN Sbjct: 1702 DFVD---KKGEALNAVETEATIHNSDMVKKDECLPLTQNLVEDGLTNN 1746 >ref|XP_006362316.1| PREDICTED: uncharacterized protein LOC102579382 [Solanum tuberosum] Length = 1718 Score = 553 bits (1426), Expect = e-154 Identities = 294/585 (50%), Positives = 377/585 (64%), Gaps = 15/585 (2%) Frame = +3 Query: 9 VGSATHIVSSSARASSKHGISRKRAKSSDVS-GPSSNAATGLSLFWWRGGRGSRSLFNWK 185 +GS HI+ +S+R +HGI +K+++ + PSSNA +GLSLFWWRGGR SR LFNWK Sbjct: 1054 MGSGHHIIINSSRV--RHGIGKKKSRHLEPEVNPSSNAGSGLSLFWWRGGRLSRRLFNWK 1111 Query: 186 VLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVRELD 365 +LP SLA KAARQGG +KIP +LYPD+ ++AKR K +WRAAVETSR+VEQLALQVR+LD Sbjct: 1112 LLPQSLARKAARQGGCKKIPDMLYPDNSDFAKRNKCIAWRAAVETSRTVEQLALQVRDLD 1171 Query: 366 ANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIVV 545 A+IRWDDIGN +L++I+ + +K VRSFKK +R+K SEG+ V+YLLDFGKRRF+PDIVV Sbjct: 1172 AHIRWDDIGNTNILAIIDKEFQKAVRSFKKATVRKKSSEGSVVKYLLDFGKRRFLPDIVV 1231 Query: 546 RYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPLK 725 R G+ E+ S E+KRYWLEE+H+PL+L+K FEEKRIARKS+ + GK E+ +I+KKPLK Sbjct: 1232 RCGTIPEEASTERKRYWLEESHMPLHLVKGFEEKRIARKSSKITVGKHRETKRIMKKPLK 1291 Query: 726 EKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKYT 905 EKGF+YLF +AER EY QC C KDVLIREAVSCQ CKGFFHK HVRKS G + + K+T Sbjct: 1292 EKGFAYLFLKAERSEYYQCGHCNKDVLIREAVSCQYCKGFFHKRHVRKSTGVVAAEFKHT 1351 Query: 906 CQKCQDGKFVKTDARKGKPELPKLRNASKSVXXXXXXXXXXXXXXXXQVNLTNKKRVPFV 1085 C KC D V+ + ++G+ E+ K ASK++ L + K+ P V Sbjct: 1352 CHKCMDVNNVRKNVKRGRIEMQKSEEASKALRPLRLKIISGGTKNKQPAQLLSSKKKPVV 1411 Query: 1086 VPLRRSARNAERVA---------KVTVQNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXQR 1238 +PLRRSAR A+ V K S +R Sbjct: 1412 IPLRRSARRAKFVVVQNKKIGRKKGKQTKSGRGRGRPRKQAKVDISEKKKPAEVAWRRKR 1471 Query: 1239 MPVTSSYWLNGLRLSRRPGDER--HLRNRKLLVLSGEVDCILNKTICSLCREVEHKSELN 1412 M + YWLNGL LS++P DER R++KLLVLSGE+ ++ C LC E+E+ N Sbjct: 1472 MQLCRIYWLNGLLLSQKPKDERVTLFRSKKLLVLSGELGGTADQPKCCLCGELEYTPTSN 1531 Query: 1413 YVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPHHCPVESSNPEIFSENN 1592 Y+ CE+CGDWFHGDA L A++I LIGFKCH C +TPP C H +S ++ E Sbjct: 1532 YIACEVCGDWFHGDAFGLTAERITKLIGFKCHECRQRTPPFCAHLHASDSKGKQVMLEGT 1591 Query: 1593 AKIECTEEDPHC---LASPDNELAHQKSHFNDKSADTLSAVNMEK 1718 EC D C L S L QKSH ND+S + + EK Sbjct: 1592 ---ECRAADETCDIELVSSKGPL-EQKSHLNDESGSCFTGDSGEK 1632 >ref|XP_004251353.1| PREDICTED: uncharacterized protein LOC101256352 [Solanum lycopersicum] Length = 1884 Score = 548 bits (1412), Expect = e-153 Identities = 293/586 (50%), Positives = 374/586 (63%), Gaps = 16/586 (2%) Frame = +3 Query: 9 VGSATHIVSSSARASSKHGISRKRAKSSDVS-GPSSNAATGLSLFWWRGGRGSRSLFNWK 185 +GS HI+ +S+R +HGI +K+A+ + PSSNA +GLSLFWWRGGR SR LFNWK Sbjct: 1228 MGSGHHIIINSSRV--RHGIGKKKARHLEPEVNPSSNAGSGLSLFWWRGGRLSRRLFNWK 1285 Query: 186 VLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVRELD 365 +LP SLA KAARQGG +KIP +LYPD+ ++AKR K +WRAAVETSR+VEQLALQVR+LD Sbjct: 1286 LLPQSLARKAARQGGCKKIPDMLYPDNSDFAKRNKCIAWRAAVETSRTVEQLALQVRDLD 1345 Query: 366 ANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIVV 545 A+IRWDDIGN +L++I+ + +K VRSFKK +R+K SEG+ V+YLLDFGKRRF+PDIVV Sbjct: 1346 AHIRWDDIGNTNILAIIDKEFQKAVRSFKKATVRKKSSEGSVVKYLLDFGKRRFLPDIVV 1405 Query: 546 RYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPLK 725 R G+ E+ S E+KRYWLEE H+PL+L+K FEEKRIARKS+ + GK E+ +I+KKPLK Sbjct: 1406 RCGTVPEEASTERKRYWLEEAHMPLHLVKGFEEKRIARKSSKITVGKHRETKRIMKKPLK 1465 Query: 726 EKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKYT 905 EKGF+YLF +AER EY QC C KDVLIREAVSCQ CKGFFHK HVRKS G + + K+T Sbjct: 1466 EKGFAYLFLKAERSEYYQCGHCNKDVLIREAVSCQYCKGFFHKRHVRKSTGVVAAEFKHT 1525 Query: 906 CQKCQDGKFVKTDARKGKPELPKLRNASKSVXXXXXXXXXXXXXXXXQVNLTNKKRVPFV 1085 C KC D V+ + ++G+ E+ K ASK++ + K+ P V Sbjct: 1526 CHKCMDVNNVRKNVKRGRIEMQKSEEASKALRPLRLKVISGGTKNKQPAQSPSSKKKPVV 1585 Query: 1086 VPLRRSARNAERVA---------KVTVQNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXQR 1238 +PLRRSAR A+ V K S +R Sbjct: 1586 MPLRRSARRAKFVVVQNKKIGRKKGKQTKSGRGRGRPRKHAKVDISEKKKPAEVAWRRKR 1645 Query: 1239 MPVTSSYWLNGLRLSRRPGDER--HLRNRKLLVLSGEVDCILNKTICSLCREVEHKSELN 1412 M + YWLNGL LS++P DER R++KLLVLSGE+ ++ CSLC E+E+ N Sbjct: 1646 MQLCRIYWLNGLLLSQKPKDERVTLFRSKKLLVLSGELGGAADQPKCSLCGELEYTPTSN 1705 Query: 1413 YVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPHHCPVESSNPEIFSENN 1592 Y+ CE+CGDWFHGDA L A++I LIGFKCH C + PP C H S ++ E+ Sbjct: 1706 YIACEVCGDWFHGDAFGLTAERITKLIGFKCHECRQRNPPFCAHLHATNSKGKQVMWEST 1765 Query: 1593 AKIECTEEDP----HCLASPDNELAHQKSHFNDKSADTLSAVNMEK 1718 EC D L+S QKSH ND+S + N EK Sbjct: 1766 ---ECKSADETFDIESLSSKGP--LEQKSHLNDESGSCFTGDNGEK 1806 >emb|CBI24209.3| unnamed protein product [Vitis vinifera] Length = 1805 Score = 532 bits (1370), Expect = e-148 Identities = 289/543 (53%), Positives = 349/543 (64%), Gaps = 30/543 (5%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSD-VSGPSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GSA+HIV SS RASSK G+ +KR + S VS PSSNAATGLSLFWWRGGR SR LFNW Sbjct: 1045 TMGSASHIVISS-RASSKLGVGKKRTRCSGFVSKPSSNAATGLSLFWWRGGRLSRKLFNW 1103 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 KVLP SLASKAARQ G KIPGILYP+S E+AKR KY WR+AVETS SVEQLAL VREL Sbjct: 1104 KVLPRSLASKAARQAGCTKIPGILYPESSEFAKRNKYVVWRSAVETSTSVEQLALLVREL 1163 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 D NIRWDDI N L ++ +++K +R F+KV+IRRKC EG +YLLDFGKR+ IPD+V Sbjct: 1164 DLNIRWDDIENTHPLFKLDKEARKSIRPFRKVIIRRKCIEGTISKYLLDFGKRKIIPDVV 1223 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 V++GS LE+ S+E+K+YWL+E+HVPL+LLKAFEEKRIARKS+ + SGKL E + +KKP Sbjct: 1224 VKHGSILEESSSERKKYWLDESHVPLHLLKAFEEKRIARKSSNINSGKLNEGGREMKKPS 1283 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKY 902 K+KGFSYLF +AER E QC CKKDVL REAVSCQ CKG+FHK HVRKSAGSI+ + Y Sbjct: 1284 KDKGFSYLFLKAERSENYQCGHCKKDVLTREAVSCQYCKGYFHKRHVRKSAGSISAECTY 1343 Query: 903 TCQKCQDGKFVKTDARKGKPELPKLRNASKSVXXXXXXXXXXXXXXXXQ----------- 1049 TC KCQDGK +K +A+ G + K + S + + Sbjct: 1344 TCHKCQDGKPMKINAKIGNVQSQKGKKGSTDLYKKKGKAYKNCRLLGSKSGKKIFTKEQP 1403 Query: 1050 ---------------VNLTNKKRVPFVVPLRRSARNAE-RVAKVTVQNSXXXXXXXXXXX 1181 V K+ V VVPLRRSAR + R K + + Sbjct: 1404 VRSCKGRKPSTGKRPVRSLVKREVSTVVPLRRSARKIKFRTPKKPKKET----------- 1452 Query: 1182 XXXXXXXXXXXXXXXXXQRMPVTSSYWLNGLRLSRRPGDER--HLRNRKLLVLSGEVDCI 1355 +R V SYWLNGL LSR P D+R R +L V S ++ + Sbjct: 1453 -----------SWKKKKRRTLVCYSYWLNGLLLSRMPNDDRVMQFRRERLFVPSEHLNVV 1501 Query: 1356 LNKTICSLCREVEHKSELNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPI 1535 ++K C LC E H LNY+ CEICGDWFHGDA L + I NLIGF+CH C +TPP Sbjct: 1502 IDKPTCHLCAEAGHTPMLNYINCEICGDWFHGDAFGLDVETIGNLIGFRCHECCKRTPPA 1561 Query: 1536 CPH 1544 CPH Sbjct: 1562 CPH 1564 >ref|XP_007214563.1| hypothetical protein PRUPE_ppa000168mg [Prunus persica] gi|462410428|gb|EMJ15762.1| hypothetical protein PRUPE_ppa000168mg [Prunus persica] Length = 1545 Score = 529 bits (1363), Expect = e-147 Identities = 276/552 (50%), Positives = 369/552 (66%), Gaps = 12/552 (2%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSDVSG-PSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GSA+H+V+S RA SK+ I+RKR K SD+ P+SNAA+GL +FWWRGGR SR +F+W Sbjct: 987 TMGSASHVVTS-LRAYSKNFINRKRPKCSDIEPTPTSNAASGLGMFWWRGGRLSRQVFSW 1045 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 KVLP SL SKAARQ G KI GILYP++ EYAKR+K SWRAAVE S SVEQLALQVREL Sbjct: 1046 KVLPRSLTSKAARQAGCSKILGILYPENSEYAKRSKSVSWRAAVEASTSVEQLALQVREL 1105 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 D NIRW+DI N+ L ++ +S+K ++ FKKV++RRKCSEG V YLLDFGKRR IPDIV Sbjct: 1106 DLNIRWNDIENSHPLPTLDKESRKSIKLFKKVIVRRKCSEGKVVNYLLDFGKRRGIPDIV 1165 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 ++GS LE+ S+E+K+YWL+E+++PL+LLK FEE+RIARKS+ ++SGK+ E ++ K+P Sbjct: 1166 KKHGSVLEELSSERKKYWLDESYLPLHLLKNFEERRIARKSSDVRSGKVIEVGRVAKRPR 1225 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKY 902 ++KGF YLFS+AER EY +C C KDVL+REAVSCQ CKGFFHK H RKSAG++ + KY Sbjct: 1226 EKKGFMYLFSKAERSEYHKCGHCNKDVLMREAVSCQYCKGFFHKRHARKSAGAVVARCKY 1285 Query: 903 TCQKCQDGKFVKTDARK-------GKPELPKLRNASKSVXXXXXXXXXXXXXXXXQVNLT 1061 TC +CQ+G K D ++ GK + K +N+ Q+ L Sbjct: 1286 TCHRCQNGLCAKIDTKRRKVETKGGKVQSQKCKNSQTERRSLRLKNNKKALAGGQQLRLK 1345 Query: 1062 NKKRVPFVVPLRRSARNAERVAKVTVQN-SXXXXXXXXXXXXXXXXXXXXXXXXXXXXQR 1238 N K++P VPLRRS R +V + +QN +R Sbjct: 1346 NSKKIPASVPLRRSPR---KVKCLPLQNKKRSKRKKGKKSKSNTTTCKKPKRVTSWQKKR 1402 Query: 1239 MPVTSSYWLNGLRLSRRPGDERHL--RNRKLLVLSGEVDCILNKTICSLCREVEHKSELN 1412 V SYWLNGL LSR+P DER + R++KLL SG IL++ C LC E + S LN Sbjct: 1403 TQVCHSYWLNGLLLSRKPNDERAMLFRDKKLLAHSGCSPVILDQLKCPLCCEASYTSALN 1462 Query: 1413 YVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPHHCPVESSNPEIF-SEN 1589 Y+ CEIC WFH +A L ++ I+ L+GF+CHMC + PP+CPH V++ ++ ++N Sbjct: 1463 YISCEICRVWFHAEAFGLSSENIDKLVGFRCHMCRQRNPPVCPHLVVVKTDVSQLAEAQN 1522 Query: 1590 NAKIECTEEDPH 1625 +A ++ +EE P+ Sbjct: 1523 DAGVDFSEEVPN 1534 >gb|EYU35929.1| hypothetical protein MIMGU_mgv1a024138mg, partial [Mimulus guttatus] Length = 936 Score = 522 bits (1344), Expect = e-145 Identities = 277/517 (53%), Positives = 345/517 (66%), Gaps = 4/517 (0%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSDVS-GPSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GSA I SS R S + R RAK S + + +A GL L WW+G + SR LFN Sbjct: 421 TMGSACLIAKSSRRVSLNNETKRTRAKCSKLEITQTPKSACGLRLLWWKGDKASRELFNC 480 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 KVLP SLASKAARQGG +KI G+ YP+SG+ AKRT+Y +WRAAVETS+SVE+LALQVREL Sbjct: 481 KVLPRSLASKAARQGGFKKISGVQYPESGDTAKRTRYTAWRAAVETSKSVEKLALQVREL 540 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 DA+IRW DIGN + S + +SKKP++SFKKV+IR+K EG VRYLLDFG++R IPDI Sbjct: 541 DAHIRWGDIGNKQFPSKQDKESKKPIKSFKKVIIRKKSCEGEIVRYLLDFGRKRCIPDIA 600 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 V++GS ED S+E K+YWLE++HVPL+L+KAFEEK+IARKS+ SG+ ESS+ + KPL Sbjct: 601 VKHGSLHEDSSSESKQYWLEDSHVPLHLIKAFEEKKIARKSSKTISGEHNESSKTVVKPL 660 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKY 902 ++KG YLF RAERL+ QC CK+DV IREAVSCQ CKGFFHK H ++S GS T + Y Sbjct: 661 RKKGLEYLFERAERLQNHQCGHCKEDVNIREAVSCQYCKGFFHKIHAQESGGSSTAESTY 720 Query: 903 TCQKCQDGKFVKTDARKGKPELPKLRNASKSVXXXXXXXXXXXXXXXXQVNLTNKKRVPF 1082 TC +CQD K V+ DA KGK ELPK + K+ V+L P Sbjct: 721 TCHECQDRKVVQVDAGKGKTELPKRKKKMKAPKPLDSKKGKAVSKEEHPVDLKTIPEDPV 780 Query: 1083 VVPL-RRSARNAERVAKVTVQNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRMPVTSSY 1259 V P RRS RNAER++K+ Q+ +R PV SSY Sbjct: 781 VAPAARRSVRNAERISKLVQQSRKIKKRKRNKRKKDLLKKISKRIRRK---KRTPVNSSY 837 Query: 1260 WLNGLRLSRRPGDERHL--RNRKLLVLSGEVDCILNKTICSLCREVEHKSELNYVGCEIC 1433 WLNGL LSRR D+R + R++KLL+LSGE ++ CSLC E+E+ E+NYV CEIC Sbjct: 838 WLNGLHLSRRTNDDRLMDFRSKKLLLLSGEAIPDSDEPKCSLCSELEYTPEMNYVACEIC 897 Query: 1434 GDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPH 1544 WFHGDAL L ADKI ++ GFKCH CL K P ICP+ Sbjct: 898 RVWFHGDALGLTADKINHIFGFKCHNCLEKRPLICPN 934 >ref|XP_007015163.1| DNA binding,zinc ion binding,DNA binding, putative isoform 3 [Theobroma cacao] gi|590584387|ref|XP_007015164.1| DNA binding,zinc ion binding,DNA binding, putative isoform 3 [Theobroma cacao] gi|508785526|gb|EOY32782.1| DNA binding,zinc ion binding,DNA binding, putative isoform 3 [Theobroma cacao] gi|508785527|gb|EOY32783.1| DNA binding,zinc ion binding,DNA binding, putative isoform 3 [Theobroma cacao] Length = 1859 Score = 517 bits (1331), Expect = e-143 Identities = 302/679 (44%), Positives = 405/679 (59%), Gaps = 49/679 (7%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSD-VSGPSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GSA+H+V++S+RAS+KHGI+RKR +S+D S P+SN A G S+ WWRGGR SR LFNW Sbjct: 1127 TMGSASHVVTASSRASAKHGIARKRGRSNDGESNPTSNPAAGPSICWWRGGRVSRQLFNW 1186 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 KVLP SLASKAARQGG +KIPGILYP+S ++A+R+K +WRAAVE+S S+EQLALQVREL Sbjct: 1187 KVLPRSLASKAARQGGGKKIPGILYPESSDFARRSKSMAWRAAVESSTSIEQLALQVREL 1246 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 D+NIRWDDI N L +++ D KK +R FKK V+RRK EG V+YLLDFGKRR IPD+V Sbjct: 1247 DSNIRWDDIENTHALPILDKDFKKSIRLFKKCVVRRKSIEGDGVKYLLDFGKRRIIPDVV 1306 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 +R+G+ +E+ S+E+K+YWL E++VPL+LLK+FEEKRIARKS+ M SGK E + K Sbjct: 1307 MRHGTAVEESSSERKKYWLNESYVPLHLLKSFEEKRIARKSSKMISGKSSEIIRDAKNSS 1366 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKY 902 K++GFSYLFS+AER EY QC C KDVLIREAV C CKGFFHK HVRKSAG+I + Y Sbjct: 1367 KKRGFSYLFSKAERSEYYQCGHCNKDVLIREAVRCHICKGFFHKRHVRKSAGAIIAECTY 1426 Query: 903 TCQKCQDGKF------VKTDARKGK-------------PELP-KLRNASKSVXXXXXXXX 1022 TC +CQDGK +DA++GK +LP K + AS + Sbjct: 1427 TCHRCQDGKSNVNAKRGGSDAKRGKGDTKGGKTNTKSAKKLPQKSKKASTNCKSMRSKDN 1486 Query: 1023 XXXXXXXXQVNLTNKKRVPFVVPLRRSARNAERVAKVTVQNSXXXXXXXXXXXXXXXXXX 1202 + K+V VPLRRS R ++ ++VQ Sbjct: 1487 KKSIAIRMSLRSQKDKKVTAGVPLRRSPR---KIKYISVQKKKPGRCKKSKQKSKKKAPK 1543 Query: 1203 XXXXXXXXXXQRMPVTSSYWLNGLRLSRRPGDERHLR-NRKLLVLSGE-VDCILNKTICS 1376 +R SYWLNGLRLS +P DER ++ RK+L E ++ LN+ C Sbjct: 1544 KTKICTSWQKKRTRAYHSYWLNGLRLSSKPDDERVMQFQRKMLFAPSEHMNVSLNQPKCL 1603 Query: 1377 LCREVEHKSELNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPHHCPV 1556 LC E + S NYV CEIC +WFHGDA L ++ +IGF+CH+C +TPP+CP+ Sbjct: 1604 LCCEAGYASSSNYVACEICEEWFHGDAYGLNSENKSKIIGFRCHVCCKRTPPVCPNMVAT 1663 Query: 1557 ESSNPEIFS-ENNAKIECTEEDPHCLASP---------------------DNELAHQKSH 1670 ++ +N+ + E +EE SP D+E H++ Sbjct: 1664 RIDGSQLAEMQNSVRTESSEELHGAFPSPCHVNLKTESPSSETRQGLLADDDECFHKEEQ 1723 Query: 1671 FNDKSADTLSAVNMEKQLQVSVGELDLKDEDLEMAEKIFLGNDPIELGGKKGDVSSTLET 1850 S +T +E +L+ S G L K + ++ + + N+ ++ D STLE Sbjct: 1724 LG-TSLETSQGPILEYKLE-SNGTLLDKKQGIDAQQ---ISNNELKPNTLTSDEKSTLE- 1777 Query: 1851 ESTIQNSDM----VREAEC 1895 ES I + + V +AEC Sbjct: 1778 ESRINSGHITATAVDKAEC 1796 >ref|XP_007015161.1| DNA binding,zinc ion binding,DNA binding, putative isoform 1 [Theobroma cacao] gi|508785524|gb|EOY32780.1| DNA binding,zinc ion binding,DNA binding, putative isoform 1 [Theobroma cacao] Length = 1931 Score = 517 bits (1331), Expect = e-143 Identities = 302/679 (44%), Positives = 405/679 (59%), Gaps = 49/679 (7%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSD-VSGPSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GSA+H+V++S+RAS+KHGI+RKR +S+D S P+SN A G S+ WWRGGR SR LFNW Sbjct: 1127 TMGSASHVVTASSRASAKHGIARKRGRSNDGESNPTSNPAAGPSICWWRGGRVSRQLFNW 1186 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 KVLP SLASKAARQGG +KIPGILYP+S ++A+R+K +WRAAVE+S S+EQLALQVREL Sbjct: 1187 KVLPRSLASKAARQGGGKKIPGILYPESSDFARRSKSMAWRAAVESSTSIEQLALQVREL 1246 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 D+NIRWDDI N L +++ D KK +R FKK V+RRK EG V+YLLDFGKRR IPD+V Sbjct: 1247 DSNIRWDDIENTHALPILDKDFKKSIRLFKKCVVRRKSIEGDGVKYLLDFGKRRIIPDVV 1306 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 +R+G+ +E+ S+E+K+YWL E++VPL+LLK+FEEKRIARKS+ M SGK E + K Sbjct: 1307 MRHGTAVEESSSERKKYWLNESYVPLHLLKSFEEKRIARKSSKMISGKSSEIIRDAKNSS 1366 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKY 902 K++GFSYLFS+AER EY QC C KDVLIREAV C CKGFFHK HVRKSAG+I + Y Sbjct: 1367 KKRGFSYLFSKAERSEYYQCGHCNKDVLIREAVRCHICKGFFHKRHVRKSAGAIIAECTY 1426 Query: 903 TCQKCQDGKF------VKTDARKGK-------------PELP-KLRNASKSVXXXXXXXX 1022 TC +CQDGK +DA++GK +LP K + AS + Sbjct: 1427 TCHRCQDGKSNVNAKRGGSDAKRGKGDTKGGKTNTKSAKKLPQKSKKASTNCKSMRSKDN 1486 Query: 1023 XXXXXXXXQVNLTNKKRVPFVVPLRRSARNAERVAKVTVQNSXXXXXXXXXXXXXXXXXX 1202 + K+V VPLRRS R ++ ++VQ Sbjct: 1487 KKSIAIRMSLRSQKDKKVTAGVPLRRSPR---KIKYISVQKKKPGRCKKSKQKSKKKAPK 1543 Query: 1203 XXXXXXXXXXQRMPVTSSYWLNGLRLSRRPGDERHLR-NRKLLVLSGE-VDCILNKTICS 1376 +R SYWLNGLRLS +P DER ++ RK+L E ++ LN+ C Sbjct: 1544 KTKICTSWQKKRTRAYHSYWLNGLRLSSKPDDERVMQFQRKMLFAPSEHMNVSLNQPKCL 1603 Query: 1377 LCREVEHKSELNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPHHCPV 1556 LC E + S NYV CEIC +WFHGDA L ++ +IGF+CH+C +TPP+CP+ Sbjct: 1604 LCCEAGYASSSNYVACEICEEWFHGDAYGLNSENKSKIIGFRCHVCCKRTPPVCPNMVAT 1663 Query: 1557 ESSNPEIFS-ENNAKIECTEEDPHCLASP---------------------DNELAHQKSH 1670 ++ +N+ + E +EE SP D+E H++ Sbjct: 1664 RIDGSQLAEMQNSVRTESSEELHGAFPSPCHVNLKTESPSSETRQGLLADDDECFHKEEQ 1723 Query: 1671 FNDKSADTLSAVNMEKQLQVSVGELDLKDEDLEMAEKIFLGNDPIELGGKKGDVSSTLET 1850 S +T +E +L+ S G L K + ++ + + N+ ++ D STLE Sbjct: 1724 LG-TSLETSQGPILEYKLE-SNGTLLDKKQGIDAQQ---ISNNELKPNTLTSDEKSTLE- 1777 Query: 1851 ESTIQNSDM----VREAEC 1895 ES I + + V +AEC Sbjct: 1778 ESRINSGHITATAVDKAEC 1796 >ref|XP_004291756.1| PREDICTED: uncharacterized protein LOC101311539 [Fragaria vesca subsp. vesca] Length = 1773 Score = 516 bits (1330), Expect = e-143 Identities = 279/584 (47%), Positives = 374/584 (64%), Gaps = 27/584 (4%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSDV-SGPSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GSA+H+V+S RA SK+ SRKR K SD+ S PSSNA +GL +FWWRGGR SR +F+W Sbjct: 1175 TMGSASHVVTS-LRACSKNMNSRKRPKFSDIDSNPSSNAGSGLGMFWWRGGRLSRQVFSW 1233 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 K+LP SL SKAARQGG KI GILYP++ EYAKR+KY +WRA VETS S E LALQVREL Sbjct: 1234 KILPRSLTSKAARQGGCTKIMGILYPENSEYAKRSKYIAWRATVETSTSAEHLALQVREL 1293 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 +NIRWDDI N L +++ +S K ++ F+KV++RRKCSE V+YLLDFGKRR IPDI+ Sbjct: 1294 YSNIRWDDIENTHPLPILDKESTKSLKLFRKVIVRRKCSEKEAVKYLLDFGKRRAIPDII 1353 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 ++GS LE+ S+EKK+YWLEE+++PL+LLK FEEKRIARKS+ KSGK ++IK+P Sbjct: 1354 RKHGSVLEEPSSEKKKYWLEESYLPLHLLKNFEEKRIARKSSDGKSGKAIADGKVIKRPQ 1413 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKY 902 EKGF+YLF++AER EY +C C KDVLIREAVSCQ C+GFFHK H +KSAG+I + Y Sbjct: 1414 DEKGFAYLFAKAERSEYYKCGHCHKDVLIREAVSCQFCRGFFHKRHAKKSAGAIVSECTY 1473 Query: 903 TCQKCQDGKFVKTDARKG---------------------KPELPKLRNASKSVXXXXXXX 1019 TC +CQ+G K D ++G K + KL+++ Sbjct: 1474 TCHRCQNGVSSKIDTKRGKVDKKRGKVGRKRGPVETKLVKVQSQKLKSSQTDRRSLRLKS 1533 Query: 1020 XXXXXXXXXQVNLTNKKRVPFVVPLRRSARNAERVAKVTVQN-SXXXXXXXXXXXXXXXX 1196 QV L N K+VP V LRRS R + +T+QN Sbjct: 1534 KRKPLAGGRQVQLKNTKKVP-VTLLRRSPR---KTKSLTLQNKKQSKRKKGKQSKSKKGT 1589 Query: 1197 XXXXXXXXXXXXQRMPVTSSYWLNGLRLSRRPGDERHL--RNRKLLVLSGEVDCILNKTI 1370 +R V SYWLNGL+ SR+P DER + R++KLL SG IL++ Sbjct: 1590 YKKQKIGTSWQKKRTKVYRSYWLNGLQFSRKPDDERVVLFRDKKLLANSGCSSNILSQLK 1649 Query: 1371 CSLCREVEHKSELNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPHHC 1550 C LC E E+ S L+Y+GCE+CG+WFHG+A L ++ I LIGF+CH+C PP+CPH Sbjct: 1650 CQLCCESEYASTLDYIGCELCGEWFHGEAFGLASENIHKLIGFRCHVCRKTEPPLCPHLV 1709 Query: 1551 PVESSNPEI-FSENNAKIECTEEDPHCLAS-PDNELAHQKSHFN 1676 V++ ++ ++N+ + C+E+ P+ + + + H++S N Sbjct: 1710 VVKTDVSQLPEAQNDGSVNCSEDVPNAVPTLSEITGGHRRSSLN 1753 >ref|XP_002274937.2| PREDICTED: uncharacterized protein LOC100260139 [Vitis vinifera] Length = 1976 Score = 514 bits (1325), Expect = e-143 Identities = 290/586 (49%), Positives = 351/586 (59%), Gaps = 73/586 (12%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSD-VSGPSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GSA+HIV SS RASSK G+ +KR + S VS PSSNAATGLSLFWWRGGR SR LFNW Sbjct: 1090 TMGSASHIVISS-RASSKLGVGKKRTRCSGFVSKPSSNAATGLSLFWWRGGRLSRKLFNW 1148 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 KVLP SLASKAARQ G KIPGILYP+S E+AKR KY WR+AVETS SVEQLAL VREL Sbjct: 1149 KVLPRSLASKAARQAGCTKIPGILYPESSEFAKRNKYVVWRSAVETSTSVEQLALLVREL 1208 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 D NIRWDDI N L ++ +++K +R F+KV+IRRKC EG +YLLDFGKR+ IPD+V Sbjct: 1209 DLNIRWDDIENTHPLFKLDKEARKSIRPFRKVIIRRKCIEGTISKYLLDFGKRKIIPDVV 1268 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 V++GS LE+ S+E+K+YWL+E+HVPL+LLKAFEEKRIARKS+ + SGKL E + +KKP Sbjct: 1269 VKHGSILEESSSERKKYWLDESHVPLHLLKAFEEKRIARKSSNINSGKLNEGGREMKKPS 1328 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCK--------------------- 839 K+KGFSYLF +AER E QC CKKDVL REAVSCQ CK Sbjct: 1329 KDKGFSYLFLKAERSENYQCGHCKKDVLTREAVSCQYCKGNLIFNKPYLFVPCYFIYGFE 1388 Query: 840 -------GFFHKWHVRKSAGSITRQRKYTCQKCQDGKFVKTDARKGKPELPKLRNASKSV 998 G+FHK HVRKSAGSI+ + YTC KCQDGK +K +A+ G + K + S + Sbjct: 1389 VTLVIMPGYFHKRHVRKSAGSISAECTYTCHKCQDGKPMKINAKIGNVQSQKGKKGSTDL 1448 Query: 999 XXXXXXXXXXXXXXXXQ--------------------------VNLTNKKRVPFVVPLRR 1100 + V K+ V VVPLRR Sbjct: 1449 YKKKGKAYKNCRLLGSKSGKKIFTKEQPVRSCKGRKPSTGKRPVRSLVKREVSTVVPLRR 1508 Query: 1101 SARNAERVAKVTVQN----------------SXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1232 SAR ++ V++QN Sbjct: 1509 SAR---KIKFVSLQNKNLEEQDKGKQEKGKQEKGKQVKSMKSKKRTPKKPKKETSWKKKK 1565 Query: 1233 QRMPVTSSYWLNGLRLSRRPGDER--HLRNRKLLVLSGEVDCILNKTICSLCREVEHKSE 1406 +R V SYWLNGL LSR P D+R R +L V S ++ +++K C LC E H Sbjct: 1566 RRTLVCYSYWLNGLLLSRMPNDDRVMQFRRERLFVPSEHLNVVIDKPTCHLCAEAGHTPM 1625 Query: 1407 LNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPH 1544 LNY+ CEICGDWFHGDA L + I NLIGF+CH C +TPP CPH Sbjct: 1626 LNYINCEICGDWFHGDAFGLDVETIGNLIGFRCHECCKRTPPACPH 1671 >gb|EXC04604.1| Nucleosome-remodeling factor subunit BPTF [Morus notabilis] Length = 1761 Score = 511 bits (1316), Expect = e-142 Identities = 270/550 (49%), Positives = 356/550 (64%), Gaps = 13/550 (2%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSDVSGPSSNAATGLSLFWWRGGRGSRSLFNWK 185 +VGSA+HIV+SSAR S K+ I RKR + SGP+ N A+GL +FWWRGGR SR +FNWK Sbjct: 1137 SVGSASHIVTSSARGSLKNVIGRKRPITE--SGPTLNTASGLGIFWWRGGRLSRKVFNWK 1194 Query: 186 VLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVRELD 365 VLP SL SKAARQGG KIPGILYP++ EYAKR+KY +W+AAVETS S EQLA QVRELD Sbjct: 1195 VLPCSLVSKAARQGGCTKIPGILYPENSEYAKRSKYVAWQAAVETSTSAEQLAFQVRELD 1254 Query: 366 ANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIVV 545 ++I+WDDI N L +++ +S+K +R FKKV++RRK +G V+YLLDFGKRR IPD+V Sbjct: 1255 SHIKWDDIENTHPLPVLDKESRKSIRLFKKVIVRRKSVQGGLVKYLLDFGKRRAIPDVVS 1314 Query: 546 RYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPLK 725 ++GS +E+ S+E+K+YWL+E+++PL+LLK FEEKRIARKS KSGK + ++K+P + Sbjct: 1315 KHGSMVEESSSERKKYWLDESYLPLHLLKNFEEKRIARKSTDNKSGKSVDYGSVMKRPQQ 1374 Query: 726 EKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKYT 905 +KGF+YLFS+AER EY QC C KDVLIREAVSCQ CKGFFHK HV+KSAG+I + YT Sbjct: 1375 KKGFAYLFSKAERSEYYQCGHCNKDVLIREAVSCQHCKGFFHKRHVKKSAGAIIAECTYT 1434 Query: 906 CQKCQDGKFVKTDARKGKPELPKLRNASKSVXXXXXXXXXXXXXXXXQVNLT-------- 1061 C +CQ+G K D +KGK SK +V+ Sbjct: 1435 CHRCQNGVRAKIDTKKGKTAKKGGNVKSKQSKNIQTDRRSSQLKSNKKVSTVGQKGQSKK 1494 Query: 1062 NKKRVPFVVPLRRSARNAERVAKVT-VQN-SXXXXXXXXXXXXXXXXXXXXXXXXXXXXQ 1235 N K +P VPLRRS R A+ ++ +QN + Sbjct: 1495 NSKAIP-AVPLRRSTRKAKCLSLPNKLQNKKHRGRKKGKQVKAKKATQEKTKKGTSCRKK 1553 Query: 1236 RMPVTSSYWLNGLRLSRRPGDERHL--RNRKLLVLSGEVDCILNKTICSLCREVEHKSEL 1409 R V+ SYWLNGL LSR+P DER + R++ L + N+ C LC E +KS L Sbjct: 1554 RTAVSHSYWLNGLLLSRKPNDERVVLFRDKSFLAPPEQSSDTPNQPKCQLCDEAGYKSTL 1613 Query: 1410 NYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPHHCPVESSNPEIFS-E 1586 NYV CE C +WFH DA+ + + I+ +IGF+CH C +TPP+C H ++S ++ + Sbjct: 1614 NYVACETCREWFHADAIGIHPENIDIVIGFRCHTCCERTPPVCLHSVTMQSDVSQLAEVQ 1673 Query: 1587 NNAKIECTEE 1616 N A ++CTEE Sbjct: 1674 NTAAVDCTEE 1683 >ref|XP_002513535.1| hypothetical protein RCOM_1578820 [Ricinus communis] gi|223547443|gb|EEF48938.1| hypothetical protein RCOM_1578820 [Ricinus communis] Length = 1915 Score = 510 bits (1314), Expect = e-142 Identities = 272/584 (46%), Positives = 372/584 (63%), Gaps = 17/584 (2%) Frame = +3 Query: 3 PTVGSATHIVSSSARASSKHGISRKRAKSSDV-SGPSSNAATGLSLFWWRGGRGSRSLFN 179 P +GSA+HIV +S RASSK+GIS+KRA+ S+ S PSSN+++GLS+ WWRGGR SR LF+ Sbjct: 1196 PRMGSASHIVMASLRASSKNGISKKRARFSEFDSNPSSNSSSGLSMLWWRGGRLSRQLFS 1255 Query: 180 WKVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVRE 359 WKVLPHSLASK ARQ G KI G+LYP++ ++AKR+KY +WRAAVE+S +VEQ+ALQVRE Sbjct: 1256 WKVLPHSLASKGARQAGCMKISGMLYPENSDFAKRSKYIAWRAAVESSNTVEQIALQVRE 1315 Query: 360 LDANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDI 539 LD+NIRWD+IGN L M++ +S+K +R FKKV+IRRK E +YLLDFGKR+ IP+I Sbjct: 1316 LDSNIRWDEIGNRNPLLMMDKESRKSIRLFKKVIIRRKSMELEGAKYLLDFGKRKCIPEI 1375 Query: 540 VVRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKP 719 V + GS +E+ S+E+K+YWL E++VPLYLLK+FE+KRIAR+S+ M SGKL ++S +KKP Sbjct: 1376 VSKNGSIVEESSSERKKYWLNESYVPLYLLKSFEQKRIARRSSKMTSGKLSDASVSMKKP 1435 Query: 720 LKEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRK 899 LK++GFSYLF++AER E+ QC C KDV +REAV CQ CKGFFHK HVRKSAGS++ + K Sbjct: 1436 LKKRGFSYLFAKAERPEHHQCGHCNKDVPVREAVCCQYCKGFFHKRHVRKSAGSMSAECK 1495 Query: 900 YTCQKCQDGKFVKTDARKGKPELPKLRNASKSVXXXXXXXXXXXXXXXXQVNLTNK---- 1067 YTC +C GK++K D++ GK + + +N ++S +K Sbjct: 1496 YTCHRCVAGKYMKMDSKTGKNDEKRGKNKNRSTKTHNQKSKKTTVGSSSVHPKNSKKTLR 1555 Query: 1068 ----------KRVPFVVPLRRSARNAERVAKVTVQN-SXXXXXXXXXXXXXXXXXXXXXX 1214 K+ VVPLRRS R A+ ++QN Sbjct: 1556 SSRLLRSQKNKKATVVVPLRRSPRKAK---LNSLQNKKSRGRKKGKQAKPKKTTGKKPTK 1612 Query: 1215 XXXXXXQRMPVTSSYWLNGLRLSRRPGDERHLRNRKLLVLSGEVDCILNKTICSLCREVE 1394 +R ++WLNGL L+R+P DER + R+ L+ I ++ C LC E Sbjct: 1613 VTSWRKKRTQAYHNFWLNGLFLTRKPDDERVMHFRRKRFLAPSESAIHDQPKCHLCSEAG 1672 Query: 1395 HKSELNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPHHCPVESSNPE 1574 + S L+Y+ CEICG+W+HG A L A+ LIGF+CHMC N PP+CP + + Sbjct: 1673 NTSTLSYISCEICGEWYHGAAFGLDAENSNKLIGFRCHMCRNCKPPVCPFVAVTRNHESQ 1732 Query: 1575 IFS-ENNAKIECTEEDPHCLASPDNELAHQKSHFNDKSADTLSA 1703 + S EN+ + E + E + + P Q S N+ +L A Sbjct: 1733 MASAENDVENELSIEGTNLVEHPTETNLFQDSLLNEDHRGSLPA 1776 >ref|XP_002313363.2| hypothetical protein POPTR_0009s05370g [Populus trichocarpa] gi|550331079|gb|EEE87318.2| hypothetical protein POPTR_0009s05370g [Populus trichocarpa] Length = 1934 Score = 506 bits (1304), Expect = e-140 Identities = 299/681 (43%), Positives = 399/681 (58%), Gaps = 37/681 (5%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSDV-SGPSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GS++H V++S+RAS K+GI RKR +S++ S P +N A+GL +FWWRGGR SR LF+W Sbjct: 1225 TMGSSSHFVTASSRASLKNGIGRKRVRSTECQSNPCANPASGLGMFWWRGGRLSRRLFSW 1284 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 KVLP SL SKAARQ G KI GILYP++ ++AKR+K+ +W+AAVE+S +VEQLALQVRE Sbjct: 1285 KVLPCSLTSKAARQAGCMKIAGILYPENSDFAKRSKHVTWQAAVESSVTVEQLALQVREF 1344 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 D+NIRWD+I N LSM++ + +K R FKKV+IRRKC E T +YLLDFGKRR IP+IV Sbjct: 1345 DSNIRWDEIQNTHPLSMLDKELRKSFRLFKKVIIRRKCVEEGT-KYLLDFGKRRSIPEIV 1403 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 ++ GS +E+ S+E+K+YWL E++VP YLLK+FEE++IAR+S+ M SGKL E+S ++KKPL Sbjct: 1404 LKNGSMIEESSSERKKYWLNESYVPFYLLKSFEERKIARRSSKMNSGKLSEASVLVKKPL 1463 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKY 902 K++GFSYLF+RAER EY QC C KDV IREAV CQ CKGFFHK HVRKSAG+IT + Y Sbjct: 1464 KQRGFSYLFARAERSEYHQCGHCHKDVPIREAVCCQNCKGFFHKRHVRKSAGAITAKCIY 1523 Query: 903 TCQKCQDG---KFVKTDARKGKPELPKLRNASKS------------VXXXXXXXXXXXXX 1037 TC +C G K VKT+A+ K + + +N+ KS V Sbjct: 1524 TCHRCHYGKNAKTVKTNAKTVKTDTKRRKNSIKSTKVQEQKSKKATVVRNSVRLKNSKKA 1583 Query: 1038 XXXQVNLTNKKRVPFVVPLRRSARNAERVAKVTVQNSXXXXXXXXXXXXXXXXXXXXXXX 1217 L ++ R VVPLR SAR A++ K Sbjct: 1584 LRGSQPLQSRNRKVTVVPLRCSARKAKQ--KALQNKKVVGRKRGRPAKSKKGANKKPKRG 1641 Query: 1218 XXXXXQRMPVTSSYWLNGLRLSRRPGDERHLRNRKLLVLSGEVDCILNKTICSLCREVEH 1397 +R SYW NGL LSR DER R+ +++ I ++ C LC E + Sbjct: 1642 TLLHKKRTDTCHSYWRNGLLLSRNSDDERVTHFREKSLIAPSESAIDDQPKCHLCCEAGY 1701 Query: 1398 KSELNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPHHCPVESSNPEI 1577 S NY+ CEICG+WFHGDA L A+ I LIGF+CHMCL KTPPICPH Sbjct: 1702 TSISNYISCEICGEWFHGDAFGLDAENINKLIGFRCHMCLKKTPPICPHAATTSHEVEIA 1761 Query: 1578 FSENNAKIECTEEDPHCLASPDNELAHQKSHFNDKSADTLSAVNMEKQL-------QVSV 1736 +N+ E +E+ D L ++ H S +V++E QL Q V Sbjct: 1762 EVQNDVGTELPKEE------TDGTLHQEEDH--PGSLLVSESVHVEGQLGTALDSNQSFV 1813 Query: 1737 GELDLKDEDLEMAEKIFLGNDPIEL--GGKKGDV-----SSTLETESTIQN-------SD 1874 E L+ E+ + D I+ K D+ S L E+TI++ SD Sbjct: 1814 SESKLEAENGHALANVIENTDAIQTLHENLKPDLLTSPNESHLVEENTIKSGDDGIVTSD 1873 Query: 1875 MVREAECPTIIHDLVENGMAN 1937 + + DL+E G+A+ Sbjct: 1874 DAAQLSSCKVGVDLIETGLAS 1894 >ref|XP_006590775.1| PREDICTED: uncharacterized protein LOC100800973 isoform X2 [Glycine max] Length = 1738 Score = 503 bits (1296), Expect = e-139 Identities = 272/571 (47%), Positives = 359/571 (62%), Gaps = 34/571 (5%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSDV-SGPSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GSATHIV SS+R SS+HGI RKRA+++D+ + SSN A+GL ++WWRGGR SR LFN Sbjct: 1161 TMGSATHIVVSSSRTSSRHGIGRKRARNTDIETSSSSNTASGLGMYWWRGGRLSRKLFNC 1220 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 K LPHSL +KAARQGG RKIPGILYP++ ++A+R+++ +WRAAVE S S EQLALQVREL Sbjct: 1221 KALPHSLVTKAARQGGCRKIPGILYPENSDFARRSRFVAWRAAVEMSTSAEQLALQVREL 1280 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 +NIRW DI NN L +++ +S+K VR FKK +IRRKC+EG +V+YL+DFGKRR IPD+V Sbjct: 1281 YSNIRWHDIENNHSLYVLDKESRKSVRLFKKSIIRRKCTEGQSVKYLIDFGKRRAIPDVV 1340 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 ++ GS LE S+E+K+YWLEET+VPL+LLK FEEKRI RKS K GK+ E ++ KK Sbjct: 1341 IKQGSLLEQSSSERKKYWLEETYVPLHLLKNFEEKRIVRKSTDKKLGKILEIGRVNKKIP 1400 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKY 902 ++KGFSYLF+R ER + QC C KDV +R+AV C CKG+FHK HVRKS+G+ T Y Sbjct: 1401 QQKGFSYLFTRLERSDCHQCGHCNKDVAMRDAVRCLHCKGYFHKRHVRKSSGTRTTGSSY 1460 Query: 903 TCQKCQDGKFVKTDARKGK--PELPKLRNASKSV-------XXXXXXXXXXXXXXXXQVN 1055 +C +CQDG KT+ K K +L K++ + + QV Sbjct: 1461 SCHRCQDGLQAKTNTNKRKVDSKLQKIQAKKRKIVPSVCKSLNLKGNKKASSKNKIRQVR 1520 Query: 1056 LTNKKRVPFVVPLRRSARNAE------------RVAKVTVQNSXXXXXXXXXXXXXXXXX 1199 N K +P +PLRRS R A+ + K T +N Sbjct: 1521 SRNSKNIPSSIPLRRSTRKAKSLYMHSQLNGGHKKGKSTKKN---VGRKKGKQSQTKKVT 1577 Query: 1200 XXXXXXXXXXXQRMPVT----------SSYWLNGLRLSRRPGDERHL--RNRKLLVLSGE 1343 +++PVT +SYWLNGL+LSR+ DER + + +K +V S + Sbjct: 1578 PQKSKETTDQYKKLPVTTAHKKRTRTCNSYWLNGLQLSRKSNDERVMLFKEKKCVVSSED 1637 Query: 1344 VDCILNKTICSLCREVEHKSELNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNK 1523 ++ C LC ++ LNY+ CEICGDWFHGDA L + LIGFKCH+CL++ Sbjct: 1638 FSGSVDYPKCCLC--CGNECTLNYIACEICGDWFHGDAFGLNVENTRQLIGFKCHVCLDR 1695 Query: 1524 TPPICPHHCPVESSNPEIFSENNAKIECTEE 1616 T PICPH N +E+NA IEC EE Sbjct: 1696 TAPICPH----LKINALSRTESNAAIECAEE 1722 >ref|XP_003538947.1| PREDICTED: uncharacterized protein LOC100800973 isoform X1 [Glycine max] Length = 1735 Score = 503 bits (1296), Expect = e-139 Identities = 272/571 (47%), Positives = 359/571 (62%), Gaps = 34/571 (5%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSDV-SGPSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GSATHIV SS+R SS+HGI RKRA+++D+ + SSN A+GL ++WWRGGR SR LFN Sbjct: 1161 TMGSATHIVVSSSRTSSRHGIGRKRARNTDIETSSSSNTASGLGMYWWRGGRLSRKLFNC 1220 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 K LPHSL +KAARQGG RKIPGILYP++ ++A+R+++ +WRAAVE S S EQLALQVREL Sbjct: 1221 KALPHSLVTKAARQGGCRKIPGILYPENSDFARRSRFVAWRAAVEMSTSAEQLALQVREL 1280 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 +NIRW DI NN L +++ +S+K VR FKK +IRRKC+EG +V+YL+DFGKRR IPD+V Sbjct: 1281 YSNIRWHDIENNHSLYVLDKESRKSVRLFKKSIIRRKCTEGQSVKYLIDFGKRRAIPDVV 1340 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 ++ GS LE S+E+K+YWLEET+VPL+LLK FEEKRI RKS K GK+ E ++ KK Sbjct: 1341 IKQGSLLEQSSSERKKYWLEETYVPLHLLKNFEEKRIVRKSTDKKLGKILEIGRVNKKIP 1400 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKY 902 ++KGFSYLF+R ER + QC C KDV +R+AV C CKG+FHK HVRKS+G+ T Y Sbjct: 1401 QQKGFSYLFTRLERSDCHQCGHCNKDVAMRDAVRCLHCKGYFHKRHVRKSSGTRTTGSSY 1460 Query: 903 TCQKCQDGKFVKTDARKGK--PELPKLRNASKSV-------XXXXXXXXXXXXXXXXQVN 1055 +C +CQDG KT+ K K +L K++ + + QV Sbjct: 1461 SCHRCQDGLQAKTNTNKRKVDSKLQKIQAKKRKIVPSVCKSLNLKGNKKASSKNKIRQVR 1520 Query: 1056 LTNKKRVPFVVPLRRSARNAE------------RVAKVTVQNSXXXXXXXXXXXXXXXXX 1199 N K +P +PLRRS R A+ + K T +N Sbjct: 1521 SRNSKNIPSSIPLRRSTRKAKSLYMHSQLNGGHKKGKSTKKN---VGRKKGKQSQTKKVT 1577 Query: 1200 XXXXXXXXXXXQRMPVT----------SSYWLNGLRLSRRPGDERHL--RNRKLLVLSGE 1343 +++PVT +SYWLNGL+LSR+ DER + + +K +V S + Sbjct: 1578 PQKSKETTDQYKKLPVTTAHKKRTRTCNSYWLNGLQLSRKSNDERVMLFKEKKCVVSSED 1637 Query: 1344 VDCILNKTICSLCREVEHKSELNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNK 1523 ++ C LC ++ LNY+ CEICGDWFHGDA L + LIGFKCH+CL++ Sbjct: 1638 FSGSVDYPKCCLC--CGNECTLNYIACEICGDWFHGDAFGLNVENTRQLIGFKCHVCLDR 1695 Query: 1524 TPPICPHHCPVESSNPEIFSENNAKIECTEE 1616 T PICPH N +E+NA IEC EE Sbjct: 1696 TAPICPH----LKINALSRTESNAAIECAEE 1722 >ref|XP_002299794.2| hypothetical protein POPTR_0001s26130g, partial [Populus trichocarpa] gi|550348214|gb|EEE84599.2| hypothetical protein POPTR_0001s26130g, partial [Populus trichocarpa] Length = 1815 Score = 501 bits (1290), Expect = e-139 Identities = 293/660 (44%), Positives = 394/660 (59%), Gaps = 18/660 (2%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSDV-SGPSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GS++H+V++S+RASSK+GI RKRA+S++ S P +N+A+GLS+FWWRGGR SR LF+W Sbjct: 1185 TMGSSSHVVTTSSRASSKNGIGRKRARSTEFESKPCANSASGLSMFWWRGGRLSRRLFSW 1244 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 KVLP SL SKAARQ G KIPGILYP++ ++AKR+K+ +W+AAV +S + EQLALQVRE Sbjct: 1245 KVLPCSLISKAARQAGCMKIPGILYPENSDFAKRSKHVAWQAAVGSSTTAEQLALQVREF 1304 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 D+NIRWD+I N LSM++ + +K R FKKV+IRRKC E +YLLDFGKRR IP++V Sbjct: 1305 DSNIRWDEIENTHPLSMLDKELRKSFRLFKKVIIRRKCVEEEGAKYLLDFGKRRCIPEVV 1364 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 + G +E+ S+E+K+YWL E++VPL+LLK+FEEK+IAR+S+ + SGKL ++ + KPL Sbjct: 1365 SKNGFMIEESSSERKKYWLNESYVPLHLLKSFEEKKIARRSSKISSGKLSDACAAVNKPL 1424 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKY 902 K++GFSYLF+RAER EY QC CKKDVLIREAV CQ CKG FHK H RKSAG+I + Y Sbjct: 1425 KKRGFSYLFARAERSEYHQCGHCKKDVLIREAVCCQLCKGSFHKRHARKSAGAIMAKCTY 1484 Query: 903 TCQKCQDGKFVK--------TDARKG------KPELPKLRNASKSVXXXXXXXXXXXXXX 1040 TC +C GK VK D ++G K + KL+ A+ Sbjct: 1485 TCHRCHYGKNVKKTNAKTVNIDNKRGKNSKITKVQERKLKKATVDRNSVRLKNSKKALKG 1544 Query: 1041 XXQVNLTNKKRVPFVVPLRRSARNAERVAKVTVQNSXXXXXXXXXXXXXXXXXXXXXXXX 1220 + N K+V VVPLRRSAR A++ K Sbjct: 1545 SRPILSRNNKKVT-VVPLRRSARKAKQ--KALQNKKALGCKRGRPAKSKKGANKKPKKGT 1601 Query: 1221 XXXXQRMPVTSSYWLNGLRLSRRPGDER--HLRNRKLLVLSGEVDCILNKTICSLCREVE 1394 +R SYWLNGL LSR+P DER H R ++ + S V I ++ C LC E Sbjct: 1602 SLHRKRTDTYYSYWLNGLLLSRKPDDERVAHFREKRYIAQSDSV--IDDQPKCHLCCEAG 1659 Query: 1395 HKSELNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPHHCPVESSNPE 1574 S +Y+ CE+CG+WFHGDA L A+ I LIGF+CHMCL KTPPICPH Sbjct: 1660 STSISSYISCEMCGEWFHGDAFGLDAENINKLIGFRCHMCLEKTPPICPHAAATSHEFEI 1719 Query: 1575 IFSENNAKIECTEEDPHCLASPDNELAHQKSHFNDKSADTLSAVNMEKQLQVSVGELDLK 1754 +N+ +I+ +E D+ L ++ H D +V++E QL + Sbjct: 1720 GEVQNDVEIDFPKE------GTDSILHLEEDHSGILPVD--ESVHVEGQLGTGL------ 1765 Query: 1755 DEDLEMAEKIFLGNDPIELGGKKGD-VSSTLETESTIQNSDMVREAECPTIIHDLVENGM 1931 D + A K +LG + G + + +E IQ S+ E P +I EN M Sbjct: 1766 DSNQSFASK-------SKLGAENGHALDNVMENSDAIQTSN---ENLKPDLITSSNENHM 1815 >ref|XP_003540620.1| PREDICTED: uncharacterized protein LOC100791832 [Glycine max] Length = 1702 Score = 499 bits (1284), Expect = e-138 Identities = 263/565 (46%), Positives = 350/565 (61%), Gaps = 28/565 (4%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSDV-SGPSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GSATHIV SS+R SS+HGI RKRA++SD+ + SSN A+GL ++WWRGGR SR LFN Sbjct: 1131 TMGSATHIVVSSSRTSSRHGIGRKRARNSDIETSSSSNTASGLGMYWWRGGRLSRKLFNC 1190 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 K LPHSL +KAARQGG RKIPGILYP++ ++A+R+++ +WRAAVE S S EQLALQVREL Sbjct: 1191 KALPHSLVTKAARQGGCRKIPGILYPENSDFARRSRFVAWRAAVEMSTSAEQLALQVREL 1250 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 +NIRW DI NN L +++ +S+K VR FKK ++RRKC+EG +V++L+DFGKRR IPD+V Sbjct: 1251 YSNIRWHDIENNYSLYVLDKESRKSVRLFKKSIVRRKCTEGGSVKFLIDFGKRRAIPDVV 1310 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 +++GS LE ++E+K+YWLEE++VPL+LLK FEEKRI RKS K GK+ E ++ KK Sbjct: 1311 IKHGSLLEQSASERKKYWLEESYVPLHLLKNFEEKRIVRKSTDKKLGKILEIGRVNKKIP 1370 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKY 902 +++GFSYLF+R ER + QC C KDV +R+AV C CKG+FHK H RKS G T Y Sbjct: 1371 QQRGFSYLFTRLERSDCHQCRHCNKDVAMRDAVRCLHCKGYFHKRHARKSGGKRTTGSSY 1430 Query: 903 TCQKCQDGKFVKTDARKGK--PELPKLRNASKSV-------XXXXXXXXXXXXXXXXQVN 1055 +C +CQDG KT+ K K +L K++ + Q Sbjct: 1431 SCHRCQDGLHAKTNTNKRKVDSKLQKIQAKKRKTVPSVCKPVNLKGNKKALSNNKIRQAR 1490 Query: 1056 LTNKKRVPFVVPLRRSARNAERVAKVTVQNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXQ 1235 N K +P +PLRRS R A+ + + N + Sbjct: 1491 SRNSKNIPSSIPLRRSTRKAKSLYMQSQLNGGHKKGKKNVGRKKGKQGKTKKVIPQKSKE 1550 Query: 1236 ----------------RMPVTSSYWLNGLRLSRRPGDERHL--RNRKLLVLSGEVDCILN 1361 R + +SYWLNGL+LSR+P DER + + +K + S + L+ Sbjct: 1551 TTGQYKKSEVTTARKKRTKICNSYWLNGLQLSRKPNDERVMLFKEKKRVASSKDFSGSLD 1610 Query: 1362 KTICSLCREVEHKSELNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICP 1541 C LC ++ LNY+ CEICGDWFHGDA L + LIGFKCH+CL++T PICP Sbjct: 1611 HPKCCLC--CGNECTLNYIACEICGDWFHGDAFGLNVENARQLIGFKCHVCLDRTAPICP 1668 Query: 1542 HHCPVESSNPEIFSENNAKIECTEE 1616 H N +E+NA IEC EE Sbjct: 1669 H----LKVNALSCTESNAAIECGEE 1689 >ref|XP_006470705.1| PREDICTED: uncharacterized protein LOC102628496 [Citrus sinensis] Length = 1761 Score = 497 bits (1280), Expect = e-138 Identities = 271/575 (47%), Positives = 360/575 (62%), Gaps = 15/575 (2%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSDVSGPSSNAATGLSLFWWRGGRGSRSLFNWK 185 TVGSA+HIV +S+RA+SK G RK+A+ D PS+ AA GLSL WWRGGR S LF+WK Sbjct: 1082 TVGSASHIVIASSRANSKAGAGRKKARDFD-GNPSTKAAGGLSLCWWRGGRLSCQLFSWK 1140 Query: 186 VLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVRELD 365 LP SL SKAARQ G KIPGILYP++ ++A+R++ +WRAAVE+S SVEQLA+QVRE D Sbjct: 1141 RLPRSLVSKAARQAGCMKIPGILYPENSDFARRSRTVAWRAAVESSTSVEQLAIQVREFD 1200 Query: 366 ANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIVV 545 +N+RWDDI N L ++ + +K VR FKK +IRRKC + V+YL+DFGKRR +PDIV+ Sbjct: 1201 SNVRWDDIENTHPLCTMDKEFRKSVRLFKKAIIRRKCLKEEGVKYLVDFGKRRSVPDIVI 1260 Query: 546 RYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPLK 725 R+GS E+ S+ +K+YWL E++VPL+LLK+FEE+R+ARKS + SGKL E ++IKK L+ Sbjct: 1261 RHGSMAEESSSGRKKYWLNESYVPLHLLKSFEERRVARKSPKLSSGKLSEPFRVIKKSLR 1320 Query: 726 EKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKYT 905 ++GFSYLFS+A R EY QC C KDVLIR+AV CQ CKG+FHK H+RKSAG++T + KYT Sbjct: 1321 DRGFSYLFSKAARSEYYQCGHCSKDVLIRDAVCCQDCKGYFHKRHIRKSAGAVTTECKYT 1380 Query: 906 CQKCQDGKFVKTDARKGKPELPKLRNASKSVXXXXXXXXXXXXXXXXQ------------ 1049 C +CQDG+F K D R K K + ++SV Q Sbjct: 1381 CYQCQDGRF-KKDTRTAKNGTKKGKMNTRSVKVKSQKSKKTTGRRSVQSKNSKKTVVGGR 1439 Query: 1050 -VNLTNKKRVPFVVPLRRSARNAERVAKVTVQNSXXXXXXXXXXXXXXXXXXXXXXXXXX 1226 + N K+V +PLRRSAR A+ V+VQN Sbjct: 1440 SLRSRNDKKVA-AIPLRRSARRAK---LVSVQNRKHAGRKRGRPKSKKKTSRKPKKTTSL 1495 Query: 1227 XXQRMPVTSSYWLNGLRLSRRPGDERHLR--NRKLLVLSGEVDCILNKTICSLCREVEHK 1400 +R SYWLNGL LSR+P D+R ++ + L S + L++ C LC E EH Sbjct: 1496 QKKRTQSYYSYWLNGLFLSRKPDDDRVMQFTRKNFLAASELLTDTLDQPKCYLCHEAEHT 1555 Query: 1401 SELNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPHHCPVESSNPEIF 1580 S NY+ CEICG+W+HGDA L + I LIGF+CH+C +T P+C + S ++ Sbjct: 1556 STSNYIACEICGEWYHGDAFGLKVENISKLIGFRCHVCRKRT-PVCSCMVSMGSDGSQLE 1614 Query: 1581 SENNAKIECTEEDPHCLASPDNELAHQKSHFNDKS 1685 ++ N KI C+EE L+ P KS+ D S Sbjct: 1615 AQTNYKIGCSEE----LSKPVVPFGELKSNPMDNS 1645 >ref|XP_006446212.1| hypothetical protein CICLE_v10014020mg [Citrus clementina] gi|557548823|gb|ESR59452.1| hypothetical protein CICLE_v10014020mg [Citrus clementina] Length = 1761 Score = 494 bits (1273), Expect = e-137 Identities = 271/575 (47%), Positives = 358/575 (62%), Gaps = 15/575 (2%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSDVSGPSSNAATGLSLFWWRGGRGSRSLFNWK 185 TVGSA+HIV +S+RA+SK G RK+A+ D PS+ AA GLSL WWRGGR S LF+WK Sbjct: 1082 TVGSASHIVIASSRANSKAGAGRKKARDFD-GNPSTKAAGGLSLCWWRGGRLSCQLFSWK 1140 Query: 186 VLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVRELD 365 LP SL SKAARQ G KIPGILYP++ ++A+R++ +WRAAVE+S SVEQLA+QVRE D Sbjct: 1141 RLPRSLVSKAARQAGCMKIPGILYPENSDFARRSRNVAWRAAVESSTSVEQLAIQVREFD 1200 Query: 366 ANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIVV 545 +N+RWDDI N L ++ + +K VR FKK +IRRKC + V+YL+DFGKRR +PDIV+ Sbjct: 1201 SNVRWDDIENTHPLCTMDKEFRKSVRLFKKAIIRRKCLKEEGVKYLVDFGKRRSVPDIVI 1260 Query: 546 RYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPLK 725 R+GS E+ S+ +K+YWL E++VPL+LLK+FEE+R+ARKS + SGKL E +IKK L+ Sbjct: 1261 RHGSMAEESSSGRKKYWLNESYVPLHLLKSFEERRVARKSPKLSSGKLSEPFGVIKKSLR 1320 Query: 726 EKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKYT 905 +GFSYLFS+A R EY QC C KDVLIR+AV CQ CKG+FHK H+RKSAG++T + KYT Sbjct: 1321 YRGFSYLFSKAARSEYYQCGHCSKDVLIRDAVCCQDCKGYFHKRHIRKSAGAVTTECKYT 1380 Query: 906 CQKCQDGKFVKTDARKGKPELPKLRNASKSVXXXXXXXXXXXXXXXXQ------------ 1049 C +CQDG+F K D R K K + ++SV Q Sbjct: 1381 CYQCQDGRF-KKDTRTAKNGTKKGKMNTRSVKVKSQKSKKTTGRRSVQSKNSKKTVVGGR 1439 Query: 1050 -VNLTNKKRVPFVVPLRRSARNAERVAKVTVQNSXXXXXXXXXXXXXXXXXXXXXXXXXX 1226 + N K+V +PLRRSAR A+ V+VQN Sbjct: 1440 SLRSRNDKKVA-AIPLRRSARRAK---LVSVQNRKHAGRKRGRPKSKKKTSRKPKKTTSL 1495 Query: 1227 XXQRMPVTSSYWLNGLRLSRRPGDERHLR--NRKLLVLSGEVDCILNKTICSLCREVEHK 1400 +R SYWLNGL LSR+P D+R ++ + L S + L++ C LC E EH Sbjct: 1496 QKKRTQSYYSYWLNGLFLSRKPDDDRVMQFTRKNFLAASELLTDTLDQPKCYLCHEAEHT 1555 Query: 1401 SELNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTPPICPHHCPVESSNPEIF 1580 S NY+ CEICG+W+HGDA L + I LIGF+CH+C +T P+C + S ++ Sbjct: 1556 STSNYIACEICGEWYHGDAFGLKVENISKLIGFRCHVCRKRT-PVCSCMVSMGSDGSQLE 1614 Query: 1581 SENNAKIECTEEDPHCLASPDNELAHQKSHFNDKS 1685 ++ N KI C+EE L+ P KS+ D S Sbjct: 1615 AQTNYKIGCSEE----LSKPVVPFGELKSNPMDNS 1645 >ref|XP_004505800.1| PREDICTED: uncharacterized protein LOC101501088, partial [Cicer arietinum] Length = 1746 Score = 484 bits (1246), Expect = e-134 Identities = 267/569 (46%), Positives = 351/569 (61%), Gaps = 32/569 (5%) Frame = +3 Query: 6 TVGSATHIVSSSARASSKHGISRKRAKSSDV-SGPSSNAATGLSLFWWRGGRGSRSLFNW 182 T+GSATHIV SAR SS+HG+ RKR++ S++ S +SN GL ++WWRGGR SR LFNW Sbjct: 1189 TMGSATHIVVGSARTSSRHGVGRKRSRHSNIESSSASNTTGGLGMYWWRGGRVSRKLFNW 1248 Query: 183 KVLPHSLASKAARQGGRRKIPGILYPDSGEYAKRTKYDSWRAAVETSRSVEQLALQVREL 362 KVLP S +KAARQ GR KIPGILYP++ ++AKR++Y +WRA+VE S SVEQLALQVREL Sbjct: 1249 KVLPRSFITKAARQAGRTKIPGILYPENSDFAKRSRYVAWRASVEISTSVEQLALQVREL 1308 Query: 363 DANIRWDDIGNNKLLSMIEMDSKKPVRSFKKVVIRRKCSEGATVRYLLDFGKRRFIPDIV 542 +NIRW DI NN L +++ +S+K VR FKK ++RRKC++G +V+YLLDFGKRR IPD+V Sbjct: 1309 YSNIRWHDIENNHPLYVLDKESRKSVRLFKKAIVRRKCTDGQSVKYLLDFGKRRAIPDVV 1368 Query: 543 VRYGSKLEDFSNEKKRYWLEETHVPLYLLKAFEEKRIARKSNPMKSGKLCESSQIIKKPL 722 +++GS LE S+EKK+YWL E++VPL+L+K FEE+RI RKSN GK E ++ + P Sbjct: 1369 IKHGSLLEQPSSEKKKYWLNESYVPLHLVKNFEERRIVRKSNDKTLGKFLEIGRVKRVP- 1427 Query: 723 KEKGFSYLFSRAERLEYDQCEQCKKDVLIREAVSCQRCKGFFHKWHVRKSAGSITRQRKY 902 +++GFSYLFSR E+ + QC CKKDV I EAVSC CKGFFHK H +KS G+ + Y Sbjct: 1428 EQRGFSYLFSRMEKSNFHQCGHCKKDVPISEAVSCLYCKGFFHKRHAKKSGGTRATECTY 1487 Query: 903 TCQKCQDGKFVKTDARKGK--PELPKLRNAS-KSV----XXXXXXXXXXXXXXXXQVNLT 1061 +C++CQDG VKT+ K K +L K+++ + KSV QV Sbjct: 1488 SCRRCQDGLHVKTNTNKRKIGSKLQKIQSQNCKSVPLVCKSVKLKGKKKASSKVQQVISR 1547 Query: 1062 NKKRVPFVVPLRRSARNAERV---------AKVTVQNSXXXXXXXXXXXXXXXXXXXXXX 1214 N K + +VPLRRS R A+ + K +Q+ Sbjct: 1548 NSKNISSIVPLRRSTRKAKSLYLRNQMIGGRKNGIQSKRNVGRKKGKQSKSKKVTSQKPK 1607 Query: 1215 XXXXXXQRMPVT----------SSYWLNGLRLSRRPGDERHL--RNRKLLV---LSGEVD 1349 ++ VT +SYWLNGLR SR+P DER + + +K + SG D Sbjct: 1608 EPTGQHKKFAVTRACKKRTELCNSYWLNGLRFSRKPNDERVMLFKEKKHITSEDFSGSRD 1667 Query: 1350 CILNKTICSLCREVEHKSELNYVGCEICGDWFHGDALNLGADKIENLIGFKCHMCLNKTP 1529 C C LC E S NY+ CEICGDWFHGDA L + LIGF+CH+C ++ Sbjct: 1668 C----PKCCLCCGDEATS--NYIACEICGDWFHGDAFGLSVENARQLIGFRCHVCRDRIA 1721 Query: 1530 PICPHHCPVESSNPEIFSENNAKIECTEE 1616 PICPH N +E +A IEC EE Sbjct: 1722 PICPH----VKINALSHTEQDATIECQEE 1746