BLASTX nr result
ID: Mentha25_contig00012302
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00012302 (1020 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002276725.2| PREDICTED: structure-specific endonuclease s... 204 4e-50 gb|EYU46004.1| hypothetical protein MIMGU_mgv1a007264mg [Mimulus... 201 4e-49 ref|XP_006361297.1| PREDICTED: uncharacterized protein LOC102597... 199 1e-48 ref|XP_003537333.1| PREDICTED: uncharacterized protein LOC100801... 188 3e-45 ref|XP_007146860.1| hypothetical protein PHAVU_006G076300g [Phas... 187 6e-45 ref|XP_006410121.1| hypothetical protein EUTSA_v10016841mg [Eutr... 184 4e-44 ref|XP_004246967.1| PREDICTED: uncharacterized protein LOC101267... 184 5e-44 ref|XP_006431991.1| hypothetical protein CICLE_v10001469mg [Citr... 182 1e-43 ref|XP_007205306.1| hypothetical protein PRUPE_ppa006794mg [Prun... 175 2e-41 ref|XP_004294742.1| PREDICTED: uncharacterized protein LOC101299... 175 3e-41 gb|EXC19560.1| Structure-specific endonuclease subunit [Morus no... 173 1e-40 ref|XP_007017048.1| Excinuclease ABC [Theobroma cacao] gi|508787... 172 2e-40 ref|XP_007205311.1| hypothetical protein PRUPE_ppa006827mg [Prun... 171 6e-40 ref|XP_002517715.1| nuclease, putative [Ricinus communis] gi|223... 169 2e-39 ref|XP_004955835.1| PREDICTED: uncharacterized protein LOC101777... 169 2e-39 ref|XP_006294402.1| hypothetical protein CARUB_v10023419mg, part... 168 3e-39 emb|CBI15837.3| unnamed protein product [Vitis vinifera] 168 4e-39 ref|XP_002461708.1| hypothetical protein SORBIDRAFT_02g006850 [S... 167 6e-39 ref|NP_001132010.1| hypothetical protein [Zea mays] gi|194693186... 166 1e-38 ref|XP_004145233.1| PREDICTED: uncharacterized protein LOC101203... 164 4e-38 >ref|XP_002276725.2| PREDICTED: structure-specific endonuclease subunit SLX1 homolog 2-like [Vitis vinifera] Length = 364 Score = 204 bits (520), Expect = 4e-50 Identities = 129/290 (44%), Positives = 162/290 (55%), Gaps = 24/290 (8%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQHP ESLAVRKAAA FKSLSG+ANKIKLAYTM TLP WQSLNLT Sbjct: 80 YGFPTNVSALQFEWAWQHPTESLAVRKAAAGFKSLSGIANKIKLAYTMFTLPAWQSLNLT 139 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYN-ADSCINDDLDDIESECSFQKSTD 358 VN FSTKY +++ CP LPE MR QV MD+LPCY+ +D D+ E E ++ + Sbjct: 140 VNFFSTKYTKHSAGCPILPEHMRVQVSPMDELPCYSGSDQSFFDNARGDEKEELGERGSS 199 Query: 359 KESAGVVGEEEVDHFHNYYHISEEDMHH-RDGASPESSGYLARTWVGSQSK--------- 508 + V E + I E + D SPE +T + + Sbjct: 200 SDGFDQVIAHEETALEQFGWIEEHGLRQPGDSPSPEVVHCSGKTQENAMRQPADLSTSKD 259 Query: 509 ---------SSPIRQEDQG----IDKDSSSRLVEETGSDELLNKLPATAMDADESLPMTK 649 SP+R +DKD+S + + + +LPAT + AD P Sbjct: 260 EHRSPFCLIDSPVRTSSHSTEGTLDKDTSG--LSKENKVLTMKQLPAT-VAADRGKPKIS 316 Query: 650 SPDMSSREVEIIDIFTPSPCYMEKSGSKMKRRRPNMCPEVIDLTNSPIFV 799 S D +S E+E+ID+ + SP Y K KRR + PE+IDLTNSPIFV Sbjct: 317 SLD-TSCEIEVIDLLSCSPDYRTNPCFK-KRRATTVHPEIIDLTNSPIFV 364 >gb|EYU46004.1| hypothetical protein MIMGU_mgv1a007264mg [Mimulus guttatus] Length = 413 Score = 201 bits (511), Expect = 4e-49 Identities = 137/333 (41%), Positives = 177/333 (53%), Gaps = 67/333 (20%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 +GFPTNVAALQFEWAWQHPVESLAVRKAA +FKSLSG+ANKIKLAYTMLTLPPWQSLNLT Sbjct: 89 HGFPTNVAALQFEWAWQHPVESLAVRKAAVNFKSLSGIANKIKLAYTMLTLPPWQSLNLT 148 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYN-ADSC--INDDLDDIESECSFQKS 352 VNLFSTKY+ +TS CPALPEQMRT++ MDDLPCYN A+ C +NDD DD E +S Sbjct: 149 VNLFSTKYQTHTSGCPALPEQMRTKISPMDDLPCYNIANDCPIVNDD-DDDGCEGLSHES 207 Query: 353 TDKESAGVVGEEEVDHFHNYYHISEEDMHHRDGASPESSGYL-ARTWVGSQSKSSPIRQE 529 T++ES+ + D FH Y+ +EE +H +S SG A W + + E Sbjct: 208 TEEESS-----RKNDDFH--YYSNEEYIHRETESSGCCSGKTRAEVWPKNSPPITEATDE 260 Query: 530 DQGIDKD------------------SSSRLVEETGSDELLNKLPATAMDAD--------- 628 + D SS +T +++ P D D Sbjct: 261 ESSTKNDGFPNCVGSKEEYIHREIESSGCCAAKTRAEDWPKNSPQITEDEDKGQFFIVNN 320 Query: 629 -ESLPMTKSPDM---------SSREVEIIDIFTPSPCYMEKSGSK--------------- 733 ES P+ S + + + ++++ T +EK + Sbjct: 321 NESPPVRTSFSLHNSSCIAGNARKNHRLVELITEVDEPLEKESAAAATRLVATDKDEVEI 380 Query: 734 ---------MKRRRP--NMCPEVIDLTNSPIFV 799 K+RRP + P+VIDLTNSP++V Sbjct: 381 IDIITPLPCKKKRRPTSSFFPDVIDLTNSPMYV 413 >ref|XP_006361297.1| PREDICTED: uncharacterized protein LOC102597488 [Solanum tuberosum] Length = 369 Score = 199 bits (507), Expect = 1e-48 Identities = 124/276 (44%), Positives = 164/276 (59%), Gaps = 10/276 (3%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQHPVES AVR+AAASFK+L G+ANKIKLAY MLTLP WQSLNLT Sbjct: 104 YGFPTNVSALQFEWAWQHPVESRAVRQAAASFKTLGGVANKIKLAYAMLTLPEWQSLNLT 163 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYNADSCINDDLDDIESECSFQKSTDK 361 VN FSTKY+ +++ CP+LPE MR +C++D+LPCY D+ E E S ++ TD+ Sbjct: 164 VNFFSTKYKMHSAGCPSLPEHMRVHICALDELPCYTGID--RDEYSTNEWENS-EELTDE 220 Query: 362 ESAGVVGEEEVDHFHNYYHISEEDMHHRDGASPESSGYLARTWVGSQSK---SSPIRQED 532 SA + E D H D + T S SP+ + Sbjct: 221 ISASSTNSNSSFSNQDKDSTDENDDEHTDWKELDERAGENSTCGREHSYIIIDSPVERSS 280 Query: 533 QGI-------DKDSSSRLVEETGSDELLNKLPATAMDADESLPMTKSPDMSSREVEIIDI 691 + DK L +E G ++ NK+ +T D+SL TK+ + S ++E+ID+ Sbjct: 281 SILGDFFHIADKKERHELDDEFG-EKQANKMCST--KTDDSL-ATKNAGLPS-DIEVIDV 335 Query: 692 FTPSPCYMEKSGSKMKRRRPNMCPEVIDLTNSPIFV 799 FTP PC ++ K +RR CPE+IDLT+SPI+V Sbjct: 336 FTP-PCSKVRADHK-RRRFSASCPEIIDLTDSPIYV 369 >ref|XP_003537333.1| PREDICTED: uncharacterized protein LOC100801307 [Glycine max] Length = 380 Score = 188 bits (477), Expect = 3e-45 Identities = 113/288 (39%), Positives = 159/288 (55%), Gaps = 22/288 (7%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQHPVESLAVRKAA FKSLSG+ANKIKLAYTMLTLP WQS+N+T Sbjct: 91 YGFPTNVSALQFEWAWQHPVESLAVRKAAVEFKSLSGIANKIKLAYTMLTLPSWQSMNIT 150 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYNAD-SCINDDLDDIESECSFQKSTD 358 VN FSTKY + + CP+LP M+T+ S+D+LPCYN ++++ DD E F + Sbjct: 151 VNFFSTKYMKHCAGCPSLPVHMKTKFGSLDELPCYNKGIDGLSENEDDTIDEVQFDDNNI 210 Query: 359 KESAGV-------VGEEEVDHFHNYYHISEEDMHHRDGAS---PESSGYLARTWVGSQSK 508 S V V + + ++ ISE +++ + P + + ++ S Sbjct: 211 STSGSVPDVSDDLVTPDSPQNPNDGDKISEAFEWNKESEAREPPLGNSFASQEQSQLFSS 270 Query: 509 SSPIRQEDQGIDKDSSSRLVEETGSDELLNK------LPATAMDADESLPMTKSPDMS-- 664 ++P+ + + ++EE ++NK P +L K+ D+ Sbjct: 271 TTPLTMKSSSTTSLQRAEIIEEDDFMSVMNKSDADLSQPEPEQSGATTLVANKNRDVGRT 330 Query: 665 ---SREVEIIDIFTPSPCYMEKSGSKMKRRRPNMCPEVIDLTNSPIFV 799 E EIID+ TPSP K +R ++ + IDLTNSP F+ Sbjct: 331 FVVPHETEIIDLSTPSPSCRSVLDRKKRRVSSSVGTDFIDLTNSPNFI 378 >ref|XP_007146860.1| hypothetical protein PHAVU_006G076300g [Phaseolus vulgaris] gi|561020083|gb|ESW18854.1| hypothetical protein PHAVU_006G076300g [Phaseolus vulgaris] Length = 374 Score = 187 bits (475), Expect = 6e-45 Identities = 117/287 (40%), Positives = 162/287 (56%), Gaps = 21/287 (7%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQHPVESLAVRKAA FKSLSG+ANKIKLAYTMLTLP WQS+N+T Sbjct: 90 YGFPTNVSALQFEWAWQHPVESLAVRKAAVEFKSLSGIANKIKLAYTMLTLPSWQSMNIT 149 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYNADSCI---NDDLDDIESECSFQKS 352 VN FSTKY + + CP+LP M+T++ +D+LPCY+ + +D++DD+E + + S Sbjct: 150 VNFFSTKYMKHCAGCPSLPAHMKTKIGPLDELPCYSINGLSENEDDNIDDVEFDDNNNTS 209 Query: 353 T--------------DKESAGVVGEEEVDHFHNYYHISEEDMHHRDGASPESSGYLARTW 490 D + GE+ + F + SE +S E ++ T Sbjct: 210 ASGSVPDVSDDLDSPDSPKNQIHGEKISEAFDEWIKESEARESGNSFSSQEQRLPVSSTT 269 Query: 491 VGSQSKSSPIR---QEDQGIDKDSSSRLVEETGSDELLNKLPATAMDADESLPMTKSPDM 661 + SS I Q + I++ ++ +GS ++A+ + S + Sbjct: 270 PLTMKSSSTITTPLQRIEIIEEADFMNVINRSGSGLSQPAQSGGTLEANTN-RTAGSTAV 328 Query: 662 SSREVEIIDIFTPSP-CYMEKSGSKMKRRRPNMCPEVIDLTNSPIFV 799 E EIID+ TPSP C + ++ KRR P+ + IDLTNSP FV Sbjct: 329 VPHEAEIIDLSTPSPSCGIV---NRKKRRVPSFVTDFIDLTNSPNFV 372 >ref|XP_006410121.1| hypothetical protein EUTSA_v10016841mg [Eutrema salsugineum] gi|557111290|gb|ESQ51574.1| hypothetical protein EUTSA_v10016841mg [Eutrema salsugineum] Length = 364 Score = 184 bits (468), Expect = 4e-44 Identities = 127/296 (42%), Positives = 159/296 (53%), Gaps = 30/296 (10%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQHP ESLAVR+AAA+FKS SGL +KIKLAYTMLTLP W SLNLT Sbjct: 86 YGFPTNVSALQFEWAWQHPRESLAVREAAAAFKSFSGLGSKIKLAYTMLTLPAWNSLNLT 145 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCY-NADSCINDDLDDIESECSFQKSTD 358 VN FSTKY H+ P+LP M+ QVC+MDDLPC+ D+ N +D ES S ++ D Sbjct: 146 VNYFSTKYAHHGGLSPSLPPHMKVQVCAMDDLPCFTKLDN--NSQPEDEESLDSHEEEED 203 Query: 359 KESAGVVGEEEVDHFHNYYHISEEDMHHRD---GASPES--SGYLAR-TWVGSQSKSSPI 520 + N ++ E+++H RD PE+ LA T GS Sbjct: 204 DRRNEIQPGNLTTSSSNDLYLGEKELHDRDFEKAKQPEAVLDDRLANFTGFGSL------ 257 Query: 521 RQEDQGIDKDSSSRLVEETGSDELLNKLPATAMD---------------ADESLPMTKSP 655 D+ ++ + S V GS E + K P T D D T Sbjct: 258 ---DESVEDEVSHITV---GSIEAMEKEPETVFDDRLANFTGFGLEDIVEDVISHSTMEK 311 Query: 656 D--------MSSREVEIIDIFTPSPCYMEKSGSKMKRRRPNMCPEVIDLTNSPIFV 799 D S+ EVE+ID+ TPSP + G MKR+R + E IDLT SP F+ Sbjct: 312 DCWRRSNLITSTTEVEVIDLMTPSPSC--RVGPSMKRQRVS---EFIDLTRSPSFI 362 >ref|XP_004246967.1| PREDICTED: uncharacterized protein LOC101267927 [Solanum lycopersicum] Length = 350 Score = 184 bits (467), Expect = 5e-44 Identities = 114/285 (40%), Positives = 157/285 (55%), Gaps = 19/285 (6%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQHPVES AVR+AAASFK+L G+ANKIKLAYTMLTLP WQSLNLT Sbjct: 81 YGFPTNVSALQFEWAWQHPVESRAVRQAAASFKTLGGVANKIKLAYTMLTLPEWQSLNLT 140 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYNA-------DSCINDDL------DD 322 VN FSTKY+ +++ CP+LPE MR +C++D+LPCY + C D+L D Sbjct: 141 VNFFSTKYKMHSAGCPSLPEHMRVHICALDELPCYTGIDRDEWENICALDELPSYTGIDR 200 Query: 323 IESECSFQKSTDKESAGVVGEEEVDHFHNYYHISEE----DMHHRDGASPESSGYLARTW 490 E E + + +E + F N E+ ++ R G + + Sbjct: 201 DEWENREECESSEELTDEISTNSNSSFSNQDKDDEQTDWRELDERAGENSTRGREHSYII 260 Query: 491 VGSQSKSSPIRQED--QGIDKDSSSRLVEETGSDELLNKLPATAMDADESLPMTKSPDMS 664 + S ++ Q D DK +L +E G ++ NK+ + + LP Sbjct: 261 IDSPAERLCSIQGDFFHIADKKERHQLDDEFGENQA-NKMYDSLATKNAGLPC------- 312 Query: 665 SREVEIIDIFTPSPCYMEKSGSKMKRRRPNMCPEVIDLTNSPIFV 799 ++E+ID+FTP +RR PE+IDLT+SP++V Sbjct: 313 --DIEVIDVFTPPV-----RADNKRRRLSASVPEIIDLTDSPVYV 350 >ref|XP_006431991.1| hypothetical protein CICLE_v10001469mg [Citrus clementina] gi|568827655|ref|XP_006468166.1| PREDICTED: uncharacterized protein LOC102631105 [Citrus sinensis] gi|557534113|gb|ESR45231.1| hypothetical protein CICLE_v10001469mg [Citrus clementina] Length = 386 Score = 182 bits (463), Expect = 1e-43 Identities = 120/285 (42%), Positives = 158/285 (55%), Gaps = 25/285 (8%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQHP+ESLAVR+AAA+FKS SG+ANKIKLAYTML LP W+SLN+T Sbjct: 107 YGFPTNVSALQFEWAWQHPMESLAVRRAAATFKSFSGVANKIKLAYTMLNLPNWESLNIT 166 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCY-NADSCINDDLDDIESECSFQKSTD 358 VN FSTKY ++S+CP LPE M+ QV SMD+LPCY D + D D + E + S + Sbjct: 167 VNYFSTKYSKHSSSCPNLPEHMKVQVRSMDELPCYTERDERLLGDEDSLGDEEYDEASEN 226 Query: 359 KESAGVVGEEEVDHFHNYYHIS-EEDMHHRDGASPESSGYLARTWVGSQSKSSPIRQEDQ 535 S + +F + Y S ED + + G + R S QE Sbjct: 227 SGSLEETRGDVTINFSSDYSFSIYEDAYEQCGQFKQYGNEQPR----DSSCLEVNCQEPF 282 Query: 536 GIDKDSSSRLVEETGSDELLNKLP-------ATAMDADESLPMTKSPDMSSR-------- 670 G+ + V + S E N+L ATA++ +E+ + ++ Sbjct: 283 GLLSSLETTSVISSTSAEDTNELGRQRSEQCATAVNDEENQQFAQRQSITIEVANKDQLQ 342 Query: 671 --------EVEIIDIFTPSPCYMEKSGSKMKRRRPNMCPEVIDLT 781 VE+ID+ TPSP E S SK KRR ++CP +IDLT Sbjct: 343 VQSSTGLPNVEVIDLLTPSPNCREMSYSK-KRRVSSLCPVIIDLT 386 >ref|XP_007205306.1| hypothetical protein PRUPE_ppa006794mg [Prunus persica] gi|462400948|gb|EMJ06505.1| hypothetical protein PRUPE_ppa006794mg [Prunus persica] Length = 395 Score = 175 bits (444), Expect = 2e-41 Identities = 124/324 (38%), Positives = 164/324 (50%), Gaps = 58/324 (17%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQ+P S AVR+AAASFKSL GL +KIKLAYTMLTLPPWQSLN+T Sbjct: 83 YGFPTNVSALQFEWAWQNPTVSKAVRQAAASFKSLGGLVSKIKLAYTMLTLPPWQSLNIT 142 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYNADSC--INDDLDDIESECSFQKST 355 VN FST+Y +++ C LPEQM+ +VCSMD+LP SC I+DDL + E E ++ Sbjct: 143 VNFFSTQYTKHSAGCLRLPEQMKVKVCSMDELP-----SCTKISDDLFENEDEWCNEREF 197 Query: 356 DKE-----------------SAGVVGEEEVDHFHNYYHISEEDMHHRDGASPE------- 463 D+ + VGE+E +Y+ E D DG E Sbjct: 198 DEHMNTNDQQSDSGKRINEVCSKEVGEDE------WYNGRECDEAVNDGTLQEETLSDLI 251 Query: 464 ----------------SSGYLARTWVGSQSK------SSPIRQEDQGIDKDSSSRLVEET 577 + Y VG +SP+R + + + ++T Sbjct: 252 VQSSADDQQDNTGKTINKAYRCSQEVGEDCTEQFGFIASPMRMPSSNVTTSFDTEVTKDT 311 Query: 578 GS-DELLNKLPATAMDADESLPMTKSPDMSSRE--------VEIIDIFTPSP-CYMEKSG 727 GS D + KL AM+ E L + D S E+ID+ TP+P C G Sbjct: 312 GSADAISVKLGRPAMEQLEQLTTIVADDDQSPSRSYLRPCGAEVIDLTTPAPLCRSHLCG 371 Query: 728 SKMKRRRPNMCPEVIDLTNSPIFV 799 K R ++ P++IDLT SP F+ Sbjct: 372 K--KSRVASVYPQIIDLTKSPNFI 393 >ref|XP_004294742.1| PREDICTED: uncharacterized protein LOC101299940 [Fragaria vesca subsp. vesca] Length = 400 Score = 175 bits (443), Expect = 3e-41 Identities = 117/325 (36%), Positives = 165/325 (50%), Gaps = 59/325 (18%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTN +ALQFEWAWQ+P S AVRKAAA+FKSL G ANKIKLAYTMLTLPPW+SLNLT Sbjct: 80 YGFPTNTSALQFEWAWQNPYVSKAVRKAAANFKSLGGFANKIKLAYTMLTLPPWESLNLT 139 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYNADSCINDDLDDIESECSFQKSTDK 361 VN FST++ + + CP LPEQM+ ++C MD+LP SCI+DD+ D E E +K D+ Sbjct: 140 VNFFSTEHTKHAAGCPRLPEQMKVKICPMDELP-----SCISDDVSDNEDEWYNEKENDE 194 Query: 362 E------SAGVVGEEEVDHFHNYYHISE----------EDMHHRDGASPES--------- 466 S VV D ++ + S ED + D S E+ Sbjct: 195 TMNISTLSEPVVPNSADDQHNDIGNRSNEVYAQDKEVGEDEWYNDKVSDEAMNSGLSWEE 254 Query: 467 --SGYLAR-----TWVGSQSKSSPIRQEDQGIDKDSSSRLVEETGSDELLNKLPATAMDA 625 S ++ R + + + SS + + ++ + +D + + N +P+ +A Sbjct: 255 TLSNFMVRDSANDLEMDTGNTSSQVSRCNEEVQEDITGEFITSPLRMPYSNVIPSFDTEA 314 Query: 626 DESLPM----TKSPDMSSR-----------------------EVEIIDIFTPSPCYMEKS 724 +++ + T D +R + E++D+ TPSP Sbjct: 315 SKNIGLFDDSTVELDRPARKQSPAIIVADEEQSPRNSYLRPCDSEVVDLITPSPLCRNGL 374 Query: 725 GSKMKRRRPNMCPEVIDLTNSPIFV 799 K K R P PE+IDLT SP F+ Sbjct: 375 CGK-KSRVPTSYPEIIDLTKSPNFI 398 >gb|EXC19560.1| Structure-specific endonuclease subunit [Morus notabilis] Length = 378 Score = 173 bits (438), Expect = 1e-40 Identities = 116/285 (40%), Positives = 153/285 (53%), Gaps = 21/285 (7%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 +GFP+NV+ALQFEWAWQHP ESLAVRKAAASFKSLSG+ANKIKLAYTMLTLP WQSLN+T Sbjct: 88 HGFPSNVSALQFEWAWQHPNESLAVRKAAASFKSLSGIANKIKLAYTMLTLPSWQSLNIT 147 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYNADSCINDDLDDIESECSFQKSTDK 361 VN FSTKY +++ C +LP+ + ++C MD+LPCY + D E+E + + ++ Sbjct: 148 VNYFSTKYTQHSAGCLSLPQHKKVKICPMDELPCY-----VKGDEGLFENEGEWD-NEER 201 Query: 362 ESAGVVGEEEVDHFHNYYHISEEDMHHRDGASPESSGYLARTWVGSQSKSSPIRQEDQGI 541 + AG E + N + E+ H ++G QS + + Sbjct: 202 DEAGSGSESAEETLSNSMFGNTEE-HDKNGLGKLYGWITEGEDCREQSTFAELPARPSSN 260 Query: 542 DKDSSS---RLVEETGSDELLN----KLPATAMDADESL-----PMTKSPDMSSREVEII 685 S S ++TG L K A D +SL S + EVEII Sbjct: 261 VSSSGSLAGEFTDDTGISGLFKDESFKSKRPAKDPSKSLVTIDDDQPPSSHIVPSEVEII 320 Query: 686 DIFTPSP-CYMEKSGSKMKRRRPNMCP-------EVIDLTN-SPI 793 D+ TPSP C G+K +R N P EV+DLT SP+ Sbjct: 321 DVTTPSPLCRSSLWGNKANKRARNKEPHNAPGEVEVVDLTTPSPL 365 >ref|XP_007017048.1| Excinuclease ABC [Theobroma cacao] gi|508787411|gb|EOY34667.1| Excinuclease ABC [Theobroma cacao] Length = 460 Score = 172 bits (436), Expect = 2e-40 Identities = 82/120 (68%), Positives = 97/120 (80%), Gaps = 8/120 (6%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQHP ES+AVR+AAA+FKSLSG+ANKIKLAYTMLTLP WQSLN+T Sbjct: 118 YGFPTNVSALQFEWAWQHPQESVAVREAAATFKSLSGVANKIKLAYTMLTLPAWQSLNIT 177 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYN-------ADSCIN-DDLDDIESEC 337 VN FSTKYR ++ CP+LPEQM+ QVCSM++LPCY D C N D+ D++ C Sbjct: 178 VNYFSTKYRKDSACCPSLPEQMKVQVCSMNELPCYTEQDEFEYKDDCDNLDEYDEVNDTC 237 >ref|XP_007205311.1| hypothetical protein PRUPE_ppa006827mg [Prunus persica] gi|462400953|gb|EMJ06510.1| hypothetical protein PRUPE_ppa006827mg [Prunus persica] Length = 393 Score = 171 bits (432), Expect = 6e-40 Identities = 118/316 (37%), Positives = 161/316 (50%), Gaps = 50/316 (15%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQ+P S AVR+AAASFKSL GLA+KIKLAYTMLTLPPWQSLN+T Sbjct: 83 YGFPTNVSALQFEWAWQNPTVSKAVRQAAASFKSLGGLASKIKLAYTMLTLPPWQSLNIT 142 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLP---------CYNADSCIND-------- 310 +N FST+Y +++ CP LPEQM+ +VCSMD+LP N D N+ Sbjct: 143 INFFSTQYTKHSAGCPRLPEQMKVKVCSMDELPSCTKLSDDLLENEDEWCNEGEFDEDMN 202 Query: 311 DLDDIESECSFQKSTDKESAGVVGEEEVDHFHNYYHISEEDMHHRDGASPE--SSGYLAR 484 DD +S+ + + + VGE+E +Y+ E D DG E SS + + Sbjct: 203 TTDDQQSDSGNRMNEVYRCSKEVGEDE------WYNGRECDEAMNDGTLQEETSSDLIVQ 256 Query: 485 TWVGSQSK--------------------------SSPIRQEDQGIDKDSSSRLVEETGS- 583 + Q +SP+R + + + ++ GS Sbjct: 257 SSADDQQDNTAKTNKAHQGSQEVGEDCTEQFGFIASPVRTPSSNVTTSFGTEVTKDIGSA 316 Query: 584 DELLNKLPATAMDADESLPMT-KSPDMSSRE---VEIIDIFTPSPCYMEKSGSKMKRRRP 751 D + KL AM+ ++ +SP S E+ID+ TP+ K R P Sbjct: 317 DAISVKLGQPAMEQLTTIVADHQSPSRSYLRPCGAEVIDLTTPASLCRSHLCGKKSRVAP 376 Query: 752 NMCPEVIDLTNSPIFV 799 + P +IDLT SP F+ Sbjct: 377 -VYPRIIDLTKSPNFI 391 >ref|XP_002517715.1| nuclease, putative [Ricinus communis] gi|223543113|gb|EEF44647.1| nuclease, putative [Ricinus communis] Length = 413 Score = 169 bits (428), Expect = 2e-39 Identities = 122/333 (36%), Positives = 165/333 (49%), Gaps = 73/333 (21%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQHP+ESLAVR+AAA+FKS SG+ANKIKLAYTML L WQSLN+T Sbjct: 83 YGFPTNVSALQFEWAWQHPMESLAVRQAAATFKSFSGVANKIKLAYTMLNLSAWQSLNIT 142 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCY---NADSCINDDLDDIESECSFQKS 352 VN FSTKY ++ACP+LPE M+ QVC + +LPCY S D +D + ++ Sbjct: 143 VNYFSTKYSILSAACPSLPEHMKIQVCPVVELPCYKETGESSLECQDAEDGFDDKENYEN 202 Query: 353 TDKESAGVVGE------EEVDHFHNYYHISEEDMHHRDGASPESSGY-----------LA 481 T ES V G+ + +D F ++ E +D S + Y Sbjct: 203 TTSESGAVKGKTVEFQSQSLDKFPDFNRGEEIAFEGQDSNSNKDEEYNEVSQKNGTLDQI 262 Query: 482 RTWVGSQSKSSPIRQEDQGIDK-----DSSSR--LVEETGSD---------------ELL 595 RT Q S +D +K D S+R ++ T +D Sbjct: 263 RTDAFGQISSDNSHTDDWTCEKFGSCEDYSTRHPSLKNTSADYPPAPKVDCARPFGFPTS 322 Query: 596 NKLPATAMDADESLPMTKS-------------PDMSSR-----------------EVEII 685 N L TA P++++ D+ SR E+E+I Sbjct: 323 NSLVRTASSLCTGFPISETSNGDELMLINNSVSDLGSRNGKILTGKDDKDKPIPQEIEVI 382 Query: 686 DIFTPSP-CYMEKSGSKMKRRRPNMCPEVIDLT 781 D+ +PSP C + S+ KRR +CP++IDLT Sbjct: 383 DLLSPSPECRI--MSSRKKRRFLTVCPQIIDLT 413 >ref|XP_004955835.1| PREDICTED: uncharacterized protein LOC101777363 [Setaria italica] Length = 377 Score = 169 bits (427), Expect = 2e-39 Identities = 106/292 (36%), Positives = 156/292 (53%), Gaps = 26/292 (8%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFP+NVAALQFEWAWQHP ESLAVRKAAA FKSL G+ NK+KLAYTML LP W+SLNLT Sbjct: 105 YGFPSNVAALQFEWAWQHPAESLAVRKAAAEFKSLGGIGNKVKLAYTMLNLPSWESLNLT 164 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYNADSCINDDLDDIESECSFQKSTDK 361 VN FS+K +T+ CP+LP QM+T VC+M+DL C +A+ ++D DD+ + Q ++ Sbjct: 165 VNFFSSKNTKFTAGCPSLPSQMKTVVCAMEDLQC-SAEGPSSED-DDLSQDP--QDQQEQ 220 Query: 362 ESAGVVGEEEVDHFHNYYHISEEDMHHRDGASPESSGYLARTWVGSQSKSSPIRQED--Q 535 + + +E H+ H + + S A+ VG + P +ED Sbjct: 221 SDSPLQDDEHSQHYEQSGHCWQ-----------QPSSDQAQPMVGQTGIAGPDVEEDPID 269 Query: 536 GIDKDSSSRLVEETGSDELLNKLPATAMD--------ADESLPMTKSP------------ 655 G S +++ + P ++ A E P SP Sbjct: 270 GFGPRKWSEILDIRTEVDEPRTSPRCSLSLSGDDCGTATEDEPGHLSPLLMFGAAGSDDG 329 Query: 656 --DMSSREVEIIDIFTPSPCYMEKSGSKMKRRR--PNMCPEVIDLTNSPIFV 799 + +++D+ TP+P +++RR ++CP++IDLT+SP+ + Sbjct: 330 GGHILDGSADVVDLVTPTPV------GRLRRRGCVASVCPKIIDLTSSPVVI 375 >ref|XP_006294402.1| hypothetical protein CARUB_v10023419mg, partial [Capsella rubella] gi|482563110|gb|EOA27300.1| hypothetical protein CARUB_v10023419mg, partial [Capsella rubella] Length = 382 Score = 168 bits (426), Expect = 3e-39 Identities = 113/297 (38%), Positives = 153/297 (51%), Gaps = 31/297 (10%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQHP ESLAVR+AAA+FKS G+A KIKL YTML LP W SLNLT Sbjct: 92 YGFPTNVSALQFEWAWQHPRESLAVREAAAAFKSFPGIAGKIKLVYTMLNLPAWNSLNLT 151 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCY----NADSCINDDLDDIESECSFQK 349 VN FS+KY HY P+LP M+ +VC+M+DLP + N+ +D+ ++ E + Sbjct: 152 VNYFSSKYAHYGGLAPSLPLHMKVEVCAMEDLPYFTKLDNSSQPEDDESPEVNEEAEDED 211 Query: 350 STDKESAGVVGEEEVDHFHNYYHISEEDMHHRDGASP------------ESSGYLARTWV 493 S + + D + + D H P G L V Sbjct: 212 SNQSQPGNSGASSQDDLYPGEKEL--HDRHFEKAKEPVTVLDEDRLANFSGFGSLEEEAV 269 Query: 494 GSQSKSSPIRQEDQGIDKDSSS----RLVEETG-------SDELLNKLPATAMDADESLP 640 + SP+ + +DK+ + RL TG DE ++ +A E Sbjct: 270 EDEVSHSPVGSIEV-MDKEPETVFVDRLANFTGFGLVEIVEDEEVSHGTVRNTEAMEKDS 328 Query: 641 MTKSPDMSSR----EVEIIDIFTPSPCYMEKSGSKMKRRRPNMCPEVIDLTNSPIFV 799 + ++S +VE+ID+ TPSP ++GS MKRRR + E IDLT SP F+ Sbjct: 329 WIRRNLITSTTTEVDVEVIDLMTPSPSC--RAGSSMKRRRVS---EFIDLTRSPNFI 380 >emb|CBI15837.3| unnamed protein product [Vitis vinifera] Length = 346 Score = 168 bits (425), Expect = 4e-39 Identities = 88/156 (56%), Positives = 103/156 (66%), Gaps = 2/156 (1%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQHP ESLAVRKAAA FKSLSG+ANKIKLAYTM TLP WQSLNLT Sbjct: 80 YGFPTNVSALQFEWAWQHPTESLAVRKAAAGFKSLSGIANKIKLAYTMFTLPAWQSLNLT 139 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYN-ADSCINDDLDDIESECSFQKSTD 358 VN FSTKY +++ CP LPE MR QV MD+LPCY+ +D D+ E E ++ + Sbjct: 140 VNFFSTKYTKHSAGCPILPEHMRVQVSPMDELPCYSGSDQSFFDNARGDEKEELGERGSS 199 Query: 359 KESAGVVGEEEVDHFHNYYHISEEDMHH-RDGASPE 463 + V E + I E + D SPE Sbjct: 200 SDGFDQVIAHEETALEQFGWIEEHGLRQPGDSPSPE 235 >ref|XP_002461708.1| hypothetical protein SORBIDRAFT_02g006850 [Sorghum bicolor] gi|241925085|gb|EER98229.1| hypothetical protein SORBIDRAFT_02g006850 [Sorghum bicolor] Length = 386 Score = 167 bits (423), Expect = 6e-39 Identities = 103/279 (36%), Positives = 152/279 (54%), Gaps = 13/279 (4%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFP+NVAALQFEWAWQHP ESLAVRKAAA FKSLSG+ NK+KLAYTML LP W++LNL Sbjct: 112 YGFPSNVAALQFEWAWQHPTESLAVRKAAAEFKSLSGIGNKVKLAYTMLNLPSWENLNLA 171 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYNADS-CINDDLDDIESECSFQKSTD 358 VN FS+K +T+ CP+LP QM+T VC+M+DL C AD +D +DI Q + + Sbjct: 172 VNFFSSKNTKFTAGCPSLPSQMKTVVCAMEDLQCQQADGPSSEEDGNDIRDPEEPQDNDE 231 Query: 359 KESAGVVGEEEVDHFHNYYHISEED----MHHRDGASPESSGYLARTWVGSQSKSSPIRQ 526 + S + + H + S +D M + G + + S + + Sbjct: 232 ELSDSSLRDGYSYSDHCFQQPSSDDQVQPMDEQTGTAGSDVEDDLADELAPAMGWSQLLE 291 Query: 527 EDQGIDKDSSSRLVEETGSDELLNKLPATAMDADESLPMTKSPDMSSR------EVEIID 688 + ++ +S L + E + + + + +P S D R +++D Sbjct: 292 ARRELNGPRTSPLCSLSPCSEDVGLEEGSGLMSPLLMPNASSDDDDGRGRRILYGNDVVD 351 Query: 689 IFTPSPCYMEKSGSKMKRRR--PNMCPEVIDLTNSPIFV 799 + TP+P ++ RR ++CP++IDLT+SPI + Sbjct: 352 LVTPTPV------GRLPRRDCVSSICPKIIDLTSSPIVI 384 >ref|NP_001132010.1| hypothetical protein [Zea mays] gi|194693186|gb|ACF80677.1| unknown [Zea mays] gi|195627240|gb|ACG35450.1| hypothetical protein [Zea mays] gi|414884064|tpg|DAA60078.1| TPA: hypothetical protein ZEAMMB73_892976 [Zea mays] Length = 377 Score = 166 bits (420), Expect = 1e-38 Identities = 110/290 (37%), Positives = 151/290 (52%), Gaps = 24/290 (8%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFP+NVAALQFEWAWQHP ESLAVRKAAA FKSL G+ NK+KLAYTML LP W+SLNLT Sbjct: 109 YGFPSNVAALQFEWAWQHPTESLAVRKAAAEFKSLGGIGNKVKLAYTMLNLPSWESLNLT 168 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYNADSCINDDLDDIESE--------- 334 VN FS+K +T+ CP+LP QM+ VC M+DL C +D +DI E Sbjct: 169 VNFFSSKNTKFTTGCPSLPSQMKAVVCGMEDLQCQPDGPSSEEDDNDIRDESQDNGEEPP 228 Query: 335 -------------CSFQKSTDKESAGVVGEEEVDHFHNYYHISEEDMHHRDGASPESSGY 475 C Q S+D+ A + E+ + E+D +S E S Sbjct: 229 DSPIRDGFSYSDYCFQQPSSDQ--AQPMDEQTISAGSGV----EDDFVDEFASSMERSEI 282 Query: 476 LARTWVGSQSKSSPIRQEDQGIDKDSSSRLVEETGSDELLNKLPATAMDADESLPMTKSP 655 L + ++SP+ + D L EE G L +P + DA + + Sbjct: 283 LGTRRGLNGPRTSPLCSLGACSNDDG---LEEEAGLMSPL-LMPNASSDAGDGRHILNGN 338 Query: 656 DMSSREVEIIDIFTPSPCYMEKSGSKMKRRR--PNMCPEVIDLTNSPIFV 799 ++D+ TP+P +++RR ++CP++IDLT+SPI + Sbjct: 339 -------HVVDLVTPTPL------GRLRRRDCISSICPKIIDLTSSPIVI 375 >ref|XP_004145233.1| PREDICTED: uncharacterized protein LOC101203492 [Cucumis sativus] gi|449471301|ref|XP_004153269.1| PREDICTED: uncharacterized protein LOC101204996 [Cucumis sativus] gi|449506301|ref|XP_004162709.1| PREDICTED: uncharacterized protein LOC101229010 [Cucumis sativus] Length = 395 Score = 164 bits (416), Expect = 4e-38 Identities = 115/317 (36%), Positives = 166/317 (52%), Gaps = 51/317 (16%) Frame = +2 Query: 2 YGFPTNVAALQFEWAWQHPVESLAVRKAAASFKSLSGLANKIKLAYTMLTLPPWQSLNLT 181 YGFPTNV+ALQFEWAWQHP ESLAVR AAA+FKSLSG+ANK+KLAYTMLTLP W+ LN+T Sbjct: 89 YGFPTNVSALQFEWAWQHPNESLAVRSAAATFKSLSGVANKVKLAYTMLTLPAWRGLNIT 148 Query: 182 VNLFSTKYRHYTSACPALPEQMRTQVCSMDDLPCYNADSCINDDLDDIESECSFQKSTDK 361 VN FSTK+ + CP+LPE M+ QV +++LPCY+ D D +E+E ++ + ++ Sbjct: 149 VNYFSTKFMKNAAGCPSLPEHMKVQVSPINELPCYS-----EGDQDMLENEGDWEYNRER 203 Query: 362 E---------SAGVVGEEEVDHFHNYY--------HI---SEEDMHHRDGASPES--SGY 475 E S V E +Y H+ ++++ + P S Y Sbjct: 204 EEICGFRVYGSMKEVSNEVPQKLMDYQTGTDGRPPHVLRGCDKELETNEQVPPSSCTPSY 263 Query: 476 LARTWVGSQSKSSPIRQEDQGIDKD-------SSSRLVEETGSDELL------NKLPATA 616 + S + D+G++ D S +V T E++ N+L ++ Sbjct: 264 I------DVGMSYDLCACDEGLENDEREAASCGQSCIVAGTSRTEIVIDDEEENQLEGSS 317 Query: 617 MDAD-----ESLPMTKSPDMS-----------SREVEIIDIFTPSPCYMEKSGSKMKRRR 748 M+ E+L + ++S + E E+ID+ TPSP S + KRR Sbjct: 318 MNLQEQPGRENLTSGIASEISKVSRWNNGWVPTVEYEVIDVSTPSP-DCRTSSHRFKRRV 376 Query: 749 PNMCPEVIDLTNSPIFV 799 + E+IDLT SP F+ Sbjct: 377 TSGKSEMIDLTKSPTFI 393