BLASTX nr result
ID: Catharanthus23_contig00014969
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00014969 (1366 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003537333.1| PREDICTED: uncharacterized protein LOC100801... 335 2e-89 gb|ESW18854.1| hypothetical protein PHAVU_006G076300g [Phaseolus... 333 8e-89 ref|XP_006361297.1| PREDICTED: uncharacterized protein LOC102597... 326 1e-86 gb|EOY34667.1| Excinuclease ABC [Theobroma cacao] 320 9e-85 ref|XP_006431991.1| hypothetical protein CICLE_v10001469mg [Citr... 318 3e-84 ref|XP_002276725.2| PREDICTED: structure-specific endonuclease s... 314 6e-83 gb|EXC19560.1| Structure-specific endonuclease subunit [Morus no... 311 4e-82 ref|XP_004246967.1| PREDICTED: uncharacterized protein LOC101267... 303 1e-79 ref|XP_002517715.1| nuclease, putative [Ricinus communis] gi|223... 303 1e-79 ref|XP_004294742.1| PREDICTED: uncharacterized protein LOC101299... 296 2e-77 ref|XP_004145233.1| PREDICTED: uncharacterized protein LOC101203... 295 4e-77 gb|EMJ06510.1| hypothetical protein PRUPE_ppa006827mg [Prunus pe... 288 3e-75 ref|XP_006410121.1| hypothetical protein EUTSA_v10016841mg [Eutr... 288 4e-75 emb|CBI15837.3| unnamed protein product [Vitis vinifera] 288 5e-75 gb|EMJ06505.1| hypothetical protein PRUPE_ppa006794mg [Prunus pe... 287 8e-75 ref|XP_002319418.2| hypothetical protein POPTR_0013s15190g, part... 284 7e-74 ref|XP_002325655.2| endo/excinuclease amino terminal domain-cont... 280 8e-73 ref|XP_006294402.1| hypothetical protein CARUB_v10023419mg, part... 278 3e-72 ref|NP_180594.2| Excinuclease ABC, C subunit, N-terminal [Arabid... 276 2e-71 ref|XP_004955835.1| PREDICTED: uncharacterized protein LOC101777... 273 2e-70 >ref|XP_003537333.1| PREDICTED: uncharacterized protein LOC100801307 [Glycine max] Length = 380 Score = 335 bits (860), Expect = 2e-89 Identities = 197/392 (50%), Positives = 240/392 (61%), Gaps = 16/392 (4%) Frame = -2 Query: 1224 ETLIRNQGENDENEREIADDEEGGGNAKFFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQ 1045 E ++N G N++NE E D EG G FFACYLLTSL PRFKGHTYIGFTVNPRRRIRQ Sbjct: 13 EESVQNHGHNNQNENE---DCEGNG---FFACYLLTSLSPRFKGHTYIGFTVNPRRRIRQ 66 Query: 1044 HNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWAWQHPIESLAVRNAAVTFKSLS 865 HNGEIG GAWRTKKRRPWEMVLCIYGFPTNV+ALQFEWAWQHP+ESLAVR AAV FKSLS Sbjct: 67 HNGEIGCGAWRTKKRRPWEMVLCIYGFPTNVSALQFEWAWQHPVESLAVRKAAVEFKSLS 126 Query: 864 GLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGCPSLTGHMRVQVCSMDELPCYS 685 G+ANKIKLAYTM+TLP+WQS+N+TVNFFSTKY KH AGCPSL HM+ + S+DELPCY+ Sbjct: 127 GIANKIKLAYTMLTLPSWQSMNITVNFFSTKYMKHCAGCPSLPVHMKTKFGSLDELPCYN 186 Query: 684 G--TNWSMGEDDGWN----GDDCEHSAGSSHECTEDGLTEAQKIGDAAGPLNSCEGTEGD 523 S EDD + D+ ++GS + ++D +T P + +GD Sbjct: 187 KGIDGLSENEDDTIDEVQFDDNNISTSGSVPDVSDDLVT----------PDSPQNPNDGD 236 Query: 522 KHKR--NWMEENETRHEQGLWGEETGLERLNNSLIREEDNWQSLKLDDYPLRASSLCLSG 349 K W +E+E R L NS +E + +++SS Sbjct: 237 KISEAFEWNKESEAREPP-----------LGNSFASQEQSQLFSSTTPLTMKSSSTTSLQ 285 Query: 348 NVTNDAEDTGIPILLNDCGSQYDQLPEQLSPTTTTVVANKE----ETRSISSAVEVIDLF 181 ED + + +N + Q PE TT+VANK T + E+IDL Sbjct: 286 RAEIIEEDDFMSV-MNKSDADLSQ-PEPEQSGATTLVANKNRDVGRTFVVPHETEIIDLS 343 Query: 180 TPSPCCKASTGSKKRRICPEI----IDLTNSP 97 TPSP C++ KKRR+ + IDLTNSP Sbjct: 344 TPSPSCRSVLDRKKRRVSSSVGTDFIDLTNSP 375 >gb|ESW18854.1| hypothetical protein PHAVU_006G076300g [Phaseolus vulgaris] Length = 374 Score = 333 bits (855), Expect = 8e-89 Identities = 195/393 (49%), Positives = 240/393 (61%), Gaps = 8/393 (2%) Frame = -2 Query: 1251 RRKGRKDSSETLIRNQGENDENEREIADDEEGGGNAKFFACYLLTSLCPRFKGHTYIGFT 1072 RR + E ++N G N++NE+E + EG G FFACYLLTSL PR+KGHTYIGFT Sbjct: 4 RRVASVEEEEETLQNHG-NNQNEKE---NSEGNG---FFACYLLTSLSPRYKGHTYIGFT 56 Query: 1071 VNPRRRIRQHNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWAWQHPIESLAVRN 892 VNPRRRIRQHNGEIG GAWRTKKRRPWEMVLCIYGFPTNV+ALQFEWAWQHP+ESLAVR Sbjct: 57 VNPRRRIRQHNGEIGCGAWRTKKRRPWEMVLCIYGFPTNVSALQFEWAWQHPVESLAVRK 116 Query: 891 AAVTFKSLSGLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGCPSLTGHMRVQVC 712 AAV FKSLSG+ANKIKLAYTM+TLP+WQS+N+TVNFFSTKY KH AGCPSL HM+ ++ Sbjct: 117 AAVEFKSLSGIANKIKLAYTMLTLPSWQSMNITVNFFSTKYMKHCAGCPSLPAHMKTKIG 176 Query: 711 SMDELPCYSGTNWSMGEDDGWNGDDCEHSAGSSHECTEDGLTEAQKIGDAAGPLNSCEGT 532 +DELPCYS S EDD N DD E ++ + + + P N G Sbjct: 177 PLDELPCYSINGLSENEDD--NIDDVEFDDNNNTSASGSVPDVSDDLDSPDSPKNQIHGE 234 Query: 531 EGDKHKRNWMEENETRHEQGLWGEETGLERLNNSLIREEDNWQSLKLDDYPLRASSLCLS 352 + + W++E+E R E+G NS +E +++SS + Sbjct: 235 KISEAFDEWIKESEAR--------ESG-----NSFSSQEQRLPVSSTTPLTMKSSSTITT 281 Query: 351 G-NVTNDAEDTGIPILLNDCGSQYDQLPEQLSPTTTTVVANKEETRSISSAV----EVID 187 E+ ++N GS Q P Q + T+ AN T ++ V E+ID Sbjct: 282 PLQRIEIIEEADFMNVINRSGSGLSQ-PAQ---SGGTLEANTNRTAGSTAVVPHEAEIID 337 Query: 186 LFTPSPCCKASTGSKKRRI---CPEIIDLTNSP 97 L TPSP C KKRR+ + IDLTNSP Sbjct: 338 LSTPSPSC-GIVNRKKRRVPSFVTDFIDLTNSP 369 >ref|XP_006361297.1| PREDICTED: uncharacterized protein LOC102597488 [Solanum tuberosum] Length = 369 Score = 326 bits (836), Expect = 1e-86 Identities = 186/403 (46%), Positives = 235/403 (58%), Gaps = 3/403 (0%) Frame = -2 Query: 1287 KPRERSDRRTMGRRKGRKDSSETLIRNQGENDENEREIADDEEGGGNAKFFACYLLTSLC 1108 + R R R MG+RK R++ + E+ E E +FFACYLLTS+C Sbjct: 10 RERGREREREMGKRKERREQKKVCSEGGDESKEVEEN-----------RFFACYLLTSMC 58 Query: 1107 PRFKGHTYIGFTVNPRRRIRQHNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWA 928 PRFKGHTYIGFTVNPRRRIRQHNGE+ GA RTK++RPWEM+LCIYGFPTNV+ALQFEWA Sbjct: 59 PRFKGHTYIGFTVNPRRRIRQHNGEVRMGALRTKRKRPWEMILCIYGFPTNVSALQFEWA 118 Query: 927 WQHPIESLAVRNAAVTFKSLSGLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGC 748 WQHP+ES AVR AA +FK+L G+ANKIKLAY M+TLP WQSLNLTVNFFSTKY+ H AGC Sbjct: 119 WQHPVESRAVRQAAASFKTLGGVANKIKLAYAMLTLPEWQSLNLTVNFFSTKYKMHSAGC 178 Query: 747 PSLTGHMRVQVCSMDELPCYSGTNWSMGEDDGWNGDDCEHSAGSSHECTEDGLTEAQKIG 568 PSL HMRV +C++DELPCY+G + + W +S E T++ + Sbjct: 179 PSLPEHMRVHICALDELPCYTGIDRDEYSTNEWE---------NSEELTDEISASSTNSN 229 Query: 567 DAAGPLNSCEGTEGDKHKRNWMEENETRHEQGLWGEETGLERLNNSLIREEDNWQSLKLD 388 + + E D +W E +E E G E S + Sbjct: 230 SSFSNQDKDSTDENDDEHTDWKELDERAGENSTCGRE-----------------HSYIII 272 Query: 387 DYPLRASSLCLSGNVTNDAEDTGIPILLNDCGSQYDQLPEQLSPTTTTVVANKEETRSIS 208 D P+ SS L G+ + A+ L ++ G + Q + S T +A K + Sbjct: 273 DSPVERSSSIL-GDFFHIADKKERHELDDEFGEK--QANKMCSTKTDDSLATK--NAGLP 327 Query: 207 SAVEVIDLFTPSPCCKASTGSKKRRI---CPEIIDLTNSPMSV 88 S +EVID+FTP PC K K+RR CPEIIDLT+SP+ V Sbjct: 328 SDIEVIDVFTP-PCSKVRADHKRRRFSASCPEIIDLTDSPIYV 369 >gb|EOY34667.1| Excinuclease ABC [Theobroma cacao] Length = 460 Score = 320 bits (820), Expect = 9e-85 Identities = 202/449 (44%), Positives = 250/449 (55%), Gaps = 68/449 (15%) Frame = -2 Query: 1248 RKGRKDSSETLIRN----------QGENDENEREIADDEEGGGNAKFFACYLLTSLCPRF 1099 RK + SETLI QG E RE DD++G FFACYLLTSL PR Sbjct: 16 RKRKAAGSETLINYYRQRRKSRDLQGGKAEEIRESGDDDKGKQGKGFFACYLLTSLSPRH 75 Query: 1098 KGHTYIGFTVNPRRRIRQHNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWAWQH 919 KGHTYIGFTVNPRRRIRQHNGEIGSGAWRTK +RPWEMV+CIYGFPTNV+ALQFEWAWQH Sbjct: 76 KGHTYIGFTVNPRRRIRQHNGEIGSGAWRTKSKRPWEMVICIYGFPTNVSALQFEWAWQH 135 Query: 918 PIESLAVRNAAVTFKSLSGLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGCPSL 739 P ES+AVR AA TFKSLSG+ANKIKLAYTM+TLPAWQSLN+TVN+FSTKY+K A CPSL Sbjct: 136 PQESVAVREAAATFKSLSGVANKIKLAYTMLTLPAWQSLNITVNYFSTKYRKDSACCPSL 195 Query: 738 TGHMRVQVCSMDELPCYSGTNWSMGEDDGWNGDDCEHSAGSSHECTEDGLTEAQKIGDAA 559 M+VQVCSM+ELPCY+ + +DD N D+ + + C T ++ +A+ Sbjct: 196 PEQMKVQVCSMNELPCYTEQDEFEYKDDCDNLDEYDE---VNDTCETVWETYPDEVVNAS 252 Query: 558 GPLNSCEGTEGDKHKRNWMEENETR----------------------------------- 484 E + ++EE +TR Sbjct: 253 ADNFLSSIHEASHEEFEYIEEYKTRKPVDSSTLGVHNIQPQVFIDSPTSKTSSIATGLPM 312 Query: 483 -------HEQGLWGEETGLERLNNSLIREEDNWQSLKLDDYPLRAS-SLCLSGNVTNDAE 328 E+ L G E+ +S + Q D P+R S S S + AE Sbjct: 313 VGPQPWVREKLLTTTYEGYEKTGDSFTFGIYHTQPFDCDYSPVRTSPSFVTSLSDGETAE 372 Query: 327 DTGIPILLNDCGSQ----------YDQLPEQLSPTTTTVVANKEE--TRSISSAVEVIDL 184 I+ +C S+ D+ P + P TT V +++ + +++ VEVIDL Sbjct: 373 GANTSIIEKECPSRKQFTAVVAANEDRQPGE-KPLTTDVAVMEDQLPSSTVAHEVEVIDL 431 Query: 183 FTPSPCCKASTGSKKRRI---CPEIIDLT 106 TPSP C+ + SKKRRI PEIIDLT Sbjct: 432 LTPSPACRLISDSKKRRISMFSPEIIDLT 460 >ref|XP_006431991.1| hypothetical protein CICLE_v10001469mg [Citrus clementina] gi|568827655|ref|XP_006468166.1| PREDICTED: uncharacterized protein LOC102631105 [Citrus sinensis] gi|557534113|gb|ESR45231.1| hypothetical protein CICLE_v10001469mg [Citrus clementina] Length = 386 Score = 318 bits (815), Expect = 3e-84 Identities = 202/416 (48%), Positives = 250/416 (60%), Gaps = 30/416 (7%) Frame = -2 Query: 1263 RTMGRRKGRK--DSSETLI---------RNQGENDENEREIADDEEGGGNAKFFACYLLT 1117 R M +RKG K SETLI ++ E +E E++ D +G FFACYLLT Sbjct: 4 REMPKRKGSKAVHDSETLISKSKTLDPVKDDFEEEEEEQKAKDQRKG-----FFACYLLT 58 Query: 1116 SLCPRFKGHTYIGFTVNPRRRIRQHNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQF 937 SLCPRFKGHTYIGFTVNPRRRIRQHNGEI GA RTKKRRPWEMVLCIYGFPTNV+ALQF Sbjct: 59 SLCPRFKGHTYIGFTVNPRRRIRQHNGEIRCGAVRTKKRRPWEMVLCIYGFPTNVSALQF 118 Query: 936 EWAWQHPIESLAVRNAAVTFKSLSGLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHI 757 EWAWQHP+ESLAVR AA TFKS SG+ANKIKLAYTM+ LP W+SLN+TVN+FSTKY KH Sbjct: 119 EWAWQHPMESLAVRRAAATFKSFSGVANKIKLAYTMLNLPNWESLNITVNYFSTKYSKHS 178 Query: 756 AGCPSLTGHMRVQVCSMDELPCYSGTNWSMGEDDGWNGDDCEHSAGSSHECTEDGLTEAQ 577 + CP+L HM+VQV SMDELPCY+ D+ GD ED L + + Sbjct: 179 SSCPNLPEHMKVQVRSMDELPCYTE------RDERLLGD-------------EDSLGD-E 218 Query: 576 KIGDAAGPLNSCEGTEGDKHKRNWMEENETRHEQGLWGEETG-LERLNNSLIREEDNWQS 400 + +A+ S E T GD + + + +E E+ G ++ N R+ + Sbjct: 219 EYDEASENSGSLEETRGDVTINFSSDYSFSIYEDAY--EQCGQFKQYGNEQPRDSSCLEV 276 Query: 399 LKLDDYPLRASSLCLSGNVTNDAEDTGIPILLNDCGSQY---------DQLPEQLSP--T 253 + + L +S S + AEDT N+ G Q D+ +Q + + Sbjct: 277 NCQEPFGLLSSLETTSVISSTSAEDT------NELGRQRSEQCATAVNDEENQQFAQRQS 330 Query: 252 TTTVVANKEETRSISSA----VEVIDLFTPSPCCKASTGSKKRRI---CPEIIDLT 106 T VANK++ + SS VEVIDL TPSP C+ + SKKRR+ CP IIDLT Sbjct: 331 ITIEVANKDQLQVQSSTGLPNVEVIDLLTPSPNCREMSYSKKRRVSSLCPVIIDLT 386 >ref|XP_002276725.2| PREDICTED: structure-specific endonuclease subunit SLX1 homolog 2-like [Vitis vinifera] Length = 364 Score = 314 bits (804), Expect = 6e-83 Identities = 194/408 (47%), Positives = 235/408 (57%), Gaps = 18/408 (4%) Frame = -2 Query: 1257 MGRRKGRKDSSETLIRNQGENDENEREIADDEEGGGNAKFFACYLLTSLCPRFKGHTYIG 1078 M +RKGR + SE + ++ + D+ FFACYLL SL PR KGH+YIG Sbjct: 1 MTKRKGRSEISEETLNSEEKGDD----------------FFACYLLASLSPRHKGHSYIG 44 Query: 1077 FTVNPRRRIRQHNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWAWQHPIESLAV 898 FTVNPRRRIRQHNGEI GAW+TK++RPWEMVLCIYGFPTNV+ALQFEWAWQHP ESLAV Sbjct: 45 FTVNPRRRIRQHNGEITCGAWKTKRKRPWEMVLCIYGFPTNVSALQFEWAWQHPTESLAV 104 Query: 897 RNAAVTFKSLSGLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGCPSLTGHMRVQ 718 R AA FKSLSG+ANKIKLAYTM TLPAWQSLNLTVNFFSTKY KH AGCP L HMRVQ Sbjct: 105 RKAAAGFKSLSGIANKIKLAYTMFTLPAWQSLNLTVNFFSTKYTKHSAGCPILPEHMRVQ 164 Query: 717 VCSMDELPCYSGTNWSMGEDDGWNGDDCEH--SAGSSHECTEDGLTEAQKIGDAAGPLNS 544 V MDELPCYSG++ S D GD+ E GSS + + + + + G Sbjct: 165 VSPMDELPCYSGSDQSF--FDNARGDEKEELGERGSSSDGFDQVIAHEETALEQFG---- 218 Query: 543 CEGTEGDKHKRNWMEEN-----------ETRHEQGLWGEETGLERLNNSLIREEDNWQSL 397 W+EE+ E H G E + + S ++E Sbjct: 219 ------------WIEEHGLRQPGDSPSPEVVHCSGKTQENAMRQPADLSTSKDEHR-SPF 265 Query: 396 KLDDYPLRASSLCLSGNVTNDAEDTGIPILLNDCGSQYDQLPEQLSPTTTTVVANKEETR 217 L D P+R SS G T D + +G+ + L + P T K + Sbjct: 266 CLIDSPVRTSSHSTEG--TLDKDTSGL-------SKENKVLTMKQLPATVAADRGKPKIS 316 Query: 216 SI--SSAVEVIDLFTPSPCCKASTGSKKRR---ICPEIIDLTNSPMSV 88 S+ S +EVIDL + SP + + KKRR + PEIIDLTNSP+ V Sbjct: 317 SLDTSCEIEVIDLLSCSPDYRTNPCFKKRRATTVHPEIIDLTNSPIFV 364 >gb|EXC19560.1| Structure-specific endonuclease subunit [Morus notabilis] Length = 378 Score = 311 bits (797), Expect = 4e-82 Identities = 180/377 (47%), Positives = 231/377 (61%), Gaps = 6/377 (1%) Frame = -2 Query: 1251 RRKGRKDSSETLIRNQGENDENEREIADDEEGGGNAKFFACYLLTSLCPRFKGHTYIGFT 1072 R++ +++ SETL + E EI DD E G F+ACYLL SL PR KGHTYIGFT Sbjct: 4 RKRAQREPSETLTQ------ELTVEIGDDGERKG---FYACYLLVSLSPRHKGHTYIGFT 54 Query: 1071 VNPRRRIRQHNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWAWQHPIESLAVRN 892 VNPRRRIRQHNGEIG GAWRTKKRRPWEMVLCI+GFP+NV+ALQFEWAWQHP ESLAVR Sbjct: 55 VNPRRRIRQHNGEIGCGAWRTKKRRPWEMVLCIHGFPSNVSALQFEWAWQHPNESLAVRK 114 Query: 891 AAVTFKSLSGLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGCPSLTGHMRVQVC 712 AA +FKSLSG+ANKIKLAYTM+TLP+WQSLN+TVN+FSTKY +H AGC SL H +V++C Sbjct: 115 AAASFKSLSGIANKIKLAYTMLTLPSWQSLNITVNYFSTKYTQHSAGCLSLPQHKKVKIC 174 Query: 711 SMDELPCYSGTNWSMGEDDG-WNGDDCEHSAGSSHECTEDGLTEAQKIGDAAGPLNSCEG 535 MDELPCY + + E++G W+ ++ AGS E E+ L+ Sbjct: 175 PMDELPCYVKGDEGLFENEGEWDNEE-RDEAGSGSESAEETLS----------------- 216 Query: 534 TEGDKHKRNWMEENETRHEQGLWGEETGLERLNNSLIREEDNWQSLKLDDYPLRASS-LC 358 N M N H ++ GL +L + ED + + P R SS + Sbjct: 217 --------NSMFGNTEEH------DKNGLGKLYGWITEGEDCREQSTFAELPARPSSNVS 262 Query: 357 LSGNVTND-AEDTGIPILLNDCGSQYDQLPEQLSPTTTTVVANKEETRS---ISSAVEVI 190 SG++ + +DTGI L D + P+ + V + ++ S + S VE+I Sbjct: 263 SSGSLAGEFTDDTGISGLFKD--ESFKSKRPAKDPSKSLVTIDDDQPPSSHIVPSEVEII 320 Query: 189 DLFTPSPCCKASTGSKK 139 D+ TPSP C++S K Sbjct: 321 DVTTPSPLCRSSLWGNK 337 >ref|XP_004246967.1| PREDICTED: uncharacterized protein LOC101267927 [Solanum lycopersicum] Length = 350 Score = 303 bits (775), Expect = 1e-79 Identities = 182/412 (44%), Positives = 239/412 (58%), Gaps = 22/412 (5%) Frame = -2 Query: 1257 MGRRKGRKDSSETLIRNQGENDENEREIADDEEGGGNAKFFACYLLTSLCPRFKGHTYIG 1078 MG+RK +K D+ ++E+ EG ++FFACYLLTS+CPRFKGHTYIG Sbjct: 1 MGKRKEQKKVCH--------RDDEDKEV----EG---SRFFACYLLTSMCPRFKGHTYIG 45 Query: 1077 FTVNPRRRIRQHNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWAWQHPIESLAV 898 FTVNPRRRIRQHNGE+ GA RTK++RPWEM+LCIYGFPTNV+ALQFEWAWQHP+ES AV Sbjct: 46 FTVNPRRRIRQHNGEVRMGALRTKRKRPWEMILCIYGFPTNVSALQFEWAWQHPVESRAV 105 Query: 897 RNAAVTFKSLSGLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGCPSLTGHMRVQ 718 R AA +FK+L G+ANKIKLAYTM+TLP WQSLNLTVNFFSTKY+ H AGCPSL HMRV Sbjct: 106 RQAAASFKTLGGVANKIKLAYTMLTLPEWQSLNLTVNFFSTKYKMHSAGCPSLPEHMRVH 165 Query: 717 VCSMDELPCYSGTN---W-------------SMGEDDGWNGDDCEHSAGSSHECTEDGLT 586 +C++DELPCY+G + W + D+ N ++CE SS E T++ T Sbjct: 166 ICALDELPCYTGIDRDEWENICALDELPSYTGIDRDEWENREECE----SSEELTDEIST 221 Query: 585 EAQKIGDAAGPLNSCEGTEGDKHKRNWMEENETRHEQGLWGEETGLERLNNSLIREEDNW 406 + +S + D + +W E +E E G E Sbjct: 222 NSN---------SSFSNQDKDDEQTDWRELDERAGENSTRGRE----------------- 255 Query: 405 QSLKLDDYPLRASSLC-LSGNVTNDAEDTGIPILLNDCGSQYDQLPEQLSPTTTTVVANK 229 S + D P A LC + G+ + A+ + QL ++ + + Sbjct: 256 HSYIIIDSP--AERLCSIQGDFFHIADK-----------KERHQLDDEFGENQANKMYDS 302 Query: 228 EETRS--ISSAVEVIDLFTPSPCCKASTGSKKRRI---CPEIIDLTNSPMSV 88 T++ + +EVID+FTP +K+RR+ PEIIDLT+SP+ V Sbjct: 303 LATKNAGLPCDIEVIDVFTP----PVRADNKRRRLSASVPEIIDLTDSPVYV 350 >ref|XP_002517715.1| nuclease, putative [Ricinus communis] gi|223543113|gb|EEF44647.1| nuclease, putative [Ricinus communis] Length = 413 Score = 303 bits (775), Expect = 1e-79 Identities = 187/399 (46%), Positives = 232/399 (58%), Gaps = 44/399 (11%) Frame = -2 Query: 1170 DDEEGGGNAKFFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGEIGSGAWRTKKRRPW 991 D+EEG G F+ACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGEI SGA+RTKKRRPW Sbjct: 20 DEEEGKG---FYACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGEIRSGAFRTKKRRPW 76 Query: 990 EMVLCIYGFPTNVAALQFEWAWQHPIESLAVRNAAVTFKSLSGLANKIKLAYTMVTLPAW 811 EMV CIYGFPTNV+ALQFEWAWQHP+ESLAVR AA TFKS SG+ANKIKLAYTM+ L AW Sbjct: 77 EMVFCIYGFPTNVSALQFEWAWQHPMESLAVRQAAATFKSFSGVANKIKLAYTMLNLSAW 136 Query: 810 QSLNLTVNFFSTKYQKHIAGCPSLTGHMRVQVCSMDELPCYSGTNWSM--------GEDD 655 QSLN+TVN+FSTKY A CPSL HM++QVC + ELPCY T S G DD Sbjct: 137 QSLNITVNYFSTKYSILSAACPSLPEHMKIQVCPVVELPCYKETGESSLECQDAEDGFDD 196 Query: 654 GWNGDDCEHSAGSSHECTEDGLTEA-QKIGD-AAGPLNSCEGTEGDKHKRNWMEENETRH 481 N ++ +G+ T + +++ K D G + EG + + +K E NE Sbjct: 197 KENYENTTSESGAVKGKTVEFQSQSLDKFPDFNRGEEIAFEGQDSNSNKDE--EYNEVSQ 254 Query: 480 EQGLWGE-ETGLERLNNSLIREEDNWQSLKL---DDYPLRASSL--------------C- 358 + G + T +S D+W K +DY R SL C Sbjct: 255 KNGTLDQIRTDAFGQISSDNSHTDDWTCEKFGSCEDYSTRHPSLKNTSADYPPAPKVDCA 314 Query: 357 ------LSGNVTNDAED--TGIPILLNDCGSQYDQLPEQLSPTTT----TVVANKEETRS 214 S ++ A TG PI G + + +S + + ++ + Sbjct: 315 RPFGFPTSNSLVRTASSLCTGFPISETSNGDELMLINNSVSDLGSRNGKILTGKDDKDKP 374 Query: 213 ISSAVEVIDLFTPSPCCKASTGSKKRR---ICPEIIDLT 106 I +EVIDL +PSP C+ + KKRR +CP+IIDLT Sbjct: 375 IPQEIEVIDLLSPSPECRIMSSRKKRRFLTVCPQIIDLT 413 >ref|XP_004294742.1| PREDICTED: uncharacterized protein LOC101299940 [Fragaria vesca subsp. vesca] Length = 400 Score = 296 bits (757), Expect = 2e-77 Identities = 183/409 (44%), Positives = 237/409 (57%), Gaps = 36/409 (8%) Frame = -2 Query: 1215 IRNQGENDENEREIADDEEGGGNAKFFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNG 1036 +R Q + +E +EE GG +FFACYLLTS CPR+KGHTYIGFTVNPRRRIRQHNG Sbjct: 1 MRQQRSKNPSETLTMPEEEEGG--RFFACYLLTSRCPRYKGHTYIGFTVNPRRRIRQHNG 58 Query: 1035 EIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWAWQHPIESLAVRNAAVTFKSLSGLA 856 EIG GAWRTKK+RPWEM LCIYGFPTN +ALQFEWAWQ+P S AVR AA FKSL G A Sbjct: 59 EIGRGAWRTKKKRPWEMALCIYGFPTNTSALQFEWAWQNPYVSKAVRKAAANFKSLGGFA 118 Query: 855 NKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGCPSLTGHMRVQVCSMDELPCYSGTN 676 NKIKLAYTM+TLP W+SLNLTVNFFST++ KH AGCP L M+V++C MDELP + Sbjct: 119 NKIKLAYTMLTLPPWESLNLTVNFFSTEHTKHAAGCPRLPEQMKVKICPMDELPSCISDD 178 Query: 675 WSMGEDDGWN---GDDCEHSAGSSHECTEDGLTEAQK-IGDAAGPLNSCEGTEGDKHKRN 508 S ED+ +N D+ + + S + + IG+ + + + + G+ N Sbjct: 179 VSDNEDEWYNEKENDETMNISTLSEPVVPNSADDQHNDIGNRSNEVYAQDKEVGEDEWYN 238 Query: 507 WMEENETRHEQGLWGEETGLERLNNSLIREEDNWQSLKLD-------------------- 388 +E + GL EET L+N ++R+ N L++D Sbjct: 239 DKVSDEAMN-SGLSWEET----LSNFMVRDSAN--DLEMDTGNTSSQVSRCNEEVQEDIT 291 Query: 387 ----DYPLRAS-SLCLSGNVTNDAEDTGIPILLNDCGSQYDQLPEQLSPTTTTVVANKEE 223 PLR S + T +++ G L +D + D+ + SP +VA++E+ Sbjct: 292 GEFITSPLRMPYSNVIPSFDTEASKNIG---LFDDSTVELDRPARKQSP--AIIVADEEQ 346 Query: 222 TRSIS----SAVEVIDLFTPSPCCKASTGSKKRRI---CPEIIDLTNSP 97 + S EV+DL TPSP C+ KK R+ PEIIDLT SP Sbjct: 347 SPRNSYLRPCDSEVVDLITPSPLCRNGLCGKKSRVPTSYPEIIDLTKSP 395 >ref|XP_004145233.1| PREDICTED: uncharacterized protein LOC101203492 [Cucumis sativus] gi|449471301|ref|XP_004153269.1| PREDICTED: uncharacterized protein LOC101204996 [Cucumis sativus] gi|449506301|ref|XP_004162709.1| PREDICTED: uncharacterized protein LOC101229010 [Cucumis sativus] Length = 395 Score = 295 bits (754), Expect = 4e-77 Identities = 179/411 (43%), Positives = 231/411 (56%), Gaps = 30/411 (7%) Frame = -2 Query: 1239 RKDSSETLIRNQGENDENEREIADDEEGGGNAKFFACYLLTSLCPRFKGHTYIGFTVNPR 1060 RK+ E E +++E E +E G FF+CYLL S CPRFKGHTYIGFTVNP+ Sbjct: 4 RKEKPEICKTTDEEKEDDEEEERGNEVNG----FFSCYLLASACPRFKGHTYIGFTVNPK 59 Query: 1059 RRIRQHNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWAWQHPIESLAVRNAAVT 880 RRIRQHNGEI GAWRTK++RPWEMVLCIYGFPTNV+ALQFEWAWQHP ESLAVR+AA T Sbjct: 60 RRIRQHNGEIRCGAWRTKRKRPWEMVLCIYGFPTNVSALQFEWAWQHPNESLAVRSAAAT 119 Query: 879 FKSLSGLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGCPSLTGHMRVQVCSMDE 700 FKSLSG+ANK+KLAYTM+TLPAW+ LN+TVN+FSTK+ K+ AGCPSL HM+VQV ++E Sbjct: 120 FKSLSGVANKVKLAYTMLTLPAWRGLNITVNYFSTKFMKNAAGCPSLPEHMKVQVSPINE 179 Query: 699 LPCYSGTNWSMGEDDGWNGDDCEHSAGSSHECTEDGLTEAQKIGDAAGP--LNSCEGTEG 526 LPCYS + M E++G D E++ C +++ + ++ GT+G Sbjct: 180 LPCYSEGDQDMLENEG----DWEYNREREEICGFRVYGSMKEVSNEVPQKLMDYQTGTDG 235 Query: 525 DKHK--RNWMEENETRH-----------------------EQGLWGEETGLERLNNSLIR 421 R +E ET ++GL +E S I Sbjct: 236 RPPHVLRGCDKELETNEQVPPSSCTPSYIDVGMSYDLCACDEGLENDEREAASCGQSCIV 295 Query: 420 EEDNWQSLKLDDYPLRASSLCLSGNVTNDAEDTGIPILLNDCGSQYDQLPEQLSPTTTTV 241 + + +DD L G+ N E G L + S+ ++ + TV Sbjct: 296 AGTSRTEIVIDD----EEENQLEGSSMNLQEQPGRENLTSGIASEISKVSRWNNGWVPTV 351 Query: 240 VANKEETRSISSAVEVIDLFTPSPCCKASTGSKKRRIC---PEIIDLTNSP 97 EVID+ TPSP C+ S+ KRR+ E+IDLT SP Sbjct: 352 ------------EYEVIDVSTPSPDCRTSSHRFKRRVTSGKSEMIDLTKSP 390 >gb|EMJ06510.1| hypothetical protein PRUPE_ppa006827mg [Prunus persica] Length = 393 Score = 288 bits (738), Expect = 3e-75 Identities = 177/407 (43%), Positives = 227/407 (55%), Gaps = 23/407 (5%) Frame = -2 Query: 1248 RKGRKDSSETLIRNQGENDENEREIADDEEGGGNAKFFACYLLTSLCPRFKGHTYIGFTV 1069 R+ RK SE GE E E +FFACYLLTS PR+KGHTYIGFTV Sbjct: 2 RQRRKIGSEIPENRIGEEKEAEE-----------GRFFACYLLTSRSPRYKGHTYIGFTV 50 Query: 1068 NPRRRIRQHNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWAWQHPIESLAVRNA 889 NPRRRIRQHNGEIG GAWRTK++RPWEMVLCIYGFPTNV+ALQFEWAWQ+P S AVR A Sbjct: 51 NPRRRIRQHNGEIGQGAWRTKRKRPWEMVLCIYGFPTNVSALQFEWAWQNPTVSKAVRQA 110 Query: 888 AVTFKSLSGLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGCPSLTGHMRVQVCS 709 A +FKSL GLA+KIKLAYTM+TLP WQSLN+T+NFFST+Y KH AGCP L M+V+VCS Sbjct: 111 AASFKSLGGLASKIKLAYTMLTLPPWQSLNITINFFSTQYTKHSAGCPRLPEQMKVKVCS 170 Query: 708 MDELP-CYSGTNWSMGEDDGWNGDDCEHSAGSSHECTEDGLTEAQKIGDAAGPLNS---C 541 MDELP C ++ + +D W + E ED T + D+ +N C Sbjct: 171 MDELPSCTKLSDDLLENEDEWCNEG---------EFDEDMNTTDDQQSDSGNRMNEVYRC 221 Query: 540 EGTEGDKHKRNWMEENETRHEQGLWGEETGLERLNNSLIREEDNWQSLK----------- 394 G+ N E +E ++ L E + + +S ++DN Sbjct: 222 SKEVGEDEWYNGRECDEAMNDGTLQEETSSDLIVQSSADDQQDNTAKTNKAHQGSQEVGE 281 Query: 393 --LDDYPLRASSLCLSGNVTNDAEDTGIPILLNDCGSQYDQLPEQLSPTTTTVVANKEE- 223 + + AS + + + T + + + +L + TT+VA+ + Sbjct: 282 DCTEQFGFIASPVRTPSSNVTTSFGTEVTKDIGSADAISVKLGQPAMEQLTTIVADHQSP 341 Query: 222 TRSI--SSAVEVIDLFTPSPCCKASTGSKKRRIC---PEIIDLTNSP 97 +RS EVIDL TP+ C++ KK R+ P IIDLT SP Sbjct: 342 SRSYLRPCGAEVIDLTTPASLCRSHLCGKKSRVAPVYPRIIDLTKSP 388 >ref|XP_006410121.1| hypothetical protein EUTSA_v10016841mg [Eutrema salsugineum] gi|557111290|gb|ESQ51574.1| hypothetical protein EUTSA_v10016841mg [Eutrema salsugineum] Length = 364 Score = 288 bits (737), Expect = 4e-75 Identities = 180/392 (45%), Positives = 221/392 (56%), Gaps = 5/392 (1%) Frame = -2 Query: 1257 MGRRKGRKDSSETLIRNQGENDENEREIADDEEGGGNAK-FFACYLLTSLCPRFKGHTYI 1081 M ++GR+ + +TL +A+D G K FFACY+LTSL PR KGHTYI Sbjct: 1 MREKRGRRGNPKTL-----------DSVAEDGVTGKEGKGFFACYILTSLSPRHKGHTYI 49 Query: 1080 GFTVNPRRRIRQHNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWAWQHPIESLA 901 GFTVNPRRRIRQHNGEI SGA+RTKK+RPWEMVLCIYGFPTNV+ALQFEWAWQHP ESLA Sbjct: 50 GFTVNPRRRIRQHNGEITSGAYRTKKKRPWEMVLCIYGFPTNVSALQFEWAWQHPRESLA 109 Query: 900 VRNAAVTFKSLSGLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGCPSLTGHMRV 721 VR AA FKS SGL +KIKLAYTM+TLPAW SLNLTVN+FSTKY H PSL HM+V Sbjct: 110 VREAAAAFKSFSGLGSKIKLAYTMLTLPAWNSLNLTVNYFSTKYAHHGGLSPSLPPHMKV 169 Query: 720 QVCSMDELPCYSG-TNWSMGEDDGWNGDDCEHSAGSSHECTEDGLTEAQKIGDAAGPLNS 544 QVC+MD+LPC++ N S ED E S S E +D E Q N Sbjct: 170 QVCAMDDLPCFTKLDNNSQPED--------EESLDSHEEEEDDRRNEIQPGNLTTSSSND 221 Query: 543 CEGTEGDKHKRNWMEENETRHE-QGLWGEETGLERLNNSLIREEDNWQSLKLDDYPLRAS 367 E + H R++ + + TG L+ S+ ED + + Sbjct: 222 LYLGEKELHDRDFEKAKQPEAVLDDRLANFTGFGSLDESV---EDEVSHITVGSIEA--- 275 Query: 366 SLCLSGNVTNDAEDTGIPILLNDCGSQYDQLPEQLSPTTTTVVANKEETRSISSA--VEV 193 + + E L N G + + E + +T + I+S VEV Sbjct: 276 -------MEKEPETVFDDRLANFTGFGLEDIVEDVISHSTMEKDCWRRSNLITSTTEVEV 328 Query: 192 IDLFTPSPCCKASTGSKKRRICPEIIDLTNSP 97 IDL TPSP C+ K++R+ E IDLT SP Sbjct: 329 IDLMTPSPSCRVGPSMKRQRV-SEFIDLTRSP 359 >emb|CBI15837.3| unnamed protein product [Vitis vinifera] Length = 346 Score = 288 bits (736), Expect = 5e-75 Identities = 146/219 (66%), Positives = 163/219 (74%), Gaps = 2/219 (0%) Frame = -2 Query: 1257 MGRRKGRKDSSETLIRNQGENDENEREIADDEEGGGNAKFFACYLLTSLCPRFKGHTYIG 1078 M +RKGR + SE + ++ + D+ FFACYLL SL PR KGH+YIG Sbjct: 1 MTKRKGRSEISEETLNSEEKGDD----------------FFACYLLASLSPRHKGHSYIG 44 Query: 1077 FTVNPRRRIRQHNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWAWQHPIESLAV 898 FTVNPRRRIRQHNGEI GAW+TK++RPWEMVLCIYGFPTNV+ALQFEWAWQHP ESLAV Sbjct: 45 FTVNPRRRIRQHNGEITCGAWKTKRKRPWEMVLCIYGFPTNVSALQFEWAWQHPTESLAV 104 Query: 897 RNAAVTFKSLSGLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGCPSLTGHMRVQ 718 R AA FKSLSG+ANKIKLAYTM TLPAWQSLNLTVNFFSTKY KH AGCP L HMRVQ Sbjct: 105 RKAAAGFKSLSGIANKIKLAYTMFTLPAWQSLNLTVNFFSTKYTKHSAGCPILPEHMRVQ 164 Query: 717 VCSMDELPCYSGTNWSMGEDDGWNGDDCEH--SAGSSHE 607 V MDELPCYSG++ S D GD+ E GSS + Sbjct: 165 VSPMDELPCYSGSDQSF--FDNARGDEKEELGERGSSSD 201 >gb|EMJ06505.1| hypothetical protein PRUPE_ppa006794mg [Prunus persica] Length = 395 Score = 287 bits (734), Expect = 8e-75 Identities = 185/414 (44%), Positives = 236/414 (57%), Gaps = 29/414 (7%) Frame = -2 Query: 1251 RRKGRKDSSETLIRNQGENDENEREIADDEEGGGNAKFFACYLLTSLCPRFKGHTYIGFT 1072 RRK + ETLI + E++E +FFACYLL+S PR+KGHTYIGFT Sbjct: 4 RRKIGSEIPETLIGEEKESEEG--------------RFFACYLLSSRSPRYKGHTYIGFT 49 Query: 1071 VNPRRRIRQHNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWAWQHPIESLAVRN 892 VNPRRRIRQHNGEI GAWRTK++RPWEMVLCIYGFPTNV+ALQFEWAWQ+P S AVR Sbjct: 50 VNPRRRIRQHNGEIAQGAWRTKRKRPWEMVLCIYGFPTNVSALQFEWAWQNPTVSKAVRQ 109 Query: 891 AAVTFKSLSGLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGCPSLTGHMRVQVC 712 AA +FKSL GL +KIKLAYTM+TLP WQSLN+TVNFFST+Y KH AGC L M+V+VC Sbjct: 110 AAASFKSLGGLVSKIKLAYTMLTLPPWQSLNITVNFFSTQYTKHSAGCLRLPEQMKVKVC 169 Query: 711 SMDELP-CYSGTNWSMGEDDGWNGDDCEHSAGSSHECTEDGLTEAQKIGDAAGPLNSCEG 535 SMDELP C ++ +D W C H T D +++ K + C Sbjct: 170 SMDELPSCTKISDDLFENEDEW----CNEREFDEHMNTNDQQSDSGKRINEV-----CSK 220 Query: 534 TEGDKHKRNWMEENETRHEQGLWGEETGLERLNNSLIREEDNW-----------QSLKLD 388 G+ N E +E ++ L E + +S ++DN Q + D Sbjct: 221 EVGEDEWYNGRECDEAVNDGTLQEETLSDLIVQSSADDQQDNTGKTINKAYRCSQEVGED 280 Query: 387 ---DYPLRASSLCL-SGNVTND-----AEDTGIPILLN-DCGSQYDQLPEQLSPTTTTVV 238 + AS + + S NVT +DTG ++ G + EQL TT+V Sbjct: 281 CTEQFGFIASPMRMPSSNVTTSFDTEVTKDTGSADAISVKLGRPAMEQLEQL----TTIV 336 Query: 237 ANKEETRSIS----SAVEVIDLFTPSPCCKASTGSKKRRIC---PEIIDLTNSP 97 A+ +++ S S EVIDL TP+P C++ KK R+ P+IIDLT SP Sbjct: 337 ADDDQSPSRSYLRPCGAEVIDLTTPAPLCRSHLCGKKSRVASVYPQIIDLTKSP 390 >ref|XP_002319418.2| hypothetical protein POPTR_0013s15190g, partial [Populus trichocarpa] gi|550325896|gb|EEE95341.2| hypothetical protein POPTR_0013s15190g, partial [Populus trichocarpa] Length = 431 Score = 284 bits (726), Expect = 7e-74 Identities = 150/267 (56%), Positives = 179/267 (67%), Gaps = 6/267 (2%) Frame = -2 Query: 1191 ENEREIADDEEGGGNAKFFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGEIGSGAWR 1012 +N +E+ + E+G FFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGE+ SGA R Sbjct: 9 KNPQELGEAEKGKNG--FFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGELRSGACR 66 Query: 1011 TKKRRPWEMVLCIYGFPTNVAALQFEWAWQHPIESLAVRNAAVTFKSLSGLANKIKLAYT 832 TKKRRPWEMV CIYGFPTNVAALQFEWAWQHP ES+AVR AA FKS SG+ANKIKLAYT Sbjct: 67 TKKRRPWEMVFCIYGFPTNVAALQFEWAWQHPTESVAVRQAAAAFKSFSGVANKIKLAYT 126 Query: 831 MVTLPAWQSLNLTVNFFSTKYQKHIAGCPSLTGHMRVQVCSMDELPCYSGTNWSMGE--- 661 M+ LP+WQSLN+T+N+FST Y+ H GCPSL +M+VQ+C MDELPCY + + E Sbjct: 127 MLNLPSWQSLNITINYFSTNYKVHSVGCPSLPKNMKVQICPMDELPCYCDSGDILFEERE 186 Query: 660 -DDGWNGDDCEHSAGSSHECTEDGLTEAQKIGDAAGPLNS--CEGTEGDKHKRNWMEENE 490 +D W+G++ E+ S G EA + L+ C GD N E Sbjct: 187 NEDAWDGEE-EYERASD----GSGTFEANLVELVVSSLDELPCYNGRGD----NIFE--- 234 Query: 489 TRHEQGLWGEETGLERLNNSLIREEDN 409 G +GE E N S + E+ N Sbjct: 235 -----GGYGETASREACNKSAVHEKYN 256 >ref|XP_002325655.2| endo/excinuclease amino terminal domain-containing family protein, partial [Populus trichocarpa] gi|550317584|gb|EEF00037.2| endo/excinuclease amino terminal domain-containing family protein, partial [Populus trichocarpa] Length = 212 Score = 280 bits (717), Expect = 8e-73 Identities = 132/194 (68%), Positives = 154/194 (79%), Gaps = 4/194 (2%) Frame = -2 Query: 1152 GNAKFFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGEIGSGAWRTKKRRPWEMVLCI 973 G FFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGE+ SGA RTKKRRPWEMV+C+ Sbjct: 11 GKNGFFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGELRSGACRTKKRRPWEMVICV 70 Query: 972 YGFPTNVAALQFEWAWQHPIESLAVRNAAVTFKSLSGLANKIKLAYTMVTLPAWQSLNLT 793 YGFPTNVAALQFEWAWQHP ES+AVR AA FKS SG+ANKIKLAYTM+ LP+WQSLN+T Sbjct: 71 YGFPTNVAALQFEWAWQHPTESVAVRQAAAAFKSFSGVANKIKLAYTMLNLPSWQSLNIT 130 Query: 792 VNFFSTKYQKHIAGCPSLTGHMRVQVCSMDELPCYSGTNWSMGE----DDGWNGDDCEHS 625 VN+FST+Y+ H AGCPSL +M+VQ+C M+ELPCYS ++ E +D W+G++ Sbjct: 131 VNYFSTQYKVHSAGCPSLPKNMKVQICPMNELPCYSDFVDNLFEERDDEDAWDGEEEYER 190 Query: 624 AGSSHECTEDGLTE 583 A + L E Sbjct: 191 ASDGSGMVDANLVE 204 >ref|XP_006294402.1| hypothetical protein CARUB_v10023419mg, partial [Capsella rubella] gi|482563110|gb|EOA27300.1| hypothetical protein CARUB_v10023419mg, partial [Capsella rubella] Length = 382 Score = 278 bits (712), Expect = 3e-72 Identities = 176/407 (43%), Positives = 227/407 (55%), Gaps = 14/407 (3%) Frame = -2 Query: 1275 RSDRRTMGRRKGRKDSSETLIRNQGENDENEREIADDEEGGGNAKFFACYLLTSLCPRFK 1096 RSDR + + R+ + +TL D + +EG G FFACYLLTSL PR K Sbjct: 1 RSDRERETKMRERRGNRKTL-------DPAGEDGVTGKEGKG---FFACYLLTSLSPRHK 50 Query: 1095 GHTYIGFTVNPRRRIRQHNGEIGSGAWRTKKRRPWEMVLCIYGFPTNVAALQFEWAWQHP 916 G TYIGFTVNPRRRIRQHNGEI GAWRTKK+RPWEMVLCIYGFPTNV+ALQFEWAWQHP Sbjct: 51 GQTYIGFTVNPRRRIRQHNGEITCGAWRTKKKRPWEMVLCIYGFPTNVSALQFEWAWQHP 110 Query: 915 IESLAVRNAAVTFKSLSGLANKIKLAYTMVTLPAWQSLNLTVNFFSTKYQKHIAGCPSLT 736 ESLAVR AA FKS G+A KIKL YTM+ LPAW SLNLTVN+FS+KY + PSL Sbjct: 111 RESLAVREAAAAFKSFPGIAGKIKLVYTMLNLPAWNSLNLTVNYFSSKYAHYGGLAPSLP 170 Query: 735 GHMRVQVCSMDELPCYSG-TNWSMGEDDGWNGDDCEHSAGSSHECTEDGLTEAQKIGDAA 559 HM+V+VC+M++LP ++ N S EDD S + E ++ ++Q A Sbjct: 171 LHMKVEVCAMEDLPYFTKLDNSSQPEDD--------ESPEVNEEAEDEDSNQSQPGNSGA 222 Query: 558 GPLNSCEGTEGDKHKRNWMEENE--TRHEQGLWGEETGLERLNNSLIREEDNWQ---SLK 394 + E + H R++ + E T ++ +G L + +E + S++ Sbjct: 223 SSQDDLYPGEKELHDRHFEKAKEPVTVLDEDRLANFSGFGSLEEEAVEDEVSHSPVGSIE 282 Query: 393 LDDYPLRASSLCLSGNVTN-------DAEDTGIPILLNDCGSQYDQ-LPEQLSPTTTTVV 238 + D + N T + E+ + N + D + L +TTT V Sbjct: 283 VMDKEPETVFVDRLANFTGFGLVEIVEDEEVSHGTVRNTEAMEKDSWIRRNLITSTTTEV 342 Query: 237 ANKEETRSISSAVEVIDLFTPSPCCKASTGSKKRRICPEIIDLTNSP 97 VEVIDL TPSP C+A + K+RR+ E IDLT SP Sbjct: 343 -----------DVEVIDLMTPSPSCRAGSSMKRRRV-SEFIDLTRSP 377 >ref|NP_180594.2| Excinuclease ABC, C subunit, N-terminal [Arabidopsis thaliana] gi|51968920|dbj|BAD43152.1| hypothetical protein [Arabidopsis thaliana] gi|51968928|dbj|BAD43156.1| hypothetical protein [Arabidopsis thaliana] gi|51971411|dbj|BAD44370.1| hypothetical protein [Arabidopsis thaliana] gi|66792676|gb|AAY56440.1| At2g30350 [Arabidopsis thaliana] gi|330253280|gb|AEC08374.1| Excinuclease ABC, C subunit, N-terminal [Arabidopsis thaliana] Length = 368 Score = 276 bits (705), Expect = 2e-71 Identities = 171/376 (45%), Positives = 212/376 (56%), Gaps = 16/376 (4%) Frame = -2 Query: 1176 IADDEEGGGNAK-FFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGEIGSGAWRTKKR 1000 + +D G + K FFACYLLTSL PR KG TYIGFTVNPRRRIRQHNGEI SGAWRTKK+ Sbjct: 14 VGEDGVTGKDGKGFFACYLLTSLSPRHKGQTYIGFTVNPRRRIRQHNGEITSGAWRTKKK 73 Query: 999 RPWEMVLCIYGFPTNVAALQFEWAWQHPIESLAVRNAAVTFKSLSGLANKIKLAYTMVTL 820 RPWEMVLCIYGFPTNV+ALQFEWAWQHP ES+AVR AA FKS SG+A+KIKL YTM+ L Sbjct: 74 RPWEMVLCIYGFPTNVSALQFEWAWQHPRESVAVREAAAAFKSFSGVASKIKLVYTMLNL 133 Query: 819 PAWQSLNLTVNFFSTKYQKHIAGCPSLTGHMRVQVCSMDELPCYSGTNWSMGEDDGWNGD 640 PAW SLNLTVN+FS+KY H PSL HM+VQVC+M++L ++ + S +D Sbjct: 134 PAWNSLNLTVNYFSSKYAHHGGKSPSLPLHMKVQVCAMEDLQYFTKVDDSSQPED----- 188 Query: 639 DCEHSAGSSHECTEDGLTEAQKIGDAAGPLNSCEGTEGDKHKRNWMEENETRHEQ----- 475 E S + E +D ++ + P NS + D+H E E+ Sbjct: 189 --EESPEVNEEDDDDDDDDSSNLSQ---PGNSNTSSSDDRHFEKAKEPVTVFDEEDRLAN 243 Query: 474 ----GLWGEETGLE----RLNNSLIREEDNWQSLKLDDYPLRASSLCLSG--NVTNDAED 325 GL EE +E + IR + +D R +S G + D Sbjct: 244 FSGFGLLDEEETVEDEVSHITVGSIRATEKEPETVFND---RLASFTCFGLVEIVEDEVS 300 Query: 324 TGIPILLNDCGSQYDQLPEQLSPTTTTVVANKEETRSISSAVEVIDLFTPSPCCKASTGS 145 G + + ++ TTT V VEVIDL TPSP C+A + Sbjct: 301 HGTIGSTEAMEKECRKRRNHITSTTTEV------------DVEVIDLMTPSPSCRAGSSM 348 Query: 144 KKRRICPEIIDLTNSP 97 K+RR+ E IDLT SP Sbjct: 349 KRRRV-SEFIDLTMSP 363 >ref|XP_004955835.1| PREDICTED: uncharacterized protein LOC101777363 [Setaria italica] Length = 377 Score = 273 bits (697), Expect = 2e-70 Identities = 159/366 (43%), Positives = 202/366 (55%), Gaps = 9/366 (2%) Frame = -2 Query: 1158 GGGNAKFFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGEIGSGAWRTKKRRPWEMVL 979 GGG FF CYLL SLCPR K TYIGFTVNPRRRIRQHNGEI SGAWRT++ RPWEMVL Sbjct: 46 GGG---FFCCYLLRSLCPRSKIRTYIGFTVNPRRRIRQHNGEIASGAWRTRRGRPWEMVL 102 Query: 978 CIYGFPTNVAALQFEWAWQHPIESLAVRNAAVTFKSLSGLANKIKLAYTMVTLPAWQSLN 799 CIYGFP+NVAALQFEWAWQHP ESLAVR AA FKSL G+ NK+KLAYTM+ LP+W+SLN Sbjct: 103 CIYGFPSNVAALQFEWAWQHPAESLAVRKAAAEFKSLGGIGNKVKLAYTMLNLPSWESLN 162 Query: 798 LTVNFFSTKYQKHIAGCPSLTGHMRVQVCSMDELPCYSGTNWSMGEDDGWNGDDCEHSAG 619 LTVNFFS+K K AGCPSL M+ VC+M++L C + S +D + D + + Sbjct: 163 LTVNFFSSKNTKFTAGCPSLPSQMKTVVCAMEDLQCSAEGPSSEDDDLSQDPQDQQEQSD 222 Query: 618 S-------SHECTEDGLTEAQKIGDAAGPLNSCEGTEGDKHKRNWMEENETRHEQGLWGE 460 S S + G Q D A P+ G G + + ++ R + Sbjct: 223 SPLQDDEHSQHYEQSGHCWQQPSSDQAQPMVGQTGIAGPDVEEDPIDGFGPRKWSEILDI 282 Query: 459 ETGLERLNNSLIREEDNWQSLKLDDYPLRASSLCLSGNVTNDAEDTGIPILLNDCGSQYD 280 T ++ S SL LSG +DCG+ + Sbjct: 283 RTEVDEPRTS------------------PRCSLSLSG---------------DDCGTATE 309 Query: 279 QLPEQLSPTTTTVVANKEE--TRSISSAVEVIDLFTPSPCCKASTGSKKRRICPEIIDLT 106 P LSP A ++ + + +V+DL TP+P + +CP+IIDLT Sbjct: 310 DEPGHLSPLLMFGAAGSDDGGGHILDGSADVVDLVTPTPVGRLRRRGCVASVCPKIIDLT 369 Query: 105 NSPMSV 88 +SP+ + Sbjct: 370 SSPVVI 375