BLASTX nr result

ID: Atropa21_contig00017658 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00017658
         (2027 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006361297.1| PREDICTED: uncharacterized protein LOC102597...   512   e-142
ref|XP_004246967.1| PREDICTED: uncharacterized protein LOC101267...   471   e-130
ref|XP_002276725.2| PREDICTED: structure-specific endonuclease s...   325   5e-86
gb|EMJ06505.1| hypothetical protein PRUPE_ppa006794mg [Prunus pe...   311   7e-82
ref|XP_006431991.1| hypothetical protein CICLE_v10001469mg [Citr...   310   1e-81
gb|EMJ06510.1| hypothetical protein PRUPE_ppa006827mg [Prunus pe...   309   3e-81
ref|XP_004294742.1| PREDICTED: uncharacterized protein LOC101299...   308   8e-81
ref|XP_003537333.1| PREDICTED: uncharacterized protein LOC100801...   306   2e-80
gb|ESW18854.1| hypothetical protein PHAVU_006G076300g [Phaseolus...   306   3e-80
gb|EXC19560.1| Structure-specific endonuclease subunit [Morus no...   298   6e-78
ref|XP_002517715.1| nuclease, putative [Ricinus communis] gi|223...   297   1e-77
ref|XP_004145233.1| PREDICTED: uncharacterized protein LOC101203...   296   3e-77
emb|CBI15837.3| unnamed protein product [Vitis vinifera]              295   4e-77
gb|EOY34667.1| Excinuclease ABC [Theobroma cacao]                     293   2e-76
ref|XP_002325655.2| endo/excinuclease amino terminal domain-cont...   285   4e-74
ref|XP_002319418.2| hypothetical protein POPTR_0013s15190g, part...   283   3e-73
ref|XP_006410121.1| hypothetical protein EUTSA_v10016841mg [Eutr...   281   8e-73
ref|NP_180594.2| Excinuclease ABC, C subunit, N-terminal [Arabid...   277   1e-71
ref|XP_006294402.1| hypothetical protein CARUB_v10023419mg, part...   275   7e-71
gb|EOY18169.1| Excinuclease ABC [Theobroma cacao]                     273   2e-70

>ref|XP_006361297.1| PREDICTED: uncharacterized protein LOC102597488 [Solanum tuberosum]
          Length = 369

 Score =  512 bits (1319), Expect = e-142
 Identities = 259/340 (76%), Positives = 286/340 (84%), Gaps = 2/340 (0%)
 Frame = -1

Query: 1892 QESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMILCI 1713
            +E+RFFACYLLTS+CPRFKGHTYIGFTVNPRRRIRQHNGEV MGA RTK+KRPWEMILCI
Sbjct: 44   EENRFFACYLLTSMCPRFKGHTYIGFTVNPRRRIRQHNGEVRMGALRTKRKRPWEMILCI 103

Query: 1712 YGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSLNLT 1533
            YGFPTNVSALQFEWAWQHPVESRAVRQAA SFKTLGGVANKIKLAY MLTLPEW+SLNLT
Sbjct: 104  YGFPTNVSALQFEWAWQHPVESRAVRQAAASFKTLGGVANKIKLAYAMLTLPEWQSLNLT 163

Query: 1532 VNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTNDWDNSEEYEESSEK 1353
            VNFFSTKYKMHSAGCPSLPEH+RV +CALDELPCYTGIDRDEY+TN+W+NSEE  +    
Sbjct: 164  VNFFSTKYKMHSAGCPSLPEHMRVHICALDELPCYTGIDRDEYSTNEWENSEELTDE--- 220

Query: 1352 LTDGRSTDSNSSFSNQ--DSTEENVDEHIDWREELNVHAEEISTPSREDPEHSYIIIDSP 1179
                 ST+SNSSFSNQ  DST+EN DEH DW+ EL+  A E ST  R   EHSYIIIDSP
Sbjct: 221  -ISASSTNSNSSFSNQDKDSTDENDDEHTDWK-ELDERAGENSTCGR---EHSYIIIDSP 275

Query: 1178 LRTSSPIQSGSFDIADNKDRYELDDEFGEELGQQANMLCSTKTDAYHDPLPTKNAGLSSE 999
            +  SS I    F IAD K+R+ELDDEFGE   +QAN +CSTKTD   D L TKNAGL S+
Sbjct: 276  VERSSSILGDFFHIADKKERHELDDEFGE---KQANKMCSTKTD---DSLATKNAGLPSD 329

Query: 998  VEVIDLFTPPCYKVGADNKKRRLSTTYPEIIDLTDSPIHV 879
            +EVID+FTPPC KV AD+K+RR S + PEIIDLTDSPI+V
Sbjct: 330  IEVIDVFTPPCSKVRADHKRRRFSASCPEIIDLTDSPIYV 369


>ref|XP_004246967.1| PREDICTED: uncharacterized protein LOC101267927 [Solanum
            lycopersicum]
          Length = 350

 Score =  471 bits (1212), Expect = e-130
 Identities = 252/360 (70%), Positives = 277/360 (76%), Gaps = 19/360 (5%)
 Frame = -1

Query: 1901 DDEQE----SRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRP 1734
            DDE +    SRFFACYLLTS+CPRFKGHTYIGFTVNPRRRIRQHNGEV MGA RTK+KRP
Sbjct: 14   DDEDKEVEGSRFFACYLLTSMCPRFKGHTYIGFTVNPRRRIRQHNGEVRMGALRTKRKRP 73

Query: 1733 WEMILCIYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPE 1554
            WEMILCIYGFPTNVSALQFEWAWQHPVESRAVRQAA SFKTLGGVANKIKLAYTMLTLPE
Sbjct: 74   WEMILCIYGFPTNVSALQFEWAWQHPVESRAVRQAAASFKTLGGVANKIKLAYTMLTLPE 133

Query: 1553 WRSLNLTVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTT-------- 1398
            W+SLNLTVNFFSTKYKMHSAGCPSLPEH+RV +CALDELPCYTGIDRDE+          
Sbjct: 134  WQSLNLTVNFFSTKYKMHSAGCPSLPEHMRVHICALDELPCYTGIDRDEWENICALDELP 193

Query: 1397 -------NDWDNSEEYEESSEKLTDGRSTDSNSSFSNQDSTEENVDEHIDWREELNVHAE 1239
                   ++W+N EE  ESSE+LTD  ST+SNSSFSNQD      DE  DWR EL+  A 
Sbjct: 194  SYTGIDRDEWENREEC-ESSEELTDEISTNSNSSFSNQDKD----DEQTDWR-ELDERAG 247

Query: 1238 EISTPSREDPEHSYIIIDSPLRTSSPIQSGSFDIADNKDRYELDDEFGEELGQQANMLCS 1059
            E ST  R   EHSYIIIDSP      IQ   F IAD K+R++LDDEFGE    QAN +  
Sbjct: 248  ENSTRGR---EHSYIIIDSPAERLCSIQGDFFHIADKKERHQLDDEFGE---NQANKM-- 299

Query: 1058 TKTDAYHDPLPTKNAGLSSEVEVIDLFTPPCYKVGADNKKRRLSTTYPEIIDLTDSPIHV 879
                  +D L TKNAGL  ++EVID+FTPP   V ADNK+RRLS + PEIIDLTDSP++V
Sbjct: 300  ------YDSLATKNAGLPCDIEVIDVFTPP---VRADNKRRRLSASVPEIIDLTDSPVYV 350


>ref|XP_002276725.2| PREDICTED: structure-specific endonuclease subunit SLX1 homolog
            2-like [Vitis vinifera]
          Length = 364

 Score =  325 bits (833), Expect = 5e-86
 Identities = 182/365 (49%), Positives = 223/365 (61%), Gaps = 25/365 (6%)
 Frame = -1

Query: 1898 DEQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMIL 1719
            +E+   FFACYLL S+ PR KGH+YIGFTVNPRRRIRQHNGE+  GAW+TK+KRPWEM+L
Sbjct: 18   EEKGDDFFACYLLASLSPRHKGHSYIGFTVNPRRRIRQHNGEITCGAWKTKRKRPWEMVL 77

Query: 1718 CIYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSLN 1539
            CIYGFPTNVSALQFEWAWQHP ES AVR+AA  FK+L G+ANKIKLAYTM TLP W+SLN
Sbjct: 78   CIYGFPTNVSALQFEWAWQHPTESLAVRKAAAGFKSLSGIANKIKLAYTMFTLPAWQSLN 137

Query: 1538 LTVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTNDWDNSEEYEESS 1359
            LTVNFFSTKY  HSAGCP LPEH+RVQV  +DELPCY+G D+  +     D  EE     
Sbjct: 138  LTVNFFSTKYTKHSAGCPILPEHMRVQVSPMDELPCYSGSDQSFFDNARGDEKEEL---- 193

Query: 1358 EKLTDGRSTDSNSSFSNQDSTEENVDEHIDWREELN------------VHAE-------- 1239
                 G    S+  F    + EE   E   W EE              VH          
Sbjct: 194  -----GERGSSSDGFDQVIAHEETALEQFGWIEEHGLRQPGDSPSPEVVHCSGKTQENAM 248

Query: 1238 ----EISTPSREDPEHSYIIIDSPLRTSSPIQSGSFDIADNKDRYELDDEFGEELGQQAN 1071
                ++ST S+++    + +IDSP+RTSS    G+ D    KD   L  E   ++     
Sbjct: 249  RQPADLST-SKDEHRSPFCLIDSPVRTSSHSTEGTLD----KDTSGLSKE--NKVLTMKQ 301

Query: 1070 MLCSTKTDAYHDPLPTKNAGLSSEVEVIDLFT-PPCYKVGADNKKRRLSTTYPEIIDLTD 894
            +  +   D     + + +   S E+EVIDL +  P Y+     KKRR +T +PEIIDLT+
Sbjct: 302  LPATVAADRGKPKISSLDT--SCEIEVIDLLSCSPDYRTNPCFKKRRATTVHPEIIDLTN 359

Query: 893  SPIHV 879
            SPI V
Sbjct: 360  SPIFV 364


>gb|EMJ06505.1| hypothetical protein PRUPE_ppa006794mg [Prunus persica]
          Length = 395

 Score =  311 bits (797), Expect = 7e-82
 Identities = 179/374 (47%), Positives = 222/374 (59%), Gaps = 37/374 (9%)
 Frame = -1

Query: 1898 DEQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMIL 1719
            + +E RFFACYLL+S  PR+KGHTYIGFTVNPRRRIRQHNGE+  GAWRTK+KRPWEM+L
Sbjct: 21   ESEEGRFFACYLLSSRSPRYKGHTYIGFTVNPRRRIRQHNGEIAQGAWRTKRKRPWEMVL 80

Query: 1718 CIYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSLN 1539
            CIYGFPTNVSALQFEWAWQ+P  S+AVRQAA SFK+LGG+ +KIKLAYTMLTLP W+SLN
Sbjct: 81   CIYGFPTNVSALQFEWAWQNPTVSKAVRQAAASFKSLGGLVSKIKLAYTMLTLPPWQSLN 140

Query: 1538 LTVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTND-WDNSEEYEES 1362
            +TVNFFST+Y  HSAGC  LPE ++V+VC++DELP  T I  D +   D W N  E++E 
Sbjct: 141  ITVNFFSTQYTKHSAGCLRLPEQMKVKVCSMDELPSCTKISDDLFENEDEWCNEREFDEH 200

Query: 1361 SEKLTDGRSTDSNSSFSNQDSTEENVDEHIDWRE----------------ELNVH----- 1245
                T+ + +DS    +   S E   DE  + RE                +L V      
Sbjct: 201  MN--TNDQQSDSGKRINEVCSKEVGEDEWYNGRECDEAVNDGTLQEETLSDLIVQSSADD 258

Query: 1244 -----------AEEISTPSREDPEHSYIIIDSPLRTSSPIQSGSFDIADNKDRYELDD-- 1104
                       A   S    ED    +  I SP+R  S   + SFD    KD    D   
Sbjct: 259  QQDNTGKTINKAYRCSQEVGEDCTEQFGFIASPMRMPSSNVTTSFDTEVTKDTGSADAIS 318

Query: 1103 -EFGEELGQQANMLCSTKTDAYHDPLPTKNAGLSSEVEVIDLFTP-PCYKVGADNKKRRL 930
             + G    +Q   L +   D   D  P+++       EVIDL TP P  +     KK R+
Sbjct: 319  VKLGRPAMEQLEQLTTIVAD--DDQSPSRSYLRPCGAEVIDLTTPAPLCRSHLCGKKSRV 376

Query: 929  STTYPEIIDLTDSP 888
            ++ YP+IIDLT SP
Sbjct: 377  ASVYPQIIDLTKSP 390


>ref|XP_006431991.1| hypothetical protein CICLE_v10001469mg [Citrus clementina]
            gi|568827655|ref|XP_006468166.1| PREDICTED:
            uncharacterized protein LOC102631105 [Citrus sinensis]
            gi|557534113|gb|ESR45231.1| hypothetical protein
            CICLE_v10001469mg [Citrus clementina]
          Length = 386

 Score =  310 bits (795), Expect = 1e-81
 Identities = 176/352 (50%), Positives = 226/352 (64%), Gaps = 19/352 (5%)
 Frame = -1

Query: 1895 EQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMILC 1716
            +Q   FFACYLLTS+CPRFKGHTYIGFTVNPRRRIRQHNGE+  GA RTKK+RPWEM+LC
Sbjct: 46   DQRKGFFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGEIRCGAVRTKKRRPWEMVLC 105

Query: 1715 IYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSLNL 1536
            IYGFPTNVSALQFEWAWQHP+ES AVR+AA +FK+  GVANKIKLAYTML LP W SLN+
Sbjct: 106  IYGFPTNVSALQFEWAWQHPMESLAVRRAAATFKSFSGVANKIKLAYTMLNLPNWESLNI 165

Query: 1535 TVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTNDWDN--SEEYEES 1362
            TVN+FSTKY  HS+ CP+LPEH++VQV ++DELPCYT  +RDE    D D+   EEY+E+
Sbjct: 166  TVNYFSTKYSKHSSSCPNLPEHMKVQVRSMDELPCYT--ERDERLLGDEDSLGDEEYDEA 223

Query: 1361 SEKLTDGRSTDSNSSFSNQDSTEENVDEHIDWREELNVHAEEISTPSREDPEHSYII--- 1191
            SE    G   ++    +   S++ +   + D  E+      +      E P  S  +   
Sbjct: 224  SE--NSGSLEETRGDVTINFSSDYSFSIYEDAYEQCG----QFKQYGNEQPRDSSCLEVN 277

Query: 1190 ------IDSPLRTSSPIQSGSFDIADNKDRYE-------LDDEFGEELGQQANMLCSTKT 1050
                  + S L T+S I S S +  +   R         ++DE  ++  Q+ ++   T  
Sbjct: 278  CQEPFGLLSSLETTSVISSTSAEDTNELGRQRSEQCATAVNDEENQQFAQRQSI---TIE 334

Query: 1049 DAYHDPLPTKNAGLSSEVEVIDLFTP-PCYKVGADNKKRRLSTTYPEIIDLT 897
             A  D L  +++     VEVIDL TP P  +  + +KKRR+S+  P IIDLT
Sbjct: 335  VANKDQLQVQSSTGLPNVEVIDLLTPSPNCREMSYSKKRRVSSLCPVIIDLT 386


>gb|EMJ06510.1| hypothetical protein PRUPE_ppa006827mg [Prunus persica]
          Length = 393

 Score =  309 bits (792), Expect = 3e-81
 Identities = 181/372 (48%), Positives = 224/372 (60%), Gaps = 35/372 (9%)
 Frame = -1

Query: 1898 DEQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMIL 1719
            + +E RFFACYLLTS  PR+KGHTYIGFTVNPRRRIRQHNGE+G GAWRTK+KRPWEM+L
Sbjct: 21   EAEEGRFFACYLLTSRSPRYKGHTYIGFTVNPRRRIRQHNGEIGQGAWRTKRKRPWEMVL 80

Query: 1718 CIYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSLN 1539
            CIYGFPTNVSALQFEWAWQ+P  S+AVRQAA SFK+LGG+A+KIKLAYTMLTLP W+SLN
Sbjct: 81   CIYGFPTNVSALQFEWAWQNPTVSKAVRQAAASFKSLGGLASKIKLAYTMLTLPPWQSLN 140

Query: 1538 LTVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTND-WDNSEEYEES 1362
            +T+NFFST+Y  HSAGCP LPE ++V+VC++DELP  T +  D     D W N  E++E 
Sbjct: 141  ITINFFSTQYTKHSAGCPRLPEQMKVKVCSMDELPSCTKLSDDLLENEDEWCNEGEFDED 200

Query: 1361 SEKLTDGRSTDSNSSFSN--QDSTEENVDEHIDWRE----------------ELNVH--- 1245
                TD + +DS +  +   + S E   DE  + RE                +L V    
Sbjct: 201  M-NTTDDQQSDSGNRMNEVYRCSKEVGEDEWYNGRECDEAMNDGTLQEETSSDLIVQSSA 259

Query: 1244 ------------AEEISTPSREDPEHSYIIIDSPLRTSSPIQSGSFDIADNKDRYELDDE 1101
                        A + S    ED    +  I SP+RT S   + SF     KD     D 
Sbjct: 260  DDQQDNTAKTNKAHQGSQEVGEDCTEQFGFIASPVRTPSSNVTTSFGTEVTKD-IGSADA 318

Query: 1100 FGEELGQQANMLCSTKTDAYHDPLPTKNAGLSSEVEVIDLFTPPCY-KVGADNKKRRLST 924
               +LGQ A    +T    +    P+++       EVIDL TP    +     KK R++ 
Sbjct: 319  ISVKLGQPAMEQLTTIVADHQS--PSRSYLRPCGAEVIDLTTPASLCRSHLCGKKSRVAP 376

Query: 923  TYPEIIDLTDSP 888
             YP IIDLT SP
Sbjct: 377  VYPRIIDLTKSP 388


>ref|XP_004294742.1| PREDICTED: uncharacterized protein LOC101299940 [Fragaria vesca
            subsp. vesca]
          Length = 400

 Score =  308 bits (788), Expect = 8e-81
 Identities = 175/381 (45%), Positives = 227/381 (59%), Gaps = 43/381 (11%)
 Frame = -1

Query: 1901 DDEQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMI 1722
            ++E+  RFFACYLLTS CPR+KGHTYIGFTVNPRRRIRQHNGE+G GAWRTKKKRPWEM 
Sbjct: 17   EEEEGGRFFACYLLTSRCPRYKGHTYIGFTVNPRRRIRQHNGEIGRGAWRTKKKRPWEMA 76

Query: 1721 LCIYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSL 1542
            LCIYGFPTN SALQFEWAWQ+P  S+AVR+AA +FK+LGG ANKIKLAYTMLTLP W SL
Sbjct: 77   LCIYGFPTNTSALQFEWAWQNPYVSKAVRKAAANFKSLGGFANKIKLAYTMLTLPPWESL 136

Query: 1541 NLTVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTNDWDNSEEYEES 1362
            NLTVNFFST++  H+AGCP LPE ++V++C +DELP     D  +   ++W N +E +E+
Sbjct: 137  NLTVNFFSTEHTKHAAGCPRLPEQMKVKICPMDELPSCISDDVSD-NEDEWYNEKENDET 195

Query: 1361 ------SEKLTDGRSTDSNSSFSNQDS-------------------TEENVDEHIDWREE 1257
                  SE +    + D ++   N+ +                   ++E ++  + W E 
Sbjct: 196  MNISTLSEPVVPNSADDQHNDIGNRSNEVYAQDKEVGEDEWYNDKVSDEAMNSGLSWEET 255

Query: 1256 LNVH----------------AEEISTPSREDPEH-SYIIIDSPLRTSSPIQSGSFDIADN 1128
            L+                  + ++S  + E  E  +   I SPLR        SFD   +
Sbjct: 256  LSNFMVRDSANDLEMDTGNTSSQVSRCNEEVQEDITGEFITSPLRMPYSNVIPSFDTEAS 315

Query: 1127 KDRYELDDEFGEELGQQANMLCSTKTDAYHDPLPTKNAGLSSEVEVIDLFTP-PCYKVGA 951
            K+   L D+   EL + A         A  +  P  +     + EV+DL TP P  + G 
Sbjct: 316  KN-IGLFDDSTVELDRPARKQSPAIIVADEEQSPRNSYLRPCDSEVVDLITPSPLCRNGL 374

Query: 950  DNKKRRLSTTYPEIIDLTDSP 888
              KK R+ T+YPEIIDLT SP
Sbjct: 375  CGKKSRVPTSYPEIIDLTKSP 395


>ref|XP_003537333.1| PREDICTED: uncharacterized protein LOC100801307 [Glycine max]
          Length = 380

 Score =  306 bits (785), Expect = 2e-80
 Identities = 166/349 (47%), Positives = 220/349 (63%), Gaps = 11/349 (3%)
 Frame = -1

Query: 1901 DDEQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMI 1722
            +D + + FFACYLLTS+ PRFKGHTYIGFTVNPRRRIRQHNGE+G GAWRTKK+RPWEM+
Sbjct: 28   EDCEGNGFFACYLLTSLSPRFKGHTYIGFTVNPRRRIRQHNGEIGCGAWRTKKRRPWEMV 87

Query: 1721 LCIYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSL 1542
            LCIYGFPTNVSALQFEWAWQHPVES AVR+AA  FK+L G+ANKIKLAYTMLTLP W+S+
Sbjct: 88   LCIYGFPTNVSALQFEWAWQHPVESLAVRKAAVEFKSLSGIANKIKLAYTMLTLPSWQSM 147

Query: 1541 NLTVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYT-GIDRDEYTTNDWDNSEEYEE 1365
            N+TVNFFSTKY  H AGCPSLP H++ +  +LDELPCY  GID      +D  +  ++++
Sbjct: 148  NITVNFFSTKYMKHCAGCPSLPVHMKTKFGSLDELPCYNKGIDGLSENEDDTIDEVQFDD 207

Query: 1364 SSEKLTDGRSTDSNSSFSNQDSTE-----ENVDEHIDWREELNVHAEEISTPSREDPEHS 1200
            ++   T G   D +      DS +     + + E  +W +E       +        +  
Sbjct: 208  NNIS-TSGSVPDVSDDLVTPDSPQNPNDGDKISEAFEWNKESEAREPPLGNSFASQEQSQ 266

Query: 1199 YIIIDSPL--RTSSPIQSGSFDIADNKDRYELDDEFGEELGQ-QANMLCSTKTDAYHDPL 1029
                 +PL  ++SS       +I +  D   + ++   +L Q +     +T   A  +  
Sbjct: 267  LFSSTTPLTMKSSSTTSLQRAEIIEEDDFMSVMNKSDADLSQPEPEQSGATTLVANKNRD 326

Query: 1028 PTKNAGLSSEVEVIDLFTP-PCYKVGADNKKRRLSTTY-PEIIDLTDSP 888
              +   +  E E+IDL TP P  +   D KKRR+S++   + IDLT+SP
Sbjct: 327  VGRTFVVPHETEIIDLSTPSPSCRSVLDRKKRRVSSSVGTDFIDLTNSP 375


>gb|ESW18854.1| hypothetical protein PHAVU_006G076300g [Phaseolus vulgaris]
          Length = 374

 Score =  306 bits (783), Expect = 3e-80
 Identities = 166/350 (47%), Positives = 220/350 (62%), Gaps = 12/350 (3%)
 Frame = -1

Query: 1901 DDEQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMI 1722
            ++ + + FFACYLLTS+ PR+KGHTYIGFTVNPRRRIRQHNGE+G GAWRTKK+RPWEM+
Sbjct: 27   ENSEGNGFFACYLLTSLSPRYKGHTYIGFTVNPRRRIRQHNGEIGCGAWRTKKRRPWEMV 86

Query: 1721 LCIYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSL 1542
            LCIYGFPTNVSALQFEWAWQHPVES AVR+AA  FK+L G+ANKIKLAYTMLTLP W+S+
Sbjct: 87   LCIYGFPTNVSALQFEWAWQHPVESLAVRKAAVEFKSLSGIANKIKLAYTMLTLPSWQSM 146

Query: 1541 NLTVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYT--GIDRDEYTTNDWDNSEEYE 1368
            N+TVNFFSTKY  H AGCPSLP H++ ++  LDELPCY+  G+  +E   +D  +  E++
Sbjct: 147  NITVNFFSTKYMKHCAGCPSLPAHMKTKIGPLDELPCYSINGLSENE---DDNIDDVEFD 203

Query: 1367 ESSEKLTDGRSTDSNSSFSNQDSTE-----ENVDEHID-WREELNVHAEEISTPSRED-- 1212
            +++     G   D +    + DS +     E + E  D W +E        S  S+E   
Sbjct: 204  DNNNTSASGSVPDVSDDLDSPDSPKNQIHGEKISEAFDEWIKESEARESGNSFSSQEQRL 263

Query: 1211 --PEHSYIIIDSPLRTSSPIQSGSFDIADNKDRYELDDEFGEELGQQANMLCSTKTDAYH 1038
                 + + + S    ++P+Q    +I +  D   + +  G  L Q A        +A  
Sbjct: 264  PVSSTTPLTMKSSSTITTPLQ--RIEIIEEADFMNVINRSGSGLSQPAQ--SGGTLEANT 319

Query: 1037 DPLPTKNAGLSSEVEVIDLFTPPCYKVGADNKKRRLSTTYPEIIDLTDSP 888
            +      A +  E E+IDL TP       + KKRR+ +   + IDLT+SP
Sbjct: 320  NRTAGSTAVVPHEAEIIDLSTPSPSCGIVNRKKRRVPSFVTDFIDLTNSP 369


>gb|EXC19560.1| Structure-specific endonuclease subunit [Morus notabilis]
          Length = 378

 Score =  298 bits (763), Expect = 6e-78
 Identities = 166/349 (47%), Positives = 210/349 (60%), Gaps = 14/349 (4%)
 Frame = -1

Query: 1901 DDEQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMI 1722
            DD +   F+ACYLL S+ PR KGHTYIGFTVNPRRRIRQHNGE+G GAWRTKK+RPWEM+
Sbjct: 25   DDGERKGFYACYLLVSLSPRHKGHTYIGFTVNPRRRIRQHNGEIGCGAWRTKKRRPWEMV 84

Query: 1721 LCIYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSL 1542
            LCI+GFP+NVSALQFEWAWQHP ES AVR+AA SFK+L G+ANKIKLAYTMLTLP W+SL
Sbjct: 85   LCIHGFPSNVSALQFEWAWQHPNESLAVRKAAASFKSLSGIANKIKLAYTMLTLPSWQSL 144

Query: 1541 NLTVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEY-TTNDWDNSEEYEE 1365
            N+TVN+FSTKY  HSAGC SLP+H +V++C +DELPCY   D   +    +WDN EE +E
Sbjct: 145  NITVNYFSTKYTQHSAGCLSLPQHKKVKICPMDELPCYVKGDEGLFENEGEWDN-EERDE 203

Query: 1364 SSEKLTDGRSTDSNSSFSNQDSTEEN-VDEHIDWREELNVHAEEISTPSREDPEHSYIII 1188
            +         T SNS F N +  ++N + +   W  E             ED        
Sbjct: 204  AGSGSESAEETLSNSMFGNTEEHDKNGLGKLYGWITE------------GEDCREQSTFA 251

Query: 1187 DSPLRTSSPIQSGSFDIADNKDRYELDDEFGEE--LGQQANMLCSTKTDAYHDPLPTKNA 1014
            + P R SS + S      +  D   +   F +E    ++     S       D  P  + 
Sbjct: 252  ELPARPSSNVSSSGSLAGEFTDDTGISGLFKDESFKSKRPAKDPSKSLVTIDDDQPPSSH 311

Query: 1013 GLSSEVEVIDLFTP-PCYKVG---------ADNKKRRLSTTYPEIIDLT 897
             + SEVE+ID+ TP P  +           A NK+   +    E++DLT
Sbjct: 312  IVPSEVEIIDVTTPSPLCRSSLWGNKANKRARNKEPHNAPGEVEVVDLT 360


>ref|XP_002517715.1| nuclease, putative [Ricinus communis] gi|223543113|gb|EEF44647.1|
            nuclease, putative [Ricinus communis]
          Length = 413

 Score =  297 bits (760), Expect = 1e-77
 Identities = 176/403 (43%), Positives = 229/403 (56%), Gaps = 68/403 (16%)
 Frame = -1

Query: 1901 DDEQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMI 1722
            D+E+   F+ACYLLTS+CPRFKGHTYIGFTVNPRRRIRQHNGE+  GA+RTKK+RPWEM+
Sbjct: 20   DEEEGKGFYACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGEIRSGAFRTKKRRPWEMV 79

Query: 1721 LCIYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSL 1542
             CIYGFPTNVSALQFEWAWQHP+ES AVRQAA +FK+  GVANKIKLAYTML L  W+SL
Sbjct: 80   FCIYGFPTNVSALQFEWAWQHPMESLAVRQAAATFKSFSGVANKIKLAYTMLNLSAWQSL 139

Query: 1541 NLTVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTND----WDNSEE 1374
            N+TVN+FSTKY + SA CPSLPEH+++QVC + ELPCY           D    +D+ E 
Sbjct: 140  NITVNYFSTKYSILSAACPSLPEHMKIQVCPVVELPCYKETGESSLECQDAEDGFDDKEN 199

Query: 1373 YEESSEK--LTDGRSTDSNS---------------SFSNQDSTEENVDEH---------- 1275
            YE ++ +     G++ +  S               +F  QDS     +E+          
Sbjct: 200  YENTTSESGAVKGKTVEFQSQSLDKFPDFNRGEEIAFEGQDSNSNKDEEYNEVSQKNGTL 259

Query: 1274 ------------------IDWREELNVHAEEIST--PSREDPEHSY-------------- 1197
                               DW  E     E+ ST  PS ++    Y              
Sbjct: 260  DQIRTDAFGQISSDNSHTDDWTCEKFGSCEDYSTRHPSLKNTSADYPPAPKVDCARPFGF 319

Query: 1196 IIIDSPLRTSSPIQSGSFDIAD--NKDRYELDDEFGEELGQQANMLCSTKTDAYHDPLPT 1023
               +S +RT+S + +G F I++  N D   L +    +LG +   + + K D        
Sbjct: 320  PTSNSLVRTASSLCTG-FPISETSNGDELMLINNSVSDLGSRNGKILTGKDD-------- 370

Query: 1022 KNAGLSSEVEVIDLFTP-PCYKVGADNKKRRLSTTYPEIIDLT 897
            K+  +  E+EVIDL +P P  ++ +  KKRR  T  P+IIDLT
Sbjct: 371  KDKPIPQEIEVIDLLSPSPECRIMSSRKKRRFLTVCPQIIDLT 413


>ref|XP_004145233.1| PREDICTED: uncharacterized protein LOC101203492 [Cucumis sativus]
            gi|449471301|ref|XP_004153269.1| PREDICTED:
            uncharacterized protein LOC101204996 [Cucumis sativus]
            gi|449506301|ref|XP_004162709.1| PREDICTED:
            uncharacterized protein LOC101229010 [Cucumis sativus]
          Length = 395

 Score =  296 bits (757), Expect = 3e-77
 Identities = 173/378 (45%), Positives = 222/378 (58%), Gaps = 37/378 (9%)
 Frame = -1

Query: 1901 DDEQESR------FFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKK 1740
            DDE+E R      FF+CYLL S CPRFKGHTYIGFTVNP+RRIRQHNGE+  GAWRTK+K
Sbjct: 20   DDEEEERGNEVNGFFSCYLLASACPRFKGHTYIGFTVNPKRRIRQHNGEIRCGAWRTKRK 79

Query: 1739 RPWEMILCIYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTL 1560
            RPWEM+LCIYGFPTNVSALQFEWAWQHP ES AVR AA +FK+L GVANK+KLAYTMLTL
Sbjct: 80   RPWEMVLCIYGFPTNVSALQFEWAWQHPNESLAVRSAAATFKSLSGVANKVKLAYTMLTL 139

Query: 1559 PEWRSLNLTVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEY-TTNDWDN 1383
            P WR LN+TVN+FSTK+  ++AGCPSLPEH++VQV  ++ELPCY+  D+D      DW  
Sbjct: 140  PAWRGLNITVNYFSTKFMKNAAGCPSLPEHMKVQVSPINELPCYSEGDQDMLENEGDW-- 197

Query: 1382 SEEYEESSEKLTDGRSTDSNSSFSNQ------------DSTEENVDEHIDWREELNVHAE 1239
              EY    E++   R   S    SN+            D    +V    D   E N    
Sbjct: 198  --EYNREREEICGFRVYGSMKEVSNEVPQKLMDYQTGTDGRPPHVLRGCDKELETNEQVP 255

Query: 1238 EIS-TPSREDPEHSYII------IDSPLRTSSPIQSGSFDIADNKDRYELDDEFGEEL-G 1083
              S TPS  D   SY +      +++  R ++           ++    +DDE   +L G
Sbjct: 256  PSSCTPSYIDVGMSYDLCACDEGLENDEREAASCGQSCIVAGTSRTEIVIDDEEENQLEG 315

Query: 1082 QQANMLCSTKTDAYHDPLPTKNAGLSS---------EVEVIDLFTP-PCYKVGADNKKRR 933
               N+      +     + ++ + +S          E EVID+ TP P  +  +   KRR
Sbjct: 316  SSMNLQEQPGRENLTSGIASEISKVSRWNNGWVPTVEYEVIDVSTPSPDCRTSSHRFKRR 375

Query: 932  LSTTYPEIIDLTDSPIHV 879
            +++   E+IDLT SP  +
Sbjct: 376  VTSGKSEMIDLTKSPTFI 393


>emb|CBI15837.3| unnamed protein product [Vitis vinifera]
          Length = 346

 Score =  295 bits (756), Expect = 4e-77
 Identities = 152/277 (54%), Positives = 180/277 (64%), Gaps = 24/277 (8%)
 Frame = -1

Query: 1898 DEQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMIL 1719
            +E+   FFACYLL S+ PR KGH+YIGFTVNPRRRIRQHNGE+  GAW+TK+KRPWEM+L
Sbjct: 18   EEKGDDFFACYLLASLSPRHKGHSYIGFTVNPRRRIRQHNGEITCGAWKTKRKRPWEMVL 77

Query: 1718 CIYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSLN 1539
            CIYGFPTNVSALQFEWAWQHP ES AVR+AA  FK+L G+ANKIKLAYTM TLP W+SLN
Sbjct: 78   CIYGFPTNVSALQFEWAWQHPTESLAVRKAAAGFKSLSGIANKIKLAYTMFTLPAWQSLN 137

Query: 1538 LTVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTNDWDNSEEYEESS 1359
            LTVNFFSTKY  HSAGCP LPEH+RVQV  +DELPCY+G D+  +     D  EE     
Sbjct: 138  LTVNFFSTKYTKHSAGCPILPEHMRVQVSPMDELPCYSGSDQSFFDNARGDEKEEL---- 193

Query: 1358 EKLTDGRSTDSNSSFSNQDSTEENVDEHIDWREELN------------VHAE-------- 1239
                 G    S+  F    + EE   E   W EE              VH          
Sbjct: 194  -----GERGSSSDGFDQVIAHEETALEQFGWIEEHGLRQPGDSPSPEVVHCSGKTQENAM 248

Query: 1238 ----EISTPSREDPEHSYIIIDSPLRTSSPIQSGSFD 1140
                ++ST S+++    + +IDSP+RTSS    G+ D
Sbjct: 249  RQPADLST-SKDEHRSPFCLIDSPVRTSSHSTEGTLD 284


>gb|EOY34667.1| Excinuclease ABC [Theobroma cacao]
          Length = 460

 Score =  293 bits (750), Expect = 2e-76
 Identities = 150/254 (59%), Positives = 179/254 (70%), Gaps = 5/254 (1%)
 Frame = -1

Query: 1895 EQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMILC 1716
            +Q   FFACYLLTS+ PR KGHTYIGFTVNPRRRIRQHNGE+G GAWRTK KRPWEM++C
Sbjct: 57   KQGKGFFACYLLTSLSPRHKGHTYIGFTVNPRRRIRQHNGEIGSGAWRTKSKRPWEMVIC 116

Query: 1715 IYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSLNL 1536
            IYGFPTNVSALQFEWAWQHP ES AVR+AA +FK+L GVANKIKLAYTMLTLP W+SLN+
Sbjct: 117  IYGFPTNVSALQFEWAWQHPQESVAVREAAATFKSLSGVANKIKLAYTMLTLPAWQSLNI 176

Query: 1535 TVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTNDWDNSEEYEESSE 1356
            TVN+FSTKY+  SA CPSLPE ++VQVC+++ELPCYT  D  EY  +D DN +EY+E ++
Sbjct: 177  TVNYFSTKYRKDSACCPSLPEQMKVQVCSMNELPCYTEQDEFEY-KDDCDNLDEYDEVND 235

Query: 1355 KLTDGRSTDS----NSSFSN-QDSTEENVDEHIDWREELNVHAEEISTPSREDPEHSYII 1191
                   T      N+S  N   S  E   E  ++ EE        S+          + 
Sbjct: 236  TCETVWETYPDEVVNASADNFLSSIHEASHEEFEYIEEYKTRKPVDSSTLGVHNIQPQVF 295

Query: 1190 IDSPLRTSSPIQSG 1149
            IDSP   +S I +G
Sbjct: 296  IDSPTSKTSSIATG 309


>ref|XP_002325655.2| endo/excinuclease amino terminal domain-containing family protein,
            partial [Populus trichocarpa] gi|550317584|gb|EEF00037.2|
            endo/excinuclease amino terminal domain-containing family
            protein, partial [Populus trichocarpa]
          Length = 212

 Score =  285 bits (730), Expect = 4e-74
 Identities = 127/185 (68%), Positives = 157/185 (84%), Gaps = 4/185 (2%)
 Frame = -1

Query: 1898 DEQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMIL 1719
            ++ ++ FFACYLLTS+CPRFKGHTYIGFTVNPRRRIRQHNGE+  GA RTKK+RPWEM++
Sbjct: 9    EKGKNGFFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGELRSGACRTKKRRPWEMVI 68

Query: 1718 CIYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSLN 1539
            C+YGFPTNV+ALQFEWAWQHP ES AVRQAA +FK+  GVANKIKLAYTML LP W+SLN
Sbjct: 69   CVYGFPTNVAALQFEWAWQHPTESVAVRQAAAAFKSFSGVANKIKLAYTMLNLPSWQSLN 128

Query: 1538 LTVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTND----WDNSEEY 1371
            +TVN+FST+YK+HSAGCPSLP++++VQ+C ++ELPCY+    + +   D    WD  EEY
Sbjct: 129  ITVNYFSTQYKVHSAGCPSLPKNMKVQICPMNELPCYSDFVDNLFEERDDEDAWDGEEEY 188

Query: 1370 EESSE 1356
            E +S+
Sbjct: 189  ERASD 193


>ref|XP_002319418.2| hypothetical protein POPTR_0013s15190g, partial [Populus trichocarpa]
            gi|550325896|gb|EEE95341.2| hypothetical protein
            POPTR_0013s15190g, partial [Populus trichocarpa]
          Length = 431

 Score =  283 bits (723), Expect = 3e-73
 Identities = 126/185 (68%), Positives = 153/185 (82%), Gaps = 4/185 (2%)
 Frame = -1

Query: 1898 DEQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMIL 1719
            ++ ++ FFACYLLTS+CPRFKGHTYIGFTVNPRRRIRQHNGE+  GA RTKK+RPWEM+ 
Sbjct: 18   EKGKNGFFACYLLTSLCPRFKGHTYIGFTVNPRRRIRQHNGELRSGACRTKKRRPWEMVF 77

Query: 1718 CIYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSLN 1539
            CIYGFPTNV+ALQFEWAWQHP ES AVRQAA +FK+  GVANKIKLAYTML LP W+SLN
Sbjct: 78   CIYGFPTNVAALQFEWAWQHPTESVAVRQAAAAFKSFSGVANKIKLAYTMLNLPSWQSLN 137

Query: 1538 LTVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTG----IDRDEYTTNDWDNSEEY 1371
            +T+N+FST YK+HS GCPSLP++++VQ+C +DELPCY      +  +    + WD  EEY
Sbjct: 138  ITINYFSTNYKVHSVGCPSLPKNMKVQICPMDELPCYCDSGDILFEERENEDAWDGEEEY 197

Query: 1370 EESSE 1356
            E +S+
Sbjct: 198  ERASD 202


>ref|XP_006410121.1| hypothetical protein EUTSA_v10016841mg [Eutrema salsugineum]
            gi|557111290|gb|ESQ51574.1| hypothetical protein
            EUTSA_v10016841mg [Eutrema salsugineum]
          Length = 364

 Score =  281 bits (719), Expect = 8e-73
 Identities = 160/340 (47%), Positives = 208/340 (61%), Gaps = 9/340 (2%)
 Frame = -1

Query: 1880 FFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMILCIYGFP 1701
            FFACY+LTS+ PR KGHTYIGFTVNPRRRIRQHNGE+  GA+RTKKKRPWEM+LCIYGFP
Sbjct: 30   FFACYILTSLSPRHKGHTYIGFTVNPRRRIRQHNGEITSGAYRTKKKRPWEMVLCIYGFP 89

Query: 1700 TNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSLNLTVNFF 1521
            TNVSALQFEWAWQHP ES AVR+AA +FK+  G+ +KIKLAYTMLTLP W SLNLTVN+F
Sbjct: 90   TNVSALQFEWAWQHPRESLAVREAAAAFKSFSGLGSKIKLAYTMLTLPAWNSLNLTVNYF 149

Query: 1520 STKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTNDWDNSEEYEESSEKLTDG 1341
            STKY  H    PSLP H++VQVCA+D+LPC+T +D +    ++       EE  ++  + 
Sbjct: 150  STKYAHHGGLSPSLPPHMKVQVCAMDDLPCFTKLDNNSQPEDEESLDSHEEEEDDRRNEI 209

Query: 1340 RSTDSNSSFSNQDSTEENVDEHIDWREELNVHAEEISTPSREDPEHSYIIIDSPLRTS-S 1164
            +  +  +S SN     E   + +  R+       E     R      +  +D  +    S
Sbjct: 210  QPGNLTTSSSNDLYLGE---KELHDRDFEKAKQPEAVLDDRLANFTGFGSLDESVEDEVS 266

Query: 1163 PIQSGSFDIADNKDRYELDDE------FG-EELGQQANMLCSTKTDAYHDPLPTKNAGLS 1005
             I  GS +  + +     DD       FG E++ +      + + D +     +     +
Sbjct: 267  HITVGSIEAMEKEPETVFDDRLANFTGFGLEDIVEDVISHSTMEKDCWR---RSNLITST 323

Query: 1004 SEVEVIDLFTP-PCYKVGADNKKRRLSTTYPEIIDLTDSP 888
            +EVEVIDL TP P  +VG   K++R+S    E IDLT SP
Sbjct: 324  TEVEVIDLMTPSPSCRVGPSMKRQRVS----EFIDLTRSP 359


>ref|NP_180594.2| Excinuclease ABC, C subunit, N-terminal [Arabidopsis thaliana]
            gi|51968920|dbj|BAD43152.1| hypothetical protein
            [Arabidopsis thaliana] gi|51968928|dbj|BAD43156.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|51971411|dbj|BAD44370.1| hypothetical protein
            [Arabidopsis thaliana] gi|66792676|gb|AAY56440.1|
            At2g30350 [Arabidopsis thaliana]
            gi|330253280|gb|AEC08374.1| Excinuclease ABC, C subunit,
            N-terminal [Arabidopsis thaliana]
          Length = 368

 Score =  277 bits (708), Expect = 1e-71
 Identities = 164/359 (45%), Positives = 210/359 (58%), Gaps = 28/359 (7%)
 Frame = -1

Query: 1880 FFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMILCIYGFP 1701
            FFACYLLTS+ PR KG TYIGFTVNPRRRIRQHNGE+  GAWRTKKKRPWEM+LCIYGFP
Sbjct: 27   FFACYLLTSLSPRHKGQTYIGFTVNPRRRIRQHNGEITSGAWRTKKKRPWEMVLCIYGFP 86

Query: 1700 TNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSLNLTVNFF 1521
            TNVSALQFEWAWQHP ES AVR+AA +FK+  GVA+KIKL YTML LP W SLNLTVN+F
Sbjct: 87   TNVSALQFEWAWQHPRESVAVREAAAAFKSFSGVASKIKLVYTMLNLPAWNSLNLTVNYF 146

Query: 1520 STKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTNDWDNSEEYEESSEKLTDG 1341
            S+KY  H    PSLP H++VQVCA+++L  +T +D D     D ++ E  EE  +   D 
Sbjct: 147  SSKYAHHGGKSPSLPLHMKVQVCAMEDLQYFTKVD-DSSQPEDEESPEVNEEDDD---DD 202

Query: 1340 RSTDSNSSFSNQDSTEENVDEHID-WREELNVHAEE-----------ISTPSREDPEHSY 1197
                SN S     +T  + D H +  +E + V  EE           +      + E S+
Sbjct: 203  DDDSSNLSQPGNSNTSSSDDRHFEKAKEPVTVFDEEDRLANFSGFGLLDEEETVEDEVSH 262

Query: 1196 IIIDSPLRTSSPIQS------------GSFDIADNKDRYEL---DDEFGEELGQQANMLC 1062
            I + S   T    ++            G  +I +++  +      +   +E  ++ N + 
Sbjct: 263  ITVGSIRATEKEPETVFNDRLASFTCFGLVEIVEDEVSHGTIGSTEAMEKECRKRRNHIT 322

Query: 1061 STKTDAYHDPLPTKNAGLSSEVEVIDLFTP-PCYKVGADNKKRRLSTTYPEIIDLTDSP 888
            ST T+               +VEVIDL TP P  + G+  K+RR+S    E IDLT SP
Sbjct: 323  STTTEV--------------DVEVIDLMTPSPSCRAGSSMKRRRVS----EFIDLTMSP 363


>ref|XP_006294402.1| hypothetical protein CARUB_v10023419mg, partial [Capsella rubella]
            gi|482563110|gb|EOA27300.1| hypothetical protein
            CARUB_v10023419mg, partial [Capsella rubella]
          Length = 382

 Score =  275 bits (702), Expect = 7e-71
 Identities = 160/362 (44%), Positives = 209/362 (57%), Gaps = 31/362 (8%)
 Frame = -1

Query: 1880 FFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMILCIYGFP 1701
            FFACYLLTS+ PR KG TYIGFTVNPRRRIRQHNGE+  GAWRTKKKRPWEM+LCIYGFP
Sbjct: 36   FFACYLLTSLSPRHKGQTYIGFTVNPRRRIRQHNGEITCGAWRTKKKRPWEMVLCIYGFP 95

Query: 1700 TNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSLNLTVNFF 1521
            TNVSALQFEWAWQHP ES AVR+AA +FK+  G+A KIKL YTML LP W SLNLTVN+F
Sbjct: 96   TNVSALQFEWAWQHPRESLAVREAAAAFKSFPGIAGKIKLVYTMLNLPAWNSLNLTVNYF 155

Query: 1520 STKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTNDWDNSEEYEESSEKLTDG 1341
            S+KY  +    PSLP H++V+VCA+++LP +T +D      +  D S E  E +E     
Sbjct: 156  SSKYAHYGGLAPSLPLHMKVEVCAMEDLPYFTKLDNSSQPED--DESPEVNEEAEDEDSN 213

Query: 1340 RSTDSNSSFSNQD----STEENVDEHIDWREEL----------------NVHAEEISTPS 1221
            +S   NS  S+QD      +E  D H +  +E                 ++  E +    
Sbjct: 214  QSQPGNSGASSQDDLYPGEKELHDRHFEKAKEPVTVLDEDRLANFSGFGSLEEEAVEDEV 273

Query: 1220 REDPEHSYIIIDSPLRTSSPIQSGSF------DIADNKD----RYELDDEFGEELGQQAN 1071
               P  S  ++D    T    +  +F      +I ++++         +   ++   + N
Sbjct: 274  SHSPVGSIEVMDKEPETVFVDRLANFTGFGLVEIVEDEEVSHGTVRNTEAMEKDSWIRRN 333

Query: 1070 MLCSTKTDAYHDPLPTKNAGLSSEVEVIDLFTP-PCYKVGADNKKRRLSTTYPEIIDLTD 894
            ++ ST T+               +VEVIDL TP P  + G+  K+RR+S    E IDLT 
Sbjct: 334  LITSTTTEV--------------DVEVIDLMTPSPSCRAGSSMKRRRVS----EFIDLTR 375

Query: 893  SP 888
            SP
Sbjct: 376  SP 377


>gb|EOY18169.1| Excinuclease ABC [Theobroma cacao]
          Length = 225

 Score =  273 bits (699), Expect = 2e-70
 Identities = 128/180 (71%), Positives = 151/180 (83%)
 Frame = -1

Query: 1895 EQESRFFACYLLTSICPRFKGHTYIGFTVNPRRRIRQHNGEVGMGAWRTKKKRPWEMILC 1716
            +Q   FFACYLLTS+ PR KGHTYIGFTVNPRRRIRQHNGE+G GAWRTK K PWEM++C
Sbjct: 9    KQGEGFFACYLLTSLSPRHKGHTYIGFTVNPRRRIRQHNGEIGSGAWRTKGKHPWEMVIC 68

Query: 1715 IYGFPTNVSALQFEWAWQHPVESRAVRQAATSFKTLGGVANKIKLAYTMLTLPEWRSLNL 1536
            IYG PT+VSALQFEWAWQHP ES AVR+AA +FK+L  VANKIKLAYTMLTLP  ++LN+
Sbjct: 69   IYGLPTDVSALQFEWAWQHPQESVAVREAAATFKSLSEVANKIKLAYTMLTLPAGQNLNI 128

Query: 1535 TVNFFSTKYKMHSAGCPSLPEHVRVQVCALDELPCYTGIDRDEYTTNDWDNSEEYEESSE 1356
            TV++FSTKY+  SA CPSLPE ++VQ C+LDELPCYT  D  EY  +D DNS+EY+E ++
Sbjct: 129  TVDYFSTKYRKDSACCPSLPEQMKVQACSLDELPCYTEQDEFEY-KDDCDNSDEYDEVND 187


Top