BLASTX nr result

ID: Sinomenium21_contig00017774 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00017774
         (1491 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002264475.2| PREDICTED: endonuclease III-like protein 1-l...   462   e-127
emb|CBI36652.3| unnamed protein product [Vitis vinifera]              457   e-126
ref|XP_002534117.1| endonuclease III, putative [Ricinus communis...   452   e-124
ref|XP_006443060.1| hypothetical protein CICLE_v10020813mg [Citr...   448   e-123
ref|XP_007034067.1| DNA glycosylase superfamily protein isoform ...   448   e-123
ref|XP_006494172.1| PREDICTED: endonuclease III-like protein 1-l...   444   e-122
ref|XP_007034069.1| DNA glycosylase superfamily protein isoform ...   440   e-121
ref|XP_004241397.1| PREDICTED: endonuclease III-like protein 1-l...   426   e-116
ref|XP_006410282.1| hypothetical protein EUTSA_v10016815mg [Eutr...   419   e-114
emb|CAC16135.1| endonuclease III homologue [Arabidopsis thaliana]     416   e-113
ref|NP_565725.1| protein NTH1 [Arabidopsis thaliana] gi|75206080...   416   e-113
gb|AAM61598.1| putative endonuclease [Arabidopsis thaliana]           415   e-113
ref|NP_001077988.1| protein NTH1 [Arabidopsis thaliana] gi|17380...   415   e-113
ref|XP_007131959.1| hypothetical protein PHAVU_011G055100g [Phas...   413   e-113
ref|XP_002309812.1| endonuclease-related family protein [Populus...   413   e-113
ref|XP_003539044.2| PREDICTED: endonuclease III-like protein 1-l...   411   e-112
ref|XP_004507328.1| PREDICTED: endonuclease III-like protein 1-l...   411   e-112
ref|XP_002881177.1| predicted protein [Arabidopsis lyrata subsp....   411   e-112
ref|XP_007131960.1| hypothetical protein PHAVU_011G055100g [Phas...   410   e-112
ref|XP_002889575.1| hypothetical protein ARALYDRAFT_470604 [Arab...   406   e-110

>ref|XP_002264475.2| PREDICTED: endonuclease III-like protein 1-like [Vitis vinifera]
          Length = 355

 Score =  462 bits (1189), Expect = e-127
 Identities = 232/343 (67%), Positives = 267/343 (77%)
 Frame = +2

Query: 242  PETRSVSAKLQSKHEIPNAEPNAEIRVSARKRRSKRTIETRMEEHKIESLQQKKKLCDLP 421
            P   ++ +K  +  E PN    +E+RV  RK+R K  +ET  +E K E  QQK  +C+LP
Sbjct: 10   PLLPALQSKTSAHEETPNGVSGSEVRVFVRKKRVKMAVETPEKEIKAEPQQQK--ICELP 67

Query: 422  DIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMPKVDAPANWKEVLDGIRNMRS 601
            DIEEF Y K   S  + K   +P S++ P  +   S I P  + PANW+++L+GIR MRS
Sbjct: 68   DIEEFTYRKGKRSTHLRKS--KPTSDVPPGGTEITSSIRPAAELPANWEKILEGIRKMRS 125

Query: 602  SEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNGLLAPDAI 781
            SEDAPVDSMGCEKAGS LPP+ERRFAVLVSSLLSSQTKD VTHGAIQRL QNGLL  DAI
Sbjct: 126  SEDAPVDSMGCEKAGSSLPPRERRFAVLVSSLLSSQTKDNVTHGAIQRLLQNGLLVADAI 185

Query: 782  DSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSSLKELLLLPGIGPKMAHL 961
            D ADE T+K+LIYPVGFYSRKA NLKKIAKICLM+Y GDIPSSL+ELLLLPGIGPKMAHL
Sbjct: 186  DKADEATVKSLIYPVGFYSRKAGNLKKIAKICLMKYDGDIPSSLEELLLLPGIGPKMAHL 245

Query: 962  VMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEETRESLQLWLPKDEWIAIN 1141
            VMNV WNNVQGICVDTHVHRICNRL WVS+ GT  KT  PEETRESLQLWLPK+EW+ IN
Sbjct: 246  VMNVAWNNVQGICVDTHVHRICNRLGWVSRRGTKQKTSLPEETRESLQLWLPKEEWVPIN 305

Query: 1142 PLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKK 1270
            PLLVGFGQT+CTPL+PRC +C ++ LCPSAFKE  S   + KK
Sbjct: 306  PLLVGFGQTICTPLRPRCGVCGVSDLCPSAFKEAQSPSSKMKK 348


>emb|CBI36652.3| unnamed protein product [Vitis vinifera]
          Length = 379

 Score =  457 bits (1175), Expect = e-126
 Identities = 232/346 (67%), Positives = 267/346 (77%), Gaps = 3/346 (0%)
 Frame = +2

Query: 242  PETRSVSAKLQSKHEIPNAEPNAEIRVSARKRRSKRTIETRMEEHKIESLQQKKKLCDLP 421
            P   ++ +K  +  E PN    +E+RV  RK+R K  +ET  +E K E  QQK  +C+LP
Sbjct: 31   PLLPALQSKTSAHEETPNGVSGSEVRVFVRKKRVKMAVETPEKEIKAEPQQQK--ICELP 88

Query: 422  DIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMPKVDAPANWKEVLDGIRNMRS 601
            DIEEF Y K   S  + K   +P S++ P  +   S I P  + PANW+++L+GIR MRS
Sbjct: 89   DIEEFTYRKGKRSTHLRKS--KPTSDVPPGGTEITSSIRPAAELPANWEKILEGIRKMRS 146

Query: 602  SEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHG---AIQRLHQNGLLAP 772
            SEDAPVDSMGCEKAGS LPP+ERRFAVLVSSLLSSQTKD VTHG   AIQRL QNGLL  
Sbjct: 147  SEDAPVDSMGCEKAGSSLPPRERRFAVLVSSLLSSQTKDNVTHGNAGAIQRLLQNGLLVA 206

Query: 773  DAIDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSSLKELLLLPGIGPKM 952
            DAID ADE T+K+LIYPVGFYSRKA NLKKIAKICLM+Y GDIPSSL+ELLLLPGIGPKM
Sbjct: 207  DAIDKADEATVKSLIYPVGFYSRKAGNLKKIAKICLMKYDGDIPSSLEELLLLPGIGPKM 266

Query: 953  AHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEETRESLQLWLPKDEWI 1132
            AHLVMNV WNNVQGICVDTHVHRICNRL WVS+ GT  KT  PEETRESLQLWLPK+EW+
Sbjct: 267  AHLVMNVAWNNVQGICVDTHVHRICNRLGWVSRRGTKQKTSLPEETRESLQLWLPKEEWV 326

Query: 1133 AINPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKK 1270
             INPLLVGFGQT+CTPL+PRC +C ++ LCPSAFKE  S   + KK
Sbjct: 327  PINPLLVGFGQTICTPLRPRCGVCGVSDLCPSAFKEAQSPSSKMKK 372


>ref|XP_002534117.1| endonuclease III, putative [Ricinus communis]
            gi|223525829|gb|EEF28268.1| endonuclease III, putative
            [Ricinus communis]
          Length = 357

 Score =  452 bits (1162), Expect = e-124
 Identities = 232/355 (65%), Positives = 271/355 (76%), Gaps = 10/355 (2%)
 Frame = +2

Query: 239  MPETRSVSAKLQSKHEIP----------NAEPNAEIRVSARKRRSKRTIETRMEEHKIES 388
            MP TR  S  LQSK EI           N       RV  RK+R+KRT+E   +E K+E+
Sbjct: 1    MPITRFSSKSLQSKTEIQILSSDPIPGSNEATEPASRVYVRKKRAKRTLEVAEKELKVET 60

Query: 389  LQQKKKLCDLPDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMPKVDAPANWK 568
             + K+    LPDIE+F++   NGSA + K   +P  ++LP+ +     I P  + PANW+
Sbjct: 61   KEVKQSA--LPDIEDFSFKGTNGSAYLRKS--KPSRDVLPVDNEVACTIRPSDEPPANWE 116

Query: 569  EVLDGIRNMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRL 748
             VL+GIR MRSSEDAPVD+MGCEKAGSFLP KERRFAVLVSSL+SSQTKD VTHGA+QRL
Sbjct: 117  IVLEGIRKMRSSEDAPVDTMGCEKAGSFLPSKERRFAVLVSSLMSSQTKDHVTHGAVQRL 176

Query: 749  HQNGLLAPDAIDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSSLKELLL 928
            HQN LL  DAID ADE TIK+LIYPVGFY+RKA NLKKIAKICLM+Y GDIP SL++LL 
Sbjct: 177  HQNSLLTADAIDKADETTIKDLIYPVGFYTRKASNLKKIAKICLMKYDGDIPRSLEDLLS 236

Query: 929  LPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEETRESLQL 1108
            LPGIGPKMAHLVMNV W++VQGICVDTHVHRICNRL WVS+PGT  KT +PEETR +LQL
Sbjct: 237  LPGIGPKMAHLVMNVAWDDVQGICVDTHVHRICNRLGWVSRPGTEQKTSNPEETRVALQL 296

Query: 1109 WLPKDEWIAINPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKKS 1273
            WLPK+EW+ INPLLVGFGQT+CTPL+PRC MCSI   CPSAFKET+S   + KKS
Sbjct: 297  WLPKEEWVPINPLLVGFGQTICTPLRPRCGMCSITEFCPSAFKETSSPASKMKKS 351


>ref|XP_006443060.1| hypothetical protein CICLE_v10020813mg [Citrus clementina]
            gi|557545322|gb|ESR56300.1| hypothetical protein
            CICLE_v10020813mg [Citrus clementina]
          Length = 357

 Score =  448 bits (1153), Expect = e-123
 Identities = 229/361 (63%), Positives = 276/361 (76%), Gaps = 10/361 (2%)
 Frame = +2

Query: 227  LIRQMPETRSVSAKL-QSKHEIPNAEPNAEIRVSARKRRSKRTIETRMEEHKIESLQQKK 403
            ++ +MP +R  S +L Q       + PN E+RV  R++R K  ++   EE K E+  + K
Sbjct: 4    ILLKMPNSRFYSKRLLQPNANFSTSPPNPELRVFVRRKRQKNALQISKEEPKNEAPIEHK 63

Query: 404  KLCDLPDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMP---------KVDAP 556
              C LPDIEEFAY + NGSA  SK     I+     +ST D P++          + + P
Sbjct: 64   S-CGLPDIEEFAYKEANGSALSSK-----IAG--KSKSTQDMPVVGTEVASLNRMRGEPP 115

Query: 557  ANWKEVLDGIRNMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGA 736
            ANW+ VL+GIR MR+SEDAPVDSMGCEKAGS LPP+ERRFAVL+SSLLSSQTKD VTHGA
Sbjct: 116  ANWERVLEGIRKMRTSEDAPVDSMGCEKAGSSLPPRERRFAVLISSLLSSQTKDNVTHGA 175

Query: 737  IQRLHQNGLLAPDAIDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSSLK 916
            IQRL QNGLL  +AID ADE TIK+LIYPVGFY+RKA N+KKIA ICL +Y GDIPSSL 
Sbjct: 176  IQRLLQNGLLTAEAIDKADEATIKDLIYPVGFYTRKASNMKKIAPICLTKYDGDIPSSLD 235

Query: 917  ELLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEETRE 1096
            ELLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRL WVS+PG   KT SPE+TRE
Sbjct: 236  ELLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLGWVSQPGRKQKTSSPEQTRE 295

Query: 1097 SLQLWLPKDEWIAINPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKKSS 1276
             LQLWLPK+EW+ INPLLVGFGQT+CTP++PRC MCS++ LCPSAFK+++S   +++KS+
Sbjct: 296  VLQLWLPKEEWVPINPLLVGFGQTICTPIRPRCGMCSVSELCPSAFKDSSSPSSKSRKSA 355

Query: 1277 R 1279
            +
Sbjct: 356  Q 356


>ref|XP_007034067.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao]
            gi|508713096|gb|EOY04993.1| DNA glycosylase superfamily
            protein isoform 1 [Theobroma cacao]
          Length = 387

 Score =  448 bits (1153), Expect = e-123
 Identities = 231/363 (63%), Positives = 278/363 (76%), Gaps = 13/363 (3%)
 Frame = +2

Query: 236  QMPETRSVSAKLQSKH--EIPNAEPNA-----------EIRVSARKRRSKRTIETRMEEH 376
            +MP+TR     L S    E+P+++PN             +RV  RK+R K+T++   E  
Sbjct: 24   KMPKTRLAFKTLSSSSTTEVPSSDPNPGSETTDNVSVPAVRVFTRKKRVKKTVDVVQEIP 83

Query: 377  KIESLQQKKKLCDLPDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMPKVDAP 556
            K E+  +  KLC LPDIEEFAY KV+G +   K   +  S+ + + +   SP+    +AP
Sbjct: 84   KAEN--KGLKLCGLPDIEEFAYKKVDGPSLSGKS--KSTSDEINVGTGIASPVGIGGNAP 139

Query: 557  ANWKEVLDGIRNMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGA 736
            ANW++VL+GIR MRS+EDAPVD+MGCEKAGS LPPKERRFAVL+SSLLSSQTKD VTHGA
Sbjct: 140  ANWEKVLEGIRKMRSAEDAPVDTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGA 199

Query: 737  IQRLHQNGLLAPDAIDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSSLK 916
            IQRL QN L+ PDAID ADE TIK+LIYPVGFY+RKA N+KKIAKICLM+Y GDIPSSL+
Sbjct: 200  IQRLIQNCLMTPDAIDKADEATIKDLIYPVGFYTRKAINVKKIAKICLMKYDGDIPSSLE 259

Query: 917  ELLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEETRE 1096
            ELLLLPGIGPKMAHLVMN+ W++VQGICVDTHVHRICNRL WVS+PGT  KTL PEETR 
Sbjct: 260  ELLLLPGIGPKMAHLVMNIAWDDVQGICVDTHVHRICNRLGWVSRPGTKQKTLYPEETRV 319

Query: 1097 SLQLWLPKDEWIAINPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKKSS 1276
            +LQ WLPK+EW+ INPLLVGFGQT+CTPL+P+CE+CSI   CPSAFKET+S   + KKS 
Sbjct: 320  ALQQWLPKEEWVPINPLLVGFGQTICTPLRPQCEVCSITEFCPSAFKETSSPSSKVKKSG 379

Query: 1277 RVK 1285
              K
Sbjct: 380  VTK 382


>ref|XP_006494172.1| PREDICTED: endonuclease III-like protein 1-like [Citrus sinensis]
          Length = 357

 Score =  444 bits (1143), Expect = e-122
 Identities = 228/361 (63%), Positives = 275/361 (76%), Gaps = 10/361 (2%)
 Frame = +2

Query: 227  LIRQMPETRSVSAKL-QSKHEIPNAEPNAEIRVSARKRRSKRTIETRMEEHKIESLQQKK 403
            ++ +MP +R  S +L Q       + PN E+RV  R++R K  ++   EE K E+  + K
Sbjct: 4    ILLKMPNSRFYSKRLLQPNANFSTSPPNPELRVFVRRKRQKNALQISKEEPKNEAPIEHK 63

Query: 404  KLCDLPDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMP---------KVDAP 556
              C LPDIEEFAY + NGSA  SK     I+     +ST D P++          + + P
Sbjct: 64   S-CGLPDIEEFAYKEANGSALSSK-----IAG--KSKSTQDMPVVGTEVASLNRMRGEPP 115

Query: 557  ANWKEVLDGIRNMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGA 736
            ANW+ VL+GIR MR+SEDAPVDSMGCEKAGS LPP+ERRFAVL+SSLLSSQTKD VTHGA
Sbjct: 116  ANWERVLEGIRKMRTSEDAPVDSMGCEKAGSSLPPRERRFAVLISSLLSSQTKDNVTHGA 175

Query: 737  IQRLHQNGLLAPDAIDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSSLK 916
            IQRL QNGLL  +AID ADE TIK+LIY VGFY+RKA N+KKIA ICL +Y GDIPSSL 
Sbjct: 176  IQRLLQNGLLTAEAIDKADEATIKDLIYLVGFYTRKASNMKKIAPICLTKYDGDIPSSLD 235

Query: 917  ELLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEETRE 1096
            ELLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRL WVS+PG   KT SPE+TRE
Sbjct: 236  ELLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLGWVSQPGRKQKTSSPEQTRE 295

Query: 1097 SLQLWLPKDEWIAINPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKKSS 1276
             LQLWLPK+EW+ INPLLVGFGQT+CTP++PRC MCS++ LCPSAFK+++S   +++KS+
Sbjct: 296  VLQLWLPKEEWVPINPLLVGFGQTICTPIRPRCGMCSVSELCPSAFKDSSSPSSKSRKSA 355

Query: 1277 R 1279
            +
Sbjct: 356  Q 356


>ref|XP_007034069.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao]
            gi|508713098|gb|EOY04995.1| DNA glycosylase superfamily
            protein isoform 3 [Theobroma cacao]
          Length = 364

 Score =  440 bits (1132), Expect = e-121
 Identities = 230/363 (63%), Positives = 269/363 (74%), Gaps = 13/363 (3%)
 Frame = +2

Query: 236  QMPETRSVSAKLQSKH--EIPNAEPNA-----------EIRVSARKRRSKRTIETRMEEH 376
            +MP+TR     L S    E+P+++PN             +RV  RK+R K+T++   E  
Sbjct: 24   KMPKTRLAFKTLSSSSTTEVPSSDPNPGSETTDNVSVPAVRVFTRKKRVKKTVDVVQEIP 83

Query: 377  KIESLQQKKKLCDLPDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMPKVDAP 556
            K E+  +  KLC LPDIEEFAY KV+G                P  S N         AP
Sbjct: 84   KAEN--KGLKLCGLPDIEEFAYKKVDG----------------PSLSGN---------AP 116

Query: 557  ANWKEVLDGIRNMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGA 736
            ANW++VL+GIR MRS+EDAPVD+MGCEKAGS LPPKERRFAVL+SSLLSSQTKD VTHGA
Sbjct: 117  ANWEKVLEGIRKMRSAEDAPVDTMGCEKAGSVLPPKERRFAVLISSLLSSQTKDHVTHGA 176

Query: 737  IQRLHQNGLLAPDAIDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSSLK 916
            IQRL QN L+ PDAID ADE TIK+LIYPVGFY+RKA N+KKIAKICLM+Y GDIPSSL+
Sbjct: 177  IQRLIQNCLMTPDAIDKADEATIKDLIYPVGFYTRKAINVKKIAKICLMKYDGDIPSSLE 236

Query: 917  ELLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEETRE 1096
            ELLLLPGIGPKMAHLVMN+ W++VQGICVDTHVHRICNRL WVS+PGT  KTL PEETR 
Sbjct: 237  ELLLLPGIGPKMAHLVMNIAWDDVQGICVDTHVHRICNRLGWVSRPGTKQKTLYPEETRV 296

Query: 1097 SLQLWLPKDEWIAINPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKKSS 1276
            +LQ WLPK+EW+ INPLLVGFGQT+CTPL+P+CE+CSI   CPSAFKET+S   + KKS 
Sbjct: 297  ALQQWLPKEEWVPINPLLVGFGQTICTPLRPQCEVCSITEFCPSAFKETSSPSSKVKKSG 356

Query: 1277 RVK 1285
              K
Sbjct: 357  VTK 359


>ref|XP_004241397.1| PREDICTED: endonuclease III-like protein 1-like [Solanum
            lycopersicum]
          Length = 380

 Score =  426 bits (1095), Expect = e-116
 Identities = 219/329 (66%), Positives = 254/329 (77%), Gaps = 9/329 (2%)
 Frame = +2

Query: 311  EIRVSARKRRSKRTIETRMEEHKIESLQQKKKLCDLPDIEEFAYGK--VNGSAEMSKGHL 484
            E+RV  R++R K+T+E   +E K ES  +K  L  LPDIE+F+Y K   +  +  SK   
Sbjct: 56   ELRVFIRRKRVKKTVEVIAKEVKEESSGKKVMLVRLPDIEDFSYSKDITHPQSTPSKTVR 115

Query: 485  EPISNILPMRSTND-------SPIMPKVDAPANWKEVLDGIRNMRSSEDAPVDSMGCEKA 643
                  LP     +        P+ P    P+NW++VL+GIR MRS+EDAPVDSMGCEKA
Sbjct: 116  LTGEKTLPQLMQTEIKGFSLSDPLQP----PSNWEKVLEGIRKMRSAEDAPVDSMGCEKA 171

Query: 644  GSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNGLLAPDAIDSADEETIKNLIYP 823
            GS LP KERRFAVLVSSLLSSQTKD V HGA+QRL QNGLLA DAIDSA+EETIK+LIYP
Sbjct: 172  GSSLPAKERRFAVLVSSLLSSQTKDQVNHGAVQRLLQNGLLAADAIDSANEETIKSLIYP 231

Query: 824  VGFYSRKACNLKKIAKICLMEYGGDIPSSLKELLLLPGIGPKMAHLVMNVGWNNVQGICV 1003
            VGFY+RKA NLKK+AKICL +Y GDIPSSL+ELLLLPGIGPKMAHLVMNV W NVQGICV
Sbjct: 232  VGFYTRKASNLKKVAKICLSKYNGDIPSSLEELLLLPGIGPKMAHLVMNVAWENVQGICV 291

Query: 1004 DTHVHRICNRLRWVSKPGTGLKTLSPEETRESLQLWLPKDEWIAINPLLVGFGQTLCTPL 1183
            DTHVHRI NRL WVS+PGT  KT +PEETRESLQLWLPK+EW+ INPLLVGFGQT+CTPL
Sbjct: 292  DTHVHRISNRLEWVSRPGTKQKTRTPEETRESLQLWLPKEEWVPINPLLVGFGQTICTPL 351

Query: 1184 KPRCEMCSINGLCPSAFKETASRMHRTKK 1270
            +PRC +C+++ LCPSAFKE AS     KK
Sbjct: 352  RPRCAICTVSDLCPSAFKEAASPSSTPKK 380


>ref|XP_006410282.1| hypothetical protein EUTSA_v10016815mg [Eutrema salsugineum]
            gi|557111451|gb|ESQ51735.1| hypothetical protein
            EUTSA_v10016815mg [Eutrema salsugineum]
          Length = 373

 Score =  419 bits (1077), Expect = e-114
 Identities = 209/349 (59%), Positives = 260/349 (74%), Gaps = 7/349 (2%)
 Frame = +2

Query: 260  SAKLQSKHEIPNAEPN-------AEIRVSARKRRSKRTIETRMEEHKIESLQQKKKLCDL 418
            S+K   +H +P+++P        +E RV  RK+R K+     +E  K   +  +K+LC L
Sbjct: 34   SSKPTQQHSLPDSDPEPAKPASGSETRVYTRKKRLKQEAFQPLE--KDSCINTQKQLCRL 91

Query: 419  PDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMPKVDAPANWKEVLDGIRNMR 598
            PDIEEFAY K N  +  S+   E    +  +++  +        AP NW +VL+GIR MR
Sbjct: 92   PDIEEFAYKK-NTRSSSSRRSTETSITVTSVKTAGN--------APENWVKVLEGIRQMR 142

Query: 599  SSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNGLLAPDA 778
            SSEDAPVDSMGC+KAGSFLPP ERRFAVL+ +LLSSQTKD V + AI RLHQNGLL P+A
Sbjct: 143  SSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDEVNNAAIHRLHQNGLLTPEA 202

Query: 779  IDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSSLKELLLLPGIGPKMAH 958
            +D ADE T++ LIYPVGFY+RKA  +KKIAKICL++Y GDIPSSL +LL LPGIGPKMAH
Sbjct: 203  VDKADESTLRELIYPVGFYTRKATYMKKIAKICLVKYNGDIPSSLDDLLALPGIGPKMAH 262

Query: 959  LVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEETRESLQLWLPKDEWIAI 1138
            L++++ WN+VQGICVDTHVHRICNRL WVS+PGT  KT SPEETR +LQ WLPK+EW+AI
Sbjct: 263  LILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTSSPEETRVALQQWLPKEEWVAI 322

Query: 1139 NPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKKSSRVK 1285
            NPLLVGFGQT+CTPL+PRCE CS+  LCP+AFKE +S   + KKS + K
Sbjct: 323  NPLLVGFGQTICTPLRPRCETCSVTKLCPAAFKEASSPSSKLKKSKQSK 371


>emb|CAC16135.1| endonuclease III homologue [Arabidopsis thaliana]
          Length = 354

 Score =  416 bits (1070), Expect = e-113
 Identities = 214/355 (60%), Positives = 258/355 (72%), Gaps = 9/355 (2%)
 Frame = +2

Query: 248  TRSVSAKLQ-----SKHEIPNAEPNAEIRVSARKRRSKRTIETRMEEHKIESLQQKKKLC 412
            ++ +S K Q     S  E+      +E RV  RK+R K+     +E++  + +   K LC
Sbjct: 12   SKHISLKTQHPLSDSNSELAYGASGSETRVYTRKKRLKQEPFEPLEKYSGKGVNTHK-LC 70

Query: 413  DLPDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMPKVDA----PANWKEVLD 580
             LPDIE+FAY K  GS   S             RST  S  +  V      P NW EVL+
Sbjct: 71   GLPDIEDFAYKKTIGSPSSS-------------RSTETSITVTSVKTAGYPPENWVEVLE 117

Query: 581  GIRNMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNG 760
            GIR MRSSEDAPVDSMGC+KAGSFLPP ERRFAVL+ +LLSSQTKD V + AI RLHQNG
Sbjct: 118  GIRQMRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNG 177

Query: 761  LLAPDAIDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSSLKELLLLPGI 940
            LL P+A+D ADE TIK LIYPVGFY+RKA  +KKIA+ICL++Y GDIPSSL +LL LPGI
Sbjct: 178  LLTPEAVDKADESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSSLDDLLSLPGI 237

Query: 941  GPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEETRESLQLWLPK 1120
            GPKMAHL++++ WN+VQGICVDTHVHRICNRL WVS+PGT  KT SPEETR +LQ WLPK
Sbjct: 238  GPKMAHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEETRVALQQWLPK 297

Query: 1121 DEWIAINPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKKSSRVK 1285
            +EW+AINPLLVGFGQ +CTPL+PRCE CS++ LCP+AFKET+S   + KKS+R K
Sbjct: 298  EEWVAINPLLVGFGQMICTPLRPRCEACSVSKLCPAAFKETSSPSSKLKKSNRSK 352


>ref|NP_565725.1| protein NTH1 [Arabidopsis thaliana]
            gi|75206080|sp|Q9SIC4.2|NTH1_ARATH RecName:
            Full=Endonuclease III homolog 1, chloroplastic;
            Short=AtNTH1; AltName: Full=Bifunctional DNA
            N-glycoslyase/DNA-(apurinic or apyrimidinic site) lyase
            1; Short=DNA glycoslyase/AP lyase 1; Flags: Precursor
            gi|20198157|gb|AAD26474.2| putative endonuclease
            [Arabidopsis thaliana] gi|330253455|gb|AEC08549.1|
            protein NTH1 [Arabidopsis thaliana]
          Length = 379

 Score =  416 bits (1068), Expect = e-113
 Identities = 213/355 (60%), Positives = 258/355 (72%), Gaps = 9/355 (2%)
 Frame = +2

Query: 248  TRSVSAKLQ-----SKHEIPNAEPNAEIRVSARKRRSKRTIETRMEEHKIESLQQKKKLC 412
            ++ +S K Q     S  E+      +E RV  RK+R K+     +E++  + +   K LC
Sbjct: 37   SKHISLKTQHPLSDSNSELAYGASGSETRVYTRKKRLKQEPFEPLEKYSGKGVNTHK-LC 95

Query: 413  DLPDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMPKVDA----PANWKEVLD 580
             LPDIE+FAY K  GS   S             RST  S  +  V      P NW EVL+
Sbjct: 96   GLPDIEDFAYKKTIGSPSSS-------------RSTETSITVTSVKTAGYPPENWVEVLE 142

Query: 581  GIRNMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNG 760
            GIR MRSSEDAPVDSMGC+KAGSFLPP ERRFAVL+ +LLSSQTKD V + AI RLHQNG
Sbjct: 143  GIRQMRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNG 202

Query: 761  LLAPDAIDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSSLKELLLLPGI 940
            LL P+A+D ADE TIK LIYPVGFY+RKA  +KKIA+ICL++Y GDIPSSL +LL LPGI
Sbjct: 203  LLTPEAVDKADESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSSLDDLLSLPGI 262

Query: 941  GPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEETRESLQLWLPK 1120
            GPKMAHL++++ WN+VQGICVDTHVHRICNRL WVS+PGT  KT SPEETR +LQ WLPK
Sbjct: 263  GPKMAHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEETRVALQQWLPK 322

Query: 1121 DEWIAINPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKKSSRVK 1285
            +EW+AINPLLVGFGQ +CTP++PRCE CS++ LCP+AFKET+S   + KKS+R K
Sbjct: 323  EEWVAINPLLVGFGQMICTPIRPRCEACSVSKLCPAAFKETSSPSSKLKKSNRSK 377


>gb|AAM61598.1| putative endonuclease [Arabidopsis thaliana]
          Length = 379

 Score =  415 bits (1067), Expect = e-113
 Identities = 214/365 (58%), Positives = 263/365 (72%), Gaps = 12/365 (3%)
 Frame = +2

Query: 227  LIRQMPETRSVSAKLQSKHEIPNAEPNAEIRVSARKRRSK-RTIETRMEEHKIESLQQKK 403
            +IRQ+    S S  +  K + P ++ N+E+   A    ++  T + R+++   E L++  
Sbjct: 26   MIRQIHGAVSSSKHISLKTQHPLSDSNSELAYGASGSETRVYTRKKRLKQEPFEPLEKDS 85

Query: 404  -------KLCDLPDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMPKV----D 550
                   KLC LPDIE+FAY K  GS   S             RST  S  +  V    +
Sbjct: 86   GKGVNTHKLCGLPDIEDFAYKKTIGSPSSS-------------RSTETSITVTSVKTAGN 132

Query: 551  APANWKEVLDGIRNMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTH 730
             P NW  VL+GIR MRSSEDAPVDSMGC+KAGSFLPP ERRFAVL+ +LLSSQTKD V +
Sbjct: 133  PPENWVGVLEGIRQMRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNN 192

Query: 731  GAIQRLHQNGLLAPDAIDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSS 910
             AI RLHQNGLL P+A+D ADE TIK LIYPVGFY+RKA  +KKIA+ICL++Y GDIPSS
Sbjct: 193  AAIHRLHQNGLLTPEAVDKADESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSS 252

Query: 911  LKELLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEET 1090
            L +LL LPGIGPKMAHL++++ WN+VQGICVDTHVHRICNRL WVS+PGT  KT SPEET
Sbjct: 253  LDDLLSLPGIGPKMAHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEET 312

Query: 1091 RESLQLWLPKDEWIAINPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKK 1270
            R +LQ WLPK+EW+AINPLLVGFGQ +CTPL+PRCE CS++ LCP+AFKET+S   + KK
Sbjct: 313  RVALQQWLPKEEWVAINPLLVGFGQMICTPLRPRCEACSVSKLCPAAFKETSSPSSKLKK 372

Query: 1271 SSRVK 1285
            S+R K
Sbjct: 373  SNRSK 377


>ref|NP_001077988.1| protein NTH1 [Arabidopsis thaliana] gi|17380754|gb|AAL36207.1|
            putative endonuclease [Arabidopsis thaliana]
            gi|20259623|gb|AAM14168.1| putative endonuclease
            [Arabidopsis thaliana] gi|330253456|gb|AEC08550.1|
            protein NTH1 [Arabidopsis thaliana]
          Length = 377

 Score =  415 bits (1066), Expect = e-113
 Identities = 215/362 (59%), Positives = 261/362 (72%), Gaps = 11/362 (3%)
 Frame = +2

Query: 233  RQMPETRSVSAKLQSKHEIPNAEPNA-------EIRVSARKRRSKRTIETRMEEHKIESL 391
            RQ+    S S  +  K + P ++ N+       E RV  RK+R K+     +E++  + +
Sbjct: 28   RQIHGAVSSSKHISLKTQHPLSDSNSAYGASGSETRVYTRKKRLKQEPFEPLEKYSGKGV 87

Query: 392  QQKKKLCDLPDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMPKVDA----PA 559
               K LC LPDIE+FAY K  GS   S             RST  S  +  V      P 
Sbjct: 88   NTHK-LCGLPDIEDFAYKKTIGSPSSS-------------RSTETSITVTSVKTAGYPPE 133

Query: 560  NWKEVLDGIRNMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAI 739
            NW EVL+GIR MRSSEDAPVDSMGC+KAGSFLPP ERRFAVL+ +LLSSQTKD V + AI
Sbjct: 134  NWVEVLEGIRQMRSSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAI 193

Query: 740  QRLHQNGLLAPDAIDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSSLKE 919
             RLHQNGLL P+A+D ADE TIK LIYPVGFY+RKA  +KKIA+ICL++Y GDIPSSL +
Sbjct: 194  HRLHQNGLLTPEAVDKADESTIKELIYPVGFYTRKATYMKKIARICLVKYDGDIPSSLDD 253

Query: 920  LLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEETRES 1099
            LL LPGIGPKMAHL++++ WN+VQGICVDTHVHRICNRL WVS+PGT  KT SPEETR +
Sbjct: 254  LLSLPGIGPKMAHLILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEETRVA 313

Query: 1100 LQLWLPKDEWIAINPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKKSSR 1279
            LQ WLPK+EW+AINPLLVGFGQ +CTP++PRCE CS++ LCP+AFKET+S   + KKS+R
Sbjct: 314  LQQWLPKEEWVAINPLLVGFGQMICTPIRPRCEACSVSKLCPAAFKETSSPSSKLKKSNR 373

Query: 1280 VK 1285
             K
Sbjct: 374  SK 375


>ref|XP_007131959.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris]
            gi|561004959|gb|ESW03953.1| hypothetical protein
            PHAVU_011G055100g [Phaseolus vulgaris]
          Length = 359

 Score =  413 bits (1062), Expect = e-113
 Identities = 215/338 (63%), Positives = 254/338 (75%), Gaps = 5/338 (1%)
 Frame = +2

Query: 305  NAEIRVSARKRRSKRTIETRMEEHKIESLQQKKKL-----CDLPDIEEFAYGKVNGSAEM 469
            N+++RV  R+ +  R +  ++EE     L Q  K+       LP+IE+FAY   N     
Sbjct: 27   NSKVRVFVRRNKKPRKMAVKLEEEDHLPLTQDHKVPVTQKFGLPEIEDFAYCGGNELTRR 86

Query: 470  SKGHLEPISNILPMRSTNDSPIMPKVDAPANWKEVLDGIRNMRSSEDAPVDSMGCEKAGS 649
             K  +E  S++  + S   S   P   +PA+W++VL+GIR MRSS DAPVD+MGCEKAG 
Sbjct: 87   RKSEME--SDVASVASEVAST-RPGGKSPAHWEKVLEGIRKMRSSADAPVDTMGCEKAGD 143

Query: 650  FLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNGLLAPDAIDSADEETIKNLIYPVG 829
             LPPKERRFAVLVSSLLSSQTKD VTHGAIQRL QN LL P+AI++ DEETIK LIYPVG
Sbjct: 144  TLPPKERRFAVLVSSLLSSQTKDPVTHGAIQRLLQNDLLTPEAINNVDEETIKKLIYPVG 203

Query: 830  FYSRKACNLKKIAKICLMEYGGDIPSSLKELLLLPGIGPKMAHLVMNVGWNNVQGICVDT 1009
            FY+RKA NLKKIA ICLM+Y GDIPSS+ +LLLLPGIGPKMAHLVMN GWNNVQGICVDT
Sbjct: 204  FYTRKATNLKKIANICLMKYHGDIPSSIDQLLLLPGIGPKMAHLVMNAGWNNVQGICVDT 263

Query: 1010 HVHRICNRLRWVSKPGTGLKTLSPEETRESLQLWLPKDEWIAINPLLVGFGQTLCTPLKP 1189
            HVHRICNRL WVS+ GT  KT +PEETRESLQ WLPK+EW+ INPLLVGFGQT+CTPL+P
Sbjct: 264  HVHRICNRLGWVSRLGTNQKTSTPEETRESLQRWLPKEEWVPINPLLVGFGQTICTPLRP 323

Query: 1190 RCEMCSINGLCPSAFKETASRMHRTKKSSRVKGS*LNK 1303
            RC  CS+  LCPSAFKET++    +  SS+ K   LNK
Sbjct: 324  RCGECSVRDLCPSAFKETSN----SSPSSKSKKPGLNK 357


>ref|XP_002309812.1| endonuclease-related family protein [Populus trichocarpa]
            gi|222852715|gb|EEE90262.1| endonuclease-related family
            protein [Populus trichocarpa]
          Length = 362

 Score =  413 bits (1062), Expect = e-113
 Identities = 219/359 (61%), Positives = 257/359 (71%), Gaps = 8/359 (2%)
 Frame = +2

Query: 233  RQMPETRSVSAKLQSKHEI--------PNAEPNAEIRVSARKRRSKRTIETRMEEHKIES 388
            ++MP TR  S  LQSK EI        PN     E+RV  RKR+ K T+E   +E K+E 
Sbjct: 23   KKMPNTRFSSKSLQSKTEISTSDTVPGPNEVSVPEVRVFVRKRKVKTTVEAAEKEVKVEP 82

Query: 389  LQQKKKLCDLPDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMPKVDAPANWK 568
              +K+KL  LPDIEEFAY K NG A + K  L+   N+LP+ S   S I P  + P NW 
Sbjct: 83   --RKQKLSALPDIEEFAYKKGNGPALIRK--LKSTENVLPVDSEAASTIRPAGEPPLNWD 138

Query: 569  EVLDGIRNMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRL 748
            +VL+GI  MRSSEDAPVD+MGCEKAG  LPP      V++S+            GAIQRL
Sbjct: 139  KVLEGIHKMRSSEDAPVDTMGCEKAGISLPP-----GVVLSA------------GAIQRL 181

Query: 749  HQNGLLAPDAIDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSSLKELLL 928
             QN LL  DAID ADE  IK+LIYPVGFY+RKA NLKKIAKICL++Y GDIPSSL++LL 
Sbjct: 182  QQNNLLTADAIDKADETAIKDLIYPVGFYTRKASNLKKIAKICLLKYDGDIPSSLEDLLS 241

Query: 929  LPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEETRESLQL 1108
            LPGIGPKMAHLVMN+ WNNVQGICVDTHVHRICNRL WV++PGT  KT +PEETRE+LQL
Sbjct: 242  LPGIGPKMAHLVMNIAWNNVQGICVDTHVHRICNRLGWVARPGTKQKTSTPEETREALQL 301

Query: 1109 WLPKDEWIAINPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKKSSRVK 1285
            WLPKDEW+ INPLLVGFGQT+CTPL+PRC MC I+  CPSAFKET+S   + K+S   K
Sbjct: 302  WLPKDEWVPINPLLVGFGQTICTPLRPRCGMCCISEFCPSAFKETSSPASKQKRSGGSK 360


>ref|XP_003539044.2| PREDICTED: endonuclease III-like protein 1-like [Glycine max]
          Length = 357

 Score =  411 bits (1056), Expect = e-112
 Identities = 214/339 (63%), Positives = 253/339 (74%), Gaps = 6/339 (1%)
 Frame = +2

Query: 305  NAEIRVSARKRRSKRTIETRMEEHKIESLQQK-KKLCDLPDIEEFAYGKVN-----GSAE 466
            ++++RV  R+ +  R +  ++E+   + L+        LP+IEEFAY         G +E
Sbjct: 28   HSQVRVFMRRNKRPRNMALKLEQSDHQDLKVPVTHKFGLPEIEEFAYCGAKELTQCGKSE 87

Query: 467  MSKGHLEPISNILPMRSTNDSPIMPKVDAPANWKEVLDGIRNMRSSEDAPVDSMGCEKAG 646
            M    +   S +   RS+ +SP        A W++VL+GIR MR S DAPVD+MGCEKAG
Sbjct: 88   MGSDAIPVASEVASTRSSGESP--------AQWEKVLEGIRKMRCSADAPVDTMGCEKAG 139

Query: 647  SFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNGLLAPDAIDSADEETIKNLIYPV 826
              LPPKERRFAVLVSSLLSSQTKD VTHGAIQRL QN LL  DAI+ ADEETIK LIYPV
Sbjct: 140  ETLPPKERRFAVLVSSLLSSQTKDPVTHGAIQRLLQNDLLTADAINDADEETIKKLIYPV 199

Query: 827  GFYSRKACNLKKIAKICLMEYGGDIPSSLKELLLLPGIGPKMAHLVMNVGWNNVQGICVD 1006
            GFY+RKA NLKKIA ICLM+Y GDIPSS+++LLLLPGIGPKMAHLVMNVGWNNVQGICVD
Sbjct: 200  GFYTRKASNLKKIANICLMKYDGDIPSSIEQLLLLPGIGPKMAHLVMNVGWNNVQGICVD 259

Query: 1007 THVHRICNRLRWVSKPGTGLKTLSPEETRESLQLWLPKDEWIAINPLLVGFGQTLCTPLK 1186
            THVHRICNRL WVS+ GT  KT +PEETRE LQ WLPK+EW+ INPLLVGFGQT+CTPL+
Sbjct: 260  THVHRICNRLGWVSRLGTKQKTSTPEETREELQRWLPKEEWVPINPLLVGFGQTICTPLR 319

Query: 1187 PRCEMCSINGLCPSAFKETASRMHRTKKSSRVKGS*LNK 1303
            PRC  CSI+ LCPSAFKET+   + +  SS+ K S LNK
Sbjct: 320  PRCGECSISELCPSAFKETS---NSSPSSSKSKKSGLNK 355


>ref|XP_004507328.1| PREDICTED: endonuclease III-like protein 1-like [Cicer arietinum]
          Length = 387

 Score =  411 bits (1056), Expect = e-112
 Identities = 208/324 (64%), Positives = 246/324 (75%), Gaps = 3/324 (0%)
 Frame = +2

Query: 323  SARKRRSKRTIETRMEE-HKIESLQQKKKLCDLPDIEEFAYGKVNGSAEMSKGHLEPISN 499
            S   +R+K    T++++ H +   Q  KK   LP+IE+FAY   N   +  K  +     
Sbjct: 62   SNNNKRAKGITTTKLQQNHHLPPTQTHKKFGGLPEIEDFAYRGPNELTQFRKSEISSDVI 121

Query: 500  ILPMRSTNDSPIMPKVDAPANWKEVLDGIRNMRSSEDAPVDSMGCEKAGSFLPPKERRFA 679
            + P   +  +    + ++PA+W+E L+GIR MR S DAPVD+MGCEKAGS LPPKERRFA
Sbjct: 122  VKPAEESEVASAAHRSESPADWEETLEGIRKMRCSADAPVDTMGCEKAGSTLPPKERRFA 181

Query: 680  VLVSSLLSSQTKDGVTHGAIQRLHQNGLLAPDAIDSADEETIKNLIYPVGFYSRKACNLK 859
            VLVSSLLSSQTKD V HGAIQRL QN LL PDAI++ADEETIK LIYPVGFY+RKA NLK
Sbjct: 182  VLVSSLLSSQTKDHVNHGAIQRLLQNDLLTPDAINNADEETIKKLIYPVGFYTRKATNLK 241

Query: 860  KIAKICLMEYGGDIPSSLKELLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLR 1039
            KIA ICLM+YGGDIPS+L++LLLLPGIGPKMAHLVMNV WNNVQGICVDTHVHRICNRL 
Sbjct: 242  KIANICLMKYGGDIPSTLEQLLLLPGIGPKMAHLVMNVAWNNVQGICVDTHVHRICNRLG 301

Query: 1040 WVSKPGTGLKTLSPEETRESLQLWLPKDEWIAINPLLVGFGQTLCTPLKPRCEMCSINGL 1219
            WVS+ GT  KTL+PEETRESLQ WLP++EW  INPLLVGFGQT+CTPL+PRC  C I+ L
Sbjct: 302  WVSRLGTKQKTLTPEETRESLQRWLPREEWDPINPLLVGFGQTICTPLRPRCGECGISHL 361

Query: 1220 CPSAFKET--ASRMHRTKKSSRVK 1285
            C SAFKE   +S   ++ KS R K
Sbjct: 362  CLSAFKEASDSSSFSKSTKSRRNK 385


>ref|XP_002881177.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297327016|gb|EFH57436.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 354

 Score =  411 bits (1056), Expect = e-112
 Identities = 207/349 (59%), Positives = 254/349 (72%), Gaps = 3/349 (0%)
 Frame = +2

Query: 248  TRSVSAKLQ---SKHEIPNAEPNAEIRVSARKRRSKRTIETRMEEHKIESLQQKKKLCDL 418
            ++ +S+K Q   S     N    +  RV  RK+R K+     +E +  + +   K+L  L
Sbjct: 13   SKPISSKTQRPLSDSNSANGASGSVTRVYTRKKRLKQEASEPLEINPGKGVNTHKQLRGL 72

Query: 419  PDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMPKVDAPANWKEVLDGIRNMR 598
            PDIE+FAY K  GS         P S      S   + +    + P NW +VL+GIR MR
Sbjct: 73   PDIEDFAYKKTIGS---------PSSRRSTETSITVTSVKTAGNPPENWVKVLEGIRQMR 123

Query: 599  SSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTHGAIQRLHQNGLLAPDA 778
            SSEDAPVDSMGC+KAGSFLPP ERRFAVL+ +LLSSQTKD V + AI RLHQN LL P+A
Sbjct: 124  SSEDAPVDSMGCDKAGSFLPPTERRFAVLLGALLSSQTKDQVNNAAIHRLHQNSLLTPEA 183

Query: 779  IDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSSLKELLLLPGIGPKMAH 958
            +D ADE TI+ LIYPVGFY+RKA  +KKIA+ICL++Y GDIPSSL +LL LPGIGPKMAH
Sbjct: 184  VDKADESTIRELIYPVGFYTRKATYMKKIARICLVKYNGDIPSSLDDLLSLPGIGPKMAH 243

Query: 959  LVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEETRESLQLWLPKDEWIAI 1138
            L++++ WN+VQGICVDTHVHRICNRL WVS+PGT  KT SPEETR +LQ WLPK+EW+AI
Sbjct: 244  LILHIAWNDVQGICVDTHVHRICNRLGWVSRPGTKQKTTSPEETRVALQQWLPKEEWVAI 303

Query: 1139 NPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKKSSRVK 1285
            NPLLVGFGQT+CTPL+PRCE CS+  LCP+AFKET+S   + KKS+R K
Sbjct: 304  NPLLVGFGQTICTPLRPRCEACSVTKLCPAAFKETSSPSSKLKKSNRSK 352


>ref|XP_007131960.1| hypothetical protein PHAVU_011G055100g [Phaseolus vulgaris]
            gi|561004960|gb|ESW03954.1| hypothetical protein
            PHAVU_011G055100g [Phaseolus vulgaris]
          Length = 408

 Score =  410 bits (1055), Expect = e-112
 Identities = 221/380 (58%), Positives = 270/380 (71%), Gaps = 22/380 (5%)
 Frame = +2

Query: 230  IRQMPETRSVSAKLQSKHEIPNAE-------PNA----------EIRVSARKRRSKRTIE 358
            +R+  + R ++ KL+ +  +P  +       PN+          + RV  R+ ++ R + 
Sbjct: 34   VRRNKKPRKMAVKLEEEDHLPLTQDHKVPVTPNSATSFIEASHSKARVFVRRNKNPRKMA 93

Query: 359  TRMEEHK-IESLQQKK----KLCDLPDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTN 523
             ++EE   + S Q  K    +   LP+IE+FAY   N      K  +E  S++  + S  
Sbjct: 94   VKLEEEDHLPSTQDHKVPVTQKFGLPEIEDFAYCGGNELTRRRKSEME--SDVASVASEV 151

Query: 524  DSPIMPKVDAPANWKEVLDGIRNMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLS 703
             S   P   +PA+W++VL+GIR MRSS DAPVD+MGCEKAG  LPPKERRFAVLVSSLLS
Sbjct: 152  AST-RPGGKSPAHWEKVLEGIRKMRSSADAPVDTMGCEKAGDTLPPKERRFAVLVSSLLS 210

Query: 704  SQTKDGVTHGAIQRLHQNGLLAPDAIDSADEETIKNLIYPVGFYSRKACNLKKIAKICLM 883
            SQTKD VTHGAIQRL QN LL P+AI++ DEETIK LIYPVGFY+RKA NLKKIA ICLM
Sbjct: 211  SQTKDPVTHGAIQRLLQNDLLTPEAINNVDEETIKKLIYPVGFYTRKATNLKKIANICLM 270

Query: 884  EYGGDIPSSLKELLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTG 1063
            +Y GDIPSS+ +LLLLPGIGPKMAHLVMN GWNNVQGICVDTHVHRICNRL WVS+ GT 
Sbjct: 271  KYHGDIPSSIDQLLLLPGIGPKMAHLVMNAGWNNVQGICVDTHVHRICNRLGWVSRLGTN 330

Query: 1064 LKTLSPEETRESLQLWLPKDEWIAINPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKET 1243
             KT +PEETRESLQ WLPK+EW+ INPLLVGFGQT+CTPL+PRC  CS+  LCPSAFKET
Sbjct: 331  QKTSTPEETRESLQRWLPKEEWVPINPLLVGFGQTICTPLRPRCGECSVRDLCPSAFKET 390

Query: 1244 ASRMHRTKKSSRVKGS*LNK 1303
            ++    +  SS+ K   LNK
Sbjct: 391  SN----SSPSSKSKKPGLNK 406


>ref|XP_002889575.1| hypothetical protein ARALYDRAFT_470604 [Arabidopsis lyrata subsp.
            lyrata] gi|297335417|gb|EFH65834.1| hypothetical protein
            ARALYDRAFT_470604 [Arabidopsis lyrata subsp. lyrata]
          Length = 384

 Score =  406 bits (1044), Expect = e-110
 Identities = 205/365 (56%), Positives = 260/365 (71%), Gaps = 8/365 (2%)
 Frame = +2

Query: 215  RRVCLIRQMPETRSVSAKLQSKHEIPNAEPNA-------EIRVSARKRRSKRTIETRMEE 373
            RR+     +   +S+SA+  +     N +  A       E RVS RK+R K+     +++
Sbjct: 23   RRMYAAATLSSAKSISAESLNPRPDSNFDSGAAIGTSESETRVSLRKKRLKQEDLEPVQQ 82

Query: 374  HKIESLQQKKKLCDLPDIEEFAYGKVNGSAEMSKGHLEPISNILPMRSTNDSPIMPKVDA 553
                 +  +K++C LPDIEE  Y K NGSA      +        ++ST  S  +     
Sbjct: 83   CSSRGINARKEMCGLPDIEESPYKKTNGSASSRTSKINSF-----IKSTEASTSIKTAGI 137

Query: 554  PA-NWKEVLDGIRNMRSSEDAPVDSMGCEKAGSFLPPKERRFAVLVSSLLSSQTKDGVTH 730
            P  NWK+VL+GI+ M+SSE+AP +++ C++ GSFLPPKERRF VL+ +LLSSQTK+ +T 
Sbjct: 138  PPENWKKVLEGIQKMKSSEEAPANAVECDRTGSFLPPKERRFYVLIGTLLSSQTKEHITG 197

Query: 731  GAIQRLHQNGLLAPDAIDSADEETIKNLIYPVGFYSRKACNLKKIAKICLMEYGGDIPSS 910
             A++RLHQNGLL P+AID ADE TIK LIYPVGFY+RKA N+KK+AKICLM+Y GDIP +
Sbjct: 198  AAVERLHQNGLLTPEAIDKADESTIKELIYPVGFYTRKATNVKKVAKICLMKYDGDIPRT 257

Query: 911  LKELLLLPGIGPKMAHLVMNVGWNNVQGICVDTHVHRICNRLRWVSKPGTGLKTLSPEET 1090
            L+ELL LPG+GPK+AHLV++V WN+VQGICVDTHVHRICNRL WVSKPGT  KTLSPEET
Sbjct: 258  LEELLSLPGVGPKIAHLVLHVAWNDVQGICVDTHVHRICNRLGWVSKPGTKQKTLSPEET 317

Query: 1091 RESLQLWLPKDEWIAINPLLVGFGQTLCTPLKPRCEMCSINGLCPSAFKETASRMHRTKK 1270
            R +LQ WLPK+EW+AIN LLVGFGQT+CTPL+PRC  CSI  LCPSAFKET S   + KK
Sbjct: 318  RVALQQWLPKEEWVAINFLLVGFGQTICTPLRPRCGTCSITELCPSAFKETPSTSSKLKK 377

Query: 1271 SSRVK 1285
            S + K
Sbjct: 378  SIKSK 382


Top