BLASTX nr result

ID: Akebia26_contig00024886 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00024886
         (799 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI15085.3| unnamed protein product [Vitis vinifera]              253   8e-65
ref|XP_002283633.1| PREDICTED: endonuclease III-like [Vitis vini...   253   8e-65
ref|XP_006439743.1| hypothetical protein CICLE_v10021561mg [Citr...   248   2e-63
ref|XP_007036110.1| DNA glycosylase superfamily protein isoform ...   248   2e-63
ref|XP_007036109.1| DNA glycosylase superfamily protein isoform ...   248   2e-63
ref|XP_007036108.1| DNA glycosylase superfamily protein isoform ...   248   2e-63
ref|XP_006476720.1| PREDICTED: protein ROS1-like isoform X3 [Cit...   245   1e-62
ref|XP_006476719.1| PREDICTED: protein ROS1-like isoform X2 [Cit...   245   1e-62
ref|XP_006476718.1| PREDICTED: protein ROS1-like isoform X1 [Cit...   245   1e-62
ref|XP_004299588.1| PREDICTED: DEMETER-like protein 3-like [Frag...   243   6e-62
ref|XP_002511456.1| Endonuclease III, putative [Ricinus communis...   243   6e-62
ref|XP_002875868.1| predicted protein [Arabidopsis lyrata subsp....   237   4e-60
ref|XP_006404333.1| hypothetical protein EUTSA_v10010580mg [Eutr...   233   5e-59
ref|XP_002321564.2| hypothetical protein POPTR_0015s08260g [Popu...   232   1e-58
ref|XP_006291571.1| hypothetical protein CARUB_v10017731mg [Caps...   231   2e-58
ref|NP_566893.1| DNA glycosylase superfamily protein [Arabidopsi...   231   2e-58
gb|EXB42063.1| Protein ROS1 [Morus notabilis]                         231   2e-58
ref|XP_006836744.1| hypothetical protein AMTR_s00088p00146000 [A...   231   2e-58
ref|XP_004236146.1| PREDICTED: endonuclease III-like [Solanum ly...   228   3e-57
ref|XP_006345014.1| PREDICTED: protein ROS1-like [Solanum tubero...   226   1e-56

>emb|CBI15085.3| unnamed protein product [Vitis vinifera]
          Length = 310

 Score =  253 bits (645), Expect = 8e-65
 Identities = 136/228 (59%), Positives = 158/228 (69%), Gaps = 11/228 (4%)
 Frame = -1

Query: 685 MQRNRKRKNLHSIS-----KPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQE 521
           MQR+RKRK   S S       +              PRPT  EC++VRD LL LHGFPQ 
Sbjct: 1   MQRSRKRKQEESSSCSKESATKSARNDVVVDPYPSHPRPTPVECRAVRDDLLALHGFPQR 60

Query: 520 FAKYRRTDY------LNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNT 359
           F KYR+          +P     G  P +L  S+    +   QKESVLDGL+SI+LSQNT
Sbjct: 61  FEKYRKLRLPPLPHTSSPGLDGGGGTPVKLDPSDGDDVNGSSQKESVLDGLVSIILSQNT 120

Query: 358 TDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGK 179
           TD NS RAFASLKS FPTW+DVLAA+ K +EN+I+CGGLAVTKASCIK +L+ LLE+KGK
Sbjct: 121 TDVNSQRAFASLKSAFPTWQDVLAADSKSIENAIRCGGLAVTKASCIKKMLSCLLERKGK 180

Query: 178 LCLEYLRGMSTDEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHV 35
           LCLEYLR ++ DEIK EL  FKGIGPKTVACVLMFHLQ+DDFPVDTHV
Sbjct: 181 LCLEYLRDLTVDEIKTELSHFKGIGPKTVACVLMFHLQRDDFPVDTHV 228


>ref|XP_002283633.1| PREDICTED: endonuclease III-like [Vitis vinifera]
          Length = 310

 Score =  253 bits (645), Expect = 8e-65
 Identities = 136/228 (59%), Positives = 158/228 (69%), Gaps = 11/228 (4%)
 Frame = -1

Query: 685 MQRNRKRKNLHSIS-----KPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQE 521
           MQR+RKRK   S S       +              PRPT  EC++VRD LL LHGFPQ 
Sbjct: 1   MQRSRKRKQEESSSCSKESATKSARNDVVVDPYPSHPRPTPVECRAVRDDLLALHGFPQR 60

Query: 520 FAKYRRTDY------LNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNT 359
           F KYR+          +P     G  P +L  S+    +   QKESVLDGL+SI+LSQNT
Sbjct: 61  FEKYRKLRLPPLPHTSSPGLDGGGGTPVKLDPSDGDDVNGSSQKESVLDGLVSIILSQNT 120

Query: 358 TDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGK 179
           TD NS RAFASLKS FPTW+DVLAA+ K +EN+I+CGGLAVTKASCIK +L+ LLE+KGK
Sbjct: 121 TDVNSQRAFASLKSAFPTWQDVLAADSKSIENAIRCGGLAVTKASCIKKMLSCLLERKGK 180

Query: 178 LCLEYLRGMSTDEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHV 35
           LCLEYLR ++ DEIK EL  FKGIGPKTVACVLMFHLQ+DDFPVDTHV
Sbjct: 181 LCLEYLRDLTVDEIKTELSHFKGIGPKTVACVLMFHLQRDDFPVDTHV 228


>ref|XP_006439743.1| hypothetical protein CICLE_v10021561mg [Citrus clementina]
           gi|557542005|gb|ESR52983.1| hypothetical protein
           CICLE_v10021561mg [Citrus clementina]
          Length = 281

 Score =  248 bits (632), Expect = 2e-63
 Identities = 132/224 (58%), Positives = 157/224 (70%)
 Frame = -1

Query: 685 MQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKYR 506
           MQ++RKRK        Q++             RPT++EC+ +RD LL LHGFP EF KYR
Sbjct: 1   MQKSRKRK--------QVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYR 52

Query: 505 RTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFAS 326
                +       S P  +      S   E ++ESVLDGL+  LLSQNTT+ANS +AFAS
Sbjct: 53  NQRLKHNMTRDKNSVPLDM------SEYDEGEEESVLDGLVKTLLSQNTTEANSLKAFAS 106

Query: 325 LKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMST 146
           LKS FPTWE VLAAE K +EN+I+CGGLA TKA+CIKN+L  LLE KGKLCLEYLRG+S 
Sbjct: 107 LKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSI 166

Query: 145 DEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVRA 14
           DEIKAEL  F+GIGPKTVACVLMFHLQQDDFPVDTHV F++ +A
Sbjct: 167 DEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDTHV-FEISKA 209


>ref|XP_007036110.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao]
           gi|508773355|gb|EOY20611.1| DNA glycosylase superfamily
           protein isoform 3 [Theobroma cacao]
          Length = 264

 Score =  248 bits (632), Expect = 2e-63
 Identities = 134/225 (59%), Positives = 158/225 (70%)
 Frame = -1

Query: 688 KMQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKY 509
           KMQ++RKRK L  I                   RPT  EC+SVRD LL LHGFP EF KY
Sbjct: 2   KMQKSRKRKQL-GIDGHSKTPKITTEEPYPSHHRPTPDECRSVRDELLALHGFPAEFLKY 60

Query: 508 RRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFA 329
           R    +        +EP+   KSEP + + +  +ESVLDGL+  +LSQNTT+ NS +AFA
Sbjct: 61  RHQRLIK-------TEPTIDAKSEPLNNNYDDGEESVLDGLVKTVLSQNTTELNSQKAFA 113

Query: 328 SLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMS 149
           SLKS FPTWEDVLAAE K +EN+I+CGGLA  KASCIKN+L  L E+KGKLC EYLR +S
Sbjct: 114 SLKSAFPTWEDVLAAESKNLENAIRCGGLAPRKASCIKNVLRCLHERKGKLCFEYLRDLS 173

Query: 148 TDEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVRA 14
            DEIKAEL  FKG+GPKTVACVLMF+LQQDDFPVDTHV F++ RA
Sbjct: 174 IDEIKAELSNFKGVGPKTVACVLMFNLQQDDFPVDTHV-FEIARA 217


>ref|XP_007036109.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao]
           gi|508773354|gb|EOY20610.1| DNA glycosylase superfamily
           protein isoform 2 [Theobroma cacao]
          Length = 292

 Score =  248 bits (632), Expect = 2e-63
 Identities = 134/225 (59%), Positives = 158/225 (70%)
 Frame = -1

Query: 688 KMQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKY 509
           KMQ++RKRK L  I                   RPT  EC+SVRD LL LHGFP EF KY
Sbjct: 2   KMQKSRKRKQL-GIDGHSKTPKITTEEPYPSHHRPTPDECRSVRDELLALHGFPAEFLKY 60

Query: 508 RRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFA 329
           R    +        +EP+   KSEP + + +  +ESVLDGL+  +LSQNTT+ NS +AFA
Sbjct: 61  RHQRLIK-------TEPTIDAKSEPLNNNYDDGEESVLDGLVKTVLSQNTTELNSQKAFA 113

Query: 328 SLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMS 149
           SLKS FPTWEDVLAAE K +EN+I+CGGLA  KASCIKN+L  L E+KGKLC EYLR +S
Sbjct: 114 SLKSAFPTWEDVLAAESKNLENAIRCGGLAPRKASCIKNVLRCLHERKGKLCFEYLRDLS 173

Query: 148 TDEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVRA 14
            DEIKAEL  FKG+GPKTVACVLMF+LQQDDFPVDTHV F++ RA
Sbjct: 174 IDEIKAELSNFKGVGPKTVACVLMFNLQQDDFPVDTHV-FEIARA 217


>ref|XP_007036108.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao]
           gi|508773353|gb|EOY20609.1| DNA glycosylase superfamily
           protein isoform 1 [Theobroma cacao]
          Length = 446

 Score =  248 bits (632), Expect = 2e-63
 Identities = 134/225 (59%), Positives = 158/225 (70%)
 Frame = -1

Query: 688 KMQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKY 509
           KMQ++RKRK L  I                   RPT  EC+SVRD LL LHGFP EF KY
Sbjct: 2   KMQKSRKRKQL-GIDGHSKTPKITTEEPYPSHHRPTPDECRSVRDELLALHGFPAEFLKY 60

Query: 508 RRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFA 329
           R    +        +EP+   KSEP + + +  +ESVLDGL+  +LSQNTT+ NS +AFA
Sbjct: 61  RHQRLIK-------TEPTIDAKSEPLNNNYDDGEESVLDGLVKTVLSQNTTELNSQKAFA 113

Query: 328 SLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMS 149
           SLKS FPTWEDVLAAE K +EN+I+CGGLA  KASCIKN+L  L E+KGKLC EYLR +S
Sbjct: 114 SLKSAFPTWEDVLAAESKNLENAIRCGGLAPRKASCIKNVLRCLHERKGKLCFEYLRDLS 173

Query: 148 TDEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVRA 14
            DEIKAEL  FKG+GPKTVACVLMF+LQQDDFPVDTHV F++ RA
Sbjct: 174 IDEIKAELSNFKGVGPKTVACVLMFNLQQDDFPVDTHV-FEIARA 217


>ref|XP_006476720.1| PREDICTED: protein ROS1-like isoform X3 [Citrus sinensis]
          Length = 258

 Score =  245 bits (626), Expect = 1e-62
 Identities = 130/224 (58%), Positives = 158/224 (70%)
 Frame = -1

Query: 685 MQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKYR 506
           MQ++RKRK        Q++             RPT++EC+ +RD LL LHGFP EF KYR
Sbjct: 1   MQKSRKRK--------QVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYR 52

Query: 505 RTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFAS 326
                +       S P  + + +      E ++ESVLDGL+  +LSQNTT+ANS +AFAS
Sbjct: 53  NQRLKHNMTRDKNSVPLDMNEYD------EGEEESVLDGLVKTVLSQNTTEANSLKAFAS 106

Query: 325 LKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMST 146
           LKS FPTWE VLAAE K +EN+I+CGGLA TKA+CIKN+L  LLE KGKLCLEYLRG+S 
Sbjct: 107 LKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSI 166

Query: 145 DEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVRA 14
           DEIKAEL  F+GIGPKTVACVLMFHLQQDDFPVDTHV F++ +A
Sbjct: 167 DEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDTHV-FEISKA 209


>ref|XP_006476719.1| PREDICTED: protein ROS1-like isoform X2 [Citrus sinensis]
          Length = 278

 Score =  245 bits (626), Expect = 1e-62
 Identities = 130/224 (58%), Positives = 158/224 (70%)
 Frame = -1

Query: 685 MQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKYR 506
           MQ++RKRK        Q++             RPT++EC+ +RD LL LHGFP EF KYR
Sbjct: 1   MQKSRKRK--------QVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYR 52

Query: 505 RTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFAS 326
                +       S P  + + +      E ++ESVLDGL+  +LSQNTT+ANS +AFAS
Sbjct: 53  NQRLKHNMTRDKNSVPLDMNEYD------EGEEESVLDGLVKTVLSQNTTEANSLKAFAS 106

Query: 325 LKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMST 146
           LKS FPTWE VLAAE K +EN+I+CGGLA TKA+CIKN+L  LLE KGKLCLEYLRG+S 
Sbjct: 107 LKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSI 166

Query: 145 DEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVRA 14
           DEIKAEL  F+GIGPKTVACVLMFHLQQDDFPVDTHV F++ +A
Sbjct: 167 DEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDTHV-FEISKA 209


>ref|XP_006476718.1| PREDICTED: protein ROS1-like isoform X1 [Citrus sinensis]
          Length = 281

 Score =  245 bits (626), Expect = 1e-62
 Identities = 130/224 (58%), Positives = 158/224 (70%)
 Frame = -1

Query: 685 MQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKYR 506
           MQ++RKRK        Q++             RPT++EC+ +RD LL LHGFP EF KYR
Sbjct: 1   MQKSRKRK--------QVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYR 52

Query: 505 RTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFAS 326
                +       S P  + + +      E ++ESVLDGL+  +LSQNTT+ANS +AFAS
Sbjct: 53  NQRLKHNMTRDKNSVPLDMNEYD------EGEEESVLDGLVKTVLSQNTTEANSLKAFAS 106

Query: 325 LKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMST 146
           LKS FPTWE VLAAE K +EN+I+CGGLA TKA+CIKN+L  LLE KGKLCLEYLRG+S 
Sbjct: 107 LKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSI 166

Query: 145 DEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVRA 14
           DEIKAEL  F+GIGPKTVACVLMFHLQQDDFPVDTHV F++ +A
Sbjct: 167 DEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDTHV-FEISKA 209


>ref|XP_004299588.1| PREDICTED: DEMETER-like protein 3-like [Fragaria vesca subsp.
           vesca]
          Length = 286

 Score =  243 bits (620), Expect = 6e-62
 Identities = 134/225 (59%), Positives = 162/225 (72%), Gaps = 1/225 (0%)
 Frame = -1

Query: 685 MQRNRKRKN-LHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKY 509
           M +NRKRK    +   P++              RPT +EC SVRD LL LHGFP+EFAKY
Sbjct: 1   MPKNRKRKEQAEADHNPKLPTKTTPKDPYPNHARPTREECVSVRDDLLALHGFPKEFAKY 60

Query: 508 RRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFA 329
           R     +   +SNG +    V SEP       +KESVLDGL+  LLSQNTT++NS +AFA
Sbjct: 61  REQRLSSQ--ASNGHDND--VSSEPLD-----EKESVLDGLVRTLLSQNTTESNSLKAFA 111

Query: 328 SLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMS 149
           SLKS FPTWE+VLAA+ + +E++I+CGGLA TKASCIKN+L+ LLEKK KLCLEYLR +S
Sbjct: 112 SLKSAFPTWEEVLAADSQSLESAIRCGGLAKTKASCIKNMLSCLLEKKEKLCLEYLRDLS 171

Query: 148 TDEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVRA 14
            DEIKAEL  FKGIGPKTVACVLMF LQQDDFPVDTHV +++ +A
Sbjct: 172 VDEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHV-YEIAKA 215


>ref|XP_002511456.1| Endonuclease III, putative [Ricinus communis]
           gi|223550571|gb|EEF52058.1| Endonuclease III, putative
           [Ricinus communis]
          Length = 291

 Score =  243 bits (620), Expect = 6e-62
 Identities = 129/226 (57%), Positives = 159/226 (70%), Gaps = 2/226 (0%)
 Frame = -1

Query: 685 MQRNRKRK--NLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAK 512
           MQ+NRKRK  +  + +K                PRPT +EC  +RDSLL  HGFPQEFAK
Sbjct: 1   MQKNRKRKLKSAETETKSAKINNGNKEEPYPTHPRPTPEECLCIRDSLLAFHGFPQEFAK 60

Query: 511 YRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAF 332
           YR+      D             ++ S  + +  +E+VLDGL+  +LSQNTT+ NS RAF
Sbjct: 61  YRKQRLGGDD------------DNKSSDVNSDTAEETVLDGLVKTVLSQNTTEVNSQRAF 108

Query: 331 ASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGM 152
            +LKS+FPTW+DVLAAE K++EN+I+CGGLA  KASCIKN+L  LLEKKGK+CLEYLR M
Sbjct: 109 DNLKSDFPTWQDVLAAEPKWIENAIRCGGLAPAKASCIKNILNCLLEKKGKICLEYLRDM 168

Query: 151 STDEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVRA 14
           S DEIKAEL  FKG+GPKTVACVLMFHLQQ+DFPVDTHV F++ +A
Sbjct: 169 SVDEIKAELSQFKGVGPKTVACVLMFHLQQEDFPVDTHV-FEIAKA 213


>ref|XP_002875868.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
           gi|297321706|gb|EFH52127.1| predicted protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 294

 Score =  237 bits (604), Expect = 4e-60
 Identities = 127/228 (55%), Positives = 161/228 (70%), Gaps = 4/228 (1%)
 Frame = -1

Query: 685 MQRNRKRKNLHS----ISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEF 518
           M + +KRK L+        P IK             RPT++EC+ VRD+LL+LHGFP EF
Sbjct: 1   MSKAQKRKRLNQDDGESKTPAIKSTVDGSNPYPTLLRPTAEECREVRDALLSLHGFPPEF 60

Query: 517 AKYRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTR 338
           A YRR   L    + +G +    +KSEP   ++E   ESVLDGL+ ILLSQNTT++NS R
Sbjct: 61  ANYRR-QRLRSLSAVDGHDTQCTMKSEPLDEAEE---ESVLDGLVKILLSQNTTESNSQR 116

Query: 337 AFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLR 158
           AFASLK+ FP WEDVLAAE K +E++I+CGGLA  KA CIKN+L RL  ++G LCLEYLR
Sbjct: 117 AFASLKAAFPNWEDVLAAESKSIESAIRCGGLAPKKAVCIKNILNRLQTERGVLCLEYLR 176

Query: 157 GMSTDEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVRA 14
           G+S +E+K EL  FKGIGPKTV+CVLMF+LQ +DFPVDTHV F++ +A
Sbjct: 177 GLSVEEVKTELSHFKGIGPKTVSCVLMFNLQHNDFPVDTHV-FEIAKA 223


>ref|XP_006404333.1| hypothetical protein EUTSA_v10010580mg [Eutrema salsugineum]
           gi|557105452|gb|ESQ45786.1| hypothetical protein
           EUTSA_v10010580mg [Eutrema salsugineum]
          Length = 302

 Score =  233 bits (595), Expect = 5e-59
 Identities = 124/231 (53%), Positives = 158/231 (68%), Gaps = 7/231 (3%)
 Frame = -1

Query: 685 MQRNRKRKNLH----SISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEF 518
           M +++KR  LH        P  K             RPTS EC+ VRD+LL+LHGFP EF
Sbjct: 1   MSKSQKRTRLHLDDGDSKTPATKSTVYGGDPYPSHLRPTSDECRDVRDALLSLHGFPPEF 60

Query: 517 AKYRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQ---KESVLDGLISILLSQNTTDAN 347
             YRR   L    + +G      +KSEP   + + +   +E+VLDGL+ ILLSQNTT+ N
Sbjct: 61  DSYRR-QRLRSSSAVDGYHTHCTMKSEPLEAANDEKDEIEETVLDGLVKILLSQNTTEIN 119

Query: 346 STRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLE 167
           S RAFASLK+ FP WEDVL AE K +EN+I+CGGLA  KA CIKN+L+RL  ++G+LCLE
Sbjct: 120 SQRAFASLKAAFPKWEDVLGAEPKSIENAIRCGGLAPKKAVCIKNILSRLQSERGRLCLE 179

Query: 166 YLRGMSTDEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVRA 14
           YLRG+S +E+K EL  FKGIGPKTV+CVLMF+LQ +DFPVDTHV F++ +A
Sbjct: 180 YLRGLSVEEVKTELSHFKGIGPKTVSCVLMFNLQHNDFPVDTHV-FEIAKA 229


>ref|XP_002321564.2| hypothetical protein POPTR_0015s08260g [Populus trichocarpa]
           gi|550322300|gb|EEF05691.2| hypothetical protein
           POPTR_0015s08260g [Populus trichocarpa]
          Length = 306

 Score =  232 bits (592), Expect = 1e-58
 Identities = 129/241 (53%), Positives = 164/241 (68%), Gaps = 17/241 (7%)
 Frame = -1

Query: 685 MQRNRKRKNLHSISKPQIKXXXXXXXXXXXXP-------RPTSQECQSVRDSLLTLHGFP 527
           MQ   KRK  H + KP+                      RPT +EC+++RDSLL  HGFP
Sbjct: 1   MQTGHKRKQQHEL-KPRTNKKSAETISNIKEEEPFPTHARPTPEECRAIRDSLLAFHGFP 59

Query: 526 QEFAKYRRT-DYL--------NPDYSSN-GSEPSQLVKSEPSSTSQEPQKESVLDGLISI 377
           QEFAKYR+   YL        +P   +N   +   +VK E     +E ++ESVLDGL+  
Sbjct: 60  QEFAKYRKQRPYLITLQDKEESPHLINNCDGKNDNVVKVEEE---EEEEEESVLDGLVKT 116

Query: 376 LLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRL 197
           +LSQNTT+ NS RAF +LKS FPTWE+VLAAE KF+E++I+CGGLA TKA+CI+N+L+ L
Sbjct: 117 VLSQNTTEVNSQRAFLNLKSAFPTWENVLAAESKFIEDAIRCGGLAPTKAACIRNILSSL 176

Query: 196 LEKKGKLCLEYLRGMSTDEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVR 17
           +EK G+LCLEYLR +   EIKAEL  FKGIGPKTVACVLMF+LQ+DDFPVDTHV F++ +
Sbjct: 177 MEKNGRLCLEYLRDLPVAEIKAELSHFKGIGPKTVACVLMFNLQKDDFPVDTHV-FEIAK 235

Query: 16  A 14
           A
Sbjct: 236 A 236


>ref|XP_006291571.1| hypothetical protein CARUB_v10017731mg [Capsella rubella]
           gi|482560278|gb|EOA24469.1| hypothetical protein
           CARUB_v10017731mg [Capsella rubella]
          Length = 298

 Score =  231 bits (590), Expect = 2e-58
 Identities = 127/229 (55%), Positives = 160/229 (69%), Gaps = 5/229 (2%)
 Frame = -1

Query: 685 MQRNRKRKNLHS----ISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEF 518
           M + +KRK L+        P IK             RPT++EC+ VRD+LL+LHGFP EF
Sbjct: 1   MSKAQKRKRLNQGDGESKTPVIKSAVDGGDPYPALLRPTAEECRDVRDALLSLHGFPPEF 60

Query: 517 AKYRRTDY-LNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANST 341
           A YRR    L      +G++ +  VK EP   ++E   ESVLDGL+ ILLSQNTT++NS 
Sbjct: 61  ASYRRKRLRLFSAVDDHGTQCT--VKPEPLDEAEE---ESVLDGLVKILLSQNTTESNSL 115

Query: 340 RAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYL 161
           RAFASLK+ FP WEDVLAAE   +EN+I+CGGLA  KA CIKN+L RL  +KG LCLEYL
Sbjct: 116 RAFASLKAAFPKWEDVLAAESISIENAIRCGGLAPKKAVCIKNILNRLQNEKGVLCLEYL 175

Query: 160 RGMSTDEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVRA 14
           R +S DE+K+EL  FKG+GPKTV+CVLMF+LQ +DFPVDTHV F++ +A
Sbjct: 176 RSLSVDEVKSELSQFKGVGPKTVSCVLMFNLQHNDFPVDTHV-FEIAKA 223


>ref|NP_566893.1| DNA glycosylase superfamily protein [Arabidopsis thaliana]
           gi|332644814|gb|AEE78335.1| DNA glycosylase superfamily
           protein [Arabidopsis thaliana]
          Length = 293

 Score =  231 bits (590), Expect = 2e-58
 Identities = 124/229 (54%), Positives = 161/229 (70%), Gaps = 5/229 (2%)
 Frame = -1

Query: 685 MQRNRKRKNLHSIS----KPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEF 518
           M + +KRK L+        P  K             RPT++EC+ VRD+LL+LHGFP EF
Sbjct: 1   MSKAQKRKRLNKYDGESKTPANKSTVDGGNPYPTLLRPTAEECRDVRDALLSLHGFPPEF 60

Query: 517 AKYRRTDYLNPDYSSNGSEPSQL-VKSEPSSTSQEPQKESVLDGLISILLSQNTTDANST 341
           A YRR    +  +S+     +Q  +KSE   T  E ++ESVLDGL+ ILLSQNTT++NS 
Sbjct: 61  ANYRRQRLRS--FSAVDDHDTQCNLKSE---TLNETEEESVLDGLVKILLSQNTTESNSQ 115

Query: 340 RAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYL 161
           RAFASLK+ FP W+DVL AE K +EN+I+CGGLA  KA CIKN+L RL  ++G+LCLEYL
Sbjct: 116 RAFASLKATFPKWDDVLNAESKSIENAIRCGGLAPKKAVCIKNILNRLQNERGRLCLEYL 175

Query: 160 RGMSTDEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVSFQVVRA 14
           RG+S +E+K EL  FKG+GPKTV+CVLMF+LQ +DFPVDTHV F++ +A
Sbjct: 176 RGLSVEEVKTELSHFKGVGPKTVSCVLMFNLQHNDFPVDTHV-FEIAKA 223


>gb|EXB42063.1| Protein ROS1 [Morus notabilis]
          Length = 308

 Score =  231 bits (589), Expect = 2e-58
 Identities = 122/191 (63%), Positives = 139/191 (72%)
 Frame = -1

Query: 586 PTSQECQSVRDSLLTLHGFPQEFAKYRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQK 407
           PT  +C++VRD LL LHGFPQEFAKYRR        + NG E                 K
Sbjct: 68  PTPDQCRAVRDDLLALHGFPQEFAKYRR----QKPTTDNGEESES--------------K 109

Query: 406 ESVLDGLISILLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKA 227
           ESVLDGL+  +LSQNTT+ANS RAFASLKS FPTWE VL A+ K +E++I+CGGLA  KA
Sbjct: 110 ESVLDGLVMTVLSQNTTEANSQRAFASLKSAFPTWEQVLNADSKCIEDAIRCGGLAPKKA 169

Query: 226 SCIKNLLTRLLEKKGKLCLEYLRGMSTDEIKAELLGFKGIGPKTVACVLMFHLQQDDFPV 47
           SCIKN L  LLE+KGKLCLEYL   S DE+KAEL  FKGIGPKTVACVLMFHLQQDDFPV
Sbjct: 170 SCIKNTLRSLLERKGKLCLEYLLDFSVDEVKAELSCFKGIGPKTVACVLMFHLQQDDFPV 229

Query: 46  DTHVSFQVVRA 14
           DTHV F++ +A
Sbjct: 230 DTHV-FEIAKA 239


>ref|XP_006836744.1| hypothetical protein AMTR_s00088p00146000 [Amborella trichopoda]
           gi|548839304|gb|ERM99597.1| hypothetical protein
           AMTR_s00088p00146000 [Amborella trichopoda]
          Length = 305

 Score =  231 bits (589), Expect = 2e-58
 Identities = 122/194 (62%), Positives = 147/194 (75%), Gaps = 2/194 (1%)
 Frame = -1

Query: 589 RPTSQECQSVRDSLLTLHGFPQEFAKYRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEP- 413
           RPT QEC  VRD+L++LHGFP+EFA++RR + +  D      E  Q    +       P 
Sbjct: 53  RPTPQECLIVRDALISLHGFPEEFAEFRRKEAVVND----SFEEKQQKLDDEGEVRIAPL 108

Query: 412 -QKESVLDGLISILLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAV 236
            Q  SVLDGL+S++LSQNTTD NS RAF SLK  FPTWEDV AAE K V N+IKCGGLA 
Sbjct: 109 IQGGSVLDGLVSVILSQNTTDVNSRRAFESLKLAFPTWEDVHAAESKSVVNTIKCGGLAE 168

Query: 235 TKASCIKNLLTRLLEKKGKLCLEYLRGMSTDEIKAELLGFKGIGPKTVACVLMFHLQQDD 56
           TKASCIKN+L+ LLE+KGK+CL+YLR M  D+IKAEL  FKG+GPKTVACVLMF+LQ+DD
Sbjct: 169 TKASCIKNILSALLEQKGKICLDYLREMPIDKIKAELRHFKGVGPKTVACVLMFYLQKDD 228

Query: 55  FPVDTHVSFQVVRA 14
           FPVDTHV F++V+A
Sbjct: 229 FPVDTHV-FRIVKA 241


>ref|XP_004236146.1| PREDICTED: endonuclease III-like [Solanum lycopersicum]
          Length = 301

 Score =  228 bits (580), Expect = 3e-57
 Identities = 116/191 (60%), Positives = 144/191 (75%)
 Frame = -1

Query: 589 RPTSQECQSVRDSLLTLHGFPQEFAKYRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQ 410
           +PT +EC++VRD LL LHGFP+EF KYR+   L+            +   E   +  EP 
Sbjct: 52  QPTPEECRAVRDDLLALHGFPKEFIKYRKQRSLD-----------HIKYEEDDISGAEPC 100

Query: 409 KESVLDGLISILLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTK 230
            ESVLDGLI+ +LSQNTT+ANS +AFASLKS+FPTWE VLAA+ K VE++I+CGGLA TK
Sbjct: 101 TESVLDGLINTILSQNTTEANSQKAFASLKSSFPTWECVLAADAKLVEDTIRCGGLAPTK 160

Query: 229 ASCIKNLLTRLLEKKGKLCLEYLRGMSTDEIKAELLGFKGIGPKTVACVLMFHLQQDDFP 50
            SCIK +L+ LL+KKG LCLEYLR +S +EIK EL  F+GIGPKTVACVLMF LQ+DDFP
Sbjct: 161 TSCIKGILSSLLQKKGNLCLEYLRELSIEEIKRELSCFRGIGPKTVACVLMFQLQRDDFP 220

Query: 49  VDTHVSFQVVR 17
           VDTH+ FQ+ +
Sbjct: 221 VDTHI-FQIAK 230


>ref|XP_006345014.1| PREDICTED: protein ROS1-like [Solanum tuberosum]
          Length = 301

 Score =  226 bits (575), Expect = 1e-56
 Identities = 114/191 (59%), Positives = 144/191 (75%)
 Frame = -1

Query: 589 RPTSQECQSVRDSLLTLHGFPQEFAKYRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQ 410
           +PT +EC++VRD LL LHGFP+EF KYR+   L+            +   E  ++  +  
Sbjct: 52  QPTPEECRAVRDDLLALHGFPKEFIKYRKQRSLD-----------HIEYEEDDTSGADSS 100

Query: 409 KESVLDGLISILLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTK 230
            ESVLDGLI+ +LSQNTT+ANS +AFASLKS+FPTWE VLAA+ K VE++I+CGGLA TK
Sbjct: 101 TESVLDGLINTILSQNTTEANSQKAFASLKSSFPTWECVLAADAKLVEDTIRCGGLAPTK 160

Query: 229 ASCIKNLLTRLLEKKGKLCLEYLRGMSTDEIKAELLGFKGIGPKTVACVLMFHLQQDDFP 50
            SCIK +L+ LL+KKG LCLEYLR +S +EIK EL  F+GIGPKTVACVLMF LQ+DDFP
Sbjct: 161 TSCIKGILSSLLQKKGNLCLEYLRELSIEEIKRELSCFRGIGPKTVACVLMFQLQRDDFP 220

Query: 49  VDTHVSFQVVR 17
           VDTH+ FQ+ +
Sbjct: 221 VDTHI-FQIAK 230


Top