BLASTX nr result

ID: Rauwolfia21_contig00015650 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00015650
         (1035 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002321564.2| hypothetical protein POPTR_0015s08260g [Popu...   370   e-100
ref|XP_004236146.1| PREDICTED: endonuclease III-like [Solanum ly...   369   1e-99
ref|XP_002283633.1| PREDICTED: endonuclease III-like [Vitis vini...   368   2e-99
ref|XP_006345014.1| PREDICTED: protein ROS1-like [Solanum tubero...   367   4e-99
emb|CBI15085.3| unnamed protein product [Vitis vinifera]              367   5e-99
ref|XP_002511456.1| Endonuclease III, putative [Ricinus communis...   364   3e-98
gb|EOY20610.1| DNA glycosylase superfamily protein isoform 2 [Th...   360   5e-97
gb|EXB42063.1| Protein ROS1 [Morus notabilis]                         359   9e-97
ref|XP_006476718.1| PREDICTED: protein ROS1-like isoform X1 [Cit...   353   5e-95
ref|XP_006439743.1| hypothetical protein CICLE_v10021561mg [Citr...   353   6e-95
ref|XP_006476719.1| PREDICTED: protein ROS1-like isoform X2 [Cit...   344   3e-92
ref|XP_003525486.1| PREDICTED: uncharacterized protein LOC100802...   343   6e-92
ref|XP_004299588.1| PREDICTED: DEMETER-like protein 3-like [Frag...   342   2e-91
ref|XP_004508835.1| PREDICTED: protein ROS1-like [Cicer arietinu...   341   3e-91
ref|XP_006404333.1| hypothetical protein EUTSA_v10010580mg [Eutr...   337   5e-90
gb|ESW27384.1| hypothetical protein PHAVU_003G197200g [Phaseolus...   330   4e-88
gb|EOY20611.1| DNA glycosylase superfamily protein isoform 3 [Th...   328   2e-87
gb|EOY20609.1| DNA glycosylase superfamily protein isoform 1 [Th...   328   2e-87
ref|XP_002875868.1| predicted protein [Arabidopsis lyrata subsp....   326   1e-86
ref|XP_003608916.1| Ultraviolet N-glycosylase/AP lyase [Medicago...   323   9e-86

>ref|XP_002321564.2| hypothetical protein POPTR_0015s08260g [Populus trichocarpa]
           gi|550322300|gb|EEF05691.2| hypothetical protein
           POPTR_0015s08260g [Populus trichocarpa]
          Length = 306

 Score =  370 bits (949), Expect = e-100
 Identities = 181/293 (61%), Positives = 220/293 (75%), Gaps = 4/293 (1%)
 Frame = -3

Query: 943 QSPSIKSAKRAKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLLANHGFPEEFIKYRRQ 764
           Q   +K     K  +TI    + EPFP H RPTPEECRA+RD+LLA HGFP+EF KYR+Q
Sbjct: 9   QQHELKPRTNKKSAETISNIKEEEPFPTHARPTPEECRAIRDSLLAFHGFPQEFAKYRKQ 68

Query: 763 RMVVKDANNGDKNXXXXXXXXXXXXXXEG----DGEPKESVLDGLVSTILSQNTTDVNSQ 596
           R  +    + +++                    + E +ESVLDGLV T+LSQNTT+VNSQ
Sbjct: 69  RPYLITLQDKEESPHLINNCDGKNDNVVKVEEEEEEEEESVLDGLVKTVLSQNTTEVNSQ 128

Query: 595 RAFASLKSAFPTWEDVLAAERKHIEDTIRCGGLAPSKASCIKGILESLFAKRGKLCLEYV 416
           RAF +LKSAFPTWE+VLAAE K IED IRCGGLAP+KA+CI+ IL SL  K G+LCLEY+
Sbjct: 129 RAFLNLKSAFPTWENVLAAESKFIEDAIRCGGLAPTKAACIRNILSSLMEKNGRLCLEYL 188

Query: 415 RDLSVEEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKK 236
           RDL V EIKAELSHFKGIGPKTVACVLMF LQ+DDFPVDTH+F+IAK+IGW+P +AD  K
Sbjct: 189 RDLPVAEIKAELSHFKGIGPKTVACVLMFNLQKDDFPVDTHVFEIAKAIGWVPPVADRNK 248

Query: 235 AYVHLNFRIPDDLKFDLNCLLFTHGKICRQCSKKGDSKSKKETYSNPCPLLAY 77
            Y+HLN RIP +LKFDLNCLL+THGK+CR+C+KK  S+ +KET+ + CPLL Y
Sbjct: 249 TYLHLNHRIPKELKFDLNCLLYTHGKLCRKCTKKSGSQQRKETHDDSCPLLNY 301


>ref|XP_004236146.1| PREDICTED: endonuclease III-like [Solanum lycopersicum]
          Length = 301

 Score =  369 bits (946), Expect = 1e-99
 Identities = 185/293 (63%), Positives = 222/293 (75%), Gaps = 1/293 (0%)
 Frame = -3

Query: 937 PSIKSAKRAKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLLANHGFPEEFIKYRRQRM 758
           P  KS+++A    T       EPFPD+ +PTPEECRAVRD LLA HGFP+EFIKYR+QR 
Sbjct: 26  PPSKSSRKAN--VTAGSSNDSEPFPDYSQPTPEECRAVRDDLLALHGFPKEFIKYRKQRS 83

Query: 757 VVKDANNGDKNXXXXXXXXXXXXXXEGDGEP-KESVLDGLVSTILSQNTTDVNSQRAFAS 581
           +       D                    EP  ESVLDGL++TILSQNTT+ NSQ+AFAS
Sbjct: 84  LDHIKYEEDD---------------ISGAEPCTESVLDGLINTILSQNTTEANSQKAFAS 128

Query: 580 LKSAFPTWEDVLAAERKHIEDTIRCGGLAPSKASCIKGILESLFAKRGKLCLEYVRDLSV 401
           LKS+FPTWE VLAA+ K +EDTIRCGGLAP+K SCIKGIL SL  K+G LCLEY+R+LS+
Sbjct: 129 LKSSFPTWECVLAADAKLVEDTIRCGGLAPTKTSCIKGILSSLLQKKGNLCLEYLRELSI 188

Query: 400 EEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKKAYVHL 221
           EEIK ELS F+GIGPKTVACVLMFQLQ+DDFPVDTHIFQIAK++ W+PA AD+KK Y+HL
Sbjct: 189 EEIKRELSCFRGIGPKTVACVLMFQLQRDDFPVDTHIFQIAKTLHWVPAAADVKKTYIHL 248

Query: 220 NFRIPDDLKFDLNCLLFTHGKICRQCSKKGDSKSKKETYSNPCPLLAYFNDSL 62
           N RIPD+LKFDLNCL++THGK+CR+CS KG +K KKE +   CPLL   +D++
Sbjct: 249 NRRIPDELKFDLNCLIYTHGKVCRECSGKGSNKPKKEQFDKLCPLLGQSSDAI 301


>ref|XP_002283633.1| PREDICTED: endonuclease III-like [Vitis vinifera]
          Length = 310

 Score =  368 bits (944), Expect = 2e-99
 Identities = 185/311 (59%), Positives = 227/311 (72%), Gaps = 3/311 (0%)
 Frame = -3

Query: 976 KKTHKIKVQSSQSPSIKSAKRAKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLLANHG 797
           +++ K K + S S S +SA ++ R   +V     +P+P H RPTP ECRAVRD LLA HG
Sbjct: 2   QRSRKRKQEESSSCSKESATKSARNDVVV-----DPYPSHPRPTPVECRAVRDDLLALHG 56

Query: 796 FPEEFIKYRRQRMVVKDANNG---DKNXXXXXXXXXXXXXXEGDGEPKESVLDGLVSTIL 626
           FP+ F KYR+ R+      +    D                      KESVLDGLVS IL
Sbjct: 57  FPQRFEKYRKLRLPPLPHTSSPGLDGGGGTPVKLDPSDGDDVNGSSQKESVLDGLVSIIL 116

Query: 625 SQNTTDVNSQRAFASLKSAFPTWEDVLAAERKHIEDTIRCGGLAPSKASCIKGILESLFA 446
           SQNTTDVNSQRAFASLKSAFPTW+DVLAA+ K IE+ IRCGGLA +KASCIK +L  L  
Sbjct: 117 SQNTTDVNSQRAFASLKSAFPTWQDVLAADSKSIENAIRCGGLAVTKASCIKKMLSCLLE 176

Query: 445 KRGKLCLEYVRDLSVEEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSIG 266
           ++GKLCLEY+RDL+V+EIK ELSHFKGIGPKTVACVLMF LQ+DDFPVDTH+ QI K+IG
Sbjct: 177 RKGKLCLEYLRDLTVDEIKTELSHFKGIGPKTVACVLMFHLQRDDFPVDTHVIQIGKAIG 236

Query: 265 WLPAMADLKKAYVHLNFRIPDDLKFDLNCLLFTHGKICRQCSKKGDSKSKKETYSNPCPL 86
           W+PA+AD KKAY+HLN RIPD+LKFDLNCLLFTHGK+C +C++KG ++ +KE++ + CPL
Sbjct: 237 WVPAVADRKKAYLHLNRRIPDELKFDLNCLLFTHGKLCHECTQKGANQKRKESHESSCPL 296

Query: 85  LAYFNDSLEPS 53
           L Y  D  + S
Sbjct: 297 LTYCGDMFKSS 307


>ref|XP_006345014.1| PREDICTED: protein ROS1-like [Solanum tuberosum]
          Length = 301

 Score =  367 bits (942), Expect = 4e-99
 Identities = 185/295 (62%), Positives = 221/295 (74%)
 Frame = -3

Query: 946 SQSPSIKSAKRAKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLLANHGFPEEFIKYRR 767
           S  P  KS+K+A    T       EPFPD+ +PTPEECRAVRD LLA HGFP+EFIKYR+
Sbjct: 23  SPCPPSKSSKKAN--VTAGPFNDSEPFPDYSQPTPEECRAVRDDLLALHGFPKEFIKYRK 80

Query: 766 QRMVVKDANNGDKNXXXXXXXXXXXXXXEGDGEPKESVLDGLVSTILSQNTTDVNSQRAF 587
           QR +       D                 G     ESVLDGL++TILSQNTT+ NSQ+AF
Sbjct: 81  QRSLDHIEYEEDDTS--------------GADSSTESVLDGLINTILSQNTTEANSQKAF 126

Query: 586 ASLKSAFPTWEDVLAAERKHIEDTIRCGGLAPSKASCIKGILESLFAKRGKLCLEYVRDL 407
           ASLKS+FPTWE VLAA+ K +EDTIRCGGLAP+K SCIKGIL SL  K+G LCLEY+R+L
Sbjct: 127 ASLKSSFPTWECVLAADAKLVEDTIRCGGLAPTKTSCIKGILSSLLQKKGNLCLEYLREL 186

Query: 406 SVEEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKKAYV 227
           S+EEIK ELS F+GIGPKTVACVLMFQLQ+DDFPVDTHIFQIAK++ W+PA AD+KK Y+
Sbjct: 187 SIEEIKRELSCFRGIGPKTVACVLMFQLQRDDFPVDTHIFQIAKTLHWVPAAADVKKTYI 246

Query: 226 HLNFRIPDDLKFDLNCLLFTHGKICRQCSKKGDSKSKKETYSNPCPLLAYFNDSL 62
           HLN RIPD+LKFDLNCL++THGK+CR+CS KG +K KKE     CPLL   ++++
Sbjct: 247 HLNQRIPDELKFDLNCLIYTHGKVCRECSGKGSNKPKKEQCDKLCPLLGQSSNAI 301


>emb|CBI15085.3| unnamed protein product [Vitis vinifera]
          Length = 310

 Score =  367 bits (941), Expect = 5e-99
 Identities = 184/306 (60%), Positives = 225/306 (73%), Gaps = 3/306 (0%)
 Frame = -3

Query: 976 KKTHKIKVQSSQSPSIKSAKRAKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLLANHG 797
           +++ K K + S S S +SA ++ R   +V     +P+P H RPTP ECRAVRD LLA HG
Sbjct: 2   QRSRKRKQEESSSCSKESATKSARNDVVV-----DPYPSHPRPTPVECRAVRDDLLALHG 56

Query: 796 FPEEFIKYRRQRMVVKDANNG---DKNXXXXXXXXXXXXXXEGDGEPKESVLDGLVSTIL 626
           FP+ F KYR+ R+      +    D                      KESVLDGLVS IL
Sbjct: 57  FPQRFEKYRKLRLPPLPHTSSPGLDGGGGTPVKLDPSDGDDVNGSSQKESVLDGLVSIIL 116

Query: 625 SQNTTDVNSQRAFASLKSAFPTWEDVLAAERKHIEDTIRCGGLAPSKASCIKGILESLFA 446
           SQNTTDVNSQRAFASLKSAFPTW+DVLAA+ K IE+ IRCGGLA +KASCIK +L  L  
Sbjct: 117 SQNTTDVNSQRAFASLKSAFPTWQDVLAADSKSIENAIRCGGLAVTKASCIKKMLSCLLE 176

Query: 445 KRGKLCLEYVRDLSVEEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSIG 266
           ++GKLCLEY+RDL+V+EIK ELSHFKGIGPKTVACVLMF LQ+DDFPVDTH+ QI K+IG
Sbjct: 177 RKGKLCLEYLRDLTVDEIKTELSHFKGIGPKTVACVLMFHLQRDDFPVDTHVIQIGKAIG 236

Query: 265 WLPAMADLKKAYVHLNFRIPDDLKFDLNCLLFTHGKICRQCSKKGDSKSKKETYSNPCPL 86
           W+PA+AD KKAY+HLN RIPD+LKFDLNCLLFTHGK+C +C++KG ++ +KE++ + CPL
Sbjct: 237 WVPAVADRKKAYLHLNRRIPDELKFDLNCLLFTHGKLCHECTQKGANQKRKESHESSCPL 296

Query: 85  LAYFND 68
           L Y  D
Sbjct: 297 LTYCGD 302


>ref|XP_002511456.1| Endonuclease III, putative [Ricinus communis]
           gi|223550571|gb|EEF52058.1| Endonuclease III, putative
           [Ricinus communis]
          Length = 291

 Score =  364 bits (935), Expect = 3e-98
 Identities = 176/297 (59%), Positives = 222/297 (74%)
 Frame = -3

Query: 955 VQSSQSPSIKSAKRAKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLLANHGFPEEFIK 776
           +Q ++   +KSA+   +   I    K EP+P H RPTPEEC  +RD+LLA HGFP+EF K
Sbjct: 1   MQKNRKRKLKSAETETKSAKINNGNKEEPYPTHPRPTPEECLCIRDSLLAFHGFPQEFAK 60

Query: 775 YRRQRMVVKDANNGDKNXXXXXXXXXXXXXXEGDGEPKESVLDGLVSTILSQNTTDVNSQ 596
           YR+QR+   D N                         +E+VLDGLV T+LSQNTT+VNSQ
Sbjct: 61  YRKQRLGGDDDNKSSD---------------VNSDTAEETVLDGLVKTVLSQNTTEVNSQ 105

Query: 595 RAFASLKSAFPTWEDVLAAERKHIEDTIRCGGLAPSKASCIKGILESLFAKRGKLCLEYV 416
           RAF +LKS FPTW+DVLAAE K IE+ IRCGGLAP+KASCIK IL  L  K+GK+CLEY+
Sbjct: 106 RAFDNLKSDFPTWQDVLAAEPKWIENAIRCGGLAPAKASCIKNILNCLLEKKGKICLEYL 165

Query: 415 RDLSVEEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKK 236
           RD+SV+EIKAELS FKG+GPKTVACVLMF LQQ+DFPVDTH+F+IAK++GW+P +AD  K
Sbjct: 166 RDMSVDEIKAELSQFKGVGPKTVACVLMFHLQQEDFPVDTHVFEIAKALGWVPEVADRNK 225

Query: 235 AYVHLNFRIPDDLKFDLNCLLFTHGKICRQCSKKGDSKSKKETYSNPCPLLAYFNDS 65
            Y+HLN RIP++LKFDLNCLL+THGK+CR+C KK  ++S+KE++ + CPLL+Y N S
Sbjct: 226 TYLHLNQRIPNELKFDLNCLLYTHGKLCRKCIKKRGNQSRKESHDDSCPLLSYCNSS 282


>gb|EOY20610.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao]
          Length = 292

 Score =  360 bits (924), Expect = 5e-97
 Identities = 173/270 (64%), Positives = 210/270 (77%)
 Frame = -3

Query: 874 EPFPDHLRPTPEECRAVRDTLLANHGFPEEFIKYRRQRMVVKDANNGDKNXXXXXXXXXX 695
           EP+P H RPTP+ECR+VRD LLA HGFP EF+KYR QR++  +     K+          
Sbjct: 27  EPYPSHHRPTPDECRSVRDELLALHGFPAEFLKYRHQRLIKTEPTIDAKSEPLNNNYD-- 84

Query: 694 XXXXEGDGEPKESVLDGLVSTILSQNTTDVNSQRAFASLKSAFPTWEDVLAAERKHIEDT 515
                 DGE  ESVLDGLV T+LSQNTT++NSQ+AFASLKSAFPTWEDVLAAE K++E+ 
Sbjct: 85  ------DGE--ESVLDGLVKTVLSQNTTELNSQKAFASLKSAFPTWEDVLAAESKNLENA 136

Query: 514 IRCGGLAPSKASCIKGILESLFAKRGKLCLEYVRDLSVEEIKAELSHFKGIGPKTVACVL 335
           IRCGGLAP KASCIK +L  L  ++GKLC EY+RDLS++EIKAELS+FKG+GPKTVACVL
Sbjct: 137 IRCGGLAPRKASCIKNVLRCLHERKGKLCFEYLRDLSIDEIKAELSNFKGVGPKTVACVL 196

Query: 334 MFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKKAYVHLNFRIPDDLKFDLNCLLFTHGKI 155
           MF LQQDDFPVDTH+F+IA++IGW+PA AD KK Y+HLN RIP+ LKFDLNCLL+THGK+
Sbjct: 197 MFNLQQDDFPVDTHVFEIARAIGWVPATADRKKTYLHLNRRIPNKLKFDLNCLLYTHGKL 256

Query: 154 CRQCSKKGDSKSKKETYSNPCPLLAYFNDS 65
           CR+C+ KG S+ K     + CPL  Y  +S
Sbjct: 257 CRKCTMKGSSQQKSARNDDSCPLCTYCKNS 286


>gb|EXB42063.1| Protein ROS1 [Morus notabilis]
          Length = 308

 Score =  359 bits (922), Expect = 9e-97
 Identities = 186/310 (60%), Positives = 223/310 (71%), Gaps = 6/310 (1%)
 Frame = -3

Query: 988 KKMQK--KTHKIKVQSSQSPSIKSAKRAKRPKTIVVKGKGE----PFPDHLRPTPEECRA 827
           KKMQK  K  + ++Q        S K++   +   + G  E    P+P H  PTP++CRA
Sbjct: 16  KKMQKLRKRKQSELQPHNQAHFLSNKKSSAKRAPPISGLSEVAKDPYPTHQWPTPDQCRA 75

Query: 826 VRDTLLANHGFPEEFIKYRRQRMVVKDANNGDKNXXXXXXXXXXXXXXEGDGEPKESVLD 647
           VRD LLA HGFP+EF KYRRQ+      +NG+++                  E KESVLD
Sbjct: 76  VRDDLLALHGFPQEFAKYRRQKPTT---DNGEES------------------ESKESVLD 114

Query: 646 GLVSTILSQNTTDVNSQRAFASLKSAFPTWEDVLAAERKHIEDTIRCGGLAPSKASCIKG 467
           GLV T+LSQNTT+ NSQRAFASLKSAFPTWE VL A+ K IED IRCGGLAP KASCIK 
Sbjct: 115 GLVMTVLSQNTTEANSQRAFASLKSAFPTWEQVLNADSKCIEDAIRCGGLAPKKASCIKN 174

Query: 466 ILESLFAKRGKLCLEYVRDLSVEEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHIF 287
            L SL  ++GKLCLEY+ D SV+E+KAELS FKGIGPKTVACVLMF LQQDDFPVDTH+F
Sbjct: 175 TLRSLLERKGKLCLEYLLDFSVDEVKAELSCFKGIGPKTVACVLMFHLQQDDFPVDTHVF 234

Query: 286 QIAKSIGWLPAMADLKKAYVHLNFRIPDDLKFDLNCLLFTHGKICRQCSKKGDSKSKKET 107
           +IAK++GWLPA AD  KAY+HLN RIP++LKFDLNCLL+THGK+CR+C KKG S+ KK +
Sbjct: 235 EIAKALGWLPAGADRNKAYLHLNQRIPNELKFDLNCLLYTHGKMCRKCIKKGGSQIKKGS 294

Query: 106 YSNPCPLLAY 77
             + CPLL Y
Sbjct: 295 SDDSCPLLHY 304


>ref|XP_006476718.1| PREDICTED: protein ROS1-like isoform X1 [Citrus sinensis]
          Length = 281

 Score =  353 bits (907), Expect = 5e-95
 Identities = 176/285 (61%), Positives = 212/285 (74%)
 Frame = -3

Query: 919 KRAKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLLANHGFPEEFIKYRRQRMVVKDAN 740
           K  KR +  V + + +P+P H RPT EECR +RD LLA HGFP EF+KYR QR+  K   
Sbjct: 3   KSRKRKQVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYRNQRL--KHNM 60

Query: 739 NGDKNXXXXXXXXXXXXXXEGDGEPKESVLDGLVSTILSQNTTDVNSQRAFASLKSAFPT 560
             DKN                +GE +ESVLDGLV T+LSQNTT+ NS +AFASLKS FPT
Sbjct: 61  TRDKNSVPLDMNEYD------EGE-EESVLDGLVKTVLSQNTTEANSLKAFASLKSTFPT 113

Query: 559 WEDVLAAERKHIEDTIRCGGLAPSKASCIKGILESLFAKRGKLCLEYVRDLSVEEIKAEL 380
           WE VLAAE+K IE+ IRCGGLAP+KA+CIK IL+ L   +GKLCLEY+R LS++EIKAEL
Sbjct: 114 WEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSIDEIKAEL 173

Query: 379 SHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKKAYVHLNFRIPDD 200
           S F+GIGPKTVACVLMF LQQDDFPVDTH+F+I+K+IGW+P  AD  K Y+HLN RIP +
Sbjct: 174 SRFRGIGPKTVACVLMFHLQQDDFPVDTHVFEISKAIGWVPTAADRNKTYLHLNQRIPKE 233

Query: 199 LKFDLNCLLFTHGKICRQCSKKGDSKSKKETYSNPCPLLAYFNDS 65
           LKFDLNCLL+THGK+CR C KKG ++ +KE+  N CPLL Y   S
Sbjct: 234 LKFDLNCLLYTHGKLCRNCIKKGGNRQRKESAGNLCPLLNYCEKS 278


>ref|XP_006439743.1| hypothetical protein CICLE_v10021561mg [Citrus clementina]
           gi|557542005|gb|ESR52983.1| hypothetical protein
           CICLE_v10021561mg [Citrus clementina]
          Length = 281

 Score =  353 bits (906), Expect = 6e-95
 Identities = 176/285 (61%), Positives = 212/285 (74%)
 Frame = -3

Query: 919 KRAKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLLANHGFPEEFIKYRRQRMVVKDAN 740
           K  KR +  V + + +P+P H RPT EECR +RD LLA HGFP EF+KYR QR+  K   
Sbjct: 3   KSRKRKQVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYRNQRL--KHNM 60

Query: 739 NGDKNXXXXXXXXXXXXXXEGDGEPKESVLDGLVSTILSQNTTDVNSQRAFASLKSAFPT 560
             DKN                +GE +ESVLDGLV T+LSQNTT+ NS +AFASLKS FPT
Sbjct: 61  TRDKNSVPLDMSEYD------EGE-EESVLDGLVKTLLSQNTTEANSLKAFASLKSTFPT 113

Query: 559 WEDVLAAERKHIEDTIRCGGLAPSKASCIKGILESLFAKRGKLCLEYVRDLSVEEIKAEL 380
           WE VLAAE+K IE+ IRCGGLAP+KA+CIK IL+ L   +GKLCLEY+R LS++EIKAEL
Sbjct: 114 WEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSIDEIKAEL 173

Query: 379 SHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKKAYVHLNFRIPDD 200
           S F+GIGPKTVACVLMF LQQDDFPVDTH+F+I+K+IGW+P  AD  K Y+HLN RIP +
Sbjct: 174 SRFRGIGPKTVACVLMFHLQQDDFPVDTHVFEISKAIGWVPTAADRNKTYLHLNQRIPKE 233

Query: 199 LKFDLNCLLFTHGKICRQCSKKGDSKSKKETYSNPCPLLAYFNDS 65
           LKFDLNCLL+THGK+CR C KKG ++ +KE+  N CPLL Y   S
Sbjct: 234 LKFDLNCLLYTHGKLCRNCIKKGGNRQRKESAGNLCPLLNYCEKS 278


>ref|XP_006476719.1| PREDICTED: protein ROS1-like isoform X2 [Citrus sinensis]
          Length = 278

 Score =  344 bits (883), Expect = 3e-92
 Identities = 171/277 (61%), Positives = 207/277 (74%)
 Frame = -3

Query: 919 KRAKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLLANHGFPEEFIKYRRQRMVVKDAN 740
           K  KR +  V + + +P+P H RPT EECR +RD LLA HGFP EF+KYR QR+  K   
Sbjct: 3   KSRKRKQVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYRNQRL--KHNM 60

Query: 739 NGDKNXXXXXXXXXXXXXXEGDGEPKESVLDGLVSTILSQNTTDVNSQRAFASLKSAFPT 560
             DKN                +GE +ESVLDGLV T+LSQNTT+ NS +AFASLKS FPT
Sbjct: 61  TRDKNSVPLDMNEYD------EGE-EESVLDGLVKTVLSQNTTEANSLKAFASLKSTFPT 113

Query: 559 WEDVLAAERKHIEDTIRCGGLAPSKASCIKGILESLFAKRGKLCLEYVRDLSVEEIKAEL 380
           WE VLAAE+K IE+ IRCGGLAP+KA+CIK IL+ L   +GKLCLEY+R LS++EIKAEL
Sbjct: 114 WEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSIDEIKAEL 173

Query: 379 SHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKKAYVHLNFRIPDD 200
           S F+GIGPKTVACVLMF LQQDDFPVDTH+F+I+K+IGW+P  AD  K Y+HLN RIP +
Sbjct: 174 SRFRGIGPKTVACVLMFHLQQDDFPVDTHVFEISKAIGWVPTAADRNKTYLHLNQRIPKE 233

Query: 199 LKFDLNCLLFTHGKICRQCSKKGDSKSKKETYSNPCP 89
           LKFDLNCLL+THGK+CR C KKG ++ +KE+  N  P
Sbjct: 234 LKFDLNCLLYTHGKLCRNCIKKGGNRQRKESAGNILP 270


>ref|XP_003525486.1| PREDICTED: uncharacterized protein LOC100802952 [Glycine max]
          Length = 284

 Score =  343 bits (880), Expect = 6e-92
 Identities = 177/307 (57%), Positives = 217/307 (70%)
 Frame = -3

Query: 979 QKKTHKIKVQSSQSPSIKSAKRAKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLLANH 800
           +K+  K +V+    P  KS  RA   +T  VK   +PFP H RPTP+EC AVRDTLLA H
Sbjct: 3   KKRKRKQQVKRDGEPKPKSV-RAGSTRTDNVK---DPFPSHARPTPQECEAVRDTLLALH 58

Query: 799 GFPEEFIKYRRQRMVVKDANNGDKNXXXXXXXXXXXXXXEGDGEPKESVLDGLVSTILSQ 620
           G P E  KYR+     +                          +P E VLDGLV T+LSQ
Sbjct: 59  GIPPELAKYRKLPPSDEPVQL----------------------QPPEPVLDGLVRTVLSQ 96

Query: 619 NTTDVNSQRAFASLKSAFPTWEDVLAAERKHIEDTIRCGGLAPSKASCIKGILESLFAKR 440
           NTT+ NSQ+AFASLKS+FP+WE VL AE K +E+ IRCGGLAP+KASCIK +L  L  +R
Sbjct: 97  NTTEANSQKAFASLKSSFPSWEQVLWAESKDVENAIRCGGLAPTKASCIKNVLRCLRERR 156

Query: 439 GKLCLEYVRDLSVEEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSIGWL 260
           G+LCLEY+RDLSV+E+KAELS FKGIGPKTVACVLMF LQQDDFPVDTHIF+IAK++GW+
Sbjct: 157 GELCLEYLRDLSVDEVKAELSLFKGIGPKTVACVLMFNLQQDDFPVDTHIFEIAKTMGWV 216

Query: 259 PAMADLKKAYVHLNFRIPDDLKFDLNCLLFTHGKICRQCSKKGDSKSKKETYSNPCPLLA 80
           PA+A+  K+Y+HLN R+P++LKFDLNCLL+THGK+C QCS K  +K  K+   N CPLL 
Sbjct: 217 PAVANRNKSYLHLNQRVPNELKFDLNCLLYTHGKLCHQCSGKKGNKQGKKCDDNSCPLLN 276

Query: 79  YFNDSLE 59
           Y  DS+E
Sbjct: 277 YDKDSVE 283


>ref|XP_004299588.1| PREDICTED: DEMETER-like protein 3-like [Fragaria vesca subsp.
           vesca]
          Length = 286

 Score =  342 bits (876), Expect = 2e-91
 Identities = 172/288 (59%), Positives = 209/288 (72%), Gaps = 4/288 (1%)
 Frame = -3

Query: 928 KSAKRAKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLLANHGFPEEFIKYRRQRMVVK 749
           + A+    PK        +P+P+H RPT EEC +VRD LLA HGFP+EF KYR QR+  +
Sbjct: 9   EQAEADHNPKLPTKTTPKDPYPNHARPTREECVSVRDDLLALHGFPKEFAKYREQRLSSQ 68

Query: 748 DANNGDKNXXXXXXXXXXXXXXEGDGEPKESVLDGLVSTILSQNTTDVNSQRAFASLKSA 569
            +N  D +                  + KESVLDGLV T+LSQNTT+ NS +AFASLKSA
Sbjct: 69  ASNGHDNDVSSEPL------------DEKESVLDGLVRTLLSQNTTESNSLKAFASLKSA 116

Query: 568 FPTWEDVLAAERKHIEDTIRCGGLAPSKASCIKGILESLFAKRGKLCLEYVRDLSVEEIK 389
           FPTWE+VLAA+ + +E  IRCGGLA +KASCIK +L  L  K+ KLCLEY+RDLSV+EIK
Sbjct: 117 FPTWEEVLAADSQSLESAIRCGGLAKTKASCIKNMLSCLLEKKEKLCLEYLRDLSVDEIK 176

Query: 388 AELSHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKKAYVHLNFRI 209
           AELSHFKGIGPKTVACVLMFQLQQDDFPVDTH+++IAK++ W+P  AD  K Y+HLN  I
Sbjct: 177 AELSHFKGIGPKTVACVLMFQLQQDDFPVDTHVYEIAKAMAWVPVGADRNKTYLHLNQWI 236

Query: 208 PDDLKFDLNCLLFTHGKICRQCSKKGDSKSKKETY----SNPCPLLAY 77
           PD+LKFDLNCLL+THGK+CR+C KKG S  K++      SN CPLL Y
Sbjct: 237 PDELKFDLNCLLYTHGKLCRKCIKKGGSTGKQQEKESEDSNSCPLLRY 284


>ref|XP_004508835.1| PREDICTED: protein ROS1-like [Cicer arietinum]
           gi|502152248|ref|XP_004508836.1| PREDICTED: protein
           ROS1-like [Cicer arietinum]
          Length = 285

 Score =  341 bits (874), Expect = 3e-91
 Identities = 172/307 (56%), Positives = 211/307 (68%)
 Frame = -3

Query: 988 KKMQKKTHKIKVQSSQSPSIKSAKRAKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLL 809
           KK ++K    + +   + S+K+++     + +      EPFP H  PTP+EC  +RDTLL
Sbjct: 3   KKRKRKQEAKRNEERNAKSVKASQIQTENENLK-----EPFPSHSGPTPQECLDIRDTLL 57

Query: 808 ANHGFPEEFIKYRRQRMVVKDANNGDKNXXXXXXXXXXXXXXEGDGEPKESVLDGLVSTI 629
           A HG P E  KYR+ +    D  N D                     P E+VLDGLV TI
Sbjct: 58  ALHGLPPELAKYRKSQQQTDDTINPD---------------------PPETVLDGLVRTI 96

Query: 628 LSQNTTDVNSQRAFASLKSAFPTWEDVLAAERKHIEDTIRCGGLAPSKASCIKGILESLF 449
           LSQNTT+ NS +AFASLKS+FPTWE V  AE K +E+ IRCGGLAP+KASCIK +L  L 
Sbjct: 97  LSQNTTESNSNKAFASLKSSFPTWEHVHGAESKELENAIRCGGLAPTKASCIKNLLRCLL 156

Query: 448 AKRGKLCLEYVRDLSVEEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSI 269
            KRGK CLEY+RDLSV +IKAELS FKGIGPKTVACVLMF LQQDDFPVDTHIF+IAK+I
Sbjct: 157 EKRGKFCLEYLRDLSVAQIKAELSLFKGIGPKTVACVLMFNLQQDDFPVDTHIFEIAKTI 216

Query: 268 GWLPAMADLKKAYVHLNFRIPDDLKFDLNCLLFTHGKICRQCSKKGDSKSKKETYSNPCP 89
           GW+PA+AD  K Y+HLN RIP++LKFDLNCLL+THGK C +CS K  +K +K+   N CP
Sbjct: 217 GWVPAVADRNKTYLHLNQRIPNELKFDLNCLLYTHGKFCSKCSSKRGNKQQKKFNDNSCP 276

Query: 88  LLAYFND 68
           LL Y+ +
Sbjct: 277 LLNYYKE 283


>ref|XP_006404333.1| hypothetical protein EUTSA_v10010580mg [Eutrema salsugineum]
           gi|557105452|gb|ESQ45786.1| hypothetical protein
           EUTSA_v10010580mg [Eutrema salsugineum]
          Length = 302

 Score =  337 bits (864), Expect = 5e-90
 Identities = 173/283 (61%), Positives = 205/283 (72%), Gaps = 4/283 (1%)
 Frame = -3

Query: 913 AKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLLANHGFPEEFIKYRRQRMVVKDANNG 734
           +K P T      G+P+P HLRPT +ECR VRD LL+ HGFP EF  YRRQR+    A +G
Sbjct: 17  SKTPATKSTVYGGDPYPSHLRPTSDECRDVRDALLSLHGFPPEFDSYRRQRLRSSSAVDG 76

Query: 733 DKNXXXXXXXXXXXXXXEGDGEPKESVLDGLVSTILSQNTTDVNSQRAFASLKSAFPTWE 554
                            E D E +E+VLDGLV  +LSQNTT++NSQRAFASLK+AFP WE
Sbjct: 77  YHTHCTMKSEPLEAANDEKD-EIEETVLDGLVKILLSQNTTEINSQRAFASLKAAFPKWE 135

Query: 553 DVLAAERKHIEDTIRCGGLAPSKASCIKGILESLFAKRGKLCLEYVRDLSVEEIKAELSH 374
           DVL AE K IE+ IRCGGLAP KA CIK IL  L ++RG+LCLEY+R LSVEE+K ELSH
Sbjct: 136 DVLGAEPKSIENAIRCGGLAPKKAVCIKNILSRLQSERGRLCLEYLRGLSVEEVKTELSH 195

Query: 373 FKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKKAYVHLNFRIPDDLK 194
           FKGIGPKTV+CVLMF LQ +DFPVDTH+F+IAK+IGW+P  AD  K YVHLN RIPD+LK
Sbjct: 196 FKGIGPKTVSCVLMFNLQHNDFPVDTHVFEIAKAIGWVPKTADRNKTYVHLNRRIPDELK 255

Query: 193 FDLNCLLFTHGKICRQCSK---KGDSKSKKETYS-NPCPLLAY 77
           FDLNCLL+THGK+C  C K   K  +KSK +  S + CPLL +
Sbjct: 256 FDLNCLLYTHGKLCSNCKKNVAKPKAKSKAKVSSPDDCPLLGF 298


>gb|ESW27384.1| hypothetical protein PHAVU_003G197200g [Phaseolus vulgaris]
          Length = 282

 Score =  330 bits (847), Expect = 4e-88
 Identities = 164/270 (60%), Positives = 198/270 (73%)
 Frame = -3

Query: 874 EPFPDHLRPTPEECRAVRDTLLANHGFPEEFIKYRRQRMVVKDANNGDKNXXXXXXXXXX 695
           +PFP H RPTPEEC AVRDTLLA HG P E  KYR+ + +  DA                
Sbjct: 34  DPFPSHARPTPEECEAVRDTLLALHGIPPELAKYRKLQPL-NDAVQP------------- 79

Query: 694 XXXXEGDGEPKESVLDGLVSTILSQNTTDVNSQRAFASLKSAFPTWEDVLAAERKHIEDT 515
                   E  E VLDGLV T+LSQNTT+ NSQ+AF SLKS+FPTWE V  AE K +E+ 
Sbjct: 80  --------ESPEPVLDGLVRTVLSQNTTEANSQKAFVSLKSSFPTWEHVFGAESKDVENA 131

Query: 514 IRCGGLAPSKASCIKGILESLFAKRGKLCLEYVRDLSVEEIKAELSHFKGIGPKTVACVL 335
           IRCGGLAP+KASCIK +L  L  +RG+LCLEY+RDLSV+E KAELS FKGIGPKTVACVL
Sbjct: 132 IRCGGLAPTKASCIKNMLRCLRERRGQLCLEYLRDLSVDEAKAELSLFKGIGPKTVACVL 191

Query: 334 MFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKKAYVHLNFRIPDDLKFDLNCLLFTHGKI 155
           MF LQQDDFPVDTHIF+I+K++GW+P++AD  K+Y+HLN RIP++LKFDLNCL+FTHGK+
Sbjct: 192 MFNLQQDDFPVDTHIFEISKTMGWVPSVADRNKSYLHLNQRIPNELKFDLNCLMFTHGKL 251

Query: 154 CRQCSKKGDSKSKKETYSNPCPLLAYFNDS 65
           CR+CS K  ++  K+     CPLL Y  +S
Sbjct: 252 CRKCSSKKGNQQGKKGNDKSCPLLNYCKES 281


>gb|EOY20611.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao]
          Length = 264

 Score =  328 bits (841), Expect = 2e-87
 Identities = 159/237 (67%), Positives = 190/237 (80%)
 Frame = -3

Query: 874 EPFPDHLRPTPEECRAVRDTLLANHGFPEEFIKYRRQRMVVKDANNGDKNXXXXXXXXXX 695
           EP+P H RPTP+ECR+VRD LLA HGFP EF+KYR QR++  +     K+          
Sbjct: 27  EPYPSHHRPTPDECRSVRDELLALHGFPAEFLKYRHQRLIKTEPTIDAKSEPLNNNYD-- 84

Query: 694 XXXXEGDGEPKESVLDGLVSTILSQNTTDVNSQRAFASLKSAFPTWEDVLAAERKHIEDT 515
                 DGE  ESVLDGLV T+LSQNTT++NSQ+AFASLKSAFPTWEDVLAAE K++E+ 
Sbjct: 85  ------DGE--ESVLDGLVKTVLSQNTTELNSQKAFASLKSAFPTWEDVLAAESKNLENA 136

Query: 514 IRCGGLAPSKASCIKGILESLFAKRGKLCLEYVRDLSVEEIKAELSHFKGIGPKTVACVL 335
           IRCGGLAP KASCIK +L  L  ++GKLC EY+RDLS++EIKAELS+FKG+GPKTVACVL
Sbjct: 137 IRCGGLAPRKASCIKNVLRCLHERKGKLCFEYLRDLSIDEIKAELSNFKGVGPKTVACVL 196

Query: 334 MFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKKAYVHLNFRIPDDLKFDLNCLLFTH 164
           MF LQQDDFPVDTH+F+IA++IGW+PA AD KK Y+HLN RIP+ LKFDLNCLL+TH
Sbjct: 197 MFNLQQDDFPVDTHVFEIARAIGWVPATADRKKTYLHLNRRIPNKLKFDLNCLLYTH 253


>gb|EOY20609.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao]
          Length = 446

 Score =  328 bits (841), Expect = 2e-87
 Identities = 159/237 (67%), Positives = 190/237 (80%)
 Frame = -3

Query: 874 EPFPDHLRPTPEECRAVRDTLLANHGFPEEFIKYRRQRMVVKDANNGDKNXXXXXXXXXX 695
           EP+P H RPTP+ECR+VRD LLA HGFP EF+KYR QR++  +     K+          
Sbjct: 27  EPYPSHHRPTPDECRSVRDELLALHGFPAEFLKYRHQRLIKTEPTIDAKSEPLNNNYD-- 84

Query: 694 XXXXEGDGEPKESVLDGLVSTILSQNTTDVNSQRAFASLKSAFPTWEDVLAAERKHIEDT 515
                 DGE  ESVLDGLV T+LSQNTT++NSQ+AFASLKSAFPTWEDVLAAE K++E+ 
Sbjct: 85  ------DGE--ESVLDGLVKTVLSQNTTELNSQKAFASLKSAFPTWEDVLAAESKNLENA 136

Query: 514 IRCGGLAPSKASCIKGILESLFAKRGKLCLEYVRDLSVEEIKAELSHFKGIGPKTVACVL 335
           IRCGGLAP KASCIK +L  L  ++GKLC EY+RDLS++EIKAELS+FKG+GPKTVACVL
Sbjct: 137 IRCGGLAPRKASCIKNVLRCLHERKGKLCFEYLRDLSIDEIKAELSNFKGVGPKTVACVL 196

Query: 334 MFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKKAYVHLNFRIPDDLKFDLNCLLFTH 164
           MF LQQDDFPVDTH+F+IA++IGW+PA AD KK Y+HLN RIP+ LKFDLNCLL+TH
Sbjct: 197 MFNLQQDDFPVDTHVFEIARAIGWVPATADRKKTYLHLNRRIPNKLKFDLNCLLYTH 253


>ref|XP_002875868.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
           gi|297321706|gb|EFH52127.1| predicted protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 294

 Score =  326 bits (835), Expect = 1e-86
 Identities = 171/292 (58%), Positives = 199/292 (68%), Gaps = 2/292 (0%)
 Frame = -3

Query: 946 SQSPSIKSAKRAKRPKTIVVKGKGEPFPDHLRPTPEECRAVRDTLLANHGFPEEFIKYRR 767
           S++P+IKS           V G   P+P  LRPT EECR VRD LL+ HGFP EF  YRR
Sbjct: 17  SKTPAIKST----------VDGSN-PYPTLLRPTAEECREVRDALLSLHGFPPEFANYRR 65

Query: 766 QRMVVKDANNGDKNXXXXXXXXXXXXXXEGDGEPKESVLDGLVSTILSQNTTDVNSQRAF 587
           QR+    A +G                   D   +ESVLDGLV  +LSQNTT+ NSQRAF
Sbjct: 66  QRLRSLSAVDGHDTQCTMKSEPL-------DEAEEESVLDGLVKILLSQNTTESNSQRAF 118

Query: 586 ASLKSAFPTWEDVLAAERKHIEDTIRCGGLAPSKASCIKGILESLFAKRGKLCLEYVRDL 407
           ASLK+AFP WEDVLAAE K IE  IRCGGLAP KA CIK IL  L  +RG LCLEY+R L
Sbjct: 119 ASLKAAFPNWEDVLAAESKSIESAIRCGGLAPKKAVCIKNILNRLQTERGVLCLEYLRGL 178

Query: 406 SVEEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQIAKSIGWLPAMADLKKAYV 227
           SVEE+K ELSHFKGIGPKTV+CVLMF LQ +DFPVDTH+F+IAK++GW+P  AD  K YV
Sbjct: 179 SVEEVKTELSHFKGIGPKTVSCVLMFNLQHNDFPVDTHVFEIAKALGWVPKTADRNKTYV 238

Query: 226 HLNFRIPDDLKFDLNCLLFTHGKICRQCSKKGDSKSKKETYSNP--CPLLAY 77
           HLN RIPD+LKFDLNCLL+THGK+C  C K       K   ++P  CPL+ +
Sbjct: 239 HLNRRIPDELKFDLNCLLYTHGKLCSNCKKTVAKPKAKARVASPDECPLVGF 290


>ref|XP_003608916.1| Ultraviolet N-glycosylase/AP lyase [Medicago truncatula]
           gi|355509971|gb|AES91113.1| Ultraviolet N-glycosylase/AP
           lyase [Medicago truncatula]
          Length = 280

 Score =  323 bits (827), Expect = 9e-86
 Identities = 166/309 (53%), Positives = 203/309 (65%), Gaps = 8/309 (2%)
 Frame = -3

Query: 979 QKKTHKIKVQ--------SSQSPSIKSAKRAKRPKTIVVKGKGEPFPDHLRPTPEECRAV 824
           +K+  K+K +        S Q P IK+    + PK         PFP H  PTP+EC  +
Sbjct: 3   KKRKRKVKTERDGDRNPNSVQVPQIKT----ENPKN--------PFPSHSAPTPQECLEI 50

Query: 823 RDTLLANHGFPEEFIKYRRQRMVVKDANNGDKNXXXXXXXXXXXXXXEGDGEPKESVLDG 644
           RD LL+ HG P E  KYR+ +                              EP E+VLDG
Sbjct: 51  RDNLLSLHGIPPELAKYRKSQQTNDTV------------------------EPPETVLDG 86

Query: 643 LVSTILSQNTTDVNSQRAFASLKSAFPTWEDVLAAERKHIEDTIRCGGLAPSKASCIKGI 464
           LV TILSQNTT+ NS +AFASLKS FPTWE V  AE K +E+ IRCGGLAP+KA CIK +
Sbjct: 87  LVRTILSQNTTEANSNKAFASLKSLFPTWEHVHGAESKELENAIRCGGLAPTKAKCIKNL 146

Query: 463 LESLFAKRGKLCLEYVRDLSVEEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHIFQ 284
           L  L  ++GK+CLEY+RDLSV+E+KAELS FKGIGPKTV+CVLMF LQ DDFPVDTHIF+
Sbjct: 147 LSCLLERKGKMCLEYLRDLSVDEVKAELSLFKGIGPKTVSCVLMFNLQLDDFPVDTHIFE 206

Query: 283 IAKSIGWLPAMADLKKAYVHLNFRIPDDLKFDLNCLLFTHGKICRQCSKKGDSKSKKETY 104
           IAK++GW+PA AD  K Y+HLN RIPD+LKFDLNCLL+THGK+C  CS K  +K +K+  
Sbjct: 207 IAKTMGWVPAAADRNKTYLHLNQRIPDELKFDLNCLLYTHGKLCSNCSSKRGNKQQKKFN 266

Query: 103 SNPCPLLAY 77
            + CPLL Y
Sbjct: 267 DSSCPLLNY 275


Top