BLASTX nr result

ID: Forsythia23_contig00018725 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00018725
         (1240 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002276173.1| PREDICTED: uncharacterized protein LOC100267...   412   e-112
ref|XP_010656606.1| PREDICTED: uncharacterized protein LOC100267...   407   e-111
emb|CDP10232.1| unnamed protein product [Coffea canephora]            405   e-110
ref|XP_007011936.1| DNA glycosylase superfamily protein isoform ...   404   e-110
ref|XP_010103669.1| Putative Glutamine amidotransferase [Morus n...   403   e-109
ref|XP_008242987.1| PREDICTED: uncharacterized protein LOC103341...   399   e-108
ref|XP_011083975.1| PREDICTED: uncharacterized protein LOC105166...   399   e-108
ref|XP_007204814.1| hypothetical protein PRUPE_ppa026720mg [Prun...   399   e-108
ref|XP_009795108.1| PREDICTED: uncharacterized protein LOC104241...   397   e-108
ref|XP_002324538.1| methyladenine glycosylase family protein [Po...   397   e-108
ref|XP_009617886.1| PREDICTED: uncharacterized protein LOC104110...   396   e-107
ref|XP_011018029.1| PREDICTED: uncharacterized protein LOC105121...   392   e-106
ref|XP_012462430.1| PREDICTED: uncharacterized protein LOC105782...   391   e-106
ref|XP_004287130.1| PREDICTED: uncharacterized protein LOC101313...   391   e-106
ref|XP_010275821.1| PREDICTED: uncharacterized protein LOC104610...   390   e-105
ref|XP_012449856.1| PREDICTED: uncharacterized protein LOC105772...   390   e-105
ref|XP_011083973.1| PREDICTED: uncharacterized protein LOC105166...   390   e-105
ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative [R...   390   e-105
gb|KHG05578.1| guaA [Gossypium arboreum]                              389   e-105
gb|KHG15995.1| putative GMP synthase [glutamine-hydrolyzing] [Go...   388   e-105

>ref|XP_002276173.1| PREDICTED: uncharacterized protein LOC100267363 isoform X2 [Vitis
            vinifera] gi|297743642|emb|CBI36525.3| unnamed protein
            product [Vitis vinifera]
          Length = 375

 Score =  412 bits (1058), Expect = e-112
 Identities = 223/348 (64%), Positives = 255/348 (73%), Gaps = 6/348 (1%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            MSG PR RS N+ADS+ R     AGNK  R  + +KP  K LRK E++ KD+  E  K  
Sbjct: 1    MSGGPRVRSMNVADSEVRPVLGPAGNKTMRSLSGRKPATKPLRKAEKATKDD--EEIKAL 58

Query: 882  PISVTKLSNP-LHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706
            P S    S+P  HSVSVP VLRR E +L                    SRASTGR+ R+S
Sbjct: 59   PSSNGAASSPPSHSVSVPLVLRRQEQLLHSNLSLNASCSSDASTDSFHSRASTGRITRSS 118

Query: 705  STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526
            ST+  R+   SK K++V +GV E  PD ++ KRRCAWVT NTD SY+ FHDEEWGVP HD
Sbjct: 119  STAR-RRSYASKPKVIVSDGVSESPPDGLKAKRRCAWVTPNTDLSYIAFHDEEWGVPVHD 177

Query: 525  DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346
            D KLFELLVL GALAE+TWP ILS+RHIFREVFADFDPI+VAK+NEKK++A         
Sbjct: 178  DKKLFELLVLSGALAELTWPTILSKRHIFREVFADFDPIAVAKLNEKKLMAPGSIASSLI 237

Query: 345  SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166
            SELKLR II+NARQ+SKVIDEFGSFD+YIW FV HKPIV RFRYPR VPV+T KAD+ISK
Sbjct: 238  SELKLRGIIENARQMSKVIDEFGSFDEYIWSFVNHKPIVSRFRYPRHVPVKTPKADVISK 297

Query: 165  DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQEE 25
            DLVRRGFRSVGPTV+YSFMQVAGITNDHLISCFRF DC  AAEVK+EE
Sbjct: 298  DLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQDCVTAAEVKEEE 345


>ref|XP_010656606.1| PREDICTED: uncharacterized protein LOC100267363 isoform X1 [Vitis
            vinifera] gi|731407750|ref|XP_010656607.1| PREDICTED:
            uncharacterized protein LOC100267363 isoform X1 [Vitis
            vinifera] gi|731407752|ref|XP_010656608.1| PREDICTED:
            uncharacterized protein LOC100267363 isoform X1 [Vitis
            vinifera] gi|731407754|ref|XP_010656609.1| PREDICTED:
            uncharacterized protein LOC100267363 isoform X1 [Vitis
            vinifera]
          Length = 376

 Score =  407 bits (1046), Expect = e-111
 Identities = 223/349 (63%), Positives = 255/349 (73%), Gaps = 7/349 (2%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            MSG PR RS N+ADS+ R     AGNK  R  + +KP  K LRK E++ KD+  E  K  
Sbjct: 1    MSGGPRVRSMNVADSEVRPVLGPAGNKTMRSLSGRKPATKPLRKAEKATKDD--EEIKAL 58

Query: 882  PISVTKLSNP-LHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706
            P S    S+P  HSVSVP VLRR E +L                    SRASTGR+ R+S
Sbjct: 59   PSSNGAASSPPSHSVSVPLVLRRQEQLLHSNLSLNASCSSDASTDSFHSRASTGRITRSS 118

Query: 705  STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANT-DPSYVTFHDEEWGVPAH 529
            ST+  R+   SK K++V +GV E  PD ++ KRRCAWVT NT D SY+ FHDEEWGVP H
Sbjct: 119  STAR-RRSYASKPKVIVSDGVSESPPDGLKAKRRCAWVTPNTADLSYIAFHDEEWGVPVH 177

Query: 528  DDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXX 349
            DD KLFELLVL GALAE+TWP ILS+RHIFREVFADFDPI+VAK+NEKK++A        
Sbjct: 178  DDKKLFELLVLSGALAELTWPTILSKRHIFREVFADFDPIAVAKLNEKKLMAPGSIASSL 237

Query: 348  XSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLIS 169
             SELKLR II+NARQ+SKVIDEFGSFD+YIW FV HKPIV RFRYPR VPV+T KAD+IS
Sbjct: 238  ISELKLRGIIENARQMSKVIDEFGSFDEYIWSFVNHKPIVSRFRYPRHVPVKTPKADVIS 297

Query: 168  KDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQEE 25
            KDLVRRGFRSVGPTV+YSFMQVAGITNDHLISCFRF DC  AAEVK+EE
Sbjct: 298  KDLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQDCVTAAEVKEEE 346


>emb|CDP10232.1| unnamed protein product [Coffea canephora]
          Length = 380

 Score =  405 bits (1041), Expect = e-110
 Identities = 217/358 (60%), Positives = 249/358 (69%), Gaps = 10/358 (2%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRHA----GNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKN- 886
            MSGAPR R  ++ DS+ R      GNKA R    +KPV K     E+S  +  V  DKN 
Sbjct: 1    MSGAPRMRPMSVGDSEVRTVLVPGGNKAQRSLRVKKPVTKAWGNAEKSTDEVEVVEDKNG 60

Query: 885  --KPISVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYR 712
               P SVT LS PL+S   PS+LRR + +L                    SRASTGR+YR
Sbjct: 61   PSSPTSVTDLSPPLNSSRFPSILRRQDSLLHSSLSLSASCSSDASTDSFHSRASTGRIYR 120

Query: 711  TSSTSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPA 532
            T   +N +K L SKAKIV PNGV     D +  KR CAWVT  TDP+Y TFHDEEWGVP 
Sbjct: 121  TRIIANRKKHLASKAKIVGPNGVSGSTSDGLPAKRTCAWVTPTTDPAYATFHDEEWGVPV 180

Query: 531  HDDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXX 352
            HDD +LFELLVLCGAL+E+TWP+ILSRR IFREVFADFDP  VAK+NEKKIIA       
Sbjct: 181  HDDKRLFELLVLCGALSELTWPSILSRRQIFREVFADFDPTVVAKLNEKKIIAPGNTASS 240

Query: 351  XXSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLI 172
              SEL+LRAII+NARQ+SKVIDEFGSFDKYIW FV HKP+V RFRYPRQ+PV+T KAD+I
Sbjct: 241  LLSELRLRAIIENARQISKVIDEFGSFDKYIWSFVNHKPLVSRFRYPRQIPVKTPKADVI 300

Query: 171  SKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQE---EDDIKEK 7
            SKDL+RRGFR VGPTVVYSFMQVAG+TNDHL+SCFRF DC   E K E   ED  ++K
Sbjct: 301  SKDLMRRGFRCVGPTVVYSFMQVAGLTNDHLVSCFRFQDCMTPEGKAEASVEDIAQQK 358


>ref|XP_007011936.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao]
            gi|590572766|ref|XP_007011937.1| DNA glycosylase
            superfamily protein isoform 1 [Theobroma cacao]
            gi|590572769|ref|XP_007011938.1| DNA glycosylase
            superfamily protein isoform 1 [Theobroma cacao]
            gi|590572773|ref|XP_007011939.1| DNA glycosylase
            superfamily protein isoform 1 [Theobroma cacao]
            gi|508782299|gb|EOY29555.1| DNA glycosylase superfamily
            protein isoform 1 [Theobroma cacao]
            gi|508782300|gb|EOY29556.1| DNA glycosylase superfamily
            protein isoform 1 [Theobroma cacao]
            gi|508782301|gb|EOY29557.1| DNA glycosylase superfamily
            protein isoform 1 [Theobroma cacao]
            gi|508782302|gb|EOY29558.1| DNA glycosylase superfamily
            protein isoform 1 [Theobroma cacao]
          Length = 379

 Score =  404 bits (1039), Expect = e-110
 Identities = 219/353 (62%), Positives = 253/353 (71%), Gaps = 6/353 (1%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            MSGAPR RS N+ADS+ R     AGNKA  L +A+KP  K LRKVE+S  +  V  +K  
Sbjct: 1    MSGAPRMRSMNVADSEARPVLGPAGNKAGSL-SARKPASKPLRKVEKSPVEVTVAEEKKA 59

Query: 882  PIS--VTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRT 709
              S  V  LS   HSVSVPSVLRRHE +L                    SRASTGR+ R+
Sbjct: 60   LPSSTVNSLSPKTHSVSVPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLIRS 119

Query: 708  SSTSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAH 529
            +S  N RK   SK + VV +G  +  PD    K+RCAWVT NTDPSYV FHDEEWGVP H
Sbjct: 120  NSVGNRRKPYASKPRSVVSDGGLDSPPDGSHQKKRCAWVTPNTDPSYVAFHDEEWGVPVH 179

Query: 528  DDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXX 349
            DD KLFELLVL GAL+E+TWPAILS+RHI REVF DFD ++V+K+NEKK++         
Sbjct: 180  DDRKLFELLVLSGALSELTWPAILSKRHIVREVFVDFDAVAVSKLNEKKLVTPGSIASSL 239

Query: 348  XSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLIS 169
             SELKLRAII+NARQ+SKVIDEFGSFD+YIW FV HKPIV RFRYPRQVPV+T KAD+IS
Sbjct: 240  LSELKLRAIIENARQISKVIDEFGSFDEYIWSFVNHKPIVSRFRYPRQVPVKTPKADVIS 299

Query: 168  KDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKE 10
            KDLVRRGFRSVGPTV+YSFMQVAGITNDHL SCFRF +C  A   +EE+ IK+
Sbjct: 300  KDLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQECITAAEGKEENGIKD 352


>ref|XP_010103669.1| Putative Glutamine amidotransferase [Morus notabilis]
            gi|587908671|gb|EXB96612.1| Putative Glutamine
            amidotransferase [Morus notabilis]
          Length = 383

 Score =  403 bits (1036), Expect = e-109
 Identities = 221/357 (61%), Positives = 259/357 (72%), Gaps = 8/357 (2%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            MSGAPR RS N+ADS+ R     AGNKA    + +K   KT RKV++S  +  +  +K K
Sbjct: 1    MSGAPRVRSMNVADSESRPVLGLAGNKAGTWSSTRKSTSKTPRKVDKSPDEVTLSEEKKK 60

Query: 882  P--ISVTKLSNP-LHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMY- 715
               +S T  ++P LHS SVPSVLRRHE +L                    SRASTGR+  
Sbjct: 61   TRQVSSTGATSPQLHSSSVPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLLT 120

Query: 714  RTSSTSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVP 535
            R+ ST + RKQL S+ + VV +G  E  PDD Q K+RCAWVT NT+P YV FHDEEWGVP
Sbjct: 121  RSYSTGSRRKQLVSRTRSVVSDGGLESPPDDSQQKKRCAWVTPNTEPCYVAFHDEEWGVP 180

Query: 534  AHDDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXX 355
             HDD KLFELLVL GALAE+TWPAILS+RHIFREVFADFDP +V+K+NEKKI+A      
Sbjct: 181  VHDDRKLFELLVLSGALAELTWPAILSKRHIFREVFADFDPAAVSKLNEKKIMAPGSTAS 240

Query: 354  XXXSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADL 175
               SELKLRAII+N RQ+SKVIDEFGSFD YIW FV +KPIV +FRYPRQVPV+T KAD+
Sbjct: 241  SLLSELKLRAIIENGRQISKVIDEFGSFDNYIWSFVNNKPIVSKFRYPRQVPVKTPKADV 300

Query: 174  ISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKEKA 4
            ISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRF +C  A   ++E+ IK +A
Sbjct: 301  ISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFQECLNAAEGKDENGIKNEA 357


>ref|XP_008242987.1| PREDICTED: uncharacterized protein LOC103341267 isoform X1 [Prunus
            mume]
          Length = 378

 Score =  399 bits (1025), Expect = e-108
 Identities = 215/354 (60%), Positives = 256/354 (72%), Gaps = 5/354 (1%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEE-SIKDEGVELDKN 886
            MSGAPR RS N+ADS+ R     AGNKA    +A+KPV K LRK E+ + K    E  K 
Sbjct: 1    MSGAPRVRSINVADSESRPVLGPAGNKAGTF-SARKPVSKPLRKAEKLAEKVASAEEKKT 59

Query: 885  KPISVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706
            +  S+   S  LHS SVPSVLRRHE +L                    SRASTGR+ R++
Sbjct: 60   RQSSMLTTSPQLHSPSVPSVLRRHEQLLHSNFSLNASCSSDASTDSFHSRASTGRLTRSN 119

Query: 705  STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526
            S  + RKQ  SK + VV +G  +  PD  Q K+RCAWVT NTDP Y  FHDEEWG+P HD
Sbjct: 120  SAGSRRKQYVSKPRSVVSDGGLDSPPDGSQSKKRCAWVTPNTDPCYAAFHDEEWGLPVHD 179

Query: 525  DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346
            D KLFELLVL GALAE++WPAILS++HIFREVFADFDP++V+K+NEKK+IA         
Sbjct: 180  DKKLFELLVLSGALAELSWPAILSKKHIFREVFADFDPVAVSKLNEKKLIAPGSTASSLL 239

Query: 345  SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166
            SELKLRAII+NARQ++KVI+EFGSFDKYIW FV +KPIV RFRYPRQVP +T KAD+ISK
Sbjct: 240  SELKLRAIIENARQMTKVIEEFGSFDKYIWSFVNNKPIVSRFRYPRQVPAKTPKADVISK 299

Query: 165  DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKEKA 4
            DLVRRGFRSVGPTV+YSFMQVAGITNDHL+SCFRF +C  A   +E+  IK++A
Sbjct: 300  DLVRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQECLNAAEGKEDYGIKDEA 353


>ref|XP_011083975.1| PREDICTED: uncharacterized protein LOC105166349 isoform X2 [Sesamum
            indicum]
          Length = 372

 Score =  399 bits (1024), Expect = e-108
 Identities = 221/359 (61%), Positives = 257/359 (71%), Gaps = 10/359 (2%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            MSG  + RSTN+ADS+ R      GNKA RL +++K V+K L+K    ++D+   L    
Sbjct: 1    MSGTAKIRSTNMADSEVRPILGPGGNKAQRLIDSRKHVVKPLKKEAVPVEDKNGSL---- 56

Query: 882  PISVTKLSNPL-HSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706
            P S    S+PL H VSVPS L RHE +L                    SRASTGR+ RT 
Sbjct: 57   PASTRAESSPLLHYVSVPSTLHRHESLLCSNLSLSASCSSDASTDSFHSRASTGRICRTI 116

Query: 705  STSNWRKQLGSKAKI-VVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAH 529
            S S+ RK+L  KA+   V NGV E L + VQ KRRCAWVTANTDP YV FHDEEWGVP H
Sbjct: 117  SKSS-RKELALKARNGAVSNGVTESLTEGVQAKRRCAWVTANTDPIYVAFHDEEWGVPTH 175

Query: 528  DDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXX 349
            DD KLFE LVL GALAE+TWPAILS+RHIFREVF DFDP +VAK++EKKIIA        
Sbjct: 176  DDRKLFEFLVLSGALAELTWPAILSKRHIFREVFVDFDPTAVAKLSEKKIIAPGSPASSL 235

Query: 348  XSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLIS 169
             SELKLR+II+NARQVS+VIDEFGSFDKYIW FV +KPIVG FRYPRQVPV+T KAD+IS
Sbjct: 236  LSELKLRSIIENARQVSRVIDEFGSFDKYIWSFVNYKPIVGSFRYPRQVPVKTPKADVIS 295

Query: 168  KDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQ----EEDDIKEKA 4
            KDLVRRGFRSVGPT++YSFMQ AGITNDHL+SCFRFH+C AA+ K+       D +EKA
Sbjct: 296  KDLVRRGFRSVGPTIIYSFMQGAGITNDHLMSCFRFHECGAAKAKEGSPLTNKDEEEKA 354


>ref|XP_007204814.1| hypothetical protein PRUPE_ppa026720mg [Prunus persica]
            gi|462400345|gb|EMJ06013.1| hypothetical protein
            PRUPE_ppa026720mg [Prunus persica]
          Length = 378

 Score =  399 bits (1024), Expect = e-108
 Identities = 214/354 (60%), Positives = 256/354 (72%), Gaps = 5/354 (1%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEE-SIKDEGVELDKN 886
            MSGAPR RS N+ADS+ R     AGNKA    +A+KPV K LRK E+ + K    E  K 
Sbjct: 1    MSGAPRVRSINVADSESRPVLGPAGNKAGTF-SARKPVSKPLRKAEKLAEKVASAEEKKT 59

Query: 885  KPISVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706
            +  S+   S  LHS SVPSVLRRHE +L                    SRASTGR+ R++
Sbjct: 60   RQSSMLTTSPQLHSPSVPSVLRRHEQLLHSNFSLNASCSSDASTDSFHSRASTGRLTRSN 119

Query: 705  STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526
            S  + RKQ  SK + VV +G  +  PD  Q K+RCAWVT NTDP Y  FHDEEWG+P HD
Sbjct: 120  SAGSRRKQYVSKPRSVVSDGGLDSPPDGSQSKKRCAWVTPNTDPCYAAFHDEEWGLPVHD 179

Query: 525  DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346
            D KLFELLVL GALAE++WPAILS++HIFREVFADFDP++++K+NEKK+IA         
Sbjct: 180  DKKLFELLVLSGALAELSWPAILSKKHIFREVFADFDPVAISKLNEKKLIAPGSNASSLL 239

Query: 345  SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166
            SELKLRAII+NARQ++KVI+EFGSFDKYIW FV +KPIV RFRYPRQVP +T KAD+ISK
Sbjct: 240  SELKLRAIIENARQMTKVIEEFGSFDKYIWSFVNNKPIVSRFRYPRQVPAKTPKADVISK 299

Query: 165  DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKEKA 4
            DL+RRGFRSVGPTV+YSFMQVAGITNDHL+SCFRF +C  A   +EE  IK++A
Sbjct: 300  DLMRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQECLNAAEGKEEYGIKDEA 353


>ref|XP_009795108.1| PREDICTED: uncharacterized protein LOC104241850 [Nicotiana
            sylvestris] gi|698498404|ref|XP_009795109.1| PREDICTED:
            uncharacterized protein LOC104241850 [Nicotiana
            sylvestris]
          Length = 368

 Score =  397 bits (1020), Expect = e-108
 Identities = 216/352 (61%), Positives = 252/352 (71%), Gaps = 5/352 (1%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            MSGA R RS N ADS+ R     AGNKA R   ++K V K  RK  +S K+E    D+  
Sbjct: 1    MSGASRVRSMNAADSEARPVLGLAGNKALRSPGSRKSVSKPTRKAVKS-KEEDKNGDQPS 59

Query: 882  PISVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTSS 703
            P         LHS  VPS+LRR E  L                    S ASTGR+YR SS
Sbjct: 60   P--------SLHSFDVPSILRRQES-LYSNLSLSASCSSDASTDSFHSSASTGRIYRMSS 110

Query: 702  TSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHDD 523
            TS+ RKQL SK+K +V + + +   D +Q K+RC+WVT NTDPSY  FHDEEWGVP HDD
Sbjct: 111  TSSRRKQLASKSKRIVSDDISDSSIDGLQSKKRCSWVTPNTDPSYADFHDEEWGVPVHDD 170

Query: 522  NKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXXS 343
             KLFELLVLCGALAE++WP+IL +RHIFREVFADFDPI VAK+NEKKI+A         S
Sbjct: 171  KKLFELLVLCGALAELSWPSILCKRHIFREVFADFDPIVVAKLNEKKILAPGSTACSLLS 230

Query: 342  ELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISKD 163
            ELKLRAII+NARQ+SKVIDEFGSFDKYIW FV +KPIV  FRYPRQVPV+T+KADLISKD
Sbjct: 231  ELKLRAIIENARQMSKVIDEFGSFDKYIWSFVNNKPIVSGFRYPRQVPVKTAKADLISKD 290

Query: 162  LVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQEEDDIKE 10
            L+RRGFR VGPTVVYSFMQV+GITNDHLISCFRFHDC ++AE K+++ +  E
Sbjct: 291  LIRRGFRGVGPTVVYSFMQVSGITNDHLISCFRFHDCVESAEAKEKDSNKDE 342


>ref|XP_002324538.1| methyladenine glycosylase family protein [Populus trichocarpa]
            gi|222865972|gb|EEF03103.1| methyladenine glycosylase
            family protein [Populus trichocarpa]
          Length = 380

 Score =  397 bits (1020), Expect = e-108
 Identities = 223/361 (61%), Positives = 254/361 (70%), Gaps = 15/361 (4%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRHA----GNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            MSGAPR RS N+ADS+ R      GN      +A+KPV K  RKVE+S   E V+L + K
Sbjct: 1    MSGAPRVRSMNVADSEARSVLGPTGNNKAGPLSARKPVSKQSRKVEKS--PEEVKLGEEK 58

Query: 882  PI----SVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMY 715
                  +V  LS   HS+++ SVLRRHEL+L                    SRASTGR+ 
Sbjct: 59   KTLTVPAVGTLSPKSHSLNISSVLRRHELLLHSNLSLNASCSSDASTDSFHSRASTGRLT 118

Query: 714  RTSSTSNWRKQLGSKAKIVVPNGVPE--PLPDDVQVKRRCAWVTANTDPSYVTFHDEEWG 541
            R++S    RKQ   + +  V  G  E  P PDD Q K+ CAWVT NTDP Y TFHDEEWG
Sbjct: 119  RSNSAGTRRKQYVLRPRSFVSEGGLESPPSPDDSQSKKSCAWVTPNTDPCYATFHDEEWG 178

Query: 540  VPAHDDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXX 361
            VP HDD KLFELLVL GALAE+TWPAILS+RHIFREVFADFDPI+V+K NEKKI+A    
Sbjct: 179  VPIHDDRKLFELLVLSGALAELTWPAILSKRHIFREVFADFDPIAVSKFNEKKILAPGST 238

Query: 360  XXXXXSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKA 181
                 SELKLRAI++NARQ+SKVIDEFGSFDKYIW FV +KPIV RFRYPRQVPV+T KA
Sbjct: 239  ATSLLSELKLRAIVENARQISKVIDEFGSFDKYIWSFVNYKPIVSRFRYPRQVPVKTPKA 298

Query: 180  DLISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQE----EDDI 16
            D ISKDLVRRGFRSVGPTV+YSFMQVAGITNDHLISCFRF +C DAAE K E     +DI
Sbjct: 299  DAISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQECLDAAEGKVENGIKSEDI 358

Query: 15   K 13
            K
Sbjct: 359  K 359


>ref|XP_009617886.1| PREDICTED: uncharacterized protein LOC104110152 [Nicotiana
            tomentosiformis]
          Length = 372

 Score =  396 bits (1017), Expect = e-107
 Identities = 216/352 (61%), Positives = 253/352 (71%), Gaps = 5/352 (1%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            MSGA R RS N ADS+ R     AGNKA R   ++K V K  RK  +S ++  +E DKN 
Sbjct: 1    MSGASRVRSMNAADSEARPVLGLAGNKALRSPGSRKSVSKPTRKAVKSKEEVEME-DKNG 59

Query: 882  PISVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTSS 703
                 + S  LHS  VPS+LRR E  L                    S ASTGR+YR SS
Sbjct: 60   H----QPSPSLHSFDVPSILRRQES-LYSNLSLSASCSSDASTDSFHSSASTGRIYRMSS 114

Query: 702  TSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHDD 523
            TS+ RKQL SK+K +V + + +   D +Q K++C WVT NTDPSY  FHDEEWGVP HDD
Sbjct: 115  TSSRRKQLASKSKRIVSDDISDSSIDGLQSKKKCGWVTPNTDPSYADFHDEEWGVPVHDD 174

Query: 522  NKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXXS 343
             KLFELLVLCGALAE++WP+IL +RHIFREVF DFDPI VAK+NEKKI+A         S
Sbjct: 175  KKLFELLVLCGALAELSWPSILCKRHIFREVFTDFDPIVVAKLNEKKILAPGSTACSLLS 234

Query: 342  ELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISKD 163
            ELKLRAII+NARQ+SKVIDEFGSFDKYIW FV +KPIV  FRYPRQVPV+T+KADLISKD
Sbjct: 235  ELKLRAIIENARQMSKVIDEFGSFDKYIWSFVNNKPIVSGFRYPRQVPVKTAKADLISKD 294

Query: 162  LVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQEEDDIKE 10
            L+RRGFR VGPTVVYSFMQVAGITNDHLISCFRFHDC ++AE K+++ +  E
Sbjct: 295  LIRRGFRGVGPTVVYSFMQVAGITNDHLISCFRFHDCVESAEAKEKDSNKDE 346


>ref|XP_011018029.1| PREDICTED: uncharacterized protein LOC105121177 [Populus euphratica]
          Length = 380

 Score =  392 bits (1007), Expect = e-106
 Identities = 218/359 (60%), Positives = 252/359 (70%), Gaps = 13/359 (3%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRHA----GNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            MSGAPR +S N+ DS+ R      GN      +A+KP  K LRKVE+S ++  +  +K  
Sbjct: 1    MSGAPRVKSMNVTDSEARSVLGPTGNNKAGPLSARKPASKQLRKVEKSAEEVRLGEEKKT 60

Query: 882  PI--SVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRT 709
             I  +V  LS   HS+++ SVL RHEL+L                    SRASTGR+ R+
Sbjct: 61   LIVPAVGTLSPKSHSLNISSVLLRHELLLHSNLSLNASCSSDASTDSFHSRASTGRLTRS 120

Query: 708  SSTSNWRKQLGSKAKIVVPNGVPE--PLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVP 535
            +S    +KQ  S+ +  V  G  E  P P+D Q K+ CAWVT NTDP Y TFHDEEWGVP
Sbjct: 121  NSAGTRKKQYVSRPRSFVSEGGLESPPSPNDSQSKKSCAWVTPNTDPCYATFHDEEWGVP 180

Query: 534  AHDDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXX 355
             HDD KLFELLVL GALAE+TWPAILS+RHIFREVFADFDP++V+K NEKKIIA      
Sbjct: 181  IHDDRKLFELLVLSGALAELTWPAILSKRHIFREVFADFDPVAVSKFNEKKIIAPGSTAT 240

Query: 354  XXXSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADL 175
               SELKLRAII+NARQ+SKVIDEFGSFDKYIW FV  KPIV RFRYPRQVPV+T KAD 
Sbjct: 241  SLLSELKLRAIIENARQISKVIDEFGSFDKYIWSFVNFKPIVSRFRYPRQVPVKTPKADA 300

Query: 174  ISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQE----EDDIK 13
            ISKDLVRRGFRSVGPTV+YSFMQVAGITNDHLISCFRF +C DAAE K E     +DIK
Sbjct: 301  ISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQECLDAAEGKGENGIKSEDIK 359


>ref|XP_012462430.1| PREDICTED: uncharacterized protein LOC105782309 [Gossypium raimondii]
            gi|763815982|gb|KJB82834.1| hypothetical protein
            B456_013G216100 [Gossypium raimondii]
            gi|763815983|gb|KJB82835.1| hypothetical protein
            B456_013G216100 [Gossypium raimondii]
          Length = 381

 Score =  391 bits (1005), Expect = e-106
 Identities = 218/356 (61%), Positives = 254/356 (71%), Gaps = 7/356 (1%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEES-IKDEGVELDKN 886
            MSGAPR RS N  DS+ R     AGNKA  L +A+KP  K LRKVE+S ++    E  K+
Sbjct: 1    MSGAPRLRSMNAPDSEARPVLGPAGNKAGSL-SARKPASKPLRKVEKSPVEVTATEEKKS 59

Query: 885  KPIS-VTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRT 709
             P S V+ LS   HSVSVPSVLRRHE +L                    SRASTGR+ R+
Sbjct: 60   LPSSIVSSLSPKKHSVSVPSVLRRHEKLLHSNLSLNASCSSDASTDSFHSRASTGRLIRS 119

Query: 708  SSTSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAH 529
            +S  + RK   SK +  V +   +   D    K+RCAWVT NTDPSY TFHDEEWGVP H
Sbjct: 120  NSVGSRRKPYVSKPRSFVSDSGSDSPSDGSHQKKRCAWVTPNTDPSYATFHDEEWGVPVH 179

Query: 528  DDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXX 349
            DD KLFELLVL GAL+E+TWPAILS+R +FREVF DFDP +V+K+NEKK+IA        
Sbjct: 180  DDKKLFELLVLSGALSELTWPAILSKRQMFREVFMDFDPAAVSKLNEKKLIAPGSVSSSL 239

Query: 348  XSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLIS 169
             SELKLRAII+NARQ+SKVIDEFGSFD+YIW FV HKPI+ +FRYPRQVPV+T KAD+IS
Sbjct: 240  LSELKLRAIIENARQISKVIDEFGSFDEYIWSFVNHKPIISKFRYPRQVPVKTPKADVIS 299

Query: 168  KDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQEEDDIKEKA 4
            KDLVRRGFRSVGPTV+YSFMQVAGITNDHL  CFRF +C  AAE K+ E  IKE+A
Sbjct: 300  KDLVRRGFRSVGPTVIYSFMQVAGITNDHLTGCFRFQECITAAEGKEVE--IKERA 353


>ref|XP_004287130.1| PREDICTED: uncharacterized protein LOC101313540 [Fragaria vesca
            subsp. vesca]
          Length = 429

 Score =  391 bits (1004), Expect = e-106
 Identities = 209/354 (59%), Positives = 251/354 (70%), Gaps = 5/354 (1%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKD-EGVELDKN 886
            MSGAPR +S N+A+S+ R     AGNK     +A+KP  K LRK E+ +++    E  K 
Sbjct: 1    MSGAPRVKSINVANSESRSVLGPAGNKGGAF-SARKPATKPLRKTEKMVEEFTSAEDKKT 59

Query: 885  KPISVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706
            +  S    S  LHS+SVPSVLRRHE +L                    SRASTGR+ R++
Sbjct: 60   QQSSKLSTSPQLHSLSVPSVLRRHEQLLQSNFSLNASCSSDASTDSFHSRASTGRLIRSN 119

Query: 705  STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526
            S  + RKQ  SK + VV +G  +  P   Q K+RCAWVT NTDP YV FHDEEWG+P HD
Sbjct: 120  SVGSRRKQYVSKPRSVVSDGGLDSPPGGSQSKKRCAWVTPNTDPCYVAFHDEEWGLPVHD 179

Query: 525  DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346
            D KLFELLVL GALAE++WP ILS+RHIFREVFADFDP+ V++ NEKKI+A         
Sbjct: 180  DKKLFELLVLSGALAELSWPLILSKRHIFREVFADFDPVDVSEFNEKKIMAPGSVASSLL 239

Query: 345  SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166
            SE KLRAI++NARQ++KVIDEFGSFDKYIW FV +KPIV RFRYPRQVP +T KAD+ISK
Sbjct: 240  SESKLRAILENARQMTKVIDEFGSFDKYIWSFVNNKPIVSRFRYPRQVPAKTPKADVISK 299

Query: 165  DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKEKA 4
            DLVRRGFRSVGPTV+YSFMQVAGITNDHL+SCFRF DC  A   +EE+  KE++
Sbjct: 300  DLVRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQDCLNAAEGKEENRTKEES 353


>ref|XP_010275821.1| PREDICTED: uncharacterized protein LOC104610746 [Nelumbo nucifera]
          Length = 387

 Score =  390 bits (1002), Expect = e-105
 Identities = 212/349 (60%), Positives = 243/349 (69%), Gaps = 5/349 (1%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            MSGAPR RS N+ADS+ R     AGNK   L   +KP  K LRKVE++   E V+ +K  
Sbjct: 1    MSGAPRVRSINVADSEARPVLGPAGNKTRSLVT-RKPASKPLRKVEKT--PEAVDEEKKA 57

Query: 882  PISVTKLSNP-LHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706
            P S    S P L  VSVPS+LRRHE  L                    SRASTGR+ RT 
Sbjct: 58   PSSPVAASPPKLQPVSVPSILRRHEF-LHSNLSLNASCSSDASSDSVYSRASTGRLIRTR 116

Query: 705  STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526
            ST + RK   S+ + VVP+   +  PD ++ K+RCAWVT NTDP Y  FHDEEWGVP HD
Sbjct: 117  STPSRRKYSISRPEKVVPDSASDSSPDSIETKKRCAWVTPNTDPCYAAFHDEEWGVPVHD 176

Query: 525  DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346
            D KLFELLVL GALAE+TWP ILS+RHIFREVF+DFDP++V+K+NEKKI A         
Sbjct: 177  DKKLFELLVLSGALAELTWPTILSKRHIFREVFSDFDPVAVSKLNEKKITAPGSTASSLL 236

Query: 345  SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166
            SELKLRAII+NARQ+ KVIDEFGSFD YIW FV HKPI+ +FRYPRQVPV+  KAD+ISK
Sbjct: 237  SELKLRAIIENARQICKVIDEFGSFDNYIWSFVNHKPIISKFRYPRQVPVKIPKADVISK 296

Query: 165  DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDD 19
            DLVRRGFRSVGPTVVYSFMQVAGITNDHLI+CFRF  C       E DD
Sbjct: 297  DLVRRGFRSVGPTVVYSFMQVAGITNDHLINCFRFQVCMDTPTVSEGDD 345


>ref|XP_012449856.1| PREDICTED: uncharacterized protein LOC105772910 [Gossypium raimondii]
            gi|763798832|gb|KJB65787.1| hypothetical protein
            B456_010G113100 [Gossypium raimondii]
            gi|763798836|gb|KJB65791.1| hypothetical protein
            B456_010G113100 [Gossypium raimondii]
          Length = 374

 Score =  390 bits (1001), Expect = e-105
 Identities = 212/352 (60%), Positives = 252/352 (71%), Gaps = 5/352 (1%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            M G PR RS N+ADS+ R     AGNK   L +A+KP  K  RK+E+   +  +  +KN 
Sbjct: 1    MFGPPRLRSMNMADSEARPVLGPAGNKTGSL-SARKPGSKPSRKIEKCSAEATLAEEKNG 59

Query: 882  PISVTKLSNPLHSVSV-PSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706
             +  +K+++  HSVSV PSVLRRHE +L                    SRASTGR+  ++
Sbjct: 60   -LQSSKVNS--HSVSVVPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLIWSN 116

Query: 705  STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526
            S    RK   S  + VV +G  + LP D   ++RCAWVT NTDPSYV FHDEEWGVP HD
Sbjct: 117  SVGTRRKPFPSTPRSVVSDGGLDSLPGDSHRRKRCAWVTPNTDPSYVAFHDEEWGVPVHD 176

Query: 525  DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346
            D KLFELLVL GAL+E+TWPAILS+RHIFREVFADFDP++V+K+NEKK+IA         
Sbjct: 177  DKKLFELLVLAGALSELTWPAILSKRHIFREVFADFDPLAVSKLNEKKLIAPGSTASSLL 236

Query: 345  SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166
            SELKLRAI++NA Q+SKVIDEFGSFDKYIW FV HKPIV RFRYPRQVPV+T KAD+ISK
Sbjct: 237  SELKLRAIVENAHQISKVIDEFGSFDKYIWSFVNHKPIVSRFRYPRQVPVKTPKADVISK 296

Query: 165  DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKE 10
            DLVRRGFRSVGPTV+YSFMQV+GITNDHL SCFRF DC  A   +EE+ IKE
Sbjct: 297  DLVRRGFRSVGPTVIYSFMQVSGITNDHLTSCFRFQDCITAAEGKEENGIKE 348


>ref|XP_011083973.1| PREDICTED: uncharacterized protein LOC105166349 isoform X1 [Sesamum
            indicum] gi|747073987|ref|XP_011083974.1| PREDICTED:
            uncharacterized protein LOC105166349 isoform X1 [Sesamum
            indicum]
          Length = 384

 Score =  390 bits (1001), Expect = e-105
 Identities = 221/371 (59%), Positives = 257/371 (69%), Gaps = 22/371 (5%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            MSG  + RSTN+ADS+ R      GNKA RL +++K V+K L+K    ++D+   L    
Sbjct: 1    MSGTAKIRSTNMADSEVRPILGPGGNKAQRLIDSRKHVVKPLKKEAVPVEDKNGSL---- 56

Query: 882  PISVTKLSNPL-HSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706
            P S    S+PL H VSVPS L RHE +L                    SRASTGR+ RT 
Sbjct: 57   PASTRAESSPLLHYVSVPSTLHRHESLLCSNLSLSASCSSDASTDSFHSRASTGRICRTI 116

Query: 705  STSNWRKQLGSKAKI-VVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAH 529
            S S+ RK+L  KA+   V NGV E L + VQ KRRCAWVTANTDP YV FHDEEWGVP H
Sbjct: 117  SKSS-RKELALKARNGAVSNGVTESLTEGVQAKRRCAWVTANTDPIYVAFHDEEWGVPTH 175

Query: 528  DDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXX 349
            DD KLFE LVL GALAE+TWPAILS+RHIFREVF DFDP +VAK++EKKIIA        
Sbjct: 176  DDRKLFEFLVLSGALAELTWPAILSKRHIFREVFVDFDPTAVAKLSEKKIIAPGSPASSL 235

Query: 348  XSELKLRAIIDNARQVSK------------VIDEFGSFDKYIWGFVKHKPIVGRFRYPRQ 205
             SELKLR+II+NARQVS+            VIDEFGSFDKYIW FV +KPIVG FRYPRQ
Sbjct: 236  LSELKLRSIIENARQVSRVRLTFSRFCKHQVIDEFGSFDKYIWSFVNYKPIVGSFRYPRQ 295

Query: 204  VPVRTSKADLISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQ-- 31
            VPV+T KAD+ISKDLVRRGFRSVGPT++YSFMQ AGITNDHL+SCFRFH+C AA+ K+  
Sbjct: 296  VPVKTPKADVISKDLVRRGFRSVGPTIIYSFMQGAGITNDHLMSCFRFHECGAAKAKEGS 355

Query: 30   --EEDDIKEKA 4
                 D +EKA
Sbjct: 356  PLTNKDEEEKA 366


>ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis]
            gi|223531126|gb|EEF32974.1| DNA-3-methyladenine
            glycosylase, putative [Ricinus communis]
          Length = 380

 Score =  390 bits (1001), Expect = e-105
 Identities = 214/356 (60%), Positives = 247/356 (69%), Gaps = 10/356 (2%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRHA----GNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            MSGAPR RS N+ADS+ R      GN      +A+KP  K LRKVE S   E V+L + K
Sbjct: 1    MSGAPRVRSMNVADSETRPVLGPTGNNKAGSLSAKKPASKQLRKVETS--PEAVKLGQEK 58

Query: 882  PI----SVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMY 715
             +    + + LS   HSVSVPSVLRRHE +L                    SRASTGR+ 
Sbjct: 59   KLVTVPTASALSPKSHSVSVPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLT 118

Query: 714  RTSSTSNWRKQLGSKAKIVVPNGVPEPLP--DDVQVKRRCAWVTANTDPSYVTFHDEEWG 541
            R++S    RKQ   K + VV +G  E  P  D  Q K+ CAWVT N DP Y  FHDEEWG
Sbjct: 119  RSNSLGTRRKQYALKPRSVVSDGGLESPPPSDGSQAKKSCAWVTPNADPCYTAFHDEEWG 178

Query: 540  VPAHDDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXX 361
            +P HDD KLFELLVL GALAE+TWPAILS+RHIFREVFA+FDP+ V+K NEKKIIA    
Sbjct: 179  IPVHDDKKLFELLVLSGALAELTWPAILSKRHIFREVFANFDPVVVSKFNEKKIIAPGST 238

Query: 360  XXXXXSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKA 181
                 SE+KLRAII+NARQ+SKV DE GSFDKYIW FV +KPIV RFRYPRQVPV+T KA
Sbjct: 239  ASSLLSEIKLRAIIENARQISKVTDELGSFDKYIWSFVNYKPIVSRFRYPRQVPVKTPKA 298

Query: 180  DLISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIK 13
            D+ISKDLVRRGFRSVGPTVVYSFMQVAG+TNDHLISCFRF +C  A   +EE+ +K
Sbjct: 299  DVISKDLVRRGFRSVGPTVVYSFMQVAGLTNDHLISCFRFQECINAAEGKEENGVK 354


>gb|KHG05578.1| guaA [Gossypium arboreum]
          Length = 381

 Score =  389 bits (1000), Expect = e-105
 Identities = 214/355 (60%), Positives = 251/355 (70%), Gaps = 6/355 (1%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKD-EGVELDKN 886
            M GAPR RS N  DS+ R     AGNKA  L +A+KP  K LRKVE+S  +    E  K+
Sbjct: 1    MLGAPRLRSMNAPDSEARPVLGPAGNKAGSL-SARKPASKPLRKVEKSPAEVTATEEKKS 59

Query: 885  KPIS-VTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRT 709
             P S V+ LS   HSVSVPSVLRRHE +L                    SRASTGR+ R+
Sbjct: 60   LPSSIVSSLSPKKHSVSVPSVLRRHEKLLHSNLSLNASCSSDASTDSFHSRASTGRLIRS 119

Query: 708  SSTSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAH 529
            +S  + RK   SK +  V +   +   D    K+RCAWVT NTDPSY TFHDEEWGVP H
Sbjct: 120  NSVGSRRKPYASKPRSFVSDSGSDSPSDGSHQKKRCAWVTPNTDPSYATFHDEEWGVPVH 179

Query: 528  DDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXX 349
            DD KLFELLVL GAL+E+TWPAILS+R +FREVF DFDP +V+K+NEKK+IA        
Sbjct: 180  DDKKLFELLVLSGALSELTWPAILSKRQMFREVFMDFDPAAVSKLNEKKLIAPGSVSSSL 239

Query: 348  XSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLIS 169
             SELKLRAII+NARQ+SKVIDEFGSFD+YIW FV HKPI+ +FRYPRQVPV+T KAD+IS
Sbjct: 240  LSELKLRAIIENARQISKVIDEFGSFDEYIWSFVNHKPIISKFRYPRQVPVKTPKADVIS 299

Query: 168  KDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKEKA 4
            KDLVRRGFRSVGPTV+YSFMQVAGITNDHL  CFRF +C  A  + +E +IKE+A
Sbjct: 300  KDLVRRGFRSVGPTVIYSFMQVAGITNDHLTGCFRFQECTTA-AEGKEVEIKERA 353


>gb|KHG15995.1| putative GMP synthase [glutamine-hydrolyzing] [Gossypium arboreum]
          Length = 374

 Score =  388 bits (997), Expect = e-105
 Identities = 212/352 (60%), Positives = 252/352 (71%), Gaps = 5/352 (1%)
 Frame = -2

Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883
            M G PR RS N ADS+ R     AGNK   L +A+KP  K LRK+E+   +  +  +KN 
Sbjct: 1    MFGPPRLRSMNTADSEARPVLGPAGNKTGSL-SARKPGSKPLRKIEKCSAEATLAEEKNG 59

Query: 882  PISVTKLSNPLHSVSV-PSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706
             +  +K+++  HSVSV PSVLRRHE +L                    SRASTGR+  ++
Sbjct: 60   -LPSSKVNS--HSVSVVPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLIWSN 116

Query: 705  STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526
            S    RK   S  + VV +G  +  P D   ++RCAWVT NTDPSYV FHDEEWGVP HD
Sbjct: 117  SVGTRRKPFPSTPRSVVSDGGLDSPPGDSHRRKRCAWVTLNTDPSYVAFHDEEWGVPVHD 176

Query: 525  DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346
            D KLFELLVL GAL+E+TWPAILS+RHIFREVFADFDP++V+K+NEKK+IA         
Sbjct: 177  DKKLFELLVLAGALSELTWPAILSKRHIFREVFADFDPLAVSKLNEKKLIAPGSTASSLL 236

Query: 345  SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166
            SELKLRAI++NA Q+SKVI+EFGSFDKYIWGFV HKPIV RFRYPRQVPV+T KAD+ISK
Sbjct: 237  SELKLRAIVENAHQISKVINEFGSFDKYIWGFVNHKPIVSRFRYPRQVPVKTPKADVISK 296

Query: 165  DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKE 10
            DLVRRGFRSVGPTV+YSFMQVAGITNDHL SCFRF DC  A   +EE+ IK+
Sbjct: 297  DLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQDCITAAEGKEENGIKD 348


Top