BLASTX nr result

ID: Forsythia21_contig00002608 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00002608
         (1696 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011073325.1| PREDICTED: uncharacterized protein LOC105158...   505   e-140
ref|XP_011082395.1| PREDICTED: uncharacterized protein LOC105165...   497   e-137
ref|XP_012852805.1| PREDICTED: uncharacterized protein LOC105972...   479   e-132
ref|XP_012856196.1| PREDICTED: uncharacterized protein LOC105975...   457   e-125
emb|CDO98228.1| unnamed protein product [Coffea canephora]            450   e-123
ref|XP_009617988.1| PREDICTED: uncharacterized protein LOC104110...   428   e-117
ref|XP_009802477.1| PREDICTED: uncharacterized protein LOC104248...   424   e-115
ref|XP_004235942.1| PREDICTED: uncharacterized protein LOC101247...   423   e-115
ref|XP_006341344.1| PREDICTED: uncharacterized protein LOC102594...   421   e-115
ref|XP_010265584.1| PREDICTED: uncharacterized protein LOC104603...   407   e-110
ref|XP_010275821.1| PREDICTED: uncharacterized protein LOC104610...   393   e-106
ref|XP_007011936.1| DNA glycosylase superfamily protein isoform ...   387   e-104
ref|XP_009358441.1| PREDICTED: uncharacterized protein LOC103949...   381   e-102
ref|XP_008242987.1| PREDICTED: uncharacterized protein LOC103341...   380   e-102
ref|XP_003527169.1| PREDICTED: uncharacterized protein LOC100801...   380   e-102
ref|XP_012442673.1| PREDICTED: uncharacterized protein LOC105767...   378   e-102
ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative [R...   378   e-102
gb|KHG02440.1| putative GMP synthase [glutamine-hydrolyzing] [Go...   377   e-101
ref|XP_007204814.1| hypothetical protein PRUPE_ppa026720mg [Prun...   377   e-101
ref|XP_002276173.1| PREDICTED: uncharacterized protein LOC100267...   377   e-101

>ref|XP_011073325.1| PREDICTED: uncharacterized protein LOC105158309 [Sesamum indicum]
          Length = 397

 Score =  505 bits (1301), Expect = e-140
 Identities = 262/397 (65%), Positives = 304/397 (76%), Gaps = 17/397 (4%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196
            MSGPPRVKSM+FT+ E RPVLGPAGNK+RS ELRKP+ KP SEK Q+  D DE  GKKSP
Sbjct: 1    MSGPPRVKSMNFTEPEARPVLGPAGNKSRSAELRKPVLKPKSEKTQRPPDIDESKGKKSP 60

Query: 1195 VTV------TDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT-GR 1037
              +      ++ +   V  +     AASIL Q++ NLSLN                + GR
Sbjct: 61   AALESPELASEKIPSPVGFRRSGSSAASILRQRQANLSLNASCSSDASSDSSQSRASTGR 120

Query: 1036 ISRRRVTLTPTMRRKQQCSPKERSAQ------KSFDGESEDINL----AKKRCAWVTSNT 887
            ISRR  T TP ++RK QCS K    +      K+  GESE + +     KKRCAWVTSNT
Sbjct: 121  ISRRSATPTPPLKRKPQCSSKGGKIENKEGYGKNVGGESESLVVDGAAVKKRCAWVTSNT 180

Query: 886  DPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVS 707
            DP YAA HDEEWGVPVHDDKKLFELLSFSTALAE+TWPIIL+KRHIFREVFL FDP+AVS
Sbjct: 181  DPSYAAFHDEEWGVPVHDDKKLFELLSFSTALAEITWPIILSKRHIFREVFLGFDPVAVS 240

Query: 706  KLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNF 527
            KLNEKK+ATPG+PA SLLSELKLRAI+ENARQICKII++LGSF+KYIW FVN KPI+GNF
Sbjct: 241  KLNEKKIATPGNPACSLLSELKLRAIVENARQICKIINELGSFDKYIWGFVNYKPIVGNF 300

Query: 526  RYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAG 347
            RYPRQVPI+TSKADTISKDLVRRGFRGVGPTVVYSFMQV+GITNDHL+SCFR+ +C+ AG
Sbjct: 301  RYPRQVPIRTSKADTISKDLVRRGFRGVGPTVVYSFMQVAGITNDHLISCFRHHDCVIAG 360

Query: 346  DLRDGDEGLKTMNEGKTAEEISELELGRAIDDLGFTT 236
            DLRD +E + + +EGK  E+I ELEL R IDDL  ++
Sbjct: 361  DLRDKNEDVTSKHEGKPPEDIMELELVRDIDDLSLSS 397


>ref|XP_011082395.1| PREDICTED: uncharacterized protein LOC105165174 [Sesamum indicum]
          Length = 395

 Score =  497 bits (1280), Expect = e-137
 Identities = 265/396 (66%), Positives = 304/396 (76%), Gaps = 17/396 (4%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196
            MSGPP V+SM+F + E RPVLGP GNKARSVELRKPI KP SEK ++S + D+  GKK P
Sbjct: 1    MSGPPMVQSMNFAEPEDRPVLGPTGNKARSVELRKPILKPKSEKTRQSPEADK--GKKPP 58

Query: 1195 VTV------TDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT-GR 1037
             T+      T+ +   V  +     AASIL Q++PNLSLN                + GR
Sbjct: 59   ATLHSPEITTEKIPSPVGFRRNASSAASILRQRQPNLSLNASCSSDASTDSSHSRASTGR 118

Query: 1036 ISRRRVTLTPTMRRKQQCSPKERSAQK------SFDGESEDI----NLAKKRCAWVTSNT 887
            I RR  T TP +++KQQ SPK    +K      S  GESE I    +L KKRCAWVTSNT
Sbjct: 119  IGRRTGTSTPPLKKKQQFSPKGERIEKMAGNGKSVGGESEGIECDGSLVKKRCAWVTSNT 178

Query: 886  DPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVS 707
            DP YAA HDEEWG+PVHDDKKLFELLSFSTALAELTWP+IL+KR IFR+VFLDFDPIAVS
Sbjct: 179  DPSYAAFHDEEWGLPVHDDKKLFELLSFSTALAELTWPVILSKRPIFRDVFLDFDPIAVS 238

Query: 706  KLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNF 527
            KLN+KK+AT GSPASSLLSELKLRAIIENARQICKIID++GSF+KYIW FVN KPI+GNF
Sbjct: 239  KLNDKKIATQGSPASSLLSELKLRAIIENARQICKIIDEVGSFDKYIWGFVNYKPIVGNF 298

Query: 526  RYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAG 347
            RYPRQVPIKTSKADTISKDLVRRG RGVGPTVVYSFMQV+GITNDHL++CFRYQ+CIAAG
Sbjct: 299  RYPRQVPIKTSKADTISKDLVRRGLRGVGPTVVYSFMQVAGITNDHLINCFRYQDCIAAG 358

Query: 346  DLRDGDEGLKTMNEGKTAEEISELELGRAIDDLGFT 239
            DLRD +EG+ + NE    E++ ELEL R IDDL  +
Sbjct: 359  DLRDKNEGITSNNEENPPEDLRELELVRDIDDLNLS 394


>ref|XP_012852805.1| PREDICTED: uncharacterized protein LOC105972398 [Erythranthe
            guttatus] gi|604305450|gb|EYU24594.1| hypothetical
            protein MIMGU_mgv1a007518mg [Erythranthe guttata]
          Length = 404

 Score =  479 bits (1232), Expect = e-132
 Identities = 258/403 (64%), Positives = 293/403 (72%), Gaps = 24/403 (5%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKK-- 1202
            MSGPP VKSM+F + E RPVLGPAGNKARSVELRKPI K  SEK QK  D DE  G    
Sbjct: 1    MSGPPLVKSMNFAEPEARPVLGPAGNKARSVELRKPILKQKSEKTQKPLDADEAKGNTAP 60

Query: 1201 ------SPVTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT- 1043
                  SP   T+ +   V  K     AASIL Q++PNLS+N                + 
Sbjct: 61   SPAAFLSPEMKTEKIPSPVGFKKNASSAASILRQRQPNLSMNASCSSDASTDSSHSRAST 120

Query: 1042 GRISRRRVTLTPTMRRKQQCSPKERSAQ------KSFDGESEDI----NLAKKRCAWVTS 893
            GR+ RR  T TP +RRK QCSPK    +      K+   ES+ +    +L KKRCAWVTS
Sbjct: 121  GRLLRRSATFTPPLRRKHQCSPKGERIEMIEGNGKNVGSESDGVVLDGSLVKKRCAWVTS 180

Query: 892  NTDPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIA 713
            NTDP YAA HDEEWG+PVHDDKKLFELLS STALAEL+WP+IL+KR IFR+VFLDFDP A
Sbjct: 181  NTDPLYAAFHDEEWGLPVHDDKKLFELLSLSTALAELSWPVILSKRSIFRDVFLDFDPAA 240

Query: 712  VSKLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIG 533
            VSKLN+KK+ATPGSPASSLLSE KLRAI+ENARQICKIID+LGSF+KYIW FVN KPI G
Sbjct: 241  VSKLNDKKIATPGSPASSLLSEQKLRAIVENARQICKIIDELGSFDKYIWGFVNYKPIAG 300

Query: 532  NFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIA 353
            NFRY RQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQV+GITNDHL++CFRYQ+CI 
Sbjct: 301  NFRYSRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVAGITNDHLINCFRYQDCII 360

Query: 352  AGD--LRDGDE---GLKTMNEGKTAEEISELELGRAIDDLGFT 239
            AGD  LRD +     + + NE   AE+ SEL+L   IDDL  +
Sbjct: 361  AGDLILRDNNNNNWSIASKNEVNLAEDFSELDLATEIDDLNLS 403


>ref|XP_012856196.1| PREDICTED: uncharacterized protein LOC105975546 [Erythranthe
            guttatus] gi|604302147|gb|EYU21733.1| hypothetical
            protein MIMGU_mgv1a024334mg [Erythranthe guttata]
          Length = 390

 Score =  457 bits (1176), Expect = e-125
 Identities = 245/390 (62%), Positives = 292/390 (74%), Gaps = 11/390 (2%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196
            MSGPPRVK M   + E RPVLGP GNKARSVELRKP+ K  SEK Q++QD D+  GKKSP
Sbjct: 1    MSGPPRVKLMTSAELEARPVLGPTGNKARSVELRKPMLKSKSEKAQRAQDVDDSKGKKSP 60

Query: 1195 VTVT--DHLALKVDSK---WINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT-GRI 1034
              +   +    K+ S      NGR+A+    Q+ ++SLN                + GRI
Sbjct: 61   TALQLPETKPEKIPSPVGFMKNGRSAASFFMQR-SMSLNVSCSSDASSDSSHSRASTGRI 119

Query: 1033 SRRRVTLTPTMRRKQQCSPKERSAQKSFDGESEDIN---LAKKRCAWVTSNTDPYYAALH 863
            S R  T TP ++R QQ S K    +K   GE E ++   + KKRCAWVT+NTDP YAA H
Sbjct: 120  SWRSGTPTPPLKRNQQSSFKRERIEKIVGGEGEVVDGAAVVKKRCAWVTANTDPLYAAFH 179

Query: 862  DEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVA 683
            DEEWG+ VHDDKKLFELLSFSTALAELTWP+IL+KRH+FREVFLDFDP AVSKLN+KK+A
Sbjct: 180  DEEWGLAVHDDKKLFELLSFSTALAELTWPVILSKRHLFREVFLDFDPNAVSKLNDKKIA 239

Query: 682  TPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPI 503
            TPGSPASSLLS+L LRAI ENAR+ICKIID+ GSF+KYIW FVN KPI+GNFRYPR VPI
Sbjct: 240  TPGSPASSLLSDLNLRAITENARRICKIIDEFGSFDKYIWGFVNHKPIVGNFRYPRLVPI 299

Query: 502  KTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRD-GDE 326
            KTSKADTISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL+SCFR+++CI A DL D  +E
Sbjct: 300  KTSKADTISKDLVKRGFRGVGPTVVYSFMQVAGITNDHLISCFRHRDCITACDLSDKSNE 359

Query: 325  GLKT-MNEGKTAEEISELELGRAIDDLGFT 239
            G+ T  NE K+ + I+E+EL R I+D+  +
Sbjct: 360  GITTSKNEVKSLDNITEMELVRDINDVSLS 389


>emb|CDO98228.1| unnamed protein product [Coffea canephora]
          Length = 399

 Score =  450 bits (1158), Expect = e-123
 Identities = 240/402 (59%), Positives = 286/402 (71%), Gaps = 22/402 (5%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARS-VELRKPIGKPNSEKVQKSQDFDEFNGKKS 1199
            MSGPPRV+SM+  +SEVRPVLGPAGNK RS +ELRKP+ KP    V K Q+ ++   KKS
Sbjct: 1    MSGPPRVRSMNHAESEVRPVLGPAGNKTRSALELRKPVSKPKISSVNKMQEGED---KKS 57

Query: 1198 PVTVTDHLALKVD-SKWINGRAASILGQQ-----------KPNLSLNXXXXXXXXXXXXX 1055
            P TVT    L     K   G +A+I+ QQ           + NLS+N             
Sbjct: 58   PATVTMEKDLSPSPKKKFGGASAAIMSQQQQRQEVKSFLMRSNLSMNASCSSDASTDSSQ 117

Query: 1054 XXXT-GRISRRRVTLTPTMRRKQQCSPKERSAQK--------SFDGESEDINLAKKRCAW 902
               + G+ISRR +T TP  R++Q C PK    +K        +  G ++D ++A+KRCAW
Sbjct: 118  SRASTGKISRRSLTPTPIRRKQQHCGPKVEKLEKVGSEVDSVAVVGLADD-SVARKRCAW 176

Query: 901  VTSNTDPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFD 722
            VT NTDP YAA HDEEWGVP H+DKKLFE LS STALAEL WP ILNKRH FREVF DFD
Sbjct: 177  VTPNTDPSYAAFHDEEWGVPAHEDKKLFEFLSLSTALAELPWPTILNKRHTFREVFQDFD 236

Query: 721  PIAVSKLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKP 542
            P+AVSKLNEKK+ATPGSPASSLLSELKLRAI+ENARQ CKII++ GSFEKYIW FVN KP
Sbjct: 237  PVAVSKLNEKKIATPGSPASSLLSELKLRAIVENARQACKIIEEFGSFEKYIWGFVNYKP 296

Query: 541  IIGNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQE 362
            I+G+FRYPRQVPIKTSKAD ISKDLVRRGFRG+GPTVVYSFMQV+GITNDHL+SCFR+++
Sbjct: 297  IVGHFRYPRQVPIKTSKADAISKDLVRRGFRGIGPTVVYSFMQVAGITNDHLISCFRFRD 356

Query: 361  CIAAGDLRDGDEGLKTMNEGKTAEEISELELGRAIDDLGFTT 236
            C+  GD R+ D+ L    EGK AE+ +E      +D L  +T
Sbjct: 357  CVDVGDGRNKDDDLIATIEGKQAEDSAESGFEERLDALSLST 398


>ref|XP_009617988.1| PREDICTED: uncharacterized protein LOC104110244 [Nicotiana
            tomentosiformis]
          Length = 398

 Score =  428 bits (1101), Expect = e-117
 Identities = 236/410 (57%), Positives = 277/410 (67%), Gaps = 30/410 (7%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKP----------NSEKVQKSQD 1226
            MSG PRVKSM+   SEVRPVLGPAGNKARSVELRKP  KP             K +K Q 
Sbjct: 1    MSGGPRVKSMNHADSEVRPVLGPAGNKARSVELRKPTEKPIKTNNKPAETEESKGKKFQG 60

Query: 1225 FDEFNGKKSPVTVTDHLALKVDSKWINGRAASILGQQ--------KPNLSLNXXXXXXXX 1070
             D     KSPV  +             G   SIL QQ        +PNLSLN        
Sbjct: 61   ADPLPQSKSPVAASKKC----------GSVPSILRQQQDHRTLLMRPNLSLNASCSSDAS 110

Query: 1069 XXXXXXXXT-GRISRRRVTLTPTMRRKQQCSPKERSAQKSFDGESE-----------DIN 926
                    + G++SR   +LTP   R++QCSPK   ++KS     E           D +
Sbjct: 111  TDSSHSRASTGKLSRG--SLTPKSGRRKQCSPKVDKSEKSGKSVGEVESLSPSPVSGDAS 168

Query: 925  LAKKRCAWVTSNTDPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIF 746
            + KKRCAWVT  TDP YAA HDEEWGVPVHDDKKLFELLS  TALAEL+WP IL+KRH F
Sbjct: 169  VIKKRCAWVTPTTDPSYAAFHDEEWGVPVHDDKKLFELLSLCTALAELSWPAILSKRHTF 228

Query: 745  REVFLDFDPIAVSKLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYI 566
            REVF +FDP+AVSKLNEKK+A PGSPAS+LLSE+KLRAI+ENARQ CKIID+LGSF+KYI
Sbjct: 229  REVFQNFDPVAVSKLNEKKIAPPGSPASTLLSEVKLRAIVENARQTCKIIDELGSFDKYI 288

Query: 565  WSFVNSKPIIGNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHL 386
            W FVN+KPI+  FRY RQVP+KTSKA+ ISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL
Sbjct: 289  WGFVNNKPIVSQFRYARQVPMKTSKAEGISKDLVKRGFRGVGPTVVYSFMQVAGITNDHL 348

Query: 385  VSCFRYQECIAAGDLRDGDEGLKTMNEGKTAEEISELELGRAIDDLGFTT 236
            +SCFR+ +C+AA D  + D+GL    E K  ++ +E+ L RAIDD   +T
Sbjct: 349  ISCFRFHDCVAAIDGMENDDGLAAKTEVKQLKDETEMGLIRAIDDFNLST 398


>ref|XP_009802477.1| PREDICTED: uncharacterized protein LOC104248004 [Nicotiana
            sylvestris]
          Length = 399

 Score =  424 bits (1090), Expect = e-115
 Identities = 239/402 (59%), Positives = 280/402 (69%), Gaps = 22/402 (5%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196
            MSG PRVKSM+   SEVRPVLGPAGNKARSVELRKPI KP      K  + +E  GKK P
Sbjct: 1    MSGGPRVKSMNHADSEVRPVLGPAGNKARSVELRKPIEKPVKTN-NKPAETEESKGKKFP 59

Query: 1195 -VTVTDHLALKVDSKWINGRAASILGQQ--------KPNLSLNXXXXXXXXXXXXXXXXT 1043
                       V +    G   SIL QQ        +PNLSLN                +
Sbjct: 60   GADPLPQSKSPVAASKKCGSVPSILRQQQDHRTLLMRPNLSLNASCSSDASTDSSHSRAS 119

Query: 1042 -GRISRRRVTLTPTMRRKQQCSPKERSAQKSFD--GESE---------DINLAKKRCAWV 899
             G++SR   +LTP   R++QCSPK   ++KS    GESE         D ++ KKRCAWV
Sbjct: 120  TGKLSRG--SLTPKSGRRKQCSPKVDKSEKSGKSVGESESLSPSPVSGDASVIKKRCAWV 177

Query: 898  TSNTDPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDP 719
            T  TDP YAA HDEEWGVPVHDDKKLFELLS  TALAEL+WP IL+KRH FREVF +FDP
Sbjct: 178  TPTTDPSYAAFHDEEWGVPVHDDKKLFELLSLCTALAELSWPAILSKRHTFREVFQNFDP 237

Query: 718  IAVSKLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPI 539
            +AVSKLNEKK+A PGSPAS+LLSE+KLRAIIENARQ CKIID+LGSF+KY+W FVN+KPI
Sbjct: 238  VAVSKLNEKKIAPPGSPASTLLSEVKLRAIIENARQTCKIIDELGSFDKYMWGFVNNKPI 297

Query: 538  IGNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQEC 359
            +  FRY RQVP+KTSKA+ ISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL+SCFR+ +C
Sbjct: 298  VSQFRYARQVPMKTSKAEGISKDLVKRGFRGVGPTVVYSFMQVAGITNDHLISCFRFHDC 357

Query: 358  IAAGDLRDGDEGLKTMNEGK-TAEEISELELGRAIDDLGFTT 236
            +AA D  D D+GL    E K   ++ +E+ L RAI D   +T
Sbjct: 358  VAAIDGMDKDDGLVAKTEVKQQLKDETEMGLIRAIADFNLST 399


>ref|XP_004235942.1| PREDICTED: uncharacterized protein LOC101247118 [Solanum
            lycopersicum]
          Length = 395

 Score =  423 bits (1087), Expect = e-115
 Identities = 234/402 (58%), Positives = 281/402 (69%), Gaps = 22/402 (5%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196
            MSG PRVK M+   SEVR VLGPAGNKARSVELRKP+ KP    V+K+ + +E  GKK  
Sbjct: 1    MSGGPRVKLMNNADSEVRSVLGPAGNKARSVELRKPVEKP----VKKAAESEESKGKKFE 56

Query: 1195 VTVTDHLALKVDSKWINGRAASILGQQ--------KPNLSLNXXXXXXXXXXXXXXXXTG 1040
             T +     +  ++   G   SIL QQ        +PNLSLN                + 
Sbjct: 57   GTDS---VPQSRARKCGGAVPSILRQQQDHRSLMMRPNLSLNASCSSDASTDSSHSRAST 113

Query: 1039 RISRRRVTLTPTMRRKQQCS-PK----ERSAQKSFDGES-------EDINLAKKRCAWVT 896
                 R ++TPT  R++QCS PK    E+  +   +GES       +D ++ KKRCAWVT
Sbjct: 114  TGKMSRGSVTPTAGRRKQCSSPKVVKSEKIGKTVGEGESLASSPTPDDASVMKKRCAWVT 173

Query: 895  SNTDPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPI 716
             NTDP YAA HDEEWGV VHDDKKLFELLS  TALAEL+WP IL+KRH+FREVF +FDP+
Sbjct: 174  PNTDPSYAAFHDEEWGVSVHDDKKLFELLSLCTALAELSWPAILSKRHMFREVFQNFDPV 233

Query: 715  AVSKLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPII 536
            AVSKLNEKK+A PGSPAS+LLSE+KLRA+IENARQ CKIID+LGSF+KYIW FVN+KPI+
Sbjct: 234  AVSKLNEKKIAPPGSPASTLLSEVKLRAVIENARQTCKIIDELGSFDKYIWGFVNNKPIV 293

Query: 535  GNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECI 356
              FRY RQVP+KTSKA+ ISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL+SCFR+ +C+
Sbjct: 294  SQFRYARQVPMKTSKAEGISKDLVKRGFRGVGPTVVYSFMQVAGITNDHLISCFRFHDCV 353

Query: 355  AAGDLRDGDEGLKTMNEGKTAEEISELELG--RAIDDLGFTT 236
            AA D  D D+GL    E K  +   E E+G  RAIDD   +T
Sbjct: 354  AATDGTDKDDGLAAKTEVKQLQLKDETEMGLIRAIDDFNLST 395


>ref|XP_006341344.1| PREDICTED: uncharacterized protein LOC102594169 [Solanum tuberosum]
          Length = 399

 Score =  421 bits (1083), Expect = e-115
 Identities = 234/403 (58%), Positives = 281/403 (69%), Gaps = 23/403 (5%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196
            MSG PRVK M+   SEVR VLGPAGNKARSVELRKP+ KP    ++K+ + +E  GKK  
Sbjct: 1    MSGGPRVKLMNNADSEVRSVLGPAGNKARSVELRKPVEKP----IKKAAESEESKGKKFE 56

Query: 1195 VT--VTDHLALKVDSKWINGRAASILGQQ--------KPNLSLNXXXXXXXXXXXXXXXX 1046
             T  V    A    SK   G   SIL QQ        +PNLSLN                
Sbjct: 57   GTDSVPQSRAPVAASKKCGGAVPSILRQQQDHRSLMMRPNLSLNASCSSDASTDSSHSRA 116

Query: 1045 TGRISRRRVTLTPTMRRKQQCS-PK----ERSAQKSFDGES-------EDINLAKKRCAW 902
            +      R ++TPT  R++QCS PK    E+  +   +G+S        D ++ KKRCAW
Sbjct: 117  STTGKLSRGSVTPTAGRRKQCSSPKVVKSEKIGKTVGEGQSLASSPTPGDASVMKKRCAW 176

Query: 901  VTSNTDPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFD 722
            VT NTDP YAA HDEEWGV +HDDKKLFELLS  TALAEL+WP IL+KRH+FREVF +FD
Sbjct: 177  VTPNTDPSYAAFHDEEWGVSIHDDKKLFELLSLCTALAELSWPAILSKRHMFREVFQNFD 236

Query: 721  PIAVSKLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKP 542
            P+AVSKLNEKK+A PGSPAS+LLSE+KLRA+IENARQ CKIID+LGSF+KYIW FVN+KP
Sbjct: 237  PVAVSKLNEKKIAPPGSPASTLLSEVKLRAVIENARQTCKIIDELGSFDKYIWGFVNNKP 296

Query: 541  IIGNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQE 362
            I+  FRY RQVP+KTSKA+ ISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL+SCFR+ +
Sbjct: 297  IVSQFRYARQVPMKTSKAEGISKDLVKRGFRGVGPTVVYSFMQVAGITNDHLISCFRFHD 356

Query: 361  CIAAGDLRDGDEGLKTMNEGK-TAEEISELELGRAIDDLGFTT 236
            C+AA D  D D+GL    E K   ++ +E+ L RAIDD   +T
Sbjct: 357  CVAATDGTDKDDGLAAKTEVKQQLKDETEMGLIRAIDDFNLST 399


>ref|XP_010265584.1| PREDICTED: uncharacterized protein LOC104603287 [Nelumbo nucifera]
          Length = 380

 Score =  407 bits (1045), Expect = e-110
 Identities = 228/381 (59%), Positives = 262/381 (68%), Gaps = 5/381 (1%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNG-KKS 1199
            MSG PRV+SM+   S+ RPVLGP GNK  S+  RKP+ KP   KV+KS +    NG KK+
Sbjct: 1    MSGAPRVRSMNVADSDARPVLGPTGNKTGSLVTRKPVSKP-LRKVEKSPEVA--NGEKKT 57

Query: 1198 PVTVTDHLALKVDSKWINGRAASILGQQK---PNLSLNXXXXXXXXXXXXXXXXT-GRIS 1031
            P +       K+ S  +     SIL + +    NLSLN                + GRI 
Sbjct: 58   PSSPVAPSPPKLQSASV----PSILRRHEFLHSNLSLNASCSSDASSDSVYSRASTGRII 113

Query: 1030 RRRVTLTPTMRRKQQCSPKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEW 851
            R   T + T RRK+  S  E+ A  S    S +    K+RCAWVT NTDP YAA HDEEW
Sbjct: 114  R---TSSTTSRRKRSISRPEKVAPDSVSDSSSESIQTKRRCAWVTPNTDPCYAAFHDEEW 170

Query: 850  GVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGS 671
            GVPVHDDKKLFE L  S ALAEL WP+IL+KRHIFREVF DFDP+AVSKLNEKK+ TPG 
Sbjct: 171  GVPVHDDKKLFEFLVLSGALAELPWPVILSKRHIFREVFADFDPVAVSKLNEKKITTPGG 230

Query: 670  PASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSK 491
             A SLLSELKLRAIIENARQICK+ID+ GSF  YIWSFVN KPII  FRYPRQVP+KT K
Sbjct: 231  TAISLLSELKLRAIIENARQICKVIDEFGSFNNYIWSFVNHKPIISKFRYPRQVPVKTPK 290

Query: 490  ADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTM 311
            AD ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL++CFRYQECI A    + DEG K  
Sbjct: 291  ADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLINCFRYQECIDATAAIE-DEGSKAK 349

Query: 310  NEGKTAEEISELELGRAIDDL 248
             E K  E+I  LELG+AID+L
Sbjct: 350  AEEKKTEDIINLELGKAIDEL 370


>ref|XP_010275821.1| PREDICTED: uncharacterized protein LOC104610746 [Nelumbo nucifera]
          Length = 387

 Score =  393 bits (1010), Expect = e-106
 Identities = 222/391 (56%), Positives = 256/391 (65%), Gaps = 15/391 (3%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDF--DEFNGKK 1202
            MSG PRV+S++   SE RPVLGPAGNK RS+  RKP  KP   KV+K+ +   +E     
Sbjct: 1    MSGAPRVRSINVADSEARPVLGPAGNKTRSLVTRKPASKP-LRKVEKTPEAVDEEKKAPS 59

Query: 1201 SPVTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT-GRISRR 1025
            SPV  +      V    I  R   +      NLSLN                + GR+ R 
Sbjct: 60   SPVAASPPKLQPVSVPSILRRHEFL----HSNLSLNASCSSDASSDSVYSRASTGRLIRT 115

Query: 1024 RVTLTPTMRRKQQCSPKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEWGV 845
            R T +   RRK   S  E+    S    S D    KKRCAWVT NTDP YAA HDEEWGV
Sbjct: 116  RSTPS---RRKYSISRPEKVVPDSASDSSPDSIETKKRCAWVTPNTDPCYAAFHDEEWGV 172

Query: 844  PVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSPA 665
            PVHDDKKLFELL  S ALAELTWP IL+KRHIFREVF DFDP+AVSKLNEKK+  PGS A
Sbjct: 173  PVHDDKKLFELLVLSGALAELTWPTILSKRHIFREVFSDFDPVAVSKLNEKKITAPGSTA 232

Query: 664  SSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKAD 485
            SSLLSELKLRAIIENARQICK+ID+ GSF+ YIWSFVN KPII  FRYPRQVP+K  KAD
Sbjct: 233  SSLLSELKLRAIIENARQICKVIDEFGSFDNYIWSFVNHKPIISKFRYPRQVPVKIPKAD 292

Query: 484  TISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDE------- 326
             ISKDLVRRGFR VGPTVVYSFMQV+GITNDHL++CFR+Q C+    + +GD+       
Sbjct: 293  VISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLINCFRFQVCMDTPTVSEGDDKLRIGKA 352

Query: 325  -----GLKTMNEGKTAEEISELELGRAIDDL 248
                 G K   E K  E++ + ELG+A+D L
Sbjct: 353  EETPTGSKGTAEEKKTEDMIKSELGKAMDKL 383


>ref|XP_007011936.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao]
            gi|590572766|ref|XP_007011937.1| DNA glycosylase
            superfamily protein isoform 1 [Theobroma cacao]
            gi|590572769|ref|XP_007011938.1| DNA glycosylase
            superfamily protein isoform 1 [Theobroma cacao]
            gi|590572773|ref|XP_007011939.1| DNA glycosylase
            superfamily protein isoform 1 [Theobroma cacao]
            gi|508782299|gb|EOY29555.1| DNA glycosylase superfamily
            protein isoform 1 [Theobroma cacao]
            gi|508782300|gb|EOY29556.1| DNA glycosylase superfamily
            protein isoform 1 [Theobroma cacao]
            gi|508782301|gb|EOY29557.1| DNA glycosylase superfamily
            protein isoform 1 [Theobroma cacao]
            gi|508782302|gb|EOY29558.1| DNA glycosylase superfamily
            protein isoform 1 [Theobroma cacao]
          Length = 379

 Score =  387 bits (993), Expect = e-104
 Identities = 215/384 (55%), Positives = 263/384 (68%), Gaps = 4/384 (1%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQ-DFDEFNGKKS 1199
            MSG PR++SM+   SE RPVLGPAGNKA S+  RKP  KP   KV+KS  +      KK+
Sbjct: 1    MSGAPRMRSMNVADSEARPVLGPAGNKAGSLSARKPASKP-LRKVEKSPVEVTVAEEKKA 59

Query: 1198 -PVTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRR 1022
             P +  + L+ K  S  +     S+L + +  L  N                  R S  R
Sbjct: 60   LPSSTVNSLSPKTHSVSV----PSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGR 115

Query: 1021 VTLTPTM-RRKQQCSPKERSAQKSFDGESE-DINLAKKRCAWVTSNTDPYYAALHDEEWG 848
            +  + ++  R++  + K RS       +S  D +  KKRCAWVT NTDP Y A HDEEWG
Sbjct: 116  LIRSNSVGNRRKPYASKPRSVVSDGGLDSPPDGSHQKKRCAWVTPNTDPSYVAFHDEEWG 175

Query: 847  VPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSP 668
            VPVHDD+KLFELL  S AL+ELTWP IL+KRHI REVF+DFD +AVSKLNEKK+ TPGS 
Sbjct: 176  VPVHDDRKLFELLVLSGALSELTWPAILSKRHIVREVFVDFDAVAVSKLNEKKLVTPGSI 235

Query: 667  ASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKA 488
            ASSLLSELKLRAIIENARQI K+ID+ GSF++YIWSFVN KPI+  FRYPRQVP+KT KA
Sbjct: 236  ASSLLSELKLRAIIENARQISKVIDEFGSFDEYIWSFVNHKPIVSRFRYPRQVPVKTPKA 295

Query: 487  DTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTMN 308
            D ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL SCFR+QECI A + ++ + G+K M 
Sbjct: 296  DVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQECITAAEGKE-ENGIKDMP 354

Query: 307  EGKTAEEISELELGRAIDDLGFTT 236
            E K  E + E +L  AID+L F++
Sbjct: 355  EEKKTENVMESKLSIAIDELSFSS 378


>ref|XP_009358441.1| PREDICTED: uncharacterized protein LOC103949071 [Pyrus x
            bretschneideri]
          Length = 378

 Score =  381 bits (978), Expect = e-102
 Identities = 210/382 (54%), Positives = 251/382 (65%), Gaps = 2/382 (0%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196
            MSG PRV+S++   SE RPVLGPAGNKA +   RKP  KP    ++K++ F E       
Sbjct: 1    MSGAPRVRSINVADSESRPVLGPAGNKAGTFSARKPASKP----LRKAEKFSEEVSSAEE 56

Query: 1195 VTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRVT 1016
                    L    +  + +  S+L + +  L  N                  R S  R+ 
Sbjct: 57   KKTHQSPMLTTSPQPHSPKVHSVLRRHEQLLHSNFSLNASCSSDASTDSFQSRASTGRLI 116

Query: 1015 LTPTM--RRKQQCSPKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEWGVP 842
             + ++  RRKQ  S               D + +KKRCAWVT N DP YAA HDEEWG+P
Sbjct: 117  RSNSVGSRRKQYVSKPRSVVSDGGLDSPPDGSQSKKRCAWVTPNADPCYAAFHDEEWGLP 176

Query: 841  VHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSPAS 662
            VHDDKKLFELL  S ALAEL+WP IL+K+HIFREVF DFDPIAVSKLNEKK+ +PGS AS
Sbjct: 177  VHDDKKLFELLVLSGALAELSWPAILSKKHIFREVFADFDPIAVSKLNEKKLISPGSAAS 236

Query: 661  SLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKADT 482
            SLLSELKLRAIIENARQ  K+I++ GSF+KYIWSFVN+KPI   FRYPRQVP+KT KAD 
Sbjct: 237  SLLSELKLRAIIENARQTTKVIEEFGSFDKYIWSFVNNKPIESRFRYPRQVPVKTPKADV 296

Query: 481  ISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTMNEG 302
            ISKDLVRRGFR VGPTV+YSFMQV+GITNDHLVSCFR+QEC+ A +  DG+ G+K    G
Sbjct: 297  ISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQECVNAAE-GDGENGIKD-EAG 354

Query: 301  KTAEEISELELGRAIDDLGFTT 236
            K  E   E EL  AID L F++
Sbjct: 355  KKTENGIESELSVAIDKLSFSS 376


>ref|XP_008242987.1| PREDICTED: uncharacterized protein LOC103341267 isoform X1 [Prunus
            mume]
          Length = 378

 Score =  380 bits (977), Expect = e-102
 Identities = 208/382 (54%), Positives = 253/382 (66%), Gaps = 2/382 (0%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196
            MSG PRV+S++   SE RPVLGPAGNKA +   RKP+ KP    ++K++   E       
Sbjct: 1    MSGAPRVRSINVADSESRPVLGPAGNKAGTFSARKPVSKP----LRKAEKLAEKVASAEE 56

Query: 1195 VTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRVT 1016
                    L    +  +    S+L + +  L  N                  R S  R+T
Sbjct: 57   KKTRQSSMLTTSPQLHSPSVPSVLRRHEQLLHSNFSLNASCSSDASTDSFHSRASTGRLT 116

Query: 1015 LTPTM--RRKQQCSPKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEWGVP 842
             + +   RRKQ  S               D + +KKRCAWVT NTDP YAA HDEEWG+P
Sbjct: 117  RSNSAGSRRKQYVSKPRSVVSDGGLDSPPDGSQSKKRCAWVTPNTDPCYAAFHDEEWGLP 176

Query: 841  VHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSPAS 662
            VHDDKKLFELL  S ALAEL+WP IL+K+HIFREVF DFDP+AVSKLNEKK+  PGS AS
Sbjct: 177  VHDDKKLFELLVLSGALAELSWPAILSKKHIFREVFADFDPVAVSKLNEKKLIAPGSTAS 236

Query: 661  SLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKADT 482
            SLLSELKLRAIIENARQ+ K+I++ GSF+KYIWSFVN+KPI+  FRYPRQVP KT KAD 
Sbjct: 237  SLLSELKLRAIIENARQMTKVIEEFGSFDKYIWSFVNNKPIVSRFRYPRQVPAKTPKADV 296

Query: 481  ISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTMNEG 302
            ISKDLVRRGFR VGPTV+YSFMQV+GITNDHLVSCFR+QEC+ A + ++ D G+K   E 
Sbjct: 297  ISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQECLNAAEGKE-DYGIKDEAEK 355

Query: 301  KTAEEISELELGRAIDDLGFTT 236
            KT   I E +L  A+D+L F++
Sbjct: 356  KTENGI-ESDLSVAMDELSFSS 376


>ref|XP_003527169.1| PREDICTED: uncharacterized protein LOC100801026 isoform X1 [Glycine
            max] gi|571461733|ref|XP_006582090.1| PREDICTED:
            uncharacterized protein LOC100801026 isoform X2 [Glycine
            max] gi|571461735|ref|XP_006582091.1| PREDICTED:
            uncharacterized protein LOC100801026 isoform X3 [Glycine
            max] gi|734430051|gb|KHN45352.1| Putative GMP synthase
            [glutamine-hydrolyzing] [Glycine soja]
          Length = 383

 Score =  380 bits (976), Expect = e-102
 Identities = 204/382 (53%), Positives = 252/382 (65%), Gaps = 2/382 (0%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196
            MSG PR++SM+   SE RPVLGPAGNK  S+  RK   KP  +KV K  D      +K P
Sbjct: 1    MSGAPRLRSMNVADSEARPVLGPAGNKTGSLSSRKTASKPLRKKVDKLLDEIASVKEKKP 60

Query: 1195 VTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRVT 1016
              V          +  +   + +L + +  L  N                  R S  R+T
Sbjct: 61   HQVLLSSVATSSPQSHSASVSLLLPRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLT 120

Query: 1015 LTPTM--RRKQQCSPKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEWGVP 842
             + ++  RRK   S     A         D + + KRCAWVT NT+P YA  HDEEWGVP
Sbjct: 121  RSYSLGSRRKPYVSKPRSVASDGVLESPTDGSQSNKRCAWVTPNTEPCYATFHDEEWGVP 180

Query: 841  VHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSPAS 662
            VHDDKKLFELL  S+ LAE TWP IL+KRHIFREVF+DF+P+AVSKLNEKK+ TPG+ AS
Sbjct: 181  VHDDKKLFELLVLSSVLAEHTWPAILSKRHIFREVFVDFEPVAVSKLNEKKIMTPGTIAS 240

Query: 661  SLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKADT 482
            SLLSE+KLRAIIENARQI K+ID+ GSF+KYIWSFVN KPI+  FRYPRQVP+KT KAD 
Sbjct: 241  SLLSEVKLRAIIENARQISKVIDEFGSFDKYIWSFVNHKPIVSRFRYPRQVPVKTPKADV 300

Query: 481  ISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTMNEG 302
            ISKDLVRRGFRGVGPTVVYSFMQV+G+T DHL+SCFR++ECIAA + ++ +  +    + 
Sbjct: 301  ISKDLVRRGFRGVGPTVVYSFMQVAGLTIDHLISCFRFEECIAAAEGKEENGIMDNHADQ 360

Query: 301  KTAEEISELELGRAIDDLGFTT 236
            K +E I E +L  A++DL F +
Sbjct: 361  KESENIMESDLSIAMEDLSFAS 382


>ref|XP_012442673.1| PREDICTED: uncharacterized protein LOC105767651 isoform X1 [Gossypium
            raimondii] gi|763787989|gb|KJB54985.1| hypothetical
            protein B456_009G057100 [Gossypium raimondii]
            gi|763787990|gb|KJB54986.1| hypothetical protein
            B456_009G057100 [Gossypium raimondii]
          Length = 379

 Score =  378 bits (971), Expect = e-102
 Identities = 209/384 (54%), Positives = 255/384 (66%), Gaps = 4/384 (1%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKS- 1199
            MSG PR++SM+ T SE RPVLGPAGNKA S+  RKP  KP+ +  + S +      KK+ 
Sbjct: 1    MSGAPRLRSMNVTDSEARPVLGPAGNKAGSLSARKPASKPSKKVEKSSVEVTVVEEKKAL 60

Query: 1198 PVTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRV 1019
            P +  + L+ K  S  +     S+L + +  L  +                  R S  R+
Sbjct: 61   PSSTVNSLSPKTHSLSV----PSVLRRHERLLHSSLSLNASCSSDASTDSFQSRASTGRL 116

Query: 1018 TLTPTM--RRKQQCS-PKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEWG 848
            +   ++  RRK   S PK   +  S D  S   +  KKRC WVT NTDP YAA HDEEWG
Sbjct: 117  SRCGSLGSRRKPYASKPKSLVSDDSLDLSSNSSH-HKKRCTWVTPNTDPSYAAFHDEEWG 175

Query: 847  VPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSP 668
            VPVHDDKKLFELL  S +L+ELTW  IL+KRHIFREVF+DFDP+AVSKLNEKK+   GS 
Sbjct: 176  VPVHDDKKLFELLVLSGSLSELTWSAILSKRHIFREVFMDFDPVAVSKLNEKKLIAHGSV 235

Query: 667  ASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKA 488
            ASSLLSEL LRAI+ENARQI K+ID+  SF++YIWSFVN KPI+  FRYPRQVP+KT KA
Sbjct: 236  ASSLLSELMLRAIVENARQISKVIDEFRSFDQYIWSFVNHKPIVSRFRYPRQVPVKTPKA 295

Query: 487  DTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTMN 308
            D ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL SCFR+QECI A + ++ +   K   
Sbjct: 296  DVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQECITAAEAKE-ENVTKDTT 354

Query: 307  EGKTAEEISELELGRAIDDLGFTT 236
            E K    +   EL  AID+L F+T
Sbjct: 355  EKKETVNVINTELSVAIDELSFST 378


>ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis]
            gi|223531126|gb|EEF32974.1| DNA-3-methyladenine
            glycosylase, putative [Ricinus communis]
          Length = 380

 Score =  378 bits (971), Expect = e-102
 Identities = 210/385 (54%), Positives = 259/385 (67%), Gaps = 5/385 (1%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGN-KARSVELRKPIGKPNSEKVQKSQDFDEFNGKKS 1199
            MSG PRV+SM+   SE RPVLGP GN KA S+  +KP  K    KV+ S +  +   +K 
Sbjct: 1    MSGAPRVRSMNVADSETRPVLGPTGNNKAGSLSAKKPASK-QLRKVETSPEAVKLGQEKK 59

Query: 1198 PVTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRV 1019
             VTV    AL   S  ++    S+L + +  L  N                  R S  R+
Sbjct: 60   LVTVPTASALSPKSHSVS--VPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRL 117

Query: 1018 TLTPTM-RRKQQCSPKERSAQKSFDGES---EDINLAKKRCAWVTSNTDPYYAALHDEEW 851
            T + ++  R++Q + K RS       ES    D + AKK CAWVT N DP Y A HDEEW
Sbjct: 118  TRSNSLGTRRKQYALKPRSVVSDGGLESPPPSDGSQAKKSCAWVTPNADPCYTAFHDEEW 177

Query: 850  GVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGS 671
            G+PVHDDKKLFELL  S ALAELTWP IL+KRHIFREVF +FDP+ VSK NEKK+  PGS
Sbjct: 178  GIPVHDDKKLFELLVLSGALAELTWPAILSKRHIFREVFANFDPVVVSKFNEKKIIAPGS 237

Query: 670  PASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSK 491
             ASSLLSE+KLRAIIENARQI K+ D+LGSF+KYIWSFVN KPI+  FRYPRQVP+KT K
Sbjct: 238  TASSLLSEIKLRAIIENARQISKVTDELGSFDKYIWSFVNYKPIVSRFRYPRQVPVKTPK 297

Query: 490  ADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTM 311
            AD ISKDLVRRGFR VGPTVVYSFMQV+G+TNDHL+SCFR+QECI A + ++ + G+K  
Sbjct: 298  ADVISKDLVRRGFRSVGPTVVYSFMQVAGLTNDHLISCFRFQECINAAEGKE-ENGVKV- 355

Query: 310  NEGKTAEEISELELGRAIDDLGFTT 236
             E K  + + E ++  A+D+L F++
Sbjct: 356  -EDKITDGVVESQISIAMDELSFSS 379


>gb|KHG02440.1| putative GMP synthase [glutamine-hydrolyzing] [Gossypium arboreum]
          Length = 379

 Score =  377 bits (969), Expect = e-101
 Identities = 211/385 (54%), Positives = 258/385 (67%), Gaps = 5/385 (1%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKS- 1199
            MSG PR++SM+ T SE RPVLGPAGNKA S+  RKP  K +S+KV+KS       G+K  
Sbjct: 1    MSGAPRLRSMNVTDSEARPVLGPAGNKAGSLSARKPASK-SSKKVEKSSVEVTVVGEKKA 59

Query: 1198 -PVTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRR 1022
             P +  + L+ K  S  +     S+L + +  L  +                  R S  R
Sbjct: 60   LPSSTVNSLSPKTHSLSV----PSVLRRHERLLHSSLSLNASCSSDASTDSFQSRASTGR 115

Query: 1021 VTLTPTM--RRKQQCS-PKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEW 851
            +    ++  RRK   S PK   +  S D  S   +  KKRCAWVT +TDP YAA HDEEW
Sbjct: 116  LNRCDSLGSRRKPYASKPKSVVSDDSLDLSSNSSH-PKKRCAWVTPSTDPSYAAFHDEEW 174

Query: 850  GVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGS 671
            GVPVHDD+KLFELL  S +L+ELTW  IL+KRHIFREVF+DFDP+AVSKLNEKK+   GS
Sbjct: 175  GVPVHDDRKLFELLVLSGSLSELTWSAILSKRHIFREVFIDFDPVAVSKLNEKKLIAHGS 234

Query: 670  PASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSK 491
             ASSLLSELKLR I+ENARQI K+ID+ GSF++YIWSFVN KPI+  FRYPRQVP+KT K
Sbjct: 235  VASSLLSELKLRVIVENARQISKVIDEFGSFDQYIWSFVNHKPIVSRFRYPRQVPVKTPK 294

Query: 490  ADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTM 311
            AD ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL SCFR+QECI A + ++ +   K  
Sbjct: 295  ADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQECITAAEAKE-ENVTKDP 353

Query: 310  NEGKTAEEISELELGRAIDDLGFTT 236
             E K    +   EL  AID+L F++
Sbjct: 354  TEKKETVNVINTELSVAIDELSFSS 378


>ref|XP_007204814.1| hypothetical protein PRUPE_ppa026720mg [Prunus persica]
            gi|462400345|gb|EMJ06013.1| hypothetical protein
            PRUPE_ppa026720mg [Prunus persica]
          Length = 378

 Score =  377 bits (968), Expect = e-101
 Identities = 205/382 (53%), Positives = 253/382 (66%), Gaps = 2/382 (0%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196
            MSG PRV+S++   SE RPVLGPAGNKA +   RKP+ KP    ++K++   E       
Sbjct: 1    MSGAPRVRSINVADSESRPVLGPAGNKAGTFSARKPVSKP----LRKAEKLAEKVASAEE 56

Query: 1195 VTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRVT 1016
                    L    +  +    S+L + +  L  N                  R S  R+T
Sbjct: 57   KKTRQSSMLTTSPQLHSPSVPSVLRRHEQLLHSNFSLNASCSSDASTDSFHSRASTGRLT 116

Query: 1015 LTPTM--RRKQQCSPKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEWGVP 842
             + +   RRKQ  S               D + +KKRCAWVT NTDP YAA HDEEWG+P
Sbjct: 117  RSNSAGSRRKQYVSKPRSVVSDGGLDSPPDGSQSKKRCAWVTPNTDPCYAAFHDEEWGLP 176

Query: 841  VHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSPAS 662
            VHDDKKLFELL  S ALAEL+WP IL+K+HIFREVF DFDP+A+SKLNEKK+  PGS AS
Sbjct: 177  VHDDKKLFELLVLSGALAELSWPAILSKKHIFREVFADFDPVAISKLNEKKLIAPGSNAS 236

Query: 661  SLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKADT 482
            SLLSELKLRAIIENARQ+ K+I++ GSF+KYIWSFVN+KPI+  FRYPRQVP KT KAD 
Sbjct: 237  SLLSELKLRAIIENARQMTKVIEEFGSFDKYIWSFVNNKPIVSRFRYPRQVPAKTPKADV 296

Query: 481  ISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTMNEG 302
            ISKDL+RRGFR VGPTV+YSFMQV+GITNDHLVSCFR+QEC+ A + ++ + G+K   E 
Sbjct: 297  ISKDLMRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQECLNAAEGKE-EYGIKDEAEK 355

Query: 301  KTAEEISELELGRAIDDLGFTT 236
            KT   I E +L  A+D+L F++
Sbjct: 356  KTENGI-ESDLSVAMDELSFSS 376


>ref|XP_002276173.1| PREDICTED: uncharacterized protein LOC100267363 isoform X2 [Vitis
            vinifera] gi|297743642|emb|CBI36525.3| unnamed protein
            product [Vitis vinifera]
          Length = 375

 Score =  377 bits (967), Expect = e-101
 Identities = 208/392 (53%), Positives = 255/392 (65%), Gaps = 12/392 (3%)
 Frame = -1

Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKA-RSVELRKPIGKPNSEKVQKSQDFDEFNGKKS 1199
            MSG PRV+SM+   SEVRPVLGPAGNK  RS+  RKP  KP  +  + ++D +E     S
Sbjct: 1    MSGGPRVRSMNVADSEVRPVLGPAGNKTMRSLSGRKPATKPLRKAEKATKDDEEIKALPS 60

Query: 1198 PVTVTDHLALKVDSKWINGRAAS----------ILGQQKPNLSLNXXXXXXXXXXXXXXX 1049
                             NG A+S          +L +Q+  L  N               
Sbjct: 61   S----------------NGAASSPPSHSVSVPLVLRRQEQLLHSNLSLNASCSSDASTDS 104

Query: 1048 XTGRISRRRVTLTPTMRRKQQCSPKERSAQKSFDGESEDINL-AKKRCAWVTSNTDPYYA 872
               R S  R+T + +  R++  + K +        ES    L AK+RCAWVT NTD  Y 
Sbjct: 105  FHSRASTGRITRSSSTARRRSYASKPKVIVSDGVSESPPDGLKAKRRCAWVTPNTDLSYI 164

Query: 871  ALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEK 692
            A HDEEWGVPVHDDKKLFELL  S ALAELTWP IL+KRHIFREVF DFDPIAV+KLNEK
Sbjct: 165  AFHDEEWGVPVHDDKKLFELLVLSGALAELTWPTILSKRHIFREVFADFDPIAVAKLNEK 224

Query: 691  KVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQ 512
            K+  PGS ASSL+SELKLR IIENARQ+ K+ID+ GSF++YIWSFVN KPI+  FRYPR 
Sbjct: 225  KLMAPGSIASSLISELKLRGIIENARQMSKVIDEFGSFDEYIWSFVNHKPIVSRFRYPRH 284

Query: 511  VPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDG 332
            VP+KT KAD ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL+SCFR+Q+C+ A +++  
Sbjct: 285  VPVKTPKADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQDCVTAAEVK-- 342

Query: 331  DEGLKTMNEGKTAEEISELELGRAIDDLGFTT 236
            +E + T    +    + E EL RAID+L F++
Sbjct: 343  EEEITTGAAEEKKSNVIESELSRAIDELSFSS 374


Top