BLASTX nr result

ID: Rehmannia25_contig00023731 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00023731
         (1204 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002271107.1| PREDICTED: uncharacterized protein LOC100243...   308   2e-81
ref|XP_004245536.1| PREDICTED: uncharacterized protein LOC101262...   291   5e-76
ref|XP_006343882.1| PREDICTED: uncharacterized protein At5g05190...   290   8e-76
ref|XP_006491240.1| PREDICTED: uncharacterized protein At5g05190...   261   4e-67
ref|XP_002320185.2| hypothetical protein POPTR_0014s09140g [Popu...   259   2e-66
ref|XP_006444880.1| hypothetical protein CICLE_v10018757mg [Citr...   259   2e-66
gb|EOX95766.1| Uncharacterized protein isoform 5 [Theobroma cacao]    249   1e-63
gb|EOX95765.1| Uncharacterized protein isoform 4, partial [Theob...   249   1e-63
gb|EOX95764.1| Uncharacterized protein isoform 3, partial [Theob...   249   1e-63
gb|EOX95763.1| Uncharacterized protein isoform 2 [Theobroma cacao]    249   1e-63
gb|EOX95762.1| Uncharacterized protein isoform 1 [Theobroma cacao]    249   1e-63
ref|XP_002533909.1| hypothetical protein RCOM_0237030 [Ricinus c...   249   1e-63
gb|EMJ21467.1| hypothetical protein PRUPE_ppa001028mg [Prunus pe...   248   3e-63
ref|XP_003520868.1| PREDICTED: uncharacterized protein At5g05190...   244   4e-62
ref|XP_003626554.1| hypothetical protein MTR_7g117150 [Medicago ...   243   1e-61
ref|XP_003553779.1| PREDICTED: uncharacterized protein At5g05190...   240   7e-61
gb|ESW19279.1| hypothetical protein PHAVU_006G111100g [Phaseolus...   236   1e-59
ref|XP_002303633.2| hypothetical protein POPTR_0003s13750g [Popu...   236   1e-59
emb|CAN76817.1| hypothetical protein VITISV_044118 [Vitis vinifera]   236   1e-59
ref|XP_004494805.1| PREDICTED: uncharacterized protein LOC101505...   234   7e-59

>ref|XP_002271107.1| PREDICTED: uncharacterized protein LOC100243335 [Vitis vinifera]
          Length = 956

 Score =  308 bits (790), Expect = 2e-81
 Identities = 182/424 (42%), Positives = 253/424 (59%), Gaps = 23/424 (5%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            PYHE F G Y + N D F  + HE FFHQPACSCV C +KNW +PP+V P     RRF  
Sbjct: 403  PYHEYFSGRYMEYNQDPFASY-HETFFHQPACSCVRCCNKNWQVPPQVPPTTFGKRRFPI 461

Query: 181  EPSNPNLHQHVNPILHRPQGNNSGGSNV--HPQ--QSLTKNSNDRDSENDGFNYHRPRKL 348
            E  NPN + HVNP     +G N  GSN   HP+  Q  T+  +D DS+  GF+ +RPR++
Sbjct: 462  ESKNPNFYHHVNPPTFGSRGYNPRGSNPPSHPRDPQPHTRWPSDIDSDIGGFSQYRPRRV 521

Query: 349  VLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFELGY 528
            V+AH + R+  P+ GGAPFI C NCFE+LK+P+K + ++K ++K++CGACS +   E+  
Sbjct: 522  VVAHGNRRLCHPIVGGAPFITCYNCFELLKVPRKFMLMDKNQRKLQCGACSCVNFLEVEN 581

Query: 529  KGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMN---TYSNDYDDFQDKFS 699
            K  I S    + +   + D+GS   +D   R     S+ A++N   T S+D+D     F 
Sbjct: 582  KKVIVSVPTQMKRRSPDADDGSCEVLDHYHR-----SSHAHLNVGGTNSDDFDTSGYNFQ 636

Query: 700  PTDKKSN--------SGESEKQXXXXXXXXXXXXXERGPENIL-------SAKLPLTDVK 834
              D + N         GE+ K+             E  P++++       SA+LPL +  
Sbjct: 637  SIDTEPNLPSKDCILIGEAAKRQGLLSSSPSSTEDEESPDSMIGQRDISSSAELPLKEDV 696

Query: 835  SLPDPDLSPQECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEM 1014
            S P      QE  D+S  N  ++R  K NKS+R ++E+V L++ TS+Q+SVKDAAVATEM
Sbjct: 697  SPPLLASPLQENFDYS-SNHAMSRHGKGNKSKRTDEEKVILNKATSRQNSVKDAAVATEM 755

Query: 1015 DVSLNEFSNSCASQDSVEISK-EARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQ 1191
            +V  NE+ N+  SQ+SVE+SK E RPK NKG +SFFAGLIKKSFRDF +SN  ++ S  +
Sbjct: 756  EVCFNEYLNTGLSQESVEVSKDEDRPKNNKGSDSFFAGLIKKSFRDFTRSNHSMDNSKPK 815

Query: 1192 VFVN 1203
            V VN
Sbjct: 816  VSVN 819


>ref|XP_004245536.1| PREDICTED: uncharacterized protein LOC101262940 [Solanum
            lycopersicum]
          Length = 945

 Score =  291 bits (744), Expect = 5e-76
 Identities = 176/425 (41%), Positives = 249/425 (58%), Gaps = 25/425 (5%)
 Frame = +1

Query: 4    YHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHNE 183
            Y E +PG++   N++ F+ H HE  FHQ ACSC HC ++N+ +PP + P G  +RR  N 
Sbjct: 390  YPEHYPGHH---NDNFFIPHPHETLFHQSACSCSHCLNQNYQIPPVIQPSGFVSRRSRNG 446

Query: 184  PSNPNLHQHVNPILHRPQGNNSGGS-----NVHPQQSLTKNSNDRDSENDGFNYHR-PRK 345
             +NP LH H+N + + P G  S GS     N H  + LT++S+D +SEN G  Y   PRK
Sbjct: 447  AANPILHHHMNSVGYGPGGYTSEGSSALNKNYHEGRRLTRSSSDLESENGGLGYRGYPRK 506

Query: 346  LVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFELG 525
            +V+AHR GRV +P+AGGAPFIAC  CFE+LK+PKK +   K+E++M+CG+CS+IILFELG
Sbjct: 507  VVVAHRVGRVYQPIAGGAPFIACCGCFELLKIPKKLMITGKSEKRMRCGSCSAIILFELG 566

Query: 526  YKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKFSPT 705
             K    S S+ V Q+  E   G+S   +EN++  N    +  M+ +S+DYD+    F+ T
Sbjct: 567  SKESGVSFSSQVKQLSAEFAPGTSNVPNENLQNANGCLMNDEMSPWSDDYDNSNYDFADT 626

Query: 706  -------DKKSNSGESEKQXXXXXXXXXXXXXERGPENIL-------SAKLPLTDVKSLP 843
                    +KSNS E EK+             E  PE ++        A++PL D    P
Sbjct: 627  KLESPSRSQKSNSTELEKRYSALSSPSSHSEDELSPERVILRHDLAHRAEIPLEDD---P 683

Query: 844  DPDLSPQECPDH----SPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATE 1011
             P L   +  DH    SP++  V +  K +  +  +QER  LDR+TS+Q+S+KD ++A E
Sbjct: 684  IPLLDSSQ-NDHAYSISPKD--VEKIRKEDMKEHTDQERTILDRSTSRQNSIKDVSMAVE 740

Query: 1012 MDVSLNEFSNSCASQDSVEISKEAR-PKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGS 1188
            MDVS NEF +S  S +S + SKE    K  KGG+SF  G IK+S  +  +S+Q  E   S
Sbjct: 741  MDVSTNEFVHSGVSVESNQSSKEENLSKSYKGGQSFM-GFIKRSLGELSRSHQSSENGRS 799

Query: 1189 QVFVN 1203
             VFVN
Sbjct: 800  NVFVN 804


>ref|XP_006343882.1| PREDICTED: uncharacterized protein At5g05190-like [Solanum tuberosum]
          Length = 946

 Score =  290 bits (742), Expect = 8e-76
 Identities = 171/421 (40%), Positives = 243/421 (57%), Gaps = 21/421 (4%)
 Frame = +1

Query: 4    YHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHNE 183
            Y E +PG++   N++ F+ H HE  FHQ ACSC HC ++N+ +PP + P G  ++R  N 
Sbjct: 390  YPEHYPGHH---NDNFFIPHPHETLFHQSACSCSHCLNQNYQIPPVIQPSGFVSQRSRNG 446

Query: 184  PSNPNLHQHVNPILHRPQGNNSGGS-----NVHPQQSLTKNSNDRDSENDGFNYHR-PRK 345
            P+NP LH H N + + P G  S GS     N H  + LT++S+D +SEN G  + R PRK
Sbjct: 447  PANPILHHHRNSVGYGPGGYTSEGSSALNKNYHEGRQLTRSSSDLESENGGLGHRRYPRK 506

Query: 346  LVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFELG 525
            +V+AHR GRV +P+AGGAPFI C  CFE+LK+PKK +   K+E+KM+CG+CS+IILFELG
Sbjct: 507  VVVAHRVGRVYQPIAGGAPFITCCGCFELLKIPKKLMITGKSEKKMRCGSCSAIILFELG 566

Query: 526  YKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKFSPT 705
             K    S S  V Q+  E   G+S   +EN++  N    +  M  +S+DYD+    F+ T
Sbjct: 567  SKESGVSFSTQVKQLSAEFAPGTSDVPNENLQNTNGCLINDEMTPWSDDYDNSNYHFTDT 626

Query: 706  -------DKKSNSGESEKQXXXXXXXXXXXXXERGPENIL-------SAKLPLTDVKSLP 843
                    +KSNS E EK+             E  PE+ +        A++PL D   +P
Sbjct: 627  KLESPSRSQKSNSTELEKRYSALSSPSSHSEDELSPESAIVRHDLAHCAEMPLED-DPIP 685

Query: 844  DPDLSPQECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEMDVS 1023
              D S  +         +V +  K +  +  +QER  LDR+TS+Q+S+KD ++A EMDVS
Sbjct: 686  LLDSSQNDHAYSISPKDVVEKIRKEDMKEHTDQERTILDRSTSRQNSIKDVSMAVEMDVS 745

Query: 1024 LNEFSNSCASQDSVEISKEAR-PKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQVFV 1200
             NEF +S  S +S + +KE    K  KGG+SF  G IK+S  +  +S+Q  E   S VFV
Sbjct: 746  TNEFVHSGVSVESNQSTKEENLSKSYKGGQSFM-GFIKRSLGELSRSHQSSENGRSNVFV 804

Query: 1201 N 1203
            N
Sbjct: 805  N 805


>ref|XP_006491240.1| PREDICTED: uncharacterized protein At5g05190-like [Citrus sinensis]
          Length = 915

 Score =  261 bits (667), Expect = 4e-67
 Identities = 160/421 (38%), Positives = 236/421 (56%), Gaps = 20/421 (4%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P H  F G Y D N+DLF  ++  + FHQP+CSC +CY+K+  +   V      +  F+N
Sbjct: 364  PSHPYFSGQYIDPNHDLFESYQQNSMFHQPSCSCYYCYNKHHQVSAPVQ-----SSAFNN 418

Query: 181  EPSNPNLHQHVNPILHRPQGNNSGGS----NVHPQQSLTKNSNDRDSENDGFNYHRPRKL 348
              +N  L+ H NP    P+ +N   +    N H  Q  T+  +D +SE   F    PR++
Sbjct: 419  RTNNAMLYHHENPRAFVPRVHNHSAAVPPLNSHGPQVHTRWPSDLNSEMGNFVRCCPRRV 478

Query: 349  VLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFELGY 528
            VL   SGR  RP+AGGAPFI C+NCFE+L+LPK+   + K ++  +CG CS++I F++  
Sbjct: 479  VLTS-SGRRCRPIAGGAPFIVCNNCFELLQLPKRTKLMAKDQKIFQCGTCSTVIDFDVIN 537

Query: 529  KGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYD----DFQ--- 687
            K  I S  A    + TE++ GS+G + +   +     +  N N  S+DYD    DFQ   
Sbjct: 538  KKLILSVQAETKGISTEVNGGSNGAMKDYTSHSLGRLDRVNANFSSDDYDNSGYDFQAMD 597

Query: 688  -DKFSPTDKKSNSGESEKQXXXXXXXXXXXXXERGPENIL-------SAKLPLTDVKSLP 843
             +  S TD+  +SG+  +              E  PE ++       S + P    +S P
Sbjct: 598  REPASSTDQFLDSGKPPETHSLRSSTPSISEDEHSPEVLITPREVTHSTQQPTKATQSTP 657

Query: 844  DPDLSPQECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEMDVS 1023
             P    QE  D+S  N +VNRF K N+S R +QE+V  ++ T++Q+S+K+A++ATEM+VS
Sbjct: 658  PPGSPLQEHFDYSSSNHVVNRFAKGNRSSRSDQEKVITNKVTARQNSLKEASLATEMEVS 717

Query: 1024 LNEFSNSCASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQVFV 1200
            LNE+SN+  SQDS + ++E   PK +K  ESFFA +IKKSF+D  +SNQ  E   S V V
Sbjct: 718  LNEYSNAGMSQDSGDATREDDLPKNHKTSESFFANIIKKSFKDLSRSNQTQERGNSNVSV 777

Query: 1201 N 1203
            N
Sbjct: 778  N 778


>ref|XP_002320185.2| hypothetical protein POPTR_0014s09140g [Populus trichocarpa]
            gi|550323811|gb|EEE98500.2| hypothetical protein
            POPTR_0014s09140g [Populus trichocarpa]
          Length = 900

 Score =  259 bits (662), Expect = 2e-66
 Identities = 164/414 (39%), Positives = 228/414 (55%), Gaps = 13/414 (3%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P  + F G Y D N DLF  +     FHQP+CSC HCY+K+  +   V P    N RF +
Sbjct: 363  PPRQYFSGQYFDTNPDLFEPYPSNAAFHQPSCSCFHCYEKHHGVSATVPPTSFGNIRFPD 422

Query: 181  EPSNPNLHQHVNPILHRPQGNNS-----GGSNVHPQQSLTKNSNDRDSENDGFNYHRPRK 345
              +NP ++QH N     P  NNS        N    QS  +  +D +SE  GF     R+
Sbjct: 423  MSNNPIMYQHRNSAAFGPHMNNSRIPVPSQLNFRSSQSHKRWPSDLNSEMAGFARPHTRR 482

Query: 346  LVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFELG 525
            +VLA  S R  RP+AGGAPF+ C NCFE+L+LPKK + +   +QKM+C  CSS+I F + 
Sbjct: 483  VVLASGS-RCCRPIAGGAPFLTCFNCFELLQLPKKVLLMANNQQKMQCSTCSSVINFSVV 541

Query: 526  YKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYD----DFQD- 690
             K  + S +    Q+PTE+D+ S+              N  N N  S+DYD    DFQ  
Sbjct: 542  NKKLMLSVNTEATQIPTEVDDSSNHI------------NRINANFSSDDYDNSGYDFQTV 589

Query: 691  KFSPTDKKSNSGESEKQXXXXXXXXXXXXXERGPENILSAKLPLTDVKSL-PDPDLSP-Q 864
            +  P     NS   ++              E  P+ IL A +  T   SL P P  SP Q
Sbjct: 590  ETDPIGHHLNSTNPQETQSFHSSSPSTSEYENIPD-ILIAPINGTQQASLSPPPPGSPLQ 648

Query: 865  ECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEMDVSLNEFSNS 1044
            +  D+S  N  VNRF K N+S R + ERV  ++  ++Q+S+K+A VATEM+VS  ++SN+
Sbjct: 649  QHFDYSSNNHAVNRFGKGNRSNRADHERVITNKANTRQNSMKEAPVATEMEVSFPDYSNT 708

Query: 1045 CASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQVFVN 1203
             ASQDS ++S+E ++ + NKGG+SFFA +IKKSF+DF +S+Q  E   + V VN
Sbjct: 709  AASQDSGDVSREDSQSRNNKGGDSFFANIIKKSFKDFSRSHQTDEHGRNNVLVN 762


>ref|XP_006444880.1| hypothetical protein CICLE_v10018757mg [Citrus clementina]
            gi|557547142|gb|ESR58120.1| hypothetical protein
            CICLE_v10018757mg [Citrus clementina]
          Length = 915

 Score =  259 bits (661), Expect = 2e-66
 Identities = 161/421 (38%), Positives = 238/421 (56%), Gaps = 20/421 (4%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P H  F G Y D N+DLF  ++  + FHQP+CSC +CY+K +H   +V  P + +  F+N
Sbjct: 364  PSHPYFSGQYIDPNHDLFESYQQNSMFHQPSCSCYYCYNK-YH---QVSAP-VQSSAFNN 418

Query: 181  EPSNPNLHQHVNPILHRPQGNNSGGS----NVHPQQSLTKNSNDRDSENDGFNYHRPRKL 348
              +N  L+ H NP    P+ +N   +    N H  Q  T+  +D + E   F    PR++
Sbjct: 419  RTNNAMLYHHENPRAFVPRVHNHSAAVPPLNSHGPQVHTRWPSDLNCEMGNFVRCCPRRV 478

Query: 349  VLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFELGY 528
            VL   SGR  RP+AGGAPFI C+NCFE+L+LPK+   + K ++  +CG CS++I F++  
Sbjct: 479  VLTS-SGRRCRPIAGGAPFIVCNNCFELLQLPKRTKLMAKDQKIFQCGTCSTVIDFDVIN 537

Query: 529  KGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYD----DFQ--- 687
            K  I S  A    + TE++ GS+G + +   +     +  N N  S+DYD    DFQ   
Sbjct: 538  KKLILSVQAETKGISTEVNGGSNGAMKDYTSHSLGRLDRVNANFSSDDYDNSGYDFQAMD 597

Query: 688  -DKFSPTDKKSNSGESEKQXXXXXXXXXXXXXERGPENIL-------SAKLPLTDVKSLP 843
             +  S TD+  +SG+  +              E  PE ++       S + P    +S P
Sbjct: 598  REPASSTDQFLDSGKPPETHSLRSSTPSISEDEHSPEVLITPREVTHSTQQPTKATQSTP 657

Query: 844  DPDLSPQECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEMDVS 1023
             P    QE  D+S  N +VNRF K N+S R +QE+V  ++ T++Q+S+K+A++ATEM+VS
Sbjct: 658  PPGSPLQEHFDYSSSNHVVNRFAKGNRSSRSDQEKVITNKVTARQNSLKEASLATEMEVS 717

Query: 1024 LNEFSNSCASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQVFV 1200
            LNE+SN+  SQDS + ++E   PK +K  ESFFA +IKKSF+D  +SNQ  E   S V V
Sbjct: 718  LNEYSNAGMSQDSGDATREDDLPKNHKTSESFFANIIKKSFKDLSRSNQTQERGNSNVSV 777

Query: 1201 N 1203
            N
Sbjct: 778  N 778


>gb|EOX95766.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 839

 Score =  249 bits (637), Expect = 1e-63
 Identities = 153/423 (36%), Positives = 233/423 (55%), Gaps = 22/423 (5%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P H  F G Y + N+D FM +   +  H  +CSC HCY+K+  +P  V P    N+RF +
Sbjct: 365  PPHTYFSGQYIENNHDPFMSYPQSSVLHHASCSCFHCYEKHRRVPAPVPPSAFGNKRFPD 424

Query: 181  EPSNPNLHQHVNPILHRPQGNNSGGS-----NVHPQQSLTKNSNDRDSENDGFNYHRPRK 345
             PSNP  H   NP       +NS  +     NV   Q   +  +D ++E  GF   RP++
Sbjct: 425  VPSNPMYHIE-NPGTFGSHFHNSRTTMPPPLNVRGTQVHARWPSDINTEIGGFVRCRPQR 483

Query: 346  LVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFELG 525
            +VLA   GR  RP+AGGAPFI C NCFE+L++P+K   + K E K++CGACS++I F + 
Sbjct: 484  VVLAS-GGRHFRPIAGGAPFITCYNCFELLQMPRKLQLIVKNEHKLRCGACSTVINFTVV 542

Query: 526  YKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKFSPT 705
             K  +    A    +  E+D+ S+  V++N  ++  G  +   N  S+DYD     F   
Sbjct: 543  NKKLVLCDHAETKGISVEVDDSSNEVVNDNSSHFR-GRVNRIANFSSDDYDHSGYDFQSM 601

Query: 706  DKKS---------NSGESEKQXXXXXXXXXXXXXERGPENILSAKLPLTDVKSLPDPDLS 858
            D++          NS   ++              E  P+ +++++  +  V+    P LS
Sbjct: 602  DREPVALSMGQALNSVRPQELQNFHSSSPSTSEDENSPDVLIASRDEVNSVEQPIKPTLS 661

Query: 859  P-------QECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEMD 1017
            P       QE  D+S  N  VNRF K N+S R +QE+V  ++ T++Q+S+K+A++ TEM+
Sbjct: 662  PPPAGSPLQEHFDYSSNNRAVNRFGKGNRSSRSDQEKVMSNKATTRQNSLKEASLPTEME 721

Query: 1018 VSLNEFSNSCASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQV 1194
            VS N++SN+  SQDS + ++E  + K+ KGGESFFA +IK+SF+DF +SNQ  E   S +
Sbjct: 722  VSFNDYSNTGISQDSGDATREDDQLKMTKGGESFFANIIKRSFKDFSRSNQTEERGKSNI 781

Query: 1195 FVN 1203
             VN
Sbjct: 782  SVN 784


>gb|EOX95765.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
          Length = 839

 Score =  249 bits (637), Expect = 1e-63
 Identities = 153/423 (36%), Positives = 233/423 (55%), Gaps = 22/423 (5%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P H  F G Y + N+D FM +   +  H  +CSC HCY+K+  +P  V P    N+RF +
Sbjct: 365  PPHTYFSGQYIENNHDPFMSYPQSSVLHHASCSCFHCYEKHRRVPAPVPPSAFGNKRFPD 424

Query: 181  EPSNPNLHQHVNPILHRPQGNNSGGS-----NVHPQQSLTKNSNDRDSENDGFNYHRPRK 345
             PSNP  H   NP       +NS  +     NV   Q   +  +D ++E  GF   RP++
Sbjct: 425  VPSNPMYHIE-NPGTFGSHFHNSRTTMPPPLNVRGTQVHARWPSDINTEIGGFVRCRPQR 483

Query: 346  LVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFELG 525
            +VLA   GR  RP+AGGAPFI C NCFE+L++P+K   + K E K++CGACS++I F + 
Sbjct: 484  VVLAS-GGRHFRPIAGGAPFITCYNCFELLQMPRKLQLIVKNEHKLRCGACSTVINFTVV 542

Query: 526  YKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKFSPT 705
             K  +    A    +  E+D+ S+  V++N  ++  G  +   N  S+DYD     F   
Sbjct: 543  NKKLVLCDHAETKGISVEVDDSSNEVVNDNSSHFR-GRVNRIANFSSDDYDHSGYDFQSM 601

Query: 706  DKKS---------NSGESEKQXXXXXXXXXXXXXERGPENILSAKLPLTDVKSLPDPDLS 858
            D++          NS   ++              E  P+ +++++  +  V+    P LS
Sbjct: 602  DREPVALSMGQALNSVRPQELQNFHSSSPSTSEDENSPDVLIASRDEVNSVEQPIKPTLS 661

Query: 859  P-------QECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEMD 1017
            P       QE  D+S  N  VNRF K N+S R +QE+V  ++ T++Q+S+K+A++ TEM+
Sbjct: 662  PPPAGSPLQEHFDYSSNNRAVNRFGKGNRSSRSDQEKVMSNKATTRQNSLKEASLPTEME 721

Query: 1018 VSLNEFSNSCASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQV 1194
            VS N++SN+  SQDS + ++E  + K+ KGGESFFA +IK+SF+DF +SNQ  E   S +
Sbjct: 722  VSFNDYSNTGISQDSGDATREDDQLKMTKGGESFFANIIKRSFKDFSRSNQTEERGKSNI 781

Query: 1195 FVN 1203
             VN
Sbjct: 782  SVN 784


>gb|EOX95764.1| Uncharacterized protein isoform 3, partial [Theobroma cacao]
          Length = 855

 Score =  249 bits (637), Expect = 1e-63
 Identities = 153/423 (36%), Positives = 233/423 (55%), Gaps = 22/423 (5%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P H  F G Y + N+D FM +   +  H  +CSC HCY+K+  +P  V P    N+RF +
Sbjct: 365  PPHTYFSGQYIENNHDPFMSYPQSSVLHHASCSCFHCYEKHRRVPAPVPPSAFGNKRFPD 424

Query: 181  EPSNPNLHQHVNPILHRPQGNNSGGS-----NVHPQQSLTKNSNDRDSENDGFNYHRPRK 345
             PSNP  H   NP       +NS  +     NV   Q   +  +D ++E  GF   RP++
Sbjct: 425  VPSNPMYHIE-NPGTFGSHFHNSRTTMPPPLNVRGTQVHARWPSDINTEIGGFVRCRPQR 483

Query: 346  LVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFELG 525
            +VLA   GR  RP+AGGAPFI C NCFE+L++P+K   + K E K++CGACS++I F + 
Sbjct: 484  VVLAS-GGRHFRPIAGGAPFITCYNCFELLQMPRKLQLIVKNEHKLRCGACSTVINFTVV 542

Query: 526  YKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKFSPT 705
             K  +    A    +  E+D+ S+  V++N  ++  G  +   N  S+DYD     F   
Sbjct: 543  NKKLVLCDHAETKGISVEVDDSSNEVVNDNSSHFR-GRVNRIANFSSDDYDHSGYDFQSM 601

Query: 706  DKKS---------NSGESEKQXXXXXXXXXXXXXERGPENILSAKLPLTDVKSLPDPDLS 858
            D++          NS   ++              E  P+ +++++  +  V+    P LS
Sbjct: 602  DREPVALSMGQALNSVRPQELQNFHSSSPSTSEDENSPDVLIASRDEVNSVEQPIKPTLS 661

Query: 859  P-------QECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEMD 1017
            P       QE  D+S  N  VNRF K N+S R +QE+V  ++ T++Q+S+K+A++ TEM+
Sbjct: 662  PPPAGSPLQEHFDYSSNNRAVNRFGKGNRSSRSDQEKVMSNKATTRQNSLKEASLPTEME 721

Query: 1018 VSLNEFSNSCASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQV 1194
            VS N++SN+  SQDS + ++E  + K+ KGGESFFA +IK+SF+DF +SNQ  E   S +
Sbjct: 722  VSFNDYSNTGISQDSGDATREDDQLKMTKGGESFFANIIKRSFKDFSRSNQTEERGKSNI 781

Query: 1195 FVN 1203
             VN
Sbjct: 782  SVN 784


>gb|EOX95763.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 844

 Score =  249 bits (637), Expect = 1e-63
 Identities = 153/423 (36%), Positives = 233/423 (55%), Gaps = 22/423 (5%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P H  F G Y + N+D FM +   +  H  +CSC HCY+K+  +P  V P    N+RF +
Sbjct: 365  PPHTYFSGQYIENNHDPFMSYPQSSVLHHASCSCFHCYEKHRRVPAPVPPSAFGNKRFPD 424

Query: 181  EPSNPNLHQHVNPILHRPQGNNSGGS-----NVHPQQSLTKNSNDRDSENDGFNYHRPRK 345
             PSNP  H   NP       +NS  +     NV   Q   +  +D ++E  GF   RP++
Sbjct: 425  VPSNPMYHIE-NPGTFGSHFHNSRTTMPPPLNVRGTQVHARWPSDINTEIGGFVRCRPQR 483

Query: 346  LVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFELG 525
            +VLA   GR  RP+AGGAPFI C NCFE+L++P+K   + K E K++CGACS++I F + 
Sbjct: 484  VVLAS-GGRHFRPIAGGAPFITCYNCFELLQMPRKLQLIVKNEHKLRCGACSTVINFTVV 542

Query: 526  YKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKFSPT 705
             K  +    A    +  E+D+ S+  V++N  ++  G  +   N  S+DYD     F   
Sbjct: 543  NKKLVLCDHAETKGISVEVDDSSNEVVNDNSSHFR-GRVNRIANFSSDDYDHSGYDFQSM 601

Query: 706  DKKS---------NSGESEKQXXXXXXXXXXXXXERGPENILSAKLPLTDVKSLPDPDLS 858
            D++          NS   ++              E  P+ +++++  +  V+    P LS
Sbjct: 602  DREPVALSMGQALNSVRPQELQNFHSSSPSTSEDENSPDVLIASRDEVNSVEQPIKPTLS 661

Query: 859  P-------QECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEMD 1017
            P       QE  D+S  N  VNRF K N+S R +QE+V  ++ T++Q+S+K+A++ TEM+
Sbjct: 662  PPPAGSPLQEHFDYSSNNRAVNRFGKGNRSSRSDQEKVMSNKATTRQNSLKEASLPTEME 721

Query: 1018 VSLNEFSNSCASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQV 1194
            VS N++SN+  SQDS + ++E  + K+ KGGESFFA +IK+SF+DF +SNQ  E   S +
Sbjct: 722  VSFNDYSNTGISQDSGDATREDDQLKMTKGGESFFANIIKRSFKDFSRSNQTEERGKSNI 781

Query: 1195 FVN 1203
             VN
Sbjct: 782  SVN 784


>gb|EOX95762.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 921

 Score =  249 bits (637), Expect = 1e-63
 Identities = 153/423 (36%), Positives = 233/423 (55%), Gaps = 22/423 (5%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P H  F G Y + N+D FM +   +  H  +CSC HCY+K+  +P  V P    N+RF +
Sbjct: 365  PPHTYFSGQYIENNHDPFMSYPQSSVLHHASCSCFHCYEKHRRVPAPVPPSAFGNKRFPD 424

Query: 181  EPSNPNLHQHVNPILHRPQGNNSGGS-----NVHPQQSLTKNSNDRDSENDGFNYHRPRK 345
             PSNP  H   NP       +NS  +     NV   Q   +  +D ++E  GF   RP++
Sbjct: 425  VPSNPMYHIE-NPGTFGSHFHNSRTTMPPPLNVRGTQVHARWPSDINTEIGGFVRCRPQR 483

Query: 346  LVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFELG 525
            +VLA   GR  RP+AGGAPFI C NCFE+L++P+K   + K E K++CGACS++I F + 
Sbjct: 484  VVLAS-GGRHFRPIAGGAPFITCYNCFELLQMPRKLQLIVKNEHKLRCGACSTVINFTVV 542

Query: 526  YKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKFSPT 705
             K  +    A    +  E+D+ S+  V++N  ++  G  +   N  S+DYD     F   
Sbjct: 543  NKKLVLCDHAETKGISVEVDDSSNEVVNDNSSHFR-GRVNRIANFSSDDYDHSGYDFQSM 601

Query: 706  DKKS---------NSGESEKQXXXXXXXXXXXXXERGPENILSAKLPLTDVKSLPDPDLS 858
            D++          NS   ++              E  P+ +++++  +  V+    P LS
Sbjct: 602  DREPVALSMGQALNSVRPQELQNFHSSSPSTSEDENSPDVLIASRDEVNSVEQPIKPTLS 661

Query: 859  P-------QECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEMD 1017
            P       QE  D+S  N  VNRF K N+S R +QE+V  ++ T++Q+S+K+A++ TEM+
Sbjct: 662  PPPAGSPLQEHFDYSSNNRAVNRFGKGNRSSRSDQEKVMSNKATTRQNSLKEASLPTEME 721

Query: 1018 VSLNEFSNSCASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQV 1194
            VS N++SN+  SQDS + ++E  + K+ KGGESFFA +IK+SF+DF +SNQ  E   S +
Sbjct: 722  VSFNDYSNTGISQDSGDATREDDQLKMTKGGESFFANIIKRSFKDFSRSNQTEERGKSNI 781

Query: 1195 FVN 1203
             VN
Sbjct: 782  SVN 784


>ref|XP_002533909.1| hypothetical protein RCOM_0237030 [Ricinus communis]
            gi|223526130|gb|EEF28474.1| hypothetical protein
            RCOM_0237030 [Ricinus communis]
          Length = 916

 Score =  249 bits (637), Expect = 1e-63
 Identities = 156/425 (36%), Positives = 239/425 (56%), Gaps = 24/425 (5%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P H+ F  +Y D+N+D F  +   + FHQP+CSC HCY+++  +   V P    N+RF +
Sbjct: 357  PSHQYFSRHYFDINSDPFGPYTSNSNFHQPSCSCFHCYERHHGVSAPVPPTAFSNKRFPD 416

Query: 181  EPSNPNLHQHVNPILHRPQGNNSGGSNVHP-----QQSLTKNSNDRDSENDGFNYHRPRK 345
              +NP L+QH N     P  +NS  +   P      QS  +  +D +SE  GF   RPR+
Sbjct: 417  VLNNPMLYQHENRGAFAPHVHNSRTTVPPPLDFRGAQSHARWPSDLNSEMGGFVRCRPRR 476

Query: 346  LVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFELG 525
            +VLA   G   +P+AGGAPF +C NCFE+L++PKK + + K +QK++CGACS++I F + 
Sbjct: 477  VVLAG-GGCCCQPMAGGAPFFSCFNCFEVLQVPKKVLLMGKNQQKIQCGACSTVIDFAVV 535

Query: 526  YKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKFSPT 705
             K  + S +  V QVP E+D  S+  + E+  Y +D  +  N N  S+DYD+    F   
Sbjct: 536  NKKLVLSINTEVTQVPIEVDNSSTEMIKESTSYSHDHMSRMNTNFSSDDYDNSGYDFQIV 595

Query: 706  D---------KKSNSGESEKQXXXXXXXXXXXXXERGPENIL-------SAKLPLTDVKS 837
            D         +  NS + ++              E  P+ ++       SA+ P+    S
Sbjct: 596  DTDPIALLSGQGLNSMKHQEMNGFHTSSLSTSEDENSPDALIAPREIINSAQQPIKASLS 655

Query: 838  LPDPDLSPQECPDHSP-ENGLVNRFDKRNKSQRPEQERVCL-DRTTSQQSSVKDAAVATE 1011
             P P    Q+  D S   N  VNRF K N+S R +QE+V   ++ T++Q+S+KD+++ATE
Sbjct: 656  PPPPGSPLQQHFDFSSNNNNAVNRFGKGNRSSRSDQEKVMTNNKATTRQNSMKDSSLATE 715

Query: 1012 MDVSLNEFSNSCASQDSVEISKEARP-KINKGGESFFAGLIKKSFRDFKKSNQGVEVSGS 1188
            ++V  +E+S++  SQDS + ++E    K++KGG+SFFA  IKKSF+D  +SNQ  + S S
Sbjct: 716  IEVPFHEYSHTGVSQDSGDANREDNQLKVSKGGDSFFAN-IKKSFKDLSRSNQIDDRSRS 774

Query: 1189 QVFVN 1203
             V VN
Sbjct: 775  NVSVN 779


>gb|EMJ21467.1| hypothetical protein PRUPE_ppa001028mg [Prunus persica]
          Length = 929

 Score =  248 bits (634), Expect = 3e-63
 Identities = 152/431 (35%), Positives = 224/431 (51%), Gaps = 30/431 (6%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P H  F G Y + + D + L+ H   FH P C C +CYDK+      V     HN+RF +
Sbjct: 363  PSHPYFSGQYAENSPDPYELYPHSATFHHPTCPCFYCYDKHRRASVPVPSTAFHNKRFPD 422

Query: 181  EPSNPNLHQHVNPILHRPQGNNSGGSNVHP-------------QQSLTKNSNDRDSENDG 321
             P+NP L Q  NP +  P  +N   + + P              Q  T+  ND +S  D 
Sbjct: 423  FPNNPMLAQPENPGMIGPYDHNKPRTAIPPPFHVSQAHTRRPSDQPHTRWPNDLNSHMDS 482

Query: 322  FNYHRPRKLVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACS 501
            F + RP ++VLA   GR   P +GGAPF+ C+NCFE+L+LPK+ +  EK +QKM+CGACS
Sbjct: 483  FAHSRPERVVLAS-GGRRCLPFSGGAPFVTCNNCFELLQLPKRVLIGEKNQQKMRCGACS 541

Query: 502  SIILFELGYKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDD 681
            ++I F +  K  + S  A   Q P+E++  S+  V ++  + +        +  S+DYD+
Sbjct: 542  TVIDFSVSNKKLVLSHHAEAQQNPSEVNISSNEVVKDSTSHSHGRVTRVYAHFSSDDYDN 601

Query: 682  FQDKFSPTDKK---------SNSGESEKQXXXXXXXXXXXXXERGPENILSAK------- 813
                F   D++         S +G+  +              +  PE  ++ K       
Sbjct: 602  SGYDFHSIDREPVLPSTAPSSTTGKPHEMQSFHSSSPSTSEDDCNPEAPIAPKEFTNSIQ 661

Query: 814  LPLTDVKSLPDPDLSPQECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKD 993
             P     S P P    QE  + S  + ++NR  K N+S R +QE+V  ++  S+Q+S+K+
Sbjct: 662  QPTKATFSPPPPGSPLQEHFEFSSNSHVINRLGKGNRSSRSDQEKVKPNKVNSRQNSLKE 721

Query: 994  AAVATEMDVSLNEFSNSCASQDSVEISKEA-RPKINKGGESFFAGLIKKSFRDFKKSNQG 1170
             ++ATEM+VS NE+SN+  SQDS + +KE  +P+ NKG ESF    IKKSFRDF KSNQ 
Sbjct: 722  TSLATEMEVSFNEYSNTGVSQDSWDANKEEDQPRTNKGSESFITNFIKKSFRDFSKSNQT 781

Query: 1171 VEVSGSQVFVN 1203
             E   S V VN
Sbjct: 782  NEHGRSNVSVN 792


>ref|XP_003520868.1| PREDICTED: uncharacterized protein At5g05190-like [Glycine max]
          Length = 904

 Score =  244 bits (624), Expect = 4e-62
 Identities = 161/421 (38%), Positives = 223/421 (52%), Gaps = 20/421 (4%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P H  +PG Y D N D + L+ H    H P CSC HCYD          P    N RF +
Sbjct: 354  PLHPYYPGRYVDTNPDSYELYSHNAMLHPPTCSCFHCYDSKQRGSVPALPASFINSRFPD 413

Query: 181  EPSNPNLHQHVNPILHRPQGNNSGGS----NVHPQQSLTKNSNDRDSENDGFNYHRPRKL 348
             P++P L+ H  P    P  +NS  +        +Q   + ++D +SE  GF   RPRK+
Sbjct: 414  TPNDPMLYHHEIPGAFGPHVHNSRTTIPPVTYRQKQLHARWASDFNSEMSGFVRSRPRKV 473

Query: 349  VLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKT-EQKMKCGACSSIILFELG 525
            +LA  S R   P AGG+PFI+C NCFE+L LPKK + L K  +QK++CGACSS I F + 
Sbjct: 474  MLASSSQR-CYPAAGGSPFISCHNCFELLLLPKKALVLVKNHQQKVQCGACSSEISFAVI 532

Query: 526  YKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKFSPT 705
             K  + S +     VP+  D  S+  V   + +     +    N  S+DY  +   F   
Sbjct: 533  NKKLVISPNLETKGVPSRGDNSSNEVVSSRMSHSRGHVSRTGANFSSDDYSGYD--FHSV 590

Query: 706  DKKS------NSGESEKQXXXXXXXXXXXXXERGPENIL-------SAKLPLTDVKSLPD 846
            D++       NS +S +              E  PE ++       S + P TD  SL  
Sbjct: 591  DREPISLVALNSNKSREMPSFHSSSLSTSEDENSPEAMIAPREATKSIQRPTTD--SLSP 648

Query: 847  PDLSP-QECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEMDVS 1023
            P  SP QE  D+S  N  VNRF K N+S R EQE+  +D+ +++Q+S+K+ A+ATEMDV 
Sbjct: 649  PAGSPLQEYFDYSSNNHAVNRFGKGNQSSRSEQEKTKVDKMSARQNSLKETALATEMDV- 707

Query: 1024 LNEFSNSCASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQVFV 1200
             +++SN+  SQDS + S+E   P+ N+GGESFFA +IKKSFRDF +SN   E S   V V
Sbjct: 708  -HDYSNTGVSQDSGDASREHDHPRSNRGGESFFANIIKKSFRDFSRSNHTDERSKISVTV 766

Query: 1201 N 1203
            N
Sbjct: 767  N 767


>ref|XP_003626554.1| hypothetical protein MTR_7g117150 [Medicago truncatula]
            gi|355501569|gb|AES82772.1| hypothetical protein
            MTR_7g117150 [Medicago truncatula]
          Length = 891

 Score =  243 bits (620), Expect = 1e-61
 Identities = 163/422 (38%), Positives = 225/422 (53%), Gaps = 21/422 (4%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHEN--FFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRF 174
            P H  FPG Y D N D + L+ H N    HQP+CSC HCYD        + PP      F
Sbjct: 346  PLHPYFPGRYVDPNPDSYELYAHNNNAMLHQPSCSCFHCYDNKRRGSVPMPPPS-----F 400

Query: 175  HNEPSNPNLHQHVNPILHRPQGNNSGGS----NVHPQQSLTKNSNDRDSENDGFNYHRPR 342
             N+PS   L+ H  P  +    +NS  S     +   Q  T+  +D +SE  GF  +R R
Sbjct: 401  PNDPSM--LYHHEIPGGYGSHVHNSKASIPPARLRENQLHTRWPSDFNSEMGGFTRNRHR 458

Query: 343  KLVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKT-EQKMKCGACSSIILFE 519
            K+++A  S R   PVAGG+PFI C+NCFE+L+LPKK + L +  +QK++CGACSS I   
Sbjct: 459  KVMVASSSRRC-HPVAGGSPFITCNNCFELLQLPKKALVLARNHQQKVRCGACSSEISVS 517

Query: 520  LGYKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKFS 699
            L  K  + S S  +   P+ +D+ S+  +   V +    +N    N  S+DY  +   F 
Sbjct: 518  LINKKLVISHS-EMKGAPSRVDDSSNEVLSSRVSHTRGLANRNGANFSSDDYSGYD--FL 574

Query: 700  PTDKKS------NSGESEKQXXXXXXXXXXXXXERGPENILSAK-------LPLTDVKSL 840
              DK+       NS +S++              E   E +++ +        P TD  S 
Sbjct: 575  SVDKEPLSAVALNSNKSQEMQSFHSSSPSTSEDENSSEAMIAPREALKSIHRPTTDSLSP 634

Query: 841  PDPDLSPQECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEMDV 1020
            P      QE  DHS  N  VNRF K N+S R EQE+  L++  S+Q+S+K+ AVATEMDV
Sbjct: 635  PSGSSPLQEYVDHSNSNRAVNRFGKGNRSSRSEQEKAKLEKIASRQNSLKETAVATEMDV 694

Query: 1021 SLNEFSNSCASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQVF 1197
              +++SN+  SQDS + S+E   P+ NKGGESFFA +IKKSFRDF +SNQ  +     V 
Sbjct: 695  --HDYSNTGVSQDSRDASREHDHPRSNKGGESFFANIIKKSFRDFSRSNQNDDCGKINVT 752

Query: 1198 VN 1203
            VN
Sbjct: 753  VN 754


>ref|XP_003553779.1| PREDICTED: uncharacterized protein At5g05190-like [Glycine max]
          Length = 911

 Score =  240 bits (613), Expect = 7e-61
 Identities = 159/421 (37%), Positives = 221/421 (52%), Gaps = 20/421 (4%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P H  +PG Y D N D + L+ H    H P CSC HCYD          P    N RF +
Sbjct: 361  PLHPYYPGRYADTNPDSYELYSHNAMLHPPTCSCFHCYDNKRRGSVPAPPASFINSRFPD 420

Query: 181  EPSNPNLHQHVNPILHRPQGNNSGGS----NVHPQQSLTKNSNDRDSENDGFNYHRPRKL 348
             P++P L+ H  P    P  +NS  +      H +Q   + ++D +SE  GF   RPRK+
Sbjct: 421  IPNDPMLYHHEIPGSFGPHVHNSRTAIPPMTYHEKQLHARWASDVNSEMGGFVRSRPRKV 480

Query: 349  VLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKT-EQKMKCGACSSIILFELG 525
            +LA  S R   PVAGG+PFI+C NCFE+L LPKK + L K  +QK++CGACS+ I F + 
Sbjct: 481  MLASSSQR-CYPVAGGSPFISCHNCFELLLLPKKPLVLVKNHQQKVQCGACSTEISFAVI 539

Query: 526  YKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKFSPT 705
             K  + S +       +  D  S+  V  ++ +     N    N  S+DY  +   F   
Sbjct: 540  NKKLVISPNLETKGASSRGDSSSNEVVSSHMSHSRGHVNRTGANFSSDDYSGYD--FHSV 597

Query: 706  DKKS------NSGESEKQXXXXXXXXXXXXXERGPENIL-------SAKLPLTDVKSLPD 846
            D++       NS +S +              E  PE ++       S   P TD  SL  
Sbjct: 598  DREPFSLVALNSNKSREIPSFHSSSLSTSEDENSPETMIAPREATKSIHRPTTD--SLSP 655

Query: 847  PDLSP-QECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEMDVS 1023
            P  SP QE  D+S  N  VNRF K N+S R EQ++  +D+ +S+Q+S+K+ A+ATEMDV 
Sbjct: 656  PAGSPLQEYFDYSNNNHAVNRFGKGNQSSRSEQDKTKVDKMSSRQNSLKETALATEMDV- 714

Query: 1024 LNEFSNSCASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQVFV 1200
             +++SN+  SQDS + S+E   P+  +GGESFFA +IKKSFRDF  SN   + S   V V
Sbjct: 715  -HDYSNNGVSQDSADASREHYHPRSTRGGESFFANIIKKSFRDFSWSNHTDDRSKISVTV 773

Query: 1201 N 1203
            N
Sbjct: 774  N 774


>gb|ESW19279.1| hypothetical protein PHAVU_006G111100g [Phaseolus vulgaris]
          Length = 909

 Score =  236 bits (603), Expect = 1e-59
 Identities = 160/421 (38%), Positives = 217/421 (51%), Gaps = 20/421 (4%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHEN-FFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFH 177
            P H  +PG Y D N D + L+ H N   H P+CSC HCYD          P    N RF 
Sbjct: 357  PLHPYYPGRYVDTNPDSYELYSHNNAMLHPPSCSCFHCYDNKRRGSVPAPPASFINSRFP 416

Query: 178  NEPSNPNLHQHVNPILHRPQGNNSGGS----NVHPQQSLTKNSNDRDSENDGFNYHRPRK 345
            + P++P L  H  P+   PQ +NS  +        +Q   +  +D +SE   F   RPRK
Sbjct: 417  DIPNDPMLFHHDIPVAFGPQVHNSRPAIPPATYREKQLHARWGSDFNSEMGSFVRTRPRK 476

Query: 346  LVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTE-QKMKCGACSSIILFEL 522
            ++LA  S R   PVAGG+PFI+C NC E+L LPKK + L K   QK++CG+CSS I   +
Sbjct: 477  VMLA-ASSRRCYPVAGGSPFISCHNCSELLLLPKKALVLVKNRRQKVQCGSCSSEISLAV 535

Query: 523  GYKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKFSP 702
              K  I S       VP+  D  S+  V   + +     N    N  S+DY  +   F  
Sbjct: 536  INKKLIISPILETKGVPSRGDNSSNEVVSSRMSHSRVHGNRTGANFSSDDYSGYD--FHS 593

Query: 703  TDKKS------NSGESEKQXXXXXXXXXXXXXERGPENIL-------SAKLPLTDVKSLP 843
             D++       NS +S +              E  PE ++       S   P TD  S P
Sbjct: 594  VDREPLSMGALNSNKSLEIPSFRSSSLSTSEDENSPEAMIDPREATKSIHPPTTDSLSPP 653

Query: 844  DPDLSPQECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEMDVS 1023
                  QE  D+S  N  VNRF K N+S R EQE+  +D+ +S+Q+S+K+AA+ATEMDV 
Sbjct: 654  PAGSPLQEYFDYSNNNHAVNRFGKGNQSSRSEQEKTKVDKMSSRQNSLKEAALATEMDV- 712

Query: 1024 LNEFSNSCASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQVFV 1200
             +++SN   SQDS + S+E   P+ NKGGESFFA +IKKSFRDF +SN   + S   + V
Sbjct: 713  -HDYSNIGVSQDSGDASREHYHPRSNKGGESFFANIIKKSFRDFSRSNHTDDRSKISITV 771

Query: 1201 N 1203
            N
Sbjct: 772  N 772


>ref|XP_002303633.2| hypothetical protein POPTR_0003s13750g [Populus trichocarpa]
            gi|550343120|gb|EEE78612.2| hypothetical protein
            POPTR_0003s13750g [Populus trichocarpa]
          Length = 934

 Score =  236 bits (602), Expect = 1e-59
 Identities = 151/427 (35%), Positives = 227/427 (53%), Gaps = 26/427 (6%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P H+ F G + D ++   +   +    H PAC C HCY+KNWH+P +  P    N++F  
Sbjct: 387  PPHDHFAGQHVDFSHKPLVSDSYGRSHHGPACPCFHCYNKNWHIPSQASPTTFSNKKFPK 446

Query: 181  EPSNPNLHQHVN-----PILHRPQGNNSGGSNVHPQQSLTKNSNDRDSENDGFNYHRPRK 345
              ++   +QH+N     P+L+ PQ N    S   PQ S  +  +D +S+ DGF    P+K
Sbjct: 447  ASTDFCFNQHINAVTHRPLLYHPQANPPALSPRDPQ-SHVRWPSDVESDMDGFPKSCPKK 505

Query: 346  LVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFELG 525
            +V+A  + ++ R +AGGAPFI+C NCFE+LKLP+K    EK ++K++CG+CS+ IL E+ 
Sbjct: 506  VVIARGNEQLCRSIAGGAPFISCCNCFELLKLPRKLKVREKNQRKLRCGSCSAFILLEIK 565

Query: 526  YKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKFSPT 705
             K  I S  A   Q+  E   G S      V   +DG  +A   T S+D++D    F   
Sbjct: 566  SKRLITSVPAENKQMLAE--AGISSHEVSKVLLNSDGCLNAGGTTCSDDFEDHGYDFQSA 623

Query: 706  D--------KKSNSGESEKQXXXXXXXXXXXXXERGPENIL-------SAKLPLTD---- 828
            D        +K N+ + EK+             E   ++++       +A+LP+ D    
Sbjct: 624  DFKDVLSEERKLNTSKCEKRQSLASSSSISSEEEENLDSLVVERDFSYAAELPVKDEVPS 683

Query: 829  -VKSLPDPDLSPQECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVA 1005
              +S P  + S      H+      N+ ++ N+    EQE V L++  SQQSSV + +VA
Sbjct: 684  TFQSSPFQEHSGDVLSSHAE-----NKCEQGNRVGWTEQENVILEKNISQQSSV-NVSVA 737

Query: 1006 TEMDVSLNEFSNSCASQDSVEI-SKEARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVS 1182
            TEM+VS NE+ N+  SQDS E+ ++E + KINKG E F  G IKKSFRDF +SNQ +   
Sbjct: 738  TEMEVSFNEYLNTSVSQDSAEVRNEENQLKINKGSEPFLLGFIKKSFRDFSRSNQHLPNE 797

Query: 1183 GSQVFVN 1203
               V +N
Sbjct: 798  KLNVIIN 804


>emb|CAN76817.1| hypothetical protein VITISV_044118 [Vitis vinifera]
          Length = 913

 Score =  236 bits (602), Expect = 1e-59
 Identities = 154/424 (36%), Positives = 224/424 (52%), Gaps = 23/424 (5%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHENFFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRFHN 180
            P +  F G Y + N++ +  + H+   H P+CSC  CY ++  +P  +    L NRRF +
Sbjct: 356  PPYAYFSGGYMEPNSNPYEPYPHDPNLHHPSCSCFLCYTRHQQVPGSIPTNALLNRRFPD 415

Query: 181  EPSNPNLHQHVNPILHRPQGNNSGGSNV-----HPQQSLTKNSNDRDSENDGFNYHRPRK 345
             P++P  +   NP+   P+  N   +N      H  QS T+  +D +++   F +H P++
Sbjct: 416  IPNDPMSYHRENPVAFGPRVYNPRTANPPPMPSHDSQSHTRLPSDLNTQTSDFVHHLPQR 475

Query: 346  LVLAHRSGR-VSRPVAGGAPFIACSNCFEILKLPKKHVSLEKTEQKMKCGACSSIILFEL 522
             VL +  GR   RP+AGGAPFI C NC E+L+LPKK + ++K +QK++CGACS+II   +
Sbjct: 476  EVLLN--GRHYCRPLAGGAPFITCCNCCELLRLPKKILLVKKNQQKIRCGACSAIIFLAV 533

Query: 523  GYKGFIASASAHVDQVPTEIDEGSSGTVDENVRYWNDGSNSANMNTYSNDYD----DFQD 690
                 +AS     ++   EID+ ++  VDE     +   N  + N  S+DYD    DFQ 
Sbjct: 534  NRHKIVASIHEETEKTSKEIDDSTNQLVDERPSNSHGHVNQYSENFSSDDYDNSAYDFQS 593

Query: 691  K-----FSPTDKKSNSGESEKQXXXXXXXXXXXXXERGPENILSAK-------LPLTDVK 834
                    PTD+  NS + E+              E   E +++ +        P   V 
Sbjct: 594  MDREAGSVPTDQGLNSRKPER-VQNLHSSPSTPENEGSQEGLIAPREVDNPLEQPKKAVL 652

Query: 835  SLPDPDLSPQECPDHSPENGLVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATEM 1014
            S P P  S QE  D+S  N  +NRF   N+S R + E+V   +  S QSSVKD +VATEM
Sbjct: 653  SPPPPGSSLQEHFDYSSNNLALNRFGNGNQSSRSDHEKVIPSKAISXQSSVKDVSVATEM 712

Query: 1015 DVSLNEFSNSCASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGSQ 1191
            +VS NEFSN+  SQDS + S+E     INKG E F AG+IKK  RD  + NQ +E   + 
Sbjct: 713  EVSFNEFSNTGVSQDSGDASREHDHLGINKGEEPFLAGIIKKDLRDSSRPNQTIEQGRNI 772

Query: 1192 VFVN 1203
            V VN
Sbjct: 773  VMVN 776


>ref|XP_004494805.1| PREDICTED: uncharacterized protein LOC101505003 isoform X1 [Cicer
            arietinum] gi|502113930|ref|XP_004494806.1| PREDICTED:
            uncharacterized protein LOC101505003 isoform X2 [Cicer
            arietinum]
          Length = 901

 Score =  234 bits (596), Expect = 7e-59
 Identities = 164/425 (38%), Positives = 226/425 (53%), Gaps = 24/425 (5%)
 Frame = +1

Query: 1    PYHEPFPGYYGDVNNDLFMLHRHEN--FFHQPACSCVHCYDKNWHMPPKVDPPGLHNRRF 174
            P H  FPG Y D N D + L+ H N    H P+CSC HCYD        V P    N RF
Sbjct: 350  PLHPYFPGRYVDPNPDSYELYAHNNNAMLHLPSCSCFHCYDNKRRGSVPVPPASFVNSRF 409

Query: 175  HNEPSNPNLHQHVNPILHRPQGNNSGGS----NVHPQQSLTKNSNDRDSENDGFNYHRPR 342
             + P +P L+ H  P     + +NS  S    +    QS T+  +D +SE      +RPR
Sbjct: 410  PDAPIDPMLYHHEIPGTFGSRVHNSRASIPPAHFRENQSHTRWPSDFNSE---VVRNRPR 466

Query: 343  KLVLAHRSGRVSRPVAGGAPFIACSNCFEILKLPKKH-VSLEKTEQKMKCGACSSIILFE 519
            K++LA  S R  RPVAGG+PFI C NCF +L+LPKK  V L   +Q+++CGACSS I F 
Sbjct: 467  KVMLASSSRRC-RPVAGGSPFITCHNCFRLLQLPKKALVLLRNHQQRVRCGACSSEISFA 525

Query: 520  LGYKGFIASASAHVDQVPTEI-DEGSSGTVDENVRYWNDGSNSANMNTYSNDYDDFQDKF 696
            +  K  +    +  ++  T + D+ S+  +  +V +    +N +  N  S+DY  +   F
Sbjct: 526  VIDKKLVILPHSETNRASTRVVDDNSNEVLSSHVSHSRGHANRSAANFSSDDYSGYD--F 583

Query: 697  SPTDKKS------NSGESEKQXXXXXXXXXXXXXERGPENIL-------SAKLPLTDVKS 837
               DK+       NS  S++              E   E ++       S   P TD  S
Sbjct: 584  LSVDKEPLSVVGLNSNRSQEMQSFHSSSSSTSEDENSSEVLIAPSEAVKSIHRPTTD--S 641

Query: 838  LPDPDLSP-QECPDHSPENG-LVNRFDKRNKSQRPEQERVCLDRTTSQQSSVKDAAVATE 1011
            L  P  SP QE  DHS  N  +VNRF K N+S R EQE+   ++  S+Q+S+K+ AVATE
Sbjct: 642  LSPPSGSPLQEYVDHSNSNNRVVNRFGKGNRSSRSEQEKAKSEKIASRQNSLKETAVATE 701

Query: 1012 MDVSLNEFSNSCASQDSVEISKE-ARPKINKGGESFFAGLIKKSFRDFKKSNQGVEVSGS 1188
            MDV  +++SN+  SQDS + S+E   P+ NKGGESFF+ +IKKSFRDF +SNQ  +    
Sbjct: 702  MDV--HDYSNTGVSQDSRDASREHDHPRSNKGGESFFSNIIKKSFRDFSRSNQTDDRCKI 759

Query: 1189 QVFVN 1203
             V VN
Sbjct: 760  NVTVN 764


Top