BLASTX nr result

ID: Ophiopogon22_contig00024367 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon22_contig00024367
         (1198 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PKA48832.1| Putative ribonuclease H protein [Apostasia shenzh...   314   3e-97
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]   323   1e-95
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   316   5e-93
ref|XP_017973011.1| PREDICTED: uncharacterized protein LOC108661...   310   1e-91
ref|XP_024047909.1| uncharacterized protein LOC112101466 [Citrus...   312   1e-91
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]   311   3e-91
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   310   5e-91
gb|PKA63448.1| putative mitochondrial protein [Apostasia shenzhe...   290   4e-90
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]   307   6e-90
gb|PNY17656.1| ribonuclease H, partial [Trifolium pratense]           303   3e-89
gb|PNX80358.1| ribonuclease H, partial [Trifolium pratense]           290   5e-88
gb|PNX97372.1| ribonuclease H, partial [Trifolium pratense]           296   5e-88
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   300   2e-87
gb|PKU68054.1| integrator complex subunit 11 [Dendrobium catenatum]   281   2e-87
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]   299   3e-87
gb|PNX96237.1| ribonuclease H [Trifolium pratense]                    297   5e-87
dbj|GAU18648.1| hypothetical protein TSUD_124810 [Trifolium subt...   298   1e-86
ref|XP_023638968.1| uncharacterized protein LOC111830668 [Capsel...   296   1e-86
gb|PNX97216.1| ribonuclease H, partial [Trifolium pratense]           293   2e-86
gb|KZV36392.1| hypothetical protein F511_03833 [Dorcoceras hygro...   290   2e-86

>gb|PKA48832.1| Putative ribonuclease H protein [Apostasia shenzhenica]
          Length = 717

 Score =  314 bits (805), Expect = 3e-97
 Identities = 171/406 (42%), Positives = 244/406 (60%), Gaps = 9/406 (2%)
 Frame = -3

Query: 1193 WKEVSLQ-------EEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGN 1035
            W EV  +       EEIFWKQK+ IKW +EG+ANT+FFH  VK K+    +D ++  +GN
Sbjct: 165  WNEVKAKLQFWYNCEEIFWKQKAAIKWWKEGEANTKFFHNLVKKKRKRLFVDHLMGTDGN 224

Query: 1034 RVHGPENIHGVAINFFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAV 855
             +   E++    + +F QL SSE  T  D     IP +V++ DN+ L+   +L E  EA+
Sbjct: 225  WITTNEDLETSGVEYFGQLLSSEGCTFTDSDFAHIPNMVTDLDNNTLLSTPTLEEVKEAI 284

Query: 854  SSIPADSAPGIDGFSSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRN 675
             SI  DSAPG DGF S FF + WDI+   ++ AA+ FL   HL + +T + IVL+PK+  
Sbjct: 285  FSIHKDSAPGPDGFGSGFFQYCWDIIKSDLLQAASAFLSGSHLDRAYTSSLIVLVPKSDE 344

Query: 674  PTTMNDFRPISLCMTFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEIS 495
             +T  DFRPISL     K ++K++  RL  ++  +IS NQS F  GR I DNI LAQE+ 
Sbjct: 345  VSTWKDFRPISLSNVKTKFLSKILVNRLRTVISDIISPNQSGFTPGRDISDNILLAQELF 404

Query: 494  SEVGKREGRPNVILSLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMF 315
              + K +   N+ L LDM KAYDR+EW F+M  L +FGFS  +R +I   IS+  FS++ 
Sbjct: 405  HSLNKGKRGGNIALKLDMEKAYDRMEWSFVMQMLTKFGFSPIFRNIISNFISNSWFSLLI 464

Query: 314  QGVMKGFIPPSRGIRQGDPLSPCLFILAEDILSRALDHFIGDSDRFTTTN--STCPSHLL 141
             G   GF   SRG++QGDPLSP LFILA + LSR L+  + ++   +  +  +T   HL 
Sbjct: 465  NGKQTGFFKSSRGLKQGDPLSPILFILASEFLSRGLNALMTNNPAISYYSHCATNIFHLA 524

Query: 140  FADDIILFACAKRRSIMRYMEVLRTYQDSSGHRLNLQKCRFFLPAS 3
            +ADD I+F    ++SI++ ++ L  YQ  SG ++N +K  F  P S
Sbjct: 525  YADDCIIFCNGAKKSIVKVLDFLNRYQTCSGQKINKEKSSFICPKS 570


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  323 bits (829), Expect = 1e-95
 Identities = 169/387 (43%), Positives = 237/387 (61%), Gaps = 2/387 (0%)
 Frame = -3

Query: 1172 EEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVAIN 993
            EE+FW+QKS +KWL EG+ NT+FFH  ++ K+    I  I D+EGN +  P  I    + 
Sbjct: 1182 EELFWQQKSGVKWLVEGERNTKFFHMRMRKKRMRNHIFRIQDQEGNVLEEPHLIQNSGVE 1241

Query: 992  FFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGIDGF 813
            FFQ L  +E+  ++     + P ++S  DN+ L    SL E  EAV +I  DS  G DGF
Sbjct: 1242 FFQNLLKAEQCDISRFDPSITPRIISTTDNEFLCATPSLQEVKEAVFNINKDSVAGPDGF 1301

Query: 812  SSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISLCM 633
            SS F+ H WDI+ Q + +A  +F K   LP+  T T +VL+PKT+N +  ++FRPISLC 
Sbjct: 1302 SSLFYQHCWDIIKQDLFEAVLDFFKGSPLPRGITSTTLVLLPKTQNVSQWSEFRPISLCT 1361

Query: 632  TFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREGRPNVIL 453
               KIV KL+  RL+ +LP +ISENQS F+ GR I DNI LAQE+  ++  R    NV+L
Sbjct: 1362 VLNKIVTKLLANRLSKILPSIISENQSGFVNGRLISDNILLAQELVDKINARSRGGNVVL 1421

Query: 452  SLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPPSRGI 273
             LDM KAYDRL W+FL   + +FGF+  W  +I ACIS+C FS++  G + G+    RG+
Sbjct: 1422 KLDMAKAYDRLNWEFLYLMMEQFGFNALWINMIKACISNCWFSLLINGSLVGYFKSERGL 1481

Query: 272  RQGDPLSPCLFILAEDILSRALDHFIG--DSDRFTTTNSTCPSHLLFADDIILFACAKRR 99
            RQGD +SP LFILA + LSR L+      +S  + +  S   SHL FADDI++F      
Sbjct: 1482 RQGDSISPSLFILAAEYLSRGLNQLFSRYNSLHYLSGCSMSVSHLAFADDIVIFTNGCHS 1541

Query: 98   SIMRYMEVLRTYQDSSGHRLNLQKCRF 18
            ++ + +  L+ Y+  SG ++N QK  F
Sbjct: 1542 ALQKILVFLQEYEQVSGQQVNHQKSCF 1568


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  316 bits (809), Expect = 5e-93
 Identities = 167/387 (43%), Positives = 233/387 (60%), Gaps = 2/387 (0%)
 Frame = -3

Query: 1172 EEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVAIN 993
            EE+FW+QKS +KWL EG+ NT+FFH  ++ K+  + I  I D EGN      +I   A +
Sbjct: 1095 EELFWQQKSGVKWLVEGENNTKFFHMRMRKKRVRSHIFQIQDSEGNVFDDIHSIQKSATD 1154

Query: 992  FFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGIDGF 813
            FF+ L  +E   ++     LIP ++S  DN+ L     L E  EAV +I  DS  G DGF
Sbjct: 1155 FFRDLMQAENCDLSRFDPSLIPRIISSADNEFLCAAPPLQEIKEAVFNINKDSVAGPDGF 1214

Query: 812  SSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISLCM 633
            SS F+ H WDI+   ++DA  +F +   LP+  T T +VL+PK  N    +++RPISLC 
Sbjct: 1215 SSLFYQHCWDIIKNDLLDAVLDFFRGSPLPRGVTSTTLVLLPKKPNACHWSEYRPISLCT 1274

Query: 632  TFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREGRPNVIL 453
               KIV KL+  RL+ +LP +ISENQS F+ GR I DNI LAQE+  ++  +    NV+L
Sbjct: 1275 VLNKIVTKLLANRLSKILPSIISENQSGFVNGRLISDNILLAQELIGKIDAKSRGGNVVL 1334

Query: 452  SLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPPSRGI 273
             LDM KAYDRL WDFL   +  FGF+  W  +I +CIS+C FS++  G + G+    RG+
Sbjct: 1335 KLDMAKAYDRLNWDFLYLMMEHFGFNAHWINMIKSCISNCWFSLLINGSLAGYFKSERGL 1394

Query: 272  RQGDPLSPCLFILAEDILSRALDHFIG--DSDRFTTTNSTCPSHLLFADDIILFACAKRR 99
            RQGD +SP LFILA D LSR L+H      S ++ +      SHL FADDI++F    R 
Sbjct: 1395 RQGDSISPMLFILAADYLSRGLNHLFSCYSSLQYLSGCQMPISHLSFADDIVIFTNGGRS 1454

Query: 98   SIMRYMEVLRTYQDSSGHRLNLQKCRF 18
            ++ + +  L+ Y+  SG ++N QK  F
Sbjct: 1455 ALQKILSFLQEYEQVSGQKVNHQKSCF 1481


>ref|XP_017973011.1| PREDICTED: uncharacterized protein LOC108661357 [Theobroma cacao]
          Length = 1329

 Score =  310 bits (795), Expect = 1e-91
 Identities = 167/387 (43%), Positives = 228/387 (58%), Gaps = 2/387 (0%)
 Frame = -3

Query: 1172 EEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVAIN 993
            EE+FWKQKS IKWL EG+ NT+FFH  VK K+  + I  I + +G+ +  P  +    + 
Sbjct: 401  EELFWKQKSGIKWLVEGERNTKFFHMRVKKKRIKSHIFKIQNSDGSWIKEPNVVKSSPVE 460

Query: 992  FFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGIDGF 813
            FF  L   E   ++     LIP ++S+ DN  L    S  E  EAV +I  DS  G DGF
Sbjct: 461  FFSSLMKKEPCNMSRFDASLIPTIISDNDNLSLCAEPSKEELKEAVFNIDKDSVAGPDGF 520

Query: 812  SSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISLCM 633
            SS F+   WDIV   +++A  +F     LP+  T T IVL+PK  N +T +DFRPISLC 
Sbjct: 521  SSYFYQQCWDIVANDLLEAVVDFFHGADLPRGITSTTIVLLPKNHNASTWSDFRPISLCN 580

Query: 632  TFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREGRPNVIL 453
               KI+ K++  RLA +LP +I++NQS F+ GR I DNI LAQE+  ++ ++    N+ L
Sbjct: 581  VLNKIITKILVNRLAKVLPSVITDNQSGFVGGRLISDNILLAQELIGKIDRKSRGGNIAL 640

Query: 452  SLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPPSRGI 273
             LDM KAYDRLEWDFL   L + GF+  W  +I  CIS+C FS++  G   GF    RG+
Sbjct: 641  KLDMMKAYDRLEWDFLFRMLEQLGFNSQWISMIRRCISNCWFSLLINGGAVGFFKSERGL 700

Query: 272  RQGDPLSPCLFILAEDILSRALDHFIGD--SDRFTTTNSTCPSHLLFADDIILFACAKRR 99
            RQGD +SP LFILA D LSR L+       S  + +  S   SHL FADDI++F    + 
Sbjct: 701  RQGDSISPILFILAADYLSRGLNALFAQYPSLHYASDCSMLVSHLAFADDILIFTNGAKS 760

Query: 98   SIMRYMEVLRTYQDSSGHRLNLQKCRF 18
            S+ + +  L+ Y++ S  R+N  K  F
Sbjct: 761  SLQKILSFLQEYEEISRQRINHSKSCF 787


>ref|XP_024047909.1| uncharacterized protein LOC112101466 [Citrus clementina]
          Length = 1651

 Score =  312 bits (799), Expect = 1e-91
 Identities = 164/401 (40%), Positives = 244/401 (60%), Gaps = 5/401 (1%)
 Frame = -3

Query: 1190 KEVSLQEEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNR-VHGPEN 1014
            ++  L+EEIF +Q+S ++W++EGD+NTRFFHA  + K+ +  +  I D+  +  +  P  
Sbjct: 612  QQALLREEIFMRQQSSVRWVREGDSNTRFFHAMFRKKRQIFHVHRIRDDSSSEWITDPSA 671

Query: 1013 IHGVAINFFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADS 834
            +   A+ F++ L S +         D IP LV+ ED+ +L +   + +   AV SI  +S
Sbjct: 672  VATSAVGFYRGLLSGDAGQFQQADFDTIPTLVTAEDDVVLCREPDIDDVRRAVFSIDPES 731

Query: 833  APGIDGFSSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDF 654
            APG DGF S F+   WDIVG+ ++DA  ++ +   +P+ F  T +VL+PK  +P++  DF
Sbjct: 732  APGPDGFCSRFYQVCWDIVGRDLLDAVLDYFRGSAMPRGFQSTLLVLLPKKESPSSWADF 791

Query: 653  RPISLCMTFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKRE 474
            RPISLC    K++ KL+  RL+ +LP++IS  QS F+ GR I DN+ L QE++ ++ +R 
Sbjct: 792  RPISLCNVSNKVITKLLVQRLSTILPRIISPTQSGFVPGRVIHDNVLLVQELTHDLNRRT 851

Query: 473  GRPNVILSLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGF 294
               NV+L LDM KAYDR+ W F++  L  FGFS  W  +I   +    FSV+  G + G+
Sbjct: 852  RGNNVVLKLDMEKAYDRMSWPFILQMLRCFGFSERWISLIRRAVYGPWFSVLVNGAIHGY 911

Query: 293  IPPSRGIRQGDPLSPCLFILAEDILSRALDHFIG--DSDRFTTTNSTCPSHLLFADDIIL 120
             P  RG+RQGDP+SPCLFI+A + LSR L H      S R+ +  ST  SHL FADDI++
Sbjct: 912  FPSERGLRQGDPISPCLFIIAAEFLSRGLVHLYSRYPSVRYRSAASTDISHLSFADDIVI 971

Query: 119  FACAKRRSIMRYMEVLRTYQDSSGHRLNLQKCRFFL--PAS 3
            FA   R S+ R M+ L  YQ  SG  ++  K  F++  PAS
Sbjct: 972  FANGSRCSLQRVMDFLHRYQVVSGQLISRTKSSFYIGKPAS 1012


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  311 bits (796), Expect = 3e-91
 Identities = 167/387 (43%), Positives = 232/387 (59%), Gaps = 2/387 (0%)
 Frame = -3

Query: 1172 EEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVAIN 993
            EE+FW+QKS +KWL EG+ NT+FFH  ++ K+    I  I D EGN    P+ I   A+ 
Sbjct: 921  EELFWQQKSGVKWLVEGERNTKFFHLRMRKKRVRNNIFRIQDSEGNIYEDPQYIQNSAVQ 980

Query: 992  FFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGIDGF 813
            +FQ L ++E+   +     LIP  +S  DN+ L    SL E  E V +I  DS  G DGF
Sbjct: 981  YFQNLLTAEQCDFSRFDPSLIPRTISITDNEFLCAAPSLKEIKEVVFNIDKDSVAGPDGF 1040

Query: 812  SSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISLCM 633
            SS F+ H WDI+ Q +++A  +F     +P+  T T +VL+PK  N    +DFRPISLC 
Sbjct: 1041 SSLFYQHCWDIIKQDLLEAVLDFFNGTPMPQGVTSTTLVLLPKKPNSCQWSDFRPISLCT 1100

Query: 632  TFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREGRPNVIL 453
               KIV K +  RL+ +LP +ISENQS F+ GR I DNI LAQE+  ++  +    NV+L
Sbjct: 1101 VLNKIVTKTLANRLSKILPSIISENQSGFVNGRLISDNILLAQELVGKLDAKARGGNVVL 1160

Query: 452  SLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPPSRGI 273
             LDM KAYDRL WDFL   + +FGF+  W  +I ACIS+C FS++  G + G+    RG+
Sbjct: 1161 KLDMAKAYDRLNWDFLYLMMKQFGFNDRWISMIKACISNCWFSLLINGSLVGYFKSERGL 1220

Query: 272  RQGDPLSPCLFILAEDILSRALDH-FIGDSDRFTTTNSTCP-SHLLFADDIILFACAKRR 99
            RQGD +SP LF+LA D LSR ++  F         +    P SHL FADDI++F    R 
Sbjct: 1221 RQGDSISPLLFVLAADYLSRGINQLFNRHKSLLYLSGCFMPISHLAFADDIVIFTNGCRP 1280

Query: 98   SIMRYMEVLRTYQDSSGHRLNLQKCRF 18
            ++ + +  L+ Y++ SG ++N QK  F
Sbjct: 1281 ALQKILVFLQEYEEVSGQQVNHQKSCF 1307


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  310 bits (794), Expect = 5e-91
 Identities = 165/390 (42%), Positives = 236/390 (60%), Gaps = 2/390 (0%)
 Frame = -3

Query: 1190 KEVSLQEEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENI 1011
            K++S+ EEIFWKQKS +KW+ EG+ NT+FFH  ++ K+  + I  I +++GN +  PE +
Sbjct: 1176 KQLSM-EEIFWKQKSGVKWVVEGERNTKFFHMRMQKKRIRSHIFKIQEQDGNWIEDPEQL 1234

Query: 1010 HGVAINFFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSA 831
               AI+FF  L  +E          L P+++S+ DN  L    +L E  EAV  I  +SA
Sbjct: 1235 QQSAIDFFSSLLKAESCDDTRFQSSLCPSIISDTDNGFLCAEPTLQEVKEAVFGIDPESA 1294

Query: 830  PGIDGFSSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFR 651
             G DGFSS F+   WDI+   + +A  EF     +P+  T T +VLIPKT + +  ++FR
Sbjct: 1295 AGPDGFSSHFYQQCWDIIAHDLFEAVKEFFHGADIPQGMTSTTLVLIPKTTSASKWSEFR 1354

Query: 650  PISLCMTFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREG 471
            PISLC    KI+ K++  RLA +LP +I+ENQS F+ GR I DNI LAQE+  ++ ++  
Sbjct: 1355 PISLCTVMNKIITKILANRLAKILPSIITENQSGFVGGRLISDNILLAQELIGKLDQKNR 1414

Query: 470  RPNVILSLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFI 291
              NV L LDM KAYDRL+W FL   L   GF+  W  +I  CIS+C FS++  G   G+ 
Sbjct: 1415 GGNVALKLDMMKAYDRLDWSFLFKVLQHLGFNAQWIGMIQKCISNCWFSLLLNGRTVGYF 1474

Query: 290  PPSRGIRQGDPLSPCLFILAEDILSRALDHFIGD--SDRFTTTNSTCPSHLLFADDIILF 117
               RG+RQGD +SP LFILA + L+R L+       S  +++  S   SHL FADD+I+F
Sbjct: 1475 KSERGLRQGDSISPQLFILAAEYLARGLNALYDQYPSLHYSSGCSLSVSHLAFADDVIIF 1534

Query: 116  ACAKRRSIMRYMEVLRTYQDSSGHRLNLQK 27
            A   + ++ + M  L+ Y+  SG R+N QK
Sbjct: 1535 ANGSKSALQKIMAFLQEYEKLSGQRINPQK 1564


>gb|PKA63448.1| putative mitochondrial protein [Apostasia shenzhenica]
          Length = 516

 Score =  290 bits (743), Expect = 4e-90
 Identities = 161/381 (42%), Positives = 222/381 (58%), Gaps = 2/381 (0%)
 Frame = -3

Query: 1154 QKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVAINFFQQLF 975
            +K+ +KW +EGD+NT+FFH  VK K+    ID + +EEG+ +  PE+I      F+  L 
Sbjct: 20   KKAAVKWWKEGDSNTKFFHNTVKKKRKKLNIDKLRNEEGSWISSPEDIENSGTTFYSNLL 79

Query: 974  SSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGIDGFSSAFFT 795
             +E    +D     IP ++   DN+ML    +L E  +AV SI  DSAPG DGF+S FF 
Sbjct: 80   KTEGCHFSDTDFHFIPNIIVVHDNEMLTAVPNLDEVKDAVFSIHKDSAPGPDGFNSGFFQ 139

Query: 794  HSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISLCMTFYKIV 615
              WDIV   +++AA  F    HL K +T T IVLIPK+    T  D RPISLC    K +
Sbjct: 140  CCWDIVKDDLLNAARGFFSGYHLDKAYTSTFIVLIPKSNEVNTWKDLRPISLCNVKMKFL 199

Query: 614  AKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREGRPNVILSLDMHK 435
            +K++  RL+   PK+IS NQ+ F+ GR I +NI LAQE+   +       N+ L LDM K
Sbjct: 200  SKILVKRLSYFFPKIISPNQTGFVPGRGIIENILLAQEVFRTINFDVRGRNIALKLDMEK 259

Query: 434  AYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPPSRGIRQGDPL 255
            AYDR+EW+F++  L++FGFS  +  +I   IS+  FS++  G   GF   SRG+RQGDP+
Sbjct: 260  AYDRVEWNFIIKMLNKFGFSNNFCNLINNLISNVWFSILINGRSTGFFNSSRGLRQGDPI 319

Query: 254  SPCLFILAEDILSRALDHFIGD--SDRFTTTNSTCPSHLLFADDIILFACAKRRSIMRYM 81
            SP LFILA + LSR L+  + +  S R+ T      SHL +ADD I+F    R  I    
Sbjct: 320  SPLLFILASEFLSRGLNQIMNNKPSFRYFTHCDMIISHLAYADDCIIFCNGSRNVINGIR 379

Query: 80   EVLRTYQDSSGHRLNLQKCRF 18
            + L  YQ  SG ++N  K  F
Sbjct: 380  DFLNCYQRCSGQKINKGKSGF 400


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  307 bits (786), Expect = 6e-90
 Identities = 158/387 (40%), Positives = 236/387 (60%), Gaps = 2/387 (0%)
 Frame = -3

Query: 1172 EEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVAIN 993
            EE+FWKQKS +KW+ EG+ NT+FFH  ++ K+  + I  + D EG  +   E +   AI 
Sbjct: 1216 EELFWKQKSGVKWVVEGERNTKFFHMRMQKKRIRSHIFKVQDPEGRWIEDQEQLKHSAIE 1275

Query: 992  FFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGIDGF 813
            +F  L   E    +     LIP+++S  +N++L    SL E  +AV  I ++SA G DGF
Sbjct: 1276 YFSSLLKVEPCYDSRFQSSLIPSIISNSENELLCAEPSLQEVKDAVFGINSESAAGPDGF 1335

Query: 812  SSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISLCM 633
            SS F+   W+I+ Q ++DA  +F    ++P+  T T ++L+PK  + +  +DFRPISLC 
Sbjct: 1336 SSYFYQQCWNIIAQDLLDAVRDFFHGANIPRGVTSTTLILLPKKSSASKWSDFRPISLCT 1395

Query: 632  TFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREGRPNVIL 453
               KI+ KL+  RLA +LP +I+ENQS F+ GR I DNI LAQE+  ++  +    N+ L
Sbjct: 1396 VMNKIITKLLSNRLAKVLPSIITENQSGFVGGRLISDNILLAQELIGKLNTKSRGGNLAL 1455

Query: 452  SLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPPSRGI 273
             LDM KAYD+L+W FL   L  FGF+  W ++I  CIS+C FS++  G  +G+    RG+
Sbjct: 1456 KLDMMKAYDKLDWSFLFKVLQHFGFNGQWIKMIQKCISNCWFSLLLNGRTEGYFKSERGL 1515

Query: 272  RQGDPLSPCLFILAEDILSRALDHFIGD--SDRFTTTNSTCPSHLLFADDIILFACAKRR 99
            RQGD +SP LFI+A + LSR L+       S  +++  S   SHL FADD+++F    + 
Sbjct: 1516 RQGDSISPQLFIIAAEYLSRGLNALYDQYPSLHYSSGVSISVSHLAFADDVLIFTNGSKS 1575

Query: 98   SIMRYMEVLRTYQDSSGHRLNLQKCRF 18
            ++ R +  L+ YQ+ SG R+N+QK  F
Sbjct: 1576 ALQRILAFLQEYQEISGQRINVQKSCF 1602


>gb|PNY17656.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1291

 Score =  303 bits (776), Expect = 3e-89
 Identities = 158/393 (40%), Positives = 229/393 (58%), Gaps = 5/393 (1%)
 Frame = -3

Query: 1178 LQEEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVA 999
            +QEE +WKQ++ + WL+EGD NT+FFH +  ++Q   RID +V+E    V     + GV 
Sbjct: 634  IQEEAYWKQRAKMHWLKEGDLNTKFFHMSASARQRTKRIDRLVNEANVEVKTQAELCGVV 693

Query: 998  INFFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGID 819
             ++F QLF + R T  D  L LI   +++EDND L+ P +  E  +A+  +  D APG D
Sbjct: 694  QSYFDQLFRA-RATYHDPILSLISPKITQEDNDRLVAPITREELKDALFHMHPDKAPGPD 752

Query: 818  GFSSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISL 639
            GF+ AF+ H W++ G  + +AA E+L+  + P     T I LIPK  NP +M D RPISL
Sbjct: 753  GFNPAFYQHFWELCGNDIYEAAVEWLERGYFPSSLNETNICLIPKCENPVSMKDMRPISL 812

Query: 638  CMTFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREG--RP 465
            C   YK+++KL+  RL + L K +SE QSAFI GRSI DN+ +A E+   + +R    + 
Sbjct: 813  CNVLYKMISKLLASRLRVCLDKCVSEEQSAFIEGRSILDNVLIATEVIHALKRRTKGLKG 872

Query: 464  NVILSLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPP 285
             + L +D  KAYD+++W F+   L + GF+  W   +  C+SS N+SV+      G + P
Sbjct: 873  ELALKIDFSKAYDKVDWGFMRGMLEKLGFADKWIHWMMLCVSSVNYSVLMNFEKVGPVYP 932

Query: 284  SRGIRQGDPLSPCLFILAEDILSRALDHFIGDSDRF---TTTNSTCPSHLLFADDIILFA 114
             RG+RQGDPLSP LFIL  + L+  L   +   D         +   SHLLFADD  LF 
Sbjct: 933  GRGLRQGDPLSPYLFILVTEGLTSLLKKSVSRGDLHGVQICRGAPTVSHLLFADDCFLFC 992

Query: 113  CAKRRSIMRYMEVLRTYQDSSGHRLNLQKCRFF 15
             A        M++L+TY+++SG  +NL K   F
Sbjct: 993  RANLNETNHLMQILKTYEEASGQEINLSKSEVF 1025


>gb|PNX80358.1| ribonuclease H, partial [Trifolium pratense]
          Length = 700

 Score =  290 bits (742), Expect = 5e-88
 Identities = 156/393 (39%), Positives = 227/393 (57%), Gaps = 5/393 (1%)
 Frame = -3

Query: 1178 LQEEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVA 999
            +QEEI+WKQ++ + WL+EGD NT+FFH +  ++Q   +I+ +V+ +   V     I  VA
Sbjct: 180  IQEEIYWKQRAKMHWLKEGDMNTKFFHMSASTRQRKKKIEKLVNADNIEVKTQTEICAVA 239

Query: 998  INFFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGID 819
             N+F  LF +   TV D  L L+   ++ EDN+ L+ P +  E  EA+  +  D +PG D
Sbjct: 240  KNYFDHLFRANA-TVQDPILSLVSPKITLEDNERLVAPITKDELKEALFQMHPDKSPGPD 298

Query: 818  GFSSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISL 639
            GF+ AF+ H WD+    + +AA E+L+  + P     T I LIPK  +P +M D RPISL
Sbjct: 299  GFNPAFYQHFWDLCSNDIYEAAKEWLERGYFPSSLNETNICLIPKCESPRSMKDMRPISL 358

Query: 638  CMTFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKR-EGRP- 465
            C   YK+V+KL+  RL   L + +SE QSAF+ GRSI DN  +A E+   + +R  GR  
Sbjct: 359  CNVLYKMVSKLLANRLKECLDRCVSEEQSAFVEGRSIVDNALIAIEVIHALKRRTRGRKG 418

Query: 464  NVILSLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPP 285
             + L +D+ KAYD+++W F+   L R GF+  W   +  C+SS N+SV+      G I P
Sbjct: 419  ELALKIDISKAYDKVDWGFMRGMLERLGFANKWIHWMMLCVSSVNYSVLVNFEKVGPIFP 478

Query: 284  SRGIRQGDPLSPCLFILAEDILSRALDHFIGDSDRF---TTTNSTCPSHLLFADDIILFA 114
             RG+RQGDPLSP LFIL  + L+  + + +   D         +   SHLLFADD  LF 
Sbjct: 479  GRGLRQGDPLSPYLFILVTEGLTTLIKNSVAKGDLHGIQICRGAPTVSHLLFADDCFLFC 538

Query: 113  CAKRRSIMRYMEVLRTYQDSSGHRLNLQKCRFF 15
             A        M++L TY+++SG  +NL K   F
Sbjct: 539  RATLDETNHLMKILNTYEEASGQEINLTKSEVF 571


>gb|PNX97372.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1023

 Score =  296 bits (759), Expect = 5e-88
 Identities = 160/393 (40%), Positives = 228/393 (58%), Gaps = 5/393 (1%)
 Frame = -3

Query: 1178 LQEEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVA 999
            +QE+ +WKQ++ + WLQEGD NT+FFH +  ++Q   +ID +V+E    V     I  VA
Sbjct: 473  VQEDTYWKQRAKMHWLQEGDLNTKFFHMSASARQKSKKIDKLVNEVNIEVRTQSGICEVA 532

Query: 998  INFFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGID 819
            +N+F QLF +   T  D  L LI   +++EDND L+ P +  E  +A+  +  D APG D
Sbjct: 533  LNYFDQLFRANA-TNYDSILSLITPKITQEDNDRLVAPITREELKDALFHMHPDKAPGPD 591

Query: 818  GFSSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISL 639
            GF+ AF+   WD+    + +AA E+L+  + P     T I LIPK  NP TM D RPISL
Sbjct: 592  GFNPAFYQQFWDLCSNDIYEAAKEWLERGYFPSSLNETNICLIPKCDNPVTMKDLRPISL 651

Query: 638  CMTFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREGRPN- 462
            C   YK+++KL+  RL   L K +SE QSAFI GRSI DN  +A E+   + ++    N 
Sbjct: 652  CNVLYKMISKLLANRLKACLEKCVSEEQSAFIEGRSIIDNALVAIEVLHALKRKTRGMNG 711

Query: 461  -VILSLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPP 285
             + L +D+ KAYD+++W FL   L R GF+  W   +  C+SS N+SV+      G I P
Sbjct: 712  ELALKIDISKAYDKVDWGFLRGMLERLGFANRWIHWMMLCVSSVNYSVLVNFDKVGPIFP 771

Query: 284  SRGIRQGDPLSPCLFILAEDILSRALDHFI--GDSDRFTTTNSTCP-SHLLFADDIILFA 114
             RG+RQGDPLSP LFIL  + L+  + + +  GD           P SHLLFADD  LF 
Sbjct: 772  GRGLRQGDPLSPYLFILVTEGLTTLIKNSVIKGDLHGVKVCRGAPPVSHLLFADDCFLFC 831

Query: 113  CAKRRSIMRYMEVLRTYQDSSGHRLNLQKCRFF 15
             +        M++LRTY++++G  +N+ K   F
Sbjct: 832  RSNLSETNHLMQILRTYENATGQEINMTKSEVF 864


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  300 bits (767), Expect = 2e-87
 Identities = 154/387 (39%), Positives = 234/387 (60%), Gaps = 2/387 (0%)
 Frame = -3

Query: 1172 EEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVAIN 993
            EEIFWKQKS +KW+ EG+ NT+FFH  ++ K+  + I  + + +G  +   E +   AI 
Sbjct: 1218 EEIFWKQKSGVKWVVEGERNTKFFHMRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIE 1277

Query: 992  FFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGIDGF 813
            +F  L  +E   ++     LIP+++S  +N++L    +L E  +AV  I  +SA G DGF
Sbjct: 1278 YFSSLLKAEPCDISRFQNSLIPSIISNSENELLCAEPNLQEVKDAVFDIDPESAAGPDGF 1337

Query: 812  SSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISLCM 633
            SS F+   W+ +   ++DA  +F    ++P+  T T +VL+PK  + +  ++FRPISLC 
Sbjct: 1338 SSYFYQQCWNTIAHDLLDAVRDFFHGANIPRGVTSTTLVLLPKKSSASKWSEFRPISLCT 1397

Query: 632  TFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREGRPNVIL 453
               KI+ KL+  RLA +LP +I+ENQS F+ GR I DNI LAQE+  ++  +    N+ L
Sbjct: 1398 VMNKIITKLLSNRLAKILPSIITENQSGFVGGRLISDNILLAQELIRKLDTKSRGGNLAL 1457

Query: 452  SLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPPSRGI 273
             LDM KAYDRL+W FL+  L  FGF+  W  +I  CIS+C FS++  G ++G+    RG+
Sbjct: 1458 KLDMMKAYDRLDWSFLIKVLQHFGFNEQWIGMIQKCISNCWFSLLLNGRIEGYFKSERGL 1517

Query: 272  RQGDPLSPCLFILAEDILSRALDHFIGD--SDRFTTTNSTCPSHLLFADDIILFACAKRR 99
            RQGD +SP LFILA + LSR L+       S  +++      SHL FADD+++F    + 
Sbjct: 1518 RQGDSISPQLFILAAEYLSRGLNALYDQYPSLHYSSGVPLSVSHLAFADDVLIFTNGSKS 1577

Query: 98   SIMRYMEVLRTYQDSSGHRLNLQKCRF 18
            ++ R +  L+ Y++ SG R+N QK  F
Sbjct: 1578 ALQRILVFLQEYEEISGQRINAQKSCF 1604


>gb|PKU68054.1| integrator complex subunit 11 [Dendrobium catenatum]
          Length = 460

 Score =  281 bits (720), Expect = 2e-87
 Identities = 154/393 (39%), Positives = 223/393 (56%), Gaps = 2/393 (0%)
 Frame = -3

Query: 1175 QEEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVAI 996
            QEE +W QKS+ K+L +GD NT+FFHA    K+  + I  I+D  G  +   ++I    +
Sbjct: 17   QEEYYWNQKSNAKFLLDGDRNTKFFHALANKKKTRSHIHKIIDMNGTALTTDDSICNSGV 76

Query: 995  NFFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGIDG 816
            N+FQ +F+S    V      LIP L++E DN ML Q  +  E    +  + +++  G DG
Sbjct: 77   NYFQHIFNSFTDCVPITIPHLIPNLITEIDNAMLCQNPTEDEIFNVIKDVNSNAVAGPDG 136

Query: 815  FSSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISLC 636
            F++ FF  +W+I+   V+ A  +F      PKFF+ T IVLIPK   PT   DFRPISLC
Sbjct: 137  FTTKFFQDNWEIIKDDVIKAVEDFFAGNSYPKFFSSTYIVLIPKKDGPTQWQDFRPISLC 196

Query: 635  MTFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREGRPNVI 456
                K+ +K+I  RL  +LP +IS NQ+ +++GRSIFDNI LAQ+I+ ++  +    NVI
Sbjct: 197  TFLNKLNSKIIAKRLINILPNIISLNQTGYVKGRSIFDNILLAQKITHDINVKVKGGNVI 256

Query: 455  LSLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPPSRG 276
              LD+ KAYD + W+FL   L  FGF+  +  +I   I  C+FSV+  G   GF   S+G
Sbjct: 257  FKLDVSKAYDNINWNFLYKVLSLFGFNNFFISLIKNSIEDCHFSVIINGKHNGFFKLSKG 316

Query: 275  IRQGDPLSPCLFILAEDILSRALD--HFIGDSDRFTTTNSTCPSHLLFADDIILFACAKR 102
            +RQGD +SP LFI+A + LSR L+  +       F T      S L FADD ILF     
Sbjct: 317  LRQGDTMSPALFIIAMEYLSRGLNDLYLRNPMLNFRTIRGFSISRLSFADDFILFTNGSI 376

Query: 101  RSIMRYMEVLRTYQDSSGHRLNLQKCRFFLPAS 3
             ++   +  L ++ + SG  +N +K  F +  S
Sbjct: 377  NNVSLLLNFLVSFYNQSGLSINKEKSTFIVGKS 409


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  299 bits (766), Expect = 3e-87
 Identities = 155/387 (40%), Positives = 233/387 (60%), Gaps = 2/387 (0%)
 Frame = -3

Query: 1172 EEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVAIN 993
            EEIFWKQKS +KW+ EG+ NT+FFH  ++ K+  + I  + + +G  +   E +   AI 
Sbjct: 1388 EEIFWKQKSGVKWVVEGERNTKFFHTRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIK 1447

Query: 992  FFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGIDGF 813
            +F  L   E    +     LIP+++S  +N++L    +L E  +AV  I  +SA G DGF
Sbjct: 1448 YFSSLLKFEPCDDSRFQRSLIPSIISNSENELLCAEPNLQEVKDAVFGIDPESAAGPDGF 1507

Query: 812  SSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISLCM 633
            SS F+   W+I+   ++DA  +F    ++P+  T T ++L+PK  + +  +DFRPISLC 
Sbjct: 1508 SSYFYQQCWNIIAHDLLDAVRDFFHGANIPRGVTSTTLILLPKKPSASKWSDFRPISLCT 1567

Query: 632  TFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREGRPNVIL 453
               KI+ KL+  RLA +LP +I+ENQS F+ GR I DNI LAQE+  ++  +    N+ L
Sbjct: 1568 VMNKIITKLLSNRLAKILPSIITENQSGFVGGRLISDNILLAQELIGKLNTKSRGGNLAL 1627

Query: 452  SLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPPSRGI 273
             LDM KAYDRL+W FL+  L  FGF+  W  +I  CIS+C FS++  G  +G+    RG+
Sbjct: 1628 KLDMMKAYDRLDWSFLIKVLQHFGFNDQWIGMIQKCISNCWFSLLLNGRTEGYFKFERGL 1687

Query: 272  RQGDPLSPCLFILAEDILSRALDHFIGD--SDRFTTTNSTCPSHLLFADDIILFACAKRR 99
            RQGDP+SP LF++A + LSR L+       S  ++T  S   SHL FADD+++F    + 
Sbjct: 1688 RQGDPISPQLFLIAAEYLSRGLNALYEQYPSLHYSTGVSIPVSHLAFADDVLIFTNGSKS 1747

Query: 98   SIMRYMEVLRTYQDSSGHRLNLQKCRF 18
            ++ R +  L+ Y++ S  R+N QK  F
Sbjct: 1748 ALQRILAFLQEYEEISRQRINAQKSCF 1774


>gb|PNX96237.1| ribonuclease H [Trifolium pratense]
          Length = 1283

 Score =  297 bits (760), Expect = 5e-87
 Identities = 159/393 (40%), Positives = 226/393 (57%), Gaps = 5/393 (1%)
 Frame = -3

Query: 1178 LQEEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVA 999
            +QEE +WKQ++ + WL EGD NT+FFH +  S+Q   +I  +V+EE   V     +  VA
Sbjct: 644  VQEEAYWKQRAKMHWLSEGDLNTKFFHMSASSRQRAKKIGKLVNEENIAVTTQPELCEVA 703

Query: 998  INFFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGID 819
            +N+F QLF S   ++ D  L LI  +++ EDN+ L+ P +  E  +A+  +  D APG D
Sbjct: 704  LNYFNQLFKSNS-SMHDPVLSLIAPVITPEDNERLVMPITRVELKDALFQMHPDKAPGPD 762

Query: 818  GFSSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISL 639
            GF+ AF+ H WD+ G  + +AA E+L+  + P     T I LIPK  NP +M D RPISL
Sbjct: 763  GFNPAFYQHFWDLCGNDIFEAAQEWLERGYFPSSLNETNICLIPKCDNPLSMKDLRPISL 822

Query: 638  CMTFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKRE--GRP 465
            C   YK+++KL+  RL   L K +SE QSAFI GRSI DN  +A E+   + +R    + 
Sbjct: 823  CNVLYKMISKLLANRLKSCLDKCVSEEQSAFIEGRSILDNALIAIEVIHALKRRTRGKKG 882

Query: 464  NVILSLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPP 285
             + L +D+ KAYD+++W F+   L R GFS  W   +  C+SS  +SV+      G I P
Sbjct: 883  ELALKIDISKAYDKVDWGFMRGMLERLGFSDKWIHWMMLCVSSVTYSVLVNFEKVGPIFP 942

Query: 284  SRGIRQGDPLSPCLFILAEDILSRALDHFIGDSDRF---TTTNSTCPSHLLFADDIILFA 114
             RG+RQGDPLSP LFIL  + L+R +   +   D         +   SHLLFADD  LF 
Sbjct: 943  GRGLRQGDPLSPYLFILVTEGLTRLIKKSLASGDIHGVQICRGAPMVSHLLFADDCFLFC 1002

Query: 113  CAKRRSIMRYMEVLRTYQDSSGHRLNLQKCRFF 15
             +        M +L+TY ++SG  +NL K   F
Sbjct: 1003 RSTIEETNHLMSILKTYGEASGQEINLSKSEVF 1035


>dbj|GAU18648.1| hypothetical protein TSUD_124810 [Trifolium subterraneum]
          Length = 1742

 Score =  298 bits (762), Expect = 1e-86
 Identities = 160/393 (40%), Positives = 227/393 (57%), Gaps = 5/393 (1%)
 Frame = -3

Query: 1178 LQEEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVA 999
            +QEE +W+Q++   WL+EGD NTRFFH    S+Q   +I  +V+E+   V     +  VA
Sbjct: 784  VQEETYWRQRAKTHWLKEGDLNTRFFHMTASSRQRAKKIGRLVNEDNMAVTTQPELCEVA 843

Query: 998  INFFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGID 819
            +N+F  LF S   T  D  L L+   +++EDN+ L++P +  E  EA+  +  D APG D
Sbjct: 844  LNYFNHLFKSNVAT-HDPILSLVTPKITQEDNEQLVKPITRDELKEALFQMHPDKAPGPD 902

Query: 818  GFSSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISL 639
            GF+ AF+ H W++ G  V  AA E+L+  + P     T I LIPK  NP +M D RPISL
Sbjct: 903  GFNPAFYQHFWEVCGDDVFVAATEWLERGYFPSSLNETNICLIPKCENPVSMKDMRPISL 962

Query: 638  CMTFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREG--RP 465
            C   YK+++KL+  RL  LL K +SE QSAFI GRSI DN  +A E+   + +R    + 
Sbjct: 963  CNVLYKMISKLLANRLKGLLEKCVSEEQSAFIEGRSILDNALIAIEVIHSLKRRTRGMKS 1022

Query: 464  NVILSLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPP 285
             + L +D+ KAYD+++W F+   L R GFS  W   +  C+SS N+S++      G I P
Sbjct: 1023 ELALKIDISKAYDKVDWGFMRGMLERLGFSNKWIHWMMLCVSSVNYSILVNYEKVGPIFP 1082

Query: 284  SRGIRQGDPLSPCLFILAEDILSRALDHFIGDSDRF---TTTNSTCPSHLLFADDIILFA 114
             RG+RQGDPLSP LFIL  + L+  L   +   D         +   SHLLFADD  LF 
Sbjct: 1083 GRGLRQGDPLSPYLFILITEGLTYLLKRSLARGDLHGVQICRGAPTVSHLLFADDCFLFC 1142

Query: 113  CAKRRSIMRYMEVLRTYQDSSGHRLNLQKCRFF 15
             A      + M++L+TY+++SG  +NL K   F
Sbjct: 1143 RATIAEANQLMQILKTYEEASGQEINLSKSEVF 1175


>ref|XP_023638968.1| uncharacterized protein LOC111830668 [Capsella rubella]
          Length = 1255

 Score =  296 bits (757), Expect = 1e-86
 Identities = 166/388 (42%), Positives = 227/388 (58%), Gaps = 5/388 (1%)
 Frame = -3

Query: 1175 QEEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVAI 996
            +EE FW  KS   W+Q GD NTRFFHA+ K++ A  R+ SI+DE G R++G ++I   AI
Sbjct: 330  EEERFWHLKSRNLWMQLGDRNTRFFHASTKNRLARNRLTSIMDEGGTRLYGNKDIATEAI 389

Query: 995  NFFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGIDG 816
             +F  LF+S   T  D  L  I  LV+ + N  L    +  E   A+ SI A  APG DG
Sbjct: 390  TYFGNLFTSPGPTPLDSVLCNITPLVTSDMNHYLTSEVTGDEIKSALFSIGATKAPGPDG 449

Query: 815  FSSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISLC 636
            F++AF+ H WD VG ++      F    HLP+ + HT + LIPK   P TM D RPISLC
Sbjct: 450  FNAAFYQHYWDTVGTMITMEVQNFFVTGHLPREWNHTNLCLIPKITAPKTMKDLRPISLC 509

Query: 635  MTFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEI--SSEVGKREGRPN 462
               YKI++K++  RL  ++  ++SENQ+AFI GR I DN+ LA E+  S +V +R     
Sbjct: 510  NVLYKIISKILTQRLKGVMSAIVSENQAAFIPGRYITDNVLLAHEVHHSLQVRRRCATSY 569

Query: 461  VILSLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPPS 282
            + +  D+ KAYDR+EW+FL + L R GF   W + I  C+SS +FSV+  G   G    +
Sbjct: 570  MAVKTDISKAYDRIEWNFLRAVLERKGFHPRWVDWIMECVSSVSFSVLINGSPYGNFSAT 629

Query: 281  RGIRQGDPLSPCLFILAEDILSRALDH--FIGDSDRFTTTN-STCPSHLLFADDIILFAC 111
            RG+RQGDPLSP LFIL  D+LS  L      GD +    +N     SHLLFADD + F  
Sbjct: 630  RGLRQGDPLSPSLFILCADVLSSLLSQATSAGDINGIQLSNGGPRLSHLLFADDSLFFLK 689

Query: 110  AKRRSIMRYMEVLRTYQDSSGHRLNLQK 27
            A  ++    M++ + Y D+SG  +N  K
Sbjct: 690  ADHKNSTNLMKIFKAYGDASGQIINFDK 717


>gb|PNX97216.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1094

 Score =  293 bits (751), Expect = 2e-86
 Identities = 157/393 (39%), Positives = 224/393 (56%), Gaps = 5/393 (1%)
 Frame = -3

Query: 1178 LQEEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHGVA 999
            +QEE +WKQ++ + WL+EGD NT+FFH +   +Q   +ID +V+E    V     I  VA
Sbjct: 98   VQEETYWKQRAKMHWLKEGDLNTKFFHMSATVRQRAKKIDKLVNEGNIEVKTQSEICEVA 157

Query: 998  INFFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPGID 819
             N+F  LF +   T  +Q L LI   V+ EDN+ L+ P +  E  EA+  +  D APG D
Sbjct: 158  RNYFDHLFRANA-TTHEQVLALITPKVTREDNERLVAPITREELKEALFQMHPDKAPGPD 216

Query: 818  GFSSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPISL 639
            GF+ AF+ H WD+ G  + +AA E+L   + P     T I LIPK  NP +M D RPISL
Sbjct: 217  GFNPAFYQHFWDLCGNDIYEAAKEWLDRGYFPSSLNETNICLIPKCENPVSMKDMRPISL 276

Query: 638  CMTFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKRE--GRP 465
            C   YK+V+KL+  RL   L K +SE QSAF+ GRSI DN  +A E+   + +R    + 
Sbjct: 277  CNVLYKMVSKLLANRLKECLEKCVSEEQSAFVEGRSILDNALIAIEVIHAIKRRTKGWKG 336

Query: 464  NVILSLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPP 285
             + L +D+ KAYD+++W F+   L R GF+  W   +  C+SS N+SV+      G I P
Sbjct: 337  ELALKIDISKAYDKVDWGFMRGMLERLGFANKWIHWMMLCVSSVNYSVLVNFEKVGPIYP 396

Query: 284  SRGIRQGDPLSPCLFILAEDILSRALDHFIGDSDRF---TTTNSTCPSHLLFADDIILFA 114
             RG+RQGDPLSP LF+L  + L+  + + +   D         +   SHLLFADD  LF 
Sbjct: 397  GRGLRQGDPLSPYLFLLVTEGLTALIKNSVARGDLHGVKICRGAPAVSHLLFADDCFLFC 456

Query: 113  CAKRRSIMRYMEVLRTYQDSSGHRLNLQKCRFF 15
             +     +  M++L+ Y+ +SG  +NL K   F
Sbjct: 457  RSSLDETLHLMQILKIYEQASGQEINLTKSEVF 489


>gb|KZV36392.1| hypothetical protein F511_03833 [Dorcoceras hygrometricum]
          Length = 884

 Score =  290 bits (741), Expect = 2e-86
 Identities = 169/395 (42%), Positives = 222/395 (56%), Gaps = 3/395 (0%)
 Frame = -3

Query: 1184 VSLQEEIFWKQKSHIKWLQEGDANTRFFHAAVKSKQALARIDSIVDEEGNRVHGPENIHG 1005
            ++  E  FWKQK+  KWL++G+ NT+ FH  V+ K    +I  I D+ GN +  P  I  
Sbjct: 325  ITAMESDFWKQKAACKWLEDGERNTKLFHNLVRKKHVANKIFRIWDD-GNCLTSPTLIQQ 383

Query: 1004 VAINFFQQLFSSERHTVADQFLDLIPALVSEEDNDMLMQPFSLAETLEAVSSIPADSAPG 825
                FF+ L + E   +A   L      +S+ +N  +    SL E    V SI  DS  G
Sbjct: 384  SGAFFFESLLTGEPSALAAPDLSYFSHEISDLENISIAATPSLEEVKAVVFSIHRDSVAG 443

Query: 824  IDGFSSAFFTHSWDIVGQLVVDAANEFLKLKHLPKFFTHTCIVLIPKTRNPTTMNDFRPI 645
             DGFSSAFF H W IV Q V  A  +F +   LP+ FT T I LIPK   P   +DFRPI
Sbjct: 444  PDGFSSAFFQHCWQIVHQDVFRAVLDFFQGTPLPQSFTSTTISLIPKCEGPRAWSDFRPI 503

Query: 644  SLCMTFYKIVAKLIGGRLAILLPKLISENQSAFIRGRSIFDNISLAQEISSEVGKREGRP 465
            SLC    KI++KL+  RL  ++ +LIS NQS F+ GR I DNI LAQE++  +  +    
Sbjct: 504  SLCNVTNKIISKLLYSRLRNVVGRLISPNQSGFVPGRLISDNILLAQELTHRLNCKTHGG 563

Query: 464  NVILSLDMHKAYDRLEWDFLMSTLHRFGFSLAWREVIFACISSCNFSVMFQGVMKGFIPP 285
            NVIL LDM KAYDR++W FL++ +   GFS     ++  CIS+C+FS+   G   GF   
Sbjct: 564  NVILKLDMAKAYDRVQWSFLLNIMRHLGFSDTVVGMVSRCISACHFSIKINGTSAGFFKS 623

Query: 284  SRGIRQGDPLSPCLFILAEDILSRALDHFIGDSDRFTTTNSTCP---SHLLFADDIILFA 114
            +RG+RQGDPLSP LFIL  + LSR LD     S      NS C    SHL +ADDI++FA
Sbjct: 624  TRGLRQGDPLSPLLFILGAEYLSRGLDRLF-LSRPSLCYNSGCDVRISHLAYADDIVIFA 682

Query: 113  CAKRRSIMRYMEVLRTYQDSSGHRLNLQKCRFFLP 9
                R I R +E L  Y+  SG  +N+ K    LP
Sbjct: 683  NGNIRGIKRIIEFLHHYESCSGQSVNVHKSSIILP 717


Top