BLASTX nr result
ID: Alisma22_contig00035739
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Alisma22_contig00035739 (741 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_011457604.1 PREDICTED: hornerin-like, partial [Fragaria vesca... 69 2e-15 XP_011092648.1 PREDICTED: uncharacterized protein LOC105172769 [... 69 3e-15 EOY03078.1 Retrotransposon protein, putative [Theobroma cacao] 73 2e-13 EOY14099.1 DNA/RNA polymerases superfamily protein [Theobroma ca... 73 3e-13 EOY08454.1 DNA/RNA polymerases superfamily protein [Theobroma ca... 74 4e-13 EOY26421.1 DNA/RNA polymerases superfamily protein [Theobroma ca... 74 5e-13 XP_011462316.1 PREDICTED: uncharacterized protein LOC105350950 [... 63 9e-13 EOY08404.1 Retrotransposon-like protein [Theobroma cacao] 73 9e-13 XP_012490570.1 PREDICTED: uncharacterized protein LOC105803109 [... 57 1e-12 XP_007220718.1 hypothetical protein PRUPE_ppa022673mg [Prunus pe... 73 1e-12 XP_016733510.1 PREDICTED: uncharacterized protein LOC107944194, ... 56 4e-12 EOY03146.1 Retrotransposon protein, putative [Theobroma cacao] 69 5e-12 XP_012487752.1 PREDICTED: uncharacterized protein LOC105800943 [... 57 9e-12 EOY16854.1 DNA/RNA polymerases superfamily protein [Theobroma ca... 70 9e-12 XP_012466477.1 PREDICTED: uncharacterized protein LOC105785086 [... 54 1e-11 EOY17430.1 Uncharacterized protein TCM_036595 [Theobroma cacao] 73 2e-11 XP_017216862.1 PREDICTED: uncharacterized protein LOC108194427 [... 64 2e-11 XP_016734104.1 PREDICTED: uncharacterized protein LOC107944788 [... 54 2e-11 XP_011466845.1 PREDICTED: uncharacterized protein LOC105352182 [... 58 3e-11 EOY26377.1 DNA/RNA polymerases superfamily protein [Theobroma ca... 73 3e-11 >XP_011457604.1 PREDICTED: hornerin-like, partial [Fragaria vesca subsp. vesca] Length = 297 Score = 68.9 bits (167), Expect(2) = 2e-15 Identities = 50/128 (39%), Positives = 66/128 (51%), Gaps = 4/128 (3%) Frame = +2 Query: 74 TEFQ--GCHQCRQMRHFKRNCLQLQQSASHGSQSQYQNVLRQAPAGG--PGSSVASDSRG 241 ++FQ GC QC Q+ HFKR+C L Q A++ + QA G G+ + +RG Sbjct: 89 SQFQLGGCFQCGQLDHFKRDCPLLTQGATYAP----TQAMGQASTSGSSSGTHAMAPARG 144 Query: 242 GAAPSAREGQRGHHGRGIASRRFFSLTQQEGIVSSQVITREILFFNSISVTVLIDLGATH 421 G P +GQRG A R ++TQQEG S VI +L F +LID GATH Sbjct: 145 GFQPG--KGQRGRPATTHA--RLHAMTQQEGRTSPDVIIGTLLIFGH-PAFILIDPGATH 199 Query: 422 SFIQSSLS 445 SF+ S S Sbjct: 200 SFMSSRFS 207 Score = 41.6 bits (96), Expect(2) = 2e-15 Identities = 20/44 (45%), Positives = 27/44 (61%) Frame = +1 Query: 547 GCLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCWKE 678 GC L G V+L+ L I F VILGMD LE +R +DC+++ Sbjct: 239 GCGLLVEGLNFEVDLIPLDIVEFDVILGMDFLEAHRAMIDCFRK 282 >XP_011092648.1 PREDICTED: uncharacterized protein LOC105172769 [Sesamum indicum] Length = 957 Score = 68.9 bits (167), Expect(2) = 3e-15 Identities = 49/148 (33%), Positives = 71/148 (47%), Gaps = 9/148 (6%) Frame = +2 Query: 29 TCDICGRPHS*QCWGTEF--QGCHQCRQMRHFKRNC----LQLQQSASHGSQSQYQ--NV 184 +C CGR H CW E + C++C H RNC + + +S + GSQSQ + Sbjct: 262 SCSTCGRQHQGPCWRREDIPKICYRCGGRGHIARNCSSQTIGVVESVASGSQSQSSEGSS 321 Query: 185 LRQAPAG-GPGSSVASDSRGGAAPSAREGQRGHHGRGIASRRFFSLTQQEGIVSSQVITR 361 R A G G G + G S G +G R +++T++E S+ VI+ Sbjct: 322 GRGANRGRGRGRGTGNRDSGHTIGSGMRGPGAQGTQGQTQARIYNITREEAPASNDVISG 381 Query: 362 EILFFNSISVTVLIDLGATHSFIQSSLS 445 IL F+ I VLID G+THS+I S + Sbjct: 382 TILLFD-IMAYVLIDPGSTHSYISSEFA 408 Score = 40.8 bits (94), Expect(2) = 3e-15 Identities = 19/51 (37%), Positives = 30/51 (58%) Frame = +1 Query: 526 VLTEFCPGCLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCWKE 678 V+ F G L+ G + V+L+ L + F VILGMD L ++ +DC+K+ Sbjct: 433 VVNSFRKGSLVRIGDVNLPVDLIVLDLKEFDVILGMDWLAQHKAIVDCYKK 483 >EOY03078.1 Retrotransposon protein, putative [Theobroma cacao] Length = 1263 Score = 73.2 bits (178), Expect(2) = 2e-13 Identities = 54/145 (37%), Positives = 71/145 (48%), Gaps = 5/145 (3%) Frame = +2 Query: 11 SRREMRTCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQSASHGSQSQYQNVLR 190 S + R+CD CGR HS C+ T + C+ C Q+ H +R+CL QS S Sbjct: 72 SSKVTRSCDTCGRRHSGWCFLTT-RTCYGCGQLGHIRRDCLMAHQSPDSACGSTQPASST 130 Query: 191 QAPAGGPGSSVA-SDSRGGAAPSAREGQRGHH----GRGIASRRFFSLTQQEGIVSSQVI 355 + A G V+ S RG S R H GRG R F+LTQQE S+ V+ Sbjct: 131 PSVAVSSGREVSGSRGRGAGTSSQDRPSRSRHQSSVGRG--QVRVFTLTQQEAQTSNAVV 188 Query: 356 TREILFFNSISVTVLIDLGATHSFI 430 + IL +++ VL D GATHSFI Sbjct: 189 S-GILSVCNMNARVLFDPGATHSFI 212 Score = 30.4 bits (67), Expect(2) = 2e-13 Identities = 15/41 (36%), Positives = 21/41 (51%) Frame = +1 Query: 550 CLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCW 672 C++ +VNL+ L F VILGMD L +DC+ Sbjct: 250 CVVRVKDKDTSVNLVVLDTLDFDVILGMDWLSPCHASVDCY 290 >EOY14099.1 DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1502 Score = 72.8 bits (177), Expect(2) = 3e-13 Identities = 55/146 (37%), Positives = 78/146 (53%), Gaps = 6/146 (4%) Frame = +2 Query: 11 SRREMRTCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQSASHGSQSQYQNVLR 190 S + +R+CD CGR HS +C+ T + C++C Q H +R+C QS+ S Sbjct: 373 SSQVIRSCDTCGRRHSGRCFLTT-RTCYECGQPGHIRRDCPMAHQSSDSARGSTQLASSA 431 Query: 191 QAPAGGPGSSVASDSRGGAAPSAREGQ---RGHH---GRGIASRRFFSLTQQEGIVSSQV 352 + A G V S SRG A ++ +G+ GH GRG A R F+LTQQE S+ V Sbjct: 432 PSVAVSSGREV-SGSRGRGAGTSSQGRPSGSGHQSSIGRGQA--RVFTLTQQEAQTSNAV 488 Query: 353 ITREILFFNSISVTVLIDLGATHSFI 430 ++ IL +++ V D GATHSFI Sbjct: 489 VS-GILSVCNMNARVQFDPGATHSFI 513 Score = 30.4 bits (67), Expect(2) = 3e-13 Identities = 15/41 (36%), Positives = 21/41 (51%) Frame = +1 Query: 550 CLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCW 672 C++ +VNL+ L F VILGMD L +DC+ Sbjct: 551 CVVRVKDKDTSVNLVVLDTLDFDVILGMDWLSPCHASVDCY 591 >EOY08454.1 DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1400 Score = 74.3 bits (181), Expect(2) = 4e-13 Identities = 53/145 (36%), Positives = 75/145 (51%), Gaps = 5/145 (3%) Frame = +2 Query: 11 SRREMRTCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQS--ASHGSQSQYQNV 184 S + +R+CD CGR HS +C+ T + C+ C Q H +R+C QS ++ GS + Sbjct: 310 SSQVIRSCDTCGRRHSGRCFLTT-KTCYGCGQPGHIRRDCPMAHQSPDSARGSTQPASSA 368 Query: 185 LRQAPAGGPGSSVASDSRGGAAPSAREGQRGHH---GRGIASRRFFSLTQQEGIVSSQVI 355 A + G S + G + R GH GRG A R F+LTQQE S+ V+ Sbjct: 369 PSVAVSSGQEVSGSRGRGAGTSSQGRPSGSGHQSSIGRGQA--RVFALTQQEAQTSNAVV 426 Query: 356 TREILFFNSISVTVLIDLGATHSFI 430 + IL +++ VL D GATHSFI Sbjct: 427 S-SILSVCNMNARVLFDPGATHSFI 450 Score = 28.5 bits (62), Expect(2) = 4e-13 Identities = 14/41 (34%), Positives = 21/41 (51%) Frame = +1 Query: 550 CLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCW 672 C++ +VNL+ L F VILGM+ L +DC+ Sbjct: 488 CVVRVKDKDTSVNLVVLDTLDFDVILGMNWLSPCHASVDCY 528 >EOY26421.1 DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 334 Score = 73.9 bits (180), Expect(2) = 5e-13 Identities = 53/145 (36%), Positives = 74/145 (51%), Gaps = 5/145 (3%) Frame = +2 Query: 11 SRREMRTCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQSA--SHGSQSQYQNV 184 S + +R+CD CGR HS +C+ T + C+ C Q H +R+C QS + GS + Sbjct: 9 SSQVIRSCDTCGRRHSGRCFLTT-KTCYGCGQPGHIRRDCPMAHQSPDFARGSTQPASSA 67 Query: 185 LRQAPAGGPGSSVASDSRGGAAPSAREGQRGHH---GRGIASRRFFSLTQQEGIVSSQVI 355 L + G S + G + R GH GRG A R F+LTQQE S+ V+ Sbjct: 68 LSVVVSSGREVSGSRGKGAGTSSQGRPSGSGHQSSIGRGQA--RVFALTQQEAQTSNAVV 125 Query: 356 TREILFFNSISVTVLIDLGATHSFI 430 + IL +++ VL D GATHSFI Sbjct: 126 S-GILSVCNMNARVLFDPGATHSFI 149 Score = 28.5 bits (62), Expect(2) = 5e-13 Identities = 14/41 (34%), Positives = 21/41 (51%) Frame = +1 Query: 550 CLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCW 672 C++ +VNL+ L F VILGM+ L +DC+ Sbjct: 187 CVVRVKDKDTSVNLVVLDTLDFDVILGMNWLSPCHASVDCY 227 >XP_011462316.1 PREDICTED: uncharacterized protein LOC105350950 [Fragaria vesca subsp. vesca] Length = 531 Score = 62.8 bits (151), Expect(2) = 9e-13 Identities = 45/118 (38%), Positives = 59/118 (50%), Gaps = 1/118 (0%) Frame = +2 Query: 86 GCHQCRQMRHFKRNCLQLQQSASHGSQSQYQNV-LRQAPAGGPGSSVASDSRGGAAPSAR 262 GC +C + HFKR+C +L Q + + YQ A GS +S +RGG P Sbjct: 112 GCFECGEPGHFKRDCPRLAQGV---APTFYQTAGQTSVGASSSGSRASSAARGG--PQQG 166 Query: 263 EGQRGHHGRGIASRRFFSLTQQEGIVSSQVITREILFFNSISVTVLIDLGATHSFIQS 436 GQR GR R ++T QEG S +VI + F + T LID GATHSF+ S Sbjct: 167 RGQR---GRPTTQARVHAMTFQEGRTSPEVIIGRLFIFGQPAFT-LIDPGATHSFMSS 220 Score = 38.9 bits (89), Expect(2) = 9e-13 Identities = 17/43 (39%), Positives = 27/43 (62%) Frame = +1 Query: 550 CLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCWKE 678 C + GY + NL+ L + F VILGMD LE ++ +DC+++ Sbjct: 256 CEVLVEGYNLEANLIPLEMVDFDVILGMDFLEAHQALVDCFQK 298 >EOY08404.1 Retrotransposon-like protein [Theobroma cacao] Length = 654 Score = 73.2 bits (178), Expect(2) = 9e-13 Identities = 56/146 (38%), Positives = 77/146 (52%), Gaps = 6/146 (4%) Frame = +2 Query: 11 SRREMRTCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQSASHGSQSQYQNVLR 190 S + +R+CD CGR HS +C+ T + C+ C Q H +R+C QS S Sbjct: 213 SSQVIRSCDTCGRRHSGRCFLTT-KTCYGCGQPGHIRRDCPMAHQSPDSARGSTQPASSA 271 Query: 191 QAPAGGPGSSVASDSRGGAAPSAREGQ---RGHH---GRGIASRRFFSLTQQEGIVSSQV 352 + A G V S SRG A ++ +G+ GH GRG A R F+LTQQE S+ V Sbjct: 272 PSVAVSSGREV-SGSRGRGAGTSSQGKPSGSGHQSSIGRGQA--RVFALTQQEAQTSNAV 328 Query: 353 ITREILFFNSISVTVLIDLGATHSFI 430 ++ IL +++ VL D GATHSFI Sbjct: 329 VS-GILSVCNMNARVLFDPGATHSFI 353 Score = 28.5 bits (62), Expect(2) = 9e-13 Identities = 14/41 (34%), Positives = 21/41 (51%) Frame = +1 Query: 550 CLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCW 672 C++ +VNL+ L F VILGM+ L +DC+ Sbjct: 391 CVVRVKDKDTSVNLVVLDTLDFDVILGMNWLSPCHASVDCY 431 >XP_012490570.1 PREDICTED: uncharacterized protein LOC105803109 [Gossypium raimondii] Length = 1107 Score = 57.0 bits (136), Expect(2) = 1e-12 Identities = 44/156 (28%), Positives = 65/156 (41%), Gaps = 11/156 (7%) Frame = +2 Query: 8 PSRREMRTCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQSASHGSQSQYQ--- 178 P E CD CG+ H +CW + GC +C HF ++C + Q S SQ Sbjct: 283 PREGENPECDYCGKRHFGECW-KKIGGCFRCGSTEHFVKDCPKTQSSTPATSQRSISTAR 341 Query: 179 --------NVLRQAPAGGPGSSVASDSRGGAAPSAREGQRGHHGRGIASRRFFSLTQQEG 334 +VLR+ G GS +A+ AP +R + T++EG Sbjct: 342 GRGMFRGGSVLRRGGV-GRGSDIATQQSEARAP---------------ARAYVVRTREEG 385 Query: 335 IVSSQVITREILFFNSISVTVLIDLGATHSFIQSSL 442 ++ + I S V LID G++HS+I S L Sbjct: 386 --NAHDVVTGIFLLYSEPVYALIDPGSSHSYINSKL 419 Score = 44.3 bits (103), Expect(2) = 1e-12 Identities = 21/57 (36%), Positives = 31/57 (54%) Frame = +1 Query: 526 VLTEFCPGCLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCWKEAGCLST 696 ++ + CP C L T V+L+ + F VILGMD L + V LDC+K+ + T Sbjct: 438 LVNQVCPRCPLIIQNKTFPVDLLIMPFGDFDVILGMDWLSEHEVILDCYKKKFSIQT 494 >XP_007220718.1 hypothetical protein PRUPE_ppa022673mg [Prunus persica] Length = 1506 Score = 72.8 bits (177), Expect(2) = 1e-12 Identities = 53/149 (35%), Positives = 71/149 (47%), Gaps = 1/149 (0%) Frame = +2 Query: 2 TGPSRREMRTCDICGRPHS*QCW-GTEFQGCHQCRQMRHFKRNCLQLQQSASHGSQSQYQ 178 +G RR C CGR HS C GT GC+ C Q HF+++C Q+ Sbjct: 333 SGSGRRSRPQCARCGRYHSGPCQQGTT--GCYYCGQPGHFQKDCPLFPQTRE-------- 382 Query: 179 NVLRQAPAGGPGSSVASDSRGGAAPSAREGQRGHHGRGIASRRFFSLTQQEGIVSSQVIT 358 AP G SS + ++ A QRG GR A+ R ++++QQE S +VIT Sbjct: 383 --TTDAPTPGTASS-SGGAQTSVASHGSSQQRGRGGRSRATGRVYNMSQQEAHASPEVIT 439 Query: 359 REILFFNSISVTVLIDLGATHSFIQSSLS 445 + F I VLID GATHSF+ S + Sbjct: 440 GILPVF-GIPARVLIDPGATHSFVTPSFA 467 Score = 28.5 bits (62), Expect(2) = 1e-12 Identities = 13/38 (34%), Positives = 21/38 (55%) Frame = +1 Query: 565 GGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCWKE 678 G +L+ L + VILGMD L +R +DC+++ Sbjct: 505 GNVFFEADLIPLGMVDLDVILGMDWLARHRASVDCFRK 542 >XP_016733510.1 PREDICTED: uncharacterized protein LOC107944194, partial [Gossypium hirsutum] Length = 2080 Score = 56.2 bits (134), Expect(2) = 4e-12 Identities = 43/146 (29%), Positives = 69/146 (47%), Gaps = 1/146 (0%) Frame = +2 Query: 8 PSRR-EMRTCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQSASHGSQSQYQNV 184 PSR ++ C CG+ H +CW +GC +C HF R+C ++ + SQ Sbjct: 313 PSREIDIPDCQHCGKKHRGECWKLT-RGCFRCGSTDHFIRDCPKVDSTVPVTSQRSVSTA 371 Query: 185 LRQAPAGGPGSSVASDSRGGAAPSAREGQRGHHGRGIASRRFFSLTQQEGIVSSQVITRE 364 + G G SV SRGG+ + + + +R + T++EG + V+T Sbjct: 372 --KGRGLGRGGSV---SRGGSIRRSNDIATQQSEAKVPARAYVVRTREEGD-AHDVVTGI 425 Query: 365 ILFFNSISVTVLIDLGATHSFIQSSL 442 L ++ V LID G++HS+I S L Sbjct: 426 FLLYSE-PVYALIDPGSSHSYINSKL 450 Score = 43.1 bits (100), Expect(2) = 4e-12 Identities = 19/57 (33%), Positives = 31/57 (54%) Frame = +1 Query: 526 VLTEFCPGCLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCWKEAGCLST 696 ++ + CP C L T ++L+ + F +ILGMD L + V LDC+K+ + T Sbjct: 476 LVNQICPRCPLIIQNKTFPIDLLIMPFGDFDIILGMDWLAEHGVVLDCYKKKFSIQT 532 >EOY03146.1 Retrotransposon protein, putative [Theobroma cacao] Length = 1480 Score = 68.6 bits (166), Expect(2) = 5e-12 Identities = 51/145 (35%), Positives = 74/145 (51%), Gaps = 5/145 (3%) Frame = +2 Query: 11 SRREMRTCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQS--ASHGSQSQYQNV 184 S + +R+CD CG HS +C+ T + C+ C Q H ++C QS ++ GS + Sbjct: 361 SSQVIRSCDTCGIRHSGRCFLTT-KTCYGCGQPGHIMKDCPMAHQSPDSARGSTQPASSA 419 Query: 185 LRQAPAGGPGSSVASDSRGGAAPSAREGQRGHH---GRGIASRRFFSLTQQEGIVSSQVI 355 A + G S + G + R + GH GRG A R F+LTQQE S+ V+ Sbjct: 420 PSVAVSSGLEVSGSRGRGAGTSSQGRPSRSGHQSSIGRGQA--RVFALTQQEAQTSNAVV 477 Query: 356 TREILFFNSISVTVLIDLGATHSFI 430 + IL +++ VL D GATHSFI Sbjct: 478 SG-ILSVCNMNARVLFDPGATHSFI 501 Score = 30.4 bits (67), Expect(2) = 5e-12 Identities = 15/41 (36%), Positives = 21/41 (51%) Frame = +1 Query: 550 CLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCW 672 C++ +VNL+ L F VILGMD L +DC+ Sbjct: 539 CVVRVKDKDTSVNLVVLDTLDFDVILGMDWLSPCHASVDCY 579 >XP_012487752.1 PREDICTED: uncharacterized protein LOC105800943 [Gossypium raimondii] Length = 808 Score = 57.0 bits (136), Expect(2) = 9e-12 Identities = 45/147 (30%), Positives = 70/147 (47%), Gaps = 1/147 (0%) Frame = +2 Query: 5 GPSRR-EMRTCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQSASHGSQSQYQN 181 GPSR ++ C CG+ H +CW + C + R HF R+C + + + SQ Sbjct: 253 GPSRNIDIPDCKHCGKKHLGECWRIT-RRCFRYRSTDHFIRDCPKNEGAIPAASQRSVST 311 Query: 182 VLRQAPAGGPGSSVASDSRGGAAPSAREGQRGHHGRGIASRRFFSLTQQEGIVSSQVITR 361 V + G GSS+ SRGG + + + +R + T++EG V V+T Sbjct: 312 V--RGRGSGRGSSI---SRGGGIRRSSDIATQQSEAKVPARAYVVRTREEGDVHD-VVTG 365 Query: 362 EILFFNSISVTVLIDLGATHSFIQSSL 442 L ++ V LID G++HS+I S L Sbjct: 366 IFLLYSE-PVYALIDPGSSHSYINSKL 391 Score = 41.2 bits (95), Expect(2) = 9e-12 Identities = 20/52 (38%), Positives = 28/52 (53%) Frame = +1 Query: 541 CPGCLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCWKEAGCLST 696 C CLL +V+L+ + F +ILGMD L Y V LDC+K+ + T Sbjct: 422 CRRCLLMIHDKMFSVDLLIMPFGDFDIILGMDWLSEYGVILDCYKKRFSIQT 473 >EOY16854.1 DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 737 Score = 69.7 bits (169), Expect(2) = 9e-12 Identities = 55/146 (37%), Positives = 76/146 (52%), Gaps = 6/146 (4%) Frame = +2 Query: 11 SRREMRTCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQSASHGSQSQYQNVLR 190 S + +R+CD GR HS +C+ T + C++C Q H +R+C QS S Sbjct: 373 SSQVIRSCDTYGRRHSGRCFLTT-KTCYRCGQPGHIRRDCPMAHQSPDSARGSTQPASSA 431 Query: 191 QAPAGGPGSSVASDSRGGAAPSAREGQ---RGHH---GRGIASRRFFSLTQQEGIVSSQV 352 + G V S SRG A ++ +G+ GH GRG A R F+LTQQE S+ V Sbjct: 432 PSVTVSSGREV-SGSRGRGAGTSSQGRPSGSGHQSSIGRGQA--RVFALTQQEAQTSNAV 488 Query: 353 ITREILFFNSISVTVLIDLGATHSFI 430 ++ IL +I+ VL D GATHSFI Sbjct: 489 VSG-ILSVCNINARVLFDPGATHSFI 513 Score = 28.5 bits (62), Expect(2) = 9e-12 Identities = 14/41 (34%), Positives = 21/41 (51%) Frame = +1 Query: 550 CLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCW 672 C++ +VNL+ L F VILGM+ L +DC+ Sbjct: 551 CVVRVKDKDTSVNLVVLDTIDFDVILGMNWLSPCHASVDCY 591 >XP_012466477.1 PREDICTED: uncharacterized protein LOC105785086 [Gossypium raimondii] Length = 780 Score = 53.9 bits (128), Expect(2) = 1e-11 Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 1/147 (0%) Frame = +2 Query: 5 GPSRR-EMRTCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQSASHGSQSQYQN 181 GPSR ++ C+ CG+ H +CW + C +C HF R+C + + + SQ Sbjct: 272 GPSRNIDIPDCEHCGKKHLGECWRIT-RRCFRCGSTDHFIRDCQKNEGALPAASQRSVST 330 Query: 182 VLRQAPAGGPGSSVASDSRGGAAPSAREGQRGHHGRGIASRRFFSLTQQEGIVSSQVITR 361 + G GSS+ SR G+ + + + +R + T++EG + V+T Sbjct: 331 A--RGRGSGRGSSL---SREGSIRRSSDIATQQSEAKVPARAYVVRTREEGD-AHDVVTG 384 Query: 362 EILFFNSISVTVLIDLGATHSFIQSSL 442 L ++ V LID G++HS+I S L Sbjct: 385 IFLLYSE-PVYALIDPGSSHSYINSKL 410 Score = 43.5 bits (101), Expect(2) = 1e-11 Identities = 21/52 (40%), Positives = 29/52 (55%) Frame = +1 Query: 541 CPGCLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCWKEAGCLST 696 C C L G T +V+L+ + F +ILGMD L Y V LDC+K+ + T Sbjct: 441 CRRCPLMIHGKTFSVDLLIMPFGDFDIILGMDWLSEYGVILDCYKKRFSIQT 492 >EOY17430.1 Uncharacterized protein TCM_036595 [Theobroma cacao] Length = 324 Score = 73.2 bits (178), Expect = 2e-11 Identities = 53/145 (36%), Positives = 75/145 (51%), Gaps = 5/145 (3%) Frame = +2 Query: 11 SRREMRTCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQS--ASHGSQSQYQNV 184 S + +R+CD CGR HS +C+ T + C+ C Q H +R+C QS ++ GS + Sbjct: 137 SSQVIRSCDTCGRRHSGRCFLTT-KTCYGCGQPGHIRRDCPMAHQSPDSARGSTQPASSA 195 Query: 185 LRQAPAGGPGSSVASDSRGGAAPSAREGQRGHH---GRGIASRRFFSLTQQEGIVSSQVI 355 A + G S + G + R GH GRG A R F+LTQQE S+ V+ Sbjct: 196 PSVAVSSGREVSGSRGRGAGTSSQGRPSGSGHQSSIGRGQA--RVFALTQQEAQTSNAVV 253 Query: 356 TREILFFNSISVTVLIDLGATHSFI 430 + IL +++ VL D GATHSFI Sbjct: 254 S-GILSVCNMNARVLFDPGATHSFI 277 >XP_017216862.1 PREDICTED: uncharacterized protein LOC108194427 [Daucus carota subsp. sativus] Length = 1810 Score = 63.9 bits (154), Expect(2) = 2e-11 Identities = 43/143 (30%), Positives = 66/143 (46%), Gaps = 4/143 (2%) Frame = +2 Query: 29 TCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQSASHGSQSQYQNV----LRQA 196 TC CGR H QC + GC+ C + HF R+C +++ S+ QNV + + Sbjct: 315 TCQTCGRQHFGQC-RAQTGGCYLCGEQGHFIRDCPNKRENVQAVSEPSVQNVEVKGVGTS 373 Query: 197 PAGGPGSSVASDSRGGAAPSAREGQRGHHGRGIASRRFFSLTQQEGIVSSQVITREILFF 376 G G + GG S + R F+LT+ E + +VIT ++L + Sbjct: 374 FGRGRGKKGTGSTGGGIGRSQAQSSNPP-----TQARVFALTRGEAEAAPEVITGKVLLY 428 Query: 377 NSISVTVLIDLGATHSFIQSSLS 445 + LID G+THSFI S ++ Sbjct: 429 -QLDAYALIDPGSTHSFISSKMT 450 Score = 33.1 bits (74), Expect(2) = 2e-11 Identities = 14/49 (28%), Positives = 25/49 (51%) Frame = +1 Query: 526 VLTEFCPGCLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCW 672 V+ + C + G + +L+ L F +ILGMD L + ++DC+ Sbjct: 475 VVDQIYRDCPIEIGNTELKADLIVLPFQEFDIILGMDWLTRHHAKVDCY 523 >XP_016734104.1 PREDICTED: uncharacterized protein LOC107944788 [Gossypium hirsutum] Length = 580 Score = 54.3 bits (129), Expect(2) = 2e-11 Identities = 43/152 (28%), Positives = 66/152 (43%), Gaps = 5/152 (3%) Frame = +2 Query: 2 TGPSRREMRTCDI-----CGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQSASHGSQ 166 TG R R DI CG+ H +CW +GC +C HF R+C ++ + SQ Sbjct: 200 TGSVRGPSREIDIPDYQHCGKKHRGECWKLT-RGCFRCGSTDHFIRDCSKVDSTVPVTSQ 258 Query: 167 SQYQNVLRQAPAGGPGSSVASDSRGGAAPSAREGQRGHHGRGIASRRFFSLTQQEGIVSS 346 + G G S+ SRGG+ + + + +R + T++EG + Sbjct: 259 RSVSTA--RGRGLGRGGSI---SRGGSIRRSSDIATQQSEAKVPARAYVVRTREEG--DA 311 Query: 347 QVITREILFFNSISVTVLIDLGATHSFIQSSL 442 + I S V LID G++HS+I S L Sbjct: 312 HDVVTGIFLLYSEPVYALIDPGSSHSYINSKL 343 Score = 42.7 bits (99), Expect(2) = 2e-11 Identities = 19/57 (33%), Positives = 31/57 (54%) Frame = +1 Query: 526 VLTEFCPGCLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCWKEAGCLST 696 ++ + CP C L T ++L+ + F +ILGMD L + V LDC+K+ + T Sbjct: 369 LVNQVCPRCPLIIQNKTFPIDLLIMPFGDFDIILGMDWLAEHGVVLDCYKKKFSIQT 425 >XP_011466845.1 PREDICTED: uncharacterized protein LOC105352182 [Fragaria vesca subsp. vesca] Length = 232 Score = 57.8 bits (138), Expect(2) = 3e-11 Identities = 43/117 (36%), Positives = 57/117 (48%), Gaps = 1/117 (0%) Frame = +2 Query: 89 CHQCRQMRHFKRNCLQLQQSASHGSQSQYQNV-LRQAPAGGPGSSVASDSRGGAAPSARE 265 C +C + HFKR+C +L Q + + YQ A GS +S RGG P Sbjct: 23 CFECGEPGHFKRDCPRLTQGV---APTFYQTAGQTSVGASSSGSRASSAVRGG--PQQGR 77 Query: 266 GQRGHHGRGIASRRFFSLTQQEGIVSSQVITREILFFNSISVTVLIDLGATHSFIQS 436 GQR GR R ++T QEG S +VI + F + T L+D GATHSF+ S Sbjct: 78 GQR---GRPTTQARVHAMTFQEGRTSPEVIIGTLFIFGQPAFT-LMDPGATHSFMSS 130 Score = 38.9 bits (89), Expect(2) = 3e-11 Identities = 17/43 (39%), Positives = 27/43 (62%) Frame = +1 Query: 550 CLLTFGGYTIAVNLMTLYINSFQVILGMDCLETYRVQLDCWKE 678 C + GY + NL+ L + F VILGMD LE ++ +DC+++ Sbjct: 166 CEVLVEGYNLEANLIPLEMIDFDVILGMDFLEAHQALVDCFQK 208 >EOY26377.1 DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 371 Score = 72.8 bits (177), Expect = 3e-11 Identities = 53/145 (36%), Positives = 75/145 (51%), Gaps = 5/145 (3%) Frame = +2 Query: 11 SRREMRTCDICGRPHS*QCWGTEFQGCHQCRQMRHFKRNCLQLQQS--ASHGSQSQYQNV 184 S + +R+CD CGR HS +C+ T + C+ C Q H +R+C QS ++ GS + Sbjct: 216 SSQVIRSCDTCGRRHSGRCFLTT-KTCYGCGQPGHIRRDCPMAHQSPDSARGSTQPASSA 274 Query: 185 LRQAPAGGPGSSVASDSRGGAAPSAREGQRGHH---GRGIASRRFFSLTQQEGIVSSQVI 355 A + G S + G + R GH GRG A R F+LTQQE S+ V+ Sbjct: 275 PSVAVSSGLEVSGSRGRGAGTSSQGRPSGSGHQSSIGRGQA--RVFALTQQEAQTSNAVV 332 Query: 356 TREILFFNSISVTVLIDLGATHSFI 430 + IL +++ VL D GATHSFI Sbjct: 333 S-GILSVCNMNARVLFDPGATHSFI 356