BLASTX nr result
ID: Lithospermum23_contig00008491
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Lithospermum23_contig00008491 (1161 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_008666385.1 PREDICTED: uncharacterized protein LOC103645032 [... 124 4e-27 JAU32948.1 Retrovirus-related Pol polyprotein from transposon TN... 113 5e-27 XP_008664109.1 PREDICTED: uncharacterized protein LOC103642667, ... 123 5e-27 AAU10819.1 putative polyprotein [Oryza sativa Japonica Group] AA... 121 4e-26 CAN61640.1 hypothetical protein VITISV_021909 [Vitis vinifera] 120 5e-26 XP_008675247.1 PREDICTED: uncharacterized protein LOC103651388 i... 120 5e-26 XP_008671960.1 PREDICTED: uncharacterized protein LOC103649473 [... 120 7e-26 AAL78658.1 Hopscotch polyprotein, partial [Fagus sylvatica] 112 1e-25 JAU84243.1 Retrovirus-related Pol polyprotein from transposon TN... 112 3e-25 OIW15217.1 hypothetical protein TanjilG_08809 [Lupinus angustifo... 111 4e-25 CAA19715.1 putative protein [Arabidopsis thaliana] CAB79576.1 pu... 118 4e-25 ABA97658.1 retrotransposon protein, putative, Ty1-copia subclass... 117 6e-25 OMO78631.1 Integrase, catalytic core [Corchorus capsularis] 115 1e-24 OMO89784.1 Integrase, catalytic core [Corchorus capsularis] 115 2e-24 EOY32308.1 Uncharacterized protein TCM_040047 [Theobroma cacao] 115 2e-24 OMO61427.1 Zinc finger, CCCH-type [Corchorus capsularis] 114 3e-24 XP_008646585.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized p... 115 3e-24 AAC67200.1 putative retroelement pol polyprotein [Arabidopsis th... 115 3e-24 JAU04955.1 Copia protein, partial [Noccaea caerulescens] 115 4e-24 ABF95666.1 retrotransposon protein, putative, Ty1-copia subclass... 115 4e-24 >XP_008666385.1 PREDICTED: uncharacterized protein LOC103645032 [Zea mays] Length = 1155 Score = 124 bits (310), Expect = 4e-27 Identities = 63/146 (43%), Positives = 88/146 (60%) Frame = +3 Query: 30 GIH*ILLPNNSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLINILPTP 209 G+H + +S + GK + R + + LLFQA +P +W EAL +L+NILPT Sbjct: 753 GVHLRMSCPYTSPQNGKAERIIRTINDVVRSLLFQASIPPSYWAEALNTTTRLLNILPTK 812 Query: 210 LLNHIFPHEALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSINHKGFR 389 L PH+ALFGT PVY LRVFG CYPNL+ + K P S+ C++ G S +HKG+R Sbjct: 813 TLRFSTPHQALFGTAPVYNHLRVFGCKCYPNLLATSPHKLAPRSVLCVFLGYSDHHKGYR 872 Query: 390 CLNPENQRVYISRHVRFSELVFSYVD 467 CL+ + ++ ISRHV F E F + + Sbjct: 873 CLDIHSNKIIISRHVVFDETSFPFAE 898 >JAU32948.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Noccaea caerulescens] Length = 129 Score = 113 bits (283), Expect = 5e-27 Identities = 55/120 (45%), Positives = 76/120 (63%) Frame = +3 Query: 87 QKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLINILPTPLLNHIFPHEALFGTGPVYT 266 +++R + + LL A MP +FW A A+ LIN +PT L+ PH LFGT P Y+ Sbjct: 8 RRHRHIVETGLALLSHASMPIEFWTYAFATAVYLINRMPTATLDMHSPHLKLFGTMPNYS 67 Query: 267 SLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSINHKGFRCLNPENQRVYISRHVRFSE 446 LR+FG LCYP L + + K P SLPC++ G S++ F CL+PE+ R+Y+SRHVRF E Sbjct: 68 KLRIFGCLCYPWLRPYASHKLDPRSLPCVFLGYSLSQSAFYCLDPESSRIYVSRHVRFCE 127 >XP_008664109.1 PREDICTED: uncharacterized protein LOC103642667, partial [Zea mays] Length = 945 Score = 123 bits (309), Expect = 5e-27 Identities = 63/146 (43%), Positives = 88/146 (60%) Frame = +3 Query: 30 GIH*ILLPNNSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLINILPTP 209 G+H + +S + GK + R + + LLFQA +P +W EAL +L+NILPT Sbjct: 762 GVHLRMSCPYTSPQNGKAERIIRTINDVVRSLLFQASIPPSYWAEALNTTTRLLNILPTK 821 Query: 210 LLNHIFPHEALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSINHKGFR 389 L PH+ALFGT PVY LRVFG CYPNL+ + K P S+ C++ G S +HKG+R Sbjct: 822 TLRFSTPHQALFGTAPVYDHLRVFGCKCYPNLLATSPHKLAPRSVLCVFLGYSDHHKGYR 881 Query: 390 CLNPENQRVYISRHVRFSELVFSYVD 467 CL+ + ++ ISRHV F E F + + Sbjct: 882 CLDIHSNKIIISRHVVFDETSFPFAE 907 >AAU10819.1 putative polyprotein [Oryza sativa Japonica Group] AAV24907.1 hypothetical protein [Oryza sativa Japonica Group] Length = 1679 Score = 121 bits (303), Expect = 4e-26 Identities = 65/153 (42%), Positives = 89/153 (58%) Frame = +3 Query: 30 GIH*ILLPNNSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLINILPTP 209 G+H + ++S + GK + R + + +LFQA++P FWVEAL A LIN PT Sbjct: 838 GVHLRMSCPHTSPQNGKAERILRSLNNIVRSMLFQAKLPGSFWVEALHTATHLINRHPTK 897 Query: 210 LLNHIFPHEALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSINHKGFR 389 L+ PH AL+GT P Y+ LRVFG CYPNL T K P S C++ G + HKG+R Sbjct: 898 TLDRHTPHFALYGTHPSYSHLRVFGCKCYPNLSATTPHKLAPRSTMCVFLGYPLYHKGYR 957 Query: 390 CLNPENQRVYISRHVRFSELVFSYVDFYKHLSS 488 C +P + RV ISRHV F E F + + +S+ Sbjct: 958 CFDPLSNRVIISRHVVFDEHSFPFTELTNGVSN 990 Score = 48.9 bits (115), Expect(2) = 2e-10 Identities = 20/34 (58%), Positives = 26/34 (76%) Frame = +2 Query: 1058 KLHPDGTIDHYKARLVSQGFK*LPGIDYDQTFSP 1159 K H DG++D YKAR V +GF PG+D+D+TFSP Sbjct: 1225 KFHADGSLDRYKARWVLRGFTQRPGVDFDETFSP 1258 Score = 46.2 bits (108), Expect(2) = 2e-10 Identities = 29/68 (42%), Positives = 35/68 (51%), Gaps = 1/68 (1%) Frame = +3 Query: 813 HHMITRGKAGIVKPRQXXXXXXXXXXPSSPYH-VPTSYTEALKYPHWKLAMIEEYNALLH 989 H M TR K+G KP ++P VP +Y AL P W+ AM EEYNALL Sbjct: 1147 HVMTTRAKSGHHKPVHRLNLH------AAPLSLVPKTYRAALADPLWRAAMEEEYNALLA 1200 Query: 990 NQTWPCSP 1013 N+TW P Sbjct: 1201 NRTWDLVP 1208 >CAN61640.1 hypothetical protein VITISV_021909 [Vitis vinifera] Length = 1361 Score = 120 bits (302), Expect = 5e-26 Identities = 74/198 (37%), Positives = 100/198 (50%), Gaps = 11/198 (5%) Frame = +3 Query: 3 KLQPHFSILGIH*ILLPNNSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAI 182 K + H GI L + ++ G +K+R +T+ L+F AR+P W EA A+ Sbjct: 554 KFRSHLHSCGIDLRLACPYTPSQNGIVERKHRYVTEIGLTLMFHARVPLSLWDEAFSTAV 613 Query: 183 KLINILPTPLLNHIFPHEALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPG 362 LIN LP P L P+E LFG P Y+ LR FG LC+P L ++ K P S PC++ G Sbjct: 614 FLINRLPPPSLAGKTPYELLFGKQPDYSMLRTFGCLCFPYLRDYSPNKLSPKSTPCVFLG 673 Query: 363 PSINHKGFRCLNPENQRVYISRHVRFSELVFSY-----------VDFYKHLSSVPSSLFP 509 S HKGFRCL+ + RVY+SRHV+F E F Y +D+ S Sbjct: 674 YSTLHKGFRCLDRKTHRVYVSRHVQFYEHTFPYNGDSVQNLPSNIDYIHFSESQECVSSS 733 Query: 510 QNCDINNLLPSTSFMTLL 563 N ++ LPS SF L Sbjct: 734 SNVSTSDSLPSPSFSNSL 751 Score = 47.8 bits (112), Expect(2) = 8e-10 Identities = 21/37 (56%), Positives = 28/37 (75%) Frame = +2 Query: 1049 F*VKLHPDGTIDHYKARLVSQGFK*LPGIDYDQTFSP 1159 F KLH DG+I+ +KARLV+QGF + G+D+ TFSP Sbjct: 887 FKTKLHSDGSIERHKARLVAQGFSQVHGLDFGDTFSP 923 Score = 45.1 bits (105), Expect(2) = 8e-10 Identities = 27/71 (38%), Positives = 32/71 (45%), Gaps = 4/71 (5%) Frame = +3 Query: 813 HHMITRGKAGIVKPRQXXXXXXXXXXPSSPYHV----PTSYTEALKYPHWKLAMIEEYNA 980 H MITRGKAGI KPR + P + A K+P W AM +E +A Sbjct: 803 HPMITRGKAGIFKPRLYHAMHISSSSQLFQAFLALKEPRGFKSAAKHPEWLSAMDDEIHA 862 Query: 981 LLHNQTWPCSP 1013 L N TW P Sbjct: 863 LKKNDTWVLVP 873 >XP_008675247.1 PREDICTED: uncharacterized protein LOC103651388 isoform X1 [Zea mays] Length = 1371 Score = 120 bits (302), Expect = 5e-26 Identities = 64/136 (47%), Positives = 78/136 (57%) Frame = +3 Query: 60 SSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLINILPTPLLNHIFPHEA 239 +S + GK + R I LLFQA MP +W EAL A L+NILPT L+ PH A Sbjct: 752 TSPQNGKAGRIIRTTNDVIRSLLFQASMPPSYWAEALHTATLLLNILPTKTLDFSTPHSA 811 Query: 240 LFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSINHKGFRCLNPENQRVY 419 L G+ P Y LRVFG CYPNL K P S CI+ G S +HKG+RCL+P R+ Sbjct: 812 LLGSAPSYDHLRVFGCKCYPNLSTTAPHKLAPRSALCIFIGYSAHHKGYRCLDPTTNRII 871 Query: 420 ISRHVRFSELVFSYVD 467 ISRHV F E F + + Sbjct: 872 ISRHVIFDESTFPFAE 887 Score = 47.0 bits (110), Expect(2) = 8e-09 Identities = 27/67 (40%), Positives = 34/67 (50%) Frame = +3 Query: 813 HHMITRGKAGIVKPRQXXXXXXXXXXPSSPYHVPTSYTEALKYPHWKLAMIEEYNALLHN 992 H M+TR K G P P SP VP ++ AL P+W+ AM EE+ ALL N Sbjct: 1050 HRMVTRAKDGFRVP------VLYHAVPLSP--VPKTFRSALADPNWRAAMEEEHTALLQN 1101 Query: 993 QTWPCSP 1013 +TW P Sbjct: 1102 KTWDLVP 1108 Score = 42.4 bits (98), Expect(2) = 8e-09 Identities = 18/34 (52%), Positives = 24/34 (70%) Frame = +2 Query: 1058 KLHPDGTIDHYKARLVSQGFK*LPGIDYDQTFSP 1159 K H DG+++ YKAR V +GF P ID+ +TFSP Sbjct: 1125 KFHADGSLERYKARWVLRGFTQRPAIDFAETFSP 1158 >XP_008671960.1 PREDICTED: uncharacterized protein LOC103649473 [Zea mays] Length = 1477 Score = 120 bits (301), Expect = 7e-26 Identities = 68/168 (40%), Positives = 90/168 (53%) Frame = +3 Query: 30 GIH*ILLPNNSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLINILPTP 209 G+H + +S + G+ + R + + LLFQA +P +WVEAL A L+N+ PT Sbjct: 783 GVHLRMSCPYTSQQNGRAERVLRTVNNIVRSLLFQAHLPAPYWVEALHTATYLLNLHPTS 842 Query: 210 LLNHIFPHEALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSINHKGFR 389 LN PH ALFG P Y LRVFG CYPN+ K P S C++ G S HKG+R Sbjct: 843 TLNSSTPHLALFGRPPSYDHLRVFGCKCYPNISATAAHKLAPRSTMCVFLGYSSEHKGYR 902 Query: 390 CLNPENQRVYISRHVRFSELVFSYVDFYKHLSSVPSSLFPQNCDINNL 533 CL+ R+ ISRHV F E F + + + PSS P N D +L Sbjct: 903 CLDISTNRIIISRHVVFDESSFPFAE------TPPSSALPTNLDFLDL 944 >AAL78658.1 Hopscotch polyprotein, partial [Fagus sylvatica] Length = 226 Score = 112 bits (281), Expect = 1e-25 Identities = 61/151 (40%), Positives = 85/151 (56%) Frame = +3 Query: 15 HFSILGIH*ILLPNNSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLIN 194 H ++ GI + + + G +K+R + + +LF AR+P + W+EA A+ LIN Sbjct: 72 HLNMCGIVQHVSCPGTPEQNGVAERKHRHIVETGLTMLFHARLPKNLWIEAFMTAVYLIN 131 Query: 195 ILPTPLLNHIFPHEALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSIN 374 LP+ L H P L G P Y SL+VFG C+P L + K P S PCI+ G S Sbjct: 132 RLPSSKLAHDTPFFKLHGVHPDYNSLKVFGCRCFPYLRDYAKNKFEPKSYPCIFIGYSPL 191 Query: 375 HKGFRCLNPENQRVYISRHVRFSELVFSYVD 467 HKG+RCL+P +RVY+SRHV F E + Y D Sbjct: 192 HKGYRCLHPPTKRVYLSRHVVFDEGILPYTD 222 >JAU84243.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Noccaea caerulescens] Length = 244 Score = 112 bits (280), Expect = 3e-25 Identities = 59/150 (39%), Positives = 85/150 (56%), Gaps = 1/150 (0%) Frame = +3 Query: 15 HFSILGIH*ILLPNNSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLIN 194 H GI + ++ + G +K+R + + ++F ++MP +WVEAL A LIN Sbjct: 24 HLQTCGIQQFISCPHTPQQNGLAERKHRHLVELGLSMMFDSKMPQKYWVEALFTANFLIN 83 Query: 195 ILPTPLLNHIF-PHEALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSI 371 +LPT L+ P++ L G P YT+LR FG CYP L + K P SL C++ G + Sbjct: 84 LLPTTALDSTMSPYQRLLGEAPEYTALRTFGCACYPTLRAYAATKFDPRSLKCVFLGYTA 143 Query: 372 NHKGFRCLNPENQRVYISRHVRFSELVFSY 461 +KG+RCL P RVY+SRHV F E VF + Sbjct: 144 KYKGYRCLYPATGRVYLSRHVLFDEEVFPF 173 >OIW15217.1 hypothetical protein TanjilG_08809 [Lupinus angustifolius] Length = 212 Score = 111 bits (277), Expect = 4e-25 Identities = 56/134 (41%), Positives = 79/134 (58%) Frame = +3 Query: 60 SSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLINILPTPLLNHIFPHEA 239 ++++ G+ K+R +T+ +LF +++ FWVEA A+ IN LP+ +L P E Sbjct: 6 TTSQNGRAEHKHRHITETSLTMLFHSQVSTSFWVEAFSTAVYTINRLPSLVLVGKSPFEV 65 Query: 240 LFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSINHKGFRCLNPENQRVY 419 LFG P Y + FG +P L + T K LP S PCI+ G S NHKGFRC NP + R+Y Sbjct: 66 LFGALPNYENFHPFGCRVFPCLRDYVTNKFLPRSAPCIFLGYSANHKGFRCFNPASSRMY 125 Query: 420 ISRHVRFSELVFSY 461 I+RH +F E F Y Sbjct: 126 ITRHAQFDEQFFPY 139 >CAA19715.1 putative protein [Arabidopsis thaliana] CAB79576.1 putative protein [Arabidopsis thaliana] Length = 1318 Score = 118 bits (295), Expect = 4e-25 Identities = 68/165 (41%), Positives = 91/165 (55%), Gaps = 1/165 (0%) Frame = +3 Query: 3 KLQPHFSILGIH*ILLPNNSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAI 182 K H GI L ++ + G +K+R + + +LFQ+ +P FWVEA A Sbjct: 405 KFLQHLQSHGIQQQLSCPHTPQQNGLAERKHRHLVELGLSMLFQSHVPHKFWVEAFFTAN 464 Query: 183 KLINILPTPLLNH-IFPHEALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYP 359 LIN+LPT L I P+E L+ P YTSLR FGS C+P L + K P SL C++ Sbjct: 465 FLINLLPTSALKESISPYEKLYDKKPDYTSLRSFGSACFPTLRDYAENKFNPCSLKCVFL 524 Query: 360 GPSINHKGFRCLNPENQRVYISRHVRFSELVFSYVDFYKHLSSVP 494 G + +KG+RCL P R+YISRHV F E V+ + YKHL P Sbjct: 525 GYNEKYKGYRCLYPPTGRLYISRHVIFDESVYPFSHTYKHLHPQP 569 Score = 43.1 bits (100), Expect(2) = 6e-08 Identities = 25/68 (36%), Positives = 29/68 (42%) Frame = +3 Query: 813 HHMITRGKAGIVKPRQXXXXXXXXXXPSSPYHVPTSYTEALKYPHWKLAMIEEYNALLHN 992 H M+TR K GI KP Y P + T ALK+P W AM EE Sbjct: 685 HPMVTRAKVGISKPNPRYVFLSHKVS----YPEPKTVTAALKHPGWTGAMTEEIGNCSET 740 Query: 993 QTWPCSPH 1016 QTW P+ Sbjct: 741 QTWSLVPY 748 Score = 43.1 bits (100), Expect(2) = 6e-08 Identities = 20/37 (54%), Positives = 27/37 (72%) Frame = +2 Query: 1049 F*VKLHPDGTIDHYKARLVSQGFK*LPGIDYDQTFSP 1159 F KLH DGT++ KAR+V++GF GIDY +T+SP Sbjct: 761 FRTKLHADGTLNKLKARIVAKGFLQEEGIDYLETYSP 797 >ABA97658.1 retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group] Length = 1597 Score = 117 bits (294), Expect = 6e-25 Identities = 68/148 (45%), Positives = 83/148 (56%) Frame = +3 Query: 57 NSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLINILPTPLLNHIFPHE 236 ++S + GK + R + + LLFQA +P FWVEAL A LIN PT +L H P Sbjct: 765 HTSPQNGKAERILRSINNIMRSLLFQASLPGTFWVEALHTATHLINRHPTKMLQHHTPFF 824 Query: 237 ALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSINHKGFRCLNPENQRV 416 AL+G P Y LRVFG CYPNL K P S CI+ G ++HKG+RCL+P RV Sbjct: 825 ALYGVHPSYAHLRVFGCKCYPNLSATAPHKLSPRSTMCIFLGYPLHHKGYRCLDPSTNRV 884 Query: 417 YISRHVRFSELVFSYVDFYKHLSSVPSS 500 ISRHV F E F Y + S PSS Sbjct: 885 IISRHVVFDEHSFPYANI-----SSPSS 907 Score = 53.5 bits (127), Expect(2) = 3e-11 Identities = 29/67 (43%), Positives = 35/67 (52%) Frame = +3 Query: 813 HHMITRGKAGIVKPRQXXXXXXXXXXPSSPYHVPTSYTEALKYPHWKLAMIEEYNALLHN 992 H M TRGKAG+ KP Q P VP +Y AL P+W+ AM EE+ AL N Sbjct: 1055 HTMTTRGKAGVRKPVQRLNLHASTLSP-----VPRTYRAALADPYWRTAMEEEFTALTAN 1109 Query: 993 QTWPCSP 1013 +TW P Sbjct: 1110 RTWDLVP 1116 Score = 44.3 bits (103), Expect(2) = 3e-11 Identities = 20/34 (58%), Positives = 24/34 (70%) Frame = +2 Query: 1058 KLHPDGTIDHYKARLVSQGFK*LPGIDYDQTFSP 1159 K DGT+D YKAR V +GF GID+D+TFSP Sbjct: 1133 KFQSDGTLDRYKARWVLRGFSQRLGIDFDETFSP 1166 >OMO78631.1 Integrase, catalytic core [Corchorus capsularis] Length = 577 Score = 115 bits (289), Expect = 1e-24 Identities = 67/151 (44%), Positives = 86/151 (56%), Gaps = 2/151 (1%) Frame = +3 Query: 90 KNRQMTQKICCLLFQARMPFDFWVEALQYAIKLINILPTPLLNHIFPHEALFGTGPVYTS 269 K+R + + LLF + P +WVEA AI LIN P+ +L+ P E L+ P Y+ Sbjct: 278 KHRNIVELGLTLLFHSHTPKWYWVEAFGTAIWLINRQPSRVLDWKSPFELLYNKSPDYSC 337 Query: 270 LRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSINHKGFRCLNPENQRVYISRHVRFSEL 449 LRVFGS C+P L H+ K P SLPCI+ G S HKG+RCL+P + RVYISRHV F E Sbjct: 338 LRVFGSKCFPFLRSHSKNKLEPRSLPCIFLGYSELHKGYRCLHPPSGRVYISRHVTFDEK 397 Query: 450 VFSYVDFYKHLSSVPSSLF--PQNCDINNLL 536 VF + D P SLF CD+ + Sbjct: 398 VFPFKD--------PGSLFAPSDTCDLTEFI 420 >OMO89784.1 Integrase, catalytic core [Corchorus capsularis] Length = 1048 Score = 115 bits (289), Expect = 2e-24 Identities = 63/152 (41%), Positives = 92/152 (60%), Gaps = 4/152 (2%) Frame = +3 Query: 18 FSIL----GIH*ILLPNNSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAIK 185 FSIL GI L ++ + G +K+R + +K LL Q+ +P FWVEA Q ++ Sbjct: 529 FSILLDSNGITHQLSCPHTPQQNGVAERKHRHVVEKGLYLLSQSSLPSKFWVEAFQTSLY 588 Query: 186 LINILPTPLLNHIFPHEALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGP 365 LIN LPTP+L P+ LFG PVY LR FG CYP+LV + K + C++ G Sbjct: 589 LINRLPTPVLGGKRPYVVLFGKPPVYDHLRTFGCACYPHLVPYNKTKLEFKTRQCVFLGY 648 Query: 366 SINHKGFRCLNPENQRVYISRHVRFSELVFSY 461 + HKG++CL+P+++R+YISR+V F E +F + Sbjct: 649 GVQHKGYKCLDPQSRRIYISRNVAFDENLFPF 680 Score = 46.2 bits (108), Expect(2) = 1e-09 Identities = 28/72 (38%), Positives = 37/72 (51%), Gaps = 5/72 (6%) Frame = +3 Query: 813 HHMITRGKAGIVKPRQXXXXXXXXXX--PSSPYHV---PTSYTEALKYPHWKLAMIEEYN 977 H M+TR K GI KP+ S+ H PT +++A K P W+ AM +E+N Sbjct: 775 HPMMTRLKVGIRKPKALCTTKHPLPACYSSTLEHSSSEPTCFSQASKDPRWRQAMQDEFN 834 Query: 978 ALLHNQTWPCSP 1013 ALL N TW P Sbjct: 835 ALLRNNTWVLVP 846 Score = 46.2 bits (108), Expect(2) = 1e-09 Identities = 20/35 (57%), Positives = 27/35 (77%) Frame = +2 Query: 1055 VKLHPDGTIDHYKARLVSQGFK*LPGIDYDQTFSP 1159 +K DG+I+ YKARLV++GF G+DYD+TFSP Sbjct: 862 IKYKSDGSIERYKARLVAKGFHQQLGLDYDETFSP 896 >EOY32308.1 Uncharacterized protein TCM_040047 [Theobroma cacao] Length = 678 Score = 115 bits (288), Expect = 2e-24 Identities = 66/191 (34%), Positives = 100/191 (52%), Gaps = 4/191 (2%) Frame = +3 Query: 3 KLQPHFSILGIH*ILLPNNSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAI 182 K+ + + GI + P + G ++++ + + LL A MP FW A Q A+ Sbjct: 310 KMSKYLATHGISHLTTPPYTLELNGAAERRHKHIVETGLTLLHHASMPLKFWSHAFQVAV 369 Query: 183 KLINILPTPLLNHIFPHEALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPG 362 IN LPTPLLN P E LF T P Y+ L+VFG LCYP L + K P S PC++ Sbjct: 370 YPINKLPTPLLNLKSPFEILFETPPNYSKLKVFGCLCYPWLKPYNKHKLQPKSKPCVFLR 429 Query: 363 PSINHKGFRCLNPENQRVYISRHVRFSELVFSYVDFYKHLSSVPSSLFPQNCDINN---- 530 SIN ++C + E+QR+++SRHV F E +F + +S + + N + N Sbjct: 430 YSINQSAYKCFDHESQRIFVSRHVMFQEHIFPFTSAKTQTTSRQAMIEEFNALVKNQTWV 489 Query: 531 LLPSTSFMTLL 563 L+P +S T++ Sbjct: 490 LVPPSSKQTVI 500 Score = 44.7 bits (104), Expect(3) = 2e-07 Identities = 21/37 (56%), Positives = 25/37 (67%) Frame = +2 Query: 1049 F*VKLHPDGTIDHYKARLVSQGFK*LPGIDYDQTFSP 1159 F +K PDG+ID YKARLV++GF GIDY T P Sbjct: 506 FKIKRKPDGSIDRYKARLVAKGFHQREGIDYTDTLIP 542 Score = 32.0 bits (71), Expect(3) = 2e-07 Identities = 15/31 (48%), Positives = 19/31 (61%) Frame = +3 Query: 921 YTEALKYPHWKLAMIEEYNALLHNQTWPCSP 1013 +T A + AMIEE+NAL+ NQTW P Sbjct: 462 FTSAKTQTTSRQAMIEEFNALVKNQTWVLVP 492 Score = 26.9 bits (58), Expect(3) = 2e-07 Identities = 10/14 (71%), Positives = 13/14 (92%) Frame = +1 Query: 1003 LVPPTSSQSLIGCK 1044 LVPP+S Q++IGCK Sbjct: 490 LVPPSSKQTVIGCK 503 >OMO61427.1 Zinc finger, CCCH-type [Corchorus capsularis] Length = 1996 Score = 114 bits (285), Expect(2) = 3e-24 Identities = 61/127 (48%), Positives = 79/127 (62%) Frame = +3 Query: 87 QKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLINILPTPLLNHIFPHEALFGTGPVYT 266 +K+R + + LLF + P +WVEA AI LIN P+ +L+ P E L+ P Y+ Sbjct: 595 RKHRNIVELGLTLLFHSHTPKRYWVEAFGTAIWLINRQPSRVLDWKSPFELLYNKSPDYS 654 Query: 267 SLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSINHKGFRCLNPENQRVYISRHVRFSE 446 LRVFGS C+P L H+ K P SLPCI+ G S HKG+RCL+P + RVYISRHV F E Sbjct: 655 CLRVFGSKCFPFLRSHSKNKLEPRSLPCIFLGYSELHKGYRCLHPPSGRVYISRHVTFDE 714 Query: 447 LVFSYVD 467 VF + D Sbjct: 715 KVFPFKD 721 Score = 27.3 bits (59), Expect(2) = 3e-24 Identities = 11/22 (50%), Positives = 15/22 (68%), Gaps = 2/22 (9%) Frame = +1 Query: 37 IRYFCPTTPRQNG--ENEHKKI 96 ++Y CP TP QNG E +H+ I Sbjct: 579 LQYACPKTPEQNGLVERKHRNI 600 Score = 44.7 bits (104), Expect(3) = 5e-09 Identities = 25/63 (39%), Positives = 29/63 (46%) Frame = +3 Query: 825 TRGKAGIVKPRQXXXXXXXXXXPSSPYHVPTSYTEALKYPHWKLAMIEEYNALLHNQTWP 1004 TR GI KP +S P S ALK+P WK AM EE +AL+ N TW Sbjct: 841 TRQSHGIAKPNPKYFNDDFCFTATSIPIEPKSVKTALKHPDWKTAMEEEIHALMQNDTWE 900 Query: 1005 CSP 1013 P Sbjct: 901 LVP 903 Score = 42.4 bits (98), Expect(3) = 5e-09 Identities = 18/37 (48%), Positives = 27/37 (72%) Frame = +2 Query: 1049 F*VKLHPDGTIDHYKARLVSQGFK*LPGIDYDQTFSP 1159 F K DG+++ KARLV++GF +PG+D+ +TFSP Sbjct: 917 FKTKTKADGSLERLKARLVAKGFNQVPGVDFLETFSP 953 Score = 22.3 bits (46), Expect(3) = 5e-09 Identities = 7/14 (50%), Positives = 12/14 (85%) Frame = +1 Query: 1003 LVPPTSSQSLIGCK 1044 LVP ++S +++GCK Sbjct: 901 LVPQSNSMNIVGCK 914 >XP_008646585.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103628087 [Zea mays] Length = 1134 Score = 115 bits (288), Expect = 3e-24 Identities = 63/144 (43%), Positives = 82/144 (56%) Frame = +3 Query: 30 GIH*ILLPNNSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLINILPTP 209 G+H + +S + G+ + R M + LLFQA +P +W EAL A L+N LPT Sbjct: 489 GVHLRMSCPYTSPQNGRAERMIRTMNDVVRSLLFQASLPVSYWAEALGTATYLLNRLPTK 548 Query: 210 LLNHIFPHEALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSINHKGFR 389 + H P+ ALFG P Y LRVFG CYPNL T K P S C++ G S +HKG+R Sbjct: 549 AVAHPTPYFALFGVHPSYDHLRVFGCACYPNLASTTPHKLAPRSTRCVFLGYSPDHKGYR 608 Query: 390 CLNPENQRVYISRHVRFSELVFSY 461 CL+ + RV ISRHV F E F + Sbjct: 609 CLDLASHRVLISRHVVFDESDFPF 632 >AAC67200.1 putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1402 Score = 115 bits (288), Expect = 3e-24 Identities = 65/167 (38%), Positives = 93/167 (55%), Gaps = 1/167 (0%) Frame = +3 Query: 3 KLQPHFSILGIH*ILLPNNSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAI 182 K H GI + ++ + G +K+R + + +LFQ+++P FWVEA A Sbjct: 600 KFLQHLQNHGIQQHISYPHTPQQNGLAERKHRHLVELGLSMLFQSKVPLKFWVEAFFTAN 659 Query: 183 KLINILPTPLLNH-IFPHEALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYP 359 LIN+LPT + I P+E L T P YT+LR FG C+P + + K P SL C++ Sbjct: 660 FLINLLPTSAVEDAISPYEKLHQTTPDYTALRSFGCACFPTMRDYAMNKFDPRSLKCVFL 719 Query: 360 GPSINHKGFRCLNPENQRVYISRHVRFSELVFSYVDFYKHLSSVPSS 500 G + +KG+RCL P RVYISRHV F E + + YKHL S P++ Sbjct: 720 GYNDKYKGYRCLYPPTGRVYISRHVIFDETAYPFSHHYKHLHSQPTT 766 Score = 42.4 bits (98), Expect(2) = 7e-06 Identities = 25/68 (36%), Positives = 28/68 (41%) Frame = +3 Query: 813 HHMITRGKAGIVKPRQXXXXXXXXXXPSSPYHVPTSYTEALKYPHWKLAMIEEYNALLHN 992 H MITR K GI KP Y P + T ALK+P W AM EE Sbjct: 896 HPMITRAKVGITKPNPRYVFLSHKVT----YPEPKTVTAALKHPGWTGAMTEEMGNCSET 951 Query: 993 QTWPCSPH 1016 TW P+ Sbjct: 952 NTWSLVPY 959 Score = 37.0 bits (84), Expect(2) = 7e-06 Identities = 18/37 (48%), Positives = 25/37 (67%) Frame = +2 Query: 1049 F*VKLHPDGTIDHYKARLVSQGFK*LPGIDYDQTFSP 1159 F KLH DGT++ KAR+V++ F GI Y +T+SP Sbjct: 972 FRTKLHADGTLNKLKARIVAKCFLQEEGIGYLETYSP 1008 >JAU04955.1 Copia protein, partial [Noccaea caerulescens] Length = 817 Score = 115 bits (287), Expect = 4e-24 Identities = 61/147 (41%), Positives = 83/147 (56%) Frame = +3 Query: 60 SSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLINILPTPLLNHIFPHEA 239 +S + G+ + R + + LLFQA++P +WVEAL A L+NILP+ +N+ P Sbjct: 18 TSQQNGRAERTLRTINNLVRALLFQAKLPNTYWVEALNMAAHLLNILPSSAINNAIPFTR 77 Query: 240 LFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSINHKGFRCLNPENQRVY 419 LF Y LRVFG LCYPNL+ K P S CI+ G NHKG+RCL+ +R+ Sbjct: 78 LFNKPVSYEHLRVFGCLCYPNLLPTAPNKLSPRSARCIFLGYPTNHKGYRCLDLSTRRII 137 Query: 420 ISRHVRFSELVFSYVDFYKHLSSVPSS 500 ISRHV F E F + SS P++ Sbjct: 138 ISRHVVFDENSFPFTSTLSPSSSPPAA 164 >ABF95666.1 retrotransposon protein, putative, Ty1-copia subclass, expressed [Oryza sativa Japonica Group] Length = 976 Score = 115 bits (287), Expect = 4e-24 Identities = 62/137 (45%), Positives = 77/137 (56%) Frame = +3 Query: 57 NSSAKRGK*AQKNRQMTQKICCLLFQARMPFDFWVEALQYAIKLINILPTPLLNHIFPHE 236 ++S + GK + R + + LLFQA +P FWVEAL A LIN PT L H P Sbjct: 339 HTSPQNGKAERTLRSLNNIVRSLLFQASLPASFWVEALYTATHLINRHPTKTLKHHTPFF 398 Query: 237 ALFGTGPVYTSLRVFGSLCYPNLVRHTTIKSLPLSLPCIYPGPSINHKGFRCLNPENQRV 416 AL+GT P Y LRVFG CYPNL K P S C++ +HKG+RC +P RV Sbjct: 399 ALYGTHPSYDHLRVFGCKCYPNLSATAANKLSPRSTLCVFRSYPTDHKGYRCFDPIYNRV 458 Query: 417 YISRHVRFSELVFSYVD 467 YISRHV F E F + + Sbjct: 459 YISRHVVFDEHSFPFAE 475 Score = 48.5 bits (114), Expect(2) = 5e-07 Identities = 22/34 (64%), Positives = 25/34 (73%) Frame = +2 Query: 1058 KLHPDGTIDHYKARLVSQGFK*LPGIDYDQTFSP 1159 K DGT+D YKAR V +GF PGIDYD+TFSP Sbjct: 557 KFQSDGTLDMYKARWVLRGFTQRPGIDYDETFSP 590 Score = 34.7 bits (78), Expect(2) = 5e-07 Identities = 17/43 (39%), Positives = 24/43 (55%), Gaps = 2/43 (4%) Frame = +3 Query: 891 PSSPYHVPTSYTE--ALKYPHWKLAMIEEYNALLHNQTWPCSP 1013 P++P T + AL P+W+ AM +EY AL+ N TW P Sbjct: 498 PAAPPGFTTKIQDKAALADPNWRAAMEDEYTALMANNTWDLVP 540