BLASTX nr result
ID: Lithospermum23_contig00033555
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Lithospermum23_contig00033555 (410 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value OAY23996.1 hypothetical protein MANES_18G124100 [Manihot esculenta] 79 2e-16 OMO54212.1 Endonuclease/exonuclease/phosphatase [Corchorus olito... 84 2e-16 OMO52105.1 reverse transcriptase [Corchorus capsularis] 84 3e-16 OMP03957.1 hypothetical protein COLO4_10073 [Corchorus olitorius] 79 7e-15 XP_017970300.1 PREDICTED: uncharacterized protein LOC18609430 [T... 76 2e-14 XP_016173438.1 PREDICTED: uncharacterized protein LOC107615941 [... 79 3e-14 XP_017979793.1 PREDICTED: uncharacterized protein LOC18594299 [T... 78 3e-14 EOY14040.1 Retrotransposon, unclassified-like protein [Theobroma... 77 4e-14 EOY13397.1 Uncharacterized protein TCM_031959 [Theobroma cacao] 77 6e-14 EOY00433.1 Uncharacterized protein TCM_010295 [Theobroma cacao] 76 8e-14 XP_015941361.1 PREDICTED: uncharacterized protein LOC107466866 [... 75 4e-13 pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabi... 74 1e-12 XP_015959674.1 PREDICTED: uncharacterized protein LOC107483570 [... 74 1e-12 OMO72106.1 hypothetical protein CCACVL1_17948 [Corchorus capsula... 71 1e-12 EOY02330.1 Ribonuclease H-like superfamily protein [Theobroma ca... 72 2e-12 OMO95972.1 reverse transcriptase [Corchorus olitorius] 73 2e-12 XP_007204004.1 hypothetical protein PRUPE_ppa025711mg, partial [... 73 2e-12 EOY21370.1 Ribonuclease H-like superfamily protein [Theobroma ca... 73 2e-12 XP_007210987.1 hypothetical protein PRUPE_ppa021345mg [Prunus pe... 72 3e-12 XP_010446023.1 PREDICTED: uncharacterized protein LOC104728787 [... 72 4e-12 >OAY23996.1 hypothetical protein MANES_18G124100 [Manihot esculenta] Length = 128 Score = 79.3 bits (194), Expect = 2e-16 Identities = 40/122 (32%), Positives = 61/122 (50%) Frame = -3 Query: 390 PRVRNFLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTH 211 P+++ F+W+ + N L K NL RGVK++ C +C +E+ EH+F +C A W+ Sbjct: 5 PKIKTFIWRAIKNSLIVKENLASRGVKLDSTCCICKLNVESQEHMFFWCAYANLVWFAGP 64 Query: 210 WQF*PSEEKGRSMYEWWRLVSRLLGTEGEDGDVEQLACVIWYWWKNRCSVSFGETQQPLE 31 + P+ S +WW + TE D+ L C WY WK RC V F + L+ Sbjct: 65 SYYKPNLVGFSSFPQWWNDIVENFWTEPYILDMIALTC--WYIWKARCKVVFEKELSGLQ 122 Query: 30 SI 25 I Sbjct: 123 HI 124 >OMO54212.1 Endonuclease/exonuclease/phosphatase [Corchorus olitorius] Length = 1018 Score = 84.3 bits (207), Expect = 2e-16 Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 1/131 (0%) Frame = -3 Query: 405 GTQSTPRVRNFLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRF 226 G +S P+VRNF+W+ N++ T+ NL RR + C C +E+ EH+ +CP A Sbjct: 731 GLKSVPKVRNFIWRACKNIISTRENLVRRRHGRDSSCLRCGEEVESLEHIMFFCPFAQAT 790 Query: 225 WYMTHWQF*PSEEKGRSMYEWWRLVSRLLGTEGEDGDVEQLACVIWYWWKNRCSVSF-GE 49 W ++H+ + P E S WW V+ G + ++ + W WK R + F G+ Sbjct: 791 WKLSHFSYSPRREGFTSFKNWWDKVACTFADFGSFSSISLISYLCWNIWKARNAFLFEGQ 850 Query: 48 TQQPLESIFNA 16 + +P+ NA Sbjct: 851 SGEPMRVWNNA 861 >OMO52105.1 reverse transcriptase [Corchorus capsularis] Length = 1565 Score = 84.0 bits (206), Expect = 3e-16 Identities = 40/121 (33%), Positives = 62/121 (51%), Gaps = 1/121 (0%) Frame = -3 Query: 390 PRVRNFLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTH 211 P+V+NFLW+ N++PTK NL +R + C C + +E+ EH+ +CP A W +H Sbjct: 1257 PKVKNFLWRSCRNIVPTKENLVKRHCSLFSQCDRCGAEVESLEHILFFCPFAQAVWRASH 1316 Query: 210 WQF*PSEEKGRSMYEWWRLVSRLLGTEGEDGDVEQLACVIWYWWKNRCSVSF-GETQQPL 34 + + P E S +WW + + + G VE + + W WK R S F G P+ Sbjct: 1317 FSYSPRSEGFVSFLKWWEESANTIVSFGSLNVVELIRYLCWNVWKARNSFVFEGREGNPI 1376 Query: 33 E 31 E Sbjct: 1377 E 1377 >OMP03957.1 hypothetical protein COLO4_10073 [Corchorus olitorius] Length = 307 Score = 79.0 bits (193), Expect = 7e-15 Identities = 40/126 (31%), Positives = 59/126 (46%), Gaps = 1/126 (0%) Frame = -3 Query: 405 GTQSTPRVRNFLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRF 226 G P++RNFLWK N+ PT NL RR C C +ET EH+ +CP A Sbjct: 23 GLNVAPKIRNFLWKASKNINPTGENLVRRHHGRVSICQRCREEVETMEHILFFCPFAQAT 82 Query: 225 WYMTHWQF*PSEEKGRSMYEWWRLVSRLLGTEGEDGDVEQLACVIWYWWKNRCSVSF-GE 49 W ++ + + P +E S +WW V G + + + W+ WK R S F G+ Sbjct: 83 WKVSSFNYSPRKEGFTSFLQWWIQVFNTFVEAGSFSAIGLASYLCWHIWKPRNSFLFEGQ 142 Query: 48 TQQPLE 31 + P + Sbjct: 143 SDDPTQ 148 >XP_017970300.1 PREDICTED: uncharacterized protein LOC18609430 [Theobroma cacao] Length = 216 Score = 76.3 bits (186), Expect = 2e-14 Identities = 39/124 (31%), Positives = 62/124 (50%) Frame = -3 Query: 375 FLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTHWQF*P 196 F+WK +H++LPT++ L +RGV IE CPLC +ETA H C + W T F Sbjct: 3 FMWKVIHDILPTRSELIKRGVNIEVMCPLCEIEVETAFHCLCNCQFSRLVWLTTKCGFRD 62 Query: 195 SEEKGRSMYEWWRLVSRLLGTEGEDGDVEQLACVIWYWWKNRCSVSFGETQQPLESIFNA 16 S+ +W + V +L + + E+ C++W WK R V F +++ + Sbjct: 63 ISNFHDSIIDWLQGVFEVLNKD----ETEEFICLLWAIWKTRNVVVFNQSRSTPMVVVEI 118 Query: 15 GIKL 4 G+ L Sbjct: 119 GLDL 122 >XP_016173438.1 PREDICTED: uncharacterized protein LOC107615941 [Arachis ipaensis] Length = 1491 Score = 78.6 bits (192), Expect = 3e-14 Identities = 42/128 (32%), Positives = 63/128 (49%), Gaps = 4/128 (3%) Frame = -3 Query: 387 RVRNFLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTHW 208 ++R FLWK + +LP +NL +R ++ C +C ET EH L CP W+ + Sbjct: 930 KIRMFLWKAVEGILPVNSNLYKRRCAVKPSCSICQDENETVEHALLLCPWTRAVWFGSSI 989 Query: 207 QF*PSEEKGRSMYEW-WRLVSRLLGTEGEDGD--VEQLACVIWYWWKNRCSVSFGE-TQQ 40 Q P+ S W W V ++ G+D + + +L CV WY WK R F + T Sbjct: 990 QITPTAYNVASFGRWIWDTVQKIRRETGKDQERILCKLGCVCWYIWKTRNQYIFQQATIN 1049 Query: 39 PLESIFNA 16 P ++I NA Sbjct: 1050 PKQAIINA 1057 >XP_017979793.1 PREDICTED: uncharacterized protein LOC18594299 [Theobroma cacao] Length = 1056 Score = 78.2 bits (191), Expect = 3e-14 Identities = 40/122 (32%), Positives = 66/122 (54%), Gaps = 1/122 (0%) Frame = -3 Query: 375 FLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTHWQF*P 196 FLWK L+ +LPT+ L R + E CP C++ LET H CP+A W+ + W F Sbjct: 745 FLWKTLNGILPTRQALIYRSIIFESNCPSCDNELETDFHCLCCCPLARAVWHFSKWGFTN 804 Query: 195 SEEKGRSMYEWWRLVSRLLGTEGEDGDVEQLACVIWYWWKNR-CSVSFGETQQPLESIFN 19 E S+ +W + ++L E+ ++ ++ C++W WK R + G++ +PL+ I Sbjct: 805 IEVLFSSVQDWIFYIFQML----ENEEISKIGCILWALWKVRNLKIFQGKSYEPLQVIEL 860 Query: 18 AG 13 AG Sbjct: 861 AG 862 >EOY14040.1 Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 366 Score = 77.4 bits (189), Expect = 4e-14 Identities = 39/107 (36%), Positives = 56/107 (52%) Frame = -3 Query: 375 FLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTHWQF*P 196 F+W+ LH LPT+A L RR + I C P+CN+ LE H+ C A W W F Sbjct: 155 FIWRVLHGCLPTQATLNRRNIAINACYPMCNADLEMDCHILCECSFAKAVWLACKWGFCD 214 Query: 195 SEEKGRSMYEWWRLVSRLLGTEGEDGDVEQLACVIWYWWKNRCSVSF 55 ++ + S+ EW+ L + L + VE+L CV+W WK S+ F Sbjct: 215 NDHQFSSLKEWFLLRLQKL----DRIIVEELCCVMWAIWKGHNSLVF 257 >EOY13397.1 Uncharacterized protein TCM_031959 [Theobroma cacao] Length = 1217 Score = 77.4 bits (189), Expect = 6e-14 Identities = 39/122 (31%), Positives = 66/122 (54%), Gaps = 1/122 (0%) Frame = -3 Query: 375 FLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTHWQF*P 196 FLWK L+ +LPT+ L R + E CP C++ LET H CP+A W+ + W F Sbjct: 906 FLWKTLNGILPTRQALIYRSIIFESNCPSCDNELETDFHCLCCCPLARAVWHFSKWGFTN 965 Query: 195 SEEKGRSMYEWWRLVSRLLGTEGEDGDVEQLACVIWYWWKNR-CSVSFGETQQPLESIFN 19 E S+ +W + +++ E+ ++ ++ C++W WK R + G++ +PL+ I Sbjct: 966 IEVLFSSVQDWIFYIFQMM----ENEEISKIGCILWALWKVRNLKIFQGKSYEPLQVIEL 1021 Query: 18 AG 13 AG Sbjct: 1022 AG 1023 >EOY00433.1 Uncharacterized protein TCM_010295 [Theobroma cacao] Length = 290 Score = 75.9 bits (185), Expect = 8e-14 Identities = 39/124 (31%), Positives = 62/124 (50%) Frame = -3 Query: 375 FLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTHWQF*P 196 F+WK +H++LPT++ L +RGV IE CPLC +ETA H C + W T F Sbjct: 77 FMWKVIHDILPTRSELIKRGVNIELMCPLCEIEVETAFHCLCNCQFSRLVWLTTKCGFRD 136 Query: 195 SEEKGRSMYEWWRLVSRLLGTEGEDGDVEQLACVIWYWWKNRCSVSFGETQQPLESIFNA 16 S+ +W + V +L + + E+ C++W WK R V F +++ + Sbjct: 137 ISNFHDSIIDWLQGVFEVLNKD----ETEEFICLLWAIWKTRNVVVFNQSRSTPMVVVEI 192 Query: 15 GIKL 4 G+ L Sbjct: 193 GLDL 196 >XP_015941361.1 PREDICTED: uncharacterized protein LOC107466866 [Arachis duranensis] Length = 1315 Score = 75.1 bits (183), Expect = 4e-13 Identities = 41/124 (33%), Positives = 59/124 (47%), Gaps = 4/124 (3%) Frame = -3 Query: 375 FLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTHWQF*P 196 FLWK + +LP +NL +R ++ C +C ET EH L CP W+ + Q P Sbjct: 903 FLWKAVEGILPVNSNLYKRRCAVKPSCSICQDENETVEHALLLCPWTRAVWFGSSIQITP 962 Query: 195 SEEKGRSMYEW-WRLVSRLLGTEGEDGD--VEQLACVIWYWWKNRCSVSFGE-TQQPLES 28 + S W W V ++ G+D + + L CV WY WK R F + T P ++ Sbjct: 963 TAYNVTSFGRWIWDTVQKIRRETGKDQERVLCNLGCVCWYIWKTRNQYIFQQATINPKQA 1022 Query: 27 IFNA 16 I NA Sbjct: 1023 IINA 1026 >pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabidopsis thaliana (fragment) Length = 1365 Score = 73.9 bits (180), Expect = 1e-12 Identities = 39/118 (33%), Positives = 63/118 (53%), Gaps = 4/118 (3%) Frame = -3 Query: 396 STPRVRNFLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYM 217 + P++R FLWK LH +P + LR RG++ +D C +C++ ET H+ CP+A + W + Sbjct: 1050 TAPKIRIFLWKALHGAIPVEDRLRTRGIRSDDGCLMCDTENETINHILFECPLARQVWAI 1109 Query: 216 THWQF*PSEEKGRSMYEWWRLVSRLLGTEGEDGDVEQLACV----IWYWWKNRCSVSF 55 TH E S+Y +SRL+ ++ L V +W+ WKNR ++ F Sbjct: 1110 THLSS-AGSEFSNSVY---TNMSRLIDLTQQNDLPHHLRFVSPWILWFLWKNRNALLF 1163 >XP_015959674.1 PREDICTED: uncharacterized protein LOC107483570 [Arachis duranensis] Length = 1522 Score = 73.6 bits (179), Expect = 1e-12 Identities = 40/123 (32%), Positives = 58/123 (47%), Gaps = 4/123 (3%) Frame = -3 Query: 399 QSTPRVRNFLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWY 220 Q +VR FLWK +H +LP NL ++ + + C +C ET EH L CP W+ Sbjct: 1011 QVPQKVRMFLWKAVHRILPVNKNLHQKRITVAPTCSICQREEETIEHALLLCPWTRAVWF 1070 Query: 219 MTHWQF*PSEEKGRSMYEW----WRLVSRLLGTEGEDGDVEQLACVIWYWWKNRCSVSFG 52 ++ Q P+ RS EW R + GTE ++ + L C+ W WK R F Sbjct: 1071 GSNIQIVPTAYNVRSFGEWILDKIRRIKAETGTE-QEKILSNLGCLSWCIWKARNQYIFQ 1129 Query: 51 ETQ 43 T+ Sbjct: 1130 HTK 1132 >OMO72106.1 hypothetical protein CCACVL1_17948 [Corchorus capsularis] Length = 189 Score = 70.9 bits (172), Expect = 1e-12 Identities = 26/61 (42%), Positives = 39/61 (63%) Frame = -3 Query: 405 GTQSTPRVRNFLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRF 226 G P+++ FLW+ HN+LPTK N +RRG+ I+D CP+CN + H+F CP + + Sbjct: 49 GASVQPKIKFFLWRVRHNILPTKLNWQRRGIPIDDACPMCNGTESSLLHIFFTCPFSRKV 108 Query: 225 W 223 W Sbjct: 109 W 109 >EOY02330.1 Ribonuclease H-like superfamily protein [Theobroma cacao] Length = 266 Score = 72.0 bits (175), Expect = 2e-12 Identities = 38/122 (31%), Positives = 65/122 (53%), Gaps = 1/122 (0%) Frame = -3 Query: 375 FLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTHWQF*P 196 FLWK L+ +LPT+ L R + E CP C++ LET + CP+A W+ W F Sbjct: 130 FLWKTLNGILPTRQALVYRSILYESNCPSCDNKLETDFYCLCCCPLARVVWHFCKWGFTN 189 Query: 195 SEEKGRSMYEWWRLVSRLLGTEGEDGDVEQLACVIWYWWKNR-CSVSFGETQQPLESIFN 19 E S+ +W + +++ E+ +++++ C +W WK R + G++ +PL+ I Sbjct: 190 IEVLFSSVQDWIFYIFQIM----ENEEIKEIGCFLWALWKVRNLKIFQGKSFEPLQVIEL 245 Query: 18 AG 13 AG Sbjct: 246 AG 247 >OMO95972.1 reverse transcriptase [Corchorus olitorius] Length = 464 Score = 72.8 bits (177), Expect = 2e-12 Identities = 29/91 (31%), Positives = 50/91 (54%) Frame = -3 Query: 387 RVRNFLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTHW 208 +++NFLW+ N++PTK NL +R + C CN +E+ EH+ +CP A W +++ Sbjct: 184 KLQNFLWRACRNIIPTKENLVKRHCSYDPVCVRCNEDVESLEHILFFCPFAQAAWKASYF 243 Query: 207 QF*PSEEKGRSMYEWWRLVSRLLGTEGEDGD 115 + P E +WW V+ + G++GD Sbjct: 244 SYSPRREGFVGFLKWWEEVATDIANFGKEGD 274 >XP_007204004.1 hypothetical protein PRUPE_ppa025711mg, partial [Prunus persica] Length = 534 Score = 72.8 bits (177), Expect = 2e-12 Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 2/121 (1%) Frame = -3 Query: 405 GTQSTPRVRNFLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRF 226 G+Q P++ NF W+ + LPT+ L RR + CP+C E+ EH+FL C Sbjct: 289 GSQMVPKLMNFWWRMVRGCLPTRDALFRRHLGTSPLCPICGEFPESVEHLFLLCNWVQLV 348 Query: 225 WYMTHWQF*PSEEKGRSMYEWWRLVSRLLGTEGEDGD--VEQLACVIWYWWKNRCSVSFG 52 W+ + + + SM EW V ++ + G D + Q+ W WK+RCS F Sbjct: 349 WFGGPLNYKINRQSITSMSEWMMQVLKISQSLGYDRKWLISQIVYTCWSIWKSRCSAVFD 408 Query: 51 E 49 + Sbjct: 409 D 409 >EOY21370.1 Ribonuclease H-like superfamily protein [Theobroma cacao] Length = 569 Score = 72.8 bits (177), Expect = 2e-12 Identities = 37/108 (34%), Positives = 53/108 (49%) Frame = -3 Query: 375 FLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTHWQF*P 196 F+WK L LPTK L R + +++ C C ET H+ YC A W + W F Sbjct: 258 FMWKALKGALPTKKALSHRKINVDNICVFCQEDEETDFHILCYCQFARATWLSSKWGFRD 317 Query: 195 SEEKGRSMYEWWRLVSRLLGTEGEDGDVEQLACVIWYWWKNRCSVSFG 52 + S+++W VS LG + DV ++AC++W WK R FG Sbjct: 318 TGAHSTSVFDWIFQVSCNLGPK----DVGEIACILWAIWKARNLRIFG 361 >XP_007210987.1 hypothetical protein PRUPE_ppa021345mg [Prunus persica] Length = 759 Score = 72.4 bits (176), Expect = 3e-12 Identities = 39/116 (33%), Positives = 53/116 (45%), Gaps = 8/116 (6%) Frame = -3 Query: 390 PRVRNFLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTH 211 P++RNFLW+ L N L T ANL ++ + CPLCN ETAEH+ L CP W+ Sbjct: 450 PKIRNFLWRALRNCLATSANLHKKKIARSPMCPLCNDHPETAEHILLLCPWVEPVWFRCS 509 Query: 210 WQF*PSEEKGRSMYEWWRLVSRLLGTEGEDGDVEQ--------LACVIWYWWKNRC 67 + + S +W LG E G Q +A W WK++C Sbjct: 510 LNLRINRQAVTSFGQW-------LGNVIEKGKTPQERSRCLTVIAYFCWQIWKDKC 558 >XP_010446023.1 PREDICTED: uncharacterized protein LOC104728787 [Camelina sativa] Length = 342 Score = 71.6 bits (174), Expect = 4e-12 Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 1/119 (0%) Frame = -3 Query: 390 PRVRNFLWKWLHNVLPTKANLRRRGVKIEDCCPLCNSALETAEHVFLYCPVAPRFWYMTH 211 P+V++F+W+ L + ANLRRRG+ + C C + +ET HV CP A + W ++H Sbjct: 33 PKVKHFMWQVLTGSISVSANLRRRGIDCDVGCMRCGADVETINHVIFVCPPARQVWALSH 92 Query: 210 WQF*PSEEKGRSMYEWWRLVSRLLGTEGEDGDVEQLACVIWYWWKNRCSVSF-GETQQP 37 P S+Y V LG+ +E ++WY WK R + F +T++P Sbjct: 93 VPVGPQHFPTDSIYV---NVDHFLGSTNPGSQIEIFPWLMWYIWKVRNACVFENQTERP 148