BLASTX nr result
ID: Lithospermum22_contig00002791
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Lithospermum22_contig00002791 (1354 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 120 4e-35 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 114 2e-33 dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] 99 2e-29 emb|CAB72467.1| putative protein [Arabidopsis thaliana] 102 3e-26 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 117 7e-24 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 120 bits (301), Expect(3) = 4e-35 Identities = 70/234 (29%), Positives = 107/234 (45%), Gaps = 6/234 (2%) Frame = +3 Query: 6 LPKYVVKEVEKRIRSFLWSGKKEGPYMAKVSWDTVCLPLVEGGLGFKDMFDWNQVCLCKL 185 LP+ ++E++K SFLWSG + + AK+SWD VC P EGGLG +++ + N V KL Sbjct: 478 LPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIVCKPKAEGGLGLRNLKEANDVSCLKL 537 Query: 186 LWNIASRKETLWVQWVHTVRLKGVSVWG-XXXXXXXXXXXXXILRVRPLVQDKYTITV*N 362 +W I S +LW +WV ++ S+W IL++R + + + V N Sbjct: 538 VWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSWIWRKILKIRDVAKSFSRVEVGN 597 Query: 363 GESVSFLYDSWSSLGPVWDFLSDQERVSLRLPNDVSXXXXXXXXXXXXXXXXXXXXXXDS 542 GES SF YD WS+ G + D + D+ + L +P + S + Sbjct: 598 GESASFWYDHWSAHGRLIDTVGDKGTIDLGIPREASVADAWTRRSRRRHRTSLLNEIEEM 657 Query: 543 EA-SLVTFSTGEDKWGWRN*TNGV----YS*KHVWEAVRRRKEKEPWTKWLWSR 689 A + S ED WR N V +S + W ++ W K +W R Sbjct: 658 MAYQRIHHSDAEDTVLWRG-KNDVFKPHFSTRDTWHLIKATSSTVSWHKGVWFR 710 Score = 53.1 bits (126), Expect(3) = 4e-35 Identities = 21/58 (36%), Positives = 35/58 (60%), Gaps = 3/58 (5%) Frame = +1 Query: 703 PRHAFIVWVMFHEKLPTRD*LIRWGI--QVEAECIFC-SGIEDQQHLFFTCSYSARVW 867 P++A W+ H +LPT D +++W V C+ C + + +HLFF+CSY++ VW Sbjct: 714 PKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLCTNNSKTLEHLFFSCSYASTVW 771 Score = 22.7 bits (47), Expect(3) = 4e-35 Identities = 8/20 (40%), Positives = 13/20 (65%) Frame = +3 Query: 972 LMQLAFMATVYVLWQEWNSK 1031 L + F AT+Y +W+E N + Sbjct: 806 LTRYIFQATIYHVWRERNGR 825 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 114 bits (286), Expect(2) = 2e-33 Identities = 69/240 (28%), Positives = 103/240 (42%), Gaps = 9/240 (3%) Frame = +3 Query: 6 LPKYVVKEVEKRIRSFLWSGKKEGPYMAKVSWDTVCLPLVEGGLGFKDMFDWNQVCLCKL 185 LPK +K +E FLWSG E KVSW +CLP EGGLG + + +WN+ +L Sbjct: 824 LPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRL 883 Query: 186 LWNIASRKETLWVQWVHTVRLKGVSVWGXXXXXXXXXXXXXILRVRPLVQDKYTITV*NG 365 +W + K++LW W H L S W +L +RPL V NG Sbjct: 884 IWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLRPLAHQFLVCKVGNG 943 Query: 366 ESVSFLYDSWSSLGPVWDFLSDQERVSLRLP--NDVSXXXXXXXXXXXXXXXXXXXXXXD 539 + YD+W+SLGP++ + D SLR+P V+ D Sbjct: 944 LKADYWYDNWTSLGPLFRIIGDIGPSSLRVPLLAKVASAFSEDGWRLPVSRSAPAKGIHD 1003 Query: 540 SEASLVTFSTGE---DKWGWRN*TNGV----YS*KHVWEAVRRRKEKEPWTKWLWSRSDI 698 ++ ST + D++ W NG +S WEA+R + + W +W + + Sbjct: 1004 HLCTVPVPSTAQEDVDRYEWS--VNGFLCQGFSAAKTWEAIRPKATVKSWASSIWFKGAV 1061 Score = 55.8 bits (133), Expect(2) = 2e-33 Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 1/64 (1%) Frame = +1 Query: 700 IPRHAFIVWVMFHEKLPTRD*LIRWGIQVEAECIFCS-GIEDQQHLFFTCSYSARVWRLI 876 +P++AF +WV +L TR L WG C+ CS E + HL C +SA+VWRL+ Sbjct: 1061 VPKYAFNMWVSHLNRLLTRQRLASWGHIQSDACVLCSFASESRDHLLLICEFSAQVWRLV 1120 Query: 877 LQRL 888 +R+ Sbjct: 1121 FRRI 1124 >dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] Length = 478 Score = 99.4 bits (246), Expect(2) = 2e-29 Identities = 68/234 (29%), Positives = 100/234 (42%), Gaps = 8/234 (3%) Frame = +3 Query: 6 LPKYVVKEVEKRIRSFLWSGKKEGPYMAKVSWDTVCLPLVEGGLGFKDMFDWNQVCLCKL 185 LP +KE++ SFLWSG + AKV+W VC P EGGLG + + + N+V L KL Sbjct: 87 LPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKEANKVSLLKL 146 Query: 186 LWNIASRKETLWVQWVHTVRLKGVSVWG-XXXXXXXXXXXXXILRVRPLVQDKYTITV*N 362 +W + S +LWVQW+ L+ S W IL+ R L + N Sbjct: 147 IWRMLS-STSLWVQWLRLYLLRKGSFWSISGNTTLGSWMWKKILKHRALASGFVKHDIHN 205 Query: 363 GESVSFLYDSWSSLGPVWDFLSDQERVSLRLPNDVSXXXXXXXXXXXXXXXXXXXXXXD- 539 G + SF +D+WS +G + D + + + + S D Sbjct: 206 GSNTSFWFDNWSKIGRLIDVTGHRGCIDMGITLHASVAEAVVNHRPRRHRHDTLLRIEDV 265 Query: 540 -SEASLVTFSTGEDKWGWRN*TNG-----VYS*KHVWEAVRRRKEKEPWTKWLW 683 +E ++GED W+ NG ++ K W A R K K W K +W Sbjct: 266 IAEVRHQGLTSGEDTVRWKG--NGDIFKPCFNTKETWAATREPKLKVNWYKGVW 317 Score = 57.8 bits (138), Expect(2) = 2e-29 Identities = 23/72 (31%), Positives = 40/72 (55%), Gaps = 1/72 (1%) Frame = +1 Query: 703 PRHAFIVWVMFHEKLPTRD*LIRWGIQVEAECIFCSG-IEDQQHLFFTCSYSARVWRLIL 879 P+++ + W+ +L T D ++ W ++ C+ C +E + HLFFTC YSA VW + Sbjct: 323 PKYSVLAWIAIKNRLTTGDRMLSWNAGADSSCVLCHHLVETRDHLFFTCPYSAEVWSTLT 382 Query: 880 QRLKACGGTMCW 915 ++L + T W Sbjct: 383 RKLLSQHFTNRW 394 >emb|CAB72467.1| putative protein [Arabidopsis thaliana] Length = 762 Score = 102 bits (254), Expect(2) = 3e-26 Identities = 68/234 (29%), Positives = 102/234 (43%), Gaps = 8/234 (3%) Frame = +3 Query: 6 LPKYVVKEVEKRIRSFLWSGKKEGPYMAKVSWDTVCLPLVEGGLGFKDMFDWNQVCLCKL 185 LP+ ++E+EK SFLWSG AK+SW+ VC P EGGLG + + + N VC KL Sbjct: 384 LPRACIQEIEKLCSSFLWSGTNLNSKKAKISWNQVCKPKSEGGLGLRSLKEANDVCCLKL 443 Query: 186 LWNIASRKETLWVQWVHTVRLKGVSVW-GXXXXXXXXXXXXXILRVRPLVQDKYTITV*N 362 +W I S ++LWV+WV LK W IL+ R + + V N Sbjct: 444 VWRIISHGDSLWVKWVEHNLLKREIFWIVKENANLGSWIWKKILKYRGVAKRFCKAEVGN 503 Query: 363 GESVSFLYDSWSSLGPVWDFLSDQERVSLRLPNDVSXXXXXXXXXXXXXXXXXXXXXXDS 542 GES SF +D WS LG + D + + + + +S + Sbjct: 504 GESTSFWFDDWSLLGRLIDVAGIRGTIDMGISRTMSVADAWTSRRRRHHRQEILNTIEEV 563 Query: 543 EASLVTFSTGEDKWG---WRN*TNGVY----S*KHVWEAVRRRKEKEPWTKWLW 683 ++ T + + G W+ N +Y S K+ W +R + W K +W Sbjct: 564 LSTQHQKRTQQQQQGRVLWKG-KNDIYKDKFSTKNTWNYLRTTSNEVAWHKGVW 616 Score = 43.9 bits (102), Expect(2) = 3e-26 Identities = 18/47 (38%), Positives = 29/47 (61%), Gaps = 1/47 (2%) Frame = +1 Query: 703 PRHAFIVWVMFHEKLPTRD*LIRWGIQVEAECIFC-SGIEDQQHLFF 840 P+++F +W+ H++L T +I+W +C FC GIE + HLFF Sbjct: 622 PKYSFCLWLAAHDRLATGARMIKWNRGETGDCTFCRQGIETRDHLFF 668 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 117 bits (293), Expect = 7e-24 Identities = 92/361 (25%), Positives = 154/361 (42%), Gaps = 16/361 (4%) Frame = +3 Query: 6 LPKYVVKEVEKRIRSFLWSGKKEGPYMAKVSWDTVCLPLVEGGLGFKDMFDWNQVCLCKL 185 LP+ ++E+EK +FLWSG + AK+SW VC P EGGLG + + + N VC KL Sbjct: 831 LPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEANDVCCLKL 890 Query: 186 LWNIASRKETLWVQWVHTVRLKGVSVWG-XXXXXXXXXXXXXILRVRPLVQDKYTITV*N 362 +W I S +LWV+WV L+ S W +L+ R + + + V N Sbjct: 891 VWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYREVAKTLSKVEVGN 950 Query: 363 GESVSFLYDSWSSLGPVWDFLSDQERVSLRLPNDVSXXXXXXXXXXXXXXXXXXXXXXDS 542 G+ SF YD+WS LG + + D+ + L + ++ D+ Sbjct: 951 GKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRRMTVEEAWTNRRQRRHRNDVYNVIEDA 1010 Query: 543 -EASLVTFSTGEDKWGWRN*TN---GVYS*KHVWEAVRRRKEKEPWTKWLW-SRSDIYTT 707 + S T + EDK WR ++ +S + W R + PW K +W S + + Sbjct: 1011 LKKSWDTRTETEDKVLWRGKSDVFRTTFSTRDTWHHTRSTSARVPWHKVIWFSHATPKYS 1070 Query: 708 TCIYCL--GDVP*EITNQRLTNSLGDTGRS*MYIL--------FWY*RPATSILYL*LFG 857 C + G +P T R+ N ++ ++ TS++++ L Sbjct: 1071 FCSWLAAHGRLP---TGDRMINWANGIATDCIFCQGTLETRDHLFFTCSFTSVIWVDL-- 1125 Query: 858 QSLEIDSPKIEGMWRDDVLA*EKTVVHREFGG*IIRNRLMQLAFMATVYVLWQEWNSKIF 1037 + I + W+ + A + HR + L + F AT+Y++W+E N + Sbjct: 1126 -ARGIFKTQYTSHWQSIIEAITNSQHHR------VEWFLRRYVFQATIYIVWRERNGRRH 1178 Query: 1038 G 1040 G Sbjct: 1179 G 1179