BLASTX nr result
ID: Angelica23_contig00023698
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00023698 (1486 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003635424.1| PREDICTED: LOW QUALITY PROTEIN: putative rib... 283 8e-74 ref|XP_002522452.1| nucleic acid binding protein, putative [Rici... 251 5e-64 gb|AAG13524.1|AC068924_29 putative non-LTR retroelement reverse ... 228 4e-57 gb|AAP54617.2| retrotransposon protein, putative, unclassified [... 228 4e-57 emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga... 218 3e-54 >ref|XP_003635424.1| PREDICTED: LOW QUALITY PROTEIN: putative ribonuclease H protein At1g65750-like [Vitis vinifera] Length = 820 Score = 283 bits (724), Expect = 8e-74 Identities = 166/490 (33%), Positives = 242/490 (49%), Gaps = 4/490 (0%) Frame = +1 Query: 19 KHAGGMGFRSLRDFNLALLGKQAWRLVTKPDSLASRVFKARYYPTTSFLEANLGSNPSFI 198 K GG+GFR L DFNLALL KQ WR + PDSL +R+F+ARY+ +SFL A LGSNPS++ Sbjct: 316 KKDGGLGFRKLXDFNLALLAKQGWRFLRNPDSLVTRIFQARYFRNSSFLNAELGSNPSYM 375 Query: 199 WRSICASQFMLLNGVRIRIGSGQSTSILGSP*LPDEANPRITTDNQT-LENAKVSSLMVV 375 WRSI A+Q +L G I SG + G LPD +N I T + KV L+ Sbjct: 376 WRSILAAQGLLKRGCYWSIASGTKVQVWGDSWLPDSSNRLIITPPVAGFDGIKVDELITE 435 Query: 376 GRLEWDEDIIWDVCNERDAXXXXXXXXXXXXXTDQWYWGKEKTRIYTVKSAYRSIH--ED 549 G W ED I D RDA DQ W + YT +S Y ++ Sbjct: 436 GL--WREDFIRDKFMARDADLILSIPLPMSSREDQISWSFDARGEYTARSGYGALRCFRQ 493 Query: 550 KAYQLQREANSGF-WRKMWNLKVPPKVKNLIWLTVTEVTGCLPTRTQLRTKHVEITSMCP 726 + + +S F W ++W + PPK+ N W CLPTR L +HV+ CP Sbjct: 494 STALVASDVDSNFVWAQLWKVTAPPKILNFAW---RAARNCLPTRFALTIRHVDTPMCCP 550 Query: 727 WCNAADETMYHTLVGCEFTIEIWSKIGIHVKEEQGRTIREWLDSNFNAYDKRKTGEIVMI 906 C + ET H LV C ++W + G+ + + + +WL + F D + + + Sbjct: 551 ICRSELETTLHALVECVAARDVWDESGLAMLQGNFGSFVDWLATMFAYCDFVVFAKYLAV 610 Query: 907 CWAIWGARNKLIWEQRNPMGQQIILAAQTFFEQWNTAQDKMNVSMESFIPGDGLDKWMKP 1086 CW +W RN ++W R QQ++ T E W A + + ++ +P KW KP Sbjct: 611 CWGLWWRRNDVVWNGRIWHSQQVVNGCFTMLESWFHANETLATAVT--VPSYS-SKWQKP 667 Query: 1087 ENDMFKVNTDATLFADTGRYNFVFVVRDAEGEMVDAGATCRAGVVQPEVAEALRIKEALS 1266 + K+N D +F D G VF RD +G + A P+V EAL ++E LS Sbjct: 668 DYGWIKINVDGAVFPDKGAIGAVF--RDHQGRFMGGFAKPFPHQTLPKVVEALGVREVLS 725 Query: 1267 WAKTQPRAKMRIEIENLLVVQGVRSNTKMTSYFGGIIEECRYLLKELSLVSLFFVKRSAN 1446 W + R+++ +E + L VVQ ++ + + FG II +C +L+ L V + + +RSAN Sbjct: 726 WIHERSRSRIVVETDCLRVVQAIQHKSCPNTSFGFIIVDCLDVLQHLVDVQVVYARRSAN 785 Query: 1447 RVAHALTRAS 1476 AH L + Sbjct: 786 SAAHCLANGA 795 >ref|XP_002522452.1| nucleic acid binding protein, putative [Ricinus communis] gi|223538337|gb|EEF39944.1| nucleic acid binding protein, putative [Ricinus communis] Length = 483 Score = 251 bits (640), Expect = 5e-64 Identities = 150/470 (31%), Positives = 237/470 (50%), Gaps = 3/470 (0%) Frame = +1 Query: 76 GKQAWRLVTKPDSLASRVFKARYYPTTSFLEANLGSNPSFIWRSICASQFMLLNGVRIRI 255 G++AWRLV PD L +++KARYYP FL + LG+NPS+IW S+ + ++ GV+ +I Sbjct: 14 GRRAWRLVENPDLLVGKLYKARYYPNGDFLSSLLGTNPSYIWASLQQVRDLICKGVQWKI 73 Query: 256 GSGQSTSILGSP*LPDEANPRITTD-NQTLENAKVSSLMVVGRLEWDEDIIWDVCNERDA 432 G G SI L D +N I T+ L V SL+ G +WD++I+ D+ +ERD Sbjct: 74 GKGTEVSIGLHEWLLDASNGFIETNLPDELRLQPVCSLLKPGTCDWDQEILNDLFSERDK 133 Query: 433 XXXXXXXXXXXXXTDQWYWGKEKTRIYTVKSAYRSIHEDKAYQLQREANSGFWRKMWNLK 612 TDQ +W K+ ++VK YR D + + +N WR++W L Sbjct: 134 NLINSIVLSPTVGTDQLFWFKDPKGKFSVKDVYRVQQPD--FSVLYPSNVVVWRRLWKLN 191 Query: 613 VPPKVKNLIWLTVTEVTGCLPTRTQLRTKHVEITSMCPWCNAADETMYHTLVGCEFTIEI 792 + K K +W +T LP RT L + V S CP C + ET+ H LV C+ T + Sbjct: 192 IAAKCKVFMWRALTNR---LPVRTNLVMRKVTEDSSCPCCVSQPETIMHILVLCDVTTQS 248 Query: 793 WSKIGIHVKEEQGRTIREWLDSNFNAYDKRKTGEIVMICWAIWGARNKLIWEQRNPMGQQ 972 W + ++ Q + E S F +D V++ W++W N ++W + + Sbjct: 249 WKYVNLYQFLSQVSNLLEGARSVFEHFDDSIVASFVVLWWSLWTNMNDVVWNGKKLSWRA 308 Query: 973 IILAAQTFFEQWNTAQDKMNVSM--ESFIPGDGLDKWMKPENDMFKVNTDATLFADTGRY 1146 + A +F QW A+ + F GD L W KP +K+N DA+ A+ G+ Sbjct: 309 VASRASSFLFQWGKARKLTDYHQLGRHFAGGDCL--WQKPATGKYKLNVDASSSAERGKS 366 Query: 1147 NFVFVVRDAEGEMVDAGATCRAGVVQPEVAEALRIKEALSWAKTQPRAKMRIEIENLLVV 1326 FV+RD G + R + P+VAEA +KEALSW + +++IE + L + Sbjct: 367 GASFVLRDNAGIWITGVLIIRPYIANPDVAEAWALKEALSWIHAKGMEEVQIETDCLRNI 426 Query: 1327 QGVRSNTKMTSYFGGIIEECRYLLKELSLVSLFFVKRSANRVAHALTRAS 1476 + + SY ++++C+ LL+ L+ +L FV SAN VAH +++A+ Sbjct: 427 ELLEEELHPNSYLLCLLKDCQDLLRVLNRCNLVFVYGSANTVAHMISKAT 476 >gb|AAG13524.1|AC068924_29 putative non-LTR retroelement reverse transcriptase [Oryza sativa Japonica Group] Length = 1382 Score = 228 bits (580), Expect = 4e-57 Identities = 149/502 (29%), Positives = 234/502 (46%), Gaps = 14/502 (2%) Frame = +1 Query: 7 LSIDKHAGGMGFRSLRDFNLALLGKQAWRLVTKPDSLASRVFKARYYPTTSFLEANLGSN 186 LS K GGMGFR FN A+LG+Q WRL+T PDSL SRV K RY+P +SF EA + Sbjct: 859 LSTPKFLGGMGFREFTTFNQAMLGRQCWRLLTDPDSLCSRVLKGRYFPNSSFWEAAQPKS 918 Query: 187 PSFIWRSICASQFMLLNGVRIRIGSGQSTSILGSP*LPDEANPRITTDNQTLENAKVSSL 366 PSF WRS+ + +L GVR +G G++ I +P +TT + +A VS L Sbjct: 919 PSFTWRSLLFGRELLAKGVRWGVGDGKTIKIFSDNWIPGFRPQLVTTLSPFPTDATVSCL 978 Query: 367 MVVGRLEWDEDIIWDVCNERDAXXXXXXXXXXXXXTDQWYWGKEKTRIYTVKSAYRSIHE 546 M WD D+I + A D W +K +Y+V+SAY Sbjct: 979 MNEDARCWDGDLIRSLFPVDIAKEILQIPISRHGDADFASWPHDKLGLYSVRSAYNLARS 1038 Query: 547 DKAYQLQREANSGF----------WRKMWNLKVPPKVKNLIWLTVTEVTGCLPTRTQLRT 696 + + Q + G W+ +W + P K+K +W E CL T QLR Sbjct: 1039 EAFFADQSNSGRGMASRLLESQKDWKGLWKINAPGKMKITLWRAAHE---CLATGFQLRR 1095 Query: 697 KHVEITSMCPWCNAADETMYHTLVGCEFTIEIWSKIGIHVKEEQGR----TIREWLDSNF 864 +H+ T C +CN D+T+ H + C F +IW +I + GR T+R+W+ Sbjct: 1096 RHIPSTDGCVFCN-RDDTVEHVFLFCPFAAQIWEEIKGKCAVKLGRNGFSTMRQWIFDFL 1154 Query: 865 NAYDKRKTGEIVMICWAIWGARNKLIWEQRNPMGQQIILAAQTFFEQWNTAQDKMNVSME 1044 + + W IW ARN Q++++ ++ + K V + Sbjct: 1155 KRGSSHANTLLAVTFWHIWEARNNTKNNNGTVHPQRVVIKILSYVDMILKHNTK-TVDGQ 1213 Query: 1045 SFIPGDGLDKWMKPENDMFKVNTDATLFADTGRYNFVFVVRDAEGEMVDAGATCRAGVVQ 1224 + +W P ++ +N+DA +F+ + ++RD G+ + A + + VV Sbjct: 1214 RGGNTQAIPRWQPPPASVWMINSDAAIFSSSRTMGVGALIRDNTGKCLVACSEMISDVVL 1273 Query: 1225 PEVAEALRIKEALSWAKTQPRAKMRIEIENLLVVQGVRSNTKMTSYFGGIIEECRYLLKE 1404 PE+AEAL I+ AL AK + + + + L V++ ++++ + S G +IE+ + L Sbjct: 1274 PELAEALAIRRALGLAKEEGLEHIVMASDCLTVIRRIQTSGRDRSGVGCVIEDIKKLAST 1333 Query: 1405 LSLVSLFFVKRSANRVAHALTR 1470 L S V R +N AH+L R Sbjct: 1334 FVLCSFMHVNRLSNLAAHSLAR 1355 >gb|AAP54617.2| retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group] gi|125575397|gb|EAZ16681.1| hypothetical protein OsJ_32156 [Oryza sativa Japonica Group] Length = 1339 Score = 228 bits (580), Expect = 4e-57 Identities = 149/502 (29%), Positives = 234/502 (46%), Gaps = 14/502 (2%) Frame = +1 Query: 7 LSIDKHAGGMGFRSLRDFNLALLGKQAWRLVTKPDSLASRVFKARYYPTTSFLEANLGSN 186 LS K GGMGFR FN A+LG+Q WRL+T PDSL SRV K RY+P +SF EA + Sbjct: 816 LSTPKFLGGMGFREFTTFNQAMLGRQCWRLLTDPDSLCSRVLKGRYFPNSSFWEAAQPKS 875 Query: 187 PSFIWRSICASQFMLLNGVRIRIGSGQSTSILGSP*LPDEANPRITTDNQTLENAKVSSL 366 PSF WRS+ + +L GVR +G G++ I +P +TT + +A VS L Sbjct: 876 PSFTWRSLLFGRELLAKGVRWGVGDGKTIKIFSDNWIPGFRPQLVTTLSPFPTDATVSCL 935 Query: 367 MVVGRLEWDEDIIWDVCNERDAXXXXXXXXXXXXXTDQWYWGKEKTRIYTVKSAYRSIHE 546 M WD D+I + A D W +K +Y+V+SAY Sbjct: 936 MNEDARCWDGDLIRSLFPVDIAKEILQIPISRHGDADFASWPHDKLGLYSVRSAYNLARS 995 Query: 547 DKAYQLQREANSGF----------WRKMWNLKVPPKVKNLIWLTVTEVTGCLPTRTQLRT 696 + + Q + G W+ +W + P K+K +W E CL T QLR Sbjct: 996 EAFFADQSNSGRGMASRLLESQKDWKGLWKINAPGKMKITLWRAAHE---CLATGFQLRR 1052 Query: 697 KHVEITSMCPWCNAADETMYHTLVGCEFTIEIWSKIGIHVKEEQGR----TIREWLDSNF 864 +H+ T C +CN D+T+ H + C F +IW +I + GR T+R+W+ Sbjct: 1053 RHIPSTDGCVFCN-RDDTVEHVFLFCPFAAQIWEEIKGKCAVKLGRNGFSTMRQWIFDFL 1111 Query: 865 NAYDKRKTGEIVMICWAIWGARNKLIWEQRNPMGQQIILAAQTFFEQWNTAQDKMNVSME 1044 + + W IW ARN Q++++ ++ + K V + Sbjct: 1112 KRGSSHANTLLAVTFWHIWEARNNTKNNNGTVHPQRVVIKILSYVDMILKHNTK-TVDGQ 1170 Query: 1045 SFIPGDGLDKWMKPENDMFKVNTDATLFADTGRYNFVFVVRDAEGEMVDAGATCRAGVVQ 1224 + +W P ++ +N+DA +F+ + ++RD G+ + A + + VV Sbjct: 1171 RGGNTQAIPRWQPPPASVWMINSDAAIFSSSRTMGVGALIRDNTGKCLVACSEMISDVVL 1230 Query: 1225 PEVAEALRIKEALSWAKTQPRAKMRIEIENLLVVQGVRSNTKMTSYFGGIIEECRYLLKE 1404 PE+AEAL I+ AL AK + + + + L V++ ++++ + S G +IE+ + L Sbjct: 1231 PELAEALAIRRALGLAKEEGLEHIVMASDCLTVIRRIQTSGRDRSGVGCVIEDIKKLAST 1290 Query: 1405 LSLVSLFFVKRSANRVAHALTR 1470 L S V R +N AH+L R Sbjct: 1291 FVLCSFMHVNRLSNLAAHSLAR 1312 >emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1369 Score = 218 bits (555), Expect = 3e-54 Identities = 149/503 (29%), Positives = 240/503 (47%), Gaps = 13/503 (2%) Frame = +1 Query: 1 ERLSIDKHAGGMGFRSLRDFNLALLGKQAWRLVTKPDSLASRVFKARYYPTTSFLEANLG 180 E+L + K GG+G R+ FN ALL KQAWR++TKPDSL +RV K +Y+P ++FLEA + Sbjct: 851 EKLFLPKKEGGLGIRNFDVFNRALLAKQAWRILTKPDSLMARVIKGKYFPRSNFLEARVS 910 Query: 181 SNPSFIWRSICASQFMLLNGVRIRIGSGQSTSILGSP*LPD-EANPRITTDNQTLENAKV 357 N SF +SI +++ ++ G+ IG G+ T+I G P +P E T+ + ++ Sbjct: 911 PNMSFTCKSILSARAVIQKGMCRVIGDGRDTTIWGDPWVPSLERYSIAATEGVSEDDGPQ 970 Query: 358 SSLMVVGRLEWDEDIIWDVCNERDAXXXXXXXXXXXXXTDQWYWGKEKTRIYTVKSA-YR 534 ++ W+ +++ + ++ DQW W K +TV+SA Y Sbjct: 971 KVCELISNDRWNVELLNTLFQPWESTAIQRIPVALQKKPDQWMWMMSKNGQFTVRSAYYH 1030 Query: 535 SIHEDK--AYQLQREANSGFWRKMWNLKVPPKVKNLIWLTVTEVTGCLPTRTQLRTKHVE 708 + ED+ R N W+K+W K+PPKVK W + L T +R + + Sbjct: 1031 ELLEDRKTGPSTSRGPNLKLWQKIWKAKIPPKVKLFSW---KAIHNGLAVYTNMRKRGMN 1087 Query: 709 ITSMCPWCNAADETMYHTLVGCEFTIEIW--SKIGIHVKEEQGRTIREWLDSNFNAY-DK 879 I CP C +ET H + GC+ + W S + IH + + R W++S + + D Sbjct: 1088 IDGACPRCGEKEETTEHLIWGCDESSRAWYISPLRIHTGNIEAGSFRIWVESLLDTHKDT 1147 Query: 880 RKTGEIVMICWAIWGARNKLIWEQRNPMGQQIILAA----QTFFEQWNTAQDKMNVSMES 1047 MICW IW RNK ++E++ Q+++ A F E+ ++ Sbjct: 1148 EWWALFWMICWNIWLGRNKWVFEKKKLAFQEVVERAVRGVMEFEEECAHTSPVETLNTHE 1207 Query: 1048 FIPGDGLDKWMKPENDMFKVNTDATLFADTGRYNFVFVVRDAEGEMVDAGATCRAG--VV 1221 + W P M K+N DA +F G VVRDAEG+++ ATC G + Sbjct: 1208 -------NGWSVPPVGMVKLNVDAAVFKHVG-IGMGGVVRDAEGDVL--LATCCGGWAME 1257 Query: 1222 QPEVAEALRIKEALSWAKTQPRAKMRIEIENLLVVQGVRSNTKMTSYFGGIIEECRYLLK 1401 P +AEA ++ L A + +E++ + +R + FG ++++ YL Sbjct: 1258 DPAMAEACSLRYGLKVAYEAGFRNLVVEMDCKKLFLQLRGKASDVTPFGRVVDDILYLAS 1317 Query: 1402 ELSLVSLFFVKRSANRVAHALTR 1470 + S V VKR N+VAH L + Sbjct: 1318 KCSNVVFEHVKRHCNKVAHLLAQ 1340