Re: [ww-users] cleaning out old directories
Status: Beta
Brought to you by:
worden
|
From: Peter L. R. <pl...@st...> - 2011-03-29 17:42:34
|
I always feel nervous when there are large, costly files that WW might
still decide to go ahead and re-make without my intending to. Perhaps
what we need is some easy mechanism to change the status of such files
-- change them to 'source files' for instance. This would be similar to
a "DNE" flag, but would also flag "do not re-make"; so re-making would
require first manual deletion. Then any files not flagged as such could
be deleted without problem.
--peter
On Mon, Mar 28, 2011 at 08:24:01PM -0700, Lee Worden wrote:
> I have some open questions about WorkingWiki's features, where I would
> appreciate your perspective as people who use WW, or even run your own
> WW sites. Please feel free to ignore this if you're busy or not interested.
>
> For quite a while, we've had a need to clean out old data that piles up
> and isn't in use - not in the wiki pages themselves, but in the working
> directories that we use behind the scenes to compute the output of
> latex, R, etc.
>
> Most obvious is the preview sessions - any time you preview a page that
> includes WW data while editing, it makes a copy of the data in the back
> end to keep it separate from the unedited page's data. When you save,
> it gets rid of the copied files by merging with the saved ones, but if
> you abandon the changes without saving (a completely reasonable thing to
> do) the copy is left sitting there, and needs to be cleaned out sometime
> later. These can be quite large - we've seen project directories that
> take up 4GB or even more. That cleanout has been on my to do list, and
> now it's getting done.
>
> More controversially, there are also old projects that eventually need
> to disappear. For instance, if someone creates a project and then
> change its name, the old working directory just sits there abandoned.
> Or if a page once had some WW files on it and now it doesn't, the
> project directory is abandoned. But also if I use the inline latex
> features to add something like $$\alpha + \beta$$ to a page, then change
> it to something else, a project is created to process that latex code,
> and it needs to not be kept forever. So at some point project
> directories need to be cleared away or the disk will eventually fill up
> with files that no one wants. I'll probably do this by erasing things
> that haven't been touched in over 3 months or something.
>
> Generally, this should be harmless even if I erase files that someone is
> using, because they can be remade from the source files - it will just
> mean waiting a minute or two (maybe more...) for them to be made.
> Unfortunately, in the worst case it could mean erasing a directory full
> of output files that can't be easily recreated. I could implement a "Do
> Not Erase" feature to mark particular projects that are sensitive and
> should never be erased.
>
> My first question: Is there a better way to protect project files that
> should be permanent? Does anyone have strong feelings about all this?
>
> Finally, I think background jobs should generally be left alone for as
> long as it takes for people to decide whether to erase them. But there
> is a slight danger: suppose I create a project and run a background job,
> then erase or rename the project. The background job becomes orphaned,
> and it won't show up in any listings. So I should probably do something
> to erase things like that.
>
> One way to address both of these things is to check whether each project
> is actually connected to current pages in one of the wikis - that would
> clear up whether it's orphaned or not. But the directory cleaning is
> done in a separate back-end component ("ProjectEngine") when requested
> by the front end ("WorkingWiki"), and I'm trying to avoid two-way
> communication where ProjectEngine has to ask questions of WorkingWiki
> while it's completing a request, so I'm looking for an alternative...
>
> Lee
>
> ------------------------------------------------------------------------------
> Enable your software for Intel(R) Active Management Technology to meet the
> growing manageability and security demands of your customers. Businesses
> are taking advantage of Intel(R) vPro (TM) technology - will your software
> be a part of the solution? Download the Intel(R) Manageability Checker
> today! http://p.sf.net/sfu/intel-dev2devmar
> _______________________________________________
> workingwiki-users mailing list
> wor...@li...
> https://lists.sourceforge.net/lists/listinfo/workingwiki-users
|