cscope / Feature Requests / #7 Incremental update

Hans-Bernhard Broeker - 2004-02-03

Logged In: YES
user_id=27517

cscope database updates are incremental, already.
cscope.out sections for files that haven't been modified are
just copied over into the new cscope.out. But even copying
takes some time.

Generally, even if it's being run from inside an editor,
that doesn't mean cscope can make assumptions about which
file are, and which are not modified. It's not even clear
it can request a list of buffers from the editor.

Generally speaking, this is what the -d switch is for, which
avoid updates altogether, until triggered manually. The VIM
interface uses that switch, IIRC.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Kirc Doog - 2004-02-07

Logged In: YES
user_id=877665

Ok, I'd like to respond to that, but the SourceForge web page
has no "Submit Followup". I feel silly about asking, but how
do I make a followup to your followup?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Kirc Doog - 2004-02-07

Logged In: YES
user_id=877665

Ok, I get it. Attaching a comment is the same as submitting
a followup. Well, that's cryptic. Oh well, now I know.

I'll submit the followup, um I mean comment shortly.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Kirc Doog - 2004-02-07

Logged In: YES
user_id=877665

Ok, you said that even copying takes time. Ok, I think that is
the underlying problem here. There is something "wrong" if
you have to copy the entire .out file just to update it for
changes made to a single file. Consider that cscope will be
run on millions of lines of source code; changing one file
should not require copying the entire .out file only to update
it with the sections that were changed because of one file.
With millions of lines of code, that is a huge .out to copy just
to update, which is the main time consumer.

You are correct that cscope cannot make assumptions about
which files are modified and which ones are not. The editor
or external program would have to notify cscope (via
command-line option) each time it saves a file. cscope could
sit around passively and only then update the database for
that one file, and hopefully without copying the .out file.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Hans-Bernhard Broeker - 2004-02-09

assigned_to: nobody --> broeker
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Hans-Bernhard Broeker - 2004-02-09

Logged In: YES
user_id=27517

Copying of the database contents is a necessity dictated by
its format --- it's essentially a flat text file with no
internal structure that would allow to overwrite only parts
of it. Users basically seem to have adopted their style of
working to this fact quite nicely: they just don't rebuild
the database all the time, but rather use -d mode,
particularly when running cscope as slave of some editor.
They also generally only trigger rebuilds manually, when
search results become just too imprecise to work with, so
the copy/rebuild time is worth spending.

What you're asking for is essentially to kill the core
design element of the whole program: the data file format,
and replace it by a full-blown database with
in-place-replaceable records and an index. I've actually
gone that way for a while, trying to replace the rather
unmaintainable invlib.c module, but there are some rather
serious drawbacks. For one thing, DB file sizes would
increase significantly to accomodate the slack space the DB
engine needs to maneouvre. And we would turn cscope, which
currently relies on no external library except curses,
dependant on some DB subsystem the user may well not have
installed.

As to letting the editor inform us which files were updated
--- sorry, but that's not a workable solution, even if we
managed to pull it off. For once, there are just too many
different editors out there, so we certainly can't do this
for each of them, on any realistic time budget. This would
make this a half-baked solution, at best. Second, and
worse, the whole assumption that the editor actually knows
which files have been modified since the last DB rebuild is
flawed. Files may have been edited, or be in the process of
being edited, by other users, using other editors, on other
machines in the network. Files may have been changed by
programs that aren't editors at all. The single instance
that has a chance of telling us which files have been
modified and which haven't is the filesystem. So cscope
*has* to check the timestamps. They're the only somewhat
reliable status indicator available, and that's exactly what
it does already.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Kirc Doog - 2004-02-18

Logged In: YES
user_id=877665

I hear what you are saying. There are some assumptions I
was making
that are not the same for other cscope users. I've concluded
that my
request for "incremental update" is not appropriate for cscope.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Hans-Bernhard Broeker - 2004-02-18

status: open --> closed
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Incremental update

Efficient, text-only browser for C sources

Group

Searches

Help

#7 Incremental update

Discussion