This is the mail archive of the gdb@sources.redhat.com mailing list for the GDB project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: struct environment

From: Daniel Berlin <dberlin at dberlin dot org>
To: David Carlton <carlton at math dot stanford dot edu>
Cc: Andrew Cagney <ac131313 at ges dot redhat dot com>,gdb <gdb at sources dot redhat dot com>
Date: Tue, 17 Sep 2002 15:49:57 -0400
Subject: Re: struct environment

Having done this for regcache and reggroup it is worth it -> you get

to see what actually works rather than what everyone else thinks
should work.  You can also pick out the cleanups that will make the
final change easier.  For instance - tightening up the way GDB
creates symbol tables (DanielB suggested moving psymtab into stabs)
an performs lookups on them.

Personally, I'd rather tighten up the interface to symbol lookups
first and only then move on to optimizations like changing how
reading in a part of the symbol information is implemented.

These are orthogonal issues. Reading in part of the symbol information is *not* an optimization (as the source stands now). It is a necessity controlled *by the symbol lookup*.
If psymtabs were private to stabs, then this would not be the cast anymore, but as it stands now, the readin of psymtabs and conversion to symtabs is controlled by the symbol lookup functions.
If your lookup doesn't do this as well, *and* you don't change how symbol tables are created (ie you do nothing), stuff will break.

  Instead, initially, I think these tables could be simple linear
  lists (that is what ``struct block'' currently implements so it
  can't be any worse :-) (just the global / final table is special
  :-).

For struct block, I'm tracking existing design extremely closely.

No offense, but you shouldn't do that.
The current design does *not* scale. Or at least, not until hash tables were introduced, which happened in the past 3 months.
I've only tested scaling of hash tables up to 100 meg of debug info. Even then, due to the number of blocks one has to search, things start to get a bit slow. There are plenty of projects these days with 500 or 1 gig of debug info.
You *need* to be thinking about these things whenever you design the data structures and algorithms.
The reason GDB was so slow on these large cases before hash tables is precisely because nobody seemed to even consider what would happen 10 years in the future.
It's perfectly reasonable to assume that in 10 years, we will have 10x as much debug info to deal with.
If we don't, great.
If we do, and we haven't even considered it, we are *really* screwed.

Currently, struct block has these options:

* Hash tables.  (Used almost everywhere.)

This is a new development.

* Lists that aren't sorted by name.  (Used in blocks that are the body
  of functions, and in Java's dynamic class table.)
* Lists that are sorted by name.  (Used by mdebugread.c in places
  where hash tables should be used.)

Most of these can't grow, but jv-lang.c and mdebugread.c both have
code by which lists can grow.

When the struct block conversion is done, the only change will be that
sorted lists will go away (since they're only used by mdebugread.c,
which should be converted over to using hash tables instead), and that
the common code in jv-lang.c and mdebugread.c to allow lists that grow
will all be in one place.  No other changes: extremely similar data
structures, exactly the same algorithms.

Which haven't served particularly well for the past 5 years (before that, maybe).
No offense, and not your fault.
And i'm not saying they *aren't* adequate.
It just needs more thought than "it's currently done this way, thus, we should do it that way in the future".

- Am I correct to think that the objective is to create a directed
  acyclic graph of nametabs and have lookups search through each in
  turn.

Not quite as Daniel has explained; C++'s name lookup rules are pretty
complicated.  Which isn't to say that we have to track those name
lookup rules perfectly: e.g. it's not the end of the world if we don't
notice that a name lookup is ambiguous where a C++ compiler would be
required to notice that.

No, we should actually be able to do exactly this.
Even if we *don't*, we should be able to.

 But it is, I think, a bad idea if we miss a
symbol that we should find, or if we return the wrong symbol instead
of the right one.

Which is what happens if you don't track the name lookup rules perfectly.
THe rules specify what is ill formed. If you find symbols you shouldn't, it's not only confusing to the user, it's the wrong symbol.

Follow-Ups:
- Re: struct environment
  - From: David Carlton

References:
- Re: struct environment
  - From: David Carlton

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]