This is the mail archive of the systemtap@sourceware.org mailing list for the systemtap project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: Why systemtap loses events and what can be done about it?

From: Mike Mason <mmlnx at us dot ibm dot com>
To: "Frank Ch. Eigler" <fche at redhat dot com>
Cc: Brad Peters <bpeters at linux dot vnet dot ibm dot com>, ååå <zzh at ncic dot ac dot cn>, systemtap at sources dot redhat dot com
Date: Tue, 04 Sep 2007 12:29:34 -0700
Subject: Re: Why systemtap loses events and what can be done about it?
References: <20070830132846.GA20477@redhat.com> <CC992CB620BE49C98F14E8285A4E2D98@ncicdcos> <20070831035838.GA891@redhat.com> <46D85BE5.6050503@linux.vnet.ibm.com> <y0mbqcmr2a3.fsf@ton.toronto.redhat.com> <46DD8C4D.3030203@us.ibm.com> <20070904173650.GE24070@redhat.com>

Frank Ch. Eigler wrote:

Hi -

On Tue, Sep 04, 2007 at 09:48:13AM -0700, Mike Mason wrote:

I've been toying with the idea of optimizing probe handlers by
implementing them like interrupt handlers, with top and bottom
halves.  [...]


This is unlikely to be faster overall, but is an interesting idea.
Putting blocking-capable constructs into a deferred-work handler could
be an explicit option, though wrought with risks.  (Lack of
synchronization is one: a deferred get_user() value may not resemble
one possibly fetched at the instant of a raw probe point.)

I was thinking specifically of using d_cookies and work queues to safely defer getting full pathnames, similar to what oprofile does. I don't know if a similar approach would work for other data.

You're right, any data that can change between the top and bottom halves would have to be collected in the top half. And, yes, it wouldn't be faster overall, but it would allow us to reenable interrupts faster in some cases. I originally thought that might reduce the number of skipped probes, but now realize I was probably mistaken.

Refresh my memory... are skipped probes only an indication of the number of probes that timed out on locks or are probes skipped for other reasons? Also, why do we only use per cpu variables for aggregations? Is it because of memory concerns or something more than that?

Mike

- FChE

Follow-Ups:
- Re: Why systemtap loses events and what can be done about it?
  - From: Frank Ch. Eigler

References:
- Re: Why systemtap loses events and what can be done about it?
  - From: Frank Ch. Eigler
- Re: Why systemtap loses events and what can be done about it?
  - From: Frank Ch. Eigler
- Re: Why systemtap loses events and what can be done about it?
  - From: Brad Peters
- Re: Why systemtap loses events and what can be done about it?
  - From: Frank Ch. Eigler
- Re: Why systemtap loses events and what can be done about it?
  - From: Mike Mason
- Re: Why systemtap loses events and what can be done about it?
  - From: Frank Ch. Eigler

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]