Rietveld Code Review Tool
Help | Bug tracker | Discussion group | Source code | Sign in
(8462)

Issue 6844002: Fix signal propagation logic for Linux/Android exception handler. (Closed)

Can't Edit
Can't Publish+Mail
Start Review
Created:
9 years, 8 months ago by primiano
Modified:
9 years, 8 months ago
Reviewers:
Mark Mentovai
CC:
google-breakpad-dev_googlegroups.com, cjhopman, torne
Base URL:
http://google-breakpad.googlecode.com/svn/trunk
Visibility:
Public.

Description

Fix signal propagation logic for Linux/Android exception handler.

The current code is relying on info->si_pid to figure out whether
the exception handler was triggered by a signal coming from the kernel
(that will re-trigger until the cause that triggered the signal has
been cleared) or from user-space e.g., kill -SIGNAL pid, which will NOT
automatically re-trigger in the next signal handler in the chain.
While the intentions are good (manually re-triggering user-space
signals), the current implementation mistakenly looks at the si_pid
field in siginfo_t, assuming that it is coming from the kernel if
si_pid == 0.
This is wrong. siginfo_t, in fact, is a union and si_pid is meaningful
only for userspace signals. For signals originated by the kernel,
instead, si_pid overlaps with si_addr (the faulting address).
As a matter of facts, the current implementation is mistakenly
re-triggering the signal using tgkill for most of the kernel-space
signals (unless the fault address is exactly 0x0).
This is not completelly correct for the case of SIGSEGV/SIGBUS. The
next handler in the chain will stil see the signal, but the |siginfo|
and the |context| arguments of the handler will be meaningless
(retriggering a signal with tgkill doesn't preserve them).
Therefore, if the next handler in the chain expects those arguments
to be set, it will fail.
Concretelly, this is causing problems to WebView. In some rare
circumstances, the next handler in the chain is a user-space runtime
which does SIGSEGV handling to implement speculative null pointer
managed exceptions (see as an example
http://www.mono-project.com/docs/advanced/runtime/docs/exception-handling/)

The fix herein proposed consists in using the si_code (see SI_FROMUSER
macros) to determine whether a signal is coming form the kernel
(and therefore just re-establish the next signal handler) or from
userspace (and use the tgkill logic).

Repro case:
This issue is visible in Chrome for Android with this simple repro case:
- Add a non-null pointer dereference in the codebase:
  *((volatile int*)0xbeef) = 42
Without this change: the next handler (the libc trap) prints:
  F/libc  (  595): Fatal signal 11 (SIGSEGV), code 1, fault addr 0x487
  where 0x487 is actually the PID of the process (which is wrong).
With this change: the next handler prints:
  F/libc  (  595): Fatal signal 11 (SIGSEGV), code 1, fault addr 0xbeef
  which is the correct answer.

BUG=chromium:481937

Patch Set 1 #

Patch Set 2 : #

Total comments: 1
Unified diffs Side-by-side diffs Delta from patch set Stats Patch
M src/client/linux/handler/exception_handler.cc View 1 1 chunk +2 lines, -1 line 1 comment Download

Messages

Total messages: 5
primiano
9 years, 8 months ago #1
Mark Mentovai
LGTM. Great description! https://breakpad.appspot.com/6844002/diff/30001/src/client/linux/handler/exception_handler.cc File src/client/linux/handler/exception_handler.cc (right): https://breakpad.appspot.com/6844002/diff/30001/src/client/linux/handler/exception_handler.cc#newcode369 src/client/linux/handler/exception_handler.cc:369: if (info->si_code <= 0 || sig ...
9 years, 8 months ago #2
primiano
On 2015/04/29 20:12:26, Mark Mentovai wrote: > LGTM. Great description! > > https://breakpad.appspot.com/6844002/diff/30001/src/client/linux/handler/exception_handler.cc > File ...
9 years, 8 months ago #3
Mark Mentovai
Linux’ (usually Android’s, actually) headers are screwed-up. If you couldn’t get the macro to work, ...
9 years, 8 months ago #4
primiano
9 years, 8 months ago #5
Message was sent while issue was closed.
On 2015/04/29 20:18:45, Mark Mentovai wrote:
> Linux’ (usually Android’s, actually) headers are screwed-up. If you couldn’t
get
> the macro to work, LGTM as-is.

Thanks.
Landed in r1454
Sign in to reply to this message.

Powered by Google App Engine
RSS Feeds Recent Issues | This issue
This is Rietveld 1004:630ec63f810e-tainted