[Webkit-unassigned] [Bug 95906] New: WKTR often reports an unresponsive WebProcess on Mac bots
bugzilla-daemon at webkit.org
bugzilla-daemon at webkit.org
Wed Sep 5 15:45:34 PDT 2012
https://bugs.webkit.org/show_bug.cgi?id=95906
Summary: WKTR often reports an unresponsive WebProcess on Mac
bots
Product: WebKit
Version: 528+ (Nightly build)
Platform: Unspecified
OS/Version: Unspecified
Status: NEW
Severity: Normal
Priority: P2
Component: Tools / Tests
AssignedTo: webkit-unassigned at lists.webkit.org
ReportedBy: timothy_horton at apple.com
CC: mitz at webkit.org, slewis at apple.com,
simon.fraser at apple.com, jberlin at webkit.org,
dpranke at chromium.org
Some output from after my logging improvement yesterday (http://trac.webkit.org/changeset/127547):
http://build.webkit.org/results/Apple%20MountainLion%20Debug%20WK2%20(Tests)/r127647%20(647)/fast/js/dfg-int32array-crash-log.txt
> No crash log found for WebProcess:16532.
> Process failed to become responsive before timing out.
>
> stdout:
> Timed out waiting for final message from web process
>
> stderr:
> #PROCESS UNRESPONSIVE - WebProcess (pid 16532)
> #EOF
http://build.webkit.org/results/Apple%20Lion%20Release%20WK2%20(Tests)/r127645%20(2940)/svg/animations/mozilla/animateMotion-by-1-crash-log.txt
> No crash log found for WebProcess:33091.
> Process failed to become responsive before timing out.
>
> stdout:
> Failed to reset to consistent state before the first test
> stderr:
> #PROCESS UNRESPONSIVE - WebProcess (pid 33091)
> #EOF
There are dozens upon dozens of examples of this happening, on Lion and Mountain Lion, across many machines. It seems to come in bunches - 20 tests from different shards will fail nearly simultaneously, and the harness will then give up.
I've never reproduced this locally, but it is one of the most significant issues blocking us from greenness on the Mac WK2 test bots, and has been for some time now.
Questions I have:
Is this the same (effectively) as a timeout for a test that doesn't call waitUntilDone (i.e. is this what you get if you have a non-waitUntilDone test, but then it hangs for some reason (infinite loop in JS, WebCore bug, whatever)?)?
If so, why don't we treat it as a timeout instead of a crash? (not that that would really help)
Could this be caused simply by slow/overloaded machines? (this is something I want to test locally)
Why does it come in batches, across many processes? (see http://build.webkit.org/results/Apple%20Lion%20Release%20WK2%20(Tests)/r127645%20(2940)/results.html)
--
Configure bugmail: https://bugs.webkit.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
More information about the webkit-unassigned
mailing list