<html>

    <head>

      <base href="https://bugs.webkit.org/" />

    </head>

    <body>

      <p>

        <div>

            <b><a class="bz_bug_link 

          bz_status_NEW "

   title="NEW - [GTK][Threaded Compositor] Several flaky tests"

   href="https://bugs.webkit.org/show_bug.cgi?id=161242#c20">Comment # 20</a>

              on <a class="bz_bug_link 

          bz_status_NEW "

   title="NEW - [GTK][Threaded Compositor] Several flaky tests"

   href="https://bugs.webkit.org/show_bug.cgi?id=161242">bug 161242</a>

              from <span class="vcard"><a class="email" href="mailto:cgarcia&#64;igalia.com" title="Carlos Garcia Campos &lt;cgarcia&#64;igalia.com&gt;"> <span class="fn">Carlos Garcia Campos</span></a>

</span></b>

        <pre>(In reply to <a href="show_bug.cgi?id=161242#c19">comment #19</a>)

<span class="quote">&gt; (In reply to <a href="show_bug.cgi?id=161242#c18">comment #18</a>)

&gt; &gt; (In reply to <a href="show_bug.cgi?id=161242#c17">comment #17</a>)

&gt; &gt; &gt; (In reply to <a href="show_bug.cgi?id=161242#c16">comment #16</a>)

&gt; &gt; &gt; &gt; You are assuming again that there's a single issue.

&gt; &gt; &gt; 

&gt; &gt; &gt; No, clearly there were many different issues, as you've fixed three

&gt; &gt; &gt; different issues so far....

&gt; &gt; &gt; 

&gt; &gt; &gt; &gt; I think the situation now

&gt; &gt; &gt; &gt; is mostly the same to what we had before switching to the threaded

&gt; &gt; &gt; &gt; compositor.

&gt; &gt; &gt; 

&gt; &gt; &gt; Yes, except for the fact that sometimes a bunch of tests pass when they

&gt; &gt; &gt; usually fail. In build #18023 we have 84 unexpected passes; that never

&gt; &gt; &gt; happened before. What's interesting is these tests either always pass or

&gt; &gt; &gt; always fail in a particular run of run-webkit-tests; they are clearly flaky,

&gt; &gt; &gt; but they don't contribute to the flakiness count.

&gt; &gt; 

&gt; &gt; We should probably mark them as pass, to see what's wrong when they fail

&gt; 

&gt; That don't makes much sense to me.

&gt; 

&gt; You can see the failure diff even if they are expected failures. You just

&gt; have to navigate under

&gt; <a href="https://build.webkit.org/results/GTK%20Linux%2064">https://build.webkit.org/results/GTK%20Linux%2064</a>-

&gt; bit%20Release%20%28Tests%29/ to find them.

&gt; 

&gt; For example: 

&gt; 

&gt; 1) Check the unexpected passes at end of build #18023

&gt; <a href="https://build.webkit.org/builders/GTK%20Linux%2064">https://build.webkit.org/builders/GTK%20Linux%2064</a>-

&gt; bit%20Release%20%28Tests%29/builds/18023/steps/layout-test/logs/stdio and

&gt; pick 1

&gt; 

&gt; 2) I pick: imported/mozilla/svg/blend-difference-stacking.html

&gt; 

&gt; 3) Check that on the next build that test failed. Log of the next build:

&gt; <a href="https://build.webkit.org/builders/GTK%20Linux%2064">https://build.webkit.org/builders/GTK%20Linux%2064</a>-

&gt; bit%20Release%20%28Tests%29/builds/18024/steps/layout-test/logs/stdio &lt;---

&gt; yes it failed

&gt; 

&gt; 4) Go to

&gt; <a href="https://build.webkit.org/results/GTK%20Linux%2064">https://build.webkit.org/results/GTK%20Linux%2064</a>-

&gt; bit%20Release%20%28Tests%29/ and find the results for build #18204 -&gt;

&gt; <a href="https://build.webkit.org/results/GTK%20Linux%2064">https://build.webkit.org/results/GTK%20Linux%2064</a>-

&gt; bit%20Release%20%28Tests%29/r205335%20%2818024%29/

&gt; 

&gt; 5) Now under that folder navigate to imported/mozilla/svg/ and look for the

&gt; blend-difference-stacking diff

&gt; 

&gt; 6) Here you have it:

&gt; <a href="https://build.webkit.org/results/GTK%20Linux%2064">https://build.webkit.org/results/GTK%20Linux%2064</a>-

&gt; bit%20Release%20%28Tests%29/r205335%20%2818024%29/imported/mozilla/svg/blend-

&gt; difference-stacking-diffs.html</span >

hmm, well, not very convenient, but still. I have theory, though. When they unexpectedly pass, they don't actually pass, we just fail to render both the actual and expected files in the same way (for example a fully white image in both cases). That's a problem of the reftests. So, more interesting to see what's failing, which can be also reproduced locally more easily, would be to see what we render when they pass.</pre>

        </div>

      </p>

      <hr>

      <span>You are receiving this mail because:</span>

      <ul>

          <li>You are the assignee for the bug.</li>

      </ul>

    </body>

</html>