Porting TestExpectations to WPT Metadata #481

KyleJu · 2020-06-04T20:14:49Z

Per Dave Tapuska's suggestions, WPT Metadata should be a subset of TestExpectations, but with a focus on chrome-specific test failures. The triage information in TestExpectations should be ported continuously to WPT Metadata.

For now, we can write a ad-hoc script to port the existing triage information to WPT metadata and think of an automated way going forward (e.g. through a bot).

KyleJu · 2020-07-07T01:53:30Z

The portable expectations in TestExpectations should meet the following criteria:

There exists a bug number;
Tags should be [], [ Linux ], [ Release ] or any combinations of these three;
Test name should start with external/wpt/ (exclude expectations for runtime flags);
Results is not [ Skip ] (maybe ideally [ Failure ] only? Exclude flaky tests);
The same expectation exists and fails in wpt.fyi (due to orphaned expectations, outdated WPT versions or Chrome versions).

KyleJu · 2020-07-07T02:03:56Z

According to failing-tests, TestExpectations records tests that cannot be rebaselined. I suspect the Chrome-specific failures can go unnoticed during the import process

stephenmcgruer · 2020-07-07T14:45:41Z

Results is not [ Skip ] (maybe ideally [ Failure ] only? Exclude flaky tests);

[ Timeout ] also seems like it would be useful?

Flakes may also be useful, but are obviously harder to track.

KyleJu · 2020-07-13T22:35:51Z

TestExpectations has been ported to WPT Metadata in #278. The selecting criteria is mentioned in the comment above, but only include [ Failure ] and [ Timeout ]. Flaky tests are not ported at this point.

Per our discussions offline, TestExpectations file only records reference test failures, flaky tests and non-deterministic tests. New (or Chrome-specific) failures could most likely go unnoticed during the WPT import process as they are reabaselined automatically. As a result, WPT Metadata isn't a subset of TestExpectations, and TestExpectations is not the single source of truth for Chrome test failures.

Going forward, we should figure out a way to port candidates from TestExpectations to WPT Metadata continuously, e,g, a bot.

KyleJu · 2020-08-17T00:34:36Z

A list of portable NeverFixTests candidates have been identified using similar criteria. However, none of them are Chrome-specific failures. I will circle back to this issue when we expand our data ingestion to include all Chrome failures

KyleJu · 2020-08-17T16:49:56Z

A list of portable NeverFixTests candidates have been identified using similar criteria. However, none of them are Chrome-specific failures. I will circle back to this issue when we expand our data ingestion to include all Chrome failures

@stephenmcgruer FYI since this issue is raised by the Layout team. Happy to prioritize it if necessary

stephenmcgruer · 2020-10-15T21:00:03Z

(Why is this issue in wpt.fyi not wpt-metadata?)

I've been playing again with importing TestExpectations via #473, after questions from folks who reasonably don't want to retriage tests they have already marked in TestExpectations. From that PR, I have uploaded data for all of css/css-* as PRs:

I then went through the newly linked bugs for problematic cases. Here are the general problems I found:

Tests linked to generic 'import this WPT directory' bugs, which are against what wpt.fyi triage was trying to achieve (useful triage data).
Tests linked to bugs that are run_web_tests.py-specific (e.g. lack of fuzzy-reftest support)
Tests generally linked to bugs that are marked Fixed (or duplicates of Fixed bugs), despite the fact that test failures are still linked to them.
- In a few cases this turns out to be deliberate, though arguably the test should be in NeverFixTests then.

Also, as a final note, if we do make a regular thing of importing from TestExpectations, we will likely also need to answer:

How to deal with 'retriages', aka when a test failure changes crbug in TestExpectations (or is removed entirely!). We don't look at the Chromium commit diffs here, just the file, so we don't know when this happened.
- Perhaps we could mark entries as 'came from TestExpectations' and then remove them if they no longer exist there? Hacky!
How to deal with tests that pass on wpt.fyi but are in TestExpectations. So far we've just been ignoring that, maybe its fine to do so.

Thoughts on how we might resolve these welcome; the answer may be that we need to cleanup TestExpectations first.

Hexcles · 2020-10-15T21:02:15Z

(Why is this issue in wpt.fyi not wpt-metadata?)

Good point. I've moved the issue to wpt-metadata. (Old issue links should continue to work.)

KyleJu self-assigned this Jul 13, 2020

Hexcles transferred this issue from web-platform-tests/wpt.fyi Oct 15, 2020

Hexcles assigned Hexcles and unassigned KyleJu Nov 11, 2020

Hexcles removed their assignment Dec 17, 2020

stephenmcgruer mentioned this issue Jan 30, 2021

Programmatic mapping of Chrome failures to existing monorail bugs #798

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Porting TestExpectations to WPT Metadata #481

Porting TestExpectations to WPT Metadata #481

KyleJu commented Jun 4, 2020

KyleJu commented Jul 7, 2020 •

edited

KyleJu commented Jul 7, 2020

stephenmcgruer commented Jul 7, 2020

KyleJu commented Jul 13, 2020

KyleJu commented Aug 17, 2020 •

edited

KyleJu commented Aug 17, 2020

stephenmcgruer commented Oct 15, 2020

Hexcles commented Oct 15, 2020

Porting TestExpectations to WPT Metadata #481

Porting TestExpectations to WPT Metadata #481

Comments

KyleJu commented Jun 4, 2020

KyleJu commented Jul 7, 2020 • edited

KyleJu commented Jul 7, 2020

stephenmcgruer commented Jul 7, 2020

KyleJu commented Jul 13, 2020

KyleJu commented Aug 17, 2020 • edited

KyleJu commented Aug 17, 2020

stephenmcgruer commented Oct 15, 2020

Hexcles commented Oct 15, 2020

KyleJu commented Jul 7, 2020 •

edited

KyleJu commented Aug 17, 2020 •

edited