Ws/create separate service #325

wswebcreation · 2024-05-05T16:47:28Z

Note

This PR replaces this PR and addresses the feedback

This PR will migrate the OCR service, from https://github.com/wswebcreation/wdio-ocr-service, to the Visual service

🚀 New Feature

Sometimes it can be hard to find an element in a mobile native app or desktop site, with an interactable Canvas, with the default WebdriverIO selectors. In that case, it would be nice if you would be able to use something like OCR (Optical Character Recognition) to interact with elements on your device/screen.

The new @wdio/ocr-service service provides you with the option to interact with elements based on visible text. It will provide multiple commands to:

wait
search
and interact

with an element, all based on text.

The following commands will be added

ocrGetText
ocrGetElementPositionByText
ocrWaitForTextDisplayed
ocrClickOnText
ocrSetValue

A CLI command will also be provided to pre-check text received form image. For a demo check this video

visual-service-ocr.mp4

🐛 Bug Fixes

Polish 💅

Unit Tests were inconsistently failing. We're hitting this error Running multiple test files crashes when canvas is installed (Error: Module did not self-register: '.../canvas/build/Release/canvas.node'.) vitest-dev/vitest#740, which resulted in the following error: Error: Module did not self-register: '/Users/Git/wdio/visual-testing/node_modules/.pnpm/canvas@2.11.2/node_modules/canvas/build/Release/canvas.node'. Setting the threads to 1 will prevent this error from happening. It slows down the tests, but it's better than having them fail

- add ocrGetText command

- update deps

- return filepath

- refactor code a bit - fix properly cropping

- completely remove the node-canvas dependency to draw OCR images

- change element to haystack

- restructure tests

changeset-bot · 2024-05-05T16:47:31Z

🦋 Changeset detected

Latest commit: da83d14

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 2 packages

Name	Type
@wdio/ocr-service	Major
@wdio/visual-service	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

christian-bromann

One comment:

can we remove the ocr prefix in the function and file names? I think it is clear to what they belong to

christian-bromann · 2024-05-06T15:15:06Z

packages/ocr-service/package.json

+    "url": "https://github.com/webdriverio/visual-testing.git"
+  },
+  "bin": {
+    "ocr-service": "./dist/cli.js"


usually for CLIs I create a bin/index.js file that imports ../dist/cli.js because afaik these files need to be excutedable (e.g. chmod +x ./bin/index.js) which we can't do for compiled files. Can you verify this works, maybe this is not needed anymore?

Good one, when I use the current config (npm run watch) and call npx ocr-service it works without changing any rights.
Any other suggestions on how to test this?

wswebcreation · 2024-05-08T16:27:40Z

One comment:

can we remove the ocr prefix in the function and file names? I think it is clear to what they belong to

@christian-bromann

I called the files the same name as the functions that reflect the browser commands. Is this a bad practice? I removed the ocr prefix for all other files

christian-bromann · 2024-05-09T15:01:55Z

I called the files the same name as the functions that reflect the browser commands.

Oh, that makes sense 👍

christian-bromann

LGTM 👍

returned text from system tesseract had unescaped txt in the result which resulted in a parsing error, now the text is abstracted differently

wswebcreation added 30 commits April 8, 2024 09:39

feat: initial commit

14b63de

- add ocrGetText command

test: fix UTs

2acda3f

feat: add system installed tesseract support

473c3a1

- update deps

feat: add ocrGetElementPositionByText

e0f6a44

feat: add ocrClickOnText

82df44d

feat: add ocrSetValue

2ade985

feat: add ocrWaitForTextDisplayed

7b7a1e3

chore: initial refactor of code

896eda9

feat: add fuzzyFindOptions to ocrWaitForTextDisplayed

a422e00

chore: optimize service code

bb6d3bd

tesT: update e2e test

aeeb7ca

feat: add contrast as a configurable option

23e6909

chore: fix some small things

4495f6f

test: fix tests due to an issue with Canvas

1abb515

feat: draw a target where the app will be clicked

ea0ba59

- return filepath

feat: add target and search on words

906cb95

- refactor code a bit - fix properly cropping

test: add E2E test

cfffefb

chore: refactor adding highlights

b2d5121

- completely remove the node-canvas dependency to draw OCR images

test: remove only

12057df

feat: add haystack as coordinates

598c69d

- change element to haystack

test: add first UT

e163a9c

test: add more test to single UT

4ad6ba4

test: add new UT ocrGetElementPositionByText

29872d3

test: add test for ocrGetText

2e283c8

test: add ocrSetValue tests

dc9dfd5

test: add ocrWaitForTextDisplayed tests

7a0d024

test: add fuzzy tests

d4206cd

- restructure tests

chore: add all mocks for fuzzy

d503a66

test: add tests for image processing

0c49211

test: add index utils tests

126a995

wswebcreation added 6 commits May 5, 2024 10:47

feat: add a separate service for OCR

d276f38

chore: revert changes to the visual service

bb39e0f

feat: add cli to ocr-service

905260e

chore: remove cli command from visual service

475b7e4

chore: update release

b2eeb2d

chore: updates after feedback

4101dc9

wswebcreation mentioned this pull request May 5, 2024

feat: add OCR to the Visual Service #287

Closed

wswebcreation requested a review from christian-bromann May 5, 2024 17:40

chore: update changelog

d365882

christian-bromann reviewed May 6, 2024

View reviewed changes

test: add service UTs

c6048df

christian-bromann approved these changes May 9, 2024

View reviewed changes

wswebcreation added 11 commits May 20, 2024 07:40

fix: fix parsing issues

03f0276

returned text from system tesseract had unescaped txt in the result which resulted in a parsing error, now the text is abstracted differently

chore: update deps and add SL OCR tests

36c76ca

test: fix tests for running on Sauce

69db90e

test: add chrome with darkmode for better text recognition

8c5c19a

fix: update cli

fa078a1

test: add CH action step

fa5d877

Merge branch 'main' into ws/create-separate-service

ed8821b

chore: fix ut

12fb024

fix: fix tesseract text response for the js version

67ff92c

test: update chrome baseline

32a7f6d

fix: fix issue #333

813283c

wswebcreation mentioned this pull request May 22, 2024

Native app session not started with appium:app behaves as web session and fails #333

Closed

chore: update release notes

da83d14

wswebcreation merged commit a924dfc into main May 23, 2024
18 checks passed

wswebcreation deleted the ws/create-separate-service branch May 23, 2024 04:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ws/create separate service #325

Ws/create separate service #325

wswebcreation commented May 5, 2024 •

edited

changeset-bot bot commented May 5, 2024 •

edited

christian-bromann left a comment

christian-bromann May 6, 2024

wswebcreation May 8, 2024

wswebcreation commented May 8, 2024

christian-bromann commented May 9, 2024

christian-bromann left a comment

Ws/create separate service #325

Ws/create separate service #325

Conversation

wswebcreation commented May 5, 2024 • edited

🚀 New Feature

🐛 Bug Fixes

Polish 💅

changeset-bot bot commented May 5, 2024 • edited

🦋 Changeset detected

christian-bromann left a comment

Choose a reason for hiding this comment

christian-bromann May 6, 2024

Choose a reason for hiding this comment

wswebcreation May 8, 2024

Choose a reason for hiding this comment

wswebcreation commented May 8, 2024

christian-bromann commented May 9, 2024

christian-bromann left a comment

Choose a reason for hiding this comment

wswebcreation commented May 5, 2024 •

edited

changeset-bot bot commented May 5, 2024 •

edited