Integrating OpenVINO Runtime in Spark NLP #14200

rajatkrishna · 2024-03-09T04:02:38Z

This PR introduces OpenVINO Runtime support in Spark NLP

Description

This PR enables Spark NLP to leverage the OpenVINO Runtime API for Java to load and run models in various formats including ONNX, PaddlePaddle, Tensorflow, Tensorflow Lite and OpenVINO IR format. OpenVINO also enables performance improvements when running on supported intel hardware, with upto 40% improvement vs Tensorflow on benchmarks with no further tuning. You can also take advantage of the full optimization and quantization capabilities offered by the OpenVINO toolkit when exporting/converting the model to the OpenVINO format using the Model Conversion API.

The following annotators have been enabled to work with OpenVINO:

BertEmbeddings: https://colab.research.google.com/drive/1J9IlT0CLrmvEOHBxuKHreEyVqsfsbidm?usp=sharing
RoBertaEmbeddings: https://colab.research.google.com/drive/1oFRqCuk2XLk29Q0X5uyGyFFQtBz23pBd?usp=sharing
XlmRoBertaEmbeddings: https://colab.research.google.com/drive/1btFhV9vunqRB-kKxTCKlBXffdE6um_Pv?usp=sharing
T5Transformer: https://colab.research.google.com/drive/1tV703j4oFAaBM9ydtE6s41cZeMXCmtFy?usp=sharing
E5Embeddings: https://colab.research.google.com/drive/1qA0dxKawDAqHo0vzhFx1QFHZHMQoWuUm?usp=sharing
LLAMA2: https://colab.research.google.com/drive/1xY6Z3yRqhS9LdB7F3JQBopLoiC6BPZgd?usp=sharing

Note: To take advantage of this feature, see these instructions to build OpenVINO jar (Linux), and these to build Spark NLP. OpenVINO is cross-platform. Refer here for Windows build instructions, and here for other platforms.

Motivation and Context

Out-of-the-box optimizations and better performance on supported Intel hardware
Capable of reading ONNX, PaddlePaddle, TensorFlow and TensorFlow Lite formats directly
This work was completed as part of Google Summer of Code 2023

Screenshots (if appropriate):

Types of changes

Bug fix (non-breaking change which fixes an issue)
Code improvements with no or little impact
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My code follows the code style of this project.
My change requires a change to the documentation.
I have updated the documentation accordingly.
I have read the CONTRIBUTING page.
I have added tests to cover my changes.
All new and existing tests passed.

Add OpenVINO model engine wrapper Add default buffer size for reading weights file Read OpenVINO IR format models

- Resole merge conflicts - Add test for openvino load model

* Add param to enable OpenVINO through Python API * Formatting changes

* Enable OpenVINO backend for E5 Embeddings * Update Python APIs

src/main/scala/com/johnsnowlabs/ml/openvino/OpenvinoWrapper.scala

* Use CPU by default for OpenVINO inference due to error loading device config in cluster envs

rajatkrishna added 13 commits March 4, 2024 15:06

Use OpenVINO model engine for BertEmbeddings

7c798f6

Add OpenVINO model engine wrapper Add default buffer size for reading weights file Read OpenVINO IR format models

Use Long Tensors with XlmRoberta

27d929c

Add OpenVINO support for RoBerta and XlmRoBerta embeddings

d6e1439

Fix data type and formatting

6e7ffe1

Add OpenVINO BERT test

b025a70

- Resole merge conflicts - Add test for openvino load model

Update Python APIs to use OpenVINO

5804be4

* Add param to enable OpenVINO through Python API * Formatting changes

Add OpenVINO support for E5 Embeddings

b9d92ef

* Enable OpenVINO backend for E5 Embeddings * Update Python APIs

Resolve merge issues

6e9bca4

Add OpenVINO support for T5

48136fa

Read and write encoder-decoder models with OpenVINO

8f44ac6

OpenVINO Async Inference

aa13333

Refactor and cleanup

981e0e5

Update comments

cac3368

rajatkrishna mentioned this pull request Mar 9, 2024

[WIP] Integrating OpenVINO Runtime in Spark NLP #13947

Closed

13 tasks

maziyarpanahi self-requested a review March 10, 2024 12:20

maziyarpanahi self-assigned this Mar 10, 2024

maziyarpanahi added on-hold cannot be merged right away new-feature Introducing a new feature DON'T MERGE Do not merge this PR labels Mar 10, 2024

rajatkrishna added 11 commits March 11, 2024 21:07

Add config to set OpenVINO inference device

a7cd5c8

Add OpenVINO support for BERT Sentence Embeddings

6f48d1f

Formatting

4b1c0ba

Openvino synchronous inference

8b922e7

Refactoring: OV Model Conversion

c969fe3

BertSentenceEmbeddings Python API

1097536

Enable OpenVINO support for Llama2

904b492

Read/write Llama2 Transformer with OpenVINO

9eb5b2f

Merge branch 'master' into openvino-integration

d918274

Bugfix: Update saved model filename

1a54f36

Replace broadcast with addFile for OpenVINO-based annotators

9c6c7a9

Add OpenVINO Wrapper tests

5f73d0b

danilojsl requested changes May 2, 2024

View reviewed changes

src/main/scala/com/johnsnowlabs/ml/openvino/OpenvinoWrapper.scala Outdated Show resolved Hide resolved

rajatkrishna added 2 commits May 3, 2024 12:44

Add suffix to avoid duplication in Spark Files

fb1fb9a

Set default OV inference device to CPU

7fa98e9

* Use CPU by default for OpenVINO inference due to error loading device config in cluster envs

danilojsl approved these changes May 3, 2024

View reviewed changes

Bugfix: Read serialized model from folder

575846b

DevinTDHa mentioned this pull request May 20, 2024

openVINO Dependencies #14255

Merged

4 tasks

maziyarpanahi approved these changes May 21, 2024

View reviewed changes

maziyarpanahi changed the base branch from master to release/540-release-candidate May 21, 2024 12:34

maziyarpanahi merged commit fabc4ab into JohnSnowLabs:release/540-release-candidate May 21, 2024
1 of 4 checks passed

maziyarpanahi mentioned this pull request May 21, 2024

540 Release Candidate #14247

Open

rajatkrishna deleted the openvino-integration branch May 31, 2024 18:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrating OpenVINO Runtime in Spark NLP #14200

Integrating OpenVINO Runtime in Spark NLP #14200

rajatkrishna commented Mar 9, 2024 •

edited

Integrating OpenVINO Runtime in Spark NLP #14200

Integrating OpenVINO Runtime in Spark NLP #14200

Conversation

rajatkrishna commented Mar 9, 2024 • edited

Description

Motivation and Context

Screenshots (if appropriate):

Types of changes

Checklist:

rajatkrishna commented Mar 9, 2024 •

edited