SparkNLP - 995 Introducing MistralAI LLMs #14163

prabod · 2024-02-06T11:48:08Z

Introducing MistralAI LLM models

Description

Mistral 7B, a 7.3 billion-parameter model that stands out for its efficient and effective
performance in natural language processing. Surpassing Llama 2 13B across all benchmarks and
excelling over Llama 1 34B in various aspects, Mistral 7B strikes a balance between English
language tasks and code comprehension, rivaling the capabilities of CodeLlama 7B in the
latter.

Types of changes

New feature (non-breaking change which adds functionality)

Checklist:

My code follows the code style of this project.
My change requires a change to the documentation.
I have updated the documentation accordingly.
I have read the CONTRIBUTING page.
I have added tests to cover my changes.
All new and existing tests passed.

- added beam search support for LLAMA2

changed test to slow test

…s-CasualLM-similar-to-Llama-2-for-text-generation' into SPARKNLP-995-Implement-Mistral-as-CasualLM-similar-to-Llama-2-for-text-generation # Conflicts: # python/sparknlp/annotator/seq2seq/__init__.py # python/sparknlp/annotator/seq2seq/llama2_transformer.py # python/sparknlp/internal/__init__.py # python/test/annotator/seq2seq/llama2_transformer_test.py # src/main/scala/com/johnsnowlabs/ml/ai/Bart.scala # src/main/scala/com/johnsnowlabs/ml/ai/LLAMA2.scala # src/main/scala/com/johnsnowlabs/ml/ai/VisionEncoderDecoder.scala # src/main/scala/com/johnsnowlabs/ml/ai/util/Generation/Generate.scala # src/main/scala/com/johnsnowlabs/ml/onnx/OnnxSerializeModel.scala # src/main/scala/com/johnsnowlabs/ml/onnx/OnnxWrapper.scala # src/main/scala/com/johnsnowlabs/ml/util/LoadExternalModel.scala # src/main/scala/com/johnsnowlabs/nlp/annotators/seq2seq/LLAMA2Transformer.scala # src/test/scala/com/johnsnowlabs/nlp/annotators/seq2seq/LLAMA2TestSpec.scala

prabod added 16 commits January 23, 2024 13:39

introducing LLAMA2

8045e3f

Added option to read model from model path to onnx wrapper

492e6c3

Added option to read model from model path to onnx wrapper

48cb061

updated text description

b4cf4cf

LLAMA2 python API

44bbc87

added method to save onnx_data

95d2587

added position ids

a9b2b5c

- updated Generate.scala to accept onnx tensors

5304c1f

- added beam search support for LLAMA2

updated max input length

46e8f15

updated python default params

c0b2c4f

changed test to slow test

fixed serialization bug

4dbd0d4

Added Mistral Scala API

18c1865

Added Mistral Python API

270f59f

Added Mistral Python tests

dd45a57

updated links

d287d19

updated output

3deac19

prabod changed the title ~~SparkNLP - 995 implement mistral as casual lm similar to llama 2 for text generation~~ SparkNLP - 995 Introducing MistralAI LLMs Feb 6, 2024

prabod self-assigned this Feb 6, 2024

prabod added new-feature Introducing a new feature new model DON'T MERGE Do not merge this PR labels Feb 6, 2024

maziyarpanahi changed the base branch from master to release/540-release-candidate June 6, 2024 11:11

maziyarpanahi self-requested a review June 6, 2024 11:12

prabod added 7 commits June 6, 2024 11:20

LLAMA2 python API

ad9807a

Added Mistral Scala API

5105734

Added Mistral Python API

46105ab

Added Mistral Python tests

1b6b3c3

updated links

08533d1

updated output

76bc68b

Revert to 3deac19

56a29f2

prabod closed this Jun 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SparkNLP - 995 Introducing MistralAI LLMs #14163

SparkNLP - 995 Introducing MistralAI LLMs #14163

prabod commented Feb 6, 2024

SparkNLP - 995 Introducing MistralAI LLMs #14163

SparkNLP - 995 Introducing MistralAI LLMs #14163

Conversation

prabod commented Feb 6, 2024

Description

Types of changes

Checklist: