Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SparkNLP - 995 Introducing MistralAI LLMs #14163

Conversation

prabod
Copy link
Contributor

@prabod prabod commented Feb 6, 2024

Introducing MistralAI LLM models

Description

Mistral 7B, a 7.3 billion-parameter model that stands out for its efficient and effective
performance in natural language processing. Surpassing Llama 2 13B across all benchmarks and
excelling over Llama 1 34B in various aspects, Mistral 7B strikes a balance between English
language tasks and code comprehension, rivaling the capabilities of CodeLlama 7B in the
latter.

Types of changes

  • New feature (non-breaking change which adds functionality)

Checklist:

  • My code follows the code style of this project.
  • My change requires a change to the documentation.
  • I have updated the documentation accordingly.
  • I have read the CONTRIBUTING page.
  • I have added tests to cover my changes.
  • All new and existing tests passed.

@prabod prabod changed the title SparkNLP - 995 implement mistral as casual lm similar to llama 2 for text generation SparkNLP - 995 Introducing MistralAI LLMs Feb 6, 2024
@prabod prabod self-assigned this Feb 6, 2024
@prabod prabod added new-feature Introducing a new feature new model DON'T MERGE Do not merge this PR labels Feb 6, 2024
@maziyarpanahi maziyarpanahi changed the base branch from master to release/540-release-candidate June 6, 2024 11:11
@maziyarpanahi maziyarpanahi self-requested a review June 6, 2024 11:12
…s-CasualLM-similar-to-Llama-2-for-text-generation' into SPARKNLP-995-Implement-Mistral-as-CasualLM-similar-to-Llama-2-for-text-generation

# Conflicts:
#	python/sparknlp/annotator/seq2seq/__init__.py
#	python/sparknlp/annotator/seq2seq/llama2_transformer.py
#	python/sparknlp/internal/__init__.py
#	python/test/annotator/seq2seq/llama2_transformer_test.py
#	src/main/scala/com/johnsnowlabs/ml/ai/Bart.scala
#	src/main/scala/com/johnsnowlabs/ml/ai/LLAMA2.scala
#	src/main/scala/com/johnsnowlabs/ml/ai/VisionEncoderDecoder.scala
#	src/main/scala/com/johnsnowlabs/ml/ai/util/Generation/Generate.scala
#	src/main/scala/com/johnsnowlabs/ml/onnx/OnnxSerializeModel.scala
#	src/main/scala/com/johnsnowlabs/ml/onnx/OnnxWrapper.scala
#	src/main/scala/com/johnsnowlabs/ml/util/LoadExternalModel.scala
#	src/main/scala/com/johnsnowlabs/nlp/annotators/seq2seq/LLAMA2Transformer.scala
#	src/test/scala/com/johnsnowlabs/nlp/annotators/seq2seq/LLAMA2TestSpec.scala
@prabod prabod closed this Jun 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
DON'T MERGE Do not merge this PR new model new-feature Introducing a new feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant