Skip to content
@ModelCloud

ModelCloud.ai

Our mission is to give allow everyone, including bots, unlimited and free access to llm/ai models.

Pinned Loading

  1. GPTQModel Public

    Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

    Python 537 77

  2. Device-SMI Public

    Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it y…

    Python 11 1

Repositories

Showing 10 of 12 repositories
  • GPTQModel Public

    Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

    Python 537 Apache-2.0 77 29 12 Updated May 9, 2025
  • lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    Python 0 MIT 2,385 0 0 Updated Apr 17, 2025
  • LogBar Public

    A unified Logger and ProgressBar util with zero dependencies.

    Python 4 Apache-2.0 0 1 0 Updated Apr 1, 2025
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 0 Apache-2.0 7,428 0 0 Updated Mar 27, 2025
  • rockthem Public
    Cuda 0 Apache-2.0 0 0 0 Updated Mar 13, 2025
  • Tokenicer Public

    A (nicer) tokenizer you want to use for model inference and training: with all known peventable gotchas normalized or auto-fixed.

    Python 8 Apache-2.0 2 0 1 Updated Mar 12, 2025
  • Python 0 CC-BY-4.0 1 0 0 Updated Mar 6, 2025
  • peft Public Forked from huggingface/peft

    🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

    Python 0 Apache-2.0 1,879 0 0 Updated Mar 4, 2025
  • sglang Public Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Python 0 Apache-2.0 1,715 0 0 Updated Mar 4, 2025
  • Device-SMI Public

    Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it yourself.

    Python 11 Apache-2.0 1 2 1 Updated Mar 1, 2025