Permute Quantize Finetune Alternatives

Using ideas from product quantization for state-of-the-art neural network compression.
Suggest Alternative
Alternatives To uber-research/permute-quantize-finetune
Project Name Stars Downloads Repos Using This Packages Using This Most Recent Commit Total Releases Latest Release Open Issues License Language
SYSTRAN/faster-whisper 6,940 0 22 about 2 years ago 12 November 26, 2023 140 mit Python
Faster Whisper transcription with CTranslate2
UFund-Me/Qbot 4,799 0 0 over 2 years ago 0 51 mit Jupyter Notebook
[🔥updating ...] AI 自动量化交易机器人 AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
AutoGPTQ/AutoGPTQ 3,206 0 0 about 2 years ago 0 174 mit Python
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
IntelLabs/nlp-architect 2,924 0 0 over 3 years ago 10 April 12, 2020 14 apache-2.0 Python
A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks
Tencent/PocketFlow 2,553 0 0 over 5 years ago 0 73 other Python
An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.
OpenNMT/CTranslate2 2,437 0 23 about 2 years ago 103 December 05, 2023 110 mit C++
Fast inference engine for Transformer models
dvmazur/mixtral-offloading 1,943 0 0 about 2 years ago 0 12 mit Python
Run Mixtral-8x7B models in Colab or consumer desktops
lucidrains/vector-quantize-pytorch 1,627 0 25 about 2 years ago 160 December 06, 2023 27 mit Python
Vector Quantization, in Pytorch
htqin/awesome-model-quantization 1,449 0 0 over 2 years ago 0 0
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
tensorflow/model-optimization 1,445 3 27 over 2 years ago 30 May 26, 2023 207 apache-2.0 Python
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
Alternatives To uber-research/permute-quantize-finetune
Select To Compare


Alternative Project Comparisons
Popular Quantization Projects
Popular Deep Learning Projects
Popular Machine Learning Categories
Related Searches
Get A Weekly Email With Trending Projects
No Spam. Unsubscribe easily at any time.
Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.

Copyright 2018-2026 Awesome Open Source.  All rights reserved.