Compactif AI: Extreme Compression of LLMs and AI Models
06 Mar 2024
AI, Machine Learning & Advanced Analytics
In the current context of generative AI, the compression and optimization of the consumption and associated costs of this type of models is key. CompactifAI is a tool based on Tensor Networks that allows compression by up to 85% while maintaining 90% accuracy.