Skip to content

russellgeum/SLM-Pruning-Quantization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

SLM-Pruning-Quantization

Pruning, Quantization recipe for Small Language Model
Official codebase for (One-Short) Depth Pruning + (PTQ) GPTQ framework for SLM

  • We quantitatively demonstrate the results of applying one-shot pruning and post-training quantization to SLM.
  • This repository plans to expand by demonstrating the results of applying more models and techniques in the future.

Requirements

Directory

Reference

About

Pruning, Quantization recipe for Small Language Model

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published