Skip to content

mhtjsh/llm_reasoning-pruning

Repository files navigation

LLM Reasoning and Pruning Methods (paper implementation)

Paper implementation of pruning techniques of reasoning thought chains, to delete redundant tokens which unnecessary hog up the KV cache memory but do not contribute much in the reasoning or logic building to get to the final solution.

About

Paper implementation of pruning techniques of reasoning thought chains, to delete redundant tokens which hog up the KV cache memory.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors