Local LLM with llamafiles

tomlarkworthy · January 7, 2024, 2:12pm

I just tried out llamafiles and I am blown away on how easy to setup and use they are. Very decent speed on Apple silicon. Very decent code completion.

I have a realtime speed example here:
https://twitter.com/tomlarkworthy/status/1743996860467990532

Example integration

I want to be a bit more aggressive in auto-retrying LLM prompting in a loop, but the cost is a bit prohibitive so I wanted to explore my options a bit. I am really excited by this option. My M1 Pro 16GB is a little underpowered though, I could not run every model. Next laptop refresh I will beg for a 32GB or better.

Topic		Replies	Views
Complex Software Development with ChatGPT & Observablehq Show and Tell	0	212	December 13, 2023
Kudos, notebook to nodejs works Feedback	4	950	January 6, 2021
nice new landing page :) Feedback	4	559	November 15, 2019
Let’s talk about Observable Suggestions Feedback	44	3155	June 3, 2022
My Questions & Wanted to say "Hello" (in a fun special way..) Feedback	1	653	December 5, 2019

Local LLM with llamafiles

Related topics