I have a couple of small projects I am working on around llms (#llm). I want to convert the llama2 project to java8 not java21 due to the prevalence of java8. Also want to build a llm around certain data. I still haven't found a comprehensive guide on what is fully involved. This post is more about, "I confused like a 5 year old and want to highlight certain things" These posts are the best and there is probably weeks worth of review of these videos and projects. And if you are thorough, you really should take apart, the tensor project. Tensorflow, pytorch, all of it to get a full understanding of the process. It is really just more than "run the inference" chat engine. And that is where I got stuck on. They say, all you need is the C file and the token data and model data. That is true but there are a lot of questions I asked. Where are the actual tokens? Where did you get them from? The actual words? What about th...