Regarding the diagram - yeah, it's SO much easier to visualize when you see it. It takes oddly a bit of time just to make boxes fit though. Huge fan of Traefik too :)
Most of my fine-tuning was on 355 (otherwise, 774 is too hard to train).
The GPU with inference is helpful for much faster responses -- for instance, it's easy to run inference (CPU) if you're getting a one way shot of 200 words (no revisions), but if you're constantly changing and revising, then speed ends up mattering a lot for UX.
EDIT: Looked over your articles, I'm literally floored/amazed you're in high school and know this much. The whole world is your oyster.
Most of my fine-tuning was on 355 (otherwise, 774 is too hard to train).
However, the default prompt on https://writeup.ai goes to 774.
The GPU with inference is helpful for much faster responses -- for instance, it's easy to run inference (CPU) if you're getting a one way shot of 200 words (no revisions), but if you're constantly changing and revising, then speed ends up mattering a lot for UX.
EDIT: Looked over your articles, I'm literally floored/amazed you're in high school and know this much. The whole world is your oyster.