• 14 Posts
  • 2 Comments
Joined 1 year ago
cake
Cake day: June 10th, 2023

help-circle








  • I am actively testing this out. It’s hard to say at the moment. There’s a lot to figure out deploying a model into a live environment, but I think there’s real value in using them for technical tasks - especially as models mature and improve over time.

    At the moment, though, performance is closer to GPT 3.5 than GPT 4, but I wouldn’t be surprised if this is no longer the case within the next year or so.


  • After finally having a chance to test some of the new Llama-2 models, I think you’re right. There’s still some work to be done to get them tuned up… I’m going to dust off some of my notes and get a new index of those other popular gen-1 models out there later this week.

    I’m very curious to try out some of these docker images, too. Thanks for sharing those! I’ll check them when I can. I could also make a post about them if you feel like featuring some of your work. Just let me know!