ℍ𝕂-𝟞𝟝

  • 0 Posts
  • 106 Comments
Joined 2 years ago
cake
Cake day: July 14th, 2024

help-circle
  • I actually did an experiment on doing just that. For context, I’m an experienced software engineer, whose company buys him a tom of Claude usage so I had time to test out what it can actually do and I feel like I’m capable of judging where it’s good and where it falls short at.

    How Claude Code works is that there are actually multiple models involved, one for doign the coding, one “reasoning” model to keep the chain of thought and the context going, and a bunch of small specialized ones for odd jobs around the thing.

    The thing that doesn’t work yet is that the big reasoning model has to still be big, otherwise it will hallucinate frequently enough to break the workflow. If you could get one of the big models to run locally, you’d be there. However, with recent advances in quantization and MoE models, it’s actually getting nearer fast enough that I would expect it to be generally available in a year or two.

    Today the best I could do was a tool that could take 150 gigs of RAM, 24 gigs of VRAM and AMD’s top of the line card to take 30 minutes what takes Claude Code 1-2. But surprisingly, the output of the model was not bad at all.




  • Oh, there are those as well, I’m not dunking on juniors.

    It’s just that my problems always tend to be caused by mismanagement of people.

    Like just today I had to clean up after a “let’s do a quick and dirty experiment, oh it works so now it’s production, make 200 more features in a month built on top of the quick and dirty let’s just try it code, what do you mean we lost millions because of a regression nobody even noticed” situation.


  • Yeah that shit is more common than people think.

    A big part of the business of cloud providers is that most orgs have no idea how to do shit. Their enterprise consultants are also wildly variable in competence.

    There was also a large amount of useless bullshit that I needed to cut down since being hired at my current spot, but the amount of containers is actually warranted. We do have that traffic, which is both happy and sad, since while business is booming, I have to deal with this.








  • If you look at what AI does, however, it’s mostly classification.

    Not necessarily, a huge use case is regulation and control in the engineering, not the political sense. Like driverless cars, independently flying drones and such. And yeah, they need classification subsystems under the hood to work, but their ultimate outputs are complex control signals, not simple classes.

    And don’t get me wrong, I also like ML and AI as a field, I just don’t like how OpenAI fucked the field with text generators that they got Silicon Valley to worship like gods. I even like LLMs, just not the grotesquely outsized cult around them.






  • I’ve a slight manageable case of ADHD and I tend to obsessively hyperfocus on tasks. It’s a good relationship because I get a lot of shit done well, and enjoy my work.

    If you start forcing me to plan out my day every day, down to 15 minute increments, my productivity drops by around 60%, because I stop concentrating on getting shit done, and start working to rule. Not because I’m vindictive, but because that’s what you asked me to do.