I’m looking to build a low-end ollama LLM server to improve home assistant voice control, Immich image recognition and a few other services. With the current cost of hardware components like memory, I’m looking to build something small, but somewhat expandable.

I have an old micro-atx form factor computer that I’m thinking will be a good option to upgrade. I’d love recommendations on motherboards, processors, and video card combos that would likely be compatible and sufficient to run a decent server while keeping costs lower, basically, the best bang for the buck. I have a couple of M.2 SSDs I can re-purpose. Would prefer the motherboard has 2.5Gbit Ethernet, but otherwise I’m open.

Also recommendations on sites to purchase good quality memory at reasonable prices that ship to the US. I’d be willing to look at lightly used components, too.

Any advice on any of these topics would be greatly appreciated. The advice I’ve found has all been out of date especially with crypto fading so video cards are not as expensive, but LLM data centers eating up and reserving memory before it’s even manufactured.

  • chrash0@lemmy.world
    link
    fedilink
    English
    arrow-up
    13
    ·
    1 day ago

    honestly it’s hard to beat Macs these days in this space for two reasons:

    • unified memory means that you don’t have to load up on RAM just to load the model and then also shell out for a video card with barely enough VRAM to fit a basic language model
    • their supply chain is solid and has mostly avoided the constraints that other OEMs and parts manufacturers are struggling with

    pricing is tough. sure, crypto is on its way out, but GPUs are still the platform of choice for most neural net workloads (outside of SoCs like Apple M-series). i built a PC in late 2024, and it’s easily worth twice what i paid for it.

    • Scipitie@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      2
      ·
      11 hours ago

      Depends what you want to do… For example I didn’t get python whisper in a container to run on Mac in any way that can be called “performance” and I don’t want my dev workflow to optimize for an OS I despise :D

      • chrash0@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        9 hours ago

        in a container

        well there’s your issue. i get not liking the OS, but actively crippling your project will cripple your project.

        containers on macOS do kinda suck

        • Scipitie@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          1
          ·
          7 hours ago

          That’s sich a Mac answer it’s unbelievable.

          Describing “A project aimed to be agnostic of it’s environment” as a design mistake and not a inherent flaw of the OS is… Just wow.

          Remember in this thread it’s about the pro and con of Macos as interference hardware. This is a major flaw which comes baked into the hardware. I tested it and find it an unacceptable limitation. It’s important for others to know.

          To state “containerization is the issue” though… Just wow.

          • Jade@programming.dev
            link
            fedilink
            English
            arrow-up
            2
            ·
            6 hours ago

            Unfortunately containerisation on macos usually means running virtualized Linux, which of course is going to add overhead and cut off access to apple APIs and some hardware. So yep. There’s plenty that runs natively.

            • chrash0@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              ·
              5 hours ago

              thanks for clarifying. it was hard for me to dignify such a comment with a response.

              you’re also going to run into hardware acceleration issues trying to run Metal acceleration with a Linux kernel. i don’t really see a need to containerize these workloads these days anyway with tools like uv.

              it’s a big pain in my ass at times trying to do web dev work with an aarch64-darwin dev env vs the target x86_64-linux. adding in hardware acceleration issues just sounds painful.

              i also just personally don’t like containers. feels like bludgeon of a solution.

      • WASTECH@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 hour ago

        I haven’t looked into Asahi Linux in a while now, but I figured the experience would be pretty good by now. You don’t need to “hack” anything to get it to run. Last I read, there were just a few driver issues, but I haven’t looked into it in probably 2-3 years now.

      • chrash0@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        1 day ago

        super fair. i am a Linux guy normally. i’m just being honest. i wish there was a better more open alternative.

        if you want to go with the Linux alternative it’s going to cost. get at least 32GB of RAM and at least a 4090 to run the kind of models you’re asking for. it’s the way she goes

      • ryokimball@infosec.pub
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        2
        ·
        24 hours ago

        The apple silicon is more energy efficient but the latest Intel and AMD CPUs deliver more processing power and can also share a significant amount of RAM to the GPU / AI components.

    • curbstickle@anarchist.nexus
      link
      fedilink
      English
      arrow-up
      6
      ·
      1 day ago

      Going to second this, its all my m2 does right now. Putting together a solution for the office with some m4s.

      Its a lot of bang for the buck specifically for llm use despite being horribly overpriced otherwise.

    • irmadlad@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      1
      ·
      edit-2
      1 day ago

      i built a PC in late 2024, and it’s easily worth twice what i paid for it.

      spoiler

      I wrote the vendor and asked him if the decimal was in the right place or was this the model that was beta testing alien technology. Got to be a misprint.