alright folks, let’s get real. we all have our sprawling digital fortresses, carefully constructed brick by brick. but there’s always that one piece of software, that one perfectly tuned instance, where if it so much as hiccuped, you’d be ready to throw the entire homelab out the window and start fresh in a cave. what is it for you? what’s your absolute, non-negotiable, ‘i will personally visit the datacenter if this fails’ self-hosted application? for me, it’s my media server stack. my wife would disown me. don’t let me down, arr suite.

  • curbstickle@anarchist.nexus
    link
    fedilink
    English
    arrow-up
    5
    ·
    16 hours ago

    Well the only way my media server goes down is if the power is out for more than a few hours or my internet connection is down.

    So I’d have to say that.

    • fhoekstra@feddit.nl
      link
      fedilink
      English
      arrow-up
      1
      ·
      11 hours ago

      I don’t believe you, but I’d like to be proven wrong.

      I expect you have a UPS that feeds your hosts and networking equipment and something like ZFS for disk redundancy. This protects against the most common failures and is usually enough, but there are still single points of failure in such a setup, that are not as common, not as hard to deal with through manual intervention, and quite difficult to protect with redundancy.

      I would be surprised if you are protected against the following single points of failure without manual intervention:

      • NAS machine (not just disk) failure. You would need to have a multi-node distributed storage, like Ceph, to protect against this.
      • Networking equipment failure. I think you can do some magic with BGP to do this, but I’m not a network engineer and I’ve never set up a redundant network.
      • curbstickle@anarchist.nexus
        link
        fedilink
        English
        arrow-up
        1
        ·
        7 hours ago

        Ceph for the proxmox cluster, 2x48 port switch + 16 port 10gbit as the core, 2xNAS (technically one is the backup, and there would be a few moments of downtime as the containers restart - a different container with the same config pointed to the backup NAS instead).

        UPS and internet are the SPoF.