Like the issue with modern AI is the data centers and central control no? How feasible would an AI be, whose code is FOSS and that is trained and running decentralized?

  • SnarkoPolo@lemmy.world
    link
    fedilink
    arrow-up
    3
    ·
    7 hours ago

    It’s a capitalist world. Economies of scale will always tip the balance toward centralization, in the hands of billionaires.

  • Mugita Sokio@lemmy.today
    link
    fedilink
    English
    arrow-up
    2
    ·
    10 hours ago

    You should, in fact, look into how centralized AI is potentially harmful to your privacy. I look at decentralizing AI as the means to completely destroy the likes of Grok, GPT, etc.

    • ☭SaltyIcetea☭@lemmy.mlOP
      link
      fedilink
      arrow-up
      1
      ·
      20 hours ago

      no i mean more like a public service, but instead of one centralized power with massive datacenters, it is a distributed zraining and computation. like reddit vs. lemmy for example.

      • klangcola@reddthat.com
        link
        fedilink
        arrow-up
        8
        arrow-down
        1
        ·
        19 hours ago

        Hardest issue would probably be financing, and motivation.

        GPUs are expensive, electricity is expensive. All the current major LLMs are huge loss leaders for giant players with deep pockets. A distributed AI service would be by smaller players without the financing nor the motivation to upfront all the cost.

        There is “folding@home” where you donate time on your hardware for scientific calculations, but that’s quite different from donating time on your hardware to some random unknown stranger to generate AI cat images or summarise a news article.

        Lemmy and Mastodon etc have a comparatively modest monetary (and energy/environmental) cost, and the benefit is building communities and bringing people together. For distributed AI the cost ( monetary and energy/environmental) is higher, and the benefit is limited.

        • 🇵🇸antifa_ceo@lemmy.ml
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          1
          ·
          edit-2
          14 hours ago

          This is only a problem in a world where we spend no time to optimize these models like we are doing today where we just throw more power at them rather than engineering them to be…better. Look at how China is doing AI - their more limited resources in this regard have forced them to invest in LLM models that work on more modest hardware with much less necessary power. This is necessarily the direction this development must continue to make it a viable product for the average person to engage with without the need for an oppressive mega corporation footing the infrastructure bill (and poisoning the surrounding population at the same time).

          Edit: I’ll add that I am broadly not in favor of AI as a whole but the tech is here and has novel use cases. Making the models more efficient is a necessary step towards seeing this tech’s true usefulness be actualized.

  • JASN_DE@feddit.org
    link
    fedilink
    arrow-up
    6
    ·
    20 hours ago

    Technically? Not a problem. The reason most of them run in data centers is the massive amount of computational and therefore also electrical power you need to run a somewhat useful model.

    Even worse when you need to initially train them. That’ll really hit the wallet.

    There are (by now) vast selection of models you can easily run at home without any outside connection, as long as you have reasonable hardware to run them.

    • village604@adultswim.fan
      link
      fedilink
      English
      arrow-up
      1
      ·
      18 hours ago

      Based on OPs comments in the rest of the thread, they’re talking about a fold@home type system, not a locally run LLM.

  • tal@lemmy.today
    link
    fedilink
    English
    arrow-up
    4
    ·
    20 hours ago

    If you mean distributing inference across many machines, each of which could not individually deal with a large model, using today’s models, not viable with reasonable performance. The problem is that you require a lot of bandwidth between layers; a lot of data moves. When you cluster current systems, you tend to use specialized, high-bandwidth links.

    It might theoretically be possible to build models that are more-amenable to this sort of thing, that have small parts of a model run on nodes that have little data interchange between them. But until they’re built, hard to say.

    I’d also be a little leery of how energy-efficient such a thing is, especially if you want to use CPUs — which are probably more-amenable to be run in a shared fashion than GPUs. Just using CPU time “in the background” also probably won’t work as well as with a system running other tasks, because the limiting factor isn’t heavy crunching on a small amount of data — where a processor can make use of idle cores without much impact to other tasks — but bandwidth to the memory, which is gonna be a bottleneck for the whole system. Also, some fairly substantial memory demands, unless you can also get model size way down.

  • Oka@sopuli.xyz
    link
    fedilink
    arrow-up
    2
    ·
    20 hours ago

    Plausibly feasible, but there would be more people using the service than letting the service borrow their hardware for processing or memory. I imagine it would work like a botnet, where if a user generates a prompt, their machine would borrow other nearby machines to process the command efficiently. So, in order to make it a fair service, perhaps the software does a POST and only operates if you meet a minimum hardware specification, and a stable internet connection.