• Scrollone@feddit.it
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      1
      ·
      11 hours ago

      Exactly, I just keep using the free plan and when I finish the amount for the day I just switch to another service

      • iocase@lemmy.zip
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 hours ago

        This is why IMO blitz scaling is dumb when your service is a commodity. I’m not any more loyal to Uber than Skip. If more investor money goes into making a cheaper meal or ride on Skip I use that. Consumers are mercenaries about that stuff.

        The “blitz” part of blitz scaling assumes your customers can’t move.

        • Scrollone@feddit.it
          link
          fedilink
          English
          arrow-up
          1
          ·
          36 minutes ago

          Exactly. And LLMs don’t have a way to keep you inside of their walled garden; if any, I prefer starting from a blank slate every time I ask something.

  • nullspace@lemmy.world
    link
    fedilink
    English
    arrow-up
    11
    arrow-down
    2
    ·
    11 hours ago

    I can’t imagine paying for AI when the open source tools have made it so easy to set up a model locally.

    • potustheplant@feddit.nl
      link
      fedilink
      English
      arrow-up
      17
      arrow-down
      4
      ·
      10 hours ago

      Don’t be daft. The vast majority of people don’t have the knowledge or resources to set that up locally.

      • nullspace@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        1
        ·
        2 hours ago

        You’re right if we’re talking about the entire population of Earth. With these local models though, other people have already done all the hard stuff. Anyone with an RTX card and just a minimum level of patience can get going.

        • potustheplant@feddit.nl
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          1
          ·
          1 hour ago

          Minimum for local models is 12gb imo. There are several “rtx” cards that have 8gb. Also, why nvidia? AMD works well too. My previous point stands, still. If you don’t already have the hw, buying a pc today is very expensive. I don’t know if you go out much but it ain’t pretty out there. People arrn’t precisely swimming in cash.

          Also, patience isn’t the only requirement. Keep in mind that some people struggle to even install a program.

    • ranzispa@mander.xyz
      link
      fedilink
      English
      arrow-up
      9
      arrow-down
      1
      ·
      10 hours ago

      Easy to set up, but still needs a 15k $ graphics card and electricity bill. The price you pay openai/anthropic is much cheaper than that for that quality of model.

      Sure, you can setup a small model on a consumer graphics card, but the output will be considerably worse and the processing speed considerably lower.

      For 240€/year you got a subscription to anthropic which will happily ingest a whole repository and process it in about one minute. No matter what latest model GPU you installed on your computer, you won’t be able to do that.

      Sure, this guy was able to run a 26B model on an old CPU: https://point.free/blog/gemma-4-on-a-2016-xeon/

      But that was not easy at all and the speed you get is definitely not the same as the one provided for a very cheap price.

      • zbyte64@awful.systems
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 hours ago

        If you were paying the real price it would be 2 grand a year though. And in 5 years that 15k graphics card will be $200 and sip on electricity by comparison.

        • ranzispa@mander.xyz
          link
          fedilink
          English
          arrow-up
          1
          ·
          6 minutes ago

          A100 is 6 years old and is now sold at over 10k $. If you were paying a higher price it could be cheaper to buy the card, since the prices are low that is not the case.

        • kuerbiskernoel@feddit.org
          link
          fedilink
          English
          arrow-up
          1
          ·
          56 minutes ago

          Currently nearly 5 year old used graphics cards are being sold for their initial price. Not sure how much they’ll get cheaper…

  • mfed1122@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    20
    ·
    14 hours ago

    I mean, this is no different than Walmart making prices low until other businesses die out and then raising them.

    It is no different than police shoving all the homeless people and drug addicts into one area of town to crash the property prices, and then evicting them once developers buy everything for cheap.

    They’re purposely operating at a loss in the expectation that they can get ingrained into a ton of workflows, and then gouge everyone absolutely to death while also worsening the quality of the service to make it cheaper for them to run.

    If it weren’t so horrible for the environment, I’d kind of like it, because all the dumbass executives that are signing up for this are going to get exactly what they deserve. You’d think they’d recognize a scheme when they see one.

    • fishy@lemmy.today
      link
      fedilink
      English
      arrow-up
      14
      ·
      14 hours ago

      My CEO (whom I don’t consider a particularly good or bad CEO) spent a day playing with AI then when asked if he’d sign the company up with the service he literally laughed in their faces and said it’s useless. I was honestly shocked because he’s totally into buzzword and popular crap. Gained a lot of respect for him that day.

      • WorldsDumbestMan@lemmy.today
        link
        fedilink
        English
        arrow-up
        2
        ·
        13 hours ago

        An older co-worker seems to ask AI for help during work, we are blue collar. But the Owner of the company does not seem to use it whatsoever.

        I ask Claude on occasion, to see if it will say something smart (it was mostly useless as fuck).

        • Scrollone@feddit.it
          link
          fedilink
          English
          arrow-up
          3
          ·
          11 hours ago

          Honestly I think Claude it’s good at programming. Way better than ChatGPT.

          But I ain’t going to pay for it.

          • ranzispa@mander.xyz
            link
            fedilink
            English
            arrow-up
            4
            ·
            10 hours ago

            Published a library doing some very specific data processing. One of the algorithms I implemented was a bit too slow: it would take about a week to process data. I reckon implementation was a little bit sloppy, but I’ve been implementing a bunch of algorithms from research papers and this was pretty much the published implementation.

            I asked Claude to analyse the implementation and check whether it could be improved, half an hour later I got a 26,000% improvement in performance with exactly the same results passing all tests.

            Of course, I could have done that myself. But optimization had to go down to simd level; I doubt I would have been able to do that in less than a week of work.

  • AdolfSchmitler@lemmy.world
    link
    fedilink
    English
    arrow-up
    15
    ·
    14 hours ago

    Trust me bro we’re so close to profitability bro, just need this IPO to secure funding one last time bro then we’ll be profitable bro I swear.

  • Cornpop@lemmy.world
    link
    fedilink
    English
    arrow-up
    16
    ·
    20 hours ago

    What is the actual “cost” after they buy the hardware, is that $1000 really pure power usage cost?

    • Corkyskog@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      15
      ·
      19 hours ago

      The problem is that the hardware has a 5 or 6 year depreciation schedule on paper, but NVIDIA keeps saying that their next generation chip will be twice as good as their last chip so there is a FOMO schedule of like every two years.

      • nullspace@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        11 hours ago

        Would be nice to see that used hardware for sale rather than it being junked as a writeoff.

    • rumba@lemmy.zip
      link
      fedilink
      English
      arrow-up
      5
      ·
      19 hours ago

      that’s the $84,000 question. They’re filling datacenters with the fastest possible equipment and need it to be 10x faster, That hardware is dinosaur fodder a year after they install it.

    • HereIAm@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      19 hours ago

      I’m curious as well. My knowledge is probably quite outdated, but from what I understood the training part is what’s expensive and then querying the model is pretty cheap. Is it still true (or was it ever) that the generated answers on search engines are cheaper to generate than the actual search results?

      • CheeseNoodle@lemmy.world
        link
        fedilink
        English
        arrow-up
        10
        ·
        19 hours ago

        I find that hard to believe, I recently had to uninstall co-pilot after it weaseled its way into my search bar. Its not an exageration to say that my PC literally ran cyberpunk 2077 with pathtracting better than it ran the fucking windows search bar with co-pilot.

        • SorryQuick@lemmy.ca
          link
          fedilink
          English
          arrow-up
          3
          ·
          15 hours ago

          Look at the public numbers, it seems true. Copilot on your taskbar is just windows being garbage, not the AI being bad. Just look at self-hosted AI and measure the power costs of your queries. It’s tiny.

      • Shteou@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        18 hours ago

        It is sorta. Training is orders of magnitudes more intensive than inference, but we infer billions of times within a model generation.

  • Fizz@lemmy.nz
    link
    fedilink
    English
    arrow-up
    8
    arrow-down
    1
    ·
    19 hours ago

    The author is right and wrong. Its subsidised but not by anthropic. The power users who use their plans to the limit are subsidised by the rest of the users. Im an AI hater but I do think anthropic will be profitable next year. Their revenue growth is insane and looks to just be getting started. Claude code took enterprise by storm and now cowork is out.

  • Fubarberry@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    38
    ·
    1 day ago

    Honestly Google is likely to beat openAI and Anthropic as things are.

    OpenAI and Anthropic have to buy/rent their hardware from Nvidia, while Google is making their own TPU hardware. Google’s hardware costs on AI is way lower, every dollar they spend on it goes a lot farther.

    And unlike the other two, they’re already a profitable company. They’re making record profits right now. They don’t have a desperate need to figure out how to make back billions on their AI models, they can just keep offering Gemini at a comparatively cheap price and wait for anthropic and open AI to bankrupt themselves.

      • Fubarberry@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        11 hours ago

        That’s definitely costing them more than running it on their own hardware, but it doesn’t mean AI is costing them more than the AI startups. Anthropic for example is already paying SpaceX 1.25 Billion a month for compute, and has agreed to pay Google 200Billion oflcer the next 5 years for access to Google’s compute and TPU chips.

        Google’s deal with xAI specifically lets them terminate the deal with 90 days notice after the end of the year. Google is also investing heavily in building new data centers with their hardware. I’m assuming this deal means they’ve eclipsed their current TPU capacity, and are just looking for a short term bandaid until they can catch up with their new constructions.

    • zbyte64@awful.systems
      link
      fedilink
      English
      arrow-up
      2
      ·
      15 hours ago

      I guess google’s announcement of renting xai compute could have been simply for show to boost SpaceX ipo.

      • Fubarberry@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        1
        ·
        14 hours ago

        They have big plans to build more data centers for themselves, so they definitely want more compute than the have access to right now. But even if they’re paying more to rent xai compute, they’re still paying less overall for hardware/access than their direct AI competition.

    • jj4211@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      20 hours ago

      Plus they have a hook with the common folk, the phone steers you toward Gemini (Android phones, obviously, and Apple currently partners with Google for Gemini for iPhone…).

      For Claude and OpenAI, you have to explicitly want to go out of your way to use them, or use them indirectly through another service that has a hook.

      Claude seems to have some software developers explicitly preferring them, though a alot of the corporate money is on Microsoft and Microsoft leveraged Visual Studio and Github to become the business-friendly frontend, and sure, you can use Anthropic models too… Though Microsoft ultimately has control of what is reasonably available and how much each one costs. Anthropic has a shot but I could see Microsoft pivot to really mess with Anthropic. The one gap in Microsoft strategy is the “native AI” workflow where Claude Code has won hearts and minds, but it uses massively more tokens for frankly marginal or sometimes negative value compared to a more curated use in-editor.

      OpenAI I see as the most exposed. Lot’s of data showing they are suffering from people being over the fad of going out of their way to use ChatGPT, especially since their phones have started embracing ‘default’ Chatbot. Software developers that are inclined to use LLM are also inclined to be pretty dismissive of anything other than either Anthropic or open weight models, depending on their inclination. Also Altman seemed the most agressive in committing to spending money they didn’t have, though all of them exhibit this to some extent.

      I predict Microsoft ultimately pivots to in-house models and convinces the businesses to go that way. Apple may continue with Gemini or roll their own eventually. Anthropic currently has the stronger position between OpenAI and them, but I think you are right that both have risk of just being left behind.

    • SunshineJogger@feddit.org
      link
      fedilink
      English
      arrow-up
      10
      arrow-down
      2
      ·
      1 day ago

      I really really really don’t want evil corporation Google to dominate even more.

      I prefer plailny greedy corporations over evil ones

          • jj4211@lemmy.world
            link
            fedilink
            English
            arrow-up
            8
            ·
            19 hours ago

            Google is only worse by virtue of their reach. OpenAI and Anthropic don’t have the reach yet, but they absolutely will get there given the chance.

            Before Google had the reach it has now, it was widely regarded as a comparitive ‘good guy’ and people believed in the “don’t be evil”. Lo and behold once they got going, “don’t be evil” went away.

      • BarneyPiccolo@lemmy.today
        link
        fedilink
        English
        arrow-up
        3
        ·
        20 hours ago

        They’re all evil, so we just have to exploit the ones that offer us some value. If Google is cheaper, and has the ability to damage the others, then Google it is.

      • badgermurphy@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        19 hours ago

        Google is shaping up to fare better than the others, but I dont think that means success. They, too, are spending more than its making, just at a less drunken rate than some competitors.

  • some_guy@lemmy.sdf.org
    link
    fedilink
    English
    arrow-up
    35
    ·
    1 day ago

    It’s gonna come crashing down pretty soon. It’s gonna hurt all of us. It won’t hurt the people responsible nearly enough.

      • jj4211@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        ·
        19 hours ago

        The thing is this really depends on the speed of some financial events, not some technical failing.

        Notably, if OpenAI has to cancel any of their commitments to buy hardware because they find they have neither the money nor can secure even more debt to cover, that event would potentially cause the bubble to pop, even for hypothetical companies that may have been more responsible and might have a viable business approach. Those commitments are coming up, and a lot of analysis struggles to see how they will fund those commitments.

        The thing with this bubble is that the investors don’t get the nuance and will flee at signs of trouble in any of OpenAI, Anthropic, or a handful of others, and Altman’s leadership has made trouble at OpenAI very likely, but the investors don’t believe it and won’t believe it’s unique to OpenAI, even if it would be.

      • Corkyskog@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        3
        ·
        19 hours ago

        What people? All the credible people I read say that things fall apart Q2/Q3 2027 as debt and profit obligations are due.

        The only thing that changed is now there is an energy crisis coming, so it’s possible that might force the bubble to pop sooner if all the systemic risk aligns.