• GreatAlbatross@feddit.uk
        link
        fedilink
        English
        arrow-up
        67
        ·
        2 months ago

        Or from the sounds of it, doing things more efficiently.
        Fewer cycles required, less hardware required.

        Maybe this was an inevitability, if you cut off access to the fast hardware, you create a natural advantage for more efficient systems.

        • sugar_in_your_tea@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          39
          arrow-down
          1
          ·
          2 months ago

          That’s generally how tech goes though. You throw hardware at the problem until it works, and then you optimize it to run on laptops and eventually phones. Usually hardware improvements and software optimizations meet somewhere in the middle.

          Look at photo and video editing, you used to need a workstation for that, and now you can get most of it on your phone. Surely AI is destined to follow the same path, with local models getting more and more robust until eventually the beefy cloud services are no longer required.

          • jmcs@discuss.tchncs.de
            link
            fedilink
            English
            arrow-up
            44
            arrow-down
            1
            ·
            2 months ago

            The problem for American tech companies is that they didn’t even try to move to stage 2.

            OpenAI is hemorrhaging money even on their most expensive subscription and their entire business plan was to hemorrhage money even faster to the point they would use entire power stations to power their data centers. Their plan makes about as much sense as digging your self out of a hole by trying to dig to the other side of the globe.

            • sugar_in_your_tea@sh.itjust.works
              link
              fedilink
              English
              arrow-up
              17
              ·
              2 months ago

              Hey, my friends and I would’ve made it to China if recess was a bit longer.

              Seriously though, the goal for something like OpenAI shouldn’t be to sell products to end customers, but to license models to companies that sell “solutions.” I see these direct to consumer devices similarly to how GPU manufacturers see reference cards or how Valve sees the Steam Deck: they’re a proof of concept for others to follow.

              OpenAI should be looking to be more like ARM and less like Apple. If they do that, they might just grow into their valuation.

      • theunknownmuncher@lemmy.world
        link
        fedilink
        English
        arrow-up
        36
        arrow-down
        1
        ·
        edit-2
        2 months ago

        China really has nothing to do with it, it could have been anyone. It’s a reaction to realizing that GPT4-equivalent AI models are dramatically cheaper to train than previously thought.

        It being China is a noteable detail because it really drives the nail in the coffin for NVIDIA, since China has been fenced off from having access to NVIDIA’s most expensive AI GPUs that were thought to be required to pull this off.

        It also makes the USA gov look extremely foolish to have made major foreign policy and relationship sacrifices in order to try to delay China by a few years, when it’s January and China has already caught up, those sacrifices did not pay off, in fact they backfired and have benefited China and will allow them to accelerate while hurting USA tech/AI companies

      • golli@lemm.ee
        link
        fedilink
        English
        arrow-up
        26
        ·
        2 months ago

        It’s a reaction to thinking China has better AI

        I don’t think this is the primary reason behind Nvidia’s drop. Because as long as they got a massive technological lead it doesn’t matter as much to them who has the best model, as long as these companies use their GPUs to train them.

        The real change is that the compute resources (which is Nvidia’s product) needed to create a great model suddenly fell of a cliff. Whereas until now the name of the game was that more is better and scale is everything.

        China vs the West (or upstart vs big players) matters to those who are investing in creating those models. So for example Meta, who presumably spends a ton of money on high paying engineers and data centers, and somehow got upstaged by someone else with a fraction of their resources.

          • golli@lemm.ee
            link
            fedilink
            English
            arrow-up
            2
            ·
            1 month ago

            Looking at the market cap of Nvidia vs their competitors the market belives it is, considering they just lost more than AMD/Intel and the likes are worth combined and still are valued at $2.9 billion.

            And with technology i mean both the performance of their hardware and the software stack they’ve created, which is a big part of their dominance.

            • mapumbaa@lemmy.zip
              link
              fedilink
              English
              arrow-up
              2
              ·
              1 month ago

              Yeah. I don’t believe market value is a great indicator in this case. In general, I would say that capital markets are rational at a macro level, but not micro. This is all speculation/gambling.

              My guess is that AMD and Intel are at most 1 year behind Nvidia when it comes to tech stack. “China”, maybe 2 years, probably less.

              However, if you can make chips with 80% performance at 10% price, its a win. People can continue to tell themselves that big tech always will buy the latest and greatest whatever the cost. It does not make it true. I mean, it hasn’t been true for a really long time. Google, Meta and Amazon already make their own chips. That’s probably true for DeepSeek as well.

              • golli@lemm.ee
                link
                fedilink
                English
                arrow-up
                2
                ·
                1 month ago

                Yeah. I don’t believe market value is a great indicator in this case. In general, I would say that capital markets are rational at a macro level, but not micro. This is all speculation/gambling.

                I have to concede that point to some degree, since i guess i hold similar views with Tesla’s value vs the rest of the automotive Industry. But i still think that the basic hirarchy holds true with nvidia being significantly ahead of the pack.

                My guess is that AMD and Intel are at most 1 year behind Nvidia when it comes to tech stack. “China”, maybe 2 years, probably less.

                Imo you are too optimistic with those estimations, particularly with Intel and China, although i am not an expert in the field.

                As i see it AMD seems to have a quite decent product with their instinct cards in the server market on the hardware side, but they wish they’d have something even close to CUDA and its mindshare. Which would take years to replicate. Intel wish they were only a year behind Nvidia. And i’d like to comment on China, but tbh i have little to no knowledge of their state in GPU development. If they are “2 years, probably less” behind as you say, then they should have something like the rtx 4090, which was released end of 2022. But do they have something that even rivals the 2000 or 3000 series cards?

                However, if you can make chips with 80% performance at 10% price, its a win. People can continue to tell themselves that big tech always will buy the latest and greatest whatever the cost. It does not make it true.

                But the issue is they all make their chips at the same manufacturer, TSMC, even Intel in the case of their GPUs. So they can’t really differentiate much on manufacturing costs and are also competing on the same limited supply. So no one can offer 80% of performance at 10% price, or even close to it. Additionally everything around the GPU (datacenters, rack space, power useage during operation etc.) also costs, so it is only part of the overall package cost and you also want to optimize for your limited space. As i understand it datacenter building and power delivery for them is actually another limiting factor right now for the hyperscalers.

                Google, Meta and Amazon already make their own chips. That’s probably true for DeepSeek as well.

                Google yes with their TPUs, but the others all use Nvidia or AMD chips to train. Amazon has their Graviton CPUs, which are quite competitive, but i don’t think they have anything on the GPU side. DeepSeek is way to small and new for custom chips, they evolved out of a hedge fund and just use nvidia GPUs as more or less everyone else.

                • mapumbaa@lemmy.zip
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  1 month ago

                  Thanks for high effort reply.

                  The Chinese companies probably use SIMC over TSMC from now on. They were able to do low volume 7 nm last year. Also, Nvidia and “China” are not on the same spot on the tech s-curve. It will be much cheaper for China (and Intel/AMD) to catch up, than it will be for Nvidia to maintain the lead. Technological leaps and reverse engineering vs dimishing returns.

                  Also, expect that the Chinese government throws insane amounts of capital at this sector right now. So unless Stargate becomes a thing (though I believe the Chinese invest much much more), there will not be fair competition (as if that has ever been a thing anywhere anytime). China also have many more tools, like optional command economy. The US has nothing but printing money and manipulating oligarchs on a broken market.

                  I’m not sure about 80/10 exactly of course, but it is in that order of magnitude, if you’re willing to not run newest fancy stuff. I believe the MI300X goes for approx 1/2 of the H100 nowadays and is MUCH better on paper. We don’t know the real performance because of NDA (I believe). It used to be 1/4. If you look at VRAM per $, the ratio is about 1/10 for the 1/4 case. Of course, the price gap will shrink at the same rate as ROCm matures and customers feel its safe to use AMD hardware for training.

                  So, my bet is max 2 years for “China”. At least when it comes to high-end performance per dollar. Max 1 year for AMD and Intel (if Intel survive).

      • nieceandtows@lemmy.world
        link
        fedilink
        English
        arrow-up
        12
        ·
        2 months ago

        From what I understand, it’s more that it takes a lot less money to train your own llms with the same powers with this one than to pay license to one of the expensive ones. Somebody correct me if I’m wrong

        • CheeseNoodle@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          1 month ago

          I wouldn’t be surprised if China spent more on AI development than the west did, sure here we spent tens of billions while China only invested a few million but that few million was actually spent on the development while out of the tens of billions all but 5$ was spent on bonuses and yachts.

      • bobalot@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        1
        ·
        1 month ago

        Does it still need people spending huge amounts of time to train models?

        After doing neural networks, fuzzy logic, etc. in university, I really question the whole usability of what is called “AI” outside niche use cases.

      • tburkhol@lemmy.world
        link
        fedilink
        English
        arrow-up
        16
        arrow-down
        16
        ·
        2 months ago

        Exactly. Galaxy brains on Wall Street realizing that nvidia’s monopoly pricing power is coming to an end. This was inevitable - China has 4x as many workers as the US, trained in the best labs and best universities in the world, interns at the best companies, then, because of racism, sent back to China. Blocking sales of nvidia chips to China drives them to develop their own hardware, rather than getting them hooked on Western hardware. China’s AI may not be as efficient or as good as the West right now, but it will be cheaper, and it will get better.

    • givesomefucks@lemmy.world
      link
      fedilink
      English
      arrow-up
      44
      arrow-down
      5
      ·
      2 months ago

      It’s coming, Pelosi sold her shares like a month ago.

      It’s going to crash, if not for the reasons she sold for, as more and more people hear she sold, they’re going to sell because they’ll assume she has insider knowledge due to her office.

      Which is why politicians (and spouses) shouldn’t be able to directly invest into individual companies.

      Even if they aren’t doing anything wrong, people will follow them and do what they do. Only a truly ignorant person would believe it doesn’t have an effect on other people.

      • ShinkanTrain@lemmy.ml
        link
        fedilink
        English
        arrow-up
        48
        ·
        edit-2
        2 months ago

        It’s coming, Pelosi sold her shares like a month ago.

        Yeah but only cause she was really disappointed with the 5000 series lineup. Can you blame her for wanting real rasterization improvements?

        • givesomefucks@lemmy.world
          link
          fedilink
          English
          arrow-up
          13
          ·
          2 months ago

          Everyone’s disappointed with the 5000 series…

          They’re giving up on improving rasterazation and focusing on “ai cores” because they’re using gpus to pay for the research into AI.

          “Real” core count is going down on the 5000 series.

          It’s not what gamers want, but they’re counting on people just buying the newest before asking if newer is really better. It’s why they’re already cutting 4000 series production, they just won’t give people the option.

          I think everything under 4070 super is already discontinued

        • Trainguyrom@reddthat.com
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 month ago

          You joke but there’s a lot of grandma/grandpa gamers these days. Remember someone who played PC games back in the 80s would be on their 50s or 60s now. Or even older if they picked up the hobby as an adult in the 80s

    • SuiXi3D@fedia.io
      link
      fedilink
      arrow-up
      17
      arrow-down
      1
      ·
      2 months ago

      I just hope it means I can get a high end GPU for less than a grand one day.

      • NuXCOM_90Percent@lemmy.zip
        link
        fedilink
        English
        arrow-up
        8
        arrow-down
        1
        ·
        2 months ago

        Prices rarely, if ever, go down and there is a push across the board to offload things “to the cloud” for a range of reasons.

        That said: If your focus is on gaming, AMD is REAL good these days and, if you can get past their completely nonsensical naming scheme, you can often get a really good GPU using “last year’s” technology for 500-800 USD (discounted to 400-600 or so).

      • manicdave@feddit.uk
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        1
        ·
        edit-2
        1 month ago

        I’m using an Rx6700xt which you can get for about £300 and it works fine.

        Edit: try using ollama on your PC. If your CPU is capable, that software should work out the rest.

    • FooBarrington@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      ·
      2 months ago

      If anything, this will accelerate the AI hype, as big leaps forward have been made without increased resource usage.

      • Alphane Moon@lemmy.world
        link
        fedilink
        English
        arrow-up
        21
        ·
        edit-2
        2 months ago

        Something is got to give. You can’t spend ~$200 billion annually on capex and get a mere $2-3 billion return on this investment.

        I understand that they are searching for a radical breakthrough “that will change everything”, but there is also reasons to be skeptical about this (e.g. documents revealing that Microsoft and OpenAI defined AGI as something that can get them $100 billion in annual revenue as opposed to some specific capabilities).

  • Blackmist@feddit.uk
    link
    fedilink
    English
    arrow-up
    135
    arrow-down
    2
    ·
    1 month ago

    Good. That shit is way overvalued.

    There is no way that Nvidia are worth 3 times as much as TSMC, the company that makes all their shit and more besides.

    I’m sure some of my market tracker funds will lose value, and they should, because they should never have been worth this much to start with.

    • CleoTheWizard@lemmy.world
      link
      fedilink
      English
      arrow-up
      50
      arrow-down
      1
      ·
      1 month ago

      It’s because Nvidia is an American company and also because they make final stage products. American companies right now are all overinflated and almost none of the stocks are worth what they’re at because of foreign trading influence.

      As much as people whine about inflation here, the US didn’t get hit as bad as many other countries and we recovered quickly which means that there is a lot of incentive for other countries to invest here. They pick our top movers, they invest in those. What you’re seeing is people bandwagoning onto certain stocks because the consistent gains create more consistent gains for them.

      The other part is that yes, companies who make products at the end stage tend to be worth a lot more than people trading more fundamental resources or parts. This is true of almost every industry except oil.

      • bobalot@lemmy.world
        link
        fedilink
        English
        arrow-up
        12
        ·
        1 month ago

        It is also because the USA is the reserve currency of the world with open capital markets.

        Savers of the world (including countries like Germany and China who have excess savings due to constrained consumer demand) dump their savings into US assets such as stocks.

        This leads to asset bubbles and an uncompetitively high US dollar.

        • Freefall@lemmy.world
          link
          fedilink
          English
          arrow-up
          8
          ·
          1 month ago

          The current administration is working real hard on removing trust and value of anything American.

          • bobalot@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            ·
            edit-2
            1 month ago

            The root problem they are trying to fix is real (systemic trade imbalances) but the way they are trying to fix it is terrible and won’t work.

            1. Only a universally applied tariff would work in theory but would require other countries not to retaliate (there will 100% be retaliation).

            2. It doesn’t really solve the root cause, capital inflows into the USA rather than purchasing US goods and services.

            3. Trump wants to maintain being the reserve currency which is a big part of the problem (the strength of currency may not align with domestic conditions, i.e. high when it needs to be low).

      • CheeseNoodle@lemmy.world
        link
        fedilink
        English
        arrow-up
        5
        ·
        1 month ago

        The US is also a regulations haven compared to other developed economies, corporations get away with shit in most places but America is on a whole other level of regulatory capture.

  • barsoap@lemm.ee
    link
    fedilink
    English
    arrow-up
    124
    ·
    edit-2
    2 months ago

    Shovel vendors scrambling for solid ground as prospectors start to understand geology.

    …that is, this isn’t yet the end of the AI bubble. It’s just the end of overvaluing hardware because efficiency increased on the software side, there’s still a whole software-side bubble to contend with.

    • theunknownmuncher@lemmy.world
      link
      fedilink
      English
      arrow-up
      22
      ·
      2 months ago

      there’s still a whole software-side bubble to contend with

      They’re ultimately linked together in some ways (not all). OpenAI has already been losing money on every GPT subscription that they charge a premium for because they had the best product, now that premium must evaporate because there are equivalent AI products on the market that are much cheaper. This will shake things up on the software side too. They probably need more hype to stay afloat

    • jj4211@lemmy.world
      link
      fedilink
      English
      arrow-up
      11
      ·
      1 month ago

      The software side bubble should take a hit here because:

      • Trained model made available for download and offline execution, versus locking it behind a subscription friendly cloud only access. Not the first, but it is more famous.

      • It came from an unexpected organization, which throws a wrench in the assumption that one of the few known entities would “win it”.

    • UnderpantsWeevil@lemmy.world
      link
      fedilink
      English
      arrow-up
      10
      ·
      1 month ago

      …that is, this isn’t yet the end of the AI bubble.

      The “bubble” in AI is predicated on proprietary software that’s been oversold and underdelivered.

      If I can outrun OpenAI’s super secret algorithm with 1/100th the physical resources, the $13B Microsoft handed Sam Altman’s company starts looking like burned capital.

      And the way this blows up the reputation of AI hype-artists makes it harder for investors to be induced to send US firms money. Why not contract with Hangzhou DeepSeek Artificial Intelligence directly, rather than ask OpenAI to adopt a model that’s better than anything they’ve produced to date?

    • meliante@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 month ago

      I really think GenAI is comparable to the internet in terms of what it will allow mankind in a couple of decades.

      Lots of people thought the internet was a fad and saw no future for it …

      • barsoap@lemm.ee
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        1
        ·
        edit-2
        1 month ago

        Lots of techies loved the internet, built it, and were all early adopters. Lots of normies didn’t see the point.

        With AI it’s pretty much the other way around: CEOs saying “we don’t need programmers, any more”, while people who understand the tech roll their eyes.

        • oldfart@lemm.ee
          link
          fedilink
          English
          arrow-up
          2
          ·
          1 month ago

          Back then the CEOs were babbling about information superhighways while tech rolled their eyes

        • meliante@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          3
          ·
          1 month ago

          I believe programming languages will become obsolete. You’ll still need professionals that will be experts in leading the machines but not nearly as hands on as presently. The same for a lot of professions that exist currently.

          I like to compare GenAI to the assembly line when it was created, but instead of repetitive menial tasks, it’s repetitive mental tasks that it improves/performs.

          • barsoap@lemm.ee
            link
            fedilink
            English
            arrow-up
            7
            arrow-down
            1
            ·
            edit-2
            1 month ago

            Oh great you’re one of them. Look I can’t magically infuse tech literacy into you, you’ll have to learn to program and, crucially, understand how much programming is not about giving computers instructions.

            • meliante@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              arrow-down
              2
              ·
              1 month ago

              Let’s talk in five years. There’s no point in discussing this right now. You’re set on what you believe you know and I’m set on what I believe I know.

              And, piece of advice, don’t assume others lack tech literacy because they don’t agree with you, it just makes you look like a brat that can’t discuss things maturely and invites the other part to be a prick as well.

              Especially because programming is quite fucking literally giving computers instructions, despite what you believe keyboard monkeys do. You wanker!

              What? You think “developers” are some kind on mythical beings that possess the mystical ability of speaking to the machines in cryptic tongues?

              They’re a dime a dozen, the large majority of “developers” are just cannon fodder that are not worth what they think they are.

              Ironically, the real good ones probably brought about their demise.

              • barsoap@lemm.ee
                link
                fedilink
                English
                arrow-up
                1
                arrow-down
                1
                ·
                edit-2
                1 month ago

                Especially because programming is quite fucking literally giving computers instructions, despite what you believe keyboard monkeys do. You wanker!

                What? You think “developers” are some kind on mythical beings that possess the mystical ability of speaking to the machines in cryptic tongues?

                First off, you’re contradicting yourself: Is programming about “giving instructions in cryptic languages”, or not?

                Then, no: Developers are mythical beings who possess the magical ability of turning vague gesturing full of internal contradictions, wishful thinking, up to right-out psychotic nonsense dreamt up by some random coke-head in a suit, into hard specifications suitable to then go into algorithm selection and finally into code. Typing shit in a cryptic language is the easy part, also, it’s not cryptic, it’s precise.

          • Strider@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            1 month ago

            That’s not the way it works. And I’m not even against that.

            It sill won’t work this way a few years later.

            • meliante@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              ·
              edit-2
              1 month ago

              I’m not talking about this being a snap transition. It will take several years but I do think this tech will evolve in that direction.

              I’ve been working with LLMs since month 1 and in these short 24 months things have progressed in a way that is mind boggling.

              I’ve produced more and better than ever and we’re developing a product that improves and makes some repetitive “sweat shop” tasks regarding documentation a thing of the past for people. It really is cool.

              • Strider@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                1 month ago

                In part we agree. However there are two things to consider.

                For one, the llms are plateauing pretty much now. So they are dependant on more quality input. Which, basically, they replace. So perspecively imo the learning will not work to keep this up. (in other fields like nature etc there’s comparatively endless input for training, so it will keep on working there).

                The other thing is, as we likely both agree, this is not intelligence. It has it’s uses. But you said to replace programming, which in my opinion will never work: were missing the critical intelligence element. It might be there at some point. Maybe llm will help there, maybe not, we might see. But for now we don’t have that piece of the puzzle and it will not be able to replace human work with (new) thought put into it.

      • Auli@lemmy.ca
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 month ago

        Sure but you had the .com bubble but it was still useful. Same as AI in a big bubble right now doesn’t mean it won’t be useful.

        • meliante@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 month ago

          Oh yes, there definitely is a bubble, but I don’t believe that means the tech is worthless, not even close to worthless.

      • Trainguyrom@reddthat.com
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 month ago

        I don’t know. In a lot of usecase AI is kinda crap, but there’s certain usecase where it’s really good. Honestly I don’t think people are giving enough thought to it’s utility in early-middle stages of creative works where an img2img model can take the basic composition from the artist, render it then the artist can go in and modify and perfect it for the final product. Also video games that use generative AI are going to be insane in about 10-15 years. Imagine an open world game where it generates building interiors and NPCs as you interact with them, even tying the stuff the NPCs say into the buildings they’re in, like an old sailer living in a house with lots of pictures of boats and boat models, or the warrior having tons of books about battle and decorative weapons everywhere all in throw away structures that would have previously been closed set dressing. Maybe they’ll even find sane ways to create quests on the fly that don’t feel overly cookie-cutter? Life changing? Of course not, but definitely a cool technology with a lot of potential

        Also realistically I don’t think there’s going to be long term use for AI models that need a quarter of a datacenter just to run, and they’ll all get tuned down to what can run directly on a phone efficiently. Maybe we’ll see some new accelerators become common place maybe we won’t.

  • drascus@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    66
    arrow-down
    3
    ·
    1 month ago

    Okay seriously this technology still baffles me. Like its cool but why invest so much in an unknown like AIs future ? We could invest in people and education and end up with really smart people. For the cost of an education we could end up with smart people who contribute to the economy and society. Instead we are dumping billions into this shit.

    • vga@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      40
      ·
      1 month ago

      For the cost of an education we could end up with smart people who contribute to the economy and society. Instead we are dumping billions into this shit.

      Those are different "we"s.

    • sudo42@lemmy.world
      link
      fedilink
      English
      arrow-up
      31
      arrow-down
      1
      ·
      edit-2
      1 month ago

      Tech/Wall St constantly needs something to hype in order to bring in “investor” money. The “new technology-> product development -> product -> IPO” pipeline is now “straight to pump-and-dump” (for example, see Crypto currency).

      The excitement of the previous hype train (self-driving cars) is no longer bringing in starry-eyed “investors” willing to quickly part ways with OPM. “AI” made a big splash and Tech/Wall St is going to milk it for all they can lest they fall into the same bad economy as that one company that didn’t jam the letters “AI” into their investor summary.

      Tech has laid off a lot of employees, which means they are aware there is nothing else exciting in the near horizon. They also know they have to flog “AI” like crazy before people figure out there’s no “there” there.

      That “investors” scattered like frightened birds at the mere mention of a cheaper version means that they also know this is a bubble. Everyone wants the quick money. More importantly they don’t want to be the suckers left holding the bag.

        • sudo42@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          1 month ago

          I follow EV battery tech a little. You’re not wrong that there is a lot of “oh its just around the bend” in battery development and tech development in general. I blame marketing for 80% of that.

          But battery technology is changing drastically. The giant cell phone market is pushing battery tech relentlessly. Add in EV and grid storage demand growth and the potential for some companies to land on top of a money printing machine is definitely there.

          We’re in a golden age of battery research. Exciting for our future, but it will be a while before we consumers will have clear best options.

    • AppleTea@lemmy.zip
      link
      fedilink
      English
      arrow-up
      24
      ·
      1 month ago

      It’s easier to sell people on the idea of a new technology or system that doesn’t have any historical precedent. All you have to do is list the potential upsides.

      Something like a school or a workplace training programme, those are known quantities. There’s a whole bunch of historical and currently-existing projects anyone can look at to gauge the cost. Your pitch has to be somewhat realistic compared to those, or it’s gonna sound really suspect.

    • alvvayson@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      19
      arrow-down
      1
      ·
      1 month ago

      Because the silicon valley bros had convinced the national security wonks in the Beltway that it was paramount for national security, technological leadership and economic prosperity.

      I think this will go down as the biggest grift in history.

      Kevin Walmsley reported on Deepseek 10 days ago. Last week, the smart money exited big tech. This week the panic starts.

      I’m getting big dot-com 2.0 vibes from all of this.

      https://youtube.com/@inside_china_business

    • _chris@lemmy.world
      link
      fedilink
      English
      arrow-up
      11
      arrow-down
      1
      ·
      1 month ago

      Education doesn’t make a tech CEO ridiculously wealthy, so there’s no draw for said CEOs to promote the shit out of education.

      Plus educated people tend to ask for more salary. Can’t do that and become a billionaire!

    • surph_ninja@lemmy.world
      link
      fedilink
      English
      arrow-up
      14
      arrow-down
      8
      ·
      1 month ago

      And you could pay people to use an abacus instead of a calculator. But the advanced tech improves productivity for everyone, and helps their output.

      If you don’t get the tech, you should play with it more.

      • fuck_u_spez_in_particular@lemmy.world
        link
        fedilink
        English
        arrow-up
        17
        arrow-down
        2
        ·
        1 month ago

        I get the tech, and still agree with the preposter. I’d even go so far as that it probably worsens a lot currently, as it’s generating a lot of bullshit that sounds great on the surface, but in reality is just regurgitated stuff that the AI has no clue of. For example I’m tired of reading AI generated text, when a hand written version would be much more precise and has some character at least…

              • fuck_u_spez_in_particular@lemmy.world
                link
                fedilink
                English
                arrow-up
                2
                ·
                1 month ago

                So unreliable boilerplate generator, you need to debug?

                Right I’ve seen that it’s somewhat nice to quickly generate bash scripts etc.

                It can certainly generate quick’n dirty scripts as a starter. But code quality is often supbar (and often incorrect), which triggers my perfectionism to make it better, at which point I should’ve written it myself…

                But I agree that it can often serve well for exploration, and sometimes you learn new stuff (if you weren’t expert in it at least, and you should always validate whether it’s correct).

                But actual programming in e.g. Rust is a catastrophe with LLMs (more common languages like js work better though).

                • SynopsisTantilize@lemm.ee
                  link
                  fedilink
                  English
                  arrow-up
                  2
                  ·
                  1 month ago

                  I use C# and PS/CMD for my job. I think you’re right. It can create a decent template for setting things up. But it trips on its own dick with anything more intricate than simple 2 step commands.

        • surph_ninja@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          10
          ·
          1 month ago

          It’s one thing to be ignorant. It’s quite another to be confidently so in the face of overwhelming evidence that you’re wrong. Impressive.

          • fuck_u_spez_in_particular@lemmy.world
            link
            fedilink
            English
            arrow-up
            4
            arrow-down
            1
            ·
            1 month ago

            confidently so in the face of overwhelming evidence

            That I’d really like to see. And I mean more than the marketing bullshit that AI companies are doing…

            For the record I was one of the first jumping on the AI hype-train (as programmer, and computer-scientist with machine-learning background), following the development of GPT1-4, being excited about having to do less boilerplaty code etc. getting help about rough ideas etc. GPT4 was almost so far as being a help (similar with o1 etc. or Anthropics models). Though I seldom use AI currently (and I’m observing similar with other colleagues and people I know of) because it actually slows me down with my stuff or gives wrong ideas, having to argue, just to see it yet again saturating at a local-minimum (aka it doesn’t get better, no matter what input I try). Just so that I have to do it myself… (which I should’ve done in the first place…).

            Same is true for the image-generative side (i.e. first with GANs now with diffusion-based models).

            I can get into more details about transformer/attention-based-models and its current plateau phase (i.e. more hardware doesn’t actually make things significantly better, it gets exponentially more expensive to make things slightly better) if you really want…

            I hope that we do a breakthrough of course, that a model actually really learns reasoning, but I fear that that will take time, and it might even mean that we need different type of hardware.

            • surph_ninja@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              arrow-down
              1
              ·
              1 month ago

              Any other AI company, and most of that would be legitimate criticism of the overhype used to generate more funding. But how does any of that apply to DeepSeek, and the code & paper they released?

              • fuck_u_spez_in_particular@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                1 month ago

                DeepSeek

                Yeah it’ll be exciting to see where this goes, i.e. if it really develops into a useful tool, for certain. Though I’m slightly cautious non-the less. It’s not doing something significantly different (i.e. it’s still an LLM), it’s just a lot cheaper/efficient to train, and open for everyone (which is great).

                • surph_ninja@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  1 month ago

                  What’s this “if” nonsense? I loaded up a light model of it, and already have put it to work.

      • TheFriar@lemm.ee
        link
        fedilink
        English
        arrow-up
        14
        arrow-down
        2
        ·
        1 month ago

        “Improves productivity for everyone”

        Famously only one class benefits from productivity, while one generates the productivity. Can you explain what you mean, if you don’t mean capitalistic productivity?

        • surph_ninja@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          arrow-down
          2
          ·
          1 month ago

          I’m referring to output for amount of work put in.

          I’m a socialist. I care about increased output leading to increased comfort for the general public. That the gains are concentrated among the wealthy is not the fault of technology, but rather those who control it.

          Thank god for DeepSeek.

    • Redex@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      1 month ago

      Look at it in another way, people think this is the start of an actual AI revolution, as in full blown AGI or close to it or something very capable at least. Personally I don’t think we’re anywhere near something like that with the current technology, I think it’s a dead end, but if there’s even a small possibility of it being true, you want to invest early because the returns will be insane if it pans out. Full blown AGI would revolutionize everything, it would probably be the next industrial revolution after the internet.

      • probably2high@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        1 month ago

        Look at it in another way, people think this is the start of an actual AI revolution, as in full blown AGI or close to it or something very capable at least

        I think the bigger threat of revolution (and counter-revolution) is that of open source software. For people that don’t know anything about FOSS, they’ve been told for decades now that [XYZ] software is a tool you need and that’s only possible through the innovative and superhuman-like intelligent CEOs helping us with the opportunity to buy it.

        If everyone finds out that they’re actually the ones stifling progress and development, while manipulating markets to further enrich themselves and whatever other partners that align with that goal, it might disrupt the golden goose model. Not to mention defrauding the countless investors that thought they were holding rocket ship money that was actually snake oil.

        All while another country did that collectively and just said, “here, it’s free. You can even take the code and use it how you personally see fit, because if this thing really is that thing, it should be a tool anyone can access. Oh, and all you other companies, your code is garbage btw. Ours runs on a potato by comparison.”

        I’m just saying, the US has already shown they will go to extreme lengths to keep its citizens from thinking too hard about how its economic model might actually be fucking them while the rich guys just move on to the next thing they’ll sell us.

        ETA: a smaller scale example: the development of Wine, and subsequently Proton finally gave PC gamers a choice to move away from Windows if they wanted to.

    • lightnsfw@reddthat.com
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      1
      ·
      1 month ago

      How would the investors profit from paying for someone’s education? By giving them a loan? Don’t we have enough problems with the student loan system without involving these assholes more?

  • ChiefGyk3D@infosec.pub
    link
    fedilink
    English
    arrow-up
    53
    arrow-down
    1
    ·
    2 months ago

    My understanding is that DeepSeek still used Nvidia just older models and way more efficiently, which was remarkable. I hope to tinker with the opensource stuff at least with a little Twitch chat bot for my streams I was already planning to do with OpenAI. Will be even more remarkable if I can run this locally.

    However this is embarassing to the western companies working on AI and especially with the $500B announcement of Stargate as it proves we don’t need as high end of an infrastructure to achieve the same results.

    • sunzu2@thebrainbin.org
      link
      fedilink
      arrow-up
      36
      arrow-down
      3
      ·
      2 months ago

      500b of trust me Bros… To shake down US taxpayer for subsidies

      Read between the lines folks

    • Dkarma@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      2
      ·
      1 month ago

      It’s really not. This is the ai equivalent of the vc repurposing usa bombs that didn’t explode when dropped.

      Their model is the differentiator here but they had to figure out something more efficient in order to overcome the hardware shortcomings.

      The us companies will soon outpace this by duping the model and running it on faster hw

      • Auli@lemmy.ca
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        1 month ago

        Throw more hardware and power at it. Build more power plants so we can use AI.

    • Cocodapuf@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      arrow-down
      3
      ·
      1 month ago

      My understanding is that DeepSeek still used Nvidia just older models

      That’s the funniest part here, the sell off makes no sense. So what if some companies are better at utilizing AI than others, it all runs in the same hardware. Why sell stock in the hardware company? (Besides the separate issue of it being totally overvalued at the moment)

      This would be kind of like if a study showed that American pilots were more skilled than European pilots, so investors sold stock in airbus… Either way, the pilots still need planes to fly…

      • bobalot@lemmy.world
        link
        fedilink
        English
        arrow-up
        14
        ·
        edit-2
        1 month ago

        Perhaps the stocks were massively overvalued and any negative news was going to start this sell off regardless of its actual impact?

        That is my theory anyway.

  • gerryflap@feddit.nl
    link
    fedilink
    English
    arrow-up
    51
    arrow-down
    3
    ·
    1 month ago

    I’m so happy this happened. This is really a power move from China. The US was really riding the whole AI bubble. By “just” releasing a powerful open-source AI model they’ve fucked the not so open US AI companies. I’m not sure if this was planned from China or whether this is was really just a small company doing this because they wanted to, but either way this really damages the western economy. And its given western consumers a free alternative. A few million dollars invested (if we are to believe the cost figures) for a major disruption.

    • surph_ninja@lemmy.world
      link
      fedilink
      English
      arrow-up
      11
      arrow-down
      22
      ·
      1 month ago

      Socialism/Communism will always outcompete the capitalists. And they know it, which is why the US invades, topples, or sanctions every country that moves towards worker controlled countries.

        • surph_ninja@lemmy.world
          link
          fedilink
          English
          arrow-up
          5
          arrow-down
          9
          ·
          1 month ago

          That you had to qualify it with a date after it had been corrupted by the west, implies that you’re well aware of how well communism served for half a century before that.

          They went from a nation of dirt poor peasants, to a nuclear superpower driving the space race in just a couple of decades. All thanks to communism. And also why China is leaving us in the dust.

          • houstoneulers@lemmy.world
            link
            fedilink
            English
            arrow-up
            6
            arrow-down
            4
            ·
            edit-2
            1 month ago

            There are many instances of communism failing lmao

            There are also many current communist states that have less freedom than many capitalist states

            Also, you need to ask the Uyghurs how they’re feeling about their experience under the communist government you’re speaking so highly of at the moment.

            • surph_ninja@lemmy.world
              link
              fedilink
              English
              arrow-up
              4
              arrow-down
              4
              ·
              1 month ago

              How many of those instances failed due to external factors, such as illegal sanctions or a western coup or western military aggression?

              Which communist states would you say have less freedom than your country? Let’s compare.

              The Uyghur genocide was debunked. Even the US state department was forced to admit they didn’t have the evidence to support their claims. In reality, western intelligence agencies were trying to radicalize the Uyghurs to destabilize the region, but China has been rehabilitating them. The intel community doesn’t like their terrorist fronts to be shut down.

              • houstoneulers@lemmy.world
                link
                fedilink
                English
                arrow-up
                2
                arrow-down
                4
                ·
                1 month ago

                LMAO found the pro-Xi propagandist account

                Either you’re brainwashed, are only reading one-sided articles, or you’re an adolescent with little world experience given how confidently you speak in absolutes, which doesn’t reflect how nuanced the global stage is.

                I’m not saying capitalism is the best, but communism won’t ALWAYS beat out capitalism (as it hasn’t regardless of external factors b/c if those regimes were strong enough they would be able to handle or recover from external pressures) nor does it REQUIRE negatively affecting others as your other comment says. You’re just delulu.

                Remember, while there maybe instances where all versions of a certain class of anything are equal, in most cases they are not. So blanketly categorizing as your have done just reflects your lack of historical perspective.

                • surph_ninja@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  2
                  arrow-down
                  3
                  ·
                  1 month ago

                  You should really drop the overconfidence, and re-evaluate your biases and perspectives. Regurgitating western propaganda almost verbatim is not a good sign that you’re on the right path.

            • BrainInABox@lemmy.ml
              link
              fedilink
              English
              arrow-up
              1
              arrow-down
              4
              ·
              1 month ago

              you need to ask the Uyghurs how they’re feeling about their experience under the communist government

              Everytime people ask regular Uyghurs, they’re usually happy enough with it. I’m guessing you mean ask Adrian Zenz and the Victims of Communism Memorial Foundation to tell the Uyghurs what they think.

            • surph_ninja@lemmy.world
              link
              fedilink
              English
              arrow-up
              3
              arrow-down
              6
              ·
              1 month ago

              Any corrupt leaders are capable of committing genocide. The difference is capitalism requires genocide to continue functioning.

              • comfy@lemmy.ml
                link
                fedilink
                English
                arrow-up
                3
                arrow-down
                3
                ·
                1 month ago

                No it doesn’t. It requires imperialism. The genocides are simply efficient for the imperial machine creating settlements, but it’s not a requirement. They’re evidently avoidable and capitalists just repeatedly decide not to avoid it because they consider it cheaper to commit genocide rather than integrate more passively.

      • Bohurt@lemm.ee
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        3
        ·
        1 month ago

        You don’t even realise how strong capitalism is in China.

        • surph_ninja@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          arrow-down
          4
          ·
          1 month ago

          It sounds like you don’t know what “capitalism” means. Market participation exists in other economy types, too. It’s how the means of production are controlled and the profits distributed that defines capitalism vs communism.

          And you don’t lift 800 million people out of poverty under capitalism. Or they’ve done a ridiculously bad job of concentrating profits into the hands of a very small few.

          • Bohurt@lemm.ee
            link
            fedilink
            English
            arrow-up
            1
            arrow-down
            1
            ·
            1 month ago

            The issue with your original comment is that it’s simplified on many levels beyond what is acceptable. China has companies working on delivering highest financial output regardless of other citizens and their rights to have fair share in produced goods. They are by no means controlled by workers (why would they accept e. g. 996?) nor creating fair rules to others economically (e.g. Taobao and their alghorims pushing many sellers to sell bellow profitable levels just to maintain visibility on their site). Put it also into wider perspective: China started to move forward in quality of life only after Deng. US system is by no means bad but it doesn’t make Chinese one perfect.

            • surph_ninja@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              arrow-down
              1
              ·
              1 month ago

              I don’t think you understand how China’s economy works. Seems very clouded by anti-China propaganda.

              In reality, the working class exercises a great deal of control over the means of production in China, and the 996 culture you’re referring to is in fact illegal.

              https://www.bbc.com/news/world-asia-china-58381538.amp

              Again, capitalism vs communism is not defined by the existence of production/profits/markets, but how control and benefit of those systems is distributed.

        • surph_ninja@lemmy.world
          link
          fedilink
          English
          arrow-up
          5
          arrow-down
          2
          ·
          1 month ago

          Absolutely. More direct democracy. The whole point of representative democracy is issues of time and distance. Now that we can communicate fast and across the globe, average citizens should play a much larger & more active role in directing the government.

          • MutilationWave@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            2
            ·
            1 month ago

            How do you solve the problem that half the country can’t even be bothered to participate once every four years?

            Don’t get me wrong, I’m with you 100%, but how would we get people to engage with such a system?

            • comfy@lemmy.ml
              link
              fedilink
              English
              arrow-up
              4
              ·
              1 month ago

              How do you solve the problem that half the country can’t even be bothered to participate once every four years?

              I assume you’re talking about the US electoral system?? That’s very different.

              but how would we get people to engage with such a system?

              By empowering them.

              Consider how the current electoral system disempowers people:

              1. Some people literally cannot vote or risk jeopardizing their job taking the day off, others face voter suppression tactics

              2. The FPTP system (esp. spoiler effect) and the present political circumstances mean that there are really only two viable options for political parties for most people, so many feel that neither option represents them, let alone their individual positions on policy

              3. Politics is widely considered to be corrupt and break electoral promises regularly. There is little faith in either party to represent voters

              But, in a system where you are able to represent yourself at will, engagement is actually rewarding and meaningful. It won’t magically make everyone care, but direct democracy alongside voter rights reform would likely make more people think it’s worth polling.

              • MutilationWave@lemmy.world
                link
                fedilink
                English
                arrow-up
                2
                ·
                1 month ago

                I hope you’re right. I would love to see it. I actually support mandatory voting like in Australia. With mostly current laws everyone could get a mail in ballot. If you don’t want to participate just check that box at the top, sign it, and send it in.

                Your system sounds much better but would require a lot more legislation.

                • comfy@lemmy.ml
                  link
                  fedilink
                  English
                  arrow-up
                  2
                  arrow-down
                  1
                  ·
                  1 month ago

                  Well, it would require more than just legislation change. Truth be told, in the US, a working democracy requires some form of revolution since the people holding all the power benefit from the broken system. But on the other hand, organizations and communities (including territories of hundreds of thousands) practicing direct democracy on a smaller scale have seen success with these strategies.

            • lonerangers1@lemmy.world
              link
              fedilink
              English
              arrow-up
              2
              ·
              1 month ago

              Imagine you had a lemmy instance that every post was a proposal for regulation in your community/region. Anyone can make a post, some will gain traction and support, some will be worthless and fall off quickly. If the proposal gains enough support it then goes to a vote post where people get to make an official vote. Could be to charge $40 for a speeding ticket instead of $50, could be a trade agreement with another region.

              I think this method would give people equity in the system. Maybe it could also be scored on a curve depending on how much it effects you as an individual. Maybe having advanced education on a topic means your say has more weight to it than someone without.

              I was thinking of ways to move towards this and so far my best idea is to build it and run it in parrelel with what we have now. Get it functioning and trusted and simply try to roll over what we have now. I figure something tragic would need to happen to create a power void for full implementation. Like yellowstone erupting or something. I was also thinking that we need to teach the kids. We need to give them tools to build on so they can take this kind of idea to fruition.

              I am just a regular idiot, so feel free to add anything constructive.

              • MutilationWave@lemmy.world
                link
                fedilink
                English
                arrow-up
                2
                ·
                1 month ago

                It’s a great idea. I think half the people just don’t give a fuck at all.

                Among people who say they care- look how rapidly disinformation is spread about anything and everything. Billionaires would be gaming the system from the get-go. I’m just pessimistic. I really do love the idea and I hope we get there some day.

                Based on how Trump 2.0 is going though we might just get that tragedy.

                • lonerangers1@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  3
                  ·
                  1 month ago

                  I don’t think this idea I have involves any billionaires with power. It would be pointless. With everything decentralized there would be no mega corps at all. They wouldn’t have politicians to bribe. They would have to make the majority of people happy with them to be allowed. I also consider that in a world designed for quality of life instead of profit we wouldn’t need to have 9-5 jobs to survive. Our production has been growing rapidly for a long time and all of the proceeds have been getting held by ~1000 people who have centralized profits to themselves. With decentralized communism the economy would be like one big co-op. No company owners, the community would have say in how products are sourced and distributed. How people who invest more in the system are rewarded by the system. Couple things to help understand where my head is at. I think we can decentralize and open source services like amazon, home depot, walmart,… We don’t need oligarchy to come together and use economy of scale. We could have a sales platform free for everyone that could source directly from manufacturers. No mark up, not even in the manufacturing. No profit model at all. This factors in that labor needs are going to plummet. Take media, I predict all media will be AI generated and personalized. You could have a never ending show. One that knows how to keep you entertained. You could even be a character in it where your screen is just the view, so now we are in VR, like a gta map. Now the big change, This will all happen in our heads. check this shit out https://synchron.com/ . We are about to have hivemind irl. I only want to discuss posative implications. I am super fucked up over thinking about what capitalism is going to do with direct access to our subconsciousness.

      • gerryflap@feddit.nl
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        1 month ago

        I disagree. Under the right conditions (read: actual competition instead of unregulated monopolies) I think a capitalist system be able to stay ahead, though I think both systems could compete depending on how they’re organized.

        But what I’m more interested in is you view that China is still Socialist/Communist. Isn’t DeepSeek a private company trying to maximize profits for itself by innovating, instead of a public company funded by the people? I don’t really know myself, but my perspective was that this was more of a capitalist vs capitalist situation. With one side (the US) kinda suffering from being so unregulated that innovation dies down.

        • surph_ninja@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 month ago

          Capitalism will by its very nature always lead to monopolies and depressed innovation. You cannot prevent corruption, while concentrating control of the means of production in the hands of a very few.

          They released DeepSeek for free. It was a side project the company worked on. How is releasing it for free in any way profit seeking?

  • Kazumara@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    39
    arrow-down
    2
    ·
    1 month ago

    Hm even with DeepSeek being more efficient, wouldn’t that just mean the rich corps throw the same amount of hardware at it to achieve a better result?

    In the end I’m not convinced this would even reduce hardware demand. It’s funny that this of all things deflates part of the bubble.

    • UnderpantsWeevil@lemmy.world
      link
      fedilink
      English
      arrow-up
      43
      ·
      edit-2
      1 month ago

      Hm even with DeepSeek being more efficient, wouldn’t that just mean the rich corps throw the same amount of hardware at it to achieve a better result?

      Only up to the point where the AI models yield value (which is already heavily speculative). If nothing else, DeepSeek makes Altman’s plan for $1T in new data-centers look like overkill.

      The revelation that you can get 100x gains by optimizing your code rather than throwing endless compute at your model means the value of graphics cards goes down relative to the value of PhD-tier developers. Why burn through a hundred warehouses full of cards to do what a university mathematics department can deliver in half the time?

      • AppleTea@lemmy.zip
        link
        fedilink
        English
        arrow-up
        7
        ·
        edit-2
        1 month ago

        you can get 100x gains by optimizing your code rather than throwing endless compute at your model

        woah, that sounds dangerously close to saying this is all just developing computer software. Don’t you know we’re trying to build God???

        • UnderpantsWeevil@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          1 month ago

          Altman insisting that once the model is good enough, it will program itself was the moment I wrote the whole thing off as a flop.

    • peereboominc@lemm.ee
      link
      fedilink
      English
      arrow-up
      14
      ·
      1 month ago

      Maybe but it also means that if a company needs a datacenter with 1000 gpu’s to do it’s AI tasks demand, it will now buy 500.

      Next year it might need more but then AMD could have better gpu’s.

    • mapumbaa@lemmy.zip
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      2
      ·
      edit-2
      1 month ago

      It will probably not reduce demand. But it will for sure make it impossible to sell insanely overpriced hardware. Now I’m looking forward to buying a PC with a Chinese open source RISCV CPU and GPU. Bye bye Intel, AMD, ARM and Nvidia.

  • index@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    36
    arrow-down
    2
    ·
    1 month ago

    It still rely on nvidia hardware why would it trigger a sell-off? Also why all media are picking up this news? I smell something fishy here…

    • Railcar8095@lemm.ee
      link
      fedilink
      English
      arrow-up
      29
      ·
      1 month ago

      The way I understood it, it’s much more efficient so it should require less hardware.

      Nvidia will sell that hardware, an obscene amount of it, and line will go up. But it will go up slower than nvidia expected because anything other than infinite and always accelerating growth means you’re not good at business.

      • rumba@lemmy.zip
        link
        fedilink
        English
        arrow-up
        5
        ·
        1 month ago

        Back in the day, that would tell me to buy green.

        Of course, that was also long enough ago that you could just swap money from green to red every new staggered product cycle.

    • PhAzE@lemmy.ca
      link
      fedilink
      English
      arrow-up
      18
      ·
      1 month ago

      It requires only 5% of the same hardware that OpenAI needs to do the same thing. So that can mean less quantity of top end cards and it can also run on less powerful cards (not top of the line).

      Should their models become standard or used more commonly, then nvidis sales will drop.

      • b34k@lemmy.world
        link
        fedilink
        English
        arrow-up
        7
        ·
        1 month ago

        Doesn’t this just mean that now we can make models 20x more complex using the same hardware? There’s many more problems that advanced Deep Learning models could potentially solve that are far more interesting and useful than a chat bot.

        I don’t see how this ends up bad for Nvidia in the long run.

        • Isthisreddit@lemmy.world
          link
          fedilink
          English
          arrow-up
          7
          ·
          1 month ago

          Honestly none of this means anything at the moment. This might be some sort of calculated trickery from China to give Nvidia the finger, or Biden the finger, or a finger to Trump’s AI infrastructure announcement a few days ago, or some other motive.

          Maybe this “selloff” is masterminded by the big wall street players (who work hand-in-hand with investor friendly media) to panic retail investors so they can snatch up shares at a discount.

          What I do know is that “AI” is a very fast moving tech and shit that was true a few months ago might not be true tomorrow - no one has a crystal ball so we all just gotta wait and see.

          • alvvayson@lemmy.dbzer0.com
            link
            fedilink
            English
            arrow-up
            2
            ·
            1 month ago

            There could be some trickery on the training side, i.e. maybe they spent way more than $6M to train it.

            But it is clear that they did it without access to the infra that big tech has.

            And on the run side, we can all verify how well it runs and people are also running it locally without internet access. There is no trickery there.

            They are 20x cheaper than OpenAI if you run it on their servers and if you run it yourself, you only need a small investment in relatively affordable servers.

            • PhAzE@lemmy.ca
              link
              fedilink
              English
              arrow-up
              1
              ·
              1 month ago

              Give that statement to maybe not super techy investors, and that could spook them into the sell-off.

    • ArchRecord@lemm.ee
      link
      fedilink
      English
      arrow-up
      16
      ·
      1 month ago

      Here’s someone doing 200 tokens/s (for context, OpenAI doesn’t usually get above 100) on… A Raspberry Pi.

      Yes, the “$75-$120 micro computer the size of a credit card” Raspberry Pi.

      If all these AI models can be run directly on users devices, or on extremely low end hardware, who needs large quantities of top of the line GPUs?

      • aesthelete@lemmy.world
        link
        fedilink
        English
        arrow-up
        21
        ·
        edit-2
        1 month ago

        Thank the fucking sky fairies actually, because even if AI continues to mostly suck it’d be nice if it didn’t swallow up every potable lake in the process. When this shit is efficient that makes it only mildly annoying instead of a complete shitstorm of failure.

      • adoxographer@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        ·
        1 month ago

        While this is great, the training is where the compute is spent. The news is also about R1 being able to be trained, still on an Nvidia cluster but for 6M USD instead of 500

        • alvvayson@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          5
          ·
          1 month ago

          True, but training is one-off. And as you say, a factor 100x less costs with this new model. Therefore NVidia just saw 99% of their expected future demand for AI chips evaporate

          Even if they are lying and used more compute, it’s obvious they managed to train it without access to the large amounts of the highest end chips due to export controls.

          Conservatively, I think NVidia is definitely going to have to scale down by 50% and they will have to reduce prices by a lot, too, since VC and government billions will no longer be available to their customers.

          • bestboyfriendintheworld@sh.itjust.works
            link
            fedilink
            English
            arrow-up
            3
            ·
            1 month ago

            True, but training is one-off. And as you say, a factor 100x less costs with this new model. Therefore NVidia just saw 99% of their expected future demand for AI chips evaporate

            It might also lead to 100x more power to train new models.

            • ArchRecord@lemm.ee
              link
              fedilink
              English
              arrow-up
              1
              ·
              1 month ago

              I doubt that will be the case, and I’ll explain why.

              As mentioned in this article,

              SFT (supervised fine-tuning), a standard step in AI development, involves training models on curated datasets to teach step-by-step reasoning, often referred to as chain-of-thought (CoT). It is considered essential for improving reasoning capabilities. DeepSeek challenged this assumption by skipping SFT entirely, opting instead to rely on reinforcement learning (RL) to train the model. This bold move forced DeepSeek-R1 to develop independent reasoning abilities, avoiding the brittleness often introduced by prescriptive datasets.

              This totally changes the way we think about AI training, which is why while OpenAI spent $100m on training GPT-4, running an expected 500,000 GPUs, DeepSeek used about 50,000, and likely spent that same roughly 10% of the cost.

              So while operation, and even training, is now cheaper, it’s also substantially less compute intensive to train models.

              And not only is there less data than ever to train models on that won’t cause them to get worse by regurgitating other worse quality AI-generated content, but even if additional datasets were scrapped entirely in favor of this new RL method, there’s a point at which an LLM is simply good enough.

              If you need to auto generate a corpo-speak email, you can already do that without many issues. Reformat notes or user input? Already possible. Classify tickets by type? Done. Write a silly poem? That’s been possible since pre-ChatGPT. Summarize a webpage? The newest version of ChatGPT will probably do just as well as the last at that.

              At a certain point, spending millions of dollars for a 1% performance improvement doesn’t make sense when the existing model just already does what you need it to do.

              I’m sure we’ll see development, but I doubt we’ll see a massive increase in training just because the cost to run and train the model has gone down.

          • adoxographer@lemmy.world
            link
            fedilink
            English
            arrow-up
            3
            ·
            1 month ago

            I’m not sure. That’s a very static view of the context.

            While china has an AI advantage due to wider adoption, less constraints and overall bigger market, the US has higher tech, and more funds.

            OpenAI, Anthropic, MS and especially X will all be getting massive amounts of backing and will reverse engineer and adopt whatever advantages R1 had. Which while there are some it’s still not a full spectrum competitor.

            I see the is as a small correction that the big players will take advantage of to buy stock, and then pump it with state funds, furthering the gap and ignoring the Chinese advances.

            Regardless, Nvidia always wins. They sell the best shovels. In any scenario the world at large still doesn’t have their Nvidia cluster, think Africa, Oceania, South America, Europe, SEA who doesn’t necessarily align with Chinese interests, India. Plenty to go around.

            • alvvayson@lemmy.dbzer0.com
              link
              fedilink
              English
              arrow-up
              2
              ·
              1 month ago

              Extra funds are only useful if they can provide a competitive advantage.

              Otherwise those investments will not have a positive ROI.

              The case until now was built on the premise that US tech was years ahead and that AI had a strong moat due to high computer requirements for AI.

              We now know that that isn’t true.

              If high compute enables a significant improvement in AI, then that old case could become true again. But the prospects of such a reality happening and staying just got a big hit.

              I think we are in for a dot-com type bubble burst, but it will take a few weeks to see if that’s gonna happen or not.

              • adoxographer@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                1 month ago

                Maybe, but there is incentive to not let that happen, and I wouldn’t be surprised if “they” have unpublished tech that will be rushed out.

                The ROI doesn’t matter, it wasn’t there yet it’s the potential for it. The Chinese AIs are also not there yet. The proposition is to reduce FTEs, regardless of cost, as long as cost is less.

                While I see OpenAi and mostly startups and VC reliant companies taking a hit, Nvidia itself as the shovel maker will remain strong.

        • vrighter@discuss.tchncs.de
          link
          fedilink
          English
          arrow-up
          2
          ·
          1 month ago

          if, on a modern gaming pc, you can get breakneck speeds of 5 tokens per second, then actually inference is quite energy intensive too. 5 per second of anything is very slow

        • vrighter@discuss.tchncs.de
          link
          fedilink
          English
          arrow-up
          3
          ·
          1 month ago

          i can also run it on my old pentium from 3 decades ago. I’d have to swap 4MiB of weights in and out constantly, it will be very very slow, but it will work.

      • GenosseFlosse@feddit.org
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        1 month ago

        Sure you can run it on low end hardware, but how does the performance (response time for a given prompt) compare to the other models, either local or as a service?

        • ArchRecord@lemm.ee
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 month ago

          That set of tokens/s is the performance, or response time if you’d like to call it that. GPT-o1 tends to get anywhere from 33-60, whereas in the example I showed previously, a Raspberry Pi can do 200 on a distilled model.

          Now, granted, a distilled model will produce worse performance than the full one, as seen in a benchmark comparison done by DeepSeek here (I’ve outlined the most distilled version of the newest DeepSeek model, which is likely the kind that is being run on the Raspberry Pi, albeit likely with some changes made by the author of that post, as well as OpenAI’s two most high-end models of a comparable distillation)

          The gap in quality is relatively small for a model that is likely distilled far past what OpenAI’s “mini” model is, when you consider that even regular laptop/PC hardware is orders of magnitudes more powerful than a Raspberry Pi, or that an external AI accelerator can be bought for as little as $60, the quality in practice could be very comparable with even slightly less distillation, especially with fine-tuning for a given use case (e.g. a local version of DeepSeek in a code development platform would be fine-tuned specifically just to produce code-related results)

          If you get into the region of only cloud-hosted instances of DeepSeek that are running at-scale on GPUs like OpenAI’s models are, the performance is only 1-2 percentage points off from OpenAI’s model, at about 3-6% of the cost, which effectively means 3-6% of the total amount of GPU power being paid for compared to the amount of GPU power OpenAI is paying for.

    • teegus@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      11
      arrow-down
      2
      ·
      1 month ago

      A year ago the price was $62, now after the fall it is $118. Stocks are volatile, what else is new? Pretty much non-news if you ask me.

    • 𝓔𝓶𝓶𝓲𝓮@lemm.ee
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      1 month ago

      And you should, generally we are amidst the internet world war. It’s not something fishy but digital rotten eggs thrown around by the hundreds.

      The only way to remain sane is to ignore it and scroll on. There is no winning versus geopolitical behemoths as a lone internet adventurer. It’s impossible to tell what’s real and what isn’t
      the first casualty of war is truth

  • RxBrad@infosec.pub
    link
    fedilink
    English
    arrow-up
    34
    arrow-down
    1
    ·
    1 month ago

    Okay, cool…

    So, how much longer before Nvidia stops slapping a “$500-600 RTX XX70” label on a $300 RTX XX60 product with each new generation?

    The thinly-veiled 75-100% price increases aren’t fun for those of us not constantly-touching-themselves over AI.

  • vga@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    33
    ·
    edit-2
    1 month ago

    I should really start looking into shorting stocks. I was looking at the news and Nvidia’s stock and thought “huh, the stock hasn’t reacted to these news at all yet, I should probably short this”.

    And then proceeded to do fuck all.

    I guess this is why some people are rich and others are like me.

    • Knock_Knock_Lemmy_In@lemmy.world
      link
      fedilink
      English
      arrow-up
      11
      ·
      1 month ago

      It’s pretty difficult to open a true short position. Providers like Robinhood create contract for differences which are subject to their TOS.

    • peregrin5@lemm.ee
      link
      fedilink
      English
      arrow-up
      10
      ·
      edit-2
      1 month ago

      It’s been proven that people who do fuckall after throwing their money into mutual funds generally fare better than people actively monitoring and making stock moves.

      You’re probably fine.

      I never bought NVIDIA in the first place so this news doesn’t affect me.

      If anything now would be a good time to buy NVIDIA. But I probably won’t.

      • MutilationWave@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        1 month ago

        The vast majority of my invested money is in SPY. I had a lot of “money” wiped out yesterday. It’s already trending back up. I’m holding for now.

  • Mac@mander.xyz
    link
    fedilink
    English
    arrow-up
    32
    ·
    1 month ago

    What the fuck are markets when you can automate making money on them???

    Ive been WTF about the stock market for a long time but now it’s obviously a scam.

    • thistleboy@lemmy.world
      link
      fedilink
      English
      arrow-up
      45
      arrow-down
      1
      ·
      1 month ago

      The stock market is nothing more than a barometer for the relative peace of mind of rich people.

      • nomy@lemmy.zip
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 month ago

        Economics is a social science not a hard science, it’s highly reactive to rumors and speculation. The stock market kind of does just run on vibes.

  • just_another_person@lemmy.world
    link
    fedilink
    English
    arrow-up
    24
    arrow-down
    1
    ·
    2 months ago

    Well, you still need the right kind of hardware to run it, and my money has been on AMD to deliver the solutions for that. Nvidia has gone full-blown stupid on the shit they are selling, and AMD is all about cost and power efficiency, plus they saw the writing on the wall for Nvidia a long time ago and started down the path for FPGA, which I think will ultimately be the same choice for running this stuff.

    • wootz@lemmy.world
      link
      fedilink
      English
      arrow-up
      10
      arrow-down
      1
      ·
      2 months ago

      Built a new PC for the first time in a decade last spring. Went full team red for the first time ever. Very happy with that choice so far.

      • palordrolap@fedia.io
        link
        fedilink
        arrow-up
        2
        ·
        1 month ago

        And it may yet swing back the other way.

        Twenty or so years ago, there was a brief period when going full AMD (or AMD+ATI as it was back then; AMD hadn’t bought ATI yet) made sense, and then the better part of a decade later, Intel+NVIDIA was the better choice.

        And now I have a full AMD PC again.

        Intel are really going to have to turn things around in my eyes if they want it to swing back, though. I really do not like the idea of a CPU hypervisor being a fully fledged OS that I have no access to.

    • NuXCOM_90Percent@lemmy.zip
      link
      fedilink
      English
      arrow-up
      3
      ·
      2 months ago

      From a “compute” perspective (so not consumer graphics), power… doesn’t really matter. There have been decades of research on the topic and it almost always boils down to “Run it at full bore for a shorter period of time” being better (outside of the kinds of corner cases that make for “top tier” thesis work).

      AMD (and Intel) are very popular for their cost to performance ratios. Jensen is the big dog and he prices accordingly. But… while there is a lot of money in adapting models and middleware to AMD, the problem is still that not ALL models and middleware are ported. So it becomes a question of whether it is worth buying AMD when you’ll still want/need nVidia for the latest and greatest. Which tends to be why those orgs tend to be closer to an Azure or AWS where they are selling tiered hardware.

      Which… is the same issue for FPGAs. There is a reason that EVERYBODY did their best to vilify and kill opencl and it is not just because most code was thousands of lines of boilerplate and tens of lines of kernels. Which gets back to “Well. I can run this older model cheap but I still want nvidia for the new stuff…”

      Which is why I think nvidia’s stock dropping is likely more about traders gaming the system than anything else. Because the work to use older models more efficiently and cheaply has already been a thing. And for the new stuff? You still want all the chooch.

      • just_another_person@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 months ago

        Your assessment is missing the simple fact that FPGA can do things a GPU cannot faster, and more cost efficiently though. Nvidia is the Ford F-150 of the data center world, sure. It’s stupidly huge, ridiculously expensive, and generally not needed unless it’s being used at full utilization all the time. That’s like the only time it makes sense.

        If you want to run your own models that have a specific purpose, say, for scientific work folding proteins, and you might have several custom extensible layers that do different things, N idia hardware and software doesn’t even support this because of the nature of Tensorrt. They JUST announced future support for such things, and it will take quite some time and some vendor lock-in for models to appropriately support it…OR

        Just use FPGAs to do the same work faster now for most of those things. The GenAI bullshit bandwagon finally has a wheel off, and it’s obvious people don’t care about the OpenAI approach to having one model doing everything. Compute work on this is already transitioning to single purpose workloads, which AMD saw coming and is prepared for. Nvidia is still out there selling these F-150s to idiots who just want to piss away money.

        • NuXCOM_90Percent@lemmy.zip
          link
          fedilink
          English
          arrow-up
          4
          ·
          1 month ago

          Your assessment is missing the simple fact that FPGA can do things a GPU cannot faster

          Yes, there are corner cases (many of which no longer exist because of software/compiler enhancements but…). But there is always the argument of “Okay. So we run at 40% efficiency but our GPU is 500% faster so…”

          Nvidia is the Ford F-150 of the data center world, sure. It’s stupidly huge, ridiculously expensive, and generally not needed unless it’s being used at full utilization all the time. That’s like the only time it makes sense.

          You are thinking of this like a consumer where those thoughts are completely valid (just look at how often I pack my hatchback dangerously full on the way to and from Lowes…). But also… everyone should have that one friend with a pickup truck for when they need to move or take a load of stuff down to the dump or whatever. Owning a truck yourself is stupid but knowing someone who does…

          Which gets to the idea of having a fleet of work vehicles versus a personal vehicle. There is a reason so many companies have pickup trucks (maybe not an f150 but something actually practical). Because, yeah, the gas consumption when you are just driving to the office is expensive. But when you don’t have to drive back to headquarters to swap out vehicles when you realize you need to go buy some pipe and get all the fun tools? It pays off pretty fast and the question stops becoming “Are we wasting gas money?” and more “Why do we have a car that we just use for giving quotes on jobs once a month?”

          Which gets back to the data center issue. The vast majority DO have a good range of cards either due to outright buying AMD/Intel or just having older generations of cards that are still in use. And, as a consumer, you can save a lot of money by using a cheaper node. But… they are going to still need the big chonky boys which means they are still going to be paying for Jensen’s new jacket. At which point… how many of the older cards do they REALLY need to keep in service?

          Which gets back down to “is it actually cost effective?” when you likely need

          • just_another_person@lemmy.world
            link
            fedilink
            English
            arrow-up
            3
            ·
            edit-2
            1 month ago

            I’m thinking of this as someone who works in the space, and has for a long time.

            An hour of time for a g4dn instance in AWS is 4x the cost of an FPGA that can do the same work faster in MOST cases. These aren’t edge cases, they are MOST cases. Look at a Sagemaker, AML, GMT pricing for the real cost sinks here as well.

            The raw power and cooling costs contribute to that pricing cost. At the end of the day, every company will choose to do it faster and cheaper, and nothing about Nvidia hardware fits into either of those categories unless you’re talking about milliseconds of timing, which THEN only fits into a mold of OpenAI’s definition.

            None of this bullshit will be a web-based service in a few years, because it’s absolutely unnecessary.

            • NuXCOM_90Percent@lemmy.zip
              link
              fedilink
              English
              arrow-up
              1
              arrow-down
              1
              ·
              1 month ago

              And you are basically a single consumer with a personal car relative to those data centers and cloud computing providers.

              YOUR workload works well with an FPGA. Good for you, take advantage of that to the best degree you can.

              People;/companies who want to run newer models that haven’t been optimized for/don’t support FPGAs? You get back to the case of “Well… I can run a 25% cheaper node for twice as long?”. That isn’t to say that people shouldn’t be running these numbers (most companies WOULD benefit from the cheaper nodes for 24/7 jobs and the like). But your use case is not everyone’s use case.

              And, it once again, boils down to: If people are going to require the latest and greatest nvidia, what incentive is there in spending significant amounts of money getting it to work on a five year old AMD? Which is where smaller businesses and researchers looking for a buyout come into play.

              At the end of the day, every company will choose to do it faster and cheaper, and nothing about Nvidia hardware fits into either of those categories unless you’re talking about milliseconds of timing, which THEN only fits into a mold of OpenAI’s definition.

              Faster is almost always cheaper. There have been decades of research into this and it almost always boils down to it being cheaper to just run at full speed (if you have the ability to) and then turn it off rather than run it longer but at a lower clock speed or with fewer transistors.

              And nVidia wouldn’t even let the word “cheaper” see the glory that is Jensen’s latest jacket that costs more than my car does. But if you are somehow claiming that “faster” doesn’t apply to that company then… you know nothing (… Jon Snow).

              unless you’re talking about milliseconds of timing

              So… its not faster unless you are talking about time?

              Also, milliseconds really DO matter when you are trying to make something responsive and already dealing with round trip times with a client. And they add up quite a bit when you are trying to lower your overall footprint so that you only need 4 notes instead of 5.

              They don’t ALWAYS add up, depending on your use case. But for the data centers that are selling computers by time? Yeah,. time matters.

              So I will just repeat this: Your use case is not everyone’s use case.

              • just_another_person@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                arrow-down
                1
                ·
                1 month ago

                I mean…I can shut this down pretty simply. Nvidia makes GPUs that are currently used as a blunt force tool, which is dumb, and now that the grift has been blown, OpenAI, Anthropic, Meta, and all the others trying to make a business center around a really simple tooling that is open source, are about to be under so much scrutiny for the cost that everyone will figure out that there are cheaper ways to do this.

                Plus AMD, Con Nvidia. It’s really simple.