swordfish69 2 minutes ago

Can we stop this drawn-out narrative that Deepseek is at the level of Gemini or o3? It’s brilliant in its own way but for some reason a lot of journalists think it’s still at par with American frontier models.

htrp 38 minutes ago

Still parroting the same uninformed takes from January

>DeepSeek claimed to have built its base model for about 5% of the estimated cost of GPT-4

  • juujian 23 minutes ago

    What do we know now that we did not know in Jan? Is there some information on this that I have missed?

    • fzzzy 18 minutes ago

      They didn't include the costs for developing v3, the base model.

      [edit] also they seem to be saying r1 is a base model, which it is not. Very sloppy.

      • xnx 14 minutes ago

        Didn't they also train off of ChatGPT API output?

  • zeroq 19 minutes ago

    I'll put my tinfoil hat on and say it plays to the current US vs China "propaganda" tune, that US is winning on all fronts, but the ice thin and have to support local tech behemoths to full extent to secure our position in this world defining struggle.

  • Analemma_ 23 minutes ago

    Bloomberg still has not retracted (or even really commented on) the Supermicro spy chip story, preferring to hope people just forget about it if they maintain total silence. They're fine if you need to look up where the Nasdaq closed yesterday, but don't expect serious tech reporting from them.