DeepSeek is a Game Changer for AI – Computerphile

By Computerphile / January 28, 2025

An AI model that changed the fortunes of silicon valley overnight. Deep Seek has been released open source, and requires far less hardware and investment. Mike Pound is based at the University of Nottingham.

EXTRA BITS: https://youtu.be/tMm7DYTGJ44

Computerphile is supported by Jane Street. Learn more about them (and exciting career opportunities) at: https://jane-st.co/computerphile

The Deep Seek papers:
https://github.com/deepseek-ai/DeepSeek-V3/blob/main/DeepSeek_V3.pdf
https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSeek_R1.pdf

https://www.deepseek.com/

This video was filmed and edited by Sean Riley.

Computerphile is a sister project to Brady Haran’s Numberphile. More at https://www.bradyharanblog.com

source

41 thoughts on “DeepSeek is a Game Changer for AI – Computerphile”

@JaapvanderVelde
January 30, 2025 at 8:26 am

From this, at this point in time, you'd expect OpenAI to crash (getting way too pricey), NVIDIA to do even better (because everyone will now want hardware, as stuff comes in reach), and companies like Meta being harder to predict, but perhaps more capable of pivoting. As for Microsoft, it'll be interesting to see who they want to buy next and whether they are too deeply into OpenAI.

Reply
@umcasalmuitolouco
January 30, 2025 at 8:29 am

Nice explanation, but it seems it still not possible to reproduce what they have done.

Reply
@MonolithproductionsT
January 30, 2025 at 8:37 am

I like to give each new AI model a simple test to see how reliable its answers will be: I ask for the plot to a specific movie from the 1930s , of which I own one of the last reels in the world, and does not exist on the internet, apart from an IMDB page acknowledging that the movie was made. Every single AI model I've tested has given me a nonsense answer, guessing on the content based on the genre. When a model just straight up tells me it doesn't know anything about that movie, I'll know that it can be reliable and doesn't hallucinate gaps in its knowledge. DeepSeek invented details, so that's a fail on this metric for DeepSeek.

I call it the "humility test" and when AI can finally pass that I would bet that the information it provides will be significantly more accurate.

Reply
@TeddyMack
January 30, 2025 at 8:45 am

Am I the only one that thinks that having smaller specialised areas that can communicate to other specialised areas of the neural network is much closer to how actual brains work

Reply
@falco830
January 30, 2025 at 8:47 am

Careful out there folks, this is new science that has been blessed to the public. The things that can be done with this are quite limitless and unexplored. Horrifying and powerful are the dangerous vocabulary you want to be looking out for.

Reply
@macksryanhold
January 30, 2025 at 8:55 am

Can you please do this fella a favor and adjust your cam white balance next time…

Reply
@charon7320
January 30, 2025 at 8:57 am

so what? the tech will catch up, what deepseek did gonna be outclassed by somebody else, it's how tech goes. now what's interesting is how people who factor the NASDAQ, they could just have sold invidia a week ago, now they buy low due to 'new chinese ai' and one month from now they just make the big money when new gpt model outclasses deepseek. it's all in the rumors and news, and here we are discussing the details.

Reply
@blockshift758
January 30, 2025 at 9:14 am

i've been looking for source code cause everyone keep saying open source. there is no source code.

Reply
@gostfalcon
January 30, 2025 at 9:17 am

AI got stale for 2 years. DeepSeek was the answer for lack of innovation by lazy companies only throwing money at problems with no true value add.

Reply
@EdgeCreativeSydney
January 30, 2025 at 9:27 am

do it smarter not harder

Reply
@TheBruuz
January 30, 2025 at 9:31 am

I would argue NVIDIA is going to sell even more videocards now in the future, since AI seems to be democratizing into mainstream, and there's always going to be a need for the cutting edge too. Companies on the cutting edge will just scale it up even further.

Reply
@23Devvrat
January 30, 2025 at 9:35 am

Do you see the reasoning capabilities of such models being able to solve unsolved maths/physics/compsci problems? If I provide it with an endless source of electricity, state of the art hardware and the correct supervision, would a reasoning model be able to solve the goldbach conjecture eventually? As humans, we have limits to how much we can THINK during a single day which translates to a week to a year to our whole lifespans, a neural network mathematician could think forever at ever increasing speeds. I am really excited to see where this takes subjects like pure maths where thinking is essentially the whole job (unlike physics, applied maths where you have to make experiments and whatnot).

Reply
@Ipowne3g
January 30, 2025 at 9:38 am

Tiananmen square happened.

Reply
@yppahpeek
January 30, 2025 at 9:41 am

6:47 was very much waiting for the number 42 here

Reply
@katiporoka
January 30, 2025 at 9:47 am

Thank you DeepSeek, your answers prove that you are a political propaganda machine.

The server is busy. Please try again later.

Reply
@climbeverest
January 30, 2025 at 9:47 am

Why did this not come out of a university ?

Reply
@climbeverest
January 30, 2025 at 9:48 am

Nice speech

Reply
@computerfis
January 30, 2025 at 9:51 am

I'm so happy for this channel

Reply
@PhillipGonzalez-u8r
January 30, 2025 at 9:53 am

This was super helpful, thanks!

Reply
@Sirius-me5zy
January 30, 2025 at 10:11 am

Deepseek another pump and dump instrument. AI contributed give any quality of life.any benefit the investors.

Reply
@Sirius-me5zy
January 30, 2025 at 10:11 am

Deepseek another pump and dump instrument. AI contributed give any quality of life.any benefit the investors.

Reply
@Sirius-me5zy
January 30, 2025 at 10:11 am

Deepseek another pump and dump instrument. AI contributed give any quality of life.any benefit the investors.

Reply
@deliro8758
January 30, 2025 at 10:21 am

The fact that Deepseek is so openly propaganda is beyond concerning to me, open source or not. The model is babbling about the unification of the motherland and people gobble it up because its better than the things that came before.

Reply
@phild5322
January 30, 2025 at 10:24 am

F*ck Silicon Valley and Nvidia

Reply
@peterwassmuth4014
January 30, 2025 at 10:30 am

Well for now I Wellcome DeepSeek!

ClosedAI and co, in general are way too expensive.

Reply
@schmuckthisout36
January 30, 2025 at 10:32 am

Truly fabulous explanation. Thank you very much.

Reply
@DioTheGreatOne
January 30, 2025 at 10:35 am

Rare China W

Reply
@DennisGentry
January 30, 2025 at 10:35 am

I tried a couple of integrals (1st year calculus) that 4-o could solve with some intermediate prompting. Even with a bunch of help, DeepSeek (R1) could never come up with the completely correct answers. It did fine on programming problems and stuff like coming up with meal suggestions and shopping lists. Claude did better on the programming problems, but can’t really render math in a readable way.

Reply
@modar9251
January 30, 2025 at 10:42 am

Could be what Linux did to Unix.

Reply
@adriangora2415
January 30, 2025 at 10:53 am

cool

Reply
@CHTO82
January 30, 2025 at 10:54 am

Has any of what the company behind Deep Seek claims been confirmed? So far I see alot of claims but everything comes from chinese media.

Reply
@IDK_Mr.M
January 30, 2025 at 10:58 am

Deepfake

Reply
@exaaltare1170
January 30, 2025 at 11:03 am

I think that Chinese just demonstrated how they can burst US economy bubles when it comes to tech😅 and we are all the ones who will benefit from that. That is the most powerful equalizer.

Reply
@alexkakac9087
January 30, 2025 at 11:05 am

I love me some chinese propaganda

Reply
@dinikasaxena
January 30, 2025 at 11:06 am

Yayy!! I'm so glad this is truly open (and just in time for fossdem 😊)

Reply
@NeedsEvidence
January 30, 2025 at 11:09 am

But what about Skynet?

Reply
@noncompulsoryl
January 30, 2025 at 11:18 am

Er, so what is a reward to AI?

Reply
@d3xmeister
January 30, 2025 at 11:20 am

Wow, so much naivety !

Reply
@iceflower7004
January 30, 2025 at 11:21 am

My research group and I had the idea of a sort of “load balanced llm” where we’d have multiple trained models, all trained to be masters of their specific topics. Math models would be trained to have this inner monologue that could solve problems step by step, coding models trained to analyze code, etc, but we turned away from it since initial tests showed we’d need a lot more time to make it work, which wasn’t viable for the sort of project we were trying to do. Kinda cool to see that a similar idea to ours was able to throw Wall Street into a frenzy

Reply
@d3xmeister
January 30, 2025 at 11:22 am

Another huge technology communism contributed with to the free world.

Reply
@nikolaskallianiotis8622
January 30, 2025 at 11:24 am

What deepseek's model actually did was to expose a multibillion group of companies (mostly US based) fraud, insisting that training the so-called "AI" models require continuous investments in new and expensive hardware. Deepseek created an "AI" model that have better performance compared to the well known models at a fraction of the cost in hardware and energy. That's a marvel of engineering and creative thinking, which is 100% HI (Human Intelligence) outcome. It's not a secret that the company, which lost in a single day $600B of capitalization, is Nvidia.

Reply

41 thoughts on “DeepSeek is a Game Changer for AI – Computerphile”

Leave a Comment Cancel Reply