Chinese artificial intelligence (AI) company DeepSeek has sent shockwaves through the tech community, with the release of extremely efficient AI models that can compete with cutting-edge products from US companies such as OpenAI and Anthropic. Founded in 2023, DeepSeek has achieved its results with a fraction of the cash and computing power of its competitors. DeepSeek’s “reasoning” R1 model, released last week, provoked excitement among researchers, shock among investors, and responses from AI heavyweights. The company followed up on January 28 with a model that can work with images as well as text. So what has DeepSeek done, and how…
Author: Tongliang Liu, Associate Professor of Machine Learning and Director of the Sydney AI Centre, University of Sydney
Read More