What is DeepSeek AI?

Context: The Chinese start-up DeepSeek, has created a buzz with the launch of its cutting-edge AI models ‘DeepSeek-R1’ & ‘DeepSeek-V3’, claiming they nearly match the capabilities of top AI models in the U.S., while being far more affordable.

Relevance of the Topic: Prelims: Basic understanding of terms like Large Language Models, DeepSeek AI. 

What is DeepSeek?

  • DeepSeek is a Hangzhou-based Chinese startup that has recently launched artificial intelligence (AI) chatbot built on a low-cost Large Language Model (LLM) infrastructure. 
  • The AI is optimised for tasks like maths and coding, making it a strong competitor in the AI space.
DeepSeek

DeepSeek vs global LLMs

  • Low training cost: DeepSeek reportedly trained its model for just $6 million, significantly lower than the estimated $100 million expenditure behind OpenAI's GPT-4.
    • DeepSeek was able to dramatically reduce the cost of building its AI models by using NVIDIA’s H800 chips, an older generation of GPUs in the US.
  • High efficiency & Low cost: The model is being praised for its efficiency, as it uses advanced and lower-grade chips to deliver high performance at a lower cost.
    • E.g., Reportedly, DeepSeek-R1 is 20 to 50 times cheaper to use than OpenAI o1 model (depending on the task).
    • DeepSeek’s R1 may not be quite as advanced as OpenAI’s o3, it is almost on par with OpenAI o1 on several metrics.
  • Innovative & Adaptable:
    • DeepSeek-R1 uses reinforcement learning to naturally (autonomously) evolve its reasoning capabilities. The model self-improves through feedback loops during training, without needing massive labeled datasets.
    • DeepSeek-R1 can transfer (distill) reasoning capabilities into smaller models (SLMs), which are faster and more resource-efficient. Thus, DeepSeek-R1’s reasoning capabilities are scalable across different model sizes, making it highly adaptable.
  • Affordable for users: DeepSeek's paid subscription comes at $0.50 a month, while ChatGPT costs $20.

Concerns:

  • Censorship on digital content & bias:
    • Unlike many Western models, DeepSeek follows China's strict censorship rules. When asked about sensitive topics, it avoids direct answers, reflecting government control over digital content. 
    • Furthermore, the chatbot is expected to have a pro-China bias. 
  • Potential security risks: Experts have warned about potential security risks associated with the DeepSeek AI app, pointing out the need for scrutiny in data privacy and AI ethics.

Global Impact:

  • Sputnik moment: Much like the Sputnik in the 1950s, DeepSeek brings a new technological frontier into the great power competition.
  • Market disruption: The launch of the DeepSeek AI resulted in a historic $600 billion market value drop for Nvidia, a key player in AI chip production. 
  • Policy implications: DeepSeek heralds an escalation of the geopolitical rivalry between the US and China. It risks escalation of the US government's restrictions on advanced chip exports to China.  

Read more: What are Small Language Models? 

Share this with friends ->

Leave a Reply

Your email address will not be published. Required fields are marked *

The maximum upload file size: 20 MB. You can upload: image, document, archive. Drop files here

Discover more from Compass by Rau's IAS

Subscribe now to keep reading and get access to the full archive.

Continue reading